Utilization of protein intrinsic disorder knowledge in structural proteomics
Oldfield, Christopher J.; Xue, Bin; Van, Ya-Yue; Ulrich, Eldon L.; Markley, John L.; Dunker, A. Keith; Uversky, Vladimir N.
2014-01-01
Intrinsically disordered proteins (IDPs) and proteins with long disordered regions are highly abundant in various proteomes. Despite their lack of well-defined ordered structure, these proteins and regions are frequently involved in crucial biological processes. Although in recent years these proteins have attracted the attention of many researchers, IDPs represent a significant challenge for structural characterization since these proteins can impact many of the processes in the structure determination pipeline. Here we investigate the effects of IDPs on the structure determination process and the utility of disorder prediction in selecting and improving proteins for structural characterization. Examination of the extent of intrinsic disorder in existing crystal structures found that relatively few protein crystal structures contain extensive regions of intrinsic disorder. Although intrinsic disorder is not the only cause of crystallization failures and many structured proteins cannot be crystallized, filtering out highly disordered proteins from structure-determination target lists is still likely to be cost effective. Therefore it is desirable to avoid highly disordered proteins from structure-determination target lists and we show that disorder prediction can be applied effectively to enrich structure determination pipelines with proteins more likely to yield crystal structures. For structural investigation of specific proteins, disorder prediction can be used to improve targets for structure determination. Finally, a framework for considering intrinsic disorder in the structure determination pipeline is proposed. PMID:23232152
Challenges in NMR-based structural genomics
NASA Astrophysics Data System (ADS)
Sue, Shih-Che; Chang, Chi-Fon; Huang, Yao-Te; Chou, Ching-Yu; Huang, Tai-huang
2005-05-01
Understanding the functions of the vast number of proteins encoded in many genomes that have been completely sequenced recently is the main challenge for biologists in the post-genomics era. Since the function of a protein is determined by its exact three-dimensional structure it is paramount to determine the 3D structures of all proteins. This need has driven structural biologists to undertake the structural genomics project aimed at determining the structures of all known proteins. Several centers for structural genomics studies have been established throughout the world. Nuclear magnetic resonance (NMR) spectroscopy has played a major role in determining protein structures in atomic details and in a physiologically relevant solution state. Since the number of new genes being discovered daily far exceeds the number of structures determined by both NMR and X-ray crystallography, a high-throughput method for speeding up the process of protein structure determination is essential for the success of the structural genomics effort. In this article we will describe NMR methods currently being employed for protein structure determination. We will also describe methods under development which may drastically increase the throughput, as well as point out areas where opportunities exist for biophysicists to make significant contribution in this important field.
Structural determination of intact proteins using mass spectrometry
Kruppa, Gary [San Francisco, CA; Schoeniger, Joseph S [Oakland, CA; Young, Malin M [Livermore, CA
2008-05-06
The present invention relates to novel methods of determining the sequence and structure of proteins. Specifically, the present invention allows for the analysis of intact proteins within a mass spectrometer. Therefore, preparatory separations need not be performed prior to introducing a protein sample into the mass spectrometer. Also disclosed herein are new instrumental developments for enhancing the signal from the desired modified proteins, methods for producing controlled protein fragments in the mass spectrometer, eliminating complex microseparations, and protein preparatory chemical steps necessary for cross-linking based protein structure determination.Additionally, the preferred method of the present invention involves the determination of protein structures utilizing a top-down analysis of protein structures to search for covalent modifications. In the preferred method, intact proteins are ionized and fragmented within the mass spectrometer.
Recent developments in structural proteomics for protein structure determination.
Liu, Hsuan-Liang; Hsu, Jyh-Ping
2005-05-01
The major challenges in structural proteomics include identifying all the proteins on the genome-wide scale, determining their structure-function relationships, and outlining the precise three-dimensional structures of the proteins. Protein structures are typically determined by experimental approaches such as X-ray crystallography or nuclear magnetic resonance (NMR) spectroscopy. However, the knowledge of three-dimensional space by these techniques is still limited. Thus, computational methods such as comparative and de novo approaches and molecular dynamic simulations are intensively used as alternative tools to predict the three-dimensional structures and dynamic behavior of proteins. This review summarizes recent developments in structural proteomics for protein structure determination; including instrumental methods such as X-ray crystallography and NMR spectroscopy, and computational methods such as comparative and de novo structure prediction and molecular dynamics simulations.
Structures of membrane proteins
Vinothkumar, Kutti R.; Henderson, Richard
2010-01-01
In reviewing the structures of membrane proteins determined up to the end of 2009, we present in words and pictures the most informative examples from each family. We group the structures together according to their function and architecture to provide an overview of the major principles and variations on the most common themes. The first structures, determined 20 years ago, were those of naturally abundant proteins with limited conformational variability, and each membrane protein structure determined was a major landmark. With the advent of complete genome sequences and efficient expression systems, there has been an explosion in the rate of membrane protein structure determination, with many classes represented. New structures are published every month and more than 150 unique membrane protein structures have been determined. This review analyses the reasons for this success, discusses the challenges that still lie ahead, and presents a concise summary of the key achievements with illustrated examples selected from each class. PMID:20667175
Automated crystallographic system for high-throughput protein structure determination.
Brunzelle, Joseph S; Shafaee, Padram; Yang, Xiaojing; Weigand, Steve; Ren, Zhong; Anderson, Wayne F
2003-07-01
High-throughput structural genomic efforts require software that is highly automated, distributive and requires minimal user intervention to determine protein structures. Preliminary experiments were set up to test whether automated scripts could utilize a minimum set of input parameters and produce a set of initial protein coordinates. From this starting point, a highly distributive system was developed that could determine macromolecular structures at a high throughput rate, warehouse and harvest the associated data. The system uses a web interface to obtain input data and display results. It utilizes a relational database to store the initial data needed to start the structure-determination process as well as generated data. A distributive program interface administers the crystallographic programs which determine protein structures. Using a test set of 19 protein targets, 79% were determined automatically.
SDSL-ESR-based protein structure characterization.
Strancar, Janez; Kavalenka, Aleh; Urbancic, Iztok; Ljubetic, Ajasja; Hemminga, Marcus A
2010-03-01
As proteins are key molecules in living cells, knowledge about their structure can provide important insights and applications in science, biotechnology, and medicine. However, many protein structures are still a big challenge for existing high-resolution structure-determination methods, as can be seen in the number of protein structures published in the Protein Data Bank. This is especially the case for less-ordered, more hydrophobic and more flexible protein systems. The lack of efficient methods for structure determination calls for urgent development of a new class of biophysical techniques. This work attempts to address this problem with a novel combination of site-directed spin labelling electron spin resonance spectroscopy (SDSL-ESR) and protein structure modelling, which is coupled by restriction of the conformational spaces of the amino acid side chains. Comparison of the application to four different protein systems enables us to generalize the new method and to establish a general procedure for determination of protein structure.
Serrano, Pedro; Dutta, Samit K; Proudfoot, Andrew; Mohanty, Biswaranjan; Susac, Lukas; Martin, Bryan; Geralt, Michael; Jaroszewski, Lukasz; Godzik, Adam; Elsliger, Marc; Wilson, Ian A; Wüthrich, Kurt
2016-11-01
For more than a decade, the Joint Center for Structural Genomics (JCSG; www.jcsg.org) worked toward increased three-dimensional structure coverage of the protein universe. This coordinated quest was one of the main goals of the four high-throughput (HT) structure determination centers of the Protein Structure Initiative (PSI; www.nigms.nih.gov/Research/specificareas/PSI). To achieve the goals of the PSI, the JCSG made use of the complementarity of structure determination by X-ray crystallography and nuclear magnetic resonance (NMR) spectroscopy to increase and diversify the range of targets entering the HT structure determination pipeline. The overall strategy, for both techniques, was to determine atomic resolution structures for representatives of large protein families, as defined by the Pfam database, which had no structural coverage and could make significant contributions to biological and biomedical research. Furthermore, the experimental structures could be leveraged by homology modeling to further expand the structural coverage of the protein universe and increase biological insights. Here, we describe what could be achieved by this structural genomics approach, using as an illustration the contributions from 20 NMR structure determinations out of a total of 98 JCSG NMR structures, which were selected because they are the first three-dimensional structure representations of the respective Pfam protein families. The information from this small sample is representative for the overall results from crystal and NMR structure determination in the JCSG. There are five new folds, which were classified as domains of unknown functions (DUF), three of the proteins could be functionally annotated based on three-dimensional structure similarity with previously characterized proteins, and 12 proteins showed only limited similarity with previous deposits in the Protein Data Bank (PDB) and were classified as DUFs. © 2016 Federation of European Biochemical Societies.
Ikeya, Teppei; Terauchi, Tsutomu; Güntert, Peter; Kainosho, Masatsune
2006-07-01
Recently we have developed the stereo-array isotope labeling (SAIL) technique to overcome the conventional molecular size limitation in NMR protein structure determination by employing complete stereo- and regiospecific patterns of stable isotopes. SAIL sharpens signals and simplifies spectra without the loss of requisite structural information, thus making large classes of proteins newly accessible to detailed solution structure determination. The automated structure calculation program CYANA can efficiently analyze SAIL-NOESY spectra and calculate structures without manual analysis. Nevertheless, the original SAIL method might not be capable of determining the structures of proteins larger than 50 kDa or membrane proteins, for which the spectra are characterized by many broadened and overlapped peaks. Here we have carried out simulations of new SAIL patterns optimized for minimal relaxation and overlap, to evaluate the combined use of SAIL and CYANA for solving the structures of larger proteins and membrane proteins. The modified approach reduces the number of peaks to nearly half of that observed with uniform labeling, while still yielding well-defined structures and is expected to enable NMR structure determinations of these challenging systems.
Masica, David L; Ash, Jason T; Ndao, Moise; Drobny, Gary P; Gray, Jeffrey J
2010-12-08
Protein-biomineral interactions are paramount to materials production in biology, including the mineral phase of hard tissue. Unfortunately, the structure of biomineral-associated proteins cannot be determined by X-ray crystallography or solution nuclear magnetic resonance (NMR). Here we report a method for determining the structure of biomineral-associated proteins. The method combines solid-state NMR (ssNMR) and ssNMR-biased computational structure prediction. In addition, the algorithm is able to identify lattice geometries most compatible with ssNMR constraints, representing a quantitative, novel method for investigating crystal-face binding specificity. We use this method to determine most of the structure of human salivary statherin interacting with the mineral phase of tooth enamel. Computation and experiment converge on an ensemble of related structures and identify preferential binding at three crystal surfaces. The work represents a significant advance toward determining structure of biomineral-adsorbed protein using experimentally biased structure prediction. This method is generally applicable to proteins that can be chemically synthesized. Copyright © 2010 Elsevier Ltd. All rights reserved.
Rapid and reliable protein structure determination via chemical shift threading.
Hafsa, Noor E; Berjanskii, Mark V; Arndt, David; Wishart, David S
2018-01-01
Protein structure determination using nuclear magnetic resonance (NMR) spectroscopy can be both time-consuming and labor intensive. Here we demonstrate how chemical shift threading can permit rapid, robust, and accurate protein structure determination using only chemical shift data. Threading is a relatively old bioinformatics technique that uses a combination of sequence information and predicted (or experimentally acquired) low-resolution structural data to generate high-resolution 3D protein structures. The key motivations behind using NMR chemical shifts for protein threading lie in the fact that they are easy to measure, they are available prior to 3D structure determination, and they contain vital structural information. The method we have developed uses not only sequence and chemical shift similarity but also chemical shift-derived secondary structure, shift-derived super-secondary structure, and shift-derived accessible surface area to generate a high quality protein structure regardless of the sequence similarity (or lack thereof) to a known structure already in the PDB. The method (called E-Thrifty) was found to be very fast (often < 10 min/structure) and to significantly outperform other shift-based or threading-based structure determination methods (in terms of top template model accuracy)-with an average TM-score performance of 0.68 (vs. 0.50-0.62 for other methods). Coupled with recent developments in chemical shift refinement, these results suggest that protein structure determination, using only NMR chemical shifts, is becoming increasingly practical and reliable. E-Thrifty is available as a web server at http://ethrifty.ca .
Improved in-cell structure determination of proteins at near-physiological concentration
Ikeya, Teppei; Hanashima, Tomomi; Hosoya, Saori; Shimazaki, Manato; Ikeda, Shiro; Mishima, Masaki; Güntert, Peter; Ito, Yutaka
2016-01-01
Investigating three-dimensional (3D) structures of proteins in living cells by in-cell nuclear magnetic resonance (NMR) spectroscopy opens an avenue towards understanding the structural basis of their functions and physical properties under physiological conditions inside cells. In-cell NMR provides data at atomic resolution non-invasively, and has been used to detect protein-protein interactions, thermodynamics of protein stability, the behavior of intrinsically disordered proteins, etc. in cells. However, so far only a single de novo 3D protein structure could be determined based on data derived only from in-cell NMR. Here we introduce methods that enable in-cell NMR protein structure determination for a larger number of proteins at concentrations that approach physiological ones. The new methods comprise (1) advances in the processing of non-uniformly sampled NMR data, which reduces the measurement time for the intrinsically short-lived in-cell NMR samples, (2) automatic chemical shift assignment for obtaining an optimal resonance assignment, and (3) structure refinement with Bayesian inference, which makes it possible to calculate accurate 3D protein structures from sparse data sets of conformational restraints. As an example application we determined the structure of the B1 domain of protein G at about 250 μM concentration in living E. coli cells. PMID:27910948
G-protein-coupled receptor structures were not built in a day.
Blois, Tracy M; Bowie, James U
2009-07-01
Among the most exciting recent developments in structural biology is the structure determination of G-protein-coupled receptors (GPCRs), which comprise the largest class of membrane proteins in mammalian cells and have enormous importance for disease and drug development. The GPCR structures are perhaps the most visible examples of a nascent revolution in membrane protein structure determination. Like other major milestones in science, however, such as the sequencing of the human genome, these achievements were built on a hidden foundation of technological developments. Here, we describe some of the methods that are fueling the membrane protein structure revolution and have enabled the determination of the current GPCR structures, along with new techniques that may lead to future structures.
Protein Structure Determination from Pseudocontact Shifts Using ROSETTA
Schmitz, Christophe; Vernon, Robert; Otting, Gottfried; Baker, David; Huber, Thomas
2013-01-01
Paramagnetic metal ions generate pseudocontact shifts (PCSs) in nuclear magnetic resonance spectra that are manifested as easily measurable changes in chemical shifts. Metals can be incorporated into proteins through metal binding tags, and PCS data constitute powerful long-range restraints on the positions of nuclear spins relative to the coordinate system of the magnetic susceptibility anisotropy tensor (Δχ-tensor) of the metal ion. We show that three-dimensional structures of proteins can reliably be determined using PCS data from a single metal binding site combined with backbone chemical shifts. The program PCS-ROSETTA automatically determines the Δχ-tensor and metal position from the PCS data during the structure calculations, without any prior knowledge of the protein structure. The program can determine structures accurately for proteins of up to 150 residues, offering a powerful new approach to protein structure determination that relies exclusively on readily measurable backbone chemical shifts and easily discriminates between correctly and incorrectly folded conformations. PMID:22285518
Mathematical methods for protein science
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hart, W.; Istrail, S.; Atkins, J.
1997-12-31
Understanding the structure and function of proteins is a fundamental endeavor in molecular biology. Currently, over 100,000 protein sequences have been determined by experimental methods. The three dimensional structure of the protein determines its function, but there are currently less than 4,000 structures known to atomic resolution. Accordingly, techniques to predict protein structure from sequence have an important role in aiding the understanding of the Genome and the effects of mutations in genetic disease. The authors describe current efforts at Sandia to better understand the structure of proteins through rigorous mathematical analyses of simple lattice models. The efforts have focusedmore » on two aspects of protein science: mathematical structure prediction, and inverse protein folding.« less
Racemic & quasi-racemic protein crystallography enabled by chemical protein synthesis.
Kent, Stephen Bh
2018-04-04
A racemic protein mixture can be used to form centrosymmetric crystals for structure determination by X-ray diffraction. Both the unnatural d-protein and the corresponding natural l-protein are made by total chemical synthesis based on native chemical ligation-chemoselective condensation of unprotected synthetic peptide segments. Racemic protein crystallography is important for structure determination of the many natural protein molecules that are refractory to crystallization. Racemic mixtures facilitate the crystallization of recalcitrant proteins, and give diffraction-quality crystals. Quasi-racemic crystallization, using a single d-protein molecule, can facilitate the determination of the structures of a series of l-protein analog molecules. Copyright © 2018 Elsevier Ltd. All rights reserved.
Non-Uniform Sampling and J-UNIO Automation for Efficient Protein NMR Structure Determination.
Didenko, Tatiana; Proudfoot, Andrew; Dutta, Samit Kumar; Serrano, Pedro; Wüthrich, Kurt
2015-08-24
High-resolution structure determination of small proteins in solution is one of the big assets of NMR spectroscopy in structural biology. Improvements in the efficiency of NMR structure determination by advances in NMR experiments and automation of data handling therefore attracts continued interest. Here, non-uniform sampling (NUS) of 3D heteronuclear-resolved [(1)H,(1)H]-NOESY data yielded two- to three-fold savings of instrument time for structure determinations of soluble proteins. With the 152-residue protein NP_372339.1 from Staphylococcus aureus and the 71-residue protein NP_346341.1 from Streptococcus pneumonia we show that high-quality structures can be obtained with NUS NMR data, which are equally well amenable to robust automated analysis as the corresponding uniformly sampled data. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Protein structure determination by exhaustive search of Protein Data Bank derived databases.
Stokes-Rees, Ian; Sliz, Piotr
2010-12-14
Parallel sequence and structure alignment tools have become ubiquitous and invaluable at all levels in the study of biological systems. We demonstrate the application and utility of this same parallel search paradigm to the process of protein structure determination, benefitting from the large and growing corpus of known structures. Such searches were previously computationally intractable. Through the method of Wide Search Molecular Replacement, developed here, they can be completed in a few hours with the aide of national-scale federated cyberinfrastructure. By dramatically expanding the range of models considered for structure determination, we show that small (less than 12% structural coverage) and low sequence identity (less than 20% identity) template structures can be identified through multidimensional template scoring metrics and used for structure determination. Many new macromolecular complexes can benefit significantly from such a technique due to the lack of known homologous protein folds or sequences. We demonstrate the effectiveness of the method by determining the structure of a full-length p97 homologue from Trichoplusia ni. Example cases with the MHC/T-cell receptor complex and the EmoB protein provide systematic estimates of minimum sequence identity, structure coverage, and structural similarity required for this method to succeed. We describe how this structure-search approach and other novel computationally intensive workflows are made tractable through integration with the US national computational cyberinfrastructure, allowing, for example, rapid processing of the entire Structural Classification of Proteins protein fragment database.
Three key residues form a critical contact network in a protein folding transition state
NASA Astrophysics Data System (ADS)
Vendruscolo, Michele; Paci, Emanuele; Dobson, Christopher M.; Karplus, Martin
2001-02-01
Determining how a protein folds is a central problem in structural biology. The rate of folding of many proteins is determined by the transition state, so that a knowledge of its structure is essential for understanding the protein folding reaction. Here we use mutation measurements-which determine the role of individual residues in stabilizing the transition state-as restraints in a Monte Carlo sampling procedure to determine the ensemble of structures that make up the transition state. We apply this approach to the experimental data for the 98-residue protein acylphosphatase, and obtain a transition-state ensemble with the native-state topology and an average root-mean-square deviation of 6Å from the native structure. Although about 20 residues with small positional fluctuations form the structural core of this transition state, the native-like contact network of only three of these residues is sufficient to determine the overall fold of the protein. This result reveals how a nucleation mechanism involving a small number of key residues can lead to folding of a polypeptide chain to its unique native-state structure.
NMR in the SPINE Structural Proteomics project.
Ab, E; Atkinson, A R; Banci, L; Bertini, I; Ciofi-Baffoni, S; Brunner, K; Diercks, T; Dötsch, V; Engelke, F; Folkers, G E; Griesinger, C; Gronwald, W; Günther, U; Habeck, M; de Jong, R N; Kalbitzer, H R; Kieffer, B; Leeflang, B R; Loss, S; Luchinat, C; Marquardsen, T; Moskau, D; Neidig, K P; Nilges, M; Piccioli, M; Pierattelli, R; Rieping, W; Schippmann, T; Schwalbe, H; Travé, G; Trenner, J; Wöhnert, J; Zweckstetter, M; Kaptein, R
2006-10-01
This paper describes the developments, role and contributions of the NMR spectroscopy groups in the Structural Proteomics In Europe (SPINE) consortium. Focusing on the development of high-throughput (HTP) pipelines for NMR structure determinations of proteins, all aspects from sample preparation, data acquisition, data processing, data analysis to structure determination have been improved with respect to sensitivity, automation, speed, robustness and validation. Specific highlights are protonless (13)C-direct detection methods and inferential structure determinations (ISD). In addition to technological improvements, these methods have been applied to deliver over 60 NMR structures of proteins, among which are five that failed to crystallize. The inclusion of NMR spectroscopy in structural proteomics pipelines improves the success rate for protein structure determinations.
Automated protein NMR structure determination using wavelet de-noised NOESY spectra.
Dancea, Felician; Günther, Ulrich
2005-11-01
A major time-consuming step of protein NMR structure determination is the generation of reliable NOESY cross peak lists which usually requires a significant amount of manual interaction. Here we present a new algorithm for automated peak picking involving wavelet de-noised NOESY spectra in a process where the identification of peaks is coupled to automated structure determination. The core of this method is the generation of incremental peak lists by applying different wavelet de-noising procedures which yield peak lists of a different noise content. In combination with additional filters which probe the consistency of the peak lists, good convergence of the NOESY-based automated structure determination could be achieved. These algorithms were implemented in the context of the ARIA software for automated NOE assignment and structure determination and were validated for a polysulfide-sulfur transferase protein of known structure. The procedures presented here should be commonly applicable for efficient protein NMR structure determination and automated NMR peak picking.
NMR-based automated protein structure determination.
Würz, Julia M; Kazemi, Sina; Schmidt, Elena; Bagaria, Anurag; Güntert, Peter
2017-08-15
NMR spectra analysis for protein structure determination can now in many cases be performed by automated computational methods. This overview of the computational methods for NMR protein structure analysis presents recent automated methods for signal identification in multidimensional NMR spectra, sequence-specific resonance assignment, collection of conformational restraints, and structure calculation, as implemented in the CYANA software package. These algorithms are sufficiently reliable and integrated into one software package to enable the fully automated structure determination of proteins starting from NMR spectra without manual interventions or corrections at intermediate steps, with an accuracy of 1-2 Å backbone RMSD in comparison with manually solved reference structures. Copyright © 2017 Elsevier Inc. All rights reserved.
Mixing and Matching Detergents for Membrane Protein NMR Structure Determination
DOE Office of Scientific and Technical Information (OSTI.GOV)
Columbus, Linda; Lipfert, Jan; Jambunathan, Kalyani
2009-10-21
One major obstacle to membrane protein structure determination is the selection of a detergent micelle that mimics the native lipid bilayer. Currently, detergents are selected by exhaustive screening because the effects of protein-detergent interactions on protein structure are poorly understood. In this study, the structure and dynamics of an integral membrane protein in different detergents is investigated by nuclear magnetic resonance (NMR) and electron paramagnetic resonance (EPR) spectroscopy and small-angle X-ray scattering (SAXS). The results suggest that matching of the micelle dimensions to the protein's hydrophobic surface avoids exchange processes that reduce the completeness of the NMR observations. Based onmore » these dimensions, several mixed micelles were designed that improved the completeness of NMR observations. These findings provide a basis for the rational design of mixed micelles that may advance membrane protein structure determination by NMR.« less
Automation of NMR structure determination of proteins.
Altieri, Amanda S; Byrd, R Andrew
2004-10-01
The automation of protein structure determination using NMR is coming of age. The tedious processes of resonance assignment, followed by assignment of NOE (nuclear Overhauser enhancement) interactions (now intertwined with structure calculation), assembly of input files for structure calculation, intermediate analyses of incorrect assignments and bad input data, and finally structure validation are all being automated with sophisticated software tools. The robustness of the different approaches continues to deal with problems of completeness and uniqueness; nevertheless, the future is very bright for automation of NMR structure generation to approach the levels found in X-ray crystallography. Currently, near completely automated structure determination is possible for small proteins, and the prospect for medium-sized and large proteins is good. Copyright 2004 Elsevier Ltd.
In Silico Analysis for the Study of Botulinum Toxin Structure
NASA Astrophysics Data System (ADS)
Suzuki, Tomonori; Miyazaki, Satoru
2010-01-01
Protein-protein interactions play many important roles in biological function. Knowledge of protein-protein complex structure is required for understanding the function. The determination of protein-protein complex structure by experimental studies remains difficult, therefore computational prediction of protein structures by structure modeling and docking studies is valuable method. In addition, MD simulation is also one of the most popular methods for protein structure modeling and characteristics. Here, we attempt to predict protein-protein complex structure and property using some of bioinformatic methods, and we focus botulinum toxin complex as target structure.
My 65 years in protein chemistry.
Scheraga, Harold A
2015-05-01
This is a tour of a physical chemist through 65 years of protein chemistry from the time when emphasis was placed on the determination of the size and shape of the protein molecule as a colloidal particle, with an early breakthrough by James Sumner, followed by Linus Pauling and Fred Sanger, that a protein was a real molecule, albeit a macromolecule. It deals with the recognition of the nature and importance of hydrogen bonds and hydrophobic interactions in determining the structure, properties, and biological function of proteins until the present acquisition of an understanding of the structure, thermodynamics, and folding pathways from a linear array of amino acids to a biological entity. Along the way, with a combination of experiment and theoretical interpretation, a mechanism was elucidated for the thrombin-induced conversion of fibrinogen to a fibrin blood clot and for the oxidative-folding pathways of ribonuclease A. Before the atomic structure of a protein molecule was determined by x-ray diffraction or nuclear magnetic resonance spectroscopy, experimental studies of the fundamental interactions underlying protein structure led to several distance constraints which motivated the theoretical approach to determine protein structure, and culminated in the Empirical Conformational Energy Program for Peptides (ECEPP), an all-atom force field, with which the structures of fibrous collagen-like proteins and the 46-residue globular staphylococcal protein A were determined. To undertake the study of larger globular proteins, a physics-based coarse-grained UNited-RESidue (UNRES) force field was developed, and applied to the protein-folding problem in terms of structure, thermodynamics, dynamics, and folding pathways. Initially, single-chain and, ultimately, multiple-chain proteins were examined, and the methodology was extended to protein-protein interactions and to nucleic acids and to protein-nucleic acid interactions. The ultimate results led to an understanding of a variety of biological processes underlying natural and disease phenomena.
Feng, Yingang
2017-01-01
The use of NMR methods to determine the three-dimensional structures of carbohydrates and glycoproteins is still challenging, in part because of the lack of standard protocols. In order to increase the convenience of structure determination, the topology and parameter files for carbohydrates in the program Crystallography & NMR System (CNS) were investigated and new files were developed to be compatible with the standard simulated annealing protocols for proteins and nucleic acids. Recalculating the published structures of protein-carbohydrate complexes and glycosylated proteins demonstrates that the results are comparable to the published structures which employed more complex procedures for structure calculation. Integrating the new carbohydrate parameters into the standard structure calculation protocol will facilitate three-dimensional structural study of carbohydrates and glycosylated proteins by NMR spectroscopy.
2017-01-01
The use of NMR methods to determine the three-dimensional structures of carbohydrates and glycoproteins is still challenging, in part because of the lack of standard protocols. In order to increase the convenience of structure determination, the topology and parameter files for carbohydrates in the program Crystallography & NMR System (CNS) were investigated and new files were developed to be compatible with the standard simulated annealing protocols for proteins and nucleic acids. Recalculating the published structures of protein-carbohydrate complexes and glycosylated proteins demonstrates that the results are comparable to the published structures which employed more complex procedures for structure calculation. Integrating the new carbohydrate parameters into the standard structure calculation protocol will facilitate three-dimensional structural study of carbohydrates and glycosylated proteins by NMR spectroscopy. PMID:29232406
High-throughput crystallization screening.
Skarina, Tatiana; Xu, Xiaohui; Evdokimova, Elena; Savchenko, Alexei
2014-01-01
Protein structure determination by X-ray crystallography is dependent on obtaining a single protein crystal suitable for diffraction data collection. Due to this requirement, protein crystallization represents a key step in protein structure determination. The conditions for protein crystallization have to be determined empirically for each protein, making this step also a bottleneck in the structure determination process. Typical protein crystallization practice involves parallel setup and monitoring of a considerable number of individual protein crystallization experiments (also called crystallization trials). In these trials the aliquots of purified protein are mixed with a range of solutions composed of a precipitating agent, buffer, and sometimes an additive that have been previously successful in prompting protein crystallization. The individual chemical conditions in which a particular protein shows signs of crystallization are used as a starting point for further crystallization experiments. The goal is optimizing the formation of individual protein crystals of sufficient size and quality to make them suitable for diffraction data collection. Thus the composition of the primary crystallization screen is critical for successful crystallization.Systematic analysis of crystallization experiments carried out on several hundred proteins as part of large-scale structural genomics efforts allowed the optimization of the protein crystallization protocol and identification of a minimal set of 96 crystallization solutions (the "TRAP" screen) that, in our experience, led to crystallization of the maximum number of proteins.
Structural Genomics and Drug Discovery for Infectious Diseases
DOE Office of Scientific and Technical Information (OSTI.GOV)
Anderson, W.F.
The application of structural genomics methods and approaches to proteins from organisms causing infectious diseases is making available the three dimensional structures of many proteins that are potential drug targets and laying the groundwork for structure aided drug discovery efforts. There are a number of structural genomics projects with a focus on pathogens that have been initiated worldwide. The Center for Structural Genomics of Infectious Diseases (CSGID) was recently established to apply state-of-the-art high throughput structural biology technologies to the characterization of proteins from the National Institute for Allergy and Infectious Diseases (NIAID) category A-C pathogens and organisms causing emerging,more » or re-emerging infectious diseases. The target selection process emphasizes potential biomedical benefits. Selected proteins include known drug targets and their homologs, essential enzymes, virulence factors and vaccine candidates. The Center also provides a structure determination service for the infectious disease scientific community. The ultimate goal is to generate a library of structures that are available to the scientific community and can serve as a starting point for further research and structure aided drug discovery for infectious diseases. To achieve this goal, the CSGID will determine protein crystal structures of 400 proteins and protein-ligand complexes using proven, rapid, highly integrated, and cost-effective methods for such determination, primarily by X-ray crystallography. High throughput crystallographic structure determination is greatly aided by frequent, convenient access to high-performance beamlines at third-generation synchrotron X-ray sources.« less
NIAS-Server: Neighbors Influence of Amino acids and Secondary Structures in Proteins.
Borguesan, Bruno; Inostroza-Ponta, Mario; Dorn, Márcio
2017-03-01
The exponential growth in the number of experimentally determined three-dimensional protein structures provide a new and relevant knowledge about the conformation of amino acids in proteins. Only a few of probability densities of amino acids are publicly available for use in structure validation and prediction methods. NIAS (Neighbors Influence of Amino acids and Secondary structures) is a web-based tool used to extract information about conformational preferences of amino acid residues and secondary structures in experimental-determined protein templates. This information is useful, for example, to characterize folds and local motifs in proteins, molecular folding, and can help the solution of complex problems such as protein structure prediction, protein design, among others. The NIAS-Server and supplementary data are available at http://sbcb.inf.ufrgs.br/nias .
Dissecting the relationship between protein structure and sequence variation
NASA Astrophysics Data System (ADS)
Shahmoradi, Amir; Wilke, Claus; Wilke Lab Team
2015-03-01
Over the past decade several independent works have shown that some structural properties of proteins are capable of predicting protein evolution. The strength and significance of these structure-sequence relations, however, appear to vary widely among different proteins, with absolute correlation strengths ranging from 0 . 1 to 0 . 8 . Here we present the results from a comprehensive search for the potential biophysical and structural determinants of protein evolution by studying more than 200 structural and evolutionary properties in a dataset of 209 monomeric enzymes. We discuss the main protein characteristics responsible for the general patterns of protein evolution, and identify sequence divergence as the main determinant of the strengths of virtually all structure-evolution relationships, explaining ~ 10 - 30 % of observed variation in sequence-structure relations. In addition to sequence divergence, we identify several protein structural properties that are moderately but significantly coupled with the strength of sequence-structure relations. In particular, proteins with more homogeneous back-bone hydrogen bond energies, large fractions of helical secondary structures and low fraction of beta sheets tend to have the strongest sequence-structure relation. BEACON-NSF center for the study of evolution in action.
PDBStat: a universal restraint converter and restraint analysis software package for protein NMR.
Tejero, Roberto; Snyder, David; Mao, Binchen; Aramini, James M; Montelione, Gaetano T
2013-08-01
The heterogeneous array of software tools used in the process of protein NMR structure determination presents organizational challenges in the structure determination and validation processes, and creates a learning curve that limits the broader use of protein NMR in biology. These challenges, including accurate use of data in different data formats required by software carrying out similar tasks, continue to confound the efforts of novices and experts alike. These important issues need to be addressed robustly in order to standardize protein NMR structure determination and validation. PDBStat is a C/C++ computer program originally developed as a universal coordinate and protein NMR restraint converter. Its primary function is to provide a user-friendly tool for interconverting between protein coordinate and protein NMR restraint data formats. It also provides an integrated set of computational methods for protein NMR restraint analysis and structure quality assessment, relabeling of prochiral atoms with correct IUPAC names, as well as multiple methods for analysis of the consistency of atomic positions indicated by their convergence across a protein NMR ensemble. In this paper we provide a detailed description of the PDBStat software, and highlight some of its valuable computational capabilities. As an example, we demonstrate the use of the PDBStat restraint converter for restrained CS-Rosetta structure generation calculations, and compare the resulting protein NMR structure models with those generated from the same NMR restraint data using more traditional structure determination methods. These results demonstrate the value of a universal restraint converter in allowing the use of multiple structure generation methods with the same restraint data for consensus analysis of protein NMR structures and the underlying restraint data.
PDBStat: A Universal Restraint Converter and Restraint Analysis Software Package for Protein NMR
Tejero, Roberto; Snyder, David; Mao, Binchen; Aramini, James M.; Montelione, Gaetano T
2013-01-01
The heterogeneous array of software tools used in the process of protein NMR structure determination presents organizational challenges in the structure determination and validation processes, and creates a learning curve that limits the broader use of protein NMR in biology. These challenges, including accurate use of data in different data formats required by software carrying out similar tasks, continue to confound the efforts of novices and experts alike. These important issues need to be addressed robustly in order to standardize protein NMR structure determination and validation. PDBStat is a C/C++ computer program originally developed as a universal coordinate and protein NMR restraint converter. Its primary function is to provide a user-friendly tool for interconverting between protein coordinate and protein NMR restraint data formats. It also provides an integrated set of computational methods for protein NMR restraint analysis and structure quality assessment, relabeling of prochiral atoms with correct IUPAC names, as well as multiple methods for analysis of the consistency of atomic positions indicated by their convergence across a protein NMR ensemble. In this paper we provide a detailed description of the PDBStat software, and highlight some of its valuable computational capabilities. As an example, we demonstrate the use of the PDBStat restraint converter for restrained CS-Rosetta structure generation calculations, and compare the resulting protein NMR structure models with those generated from the same NMR restraint data using more traditional structure determination methods. These results demonstrate the value of a universal restraint converter in allowing the use of multiple structure generation methods with the same restraint data for consensus analysis of protein NMR structures and the underlying restraint data. PMID:23897031
SFG analysis of surface bound proteins: a route towards structure determination.
Weidner, Tobias; Castner, David G
2013-08-14
The surface of a material is rapidly covered with proteins once that material is placed in a biological environment. The structure and function of these bound proteins play a key role in the interactions and communications of the material with the biological environment. Thus, it is crucial to gain a molecular level understanding of surface bound protein structure. While X-ray diffraction and solution phase NMR methods are well established for determining the structure of proteins in the crystalline or solution phase, there is not a corresponding single technique that can provide the same level of structural detail about proteins at surfaces or interfaces. However, recent advances in sum frequency generation (SFG) vibrational spectroscopy have significantly increased our ability to obtain structural information about surface bound proteins and peptides. A multi-technique approach of combining SFG with (1) protein engineering methods to selectively introduce mutations and isotopic labels, (2) other experimental methods such as time-of-flight secondary ion mass spectrometry (ToF-SIMS) and near edge X-ray absorption fine structure (NEXAFS) to provide complementary information, and (3) molecular dynamic (MD) simulations to extend the molecular level experimental results is a particularly promising route for structural characterization of surface bound proteins and peptides. By using model peptides and small proteins with well-defined structures, methods have been developed to determine the orientation of both backbone and side chains to the surface.
SFG analysis of surface bound proteins: A route towards structure determination
Weidner, Tobias; Castner, David G.
2013-01-01
The surface of a material is rapidly covered with proteins once that material is placed in a biological environment. The structure and function of these bound proteins play a key role in the interactions and communications of the material with the biological environment. Thus, it is crucial to gain a molecular level understanding of surface bound protein structure. While X-ray diffraction and solution phase NMR methods are well established for determining the structure of proteins in the crystalline or solution phase, there is not a corresponding single technique that can provide the same level of structural detail about proteins at surfaces or interfaces. However, recent advances in sum frequency generation (SFG) vibrational spectroscopy have significantly increased our ability to obtain structural information about surface bound proteins and peptides. A multi-technique approach of combining SFG with (1) protein engineering methods to selectively introduce mutations and isotopic labels, (2) other experimental methods such as time-of-flight secondary ion mass spectrometry (ToF-SIMS) and near edge x-ray absorption fine structure (NEXAFS) to provide complementary information, and (3) molecular dynamic (MD) simulations to extend the molecular level experimental results is a particularly promising route for structural characterization of surface bound proteins and peptides. By using model peptides and small proteins with well-defined structures, methods have been developed to determine the orientation of both backbone and side chains to the surface. PMID:23727992
My 65 years in protein chemistry
Scheraga, Harold A.
2015-01-01
This is a tour of a physical chemist through 65 years of protein chemistry from the time when emphasis was placed on the determination of the size and shape of the protein molecule as a colloidal particle, with an early breakthrough by James Sumner, followed by Linus Pauling and Fred Sanger, that a protein was a real molecule, albeit a macromolecule. It deals with the recognition of the nature and importance of hydrogen bonds and hydrophobic interactions in determining the structure, properties, and biological function of proteins until the present acquisition of an understanding of the structure, thermodynamics, and folding pathways from a linear array of amino acids to a biological entity. Along the way, with a combination of experiment and theoretical interpretation, a mechanism was elucidated for the thrombin-induced conversion of fibrinogen to a fibrin blood clot and for the oxidative-folding pathways of ribonuclease A. Before the atomic structure of a protein molecule was determined by x-ray diffraction or nuclear magnetic resonance spectroscopy, experimental studies of the fundamental interactions underlying protein structure led to several distance constraints which motivated the theoretical approach to determine protein structure, and culminated in the Empirical Conformational Energy Program for Peptides (ECEPP), an all-atom force field, with which the structures of fibrous collagen-like proteins and the 46-residue globular staphylococcal protein A were determined. To undertake the study of larger globular proteins, a physics-based coarse-grained UNited-RESidue (UNRES) force field was developed, and applied to the protein-folding problem in terms of structure, thermodynamics, dynamics, and folding pathways. Initially, single-chain and, ultimately, multiple-chain proteins were examined, and the methodology was extended to protein–protein interactions and to nucleic acids and to protein–nucleic acid interactions. The ultimate results led to an understanding of a variety of biological processes underlying natural and disease phenomena. PMID:25850343
Cell-free protein synthesis for structure determination by X-ray crystallography.
Watanabe, Miki; Miyazono, Ken-ichi; Tanokura, Masaru; Sawasaki, Tatsuya; Endo, Yaeta; Kobayashi, Ichizo
2010-01-01
Structure determination has been difficult for those proteins that are toxic to the cells and cannot be prepared in a large amount in vivo. These proteins, even when biologically very interesting, tend to be left uncharacterized in the structural genomics projects. Their cell-free synthesis can bypass the toxicity problem. Among the various cell-free systems, the wheat-germ-based system is of special interest due to the following points: (1) Because the gene is placed under a plant translational signal, its toxic expression in a bacterial host is reduced. (2) It has only little codon preference and, especially, little discrimination between methionine and selenomethionine (SeMet), which allows easy preparation of selenomethionylated proteins for crystal structure determination by SAD and MAD methods. (3) Translation is uncoupled from transcription, so that the toxicity of the translation product on DNA and its transcription, if any, can be bypassed. We have shown that the wheat-germ-based cell-free protein synthesis is useful for X-ray crystallography of one of the 4-bp cutter restriction enzymes, which are expected to be very toxic to all forms of cells retaining the genome. Our report on its structure represents the first report of structure determination by X-ray crystallography using protein overexpressed with the wheat-germ-based cell-free protein expression system. This will be a method of choice for cytotoxic proteins when its cost is not a problem. Its use will become popular when the crystal structure determination technology has evolved to require only a tiny amount of protein.
Potrzebowski, Wojciech; André, Ingemar
2015-07-01
For highly oriented fibrillar molecules, three-dimensional structures can often be determined from X-ray fiber diffraction data. However, because of limited information content, structure determination and validation can be challenging. We demonstrate that automated structure determination of protein fibers can be achieved by guiding the building of macromolecular models with fiber diffraction data. We illustrate the power of our approach by determining the structures of six bacteriophage viruses de novo using fiber diffraction data alone and together with solid-state NMR data. Furthermore, we demonstrate the feasibility of molecular replacement from monomeric and fibrillar templates by solving the structure of a plant virus using homology modeling and protein-protein docking. The generated models explain the experimental data to the same degree as deposited reference structures but with improved structural quality. We also developed a cross-validation method for model selection. The results highlight the power of fiber diffraction data as structural constraints.
G-LoSA for Prediction of Protein-Ligand Binding Sites and Structures.
Lee, Hui Sun; Im, Wonpil
2017-01-01
Recent advances in high-throughput structure determination and computational protein structure prediction have significantly enriched the universe of protein structure. However, there is still a large gap between the number of available protein structures and that of proteins with annotated function in high accuracy. Computational structure-based protein function prediction has emerged to reduce this knowledge gap. The identification of a ligand binding site and its structure is critical to the determination of a protein's molecular function. We present a computational methodology for predicting small molecule ligand binding site and ligand structure using G-LoSA, our protein local structure alignment and similarity measurement tool. All the computational procedures described here can be easily implemented using G-LoSA Toolkit, a package of standalone software programs and preprocessed PDB structure libraries. G-LoSA and G-LoSA Toolkit are freely available to academic users at http://compbio.lehigh.edu/GLoSA . We also illustrate a case study to show the potential of our template-based approach harnessing G-LoSA for protein function prediction.
Adjusting protein graphs based on graph entropy.
Peng, Sheng-Lung; Tsay, Yu-Wei
2014-01-01
Measuring protein structural similarity attempts to establish a relationship of equivalence between polymer structures based on their conformations. In several recent studies, researchers have explored protein-graph remodeling, instead of looking a minimum superimposition for pairwise proteins. When graphs are used to represent structured objects, the problem of measuring object similarity become one of computing the similarity between graphs. Graph theory provides an alternative perspective as well as efficiency. Once a protein graph has been created, its structural stability must be verified. Therefore, a criterion is needed to determine if a protein graph can be used for structural comparison. In this paper, we propose a measurement for protein graph remodeling based on graph entropy. We extend the concept of graph entropy to determine whether a graph is suitable for representing a protein. The experimental results suggest that when applied, graph entropy helps a conformational on protein graph modeling. Furthermore, it indirectly contributes to protein structural comparison if a protein graph is solid.
Adjusting protein graphs based on graph entropy
2014-01-01
Measuring protein structural similarity attempts to establish a relationship of equivalence between polymer structures based on their conformations. In several recent studies, researchers have explored protein-graph remodeling, instead of looking a minimum superimposition for pairwise proteins. When graphs are used to represent structured objects, the problem of measuring object similarity become one of computing the similarity between graphs. Graph theory provides an alternative perspective as well as efficiency. Once a protein graph has been created, its structural stability must be verified. Therefore, a criterion is needed to determine if a protein graph can be used for structural comparison. In this paper, we propose a measurement for protein graph remodeling based on graph entropy. We extend the concept of graph entropy to determine whether a graph is suitable for representing a protein. The experimental results suggest that when applied, graph entropy helps a conformational on protein graph modeling. Furthermore, it indirectly contributes to protein structural comparison if a protein graph is solid. PMID:25474347
New paradigm in ankyrin repeats: Beyond protein-protein interaction module.
Islam, Zeyaul; Nagampalli, Raghavendra Sashi Krishna; Fatima, Munazza Tamkeen; Ashraf, Ghulam Md
2018-04-01
Classically, ankyrin repeat (ANK) proteins are built from tandems of two or more repeats and form curved solenoid structures that are associated with protein-protein interactions. These are short, widespread structural motif of around 33 amino acids repeats in tandem, having a canonical helix-loop-helix fold, found individually or in combination with other domains. The multiplicity of structural pattern enables it to form assemblies of diverse sizes, required for their abilities to confer multiple binding and structural roles of proteins. Three-dimensional structures of these repeats determined to date reveal a degree of structural variability that translates into the considerable functional versatility of this protein superfamily. Recent work on the ANK has proposed novel structural information, especially protein-lipid, protein-sugar and protein-protein interaction. Self-assembly of these repeats was also shown to prevent the associated protein in forming filaments. In this review, we summarize the latest findings and how the new structural information has increased our understanding of the structural determinants of ANK proteins. We discussed latest findings on how these proteins participate in various interactions to diversify the ANK roles in numerous biological processes, and explored the emerging and evolving field of designer ankyrins and its framework for protein engineering emphasizing on biotechnological applications. Copyright © 2017 Elsevier B.V. All rights reserved.
Nealon, John Oliver; Philomina, Limcy Seby
2017-01-01
The elucidation of protein–protein interactions is vital for determining the function and action of quaternary protein structures. Here, we discuss the difficulty and importance of establishing protein quaternary structure and review in vitro and in silico methods for doing so. Determining the interacting partner proteins of predicted protein structures is very time-consuming when using in vitro methods, this can be somewhat alleviated by use of predictive methods. However, developing reliably accurate predictive tools has proved to be difficult. We review the current state of the art in predictive protein interaction software and discuss the problem of scoring and therefore ranking predictions. Current community-based predictive exercises are discussed in relation to the growth of protein interaction prediction as an area within these exercises. We suggest a fusion of experimental and predictive methods that make use of sparse experimental data to determine higher resolution predicted protein interactions as being necessary to drive forward development. PMID:29206185
X-ray laser diffraction for structure determination of the rhodopsin-arrestin complex
NASA Astrophysics Data System (ADS)
Zhou, X. Edward; Gao, Xiang; Barty, Anton; Kang, Yanyong; He, Yuanzheng; Liu, Wei; Ishchenko, Andrii; White, Thomas A.; Yefanov, Oleksandr; Han, Gye Won; Xu, Qingping; de Waal, Parker W.; Suino-Powell, Kelly M.; Boutet, Sébastien; Williams, Garth J.; Wang, Meitian; Li, Dianfan; Caffrey, Martin; Chapman, Henry N.; Spence, John C. H.; Fromme, Petra; Weierstall, Uwe; Stevens, Raymond C.; Cherezov, Vadim; Melcher, Karsten; Xu, H. Eric
2016-04-01
Serial femtosecond X-ray crystallography (SFX) using an X-ray free electron laser (XFEL) is a recent advancement in structural biology for solving crystal structures of challenging membrane proteins, including G-protein coupled receptors (GPCRs), which often only produce microcrystals. An XFEL delivers highly intense X-ray pulses of femtosecond duration short enough to enable the collection of single diffraction images before significant radiation damage to crystals sets in. Here we report the deposition of the XFEL data and provide further details on crystallization, XFEL data collection and analysis, structure determination, and the validation of the structural model. The rhodopsin-arrestin crystal structure solved with SFX represents the first near-atomic resolution structure of a GPCR-arrestin complex, provides structural insights into understanding of arrestin-mediated GPCR signaling, and demonstrates the great potential of this SFX-XFEL technology for accelerating crystal structure determination of challenging proteins and protein complexes.
X-ray laser diffraction for structure determination of the rhodopsin-arrestin complex.
Zhou, X Edward; Gao, Xiang; Barty, Anton; Kang, Yanyong; He, Yuanzheng; Liu, Wei; Ishchenko, Andrii; White, Thomas A; Yefanov, Oleksandr; Han, Gye Won; Xu, Qingping; de Waal, Parker W; Suino-Powell, Kelly M; Boutet, Sébastien; Williams, Garth J; Wang, Meitian; Li, Dianfan; Caffrey, Martin; Chapman, Henry N; Spence, John C H; Fromme, Petra; Weierstall, Uwe; Stevens, Raymond C; Cherezov, Vadim; Melcher, Karsten; Xu, H Eric
2016-04-12
Serial femtosecond X-ray crystallography (SFX) using an X-ray free electron laser (XFEL) is a recent advancement in structural biology for solving crystal structures of challenging membrane proteins, including G-protein coupled receptors (GPCRs), which often only produce microcrystals. An XFEL delivers highly intense X-ray pulses of femtosecond duration short enough to enable the collection of single diffraction images before significant radiation damage to crystals sets in. Here we report the deposition of the XFEL data and provide further details on crystallization, XFEL data collection and analysis, structure determination, and the validation of the structural model. The rhodopsin-arrestin crystal structure solved with SFX represents the first near-atomic resolution structure of a GPCR-arrestin complex, provides structural insights into understanding of arrestin-mediated GPCR signaling, and demonstrates the great potential of this SFX-XFEL technology for accelerating crystal structure determination of challenging proteins and protein complexes.
X-ray laser diffraction for structure determination of the rhodopsin-arrestin complex
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhou, X. Edward; Gao, Xiang; Barty, Anton
Here, serial femtosecond X-ray crystallography (SFX) using an X-ray free electron laser (XFEL) is a recent advancement in structural biology for solving crystal structures of challenging membrane proteins, including G-protein coupled receptors (GPCRs), which often only produce microcrystals. An XFEL delivers highly intense X-ray pulses of femtosecond duration short enough to enable the collection of single diffraction images before significant radiation damage to crystals sets in. Here we report the deposition of the XFEL data and provide further details on crystallization, XFEL data collection and analysis, structure determination, and the validation of the structural model. The rhodopsin-arrestin crystal structure solvedmore » with SFX represents the first near-atomic resolution structure of a GPCR-arrestin complex, provides structural insights into understanding of arrestin-mediated GPCR signaling, and demonstrates the great potential of this SFX-XFEL technology for accelerating crystal structure determination of challenging proteins and protein complexes.« less
X-ray laser diffraction for structure determination of the rhodopsin-arrestin complex
Zhou, X. Edward; Gao, Xiang; Barty, Anton; Kang, Yanyong; He, Yuanzheng; Liu, Wei; Ishchenko, Andrii; White, Thomas A.; Yefanov, Oleksandr; Han, Gye Won; Xu, Qingping; de Waal, Parker W.; Suino-Powell, Kelly M.; Boutet, Sébastien; Williams, Garth J.; Wang, Meitian; Li, Dianfan; Caffrey, Martin; Chapman, Henry N.; Spence, John C.H.; Fromme, Petra; Weierstall, Uwe; Stevens, Raymond C.; Cherezov, Vadim; Melcher, Karsten; Xu, H. Eric
2016-01-01
Serial femtosecond X-ray crystallography (SFX) using an X-ray free electron laser (XFEL) is a recent advancement in structural biology for solving crystal structures of challenging membrane proteins, including G-protein coupled receptors (GPCRs), which often only produce microcrystals. An XFEL delivers highly intense X-ray pulses of femtosecond duration short enough to enable the collection of single diffraction images before significant radiation damage to crystals sets in. Here we report the deposition of the XFEL data and provide further details on crystallization, XFEL data collection and analysis, structure determination, and the validation of the structural model. The rhodopsin-arrestin crystal structure solved with SFX represents the first near-atomic resolution structure of a GPCR-arrestin complex, provides structural insights into understanding of arrestin-mediated GPCR signaling, and demonstrates the great potential of this SFX-XFEL technology for accelerating crystal structure determination of challenging proteins and protein complexes. PMID:27070998
X-ray laser diffraction for structure determination of the rhodopsin-arrestin complex
Zhou, X. Edward; Gao, Xiang; Barty, Anton; ...
2016-04-12
Here, serial femtosecond X-ray crystallography (SFX) using an X-ray free electron laser (XFEL) is a recent advancement in structural biology for solving crystal structures of challenging membrane proteins, including G-protein coupled receptors (GPCRs), which often only produce microcrystals. An XFEL delivers highly intense X-ray pulses of femtosecond duration short enough to enable the collection of single diffraction images before significant radiation damage to crystals sets in. Here we report the deposition of the XFEL data and provide further details on crystallization, XFEL data collection and analysis, structure determination, and the validation of the structural model. The rhodopsin-arrestin crystal structure solvedmore » with SFX represents the first near-atomic resolution structure of a GPCR-arrestin complex, provides structural insights into understanding of arrestin-mediated GPCR signaling, and demonstrates the great potential of this SFX-XFEL technology for accelerating crystal structure determination of challenging proteins and protein complexes.« less
Dal Palù, Alessandro; Pontelli, Enrico; He, Jing; Lu, Yonggang
2007-01-01
The paper describes a novel framework, constructed using Constraint Logic Programming (CLP) and parallelism, to determine the association between parts of the primary sequence of a protein and alpha-helices extracted from 3D low-resolution descriptions of large protein complexes. The association is determined by extracting constraints from the 3D information, regarding length, relative position and connectivity of helices, and solving these constraints with the guidance of a secondary structure prediction algorithm. Parallelism is employed to enhance performance on large proteins. The framework provides a fast, inexpensive alternative to determine the exact tertiary structure of unknown proteins.
NASA Astrophysics Data System (ADS)
Ward, Meaghan E.; Brown, Leonid S.; Ladizhansky, Vladimir
2015-04-01
Studies of the structure, dynamics, and function of membrane proteins (MPs) have long been considered one of the main applications of solid-state NMR (SSNMR). Advances in instrumentation, and the plethora of new SSNMR methodologies developed over the past decade have resulted in a number of high-resolution structures and structural models of both bitopic and polytopic α-helical MPs. The necessity to retain lipids in the sample, the high proportion of one type of secondary structure, differential dynamics, and the possibility of local disorder in the loop regions all create challenges for structure determination. In this Perspective article we describe our recent efforts directed at determining the structure and functional dynamics of Anabaena Sensory Rhodopsin, a heptahelical transmembrane (7TM) protein. We review some of the established and emerging methods which can be utilized for SSNMR-based structure determination, with a particular focus on those used for ASR, a bacterial protein which shares its 7TM architecture with G-protein coupled receptors.
Structure determination of an integral membrane protein at room temperature from crystals in situ
DOE Office of Scientific and Technical Information (OSTI.GOV)
Axford, Danny; Foadi, James; Imperial College London, London SW7 2AZ
2015-05-14
The X-ray structure determination of an integral membrane protein using synchrotron diffraction data measured in situ at room temperature is demonstrated. The structure determination of an integral membrane protein using synchrotron X-ray diffraction data collected at room temperature directly in vapour-diffusion crystallization plates (in situ) is demonstrated. Exposing the crystals in situ eliminates manual sample handling and, since it is performed at room temperature, removes the complication of cryoprotection and potential structural anomalies induced by sample cryocooling. Essential to the method is the ability to limit radiation damage by recording a small amount of data per sample from many samplesmore » and subsequently assembling the resulting data sets using specialized software. The validity of this procedure is established by the structure determination of Haemophilus influenza TehA at 2.3 Å resolution. The method presented offers an effective protocol for the fast and efficient determination of membrane-protein structures at room temperature using third-generation synchrotron beamlines.« less
Present and future of membrane protein structure determination by electron crystallography.
Ubarretxena-Belandia, Iban; Stokes, David L
2010-01-01
Membrane proteins are critical to cell physiology, playing roles in signaling, trafficking, transport, adhesion, and recognition. Despite their relative abundance in the proteome and their prevalence as targets of therapeutic drugs, structural information about membrane proteins is in short supply. This chapter describes the use of electron crystallography as a tool for determining membrane protein structures. Electron crystallography offers distinct advantages relative to the alternatives of X-ray crystallography and NMR spectroscopy. Namely, membrane proteins are placed in their native membranous environment, which is likely to favor a native conformation and allow changes in conformation in response to physiological ligands. Nevertheless, there are significant logistical challenges in finding appropriate conditions for inducing membrane proteins to form two-dimensional arrays within the membrane and in using electron cryo-microscopy to collect the data required for structure determination. A number of developments are described for high-throughput screening of crystallization trials and for automated imaging of crystals with the electron microscope. These tools are critical for exploring the necessary range of factors governing the crystallization process. There have also been recent software developments to facilitate the process of structure determination. However, further innovations in the algorithms used for processing images and electron diffraction are necessary to improve throughput and to make electron crystallography truly viable as a method for determining atomic structures of membrane proteins. Copyright © 2010 Elsevier Inc. All rights reserved.
Present and future of membrane protein structure determination by electron crystallography
Ubarretxena-Belandia, Iban; Stokes, David L.
2011-01-01
Membrane proteins are critical to cell physiology, playing roles in signaling, trafficking, transport, adhesion, and recognition. Despite their relative abundance in the proteome and their prevalence as targets of therapeutic drugs, structural information about membrane proteins is in short supply. This review describes the use of electron crystallography as a tool for determining membrane protein structures. Electron crystallography offers distinct advantages relative to the alternatives of X-ray crystallography and NMR spectroscopy. Namely, membrane proteins are placed in their native membranous environment, which is likely to favor a native conformation and allow changes in conformation in response to physiological ligands. Nevertheless, there are significant logistical challenges in finding appropriate conditions for inducing membrane proteins to form two-dimensional arrays within the membrane and in using electron cryo-microscopy to collect the data required for structure determination. A number of developments are described for high-throughput screening of crystallization trials and for automated imaging of crystals with the electron microscope. These tools are critical for exploring the necessary range of factors governing the crystallization process. There have also been recent software developments to facilitate the process of structure determination. However, further innovations in the algorithms used for processing images and electron diffraction are necessary to improve throughput and to make electron crystallography truly viable as a method for determining atomic structures of membrane proteins. PMID:21115172
Protein crystallization X-ray diffraction data collection Protein structure determination Obtaining structures of protein-ligand complexes Site-directed mutagenesis Structure-function relationship Enzymatic CelA," Science (2013) "Sequence, Structure, and Evolution of Cellulases in Glycoside
Structural Genomics of Protein Phosphatases
DOE Office of Scientific and Technical Information (OSTI.GOV)
Almo,S.; Bonanno, J.; Sauder, J.
The New York SGX Research Center for Structural Genomics (NYSGXRC) of the NIGMS Protein Structure Initiative (PSI) has applied its high-throughput X-ray crystallographic structure determination platform to systematic studies of all human protein phosphatases and protein phosphatases from biomedically-relevant pathogens. To date, the NYSGXRC has determined structures of 21 distinct protein phosphatases: 14 from human, 2 from mouse, 2 from the pathogen Toxoplasma gondii, 1 from Trypanosoma brucei, the parasite responsible for African sleeping sickness, and 2 from the principal mosquito vector of malaria in Africa, Anopheles gambiae. These structures provide insights into both normal and pathophysiologic processes, including transcriptionalmore » regulation, regulation of major signaling pathways, neural development, and type 1 diabetes. In conjunction with the contributions of other international structural genomics consortia, these efforts promise to provide an unprecedented database and materials repository for structure-guided experimental and computational discovery of inhibitors for all classes of protein phosphatases.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stols, L.; Sanville Millard, C.; Dementieva, I.
2004-03-01
A simplified approach developed recently for the production of heterologous proteins in Escherichia coli uses 2-liter polyethylene terephthalate beverage bottles as disposable culture vessels [Sanville Millard, C. et al. 2003. Protein Expr. Purif. 29, 311-320]. The method greatly reduces the time and effort needed to produce native proteins for structural or functional studies. We now demonstrate that the approach is also well suited for production of proteins in defined media with incorporation of selenomethionine to facilitate structure determination by multiwavelength anomalous diffraction. Induction of a random set of Bacillus stearothermophilus target genes under the new protocols generated soluble selenomethionyl proteinsmore » in good yield. Several selenomethionyl proteins were purified in good yields and three were subjected to amino acid analysis. Incorporation of selenomethionine was determined to be greater than 95% in one protein and greater than 98% in the other two. In the preceding paper [Zhao et al., this issue, pp. 87-93], the approach is further extended to production of [U-15N]- or [U-13C, U-15N]-labeled proteins. The approach thus appears suitable for high-throughput production of proteins for structure determination by X-ray crystallography or nuclear magnetic resonance spectroscopy.« less
Bhardwaj, Anshul; Casjens, Sherwood R; Cingolani, Gino
2014-02-01
Protein fibers are widespread in nature, but only a limited number of high-resolution structures have been determined experimentally. Unlike globular proteins, fibers are usually recalcitrant to form three-dimensional crystals, preventing single-crystal X-ray diffraction analysis. In the absence of three-dimensional crystals, X-ray fiber diffraction is a powerful tool to determine the internal symmetry of a fiber, but it rarely yields atomic resolution structural information on complex protein fibers. An 85-residue-long minimal coiled-coil repeat unit (MiCRU) was previously identified in the trimeric helical core of tail needle gp26, a fibrous protein emanating from the tail apparatus of the bacteriophage P22 virion. Here, evidence is provided that an MiCRU can be inserted in frame inside the gp26 helical core to generate a rationally extended fiber (gp26-2M) which, like gp26, retains a trimeric quaternary structure in solution. The 2.7 Å resolution crystal structure of this engineered fiber, which measures ∼320 Å in length and is only 20-35 Å wide, was determined. This structure, the longest for a trimeric protein fiber to be determined to such a high resolution, reveals the architecture of 22 consecutive trimerization heptads and provides a framework to decipher the structural determinants for protein fiber assembly, stability and flexibility.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bhardwaj, Anshul; Casjens, Sherwood R.; Cingolani, Gino, E-mail: gino.cingolani@jefferson.edu
2014-02-01
This study presents the crystal structure of a ∼320 Å long protein fiber generated by in-frame extension of its repeated helical coiled-coil core. Protein fibers are widespread in nature, but only a limited number of high-resolution structures have been determined experimentally. Unlike globular proteins, fibers are usually recalcitrant to form three-dimensional crystals, preventing single-crystal X-ray diffraction analysis. In the absence of three-dimensional crystals, X-ray fiber diffraction is a powerful tool to determine the internal symmetry of a fiber, but it rarely yields atomic resolution structural information on complex protein fibers. An 85-residue-long minimal coiled-coil repeat unit (MiCRU) was previously identifiedmore » in the trimeric helical core of tail needle gp26, a fibrous protein emanating from the tail apparatus of the bacteriophage P22 virion. Here, evidence is provided that an MiCRU can be inserted in frame inside the gp26 helical core to generate a rationally extended fiber (gp26-2M) which, like gp26, retains a trimeric quaternary structure in solution. The 2.7 Å resolution crystal structure of this engineered fiber, which measures ∼320 Å in length and is only 20–35 Å wide, was determined. This structure, the longest for a trimeric protein fiber to be determined to such a high resolution, reveals the architecture of 22 consecutive trimerization heptads and provides a framework to decipher the structural determinants for protein fiber assembly, stability and flexibility.« less
Relation between native ensembles and experimental structures of proteins
Best, Robert B.; Lindorff-Larsen, Kresten; DePristo, Mark A.; Vendruscolo, Michele
2006-01-01
Different experimental structures of the same protein or of proteins with high sequence similarity contain many small variations. Here we construct ensembles of “high-sequence similarity Protein Data Bank” (HSP) structures and consider the extent to which such ensembles represent the structural heterogeneity of the native state in solution. We find that different NMR measurements probing structure and dynamics of given proteins in solution, including order parameters, scalar couplings, and residual dipolar couplings, are remarkably well reproduced by their respective high-sequence similarity Protein Data Bank ensembles; moreover, we show that the effects of uncertainties in structure determination are insufficient to explain the results. These results highlight the importance of accounting for native-state protein dynamics in making comparisons with ensemble-averaged experimental data and suggest that even a modest number of structures of a protein determined under different conditions, or with small variations in sequence, capture a representative subset of the true native-state ensemble. PMID:16829580
What determines the spectrum of protein native state structures?
Lezon, Timothy R; Banavar, Jayanth R; Lesk, Arthur M; Maritan, Amos
2006-05-01
We present a brief summary of the key factors underlying protein structure, as developed in the investigations of Pauling, Ramachandran, and Rose. We then outline a simplified physical model of proteins that focuses on geometry and symmetry. Although this model superficially appears unrelated to the detailed chemical descriptions commonly applied to proteins, we show that it captures the essential elements of the chemistry and provides a unified framework for understanding the common characteristics of folded proteins. We suggest that the spectrum of protein native state structures is determined by geometry and symmetry and the role of the sequence is to choose its native state structure from this predetermined menu. 2006 Wiley-Liss, Inc.
2015-01-01
Identifying determinant(s) of protein thermostability is key for rational and data-driven protein engineering. By analyzing more than 130 pairs of mesophilic/(hyper)thermophilic proteins, we identified the quality (residue-wise energy) of hydrophobic interactions as a key factor for protein thermostability. This distinguishes our study from previous ones that investigated predominantly structural determinants. Considering this key factor, we successfully discriminated between pairs of mesophilic/(hyper)thermophilic proteins (discrimination accuracy: ∼80%) and searched for structural weak spots in E. coli dihydrofolate reductase (classification accuracy: 70%). PMID:24437522
Craig, George D.; Glass, Robert; Rupp, Bernhard
1997-01-01
A method for forming synthetic crystals of proteins in a carrier fluid by use of the dipole moments of protein macromolecules that self-align in the Helmholtz layer adjacent to an electrode. The voltage gradients of such layers easily exceed 10.sup.6 V/m. The synthetic protein crystals are subjected to x-ray crystallography to determine the conformational structure of the protein involved.
Covering complete proteomes with X-ray structures: A current snapshot
Mizianty, Marcin J.; Fan, Xiao; Yan, Jing; ...
2014-10-23
Structural genomics programs have developed and applied structure-determination pipelines to a wide range of protein targets, facilitating the visualization of macromolecular interactions and the understanding of their molecular and biochemical functions. The fundamental question of whether three-dimensional structures of all proteins and all functional annotations can be determined using X-ray crystallography is investigated. A first-of-its-kind large-scale analysis of crystallization propensity for all proteins encoded in 1953 fully sequenced genomes was performed. It is shown that current X-ray crystallographic knowhow combined with homology modeling can provide structures for 25% of modeling families (protein clusters for which structural models can be obtainedmore » through homology modeling), with at least one structural model produced for each Gene Ontology functional annotation. The coverage varies between superkingdoms, with 19% for eukaryotes, 35% for bacteria and 49% for archaea, and with those of viruses following the coverage values of their hosts. It is shown that the crystallization propensities of proteomes from the taxonomic superkingdoms are distinct. The use of knowledge-based target selection is shown to substantially increase the ability to produce X-ray structures. It is demonstrated that the human proteome has one of the highest attainable coverage values among eukaryotes, and GPCR membrane proteins suitable for X-ray structure determination were determined.« less
Gottstein, Daniel; Reckel, Sina; Dötsch, Volker; Güntert, Peter
2012-06-06
Nuclear magnetic resonance (NMR) structure calculations of the α-helical integral membrane proteins DsbB, GlpG, and halorhodopsin show that distance restraints from paramagnetic relaxation enhancement (PRE) can provide sufficient structural information to determine their structure with an accuracy of about 1.5 Å in the absence of other long-range conformational restraints. Our systematic study with simulated NMR data shows that about one spin label per transmembrane helix is necessary for obtaining enough PRE distance restraints to exclude wrong topologies, such as pseudo mirror images, if only limited other NMR restraints are available. Consequently, an experimentally realistic amount of PRE data enables α-helical membrane protein structure determinations that would not be feasible with the very limited amount of conventional NOESY data normally available for these systems. These findings are in line with our recent first de novo NMR structure determination of a heptahelical integral membrane protein, proteorhodopsin, that relied extensively on PRE data. Copyright © 2012 Elsevier Ltd. All rights reserved.
Kister, Alexander
2015-01-01
We present an alternative approach to protein 3D folding prediction based on determination of rules that specify distribution of “favorable” residues, that are mainly responsible for a given fold formation, and “unfavorable” residues, that are incompatible with that fold, in polypeptide sequences. The process of determining favorable and unfavorable residues is iterative. The starting assumptions are based on the general principles of protein structure formation as well as structural features peculiar to a protein fold under investigation. The initial assumptions are tested one-by-one for a set of all known proteins with a given structure. The assumption is accepted as a “rule of amino acid distribution” for the protein fold if it holds true for all, or near all, structures. If the assumption is not accepted as a rule, it can be modified to better fit the data and then tested again in the next step of the iterative search algorithm, or rejected. We determined the set of amino acid distribution rules for a large group of beta sandwich-like proteins characterized by a specific arrangement of strands in two beta sheets. It was shown that this set of rules is highly sensitive (~90%) and very specific (~99%) for identifying sequences of proteins with specified beta sandwich fold structure. The advantage of the proposed approach is that it does not require that query proteins have a high degree of homology to proteins with known structure. So long as the query protein satisfies residue distribution rules, it can be confidently assigned to its respective protein fold. Another advantage of our approach is that it allows for a better understanding of which residues play an essential role in protein fold formation. It may, therefore, facilitate rational protein engineering design. PMID:25625198
Elucidating Peptide and Protein Structure and Dynamics: UV Resonance Raman Spectroscopy
Oladepo, Sulayman A.; Xiong, Kan; Hong, Zhenmin; Asher, Sanford A.
2011-01-01
UV resonance Raman spectroscopy (UVRR) is a powerful method that has the requisite selectivity and sensitivity to incisively monitor biomolecular structure and dynamics in solution. In this perspective, we highlight applications of UVRR for studying peptide and protein structure and the dynamics of protein and peptide folding. UVRR spectral monitors of protein secondary structure, such as the Amide III3 band and the Cα-H band frequencies and intensities can be used to determine Ramachandran Ψ angle distributions for peptide bonds. These incisive, quantitative glimpses into conformation can be combined with kinetic T-jump methodologies to monitor the dynamics of biomolecular conformational transitions. The resulting UVRR structural insight is impressive in that it allows differentiation of, for example, different α-helix-like states that enable differentiating π- and 310- states from pure α-helices. These approaches can be used to determine the Gibbs free energy landscape of individual peptide bonds along the most important protein (un)folding coordinate. Future work will find spectral monitors that probe peptide bond activation barriers that control protein (un)folding mechanisms. In addition, UVRR studies of sidechain vibrations will probe the role of side chains in determining protein secondary, tertiary and quaternary structures. PMID:21379371
Craig, G.D.; Glass, R.; Rupp, B.
1997-01-28
A method is disclosed for forming synthetic crystals of proteins in a carrier fluid by use of the dipole moments of protein macromolecules that self-align in the Helmholtz layer adjacent to an electrode. The voltage gradients of such layers easily exceed 10{sup 6}V/m. The synthetic protein crystals are subjected to x-ray crystallography to determine the conformational structure of the protein involved. 2 figs.
Modeling complexes of modeled proteins.
Anishchenko, Ivan; Kundrotas, Petras J; Vakser, Ilya A
2017-03-01
Structural characterization of proteins is essential for understanding life processes at the molecular level. However, only a fraction of known proteins have experimentally determined structures. This fraction is even smaller for protein-protein complexes. Thus, structural modeling of protein-protein interactions (docking) primarily has to rely on modeled structures of the individual proteins, which typically are less accurate than the experimentally determined ones. Such "double" modeling is the Grand Challenge of structural reconstruction of the interactome. Yet it remains so far largely untested in a systematic way. We present a comprehensive validation of template-based and free docking on a set of 165 complexes, where each protein model has six levels of structural accuracy, from 1 to 6 Å C α RMSD. Many template-based docking predictions fall into acceptable quality category, according to the CAPRI criteria, even for highly inaccurate proteins (5-6 Å RMSD), although the number of such models (and, consequently, the docking success rate) drops significantly for models with RMSD > 4 Å. The results show that the existing docking methodologies can be successfully applied to protein models with a broad range of structural accuracy, and the template-based docking is much less sensitive to inaccuracies of protein models than the free docking. Proteins 2017; 85:470-478. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Thermostabilisation of membrane proteins for structural studies
Magnani, Francesca; Serrano-Vega, Maria J.; Shibata, Yoko; Abdul-Hussein, Saba; Lebon, Guillaume; Miller-Gallacher, Jennifer; Singhal, Ankita; Strege, Annette; Thomas, Jennifer A.; Tate, Christopher G.
2017-01-01
The thermostability of an integral membrane protein in detergent solution is a key parameter that dictates the likelihood of obtaining well-diffracting crystals suitable for structure determination. However, many mammalian membrane proteins are too unstable for crystallisation. We developed a thermostabilisation strategy based on systematic mutagenesis coupled to a radioligand-binding thermostability assay that can be applied to receptors, ion channels and transporters. It takes approximately 6-12 months to thermostabilise a G protein-coupled receptor (GPCR) containing 300 amino acid residues. The resulting thermostabilised membrane proteins are more easily crystallised and result in high-quality structures. This methodology has facilitated structure-based drug design applied to GPCRs, because it is possible to determine multiple structures of the thermostabilised receptors bound to low affinity ligands. Protocols and advice are given on how to develop thermostability assays for membrane proteins and how to combine mutations to make an optimally stable mutant suitable for structural studies. PMID:27466713
NMR studies of protein-nucleic acid interactions.
Varani, Gabriele; Chen, Yu; Leeper, Thomas C
2004-01-01
Protein-DNA and protein-RNA complexes play key functional roles in every living organism. Therefore, the elucidation of their structure and dynamics is an important goal of structural and molecular biology. Nuclear magnetic resonance (NMR) studies of protein and nucleic acid complexes have common features with studies of protein-protein complexes: the interaction surfaces between the molecules must be carefully delineated, the relative orientation of the two species needs to be accurately and precisely determined, and close intermolecular contacts defined by nuclear Overhauser effects (NOEs) must be obtained. However, differences in NMR properties (e.g., chemical shifts) and biosynthetic pathways for sample productions generate important differences. Chemical shift differences between the protein and nucleic acid resonances can aid the NMR structure determination process; however, the relatively limited dispersion of the RNA ribose resonances makes the process of assigning intermolecular NOEs more difficult. The analysis of the resulting structures requires computational tools unique to nucleic acid interactions. This chapter summarizes the most important elements of the structure determination by NMR of protein-nucleic acid complexes and their analysis. The main emphasis is on recent developments (e.g., residual dipolar couplings and new Web-based analysis tools) that have facilitated NMR studies of these complexes and expanded the type of biological problems to which NMR techniques of structural elucidation can now be applied.
A New Method for Determining Structure Ensemble: Application to a RNA Binding Di-Domain Protein.
Liu, Wei; Zhang, Jingfeng; Fan, Jing-Song; Tria, Giancarlo; Grüber, Gerhard; Yang, Daiwen
2016-05-10
Structure ensemble determination is the basis of understanding the structure-function relationship of a multidomain protein with weak domain-domain interactions. Paramagnetic relaxation enhancement has been proven a powerful tool in the study of structure ensembles, but there exist a number of challenges such as spin-label flexibility, domain dynamics, and overfitting. Here we propose a new (to our knowledge) method to describe structure ensembles using a minimal number of conformers. In this method, individual domains are considered rigid; the position of each spin-label conformer and the structure of each protein conformer are defined by three and six orthogonal parameters, respectively. First, the spin-label ensemble is determined by optimizing the positions and populations of spin-label conformers against intradomain paramagnetic relaxation enhancements with a genetic algorithm. Subsequently, the protein structure ensemble is optimized using a more efficient genetic algorithm-based approach and an overfitting indicator, both of which were established in this work. The method was validated using a reference ensemble with a set of conformers whose populations and structures are known. This method was also applied to study the structure ensemble of the tandem di-domain of a poly (U) binding protein. The determined ensemble was supported by small-angle x-ray scattering and nuclear magnetic resonance relaxation data. The ensemble obtained suggests an induced fit mechanism for recognition of target RNA by the protein. Copyright © 2016 Biophysical Society. Published by Elsevier Inc. All rights reserved.
Integrated Structural Biology for α-Helical Membrane Protein Structure Determination.
Xia, Yan; Fischer, Axel W; Teixeira, Pedro; Weiner, Brian; Meiler, Jens
2018-04-03
While great progress has been made, only 10% of the nearly 1,000 integral, α-helical, multi-span membrane protein families are represented by at least one experimentally determined structure in the PDB. Previously, we developed the algorithm BCL::MP-Fold, which samples the large conformational space of membrane proteins de novo by assembling predicted secondary structure elements guided by knowledge-based potentials. Here, we present a case study of rhodopsin fold determination by integrating sparse and/or low-resolution restraints from multiple experimental techniques including electron microscopy, electron paramagnetic resonance spectroscopy, and nuclear magnetic resonance spectroscopy. Simultaneous incorporation of orthogonal experimental restraints not only significantly improved the sampling accuracy but also allowed identification of the correct fold, which is demonstrated by a protein size-normalized transmembrane root-mean-square deviation as low as 1.2 Å. The protocol developed in this case study can be used for the determination of unknown membrane protein folds when limited experimental restraints are available. Copyright © 2018 Elsevier Ltd. All rights reserved.
Asymmetric scoring functions for proteins
NASA Astrophysics Data System (ADS)
Lezon, Timothy; Holter, Neal; Maritan, Amos; Banavar, Jayanth
2003-03-01
The protein folding problem entails the prediction of the native state structure of a protein given the sequence of amino acids. In a coarse-grained description of a protein, an important ingredient for attempting this task is the determination of the effective energies of interaction between amino acids. We will discuss a simple approach for determining such interaction potentials from a training set of protein sequences and their experimentally determined native state structures. The key new ingredient in our study is the incorporation of the lack of symmetry in the effective interactions between amino acids. Our results, obtained using a set of 513 proteins, and their implications will be discussed.
Langó, Tamás; Róna, Gergely; Hunyadi-Gulyás, Éva; Turiák, Lilla; Varga, Julia; Dobson, László; Várady, György; Drahos, László; Vértessy, Beáta G; Medzihradszky, Katalin F; Szakács, Gergely; Tusnády, Gábor E
2017-02-13
Transmembrane proteins play crucial role in signaling, ion transport, nutrient uptake, as well as in maintaining the dynamic equilibrium between the internal and external environment of cells. Despite their important biological functions and abundance, less than 2% of all determined structures are transmembrane proteins. Given the persisting technical difficulties associated with high resolution structure determination of transmembrane proteins, additional methods, including computational and experimental techniques remain vital in promoting our understanding of their topologies, 3D structures, functions and interactions. Here we report a method for the high-throughput determination of extracellular segments of transmembrane proteins based on the identification of surface labeled and biotin captured peptide fragments by LC/MS/MS. We show that reliable identification of extracellular protein segments increases the accuracy and reliability of existing topology prediction algorithms. Using the experimental topology data as constraints, our improved prediction tool provides accurate and reliable topology models for hundreds of human transmembrane proteins.
Unraveling the meaning of chemical shifts in protein NMR.
Berjanskii, Mark V; Wishart, David S
2017-11-01
Chemical shifts are among the most informative parameters in protein NMR. They provide wealth of information about protein secondary and tertiary structure, protein flexibility, and protein-ligand binding. In this report, we review the progress in interpreting and utilizing protein chemical shifts that has occurred over the past 25years, with a particular focus on the large body of work arising from our group and other Canadian NMR laboratories. More specifically, this review focuses on describing, assessing, and providing some historical context for various chemical shift-based methods to: (1) determine protein secondary and super-secondary structure; (2) derive protein torsion angles; (3) assess protein flexibility; (4) predict residue accessible surface area; (5) refine 3D protein structures; (6) determine 3D protein structures and (7) characterize intrinsically disordered proteins. This review also briefly covers some of the methods that we previously developed to predict chemical shifts from 3D protein structures and/or protein sequence data. It is hoped that this review will help to increase awareness of the considerable utility of NMR chemical shifts in structural biology and facilitate more widespread adoption of chemical-shift based methods by the NMR spectroscopists, structural biologists, protein biophysicists, and biochemists worldwide. This article is part of a Special Issue entitled: Biophysics in Canada, edited by Lewis Kay, John Baenziger, Albert Berghuis and Peter Tieleman. Copyright © 2017 Elsevier B.V. All rights reserved.
Automated structure determination of proteins with the SAIL-FLYA NMR method.
Takeda, Mitsuhiro; Ikeya, Teppei; Güntert, Peter; Kainosho, Masatsune
2007-01-01
The labeling of proteins with stable isotopes enhances the NMR method for the determination of 3D protein structures in solution. Stereo-array isotope labeling (SAIL) provides an optimal stereospecific and regiospecific pattern of stable isotopes that yields sharpened lines, spectral simplification without loss of information, and the ability to collect rapidly and evaluate fully automatically the structural restraints required to solve a high-quality solution structure for proteins up to twice as large as those that can be analyzed using conventional methods. Here, we describe a protocol for the preparation of SAIL proteins by cell-free methods, including the preparation of S30 extract and their automated structure analysis using the FLYA algorithm and the program CYANA. Once efficient cell-free expression of the unlabeled or uniformly labeled target protein has been achieved, the NMR sample preparation of a SAIL protein can be accomplished in 3 d. A fully automated FLYA structure calculation can be completed in 1 d on a powerful computer system.
Accurate protein structure modeling using sparse NMR data and homologous structure information.
Thompson, James M; Sgourakis, Nikolaos G; Liu, Gaohua; Rossi, Paolo; Tang, Yuefeng; Mills, Jeffrey L; Szyperski, Thomas; Montelione, Gaetano T; Baker, David
2012-06-19
While information from homologous structures plays a central role in X-ray structure determination by molecular replacement, such information is rarely used in NMR structure determination because it can be incorrect, both locally and globally, when evolutionary relationships are inferred incorrectly or there has been considerable evolutionary structural divergence. Here we describe a method that allows robust modeling of protein structures of up to 225 residues by combining (1)H(N), (13)C, and (15)N backbone and (13)Cβ chemical shift data, distance restraints derived from homologous structures, and a physically realistic all-atom energy function. Accurate models are distinguished from inaccurate models generated using incorrect sequence alignments by requiring that (i) the all-atom energies of models generated using the restraints are lower than models generated in unrestrained calculations and (ii) the low-energy structures converge to within 2.0 Å backbone rmsd over 75% of the protein. Benchmark calculations on known structures and blind targets show that the method can accurately model protein structures, even with very remote homology information, to a backbone rmsd of 1.2-1.9 Å relative to the conventional determined NMR ensembles and of 0.9-1.6 Å relative to X-ray structures for well-defined regions of the protein structures. This approach facilitates the accurate modeling of protein structures using backbone chemical shift data without need for side-chain resonance assignments and extensive analysis of NOESY cross-peak assignments.
The Prediction of Botulinum Toxin Structure Based on in Silico and in Vitro Analysis
NASA Astrophysics Data System (ADS)
Suzuki, Tomonori; Miyazaki, Satoru
2011-01-01
Many of biological system mediated through protein-protein interactions. Knowledge of protein-protein complex structure is required for understanding the function. The determination of huge size and flexible protein-protein complex structure by experimental studies remains difficult, costly and five-consuming, therefore computational prediction of protein structures by homolog modeling and docking studies is valuable method. In addition, MD simulation is also one of the most powerful methods allowing to see the real dynamics of proteins. Here, we predict protein-protein complex structure of botulinum toxin to analyze its property. These bioinformatics methods are useful to report the relation between the flexibility of backbone structure and the activity.
HARMONY: a server for the assessment of protein structures
Pugalenthi, G.; Shameer, K.; Srinivasan, N.; Sowdhamini, R.
2006-01-01
Protein structure validation is an important step in computational modeling and structure determination. Stereochemical assessment of protein structures examine internal parameters such as bond lengths and Ramachandran (φ,ψ) angles. Gross structure prediction methods such as inverse folding procedure and structure determination especially at low resolution can sometimes give rise to models that are incorrect due to assignment of misfolds or mistracing of electron density maps. Such errors are not reflected as strain in internal parameters. HARMONY is a procedure that examines the compatibility between the sequence and the structure of a protein by assigning scores to individual residues and their amino acid exchange patterns after considering their local environments. Local environments are described by the backbone conformation, solvent accessibility and hydrogen bonding patterns. We are now providing HARMONY through a web server such that users can submit their protein structure files and, if required, the alignment of homologous sequences. Scores are mapped on the structure for subsequent examination that is useful to also recognize regions of possible local errors in protein structures. HARMONY server is located at PMID:16844999
Membrane protein structure determination — The next generation☆☆☆
Moraes, Isabel; Evans, Gwyndaf; Sanchez-Weatherby, Juan; Newstead, Simon; Stewart, Patrick D. Shaw
2014-01-01
The field of Membrane Protein Structural Biology has grown significantly since its first landmark in 1985 with the first three-dimensional atomic resolution structure of a membrane protein. Nearly twenty-six years later, the crystal structure of the beta2 adrenergic receptor in complex with G protein has contributed to another landmark in the field leading to the 2012 Nobel Prize in Chemistry. At present, more than 350 unique membrane protein structures solved by X-ray crystallography (http://blanco.biomol.uci.edu/mpstruc/exp/list, Stephen White Lab at UC Irvine) are available in the Protein Data Bank. The advent of genomics and proteomics initiatives combined with high-throughput technologies, such as automation, miniaturization, integration and third-generation synchrotrons, has enhanced membrane protein structure determination rate. X-ray crystallography is still the only method capable of providing detailed information on how ligands, cofactors, and ions interact with proteins, and is therefore a powerful tool in biochemistry and drug discovery. Yet the growth of membrane protein crystals suitable for X-ray diffraction studies amazingly remains a fine art and a major bottleneck in the field. It is often necessary to apply as many innovative approaches as possible. In this review we draw attention to the latest methods and strategies for the production of suitable crystals for membrane protein structure determination. In addition we also highlight the impact that third-generation synchrotron radiation has made in the field, summarizing the latest strategies used at synchrotron beamlines for screening and data collection from such demanding crystals. This article is part of a Special Issue entitled: Structural and biophysical characterisation of membrane protein-ligand binding. PMID:23860256
Gruss, Fabian; Hiller, Sebastian; Maier, Timm
2015-01-01
TamA is an Omp85 protein involved in autotransporter assembly in the outer membrane of Escherichia coli. It comprises a C-terminal 16-stranded transmembrane β-barrel as well as three periplasmic POTRA domains, and is a challenging target for structure determination. Here, we present a method for crystal structure determination of TamA, including recombinant expression in E. coli, detergent extraction, chromatographic purification, and bicelle crystallization in combination with seeding. As a result, crystals in space group P21212 are obtained, which diffract to 2.3 Å resolution. This protocol also serves as a template for structure determination of other outer membrane proteins, in particular of the Omp85 family.
Zeng, Jianyang; Roberts, Kyle E.; Zhou, Pei
2011-01-01
Abstract A major bottleneck in protein structure determination via nuclear magnetic resonance (NMR) is the lengthy and laborious process of assigning resonances and nuclear Overhauser effect (NOE) cross peaks. Recent studies have shown that accurate backbone folds can be determined using sparse NMR data, such as residual dipolar couplings (RDCs) or backbone chemical shifts. This opens a question of whether we can also determine the accurate protein side-chain conformations using sparse or unassigned NMR data. We attack this question by using unassigned nuclear Overhauser effect spectroscopy (NOESY) data, which records the through-space dipolar interactions between protons nearby in three-dimensional (3D) space. We propose a Bayesian approach with a Markov random field (MRF) model to integrate the likelihood function derived from observed experimental data, with prior information (i.e., empirical molecular mechanics energies) about the protein structures. We unify the side-chain structure prediction problem with the side-chain structure determination problem using unassigned NMR data, and apply the deterministic dead-end elimination (DEE) and A* search algorithms to provably find the global optimum solution that maximizes the posterior probability. We employ a Hausdorff-based measure to derive the likelihood of a rotamer or a pairwise rotamer interaction from unassigned NOESY data. In addition, we apply a systematic and rigorous approach to estimate the experimental noise in NMR data, which also determines the weighting factor of the data term in the scoring function derived from the Bayesian framework. We tested our approach on real NMR data of three proteins: the FF Domain 2 of human transcription elongation factor CA150 (FF2), the B1 domain of Protein G (GB1), and human ubiquitin. The promising results indicate that our algorithm can be applied in high-resolution protein structure determination. Since our approach does not require any NOE assignment, it can accelerate the NMR structure determination process. PMID:21970619
Resource for structure related information on transmembrane proteins
NASA Astrophysics Data System (ADS)
Tusnády, Gábor E.; Simon, István
Transmembrane proteins are involved in a wide variety of vital biological processes including transport of water-soluble molecules, flow of information and energy production. Despite significant efforts to determine the structures of these proteins, only a few thousand solved structures are known so far. Here, we review the various resources for structure-related information on these types of proteins ranging from the 3D structure to the topology and from the up-to-date databases to the various Internet sites and servers dealing with structure prediction and structure analysis. Abbreviations: 3D, three dimensional; PDB, Protein Data Bank; TMP, transmembrane protein.
Impact of genetic variation on three dimensional structure and function of proteins
Bhattacharya, Roshni; Rose, Peter W.; Burley, Stephen K.
2017-01-01
The Protein Data Bank (PDB; http://wwpdb.org) was established in 1971 as the first open access digital data resource in biology with seven protein structures as its initial holdings. The global PDB archive now contains more than 126,000 experimentally determined atomic level three-dimensional (3D) structures of biological macromolecules (proteins, DNA, RNA), all of which are freely accessible via the Internet. Knowledge of the 3D structure of the gene product can help in understanding its function and role in disease. Of particular interest in the PDB archive are proteins for which 3D structures of genetic variant proteins have been determined, thus revealing atomic-level structural differences caused by the variation at the DNA level. Herein, we present a systematic and qualitative analysis of such cases. We observe a wide range of structural and functional changes caused by single amino acid differences, including changes in enzyme activity, aggregation propensity, structural stability, binding, and dissociation, some in the context of large assemblies. Structural comparison of wild type and mutated proteins, when both are available, provide insights into atomic-level structural differences caused by the genetic variation. PMID:28296894
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mandal, Kalyaneswar; Pentelute, Brad L.; Tereshko, Valentina
2009-04-08
Racemic protein crystallography, enabled by total chemical synthesis, has allowed us to determine the X-ray structure of native scorpion toxin BmBKTx1; direct methods were used for phase determination. This is the first example of a protein racemate that crystallized in space group I41/a.
The structure of a cholesterol-trapping protein
Date February 28, 2003 Date Berkeley Lab Science Beat Berkeley Lab Science Beat The structure of a Institute researchers determined the three-dimensional structure of a protein that controls cholesterol level in the bloodstream. Knowing the structure of the protein, a cellular receptor that ensnares
Text Mining for Protein Docking
Badal, Varsha D.; Kundrotas, Petras J.; Vakser, Ilya A.
2015-01-01
The rapidly growing amount of publicly available information from biomedical research is readily accessible on the Internet, providing a powerful resource for predictive biomolecular modeling. The accumulated data on experimentally determined structures transformed structure prediction of proteins and protein complexes. Instead of exploring the enormous search space, predictive tools can simply proceed to the solution based on similarity to the existing, previously determined structures. A similar major paradigm shift is emerging due to the rapidly expanding amount of information, other than experimentally determined structures, which still can be used as constraints in biomolecular structure prediction. Automated text mining has been widely used in recreating protein interaction networks, as well as in detecting small ligand binding sites on protein structures. Combining and expanding these two well-developed areas of research, we applied the text mining to structural modeling of protein-protein complexes (protein docking). Protein docking can be significantly improved when constraints on the docking mode are available. We developed a procedure that retrieves published abstracts on a specific protein-protein interaction and extracts information relevant to docking. The procedure was assessed on protein complexes from Dockground (http://dockground.compbio.ku.edu). The results show that correct information on binding residues can be extracted for about half of the complexes. The amount of irrelevant information was reduced by conceptual analysis of a subset of the retrieved abstracts, based on the bag-of-words (features) approach. Support Vector Machine models were trained and validated on the subset. The remaining abstracts were filtered by the best-performing models, which decreased the irrelevant information for ~ 25% complexes in the dataset. The extracted constraints were incorporated in the docking protocol and tested on the Dockground unbound benchmark set, significantly increasing the docking success rate. PMID:26650466
Objective identification of residue ranges for the superposition of protein structures
2011-01-01
Background The automation of objectively selecting amino acid residue ranges for structure superpositions is important for meaningful and consistent protein structure analyses. So far there is no widely-used standard for choosing these residue ranges for experimentally determined protein structures, where the manual selection of residue ranges or the use of suboptimal criteria remain commonplace. Results We present an automated and objective method for finding amino acid residue ranges for the superposition and analysis of protein structures, in particular for structure bundles resulting from NMR structure calculations. The method is implemented in an algorithm, CYRANGE, that yields, without protein-specific parameter adjustment, appropriate residue ranges in most commonly occurring situations, including low-precision structure bundles, multi-domain proteins, symmetric multimers, and protein complexes. Residue ranges are chosen to comprise as many residues of a protein domain that increasing their number would lead to a steep rise in the RMSD value. Residue ranges are determined by first clustering residues into domains based on the distance variance matrix, and then refining for each domain the initial choice of residues by excluding residues one by one until the relative decrease of the RMSD value becomes insignificant. A penalty for the opening of gaps favours contiguous residue ranges in order to obtain a result that is as simple as possible, but not simpler. Results are given for a set of 37 proteins and compared with those of commonly used protein structure validation packages. We also provide residue ranges for 6351 NMR structures in the Protein Data Bank. Conclusions The CYRANGE method is capable of automatically determining residue ranges for the superposition of protein structure bundles for a large variety of protein structures. The method correctly identifies ordered regions. Global structure superpositions based on the CYRANGE residue ranges allow a clear presentation of the structure, and unnecessary small gaps within the selected ranges are absent. In the majority of cases, the residue ranges from CYRANGE contain fewer gaps and cover considerably larger parts of the sequence than those from other methods without significantly increasing the RMSD values. CYRANGE thus provides an objective and automatic method for standardizing the choice of residue ranges for the superposition of protein structures. PMID:21592348
DePietro, Paul J; Julfayev, Elchin S; McLaughlin, William A
2013-10-21
Protein Structure Initiative:Biology (PSI:Biology) is the third phase of PSI where protein structures are determined in high-throughput to characterize their biological functions. The transition to the third phase entailed the formation of PSI:Biology Partnerships which are composed of structural genomics centers and biomedical science laboratories. We present a method to examine the impact of protein structures determined under the auspices of PSI:Biology by measuring their rates of annotations. The mean numbers of annotations per structure and per residue are examined. These are designed to provide measures of the amount of structure to function connections that can be leveraged from each structure. One result is that PSI:Biology structures are found to have a higher rate of annotations than structures determined during the first two phases of PSI. A second result is that the subset of PSI:Biology structures determined through PSI:Biology Partnerships have a higher rate of annotations than those determined exclusive of those partnerships. Both results hold when the annotation rates are examined either at the level of the entire protein or for annotations that are known to fall at specific residues within the portion of the protein that has a determined structure. We conclude that PSI:Biology determines structures that are estimated to have a higher degree of biomedical interest than those determined during the first two phases of PSI based on a broad array of biomedical annotations. For the PSI:Biology Partnerships, we see that there is an associated added value that represents part of the progress toward the goals of PSI:Biology. We interpret the added value to mean that team-based structural biology projects that utilize the expertise and technologies of structural genomics centers together with biological laboratories in the community are conducted in a synergistic manner. We show that the annotation rates can be used in conjunction with established metrics, i.e. the numbers of structures and impact of publication records, to monitor the progress of PSI:Biology towards its goals of examining structure to function connections of high biomedical relevance. The metric provides an objective means to quantify the overall impact of PSI:Biology as it uses biomedical annotations from external sources.
2013-01-01
Background Protein Structure Initiative:Biology (PSI:Biology) is the third phase of PSI where protein structures are determined in high-throughput to characterize their biological functions. The transition to the third phase entailed the formation of PSI:Biology Partnerships which are composed of structural genomics centers and biomedical science laboratories. We present a method to examine the impact of protein structures determined under the auspices of PSI:Biology by measuring their rates of annotations. The mean numbers of annotations per structure and per residue are examined. These are designed to provide measures of the amount of structure to function connections that can be leveraged from each structure. Results One result is that PSI:Biology structures are found to have a higher rate of annotations than structures determined during the first two phases of PSI. A second result is that the subset of PSI:Biology structures determined through PSI:Biology Partnerships have a higher rate of annotations than those determined exclusive of those partnerships. Both results hold when the annotation rates are examined either at the level of the entire protein or for annotations that are known to fall at specific residues within the portion of the protein that has a determined structure. Conclusions We conclude that PSI:Biology determines structures that are estimated to have a higher degree of biomedical interest than those determined during the first two phases of PSI based on a broad array of biomedical annotations. For the PSI:Biology Partnerships, we see that there is an associated added value that represents part of the progress toward the goals of PSI:Biology. We interpret the added value to mean that team-based structural biology projects that utilize the expertise and technologies of structural genomics centers together with biological laboratories in the community are conducted in a synergistic manner. We show that the annotation rates can be used in conjunction with established metrics, i.e. the numbers of structures and impact of publication records, to monitor the progress of PSI:Biology towards its goals of examining structure to function connections of high biomedical relevance. The metric provides an objective means to quantify the overall impact of PSI:Biology as it uses biomedical annotations from external sources. PMID:24139526
Ryabov, Yaroslav; Fushman, David
2008-01-01
We present a simple and robust approach that uses the overall rotational diffusion tensor as a structural constraint for domain positioning in multidomain proteins and protein-protein complexes. This method offers the possibility to use NMR relaxation data for detailed structure characterization of such systems provided the structures of individual domains are available. The proposed approach extends the concept of using long-range information contained in the overall rotational diffusion tensor. In contrast to the existing approaches, we use both the principal axes and principal values of protein’s rotational diffusion tensor to determine not only the orientation but also the relative positioning of the individual domains in a protein. This is achieved by finding the domain arrangement in a molecule that provides the best possible agreement with all components of the overall rotational diffusion tensor derived from experimental data. The accuracy of the proposed approach is demonstrated for two protein systems with known domain arrangement and parameters of the overall tumbling: the HIV-1 protease homodimer and Maltose Binding Protein. The accuracy of the method and its sensitivity to domain positioning is also tested using computer-generated data for three protein complexes, for which the experimental diffusion tensors are not available. In addition, the proposed method is applied here to determine, for the first time, the structure of both open and closed conformations of Lys48-linked di-ubiquitin chain, where domain motions render impossible accurate structure determination by other methods. The proposed method opens new avenues for improving structure characterization of proteins in solution. PMID:17550252
Shao, W; Fernandez, E; Wilken, J; Thompson, D A; Siani, M A; West, J; Lolis, E; Schweitzer, B I
1998-12-11
The determination of high resolution three-dimensional structures by X-ray crystallography or nuclear magnetic resonance (NMR) is a time-consuming process. Here we describe an approach to circumvent the cloning and expression of a recombinant protein as well as screening for heavy atom derivatives. The selenomethionine-modified chemokine macrophage inflammatory protein-II (MIP-II) from human herpesvirus-8 has been produced by total chemical synthesis, crystallized, and characterized by NMR. The protein has a secondary structure typical of other chemokines and forms a monomer in solution. These results indicate that total chemical synthesis can be used to accelerate the determination of three-dimensional structures of new proteins identified in genome programs.
Ye, Shuji; Li, Hongchun; Yang, Weilai; Luo, Yi
2014-01-29
Accurate determination of protein structures at the interface is essential to understand the nature of interfacial protein interactions, but it can only be done with a few, very limited experimental methods. Here, we demonstrate for the first time that sum frequency generation vibrational spectroscopy can unambiguously differentiate the interfacial protein secondary structures by combining surface-sensitive amide I and amide III spectral signals. This combination offers a powerful tool to directly distinguish random-coil (disordered) and α-helical structures in proteins. From a systematic study on the interactions between several antimicrobial peptides (including LKα14, mastoparan X, cecropin P1, melittin, and pardaxin) and lipid bilayers, it is found that the spectral profiles of the random-coil and α-helical structures are well separated in the amide III spectra, appearing below and above 1260 cm(-1), respectively. For the peptides with a straight backbone chain, the strength ratio for the peaks of the random-coil and α-helical structures shows a distinct linear relationship with the fraction of the disordered structure deduced from independent NMR experiments reported in the literature. It is revealed that increasing the fraction of negatively charged lipids can induce a conformational change of pardaxin from random-coil to α-helical structures. This experimental protocol can be employed for determining the interfacial protein secondary structures and dynamics in situ and in real time without extraneous labels.
High-Resolution Protein Structure Determination by Serial Femtosecond Crystallography
Boutet, Sébastien; Lomb, Lukas; Williams, Garth J.; Barends, Thomas R. M.; Aquila, Andrew; Doak, R. Bruce; Weierstall, Uwe; DePonte, Daniel P.; Steinbrener, Jan; Shoeman, Robert L.; Messerschmidt, Marc; Barty, Anton; White, Thomas A.; Kassemeyer, Stephan; Kirian, Richard A.; Seibert, M. Marvin; Montanez, Paul A.; Kenney, Chris; Herbst, Ryan; Hart, Philip; Pines, Jack; Haller, Gunther; Gruner, Sol M.; Philipp, Hugh T.; Tate, Mark W.; Hromalik, Marianne; Koerner, Lucas J.; van Bakel, Niels; Morse, John; Ghonsalves, Wilfred; Arnlund, David; Bogan, Michael J.; Caleman, Carl; Fromme, Raimund; Hampton, Christina Y.; Hunter, Mark S.; Johansson, Linda C.; Katona, Gergely; Kupitz, Christopher; Liang, Mengning; Martin, Andrew V.; Nass, Karol; Redecke, Lars; Stellato, Francesco; Timneanu, Nicusor; Wang, Dingjie; Zatsepin, Nadia A.; Schafer, Donald; Defever, James; Neutze, Richard; Fromme, Petra; Spence, John C. H.; Chapman, Henry N.; Schlichting, Ilme
2013-01-01
Structure determination of proteins and other macromolecules has historically required the growth of high-quality crystals sufficiently large to diffract x-rays efficiently while withstanding radiation damage. We applied serial femtosecond crystallography (SFX) using an x-ray free-electron laser (XFEL) to obtain high-resolution structural information from microcrystals (less than 1 micrometer by 1 micrometer by 3 micrometers) of the well-characterized model protein lysozyme. The agreement with synchrotron data demonstrates the immediate relevance of SFX for analyzing the structure of the large group of difficult-to-crystallize molecules. PMID:22653729
Lua, Rhonald C; Wilson, Stephen J; Konecki, Daniel M; Wilkins, Angela D; Venner, Eric; Morgan, Daniel H; Lichtarge, Olivier
2016-01-04
The structure and function of proteins underlie most aspects of biology and their mutational perturbations often cause disease. To identify the molecular determinants of function as well as targets for drugs, it is central to characterize the important residues and how they cluster to form functional sites. The Evolutionary Trace (ET) achieves this by ranking the functional and structural importance of the protein sequence positions. ET uses evolutionary distances to estimate functional distances and correlates genotype variations with those in the fitness phenotype. Thus, ET ranks are worse for sequence positions that vary among evolutionarily closer homologs but better for positions that vary mostly among distant homologs. This approach identifies functional determinants, predicts function, guides the mutational redesign of functional and allosteric specificity, and interprets the action of coding sequence variations in proteins, people and populations. Now, the UET database offers pre-computed ET analyses for the protein structure databank, and on-the-fly analysis of any protein sequence. A web interface retrieves ET rankings of sequence positions and maps results to a structure to identify functionally important regions. This UET database integrates several ways of viewing the results on the protein sequence or structure and can be found at http://mammoth.bcm.tmc.edu/uet/. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Ye, Shuji; Wei, Feng; Li, Hongchun; Tian, Kangzhen; Luo, Yi
2013-01-01
In situ and real-time characterization of molecular structures and orientation of proteins at interfaces is essential to understand the nature of interfacial protein interaction. Such work will undoubtedly provide important clues to control biointerface in a desired manner. Sum frequency generation vibrational spectroscopy (SFG-VS) has been demonstrated to be a powerful technique to study the interfacial structures and interactions at the molecular level. This paper first systematically introduced the methods for the calculation of the Raman polarizability tensor, infrared transition dipole moment, and SFG molecular hyperpolarizability tensor elements of proteins/peptides with the secondary structures of α-helix, 310-helix, antiparallel β-sheet, and parallel β-sheet, as well as the methodology to determine the orientation of interfacial protein secondary structures using SFG amide I spectra. After that, recent progresses on the determination of protein structure and orientation at different interfaces by SFG-VS were then reviewed, which provides a molecular-level understanding of the structures and interactions of interfacial proteins, specially understanding the nature of driving force behind such interactions. Although this review has focused on analysis of amide I spectra, it will be expected to offer a basic idea for the spectral analysis of amide III SFG signals and other complicated molecular systems such as RNA and DNA. Copyright © 2013 Elsevier Inc. All rights reserved.
Dynamics of endoglucanase catalytic domains: implications towards thermostability
USDA-ARS?s Scientific Manuscript database
The function of proteins is controlled by their dynamics inherently determined by their structure. Exploring the protein structure-dynamics relationship is important to develop an understanding of protein function that allows tapping the potential of economically important proteins, such as endogluc...
Higashiura, Akifumi; Ohta, Kazunori; Masaki, Mika; Sato, Masaru; Inaka, Koji; Tanaka, Hiroaki; Nakagawa, Atsushi
2013-11-01
Recently, many technical improvements in macromolecular X-ray crystallography have increased the number of structures deposited in the Protein Data Bank and improved the resolution limit of protein structures. Almost all high-resolution structures have been determined using a synchrotron radiation source in conjunction with cryocooling techniques, which are required in order to minimize radiation damage. However, optimization of cryoprotectant conditions is a time-consuming and difficult step. To overcome this problem, the high-pressure cryocooling method was developed (Kim et al., 2005) and successfully applied to many protein-structure analyses. In this report, using the high-pressure cryocooling method, the X-ray crystal structure of bovine H-protein was determined at 0.86 Å resolution. Structural comparisons between high- and ambient-pressure cryocooled crystals at ultra-high resolution illustrate the versatility of this technique. This is the first ultra-high-resolution X-ray structure obtained using the high-pressure cryocooling method.
Siponen, Marina I.; Wisniewska, Magdalena; Lehtiö, Lari; Johansson, Ida; Svensson, Linda; Raszewski, Grzegorz; Nilsson, Lennart; Sigvardsson, Mikael; Berglund, Helena
2010-01-01
The early B-cell factor (EBF) transcription factors are central regulators of development in several organs and tissues. This protein family shows low sequence similarity to other protein families, which is why structural information for the functional domains of these proteins is crucial to understand their biochemical features. We have used a modular approach to determine the crystal structures of the structured domains in the EBF family. The DNA binding domain reveals a striking resemblance to the DNA binding domains of the Rel homology superfamily of transcription factors but contains a unique zinc binding structure, termed zinc knuckle. Further the EBF proteins contain an IPT/TIG domain and an atypical helix-loop-helix domain with a novel type of dimerization motif. The data presented here provide insights into unique structural features of the EBF proteins and open possibilities for detailed molecular investigations of this important transcription factor family. PMID:20592035
Shen, Hong-Bin; Yi, Dong-Liang; Yao, Li-Xiu; Yang, Jie; Chou, Kuo-Chen
2008-10-01
In the postgenomic age, with the avalanche of protein sequences generated and relatively slow progress in determining their structures by experiments, it is important to develop automated methods to predict the structure of a protein from its sequence. The membrane proteins are a special group in the protein family that accounts for approximately 30% of all proteins; however, solved membrane protein structures only represent less than 1% of known protein structures to date. Although a great success has been achieved for developing computational intelligence techniques to predict secondary structures in both globular and membrane proteins, there is still much challenging work in this regard. In this review article, we firstly summarize the recent progress of automation methodology development in predicting protein secondary structures, especially in membrane proteins; we will then give some future directions in this research field.
Vögeli, Beat; Orts, Julien; Strotz, Dean; Chi, Celestine; Minges, Martina; Wälti, Marielle Aulikki; Güntert, Peter; Riek, Roland
2014-04-01
Confined by the Boltzmann distribution of the energies of the states, a multitude of structural states are inherent to biomolecules. For a detailed understanding of a protein's function, its entire structural landscape at atomic resolution and insight into the interconversion between all the structural states (i.e. dynamics) are required. Whereas dedicated trickery with NMR relaxation provides aspects of local dynamics, and 3D structure determination by NMR is well established, only recently have several attempts been made to formulate a more comprehensive description of the dynamics and the structural landscape of a protein. Here, a perspective is given on the use of exact NOEs (eNOEs) for the elucidation of structural ensembles of a protein describing the covered conformational space. Copyright © 2013 Elsevier Inc. All rights reserved.
Life in the fast lane for protein crystallization and X-ray crystallography
NASA Technical Reports Server (NTRS)
Pusey, Marc L.; Liu, Zhi-Jie; Tempel, Wolfram; Praissman, Jeremy; Lin, Dawei; Wang, Bi-Cheng; Gavira, Jose A.; Ng, Joseph D.
2005-01-01
The common goal for structural genomic centers and consortiums is to decipher as quickly as possible the three-dimensional structures for a multitude of recombinant proteins derived from known genomic sequences. Since X-ray crystallography is the foremost method to acquire atomic resolution for macromolecules, the limiting step is obtaining protein crystals that can be useful of structure determination. High-throughput methods have been developed in recent years to clone, express, purify, crystallize and determine the three-dimensional structure of a protein gene product rapidly using automated devices, commercialized kits and consolidated protocols. However, the average number of protein structures obtained for most structural genomic groups has been very low compared to the total number of proteins purified. As more entire genomic sequences are obtained for different organisms from the three kingdoms of life, only the proteins that can be crystallized and whose structures can be obtained easily are studied. Consequently, an astonishing number of genomic proteins remain unexamined. In the era of high-throughput processes, traditional methods in molecular biology, protein chemistry and crystallization are eclipsed by automation and pipeline practices. The necessity for high-rate production of protein crystals and structures has prevented the usage of more intellectual strategies and creative approaches in experimental executions. Fundamental principles and personal experiences in protein chemistry and crystallization are minimally exploited only to obtain "low-hanging fruit" protein structures. We review the practical aspects of today's high-throughput manipulations and discuss the challenges in fast pace protein crystallization and tools for crystallography. Structural genomic pipelines can be improved with information gained from low-throughput tactics that may help us reach the higher-bearing fruits. Examples of recent developments in this area are reported from the efforts of the Southeast Collaboratory for Structural Genomics (SECSG).
Life in the Fast Lane for Protein Crystallization and X-Ray Crystallography
NASA Technical Reports Server (NTRS)
Pusey, Marc L.; Liu, Zhi-Jie; Tempel, Wolfram; Praissman, Jeremy; Lin, Dawei; Wang, Bi-Cheng; Gavira, Jose A.; Ng, Joseph D.
2004-01-01
The common goal for structural genomic centers and consortiums is to decipher as quickly as possible the three-dimensional structures for a multitude of recombinant proteins derived from known genomic sequences. Since X-ray crystallography is the foremost method to acquire atomic resolution for macromolecules, the limiting step is obtaining protein crystals that can be useful of structure determination. High-throughput methods have been developed in recent years to clone, express, purify, crystallize and determine the three-dimensional structure of a protein gene product rapidly using automated devices, commercialized kits and consolidated protocols. However, the average number of protein structures obtained for most structural genomic groups has been very low compared to the total number of proteins purified. As more entire genomic sequences are obtained for different organisms from the three kingdoms of life, only the proteins that can be crystallized and whose structures can be obtained easily are studied. Consequently, an astonishing number of genomic proteins remain unexamined. In the era of high-throughput processes, traditional methods in molecular biology, protein chemistry and crystallization are eclipsed by automation and pipeline practices. The necessity for high rate production of protein crystals and structures has prevented the usage of more intellectual strategies and creative approaches in experimental executions. Fundamental principles and personal experiences in protein chemistry and crystallization are minimally exploited only to obtain "low-hanging fruit" protein structures. We review the practical aspects of today s high-throughput manipulations and discuss the challenges in fast pace protein crystallization and tools for crystallography. Structural genomic pipelines can be improved with information gained from low-throughput tactics that may help us reach the higher-bearing fruits. Examples of recent developments in this area are reported from the efforts of the Southeast Collaboratory for Structural Genomics (SECSG).
Li de La Sierra-Gallay, Ines; Collinet, Bruno; Graille, Marc; Quevillon-Cheruel, Sophie; Liger, Dominique; Minard, Philippe; Blondeau, Karine; Henckes, Gilles; Aufrère, Robert; Leulliot, Nicolas; Zhou, Cong-Zhao; Sorel, Isabelle; Ferrer, Jean-Luc; Poupon, Anne; Janin, Joël; van Tilbeurgh, Herman
2004-03-01
The protein product of the YGR205w gene of Saccharomyces cerevisiae was targeted as part of our yeast structural genomics project. YGR205w codes for a small (290 amino acids) protein with unknown structure and function. The only recognizable sequence feature is the presence of a Walker A motif (P loop) indicating a possible nucleotide binding/converting function. We determined the three-dimensional crystal structure of Se-methionine substituted protein using multiple anomalous diffraction. The structure revealed a well known mononucleotide fold and strong resemblance to the structure of small metabolite phosphorylating enzymes such as pantothenate and phosphoribulo kinase. Biochemical experiments show that YGR205w binds specifically ATP and, less tightly, ADP. The structure also revealed the presence of two bound sulphate ions, occupying opposite niches in a canyon that corresponds to the active site of the protein. One sulphate is bound to the P-loop in a position that corresponds to the position of beta-phosphate in mononucleotide protein ATP complex, suggesting the protein is indeed a kinase. The nature of the phosphate accepting substrate remains to be determined. Copyright 2004 Wiley-Liss, Inc.
Use of 13Cα Chemical-Shifts in Protein Structure Determination
Vila, Jorge A.; Ripoll, Daniel R.; Scheraga, Harold A.
2008-01-01
A physics-based method, aimed at determining protein structures by using NOE-derived distances together with observed and computed 13C chemical shifts, is proposed. The approach makes use of 13Cα chemical shifts, computed at the density functional level of theory, to obtain torsional constraints for all backbone and side-chain torsional angles without making a priori use of the occupancy of any region of the Ramachandran map by the amino acid residues. The torsional constraints are not fixed but are changed dynamically in each step of the procedure, following an iterative self-consistent approach intended to identify a set of conformations for which the computed 13Cα chemical shifts match the experimental ones. A test is carried out on a 76-amino acid all-α-helical protein, namely the B. Subtilis acyl carrier protein. It is shown that, starting from randomly generated conformations, the final protein models are more accurate than an existing NMR-derived structure model of this protein, in terms of both the agreement between predicted and observed 13Cα chemical shifts and some stereochemical quality indicators, and of similar accuracy as one of the protein models solved at a high level of resolution. The results provide evidence that this methodology can be used not only for structure determination but also for additional protein structure refinement of NMR-derived models deposited in the Protein Data Bank. PMID:17516673
Johnson, Derrick E.; Xue, Bin; Sickmeier, Megan D.; Meng, Jingwei; Cortese, Marc S.; Oldfield, Christopher J.; Le Gall, Tanguy; Dunker, A. Keith; Uversky, Vladimir N.
2012-01-01
The identification of intrinsically disordered proteins (IDPs) among the targets that fail to form satisfactory crystal structures in the Protein Structure Initiative represent a key to reducing the costs and time for determining three-dimensional structures of proteins. To help in this endeavor, several Protein Structure Initiative Centers were asked to send samples of both crystallizable proteins and proteins that failed to crystallize. The abundance of intrinsic disorder in these proteins was evaluated via computational analysis using Predictors of Natural Disordered Regions (PONDR®) and the potential cleavage sites and corresponding fragments were determined. Then, the target proteins were analyzed for intrinsic disorder by their resistance to limited proteolysis. The rates of tryptic digestion of sample target proteins were compared to those of lysozyme/myoglobin, apo-myoglobin and α-casein as standards of ordered, partially disordered and completely disordered proteins, respectively. At the next stage, the protein samples were subjected to both far-UV and near-UV circular dichroism (CD) analysis. For most of the samples, a good agreement between CD data, predictions of disorder and the rates of limited tryptic digestion was established. Further experimentation is being performed on a smaller subset of these samples in order to obtain more detailed information on the ordered/disordered nature of the proteins. PMID:22651963
NASA Astrophysics Data System (ADS)
Shen, Lei; Ulrich, Nathan W.; Mello, Charlene M.; Chen, Zhan
2015-01-01
Surface immobilized peptides/proteins have important applications such as antimicrobial coating and biosensing. We report a study of such peptides/proteins using sum frequency generation vibrational spectroscopy and ATR-FTIR. Immobilization on surfaces via physical adsorption and chemical coupling revealed that structures of chemically immobilized peptides are determined by immobilization sites, chemical environments, and substrate surfaces. In addition, controlling enzyme orientation by engineering the surface immobilization site demonstrated that structures can be well-correlated to measured chemical activity. This research facilitates the development of immobilized peptides/proteins with improved activities by optimizing their surface orientation and structure.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Osipiuk, J.; Gornicki, P.; Maj, L.
The structure of the YlxR protein of unknown function from Streptococcus pneumonia was determined to 1.35 Angstroms. YlxR is expressed from the nusA/infB operon in bacteria and belongs to a small protein family (COG2740) that shares a conserved sequence motif GRGA(Y/W). The family shows no significant amino-acid sequence similarity with other proteins. Three-wavelength diffraction MAD data were collected to 1.7 Angstroms from orthorhombic crystals using synchrotron radiation and the structure was determined using a semi-automated approach. The YlxR structure resembles a two-layer {alpha}/{beta} sandwich with the overall shape of a cylinder and shows no structural homology to proteins of knownmore » structure. Structural analysis revealed that the YlxR structure represents a new protein fold that belongs to the {alpha}-{beta} plait superfamily. The distribution of the electrostatic surface potential shows a large positively charged patch on one side of the protein, a feature often found in nucleic acid-binding proteins. Three sulfate ions bind to this positively charged surface. Analysis of potential binding sites uncovered several substantial clefts, with the largest spanning 3/4 of the protein. A similar distribution of binding sites and a large sharply bent cleft are observed in RNA-binding proteins that are unrelated in sequence and structure. It is proposed that YlxR is an RNA-binding protein.« less
Streptococcus pneumonia YlxR at 1.35 A shows a putative new fold.
Osipiuk, J; Górnicki, P; Maj, L; Dementieva, I; Laskowski, R; Joachimiak, A
2001-11-01
The structure of the YlxR protein of unknown function from Streptococcus pneumonia was determined to 1.35 A. YlxR is expressed from the nusA/infB operon in bacteria and belongs to a small protein family (COG2740) that shares a conserved sequence motif GRGA(Y/W). The family shows no significant amino-acid sequence similarity with other proteins. Three-wavelength diffraction MAD data were collected to 1.7 A from orthorhombic crystals using synchrotron radiation and the structure was determined using a semi-automated approach. The YlxR structure resembles a two-layer alpha/beta sandwich with the overall shape of a cylinder and shows no structural homology to proteins of known structure. Structural analysis revealed that the YlxR structure represents a new protein fold that belongs to the alpha-beta plait superfamily. The distribution of the electrostatic surface potential shows a large positively charged patch on one side of the protein, a feature often found in nucleic acid-binding proteins. Three sulfate ions bind to this positively charged surface. Analysis of potential binding sites uncovered several substantial clefts, with the largest spanning 3/4 of the protein. A similar distribution of binding sites and a large sharply bent cleft are observed in RNA-binding proteins that are unrelated in sequence and structure. It is proposed that YlxR is an RNA-binding protein.
Banigan, James R; Mandal, Kalyaneswar; Sawaya, Michael R; Thammavongsa, Vilasak; Hendrickx, Antoni P A; Schneewind, Olaf; Yeates, Todd O; Kent, Stephen B H
2010-10-01
The 50-residue snake venom protein L-omwaprin and its enantiomer D-omwaprin were prepared by total chemical synthesis. Radial diffusion assays were performed against Bacillus megaterium and Bacillus anthracis; both L- and D-omwaprin showed antibacterial activity against B. megaterium. The native protein enantiomer, made of L-amino acids, failed to crystallize readily. However, when a racemic mixture containing equal amounts of L- and D-omwaprin was used, diffraction quality crystals were obtained. The racemic protein sample crystallized in the centrosymmetric space group P2(1)/c and its structure was determined at atomic resolution (1.33 A) by a combination of Patterson and direct methods based on the strong scattering from the sulfur atoms in the eight cysteine residues per protein. Racemic crystallography once again proved to be a valuable method for obtaining crystals of recalcitrant proteins and for determining high-resolution X-ray structures by direct methods.
Protein Structure Determination using Metagenome sequence data
Ovchinnikov, Sergey; Park, Hahnbeom; Varghese, Neha; Huang, Po-Ssu; Pavlopoulos, Georgios A.; Kim, David E.; Kamisetty, Hetunandan; Kyrpides, Nikos C.; Baker, David
2017-01-01
Despite decades of work by structural biologists, there are still ~5200 protein families with unknown structure outside the range of comparative modeling. We show that Rosetta structure prediction guided by residue-residue contacts inferred from evolutionary information can accurately model proteins that belong to large families, and that metagenome sequence data more than triples the number of protein families with sufficient sequences for accurate modeling. We then integrate metagenome data, contact based structure matching and Rosetta structure calculations to generate models for 614 protein families with currently unknown structures; 206 are membrane proteins and 137 have folds not represented in the PDB. This approach provides the representative models for large protein families originally envisioned as the goal of the protein structure initiative at a fraction of the cost. PMID:28104891
NASA Astrophysics Data System (ADS)
Wanapun, Duangporn; Wampler, Ronald D.; Begue, Nathan J.; Simpson, Garth J.
2008-03-01
A new method for sensitive determination of protein secondary structure via multi-photon absorption is considered theoretically. Perturbation theory is developed to describe the polarization-dependent two-photon absorption (TPA) of α-helix and β-sheet protein secondary structures. The exciton coupling interactions responsible for relatively weak electronic circular dichroism in one-photon absorption are predicted to give rise to large changes in the TPA cross-section (>200%) for circular versus linear incident polarizations, defined as CLD. The CLD effect in TPA is electric dipole-allowed, which explains the much greater sensitivity. These predictions suggest TPA should be a viable means of sensitively probing protein secondary structure.
Native sulfur/chlorine SAD phasing for serial femtosecond crystallography
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nakane, Takanori; Song, Changyong; POSTECH, Pohang 790-784
Sulfur SAD phasing facilitates the structure determination of diverse native proteins using femtosecond X-rays from free-electron lasers via serial femtosecond crystallography. Serial femtosecond crystallography (SFX) allows structures to be determined with minimal radiation damage. However, phasing native crystals in SFX is not very common. Here, the structure determination of native lysozyme from single-wavelength anomalous diffraction (SAD) by utilizing the anomalous signal of sulfur and chlorine at a wavelength of 1.77 Å is successfully demonstrated. This sulfur SAD method can be applied to a wide range of proteins, which will improve the determination of native crystal structures.
Oezguen, Numan; Zhou, Bin; Negi, Surendra S.; Ivanciuc, Ovidiu; Schein, Catherine H.; Labesse, Gilles; Braun, Werner
2008-01-01
Similarities in sequences and 3D structures of allergenic proteins provide vital clues to identify clinically relevant IgE cross-reactivities. However, experimental 3D structures are available in the Protein Data Bank for only 5% (45/829) of all allergens catalogued in the Structural Database of Allergenic Proteins (SDAP, http://fermi.utmb.edu/SDAP). Here, an automated procedure was used to prepare 3D-models of all allergens where there was no experimentally determined 3D structure or high identity (95%) to another protein of known 3D structure. After a final selection by quality criteria, 433 reliable 3D models were retained and are available from our SDAP Website. The new 3D models extensively enhance our knowledge of allergen structures. As an example of their use, experimentally derived “continuous IgE epitopes” were mapped on 3 experimentally determined structures and 13 of our 3D-models of allergenic proteins. Large portions of these continuous sequences are not entirely on the surface and therefore cannot interact with IgE or other proteins. Only the surface exposed residues are constituents of “conformational IgE epitopes” which are not in all cases continuous in sequence. The surface exposed parts of the experimental determined continuous IgE epitopes showed a distinct statistical distribution as compared to their presence in typical protein-protein interfaces. The amino acids Ala, Ser, Asn, Gly and particularly Lys have a high propensity to occur in IgE binding sites. The 3D-models will facilitate further analysis of the common properties of IgE binding sites of allergenic proteins. PMID:18621419
NASA Astrophysics Data System (ADS)
Krokhotin, Andrey; Dokholyan, Nikolay V.
2017-07-01
Most proteins fold into unique three-dimensional (3D) structures that determine their biological functions, such as catalytic activity or macromolecular binding. Misfolded proteins can pose a threat through aberrant interactions with other proteins leading to a number of diseases including Alzheimer's disease, Parkinson's disease, and amyotrophic lateral sclerosis [1,2]. What does determine 3D structure of proteins? The first clue to this question came more than fifty years ago when Anfinsen demonstrated that unfolded proteins can spontaneously fold to their native 3D structures [3,4]. Anfinsen's experiments lead to the conclusion that proteins fold to unique native structure corresponding to the stable and kinetically accessible free energy minimum, and protein native structure is solely determined by its amino acid sequence. The question of how exactly proteins find their free energy minimum proved to be a difficult problem. One of the puzzles, initially pointed out by Levinthal, was an inconsistency between observed protein folding times and theoretical estimates. A self-avoiding polymer model of a globular protein of 100-residues length on a cubic lattice can sample at least 1047 states. Based on the assumption that conformational sampling occurs at the highest vibrational mode of proteins (∼picoseconds), predicted folding time by searching among all the possible conformations leads to ∼1027 years (much larger than the age of the universe) [5]. In contrast, observed protein folding time range from microseconds to minutes. Due to tremendous theoretical progress in protein folding field that has been achieved in past decades, the source of this inconsistency is currently understood that is thoroughly described in the review by Finkelstein et al. [6].
Bayesian peak picking for NMR spectra.
Cheng, Yichen; Gao, Xin; Liang, Faming
2014-02-01
Protein structure determination is a very important topic in structural genomics, which helps people to understand varieties of biological functions such as protein-protein interactions, protein-DNA interactions and so on. Nowadays, nuclear magnetic resonance (NMR) has often been used to determine the three-dimensional structures of protein in vivo. This study aims to automate the peak picking step, the most important and tricky step in NMR structure determination. We propose to model the NMR spectrum by a mixture of bivariate Gaussian densities and use the stochastic approximation Monte Carlo algorithm as the computational tool to solve the problem. Under the Bayesian framework, the peak picking problem is casted as a variable selection problem. The proposed method can automatically distinguish true peaks from false ones without preprocessing the data. To the best of our knowledge, this is the first effort in the literature that tackles the peak picking problem for NMR spectrum data using Bayesian method. Copyright © 2013. Production and hosting by Elsevier Ltd.
Structural Conservation of the Myoviridae Phage Tail Sheath Protein Fold
DOE Office of Scientific and Technical Information (OSTI.GOV)
Aksyuk, Anastasia A.; Kurochkina, Lidia P.; Fokine, Andrei
2012-02-21
Bacteriophage phiKZ is a giant phage that infects Pseudomonas aeruginosa, a human pathogen. The phiKZ virion consists of a 1450 {angstrom} diameter icosahedral head and a 2000 {angstrom}-long contractile tail. The structure of the whole virus was previously reported, showing that its tail organization in the extended state is similar to the well-studied Myovirus bacteriophage T4 tail. The crystal structure of a tail sheath protein fragment of phiKZ was determined to 2.4 {angstrom} resolution. Furthermore, crystal structures of two prophage tail sheath proteins were determined to 1.9 and 3.3 {angstrom} resolution. Despite low sequence identity between these proteins, all ofmore » these structures have a similar fold. The crystal structure of the phiKZ tail sheath protein has been fitted into cryo-electron-microscopy reconstructions of the extended tail sheath and of a polysheath. The structural rearrangement of the phiKZ tail sheath contraction was found to be similar to that of phage T4.« less
Optimization of protein-protein docking for predicting Fc-protein interactions.
Agostino, Mark; Mancera, Ricardo L; Ramsland, Paul A; Fernández-Recio, Juan
2016-11-01
The antibody crystallizable fragment (Fc) is recognized by effector proteins as part of the immune system. Pathogens produce proteins that bind Fc in order to subvert or evade the immune response. The structural characterization of the determinants of Fc-protein association is essential to improve our understanding of the immune system at the molecular level and to develop new therapeutic agents. Furthermore, Fc-binding peptides and proteins are frequently used to purify therapeutic antibodies. Although several structures of Fc-protein complexes are available, numerous others have not yet been determined. Protein-protein docking could be used to investigate Fc-protein complexes; however, improved approaches are necessary to efficiently model such cases. In this study, a docking-based structural bioinformatics approach is developed for predicting the structures of Fc-protein complexes. Based on the available set of X-ray structures of Fc-protein complexes, three regions of the Fc, loosely corresponding to three turns within the structure, were defined as containing the essential features for protein recognition and used as restraints to filter the initial docking search. Rescoring the filtered poses with an optimal scoring strategy provided a success rate of approximately 80% of the test cases examined within the top ranked 20 poses, compared to approximately 20% by the initial unrestrained docking. The developed docking protocol provides a significant improvement over the initial unrestrained docking and will be valuable for predicting the structures of currently undetermined Fc-protein complexes, as well as in the design of peptides and proteins that target Fc. Copyright © 2016 John Wiley & Sons, Ltd.
From protein structure to function via single crystal optical spectroscopy
Ronda, Luca; Bruno, Stefano; Bettati, Stefano; Storici, Paola; Mozzarelli, Andrea
2015-01-01
The more than 100,000 protein structures determined by X-ray crystallography provide a wealth of information for the characterization of biological processes at the molecular level. However, several crystallographic “artifacts,” including conformational selection, crystallization conditions and radiation damages, may affect the quality and the interpretation of the electron density maps, thus limiting the relevance of structure determinations. Moreover, for most of these structures, no functional data have been obtained in the crystalline state, thus posing serious questions on their validity in infereing protein mechanisms. In order to solve these issues, spectroscopic methods have been applied for the determination of equilibrium and kinetic properties of proteins in the crystalline state. These methods are UV-vis spectrophotometry, spectrofluorimetry, IR, EPR, Raman, and resonance Raman spectroscopy. Some of these approaches have been implemented with on-line instruments at X-ray synchrotron beamlines. Here, we provide an overview of investigations predominantly carried out in our laboratory by single crystal polarized absorption UV-vis microspectrophotometry, the most applied technique for the functional characterization of proteins in the crystalline state. Studies on hemoglobins, pyridoxal 5′-phosphate dependent enzymes and green fluorescent protein in the crystalline state have addressed key biological issues, leading to either straightforward structure-function correlations or limitations to structure-based mechanisms. PMID:25988179
DNA Nanotubes for NMR Structure Determination of Membrane Proteins
Bellot, Gaëtan; McClintock, Mark A.; Chou, James J; Shih, William M.
2013-01-01
Structure determination of integral membrane proteins by solution NMR represents one of the most important challenges of structural biology. A Residual-Dipolar-Coupling-based refinement approach can be used to solve the structure of membrane proteins up to 40 kDa in size, however, a weak-alignment medium that is detergent-resistant is required. Previously, availability of media suitable for weak alignment of membrane proteins was severely limited. We describe here a protocol for robust, large-scale synthesis of detergent-resistant DNA nanotubes that can be assembled into dilute liquid crystals for application as weak-alignment media in solution NMR structure determination of membrane proteins in detergent micelles. The DNA nanotubes are heterodimers of 400nm-long six-helix bundles each self-assembled from a M13-based p7308 scaffold strand and >170 short oligonucleotide staple strands. Compatibility with proteins bearing considerable positive charge as well as modulation of molecular alignment, towards collection of linearly independent restraints, can be introduced by reducing the negative charge of DNA nanotubes via counter ions and small DNA binding molecules. This detergent-resistant liquid-crystal media offers a number of properties conducive for membrane protein alignment, including high-yield production, thermal stability, buffer compatibility, and structural programmability. Production of sufficient nanotubes for 4–5 NMR experiments can be completed in one week by a single individual. PMID:23518667
Johann Deisenhofer, Crystallography, and Proteins
research using X-ray crystallography to elucidate for the first time the three-dimensional structure of a large membrane-bound protein molecule. This structure helped explain the process of photosynthesis, by a protein structure determination that relied on complementary features of two different beam lines
Overcoming barriers to membrane protein structure determination.
Bill, Roslyn M; Henderson, Peter J F; Iwata, So; Kunji, Edmund R S; Michel, Hartmut; Neutze, Richard; Newstead, Simon; Poolman, Bert; Tate, Christopher G; Vogel, Horst
2011-04-01
After decades of slow progress, the pace of research on membrane protein structures is beginning to quicken thanks to various improvements in technology, including protein engineering and microfocus X-ray diffraction. Here we review these developments and, where possible, highlight generic new approaches to solving membrane protein structures based on recent technological advances. Rational approaches to overcoming the bottlenecks in the field are urgently required as membrane proteins, which typically comprise ~30% of the proteomes of organisms, are dramatically under-represented in the structural database of the Protein Data Bank.
A 'periodic table' for protein structures.
Taylor, William R
2002-04-11
Current structural genomics programs aim systematically to determine the structures of all proteins coded in both human and other genomes, providing a complete picture of the number and variety of protein structures that exist. In the past, estimates have been made on the basis of the incomplete sample of structures currently known. These estimates have varied greatly (between 1,000 and 10,000; see for example refs 1 and 2), partly because of limited sample size but also owing to the difficulties of distinguishing one structure from another. This distinction is usually topological, based on the fold of the protein; however, in strict topological terms (neglecting to consider intra-chain cross-links), protein chains are open strings and hence are all identical. To avoid this trivial result, topologies are determined by considering secondary links in the form of intra-chain hydrogen bonds (secondary structure) and tertiary links formed by the packing of secondary structures. However, small additions to or loss of structure can make large changes to these perceived topologies and such subjective solutions are neither robust nor amenable to automation. Here I formalize both secondary and tertiary links to allow the rigorous and automatic definition of protein topology.
Fourier-based classification of protein secondary structures.
Shu, Jian-Jun; Yong, Kian Yan
2017-04-15
The correct prediction of protein secondary structures is one of the key issues in predicting the correct protein folded shape, which is used for determining gene function. Existing methods make use of amino acids properties as indices to classify protein secondary structures, but are faced with a significant number of misclassifications. The paper presents a technique for the classification of protein secondary structures based on protein "signal-plotting" and the use of the Fourier technique for digital signal processing. New indices are proposed to classify protein secondary structures by analyzing hydrophobicity profiles. The approach is simple and straightforward. Results show that the more types of protein secondary structures can be classified by means of these newly-proposed indices. Copyright © 2017 Elsevier Inc. All rights reserved.
Chakravorty, Dhruva K.; Wang, Bing; Lee, Chul Won; Guerra, Alfredo J.; Giedroc, David P.; Merz, Kenneth M.
2013-01-01
Correctly calculating the structure of metal coordination sites in a protein during the process of nuclear magnetic resonance (NMR) structure determination and refinement continues to be a challenging task. In this study, we present an accurate and convenient means by which to include metal ions in the NMR structure determination process using molecular dynamics (MD) constrained by NMR-derived data to obtain a realistic and physically viable description of the metal binding site(s). This method provides the framework to accurately portray the metal ions and its binding residues in a pseudo-bond or dummy-cation like approach, and is validated by quantum mechanical/molecular mechanical (QM/MM) MD calculations constrained by NMR-derived data. To illustrate this approach, we refine the zinc coordination complex structure of the zinc sensing transcriptional repressor protein Staphylococcus aureus CzrA, generating over 130 ns of MD and QM/MM MD NMR-data compliant sampling. In addition to refining the first coordination shell structure of the Zn(II) ion, this protocol benefits from being performed in a periodically replicated solvation environment including long-range electrostatics. We determine that unrestrained (not based on NMR data) MD simulations correlated to the NMR data in a time-averaged ensemble. The accurate solution structure ensemble of the metal-bound protein accurately describes the role of conformational dynamics in allosteric regulation of DNA binding by zinc and serves to validate our previous unrestrained MD simulations of CzrA. This methodology has potentially broad applicability in the structure determination of metal ion bound proteins, protein folding and metal template protein-design studies. PMID:23609042
Bayesian Peak Picking for NMR Spectra
Cheng, Yichen; Gao, Xin; Liang, Faming
2013-01-01
Protein structure determination is a very important topic in structural genomics, which helps people to understand varieties of biological functions such as protein-protein interactions, protein–DNA interactions and so on. Nowadays, nuclear magnetic resonance (NMR) has often been used to determine the three-dimensional structures of protein in vivo. This study aims to automate the peak picking step, the most important and tricky step in NMR structure determination. We propose to model the NMR spectrum by a mixture of bivariate Gaussian densities and use the stochastic approximation Monte Carlo algorithm as the computational tool to solve the problem. Under the Bayesian framework, the peak picking problem is casted as a variable selection problem. The proposed method can automatically distinguish true peaks from false ones without preprocessing the data. To the best of our knowledge, this is the first effort in the literature that tackles the peak picking problem for NMR spectrum data using Bayesian method. PMID:24184964
Ramakrishnan, Gayatri; Ochoa-Montaño, Bernardo; Raghavender, Upadhyayula S; Mudgal, Richa; Joshi, Adwait G; Chandra, Nagasuma R; Sowdhamini, Ramanathan; Blundell, Tom L; Srinivasan, Narayanaswamy
2015-01-01
The availability of the genome sequence of Mycobacterium tuberculosis H37Rv has encouraged determination of large numbers of protein structures and detailed definition of the biological information encoded therein; yet, the functions of many proteins in M. tuberculosis remain unknown. The emergence of multidrug resistant strains makes it a priority to exploit recent advances in homology recognition and structure prediction to re-analyse its gene products. Here we report the structural and functional characterization of gene products encoded in the M. tuberculosis genome, with the help of sensitive profile-based remote homology search and fold recognition algorithms resulting in an enhanced annotation of the proteome where 95% of the M. tuberculosis proteins were identified wholly or partly with information on structure or function. New information includes association of 244 proteins with 205 domain families and a separate set of new association of folds to 64 proteins. Extending structural information across uncharacterized protein families represented in the M. tuberculosis proteome, by determining superfamily relationships between families of known and unknown structures, has contributed to an enhancement in the knowledge of structural content. In retrospect, such superfamily relationships have facilitated recognition of probable structure and/or function for several uncharacterized protein families, eventually aiding recognition of probable functions for homologous proteins corresponding to such families. Gene products unique to mycobacteria for which no functions could be identified are 183. Of these 18 were determined to be M. tuberculosis specific. Such pathogen-specific proteins are speculated to harbour virulence factors required for pathogenesis. A re-annotated proteome of M. tuberculosis, with greater completeness of annotated proteins and domain assigned regions, provides a valuable basis for experimental endeavours designed to obtain a better understanding of pathogenesis and to accelerate the process of drug target discovery. Copyright © 2014 Elsevier Ltd. All rights reserved.
The structure and dynamics in solution of Cu(I) pseudoazurin from Paracoccus pantotrophus.
Thompson, G. S.; Leung, Y. C.; Ferguson, S. J.; Radford, S. E.; Redfield, C.
2000-01-01
The solution structure and backbone dynamics of Cu(I) pseudoazurin, a 123 amino acid electron transfer protein from Paracoccus pantotrophus, have been determined using NMR methods. The structure was calculated to high precision, with a backbone RMS deviation for secondary structure elements of 0.35+/-0.06 A, using 1,498 distance and 55 torsion angle constraints. The protein has a double-wound Greek-key fold with two alpha-helices toward its C-terminus, similar to that of its oxidized counterpart determined by X-ray crystallography. Comparison of the Cu(I) solution structure with the X-ray structure of the Cu(II) protein shows only small differences in the positions of some of the secondary structure elements. Order parameters S2, measured for amide nitrogens, indicate that the backbone of the protein is rigid on the picosecond to nanosecond timescale. PMID:10850794
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sachleben, Joseph R.; Adhikari, Aashish N.; Gawlak, Grzegorz
2016-11-10
We determined the NMR structure of a highly aromatic (13%) protein of unknown function, Aq1974 from Aquifex aeolicus (PDB ID: 5SYQ). The unusual sequence of this protein has a tryptophan content five times the normal (six tryptophan residues of 114 or 5.2% while the average tryptophan content is 1.0%) with the tryptophans occurring in a WXW motif. It has no detectable sequence homology with known protein structures. Although its NMR spectrum suggested that the protein was rich in β-sheet, upon resonance assignment and solution structure determination, the protein was found to be primarily α-helical with a small two-stranded β-sheet withmore » a novel fold that we have termed an Aromatic Claw. As this fold was previously unknown and the sequence unique, we submitted the sequence to CASP10 as a target for blind structural prediction. At the end of the competition, the sequence was classified a hard template based model; the structural relationship between the template and the experimental structure was small and the predictions all failed to predict the structure. CSRosetta was found to predict the secondary structure and its packing; however, it was found that there was little correlation between CSRosetta score and the RMSD between the CSRosetta structure and the NMR determined one. This work demonstrates that even in relatively small proteins, we do not yet have the capacity to accurately predict the fold for all primary sequences. The experimental discovery of new folds helps guide the improvement of structural prediction methods.« less
Pilla, Kala Bharath; Otting, Gottfried; Huber, Thomas
2017-03-07
Computational and nuclear magnetic resonance hybrid approaches provide efficient tools for 3D structure determination of small proteins, but currently available algorithms struggle to perform with larger proteins. Here we demonstrate a new computational algorithm that assembles the 3D structure of a protein from its constituent super-secondary structural motifs (Smotifs) with the help of pseudocontact shift (PCS) restraints for backbone amide protons, where the PCSs are produced from different metal centers. The algorithm, DINGO-PCS (3D assembly of Individual Smotifs to Near-native Geometry as Orchestrated by PCSs), employs the PCSs to recognize, orient, and assemble the constituent Smotifs of the target protein without any other experimental data or computational force fields. Using a universal Smotif database, the DINGO-PCS algorithm exhaustively enumerates any given Smotif. We benchmarked the program against ten different protein targets ranging from 100 to 220 residues with different topologies. For nine of these targets, the method was able to identify near-native Smotifs. Copyright © 2017 Elsevier Ltd. All rights reserved.
Ban, Yajing; L Prates, Luciana; Yu, Peiqiang
2017-10-18
This study was conducted to (1) determine protein and carbohydrate molecular structure profiles and (2) quantify the relationship between structural features and protein bioavailability of newly developed carinata and canola seeds for dairy cows by using Fourier transform infrared molecular spectroscopy. Results showed similarity in protein structural makeup within the entire protein structural region between carinata and canola seeds. The highest area ratios related to structural CHO, total CHO, and cellulosic compounds were obtained for carinata seeds. Carinata and canola seeds showed similar carbohydrate and protein molecular structures by multivariate analyses. Carbohydrate molecular structure profiles were highly correlated to protein rumen degradation and intestinal digestion characteristics. In conclusion, the molecular spectroscopy can detect inherent structural characteristics in carinata and canola seeds in which carbohydrate-relative structural features are related to protein metabolism and utilization. Protein and carbohydrate spectral profiles could be used as predictors of rumen protein bioavailability in cows.
Fast computational methods for predicting protein structure from primary amino acid sequence
Agarwal, Pratul Kumar [Knoxville, TN
2011-07-19
The present invention provides a method utilizing primary amino acid sequence of a protein, energy minimization, molecular dynamics and protein vibrational modes to predict three-dimensional structure of a protein. The present invention also determines possible intermediates in the protein folding pathway. The present invention has important applications to the design of novel drugs as well as protein engineering. The present invention predicts the three-dimensional structure of a protein independent of size of the protein, overcoming a significant limitation in the prior art.
New insights into structural determinants of prion protein folding and stability.
Benetti, Federico; Legname, Giuseppe
2015-01-01
Prions are the etiological agent of fatal neurodegenerative diseases called prion diseases or transmissible spongiform encephalopathies. These maladies can be sporadic, genetic or infectious disorders. Prions are due to post-translational modifications of the cellular prion protein leading to the formation of a β-sheet enriched conformer with altered biochemical properties. The molecular events causing prion formation in sporadic prion diseases are still elusive. Recently, we published a research elucidating the contribution of major structural determinants and environmental factors in prion protein folding and stability. Our study highlighted the crucial role of octarepeats in stabilizing prion protein; the presence of a highly enthalpically stable intermediate state in prion-susceptible species; and the role of disulfide bridge in preserving native fold thus avoiding the misfolding to a β-sheet enriched isoform. Taking advantage from these findings, in this work we present new insights into structural determinants of prion protein folding and stability.
Wong, Sienna; Jin, J-P
2017-01-01
Study of folded structure of proteins provides insights into their biological functions, conformational dynamics and molecular evolution. Current methods of elucidating folded structure of proteins are laborious, low-throughput, and constrained by various limitations. Arising from these methods is the need for a sensitive, quantitative, rapid and high-throughput method not only analysing the folded structure of proteins, but also to monitor dynamic changes under physiological or experimental conditions. In this focused review, we outline the foundation and limitations of current protein structure-determination methods prior to discussing the advantages of an emerging antibody epitope analysis for applications in structural, conformational and evolutionary studies of proteins. We discuss the application of this method using representative examples in monitoring allosteric conformation of regulatory proteins and the determination of the evolutionary lineage of related proteins and protein isoforms. The versatility of the method described herein is validated by the ability to modulate a variety of assay parameters to meet the needs of the user in order to monitor protein conformation. Furthermore, the assay has been used to clarify the lineage of troponin isoforms beyond what has been depicted by sequence homology alone, demonstrating the nonlinear evolutionary relationship between primary structure and tertiary structure of proteins. The antibody epitope analysis method is a highly adaptable technique of protein conformation elucidation, which can be easily applied without the need for specialized equipment or technical expertise. When applied in a systematic and strategic manner, this method has the potential to reveal novel and biomedically meaningful information for structure-function relationship and evolutionary lineage of proteins. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Neumann, Sindy; Hartmann, Holger; Martin-Galiano, Antonio J; Fuchs, Angelika; Frishman, Dmitrij
2012-03-01
Structural bioinformatics of membrane proteins is still in its infancy, and the picture of their fold space is only beginning to emerge. Because only a handful of three-dimensional structures are available, sequence comparison and structure prediction remain the main tools for investigating sequence-structure relationships in membrane protein families. Here we present a comprehensive analysis of the structural families corresponding to α-helical membrane proteins with at least three transmembrane helices. The new version of our CAMPS database (CAMPS 2.0) covers nearly 1300 eukaryotic, prokaryotic, and viral genomes. Using an advanced classification procedure, which is based on high-order hidden Markov models and considers both sequence similarity as well as the number of transmembrane helices and loop lengths, we identified 1353 structurally homogeneous clusters roughly corresponding to membrane protein folds. Only 53 clusters are associated with experimentally determined three-dimensional structures, and for these clusters CAMPS is in reasonable agreement with structure-based classification approaches such as SCOP and CATH. We therefore estimate that ∼1300 structures would need to be determined to provide a sufficient structural coverage of polytopic membrane proteins. CAMPS 2.0 is available at http://webclu.bio.wzw.tum.de/CAMPS2.0/. Copyright © 2011 Wiley Periodicals, Inc.
NASA Technical Reports Server (NTRS)
Weaver, D. L.
1982-01-01
Theoretical methods and solutions of the dynamics of protein folding, protein aggregation, protein structure, and the origin of life are discussed. The elements of a dynamic model representing the initial stages of protein folding are presented. The calculation and experimental determination of the model parameters are discussed. The use of computer simulation for modeling protein folding is considered.
Crystallization of PTP Domains.
Levy, Colin; Adams, James; Tabernero, Lydia
2016-01-01
Protein crystallography is the most powerful method to obtain atomic resolution information on the three-dimensional structure of proteins. An essential step towards determining the crystallographic structure of a protein is to produce good quality crystals from a concentrated sample of purified protein. These crystals are then used to obtain X-ray diffraction data necessary to determine the 3D structure by direct phasing or molecular replacement if the model of a homologous protein is available. Here, we describe the main approaches and techniques to obtain suitable crystals for X-ray diffraction. We include tools and guidance on how to evaluate and design the protein construct, how to prepare Se-methionine derivatized protein, how to assess the stability and quality of the sample, and how to crystallize and prepare crystals for diffraction experiments. While general strategies for protein crystallization are summarized, specific examples of the application of these strategies to the crystallization of PTP domains are discussed.
The fine art of integral membrane protein crystallisation.
Birch, James; Axford, Danny; Foadi, James; Meyer, Arne; Eckhardt, Annette; Thielmann, Yvonne; Moraes, Isabel
2018-05-18
Integral membrane proteins are among the most fascinating and important biomolecules as they play a vital role in many biological functions. Knowledge of their atomic structures is fundamental to the understanding of their biochemical function and key in many drug discovery programs. However, over the years, structure determination of integral membrane proteins has proven to be far from trivial, hence they are underrepresented in the protein data bank. Low expression levels, insolubility and instability are just a few of the many hurdles one faces when studying these proteins. X-ray crystallography has been the most used method to determine atomic structures of membrane proteins. However, the production of high quality membrane protein crystals is always very challenging, often seen more as art than a rational experiment. Here we review valuable approaches, methods and techniques to successful membrane protein crystallisation. Copyright © 2018 Diamond Light Source LTD. Published by Elsevier Inc. All rights reserved.
Dafforn, Timothy R; Rajendra, Jacindra; Halsall, David J; Serpell, Louise C; Rodger, Alison
2004-01-01
High-resolution structure determination of soluble globular proteins relies heavily on x-ray crystallography techniques. Such an approach is often ineffective for investigations into the structure of fibrous proteins as these proteins generally do not crystallize. Thus investigations into fibrous protein structure have relied on less direct methods such as x-ray fiber diffraction and circular dichroism. Ultraviolet linear dichroism has the potential to provide additional information on the structure of such biomolecular systems. However, existing systems are not optimized for the requirements of fibrous proteins. We have designed and built a low-volume (200 microL), low-wavelength (down to 180 nm), low-pathlength (100 microm), high-alignment flow-alignment system (couette) to perform ultraviolet linear dichroism studies on the fibers formed by a range of biomolecules. The apparatus has been tested using a number of proteins for which longer wavelength linear dichroism spectra had already been measured. The new couette cell has also been used to obtain data on two medically important protein fibers, the all-beta-sheet amyloid fibers of the Alzheimer's derived protein Abeta and the long-chain assemblies of alpha1-antitrypsin polymers.
Rigidity of transmembrane proteins determines their cluster shape
NASA Astrophysics Data System (ADS)
Jafarinia, Hamidreza; Khoshnood, Atefeh; Jalali, Mir Abbas
2016-01-01
Protein aggregation in cell membrane is vital for the majority of biological functions. Recent experimental results suggest that transmembrane domains of proteins such as α -helices and β -sheets have different structural rigidities. We use molecular dynamics simulation of a coarse-grained model of protein-embedded lipid membranes to investigate the mechanisms of protein clustering. For a variety of protein concentrations, our simulations under thermal equilibrium conditions reveal that the structural rigidity of transmembrane domains dramatically affects interactions and changes the shape of the cluster. We have observed stable large aggregates even in the absence of hydrophobic mismatch, which has been previously proposed as the mechanism of protein aggregation. According to our results, semiflexible proteins aggregate to form two-dimensional clusters, while rigid proteins, by contrast, form one-dimensional string-like structures. By assuming two probable scenarios for the formation of a two-dimensional triangular structure, we calculate the lipid density around protein clusters and find that the difference in lipid distribution around rigid and semiflexible proteins determines the one- or two-dimensional nature of aggregates. It is found that lipids move faster around semiflexible proteins than rigid ones. The aggregation mechanism suggested in this paper can be tested by current state-of-the-art experimental facilities.
Lee, Woonghee; Kim, Jin Hae; Westler, William M; Markley, John L
2011-06-15
PONDEROSA (Peak-picking Of Noe Data Enabled by Restriction of Shift Assignments) accepts input information consisting of a protein sequence, backbone and sidechain NMR resonance assignments, and 3D-NOESY ((13)C-edited and/or (15)N-edited) spectra, and returns assignments of NOESY crosspeaks, distance and angle constraints, and a reliable NMR structure represented by a family of conformers. PONDEROSA incorporates and integrates external software packages (TALOS+, STRIDE and CYANA) to carry out different steps in the structure determination. PONDEROSA implements internal functions that identify and validate NOESY peak assignments and assess the quality of the calculated three-dimensional structure of the protein. The robustness of the analysis results from PONDEROSA's hierarchical processing steps that involve iterative interaction among the internal and external modules. PONDEROSA supports a variety of input formats: SPARKY assignment table (.shifts) and spectrum file formats (.ucsf), XEASY proton file format (.prot), and NMR-STAR format (.star). To demonstrate the utility of PONDEROSA, we used the package to determine 3D structures of two proteins: human ubiquitin and Escherichia coli iron-sulfur scaffold protein variant IscU(D39A). The automatically generated structural constraints and ensembles of conformers were as good as or better than those determined previously by much less automated means. The program, in the form of binary code along with tutorials and reference manuals, is available at http://ponderosa.nmrfam.wisc.edu/.
Kemege, Kyle E.; Hickey, John M.; Lovell, Scott; Battaile, Kevin P.; Zhang, Yang; Hefty, P. Scott
2011-01-01
Chlamydia trachomatis is a medically important pathogen that encodes a relatively high percentage of proteins with unknown function. The three-dimensional structure of a protein can be very informative regarding the protein's functional characteristics; however, determining protein structures experimentally can be very challenging. Computational methods that model protein structures with sufficient accuracy to facilitate functional studies have had notable successes. To evaluate the accuracy and potential impact of computational protein structure modeling of hypothetical proteins encoded by Chlamydia, a successful computational method termed I-TASSER was utilized to model the three-dimensional structure of a hypothetical protein encoded by open reading frame (ORF) CT296. CT296 has been reported to exhibit functional properties of a divalent cation transcription repressor (DcrA), with similarity to the Escherichia coli iron-responsive transcriptional repressor, Fur. Unexpectedly, the I-TASSER model of CT296 exhibited no structural similarity to any DNA-interacting proteins or motifs. To validate the I-TASSER-generated model, the structure of CT296 was solved experimentally using X-ray crystallography. Impressively, the ab initio I-TASSER-generated model closely matched (2.72-Å Cα root mean square deviation [RMSD]) the high-resolution (1.8-Å) crystal structure of CT296. Modeled and experimentally determined structures of CT296 share structural characteristics of non-heme Fe(II) 2-oxoglutarate-dependent enzymes, although key enzymatic residues are not conserved, suggesting a unique biochemical process is likely associated with CT296 function. Additionally, functional analyses did not support prior reports that CT296 has properties shared with divalent cation repressors such as Fur. PMID:21965559
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kemege, Kyle E.; Hickey, John M.; Lovell, Scott
2012-02-13
Chlamydia trachomatis is a medically important pathogen that encodes a relatively high percentage of proteins with unknown function. The three-dimensional structure of a protein can be very informative regarding the protein's functional characteristics; however, determining protein structures experimentally can be very challenging. Computational methods that model protein structures with sufficient accuracy to facilitate functional studies have had notable successes. To evaluate the accuracy and potential impact of computational protein structure modeling of hypothetical proteins encoded by Chlamydia, a successful computational method termed I-TASSER was utilized to model the three-dimensional structure of a hypothetical protein encoded by open reading frame (ORF)more » CT296. CT296 has been reported to exhibit functional properties of a divalent cation transcription repressor (DcrA), with similarity to the Escherichia coli iron-responsive transcriptional repressor, Fur. Unexpectedly, the I-TASSER model of CT296 exhibited no structural similarity to any DNA-interacting proteins or motifs. To validate the I-TASSER-generated model, the structure of CT296 was solved experimentally using X-ray crystallography. Impressively, the ab initio I-TASSER-generated model closely matched (2.72-{angstrom} C{alpha} root mean square deviation [RMSD]) the high-resolution (1.8-{angstrom}) crystal structure of CT296. Modeled and experimentally determined structures of CT296 share structural characteristics of non-heme Fe(II) 2-oxoglutarate-dependent enzymes, although key enzymatic residues are not conserved, suggesting a unique biochemical process is likely associated with CT296 function. Additionally, functional analyses did not support prior reports that CT296 has properties shared with divalent cation repressors such as Fur.« less
Purely Structural Protein Scoring Functions Using Support Vector Machine and Ensemble Learning.
Mirzaei, Shokoufeh; Sidi, Tomer; Keasar, Chen; Crivelli, Silvia
2016-08-24
The function of a protein is determined by its structure, which creates a need for efficient methods of protein structure determination to advance scientific and medical research. Because current experimental structure determination methods carry a high price tag, computational predictions are highly desirable. Given a protein sequence, computational methods produce numerous 3D structures known as decoys. However, selection of the best quality decoys is challenging as the end users can handle only a few ones. Therefore, scoring functions are central to decoy selection. They combine measurable features into a single number indicator of decoy quality. Unfortunately, current scoring functions do not consistently select the best decoys. Machine learning techniques offer great potential to improve decoy scoring. This paper presents two machine-learning based scoring functions to predict the quality of proteins structures, i.e., the similarity between the predicted structure and the experimental one without knowing the latter. We use different metrics to compare these scoring functions against three state-of-the-art scores. This is a first attempt at comparing different scoring functions using the same non-redundant dataset for training and testing and the same features. The results show that adding informative features may be more significant than the method used.
De Novo Protein Structure Prediction
NASA Astrophysics Data System (ADS)
Hung, Ling-Hong; Ngan, Shing-Chung; Samudrala, Ram
An unparalleled amount of sequence data is being made available from large-scale genome sequencing efforts. The data provide a shortcut to the determination of the function of a gene of interest, as long as there is an existing sequenced gene with similar sequence and of known function. This has spurred structural genomic initiatives with the goal of determining as many protein folds as possible (Brenner and Levitt, 2000; Burley, 2000; Brenner, 2001; Heinemann et al., 2001). The purpose of this is twofold: First, the structure of a gene product can often lead to direct inference of its function. Second, since the function of a protein is dependent on its structure, direct comparison of the structures of gene products can be more sensitive than the comparison of sequences of genes for detecting homology. Presently, structural determination by crystallography and NMR techniques is still slow and expensive in terms of manpower and resources, despite attempts to automate the processes. Computer structure prediction algorithms, while not providing the accuracy of the traditional techniques, are extremely quick and inexpensive and can provide useful low-resolution data for structure comparisons (Bonneau and Baker, 2001). Given the immense number of structures which the structural genomic projects are attempting to solve, there would be a considerable gain even if the computer structure prediction approach were applicable to a subset of proteins.
Modeling Structure and Dynamics of Protein Complexes with SAXS Profiles
Schneidman-Duhovny, Dina; Hammel, Michal
2018-01-01
Small-angle X-ray scattering (SAXS) is an increasingly common and useful technique for structural characterization of molecules in solution. A SAXS experiment determines the scattering intensity of a molecule as a function of spatial frequency, termed SAXS profile. SAXS profiles can be utilized in a variety of molecular modeling applications, such as comparing solution and crystal structures, structural characterization of flexible proteins, assembly of multi-protein complexes, and modeling of missing regions in the high-resolution structure. Here, we describe protocols for modeling atomic structures based on SAXS profiles. The first protocol is for comparing solution and crystal structures including modeling of missing regions and determination of the oligomeric state. The second protocol performs multi-state modeling by finding a set of conformations and their weights that fit the SAXS profile starting from a single-input structure. The third protocol is for protein-protein docking based on the SAXS profile of the complex. We describe the underlying software, followed by demonstrating their application on interleukin 33 (IL33) with its primary receptor ST2 and DNA ligase IV-XRCC4 complex. PMID:29605933
High-throughput Crystallography for Structural Genomics
Joachimiak, Andrzej
2009-01-01
Protein X-ray crystallography recently celebrated its 50th anniversary. The structures of myoglobin and hemoglobin determined by Kendrew and Perutz provided the first glimpses into the complex protein architecture and chemistry. Since then, the field of structural molecular biology has experienced extraordinary progress and now over 53,000 proteins structures have been deposited into the Protein Data Bank. In the past decade many advances in macromolecular crystallography have been driven by world-wide structural genomics efforts. This was made possible because of third-generation synchrotron sources, structure phasing approaches using anomalous signal and cryo-crystallography. Complementary progress in molecular biology, proteomics, hardware and software for crystallographic data collection, structure determination and refinement, computer science, databases, robotics and automation improved and accelerated many processes. These advancements provide the robust foundation for structural molecular biology and assure strong contribution to science in the future. In this report we focus mainly on reviewing structural genomics high-throughput X-ray crystallography technologies and their impact. PMID:19765976
Medrano, Francisco Javier; de Souza, Cristiane Santos; Romero, Antonio; Balan, Andrea
2014-01-01
The uptake of maltose and related sugars in Gram-negative bacteria is mediated by an ABC transporter encompassing a periplasmic component (the maltose-binding protein or MalE), a pore-forming membrane protein (MalF and MalG) and a membrane-associated ATPase (MalK). In the present study, the structure determination of the apo form of the putative maltose/trehalose-binding protein (Xac-MalE) from the citrus pathogen Xanthomonas citri in space group P6522 is described. The crystals contained two protein molecules in the asymmetric unit and diffracted to 2.8 Å resolution. Xac-MalE conserves the structural and functional features of sugar-binding proteins and a ligand-binding pocket with similar characteristics to eight different orthologues, including the residues for maltose and trehalose interaction. This is the first structure of a sugar-binding protein from a phytopathogenic bacterium, which is highly conserved in all species from the Xanthomonas genus. PMID:24817711
VoroMQA: Assessment of protein structure quality using interatomic contact areas.
Olechnovič, Kliment; Venclovas, Česlovas
2017-06-01
In the absence of experimentally determined protein structure many biological questions can be addressed using computational structural models. However, the utility of protein structural models depends on their quality. Therefore, the estimation of the quality of predicted structures is an important problem. One of the approaches to this problem is the use of knowledge-based statistical potentials. Such methods typically rely on the statistics of distances and angles of residue-residue or atom-atom interactions collected from experimentally determined structures. Here, we present VoroMQA (Voronoi tessellation-based Model Quality Assessment), a new method for the estimation of protein structure quality. Our method combines the idea of statistical potentials with the use of interatomic contact areas instead of distances. Contact areas, derived using Voronoi tessellation of protein structure, are used to describe and seamlessly integrate both explicit interactions between protein atoms and implicit interactions of protein atoms with solvent. VoroMQA produces scores at atomic, residue, and global levels, all in the fixed range from 0 to 1. The method was tested on the CASP data and compared to several other single-model quality assessment methods. VoroMQA showed strong performance in the recognition of the native structure and in the structural model selection tests, thus demonstrating the efficacy of interatomic contact areas in estimating protein structure quality. The software implementation of VoroMQA is freely available as a standalone application and as a web server at http://bioinformatics.lt/software/voromqa. Proteins 2017; 85:1131-1145. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Structure of the ordered hydration of amino acids in proteins: analysis of crystal structures
DOE Office of Scientific and Technical Information (OSTI.GOV)
Biedermannová, Lada, E-mail: lada.biedermannova@ibt.cas.cz; Schneider, Bohdan
2015-10-27
The hydration of protein crystal structures was studied at the level of individual amino acids. The dependence of the number of water molecules and their preferred spatial localization on various parameters, such as solvent accessibility, secondary structure and side-chain conformation, was determined. Crystallography provides unique information about the arrangement of water molecules near protein surfaces. Using a nonredundant set of 2818 protein crystal structures with a resolution of better than 1.8 Å, the extent and structure of the hydration shell of all 20 standard amino-acid residues were analyzed as function of the residue conformation, secondary structure and solvent accessibility. Themore » results show how hydration depends on the amino-acid conformation and the environment in which it occurs. After conformational clustering of individual residues, the density distribution of water molecules was compiled and the preferred hydration sites were determined as maxima in the pseudo-electron-density representation of water distributions. Many hydration sites interact with both main-chain and side-chain amino-acid atoms, and several occurrences of hydration sites with less canonical contacts, such as carbon–donor hydrogen bonds, OH–π interactions and off-plane interactions with aromatic heteroatoms, are also reported. Information about the location and relative importance of the empirically determined preferred hydration sites in proteins has applications in improving the current methods of hydration-site prediction in molecular replacement, ab initio protein structure prediction and the set-up of molecular-dynamics simulations.« less
Sivakolundu, Sivashankar G; Mabrouk, Patricia Ann
2003-05-01
The complete solution structure of ferrocytochrome c in 30% acetonitrile/70% water has been determined using high-field 1D and 2D (1)H NMR methods and deposited in the Protein Data Bank with codes 1LC1 and 1LC2. This is the first time a complete solution protein structure has been determined for a protein in nonaqueous media. Ferrocyt c retains a native protein secondary structure (five alpha-helices and two omega loops) in 30% acetonitrile. H18 and M80 residues are the axial heme ligands, as in aqueous solution. Residues believed to be axial heme ligands in the alkaline-like conformers of ferricyt c, specifically H33 and K72, are positioned close to the heme iron. The orientations of both heme propionates are markedly different in 30% acetonitrile/70% water. Comparative structural analysis of reduced cyt c in 30% acetonitrile/70% water solution with cyt c in different environments has given new insight into the cyt c folding mechanism, the electron transfer pathway, and cell apoptosis.
The role of protein structural analysis in the next generation sequencing era.
Yue, Wyatt W; Froese, D Sean; Brennan, Paul E
2014-01-01
Proteins are macromolecules that serve a cell's myriad processes and functions in all living organisms via dynamic interactions with other proteins, small molecules and cellular components. Genetic variations in the protein-encoding regions of the human genome account for >85% of all known Mendelian diseases, and play an influential role in shaping complex polygenic diseases. Proteins also serve as the predominant target class for the design of small molecule drugs to modulate their activity. Knowledge of the shape and form of proteins, by means of their three-dimensional structures, is therefore instrumental to understanding their roles in disease and their potentials for drug development. In this chapter we outline, with the wide readership of non-structural biologists in mind, the various experimental and computational methods available for protein structure determination. We summarize how the wealth of structure information, contributed to a large extent by the technological advances in structure determination to date, serves as a useful tool to decipher the molecular basis of genetic variations for disease characterization and diagnosis, particularly in the emerging era of genomic medicine, and becomes an integral component in the modern day approach towards rational drug development.
A Circular Dichroism Reference Database for Membrane Proteins
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wallace,B.; Wien, F.; Stone, T.
2006-01-01
Membrane proteins are a major product of most genomes and the target of a large number of current pharmaceuticals, yet little information exists on their structures because of the difficulty of crystallising them; hence for the most part they have been excluded from structural genomics programme targets. Furthermore, even methods such as circular dichroism (CD) spectroscopy which seek to define secondary structure have not been fully exploited because of technical limitations to their interpretation for membrane embedded proteins. Empirical analyses of circular dichroism (CD) spectra are valuable for providing information on secondary structures of proteins. However, the accuracy of themore » results depends on the appropriateness of the reference databases used in the analyses. Membrane proteins have different spectral characteristics than do soluble proteins as a result of the low dielectric constants of membrane bilayers relative to those of aqueous solutions (Chen & Wallace (1997) Biophys. Chem. 65:65-74). To date, no CD reference database exists exclusively for the analysis of membrane proteins, and hence empirical analyses based on current reference databases derived from soluble proteins are not adequate for accurate analyses of membrane protein secondary structures (Wallace et al (2003) Prot. Sci. 12:875-884). We have therefore created a new reference database of CD spectra of integral membrane proteins whose crystal structures have been determined. To date it contains more than 20 proteins, and spans the range of secondary structures from mostly helical to mostly sheet proteins. This reference database should enable more accurate secondary structure determinations of membrane embedded proteins and will become one of the reference database options in the CD calculation server DICHROWEB (Whitmore & Wallace (2004) NAR 32:W668-673).« less
Li, Congmin; Lim, Sunghyuk; Braunewell, Karl H; Ames, James B
2016-01-01
Visinin-like protein 3 (VILIP-3) belongs to a family of Ca2+-myristoyl switch proteins that regulate signal transduction in the brain and retina. Here we analyze Ca2+ binding, characterize Ca2+-induced conformational changes, and determine the NMR structure of myristoylated VILIP-3. Three Ca2+ bind cooperatively to VILIP-3 at EF2, EF3 and EF4 (KD = 0.52 μM and Hill slope of 1.8). NMR assignments, mutagenesis and structural analysis indicate that the covalently attached myristoyl group is solvent exposed in Ca2+-bound VILIP-3, whereas Ca2+-free VILIP-3 contains a sequestered myristoyl group that interacts with protein residues (E26, Y64, V68), which are distinct from myristate contacts seen in other Ca2+-myristoyl switch proteins. The myristoyl group in VILIP-3 forms an unusual L-shaped structure that places the C14 methyl group inside a shallow protein groove, in contrast to the much deeper myristoyl binding pockets observed for recoverin, NCS-1 and GCAP1. Thus, the myristoylated VILIP-3 protein structure determined in this study is quite different from those of other known myristoyl switch proteins (recoverin, NCS-1, and GCAP1). We propose that myristoylation serves to fine tune the three-dimensional structures of neuronal calcium sensor proteins as a means of generating functional diversity.
Relationships between residue Voronoi volume and sequence conservation in proteins.
Liu, Jen-Wei; Cheng, Chih-Wen; Lin, Yu-Feng; Chen, Shao-Yu; Hwang, Jenn-Kang; Yen, Shih-Chung
2018-02-01
Functional and biophysical constraints can cause different levels of sequence conservation in proteins. Previously, structural properties, e.g., relative solvent accessibility (RSA) and packing density of the weighted contact number (WCN), have been found to be related to protein sequence conservation (CS). The Voronoi volume has recently been recognized as a new structural property of the local protein structural environment reflecting CS. However, for surface residues, it is sensitive to water molecules surrounding the protein structure. Herein, we present a simple structural determinant termed the relative space of Voronoi volume (RSV); it uses the Voronoi volume and the van der Waals volume of particular residues to quantify the local structural environment. RSV (range, 0-1) is defined as (Voronoi volume-van der Waals volume)/Voronoi volume of the target residue. The concept of RSV describes the extent of available space for every protein residue. RSV and Voronoi profiles with and without water molecules (RSVw, RSV, VOw, and VO) were compared for 554 non-homologous proteins. RSV (without water) showed better Pearson's correlations with CS than did RSVw, VO, or VOw values. The mean correlation coefficient between RSV and CS was 0.51, which is comparable to the correlation between RSA and CS (0.49) and that between WCN and CS (0.56). RSV is a robust structural descriptor with and without water molecules and can quantitatively reflect evolutionary information in a single protein structure. Therefore, it may represent a practical structural determinant to study protein sequence, structure, and function relationships. Copyright © 2017 Elsevier B.V. All rights reserved.
DNA nanotubes for NMR structure determination of membrane proteins.
Bellot, Gaëtan; McClintock, Mark A; Chou, James J; Shih, William M
2013-04-01
Finding a way to determine the structures of integral membrane proteins using solution nuclear magnetic resonance (NMR) spectroscopy has proved to be challenging. A residual-dipolar-coupling-based refinement approach can be used to resolve the structure of membrane proteins up to 40 kDa in size, but to do this you need a weak-alignment medium that is detergent-resistant and it has thus far been difficult to obtain such a medium suitable for weak alignment of membrane proteins. We describe here a protocol for robust, large-scale synthesis of detergent-resistant DNA nanotubes that can be assembled into dilute liquid crystals for application as weak-alignment media in solution NMR structure determination of membrane proteins in detergent micelles. The DNA nanotubes are heterodimers of 400-nm-long six-helix bundles, each self-assembled from a M13-based p7308 scaffold strand and >170 short oligonucleotide staple strands. Compatibility with proteins bearing considerable positive charge as well as modulation of molecular alignment, toward collection of linearly independent restraints, can be introduced by reducing the negative charge of DNA nanotubes using counter ions and small DNA-binding molecules. This detergent-resistant liquid-crystal medium offers a number of properties conducive for membrane protein alignment, including high-yield production, thermal stability, buffer compatibility and structural programmability. Production of sufficient nanotubes for four or five NMR experiments can be completed in 1 week by a single individual.
Mitchell, Carter A; Shi, Ce; Aldrich, Courtney C; Gulick, Andrew M
2012-04-17
Many bacteria use large modular enzymes for the synthesis of polyketide and peptide natural products. These multidomain enzymes contain integrated carrier domains that deliver bound substrates to multiple catalytic domains, requiring coordination of these chemical steps. Nonribosomal peptide synthetases (NRPSs) load amino acids onto carrier domains through the activity of an upstream adenylation domain. Our lab recently determined the structure of an engineered two-domain NRPS containing fused adenylation and carrier domains. This structure adopted a domain-swapped dimer that illustrated the interface between these two domains. To continue our investigation, we now examine PA1221, a natural two-domain protein from Pseudomonas aeruginosa. We have determined the amino acid specificity of this new enzyme and used domain specific mutations to demonstrate that loading the downstream carrier domain within a single protein molecule occurs more quickly than loading of a nonfused carrier domain intermolecularly. Finally, we have determined crystal structures of both apo- and holo-PA1221 proteins, the latter using a valine-adenosine vinylsulfonamide inhibitor that traps the adenylation domain-carrier domain interaction. The protein adopts an interface similar to that seen with the prior adenylation domain-carrier protein construct. A comparison of these structures with previous structures of multidomain NRPSs suggests that a large conformational change within the NRPS adenylation domains guides the carrier domain into the active site for thioester formation.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bunker, Richard D.; Mandal, Kalyaneswar; Bashiri, Ghader
Racemic protein crystallography was used to determine the X-ray structure of the predicted Mycobacterium tuberculosis protein Rv1738, which had been completely recalcitrant to crystallization in its natural L-form. Native chemical ligation was used to synthesize both L-protein and D-protein enantiomers of Rv1738. Crystallization of the racemic {D-protein + L-protein} mixture was immediately successful. The resulting crystals diffracted to high resolution and also enabled facile structure determination because of the quantized phases of the data from centrosymmetric crystals. The X-ray structure of Rv1738 revealed striking similarity with bacterial hibernation factors, despite minimal sequence similarity. As a result, we predict that Rv1738,more » which is highly up-regulated in conditions that mimic the onset of persistence, helps trigger dormancy by association with the bacterial ribosome.« less
Bunker, Richard D.; Mandal, Kalyaneswar; Bashiri, Ghader; ...
2015-04-07
Racemic protein crystallography was used to determine the X-ray structure of the predicted Mycobacterium tuberculosis protein Rv1738, which had been completely recalcitrant to crystallization in its natural L-form. Native chemical ligation was used to synthesize both L-protein and D-protein enantiomers of Rv1738. Crystallization of the racemic {D-protein + L-protein} mixture was immediately successful. The resulting crystals diffracted to high resolution and also enabled facile structure determination because of the quantized phases of the data from centrosymmetric crystals. The X-ray structure of Rv1738 revealed striking similarity with bacterial hibernation factors, despite minimal sequence similarity. As a result, we predict that Rv1738,more » which is highly up-regulated in conditions that mimic the onset of persistence, helps trigger dormancy by association with the bacterial ribosome.« less
A Markov Random Field Framework for Protein Side-Chain Resonance Assignment
NASA Astrophysics Data System (ADS)
Zeng, Jianyang; Zhou, Pei; Donald, Bruce Randall
Nuclear magnetic resonance (NMR) spectroscopy plays a critical role in structural genomics, and serves as a primary tool for determining protein structures, dynamics and interactions in physiologically-relevant solution conditions. The current speed of protein structure determination via NMR is limited by the lengthy time required in resonance assignment, which maps spectral peaks to specific atoms and residues in the primary sequence. Although numerous algorithms have been developed to address the backbone resonance assignment problem [68,2,10,37,14,64,1,31,60], little work has been done to automate side-chain resonance assignment [43, 48, 5]. Most previous attempts in assigning side-chain resonances depend on a set of NMR experiments that record through-bond interactions with side-chain protons for each residue. Unfortunately, these NMR experiments have low sensitivity and limited performance on large proteins, which makes it difficult to obtain enough side-chain resonance assignments. On the other hand, it is essential to obtain almost all of the side-chain resonance assignments as a prerequisite for high-resolution structure determination. To overcome this deficiency, we present a novel side-chain resonance assignment algorithm based on alternative NMR experiments measuring through-space interactions between protons in the protein, which also provide crucial distance restraints and are normally required in high-resolution structure determination. We cast the side-chain resonance assignment problem into a Markov Random Field (MRF) framework, and extend and apply combinatorial protein design algorithms to compute the optimal solution that best interprets the NMR data. Our MRF framework captures the contact map information of the protein derived from NMR spectra, and exploits the structural information available from the backbone conformations determined by orientational restraints and a set of discretized side-chain conformations (i.e., rotamers). A Hausdorff-based computation is employed in the scoring function to evaluate the probability of side-chain resonance assignments to generate the observed NMR spectra. The complexity of the assignment problem is first reduced by using a dead-end elimination (DEE) algorithm, which prunes side-chain resonance assignments that are provably not part of the optimal solution. Then an A* search algorithm is used to find a set of optimal side-chain resonance assignments that best fit the NMR data. We have tested our algorithm on NMR data for five proteins, including the FF Domain 2 of human transcription elongation factor CA150 (FF2), the B1 domain of Protein G (GB1), human ubiquitin, the ubiquitin-binding zinc finger domain of the human Y-family DNA polymerase Eta (pol η UBZ), and the human Set2-Rpb1 interacting domain (hSRI). Our algorithm assigns resonances for more than 90% of the protons in the proteins, and achieves about 80% correct side-chain resonance assignments. The final structures computed using distance restraints resulting from the set of assigned side-chain resonances have backbone RMSD 0.5 - 1.4 Å and all-heavy-atom RMSD 1.0 - 2.2 Å from the reference structures that were determined by X-ray crystallography or traditional NMR approaches. These results demonstrate that our algorithm can be successfully applied to automate side-chain resonance assignment and high-quality protein structure determination. Since our algorithm does not require any specific NMR experiments for measuring the through-bond interactions with side-chain protons, it can save a significant amount of both experimental cost and spectrometer time, and hence accelerate the NMR structure determination process.
Sequence Determinants of Compaction in Intrinsically Disordered Proteins
Marsh, Joseph A.; Forman-Kay, Julie D.
2010-01-01
Abstract Intrinsically disordered proteins (IDPs), which lack folded structure and are disordered under nondenaturing conditions, have been shown to perform important functions in a large number of cellular processes. These proteins have interesting structural properties that deviate from the random-coil-like behavior exhibited by chemically denatured proteins. In particular, IDPs are often observed to exhibit significant compaction. In this study, we have analyzed the hydrodynamic radii of a number of IDPs to investigate the sequence determinants of this compaction. Net charge and proline content are observed to be strongly correlated with increased hydrodynamic radii, suggesting that these are the dominant contributors to compaction. Hydrophobicity and secondary structure, on the other hand, appear to have negligible effects on compaction, which implies that the determinants of structure in folded and intrinsically disordered proteins are profoundly different. Finally, we observe that polyhistidine tags seem to increase IDP compaction, which suggests that these tags have significant perturbing effects and thus should be removed before any structural characterizations of IDPs. Using the relationships observed in this analysis, we have developed a sequence-based predictor of hydrodynamic radius for IDPs that shows substantial improvement over a simple model based upon chain length alone. PMID:20483348
Meeting Report: Structural Determination of Environmentally Responsive Proteins
Reinlib, Leslie
2005-01-01
The three-dimensional structure of gene products continues to be a missing lynchpin between linear genome sequences and our understanding of the normal and abnormal function of proteins and pathways. Enhanced activity in this area is likely to lead to better understanding of how discrete changes in molecular patterns and conformation underlie functional changes in protein complexes and, with it, sensitivity of an individual to an exposure. The National Institute of Environmental Health Sciences convened a workshop of experts in structural determination and environmental health to solicit advice for future research in structural resolution relative to environmentally responsive proteins and pathways. The highest priorities recommended by the workshop were to support studies of structure, analysis, control, and design of conformational and functional states at molecular resolution for environmentally responsive molecules and complexes; promote understanding of dynamics, kinetics, and ligand responses; investigate the mechanisms and steps in posttranslational modifications, protein partnering, impact of genetic polymorphisms on structure/function, and ligand interactions; and encourage integrated experimental and computational approaches. The workshop participants also saw value in improving the throughput and purity of protein samples and macromolecular assemblies; developing optimal processes for design, production, and assembly of macromolecular complexes; encouraging studies on protein–protein and macromolecular interactions; and examining assemblies of individual proteins and their functions in pathways of interest for environmental health. PMID:16263521
2017-01-01
ExoU is a 74 kDa cytotoxin that undergoes substantial conformational changes as part of its function, that is, it has multiple thermodynamically stable conformations that interchange depending on its environment. Such flexible proteins pose unique challenges to structural biology: (1) not only is it often difficult to determine structures by X-ray crystallography for all biologically relevant conformations because of the flat energy landscape (2) but also experimental conditions can easily perturb the biologically relevant conformation. The first challenge can be overcome by applying orthogonal structural biology techniques that are capable of observing alternative, biologically relevant conformations. The second challenge can be addressed by determining the structure in the same biological state with two independent techniques under different experimental conditions. If both techniques converge to the same structural model, the confidence that an unperturbed biologically relevant conformation is observed increases. To this end, we determine the structure of the C-terminal domain of the effector protein, ExoU, from data obtained by electron paramagnetic resonance spectroscopy in conjunction with site-directed spin labeling and in silico de novo structure determination. Our protocol encompasses a multimodule approach, consisting of low-resolution topology sampling, clustering, and high-resolution refinement. The resulting model was compared with an ExoU model in complex with its chaperone SpcU obtained previously by X-ray crystallography. The two models converged to a minimal RMSD100 of 3.2 Å, providing evidence that the unbound structure of ExoU matches the fold observed in complex with SpcU. PMID:28691114
Protein secondary structure determination by constrained single-particle cryo-electron tomography.
Bartesaghi, Alberto; Lecumberry, Federico; Sapiro, Guillermo; Subramaniam, Sriram
2012-12-05
Cryo-electron microscopy (cryo-EM) is a powerful technique for 3D structure determination of protein complexes by averaging information from individual molecular images. The resolutions that can be achieved with single-particle cryo-EM are frequently limited by inaccuracies in assigning molecular orientations based solely on 2D projection images. Tomographic data collection schemes, however, provide powerful constraints that can be used to more accurately determine molecular orientations necessary for 3D reconstruction. Here, we propose "constrained single-particle tomography" as a general strategy for 3D structure determination in cryo-EM. A key component of our approach is the effective use of images recorded in tilt series to extract high-resolution information and correct for the contrast transfer function. By incorporating geometric constraints into the refinement to improve orientational accuracy of images, we reduce model bias and overrefinement artifacts and demonstrate that protein structures can be determined at resolutions of ∼8 Å starting from low-dose tomographic tilt series. Copyright © 2012 Elsevier Ltd. All rights reserved.
Mandal, Kalyaneswar; Pentelute, Brad L; Tereshko, Valentina; Thammavongsa, Vilasak; Schneewind, Olaf; Kossiakoff, Anthony A; Kent, Stephen B H
2009-01-01
We describe the use of racemic crystallography to determine the X-ray structure of the natural product plectasin, a potent antimicrobial protein recently isolated from fungus. The protein enantiomers l-plectasin and d-plectasin were prepared by total chemical synthesis; interestingly, l-plectasin showed the expected antimicrobial activity, while d-plectasin was devoid of such activity. The mirror image proteins were then used for racemic crystallization. Synchrotron X-ray diffraction data were collected to atomic resolution from a racemic plectasin crystal; the racemate crystallized in the achiral centrosymmetric space group with one l-plectasin molecule and one d-plectasin molecule forming the unit cell. Dimer-like intermolecular interactions between the protein enantiomers were observed, which may account for the observed extremely low solvent content (13%–15%) and more highly ordered nature of the racemic crystals. The structure of the plectasin molecule was well defined for all 40 amino acids and was generally similar to the previously determined NMR structure, suggesting minimal impact of the crystal packing on the plectasin conformation. PMID:19472324
How precise are reported protein coordinate data?
Konagurthu, Arun S; Allison, Lloyd; Abramson, David; Stuckey, Peter J; Lesk, Arthur M
2014-03-01
Atomic coordinates in the Worldwide Protein Data Bank (wwPDB) are generally reported to greater precision than the experimental structure determinations have actually achieved. By using information theory and data compression to study the compressibility of protein atomic coordinates, it is possible to quantify the amount of randomness in the coordinate data and thereby to determine the realistic precision of the reported coordinates. On average, the value of each C(α) coordinate in a set of selected protein structures solved at a variety of resolutions is good to about 0.1 Å.
NASA Technical Reports Server (NTRS)
1992-01-01
Malic Enzyme is a target protein for drug design because it is a key protein in the life cycle of intestinal parasites. After 2 years of effort on Earth, investigators were unable to produce any crystals that were of high enough quality and for this reason the structure of this important protein could not be determined. Crystals obtained from one STS-50 were of superior quality allowing the structure to be determined. This is just one example why access to space is so vital for these studies. Principal Investigator is Larry DeLucas.
Kazmier, Kelli; Alexander, Nathan S.; Meiler, Jens; Mchaourab, Hassane S.
2010-01-01
A hybrid protein structure determination approach combining sparse Electron Paramagnetic Resonance (EPR) distance restraints and Rosetta de novo protein folding has been previously demonstrated to yield high quality models (Alexander et al., 2008). However, widespread application of this methodology to proteins of unknown structures is hindered by the lack of a general strategy to place spin label pairs in the primary sequence. In this work, we report the development of an algorithm that optimally selects spin labeling positions for the purpose of distance measurements by EPR. For the α-helical subdomain of T4 lysozyme (T4L), simulated restraints that maximize sequence separation between the two spin labels while simultaneously ensuring pairwise connectivity of secondary structure elements yielded vastly improved models by Rosetta folding. 50% of all these models have the correct fold compared to only 21% and 8% correctly folded models when randomly placed restraints or no restraints are used, respectively. Moreover, the improvements in model quality require a limited number of optimized restraints, the number of which is determined by the pairwise connectivities of T4L α-helices. The predicted improvement in Rosetta model quality was verified by experimental determination of distances between spin labels pairs selected by the algorithm. Overall, our results reinforce the rationale for the combined use of sparse EPR distance restraints and de novo folding. By alleviating the experimental bottleneck associated with restraint selection, this algorithm sets the stage for extending computational structure determination to larger, traditionally elusive protein topologies of critical structural and biochemical importance. PMID:21074624
General overview on structure prediction of twilight-zone proteins.
Khor, Bee Yin; Tye, Gee Jun; Lim, Theam Soon; Choong, Yee Siew
2015-09-04
Protein structure prediction from amino acid sequence has been one of the most challenging aspects in computational structural biology despite significant progress in recent years showed by critical assessment of protein structure prediction (CASP) experiments. When experimentally determined structures are unavailable, the predictive structures may serve as starting points to study a protein. If the target protein consists of homologous region, high-resolution (typically <1.5 Å) model can be built via comparative modelling. However, when confronted with low sequence similarity of the target protein (also known as twilight-zone protein, sequence identity with available templates is less than 30%), the protein structure prediction has to be initiated from scratch. Traditionally, twilight-zone proteins can be predicted via threading or ab initio method. Based on the current trend, combination of different methods brings an improved success in the prediction of twilight-zone proteins. In this mini review, the methods, progresses and challenges for the prediction of twilight-zone proteins were discussed.
X-ray Diffraction from Membrane Protein Nanocrystals
Hunter, M.S.; DePonte, D.P.; Shapiro, D.A.; Kirian, R.A.; Wang, X.; Starodub, D.; Marchesini, S.; Weierstall, U.; Doak, R.B.; Spence, J.C.H.; Fromme, P.
2011-01-01
Membrane proteins constitute >30% of the proteins in an average cell, and yet the number of currently known structures of unique membrane proteins is <300. To develop new concepts for membrane protein structure determination, we have explored the serial nanocrystallography method, in which fully hydrated protein nanocrystals are delivered to an x-ray beam within a liquid jet at room temperature. As a model system, we have collected x-ray powder diffraction data from the integral membrane protein Photosystem I, which consists of 36 subunits and 381 cofactors. Data were collected from crystals ranging in size from 100 nm to 2 μm. The results demonstrate that there are membrane protein crystals that contain <100 unit cells (200 total molecules) and that 3D crystals of membrane proteins, which contain <200 molecules, may be suitable for structural investigation. Serial nanocrystallography overcomes the problem of x-ray damage, which is currently one of the major limitations for x-ray structure determination of small crystals. By combining serial nanocrystallography with x-ray free-electron laser sources in the future, it may be possible to produce molecular-resolution electron-density maps using membrane protein crystals that contain only a few hundred or thousand unit cells. PMID:21190672
Eggimann, Becky L.; Vostrikov, Vitaly V.; Veglia, Gianluigi; Siepmann, J. Ilja
2013-01-01
We present a fast and simple protocol to obtain moderate-resolution backbone structures of helical proteins. This approach utilizes a combination of sparse backbone NMR data (residual dipolar couplings and paramagnetic relaxation enhancements) or EPR data with a residue-based force field and Monte Carlo/simulated annealing protocol to explore the folding energy landscape of helical proteins. By using only backbone NMR data, which are relatively easy to collect and analyze, and strategically placed spin relaxation probes, we show that it is possible to obtain protein structures with correct helical topology and backbone RMS deviations well below 4 Å. This approach offers promising alternatives for the structural determination of proteins in which nuclear Overha-user effect data are difficult or impossible to assign and produces initial models that will speed up the high-resolution structure determination by NMR spectroscopy. PMID:24639619
Structure of Lmaj006129AAA, a hypothetical protein from Leishmania major
DOE Office of Scientific and Technical Information (OSTI.GOV)
Arakaki, Tracy; Le Trong, Isolde; Structural Genomics of Pathogenic Protozoa
2006-03-01
The crystal structure of a conserved hypothetical protein from L. major, Pfam sequence family PF04543, structural genomics target ID Lmaj006129AAA, has been determined at a resolution of 1.6 Å. The gene product of structural genomics target Lmaj006129 from Leishmania major codes for a 164-residue protein of unknown function. When SeMet expression of the full-length gene product failed, several truncation variants were created with the aid of Ginzu, a domain-prediction method. 11 truncations were selected for expression, purification and crystallization based upon secondary-structure elements and disorder. The structure of one of these variants, Lmaj006129AAH, was solved by multiple-wavelength anomalous diffraction (MAD)more » using ELVES, an automatic protein crystal structure-determination system. This model was then successfully used as a molecular-replacement probe for the parent full-length target, Lmaj006129AAA. The final structure of Lmaj006129AAA was refined to an R value of 0.185 (R{sub free} = 0.229) at 1.60 Å resolution. Structure and sequence comparisons based on Lmaj006129AAA suggest that proteins belonging to Pfam sequence families PF04543 and PF01878 may share a common ligand-binding motif.« less
ERIC Educational Resources Information Center
National Institute of General Medical Sciences (NIGMS), 2007
2007-01-01
This booklet reveals how structural biology provides insight into health and disease and is useful in developing new medications. It contains a general introduction to proteins, coverage of the techniques used to determine protein structures, and a chapter on structure-based drug design. The booklet features "Student Snapshots," designed to…
Isom, Daniel G; Marguet, Philippe R; Oas, Terrence G; Hellinga, Homme W
2011-04-01
Protein thermodynamic stability is a fundamental physical characteristic that determines biological function. Furthermore, alteration of thermodynamic stability by macromolecular interactions or biochemical modifications is a powerful tool for assessing the relationship between protein structure, stability, and biological function. High-throughput approaches for quantifying protein stability are beginning to emerge that enable thermodynamic measurements on small amounts of material, in short periods of time, and using readily accessible instrumentation. Here we present such a method, fast quantitative cysteine reactivity, which exploits the linkage between protein stability, sidechain protection by protein structure, and structural dynamics to characterize the thermodynamic and kinetic properties of proteins. In this approach, the reaction of a protected cysteine and thiol-reactive fluorogenic indicator is monitored over a gradient of temperatures after a short incubation time. These labeling data can be used to determine the midpoint of thermal unfolding, measure the temperature dependence of protein stability, quantify ligand-binding affinity, and, under certain conditions, estimate folding rate constants. Here, we demonstrate the fQCR method by characterizing these thermodynamic and kinetic properties for variants of Staphylococcal nuclease and E. coli ribose-binding protein engineered to contain single, protected cysteines. These straightforward, information-rich experiments are likely to find applications in protein engineering and functional genomics. Copyright © 2010 Wiley-Liss, Inc.
Recent advances in automated protein design and its future challenges.
Setiawan, Dani; Brender, Jeffrey; Zhang, Yang
2018-04-25
Protein function is determined by protein structure which is in turn determined by the corresponding protein sequence. If the rules that cause a protein to adopt a particular structure are understood, it should be possible to refine or even redefine the function of a protein by working backwards from the desired structure to the sequence. Automated protein design attempts to calculate the effects of mutations computationally with the goal of more radical or complex transformations than are accessible by experimental techniques. Areas covered: The authors give a brief overview of the recent methodological advances in computer-aided protein design, showing how methodological choices affect final design and how automated protein design can be used to address problems considered beyond traditional protein engineering, including the creation of novel protein scaffolds for drug development. Also, the authors address specifically the future challenges in the development of automated protein design. Expert opinion: Automated protein design holds potential as a protein engineering technique, particularly in cases where screening by combinatorial mutagenesis is problematic. Considering solubility and immunogenicity issues, automated protein design is initially more likely to make an impact as a research tool for exploring basic biology in drug discovery than in the design of protein biologics.
Density functional study of molecular interactions in secondary structures of proteins.
Takano, Yu; Kusaka, Ayumi; Nakamura, Haruki
2016-01-01
Proteins play diverse and vital roles in biology, which are dominated by their three-dimensional structures. The three-dimensional structure of a protein determines its functions and chemical properties. Protein secondary structures, including α-helices and β-sheets, are key components of the protein architecture. Molecular interactions, in particular hydrogen bonds, play significant roles in the formation of protein secondary structures. Precise and quantitative estimations of these interactions are required to understand the principles underlying the formation of three-dimensional protein structures. In the present study, we have investigated the molecular interactions in α-helices and β-sheets, using ab initio wave function-based methods, the Hartree-Fock method (HF) and the second-order Møller-Plesset perturbation theory (MP2), density functional theory, and molecular mechanics. The characteristic interactions essential for forming the secondary structures are discussed quantitatively.
HBNG: Graph theory based visualization of hydrogen bond networks in protein structures.
Tiwari, Abhishek; Tiwari, Vivek
2007-07-09
HBNG is a graph theory based tool for visualization of hydrogen bond network in 2D. Digraphs generated by HBNG facilitate visualization of cooperativity and anticooperativity chains and rings in protein structures. HBNG takes hydrogen bonds list files (output from HBAT, HBEXPLORE, HBPLUS and STRIDE) as input and generates a DOT language script and constructs digraphs using freeware AT and T Graphviz tool. HBNG is useful in the enumeration of favorable topologies of hydrogen bond networks in protein structures and determining the effect of cooperativity and anticooperativity on protein stability and folding. HBNG can be applied to protein structure comparison and in the identification of secondary structural regions in protein structures. Program is available from the authors for non-commercial purposes.
Protein Models Docking Benchmark 2
Anishchenko, Ivan; Kundrotas, Petras J.; Tuzikov, Alexander V.; Vakser, Ilya A.
2015-01-01
Structural characterization of protein-protein interactions is essential for our ability to understand life processes. However, only a fraction of known proteins have experimentally determined structures. Such structures provide templates for modeling of a large part of the proteome, where individual proteins can be docked by template-free or template-based techniques. Still, the sensitivity of the docking methods to the inherent inaccuracies of protein models, as opposed to the experimentally determined high-resolution structures, remains largely untested, primarily due to the absence of appropriate benchmark set(s). Structures in such a set should have pre-defined inaccuracy levels and, at the same time, resemble actual protein models in terms of structural motifs/packing. The set should also be large enough to ensure statistical reliability of the benchmarking results. We present a major update of the previously developed benchmark set of protein models. For each interactor, six models were generated with the model-to-native Cα RMSD in the 1 to 6 Å range. The models in the set were generated by a new approach, which corresponds to the actual modeling of new protein structures in the “real case scenario,” as opposed to the previous set, where a significant number of structures were model-like only. In addition, the larger number of complexes (165 vs. 63 in the previous set) increases the statistical reliability of the benchmarking. We estimated the highest accuracy of the predicted complexes (according to CAPRI criteria), which can be attained using the benchmark structures. The set is available at http://dockground.bioinformatics.ku.edu. PMID:25712716
Brodie, Nicholas I; Popov, Konstantin I; Petrotchenko, Evgeniy V; Dokholyan, Nikolay V; Borchers, Christoph H
2017-07-01
We present an integrated experimental and computational approach for de novo protein structure determination in which short-distance cross-linking data are incorporated into rapid discrete molecular dynamics (DMD) simulations as constraints, reducing the conformational space and achieving the correct protein folding on practical time scales. We tested our approach on myoglobin and FK506 binding protein-models for α helix-rich and β sheet-rich proteins, respectively-and found that the lowest-energy structures obtained were in agreement with the crystal structure, hydrogen-deuterium exchange, surface modification, and long-distance cross-linking validation data. Our approach is readily applicable to other proteins with unknown structures.
Advances in structural and functional analysis of membrane proteins by electron crystallography
Wisedchaisri, Goragot; Reichow, Steve L.; Gonen, Tamir
2011-01-01
Summary Electron crystallography is a powerful technique for the study of membrane protein structure and function in the lipid environment. When well-ordered two-dimensional crystals are obtained the structure of both protein and lipid can be determined and lipid-protein interactions analyzed. Protons and ionic charges can be visualized by electron crystallography and the protein of interest can be captured for structural analysis in a variety of physiologically distinct states. This review highlights the strengths of electron crystallography and the momentum that is building up in automation and the development of high throughput tools and methods for structural and functional analysis of membrane proteins by electron crystallography. PMID:22000511
Advances in structural and functional analysis of membrane proteins by electron crystallography.
Wisedchaisri, Goragot; Reichow, Steve L; Gonen, Tamir
2011-10-12
Electron crystallography is a powerful technique for the study of membrane protein structure and function in the lipid environment. When well-ordered two-dimensional crystals are obtained the structure of both protein and lipid can be determined and lipid-protein interactions analyzed. Protons and ionic charges can be visualized by electron crystallography and the protein of interest can be captured for structural analysis in a variety of physiologically distinct states. This review highlights the strengths of electron crystallography and the momentum that is building up in automation and the development of high throughput tools and methods for structural and functional analysis of membrane proteins by electron crystallography. Copyright © 2011 Elsevier Ltd. All rights reserved.
Treatment of Second-Order Structures of Proteins Using Oxygen Radio Frequency Plasma
NASA Astrophysics Data System (ADS)
Hayashi, Nobuya; Nakahigashi, Akari; Liu, Hao; Goto, Masaaki
2010-08-01
Decomposition characteristics of second-order structures of proteins are determined using an oxygen radio frequency (RF) plasma sterilizer in order to prevent infectious proteins from contaminating medical equipment in hospitals. The removal of casein protein as a test protein with a concentration of 50 mg/cm2 on the plane substrate requires approximately 8 h when singlet atomic oxygen is irradiated. The peak intensity of Fourier transform infrared spectroscopy (FTIR) spectra of the β-sheet structures decreases at approximately the same rate as those of the α-helix and first-order structures of proteins. Active oxygen has a sufficient oxidation energy to dissociate hydrogen bonds within the β-sheet structure.
Formulation and in vitro characterization of protein-loaded liposomes
NASA Astrophysics Data System (ADS)
Kuzimski, Lauren
Background/Objective: Protein-based drugs are increasingly used to treat a variety of conditions including cancer and cardio-vascular disease. Due to the immune system's innate ability to degrade the foreign particles quickly, protein-based treatments are generally short-lived. To address this limitation, the objective of the study was to: 1) develop protein-loaded liposomes; 2) characterize size, stability, encapsulation efficiency and rate of protein release; and 3) determine intracellular uptake and distribution; and 4) protein structural changes. Method: Liposomes were loaded with a fluorescent-albumin using freeze-thaw (F/T) methodology. Albumin encapsulation and release were quantified by fluorescence spectroscopic techniques. Flow cytometry was used to determine liposome uptake by macrophages. Epifluorescence microscopy was used to determine cellular distribution of liposomes. Stability was determined using dynamic light scattering by measuring liposome size over one month period. Protein structure was determined using circular dichroism (CD). Result: Encapsulation of albumin in liposome was ˜90% and was dependent on F/T rates, with fifteen cycles yielding the highest encapsulation efficacy (p < 0.05). Albumin-loaded liposomes demonstrated consistent size (<300nm). Release of encapsulated albumin in physiological buffer at 25°C was ˜60% in 72 h. Fluorescence imaging suggested an endosomal route of cellular entry for the FITC-albumin liposome with maximum uptake rates in immune cells (30% at 2hour incubation). CD suggested protein structure is minimally impacted by freeze-thaw methodology. Conclusion: Using F/T as a loading method, we were able to successfully achieve a protein-loaded liposome that was under 300nm, had encapsulation of ˜90%. Synthesized liposomes demonstrated a burst release of encapsulate protein (60%) at 72 hours. Cellular trafficking confirmed endosomal uptake, and minimal protein damage was noticed in CD.
A Parametric Rosetta Energy Function Analysis with LK Peptides on SAM Surfaces.
Lubin, Joseph H; Pacella, Michael S; Gray, Jeffrey J
2018-05-08
Although structures have been determined for many soluble proteins and an increasing number of membrane proteins, experimental structure determination methods are limited for complexes of proteins and solid surfaces. An economical alternative or complement to experimental structure determination is molecular simulation. Rosetta is one software suite that models protein-surface interactions, but Rosetta is normally benchmarked on soluble proteins. For surface interactions, the validity of the energy function is uncertain because it is a combination of independent parameters from energy functions developed separately for solution proteins and mineral surfaces. Here, we assess the performance of the RosettaSurface algorithm and test the accuracy of its energy function by modeling the adsorption of leucine/lysine (LK)-repeat peptides on methyl- and carboxy-terminated self-assembled monolayers (SAMs). We investigated how RosettaSurface predictions for this system compare with the experimental results, which showed that on both surfaces, LK-α peptides folded into helices and LK-β peptides held extended structures. Utilizing this model system, we performed a parametric analysis of Rosetta's Talaris energy function and determined that adjusting solvation parameters offered improved predictive accuracy. Simultaneously increasing lysine carbon hydrophilicity and the hydrophobicity of the surface methyl head groups yielded computational predictions most closely matching the experimental results. De novo models still should be interpreted skeptically unless bolstered in an integrative approach with experimental data.
Fingerprint-Based Structure Retrieval Using Electron Density
Yin, Shuangye; Dokholyan, Nikolay V.
2010-01-01
We present a computational approach that can quickly search a large protein structural database to identify structures that fit a given electron density, such as determined by cryo-electron microscopy. We use geometric invariants (fingerprints) constructed using 3D Zernike moments to describe the electron density, and reduce the problem of fitting of the structure to the electron density to simple fingerprint comparison. Using this approach, we are able to screen the entire Protein Data Bank and identify structures that fit two experimental electron densities determined by cryo-electron microscopy. PMID:21287628
Fingerprint-based structure retrieval using electron density.
Yin, Shuangye; Dokholyan, Nikolay V
2011-03-01
We present a computational approach that can quickly search a large protein structural database to identify structures that fit a given electron density, such as determined by cryo-electron microscopy. We use geometric invariants (fingerprints) constructed using 3D Zernike moments to describe the electron density, and reduce the problem of fitting of the structure to the electron density to simple fingerprint comparison. Using this approach, we are able to screen the entire Protein Data Bank and identify structures that fit two experimental electron densities determined by cryo-electron microscopy. Copyright © 2010 Wiley-Liss, Inc.
Crystallization of Membrane Proteins by Vapor Diffusion
Delmar, Jared A.; Bolla, Jani Reddy; Su, Chih-Chia; Yu, Edward W.
2016-01-01
X-ray crystallography remains the most robust method to determine protein structure at the atomic level. However, the bottlenecks of protein expression and purification often discourage further study. In this chapter, we address the most common problems encountered at these stages. Based on our experiences in expressing and purifying antimicrobial efflux proteins, we explain how a pure and homogenous protein sample can be successfully crystallized by the vapor diffusion method. We present our current protocols and methodologies for this technique. Case studies show step-by-step how we have overcome problems related to expression and diffraction, eventually producing high quality membrane protein crystals for structural determinations. It is our hope that a rational approach can be made of the often anecdotal process of membrane protein crystallization. PMID:25950974
A discrete search algorithm for finding the structure of protein backbones and side chains.
Sallaume, Silas; Martins, Simone de Lima; Ochi, Luiz Satoru; Da Silva, Warley Gramacho; Lavor, Carlile; Liberti, Leo
2013-01-01
Some information about protein structure can be obtained by using Nuclear Magnetic Resonance (NMR) techniques, but they provide only a sparse set of distances between atoms in a protein. The Molecular Distance Geometry Problem (MDGP) consists in determining the three-dimensional structure of a molecule using a set of known distances between some atoms. Recently, a Branch and Prune (BP) algorithm was proposed to calculate the backbone of a protein, based on a discrete formulation for the MDGP. We present an extension of the BP algorithm that can calculate not only the protein backbone, but the whole three-dimensional structure of proteins.
Gold, Nicola D; Jackson, Richard M
2006-02-03
The rapid growth in protein structural data and the emergence of structural genomics projects have increased the need for automatic structure analysis and tools for function prediction. Small molecule recognition is critical to the function of many proteins; therefore, determination of ligand binding site similarity is important for understanding ligand interactions and may allow their functional classification. Here, we present a binding sites database (SitesBase) that given a known protein-ligand binding site allows rapid retrieval of other binding sites with similar structure independent of overall sequence or fold similarity. However, each match is also annotated with sequence similarity and fold information to aid interpretation of structure and functional similarity. Similarity in ligand binding sites can indicate common binding modes and recognition of similar molecules, allowing potential inference of function for an uncharacterised protein or providing additional evidence of common function where sequence or fold similarity is already known. Alternatively, the resource can provide valuable information for detailed studies of molecular recognition including structure-based ligand design and in understanding ligand cross-reactivity. Here, we show examples of atomic similarity between superfamily or more distant fold relatives as well as between seemingly unrelated proteins. Assignment of unclassified proteins to structural superfamiles is also undertaken and in most cases substantiates assignments made using sequence similarity. Correct assignment is also possible where sequence similarity fails to find significant matches, illustrating the potential use of binding site comparisons for newly determined proteins.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Close, Devin W.; Paul, Craig Don; Langan, Patricia S.
In this paper, we describe the engineering and X-ray crystal structure of Thermal Green Protein (TGP), an extremely stable, highly soluble, non-aggregating green fluorescent protein. TGP is a soluble variant of the fluorescent protein eCGP123, which despite being highly stable, has proven to be aggregation-prone. The X-ray crystal structure of eCGP123, also determined within the context of this paper, was used to carry out rational surface engineering to improve its solubility, leading to TGP. The approach involved simultaneously eliminating crystal lattice contacts while increasing the overall negative charge of the protein. Despite intentional disruption of lattice contacts and introduction ofmore » high entropy glutamate side chains, TGP crystallized readily in a number of different conditions and the X-ray crystal structure of TGP was determined to 1.9 Å resolution. The structural reasons for the enhanced stability of TGP and eCGP123 are discussed. We demonstrate the utility of using TGP as a fusion partner in various assays and significantly, in amyloid assays in which the standard fluorescent protein, EGFP, is undesirable because of aberrant oligomerization.« less
Close, Devin W.; Paul, Craig Don; Langan, Patricia S.; ...
2015-05-08
In this paper, we describe the engineering and X-ray crystal structure of Thermal Green Protein (TGP), an extremely stable, highly soluble, non-aggregating green fluorescent protein. TGP is a soluble variant of the fluorescent protein eCGP123, which despite being highly stable, has proven to be aggregation-prone. The X-ray crystal structure of eCGP123, also determined within the context of this paper, was used to carry out rational surface engineering to improve its solubility, leading to TGP. The approach involved simultaneously eliminating crystal lattice contacts while increasing the overall negative charge of the protein. Despite intentional disruption of lattice contacts and introduction ofmore » high entropy glutamate side chains, TGP crystallized readily in a number of different conditions and the X-ray crystal structure of TGP was determined to 1.9 Å resolution. The structural reasons for the enhanced stability of TGP and eCGP123 are discussed. We demonstrate the utility of using TGP as a fusion partner in various assays and significantly, in amyloid assays in which the standard fluorescent protein, EGFP, is undesirable because of aberrant oligomerization.« less
Protein folding and misfolding: mechanism and principles
Englander, S. Walter; Mayne, Leland; Krishna, Mallela M. G.
2012-01-01
Two fundamentally different views of how proteins fold are now being debated. Do proteins fold through multiple unpredictable routes directed only by the energetically downhill nature of the folding landscape or do they fold through specific intermediates in a defined pathway that systematically puts predetermined pieces of the target native protein into place? It has now become possible to determine the structure of protein folding intermediates, evaluate their equilibrium and kinetic parameters, and establish their pathway relationships. Results obtained for many proteins have serendipitously revealed a new dimension of protein structure. Cooperative structural units of the native protein, called foldons, unfold and refold repeatedly even under native conditions. Much evidence obtained by hydrogen exchange and other methods now indicates that cooperative foldon units and not individual amino acids account for the unit steps in protein folding pathways. The formation of foldons and their ordered pathway assembly systematically puts native-like foldon building blocks into place, guided by a sequential stabilization mechanism in which prior native-like structure templates the formation of incoming foldons with complementary structure. Thus the same propensities and interactions that specify the final native state, encoded in the amino-acid sequence of every protein, determine the pathway for getting there. Experimental observations that have been interpreted differently, in terms of multiple independent pathways, appear to be due to chance misfolding errors that cause different population fractions to block at different pathway points, populate different pathway intermediates, and fold at different rates. This paper summarizes the experimental basis for these three determining principles and their consequences. Cooperative native-like foldon units and the sequential stabilization process together generate predetermined stepwise pathways. Optional misfolding errors are responsible for 3-state and heterogeneous kinetic folding. PMID:18405419
Contemporary Methodology for Protein Structure Determination.
ERIC Educational Resources Information Center
Hunkapiller, Michael W.; And Others
1984-01-01
Describes the nature and capabilities of methods used to characterize protein and peptide structure, indicating that they have undergone changes which have improved the speed, reliability, and applicability of the process. Also indicates that high-performance liquid chromatography and gel electrophoresis have made purifying proteins and peptides a…
Gaia: automated quality assessment of protein structure models.
Kota, Pradeep; Ding, Feng; Ramachandran, Srinivas; Dokholyan, Nikolay V
2011-08-15
Increasing use of structural modeling for understanding structure-function relationships in proteins has led to the need to ensure that the protein models being used are of acceptable quality. Quality of a given protein structure can be assessed by comparing various intrinsic structural properties of the protein to those observed in high-resolution protein structures. In this study, we present tools to compare a given structure to high-resolution crystal structures. We assess packing by calculating the total void volume, the percentage of unsatisfied hydrogen bonds, the number of steric clashes and the scaling of the accessible surface area. We assess covalent geometry by determining bond lengths, angles, dihedrals and rotamers. The statistical parameters for the above measures, obtained from high-resolution crystal structures enable us to provide a quality-score that points to specific areas where a given protein structural model needs improvement. We provide these tools that appraise protein structures in the form of a web server Gaia (http://chiron.dokhlab.org). Gaia evaluates the packing and covalent geometry of a given protein structure and provides quantitative comparison of the given structure to high-resolution crystal structures. dokh@unc.edu Supplementary data are available at Bioinformatics online.
The Structure of the Mouse Serotonin 5-HT3 Receptor in Lipid Vesicles.
Kudryashev, Mikhail; Castaño-Díez, Daniel; Deluz, Cédric; Hassaine, Gherici; Grasso, Luigino; Graf-Meyer, Alexandra; Vogel, Horst; Stahlberg, Henning
2016-01-05
The function of membrane proteins is best understood if their structure in the lipid membrane is known. Here, we determined the structure of the mouse serotonin 5-HT3 receptor inserted in lipid bilayers to a resolution of 12 Å without stabilizing antibodies by cryo electron tomography and subtomogram averaging. The reconstruction reveals protein secondary structure elements in the transmembrane region, the extracellular pore, and the transmembrane channel pathway, showing an overall similarity to the available X-ray model of the truncated 5-HT3 receptor determined in the presence of a stabilizing nanobody. Structural analysis of the 5-HT3 receptor embedded in a lipid bilayer allowed the position of the membrane to be determined. Interactions between the densely packed receptors in lipids were visualized, revealing that the interactions were maintained by the short horizontal helices. In combination with methodological improvements, our approach enables the structural analysis of membrane proteins in response to voltage and ligand gating. Copyright © 2016 Elsevier Ltd. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Parish, D.; Benach, J; Liu, G
2008-01-01
The structure of the 142-residue protein Q8ZP25 SALTY encoded in the genome of Salmonella typhimurium LT2 was determined independently by NMR and X-ray crystallography, and the structure of the 140-residue protein HYAE ECOLI encoded in the genome of Escherichia coli was determined by NMR. The two proteins belong to Pfam (Finn et al. 34:D247-D251, 2006) PF07449, which currently comprises 50 members, and belongs itself to the 'thioredoxin-like clan'. However, protein HYAE ECOLI and the other proteins of Pfam PF07449 do not contain the canonical Cys-X-X-Cys active site sequence motif of thioredoxin. Protein HYAE ECOLI was previously classified as a (NiFe)more » hydrogenase-1 specific chaperone interacting with the twin-arginine translocation (Tat) signal peptide. The structures presented here exhibit the expected thioredoxin-like fold and support the view that members of Pfam family PF07449 specifically interact with Tat signal peptides.« less
Gunčar, Gregor; Wang, Ching-I A.; Forwood, Jade K.; Teh, Trazel; Catanzariti, Ann-Maree; Ellis, Jeffrey G.; Dodds, Peter N.; Kobe, Boštjan
2007-01-01
Metal-binding sites are ubiquitous in proteins and can be readily utilized for phasing. It is shown that a protein crystal structure can be solved using single-wavelength anomalous diffraction based on the anomalous signal of a cobalt ion measured on a conventional monochromatic X-ray source. The unique absorption edge of cobalt (1.61 Å) is compatible with the Cu Kα wavelength (1.54 Å) commonly available in macromolecular crystallography laboratories. This approach was applied to the determination of the structure of Melampsora lini avirulence protein AvrL567-A, a protein with a novel fold from the fungal pathogen flax rust that induces plant disease resistance in flax plants. This approach using cobalt ions may be applicable to all cobalt-binding proteins and may be advantageous when synchrotron radiation is not readily available. PMID:17329816
Gi- and Gs-coupled GPCRs show different modes of G-protein binding.
Van Eps, Ned; Altenbach, Christian; Caro, Lydia N; Latorraca, Naomi R; Hollingsworth, Scott A; Dror, Ron O; Ernst, Oliver P; Hubbell, Wayne L
2018-03-06
More than two decades ago, the activation mechanism for the membrane-bound photoreceptor and prototypical G protein-coupled receptor (GPCR) rhodopsin was uncovered. Upon light-induced changes in ligand-receptor interaction, movement of specific transmembrane helices within the receptor opens a crevice at the cytoplasmic surface, allowing for coupling of heterotrimeric guanine nucleotide-binding proteins (G proteins). The general features of this activation mechanism are conserved across the GPCR superfamily. Nevertheless, GPCRs have selectivity for distinct G-protein family members, but the mechanism of selectivity remains elusive. Structures of GPCRs in complex with the stimulatory G protein, G s , and an accessory nanobody to stabilize the complex have been reported, providing information on the intermolecular interactions. However, to reveal the structural selectivity filters, it will be necessary to determine GPCR-G protein structures involving other G-protein subtypes. In addition, it is important to obtain structures in the absence of a nanobody that may influence the structure. Here, we present a model for a rhodopsin-G protein complex derived from intermolecular distance constraints between the activated receptor and the inhibitory G protein, G i , using electron paramagnetic resonance spectroscopy and spin-labeling methodologies. Molecular dynamics simulations demonstrated the overall stability of the modeled complex. In the rhodopsin-G i complex, G i engages rhodopsin in a manner distinct from previous GPCR-G s structures, providing insight into specificity determinants. Copyright © 2018 the Author(s). Published by PNAS.
Where to attach dye molecules to a protein: lessons from the computer program WHAT IF
NASA Astrophysics Data System (ADS)
Altenberg-Greulich, B.; Vriend, G.
2001-10-01
Genomic and proteomic projects are producing a flood of data that all require interpretation which often is best performed based on a three dimensional structure of the molecule(s) involved. These structures can be determined experimentally, or modelled by homology. Because of the complexity of the questions and the heterogeneity of the data, the software used for modelling proteins must become even more versatile. We describe several case studies in which the questions asked, the data, and the requirements on the software all are very different. It is shown how structural knowledge about a protein helps to determine the best place to bind a fluorescent dye. Such dyes are needed to determine protein-protein, protein-DNA interactions or intrinsic fluorescence microscopy. Further, using dyes you can trace molecules in the cell and thus get a handle on subcellular localisation. The first example (OCT-1) involves the search for free amino groups in a protein-DNA complex. The second example (BPTI) is a case, in which the amino acid distribution shows that amino groups are spread all over the structure, so that the natural structure has to be modified to get an answer. The third example (HFE) involves a model built by homology. In this case the amino group distribution can also be predicted. All these studies were performed using the WHAT IF software package. This package is available including source code, documentation, etc. See http://www.cmbi.kun.nl/whatif/
Integrating NOE and RDC using sum-of-squares relaxation for protein structure determination.
Khoo, Y; Singer, A; Cowburn, D
2017-07-01
We revisit the problem of protein structure determination from geometrical restraints from NMR, using convex optimization. It is well-known that the NP-hard distance geometry problem of determining atomic positions from pairwise distance restraints can be relaxed into a convex semidefinite program (SDP). However, often the NOE distance restraints are too imprecise and sparse for accurate structure determination. Residual dipolar coupling (RDC) measurements provide additional geometric information on the angles between atom-pair directions and axes of the principal-axis-frame. The optimization problem involving RDC is highly non-convex and requires a good initialization even within the simulated annealing framework. In this paper, we model the protein backbone as an articulated structure composed of rigid units. Determining the rotation of each rigid unit gives the full protein structure. We propose solving the non-convex optimization problems using the sum-of-squares (SOS) hierarchy, a hierarchy of convex relaxations with increasing complexity and approximation power. Unlike classical global optimization approaches, SOS optimization returns a certificate of optimality if the global optimum is found. Based on the SOS method, we proposed two algorithms-RDC-SOS and RDC-NOE-SOS, that have polynomial time complexity in the number of amino-acid residues and run efficiently on a standard desktop. In many instances, the proposed methods exactly recover the solution to the original non-convex optimization problem. To the best of our knowledge this is the first time SOS relaxation is introduced to solve non-convex optimization problems in structural biology. We further introduce a statistical tool, the Cramér-Rao bound (CRB), to provide an information theoretic bound on the highest resolution one can hope to achieve when determining protein structure from noisy measurements using any unbiased estimator. Our simulation results show that when the RDC measurements are corrupted by Gaussian noise of realistic variance, both SOS based algorithms attain the CRB. We successfully apply our method in a divide-and-conquer fashion to determine the structure of ubiquitin from experimental NOE and RDC measurements obtained in two alignment media, achieving more accurate and faster reconstructions compared to the current state of the art.
1992-06-01
Malic Enzyme is a target protein for drug design because it is a key protein in the life cycle of intestinal parasites. After 2 years of effort on Earth, investigators were unable to produce any crystals that were of high enough quality and for this reason the structure of this important protein could not be determined. Crystals obtained from one STS-50 were of superior quality allowing the structure to be determined. This is just one example why access to space is so vital for these studies. Principal Investigator is Larry DeLucas.
Modeling the assembly order of multimeric heteroprotein complexes
Esquivel-Rodriguez, Juan; Terashi, Genki; Christoffer, Charles; Shin, Woong-Hee
2018-01-01
Protein-protein interactions are the cornerstone of numerous biological processes. Although an increasing number of protein complex structures have been determined using experimental methods, relatively fewer studies have been performed to determine the assembly order of complexes. In addition to the insights into the molecular mechanisms of biological function provided by the structure of a complex, knowing the assembly order is important for understanding the process of complex formation. Assembly order is also practically useful for constructing subcomplexes as a step toward solving the entire complex experimentally, designing artificial protein complexes, and developing drugs that interrupt a critical step in the complex assembly. There are several experimental methods for determining the assembly order of complexes; however, these techniques are resource-intensive. Here, we present a computational method that predicts the assembly order of protein complexes by building the complex structure. The method, named Path-LzerD, uses a multimeric protein docking algorithm that assembles a protein complex structure from individual subunit structures and predicts assembly order by observing the simulated assembly process of the complex. Benchmarked on a dataset of complexes with experimental evidence of assembly order, Path-LZerD was successful in predicting the assembly pathway for the majority of the cases. Moreover, when compared with a simple approach that infers the assembly path from the buried surface area of subunits in the native complex, Path-LZerD has the strong advantage that it can be used for cases where the complex structure is not known. The path prediction accuracy decreased when starting from unbound monomers, particularly for larger complexes of five or more subunits, for which only a part of the assembly path was correctly identified. As the first method of its kind, Path-LZerD opens a new area of computational protein structure modeling and will be an indispensable approach for studying protein complexes. PMID:29329283
Modeling the assembly order of multimeric heteroprotein complexes.
Peterson, Lenna X; Togawa, Yoichiro; Esquivel-Rodriguez, Juan; Terashi, Genki; Christoffer, Charles; Roy, Amitava; Shin, Woong-Hee; Kihara, Daisuke
2018-01-01
Protein-protein interactions are the cornerstone of numerous biological processes. Although an increasing number of protein complex structures have been determined using experimental methods, relatively fewer studies have been performed to determine the assembly order of complexes. In addition to the insights into the molecular mechanisms of biological function provided by the structure of a complex, knowing the assembly order is important for understanding the process of complex formation. Assembly order is also practically useful for constructing subcomplexes as a step toward solving the entire complex experimentally, designing artificial protein complexes, and developing drugs that interrupt a critical step in the complex assembly. There are several experimental methods for determining the assembly order of complexes; however, these techniques are resource-intensive. Here, we present a computational method that predicts the assembly order of protein complexes by building the complex structure. The method, named Path-LzerD, uses a multimeric protein docking algorithm that assembles a protein complex structure from individual subunit structures and predicts assembly order by observing the simulated assembly process of the complex. Benchmarked on a dataset of complexes with experimental evidence of assembly order, Path-LZerD was successful in predicting the assembly pathway for the majority of the cases. Moreover, when compared with a simple approach that infers the assembly path from the buried surface area of subunits in the native complex, Path-LZerD has the strong advantage that it can be used for cases where the complex structure is not known. The path prediction accuracy decreased when starting from unbound monomers, particularly for larger complexes of five or more subunits, for which only a part of the assembly path was correctly identified. As the first method of its kind, Path-LZerD opens a new area of computational protein structure modeling and will be an indispensable approach for studying protein complexes.
Pires, Mathias M.; Cantor, Maurício; Guimarães, Paulo R.; de Aguiar, Marcus A. M.; dos Reis, Sérgio F.; Coltri, Patricia P.
2015-01-01
The network structure of biological systems provides information on the underlying processes shaping their organization and dynamics. Here we examined the structure of the network depicting protein interactions within the spliceosome, the macromolecular complex responsible for splicing in eukaryotic cells. We show the interactions of less connected spliceosome proteins are nested subsets of the connections of the highly connected proteins. At the same time, the network has a modular structure with groups of proteins sharing similar interaction patterns. We then investigated the role of affinity and specificity in shaping the spliceosome network by adapting a probabilistic model originally designed to reproduce food webs. This food-web model was as successful in reproducing the structure of protein interactions as it is in reproducing interactions among species. The good performance of the model suggests affinity and specificity, partially determined by protein size and the timing of association to the complex, may be determining network structure. Moreover, because network models allow building ensembles of realistic networks while encompassing uncertainty they can be useful to examine the dynamics and vulnerability of intracelullar processes. Unraveling the mechanisms organizing the spliceosome interactions is important to characterize the role of individual proteins on splicing catalysis and regulation. PMID:26443080
Xie, Jianming [San Diego, CA; Wang, Lei [San Diego, CA; Wu, Ning [Boston, MA; Schultz, Peter G [La Jolla, CA
2008-07-15
Translation systems and other compositions including orthogonal aminoacyl tRNA-synthetases that preferentially charge an orthogonal tRNA with an iodinated or brominated amino acid are provided. Nucleic acids encoding such synthetases are also described, as are methods and kits for producing proteins including heavy atom-containing amino acids, e.g., brominated or iodinated amino acids. Methods of determining the structure of a protein, e.g., a protein into which a heavy atom has been site-specifically incorporated through use of an orthogonal tRNA/aminoacyl tRNA-synthetase pair, are also described.
Structure of the GH1 domain of guanylate kinase-associated protein from Rattus norvegicus
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tong, Junsen; Yang, Huiseon; Eom, Soo Hyun
2014-09-12
Graphical abstract: - Highlights: • The crystal structure of GKAP homology domain 1 (GH1) was determined. • GKAP GH1 is a three-helix bundle connected by short flexible loops. • The predicted helix α4 associates weakly with the helix α3, suggesting dynamic nature of the GH1 domain. - Abstract: Guanylate-kinase-associated protein (GKAP) is a scaffolding protein that links NMDA receptor-PSD-95 to Shank–Homer complexes by protein–protein interactions at the synaptic junction. GKAP family proteins are characterized by the presence of a C-terminal conserved GKAP homology domain 1 (GH1) of unknown structure and function. In this study, crystal structure of the GH1 domainmore » of GKAP from Rattus norvegicus was determined in fusion with an N-terminal maltose-binding protein at 2.0 Å resolution. The structure of GKAP GH1 displays a three-helix bundle connected by short flexible loops. The predicted helix α4 which was not visible in the crystal structure associates weakly with the helix α3 suggesting dynamic nature of the GH1 domain. The strict conservation of GH1 domain across GKAP family members and the lack of a catalytic active site required for enzyme activity imply that the GH1 domain might serve as a protein–protein interaction module for the synaptic protein clustering.« less
Taking structure searches to the next dimension.
Schafferhans, Andrea; Rost, Burkhard
2014-07-08
Structure comparisons are now the first step when a new experimental high-resolution protein structure has been determined. In this issue of Structure, Wiederstein and colleagues describe their latest tool for comparing structures, which gives us the unprecedented power to discover crucial structural connections between whole complexes of proteins in the full structural database in real time. Copyright © 2014 Elsevier Ltd. All rights reserved.
Identification of helix capping and β-turn motifs from NMR chemical shifts
Shen, Yang; Bax, Ad
2012-01-01
We present an empirical method for identification of distinct structural motifs in proteins on the basis of experimentally determined backbone and 13Cβ chemical shifts. Elements identified include the N-terminal and C-terminal helix capping motifs and five types of β-turns: I, II, I′, II′ and VIII. Using a database of proteins of known structure, the NMR chemical shifts, together with the PDB-extracted amino acid preference of the helix capping and β-turn motifs are used as input data for training an artificial neural network algorithm, which outputs the statistical probability of finding each motif at any given position in the protein. The trained neural networks, contained in the MICS (motif identification from chemical shifts) program, also provide a confidence level for each of their predictions, and values ranging from ca 0.7–0.9 for the Matthews correlation coefficient of its predictions far exceed that attainable by sequence analysis. MICS is anticipated to be useful both in the conventional NMR structure determination process and for enhancing on-going efforts to determine protein structures solely on the basis of chemical shift information, where it can aid in identifying protein database fragments suitable for use in building such structures. PMID:22314702
Warepam, Marina; Sharma, Gurumayum Suraj; Dar, Tanveer Ali; Khan, Md. Khurshid Alam; Singh, Laishram Rajendrakumar
2014-01-01
Osmolytes are low molecular weight organic molecules accumulated by organisms to assist proper protein folding, and to provide protection to the structural integrity of proteins under denaturing stress conditions. It is known that osmolyte-induced protein folding is brought by unfavorable interaction of osmolytes with the denatured/unfolded states. The interaction of osmolyte with the native state does not significantly contribute to the osmolyte-induced protein folding. We have therefore investigated if different denatured states of a protein (generated by different denaturing agents) interact differently with the osmolytes to induce protein folding. We observed that osmolyte-assisted refolding of protein obtained from heat-induced denatured state produces native molecules with higher enzyme activity than those initiated from GdmCl- or urea-induced denatured state indicating that the structural property of the initial denatured state during refolding by osmolytes determines the catalytic efficiency of the folded protein molecule. These conclusions have been reached from the systematic measurements of enzymatic kinetic parameters (K m and k cat), thermodynamic stability (T m and ΔH m) and secondary and tertiary structures of the folded native proteins obtained from refolding of various denatured states (due to heat-, urea- and GdmCl-induced denaturation) of RNase-A in the presence of various osmolytes. PMID:25313668
Laskowski, Roman A
2009-01-01
PDBsum (http://www.ebi.ac.uk/pdbsum) provides summary information about each experimentally determined structural model in the Protein Data Bank (PDB). Here we describe some of its most recent features, including figures from the structure's key reference, citation data, Pfam domain diagrams, topology diagrams and protein-protein interactions. Furthermore, it now accepts users' own PDB format files and generates a private set of analyses for each uploaded structure.
Protein classification using sequential pattern mining.
Exarchos, Themis P; Papaloukas, Costas; Lampros, Christos; Fotiadis, Dimitrios I
2006-01-01
Protein classification in terms of fold recognition can be employed to determine the structural and functional properties of a newly discovered protein. In this work sequential pattern mining (SPM) is utilized for sequence-based fold recognition. One of the most efficient SPM algorithms, cSPADE, is employed for protein primary structure analysis. Then a classifier uses the extracted sequential patterns for classifying proteins of unknown structure in the appropriate fold category. The proposed methodology exhibited an overall accuracy of 36% in a multi-class problem of 17 candidate categories. The classification performance reaches up to 65% when the three most probable protein folds are considered.
Fusion proteins as alternate crystallization paths to difficult structure problems
NASA Technical Reports Server (NTRS)
Carter, Daniel C.; Rueker, Florian; Ho, Joseph X.; Lim, Kap; Keeling, Kim; Gilliland, Gary; Ji, Xinhua
1994-01-01
The three-dimensional structure of a peptide fusion product with glutathione transferase from Schistosoma japonicum (SjGST) has been solved by crystallographic methods to 2.5 A resolution. Peptides or proteins can be fused to SjGST and expressed in a plasmid for rapid synthesis in Escherichia coli. Fusion proteins created by this commercial method can be purified rapidly by chromatography on immobilized glutathione. The potential utility of using SjGST fusion proteins as alternate paths to the crystallization and structure determination of proteins is demonstrated.
Advances in Homology Protein Structure Modeling
Xiang, Zhexin
2007-01-01
Homology modeling plays a central role in determining protein structure in the structural genomics project. The importance of homology modeling has been steadily increasing because of the large gap that exists between the overwhelming number of available protein sequences and experimentally solved protein structures, and also, more importantly, because of the increasing reliability and accuracy of the method. In fact, a protein sequence with over 30% identity to a known structure can often be predicted with an accuracy equivalent to a low-resolution X-ray structure. The recent advances in homology modeling, especially in detecting distant homologues, aligning sequences with template structures, modeling of loops and side chains, as well as detecting errors in a model, have contributed to reliable prediction of protein structure, which was not possible even several years ago. The ongoing efforts in solving protein structures, which can be time-consuming and often difficult, will continue to spur the development of a host of new computational methods that can fill in the gap and further contribute to understanding the relationship between protein structure and function. PMID:16787261
Solution structure of an antifreeze protein CfAFP-501 from Choristoneura fumiferana.
Li, Congmin; Guo, Xianrong; Jia, Zongchao; Xia, Bin; Jin, Changwen
2005-07-01
Antifreeze proteins (AFPs) are widely employed by various organisms as part of their overwintering survival strategy. AFPs have the unique ability to suppress the freezing point of aqueous solution and inhibit ice recrystallization through binding to the ice seed crystals and restricting their growth. The solution structure of CfAFP-501 from spruce budworm has been determined by NMR spectroscopy. Our result demonstrates that CfAFP-501 retains its rigid and highly regular structure in solution. Overall, the solution structure is similar to the crystal structure except the N- and C-terminal regions. NMR spin-relaxation experiments further indicate the overall rigidity of the protein and identify a collection of residues with greater flexibilities. Furthermore, Pro91 shows a cis conformation in solution instead of the trans conformation determined in the crystal structure.
Zemla, Adam T; Lang, Dorothy M; Kostova, Tanya; Andino, Raul; Ecale Zhou, Carol L
2011-06-02
Most of the currently used methods for protein function prediction rely on sequence-based comparisons between a query protein and those for which a functional annotation is provided. A serious limitation of sequence similarity-based approaches for identifying residue conservation among proteins is the low confidence in assigning residue-residue correspondences among proteins when the level of sequence identity between the compared proteins is poor. Multiple sequence alignment methods are more satisfactory--still, they cannot provide reliable results at low levels of sequence identity. Our goal in the current work was to develop an algorithm that could help overcome these difficulties by facilitating the identification of structurally (and possibly functionally) relevant residue-residue correspondences between compared protein structures. Here we present StralSV (structure-alignment sequence variability), a new algorithm for detecting closely related structure fragments and quantifying residue frequency from tight local structure alignments. We apply StralSV in a study of the RNA-dependent RNA polymerase of poliovirus, and we demonstrate that the algorithm can be used to determine regions of the protein that are relatively unique, or that share structural similarity with proteins that would be considered distantly related. By quantifying residue frequencies among many residue-residue pairs extracted from local structural alignments, one can infer potential structural or functional importance of specific residues that are determined to be highly conserved or that deviate from a consensus. We further demonstrate that considerable detailed structural and phylogenetic information can be derived from StralSV analyses. StralSV is a new structure-based algorithm for identifying and aligning structure fragments that have similarity to a reference protein. StralSV analysis can be used to quantify residue-residue correspondences and identify residues that may be of particular structural or functional importance, as well as unusual or unexpected residues at a given sequence position. StralSV is provided as a web service at http://proteinmodel.org/AS2TS/STRALSV/.
Lefebvre, Benoit; Batoko, Henri; Duby, Geoffrey; Boutry, Marc
2004-07-01
The structural determinants involved in the targeting of multitransmembrane-span proteins to the plasma membrane (PM) remain poorly understood. The plasma membrane H+ -ATPase (PMA) from Nicotiana plumbaginifolia, a well-characterized 10 transmembrane-span enzyme, was used as a model to identify structural elements essential for targeting to the PM. When PMA2 and PMA4, representatives of the two main PMA subfamilies, were fused to green fluorescent protein (GFP), the chimeras were shown to be still functional and to be correctly and rapidly targeted to the PM in transgenic tobacco. By contrast, chimeric proteins containing various combinations of PMA transmembrane spanning domains accumulated in the Golgi apparatus and not in the PM and displayed slow traffic properties through the secretory pathway. Individual deletion of three of the four cytosolic domains did not prevent PM targeting, but deletion of the large loop or of its nucleotide binding domain resulted in GFP fluorescence accumulating exclusively in the endoplasmic reticulum. The results show that, at least for this polytopic protein, the PM is not the default pathway and that, in contrast with single-pass membrane proteins, cytosolic structural determinants are required for correct targeting.
Lefebvre, Benoit; Batoko, Henri; Duby, Geoffrey; Boutry, Marc
2004-01-01
The structural determinants involved in the targeting of multitransmembrane-span proteins to the plasma membrane (PM) remain poorly understood. The plasma membrane H+-ATPase (PMA) from Nicotiana plumbaginifolia, a well-characterized 10 transmembrane–span enzyme, was used as a model to identify structural elements essential for targeting to the PM. When PMA2 and PMA4, representatives of the two main PMA subfamilies, were fused to green fluorescent protein (GFP), the chimeras were shown to be still functional and to be correctly and rapidly targeted to the PM in transgenic tobacco. By contrast, chimeric proteins containing various combinations of PMA transmembrane spanning domains accumulated in the Golgi apparatus and not in the PM and displayed slow traffic properties through the secretory pathway. Individual deletion of three of the four cytosolic domains did not prevent PM targeting, but deletion of the large loop or of its nucleotide binding domain resulted in GFP fluorescence accumulating exclusively in the endoplasmic reticulum. The results show that, at least for this polytopic protein, the PM is not the default pathway and that, in contrast with single-pass membrane proteins, cytosolic structural determinants are required for correct targeting. PMID:15208389
Brodie, Nicholas I.; Popov, Konstantin I.; Petrotchenko, Evgeniy V.; Dokholyan, Nikolay V.; Borchers, Christoph H.
2017-01-01
We present an integrated experimental and computational approach for de novo protein structure determination in which short-distance cross-linking data are incorporated into rapid discrete molecular dynamics (DMD) simulations as constraints, reducing the conformational space and achieving the correct protein folding on practical time scales. We tested our approach on myoglobin and FK506 binding protein—models for α helix–rich and β sheet–rich proteins, respectively—and found that the lowest-energy structures obtained were in agreement with the crystal structure, hydrogen-deuterium exchange, surface modification, and long-distance cross-linking validation data. Our approach is readily applicable to other proteins with unknown structures. PMID:28695211
Automatic protein structure solution from weak X-ray data
NASA Astrophysics Data System (ADS)
Skubák, Pavol; Pannu, Navraj S.
2013-11-01
Determining new protein structures from X-ray diffraction data at low resolution or with a weak anomalous signal is a difficult and often an impossible task. Here we propose a multivariate algorithm that simultaneously combines the structure determination steps. In tests on over 140 real data sets from the protein data bank, we show that this combined approach can automatically build models where current algorithms fail, including an anisotropically diffracting 3.88 Å RNA polymerase II data set. The method seamlessly automates the process, is ideal for non-specialists and provides a mathematical framework for successfully combining various sources of information in image processing.
ERIC Educational Resources Information Center
National Inst. of General Medical Sciences (NIH), Bethesda, MD.
This booklet, geared toward an advanced high school or early college-level audience, explains how structural biology provides insight into health and disease and is useful in developing new medications. This publication contains a general introduction to proteins, coverage of the techniques used to determine protein structures, and a chapter on…
Application of far-infrared spectroscopy to the structural identification of protein materials.
Han, Yanchen; Ling, Shengjie; Qi, Zeming; Shao, Zhengzhong; Chen, Xin
2018-05-03
Although far-infrared (IR) spectroscopy has been shown to be a powerful tool to determine peptide structure and to detect structural transitions in peptides, it has been overlooked in the characterization of proteins. Herein, we used far-IR spectroscopy to monitor the structure of four abundant non-bioactive proteins, namely, soybean protein isolate (SPI), pea protein isolate (PPI) and two types of silk fibroins (SFs), domestic Bombyx mori and wild Antheraea pernyi. The two globular proteins SPI and PPI result in broad and weak far-IR bands (between 50 and 700 cm-1), in agreement with those of some other bioactive globular proteins previously studied (lysozyme, myoglobin, hemoglobin, etc.) that generally only have random amino acid sequences. Interestingly, the two SFs, which are characterized by a structure composed of highly repetitive motifs, show several sharp far-IR characteristic absorption peaks. Moreover, some of these characteristic peaks (such as the peaks at 260 and 428 cm-1 in B. mori, and the peaks at 245 and 448 cm-1 in A. pernyi) are sensitive to conformational changes; hence, they can be directly used to monitor conformational transitions in SFs. Furthermore, since SF absorption bands clearly differ from those of globular proteins and different SFs even show distinct adsorption bands, far-IR spectroscopy can be applied to distinguish and determine the specific SF component within protein blends.
Lee, Woonghee; Kim, Jin Hae; Westler, William M.; Markley, John L.
2011-01-01
Summary: PONDEROSA (Peak-picking Of Noe Data Enabled by Restriction of Shift Assignments) accepts input information consisting of a protein sequence, backbone and sidechain NMR resonance assignments, and 3D-NOESY (13C-edited and/or 15N-edited) spectra, and returns assignments of NOESY crosspeaks, distance and angle constraints, and a reliable NMR structure represented by a family of conformers. PONDEROSA incorporates and integrates external software packages (TALOS+, STRIDE and CYANA) to carry out different steps in the structure determination. PONDEROSA implements internal functions that identify and validate NOESY peak assignments and assess the quality of the calculated three-dimensional structure of the protein. The robustness of the analysis results from PONDEROSA's hierarchical processing steps that involve iterative interaction among the internal and external modules. PONDEROSA supports a variety of input formats: SPARKY assignment table (.shifts) and spectrum file formats (.ucsf), XEASY proton file format (.prot), and NMR-STAR format (.star). To demonstrate the utility of PONDEROSA, we used the package to determine 3D structures of two proteins: human ubiquitin and Escherichia coli iron-sulfur scaffold protein variant IscU(D39A). The automatically generated structural constraints and ensembles of conformers were as good as or better than those determined previously by much less automated means. Availability: The program, in the form of binary code along with tutorials and reference manuals, is available at http://ponderosa.nmrfam.wisc.edu/. Contact: whlee@nmrfam.wisc.edu; markley@nmrfam.wisc.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:21511715
What are the structural features that drive partitioning of proteins in aqueous two-phase systems?
Wu, Zhonghua; Hu, Gang; Wang, Kui; Zaslavsky, Boris Yu; Kurgan, Lukasz; Uversky, Vladimir N
2017-01-01
Protein partitioning in aqueous two-phase systems (ATPSs) represents a convenient, inexpensive, and easy to scale-up protein separation technique. Since partition behavior of a protein dramatically depends on an ATPS composition, it would be highly beneficial to have reliable means for (even qualitative) prediction of partitioning of a target protein under different conditions. Our aim was to understand which structural features of proteins contribute to partitioning of a query protein in a given ATPS. We undertook a systematic empirical analysis of relations between 57 numerical structural descriptors derived from the corresponding amino acid sequences and crystal structures of 10 well-characterized proteins and the partition behavior of these proteins in 29 different ATPSs. This analysis revealed that just a few structural characteristics of proteins can accurately determine behavior of these proteins in a given ATPS. However, partition behavior of proteins in different ATPSs relies on different structural features. In other words, we could not find a unique set of protein structural features derived from their crystal structures that could be used for the description of the protein partition behavior of all proteins in all ATPSs analyzed in this study. We likely need to gain better insight into relationships between protein-solvent interactions and protein structure peculiarities, in particular given limitations of the used here crystal structures, to be able to construct a model that accurately predicts protein partition behavior across all ATPSs. Copyright © 2016 Elsevier B.V. All rights reserved.
Determination and Quantification of Molecular Interactions in Protein Films: A Review.
Hammann, Felicia; Schmid, Markus
2014-12-10
Protein based films are nowadays also prepared with the aim of replacing expensive, crude oil-based polymers as environmentally friendly and renewable alternatives. The protein structure determines the ability of protein chains to form intra- and intermolecular bonds, whereas the degree of cross-linking depends on the amino acid composition and molecular weight of the protein, besides the conditions used in film preparation and processing. The functionality varies significantly depending on the type of protein and affects the resulting film quality and properties. This paper reviews the methods used in examination of molecular interactions in protein films and discusses how these intermolecular interactions can be quantified. The qualitative determination methods can be distinguished by structural analysis of solutions (electrophoretic analysis, size exclusion chromatography) and analysis of solid films (spectroscopy techniques, X-ray scattering methods). To quantify molecular interactions involved, two methods were found to be the most suitable: protein film swelling and solubility. The importance of non-covalent and covalent interactions in protein films can be investigated using different solvents. The research was focused on whey protein, whereas soy protein and wheat gluten were included as further examples of proteins.
Determination Quantification of Molecular Interactions in Protein Films: A Review
Hammann, Felicia; Schmid, Markus
2014-01-01
Protein based films are nowadays also prepared with the aim of replacing expensive, crude oil-based polymers as environmentally friendly and renewable alternatives. The protein structure determines the ability of protein chains to form intra- and intermolecular bonds, whereas the degree of cross-linking depends on the amino acid composition and molecular weight of the protein, besides the conditions used in film preparation and processing. The functionality varies significantly depending on the type of protein and affects the resulting film quality and properties. This paper reviews the methods used in examination of molecular interactions in protein films and discusses how these intermolecular interactions can be quantified. The qualitative determination methods can be distinguished by structural analysis of solutions (electrophoretic analysis, size exclusion chromatography) and analysis of solid films (spectroscopy techniques, X-ray scattering methods). To quantify molecular interactions involved, two methods were found to be the most suitable: protein film swelling and solubility. The importance of non-covalent and covalent interactions in protein films can be investigated using different solvents. The research was focused on whey protein, whereas soy protein and wheat gluten were included as further examples of proteins. PMID:28788285
Structures of BIR domains from human NAIP and cIAP2.
Herman, Maria Dolores; Moche, Martin; Flodin, Susanne; Welin, Martin; Trésaugues, Lionel; Johansson, Ida; Nilsson, Martina; Nordlund, Pär; Nyman, Tomas
2009-11-01
The inhibitor of apoptosis (IAP) family of proteins contains key modulators of apoptosis and inflammation that interact with caspases through baculovirus IAP-repeat (BIR) domains. Overexpression of IAP proteins frequently occurs in cancer cells, thus counteracting the activated apoptotic program. The IAP proteins have therefore emerged as promising targets for cancer therapy. In this work, X-ray crystallography was used to determine the first structures of BIR domains from human NAIP and cIAP2. Both structures harbour an N-terminal tetrapeptide in the conserved peptide-binding groove. The structures reveal that these two proteins bind the tetrapeptides in a similar mode as do other BIR domains. Detailed interactions are described for the P1'-P4' side chains of the peptide, providing a structural basis for peptide-specific recognition. An arginine side chain in the P3' position reveals favourable interactions with its hydrophobic moiety in the binding pocket, while hydrophobic residues in the P2' and P4' pockets make similar interactions to those seen in other BIR domain-peptide complexes. The structures also reveal how a serine in the P1' position is accommodated in the binding pockets of NAIP and cIAP2. In addition to shedding light on the specificity determinants of these two proteins, the structures should now also provide a framework for future structure-based work targeting these proteins.
Structures of BIR domains from human NAIP and cIAP2
Herman, Maria Dolores; Moche, Martin; Flodin, Susanne; Welin, Martin; Trésaugues, Lionel; Johansson, Ida; Nilsson, Martina; Nordlund, Pär; Nyman, Tomas
2009-01-01
The inhibitor of apoptosis (IAP) family of proteins contains key modulators of apoptosis and inflammation that interact with caspases through baculovirus IAP-repeat (BIR) domains. Overexpression of IAP proteins frequently occurs in cancer cells, thus counteracting the activated apoptotic program. The IAP proteins have therefore emerged as promising targets for cancer therapy. In this work, X-ray crystallography was used to determine the first structures of BIR domains from human NAIP and cIAP2. Both structures harbour an N-terminal tetrapeptide in the conserved peptide-binding groove. The structures reveal that these two proteins bind the tetrapeptides in a similar mode as do other BIR domains. Detailed interactions are described for the P1′–P4′ side chains of the peptide, providing a structural basis for peptide-specific recognition. An arginine side chain in the P3′ position reveals favourable interactions with its hydrophobic moiety in the binding pocket, while hydrophobic residues in the P2′ and P4′ pockets make similar interactions to those seen in other BIR domain–peptide complexes. The structures also reveal how a serine in the P1′ position is accommodated in the binding pockets of NAIP and cIAP2. In addition to shedding light on the specificity determinants of these two proteins, the structures should now also provide a framework for future structure-based work targeting these proteins. PMID:19923725
Structure and Sequence Search on Aptamer-Protein Docking
NASA Astrophysics Data System (ADS)
Xiao, Jiajie; Bonin, Keith; Guthold, Martin; Salsbury, Freddie
2015-03-01
Interactions between proteins and deoxyribonucleic acid (DNA) play a significant role in the living systems, especially through gene regulation. However, short nucleic acids sequences (aptamers) with specific binding affinity to specific proteins exhibit clinical potential as therapeutics. Our capillary and gel electrophoresis selection experiments show that specific sequences of aptamers can be selected that bind specific proteins. Computationally, given the experimentally-determined structure and sequence of a thrombin-binding aptamer, we can successfully dock the aptamer onto thrombin in agreement with experimental structures of the complex. In order to further study the conformational flexibility of this thrombin-binding aptamer and to potentially develop a predictive computational model of aptamer-binding, we use GPU-enabled molecular dynamics simulations to both examine the conformational flexibility of the aptamer in the absence of binding to thrombin, and to determine our ability to fold an aptamer. This study should help further de-novo predictions of aptamer sequences by enabling the study of structural and sequence-dependent effects on aptamer-protein docking specificity.
Lee, Woonghee; Stark, Jaime L; Markley, John L
2014-11-01
Peak-picking Of Noe Data Enabled by Restriction Of Shift Assignments-Client Server (PONDEROSA-C/S) builds on the original PONDEROSA software (Lee et al. in Bioinformatics 27:1727-1728. doi: 10.1093/bioinformatics/btr200, 2011) and includes improved features for structure calculation and refinement. PONDEROSA-C/S consists of three programs: Ponderosa Server, Ponderosa Client, and Ponderosa Analyzer. PONDEROSA-C/S takes as input the protein sequence, a list of assigned chemical shifts, and nuclear Overhauser data sets ((13)C- and/or (15)N-NOESY). The output is a set of assigned NOEs and 3D structural models for the protein. Ponderosa Analyzer supports the visualization, validation, and refinement of the results from Ponderosa Server. These tools enable semi-automated NMR-based structure determination of proteins in a rapid and robust fashion. We present examples showing the use of PONDEROSA-C/S in solving structures of four proteins: two that enable comparison with the original PONDEROSA package, and two from the Critical Assessment of automated Structure Determination by NMR (Rosato et al. in Nat Methods 6:625-626. doi: 10.1038/nmeth0909-625 , 2009) competition. The software package can be downloaded freely in binary format from http://pine.nmrfam.wisc.edu/download_packages.html. Registered users of the National Magnetic Resonance Facility at Madison can submit jobs to the PONDEROSA-C/S server at http://ponderosa.nmrfam.wisc.edu, where instructions, tutorials, and instructions can be found. Structures are normally returned within 1-2 days.
Zeng, Jianyang; Zhou, Pei; Donald, Bruce Randall
2011-01-01
One bottleneck in NMR structure determination lies in the laborious and time-consuming process of side-chain resonance and NOE assignments. Compared to the well-studied backbone resonance assignment problem, automated side-chain resonance and NOE assignments are relatively less explored. Most NOE assignment algorithms require nearly complete side-chain resonance assignments from a series of through-bond experiments such as HCCH-TOCSY or HCCCONH. Unfortunately, these TOCSY experiments perform poorly on large proteins. To overcome this deficiency, we present a novel algorithm, called NASCA (NOE Assignment and Side-Chain Assignment), to automate both side-chain resonance and NOE assignments and to perform high-resolution protein structure determination in the absence of any explicit through-bond experiment to facilitate side-chain resonance assignment, such as HCCH-TOCSY. After casting the assignment problem into a Markov Random Field (MRF), NASCA extends and applies combinatorial protein design algorithms to compute optimal assignments that best interpret the NMR data. The MRF captures the contact map information of the protein derived from NOESY spectra, exploits the backbone structural information determined by RDCs, and considers all possible side-chain rotamers. The complexity of the combinatorial search is reduced by using a dead-end elimination (DEE) algorithm, which prunes side-chain resonance assignments that are provably not part of the optimal solution. Then an A* search algorithm is employed to find a set of optimal side-chain resonance assignments that best fit the NMR data. These side-chain resonance assignments are then used to resolve the NOE assignment ambiguity and compute high-resolution protein structures. Tests on five proteins show that NASCA assigns resonances for more than 90% of side-chain protons, and achieves about 80% correct assignments. The final structures computed using the NOE distance restraints assigned by NASCA have backbone RMSD 0.8 – 1.5 Å from the reference structures determined by traditional NMR approaches. PMID:21706248
Takeda, Mitsuhiro; Chang, Chung-ke; Ikeya, Teppei; Güntert, Peter; Chang, Yuan-hsiang; Hsu, Yen-lan; Huang, Tai-huang; Kainosho, Masatsune
2008-07-18
The C-terminal domain (CTD) of the severe acute respiratory syndrome coronavirus (SARS-CoV) nucleocapsid protein (NP) contains a potential RNA-binding region in its N-terminal portion and also serves as a dimerization domain by forming a homodimer with a molecular mass of 28 kDa. So far, the structure determination of the SARS-CoV NP CTD in solution has been impeded by the poor quality of NMR spectra, especially for aromatic resonances. We have recently developed the stereo-array isotope labeling (SAIL) method to overcome the size problem of NMR structure determination by utilizing a protein exclusively composed of stereo- and regio-specifically isotope-labeled amino acids. Here, we employed the SAIL method to determine the high-quality solution structure of the SARS-CoV NP CTD by NMR. The SAIL protein yielded less crowded and better resolved spectra than uniform (13)C and (15)N labeling, and enabled the homodimeric solution structure of this protein to be determined. The NMR structure is almost identical with the previously solved crystal structure, except for a disordered putative RNA-binding domain at the N-terminus. Studies of the chemical shift perturbations caused by the binding of single-stranded DNA and mutational analyses have identified the disordered region at the N-termini as the prime site for nucleic acid binding. In addition, residues in the beta-sheet region also showed significant perturbations. Mapping of the locations of these residues onto the helical model observed in the crystal revealed that these two regions are parts of the interior lining of the positively charged helical groove, supporting the hypothesis that the helical oligomer may form in solution.
Karlsen, Morten L; Thorsen, Thor S; Johner, Niklaus; Ammendrup-Johnsen, Ina; Erlendsson, Simon; Tian, Xinsheng; Simonsen, Jens B; Høiberg-Nielsen, Rasmus; Christensen, Nikolaj M; Khelashvili, George; Streicher, Werner; Teilum, Kaare; Vestergaard, Bente; Weinstein, Harel; Gether, Ulrik; Arleth, Lise; Madsen, Kenneth L
2015-07-07
PICK1 is a neuronal scaffolding protein containing a PDZ domain and an auto-inhibited BAR domain. BAR domains are membrane-sculpting protein modules generating membrane curvature and promoting membrane fission. Previous data suggest that BAR domains are organized in lattice-like arrangements when stabilizing membranes but little is known about structural organization of BAR domains in solution. Through a small-angle X-ray scattering (SAXS) analysis, we determine the structure of dimeric and tetrameric complexes of PICK1 in solution. SAXS and biochemical data reveal a strong propensity of PICK1 to form higher-order structures, and SAXS analysis suggests an offset, parallel mode of BAR-BAR oligomerization. Furthermore, unlike accessory domains in other BAR domain proteins, the positioning of the PDZ domains is flexible, enabling PICK1 to perform long-range, dynamic scaffolding of membrane-associated proteins. Together with functional data, these structural findings are compatible with a model in which oligomerization governs auto-inhibition of BAR domain function. Copyright © 2015 Elsevier Ltd. All rights reserved.
Time-resolved structural studies with serial crystallography: A new light on retinal proteins
Panneels, Valérie; Wu, Wenting; Tsai, Ching-Ju; Nogly, Przemek; Rheinberger, Jan; Jaeger, Kathrin; Cicchetti, Gregor; Gati, Cornelius; Kick, Leonhard M.; Sala, Leonardo; Capitani, Guido; Milne, Chris; Padeste, Celestino; Pedrini, Bill; Li, Xiao-Dan; Standfuss, Jörg; Abela, Rafael; Schertler, Gebhard
2015-01-01
Structural information of the different conformational states of the two prototypical light-sensitive membrane proteins, bacteriorhodopsin and rhodopsin, has been obtained in the past by X-ray cryo-crystallography and cryo-electron microscopy. However, these methods do not allow for the structure determination of most intermediate conformations. Recently, the potential of X-Ray Free Electron Lasers (X-FELs) for tracking the dynamics of light-triggered processes by pump-probe serial femtosecond crystallography has been demonstrated using 3D-micron-sized crystals. In addition, X-FELs provide new opportunities for protein 2D-crystal diffraction, which would allow to observe the course of conformational changes of membrane proteins in a close-to-physiological lipid bilayer environment. Here, we describe the strategies towards structural dynamic studies of retinal proteins at room temperature, using injector or fixed-target based serial femtosecond crystallography at X-FELs. Thanks to recent progress especially in sample delivery methods, serial crystallography is now also feasible at synchrotron X-ray sources, thus expanding the possibilities for time-resolved structure determination. PMID:26798817
Combining functional and structural genomics to sample the essential Burkholderia structome.
Baugh, Loren; Gallagher, Larry A; Patrapuvich, Rapatbhorn; Clifton, Matthew C; Gardberg, Anna S; Edwards, Thomas E; Armour, Brianna; Begley, Darren W; Dieterich, Shellie H; Dranow, David M; Abendroth, Jan; Fairman, James W; Fox, David; Staker, Bart L; Phan, Isabelle; Gillespie, Angela; Choi, Ryan; Nakazawa-Hewitt, Steve; Nguyen, Mary Trang; Napuli, Alberto; Barrett, Lynn; Buchko, Garry W; Stacy, Robin; Myler, Peter J; Stewart, Lance J; Manoil, Colin; Van Voorhis, Wesley C
2013-01-01
The genus Burkholderia includes pathogenic gram-negative bacteria that cause melioidosis, glanders, and pulmonary infections of patients with cancer and cystic fibrosis. Drug resistance has made development of new antimicrobials critical. Many approaches to discovering new antimicrobials, such as structure-based drug design and whole cell phenotypic screens followed by lead refinement, require high-resolution structures of proteins essential to the parasite. We experimentally identified 406 putative essential genes in B. thailandensis, a low-virulence species phylogenetically similar to B. pseudomallei, the causative agent of melioidosis, using saturation-level transposon mutagenesis and next-generation sequencing (Tn-seq). We selected 315 protein products of these genes based on structure-determination criteria, such as excluding very large and/or integral membrane proteins, and entered them into the Seattle Structural Genomics Center for Infection Disease (SSGCID) structure determination pipeline. To maximize structural coverage of these targets, we applied an "ortholog rescue" strategy for those producing insoluble or difficult to crystallize proteins, resulting in the addition of 387 orthologs (or paralogs) from seven other Burkholderia species into the SSGCID pipeline. This structural genomics approach yielded structures from 31 putative essential targets from B. thailandensis, and 25 orthologs from other Burkholderia species, yielding an overall structural coverage for 49 of the 406 essential gene families, with a total of 88 depositions into the Protein Data Bank. Of these, 25 proteins have properties of a potential antimicrobial drug target i.e., no close human homolog, part of an essential metabolic pathway, and a deep binding pocket. We describe the structures of several potential drug targets in detail. This collection of structures, solubility and experimental essentiality data provides a resource for development of drugs against infections and diseases caused by Burkholderia. All expression clones and proteins created in this study are freely available by request.
Integrative Structure Determination of Protein Assemblies by Satisfaction of Spatial Restraints
NASA Astrophysics Data System (ADS)
Alber, Frank; Chait, Brian T.; Rout, Michael P.; Sali, Andrej
To understand the cell, we need to determine the structures of macromolecular assemblies, many of which consist of tens to hundreds of components. A great variety of experimental data can be used to characterize the assemblies at several levels of resolution, from atomic structures to component configurations. To maximize completeness, resolution, accuracy, precision and efficiency of the structure determination, a computational approach is needed that can use spatial information from a variety of experimental methods. We propose such an approach, defined by its three main components: a hierarchical representation of the assembly, a scoring function consisting of spatial restraints derived from experimental data, and an optimization method that generates structures consistent with the data. We illustrate the approach by determining the configuration of the 456 proteins in the nuclear pore complex from Baker's yeast.
Goonesekere, Nalin Cw
2009-01-01
The large numbers of protein sequences generated by whole genome sequencing projects require rapid and accurate methods of annotation. The detection of homology through computational sequence analysis is a powerful tool in determining the complex evolutionary and functional relationships that exist between proteins. Homology search algorithms employ amino acid substitution matrices to detect similarity between proteins sequences. The substitution matrices in common use today are constructed using sequences aligned without reference to protein structure. Here we present amino acid substitution matrices constructed from the alignment of a large number of protein domain structures from the structural classification of proteins (SCOP) database. We show that when incorporated into the homology search algorithms BLAST and PSI-blast, the structure-based substitution matrices enhance the efficacy of detecting remote homologs.
Kinjo, Akira R.; Bekker, Gert-Jan; Suzuki, Hirofumi; Tsuchiya, Yuko; Kawabata, Takeshi; Ikegawa, Yasuyo; Nakamura, Haruki
2017-01-01
The Protein Data Bank Japan (PDBj, http://pdbj.org), a member of the worldwide Protein Data Bank (wwPDB), accepts and processes the deposited data of experimentally determined macromolecular structures. While maintaining the archive in collaboration with other wwPDB partners, PDBj also provides a wide range of services and tools for analyzing structures and functions of proteins. We herein outline the updated web user interfaces together with RESTful web services and the backend relational database that support the former. To enhance the interoperability of the PDB data, we have previously developed PDB/RDF, PDB data in the Resource Description Framework (RDF) format, which is now a wwPDB standard called wwPDB/RDF. We have enhanced the connectivity of the wwPDB/RDF data by incorporating various external data resources. Services for searching, comparing and analyzing the ever-increasing large structures determined by hybrid methods are also described. PMID:27789697
Khafizov, Kamil; Madrid-Aliste, Carlos; Almo, Steven C; Fiser, Andras
2014-03-11
The exponential growth of protein sequence data provides an ever-expanding body of unannotated and misannotated proteins. The National Institutes of Health-supported Protein Structure Initiative and related worldwide structural genomics efforts facilitate functional annotation of proteins through structural characterization. Recently there have been profound changes in the taxonomic composition of sequence databases, which are effectively redefining the scope and contribution of these large-scale structure-based efforts. The faster-growing bacterial genomic entries have overtaken the eukaryotic entries over the last 5 y, but also have become more redundant. Despite the enormous increase in the number of sequences, the overall structural coverage of proteins--including proteins for which reliable homology models can be generated--on the residue level has increased from 30% to 40% over the last 10 y. Structural genomics efforts contributed ∼50% of this new structural coverage, despite determining only ∼10% of all new structures. Based on current trends, it is expected that ∼55% structural coverage (the level required for significant functional insight) will be achieved within 15 y, whereas without structural genomics efforts, realizing this goal will take approximately twice as long.
Pulse - Accelerator Science in Medicine
the structure of biological molecules. They use the energy that charged particles emit when powerful than the sun and focused on a pinpoint. Deciphering the structure of proteins is key to understanding biological processes and healing disease. To determine a proteinÂs structure, researchers direct
Mote, Kaustubh R.; Gopinath, T.; Veglia, Gianluigi
2013-01-01
The low sensitivity inherent to both the static and magic angle spinning techniques of solid-state NMR (ssNMR) spectroscopy has thus far limited the routine application of multidimensional experiments to determine the structure of membrane proteins in lipid bilayers. Here, we demonstrate the advantage of using a recently developed class of experiments, polarization optimized experiments (POE), for both static and MAS spectroscopy to achieve higher sensitivity and substantial time-savings for 2D and 3D experiments. We used sarcolipin, a single pass membrane protein, reconstituted in oriented bicelles (for oriented ssNMR) and multilamellar vesicles (for MAS ssNMR) as a benchmark. The restraints derived by these experiments are then combined into a hybrid energy function to allow simultaneous determination of structure and topology. The resulting structural ensemble converged to a helical conformation with a backbone RMSD ∼ 0.44 Å, a tilt angle of 24° ± 1°, and an azimuthal angle of 55° ± 6°. This work represents a crucial first step toward obtaining high-resolution structures of large membrane proteins using combined multidimensional O-ssNMR and MAS-ssNMR. PMID:23963722
Diehl, Carl; Wisniewska, Magdalena; Frick, Inga-Maria; Streicher, Werner; Björck, Lars; Malmström, Johan; Wikström, Mats
2016-01-01
Streptococcus pyogenes is one of the most significant bacterial pathogens in the human population mostly causing superficial and uncomplicated infections (pharyngitis and impetigo) but also invasive and life-threatening disease. We have previously identified a virulence determinant, protein sHIP, which is secreted at higher levels by an invasive compared to a non-invasive strain of S. pyogenes. The present work presents a further characterization of the structural and functional properties of this bacterial protein. Biophysical and structural studies have shown that protein sHIP forms stable tetramers both in the crystal and in solution. The tetramers are composed of four helix-loop-helix motifs with the loop regions connecting the helices displaying a high degree of flexibility. Owing to interactions at the tetramer interface, the observed tetramer can be described as a dimer of dimers. We identified three residues at the tetramer interface (Leu84, Leu88, Tyr95), which due to largely non-polar side-chains, could be important determinants for protein oligomerization. Based on these observations, we produced a sHIP variant in which these residues were mutated to alanines. Biophysical experiments clearly indicated that the sHIP mutant appear only as dimers in solution confirming the importance of the interfacial residues for protein oligomerisation. Furthermore, we could show that the sHIP mutant interacts with intact histidine-rich glycoprotein (HRG) and the histidine-rich repeats in HRG, and inhibits their antibacterial activity to the same or even higher extent as compared to the wild type protein sHIP. We determined the crystal structure of the sHIP mutant, which, as a result of the high quality of the data, allowed us to improve the existing structural model of the protein. Finally, by employing NMR spectroscopy in solution, we generated a model for the complex between the sHIP mutant and an HRG-derived heparin-binding peptide, providing further molecular details into the interactions involving protein sHIP.
Gibbons, Don L.; Reilly, Brigid; Ahn, Anna; Vaney, Marie-Christine; Vigouroux, Armelle; Rey, Felix A.; Kielian, Margaret
2004-01-01
The fusion proteins of the alphaviruses and flaviviruses have a similar native structure and convert to a highly stable homotrimer conformation during the fusion of the viral and target membranes. The properties of the alpha- and flavivirus fusion proteins distinguish them from the class I viral fusion proteins, such as influenza virus hemagglutinin, and establish them as the first members of the class II fusion proteins. Understanding how this new class carries out membrane fusion will require analysis of the structural basis for both the interaction of the protein subunits within the homotrimer and their interaction with the viral and target membranes. To this end we report a purification method for the E1 ectodomain homotrimer from the alphavirus Semliki Forest virus. The purified protein is trimeric, detergent soluble, retains the characteristic stability of the starting homotrimer, and is free of lipid and other contaminants. In contrast to the postfusion structures that have been determined for the class I proteins, the E1 homotrimer contains the fusion peptide region responsible for interaction with target membranes. This E1 trimer preparation is an excellent candidate for structural studies of the class II viral fusion proteins, and we report conditions that generate three-dimensional crystals suitable for analysis by X-ray diffraction. Determination of the structure will provide our first high-resolution views of both the low-pH-induced trimeric conformation and the target membrane-interacting region of the alphavirus fusion protein. PMID:15016874
Structure-Functional Basis of Ion Transport in Sodium–Calcium Exchanger (NCX) Proteins
Giladi, Moshe; Shor, Reut; Lisnyansky, Michal; Khananshvili, Daniel
2016-01-01
The membrane-bound sodium–calcium exchanger (NCX) proteins shape Ca2+ homeostasis in many cell types, thus participating in a wide range of physiological and pathological processes. Determination of the crystal structure of an archaeal NCX (NCX_Mj) paved the way for a thorough and systematic investigation of ion transport mechanisms in NCX proteins. Here, we review the data gathered from the X-ray crystallography, molecular dynamics simulations, hydrogen–deuterium exchange mass-spectrometry (HDX-MS), and ion-flux analyses of mutants. Strikingly, the apo NCX_Mj protein exhibits characteristic patterns in the local backbone dynamics at particular helix segments, thereby possessing characteristic HDX profiles, suggesting structure-dynamic preorganization (geometric arrangements of catalytic residues before the transition state) of conserved α1 and α2 repeats at ion-coordinating residues involved in transport activities. Moreover, dynamic preorganization of local structural entities in the apo protein predefines the status of ion-occlusion and transition states, even though Na+ or Ca2+ binding modifies the preceding backbone dynamics nearby functionally important residues. Future challenges include resolving the structural-dynamic determinants governing the ion selectivity, functional asymmetry and ion-induced alternating access. Taking into account the structural similarities of NCX_Mj with the other proteins belonging to the Ca2+/cation exchanger superfamily, the recent findings can significantly improve our understanding of ion transport mechanisms in NCX and similar proteins. PMID:27879668
Structure-Functional Basis of Ion Transport in Sodium-Calcium Exchanger (NCX) Proteins.
Giladi, Moshe; Shor, Reut; Lisnyansky, Michal; Khananshvili, Daniel
2016-11-22
The membrane-bound sodium-calcium exchanger (NCX) proteins shape Ca 2+ homeostasis in many cell types, thus participating in a wide range of physiological and pathological processes. Determination of the crystal structure of an archaeal NCX (NCX_Mj) paved the way for a thorough and systematic investigation of ion transport mechanisms in NCX proteins. Here, we review the data gathered from the X-ray crystallography, molecular dynamics simulations, hydrogen-deuterium exchange mass-spectrometry (HDX-MS), and ion-flux analyses of mutants. Strikingly, the apo NCX_Mj protein exhibits characteristic patterns in the local backbone dynamics at particular helix segments, thereby possessing characteristic HDX profiles, suggesting structure-dynamic preorganization (geometric arrangements of catalytic residues before the transition state) of conserved α₁ and α₂ repeats at ion-coordinating residues involved in transport activities. Moreover, dynamic preorganization of local structural entities in the apo protein predefines the status of ion-occlusion and transition states, even though Na⁺ or Ca 2+ binding modifies the preceding backbone dynamics nearby functionally important residues. Future challenges include resolving the structural-dynamic determinants governing the ion selectivity, functional asymmetry and ion-induced alternating access. Taking into account the structural similarities of NCX_Mj with the other proteins belonging to the Ca 2+ /cation exchanger superfamily, the recent findings can significantly improve our understanding of ion transport mechanisms in NCX and similar proteins.
Structural basis of viral invasion: lessons from paramyxovirus F
Lamb, Robert A.; Jardetzky, Theodore S.
2007-01-01
Summary The structures of glycoproteins that mediate enveloped virus entry into cells have revealed dramatic structural changes that accompany membrane fusion and provided mechanistic insights into this process. The group of class I viral fusion proteins includes the influenza hemagglutinin, paramyxovirus F, HIV env and other mechanistically related fusogens, but these proteins are unrelated in sequence and exhibit clearly distinct structural features. Recently determined crystal structures of the paramyxovirus F protein in two conformations, representing prefusion and postfusion states, reveal a novel protein architecture that undergoes large-scale, irreversible refolding during membrane fusion, extending our understanding of this diverse group of membrane fusion machines. PMID:17870467
Microgravity protein crystallization
McPherson, Alexander; DeLucas, Lawrence James
2015-01-01
Over the past 20 years a variety of technological advances in X-ray crystallography have shortened the time required to determine the structures of large macromolecules (i.e., proteins and nucleic acids) from several years to several weeks or days. However, one of the remaining challenges is the ability to produce diffraction-quality crystals suitable for a detailed structural analysis. Although the development of automated crystallization systems combined with protein engineering (site-directed mutagenesis to enhance protein solubility and crystallization) have improved crystallization success rates, there remain hundreds of proteins that either cannot be crystallized or yield crystals of insufficient quality to support X-ray structure determination. In an attempt to address this bottleneck, an international group of scientists has explored use of a microgravity environment to crystallize macromolecules. This paper summarizes the history of this international initiative along with a description of some of the flight hardware systems and crystallization results. PMID:28725714
Rigidity and pH dependent Morphology of Beta-Lactoglobulin Spherulites
NASA Astrophysics Data System (ADS)
Gayetsky, Lisa; Armstead, Douglas
2008-03-01
Beta-Lactoglobulin is a milk protein that will denature in acidic solution (less than 2.0 pH) and if heated for extended periods (greater than 18 hours) it will form radial structures called Spherulites. Spherulites, along with the amyloid fibrils that compose them, are of practical importance because they form in the human body and cause the amyloidosis diseases. Different amyloidosis are caused by different types of denatured proteins occurring in different parts of the body. Since it is believed that Spherulite formation is a generic protein characteristic, Beta-Lactoglobulin is a legitimate and easy to use protein to study these structures. In this study we are quantifying the shape of Beta-Lactoglobulin Spherulites to determine if the pH of the protein solution has an impact on the morphology due to side chain interactions or other causes. We are also testing the rigidity of these structures to determine the relevance of small shape changes.
Intermediates and the folding of proteins L and G
DOE Office of Scientific and Technical Information (OSTI.GOV)
Brown, Scott; Head-Gordon, Teresa
We use a minimalist protein model, in combination with a sequence design strategy, to determine differences in primary structure for proteins L and G that are responsible for the two proteins folding through distinctly different folding mechanisms. We find that the folding of proteins L and G are consistent with a nucleation-condensation mechanism, each of which is described as helix-assisted {beta}-1 and {beta}-2 hairpin formation, respectively. We determine that the model for protein G exhibits an early intermediate that precedes the rate-limiting barrier of folding and which draws together misaligned secondary structure elements that are stabilized by hydrophobic core contactsmore » involving the third {beta}-strand, and presages the later transition state in which the correct strand alignment of these same secondary structure elements is restored. Finally the validity of the targeted intermediate ensemble for protein G was analyzed by fitting the kinetic data to a two-step first order reversible reaction, proving that protein G folding involves an on-pathway early intermediate, and should be populated and therefore observable by experiment.« less
Intermediates and the folding of proteins L and G
Brown, Scott; Head-Gordon, Teresa
2004-01-01
We use a minimalist protein model, in combination with a sequence design strategy, to determine differences in primary structure for proteins L and G, which are responsible for the two proteins folding through distinctly different folding mechanisms. We find that the folding of proteins L and G are consistent with a nucleation-condensation mechanism, each of which is described as helix-assisted β-1 and β-2 hairpin formation, respectively. We determine that the model for protein G exhibits an early intermediate that precedes the rate-limiting barrier of folding, and which draws together misaligned secondary structure elements that are stabilized by hydrophobic core contacts involving the third β-strand, and presages the later transition state in which the correct strand alignment of these same secondary structure elements is restored. Finally, the validity of the targeted intermediate ensemble for protein G was analyzed by fitting the kinetic data to a two-step first-order reversible reaction, proving that protein G folding involves an on-pathway early intermediate, and should be populated and therefore observable by experiment. PMID:15044729
Raval, Alpan; Piana, Stefano; Eastwood, Michael P; Shaw, David E
2016-01-01
Molecular dynamics (MD) simulation is a well-established tool for the computational study of protein structure and dynamics, but its application to the important problem of protein structure prediction remains challenging, in part because extremely long timescales can be required to reach the native structure. Here, we examine the extent to which the use of low-resolution information in the form of residue-residue contacts, which can often be inferred from bioinformatics or experimental studies, can accelerate the determination of protein structure in simulation. We incorporated sets of 62, 31, or 15 contact-based restraints in MD simulations of ubiquitin, a benchmark system known to fold to the native state on the millisecond timescale in unrestrained simulations. One-third of the restrained simulations folded to the native state within a few tens of microseconds-a speedup of over an order of magnitude compared with unrestrained simulations and a demonstration of the potential for limited amounts of structural information to accelerate structure determination. Almost all of the remaining ubiquitin simulations reached near-native conformations within a few tens of microseconds, but remained trapped there, apparently due to the restraints. We discuss potential methodological improvements that would facilitate escape from these near-native traps and allow more simulations to quickly reach the native state. Finally, using a target from the Critical Assessment of protein Structure Prediction (CASP) experiment, we show that distance restraints can improve simulation accuracy: In our simulations, restraints stabilized the native state of the protein, enabling a reasonable structural model to be inferred. © 2015 The Authors Protein Science published by Wiley Periodicals, Inc. on behalf of The Protein Society.
Disulfide Trapping for Modeling and Structure Determination of Receptor: Chemokine Complexes.
Kufareva, Irina; Gustavsson, Martin; Holden, Lauren G; Qin, Ling; Zheng, Yi; Handel, Tracy M
2016-01-01
Despite the recent breakthrough advances in GPCR crystallography, structure determination of protein-protein complexes involving chemokine receptors and their endogenous chemokine ligands remains challenging. Here, we describe disulfide trapping, a methodology for generating irreversible covalent binary protein complexes from unbound protein partners by introducing two cysteine residues, one per interaction partner, at selected positions within their interaction interface. Disulfide trapping can serve at least two distinct purposes: (i) stabilization of the complex to assist structural studies and/or (ii) determination of pairwise residue proximities to guide molecular modeling. Methods for characterization of disulfide-trapped complexes are described and evaluated in terms of throughput, sensitivity, and specificity toward the most energetically favorable crosslinks. Due to abundance of native disulfide bonds at receptor:chemokine interfaces, disulfide trapping of their complexes can be associated with intramolecular disulfide shuffling and result in misfolding of the component proteins; because of this, evidence from several experiments is typically needed to firmly establish a positive disulfide crosslink. An optimal pipeline that maximizes throughput and minimizes time and costs by early triage of unsuccessful candidate constructs is proposed. © 2016 Elsevier Inc. All rights reserved.
Protein Denaturation on p-T Axes--Thermodynamics and Analysis.
Smeller, László
2015-01-01
Proteins are essential players in the vast majority of molecular level life processes. Since their structure is in most cases substantial for their correct function, study of their structural changes attracted great interest in the past decades. The three dimensional structure of proteins is influenced by several factors including temperature, pH, presence of chaotropic and cosmotropic agents, or presence of denaturants. Although pressure is an equally important thermodynamic parameter as temperature, pressure studies are considerably less frequent in the literature, probably due to the technical difficulties associated to the pressure studies. Although the first steps in the high-pressure protein study have been done 100 years ago with Bridgman's ground breaking work, the field was silent until the modern spectroscopic techniques allowed the characterization of the protein structural changes, while the protein was under pressure. Recently a number of proteins were studied under pressure, and complete pressure-temperature phase diagrams were determined for several of them. This review summarizes the thermodynamic background of the typical elliptic p-T phase diagram, its limitations and the possible reasons for deviations of the experimental diagrams from the theoretical one. Finally we show some examples of experimentally determined pressure-temperature phase diagrams.
Biswas, Ria; Bagchi, Angshuman
2017-09-11
The tumour necrosis factor (TNF) receptor-associated factor (TRAF) family of proteins having E3 ligase activity are the key molecules involved in cellular immune response pathways. TRAF6 is a unique member of the TRAF superfamily differing from other members of the family, owing to its specific interactions with molecules outside the TNF receptor superfamily. The C-terminal domain of TRAF proteins contains the catalytic residues and are known to be involved in self-oligomerization forming a mushroom-shaped trimeric structure, which is the functional form of the protein. However, the monomeric crystal structure of TRAF6 C-terminal domain has been already determined, but the trimeric structure of the same is still not available. We here applied computational structural modelling and molecular dynamics simulations studies to get insights into the molecular interactions involved in determining the trimeric structure of the TRAF6 C-terminal domain. The non-availability of the trimeric structure of the TRAF6 C-terminal domain prevented the elucidation of the molecular mechanism of many different biological processes. Our results suggest that the trimer complex is transient in nature. The amino acid residues Lys340 and Glu345 in the coiled coil domain in the C-terminus of TRAF6 play a critical role in trimer structure formation. This structural modelling study may therefore be utilized to obtain the experimentally validated trimeric structure of this important protein.
Abendroth, Jan; McCormick, Michael S.; Edwards, Thomas E.; Staker, Bart; Loewen, Roderick; Gifford, Martin; Rifkin, Jeff; Mayer, Chad; Guo, Wenjin; Zhang, Yang; Myler, Peter; Kelley, Angela; Analau, Erwin; Hewitt, Stephen Nakazawa; Napuli, Alberto J.; Kuhn, Peter; Ruth, Ronald D.; Stewart, Lance J.
2010-01-01
Structural genomics discovery projects require ready access to both X-ray and NMR instrumentation which support the collection of experimental data needed to solve large numbers of novel protein structures. The most productive X-ray crystal structure determination laboratories make extensive frequent use of tunable synchrotron X-ray light to solve novel structures by anomalous diffraction methods. This requires that frozen cryo-protected crystals be shipped to large government-run synchrotron facilities for data collection. In an effort to eliminate the need to ship crystals for data collection, we have developed the first laboratory-scale synchrotron light source capable of performing many of the state-of-the-art synchrotron applications in X-ray science. This Compact Light Source is a first-in-class device that uses inverse Compton scattering to generate X-rays of sufficient flux, tunable wavelength and beam size to allow high-resolution X-ray diffraction data collection from protein crystals. We report on benchmarking tests of X-ray diffraction data collection with hen egg white lysozyme, and the successful high-resolution X-ray structure determination of the Glycine cleavage system protein H from Mycobacterium tuberculosis using diffraction data collected with the Compact Light Source X-ray beam. PMID:20364333
Dokmanić, Ivan; Sikić, Mile; Tomić, Sanja
2008-03-01
Metal ions are constituents of many metalloproteins, in which they have either catalytic (metalloenzymes) or structural functions. In this work, the characteristics of various metals were studied (Cu, Zn, Mg, Mn, Fe, Co, Ni, Cd and Ca in proteins with known crystal structure) as well as the specificity of their environments. The analysis was performed on two data sets: the set of protein structures in the Protein Data Bank (PDB) determined with resolution <1.5 A and the set of nonredundant protein structures from the PDB. The former was used to determine the distances between each metal ion and its electron donors and the latter was used to assess the preferred coordination numbers and common combinations of amino-acid residues in the neighbourhood of each metal. Although the metal ions considered predominantly had a valence of two, their preferred coordination number and the type of amino-acid residues that participate in the coordination differed significantly from one metal ion to the next. This study concentrates on finding the specificities of a metal-ion environment, namely the distribution of coordination numbers and the amino-acid residue types that frequently take part in coordination. Furthermore, the correlation between the coordination number and the occurrence of certain amino-acid residues (quartets and triplets) in a metal-ion coordination sphere was analysed. The results obtained are of particular value for the identification and modelling of metal-binding sites in protein structures derived by homology modelling. Knowledge of the geometry and characteristics of the metal-binding sites in metalloproteins of known function can help to more closely determine the biological activity of proteins of unknown function and to aid in design of proteins with specific affinity for certain metals.
Mutations that Cause Human Disease: A Computational/Experimental Approach
DOE Office of Scientific and Technical Information (OSTI.GOV)
Beernink, P; Barsky, D; Pesavento, B
International genome sequencing projects have produced billions of nucleotides (letters) of DNA sequence data, including the complete genome sequences of 74 organisms. These genome sequences have created many new scientific opportunities, including the ability to identify sequence variations among individuals within a species. These genetic differences, which are known as single nucleotide polymorphisms (SNPs), are particularly important in understanding the genetic basis for disease susceptibility. Since the report of the complete human genome sequence, over two million human SNPs have been identified, including a large-scale comparison of an entire chromosome from twenty individuals. Of the protein coding SNPs (cSNPs), approximatelymore » half leads to a single amino acid change in the encoded protein (non-synonymous coding SNPs). Most of these changes are functionally silent, while the remainder negatively impact the protein and sometimes cause human disease. To date, over 550 SNPs have been found to cause single locus (monogenic) diseases and many others have been associated with polygenic diseases. SNPs have been linked to specific human diseases, including late-onset Parkinson disease, autism, rheumatoid arthritis and cancer. The ability to predict accurately the effects of these SNPs on protein function would represent a major advance toward understanding these diseases. To date several attempts have been made toward predicting the effects of such mutations. The most successful of these is a computational approach called ''Sorting Intolerant From Tolerant'' (SIFT). This method uses sequence conservation among many similar proteins to predict which residues in a protein are functionally important. However, this method suffers from several limitations. First, a query sequence must have a sufficient number of relatives to infer sequence conservation. Second, this method does not make use of or provide any information on protein structure, which can be used to understand how an amino acid change affects the protein. The experimental methods that provide the most detailed structural information on proteins are X-ray crystallography and NMR spectroscopy. However, these methods are labor intensive and currently cannot be carried out on a genomic scale. Nonetheless, Structural Genomics projects are being pursued by more than a dozen groups and consortia worldwide and as a result the number of experimentally determined structures is rising exponentially. Based on the expectation that protein structures will continue to be determined at an ever-increasing rate, reliable structure prediction schemes will become increasingly valuable, leading to information on protein function and disease for many different proteins. Given known genetic variability and experimentally determined protein structures, can we accurately predict the effects of single amino acid substitutions? An objective assessment of this question would involve comparing predicted and experimentally determined structures, which thus far has not been rigorously performed. The completed research leveraged existing expertise at LLNL in computational and structural biology, as well as significant computing resources, to address this question.« less
Nagata, Koji
2010-01-01
Peptides and proteins with similar amino acid sequences can have different biological functions. Knowledge of their three-dimensional molecular structures is critically important in identifying their functional determinants. In this review, I describe the results of our and other groups' structure-based functional characterization of insect insulin-like peptides, a crustacean hyperglycemic hormone-family peptide, a mammalian epidermal growth factor-family protein, and an intracellular signaling domain that recognizes proline-rich sequence.
Structure-related statistical singularities along protein sequences: a correlation study.
Colafranceschi, Mauro; Colosimo, Alfredo; Zbilut, Joseph P; Uversky, Vladimir N; Giuliani, Alessandro
2005-01-01
A data set composed of 1141 proteins representative of all eukaryotic protein sequences in the Swiss-Prot Protein Knowledge base was coded by seven physicochemical properties of amino acid residues. The resulting numerical profiles were submitted to correlation analysis after the application of a linear (simple mean) and a nonlinear (Recurrence Quantification Analysis, RQA) filter. The main RQA variables, Recurrence and Determinism, were subsequently analyzed by Principal Component Analysis. The RQA descriptors showed that (i) within protein sequences is embedded specific information neither present in the codes nor in the amino acid composition and (ii) the most sensitive code for detecting ordered recurrent (deterministic) patterns of residues in protein sequences is the Miyazawa-Jernigan hydrophobicity scale. The most deterministic proteins in terms of autocorrelation properties of primary structures were found (i) to be involved in protein-protein and protein-DNA interactions and (ii) to display a significantly higher proportion of structural disorder with respect to the average data set. A study of the scaling behavior of the average determinism with the setting parameters of RQA (embedding dimension and radius) allows for the identification of patterns of minimal length (six residues) as possible markers of zones specifically prone to inter- and intramolecular interactions.
Prediction of Ras-effector interactions using position energy matrices.
Kiel, Christina; Serrano, Luis
2007-09-01
One of the more challenging problems in biology is to determine the cellular protein interaction network. Progress has been made to predict protein-protein interactions based on structural information, assuming that structural similar proteins interact in a similar way. In a previous publication, we have determined a genome-wide Ras-effector interaction network based on homology models, with a high accuracy of predicting binding and non-binding domains. However, for a prediction on a genome-wide scale, homology modelling is a time-consuming process. Therefore, we here successfully developed a faster method using position energy matrices, where based on different Ras-effector X-ray template structures, all amino acids in the effector binding domain are sequentially mutated to all other amino acid residues and the effect on binding energy is calculated. Those pre-calculated matrices can then be used to score for binding any Ras or effector sequences. Based on position energy matrices, the sequences of putative Ras-binding domains can be scanned quickly to calculate an energy sum value. By calibrating energy sum values using quantitative experimental binding data, thresholds can be defined and thus non-binding domains can be excluded quickly. Sequences which have energy sum values above this threshold are considered to be potential binding domains, and could be further analysed using homology modelling. This prediction method could be applied to other protein families sharing conserved interaction types, in order to determine in a fast way large scale cellular protein interaction networks. Thus, it could have an important impact on future in silico structural genomics approaches, in particular with regard to increasing structural proteomics efforts, aiming to determine all possible domain folds and interaction types. All matrices are deposited in the ADAN database (http://adan-embl.ibmc.umh.es/). Supplementary data are available at Bioinformatics online.
Mishra, Avinash; Rana, Prashant Singh; Mittal, Aditya; Jayaram, B
2014-10-01
Root-mean-square-deviation (RMSD), of computationally-derived protein structures from experimentally determined structures, is a critical index to assessing protein-structure-prediction-algorithms (PSPAs). The development of PSPAs to obtain 0Å RMSD from native structures is considered central to computational biology. However, till date it has been quite challenging to measure how far a predicted protein structure is from its native - in the absence of a known experimental/native structure. In this work, we report the development of a metric "D2N" (distance to the native) - that predicts the "RMSD" of any structure without actually knowing the native structure. By combining physico-chemical properties and known universalities in spatial organization of soluble proteins to develop D2N, we demonstrate the ability to predict the distance of a proposed structure to within ±1.5Ǻ error with a remarkable average accuracy of 93.6% for structures below 5Ǻ from the native. We believe that this work opens up a completely new avenue towards assigning reliable structures to whole proteomes even in the absence of experimentally determined native structures. The D2N tool is freely available at http://www.scfbio-iitd.res.in/software/d2n.jsp. Copyright © 2014 Elsevier B.V. All rights reserved.
Brown, Simon H J; Mitchell, Todd W; Oakley, Aaron J; Pham, Huong T; Blanksby, Stephen J
2012-09-01
Since the 1950s, X-ray crystallography has been the mainstay of structural biology, providing detailed atomic-level structures that continue to revolutionize our understanding of protein function. From recent advances in this discipline, a picture has emerged of intimate and specific interactions between lipids and proteins that has driven renewed interest in the structure of lipids themselves and raised intriguing questions as to the specificity and stoichiometry in lipid-protein complexes. Herein we demonstrate some of the limitations of crystallography in resolving critical structural features of ligated lipids and thus determining how these motifs impact protein binding. As a consequence, mass spectrometry must play an important and complementary role in unraveling the complexities of lipid-protein interactions. We evaluate recent advances and highlight ongoing challenges towards the twin goals of (1) complete structure elucidation of low, abundant, and structurally diverse lipids by mass spectrometry alone, and (2) assignment of stoichiometry and specificity of lipid interactions within protein complexes.
NASA Astrophysics Data System (ADS)
Brown, Simon H. J.; Mitchell, Todd W.; Oakley, Aaron J.; Pham, Huong T.; Blanksby, Stephen J.
2012-09-01
Since the 1950s, X-ray crystallography has been the mainstay of structural biology, providing detailed atomic-level structures that continue to revolutionize our understanding of protein function. From recent advances in this discipline, a picture has emerged of intimate and specific interactions between lipids and proteins that has driven renewed interest in the structure of lipids themselves and raised intriguing questions as to the specificity and stoichiometry in lipid-protein complexes. Herein we demonstrate some of the limitations of crystallography in resolving critical structural features of ligated lipids and thus determining how these motifs impact protein binding. As a consequence, mass spectrometry must play an important and complementary role in unraveling the complexities of lipid-protein interactions. We evaluate recent advances and highlight ongoing challenges towards the twin goals of (1) complete structure elucidation of low, abundant, and structurally diverse lipids by mass spectrometry alone, and (2) assignment of stoichiometry and specificity of lipid interactions within protein complexes.
Predicting the helix packing of globular proteins by self-correcting distance geometry.
Mumenthaler, C; Braun, W
1995-05-01
A new self-correcting distance geometry method for predicting the three-dimensional structure of small globular proteins was assessed with a test set of 8 helical proteins. With the knowledge of the amino acid sequence and the helical segments, our completely automated method calculated the correct backbone topology of six proteins. The accuracy of the predicted structures ranged from 2.3 A to 3.1 A for the helical segments compared to the experimentally determined structures. For two proteins, the predicted constraints were not restrictive enough to yield a conclusive prediction. The method can be applied to all small globular proteins, provided the secondary structure is known from NMR analysis or can be predicted with high reliability.
Genome Pool Strategy for Structural Coverage of Protein Families
Jaroszewski, Lukasz; Slabinski, Lukasz; Wooley, John; Deacon, Ashley M.; Lesley, Scott A.; Wilson, Ian. A.; Godzik, Adam
2010-01-01
As noticed by generations of structural biologists, closely homologous proteins may have substantially different crystallization properties and propensities. These observations can be used to systematically introduce additional dimensionality into crystallization trials by targeting homologous proteins from multiple genomes in a “genome pool” strategy. Through extensive use of our recently introduced “crystallization feasibility score” (Slabinski et al., 2007a), we can explain that the genome pool strategy works well because the crystallization feasibility scores are surprisingly broad within families of homologous proteins, with most families containing a range of optimal to very difficult targets. We also show that some families can be regarded as relatively “easy”, where a significant number of proteins are predicted to have optimal crystallization features, and others are “very difficult”, where almost none are predicted to result in a crystal structure. Thus, the outcome of such variable distributions of such crystallizability' preferences leads to uneven structural coverage of known families, with “easier” or “optimal” families having several times more solved structures than “very difficult” ones. Nevertheless, this latter category can be successfully targeted by increasing the number of genomes that are used to select targets from a given family. On average, adding 10 new genomes to the “genome pool” provides more promising targets for 7 “very difficult” families. In contrast, our crystallization feasibility score does not indicate that any specific microbial genomes can be readily classified as “easier” or “very difficult” with respect to providing suitable candidates for crystallization and structure determination. Finally, our analyses show that specific physicochemical properties of the protein sequence favor successful outcomes for structure determination and, hence, the group of proteins with known 3D structures is systematically different from the general pool of known proteins. We, therefore, assess the structural consequences of these differences in protein sequence and protein biophysical properties. PMID:19000818
Observing the overall rocking motion of a protein in a crystal
NASA Astrophysics Data System (ADS)
Ma, Peixiang; Xue, Yi; Coquelle, Nicolas; Haller, Jens D.; Yuwen, Tairan; Ayala, Isabel; Mikhailovskii, Oleg; Willbold, Dieter; Colletier, Jacques-Philippe; Skrynnikov, Nikolai R.; Schanda, Paul
2015-10-01
The large majority of three-dimensional structures of biological macromolecules have been determined by X-ray diffraction of crystalline samples. High-resolution structure determination crucially depends on the homogeneity of the protein crystal. Overall `rocking' motion of molecules in the crystal is expected to influence diffraction quality, and such motion may therefore affect the process of solving crystal structures. Yet, so far overall molecular motion has not directly been observed in protein crystals, and the timescale of such dynamics remains unclear. Here we use solid-state NMR, X-ray diffraction methods and μs-long molecular dynamics simulations to directly characterize the rigid-body motion of a protein in different crystal forms. For ubiquitin crystals investigated in this study we determine the range of possible correlation times of rocking motion, 0.1-100 μs. The amplitude of rocking varies from one crystal form to another and is correlated with the resolution obtainable in X-ray diffraction experiments.
Zhang, Gaihua; Su, Zhen
2012-01-01
Work on protein structure prediction is very useful in biological research. To evaluate their accuracy, experimental protein structures or their derived data are used as the 'gold standard'. However, as proteins are dynamic molecular machines with structural flexibility such a standard may be unreliable. To investigate the influence of the structure flexibility, we analysed 3,652 protein structures of 137 unique sequences from 24 protein families. The results showed that (1) the three-dimensional (3D) protein structures were not rigid: the root-mean-square deviation (RMSD) of the backbone Cα of structures with identical sequences was relatively large, with the average of the maximum RMSD from each of the 137 sequences being 1.06 Å; (2) the derived data of the 3D structure was not constant, e.g. the highest ratio of the secondary structure wobble site was 60.69%, with the sequence alignments from structural comparisons of two proteins in the same family sometimes being completely different. Proteins may have several stable conformations and the data derived from resolved structures as a 'gold standard' should be optimized before being utilized as criteria to evaluate the prediction methods, e.g. sequence alignment from structural comparison. Helix/β-sheet transition exists in normal free proteins. The coil ratio of the 3D structure could affect its resolution as determined by X-ray crystallography.
Wang, Liwen; Qin, Yali; Ilchenko, Serguei; Bohon, Jen; Shi, Wuxian; Cho, Michael W.; Takamoto, Keiji; Chance, Mark R.
2010-01-01
Structural characterization of the HIV envelope protein gp120 is very important to provide an understanding of the protein's immunogenicity and it's binding to cell receptors. So far, crystallographic structure determination of gp120 with an intact V3 loop (in the absence of CD4 co-receptor or antibody) has not been achieved. The third variable region (V3) of the gp120 is immunodominant and contains glycosylation signatures that are essential for co-receptor binding and viral entry to T-cells. In this study, we characterized the structure of the outer domain of gp120 with an intact V3 loop (gp120-OD8) purified from Drosophila S2 cells utilizing mass spectrometry-based approaches. We mapped the glycosylation sites and calculated glycosylation occupancy of gp120-OD8; eleven sites from fifteen glycosylation motifs were determined as having high mannose or hybrid glycosylation structures. The specific glycan moieties of nine glycosylation sites from eight unique glycopeptides were determined by a combination of ECD and CID MS approaches. Hydroxyl radical-mediated protein footprinting coupled with mass spectrometry analysis was employed to provide detailed information on protein structure of gp120-OD8 by directly identifying accessible and hydroxyl radical-reactive side chain residues. Comparison of gp120-OD8 experimental footprinting data with a homology model derived from the ligated CD4/ gp120-OD8 crystal structure revealed a flexible V3 loop structure where the V3 tip may provide contacts with the rest of the protein while residues in the V3 base remain solvent accessible. In addition, the data illustrate interactions between specific sugar moieties and amino acid side chains potentially important to the gp120-OD8 structure. PMID:20825246
Protein Crystal Quality Studies
NASA Technical Reports Server (NTRS)
1998-01-01
Eddie Snell, Post-Doctoral Fellow the National Research Council (NRC) uses a reciprocal space mapping diffractometer for macromolecular crystal quality studies. The diffractometer is used in mapping the structure of macromolecules such as proteins to determine their structure and thus understand how they function with other proteins in the body. This is one of several analytical tools used on proteins crystallized on Earth and in space experiments. Photo credit: NASA/Marshall Space Flight Center (MSFC)
2009-01-01
An important part of characterizing any protein molecule is to determine its size and shape. Sedimentation and gel filtration are hydrodynamic techniques that can be used for this medium resolution structural analysis. This review collects a number of simple calculations that are useful for thinking about protein structure at the nanometer level. Readers are reminded that the Perrin equation is generally not a valid approach to determine the shape of proteins. Instead, a simple guideline is presented, based on the measured sedimentation coefficient and a calculated maximum S, to estimate if a protein is globular or elongated. It is recalled that a gel filtration column fractionates proteins on the basis of their Stokes radius, not molecular weight. The molecular weight can be determined by combining gradient sedimentation and gel filtration, techniques available in most biochemistry laboratories, as originally proposed by Siegel and Monte. Finally, rotary shadowing and negative stain electron microscopy are powerful techniques for resolving the size and shape of single protein molecules and complexes at the nanometer level. A combination of hydrodynamics and electron microscopy is especially powerful. PMID:19495910
Structure of synaptophysin: a hexameric MARVEL-domain channel protein.
Arthur, Christopher P; Stowell, Michael H B
2007-06-01
Synaptophysin I (SypI) is an archetypal member of the MARVEL-domain family of integral membrane proteins and one of the first synaptic vesicle proteins to be identified and cloned. Most all MARVEL-domain proteins are involved in membrane apposition and vesicle-trafficking events, but their precise role in these processes is unclear. We have purified mammalian SypI and determined its three-dimensional (3D) structure by using electron microscopy and single-particle 3D reconstruction. The hexameric structure resembles an open basket with a large pore and tenuous interactions within the cytosolic domain. The structure suggests a model for Synaptophysin's role in fusion and recycling that is regulated by known interactions with the SNARE machinery. This 3D structure of a MARVEL-domain protein provides a structural foundation for understanding the role of these important proteins in a variety of biological processes.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zemla, A; Lang, D; Kostova, T
2010-11-29
Most of the currently used methods for protein function prediction rely on sequence-based comparisons between a query protein and those for which a functional annotation is provided. A serious limitation of sequence similarity-based approaches for identifying residue conservation among proteins is the low confidence in assigning residue-residue correspondences among proteins when the level of sequence identity between the compared proteins is poor. Multiple sequence alignment methods are more satisfactory - still, they cannot provide reliable results at low levels of sequence identity. Our goal in the current work was to develop an algorithm that could overcome these difficulties and facilitatemore » the identification of structurally (and possibly functionally) relevant residue-residue correspondences between compared protein structures. Here we present StralSV, a new algorithm for detecting closely related structure fragments and quantifying residue frequency from tight local structure alignments. We apply StralSV in a study of the RNA-dependent RNA polymerase of poliovirus and demonstrate that the algorithm can be used to determine regions of the protein that are relatively unique or that shared structural similarity with structures that are distantly related. By quantifying residue frequencies among many residue-residue pairs extracted from local alignments, one can infer potential structural or functional importance of specific residues that are determined to be highly conserved or that deviate from a consensus. We further demonstrate that considerable detailed structural and phylogenetic information can be derived from StralSV analyses. StralSV is a new structure-based algorithm for identifying and aligning structure fragments that have similarity to a reference protein. StralSV analysis can be used to quantify residue-residue correspondences and identify residues that may be of particular structural or functional importance, as well as unusual or unexpected residues at a given sequence position.« less
Bauer, Katharina Christin; Hämmerling, Frank; Kittelmann, Jörg; Dürr, Cathrin; Görlich, Fabian; Hubbuch, Jürgen
2017-04-01
Information about protein-protein interactions provides valuable knowledge about the phase behavior of protein solutions during the biopharmaceutical production process. Up to date it is possible to capture their overall impact by an experimentally determined potential of mean force. For the description of this potential, the second virial coefficient B22, the diffusion interaction parameter kD, the storage modulus G', or the diffusion coefficient D is applied. In silico methods do not only have the potential to predict these parameters, but also to provide deeper understanding of the molecular origin of the protein-protein interactions by correlating the data to the protein's three-dimensional structure. This methodology furthermore allows a lower sample consumption and less experimental effort. Of all in silico methods, QSAR modeling, which correlates the properties of the molecule's structure with the experimental behavior, seems to be particularly suitable for this purpose. To verify this, the study reported here dealt with the determination of a QSAR model for the diffusion coefficient of proteins. This model consisted of diffusion coefficients for six different model proteins at various pH values and NaCl concentrations. The generated QSAR model showed a good correlation between experimental and predicted data with a coefficient of determination R2 = 0.9 and a good predictability for an external test set with R2 = 0.91. The information about the properties affecting protein-protein interactions present in solution was in agreement with experiment and theory. Furthermore, the model was able to give a more detailed picture of the protein properties influencing the diffusion coefficient and the acting protein-protein interactions. Biotechnol. Bioeng. 2017;114: 821-831. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
3D structural fluctuation of IgG1 antibody revealed by individual particle electron tomography
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhang, Xing; Zhang, Lei; Tong, Huimin
2015-05-05
Commonly used methods for determining protein structure, including X-ray crystallography and single-particle reconstruction, often provide a single and unique three-dimensional (3D) structure. However, in these methods, the protein dynamics and flexibility/fluctuation remain mostly unknown. Here, we utilized advances in electron tomography (ET) to study the antibody flexibility and fluctuation through structural determination of individual antibody particles rather than averaging multiple antibody particles together. Through individual-particle electron tomography (IPET) 3D reconstruction from negatively-stained ET images, we obtained 120 ab-initio 3D density maps at an intermediate resolution (~1–3 nm) from 120 individual IgG1 antibody particles. Using these maps as a constraint, wemore » derived 120 conformations of the antibody via structural flexible docking of the crystal structure to these maps by targeted molecular dynamics simulations. Statistical analysis of the various conformations disclosed the antibody 3D conformational flexibility through the distribution of its domain distances and orientations. This blueprint approach, if extended to other flexible proteins, may serve as a useful methodology towards understanding protein dynamics and functions.« less
Simplified Protein Models: Predicting Folding Pathways and Structure Using Amino Acid Sequences
NASA Astrophysics Data System (ADS)
Adhikari, Aashish N.; Freed, Karl F.; Sosnick, Tobin R.
2013-07-01
We demonstrate the ability of simultaneously determining a protein’s folding pathway and structure using a properly formulated model without prior knowledge of the native structure. Our model employs a natural coordinate system for describing proteins and a search strategy inspired by the observation that real proteins fold in a sequential fashion by incrementally stabilizing nativelike substructures or “foldons.” Comparable folding pathways and structures are obtained for the twelve proteins recently studied using atomistic molecular dynamics simulations [K. Lindorff-Larsen, S. Piana, R. O. Dror, D. E. Shaw, Science 334, 517 (2011)], with our calculations running several orders of magnitude faster. We find that nativelike propensities in the unfolded state do not necessarily determine the order of structure formation, a departure from a major conclusion of the molecular dynamics study. Instead, our results support a more expansive view wherein intrinsic local structural propensities may be enhanced or overridden in the folding process by environmental context. The success of our search strategy validates it as an expedient mechanism for folding both in silico and in vivo.
Knutson, Stacy T; Westwood, Brian M; Leuthaeuser, Janelle B; Turner, Brandon E; Nguyendac, Don; Shea, Gabrielle; Kumar, Kiran; Hayden, Julia D; Harper, Angela F; Brown, Shoshana D; Morris, John H; Ferrin, Thomas E; Babbitt, Patricia C; Fetrow, Jacquelyn S
2017-04-01
Protein function identification remains a significant problem. Solving this problem at the molecular functional level would allow mechanistic determinant identification-amino acids that distinguish details between functional families within a superfamily. Active site profiling was developed to identify mechanistic determinants. DASP and DASP2 were developed as tools to search sequence databases using active site profiling. Here, TuLIP (Two-Level Iterative clustering Process) is introduced as an iterative, divisive clustering process that utilizes active site profiling to separate structurally characterized superfamily members into functionally relevant clusters. Underlying TuLIP is the observation that functionally relevant families (curated by Structure-Function Linkage Database, SFLD) self-identify in DASP2 searches; clusters containing multiple functional families do not. Each TuLIP iteration produces candidate clusters, each evaluated to determine if it self-identifies using DASP2. If so, it is deemed a functionally relevant group. Divisive clustering continues until each structure is either a functionally relevant group member or a singlet. TuLIP is validated on enolase and glutathione transferase structures, superfamilies well-curated by SFLD. Correlation is strong; small numbers of structures prevent statistically significant analysis. TuLIP-identified enolase clusters are used in DASP2 GenBank searches to identify sequences sharing functional site features. Analysis shows a true positive rate of 96%, false negative rate of 4%, and maximum false positive rate of 4%. F-measure and performance analysis on the enolase search results and comparison to GEMMA and SCI-PHY demonstrate that TuLIP avoids the over-division problem of these methods. Mechanistic determinants for enolase families are evaluated and shown to correlate well with literature results. © 2017 The Authors Protein Science published by Wiley Periodicals, Inc. on behalf of The Protein Society.
Takeda, Mitsuhiro; Sugimori, Nozomi; Torizawa, Takuya; Terauchi, Tsutomu; Ono, Akira M; Yagi, Hirokazu; Yamaguchi, Yoshiki; Kato, Koichi; Ikeya, Teppei; Jee, Jungoo; Güntert, Peter; Aceti, David J; Markley, John L; Kainosho, Masatsune
2008-12-01
The product of gene At3g16450.1 from Arabidopsis thaliana is a 32 kDa, 299-residue protein classified as resembling a myrosinase-binding protein (MyroBP). MyroBPs are found in plants as part of a complex with the glucosinolate-degrading enzyme myrosinase, and are suspected to play a role in myrosinase-dependent defense against pathogens. Many MyroBPs and MyroBP-related proteins are composed of repeated homologous sequences with unknown structure. We report here the three-dimensional structure of the At3g16450.1 protein from Arabidopsis, which consists of two tandem repeats. Because the size of the protein is larger than that amenable to high-throughput analysis by uniform (13)C/(15)N labeling methods, we used stereo-array isotope labeling (SAIL) technology to prepare an optimally (2)H/(13)C/(15)N-labeled sample. NMR data sets collected using the SAIL protein enabled us to assign (1)H, (13)C and (15)N chemical shifts to 95.5% of all atoms, even at a low concentration (0.2 mm) of protein product. We collected additional NOESY data and determined the three-dimensional structure using the cyana software package. The structure, the first for a MyroBP family member, revealed that the At3g16450.1 protein consists of two independent but similar lectin-fold domains, each composed of three beta-sheets.
Gomaa, Walaa M S; Mosaad, Gamal M; Yu, Peiqiang
2018-04-21
The objectives of this study were to: (1) Use molecular spectroscopy as a novel technique to quantify protein molecular structures in relation to its chemical profiles and bioenergy values in oil-seeds and co-products from bio-oil processing. (2) Determine and compare: (a) protein molecular structure using Fourier transform infrared (FT/IR-ATR) molecular spectroscopy technique; (b) bioactive compounds, anti-nutritional factors, and chemical composition; and (c) bioenergy values in oil seeds (canola seeds), co-products (meal or pellets) from bio-oil processing plants in Canada in comparison with China. (3) Determine the relationship between protein molecular structural features and nutrient profiles in oil-seeds and co-products from bio-oil processing. Our results showed the possibility to characterize protein molecular structure using FT/IR molecular spectroscopy. Processing induced changes between oil seeds and co-products were found in the chemical, bioenergy profiles and protein molecular structure. However, no strong correlation was found between the chemical and nutrient profiles of oil seeds (canola seeds) and their protein molecular structure. On the other hand, co-products were strongly correlated with protein molecular structure in the chemical profile and bioenergy values. Generally, comparisons of oil seeds (canola seeds) and co-products (meal or pellets) in Canada, in China, and between Canada and China indicated the presence of variations among different crusher plants and bio-oil processing products.
Lipid nanotechnologies for structural studies of membrane-associated proteins.
Stoilova-McPhie, Svetla; Grushin, Kirill; Dalm, Daniela; Miller, Jaimy
2014-11-01
We present a methodology of lipid nanotubes (LNT) and nanodisks technologies optimized in our laboratory for structural studies of membrane-associated proteins at close to physiological conditions. The application of these lipid nanotechnologies for structure determination by cryo-electron microscopy (cryo-EM) is fundamental for understanding and modulating their function. The LNTs in our studies are single bilayer galactosylceramide based nanotubes of ∼20 nm inner diameter and a few microns in length, that self-assemble in aqueous solutions. The lipid nanodisks (NDs) are self-assembled discoid lipid bilayers of ∼10 nm diameter, which are stabilized in aqueous solutions by a belt of amphipathic helical scaffold proteins. By combining LNT and ND technologies, we can examine structurally how the membrane curvature and lipid composition modulates the function of the membrane-associated proteins. As proof of principle, we have engineered these lipid nanotechnologies to mimic the activated platelet's phosphtaidylserine rich membrane and have successfully assembled functional membrane-bound coagulation factor VIII in vitro for structure determination by cryo-EM. The macromolecular organization of the proteins bound to ND and LNT are further defined by fitting the known atomic structures within the calculated three-dimensional maps. The combination of LNT and ND technologies offers a means to control the design and assembly of a wide range of functional membrane-associated proteins and complexes for structural studies by cryo-EM. The presented results confirm the suitability of the developed methodology for studying the functional structure of membrane-associated proteins, such as the coagulation factors, at a close to physiological environment. © 2014 Wiley Periodicals, Inc.
Sequence co-evolution gives 3D contacts and structures of protein complexes
Hopf, Thomas A; Schärfe, Charlotta P I; Rodrigues, João P G L M; Green, Anna G; Kohlbacher, Oliver; Sander, Chris; Bonvin, Alexandre M J J; Marks, Debora S
2014-01-01
Protein–protein interactions are fundamental to many biological processes. Experimental screens have identified tens of thousands of interactions, and structural biology has provided detailed functional insight for select 3D protein complexes. An alternative rich source of information about protein interactions is the evolutionary sequence record. Building on earlier work, we show that analysis of correlated evolutionary sequence changes across proteins identifies residues that are close in space with sufficient accuracy to determine the three-dimensional structure of the protein complexes. We evaluate prediction performance in blinded tests on 76 complexes of known 3D structure, predict protein–protein contacts in 32 complexes of unknown structure, and demonstrate how evolutionary couplings can be used to distinguish between interacting and non-interacting protein pairs in a large complex. With the current growth of sequences, we expect that the method can be generalized to genome-wide elucidation of protein–protein interaction networks and used for interaction predictions at residue resolution. DOI: http://dx.doi.org/10.7554/eLife.03430.001 PMID:25255213
Salvage of failed protein targets by reductive alkylation.
Tan, Kemin; Kim, Youngchang; Hatzos-Skintges, Catherine; Chang, Changsoo; Cuff, Marianne; Chhor, Gekleng; Osipiuk, Jerzy; Michalska, Karolina; Nocek, Boguslaw; An, Hao; Babnigg, Gyorgy; Bigelow, Lance; Joachimiak, Grazyna; Li, Hui; Mack, Jamey; Makowska-Grzyska, Magdalena; Maltseva, Natalia; Mulligan, Rory; Tesar, Christine; Zhou, Min; Joachimiak, Andrzej
2014-01-01
The growth of diffraction-quality single crystals is of primary importance in protein X-ray crystallography. Chemical modification of proteins can alter their surface properties and crystallization behavior. The Midwest Center for Structural Genomics (MCSG) has previously reported how reductive methylation of lysine residues in proteins can improve crystallization of unique proteins that initially failed to produce diffraction-quality crystals. Recently, this approach has been expanded to include ethylation and isopropylation in the MCSG protein crystallization pipeline. Applying standard methods, 180 unique proteins were alkylated and screened using standard crystallization procedures. Crystal structures of 12 new proteins were determined, including the first ethylated and the first isopropylated protein structures. In a few cases, the structures of native and methylated or ethylated states were obtained and the impact of reductive alkylation of lysine residues was assessed. Reductive methylation tends to be more efficient and produces the most alkylated protein structures. Structures of methylated proteins typically have higher resolution limits. A number of well-ordered alkylated lysine residues have been identified, which make both intermolecular and intramolecular contacts. The previous report is updated and complemented with the following new data; a description of a detailed alkylation protocol with results, structural features, and roles of alkylated lysine residues in protein crystals. These contribute to improved crystallization properties of some proteins.
Salvage of Failed Protein Targets by Reductive Alkylation
Tan, Kemin; Kim, Youngchang; Hatzos-Skintges, Catherine; Chang, Changsoo; Cuff, Marianne; Chhor, Gekleng; Osipiuk, Jerzy; Michalska, Karolina; Nocek, Boguslaw; An, Hao; Babnigg, Gyorgy; Bigelow, Lance; Joachimiak, Grazyna; Li, Hui; Mack, Jamey; Makowska-Grzyska, Magdalena; Maltseva, Natalia; Mulligan, Rory; Tesar, Christine; Zhou, Min; Joachimiak, Andrzej
2014-01-01
The growth of diffraction-quality single crystals is of primary importance in protein X-ray crystallography. Chemical modification of proteins can alter their surface properties and crystallization behavior. The Midwest Center for Structural Genomics (MCSG) has previously reported how reductive methylation of lysine residues in proteins can improve crystallization of unique proteins that initially failed to produce diffraction-quality crystals. Recently, this approach has been expanded to include ethylation and isopropylation in the MCSG protein crystallization pipeline. Applying standard methods, 180 unique proteins were alkylated and screened using standard crystallization procedures. Crystal structures of 12 new proteins were determined, including the first ethylated and the first isopropylated protein structures. In a few cases, the structures of native and methylated or ethylated states were obtained and the impact of reductive alkylation of lysine residues was assessed. Reductive methylation tends to be more efficient and produces the most alkylated protein structures. Structures of methylated proteins typically have higher resolution limits. A number of well-ordered alkylated lysine residues have been identified, which make both intermolecular and intramolecular contacts. The previous report is updated and complemented with the following new data; a description of a detailed alkylation protocol with results, structural features, and roles of alkylated lysine residues in protein crystals. These contribute to improved crystallization properties of some proteins. PMID:24590719
Khafizov, Kamil; Madrid-Aliste, Carlos; Almo, Steven C.; Fiser, Andras
2014-01-01
The exponential growth of protein sequence data provides an ever-expanding body of unannotated and misannotated proteins. The National Institutes of Health-supported Protein Structure Initiative and related worldwide structural genomics efforts facilitate functional annotation of proteins through structural characterization. Recently there have been profound changes in the taxonomic composition of sequence databases, which are effectively redefining the scope and contribution of these large-scale structure-based efforts. The faster-growing bacterial genomic entries have overtaken the eukaryotic entries over the last 5 y, but also have become more redundant. Despite the enormous increase in the number of sequences, the overall structural coverage of proteins—including proteins for which reliable homology models can be generated—on the residue level has increased from 30% to 40% over the last 10 y. Structural genomics efforts contributed ∼50% of this new structural coverage, despite determining only ∼10% of all new structures. Based on current trends, it is expected that ∼55% structural coverage (the level required for significant functional insight) will be achieved within 15 y, whereas without structural genomics efforts, realizing this goal will take approximately twice as long. PMID:24567391
Robust, high-throughput solution structural analyses by small angle X-ray scattering (SAXS)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hura, Greg L.; Menon, Angeli L.; Hammel, Michal
2009-07-20
We present an efficient pipeline enabling high-throughput analysis of protein structure in solution with small angle X-ray scattering (SAXS). Our SAXS pipeline combines automated sample handling of microliter volumes, temperature and anaerobic control, rapid data collection and data analysis, and couples structural analysis with automated archiving. We subjected 50 representative proteins, mostly from Pyrococcus furiosus, to this pipeline and found that 30 were multimeric structures in solution. SAXS analysis allowed us to distinguish aggregated and unfolded proteins, define global structural parameters and oligomeric states for most samples, identify shapes and similar structures for 25 unknown structures, and determine envelopes formore » 41 proteins. We believe that high-throughput SAXS is an enabling technology that may change the way that structural genomics research is done.« less
Dal Palù, Alessandro; Dovier, Agostino; Pontelli, Enrico
2010-01-01
Crystal lattices are discrete models of the three-dimensional space that have been effectively employed to facilitate the task of determining proteins' natural conformation. This paper investigates alternative global constraints that can be introduced in a constraint solver over discrete crystal lattices. The objective is to enhance the efficiency of lattice solvers in dealing with the construction of approximate solutions of the protein structure determination problem. Some of them (e.g., self-avoiding-walk) have been explicitly or implicitly already used in previous approaches, while others (e.g., the density constraint) are new. The intrinsic complexities of all of them are studied and preliminary experimental results are discussed.
SAIL--stereo-array isotope labeling.
Kainosho, Masatsune; Güntert, Peter
2009-11-01
Optimal stereospecific and regiospecific labeling of proteins with stable isotopes enhances the nuclear magnetic resonance (NMR) method for the determination of the three-dimensional protein structures in solution. Stereo-array isotope labeling (SAIL) offers sharpened lines, spectral simplification without loss of information and the ability to rapidly collect and automatically evaluate the structural restraints required to solve a high-quality solution structure for proteins up to twice as large as before. This review gives an overview of stable isotope labeling methods for NMR spectroscopy with proteins and provides an in-depth treatment of the SAIL technology.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Baig, M.; Brown, A.; Eswaramoorthy, S.
Klebsiella pneumoniae, a gram-negative enteric bacterium, is found in nosocomial infections which are acquired during hospital stays for about 10% of hospital patients in the United States. The crystal structure of a putative oxidoreductase from K. pneumoniae has been determined. The structural information of this K. pneumoniae protein was used to understand its function. Crystals of the putative oxidoreductase enzyme were obtained by the sitting drop vapor diffusion method using Polyethylene glycol (PEG) 3350, Bis-Tris buffer, pH 5.5 as precipitant. These crystals were used to collect X-ray data at beam line X12C of the National Synchrotron Light Source (NSLS) atmore » Brookhaven National Laboratory (BNL). The crystal structure was determined using the SHELX program and refi ned with CNS 1.1. This protein, which is involved in the catalysis of an oxidation-reduction (redox) reaction, has an alpha/beta structure. It utilizes nicotinamide adenine dinucleotide phosphate (NADP) or nicotine adenine dinucleotide (NAD) to perform its function. This structure could be used to determine the active and co-factor binding sites of the protein, information that could help pharmaceutical companies in drug design and in determining the protein’s relationship to disease treatment such as that for pneumonia and other related pathologies.« less
Structure determination of helical filaments by solid-state NMR spectroscopy
Ahmed, Mumdooh; Spehr, Johannes; König, Renate; Lünsdorf, Heinrich; Rand, Ulfert; Lührs, Thorsten; Ritter, Christiane
2016-01-01
The controlled formation of filamentous protein complexes plays a crucial role in many biological systems and represents an emerging paradigm in signal transduction. The mitochondrial antiviral signaling protein (MAVS) is a central signal transduction hub in innate immunity that is activated by a receptor-induced conversion into helical superstructures (filaments) assembled from its globular caspase activation and recruitment domain. Solid-state NMR (ssNMR) spectroscopy has become one of the most powerful techniques for atomic resolution structures of protein fibrils. However, for helical filaments, the determination of the correct symmetry parameters has remained a significant hurdle for any structural technique and could thus far not be precisely derived from ssNMR data. Here, we solved the atomic resolution structure of helical MAVSCARD filaments exclusively from ssNMR data. We present a generally applicable approach that systematically explores the helical symmetry space by efficient modeling of the helical structure restrained by interprotomer ssNMR distance restraints. Together with classical automated NMR structure calculation, this allowed us to faithfully determine the symmetry that defines the entire assembly. To validate our structure, we probed the protomer arrangement by solvent paramagnetic resonance enhancement, analysis of chemical shift differences relative to the solution NMR structure of the monomer, and mutagenesis. We provide detailed information on the atomic contacts that determine filament stability and describe mechanistic details on the formation of signaling-competent MAVS filaments from inactive monomers. PMID:26733681
Defining an essence of structure determining residue contacts in proteins.
Sathyapriya, R; Duarte, Jose M; Stehr, Henning; Filippis, Ioannis; Lappe, Michael
2009-12-01
The network of native non-covalent residue contacts determines the three-dimensional structure of a protein. However, not all contacts are of equal structural significance, and little knowledge exists about a minimal, yet sufficient, subset required to define the global features of a protein. Characterisation of this "structural essence" has remained elusive so far: no algorithmic strategy has been devised to-date that could outperform a random selection in terms of 3D reconstruction accuracy (measured as the Ca RMSD). It is not only of theoretical interest (i.e., for design of advanced statistical potentials) to identify the number and nature of essential native contacts-such a subset of spatial constraints is very useful in a number of novel experimental methods (like EPR) which rely heavily on constraint-based protein modelling. To derive accurate three-dimensional models from distance constraints, we implemented a reconstruction pipeline using distance geometry. We selected a test-set of 12 protein structures from the four major SCOP fold classes and performed our reconstruction analysis. As a reference set, series of random subsets (ranging from 10% to 90% of native contacts) are generated for each protein, and the reconstruction accuracy is computed for each subset. We have developed a rational strategy, termed "cone-peeling" that combines sequence features and network descriptors to select minimal subsets that outperform the reference sets. We present, for the first time, a rational strategy to derive a structural essence of residue contacts and provide an estimate of the size of this minimal subset. Our algorithm computes sparse subsets capable of determining the tertiary structure at approximately 4.8 A Ca RMSD with as little as 8% of the native contacts (Ca-Ca and Cb-Cb). At the same time, a randomly chosen subset of native contacts needs about twice as many contacts to reach the same level of accuracy. This "structural essence" opens new avenues in the fields of structure prediction, empirical potentials and docking.
Defining an Essence of Structure Determining Residue Contacts in Proteins
Sathyapriya, R.; Duarte, Jose M.; Stehr, Henning; Filippis, Ioannis; Lappe, Michael
2009-01-01
The network of native non-covalent residue contacts determines the three-dimensional structure of a protein. However, not all contacts are of equal structural significance, and little knowledge exists about a minimal, yet sufficient, subset required to define the global features of a protein. Characterisation of this “structural essence” has remained elusive so far: no algorithmic strategy has been devised to-date that could outperform a random selection in terms of 3D reconstruction accuracy (measured as the Ca RMSD). It is not only of theoretical interest (i.e., for design of advanced statistical potentials) to identify the number and nature of essential native contacts—such a subset of spatial constraints is very useful in a number of novel experimental methods (like EPR) which rely heavily on constraint-based protein modelling. To derive accurate three-dimensional models from distance constraints, we implemented a reconstruction pipeline using distance geometry. We selected a test-set of 12 protein structures from the four major SCOP fold classes and performed our reconstruction analysis. As a reference set, series of random subsets (ranging from 10% to 90% of native contacts) are generated for each protein, and the reconstruction accuracy is computed for each subset. We have developed a rational strategy, termed “cone-peeling” that combines sequence features and network descriptors to select minimal subsets that outperform the reference sets. We present, for the first time, a rational strategy to derive a structural essence of residue contacts and provide an estimate of the size of this minimal subset. Our algorithm computes sparse subsets capable of determining the tertiary structure at approximately 4.8 Å Ca RMSD with as little as 8% of the native contacts (Ca-Ca and Cb-Cb). At the same time, a randomly chosen subset of native contacts needs about twice as many contacts to reach the same level of accuracy. This “structural essence” opens new avenues in the fields of structure prediction, empirical potentials and docking. PMID:19997489
Kalinowska, Barbara; Banach, Mateusz; Konieczny, Leszek; Marchewka, Damian; Roterman, Irena
2014-01-01
This work discusses the role of unstructured polypeptide chain fragments in shaping the protein's hydrophobic core. Based on the "fuzzy oil drop" model, which assumes an idealized distribution of hydrophobicity density described by the 3D Gaussian, we can determine which fragments make up the core and pinpoint residues whose location conflicts with theoretical predictions. We show that the structural influence of the water environment determines the positions of disordered fragments, leading to the formation of a hydrophobic core overlaid by a hydrophilic mantle. This phenomenon is further described by studying selected proteins which are known to be unstable and contain intrinsically disordered fragments. Their properties are established quantitatively, explaining the causative relation between the protein's structure and function and facilitating further comparative analyses of various structural models. © 2014 Elsevier Inc. All rights reserved.
Characterization of the Structural Gene Promoter of Aedes aegypti Densovirus
Ward, Todd W.; Kimmick, Michael W.; Afanasiev, Boris N.; Carlson, Jonathan O.
2001-01-01
Aedes aegypti densonucleosis virus (AeDNV) has two promoters that have been shown to be active by reporter gene expression analysis (B. N. Afanasiev, Y. V. Koslov, J. O. Carlson, and B. J. Beaty, Exp. Parasitol. 79:322–339, 1994). Northern blot analysis of cells infected with AeDNV revealed two transcripts 1,200 and 3,500 nucleotides in length that are assumed to express the structural protein (VP) gene and nonstructural protein genes, respectively. Primer extension was used to map the transcriptional start site of the structural protein gene. Surprisingly, the structural protein gene transcript began at an initiator consensus sequence, CAGT, 60 nucleotides upstream from the map unit 61 TATAA sequence previously thought to define the promoter. Constructs with the β-galactosidase gene fused to the structural protein gene were used to determine elements necessary for promoter function. Deletion or mutation of the initiator sequence, CAGT, reduced protein expression by 93%, whereas mutation of the TATAA sequence at map unit 61 had little effect. An additional open reading frame was observed upstream of the structural protein gene that can express β-galactosidase at a low level (20% of that of VP fusions). Expression of the AeDNV structural protein gene was shown to be stimulated by the major nonstructural protein NS1 (Afanasiev et al., Exp. parasitol., 1994). To determine the sequences required for transactivation, expression of structural protein gene–β-galactosidase gene fusion constructs differing in AeDNV genome content was measured with and without NS1. The presence of NS1 led to an 8- to 10-fold increase in expression when either genomic end was present, compared to a 2-fold increase with a construct lacking the genomic ends. An even higher (37-fold) increase in expression occurred with both genomic ends present; however, this was in part due to template replication as shown by Southern blot analysis. These data indicate the location and importance of various elements necessary for efficient protein expression and transactivation from the structural protein gene promoter of AeDNV. PMID:11152505
Tools to evaluate the conformation of protein products.
Manta, Bruno; Obal, Gonzalo; Ricciardi, Alejandro; Pritsch, Otto; Denicola, Ana
2011-06-01
Production of recombinant proteins is a process intensively used in the research laboratory. In addition, the main biotechnology market products are recombinant proteins and monoclonal antibodies. The biological (and clinical) properties of the protein product strongly depend on the conformation of the polypeptide. Therefore, assessment of the correct conformation of the produced protein is crucial. There is no single method to assess every aspect of protein structure or function. Depending on the protein, the methods of choice vary. There are general methods to evaluate not only mass and primary sequence of the protein, but also higher-order structure. This review outlines the principal techniques for determining the conformation of a protein from structural (biophysical methods) to functional (in vitro binding assays) analyses. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Visualizing and Clustering Protein Similarity Networks: Sequences, Structures, and Functions.
Mai, Te-Lun; Hu, Geng-Ming; Chen, Chi-Ming
2016-07-01
Research in the recent decade has demonstrated the usefulness of protein network knowledge in furthering the study of molecular evolution of proteins, understanding the robustness of cells to perturbation, and annotating new protein functions. In this study, we aimed to provide a general clustering approach to visualize the sequence-structure-function relationship of protein networks, and investigate possible causes for inconsistency in the protein classifications based on sequences, structures, and functions. Such visualization of protein networks could facilitate our understanding of the overall relationship among proteins and help researchers comprehend various protein databases. As a demonstration, we clustered 1437 enzymes by their sequences and structures using the minimum span clustering (MSC) method. The general structure of this protein network was delineated at two clustering resolutions, and the second level MSC clustering was found to be highly similar to existing enzyme classifications. The clustering of these enzymes based on sequence, structure, and function information is consistent with each other. For proteases, the Jaccard's similarity coefficient is 0.86 between sequence and function classifications, 0.82 between sequence and structure classifications, and 0.78 between structure and function classifications. From our clustering results, we discussed possible examples of divergent evolution and convergent evolution of enzymes. Our clustering approach provides a panoramic view of the sequence-structure-function network of proteins, helps visualize the relation between related proteins intuitively, and is useful in predicting the structure and function of newly determined protein sequences.
Direct demodulation method for heavy atom position determination in protein crystallography
NASA Astrophysics Data System (ADS)
Zhou, Liang; Liu, Zhong-Chuan; Liu, Peng; Dong, Yu-Hui
2013-01-01
The first step of phasing in any de novo protein structure determination using isomorphous replacement (IR) or anomalous scattering (AD) experiments is to find heavy atom positions. Traditionally, heavy atom positions can be solved by inspecting the difference Patterson maps. Due to the weak signals in isomorphous or anomalous differences and the noisy background in the Patterson map, the search for heavy atoms may become difficult. Here, the direct demodulation (DD) method is applied to the difference Patterson maps to reduce the noisy backgrounds and sharpen the signal peaks. The real space Patterson search by using these optimized maps can locate the heavy atom positions more accurately. It is anticipated that the direct demodulation method can assist in heavy atom position determination and facilitate the de novo structure determination of proteins.
Development of techniques in magnetic resonance and structural studies of the prion protein
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bitter, Hans-Marcus L.
2000-07-01
Magnetic resonance is the most powerful analytical tool used by chemists today. Its applications range from determining structures of large biomolecules to imaging of human brains. Nevertheless, magnetic resonance remains a relatively young field, in which many techniques are currently being developed that have broad applications. In this dissertation, two new techniques are presented, one that enables the determination of torsion angles in solid-state peptides and proteins, and another that involves imaging of heterogenous materials at ultra-low magnetic fields. In addition, structural studies of the prion protein via solid-state NMR are described. More specifically, work is presented in which themore » dependence of chemical shifts on local molecular structure is used to predict chemical shift tensors in solid-state peptides with theoretical ab initio surfaces. These predictions are then used to determine the backbone dihedral angles in peptides. This method utilizes the theoretical chemicalshift tensors and experimentally determined chemical-shift anisotropies (CSAs) to predict the backbone and side chain torsion angles in alanine, leucine, and valine residues. Additionally, structural studies of prion protein fragments are described in which conformationally-dependent chemical-shift measurements were made to gain insight into the structural differences between the various conformational states of the prion protein. These studies are of biological and pathological interest since conformational changes in the prion protein are believed to cause prion diseases. Finally, an ultra-low field magnetic resonance imaging technique is described that enables imaging and characterization of heterogeneous and porous media. The notion of imaging gases at ultra-low fields would appear to be very difficult due to the prohibitively low polarization and spin densities as well as the low sensitivities of conventional Faraday coil detectors. However, Chapter 5 describes how gas imaging at ultra-low fields is realized by incorporating the high sensitivities of a dc superconducting quantum interference device (SQUID) with the high polarizations attainable through optica11y pumping 129Xe gas.« less
Structural genomics: keeping up with expanding knowledge of the protein universe.
Grabowski, Marek; Joachimiak, Andrzej; Otwinowski, Zbyszek; Minor, Wladek
2007-06-01
Structural characterization of the protein universe is the main mission of Structural Genomics (SG) programs. However, progress in gene sequencing technology, set in motion in the 1990s, has resulted in rapid expansion of protein sequence space--a twelvefold increase in the past seven years. For the SG field, this creates new challenges and necessitates a re-assessment of its strategies. Nevertheless, despite the growth of sequence space, at present nearly half of the content of the Swiss-Prot database and over 40% of Pfam protein families can be structurally modeled based on structures determined so far, with SG projects making an increasingly significant contribution. The SG contribution of new Pfam structures nearly doubled from 27.2% in 2003 to 51.6% in 2006.
Mandal, Kalyaneswar; Uppalapati, Maruti; Ault-Riché, Dana; Kenney, John; Lowitz, Joshua; Sidhu, Sachdev S; Kent, Stephen B H
2012-09-11
Total chemical synthesis was used to prepare the mirror image (D-protein) form of the angiogenic protein vascular endothelial growth factor (VEGF-A). Phage display against D-VEGF-A was used to screen designed libraries based on a unique small protein scaffold in order to identify a high affinity ligand. Chemically synthesized D- and L- forms of the protein ligand showed reciprocal chiral specificity in surface plasmon resonance binding experiments: The L-protein ligand bound only to D-VEGF-A, whereas the D-protein ligand bound only to L-VEGF-A. The D-protein ligand, but not the L-protein ligand, inhibited the binding of natural VEGF(165) to the VEGFR1 receptor. Racemic protein crystallography was used to determine the high resolution X-ray structure of the heterochiral complex consisting of {D-protein antagonist + L-protein form of VEGF-A}. Crystallization of a racemic mixture of these synthetic proteins in appropriate stoichiometry gave a racemic protein complex of more than 73 kDa containing six synthetic protein molecules. The structure of the complex was determined to a resolution of 1.6 Å. Detailed analysis of the interaction between the D-protein antagonist and the VEGF-A protein molecule showed that the binding interface comprised a contact surface area of approximately 800 Å(2) in accord with our design objectives, and that the D-protein antagonist binds to the same region of VEGF-A that interacts with VEGFR1-domain 2.
Biotechnology Protein Expression and Purification Facility
NASA Technical Reports Server (NTRS)
2003-01-01
The purpose of the Project Scientist Core Facility is to provide purified proteins, both recombinant and natural, to the Biotechnology Science Team Project Scientists and the NRA-Structural Biology Test Investigators. Having a core facility for this purpose obviates the need for each scientist to develop the necessary expertise and equipment for molecular biology, protein expression, and protein purification. Because of this, they are able to focus their energies as well as their funding on the crystallization and structure determination of their target proteins.
Comparative Protein Structure Modeling Using MODELLER.
Webb, Benjamin; Sali, Andrej
2014-09-08
Functional characterization of a protein sequence is one of the most frequent problems in biology. This task is usually facilitated by accurate three-dimensional (3-D) structure of the studied protein. In the absence of an experimentally determined structure, comparative or homology modeling can sometimes provide a useful 3-D model for a protein that is related to at least one known protein structure. Comparative modeling predicts the 3-D structure of a given protein sequence (target) based primarily on its alignment to one or more proteins of known structure (templates). The prediction process consists of fold assignment, target-template alignment, model building, and model evaluation. This unit describes how to calculate comparative models using the program MODELLER and discusses all four steps of comparative modeling, frequently observed errors, and some applications. Modeling lactate dehydrogenase from Trichomonas vaginalis (TvLDH) is described as an example. The download and installation of the MODELLER software is also described. Copyright © 2014 John Wiley & Sons, Inc.
Nakamura, Akira; Ohtsuka, Jun; Kashiwagi, Tatsuki; Numoto, Nobutaka; Hirota, Noriyuki; Ode, Takahiro; Okada, Hidehiko; Nagata, Koji; Kiyohara, Motosuke; Suzuki, Ei-Ichiro; Kita, Akiko; Wada, Hitoshi; Tanokura, Masaru
2016-02-26
Precise protein structure determination provides significant information on life science research, although high-quality crystals are not easily obtained. We developed a system for producing high-quality protein crystals with high throughput. Using this system, gravity-controlled crystallization are made possible by a magnetic microgravity environment. In addition, in-situ and real-time observation and time-lapse imaging of crystal growth are feasible for over 200 solution samples independently. In this paper, we also report results of crystallization experiments for two protein samples. Crystals grown in the system exhibited magnetic orientation and showed higher and more homogeneous quality compared with the control crystals. The structural analysis reveals that making use of the magnetic microgravity during the crystallization process helps us to build a well-refined protein structure model, which has no significant structural differences with a control structure. Therefore, the system contributes to improvement in efficiency of structural analysis for "difficult" proteins, such as membrane proteins and supermolecular complexes.
Pineda-Lucena, Antonio; Liao, Jack C C; Cort, John R; Yee, Adelinda; Kennedy, Michael A; Edwards, Aled M; Arrowsmith, Cheryl H
2003-05-01
As part of the Northeast Structural Genomics Consortium pilot project focused on small eukaryotic proteins and protein domains, we have determined the NMR structure of the protein encoded by ORF YML108W from Saccharomyces cerevisiae. YML108W belongs to one of the numerous structural proteomics targets whose biological function is unknown. Moreover, this protein does not have sequence similarity to any other protein. The NMR structure of YML108W consists of a four-stranded beta-sheet with strand order 2143 and two alpha-helices, with an overall topology of betabetaalphabetabetaalpha. Strand beta1 runs parallel to beta4, and beta2:beta1 and beta4:beta3 pairs are arranged in an antiparallel fashion. Although this fold belongs to the split betaalphabeta family, it appears to be unique among this family; it is a novel arrangement of secondary structure, thereby expanding the universe of protein folds.
Tuning structure of oppositely charged nanoparticle and protein complexes
NASA Astrophysics Data System (ADS)
Kumar, Sugam; Aswal, V. K.; Callow, P.
2014-04-01
Small-angle neutron scattering (SANS) has been used to probe the structures of anionic silica nanoparticles (LS30) and cationic lyszyme protein (M.W. 14.7kD, I.P. ˜ 11.4) by tuning their interaction through the pH variation. The protein adsorption on nanoparticles is found to be increasing with pH and determined by the electrostatic attraction between two components as well as repulsion between protein molecules. We show the strong electrostatic attraction between nanoparticles and protein molecules leads to protein-mediated aggregation of nanoparticles which are characterized by fractal structures. At pH 5, the protein adsorption gives rise to nanoparticle aggregation having surface fractal morphology with close packing of nanoparticles. The surface fractals transform to open structures of mass fractal morphology at higher pH (7 and 9) on approaching isoelectric point (I.P.).
DOE Office of Scientific and Technical Information (OSTI.GOV)
Judd, R.C.; Caldwell, H.D.
1985-01-01
The objective of this study was to determine if in-gel chloramine-T radioiodination adequately labels OM proteins to allow for accurate and precise structural comparison of these molecules. Therefore, intrinsically /sup 14/C-amino acid labeled proteins and /sup 125/I-labeled proteins were cleaved with two endopeptidic reagents and the peptide fragments separated by HPLC. A comparison of retention times of the fragments, as determined by differential radiation counting, thus indicated whether /sup 125/Ilabeling identified of all the peptide peaks seen in the /sup 14/Clabeled proteins. Results demonstrated that radioiodination yields complete and accurate information about the primary structure of outer membrane proteins. Inmore » addition, it permits the use of extremely small amounts of protein allowing for method optimization and multiple separations to insure reproducibility.« less
Protein crystallization: Eluding the bottleneck of X-ray crystallography
Holcomb, Joshua; Spellmon, Nicholas; Zhang, Yingxue; Doughan, Maysaa; Li, Chunying; Yang, Zhe
2017-01-01
To date, X-ray crystallography remains the gold standard for the determination of macromolecular structure and protein substrate interactions. However, the unpredictability of obtaining a protein crystal remains the limiting factor and continues to be the bottleneck in determining protein structures. A vast amount of research has been conducted in order to circumvent this issue with limited success. No single method has proven to guarantee the crystallization of all proteins. However, techniques using antibody fragments, lipids, carrier proteins, and even mutagenesis of crystal contacts have been implemented to increase the odds of obtaining a crystal with adequate diffraction. In addition, we review a new technique using the scaffolding ability of PDZ domains to facilitate nucleation and crystal lattice formation. Although in its infancy, such technology may be a valuable asset and another method in the crystallography toolbox to further the chances of crystallizing problematic proteins. PMID:29051919
NASA Astrophysics Data System (ADS)
Illing, Gerd; Saenger, Wolfram; Heinemann, Udo
2000-06-01
The Protein Structure Factory will be established to characterize proteins encoded by human genes or cDNAs, which will be selected by criteria of potential structural novelty or medical or biotechnological usefulness. It represents an integrative approach to structure analysis combining bioinformatics techniques, automated gene expression and purification of gene products, generation of a biophysical fingerprint of the proteins and the determination of their three-dimensional structures either by NMR spectroscopy or by X-ray diffraction. The use of synchrotron radiation will be crucial to the Protein Structure Factory: high brilliance and tunable wavelengths are prerequisites for fast data collection, the use of small crystals and multiwavelength anomalous diffraction (MAD) phasing. With the opening of BESSY II, direct access to a third-generation XUV storage ring source with excellent conditions is available nearby. An insertion device with two MAD beamlines and one constant energy station will be set up until 2001.
Recognition of coarse-grained protein tertiary structure.
Lezon, Timothy; Banavar, Jayanth R; Maritan, Amos
2004-05-15
A model of the protein backbone is considered in which each residue is characterized by the location of its C(alpha) atom and one of a discrete set of conformal (phi, psi) states. We investigate the key differences between a description that offers a locally precise fit to known backbone structures and one that provides a globally accurate fit to protein structures. Using a statistical scoring scheme and threading, a protein's local best-fit conformation is highly recognizable, but its global structure cannot be directly determined from an amino acid sequence. The incorporation of information about the conformal states of neighboring residues along the chain allows one to accurately translate the local structure into a global structure. We present a two-step algorithm, which recognizes up to 95% of the tested protein native-state structures to within a 2.5 A root mean square deviation. Copyright 2004 Wiley-Liss, Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Porebski, Przemyslaw J.; Klimecka, Maria; Chruszcz, Maksymilian
2012-07-11
Dethiobiotin synthetase (DTBS) is involved in the biosynthesis of biotin in bacteria, fungi, and plants. As humans lack this pathway, DTBS is a promising antimicrobial drug target. We determined structures of DTBS from Helicobacter pylori (hpDTBS) bound with cofactors and a substrate analog, and described its unique characteristics relative to other DTBS proteins. Comparison with bacterial DTBS orthologs revealed considerable structural differences in nucleotide recognition. The C-terminal region of DTBS proteins, which contains two nucleotide-recognition motifs, differs greatly among DTBS proteins from different species. The structure of hpDTBS revealed that this protein is unique and does not contain a C-terminalmore » region containing one of the motifs. The single nucleotide-binding motif in hpDTBS is similar to its counterpart in GTPases; however, isothermal titration calorimetry binding studies showed that hpDTBS has a strong preference for ATP. The structural determinants of ATP specificity were assessed with X-ray crystallographic studies of hpDTBS-ATP and hpDTBS-GTP complexes. The unique mode of nucleotide recognition in hpDTBS makes this protein a good target for H. pylori-specific inhibitors of the biotin synthesis pathway.« less
NASA Astrophysics Data System (ADS)
Xu, Zhijun; Lazim, Raudah; Sun, Tiedong; Mei, Ye; Zhang, Dawei
2012-04-01
Solvent effect on protein conformation and folding mechanism of E6-associated protein (E6ap) peptide are investigated using a recently developed charge update scheme termed as adaptive hydrogen bond-specific charge (AHBC). On the basis of the close agreement between the calculated helix contents from AHBC simulations and experimental results, we observed based on the presented simulations that the two ends of the peptide may simultaneously take part in the formation of the helical structure at the early stage of folding and finally merge to form a helix with lowest backbone RMSD of about 0.9 Å in 40% 2,2,2-trifluoroethanol solution. However, in pure water, the folding may start at the center of the peptide sequence instead of at the two opposite ends. The analysis of the free energy landscape indicates that the solvent may determine the folding clusters of E6ap, which subsequently leads to the different final folded structure. The current study demonstrates new insight to the role of solvent in the determination of protein structure and folding dynamics.
Data Mining of Macromolecular Structures.
van Beusekom, Bart; Perrakis, Anastassis; Joosten, Robbie P
2016-01-01
The use of macromolecular structures is widespread for a variety of applications, from teaching protein structure principles all the way to ligand optimization in drug development. Applying data mining techniques on these experimentally determined structures requires a highly uniform, standardized structural data source. The Protein Data Bank (PDB) has evolved over the years toward becoming the standard resource for macromolecular structures. However, the process selecting the data most suitable for specific applications is still very much based on personal preferences and understanding of the experimental techniques used to obtain these models. In this chapter, we will first explain the challenges with data standardization, annotation, and uniformity in the PDB entries determined by X-ray crystallography. We then discuss the specific effect that crystallographic data quality and model optimization methods have on structural models and how validation tools can be used to make informed choices. We also discuss specific advantages of using the PDB_REDO databank as a resource for structural data. Finally, we will provide guidelines on how to select the most suitable protein structure models for detailed analysis and how to select a set of structure models suitable for data mining.
Pokkuluri, P Raj; Dwulit-Smith, Jeff; Duke, Norma E; Wilton, Rosemarie; Mack, Jamey C; Bearden, Jessica; Rakowski, Ella; Babnigg, Gyorgy; Szurmant, Hendrik; Joachimiak, Andrzej; Schiffer, Marianne
2013-01-01
Anaeromyxobacter dehalogenans is a δ-proteobacterium found in diverse soils and sediments. It is of interest in bioremediation efforts due to its dechlorination and metal-reducing capabilities. To gain an understanding on A. dehalogenans' abilities to adapt to diverse environments we analyzed its signal transduction proteins. The A. dehalogenans genome codes for a large number of sensor histidine kinases (HK) and methyl-accepting chemotaxis proteins (MCP); among these 23 HK and 11 MCP proteins have a sensor domain in the periplasm. These proteins most likely contribute to adaptation to the organism's surroundings. We predicted their three-dimensional folds and determined the structures of two of the periplasmic sensor domains by X-ray diffraction. Most of the domains are predicted to have either PAS-like or helical bundle structures, with two predicted to have solute-binding protein fold, and another predicted to have a 6-phosphogluconolactonase like fold. Atomic structures of two sensor domains confirmed the respective fold predictions. The Adeh_2942 sensor (HK) was found to have a helical bundle structure, and the Adeh_3718 sensor (MCP) has a PAS-like structure. Interestingly, the Adeh_3718 sensor has an acetate moiety bound in a binding site typical for PAS-like domains. Future work is needed to determine whether Adeh_3718 is involved in acetate sensing by A. dehalogenans. PMID:23897711
The hypothetical protein Atu4866 from Agrobacterium tumefaciens adopts a streptavidin-like fold
Ai, Xuanjun; Semesi, Anthony; Yee, Adelinda; Arrowsmith, Cheryl H.; Choy, Wing-Yiu; Li, Shawn S.C.
2008-01-01
Atu4866 is a 79-residue conserved hypothetical protein of unknown function from Agrobacterium tumefaciens. Protein sequence alignments show that it shares ≥60% sequence identity with 20 other hypothetical proteins of bacterial origin. However, the structures and functions of these proteins remain unknown so far. To gain insight into the function of this family of proteins, we have determined the structure of Atu4866 as a target of a structural genomics project using solution NMR spectroscopy. Our results reveal that Atu4866 adopts a streptavidin-like fold featuring a β-barrel/sandwich formed by eight antiparallel β-strands. Further structural analysis identified a continuous patch of conserved residues on the surface of Atu4866 that may constitute a potential ligand-binding site. PMID:18042676
Lipidic cubic phase injector facilitates membrane protein serial femtosecond crystallography.
Weierstall, Uwe; James, Daniel; Wang, Chong; White, Thomas A; Wang, Dingjie; Liu, Wei; Spence, John C H; Bruce Doak, R; Nelson, Garrett; Fromme, Petra; Fromme, Raimund; Grotjohann, Ingo; Kupitz, Christopher; Zatsepin, Nadia A; Liu, Haiguang; Basu, Shibom; Wacker, Daniel; Han, Gye Won; Katritch, Vsevolod; Boutet, Sébastien; Messerschmidt, Marc; Williams, Garth J; Koglin, Jason E; Marvin Seibert, M; Klinker, Markus; Gati, Cornelius; Shoeman, Robert L; Barty, Anton; Chapman, Henry N; Kirian, Richard A; Beyerlein, Kenneth R; Stevens, Raymond C; Li, Dianfan; Shah, Syed T A; Howe, Nicole; Caffrey, Martin; Cherezov, Vadim
2014-01-01
Lipidic cubic phase (LCP) crystallization has proven successful for high-resolution structure determination of challenging membrane proteins. Here we present a technique for extruding gel-like LCP with embedded membrane protein microcrystals, providing a continuously renewed source of material for serial femtosecond crystallography. Data collected from sub-10-μm-sized crystals produced with less than 0.5 mg of purified protein yield structural insights regarding cyclopamine binding to the Smoothened receptor.
Wolf, Maxim Y; Wolf, Yuri I; Koonin, Eugene V
2008-01-01
Background Proteins show a broad range of evolutionary rates. Understanding the factors that are responsible for the characteristic rate of evolution of a given protein arguably is one of the major goals of evolutionary biology. A long-standing general assumption used to be that the evolution rate is, primarily, determined by the specific functional constraints that affect the given protein. These constrains were traditionally thought to depend both on the specific features of the protein's structure and its biological role. The advent of systems biology brought about new types of data, such as expression level and protein-protein interactions, and unexpectedly, a variety of correlations between protein evolution rate and these variables have been observed. The strongest connections by far were repeatedly seen between protein sequence evolution rate and the expression level of the respective gene. It has been hypothesized that this link is due to the selection for the robustness of the protein structure to mistranslation-induced misfolding that is particularly important for highly expressed proteins and is the dominant determinant of the sequence evolution rate. Results This work is an attempt to assess the relative contributions of protein domain structure and function, on the one hand, and expression level on the other hand, to the rate of sequence evolution. To this end, we performed a genome-wide analysis of the effect of the fusion of a pair of domains in multidomain proteins on the difference in the domain-specific evolutionary rates. The mistranslation-induced misfolding hypothesis would predict that, within multidomain proteins, fused domains, on average, should evolve at substantially closer rates than the same domains in different proteins because, within a mutlidomain protein, all domains are translated at the same rate. We performed a comprehensive comparison of the evolutionary rates of mammalian and plant protein domains that are either joined in multidomain proteins or contained in distinct proteins. Substantial homogenization of evolutionary rates in multidomain proteins was, indeed, observed in both animals and plants, although highly significant differences between domain-specific rates remained. The contributions of the translation rate, as determined by the effect of the fusion of a pair of domains within a multidomain protein, and intrinsic, domain-specific structural-functional constraints appear to be comparable in magnitude. Conclusion Fusion of domains in a multidomain protein results in substantial homogenization of the domain-specific evolutionary rates but significant differences between domain-specific evolution rates remain. Thus, the rate of translation and intrinsic structural-functional constraints both exert sizable and comparable effects on sequence evolution. Reviewers This article was reviewed by Sergei Maslov, Dennis Vitkup, Claus Wilke (nominated by Orly Alter), and Allan Drummond (nominated by Joel Bader). For the full reviews, please go to the Reviewers' Reports section. PMID:18840284
Survey of large protein complexes D. vulgaris reveals great structural diversity
DOE Office of Scientific and Technical Information (OSTI.GOV)
Han, B.-G.; Dong, M.; Liu, H.
2009-08-15
An unbiased survey has been made of the stable, most abundant multi-protein complexes in Desulfovibrio vulgaris Hildenborough (DvH) that are larger than Mr {approx} 400 k. The quaternary structures for 8 of the 16 complexes purified during this work were determined by single-particle reconstruction of negatively stained specimens, a success rate {approx}10 times greater than that of previous 'proteomic' screens. In addition, the subunit compositions and stoichiometries of the remaining complexes were determined by biochemical methods. Our data show that the structures of only two of these large complexes, out of the 13 in this set that have recognizable functions,more » can be modeled with confidence based on the structures of known homologs. These results indicate that there is significantly greater variability in the way that homologous prokaryotic macromolecular complexes are assembled than has generally been appreciated. As a consequence, we suggest that relying solely on previously determined quaternary structures for homologous proteins may not be sufficient to properly understand their role in another cell of interest.« less
Jensen, Malene Ringkjøbing; Bernadó, Pau; Houben, Klaartje; Blanchard, Laurence; Marion, Dominque; Ruigrok, Rob W H; Blackledge, Martin
2010-08-01
Intrinsically disordered regions of significant length are present throughout eukaryotic genomes, and are particularly prevalent in viral proteins. Due to their inherent flexibility, these proteins inhabit a conformational landscape that is too complex to be described by classical structural biology. The elucidation of the role that conformational flexibility plays in molecular function will redefine our understanding of the molecular basis of biological function, and the development of appropriate technology to achieve this aim remains one of the major challenges for the future of structural biology. NMR is the technique of choice for studying intrinsically disordered proteins, providing information about structure, flexibility and interactions at atomic resolution even in completely disordered proteins. In particular residual dipolar couplings (RDCs) are sensitive and powerful tools for determining local and long-range structural behaviour in flexible proteins. Here we describe recent applications of the use of RDCs to quantitatively describe the level of local structure in intrinsically disordered proteins involved in replication and transcription in Sendai virus.
A protein-dependent side-chain rotamer library.
Bhuyan, Md Shariful Islam; Gao, Xin
2011-12-14
Protein side-chain packing problem has remained one of the key open problems in bioinformatics. The three main components of protein side-chain prediction methods are a rotamer library, an energy function and a search algorithm. Rotamer libraries summarize the existing knowledge of the experimentally determined structures quantitatively. Depending on how much contextual information is encoded, there are backbone-independent rotamer libraries and backbone-dependent rotamer libraries. Backbone-independent libraries only encode sequential information, whereas backbone-dependent libraries encode both sequential and locally structural information. However, side-chain conformations are determined by spatially local information, rather than sequentially local information. Since in the side-chain prediction problem, the backbone structure is given, spatially local information should ideally be encoded into the rotamer libraries. In this paper, we propose a new type of backbone-dependent rotamer library, which encodes structural information of all the spatially neighboring residues. We call it protein-dependent rotamer libraries. Given any rotamer library and a protein backbone structure, we first model the protein structure as a Markov random field. Then the marginal distributions are estimated by the inference algorithms, without doing global optimization or search. The rotamers from the given library are then re-ranked and associated with the updated probabilities. Experimental results demonstrate that the proposed protein-dependent libraries significantly outperform the widely used backbone-dependent libraries in terms of the side-chain prediction accuracy and the rotamer ranking ability. Furthermore, without global optimization/search, the side-chain prediction power of the protein-dependent library is still comparable to the global-search-based side-chain prediction methods.
Matveev, Vladimir V
2010-06-09
According to the hypothesis explored in this paper, native aggregation is genetically controlled (programmed) reversible aggregation that occurs when interacting proteins form new temporary structures through highly specific interactions. It is assumed that Anfinsen's dogma may be extended to protein aggregation: composition and amino acid sequence determine not only the secondary and tertiary structure of single protein, but also the structure of protein aggregates (associates). Cell function is considered as a transition between two states (two states model), the resting state and state of activity (this applies to the cell as a whole and to its individual structures). In the resting state, the key proteins are found in the following inactive forms: natively unfolded and globular. When the cell is activated, secondary structures appear in natively unfolded proteins (including unfolded regions in other proteins), and globular proteins begin to melt and their secondary structures become available for interaction with the secondary structures of other proteins. These temporary secondary structures provide a means for highly specific interactions between proteins. As a result, native aggregation creates temporary structures necessary for cell activity."One of the principal objects of theoretical research in any department of knowledge is to find the point of view from which the subject appears in its greatest simplicity."Josiah Willard Gibbs (1839-1903).
Structural alphabets derived from attractors in conformational space
2010-01-01
Background The hierarchical and partially redundant nature of protein structures justifies the definition of frequently occurring conformations of short fragments as 'states'. Collections of selected representatives for these states define Structural Alphabets, describing the most typical local conformations within protein structures. These alphabets form a bridge between the string-oriented methods of sequence analysis and the coordinate-oriented methods of protein structure analysis. Results A Structural Alphabet has been derived by clustering all four-residue fragments of a high-resolution subset of the protein data bank and extracting the high-density states as representative conformational states. Each fragment is uniquely defined by a set of three independent angles corresponding to its degrees of freedom, capturing in simple and intuitive terms the properties of the conformational space. The fragments of the Structural Alphabet are equivalent to the conformational attractors and therefore yield a most informative encoding of proteins. Proteins can be reconstructed within the experimental uncertainty in structure determination and ensembles of structures can be encoded with accuracy and robustness. Conclusions The density-based Structural Alphabet provides a novel tool to describe local conformations and it is specifically suitable for application in studies of protein dynamics. PMID:20170534
Manik, Mohammad Kawsar; Yang, Huiseon; Tong, Junsen; Im, Young Jun
2017-04-04
Yeast Osh1 belongs to the oxysterol-binding protein (OSBP) family of proteins and contains multiple targeting modules optimized for lipid transport at the nucleus-vacuole junction (NVJ). The key determinants for NVJ targeting and the role of Osh1 at NVJs have remained elusive because of unknown lipid specificities. In this study, we determined the structures of the ankyrin repeat domain (ANK), and OSBP-related domain (ORD) of Osh1, in complex with Nvj1 and ergosterol, respectively. The Osh1 ANK forms a unique bi-lobed structure that recognizes a cytosolic helical segment of Nvj1. We discovered that Osh1 ORD binds ergosterol and phosphatidylinositol 4-phosphate PI(4)P in a competitive manner, suggesting counter-transport function of the two lipids. Ergosterol is bound to the hydrophobic pocket in a head-down orientation, and the structure of the PI(4)P-binding site in Osh1 is well conserved. Our results suggest that Osh1 performs non-vesicular transport of ergosterol and PI(4)P at the NVJ. Copyright © 2017 Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Verkhivker, Gennady M.; Rejto, Paul A.; Bouzida, Djamal; Arthurs, Sandra; Colson, Anthony B.; Freer, Stephan T.; Gehlhaar, Daniel K.; Larson, Veda; Luty, Brock A.; Marrone, Tami; Rose, Peter W.
2001-03-01
Thermodynamic and kinetic aspects of ligand-protein binding are studied for the methotrexate-dihydrofolate reductase system from the binding free energy profile constructed as a function of the order parameter. Thermodynamic stability of the native complex and a cooperative transition to the unique native structure suggest the nucleation kinetic mechanism at the equilibrium transition temperature. Structural properties of the transition state ensemble and the ensemble of nucleation conformations are determined by kinetic simulations of the transmission coefficient and ligand-protein association pathways. Structural analysis of the transition states and the nucleation conformations reconciles different views on the nucleation mechanism in protein folding.
Understand protein functions by comparing the similarity of local structural environments.
Chen, Jiawen; Xie, Zhong-Ru; Wu, Yinghao
2017-02-01
The three-dimensional structures of proteins play an essential role in regulating binding between proteins and their partners, offering a direct relationship between structures and functions of proteins. It is widely accepted that the function of a protein can be determined if its structure is similar to other proteins whose functions are known. However, it is also observed that proteins with similar global structures do not necessarily correspond to the same function, while proteins with very different folds can share similar functions. This indicates that function similarity is originated from the local structural information of proteins instead of their global shapes. We assume that proteins with similar local environments prefer binding to similar types of molecular targets. In order to testify this assumption, we designed a new structural indicator to define the similarity of local environment between residues in different proteins. This indicator was further used to calculate the probability that a given residue binds to a specific type of structural neighbors, including DNA, RNA, small molecules and proteins. After applying the method to a large-scale non-redundant database of proteins, we show that the positive signal of binding probability calculated from the local structural indicator is statistically meaningful. In summary, our studies suggested that the local environment of residues in a protein is a good indicator to recognize specific binding partners of the protein. The new method could be a potential addition to a suite of existing template-based approaches for protein function prediction. Copyright © 2016 Elsevier B.V. All rights reserved.
Structural Biology of Non-Ribosomal Peptide Synthetases
Miller, Bradley R.; Gulick, Andrew M.
2016-01-01
Summary The non-ribosomal peptide synthetases are modular enzymes that catalyze synthesis of important peptide products from a variety of standard and non-proteinogenic amino acid substrates. Within a single module are multiple catalytic domains that are responsible for incorporation of a single residue. After the amino acid is activated and covalently attached to an integrated carrier protein domain, the substrates and intermediates are delivered to neighboring catalytic domains for peptide bond formation or, in some modules, chemical modification. In the final module, the peptide is delivered to a terminal thioesterase domain that catalyzes release of the peptide product. This multi-domain modular architecture raises questions about the structural features that enable this assembly line synthesis in an efficient manner. The structures of the core component domains have been determined and demonstrate insights into the catalytic activity. More recently, multi-domain structures have been determined and are providing clues to the features of these enzyme systems that govern the functional interaction between multiple domains. This chapter describes the structures of NRPS proteins and the strategies that are being used to assist structural studies of these dynamic proteins, including careful consideration of domain boundaries for generation of truncated proteins and the use of mechanism-based inhibitors that trap interactions between the catalytic and carrier protein domains. PMID:26831698
Pukáncsik, Mária; Orbán, Ágnes; Nagy, Kinga; Matsuo, Koichi; Gekko, Kunihiko; Maurin, Damien; Hart, Darren; Kézsmárki, István; Vertessy, Beata G.
2016-01-01
A novel uracil-DNA degrading protein factor (termed UDE) was identified in Drosophila melanogaster with no significant structural and functional homology to other uracil-DNA binding or processing factors. Determination of the 3D structure of UDE is excepted to provide key information on the description of the molecular mechanism of action of UDE catalysis, as well as in general uracil-recognition and nuclease action. Towards this long-term aim, the random library ESPRIT technology was applied to the novel protein UDE to overcome problems in identifying soluble expressing constructs given the absence of precise information on domain content and arrangement. Nine constructs of UDE were chosen to decipher structural and functional relationships. Vacuum ultraviolet circular dichroism (VUVCD) spectroscopy was performed to define the secondary structure content and location within UDE and its truncated variants. The quantitative analysis demonstrated exclusive α-helical content for the full-length protein, which is preserved in the truncated constructs. Arrangement of α-helical bundles within the truncated protein segments suggested new domain boundaries which differ from the conserved motifs determined by sequence-based alignment of UDE homologues. Here we demonstrate that the combination of ESPRIT and VUVCD spectroscopy provides a new structural description of UDE and confirms that the truncated constructs are useful for further detailed functional studies. PMID:27273007
Huang, Wenxi; Liu, Wanting; Jin, Jingjie; Xiao, Qilan; Lu, Ruibin; Chen, Wei; Xiong, Sheng; Zhang, Gong
2018-03-25
Translational pausing coordinates protein synthesis and co-translational folding. It is a common factor that facilitates the correct folding of large, multi-domain proteins. For small proteins, pausing sites rarely occurs in the gene body, and the 3'-end pausing sites are only essential for the folding of a fraction of proteins. The determinant of the necessity of the pausings remains obscure. In this study, we demonstrated that the steady-state structural fluctuation is a predictor of the necessity of pausing-mediated co-translational folding for small proteins. Validated by experiments with 5 model proteins, we found that the rigid protein structures do not, while the flexible structures do need 3'-end pausings to fold correctly. Therefore, rational optimization of translational pausing can improve soluble expression of small proteins with flexible structures, but not the rigid ones. The rigidity of the structure can be quantitatively estimated in silico using molecular dynamic simulation. Nevertheless, we also found that the translational pausing optimization increases the fitness of the expression host, and thus benefits the recombinant protein production, independent from the soluble expression. These results shed light on the structural basis of the translational pausing and provided a practical tool for industrial protein fermentation. Copyright © 2017. Published by Elsevier Inc.
Structural determinants of arrestin functions.
Gurevich, Vsevolod V; Gurevich, Eugenia V
2013-01-01
Arrestins are a small protein family with only four members in mammals. Arrestins demonstrate an amazing versatility, interacting with hundreds of different G protein-coupled receptor (GPCR) subtypes, numerous nonreceptor signaling proteins, and components of the internalization machinery, as well as cytoskeletal elements, including regular microtubules and centrosomes. Here, we focus on the structural determinants that mediate various arrestin functions. The receptor-binding elements in arrestins were mapped fairly comprehensively, which set the stage for the construction of mutants targeting particular GPCRs. The elements engaged by other binding partners are only now being elucidated and in most cases we have more questions than answers. Interestingly, even very limited and imprecise identification of structural requirements for the interaction with very few other proteins has enabled the development of signaling-biased arrestin mutants. More comprehensive understanding of the structural underpinning of different arrestin functions will pave the way for the construction of arrestins that can link the receptor we want to the signaling pathway of our choosing. Copyright © 2013 Elsevier Inc. All rights reserved.
Structural Determinants of Arrestin Functions
Gurevich, Vsevolod V.; Gurevich, Eugenia V.
2015-01-01
Arrestins are a small protein family with only four members in mammals. Arrestins demonstrate an amazing versatility, interacting with hundreds of different G protein-coupled receptor (GPCR) subtypes, numerous nonreceptor signaling proteins, and components of the internalization machinery, as well as cytoskeletal elements, including regular microtubules and centrosomes. Here, we focus on the structural determinants that mediate various arrestin functions. The receptor-binding elements in arrestins were mapped fairly comprehensively, which set the stage for the construction of mutants targeting particular GPCRs. The elements engaged by other binding partners are only now being elucidated and in most cases we have more questions than answers. Interestingly, even very limited and imprecise identification of structural requirements for the interaction with very few other proteins has enabled the development of signaling-biased arrestin mutants. More comprehensive understanding of the structural underpinning of different arrestin functions will pave the way for the construction of arrestins that can link the receptor we want to the signaling pathway of our choosing. PMID:23764050
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yu,P.
2007-01-01
Studying the secondary structure of proteins leads to an understanding of the components that make up a whole protein, and such an understanding of the structure of the whole protein is often vital to understanding its digestive behaviour and nutritive value in animals. The main protein secondary structures are the {alpha}-helix and {beta}-sheet. The percentage of these two structures in protein secondary structures influences protein nutritive value, quality and digestive behaviour. A high percentage of {beta}-sheet structure may partly cause a low access to gastrointestinal digestive enzymes, which results in a low protein value. The objectives of the present studymore » were to use advanced synchrotron-based Fourier transform IR (S-FTIR) microspectroscopy as a new approach to reveal the molecular chemistry of the protein secondary structures of feed tissues affected by heat-processing within intact tissue at a cellular level, and to quantify protein secondary structures using multicomponent peak modelling Gaussian and Lorentzian methods, in relation to protein digestive behaviours and nutritive value in the rumen, which was determined using the Cornell Net Carbohydrate Protein System. The synchrotron-based molecular chemistry research experiment was performed at the National Synchrotron Light Source at Brookhaven National Laboratory, US Department of Energy. The results showed that, with S-FTIR microspectroscopy, the molecular chemistry, ultrastructural chemical make-up and nutritive characteristics could be revealed at a high ultraspatial resolution ({approx}10 {mu}m). S-FTIR microspectroscopy revealed that the secondary structure of protein differed between raw and roasted golden flaxseeds in terms of the percentages and ratio of {alpha}-helixes and {beta}-sheets in the mid-IR range at the cellular level. By using multicomponent peak modelling, the results show that the roasting reduced (P <0.05) the percentage of {alpha}-helixes (from 47.1% to 36.1%: S-FTIR absorption intensity), increased the percentage of {beta}-sheets (from 37.2% to 49.8%: S-FTIR absorption intensity) and reduced the {alpha}-helix to {beta}-sheet ratio (from 0.3 to 0.7) in the golden flaxseeds, which indicated a negative effect of the roasting on protein values, utilisation and bioavailability. These results were proved by the Cornell Net Carbohydrate Protein System in situ animal trial, which also revealed that roasting increased the amount of protein bound to lignin, and well as of the Maillard reaction protein (both of which are poorly used by ruminants), and increased the level of indigestible and undegradable protein in ruminants. The present results demonstrate the potential of highly spatially resolved synchrotron-based infrared microspectroscopy to locate 'pure' protein in feed tissues, and reveal protein secondary structures and digestive behaviour, making a significant step forward in and an important contribution to protein nutritional research. Further study is needed to determine the sensitivities of protein secondary structures to various heat-processing conditions, and to quantify the relationship between protein secondary structures and the nutrient availability and digestive behaviour of various protein sources. Information from the present study arising from the synchrotron-based IR probing of the protein secondary structures of protein sources at the cellular level will be valuable as a guide to maintaining protein quality and predicting digestive behaviours.« less
Network representation of protein interactions: Theory of graph description and analysis.
Kurzbach, Dennis
2016-09-01
A methodological framework is presented for the graph theoretical interpretation of NMR data of protein interactions. The proposed analysis generalizes the idea of network representations of protein structures by expanding it to protein interactions. This approach is based on regularization of residue-resolved NMR relaxation times and chemical shift data and subsequent construction of an adjacency matrix that represents the underlying protein interaction as a graph or network. The network nodes represent protein residues. Two nodes are connected if two residues are functionally correlated during the protein interaction event. The analysis of the resulting network enables the quantification of the importance of each amino acid of a protein for its interactions. Furthermore, the determination of the pattern of correlations between residues yields insights into the functional architecture of an interaction. This is of special interest for intrinsically disordered proteins, since the structural (three-dimensional) architecture of these proteins and their complexes is difficult to determine. The power of the proposed methodology is demonstrated at the example of the interaction between the intrinsically disordered protein osteopontin and its natural ligand heparin. © 2016 The Protein Society.
Structure and expression of a novel compact myelin protein – Small VCP-interacting protein (SVIP)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wu, Jiawen; Peng, Dungeng; Voehler, Markus
2013-10-11
Highlights: •SVIP (small p97/VCP-interacting protein) co-localizes with myelin basic protein (MBP) in compact myelin. •We determined that SVIP is an intrinsically disordered protein (IDP). •The helical content of SVIP increases dramatically during its interaction with negatively charged lipid membrane. •This study provides structural insight into interactions between SVIP and myelin membranes. -- Abstract: SVIP (small p97/VCP-interacting protein) was initially identified as one of many cofactors regulating the valosin containing protein (VCP), an AAA+ ATPase involved in endoplasmic-reticulum-associated protein degradation (ERAD). Our previous study showed that SVIP is expressed exclusively in the nervous system. In the present study, SVIP and VCPmore » were seen to be co-localized in neuronal cell bodies. Interestingly, we also observed that SVIP co-localizes with myelin basic protein (MBP) in compact myelin, where VCP was absent. Furthermore, using nuclear magnetic resonance (NMR) and circular dichroism (CD) spectroscopic measurements, we determined that SVIP is an intrinsically disordered protein (IDP). However, upon binding to the surface of membranes containing a net negative charge, the helical content of SVIP increases dramatically. These findings provide structural insight into interactions between SVIP and myelin membranes.« less
Generation of a consensus protein domain dictionary
Schaeffer, R. Dustin; Jonsson, Amanda L.; Simms, Andrew M.; Daggett, Valerie
2011-01-01
Motivation: The discovery of new protein folds is a relatively rare occurrence even as the rate of protein structure determination increases. This rarity reinforces the concept of folds as reusable units of structure and function shared by diverse proteins. If the folding mechanism of proteins is largely determined by their topology, then the folding pathways of members of existing folds could encompass the full set used by globular protein domains. Results: We have used recent versions of three common protein domain dictionaries (SCOP, CATH and Dali) to generate a consensus domain dictionary (CDD). Surprisingly, 40% of the metafolds in the CDD are not composed of autonomous structural domains, i.e. they are not plausible independent folding units. This finding has serious ramifications for bioinformatics studies mining these domain dictionaries for globular protein properties. However, our main purpose in deriving this CDD was to generate an updated CDD to choose targets for MD simulation as part of our dynameomics effort, which aims to simulate the native and unfolding pathways of representatives of all globular protein consensus folds (metafolds). Consequently, we also compiled a list of representative protein targets of each metafold in the CDD. Availability and implementation: This domain dictionary is available at www.dynameomics.org. Contact: daggett@u.washington.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:21068000
Refolding strategies from inclusion bodies in a structural genomics project.
Trésaugues, Lionel; Collinet, Bruno; Minard, Philippe; Henckes, Gilles; Aufrère, Robert; Blondeau, Karine; Liger, Dominique; Zhou, Cong-Zhao; Janin, Joël; Van Tilbeurgh, Herman; Quevillon-Cheruel, Sophie
2004-01-01
The South-Paris Yeast Structural Genomics Project aims at systematically expressing, purifying and determining the structure of S. cerevisiae proteins with no detectable homology to proteins of known structure. We brought 250 yeast ORFs to expression in E. coli, but 37% of them form inclusion bodies. This important fraction of proteins that are well expressed but lost for structural studies prompted us to test methodologies to recover these proteins. Three different strategies were explored in parallel on a set of 20 proteins: (1) refolding from solubilized inclusion bodies using an original and fast 96-well plates screening test, (2) co-expression of the targets in E. coli with DnaK-DnaJ-GrpE and GroEL-GroES chaperones, and (3) use of the cell-free expression system. Most of the tested proteins (17/20) could be resolubilized at least by one approach, but the subsequent purification proved to be difficult for most of them.
Structural domains and main-chain flexibility in prion proteins.
Blinov, N; Berjanskii, M; Wishart, D S; Stepanova, M
2009-02-24
In this study we describe a novel approach to define structural domains and to characterize the local flexibility in both human and chicken prion proteins. The approach we use is based on a comprehensive theory of collective dynamics in proteins that was recently developed. This method determines the essential collective coordinates, which can be found from molecular dynamics trajectories via principal component analysis. Under this particular framework, we are able to identify the domains where atoms move coherently while at the same time to determine the local main-chain flexibility for each residue. We have verified this approach by comparing our results for the predicted dynamic domain systems with the computed main-chain flexibility profiles and the NMR-derived random coil indexes for human and chicken prion proteins. The three sets of data show excellent agreement. Additionally, we demonstrate that the dynamic domains calculated in this fashion provide a highly sensitive measure of protein collective structure and dynamics. Furthermore, such an analysis is capable of revealing structural and dynamic properties of proteins that are inaccessible to the conventional assessment of secondary structure. Using the collective dynamic simulation approach described here along with a high-temperature simulations of unfolding of human prion protein, we have explored whether locations of relatively low stability could be identified where the unfolding process could potentially be facilitated. According to our analysis, the locations of relatively low stability may be associated with the beta-sheet formed by strands S1 and S2 and the adjacent loops, whereas helix HC appears to be a relatively stable part of the protein. We suggest that this kind of structural analysis may provide a useful background for a more quantitative assessment of potential routes of spontaneous misfolding in prion proteins.
Garcia, J A; Harrich, D; Soultanakis, E; Wu, F; Mitsuyasu, R; Gaynor, R B
1989-01-01
The human immunodeficiency virus (HIV) type 1 LTR is regulated at the transcriptional level by both cellular and viral proteins. Using HeLa cell extracts, multiple regions of the HIV LTR were found to serve as binding sites for cellular proteins. An untranslated region binding protein UBP-1 has been purified and fractions containing this protein bind to both the TAR and TATA regions. To investigate the role of cellular proteins binding to both the TATA and TAR regions and their potential interaction with other HIV DNA binding proteins, oligonucleotide-directed mutagenesis of both these regions was performed followed by DNase I footprinting and transient expression assays. In the TATA region, two direct repeats TC/AAGC/AT/AGCTGC surround the TATA sequence. Mutagenesis of both of these direct repeats or of the TATA sequence interrupted binding over the TATA region on the coding strand, but only a mutation of the TATA sequence affected in vivo assays for tat-activation. In addition to TAR serving as the site of binding of cellular proteins, RNA transcribed from TAR is capable of forming a stable stem-loop structure. To determine the relative importance of DNA binding proteins as compared to secondary structure, oligonucleotide-directed mutations in the TAR region were studied. Local mutations that disrupted either the stem or loop structure were defective in gene expression. However, compensatory mutations which restored base pairing in the stem resulted in complete tat-activation. This indicated a significant role for the stem-loop structure in HIV gene expression. To determine the role of TAR binding proteins, mutations were constructed which extensively changed the primary structure of the TAR region, yet left stem base pairing, stem energy and the loop sequence intact. These mutations resulted in decreased protein binding to TAR DNA and defects in tat-activation, and revealed factor binding specifically to the loop DNA sequence. Further mutagenesis which inverted this stem and loop mutation relative to the HIV LTR mRNA start site resulted in even larger decreases in tat-activation. This suggests that multiple determinants, including protein binding, the loop sequence, and RNA or DNA secondary structure, are important in tat-activation and suggests that tat may interact with cellular proteins binding to DNA to increase HIV gene expression. Images PMID:2721501
Uversky, Vladimir N
2015-03-01
Intrinsically disordered proteins (IDPs) and intrinsically disordered protein regions (IDPRs) are functional proteins or regions that do not have unique 3D structures under functional conditions. Therefore, from the viewpoint of their lack of stable 3D structure, IDPs/IDPRs are inherently unstable. As much as structure and function of normal ordered globular proteins are determined by their amino acid sequences, the lack of unique 3D structure in IDPs/IDPRs and their disorder-based functionality are also encoded in the amino acid sequences. Because of their specific sequence features and distinctive conformational behavior, these intrinsically unstable proteins or regions have several applications in biotechnology. This review introduces some of the most characteristic features of IDPs/IDPRs (such as peculiarities of amino acid sequences of these proteins and regions, their major structural features, and peculiar responses to changes in their environment) and describes how these features can be used in the biotechnology, for example for the proteome-wide analysis of the abundance of extended IDPs, for recombinant protein isolation and purification, as polypeptide nanoparticles for drug delivery, as solubilization tools, and as thermally sensitive carriers of active peptides and proteins. Copyright © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Su, Qiudong; Guo, Minzhuo; Jia, Zhiyuan; Qiu, Feng; Lu, Xuexin; Gao, Yan; Meng, Qingling; Tian, Ruiguang; Bi, Shengli; Yi, Yao
2016-07-01
Hepatitis A virus (HAV) infection can stimulate the production of antibodies to structural and non-structural proteins of the virus. However, vaccination with an inactivated or attenuated HAV vaccine produces antibodies mainly against structural proteins, whereas no or very limited antibodies are produced against the non-structural proteins. Current diagnostic assays to determine exposure to HAV, such as the Abbott HAV AB test, detect antibodies only to the structural proteins and so are not able to distinguish a natural infection from vaccination with an inactivated or attenuated virus. Here, we constructed a recombinant tandem multi-epitope diagnostic antigen (designated 'H1') based on the immune-dominant epitopes of the non-structural proteins of HAV to distinguish the two situations. H1 protein expressed in Escherichia coli and purified by affinity and anion exchange chromatography was applied in a double-antigen sandwich ELISA for the detection of anti-non-structural HAV proteins, which was confirmed to distinguish a natural infection from vaccination with an inactivated or attenuated HAV vaccine. Copyright © 2016 Elsevier B.V. All rights reserved.
xMDFF: molecular dynamics flexible fitting of low-resolution X-ray structures.
McGreevy, Ryan; Singharoy, Abhishek; Li, Qufei; Zhang, Jingfen; Xu, Dong; Perozo, Eduardo; Schulten, Klaus
2014-09-01
X-ray crystallography remains the most dominant method for solving atomic structures. However, for relatively large systems, the availability of only medium-to-low-resolution diffraction data often limits the determination of all-atom details. A new molecular dynamics flexible fitting (MDFF)-based approach, xMDFF, for determining structures from such low-resolution crystallographic data is reported. xMDFF employs a real-space refinement scheme that flexibly fits atomic models into an iteratively updating electron-density map. It addresses significant large-scale deformations of the initial model to fit the low-resolution density, as tested with synthetic low-resolution maps of D-ribose-binding protein. xMDFF has been successfully applied to re-refine six low-resolution protein structures of varying sizes that had already been submitted to the Protein Data Bank. Finally, via systematic refinement of a series of data from 3.6 to 7 Å resolution, xMDFF refinements together with electrophysiology experiments were used to validate the first all-atom structure of the voltage-sensing protein Ci-VSP.
Combining Functional and Structural Genomics to Sample the Essential Burkholderia Structome
Baugh, Loren; Gallagher, Larry A.; Patrapuvich, Rapatbhorn; Clifton, Matthew C.; Gardberg, Anna S.; Edwards, Thomas E.; Armour, Brianna; Begley, Darren W.; Dieterich, Shellie H.; Dranow, David M.; Abendroth, Jan; Fairman, James W.; Fox, David; Staker, Bart L.; Phan, Isabelle; Gillespie, Angela; Choi, Ryan; Nakazawa-Hewitt, Steve; Nguyen, Mary Trang; Napuli, Alberto; Barrett, Lynn; Buchko, Garry W.; Stacy, Robin; Myler, Peter J.; Stewart, Lance J.; Manoil, Colin; Van Voorhis, Wesley C.
2013-01-01
Background The genus Burkholderia includes pathogenic gram-negative bacteria that cause melioidosis, glanders, and pulmonary infections of patients with cancer and cystic fibrosis. Drug resistance has made development of new antimicrobials critical. Many approaches to discovering new antimicrobials, such as structure-based drug design and whole cell phenotypic screens followed by lead refinement, require high-resolution structures of proteins essential to the parasite. Methodology/Principal Findings We experimentally identified 406 putative essential genes in B. thailandensis, a low-virulence species phylogenetically similar to B. pseudomallei, the causative agent of melioidosis, using saturation-level transposon mutagenesis and next-generation sequencing (Tn-seq). We selected 315 protein products of these genes based on structure-determination criteria, such as excluding very large and/or integral membrane proteins, and entered them into the Seattle Structural Genomics Center for Infection Disease (SSGCID) structure determination pipeline. To maximize structural coverage of these targets, we applied an “ortholog rescue” strategy for those producing insoluble or difficult to crystallize proteins, resulting in the addition of 387 orthologs (or paralogs) from seven other Burkholderia species into the SSGCID pipeline. This structural genomics approach yielded structures from 31 putative essential targets from B. thailandensis, and 25 orthologs from other Burkholderia species, yielding an overall structural coverage for 49 of the 406 essential gene families, with a total of 88 depositions into the Protein Data Bank. Of these, 25 proteins have properties of a potential antimicrobial drug target i.e., no close human homolog, part of an essential metabolic pathway, and a deep binding pocket. We describe the structures of several potential drug targets in detail. Conclusions/Significance This collection of structures, solubility and experimental essentiality data provides a resource for development of drugs against infections and diseases caused by Burkholderia. All expression clones and proteins created in this study are freely available by request. PMID:23382856
Ramírez, Rosa; Falcón, Rosabel; Izquierdo, Alienys; García, Angélica; Alvarez, Mayling; Pérez, Ana Beatriz; Soto, Yudira; Muné, Mayra; da Silva, Emiliana Mandarano; Ortega, Oney; Mohana-Borges, Ronaldo; Guzmán, María G
2014-10-01
The NS3 protein is a multifunctional non-structural protein of flaviviruses implicated in the polyprotein processing. The predominance of cytotoxic T cell lymphocytes epitopes on the NS3 protein suggests a protective role of this protein in limiting virus replication. In this work, we studied the antigenicity and immunogenicity of a recombinant NS3 protein of the Dengue virus 2. The full-length NS3 gene was cloned and expressed as a His-tagged fusion protein in Escherichia coli. The pNS3 protein was purified by two chromatography steps. The recombinant NS3 protein was recognized by anti-protease NS3 polyclonal antibody and anti-DENV2 HMAF by Western Blot. This purified protein was able to stimulate the secretion of high levels of gamma interferon and low levels of interleukin-10 and tumor necrosis factor-α in mice splenocytes, suggesting a predominantly Th-1-type T cell response. Immunized BALB/c mice with the purified NS3 protein showed a strong induction of anti-NS3 IgG antibodies, essentially IgG2b, as determined by ELISA. Immunized mice sera with recombinant NS3 protein showed specific recognition of native dengue protein by Western blotting and immunofluorescence techniques. The successfully purified recombinant protein was able to preserv the structural and antigenic determinants of the native dengue protein. The antigenicity shown by the recombinant NS3 protein suggests its possible inclusion into future DENV vaccine preparations.
Huber, Roland G.; Bond, Peter J.
2017-01-01
An improved knowledge of protein-protein interactions is essential for better understanding of metabolic and signaling networks, and cellular function. Progress tends to be based on structure determination and predictions using known structures, along with computational methods based on evolutionary information or detailed atomistic descriptions. We hypothesized that for the case of interactions across a common interface, between proteins from a pair of paralogue families or within a family of paralogues, a relatively simple interface description could distinguish between binding and non-binding pairs. Using binding data for several systems, and large-scale comparative modeling based on known template complex structures, it is found that charge-charge interactions (for groups bearing net charge) are generally a better discriminant than buried non-polar surface. This is particularly the case for paralogue families that are less divergent, with more reliable comparative modeling. We suggest that electrostatic interactions are major determinants of specificity in such systems, an observation that could be used to predict binding partners. PMID:29016650
Ivanov, Stefan M; Cawley, Andrew; Huber, Roland G; Bond, Peter J; Warwicker, Jim
2017-01-01
An improved knowledge of protein-protein interactions is essential for better understanding of metabolic and signaling networks, and cellular function. Progress tends to be based on structure determination and predictions using known structures, along with computational methods based on evolutionary information or detailed atomistic descriptions. We hypothesized that for the case of interactions across a common interface, between proteins from a pair of paralogue families or within a family of paralogues, a relatively simple interface description could distinguish between binding and non-binding pairs. Using binding data for several systems, and large-scale comparative modeling based on known template complex structures, it is found that charge-charge interactions (for groups bearing net charge) are generally a better discriminant than buried non-polar surface. This is particularly the case for paralogue families that are less divergent, with more reliable comparative modeling. We suggest that electrostatic interactions are major determinants of specificity in such systems, an observation that could be used to predict binding partners.
Pilger, Jens; Mazur, Adam; Monecke, Peter; Schreuder, Herman; Elshorst, Bettina; Bartoschek, Stefan; Langer, Thomas; Schiffer, Alexander; Krimm, Isabelle; Wegstroth, Melanie; Lee, Donghan; Hessler, Gerhard; Wendt, K-Ulrich; Becker, Stefan; Griesinger, Christian
2015-05-26
Structure-based drug design (SBDD) is a powerful and widely used approach to optimize affinity of drug candidates. With the recently introduced INPHARMA method, the binding mode of small molecules to their protein target can be characterized even if no spectroscopic information about the protein is known. Here, we show that the combination of the spin-diffusion-based NMR methods INPHARMA, trNOE, and STD results in an accurate scoring function for docking modes and therefore determination of protein-ligand complex structures. Applications are shown on the model system protein kinase A and the drug targets glycogen phosphorylase and soluble epoxide hydrolase (sEH). Multiplexing of several ligands improves the reliability of the scoring function further. The new score allows in the case of sEH detecting two binding modes of the ligand in its binding site, which was corroborated by X-ray analysis. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Kumwenda, Benjamin; Litthauer, Derek; Bishop, Özlem Tastan; Reva, Oleg
2013-01-01
Elucidation of evolutionary factors that enhance protein thermostability is a critical problem and was the focus of this work on Thermus species. Pairs of orthologous sequences of T. scotoductus SA-01 and T. thermophilus HB27, with the largest negative minimum folding energy (MFE) as predicted by the UNAFold algorithm, were statistically analyzed. Favored substitutions of amino acids residues and their properties were determined. Substitutions were analyzed in modeled protein structures to determine their locations and contribution to energy differences using PyMOL and FoldX programs respectively. Dominant trends in amino acid substitutions consistent with differences in thermostability between orthologous sequences were observed. T. thermophilus thermophilic proteins showed an increase in non-polar, tiny, and charged amino acids. An abundance of alanine substituted by serine and threonine, as well as arginine substituted by glutamine and lysine was observed in T. thermophilus HB27. Structural comparison showed that stabilizing mutations occurred on surfaces and loops in protein structures. PMID:24023508
Structure of the virulence-associated protein VapD from the intracellular pathogen Rhodococcus equi
DOE Office of Scientific and Technical Information (OSTI.GOV)
Whittingham, Jean L.; Blagova, Elena V.; Finn, Ciaran E.
2014-08-01
VapD is one of a set of highly homologous virulence-associated proteins from the multi-host pathogen Rhodococcus equi. The crystal structure reveals an eight-stranded β-barrel with a novel fold and a glycine rich ‘bald’ surface. Rhodococcus equi is a multi-host pathogen that infects a range of animals as well as immune-compromised humans. Equine and porcine isolates harbour a virulence plasmid encoding a homologous family of virulence-associated proteins associated with the capacity of R. equi to divert the normal processes of endosomal maturation, enabling bacterial survival and proliferation in alveolar macrophages. To provide a basis for probing the function of the Vapmore » proteins in virulence, the crystal structure of VapD was determined. VapD is a monomer as determined by multi-angle laser light scattering. The structure reveals an elliptical, compact eight-stranded β-barrel with a novel strand topology and pseudo-twofold symmetry, suggesting evolution from an ancestral dimer. Surface-associated octyl-β-d-glucoside molecules may provide clues to function. Circular-dichroism spectroscopic analysis suggests that the β-barrel structure is preceded by a natively disordered region at the N-terminus. Sequence comparisons indicate that the core folds of the other plasmid-encoded virulence-associated proteins from R. equi strains are similar to that of VapD. It is further shown that sequences encoding putative R. equi Vap-like proteins occur in diverse bacterial species. Finally, the functional implications of the structure are discussed in the light of the unique structural features of VapD and its partial structural similarity to other β-barrel proteins.« less
Slama-Schwok, A; Zakrzewska, K; Léger, G; Leroux, Y; Takahashi, M; Käs, E; Debey, P
2000-01-01
Using spectroscopic methods, we have studied the structural changes induced in both protein and DNA upon binding of the High-Mobility Group I (HMG-I) protein to a 21-bp sequence derived from mouse satellite DNA. We show that these structural changes depend on the stoichiometry of the protein/DNA complexes formed, as determined by Job plots derived from experiments using pyrene-labeled duplexes. Circular dichroism and melting temperature experiments extended in the far ultraviolet range show that while native HMG-I is mainly random coiled in solution, it adopts a beta-turn conformation upon forming a 1:1 complex in which the protein first binds to one of two dA.dT stretches present in the duplex. HMG-I structure in the 1:1 complex is dependent on the sequence of its DNA target. A 3:1 HMG-I/DNA complex can also form and is characterized by a small increase in the DNA natural bend and/or compaction coupled to a change in the protein conformation, as determined from fluorescence resonance energy transfer (FRET) experiments. In addition, a peptide corresponding to an extended DNA-binding domain of HMG-I induces an ordered condensation of DNA duplexes. Based on the constraints derived from pyrene excimer measurements, we present a model of these nucleated structures. Our results illustrate an extreme case of protein structure induced by DNA conformation that may bear on the evolutionary conservation of the DNA-binding motifs of HMG-I. We discuss the functional relevance of the structural flexibility of HMG-I associated with the nature of its DNA targets and the implications of the binding stoichiometry for several aspects of chromatin structure and gene regulation. PMID:10777751
NASA Astrophysics Data System (ADS)
Becker, J. Susanne; Zoriy, Miroslav; Przybylski, Michael; Becker, J. Sabine
2007-03-01
The combination of atomic and molecular mass spectrometric methods was applied for characterization and identification of several human proteins from Alzheimer's diseased brain. A brain protein mixture was separated by two-dimensional (2D) gel electrophoresis and the protein spots were fast screened by microlocal analysis using LA-ICP-MS (laser ablation inductively coupled plasma mass spectrometry) in respect to phosphorus, sulfur, copper, zinc and iron content. Five selected protein spots in 2D gel containing these elements were investigated after tryptic digestion by matrix assisted laser desorption ionization Fourier transform ion cyclotron resonance mass spectrometry (MALDI-FTICR-MS). Than element concentrations (P, Cu, Zn and Fe) were determined in three identified human brain proteins by LA-ICP-MS in the 2D gel. Results of structure analysis of human brain proteins by MALDI-FTICR-MS were combined with those of the direct determination of phosphorus, copper, zinc and iron concentrations in protein spots with LA-ICP-MS. From the results of atomic and molecular mass spectrometric techniques the human brain proteins were characterized in respect to their structure, sequence, phosphorylation state and metal content as well.
Ramya, L; Gautham, N; Chaloin, Laurent; Kajava, Andrey V
2015-09-01
Significant progress has been made in the determination of the protein structures with their number today passing over a hundred thousand structures. The next challenge is the understanding and prediction of protein-protein and protein-ligand interactions. In this work we address this problem by analyzing curved solenoid proteins. Many of these proteins are considered as "hub molecules" for their high potential to interact with many different molecules and to be a scaffold for multisubunit protein machineries. Our analysis of these structures through molecular dynamics simulations reveals that the mobility of the side-chains on the concave surfaces of the solenoids is lower than on the convex ones. This result provides an explanation to the observed preferential binding of the ligands, including small and flexible ligands, to the concave surface of the curved solenoid proteins. The relationship between the landscapes and dynamic properties of the protein surfaces can be further generalized to the other types of protein structures and eventually used in the computer algorithms, allowing prediction of protein-ligand interactions by analysis of protein surfaces. © 2015 Wiley Periodicals, Inc.
Lezon, Timothy R.; Bahar, Ivet
2010-01-01
Comparison of elastic network model predictions with experimental data has provided important insights on the dominant role of the network of inter-residue contacts in defining the global dynamics of proteins. Most of these studies have focused on interpreting the mean-square fluctuations of residues, or deriving the most collective, or softest, modes of motions that are known to be insensitive to structural and energetic details. However, with increasing structural data, we are in a position to perform a more critical assessment of the structure-dynamics relations in proteins, and gain a deeper understanding of the major determinants of not only the mean-square fluctuations and lowest frequency modes, but the covariance or the cross-correlations between residue fluctuations and the shapes of higher modes. A systematic study of a large set of NMR-determined proteins is analyzed using a novel method based on entropy maximization to demonstrate that the next level of refinement in the elastic network model description of proteins ought to take into consideration properties such as contact order (or sequential separation between contacting residues) and the secondary structure types of the interacting residues, whereas the types of amino acids do not play a critical role. Most importantly, an optimal description of observed cross-correlations requires the inclusion of destabilizing, as opposed to exclusively stabilizing, interactions, stipulating the functional significance of local frustration in imparting native-like dynamics. This study provides us with a deeper understanding of the structural basis of experimentally observed behavior, and opens the way to the development of more accurate models for exploring protein dynamics. PMID:20585542
Lezon, Timothy R; Bahar, Ivet
2010-06-17
Comparison of elastic network model predictions with experimental data has provided important insights on the dominant role of the network of inter-residue contacts in defining the global dynamics of proteins. Most of these studies have focused on interpreting the mean-square fluctuations of residues, or deriving the most collective, or softest, modes of motions that are known to be insensitive to structural and energetic details. However, with increasing structural data, we are in a position to perform a more critical assessment of the structure-dynamics relations in proteins, and gain a deeper understanding of the major determinants of not only the mean-square fluctuations and lowest frequency modes, but the covariance or the cross-correlations between residue fluctuations and the shapes of higher modes. A systematic study of a large set of NMR-determined proteins is analyzed using a novel method based on entropy maximization to demonstrate that the next level of refinement in the elastic network model description of proteins ought to take into consideration properties such as contact order (or sequential separation between contacting residues) and the secondary structure types of the interacting residues, whereas the types of amino acids do not play a critical role. Most importantly, an optimal description of observed cross-correlations requires the inclusion of destabilizing, as opposed to exclusively stabilizing, interactions, stipulating the functional significance of local frustration in imparting native-like dynamics. This study provides us with a deeper understanding of the structural basis of experimentally observed behavior, and opens the way to the development of more accurate models for exploring protein dynamics.
NASA Technical Reports Server (NTRS)
Luo, Ming (Inventor); Sha, Bingdong (Inventor)
2000-01-01
The matrix protein, M1, of influenza virus strain A/PR/8/34 has been purified from virions and crystallized. The crystals consist of a stable fragment (18 Kd) of the M1 protein. X-ray diffraction studies indicated that the crystals have a space group of P3.sub.t 21 or P3.sub.2 21. Vm calculations showed that there are two monomers in an asymmetric unit. A crystallized N-terminal domain of M1, wherein the N-terminal domain of M1 is crystallized such that the three dimensional structure of the crystallized N-terminal domain of M1 can be determined to a resolution of about 2.1 .ANG. or better, and wherein the three dimensional structure of the uncrystallized N-terminal domain of M1 cannot be determined to a resolution of about 2.1 .ANG. or better. A method of purifying M1 and a method of crystallizing M1. A method of using the three-dimensional crystal structure of M1 to screen for antiviral, influenza virus treating or preventing compounds. A method of using the three-dimensional crystal structure of M1 to screen for improved binding to or inhibition of influenza virus M1. The use of the three-dimensional crystal structure of the M1 protein of influenza virus in the manufacture of an inhibitor of influenza virus M1. The use of the three-dimensional crystal structure of the M1 protein of influenza virus in the screening of candidates for inhibition of influenza virus M1.
Tian, Ye; Schwieters, Charles D; Opella, Stanley J; Marassi, Francesca M
2017-01-01
Structure determination of proteins by NMR is unique in its ability to measure restraints, very accurately, in environments and under conditions that closely mimic those encountered in vivo. For example, advances in solid-state NMR methods enable structure determination of membrane proteins in detergent-free lipid bilayers, and of large soluble proteins prepared by sedimentation, while parallel advances in solution NMR methods and optimization of detergent-free lipid nanodiscs are rapidly pushing the envelope of the size limit for both soluble and membrane proteins. These experimental advantages, however, are partially squandered during structure calculation, because the commonly used force fields are purely repulsive and neglect solvation, Van der Waals forces and electrostatic energy. Here we describe a new force field, and updated energy functions, for protein structure calculations with EEFx implicit solvation, electrostatics, and Van der Waals Lennard-Jones forces, in the widely used program Xplor-NIH. The new force field is based primarily on CHARMM22, facilitating calculations with a wider range of biomolecules. The new EEFx energy function has been rewritten to enable OpenMP parallelism, and optimized to enhance computation efficiency. It implements solvation, electrostatics, and Van der Waals energy terms together, thus ensuring more consistent and efficient computation of the complete nonbonded energy lists. Updates in the related python module allow detailed analysis of the interaction energies and associated parameters. The new force field and energy function work with both soluble proteins and membrane proteins, including those with cofactors or engineered tags, and are very effective in situations where there are sparse experimental restraints. Results obtained for NMR-restrained calculations with a set of five soluble proteins and five membrane proteins show that structures calculated with EEFx have significant improvements in accuracy, precision, and conformation, and that structure refinement can be obtained by short relaxation with EEFx to obtain improvements in these key metrics. These developments broaden the range of biomolecular structures that can be calculated with high fidelity from NMR restraints.
FTIR Spectroscopy of Protein Isolates of Salt-Tolerant Soybean Mutants
NASA Astrophysics Data System (ADS)
Akyuz, S.; Akyuz, T.; Celik, O.; Atak, C.
2018-01-01
The effect of salinity on the conformation of proteins of four salt-tolerant M2 generation mutants of soybean plants (S04-05/150-2, S04-05/150-8, S04-05/150-106, and S04-05/150-114) was investigated using Fourier transform infrared (FTIR) spectroscopy. Salinity is one of the important abiotic stress factors that limits growth and productivity of plants. The mutants belonging to the M2 generation were determined as tolerant to 90 mM NaCl. The relative contents of α-helix, β-sheet, turn, and irregular conformations for the soybean protein isolates were determined depending on the analysis of the amide I region. The comparison of the secondary structures of soybean proteins of the mutants with those of the control group indicated that the α-helix structure percentage was diminished while β-turn and disordered structures were increased as a result of the salt stress.
Direct detection of x-rays for protein crystallography employing a thick, large area CCD
Atac, Muzaffer; McKay, Timothy
1999-01-01
An apparatus and method for directly determining the crystalline structure of a protein crystal. The crystal is irradiated by a finely collimated x-ray beam. The interaction of the x-ray beam with the crystal produces scattered x-rays. These scattered x-rays are detected by means of a large area, thick CCD which is capable of measuring a significant number of scattered x-rays which impact its surface. The CCD is capable of detecting the position of impact of the scattered x-ray on the surface of the CCD and the quantity of scattered x-rays which impact the same cell or pixel. This data is then processed in real-time and the processed data is outputted to produce a image of the structure of the crystal. If this crystal is a protein the molecular structure of the protein can be determined from the data received.
Cook, Jonathan D; Soto-Montoya, Hazel; Korpela, Markus K; Lee, Jeffrey E
2015-07-24
Segment 5, ORF 1 of the infectious salmon anemia virus (ISAV) genome, encodes for the ISAV F protein, which is responsible for viral-host endosomal membrane fusion during a productive ISAV infection. The entry machinery of ISAV is composed of a complex of the ISAV F and ISAV hemagglutinin esterase (HE) proteins in an unknown stoichiometry prior to receptor engagement by ISAV HE. Following binding of the receptor to ISAV HE, dissociation of the ISAV F protein from HE, and subsequent endocytosis, the ISAV F protein resolves into a fusion-competent oligomeric state. Here, we present a 2.1 Å crystal structure of the fusion core of the ISAV F protein determined at low pH. This structure has allowed us to unambiguously demonstrate that the ISAV entry machinery exhibits typical class I viral fusion protein architecture. Furthermore, we have determined stabilizing factors that accommodate the pH-dependent mode of ISAV transmission, and our structure has allowed the identification of a central coil that is conserved across numerous and varied post-fusion viral glycoprotein structures. We then discuss a mechanistic model of ISAV fusion that parallels the paramyxoviral class I fusion strategy wherein attachment and fusion are relegated to separate proteins in a similar fashion to ISAV fusion. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.
Bonomi, Massimiliano; Pellarin, Riccardo; Kim, Seung Joong; Russel, Daniel; Sundin, Bryan A.; Riffle, Michael; Jaschob, Daniel; Ramsden, Richard; Davis, Trisha N.; Muller, Eric G. D.; Sali, Andrej
2014-01-01
The use of in vivo Förster resonance energy transfer (FRET) data to determine the molecular architecture of a protein complex in living cells is challenging due to data sparseness, sample heterogeneity, signal contributions from multiple donors and acceptors, unequal fluorophore brightness, photobleaching, flexibility of the linker connecting the fluorophore to the tagged protein, and spectral cross-talk. We addressed these challenges by using a Bayesian approach that produces the posterior probability of a model, given the input data. The posterior probability is defined as a function of the dependence of our FRET metric FRETR on a structure (forward model), a model of noise in the data, as well as prior information about the structure, relative populations of distinct states in the sample, forward model parameters, and data noise. The forward model was validated against kinetic Monte Carlo simulations and in vivo experimental data collected on nine systems of known structure. In addition, our Bayesian approach was validated by a benchmark of 16 protein complexes of known structure. Given the structures of each subunit of the complexes, models were computed from synthetic FRETR data with a distance root-mean-squared deviation error of 14 to 17 Å. The approach is implemented in the open-source Integrative Modeling Platform, allowing us to determine macromolecular structures through a combination of in vivo FRETR data and data from other sources, such as electron microscopy and chemical cross-linking. PMID:25139910
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bajaj, R. Alexandra; Arbing, Mark A.; Shin, Annie
The structure of Msmeg_6760, a protein of unknown function, has been determined. Biochemical and bioinformatics analyses determined that Msmeg_6760 interacts with a protein encoded in the same operon, Msmeg_6762, and predicted that the operon is a toxin–antitoxin (TA) system. Structural comparison of Msmeg_6760 with proteins of known function suggests that Msmeg_6760 binds a hydrophobic ligand in a buried cavity lined by large hydrophobic residues. Access to this cavity could be controlled by a gate–latch mechanism. The function of the Msmeg_6760 toxin is unknown, but structure-based predictions revealed that Msmeg_6760 and Msmeg_6762 are homologous to Rv2034 and Rv2035, a predicted novelmore » TA system involved inMycobacterium tuberculosislatency during macrophage infection. The Msmeg_6760 toxin fold has not been previously described for bacterial toxins and its unique structural features suggest that toxin activation is likely to be mediated by a novel mechanism.« less
Structure solution of DNA-binding proteins and complexes with ARCIMBOLDO libraries
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pröpper, Kevin; Instituto de Biologia Molecular de Barcelona; Meindl, Kathrin
2014-06-01
The structure solution of DNA-binding protein structures and complexes based on the combination of location of DNA-binding protein motif fragments with density modification in a multi-solution frame is described. Protein–DNA interactions play a major role in all aspects of genetic activity within an organism, such as transcription, packaging, rearrangement, replication and repair. The molecular detail of protein–DNA interactions can be best visualized through crystallography, and structures emphasizing insight into the principles of binding and base-sequence recognition are essential to understanding the subtleties of the underlying mechanisms. An increasing number of high-quality DNA-binding protein structure determinations have been witnessed despite themore » fact that the crystallographic particularities of nucleic acids tend to pose specific challenges to methods primarily developed for proteins. Crystallographic structure solution of protein–DNA complexes therefore remains a challenging area that is in need of optimized experimental and computational methods. The potential of the structure-solution program ARCIMBOLDO for the solution of protein–DNA complexes has therefore been assessed. The method is based on the combination of locating small, very accurate fragments using the program Phaser and density modification with the program SHELXE. Whereas for typical proteins main-chain α-helices provide the ideal, almost ubiquitous, small fragments to start searches, in the case of DNA complexes the binding motifs and DNA double helix constitute suitable search fragments. The aim of this work is to provide an effective library of search fragments as well as to determine the optimal ARCIMBOLDO strategy for the solution of this class of structures.« less
Structure elucidation of dimeric transmembrane domains of bitopic proteins.
Bocharov, Eduard V; Volynsky, Pavel E; Pavlov, Konstantin V; Efremov, Roman G; Arseniev, Alexander S
2010-01-01
The interaction between transmembrane helices is of great interest because it directly determines biological activity of a membrane protein. Either destroying or enhancing such interactions can result in many diseases related to dysfunction of different tissues in human body. One much studied form of membrane proteins known as bitopic protein is a dimer containing two membrane-spanning helices associating laterally. Establishing structure-function relationship as well as rational design of new types of drugs targeting membrane proteins requires precise structural information about this class of objects. At present time, to investigate spatial structure and internal dynamics of such transmembrane helical dimers, several strategies were developed based mainly on a combination of NMR spectroscopy, optical spectroscopy, protein engineering and molecular modeling. These approaches were successfully applied to homo- and heterodimeric transmembrane fragments of several bitopic proteins, which play important roles in normal and in pathological conditions of human organism.
Structural genomics: keeping up with expanding knowledge of the protein universe
Grabowski, Marek; Joachimiak, Andrzej; Otwinowski, Zbyszek; Minor, Wladek
2010-01-01
Structural characterization of the protein universe is the main mission of Structural Genomics (SG) programs. However, progress in gene sequencing technology, set in motion in the 1990s, has resulted in rapid expansion of protein sequence space — a twelvefold increase in the past seven years. For the SG field, this creates new challenges and necessitates a reassessment of its strategies. Nevertheless, despite the growth of sequence space, at present nearly half of the content of the Swiss-Prot database and over 40% of Pfam protein families can be structurally modeled based on structures determined so far, with SG projects making an increasingly significant contribution. The SG contribution of new Pfam structures nearly doubled from 27.2% in 2003 to 51.6% in 2006. PMID:17587562
F2Dock: Fast Fourier Protein-Protein Docking
Bajaj, Chandrajit; Chowdhury, Rezaul; Siddavanahalli, Vinay
2009-01-01
The functions of proteins is often realized through their mutual interactions. Determining a relative transformation for a pair of proteins and their conformations which form a stable complex, reproducible in nature, is known as docking. It is an important step in drug design, structure determination and understanding function and structure relationships. In this paper we extend our non-uniform fast Fourier transform docking algorithm to include an adaptive search phase (both translational and rotational) and thereby speed up its execution. We have also implemented a multithreaded version of the adaptive docking algorithm for even faster execution on multicore machines. We call this protein-protein docking code F2Dock (F2 = Fast Fourier). We have calibrated F2Dock based on an extensive experimental study on a list of benchmark complexes and conclude that F2Dock works very well in practice. Though all docking results reported in this paper use shape complementarity and Coulombic potential based scores only, F2Dock is structured to incorporate Lennard-Jones potential and re-ranking docking solutions based on desolvation energy. PMID:21071796
DOE Office of Scientific and Technical Information (OSTI.GOV)
Vetting, Matthew W., E-mail: vetting@aecom.yu.edu; Hegde, Subray S.; Blanchard, John S.
2009-05-01
A method to modify proteins with glutaraldehyde under reducing conditions is presented. Treatment with glutaraldehyde and dimethylaminoborane was found to result in cyclic pentylation of free amines and facilitated the structural determination of a protein previously recalcitrant to the formation of diffraction quality crystals. The pentapeptide-repeat protein EfsQnr from Enterococcus faecalis protects DNA gyrase from inhibition by fluoroquinolones. EfsQnr was cloned and purified to homogeneity, but failed to produce diffraction-quality crystals in initial crystallization screens. Treatment of EfsQnr with glutaraldehyde and the strong reducing agent borane–dimethylamine resulted in a derivatized protein which produced crystals that diffracted to 1.6 Å resolution;more » their structure was subsequently determined by single-wavelength anomalous dispersion. Analysis of the derivatized protein using Fourier transform ion cyclotron resonance mass spectrometry indicated a mass increase of 68 Da per free amino group. Electron-density maps about a limited number of structurally ordered lysines indicated that the modification was a cyclic pentylation of free amines, producing piperidine groups.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cardarelli, Lia; Lam, Robert; Tuite, Ashleigh
2010-08-17
The final step in the morphogenesis of long-tailed double-stranded DNA bacteriophages is the joining of the DNA-filled head to the tail. The connector is a specialized structure of the head that serves as the interface for tail attachment and the point of egress for DNA from the head during infection. Here, we report the determination of a 2.1 {angstrom} crystal structure of gp6 of bacteriophage HK97. Through structural comparisons, functional studies, and bioinformatic analysis, gp6 has been determined to be a component of the connector of phage HK97 that is evolutionarily related to gp15, a well-characterized connector component of bacteriophagemore » SPP1. Whereas the structure of gp15 was solved in a monomeric form, gp6 crystallized as an oligomeric ring with the dimensions expected for a connector protein. Although this ring is composed of 13 subunits, which does not match the symmetry of the connector within the phage, sequence conservation and modeling of this structure into the cryo-electron microscopy density of the SPP1 connector indicate that this oligomeric structure represents the arrangement of gp6 subunits within the mature phage particle. Through sequence searches and genomic position analysis, we determined that gp6 is a member of a large family of connector proteins that are present in long-tailed phages. We have also identified gp7 of HK97 as a homologue of gp16 of phage SPP1, which is the second component of the connector of this phage. These proteins are members of another large protein family involved in connector assembly.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cardarelli, Lia; Lam, Robert; Tuite, Ashleigh
2011-11-23
The final step in the morphogenesis of long-tailed double-stranded DNA bacteriophages is the joining of the DNA-filled head to the tail. The connector is a specialized structure of the head that serves as the interface for tail attachment and the point of egress for DNA from the head during infection. Here, we report the determination of a 2.1 Å crystal structure of gp6 of bacteriophage HK97. Through structural comparisons, functional studies, and bioinformatic analysis, gp6 has been determined to be a component of the connector of phage HK97 that is evolutionarily related to gp15, a well-characterized connector component of bacteriophagemore » SPP1. Whereas the structure of gp15 was solved in a monomeric form, gp6 crystallized as an oligomeric ring with the dimensions expected for a connector protein. Although this ring is composed of 13 subunits, which does not match the symmetry of the connector within the phage, sequence conservation and modeling of this structure into the cryo-electron microscopy density of the SPP1 connector indicate that this oligomeric structure represents the arrangement of gp6 subunits within the mature phage particle. Through sequence searches and genomic position analysis, we determined that gp6 is a member of a large family of connector proteins that are present in long-tailed phages. We have also identified gp7 of HK97 as a homologue of gp16 of phage SPP1, which is the second component of the connector of this phage. These proteins are members of another large protein family involved in connector assembly.« less
Structural Polypeptides of the Granulosis Virus of Plodia interpunctella†
Tweeten, Kathleen A.; Bulla, Lee A.; Consigli, Richard A.
1980-01-01
Techniques were developed for the isolation and purification of three structural components of Plodia interpunctella granulosis virus: granulin, enveloped nucleocapsids, and nucleocapsids. The polypeptide composition and distribution of protein in each viral component were determined by sodium dodecyl sulfate discontinuous and gradient polyacrylamide slab gel electrophoresis. Enveloped nucleocapsids consisted of 15 structural proteins ranging in molecular weight from 12,600 to 97,300. Five of these proteins, having approximate molecular weights of 17,800, 39,700, 42,400, 48,200, and 97,300, were identified as envelope proteins by surface radioiodination of the enveloped nucleocapsids. Present in purified nucleocapsids were eight polypeptides. The predominant proteins in this structural component had molecular weights of 12,500 and 31,000. Whereas no evidence of polypeptide glycosylation was obtained, six of the viral proteins were observed to be phosphorylated. Images PMID:16789191
Antigenic Determinants of Alpha-Like Proteins of Streptococcus agalactiae
Maeland, Johan A.; Bevanger, Lars; Lyng, Randi Valsoe
2004-01-01
The majority of group B streptococcus (GBS) isolates express one or more of a family of surface-anchored proteins that vary by strain and that form ladder-like patterns on Western blotting due to large repeat units. These proteins, which are important as GBS serotype markers and as inducers of protective antibodies, include the alpha C (Cα) and R4 proteins and the recently described alpha-like protein 2 (Alp2), encoded by alp2, and Alp3, encoded by alp3. In this study, we examined antigenic determinants possessed by Alp2 and Alp3 by testing of antibodies raised in rabbits, mainly by using enzyme-linked immunosorbent assays (ELISA) and an ELISA absorption test. The results showed that Alp2 and Alp3 shared an antigenic determinant, which may be a unique immunological marker of the Alp variants of GBS proteins. Alp2, in addition, possessed an antigenic determinant which showed specificity for Alp2 and a third determinant which showed serological cross-reactivity with Cα. Alp3, in addition to the determinant common to Alp2 and Alp3, harbored an antigenic site which also was present in the R4 protein, whereas no Alp3-specific antigenic site was detected. These ELISA-based results were confirmed by Western blotting and a fluorescent-antibody test. The results are consistent with highly complex antigenic structures of the alpha-like proteins in a fashion which is in agreement with the recently described structural mosaicism of the alp2 and alp3 genes. The results are expected to influence GBS serotyping, immunoprotection studies, and GBS vaccine developments. PMID:15539502
Assfalg, Michael; Gianolio, Eliana; Zanzoni, Serena; Tomaselli, Simona; Russo, Vito Lo; Cabella, Claudia; Ragona, Laura; Aime, Silvio; Molinari, Henriette
2007-11-01
The binding affinities of a selected series of Gd(III) chelates bearing bile acid residues, potential hepatospecific MRI contrast agents, to a liver cytosolic bile acid transporter, have been determined through relaxivity measurements. The Ln(III) complexes of compound 1 were selected for further NMR structural analysis aimed at assessing the molecular determinants of binding. A number of NMR experiments have been carried out on the bile acid-like adduct, using both diamagnetic Y(III) and paramagnetic Gd(III) complexes, bound to a liver bile acid binding protein. The identified protein "hot spots" defined a single binding site located at the protein portal region. The presented findings will serve in a medicinal chemistry approach for the design of hepatocytes-selective gadolinium chelates for liver malignancies detection.
Crystal structure of the YDR533c S. cerevisiae protein, a class II member of the Hsp31 family.
Graille, Marc; Quevillon-Cheruel, Sophie; Leulliot, Nicolas; Zhou, Cong-Zhao; Li de la Sierra Gallay, Ines; Jacquamet, Lilian; Ferrer, Jean-Luc; Liger, Dominique; Poupon, Anne; Janin, Joel; van Tilbeurgh, Herman
2004-05-01
The ORF YDR533c from Saccharomyces cerevisiae codes for a 25.5 kDa protein of unknown biochemical function. Transcriptome analysis of yeast has shown that this gene is activated in response to various stress conditions together with proteins belonging to the heat shock family. In order to clarify its biochemical function, we determined the crystal structure of YDR533c to 1.85 A resolution by the single anomalous diffraction method. The protein possesses an alpha/beta hydrolase fold and a putative Cys-His-Glu catalytic triad common to a large enzyme family containing proteases, amidotransferases, lipases, and esterases. The protein has strong structural resemblance with the E. coli Hsp31 protein and the intracellular protease I from Pyrococcus horikoshii, which are considered class I and class III members of the Hsp31 family, respectively. Detailed structural analysis strongly suggests that the YDR533c protein crystal structure is the first one of a class II member of the Hsp31 family.
XLinkDB 2.0: integrated, large-scale structural analysis of protein crosslinking data
Schweppe, Devin K.; Zheng, Chunxiang; Chavez, Juan D.; Navare, Arti T.; Wu, Xia; Eng, Jimmy K.; Bruce, James E.
2016-01-01
Motivation: Large-scale chemical cross-linking with mass spectrometry (XL-MS) analyses are quickly becoming a powerful means for high-throughput determination of protein structural information and protein–protein interactions. Recent studies have garnered thousands of cross-linked interactions, yet the field lacks an effective tool to compile experimental data or access the network and structural knowledge for these large scale analyses. We present XLinkDB 2.0 which integrates tools for network analysis, Protein Databank queries, modeling of predicted protein structures and modeling of docked protein structures. The novel, integrated approach of XLinkDB 2.0 enables the holistic analysis of XL-MS protein interaction data without limitation to the cross-linker or analytical system used for the analysis. Availability and Implementation: XLinkDB 2.0 can be found here, including documentation and help: http://xlinkdb.gs.washington.edu/. Contact: jimbruce@uw.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27153666
Structure of a Trypanosoma Brucei Alpha/Beta--Hydrolase Fold Protein With Unknown Function
DOE Office of Scientific and Technical Information (OSTI.GOV)
Merritt, E.A.; Holmes, M.; Buckner, F.S.
2009-05-26
The structure of a structural genomics target protein, Tbru020260AAA from Trypanosoma brucei, has been determined to a resolution of 2.2 {angstrom} using multiple-wavelength anomalous diffraction at the Se K edge. This protein belongs to Pfam sequence family PF08538 and is only distantly related to previously studied members of the {alpha}/{beta}-hydrolase fold family. Structural superposition onto representative {alpha}/{beta}-hydrolase fold proteins of known function indicates that a possible catalytic nucleophile, Ser116 in the T. brucei protein, lies at the expected location. However, the present structure and by extension the other trypanosomatid members of this sequence family have neither sequence nor structural similaritymore » at the location of other active-site residues typical for proteins with this fold. Together with the presence of an additional domain between strands {beta}6 and {beta}7 that is conserved in trypanosomatid genomes, this suggests that the function of these homologs has diverged from other members of the fold family.« less
RaptorX server: a resource for template-based protein structure modeling.
Källberg, Morten; Margaryan, Gohar; Wang, Sheng; Ma, Jianzhu; Xu, Jinbo
2014-01-01
Assigning functional properties to a newly discovered protein is a key challenge in modern biology. To this end, computational modeling of the three-dimensional atomic arrangement of the amino acid chain is often crucial in determining the role of the protein in biological processes. We present a community-wide web-based protocol, RaptorX server ( http://raptorx.uchicago.edu ), for automated protein secondary structure prediction, template-based tertiary structure modeling, and probabilistic alignment sampling.Given a target sequence, RaptorX server is able to detect even remotely related template sequences by means of a novel nonlinear context-specific alignment potential and probabilistic consistency algorithm. Using the protocol presented here it is thus possible to obtain high-quality structural models for many target protein sequences when only distantly related protein domains have experimentally solved structures. At present, RaptorX server can perform secondary and tertiary structure prediction of a 200 amino acid target sequence in approximately 30 min.
Keates, Tracy; Cooper, Christopher D O; Savitsky, Pavel; Allerston, Charles K; Phillips, Claire; Hammarström, Martin; Daga, Neha; Berridge, Georgina; Mahajan, Pravin; Burgess-Brown, Nicola A; Müller, Susanne; Gräslund, Susanne; Gileadi, Opher
2012-06-15
The generation of affinity reagents to large numbers of human proteins depends on the ability to express the target proteins as high-quality antigens. The Structural Genomics Consortium (SGC) focuses on the production and structure determination of human proteins. In a 7-year period, the SGC has deposited crystal structures of >800 human protein domains, and has additionally expressed and purified a similar number of protein domains that have not yet been crystallised. The targets include a diversity of protein domains, with an attempt to provide high coverage of protein families. The family approach provides an excellent basis for characterising the selectivity of affinity reagents. We present a summary of the approaches used to generate purified human proteins or protein domains, a test case demonstrating the ability to rapidly generate new proteins, and an optimisation study on the modification of >70 proteins by biotinylation in vivo. These results provide a unique synergy between large-scale structural projects and the recent efforts to produce a wide coverage of affinity reagents to the human proteome. Copyright © 2011 Elsevier B.V. All rights reserved.
Keates, Tracy; Cooper, Christopher D.O.; Savitsky, Pavel; Allerston, Charles K.; Phillips, Claire; Hammarström, Martin; Daga, Neha; Berridge, Georgina; Mahajan, Pravin; Burgess-Brown, Nicola A.; Müller, Susanne; Gräslund, Susanne; Gileadi, Opher
2012-01-01
The generation of affinity reagents to large numbers of human proteins depends on the ability to express the target proteins as high-quality antigens. The Structural Genomics Consortium (SGC) focuses on the production and structure determination of human proteins. In a 7-year period, the SGC has deposited crystal structures of >800 human protein domains, and has additionally expressed and purified a similar number of protein domains that have not yet been crystallised. The targets include a diversity of protein domains, with an attempt to provide high coverage of protein families. The family approach provides an excellent basis for characterising the selectivity of affinity reagents. We present a summary of the approaches used to generate purified human proteins or protein domains, a test case demonstrating the ability to rapidly generate new proteins, and an optimisation study on the modification of >70 proteins by biotinylation in vivo. These results provide a unique synergy between large-scale structural projects and the recent efforts to produce a wide coverage of affinity reagents to the human proteome. PMID:22027370
Rate Kinetics and Molecular Dynamics of the Structural Transitions in Amyloidogenic Proteins
NASA Astrophysics Data System (ADS)
Steckmann, Timothy M.
Amyloid fibril aggregation is associated with several horrific diseases such as Alzheimer's, Creutzfeld-Jacob, diabetes, Parkinson's and others. The process of amyloid aggregation involves forming myriad different metastable intermediate aggregates. Amyloid fibrils are composed of proteins that originate in an innocuous alpha-helix or random-coil structure. The alpha-helices convert their structure to beta-strands that aggregate into beta-sheets, and then into protofibrils, and ultimately into fully formed amyloid fibrils. On the basis of experimental data, I have developed a mathematical model for the kinetics of the reaction pathways and determined rate parameters for peptide secondary structural conversion and aggregation during the entire fibrillogenesis process from random coil to fibrils, including the molecular species that accelerate the conversions. The specific steps of the model and the rate constants that are determined by fitting to experimental data provide insight on the molecular species involved in the fibril formation process. To better understand the molecular basis of the protein structural transitions and aggregation, I report on molecular dynamics (MD) computational studies on the formation of amyloid protofibrillar structures in the small model protein ccbeta, which undergoes many of the structural transitions of the larger, naturally occurring amyloid forming proteins. Two different structural transition processes involving hydrogen bonds are observed for aggregation into fibrils: the breaking of intrachain hydrogen bonds to allow beta-hairpin proteins to straighten, and the subsequent formation of interchain hydrogen bonds during aggregation into amyloid fibrils. For my MD simulations, I found that the temperature dependence of these two different structural transition processes results in the existence of a temperature window that the ccbeta protein experiences during the process of forming protofibrillar structures. Both the mathematical modeling of the kinetics and the MD simulations show that molecular structural heterogeneity is a major factor in the process. The MD simulations also show that intrachain and interchain hydrogen bonds breaking and forming is strongly correlated to the process of amyloid formation.
Wlodawer, Alexander; Minor, Wladek; Dauter, Zbigniew; Jaskolski, Mariusz
2015-01-01
The number of macromolecular structures deposited in the Protein Data Bank now exceeds 45 000, with the vast majority determined using crystallographic methods. Thousands of studies describing such structures have been published in the scientific literature, and 14 Nobel prizes in chemistry or medicine have been awarded to protein crystallographers. As important as these structures are for understanding the processes that take place in living organisms and also for practical applications such as drug design, many non-crystallographers still have problems with critical evaluation of the structural literature data. This review attempts to provide a brief outline of technical aspects of crystallography and to explain the meaning of some parameters that should be evaluated by users of macromolecular structures in order to interpret, but not over-interpret, the information present in the coordinate files and in their description. A discussion of the extent of the information that can be gleaned from the coordinates of structures solved at different resolution, as well as problems and pitfalls encountered in structure determination and interpretation are also covered. PMID:18034855
Zook, James D.; Molugu, Trivikram R.; Jacobsen, Neil E.; Lin, Guangxin; Soll, Jürgen; Cherry, Brian R.; Brown, Michael F.; Fromme, Petra
2013-01-01
Solving high-resolution structures for membrane proteins continues to be a daunting challenge in the structural biology community. In this study we report our high-resolution NMR results for a transmembrane protein, outer envelope protein of molar mass 16 kDa (OEP16), an amino acid transporter from the outer membrane of chloroplasts. Three-dimensional, high-resolution NMR experiments on the 13C, 15N, 2H-triply-labeled protein were used to assign protein backbone resonances and to obtain secondary structure information. The results yield over 95% assignment of N, HN, CO, Cα, and Cβ chemical shifts, which is essential for obtaining a high resolution structure from NMR data. Chemical shift analysis from the assignment data reveals experimental evidence for the first time on the location of the secondary structure elements on a per residue basis. In addition T 1Z and T2 relaxation experiments were performed in order to better understand the protein dynamics. Arginine titration experiments yield an insight into the amino acid residues responsible for protein transporter function. The results provide the necessary basis for high-resolution structural determination of this important plant membrane protein. PMID:24205117
Mandal, Kalyaneswar; Uppalapati, Maruti; Ault-Riché, Dana; Kenney, John; Lowitz, Joshua; Sidhu, Sachdev S.; Kent, Stephen B.H.
2012-01-01
Total chemical synthesis was used to prepare the mirror image (D-protein) form of the angiogenic protein vascular endothelial growth factor (VEGF-A). Phage display against D-VEGF-A was used to screen designed libraries based on a unique small protein scaffold in order to identify a high affinity ligand. Chemically synthesized D- and L- forms of the protein ligand showed reciprocal chiral specificity in surface plasmon resonance binding experiments: The L-protein ligand bound only to D-VEGF-A, whereas the D-protein ligand bound only to L-VEGF-A. The D-protein ligand, but not the L-protein ligand, inhibited the binding of natural VEGF165 to the VEGFR1 receptor. Racemic protein crystallography was used to determine the high resolution X-ray structure of the heterochiral complex consisting of {D-protein antagonist + L-protein form ofVEGF-A}. Crystallization of a racemic mixture of these synthetic proteins in appropriate stoichiometry gave a racemic protein complex of more than 73 kDa containing six synthetic protein molecules. The structure of the complex was determined to a resolution of 1.6 Å. Detailed analysis of the interaction between the D-protein antagonist and the VEGF-A protein molecule showed that the binding interface comprised a contact surface area of approximately 800 Å2 in accord with our design objectives, and that the D-protein antagonist binds to the same region of VEGF-A that interacts with VEGFR1-domain 2. PMID:22927390
Ayuso-Tejedor, Sara; Angarica, Vladimir Espinosa; Bueno, Marta; Campos, Luis A; Abián, Olga; Bernadó, Pau; Sancho, Javier; Jiménez, M Angeles
2010-07-23
Partly unfolded protein conformations close to the native state may play important roles in protein function and in protein misfolding. Structural analyses of such conformations which are essential for their fully physicochemical understanding are complicated by their characteristic low populations at equilibrium. We stabilize here with a single mutation the equilibrium intermediate of apoflavodoxin thermal unfolding and determine its solution structure by NMR. It consists of a large native region identical with that observed in the X-ray structure of the wild-type protein plus an unfolded region. Small-angle X-ray scattering analysis indicates that the calculated ensemble of structures is consistent with the actual degree of expansion of the intermediate. The unfolded region encompasses discontinuous sequence segments that cluster in the 3D structure of the native protein forming the FMN cofactor binding loops and the binding site of a variety of partner proteins. Analysis of the apoflavodoxin inner interfaces reveals that those becoming destabilized in the intermediate are more polar than other inner interfaces of the protein. Natively folded proteins contain hydrophobic cores formed by the packing of hydrophobic surfaces, while natively unfolded proteins are rich in polar residues. The structure of the apoflavodoxin thermal intermediate suggests that the regions of natively folded proteins that are easily responsive to thermal activation may contain cores of intermediate hydrophobicity. Copyright (c) 2010 Elsevier Ltd. All rights reserved.
PDB_TM: selection and membrane localization of transmembrane proteins in the protein data bank.
Tusnády, Gábor E; Dosztányi, Zsuzsanna; Simon, István
2005-01-01
PDB_TM is a database for transmembrane proteins with known structures. It aims to collect all transmembrane proteins that are deposited in the protein structure database (PDB) and to determine their membrane-spanning regions. These assignments are based on the TMDET algorithm, which uses only structural information to locate the most likely position of the lipid bilayer and to distinguish between transmembrane and globular proteins. This algorithm was applied to all PDB entries and the results were collected in the PDB_TM database. By using TMDET algorithm, the PDB_TM database can be automatically updated every week, keeping it synchronized with the latest PDB updates. The PDB_TM database is available at http://www.enzim.hu/PDB_TM.
Childers, Christine L; Green, Stuart R; Dawson, Neal J; Storey, Kenneth B
2016-09-01
The effect of protein stability on kinetic function is monitored with many techniques that often require large amounts of expensive substrates and specialized equipment not universally available. We present differential scanning fluorimetry (DSF), a simple high-throughput assay performed in real-time thermocyclers, as a technique for analysis of protein unfolding. Furthermore, we demonstrate a correlation between the half-maximal rate of protein unfolding (Knd), and protein unfolding by urea (I50). This demonstrates that DSF methods can determine the structural stability of an enzyme's active site and can compare the relative structural stability of homologous enzymes with a high degree of sequence similarity. Copyright © 2016 Elsevier Inc. All rights reserved.
[Biochemical characteristics and antigenic structures of Chlamydia].
Puy, H; Fuentes, V; Eb, F; Orfila, J
1989-01-01
New biotechnology in immunology and molecular biology has enabled the identification and definition of the structure of glycolipids and especially membrane proteins of Chlamydia. Chlamydia antigen lipopolysaccharide, major outer membrane protein, protein 74 kDa, eukaryotic cell binding protein and cysteine rich proteins are all carriers of antigenic determinants, genus, species or type specific. They are very usefull for diagnosis of Chlamydial infections and epidemiological studies. These membranous antigens have an important role in the pathogenesis of these bacteries. Finally these studies have contributed to the isolation of a new species: C. pneumoniae (TWAR strains).
Predictive and comparative analysis of Ebolavirus proteins
Cong, Qian; Pei, Jimin; Grishin, Nick V
2015-01-01
Ebolavirus is the pathogen for Ebola Hemorrhagic Fever (EHF). This disease exhibits a high fatality rate and has recently reached a historically epidemic proportion in West Africa. Out of the 5 known Ebolavirus species, only Reston ebolavirus has lost human pathogenicity, while retaining the ability to cause EHF in long-tailed macaque. Significant efforts have been spent to determine the three-dimensional (3D) structures of Ebolavirus proteins, to study their interaction with host proteins, and to identify the functional motifs in these viral proteins. Here, in light of these experimental results, we apply computational analysis to predict the 3D structures and functional sites for Ebolavirus protein domains with unknown structure, including a zinc-finger domain of VP30, the RNA-dependent RNA polymerase catalytic domain and a methyltransferase domain of protein L. In addition, we compare sequences of proteins that interact with Ebolavirus proteins from RESTV-resistant primates with those from RESTV-susceptible monkeys. The host proteins that interact with GP and VP35 show an elevated level of sequence divergence between the RESTV-resistant and RESTV-susceptible species, suggesting that they may be responsible for host specificity. Meanwhile, we detect variable positions in protein sequences that are likely associated with the loss of human pathogenicity in RESTV, map them onto the 3D structures and compare their positions to known functional sites. VP35 and VP30 are significantly enriched in these potential pathogenicity determinants and the clustering of such positions on the surfaces of VP35 and GP suggests possible uncharacterized interaction sites with host proteins that contribute to the virulence of Ebolavirus. PMID:26158395
Razban, Rostam M; Gilson, Amy I; Durfee, Niamh; Strobelt, Hendrik; Dinkla, Kasper; Choi, Jeong-Mo; Pfister, Hanspeter; Shakhnovich, Eugene I
2018-05-08
Protein evolution spans time scales and its effects span the length of an organism. A web app named ProteomeVis is developed to provide a comprehensive view of protein evolution in the S. cerevisiae and E. coli proteomes. ProteomeVis interactively creates protein chain graphs, where edges between nodes represent structure and sequence similarities within user-defined ranges, to study the long time scale effects of protein structure evolution. The short time scale effects of protein sequence evolution are studied by sequence evolutionary rate (ER) correlation analyses with protein properties that span from the molecular to the organismal level. We demonstrate the utility and versatility of ProteomeVis by investigating the distribution of edges per node in organismal protein chain universe graphs (oPCUGs) and putative ER determinants. S. cerevisiae and E. coli oPCUGs are scale-free with scaling constants of 1.79 and 1.56, respectively. Both scaling constants can be explained by a previously reported theoretical model describing protein structure evolution (Dokholyan et al., 2002). Protein abundance most strongly correlates with ER among properties in ProteomeVis, with Spearman correlations of -0.49 (p-value<10-10) and -0.46 (p-value<10-10) for S. cerevisiae and E. coli, respectively. This result is consistent with previous reports that found protein expression to be the most important ER determinant (Zhang and Yang, 2015). ProteomeVis is freely accessible at http://proteomevis.chem.harvard.edu. Supplementary data are available at Bioinformatics. shakhnovich@chemistry.harvard.edu.
Predictive and comparative analysis of Ebolavirus proteins.
Cong, Qian; Pei, Jimin; Grishin, Nick V
2015-01-01
Ebolavirus is the pathogen for Ebola Hemorrhagic Fever (EHF). This disease exhibits a high fatality rate and has recently reached a historically epidemic proportion in West Africa. Out of the 5 known Ebolavirus species, only Reston ebolavirus has lost human pathogenicity, while retaining the ability to cause EHF in long-tailed macaque. Significant efforts have been spent to determine the three-dimensional (3D) structures of Ebolavirus proteins, to study their interaction with host proteins, and to identify the functional motifs in these viral proteins. Here, in light of these experimental results, we apply computational analysis to predict the 3D structures and functional sites for Ebolavirus protein domains with unknown structure, including a zinc-finger domain of VP30, the RNA-dependent RNA polymerase catalytic domain and a methyltransferase domain of protein L. In addition, we compare sequences of proteins that interact with Ebolavirus proteins from RESTV-resistant primates with those from RESTV-susceptible monkeys. The host proteins that interact with GP and VP35 show an elevated level of sequence divergence between the RESTV-resistant and RESTV-susceptible species, suggesting that they may be responsible for host specificity. Meanwhile, we detect variable positions in protein sequences that are likely associated with the loss of human pathogenicity in RESTV, map them onto the 3D structures and compare their positions to known functional sites. VP35 and VP30 are significantly enriched in these potential pathogenicity determinants and the clustering of such positions on the surfaces of VP35 and GP suggests possible uncharacterized interaction sites with host proteins that contribute to the virulence of Ebolavirus.
Kryshtafovych, Andriy; Moult, John; Bales, Patrick; Bazan, J. Fernando; Biasini, Marco; Burgin, Alex; Chen, Chen; Cochran, Frank V.; Craig, Timothy K.; Das, Rhiju; Fass, Deborah; Garcia-Doval, Carmela; Herzberg, Osnat; Lorimer, Donald; Luecke, Hartmut; Ma, Xiaolei; Nelson, Daniel C.; van Raaij, Mark J.; Rohwer, Forest; Segall, Anca; Seguritan, Victor; Zeth, Kornelius; Schwede, Torsten
2014-01-01
For the last two decades, CASP has assessed the state of the art in techniques for protein structure prediction and identified areas which required further development. CASP would not have been possible without the prediction targets provided by the experimental structural biology community. In the latest experiment, CASP10, over 100 structures were suggested as prediction targets, some of which appeared to be extraordinarily difficult for modeling. In this paper, authors of some of the most challenging targets discuss which specific scientific question motivated the experimental structure determination of the target protein, which structural features were especially interesting from a structural or functional perspective, and to what extent these features were correctly reproduced in the predictions submitted to CASP10. Specifically, the following targets will be presented: the acid-gated urea channel, a difficult to predict trans-membrane protein from the important human pathogen Helicobacter pylori; the structure of human interleukin IL-34, a recently discovered helical cytokine; the structure of a functionally uncharacterized enzyme OrfY from Thermoproteus tenax formed by a gene duplication and a novel fold; an ORFan domain of mimivirus sulfhydryl oxidase R596; the fibre protein gp17 from bacteriophage T7; the Bacteriophage CBA-120 tailspike protein; a virus coat protein from metagenomic samples of the marine environment; and finally an unprecedented class of structure prediction targets based on engineered disulfide-rich small proteins. PMID:24318984
Computational modeling of membrane proteins
Leman, Julia Koehler; Ulmschneider, Martin B.; Gray, Jeffrey J.
2014-01-01
The determination of membrane protein (MP) structures has always trailed that of soluble proteins due to difficulties in their overexpression, reconstitution into membrane mimetics, and subsequent structure determination. The percentage of MP structures in the protein databank (PDB) has been at a constant 1-2% for the last decade. In contrast, over half of all drugs target MPs, only highlighting how little we understand about drug-specific effects in the human body. To reduce this gap, researchers have attempted to predict structural features of MPs even before the first structure was experimentally elucidated. In this review, we present current computational methods to predict MP structure, starting with secondary structure prediction, prediction of trans-membrane spans, and topology. Even though these methods generate reliable predictions, challenges such as predicting kinks or precise beginnings and ends of secondary structure elements are still waiting to be addressed. We describe recent developments in the prediction of 3D structures of both α-helical MPs as well as β-barrels using comparative modeling techniques, de novo methods, and molecular dynamics (MD) simulations. The increase of MP structures has (1) facilitated comparative modeling due to availability of more and better templates, and (2) improved the statistics for knowledge-based scoring functions. Moreover, de novo methods have benefitted from the use of correlated mutations as restraints. Finally, we outline current advances that will likely shape the field in the forthcoming decade. PMID:25355688
BCL::MP-Fold: membrane protein structure prediction guided by EPR restraints
Fischer, Axel W.; Alexander, Nathan S.; Woetzel, Nils; Karakaş, Mert; Weiner, Brian E.; Meiler, Jens
2016-01-01
For many membrane proteins, the determination of their topology remains a challenge for methods like X-ray crystallography and nuclear magnetic resonance (NMR) spectroscopy. Electron paramagnetic resonance (EPR) spectroscopy has evolved as an alternative technique to study structure and dynamics of membrane proteins. The present study demonstrates the feasibility of membrane protein topology determination using limited EPR distance and accessibility measurements. The BCL::MP-Fold algorithm assembles secondary structure elements (SSEs) in the membrane using a Monte Carlo Metropolis (MCM) approach. Sampled models are evaluated using knowledge-based potential functions and agreement with the EPR data and a knowledge-based energy function. Twenty-nine membrane proteins of up to 696 residues are used to test the algorithm. The protein-size-normalized root-mean-square-deviation (RMSD100) value of the most accurate model is better than 8 Å for twenty-seven, better than 6 Å for twenty-two, and better than 4 Å for fifteen out of twenty-nine proteins, demonstrating the algorithm’s ability to sample the native topology. The average enrichment could be improved from 1.3 to 2.5, showing the improved discrimination power by using EPR data. PMID:25820805
The structure of Ca2+-loaded S100A2 at 1.3-Å resolution.
Koch, Michael; Fritz, Günter
2012-05-01
S100A2 is an EF-hand calcium ion (Ca(2+))-binding protein that activates the tumour suppressor p53. In order to understand the molecular mechanisms underlying the Ca(2+) -induced activation of S100A2, the structure of Ca(2+)-bound S100A2 was determined at 1.3 Å resolution by X-ray crystallography. The structure was compared with Ca(2+) -free S100A2 and with other S100 proteins. Binding of Ca(2+) to S100A2 induces small structural changes in the N-terminal EF-hand, but a large conformational change in the C-terminal EF-hand, reorienting helix III by approximately 90°. This movement is accompanied by the exposure of a hydrophobic cavity between helix III and helix IV that represents the target protein interaction site. This molecular reorganization is associated with the breaking and new formation of intramolecular hydrophobic contacts. The target binding site exhibits unique features; in particular, the hydrophobic cavity is larger than in other Ca(2+)-loaded S100 proteins. The structural data underline that the shape and size of the hydrophobic cavity are major determinants for target specificity of S100 proteins and suggest that the binding mode for S100A2 is different from that of other p53-interacting S100 proteins. Database Structural data are available in the Protein Data Bank database under the accession number 4DUQ © 2012 The Authors Journal compilation © 2012 FEBS.
Three-Dimensional Structures Reveal Multiple ADP/ATP Binding Modes
DOE Office of Scientific and Technical Information (OSTI.GOV)
C Simmons; C Magee; D Smith
The creation of synthetic enzymes with predefined functions represents a major challenge in future synthetic biology applications. Here, we describe six structures of de novo proteins that have been determined using protein crystallography to address how simple enzymes perform catalysis. Three structures are of a protein, DX, selected for its stability and ability to tightly bind ATP. Despite the addition of ATP to the crystallization conditions, the presence of a bound but distorted ATP was found only under excess ATP conditions, with ADP being present under equimolar conditions or when crystallized for a prolonged period of time. A bound ADPmore » cofactor was evident when Asp was substituted for Val at residue 65, but ATP in a linear configuration is present when Phe was substituted for Tyr at residue 43. These new structures complement previously determined structures of DX and the protein with the Phe 43 to Tyr substitution [Simmons, C. R., et al. (2009) ACS Chem. Biol. 4, 649-658] and together demonstrate the multiple ADP/ATP binding modes from which a model emerges in which the DX protein binds ATP in a configuration that represents a transitional state for the catalysis of ATP to ADP through a slow, metal-free reaction capable of multiple turnovers. This unusual observation suggests that design-free methods can be used to generate novel protein scaffolds that are tailor-made for catalysis.« less
Troshin, Petr V; Morris, Chris; Prince, Stephen M; Papiz, Miroslav Z
2008-12-01
Membrane Protein Structure Initiative (MPSI) exploits laboratory competencies to work collaboratively and distribute work among the different sites. This is possible as protein structure determination requires a series of steps, starting with target selection, through cloning, expression, purification, crystallization and finally structure determination. Distributed sites create a unique set of challenges for integrating and passing on information on the progress of targets. This role is played by the Protein Information Management System (PIMS), which is a laboratory information management system (LIMS), serving as a hub for MPSI, allowing collaborative structural proteomics to be carried out in a distributed fashion. It holds key information on the progress of cloning, expression, purification and crystallization of proteins. PIMS is employed to track the status of protein targets and to manage constructs, primers, experiments, protocols, sample locations and their detailed histories: thus playing a key role in MPSI data exchange. It also serves as the centre of a federation of interoperable information resources such as local laboratory information systems and international archival resources, like PDB or NCBI. During the challenging task of PIMS integration, within the MPSI, we discovered a number of prerequisites for successful PIMS integration. In this article we share our experiences and provide invaluable insights into the process of LIMS adaptation. This information should be of interest to partners who are thinking about using LIMS as a data centre for their collaborative efforts.
Improta, Roberto; Vitagliano, Luigi; Esposito, Luciana
2015-11-01
The elucidation of the mutual influence between peptide bond geometry and local conformation has important implications for protein structure refinement, validation, and prediction. To gain insights into the structural determinants and the energetic contributions associated with protein/peptide backbone plasticity, we here report an extensive analysis of the variability of the peptide bond angles by combining statistical analyses of protein structures and quantum mechanics calculations on small model peptide systems. Our analyses demonstrate that all the backbone bond angles strongly depend on the peptide conformation and unveil the existence of regular trends as function of ψ and/or φ. The excellent agreement of the quantum mechanics calculations with the statistical surveys of protein structures validates the computational scheme here employed and demonstrates that the valence geometry of protein/peptide backbone is primarily dictated by local interactions. Notably, for the first time we show that the position of the H(α) hydrogen atom, which is an important parameter in NMR structural studies, is also dependent on the local conformation. Most of the trends observed may be satisfactorily explained by invoking steric repulsive interactions; in some specific cases the valence bond variability is also influenced by hydrogen-bond like interactions. Moreover, we can provide a reliable estimate of the energies involved in the interplay between geometry and conformations. © 2015 Wiley Periodicals, Inc.
Computational methods for constructing protein structure models from 3D electron microscopy maps.
Esquivel-Rodríguez, Juan; Kihara, Daisuke
2013-10-01
Protein structure determination by cryo-electron microscopy (EM) has made significant progress in the past decades. Resolutions of EM maps have been improving as evidenced by recently reported structures that are solved at high resolutions close to 3Å. Computational methods play a key role in interpreting EM data. Among many computational procedures applied to an EM map to obtain protein structure information, in this article we focus on reviewing computational methods that model protein three-dimensional (3D) structures from a 3D EM density map that is constructed from two-dimensional (2D) maps. The computational methods we discuss range from de novo methods, which identify structural elements in an EM map, to structure fitting methods, where known high resolution structures are fit into a low-resolution EM map. A list of available computational tools is also provided. Copyright © 2013 Elsevier Inc. All rights reserved.
Modeling the Structure of Helical Assemblies with Experimental Constraints in Rosetta.
André, Ingemar
2018-01-01
Determining high-resolution structures of proteins with helical symmetry can be challenging due to limitations in experimental data. In such instances, structure-based protein simulations driven by experimental data can provide a valuable approach for building models of helical assemblies. This chapter describes how the Rosetta macromolecular package can be used to model homomeric protein assemblies with helical symmetry in a range of modeling scenarios including energy refinement, symmetrical docking, comparative modeling, and de novo structure prediction. Data-guided structure modeling of helical assemblies with experimental information from electron density, X-ray fiber diffraction, solid-state NMR, and chemical cross-linking mass spectrometry is also described.
Equilibrium simulations of proteins using molecular fragment replacement and NMR chemical shifts.
Boomsma, Wouter; Tian, Pengfei; Frellsen, Jes; Ferkinghoff-Borg, Jesper; Hamelryck, Thomas; Lindorff-Larsen, Kresten; Vendruscolo, Michele
2014-09-23
Methods of protein structure determination based on NMR chemical shifts are becoming increasingly common. The most widely used approaches adopt the molecular fragment replacement strategy, in which structural fragments are repeatedly reassembled into different complete conformations in molecular simulations. Although these approaches are effective in generating individual structures consistent with the chemical shift data, they do not enable the sampling of the conformational space of proteins with correct statistical weights. Here, we present a method of molecular fragment replacement that makes it possible to perform equilibrium simulations of proteins, and hence to determine their free energy landscapes. This strategy is based on the encoding of the chemical shift information in a probabilistic model in Markov chain Monte Carlo simulations. First, we demonstrate that with this approach it is possible to fold proteins to their native states starting from extended structures. Second, we show that the method satisfies the detailed balance condition and hence it can be used to carry out an equilibrium sampling from the Boltzmann distribution corresponding to the force field used in the simulations. Third, by comparing the results of simulations carried out with and without chemical shift restraints we describe quantitatively the effects that these restraints have on the free energy landscapes of proteins. Taken together, these results demonstrate that the molecular fragment replacement strategy can be used in combination with chemical shift information to characterize not only the native structures of proteins but also their conformational fluctuations.
Fast iodide-SAD phasing for high-throughput membrane protein structure determination.
Melnikov, Igor; Polovinkin, Vitaly; Kovalev, Kirill; Gushchin, Ivan; Shevtsov, Mikhail; Shevchenko, Vitaly; Mishin, Alexey; Alekseev, Alexey; Rodriguez-Valera, Francisco; Borshchevskiy, Valentin; Cherezov, Vadim; Leonard, Gordon A; Gordeliy, Valentin; Popov, Alexander
2017-05-01
We describe a fast, easy, and potentially universal method for the de novo solution of the crystal structures of membrane proteins via iodide-single-wavelength anomalous diffraction (I-SAD). The potential universality of the method is based on a common feature of membrane proteins-the availability at the hydrophobic-hydrophilic interface of positively charged amino acid residues with which iodide strongly interacts. We demonstrate the solution using I-SAD of four crystal structures representing different classes of membrane proteins, including a human G protein-coupled receptor (GPCR), and we show that I-SAD can be applied using data collection strategies based on either standard or serial x-ray crystallography techniques.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Geerds, Christina; Wohlmann, Jens; Haas, Albert
The structure of VapB, a member of the Vap protein family that is involved in virulence of the bacterial pathogen R. equi, was determined by SAD phasing and reveals an eight-stranded antiparallel β-barrel similar to avidin, suggestive of a binding function. Made up of two Greek-key motifs, the topology of VapB is unusual or even unique. Members of the virulence-associated protein (Vap) family from the pathogen Rhodococcus equi regulate virulence in an unknown manner. They do not share recognizable sequence homology with any protein of known structure. VapB and VapA are normally associated with isolates from pigs and horses, respectively.more » To contribute to a molecular understanding of Vap function, the crystal structure of a protease-resistant VapB fragment was determined at 1.4 Å resolution. The structure was solved by SAD phasing employing the anomalous signal of one endogenous S atom and two bound Co ions with low occupancy. VapB is an eight-stranded antiparallel β-barrel with a single helix. Structural similarity to avidins suggests a potential binding function. Unlike other eight- or ten-stranded β-barrels found in avidins, bacterial outer membrane proteins, fatty-acid-binding proteins and lysozyme inhibitors, Vaps do not have a next-neighbour arrangement but consist of two Greek-key motifs with strand order 41238567, suggesting an unusual or even unique topology.« less
Protein Crystal Quality Studies
NASA Technical Reports Server (NTRS)
1998-01-01
Eddie Snell (standing), Post-Doctoral Fellow the National Research Council (NRC),and Marc Pusey of Marshall Space Flight Center (MSFC) use a reciprocal space mapping diffractometer for marcromolecular crystal quality studies. The diffractometer is used in mapping the structure of marcromolecules such as proteins to determine their structure and thus understand how they function with other proteins in the body. This is one of several analytical tools used on proteins crystalized on Earth and in space experiments. Photo credit: NASA/Marshall Space Flight Center (MSFC)
Miyakawa, Takuya; Sawano, Yoriko; Miyazono, Ken-ichi; Miyauchi, Yumiko; Hatano, Ken-ichi
2013-01-01
STK_08120 is a member of the thermoacidophile-specific DUF3211 protein family from Sulfolobus tokodaii strain 7. Its molecular function remains obscure, and sequence similarities for obtaining functional remarks are not available. In this study, the crystal structure of STK_08120 was determined at 1.79-Å resolution to predict its probable function using structure similarity searches. The structure adopts an α/β structure of a helix-grip fold, which is found in the START domain proteins with cavities for hydrophobic substrates or ligands. The detailed structural features implied that fatty acids are the primary ligand candidates for STK_08120, and binding assays revealed that the protein bound long-chain saturated fatty acids (>C14) and their trans-unsaturated types with an affinity equal to that for major fatty acid binding proteins in mammals and plants. Moreover, the structure of an STK_08120-myristic acid complex revealed a unique binding mode among fatty acid binding proteins. These results suggest that the thermoacidophile-specific protein family DUF3211 functions as a fatty acid carrier with a novel binding mode. PMID:23836863
PhyreStorm: A Web Server for Fast Structural Searches Against the PDB.
Mezulis, Stefans; Sternberg, Michael J E; Kelley, Lawrence A
2016-02-22
The identification of structurally similar proteins can provide a range of biological insights, and accordingly, the alignment of a query protein to a database of experimentally determined protein structures is a technique commonly used in the fields of structural and evolutionary biology. The PhyreStorm Web server has been designed to provide comprehensive, up-to-date and rapid structural comparisons against the Protein Data Bank (PDB) combined with a rich and intuitive user interface. It is intended that this facility will enable biologists inexpert in bioinformatics access to a powerful tool for exploring protein structure relationships beyond what can be achieved by sequence analysis alone. By partitioning the PDB into similar structures, PhyreStorm is able to quickly discard the majority of structures that cannot possibly align well to a query protein, reducing the number of alignments required by an order of magnitude. PhyreStorm is capable of finding 93±2% of all highly similar (TM-score>0.7) structures in the PDB for each query structure, usually in less than 60s. PhyreStorm is available at http://www.sbg.bio.ic.ac.uk/phyrestorm/. Copyright © 2015 The Authors. Published by Elsevier Ltd.. All rights reserved.
Sanchez-Martinez, M; Crehuet, R
2014-12-21
We present a method based on the maximum entropy principle that can re-weight an ensemble of protein structures based on data from residual dipolar couplings (RDCs). The RDCs of intrinsically disordered proteins (IDPs) provide information on the secondary structure elements present in an ensemble; however even two sets of RDCs are not enough to fully determine the distribution of conformations, and the force field used to generate the structures has a pervasive influence on the refined ensemble. Two physics-based coarse-grained force fields, Profasi and Campari, are able to predict the secondary structure elements present in an IDP, but even after including the RDC data, the re-weighted ensembles differ between both force fields. Thus the spread of IDP ensembles highlights the need for better force fields. We distribute our algorithm in an open-source Python code.
Gallagher, D T; Karageorgos, I; Hudgens, J W; Galvin, C V
2018-02-01
The reported data describe the crystallization, crystal packing, structure determination and twinning of the unliganded Fab (antigen-binding fragment) from the NISTmAb (standard reference material 8671). The raw atomic coordinates are available as Protein Data Bank structure 5K8A and biological aspects are described in the article, (Karageorgos et al., 2017) [1]. Crystal data show that the packing is unique, and show the basis for the crystal's twinned growth. Twinning is a common and often serious problem in protein structure determination by x-ray crystallography [2]. In the present case the twinning is due to a small deviation (about 0.3 nm) from 4-fold symmetry in the primary intermolecular interface. The deviation produces pseudosymmetry, generating slightly different conformations of the protein, and alternating strong and weak forms of key packing interfaces throughout the lattice.
1998-06-16
Eddie Snell, Post-Doctoral Fellow the National Research Council (NRC) uses a reciprocal space mapping diffractometer for macromolecular crystal quality studies. The diffractometer is used in mapping the structure of macromolecules such as proteins to determine their structure and thus understand how they function with other proteins in the body. This is one of several analytical tools used on proteins crystallized on Earth and in space experiments. Photo credit: NASA/Marshall Space Flight Center (MSFC)
The Protein Data Bank at 40: Reflecting on the Past to Prepare for the Future
Berman, Helen M.; Kleywegt, Gerard J.; Nakamura, Haruki; Markley, John L.
2012-01-01
A symposium celebrating the 40th anniversary of the Protein Data Bank archive (PDB), organized by the Worldwide Protein Data Bank, was held at Cold Spring Harbor Laboratory (CSHL) October 28–30, 2011. PDB40’s distinguished speakers highlighted four decades of innovation in structural biology, from the early era of structural determination to future directions for the field. PMID:22404998
Corsaro, Alessandro; Thellung, Stefano; Bucciarelli, Tonino; Scotti, Luca; Chiovitti, Katia; Villa, Valentina; D'Arrigo, Cristina; Aceto, Antonio; Florio, Tullio
2011-03-01
Mutations in prion protein are thought to be causative of inherited prion diseases favoring the spontaneous conversion of the normal prion protein into the scrapie-like pathological prion protein. We previously reported that, by controlled thermal denaturation, human prion protein fragment 90-231 acquires neurotoxic properties when transformed in a β-rich conformation, resembling the scrapie-like conformation. In this study we generated prion protein fragment 90-231 bearing mutations identified in familial prion diseases (D202N and E200K), to analyze their role in the induction of a neurotoxic conformation. Prion protein fragment 90-231(wild type) and the D202N mutant were not toxic in native conformation but induced cell death only after thermal denaturation. Conversely, prion protein fragment 90-231(E200K) was highly toxic in its native structure, suggesting that E200K mutation per se favors the acquisition of a peptide neurotoxic conformation. To identify the structural determinants of prion protein fragment 90-231 toxicity, we show that while the wild type peptide is structured in α-helix, hPrP90-231 E200K is spontaneously refolded in a β-structured conformer characterized by increased proteinase K resistance and propensity to generate fibrils. However, the most significant difference induced by E200K mutation in prion protein fragment 90-231 structure in native conformation we observed, was an increase in the exposure of hydrophobic amino-acids on protein surface that was detected in wild type and D202N proteins only after thermal denaturation. In conclusion, we propose that increased hydrophobicity is one of the main determinants of toxicity induced by different mutations in prion protein-derived peptides. Copyright © 2010 Elsevier Ltd. All rights reserved.
TAP score: torsion angle propensity normalization applied to local protein structure evaluation
Tosatto, Silvio CE; Battistutta, Roberto
2007-01-01
Background Experimentally determined protein structures may contain errors and require validation. Conformational criteria based on the Ramachandran plot are mainly used to distinguish between distorted and adequately refined models. While the readily available criteria are sufficient to detect totally wrong structures, establishing the more subtle differences between plausible structures remains more challenging. Results A new criterion, called TAP score, measuring local sequence to structure fitness based on torsion angle propensities normalized against the global minimum and maximum is introduced. It is shown to be more accurate than previous methods at estimating the validity of a protein model in terms of commonly used experimental quality parameters on two test sets representing the full PDB database and a subset of obsolete PDB structures. Highly selective TAP thresholds are derived to recognize over 90% of the top experimental structures in the absence of experimental information. Both a web server and an executable version of the TAP score are available at . Conclusion A novel procedure for energy normalization (TAP) has significantly improved the possibility to recognize the best experimental structures. It will allow the user to more reliably isolate problematic structures in the context of automated experimental structure determination. PMID:17504537
Serum Albumin Domain Structures in Human Blood Serum by Mass Spectrometry and Computational Biology.
Belsom, Adam; Schneider, Michael; Fischer, Lutz; Brock, Oliver; Rappsilber, Juri
2016-03-01
Chemical cross-linking combined with mass spectrometry has proven useful for studying protein-protein interactions and protein structure, however the low density of cross-link data has so far precluded its use in determining structures de novo. Cross-linking density has been typically limited by the chemical selectivity of the standard cross-linking reagents that are commonly used for protein cross-linking. We have implemented the use of a heterobifunctional cross-linking reagent, sulfosuccinimidyl 4,4'-azipentanoate (sulfo-SDA), combining a traditional sulfo-N-hydroxysuccinimide (sulfo-NHS) ester and a UV photoactivatable diazirine group. This diazirine yields a highly reactive and promiscuous carbene species, the net result being a greatly increased number of cross-links compared with homobifunctional, NHS-based cross-linkers. We present a novel methodology that combines the use of this high density photo-cross-linking data with conformational space search to investigate the structure of human serum albumin domains, from purified samples, and in its native environment, human blood serum. Our approach is able to determine human serum albumin domain structures with good accuracy: root-mean-square deviation to crystal structure are 2.8/5.6/2.9 Å (purified samples) and 4.5/5.9/4.8Å (serum samples) for domains A/B/C for the first selected structure; 2.5/4.9/2.9 Å (purified samples) and 3.5/5.2/3.8 Å (serum samples) for the best out of top five selected structures. Our proof-of-concept study on human serum albumin demonstrates initial potential of our approach for determining the structures of more proteins in the complex biological contexts in which they function and which they may require for correct folding. Data are available via ProteomeXchange with identifier PXD001692. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
TALEs from a spring--superelasticity of Tal effector protein structures.
Flechsig, Holger
2014-01-01
Transcription activator-like effectors (TALEs) are DNA-related proteins that recognise and bind specific target sequences to manipulate gene expression. Recently determined crystal structures show that their common architecture reveals a superhelical overall structure that may undergo drastic conformational changes. To establish a link between structure and dynamics in TALE proteins we have employed coarse-grained elastic-network modelling of currently available structural data and implemented a force-probe setup that allowed us to investigate their mechanical behaviour in computer experiments. Based on the measured force-extension curves we conclude that TALEs exhibit superelastic dynamical properties allowing for large-scale global conformational changes along their helical axis, which represents the soft direction in such proteins. For moderate external forcing the TALE models behave like linear springs, obeying Hooke's law, and the investigated structures can be characterised and compared by a corresponding spring constant. We show that conformational flexibility underlying the large-scale motions is not homogeneously distributed over the TALE structure, but instead soft spot residues around which strain is accumulated and which turn out to represent key agents in the transmission of conformational motions are identified. They correspond to the RVD loop residues that have been experimentally determined to play an eminent role in the binding process of target DNA.
TALEs from a Spring – Superelasticity of Tal Effector Protein Structures
Flechsig, Holger
2014-01-01
Transcription activator-like effectors (TALEs) are DNA-related proteins that recognise and bind specific target sequences to manipulate gene expression. Recently determined crystal structures show that their common architecture reveals a superhelical overall structure that may undergo drastic conformational changes. To establish a link between structure and dynamics in TALE proteins we have employed coarse-grained elastic-network modelling of currently available structural data and implemented a force-probe setup that allowed us to investigate their mechanical behaviour in computer experiments. Based on the measured force-extension curves we conclude that TALEs exhibit superelastic dynamical properties allowing for large-scale global conformational changes along their helical axis, which represents the soft direction in such proteins. For moderate external forcing the TALE models behave like linear springs, obeying Hooke's law, and the investigated structures can be characterised and compared by a corresponding spring constant. We show that conformational flexibility underlying the large-scale motions is not homogeneously distributed over the TALE structure, but instead soft spot residues around which strain is accumulated and which turn out to represent key agents in the transmission of conformational motions are identified. They correspond to the RVD loop residues that have been experimentally determined to play an eminent role in the binding process of target DNA. PMID:25313859
Refinement of NMR structures using implicit solvent and advanced sampling techniques.
Chen, Jianhan; Im, Wonpil; Brooks, Charles L
2004-12-15
NMR biomolecular structure calculations exploit simulated annealing methods for conformational sampling and require a relatively high level of redundancy in the experimental restraints to determine quality three-dimensional structures. Recent advances in generalized Born (GB) implicit solvent models should make it possible to combine information from both experimental measurements and accurate empirical force fields to improve the quality of NMR-derived structures. In this paper, we study the influence of implicit solvent on the refinement of protein NMR structures and identify an optimal protocol of utilizing these improved force fields. To do so, we carry out structure refinement experiments for model proteins with published NMR structures using full NMR restraints and subsets of them. We also investigate the application of advanced sampling techniques to NMR structure refinement. Similar to the observations of Xia et al. (J.Biomol. NMR 2002, 22, 317-331), we find that the impact of implicit solvent is rather small when there is a sufficient number of experimental restraints (such as in the final stage of NMR structure determination), whether implicit solvent is used throughout the calculation or only in the final refinement step. The application of advanced sampling techniques also seems to have minimal impact in this case. However, when the experimental data are limited, we demonstrate that refinement with implicit solvent can substantially improve the quality of the structures. In particular, when combined with an advanced sampling technique, the replica exchange (REX) method, near-native structures can be rapidly moved toward the native basin. The REX method provides both enhanced sampling and automatic selection of the most native-like (lowest energy) structures. An optimal protocol based on our studies first generates an ensemble of initial structures that maximally satisfy the available experimental data with conventional NMR software using a simplified force field and then refines these structures with implicit solvent using the REX method. We systematically examine the reliability and efficacy of this protocol using four proteins of various sizes ranging from the 56-residue B1 domain of Streptococcal protein G to the 370-residue Maltose-binding protein. Significant improvement in the structures was observed in all cases when refinement was based on low-redundancy restraint data. The proposed protocol is anticipated to be particularly useful in early stages of NMR structure determination where a reliable estimate of the native fold from limited data can significantly expedite the overall process. This refinement procedure is also expected to be useful when redundant experimental data are not readily available, such as for large multidomain biomolecules and in solid-state NMR structure determination.
Underestimated Halogen Bonds Forming with Protein Backbone in Protein Data Bank.
Zhang, Qian; Xu, Zhijian; Shi, Jiye; Zhu, Weiliang
2017-07-24
Halogen bonds (XBs) are attracting increasing attention in biological systems. Protein Data Bank (PDB) archives experimentally determined XBs in biological macromolecules. However, no software for structure refinement in X-ray crystallography takes into account XBs, which might result in the weakening or even vanishing of experimentally determined XBs in PDB. In our previous study, we showed that side-chain XBs forming with protein side chains are underestimated in PDB on the basis of the phenomenon that the proportion of side-chain XBs to overall XBs decreases as structural resolution becomes lower and lower. However, whether the dominant backbone XBs forming with protein backbone are overlooked is still a mystery. Here, with the help of the ratio (R F ) of the observed XBs' frequency of occurrence to their frequency expected at random, we demonstrated that backbone XBs are largely overlooked in PDB, too. Furthermore, three cases were discovered possessing backbone XBs in high resolution structures while losing the XBs in low resolution structures. In the last two cases, even at 1.80 Å resolution, the backbone XBs were lost, manifesting the urgent need to consider XBs in the refinement process during X-ray crystallography study.
Kim, Do Jin; Bitto, Eduard; Bingman, Craig A; Kim, Hyun-Jung; Han, Byung Woo; Phillips, George N
2015-07-01
Members of the universal stress protein (USP) family are conserved in a phylogenetically diverse range of prokaryotes, fungi, protists, and plants and confer abilities to respond to a wide range of environmental stresses. Arabidopsis thaliana contains 44 USP domain-containing proteins, and USP domain is found either in a small protein with unknown physiological function or in an N-terminal portion of a multi-domain protein, usually a protein kinase. Here, we report the first crystal structure of a eukaryotic USP-like protein encoded from the gene At3g01520. The crystal structure of the protein At3g01520 was determined by the single-wavelength anomalous dispersion method and refined to an R factor of 21.8% (Rfree = 26.1%) at 2.5 Å resolution. The crystal structure includes three At3g01520 protein dimers with one AMP molecule bound to each protomer, comprising a Rossmann-like α/β overall fold. The bound AMP and conservation of residues in the ATP-binding loop suggest that the protein At3g01520 also belongs to the ATP-binding USP subfamily members. © 2015 The Authors. Proteins: Structure, Function, and Bioinformatics Published by Wiley Periodicals, Inc.
Watching proteins function with picosecond X-ray crystallography and molecular dynamics simulations.
NASA Astrophysics Data System (ADS)
Anfinrud, Philip
2006-03-01
Time-resolved electron density maps of myoglobin, a ligand-binding heme protein, have been stitched together into movies that unveil with < 2-å spatial resolution and 150-ps time-resolution the correlated protein motions that accompany and/or mediate ligand migration within the hydrophobic interior of a protein. A joint analysis of all-atom molecular dynamics (MD) calculations and picosecond time-resolved X-ray structures provides single-molecule insights into mechanisms of protein function. Ensemble-averaged MD simulations of the L29F mutant of myoglobin following ligand dissociation reproduce the direction, amplitude, and timescales of crystallographically-determined structural changes. This close agreement with experiments at comparable resolution in space and time validates the individual MD trajectories, which identify and structurally characterize a conformational switch that directs dissociated ligands to one of two nearby protein cavities. This unique combination of simulation and experiment unveils functional protein motions and illustrates at an atomic level relationships among protein structure, dynamics, and function. In collaboration with Friedrich Schotte and Gerhard Hummer, NIH.
Cloning, production, and purification of proteins for a medium-scale structural genomics project.
Quevillon-Cheruel, Sophie; Collinet, Bruno; Trésaugues, Lionel; Minard, Philippe; Henckes, Gilles; Aufrère, Robert; Blondeau, Karine; Zhou, Cong-Zhao; Liger, Dominique; Bettache, Nabila; Poupon, Anne; Aboulfath, Ilham; Leulliot, Nicolas; Janin, Joël; van Tilbeurgh, Herman
2007-01-01
The South-Paris Yeast Structural Genomics Pilot Project (http://www.genomics.eu.org) aims at systematically expressing, purifying, and determining the three-dimensional structures of Saccharomyces cerevisiae proteins. We have already cloned 240 yeast open reading frames in the Escherichia coli pET system. Eighty-two percent of the targets can be expressed in E. coli, and 61% yield soluble protein. We have currently purified 58 proteins. Twelve X-ray structures have been solved, six are in progress, and six other proteins gave crystals. In this chapter, we present the general experimental flowchart applied for this project. One of the main difficulties encountered in this pilot project was the low solubility of a great number of target proteins. We have developed parallel strategies to recover these proteins from inclusion bodies, including refolding, coexpression with chaperones, and an in vitro expression system. A limited proteolysis protocol, developed to localize flexible regions in proteins that could hinder crystallization, is also described.
Zook, James; Mo, Gina; Sisco, Nicholas J; Craciunescu, Felicia M; Hansen, Debra T; Baravati, Bobby; Cherry, Brian R; Sykes, Kathryn; Wachter, Rebekka; Van Horn, Wade D; Fromme, Petra
2015-06-02
Tularemia is a potentially fatal bacterial infection caused by Francisella tularensis, and is endemic to North America and many parts of northern Europe and Asia. The outer membrane lipoprotein, Flpp3, has been identified as a virulence determinant as well as a potential subunit template for vaccine development. Here we present the first structure for the soluble domain of Flpp3 from the highly infectious Type A SCHU S4 strain, derived through high-resolution solution nuclear magnetic resonance (NMR) spectroscopy; the first structure of a lipoprotein from the genus Francisella. The Flpp3 structure demonstrates a globular protein with an electrostatically polarized surface containing an internal cavity-a putative binding site based on the structurally homologous Bet v1 protein family of allergens. NMR-based relaxation studies suggest loop regions that potentially modulate access to the internal cavity. The Flpp3 structure may add to the understanding of F. tularensis virulence and contribute to the development of effective vaccines. Copyright © 2015 Elsevier Ltd. All rights reserved.
The interface of protein structure, protein biophysics, and molecular evolution
Liberles, David A; Teichmann, Sarah A; Bahar, Ivet; Bastolla, Ugo; Bloom, Jesse; Bornberg-Bauer, Erich; Colwell, Lucy J; de Koning, A P Jason; Dokholyan, Nikolay V; Echave, Julian; Elofsson, Arne; Gerloff, Dietlind L; Goldstein, Richard A; Grahnen, Johan A; Holder, Mark T; Lakner, Clemens; Lartillot, Nicholas; Lovell, Simon C; Naylor, Gavin; Perica, Tina; Pollock, David D; Pupko, Tal; Regan, Lynne; Roger, Andrew; Rubinstein, Nimrod; Shakhnovich, Eugene; Sjölander, Kimmen; Sunyaev, Shamil; Teufel, Ashley I; Thorne, Jeffrey L; Thornton, Joseph W; Weinreich, Daniel M; Whelan, Simon
2012-01-01
Abstract The interface of protein structural biology, protein biophysics, molecular evolution, and molecular population genetics forms the foundations for a mechanistic understanding of many aspects of protein biochemistry. Current efforts in interdisciplinary protein modeling are in their infancy and the state-of-the art of such models is described. Beyond the relationship between amino acid substitution and static protein structure, protein function, and corresponding organismal fitness, other considerations are also discussed. More complex mutational processes such as insertion and deletion and domain rearrangements and even circular permutations should be evaluated. The role of intrinsically disordered proteins is still controversial, but may be increasingly important to consider. Protein geometry and protein dynamics as a deviation from static considerations of protein structure are also important. Protein expression level is known to be a major determinant of evolutionary rate and several considerations including selection at the mRNA level and the role of interaction specificity are discussed. Lastly, the relationship between modeling and needed high-throughput experimental data as well as experimental examination of protein evolution using ancestral sequence resurrection and in vitro biochemistry are presented, towards an aim of ultimately generating better models for biological inference and prediction. PMID:22528593
Use of conserved key amino acid positions to morph protein folds.
Reddy, Boojala V B; Li, Wilfred W; Bourne, Philip E
2002-07-15
By using three-dimensional (3D) structure alignments and a previously published method to determine Conserved Key Amino Acid Positions (CKAAPs) we propose a theoretical method to design mutations that can be used to morph the protein folds. The original Paracelsus challenge, met by several groups, called for the engineering of a stable but different structure by modifying less than 50% of the amino acid residues. We have used the sequences from the Protein Data Bank (PDB) identifiers 1ROP, and 2CRO, which were previously used in the Paracelsus challenge by those groups, and suggest mutation to CKAAPs to morph the protein fold. The total number of mutations suggested is less than 40% of the starting sequence theoretically improving the challenge results. From secondary structure prediction experiments of the proposed mutant sequence structures, we observe that each of the suggested mutant protein sequences likely folds to a different, non-native potentially stable target structure. These results are an early indicator that analyses using structure alignments leading to CKAAPs of a given structure are of value in protein engineering experiments. Copyright 2002 Wiley Periodicals, Inc.
Proteins Are the Body's Worker Molecules
... molecular structures. Many of these new technologies are robots that automate previously labor-intensive steps in structure determination. Thanks to these robots, it is possible to solve structures faster than ...
Time-Resolved Macromolecular Crystallography at Modern X-Ray Sources.
Schmidt, Marius
2017-01-01
Time-resolved macromolecular crystallography unifies protein structure determination with chemical kinetics. With the advent of fourth generation X-ray sources the time-resolution can be on the order of 10-40 fs, which opens the ultrafast time scale to structure determination. Fundamental motions and transitions associated with chemical reactions in proteins can now be observed. Moreover, new experimental approaches at synchrotrons allow for the straightforward investigation of all kind of reactions in biological macromolecules. Here, recent developments in the field are reviewed.
The SARS coronavirus nucleocapsid protein--forms and functions.
Chang, Chung-ke; Hou, Ming-Hon; Chang, Chi-Fon; Hsiao, Chwan-Deng; Huang, Tai-huang
2014-03-01
The nucleocapsid phosphoprotein of the severe acute respiratory syndrome coronavirus (SARS-CoV N protein) packages the viral genome into a helical ribonucleocapsid (RNP) and plays a fundamental role during viral self-assembly. It is a protein with multifarious activities. In this article we will review our current understanding of the N protein structure and its interaction with nucleic acid. Highlights of the progresses include uncovering the modular organization, determining the structures of the structural domains, realizing the roles of protein disorder in protein-protein and protein-nucleic acid interactions, and visualizing the ribonucleoprotein (RNP) structure inside the virions. It was also demonstrated that N-protein binds to nucleic acid at multiple sites with a coupled-allostery manner. We propose a SARS-CoV RNP model that conforms to existing data and bears resemblance to the existing RNP structures of RNA viruses. The model highlights the critical role of modular organization and intrinsic disorder of the N protein in the formation and functions of the dynamic RNP capsid in RNA viruses. This paper forms part of a symposium in Antiviral Research on "From SARS to MERS: 10 years of research on highly pathogenic human coronaviruses." Copyright © 2014 Elsevier B.V. All rights reserved.
Crystal structures of ASK1-inhibtor complexes provide a platform for structure-based drug design
Singh, Onkar; Shillings, Anthony; Craggs, Peter; Wall, Ian; Rowland, Paul; Skarzynski, Tadeusz; Hobbs, Clare I; Hardwick, Phil; Tanner, Rob; Blunt, Michelle; Witty, David R; Smith, Kathrine J
2013-01-01
ASK1, a member of the MAPK Kinase Kinase family of proteins has been shown to play a key role in cancer, neurodegeneration and cardiovascular diseases and is emerging as a possible drug target. Here we describe a ‘replacement-soaking’ method that has enabled the high-throughput X-ray structure determination of ASK1/ligand complexes. Comparison of the X-ray structures of five ASK1/ligand complexes from 3 different chemotypes illustrates that the ASK1 ATP binding site is able to accommodate a range of chemical diversity and different binding modes. The replacement-soaking system is also able to tolerate some protein flexibility. This crystal system provides a robust platform for ASK1/ligand structure determination and future structure based drug design. PMID:23776076
Lobley, Carina M C; Aller, Pierre; Douangamath, Alice; Reddivari, Yamini; Bumann, Mario; Bird, Louise E; Nettleship, Joanne E; Brandao-Neto, Jose; Owens, Raymond J; O'Toole, Paul W; Walsh, Martin A
2012-12-01
The structure of ribose 5-phosphate isomerase from the probiotic bacterium Lactobacillus salivarius UCC188 has been determined at 1.72 Å resolution. The structure was solved by molecular replacement, which identified the functional homodimer in the asymmetric unit. Despite only showing 57% sequence identity to its closest homologue, the structure adopted the typical α and β D-ribose 5-phosphate isomerase fold. Comparison to other related structures revealed high homology in the active site, allowing a model of the substrate-bound protein to be proposed. The determination of the structure was expedited by the use of in situ crystallization-plate screening on beamline I04-1 at Diamond Light Source to identify well diffracting protein crystals prior to routine cryocrystallography.
Accurate high-throughput structure mapping and prediction with transition metal ion FRET
Yu, Xiaozhen; Wu, Xiongwu; Bermejo, Guillermo A.; Brooks, Bernard R.; Taraska, Justin W.
2013-01-01
Mapping the landscape of a protein’s conformational space is essential to understanding its functions and regulation. The limitations of many structural methods have made this process challenging for most proteins. Here, we report that transition metal ion FRET (tmFRET) can be used in a rapid, highly parallel screen, to determine distances from multiple locations within a protein at extremely low concentrations. The distances generated through this screen for the protein Maltose Binding Protein (MBP) match distances from the crystal structure to within a few angstroms. Furthermore, energy transfer accurately detects structural changes during ligand binding. Finally, fluorescence-derived distances can be used to guide molecular simulations to find low energy states. Our results open the door to rapid, accurate mapping and prediction of protein structures at low concentrations, in large complex systems, and in living cells. PMID:23273426
Meyer, Philippe; Liger, Dominique; Leulliot, Nicolas; Quevillon-Cheruel, Sophie; Zhou, Cong-Zhao; Borel, Franck; Ferrer, Jean-Luc; Poupon, Anne; Janin, Joël; van Tilbeurgh, Herman
2005-12-01
We have determined the three-dimensional crystal structure of the protein encoded by the open reading frame YFL030w from Saccharomyces cerevisiae to a resolution of 2.6 A using single wavelength anomalous diffraction. YFL030w is a 385 amino-acid protein with sequence similarity to the aminotransferase family. The structure of the protein reveals a homodimer adopting the fold-type I of pyridoxal 5'-phosphate (PLP)-dependent aminotransferases. The PLP co-factor is covalently bound to the active site in the crystal structure. The protein shows close structural resemblance with the human alanine:glyoxylate aminotransferase (EC 2.6.1.44), an enzyme involved in the hereditary kidney stone disease primary hyperoxaluria type 1. In this paper we show that YFL030w codes for an alanine:glyoxylate aminotransferase, highly specific for its amino donor and acceptor substrates.
Kim, Jaewon; Lee, Jihun; Brych, Stephen R; Logan, Timothy M; Blaber, Michael
2005-02-01
The beta-turn is the most common type of nonrepetitive structure in globular proteins, comprising ~25% of all residues; however, a detailed understanding of effects of specific residues upon beta-turn stability and conformation is lacking. Human acidic fibroblast growth factor (FGF-1) is a member of the beta-trefoil superfold and contains a total of five beta-hairpin structures (antiparallel beta-sheets connected by a reverse turn). beta-Turns related by the characteristic threefold structural symmetry of this superfold exhibit different primary structures, and in some cases, different secondary structures. As such, they represent a useful system with which to study the role that turn sequences play in determining structure, stability, and folding of the protein. Two turns related by the threefold structural symmetry, the beta4/beta5 and beta8/beta9 turns, were subjected to both sequence-swapping and poly-glycine substitution mutations, and the effects upon stability, folding, and structure were investigated. In the wild-type protein these turns are of identical length, but exhibit different conformations. These conformations were observed to be retained during sequence-swapping and glycine substitution mutagenesis. The results indicate that the beta-turn structure at these positions is not determined by the turn sequence. Structural analysis suggests that residues flanking the turn are a primary structural determinant of the conformation within the turn.
Accurate Structural Correlations from Maximum Likelihood Superpositions
Theobald, Douglas L; Wuttke, Deborah S
2008-01-01
The cores of globular proteins are densely packed, resulting in complicated networks of structural interactions. These interactions in turn give rise to dynamic structural correlations over a wide range of time scales. Accurate analysis of these complex correlations is crucial for understanding biomolecular mechanisms and for relating structure to function. Here we report a highly accurate technique for inferring the major modes of structural correlation in macromolecules using likelihood-based statistical analysis of sets of structures. This method is generally applicable to any ensemble of related molecules, including families of nuclear magnetic resonance (NMR) models, different crystal forms of a protein, and structural alignments of homologous proteins, as well as molecular dynamics trajectories. Dominant modes of structural correlation are determined using principal components analysis (PCA) of the maximum likelihood estimate of the correlation matrix. The correlations we identify are inherently independent of the statistical uncertainty and dynamic heterogeneity associated with the structural coordinates. We additionally present an easily interpretable method (“PCA plots”) for displaying these positional correlations by color-coding them onto a macromolecular structure. Maximum likelihood PCA of structural superpositions, and the structural PCA plots that illustrate the results, will facilitate the accurate determination of dynamic structural correlations analyzed in diverse fields of structural biology. PMID:18282091
Controlling the shape of membrane protein polyhedra
NASA Astrophysics Data System (ADS)
Li, Di; Kahraman, Osman; Haselwandter, Christoph A.
2017-03-01
Membrane proteins and lipids can self-assemble into membrane protein polyhedral nanoparticles (MPPNs). MPPNs have a closed spherical surface and a polyhedral protein arrangement, and may offer a new route for structure determination of membrane proteins and targeted drug delivery. We develop here a general analytic model of how MPPN self-assembly depends on bilayer-protein interactions and lipid bilayer mechanical properties. We find that the bilayer-protein hydrophobic thickness mismatch is a key molecular control parameter for MPPN shape that can be used to bias MPPN self-assembly towards highly symmetric and uniform MPPN shapes. Our results suggest strategies for optimizing MPPN shape for structural studies of membrane proteins and targeted drug delivery.
Acyl carrier protein structural classification and normal mode analysis
Cantu, David C; Forrester, Michael J; Charov, Katherine; Reilly, Peter J
2012-01-01
All acyl carrier protein primary and tertiary structures were gathered into the ThYme database. They are classified into 16 families by amino acid sequence similarity, with members of the different families having sequences with statistically highly significant differences. These classifications are supported by tertiary structure superposition analysis. Tertiary structures from a number of families are very similar, suggesting that these families may come from a single distant ancestor. Normal vibrational mode analysis was conducted on experimentally determined freestanding structures, showing greater fluctuations at chain termini and loops than in most helices. Their modes overlap more so within families than between different families. The tertiary structures of three acyl carrier protein families that lacked any known structures were predicted as well. PMID:22374859
Christensen, Anders S.; Linnet, Troels E.; Borg, Mikael; Boomsma, Wouter; Lindorff-Larsen, Kresten; Hamelryck, Thomas; Jensen, Jan H.
2013-01-01
We present the ProCS method for the rapid and accurate prediction of protein backbone amide proton chemical shifts - sensitive probes of the geometry of key hydrogen bonds that determine protein structure. ProCS is parameterized against quantum mechanical (QM) calculations and reproduces high level QM results obtained for a small protein with an RMSD of 0.25 ppm (r = 0.94). ProCS is interfaced with the PHAISTOS protein simulation program and is used to infer statistical protein ensembles that reflect experimentally measured amide proton chemical shift values. Such chemical shift-based structural refinements, starting from high-resolution X-ray structures of Protein G, ubiquitin, and SMN Tudor Domain, result in average chemical shifts, hydrogen bond geometries, and trans-hydrogen bond (h3 JNC') spin-spin coupling constants that are in excellent agreement with experiment. We show that the structural sensitivity of the QM-based amide proton chemical shift predictions is needed to obtain this agreement. The ProCS method thus offers a powerful new tool for refining the structures of hydrogen bonding networks to high accuracy with many potential applications such as protein flexibility in ligand binding. PMID:24391900
Structural perturbations on huntingtin N17 domain during its folding on 2D-nanomaterials
NASA Astrophysics Data System (ADS)
Zhang, Leili; Feng, Mei; Zhou, Ruhong; Luan, Binquan
2017-09-01
A globular protein’s folded structure in its physiological environment is largely determined by its amino acid sequence. Recently, newly discovered transformer proteins as well as intrinsically disordered proteins may adopt the folding-upon-binding mechanism where their secondary structures are highly dependent on their binding partners. Due to the various applications of nanomaterials in biological sensors and potential wearable devices, it is important to discover possible conformational changes of proteins on nanomaterials. Here, through molecular dynamics simulations, we show that the first 17 residues of the huntingtin protein (HTT-N17) exhibit appreciable differences during its folding on 2D-nanomaterials, such as graphene and MoS2 nanosheets. Namely, the protein is disordered on the graphene surface but is helical on the MoS2 surface. Despite that the amphiphilic environment at the nanosheet-water interface promotes the folding of the amphipathic proteins (such as HTT-N17), competitions between protein-nanosheet and intra-protein interactions yield very different protein conformations. Therefore, as engineered binding partners, nanomaterials might significantly affect the structures of adsorbed proteins.
Discovering rules for protein-ligand specificity using support vector inductive logic programming.
Kelley, Lawrence A; Shrimpton, Paul J; Muggleton, Stephen H; Sternberg, Michael J E
2009-09-01
Structural genomics initiatives are rapidly generating vast numbers of protein structures. Comparative modelling is also capable of producing accurate structural models for many protein sequences. However, for many of the known structures, functions are not yet determined, and in many modelling tasks, an accurate structural model does not necessarily tell us about function. Thus, there is a pressing need for high-throughput methods for determining function from structure. The spatial arrangement of key amino acids in a folded protein, on the surface or buried in clefts, is often the determinants of its biological function. A central aim of molecular biology is to understand the relationship between such substructures or surfaces and biological function, leading both to function prediction and to function design. We present a new general method for discovering the features of binding pockets that confer specificity for particular ligands. Using a recently developed machine-learning technique which couples the rule-discovery approach of inductive logic programming with the statistical learning power of support vector machines, we are able to discriminate, with high precision (90%) and recall (86%) between pockets that bind FAD and those that bind NAD on a large benchmark set given only the geometry and composition of the backbone of the binding pocket without the use of docking. In addition, we learn rules governing this specificity which can feed into protein functional design protocols. An analysis of the rules found suggests that key features of the binding pocket may be tied to conformational freedom in the ligand. The representation is sufficiently general to be applicable to any discriminatory binding problem. All programs and data sets are freely available to non-commercial users at http://www.sbg.bio.ic.ac.uk/svilp_ligand/.
Fast photochemical oxidation of proteins (FPOP) maps the epitope of EGFR binding to adnectin.
Yan, Yuetian; Chen, Guodong; Wei, Hui; Huang, Richard Y-C; Mo, Jingjie; Rempel, Don L; Tymiak, Adrienne A; Gross, Michael L
2014-12-01
Epitope mapping is an important tool for the development of monoclonal antibodies, mAbs, as therapeutic drugs. Recently, a class of therapeutic mAb alternatives, adnectins, has been developed as targeted biologics. They are derived from the 10th type III domain of human fibronectin ((10)Fn3). A common approach to map the epitope binding of these therapeutic proteins to their binding partners is X-ray crystallography. Although the crystal structure is known for Adnectin 1 binding to human epidermal growth factor receptor (EGFR), we seek to determine complementary binding in solution and to test the efficacy of footprinting for this purpose. As a relatively new tool in structural biology and complementary to X-ray crystallography, protein footprinting coupled with mass spectrometry is promising for protein-protein interaction studies. We report here the use of fast photochemical oxidation of proteins (FPOP) coupled with MS to map the epitope of EGFR-Adnectin 1 at both the peptide and amino-acid residue levels. The data correlate well with the previously determined epitopes from the crystal structure and are consistent with HDX MS data, which are presented in an accompanying paper. The FPOP-determined binding interface involves various amino-acid and peptide regions near the N terminus of EGFR. The outcome adds credibility to oxidative labeling by FPOP for epitope mapping and motivates more applications in the therapeutic protein area as a stand-alone method or in conjunction with X-ray crystallography, NMR, site-directed mutagenesis, and other orthogonal methods.
Fast Photochemical Oxidation of Proteins (FPOP) Maps the Epitope of EGFR Binding to Adnectin
NASA Astrophysics Data System (ADS)
Yan, Yuetian; Chen, Guodong; Wei, Hui; Huang, Richard Y.-C.; Mo, Jingjie; Rempel, Don L.; Tymiak, Adrienne A.; Gross, Michael L.
2014-12-01
Epitope mapping is an important tool for the development of monoclonal antibodies, mAbs, as therapeutic drugs. Recently, a class of therapeutic mAb alternatives, adnectins, has been developed as targeted biologics. They are derived from the 10th type III domain of human fibronectin (10Fn3). A common approach to map the epitope binding of these therapeutic proteins to their binding partners is X-ray crystallography. Although the crystal structure is known for Adnectin 1 binding to human epidermal growth factor receptor (EGFR), we seek to determine complementary binding in solution and to test the efficacy of footprinting for this purpose. As a relatively new tool in structural biology and complementary to X-ray crystallography, protein footprinting coupled with mass spectrometry is promising for protein-protein interaction studies. We report here the use of fast photochemical oxidation of proteins (FPOP) coupled with MS to map the epitope of EGFR-Adnectin 1 at both the peptide and amino-acid residue levels. The data correlate well with the previously determined epitopes from the crystal structure and are consistent with HDX MS data, which are presented in an accompanying paper. The FPOP-determined binding interface involves various amino-acid and peptide regions near the N terminus of EGFR. The outcome adds credibility to oxidative labeling by FPOP for epitope mapping and motivates more applications in the therapeutic protein area as a stand-alone method or in conjunction with X-ray crystallography, NMR, site-directed mutagenesis, and other orthogonal methods.
A Practical Approach to Protein Crystallography.
Ilari, Andrea; Savino, Carmelinda
2017-01-01
Macromolecular crystallography is a powerful tool for structural biology. The resolution of a protein crystal structure is becoming much easier than in the past, thanks to developments in computing, automation of crystallization techniques and high-flux synchrotron sources to collect diffraction datasets. The aim of this chapter is to provide practical procedures to determine a protein crystal structure, illustrating the new techniques, experimental methods, and software that have made protein crystallography a tool accessible to a larger scientific community.It is impossible to give more than a taste of what the X-ray crystallographic technique entails in one brief chapter and there are different ways to solve a protein structure. Since the number of structures available in the Protein Data Bank (PDB) is becoming ever larger (the protein data bank now contains more than 100,000 entries) and therefore the probability to find a good model to solve the structure is ever increasing, we focus our attention on the Molecular Replacement method. Indeed, whenever applicable, this method allows the resolution of macromolecular structures starting from a single data set and a search model downloaded from the PDB, with the aid only of computer work.
Orientation determination of interfacial beta-sheet structures in situ.
Nguyen, Khoi Tan; King, John Thomas; Chen, Zhan
2010-07-01
Structural information such as orientations of interfacial proteins and peptides is important for understanding properties and functions of such biological molecules, which play crucial roles in biological applications and processes such as antimicrobial selectivity, membrane protein activity, biocompatibility, and biosensing performance. The alpha-helical and beta-sheet structures are the most widely encountered secondary structures in peptides and proteins. In this paper, for the first time, a method to quantify the orientation of the interfacial beta-sheet structure using a combined attenuated total reflectance Fourier transformation infrared spectroscopic (ATR-FTIR) and sum frequency generation (SFG) vibrational spectroscopic study was developed. As an illustration of the methodology, the orientation of tachyplesin I, a 17 amino acid peptide with an antiparallel beta-sheet, adsorbed to polymer surfaces as well as associated with a lipid bilayer was determined using the regular and chiral SFG spectra, together with polarized ATR-FTIR amide I signals. Both the tilt angle (theta) and the twist angle (psi) of the beta-sheet at interfaces are determined. The developed method in this paper can be used to obtain in situ structural information of beta-sheet components in complex molecules. The combination of this method and the existing methodology that is currently used to investigate alpha-helical structures will greatly broaden the application of optical spectroscopy in physical chemistry, biochemistry, biophysics, and structural biology.
Takeda, Mitsuhiro; Ono, Akira M; Terauchi, Tsutomu; Kainosho, Masatsune
2010-01-01
The extensive collection of NOE constraint data involving the aromatic ring signals is essential for accurate protein structure determination, although it is often hampered in practice by the pervasive signal overlapping and tight spin couplings for aromatic rings. We have prepared various types of stereo-array isotope labeled phenylalanines (epsilon- and zeta-SAIL Phe) and tyrosine (epsilon-SAIL Tyr) to overcome these problems (Torizawa et al. 2005), and proven that these SAIL amino acids provide dramatic spectral simplification and sensitivity enhancement for the aromatic ring NMR signals. In addition to these SAIL aromatic amino acids, we recently synthesized delta-SAIL Phe and delta-SAIL Tyr, which allow us to observe and assign delta-(13)C/(1)H signals very efficiently. Each of the various types of SAIL Phe and SAIL Tyr yields well-resolved resonances for the delta-, epsilon- or zeta-(13)C/(1)H signals, respectively, which can readily be assigned by simple and robust pulse sequences. Since the delta-, epsilon-, and zeta-proton signals of Phe/Tyr residues give rise to complementary NOE constraints, the concomitant use of various types of SAIL-Phe and SAIL-Tyr would generate more accurate protein structures, as compared to those obtained by using conventional uniformly (13)C, (15)N-double labeled proteins. We illustrated this with the case of an 18.2 kDa protein, Escherichia coli peptidyl-prolyl cis-trans isomerase b (EPPIb), and concluded that the combined use of zeta-SAIL Phe and epsilon-SAIL Tyr would be practically the best choice for protein structural determinations.
Crystallization of Proteins from Crude Bovine Rod Outer Segments☆
Baker, Bo Y.; Gulati, Sahil; Shi, Wuxian; Wang, Benlian; Stewart, Phoebe L.; Palczewski, Krzysztof
2015-01-01
Obtaining protein crystals suitable for X-ray diffraction studies comprises the greatest challenge in the determination of protein crystal structures, especially for membrane proteins and protein complexes. Although high purity has been broadly accepted as one of the most significant requirements for protein crystallization, a recent study of the Escherichia coli proteome showed that many proteins have an inherent propensity to crystallize and do not require a highly homogeneous sample (Totir et al., 2012). As exemplified by RPE65 (Kiser, Golczak, Lodowski, Chance, & Palczewski, 2009), there also are cases of mammalian proteins crystallized from less purified samples. To test whether this phenomenon can be applied more broadly to the study of proteins from higher organisms, we investigated the protein crystallization profile of bovine rod outer segment (ROS) crude extracts. Interestingly, multiple protein crystals readily formed from such extracts, some of them diffracting to high resolution that allowed structural determination. A total of seven proteins were crystallized, one of which was a membrane protein. Successful crystallization of proteins from heterogeneous ROS extracts demonstrates that many mammalian proteins also have an intrinsic propensity to crystallize from complex biological mixtures. By providing an alternative approach to heterologous expression to achieve crystallization, this strategy could be useful for proteins and complexes that are difficult to purify or obtain by recombinant techniques. PMID:25950977
NASA Astrophysics Data System (ADS)
Takemura, Kazuhiro; Guo, Hao; Sakuraba, Shun; Matubayasi, Nobuyuki; Kitao, Akio
2012-12-01
We propose a method to evaluate binding free energy differences among distinct protein-protein complex model structures through all-atom molecular dynamics simulations in explicit water using the solution theory in the energy representation. Complex model structures are generated from a pair of monomeric structures using the rigid-body docking program ZDOCK. After structure refinement by side chain optimization and all-atom molecular dynamics simulations in explicit water, complex models are evaluated based on the sum of their conformational and solvation free energies, the latter calculated from the energy distribution functions obtained from relatively short molecular dynamics simulations of the complex in water and of pure water based on the solution theory in the energy representation. We examined protein-protein complex model structures of two protein-protein complex systems, bovine trypsin/CMTI-1 squash inhibitor (PDB ID: 1PPE) and RNase SA/barstar (PDB ID: 1AY7), for which both complex and monomer structures were determined experimentally. For each system, we calculated the energies for the crystal complex structure and twelve generated model structures including the model most similar to the crystal structure and very different from it. In both systems, the sum of the conformational and solvation free energies tended to be lower for the structure similar to the crystal. We concluded that our energy calculation method is useful for selecting low energy complex models similar to the crystal structure from among a set of generated models.
Takemura, Kazuhiro; Guo, Hao; Sakuraba, Shun; Matubayasi, Nobuyuki; Kitao, Akio
2012-12-07
We propose a method to evaluate binding free energy differences among distinct protein-protein complex model structures through all-atom molecular dynamics simulations in explicit water using the solution theory in the energy representation. Complex model structures are generated from a pair of monomeric structures using the rigid-body docking program ZDOCK. After structure refinement by side chain optimization and all-atom molecular dynamics simulations in explicit water, complex models are evaluated based on the sum of their conformational and solvation free energies, the latter calculated from the energy distribution functions obtained from relatively short molecular dynamics simulations of the complex in water and of pure water based on the solution theory in the energy representation. We examined protein-protein complex model structures of two protein-protein complex systems, bovine trypsin/CMTI-1 squash inhibitor (PDB ID: 1PPE) and RNase SA/barstar (PDB ID: 1AY7), for which both complex and monomer structures were determined experimentally. For each system, we calculated the energies for the crystal complex structure and twelve generated model structures including the model most similar to the crystal structure and very different from it. In both systems, the sum of the conformational and solvation free energies tended to be lower for the structure similar to the crystal. We concluded that our energy calculation method is useful for selecting low energy complex models similar to the crystal structure from among a set of generated models.
Fukuda, Yohta; Miura, Yoshimasa; Mizohata, Eiichi; Inoue, Tsuyoshi
2017-08-01
Upon stopping metabolic processes, some tardigrades can undergo anhydrobiosis. Secretory abundant heat-soluble (SAHS) proteins have been reported as candidates for anhydrobiosis-related proteins in tardigrades, which seem to protect extracellular components and/or secretory organelles. We determined structures of a SAHS protein from Ramazzottius varieornatus (RvSAHS1), which is one of the toughest tardigrades. RvSAHS1 shows a β-barrel structure similar to fatty acid-binding proteins (FABPs), in which hydrophilic residues form peculiar hydrogen bond networks, which would provide RvSAHS1 with better tolerance against dehydration. We identified two putative ligand-binding sites: one that superimposes on those of some FABPs and the other, unique to and conserved in SAHS proteins. These results indicate that SAHS proteins constitute a new FABP family. © 2017 Federation of European Biochemical Societies.
Xia, Bing; Mamonov, Artem; Leysen, Seppe; Allen, Karen N; Strelkov, Sergei V; Paschalidis, Ioannis Ch; Vajda, Sandor; Kozakov, Dima
2015-07-30
The protein-protein docking server ClusPro is used by thousands of laboratories, and models built by the server have been reported in over 300 publications. Although the structures generated by the docking include near-native ones for many proteins, selecting the best model is difficult due to the uncertainty in scoring. Small angle X-ray scattering (SAXS) is an experimental technique for obtaining low resolution structural information in solution. While not sufficient on its own to uniquely predict complex structures, accounting for SAXS data improves the ranking of models and facilitates the identification of the most accurate structure. Although SAXS profiles are currently available only for a small number of complexes, due to its simplicity the method is becoming increasingly popular. Since combining docking with SAXS experiments will provide a viable strategy for fairly high-throughput determination of protein complex structures, the option of using SAXS restraints is added to the ClusPro server. © 2015 Wiley Periodicals, Inc. © 2015 Wiley Periodicals, Inc.
Experimental Protein Structure Verification by Scoring with a Single, Unassigned NMR Spectrum.
Courtney, Joseph M; Ye, Qing; Nesbitt, Anna E; Tang, Ming; Tuttle, Marcus D; Watt, Eric D; Nuzzio, Kristin M; Sperling, Lindsay J; Comellas, Gemma; Peterson, Joseph R; Morrissey, James H; Rienstra, Chad M
2015-10-06
Standard methods for de novo protein structure determination by nuclear magnetic resonance (NMR) require time-consuming data collection and interpretation efforts. Here we present a qualitatively distinct and novel approach, called Comparative, Objective Measurement of Protein Architectures by Scoring Shifts (COMPASS), which identifies the best structures from a set of structural models by numerical comparison with a single, unassigned 2D (13)C-(13)C NMR spectrum containing backbone and side-chain aliphatic signals. COMPASS does not require resonance assignments. It is particularly well suited for interpretation of magic-angle spinning solid-state NMR spectra, but also applicable to solution NMR spectra. We demonstrate COMPASS with experimental data from four proteins--GB1, ubiquitin, DsbA, and the extracellular domain of human tissue factor--and with reconstructed spectra from 11 additional proteins. For all these proteins, with molecular mass up to 25 kDa, COMPASS distinguished the correct fold, most often within 1.5 Å root-mean-square deviation of the reference structure. Copyright © 2015 Elsevier Ltd. All rights reserved.
Evaluation of Software for Introducing Protein Structure: Visualization and Simulation
ERIC Educational Resources Information Center
White, Brian; Kahriman, Azmin; Luberice, Lois; Idleh, Farhia
2010-01-01
Communicating an understanding of the forces and factors that determine a protein's structure is an important goal of many biology and biochemistry courses at a variety of levels. Many educators use computer software that allows visualization of these complex molecules for this purpose. Although visualization is in wide use and has been associated…
Mechanistic aspects of protein corona formation: insulin adsorption onto gold nanoparticle surfaces
NASA Astrophysics Data System (ADS)
Grass, Stefan; Treuel, Lennart
2014-02-01
In biological fluids, an adsorption layer of proteins, a "protein corona" forms around nanoparticles (NPs) largely determining their biological identity. In many interactions with NPs proteins can undergo structural changes. Here, we study the adsorption of insulin onto gold NPs (mean hydrodynamic particle diameter 80 ± 18 nm), focusing on the structural consequences of the adsorption process for the protein. We use surface enhanced Raman scattering (SERS) spectroscopy to study changes in the protein's secondary structure as well as the impact on integrity and conformations of disulfide bonds immediately on the NP surface. A detailed comparison to SERS spectra of cysteine and cystine provides first mechanistic insights into the causes for these conformational changes. Potential biological and toxicological implications of these findings are also discussed.
Prediction of Water Binding to Protein Hydration Sites with a Discrete, Semiexplicit Solvent Model.
Setny, Piotr
2015-12-08
Buried water molecules are ubiquitous in protein structures and are found at the interface of most protein-ligand complexes. Determining their distribution and thermodynamic effect is a challenging yet important task, of great of practical value for the modeling of biomolecular structures and their interactions. In this study, we present a novel method aimed at the prediction of buried water molecules in protein structures and estimation of their binding free energies. It is based on a semiexplicit, discrete solvation model, which we previously introduced in the context of small molecule hydration. The method is applicable to all macromolecular structures described by a standard all-atom force field, and predicts complete solvent distribution within a single run with modest computational cost. We demonstrate that it indicates positions of buried hydration sites, including those filled by more than one water molecule, and accurately differentiates them from sterically accessible to water but void regions. The obtained estimates of water binding free energies are in fair agreement with reference results determined with the double decoupling method.
Thermodynamic prediction of protein neutrality.
Bloom, Jesse D; Silberg, Jonathan J; Wilke, Claus O; Drummond, D Allan; Adami, Christoph; Arnold, Frances H
2005-01-18
We present a simple theory that uses thermodynamic parameters to predict the probability that a protein retains the wild-type structure after one or more random amino acid substitutions. Our theory predicts that for large numbers of substitutions the probability that a protein retains its structure will decline exponentially with the number of substitutions, with the severity of this decline determined by properties of the structure. Our theory also predicts that a protein can gain extra robustness to the first few substitutions by increasing its thermodynamic stability. We validate our theory with simulations on lattice protein models and by showing that it quantitatively predicts previously published experimental measurements on subtilisin and our own measurements on variants of TEM1 beta-lactamase. Our work unifies observations about the clustering of functional proteins in sequence space, and provides a basis for interpreting the response of proteins to substitutions in protein engineering applications.
Thermodynamic prediction of protein neutrality
Bloom, Jesse D.; Silberg, Jonathan J.; Wilke, Claus O.; Drummond, D. Allan; Adami, Christoph; Arnold, Frances H.
2005-01-01
We present a simple theory that uses thermodynamic parameters to predict the probability that a protein retains the wild-type structure after one or more random amino acid substitutions. Our theory predicts that for large numbers of substitutions the probability that a protein retains its structure will decline exponentially with the number of substitutions, with the severity of this decline determined by properties of the structure. Our theory also predicts that a protein can gain extra robustness to the first few substitutions by increasing its thermodynamic stability. We validate our theory with simulations on lattice protein models and by showing that it quantitatively predicts previously published experimental measurements on subtilisin and our own measurements on variants of TEM1 β-lactamase. Our work unifies observations about the clustering of functional proteins in sequence space, and provides a basis for interpreting the response of proteins to substitutions in protein engineering applications. PMID:15644440
Lu, Hui-Meng; Yin, Da-Chuan; Ye, Ya-Jing; Luo, Hui-Min; Geng, Li-Qiang; Li, Hai-Sheng; Guo, Wei-Hong; Shang, Peng
2009-01-01
As the most widely utilized technique to determine the 3-dimensional structure of protein molecules, X-ray crystallography can provide structure of the highest resolution among the developed techniques. The resolution obtained via X-ray crystallography is known to be influenced by many factors, such as the crystal quality, diffraction techniques, and X-ray sources, etc. In this paper, the authors found that the protein sequence could also be one of the factors. We extracted information of the resolution and the sequence of proteins from the Protein Data Bank (PDB), classified the proteins into different clusters according to the sequence similarity, and statistically analyzed the relationship between the sequence similarity and the best resolution obtained. The results showed that there was a pronounced correlation between the sequence similarity and the obtained resolution. These results indicate that protein structure itself is one variable that may affect resolution when X-ray crystallography is used.
Baltoumas, Fotis A; Theodoropoulou, Margarita C; Hamodrakas, Stavros J
2016-06-01
A significant amount of experimental evidence suggests that G-protein coupled receptors (GPCRs) do not act exclusively as monomers but also form biologically relevant dimers and oligomers. However, the structural determinants, stoichiometry and functional importance of GPCR oligomerization remain topics of intense speculation. In this study we attempted to evaluate the nature and dynamics of GPCR oligomeric interactions. A representative set of GPCR homodimers were studied through Coarse-Grained Molecular Dynamics simulations, combined with interface analysis and concepts from network theory for the construction and analysis of dynamic structural networks. Our results highlight important structural determinants that seem to govern receptor dimer interactions. A conserved dynamic behavior was observed among different GPCRs, including receptors belonging in different GPCR classes. Specific GPCR regions were highlighted as the core of the interfaces. Finally, correlations of motion were observed between parts of the dimer interface and GPCR segments participating in ligand binding and receptor activation, suggesting the existence of mechanisms through which dimer formation may affect GPCR function. The results of this study can be used to drive experiments aimed at exploring GPCR oligomerization, as well as in the study of transmembrane protein-protein interactions in general.
NASA Astrophysics Data System (ADS)
Baltoumas, Fotis A.; Theodoropoulou, Margarita C.; Hamodrakas, Stavros J.
2016-06-01
A significant amount of experimental evidence suggests that G-protein coupled receptors (GPCRs) do not act exclusively as monomers but also form biologically relevant dimers and oligomers. However, the structural determinants, stoichiometry and functional importance of GPCR oligomerization remain topics of intense speculation. In this study we attempted to evaluate the nature and dynamics of GPCR oligomeric interactions. A representative set of GPCR homodimers were studied through Coarse-Grained Molecular Dynamics simulations, combined with interface analysis and concepts from network theory for the construction and analysis of dynamic structural networks. Our results highlight important structural determinants that seem to govern receptor dimer interactions. A conserved dynamic behavior was observed among different GPCRs, including receptors belonging in different GPCR classes. Specific GPCR regions were highlighted as the core of the interfaces. Finally, correlations of motion were observed between parts of the dimer interface and GPCR segments participating in ligand binding and receptor activation, suggesting the existence of mechanisms through which dimer formation may affect GPCR function. The results of this study can be used to drive experiments aimed at exploring GPCR oligomerization, as well as in the study of transmembrane protein-protein interactions in general.
Structure, Function, and Evolution of Biogenic Amine-binding Proteins in Soft Ticks
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mans, Ben J.; Ribeiro, Jose M.C.; Andersen, John F.
2008-08-19
Two highly abundant lipocalins, monomine and monotonin, have been isolated from the salivary gland of the soft tick Argas monolakensis and shown to bind histamine and 5-hydroxytryptamine (5-HT), respectively. The crystal structures of monomine and a paralog of monotonin were determined in the presence of ligands to compare the determinants of ligand binding. Both the structures and binding measurements indicate that the proteins have a single binding site rather than the two sites previously described for the female-specific histamine-binding protein (FS-HBP), the histamine-binding lipocalin of the tick Rhipicephalus appendiculatus. The binding sites of monomine and monotonin are similar to themore » lower, low affinity site of FS-HBP. The interaction of the protein with the aliphatic amine group of the ligand is very similar for the all of the proteins, whereas specificity is determined by interactions with the aromatic portion of the ligand. Interestingly, protein interaction with the imidazole ring of histamine differs significantly between the low affinity binding site of FS-HBP and monomine, suggesting that histamine binding has evolved independently in the two lineages. From the conserved features of these proteins, a tick lipocalin biogenic amine-binding motif could be derived that was used to predict biogenic amine-binding function in other tick lipocalins. Heterologous expression of genes from salivary gland libraries led to the discovery of biogenic amine-binding proteins in soft (Ornithodoros) and hard (Ixodes) tick genera. The data generated were used to reconstruct the most probable evolutionary pathway for the evolution of biogenic amine-binding in tick lipocalins.« less
Zhang, Hong; Wang, Guangwen; Li, Jian; Nie, Yuchun; Shi, Xuanling; Lian, Gewei; Wang, Wei; Yin, Xiaolei; Zhao, Yang; Qu, Xiuxia; Ding, Mingxiao; Deng, Hongkui
2004-07-01
Severe acute respiratory syndrome (SARS) is a life-threatening disease caused by a newly identified coronavirus (CoV), SARS-CoV. The spike (S) glycoprotein of CoV is the major structural protein responsible for induction of host immune response and virus neutralization by antibodies. Hence, knowledge of neutralization determinants on the S protein is helpful for designing protective vaccines. To analyze the antigenic structure of the SARS-CoV S2 domain, the carboxyl-terminal half of the S protein, we first used sera from convalescent SARS patients to test the antigenicity of 12 overlapping fragments spanning the entire S2 and identified two antigenic determinants (Leu 803 to Ala 828 and Pro 1061 to Ser 1093). To determine whether neutralizing antibodies can be elicited by these two determinants, we immunized animals and found that both of them could induce the S2-specific antisera. In some animals, however, only one determinant (Leu 803 to Ala 828) was able to induce the antisera with the binding ability to the native S protein and the neutralizing activity to the SARS-CoV pseudovirus. This determinant is highly conserved across different SARS-CoV isolates. Identification of a conserved antigenic determinant on the S2 domain of the SARS-CoV S protein, which has the potential for inducing neutralizing antibodies, has implications in the development of effective vaccines against SARS-CoV.
Kryshtafovych, Andriy; Moult, John; Bales, Patrick; Bazan, J Fernando; Biasini, Marco; Burgin, Alex; Chen, Chen; Cochran, Frank V; Craig, Timothy K; Das, Rhiju; Fass, Deborah; Garcia-Doval, Carmela; Herzberg, Osnat; Lorimer, Donald; Luecke, Hartmut; Ma, Xiaolei; Nelson, Daniel C; van Raaij, Mark J; Rohwer, Forest; Segall, Anca; Seguritan, Victor; Zeth, Kornelius; Schwede, Torsten
2014-02-01
For the last two decades, CASP has assessed the state of the art in techniques for protein structure prediction and identified areas which required further development. CASP would not have been possible without the prediction targets provided by the experimental structural biology community. In the latest experiment, CASP10, more than 100 structures were suggested as prediction targets, some of which appeared to be extraordinarily difficult for modeling. In this article, authors of some of the most challenging targets discuss which specific scientific question motivated the experimental structure determination of the target protein, which structural features were especially interesting from a structural or functional perspective, and to what extent these features were correctly reproduced in the predictions submitted to CASP10. Specifically, the following targets will be presented: the acid-gated urea channel, a difficult to predict transmembrane protein from the important human pathogen Helicobacter pylori; the structure of human interleukin (IL)-34, a recently discovered helical cytokine; the structure of a functionally uncharacterized enzyme OrfY from Thermoproteus tenax formed by a gene duplication and a novel fold; an ORFan domain of mimivirus sulfhydryl oxidase R596; the fiber protein gene product 17 from bacteriophage T7; the bacteriophage CBA-120 tailspike protein; a virus coat protein from metagenomic samples of the marine environment; and finally, an unprecedented class of structure prediction targets based on engineered disulfide-rich small proteins. Copyright © 2013 The Authors. Wiley Periodicals, Inc.
Protein structure database search and evolutionary classification.
Yang, Jinn-Moon; Tung, Chi-Hua
2006-01-01
As more protein structures become available and structural genomics efforts provide structural models in a genome-wide strategy, there is a growing need for fast and accurate methods for discovering homologous proteins and evolutionary classifications of newly determined structures. We have developed 3D-BLAST, in part, to address these issues. 3D-BLAST is as fast as BLAST and calculates the statistical significance (E-value) of an alignment to indicate the reliability of the prediction. Using this method, we first identified 23 states of the structural alphabet that represent pattern profiles of the backbone fragments and then used them to represent protein structure databases as structural alphabet sequence databases (SADB). Our method enhanced BLAST as a search method, using a new structural alphabet substitution matrix (SASM) to find the longest common substructures with high-scoring structured segment pairs from an SADB database. Using personal computers with Intel Pentium4 (2.8 GHz) processors, our method searched more than 10 000 protein structures in 1.3 s and achieved a good agreement with search results from detailed structure alignment methods. [3D-BLAST is available at http://3d-blast.life.nctu.edu.tw].
Buried and accessible surface area control intrinsic protein flexibility.
Marsh, Joseph A
2013-09-09
Proteins experience a wide variety of conformational dynamics that can be crucial for facilitating their diverse functions. How is the intrinsic flexibility required for these motions encoded in their three-dimensional structures? Here, the overall flexibility of a protein is demonstrated to be tightly coupled to the total amount of surface area buried within its fold. A simple proxy for this, the relative solvent-accessible surface area (Arel), therefore shows excellent agreement with independent measures of global protein flexibility derived from various experimental and computational methods. Application of Arel on a large scale demonstrates its utility by revealing unique sequence and structural properties associated with intrinsic flexibility. In particular, flexibility as measured by Arel shows little correspondence with intrinsic disorder, but instead tends to be associated with multiple domains and increased α-helical structure. Furthermore, the apparent flexibility of monomeric proteins is found to be useful for identifying quaternary-structure errors in published crystal structures. There is also a strong tendency for the crystal structures of more flexible proteins to be solved to lower resolutions. Finally, local solvent accessibility is shown to be a primary determinant of local residue flexibility. Overall, this work provides both fundamental mechanistic insight into the origin of protein flexibility and a simple, practical method for predicting flexibility from protein structures. © 2013 Elsevier Ltd. All rights reserved.
Physical–chemical determinants of coil conformations in globular proteins
Perskie, Lauren L; Rose, George D
2010-01-01
We present a method with the potential to generate a library of coil segments from first principles. Proteins are built from α-helices and/or β-strands interconnected by these coil segments. Here, we investigate the conformational determinants of short coil segments, with particular emphasis on chain turns. Toward this goal, we extracted a comprehensive set of two-, three-, and four-residue turns from X-ray–elucidated proteins and classified them by conformation. A remarkably small number of unique conformers account for most of this experimentally determined set, whereas remaining members span a large number of rare conformers, many occurring only once in the entire protein database. Factors determining conformation were identified via Metropolis Monte Carlo simulations devised to test the effectiveness of various energy terms. Simulated structures were validated by comparison to experimental counterparts. After filtering rare conformers, we found that 98% of the remaining experimentally determined turn population could be reproduced by applying a hydrogen bond energy term to an exhaustively generated ensemble of clash-free conformers in which no backbone polar group lacks a hydrogen-bond partner. Further, at least 90% of longer coil segments, ranging from 5- to 20 residues, were found to be structural composites of these shorter primitives. These results are pertinent to protein structure prediction, where approaches can be divided into either empirical or ab initio methods. Empirical methods use database-derived information; ab initio methods rely on physical–chemical principles exclusively. Replacing the database-derived coil library with one generated from first principles would transform any empirically based method into its corresponding ab initio homologue. PMID:20512968
Gupta, Vibha; Gupta, Rakesh K.; Khare, Garima; Salunke, Dinakar M.; Tyagi, Anil K.
2009-01-01
Emergence of tuberculosis as a global health threat has necessitated an urgent search for new antitubercular drugs entailing determination of 3-dimensional structures of a large number of mycobacterial proteins for structure-based drug design. The essential requirement of ferritins/bacterioferritins (proteins involved in iron storage and homeostasis) for the survival of several prokaryotic pathogens makes these proteins very attractive targets for structure determination and inhibitor design. Bacterioferritins (Bfrs) differ from ferritins in that they have additional noncovalently bound haem groups. The physiological role of haem in Bfrs is not very clear but studies indicate that the haem group is involved in mediating release of iron from Bfr by facilitating reduction of the iron core. To further enhance our understanding, we have determined the crystal structure of the selenomethionyl analog of bacterioferritin A (SeMet-BfrA) from Mycobacterium tuberculosis (Mtb). Unexpectedly, electron density observed in the crystals of SeMet-BfrA analogous to haem location in bacterioferritins, shows a demetallated and degraded product of haem. This unanticipated observation is a consequence of the altered spatial electronic environment around the axial ligands of haem (in lieu of Met52 modification to SeMet52). Furthermore, the structure of Mtb SeMet-BfrA displays a possible lost protein interaction with haem propionates due to formation of a salt bridge between Arg53-Glu57, which appears to be unique to Mtb BfrA, resulting in slight modulation of haem binding pocket in this organism. The crystal structure of Mtb SeMet-BfrA provides novel leads to physiological function of haem in Bfrs. If validated as a drug target, it may also serve as a scaffold for designing specific inhibitors. In addition, this study provides evidence against the general belief that a selenium derivative of a protein represents its true physiological native structure. PMID:19946376
Dubois, G C; Robinson, E A; Inman, J K; Perham, R N; Appella, E
1981-01-01
Methylamine buffers can be used for the rapid quantitative removal of acetimidoyl groups from proteins and peptides modified by treatment with ethyl or methyl acetimidate. The half-life for displacement of acetimidoyl groups from fully amidinated proteins incubated in 3.44 M-methylamine/HCl buffer at pH 11.5 and 25 degrees C was approx. 26 min; this half life is 29 times less than that observed in ammonia/HCl buffer under the same conditions of pH and amine concentration. Incubation of acetimidated proteins with methylamine for 4 h resulted in greater than 95% removal of acetimidoyl groups. No deleterious effects on primary structure were detected by amino acid analysis or by automated Edman degradation. Reversible amidination of lysine residues, in conjunction with tryptic digestion, has been successfully applied to the determination of the amino acid sequence of an acetimidated mouse immunoglobulin heavy chain peptide. The regeneration of amino groups in amidinated proteins and peptides by methylaminolysis makes amidination a valuable alternative to citraconoylation and maleoylation in structural studies. PMID:6803762
Visualizing ligand molecules in Twilight electron density.
Weichenberger, Christian X; Pozharski, Edwin; Rupp, Bernhard
2013-02-01
Three-dimensional models of protein structures determined by X-ray crystallography are based on the interpretation of experimentally derived electron-density maps. The real-space correlation coefficient (RSCC) provides an easily comprehensible, objective measure of the residue-based fit of atom coordinates to electron density. Among protein structure models, protein-ligand complexes are of special interest, given their contribution to understanding the molecular underpinnings of biological activity and to drug design. For consumers of such models, it is not trivial to determine the degree to which ligand-structure modelling is biased by subjective electron-density interpretation. A standalone script, Twilight, is presented for the analysis, visualization and annotation of a pre-filtered set of 2815 protein-ligand complexes deposited with the PDB as of 15 January 2012 with ligand RSCC values that are below a threshold of 0.6. It also provides simplified access to the visualization of any protein-ligand complex available from the PDB and annotated by the Uppsala Electron Density Server. The script runs on various platforms and is available for download at http://www.ruppweb.org/twilight/.
Protein Folding—How and Why: By Hydrogen Exchange, Fragment Separation, and Mass Spectrometry
Englander, S. Walter; Mayne, Leland; Kan, Zhong-Yuan; Hu, Wenbing
2017-01-01
Advanced hydrogen exchange (HX) methodology can now determine the structure of protein folding intermediates and their progression in folding pathways. Key developments over time include the HX pulse labeling method with nuclear magnetic resonance analysis, development of the fragment separation method, the addition to it of mass spectrometric (MS) analysis, and recent improvements in the HX MS technique and data analysis. Also, the discovery of protein foldons and their role supplies an essential interpretive link. Recent work using HX pulse labeling with HX MS analysis finds that a number of proteins fold by stepping through a reproducible sequence of native-like intermediates in an ordered pathway. The stepwise nature of the pathway is dictated by the cooperative foldon unit construction of the protein. The pathway order is determined by a sequential stabilization principle; prior native-like structure guides the formation of adjacent native-like structure. This view does not match the funneled energy landscape paradigm of a very large number of folding tracks, which was framed before foldons were known. PMID:27145881
Characterization of the motion of membrane proteins using high-speed atomic force microscopy
NASA Astrophysics Data System (ADS)
Casuso, Ignacio; Khao, Jonathan; Chami, Mohamed; Paul-Gilloteaux, Perrine; Husain, Mohamed; Duneau, Jean-Pierre; Stahlberg, Henning; Sturgis, James N.; Scheuring, Simon
2012-08-01
For cells to function properly, membrane proteins must be able to diffuse within biological membranes. The functions of these membrane proteins depend on their position and also on protein-protein and protein-lipid interactions. However, so far, it has not been possible to study simultaneously the structure and dynamics of biological membranes. Here, we show that the motion of unlabelled membrane proteins can be characterized using high-speed atomic force microscopy. We find that the molecules of outer membrane protein F (OmpF) are widely distributed in the membrane as a result of diffusion-limited aggregation, and while the overall protein motion scales roughly with the local density of proteins in the membrane, individual protein molecules can also diffuse freely or become trapped by protein-protein interactions. Using these measurements, and the results of molecular dynamics simulations, we determine an interaction potential map and an interaction pathway for a membrane protein, which should provide new insights into the connection between the structures of individual proteins and the structures and dynamics of supramolecular membranes.
Structural Determination of Biomolecules in Microfluidic Systems
NASA Astrophysics Data System (ADS)
Butler, John C.; Menard, Etienne; Rogers, John A.; Wong, Gerard C. L.
2004-03-01
Supramolecular biological complexes are often too large to be crystallized for structural studies. Here, we explore the use of microfluidic arrays to order a model self-assembled cytoskeletal system. Filamentous actin (F-actin) is a negatively charged protein rod and is a key structural component in the eukaryotic cytoskeleton. In this context, F-actin can self-assemble with actin binding proteins (ABP) in a highly regulated manner to dynamically form structures for a wide range of biomechanical functions. In this work, we will systematically study the action of 3 types of actin binding proteins (a-actinin, fimbrin, cofilin) on the self-assembled structures of F-actin that have been aligned in microfluidic arrays.
ECOD: An Evolutionary Classification of Protein Domains
Kinch, Lisa N.; Pei, Jimin; Shi, Shuoyong; Kim, Bong-Hyun; Grishin, Nick V.
2014-01-01
Understanding the evolution of a protein, including both close and distant relationships, often reveals insight into its structure and function. Fast and easy access to such up-to-date information facilitates research. We have developed a hierarchical evolutionary classification of all proteins with experimentally determined spatial structures, and presented it as an interactive and updatable online database. ECOD (Evolutionary Classification of protein Domains) is distinct from other structural classifications in that it groups domains primarily by evolutionary relationships (homology), rather than topology (or “fold”). This distinction highlights cases of homology between domains of differing topology to aid in understanding of protein structure evolution. ECOD uniquely emphasizes distantly related homologs that are difficult to detect, and thus catalogs the largest number of evolutionary links among structural domain classifications. Placing distant homologs together underscores the ancestral similarities of these proteins and draws attention to the most important regions of sequence and structure, as well as conserved functional sites. ECOD also recognizes closer sequence-based relationships between protein domains. Currently, approximately 100,000 protein structures are classified in ECOD into 9,000 sequence families clustered into close to 2,000 evolutionary groups. The classification is assisted by an automated pipeline that quickly and consistently classifies weekly releases of PDB structures and allows for continual updates. This synchronization with PDB uniquely distinguishes ECOD among all protein classifications. Finally, we present several case studies of homologous proteins not recorded in other classifications, illustrating the potential of how ECOD can be used to further biological and evolutionary studies. PMID:25474468
ECOD: an evolutionary classification of protein domains.
Cheng, Hua; Schaeffer, R Dustin; Liao, Yuxing; Kinch, Lisa N; Pei, Jimin; Shi, Shuoyong; Kim, Bong-Hyun; Grishin, Nick V
2014-12-01
Understanding the evolution of a protein, including both close and distant relationships, often reveals insight into its structure and function. Fast and easy access to such up-to-date information facilitates research. We have developed a hierarchical evolutionary classification of all proteins with experimentally determined spatial structures, and presented it as an interactive and updatable online database. ECOD (Evolutionary Classification of protein Domains) is distinct from other structural classifications in that it groups domains primarily by evolutionary relationships (homology), rather than topology (or "fold"). This distinction highlights cases of homology between domains of differing topology to aid in understanding of protein structure evolution. ECOD uniquely emphasizes distantly related homologs that are difficult to detect, and thus catalogs the largest number of evolutionary links among structural domain classifications. Placing distant homologs together underscores the ancestral similarities of these proteins and draws attention to the most important regions of sequence and structure, as well as conserved functional sites. ECOD also recognizes closer sequence-based relationships between protein domains. Currently, approximately 100,000 protein structures are classified in ECOD into 9,000 sequence families clustered into close to 2,000 evolutionary groups. The classification is assisted by an automated pipeline that quickly and consistently classifies weekly releases of PDB structures and allows for continual updates. This synchronization with PDB uniquely distinguishes ECOD among all protein classifications. Finally, we present several case studies of homologous proteins not recorded in other classifications, illustrating the potential of how ECOD can be used to further biological and evolutionary studies.
DWARF – a data warehouse system for analyzing protein families
Fischer, Markus; Thai, Quan K; Grieb, Melanie; Pleiss, Jürgen
2006-01-01
Background The emerging field of integrative bioinformatics provides the tools to organize and systematically analyze vast amounts of highly diverse biological data and thus allows to gain a novel understanding of complex biological systems. The data warehouse DWARF applies integrative bioinformatics approaches to the analysis of large protein families. Description The data warehouse system DWARF integrates data on sequence, structure, and functional annotation for protein fold families. The underlying relational data model consists of three major sections representing entities related to the protein (biochemical function, source organism, classification to homologous families and superfamilies), the protein sequence (position-specific annotation, mutant information), and the protein structure (secondary structure information, superimposed tertiary structure). Tools for extracting, transforming and loading data from public available resources (ExPDB, GenBank, DSSP) are provided to populate the database. The data can be accessed by an interface for searching and browsing, and by analysis tools that operate on annotation, sequence, or structure. We applied DWARF to the family of α/β-hydrolases to host the Lipase Engineering database. Release 2.3 contains 6138 sequences and 167 experimentally determined protein structures, which are assigned to 37 superfamilies 103 homologous families. Conclusion DWARF has been designed for constructing databases of large structurally related protein families and for evaluating their sequence-structure-function relationships by a systematic analysis of sequence, structure and functional annotation. It has been applied to predict biochemical properties from sequence, and serves as a valuable tool for protein engineering. PMID:17094801
Ikeya, Teppei; Takeda, Mitsuhiro; Yoshida, Hitoshi; Terauchi, Tsutomu; Jee, Jun-Goo; Kainosho, Masatsune; Güntert, Peter
2009-08-01
Stereo-array isotope labeling (SAIL) has been combined with the fully automated NMR structure determination algorithm FLYA to determine the three-dimensional structure of the protein ubiquitin from different sets of input NMR spectra. SAIL provides a complete stereo- and regio-specific pattern of stable isotopes that results in sharper resonance lines and reduced signal overlap, without information loss. Here we show that as a result of the superior quality of the SAIL NMR spectra, reliable, fully automated analyses of the NMR spectra and structure calculations are possible using fewer input spectra than with conventional uniformly 13C/15N-labeled proteins. FLYA calculations with SAIL ubiquitin, using a single three-dimensional "through-bond" spectrum (and 2D HSQC spectra) in addition to the 13C-edited and 15N-edited NOESY spectra for conformational restraints, yielded structures with an accuracy of 0.83-1.15 A for the backbone RMSD to the conventionally determined solution structure of SAIL ubiquitin. NMR structures can thus be determined almost exclusively from the NOESY spectra that yield the conformational restraints, without the need to record many spectra only for determining intermediate, auxiliary data of the chemical shift assignments. The FLYA calculations for this report resulted in 252 ubiquitin structure bundles, obtained with different input data but identical structure calculation and refinement methods. These structures cover the entire range from highly accurate structures to seriously, but not trivially, wrong structures, and thus constitute a valuable database for the substantiation of structure validation methods.
Golden rule for buttressing vulnerable soluble proteins.
Fernández, Ariel; Berry, R Stephen
2010-05-07
Local weaknesses in the structure of soluble proteins have received little attention. The structure may be inherently weak at sites where hydration of the protein backbone is locally hampered by formation of an intramolecular hydrogen bond which in turn is not fully stabilized through burial within a hydrophobic environment. The result is insufficient compensation for the thermodynamic cost of dehydrating the backbone polar groups. This work shows that these structural deficiencies, the unburied backbone hydrogen bonds, are compensated in natural proteins by disulfide bonds that are needed to maintain the structural integrity. Examination of all PDB-reported soluble structures reveals that, after suitable normalization, the number of disulfide bonds, X, correlates tightly with the number of unburied backbone hydrogen bonds, Y, beyond the baseline level Y = 20, revealing a simple balance relation: Y = 5X + 20. This equation introduces a 1:5 ratio associated with the buttressing of soluble proteins with structural deficiencies. The results are justified on thermodynamic grounds and have implications for biomolecular engineering as they introduce two constants of universal applicability determining the architecture of soluble proteins.
Structure prediction of polyglutamine disease proteins: comparison of methods
2014-01-01
Background The expansion of polyglutamine (poly-Q) repeats in several unrelated proteins is associated with at least ten neurodegenerative diseases. The length of the poly-Q regions plays an important role in the progression of the diseases. The number of glutamines (Q) is inversely related to the onset age of these polyglutamine diseases, and the expansion of poly-Q repeats has been associated with protein misfolding. However, very little is known about the structural changes induced by the expansion of the repeats. Computational methods can provide an alternative to determine the structure of these poly-Q proteins, but it is important to evaluate their performance before large scale prediction work is done. Results In this paper, two popular protein structure prediction programs, I-TASSER and Rosetta, have been used to predict the structure of the N-terminal fragment of a protein associated with Huntington's disease with 17 glutamines. Results show that both programs have the ability to find the native structures, but I-TASSER performs better for the overall task. Conclusions Both I-TASSER and Rosetta can be used for structure prediction of proteins with poly-Q repeats. Knowledge of poly-Q structure may significantly contribute to development of therapeutic strategies for poly-Q diseases. PMID:25080018
Scarafoni, Alessio; Gualtieri, Elisa; Barbiroli, Alberto; Carpen, Aristodemo; Negri, Armando; Duranti, Marcello
2011-09-14
The present paper reports the purification and biochemical characterization of an albumin identified in mature lentil seeds with high sequence similarity to pea PA2. These proteins are found in many edible seeds and are considered potentially detrimental for human health due to the potential allergenicity and lectin-like activity. Thus, the description of their possible presence in food and the assessment of the molecular properties are relevant. The M(r), pI, and N-terminal sequence of this protein have been determined. The work included the study of (i) the binding properties to hemine to assess the presence of hemopexin structural domains and (ii) the binding properties of the protein to thiamin. In addition, the structural changes induced by heating have been evaluated by means of spectroscopic techniques. Denaturation temperature has also been determined. The present work provides new insights about the structural molecular features and the ligand-binding properties and dynamics of this kind of seed albumin.
New strategy for protein interactions and application to structure-based drug design
NASA Astrophysics Data System (ADS)
Zou, Xiaoqin
One of the greatest challenges in computational biophysics is to predict interactions between biological molecules, which play critical roles in biological processes and rational design of therapeutic drugs. Biomolecular interactions involve delicate interplay between multiple interactions, including electrostatic interactions, van der Waals interactions, solvent effect, and conformational entropic effect. Accurate determination of these complex and subtle interactions is challenging. Moreover, a biological molecule such as a protein usually consists of thousands of atoms, and thus occupies a huge conformational space. The large degrees of freedom pose further challenges for accurate prediction of biomolecular interactions. Here, I will present our development of physics-based theory and computational modeling on protein interactions with other molecules. The major strategy is to extract microscopic energetics from the information embedded in the experimentally-determined structures of protein complexes. I will also present applications of the methods to structure-based therapeutic design. Supported by NSF CAREER Award DBI-0953839, NIH R01GM109980, and the American Heart Association (Midwest Affiliate) [13GRNT16990076].
Kinetics and Mechanism of Mammalian Mitochondrial Ribosome Assembly.
Bogenhagen, Daniel F; Ostermeyer-Fay, Anne G; Haley, John D; Garcia-Diaz, Miguel
2018-02-13
Mammalian mtDNA encodes only 13 proteins, all essential components of respiratory complexes, synthesized by mitochondrial ribosomes. Mitoribosomes contain greatly truncated RNAs transcribed from mtDNA, including a structural tRNA in place of 5S RNA as a scaffold for binding 82 nucleus-encoded proteins, mitoribosomal proteins (MRPs). Cryoelectron microscopy (cryo-EM) studies have determined the structure of the mitoribosome, but its mechanism of assembly is unknown. Our SILAC pulse-labeling experiments determine the rates of mitochondrial import of MRPs and their assembly into intact mitoribosomes, providing a basis for distinguishing MRPs that bind at early and late stages in mitoribosome assembly to generate a working model for mitoribosome assembly. Mitoribosome assembly is a slow process initiated at the mtDNA nucleoid driven by excess synthesis of individual MRPs. MRPs that are tightly associated in the structure frequently join the complex in a coordinated manner. Clinically significant MRP mutations reported to date affect proteins that bind early on during assembly. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.
Quantum-mechanics-derived 13Cα chemical shift server (CheShift) for protein structure validation
Vila, Jorge A.; Arnautova, Yelena A.; Martin, Osvaldo A.; Scheraga, Harold A.
2009-01-01
A server (CheShift) has been developed to predict 13Cα chemical shifts of protein structures. It is based on the generation of 696,916 conformations as a function of the φ, ψ, ω, χ1 and χ2 torsional angles for all 20 naturally occurring amino acids. Their 13Cα chemical shifts were computed at the DFT level of theory with a small basis set and extrapolated, with an empirically-determined linear regression formula, to reproduce the values obtained with a larger basis set. Analysis of the accuracy and sensitivity of the CheShift predictions, in terms of both the correlation coefficient R and the conformational-averaged rmsd between the observed and predicted 13Cα chemical shifts, was carried out for 3 sets of conformations: (i) 36 x-ray-derived protein structures solved at 2.3 Å or better resolution, for which sets of 13Cα chemical shifts were available; (ii) 15 pairs of x-ray and NMR-derived sets of protein conformations; and (iii) a set of decoys for 3 proteins showing an rmsd with respect to the x-ray structure from which they were derived of up to 3 Å. Comparative analysis carried out with 4 popular servers, namely SHIFTS, SHIFTX, SPARTA, and PROSHIFT, for these 3 sets of conformations demonstrated that CheShift is the most sensitive server with which to detect subtle differences between protein models and, hence, to validate protein structures determined by either x-ray or NMR methods, if the observed 13Cα chemical shifts are available. CheShift is available as a web server. PMID:19805131
Structural analysis of a set of proteins resulting from a bacterial genomics project.
Badger, J; Sauder, J M; Adams, J M; Antonysamy, S; Bain, K; Bergseid, M G; Buchanan, S G; Buchanan, M D; Batiyenko, Y; Christopher, J A; Emtage, S; Eroshkina, A; Feil, I; Furlong, E B; Gajiwala, K S; Gao, X; He, D; Hendle, J; Huber, A; Hoda, K; Kearins, P; Kissinger, C; Laubert, B; Lewis, H A; Lin, J; Loomis, K; Lorimer, D; Louie, G; Maletic, M; Marsh, C D; Miller, I; Molinari, J; Muller-Dieckmann, H J; Newman, J M; Noland, B W; Pagarigan, B; Park, F; Peat, T S; Post, K W; Radojicic, S; Ramos, A; Romero, R; Rutter, M E; Sanderson, W E; Schwinn, K D; Tresser, J; Winhoven, J; Wright, T A; Wu, L; Xu, J; Harris, T J R
2005-09-01
The targets of the Structural GenomiX (SGX) bacterial genomics project were proteins conserved in multiple prokaryotic organisms with no obvious sequence homolog in the Protein Data Bank of known structures. The outcome of this work was 80 structures, covering 60 unique sequences and 49 different genes. Experimental phase determination from proteins incorporating Se-Met was carried out for 45 structures with most of the remainder solved by molecular replacement using members of the experimentally phased set as search models. An automated tool was developed to deposit these structures in the Protein Data Bank, along with the associated X-ray diffraction data (including refined experimental phases) and experimentally confirmed sequences. BLAST comparisons of the SGX structures with structures that had appeared in the Protein Data Bank over the intervening 3.5 years since the SGX target list had been compiled identified homologs for 49 of the 60 unique sequences represented by the SGX structures. This result indicates that, for bacterial structures that are relatively easy to express, purify, and crystallize, the structural coverage of gene space is proceeding rapidly. More distant sequence-structure relationships between the SGX and PDB structures were investigated using PDB-BLAST and Combinatorial Extension (CE). Only one structure, SufD, has a truly unique topology compared to all folds in the PDB. Copyright 2005 Wiley-Liss, Inc.
Huang, Li-shar; Borders, Toni M.; Shen, John T.; Wang, Chung-Jen; Berry, Edward
2006-01-01
Synopsis A multi-subunit mitochondrial membrane protein complex involved in the Krebs Cycle and respiratory chain has been crystallized in a form suitable for near-atomic resolution structure determination. A procedure is presented for preparation of diffraction-quality crystals of a vertebrate mitochondrial respiratory Complex II. The crystals have the potential to diffract to at least 2.0 Å with optimization of post-crystal-growth treatment and cryoprotection. This should allow determination of the structure of this important and medically relevant membrane protein complex at near-atomic resolution and provide great detail of the mode of binding of substrates and inhibitors at the two substrate-binding sites. PMID:15805592
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ecale Zhou, C L; Zemla, A T; Roe, D
2005-01-29
Specific and sensitive ligand-based protein detection assays that employ antibodies or small molecules such as peptides, aptamers, or other small molecules require that the corresponding surface region of the protein be accessible and that there be minimal cross-reactivity with non-target proteins. To reduce the time and cost of laboratory screening efforts for diagnostic reagents, we developed new methods for evaluating and selecting protein surface regions for ligand targeting. We devised combined structure- and sequence-based methods for identifying 3D epitopes and binding pockets on the surface of the A chain of ricin that are conserved with respect to a set ofmore » ricin A chains and unique with respect to other proteins. We (1) used structure alignment software to detect structural deviations and extracted from this analysis the residue-residue correspondence, (2) devised a method to compare corresponding residues across sets of ricin structures and structures of closely related proteins, (3) devised a sequence-based approach to determine residue infrequency in local sequence context, and (4) modified a pocket-finding algorithm to identify surface crevices in close proximity to residues determined to be conserved/unique based on our structure- and sequence-based methods. In applying this combined informatics approach to ricin A we identified a conserved/unique pocket in close proximity (but not overlapping) the active site that is suitable for bi-dentate ligand development. These methods are generally applicable to identification of surface epitopes and binding pockets for development of diagnostic reagents, therapeutics, and vaccines.« less
Algorithm, applications and evaluation for protein comparison by Ramanujan Fourier transform.
Zhao, Jian; Wang, Jiasong; Hua, Wei; Ouyang, Pingkai
2015-12-01
The amino acid sequence of a protein determines its chemical properties, chain conformation and biological functions. Protein sequence comparison is of great importance to identify similarities of protein structures and infer their functions. Many properties of a protein correspond to the low-frequency signals within the sequence. Low frequency modes in protein sequences are linked to the secondary structures, membrane protein types, and sub-cellular localizations of the proteins. In this paper, we present Ramanujan Fourier transform (RFT) with a fast algorithm to analyze the low-frequency signals of protein sequences. The RFT method is applied to similarity analysis of protein sequences with the Resonant Recognition Model (RRM). The results show that the proposed fast RFT method on protein comparison is more efficient than commonly used discrete Fourier transform (DFT). RFT can detect common frequencies as significant feature for specific protein families, and the RFT spectrum heat-map of protein sequences demonstrates the information conservation in the sequence comparison. The proposed method offers a new tool for pattern recognition, feature extraction and structural analysis on protein sequences. Copyright © 2015 Elsevier Ltd. All rights reserved.
Insight into the Structure of Amyloid Fibrils from the Analysis of Globular Proteins
Trovato, Antonio; Chiti, Fabrizio; Maritan, Amos; Seno, Flavio
2006-01-01
The conversion from soluble states into cross-β fibrillar aggregates is a property shared by many different proteins and peptides and was hence conjectured to be a generic feature of polypeptide chains. Increasing evidence is now accumulating that such fibrillar assemblies are generally characterized by a parallel in-register alignment of β-strands contributed by distinct protein molecules. Here we assume a universal mechanism is responsible for β-structure formation and deduce sequence-specific interaction energies between pairs of protein fragments from a statistical analysis of the native folds of globular proteins. The derived fragment–fragment interaction was implemented within a novel algorithm, prediction of amyloid structure aggregation (PASTA), to investigate the role of sequence heterogeneity in driving specific aggregation into ordered self-propagating cross-β structures. The algorithm predicts that the parallel in-register arrangement of sequence portions that participate in the fibril cross-β core is favoured in most cases. However, the antiparallel arrangement is correctly discriminated when present in fibrils formed by short peptides. The predictions of the most aggregation-prone portions of initially unfolded polypeptide chains are also in excellent agreement with available experimental observations. These results corroborate the recent hypothesis that the amyloid structure is stabilised by the same physicochemical determinants as those operating in folded proteins. They also suggest that side chain–side chain interaction across neighbouring β-strands is a key determinant of amyloid fibril formation and of their self-propagating ability. PMID:17173479
Protein Folding and Self-Organized Criticality
NASA Astrophysics Data System (ADS)
Bajracharya, Arun; Murray, Joelle
Proteins are known to fold into tertiary structures that determine their functionality in living organisms. However, the complex dynamics of protein folding and the way they consistently fold into the same structures is not fully understood. Self-organized criticality (SOC) has provided a framework for understanding complex systems in various systems (earthquakes, forest fires, financial markets, and epidemics) through scale invariance and the associated power law behavior. In this research, we use a simple hydrophobic-polar lattice-bound computational model to investigate self-organized criticality as a possible mechanism for generating complexity in protein folding.
Membrane-spanning α-helical barrels as tractable protein-design targets.
Niitsu, Ai; Heal, Jack W; Fauland, Kerstin; Thomson, Andrew R; Woolfson, Derek N
2017-08-05
The rational ( de novo ) design of membrane-spanning proteins lags behind that for water-soluble globular proteins. This is due to gaps in our knowledge of membrane-protein structure, and experimental difficulties in studying such proteins compared to water-soluble counterparts. One limiting factor is the small number of experimentally determined three-dimensional structures for transmembrane proteins. By contrast, many tens of thousands of globular protein structures provide a rich source of 'scaffolds' for protein design, and the means to garner sequence-to-structure relationships to guide the design process. The α-helical coiled coil is a protein-structure element found in both globular and membrane proteins, where it cements a variety of helix-helix interactions and helical bundles. Our deep understanding of coiled coils has enabled a large number of successful de novo designs. For one class, the α-helical barrels-that is, symmetric bundles of five or more helices with central accessible channels-there are both water-soluble and membrane-spanning examples. Recent computational designs of water-soluble α-helical barrels with five to seven helices have advanced the design field considerably. Here we identify and classify analogous and more complicated membrane-spanning α-helical barrels from the Protein Data Bank. These provide tantalizing but tractable targets for protein engineering and de novo protein design.This article is part of the themed issue 'Membrane pores: from structure and assembly, to medicine and technology'. © 2017 The Author(s).
Sixty-five years of the long march in protein secondary structure prediction: the final stretch?
Yang, Yuedong; Gao, Jianzhao; Wang, Jihua; Heffernan, Rhys; Hanson, Jack; Paliwal, Kuldip; Zhou, Yaoqi
2018-01-01
Abstract Protein secondary structure prediction began in 1951 when Pauling and Corey predicted helical and sheet conformations for protein polypeptide backbone even before the first protein structure was determined. Sixty-five years later, powerful new methods breathe new life into this field. The highest three-state accuracy without relying on structure templates is now at 82–84%, a number unthinkable just a few years ago. These improvements came from increasingly larger databases of protein sequences and structures for training, the use of template secondary structure information and more powerful deep learning techniques. As we are approaching to the theoretical limit of three-state prediction (88–90%), alternative to secondary structure prediction (prediction of backbone torsion angles and Cα-atom-based angles and torsion angles) not only has more room for further improvement but also allows direct prediction of three-dimensional fragment structures with constantly improved accuracy. About 20% of all 40-residue fragments in a database of 1199 non-redundant proteins have <6 Å root-mean-squared distance from the native conformations by SPIDER2. More powerful deep learning methods with improved capability of capturing long-range interactions begin to emerge as the next generation of techniques for secondary structure prediction. The time has come to finish off the final stretch of the long march towards protein secondary structure prediction. PMID:28040746
Synchrotron IR microspectroscopy for protein structure analysis: Potential and questions
Yu, Peiqiang
2006-01-01
Synchrotron radiation-based Fourier transform infrared microspectroscopy (S-FTIR) has been developed as a rapid, direct, non-destructive, bioanalytical technique. This technique takes advantage of synchrotron light brightness and small effective source size and is capable of exploring the molecular chemical make-up within microstructures of a biological tissue without destruction of inherent structures at ultra-spatial resolutions within cellular dimension. To date there has been very little application of this advanced technique to the study of pure protein inherent structure at a cellular level in biological tissues. In this review, a novel approach was introduced to show the potential of the newly developed, advancedmore » synchrotron-based analytical technology, which can be used to localize relatively “pure“ protein in the plant tissues and relatively reveal protein inherent structure and protein molecular chemical make-up within intact tissue at cellular and subcellular levels. Several complex protein IR spectra data analytical techniques (Gaussian and Lorentzian multi-component peak modeling, univariate and multivariate analysis, principal component analysis (PCA), and hierarchical cluster analysis (CLA) are employed to relatively reveal features of protein inherent structure and distinguish protein inherent structure differences between varieties/species and treatments in plant tissues. By using a multi-peak modeling procedure, RELATIVE estimates (but not EXACT determinations) for protein secondary structure analysis can be made for comparison purpose. The issues of pro- and anti-multi-peaking modeling/fitting procedure for relative estimation of protein structure were discussed. By using the PCA and CLA analyses, the plant molecular structure can be qualitatively separate one group from another, statistically, even though the spectral assignments are not known. The synchrotron-based technology provides a new approach for protein structure research in biological tissues at ultraspatial resolutions.« less
Yamamoto, Norifumi
2014-08-21
The conformational conversion of proteins into an aggregation-prone form is a common feature of various neurodegenerative disorders including Alzheimer's, Huntington's, Parkinson's, and prion diseases. In the early stage of prion diseases, secondary structure conversion in prion protein (PrP) causing β-sheet expansion facilitates the formation of a pathogenic isoform with a high content of β-sheets and strong aggregation tendency to form amyloid fibrils. Herein, we propose a straightforward method to extract essential information regarding the secondary structure conversion of proteins from molecular simulations, named secondary structure principal component analysis (SSPCA). The definite existence of a PrP isoform with an increased β-sheet structure was confirmed in a free-energy landscape constructed by mapping protein structural data into a reduced space according to the principal components determined by the SSPCA. We suggest a "spot" of structural ambivalence in PrP-the C-terminal part of helix 2-that lacks a strong intrinsic secondary structure, thus promoting a partial α-helix-to-β-sheet conversion. This result is important to understand how the pathogenic conformational conversion of PrP is initiated in prion diseases. The SSPCA has great potential to solve various challenges in studying highly flexible molecular systems, such as intrinsically disordered proteins, structurally ambivalent peptides, and chameleon sequences.
The architecture of the DNA replication origin recognition complex in Saccharomyces cerevisiae
Chen, Zhiqiang; Speck, Christian; Wendel, Patricia; Tang, Chunyan; Stillman, Bruce; Li, Huilin
2008-01-01
The origin recognition complex (ORC) is conserved in all eukaryotes. The six proteins of the Saccharomyces cerevisiae ORC that form a stable complex bind to origins of DNA replication and recruit prereplicative complex (pre-RC) proteins, one of which is Cdc6. To further understand the function of ORC we recently determined by single-particle reconstruction of electron micrographs a low-resolution, 3D structure of S. cerevisiae ORC and the ORC–Cdc6 complex. In this article, the spatial arrangement of the ORC subunits within the ORC structure is described. In one approach, a maltose binding protein (MBP) was systematically fused to the N or the C termini of the five largest ORC subunits, one subunit at a time, generating 10 MBP-fused ORCs, and the MBP density was localized in the averaged, 2D EM images of the MBP-fused ORC particles. Determining the Orc1–5 structure and comparing it with the native ORC structure localized the Orc6 subunit near Orc2 and Orc3. Finally, subunit–subunit interactions were determined by immunoprecipitation of ORC subunits synthesized in vitro. Based on the derived ORC architecture and existing structures of archaeal Orc1–DNA structures, we propose a model for ORC and suggest how ORC interacts with origin DNA and Cdc6. The studies provide a basis for understanding the overall structure of the pre-RC. PMID:18647841
Elucidation of the structure of retroviral proteases: a reminiscence.
Jaskolski, Mariusz; Miller, Maria; Mohana Rao, J K; Gustchina, Alla; Wlodawer, Alexander
2015-11-01
Determinations of only a very few protein structures had consequences comparable to the impact exerted by the structure of the protease encoded by HIV-1, published just over 25 years ago. The structure of this relatively small protein and its cousins from other retroviruses provided a clear target for a spectacularly successful structure-assisted drug design effort that offered new hope for controlling the then-escalating AIDS epidemic. This reminiscence is limited primarily to work conducted at the National Cancer Institute, and is not meant to be a comprehensive history of the field, but is rather an attempt to provide a very personal account of how the structures of this most thoroughly studied crystallographic target were determined. Published 2015. This article is a U.S. Government work and is in the public domain in the USA.
Armour, Brianna L; Barnes, Steve R; Moen, Spencer O; Smith, Eric; Raymond, Amy C; Fairman, James W; Stewart, Lance J; Staker, Bart L; Begley, Darren W; Edwards, Thomas E; Lorimer, Donald D
2013-06-28
Pandemic outbreaks of highly virulent influenza strains can cause widespread morbidity and mortality in human populations worldwide. In the United States alone, an average of 41,400 deaths and 1.86 million hospitalizations are caused by influenza virus infection each year (1). Point mutations in the polymerase basic protein 2 subunit (PB2) have been linked to the adaptation of the viral infection in humans (2). Findings from such studies have revealed the biological significance of PB2 as a virulence factor, thus highlighting its potential as an antiviral drug target. The structural genomics program put forth by the National Institute of Allergy and Infectious Disease (NIAID) provides funding to Emerald Bio and three other Pacific Northwest institutions that together make up the Seattle Structural Genomics Center for Infectious Disease (SSGCID). The SSGCID is dedicated to providing the scientific community with three-dimensional protein structures of NIAID category A-C pathogens. Making such structural information available to the scientific community serves to accelerate structure-based drug design. Structure-based drug design plays an important role in drug development. Pursuing multiple targets in parallel greatly increases the chance of success for new lead discovery by targeting a pathway or an entire protein family. Emerald Bio has developed a high-throughput, multi-target parallel processing pipeline (MTPP) for gene-to-structure determination to support the consortium. Here we describe the protocols used to determine the structure of the PB2 subunit from four different influenza A strains.
Analysis of the Free-Energy Surface of Proteins from Reversible Folding Simulations
Allen, Lucy R.; Krivov, Sergei V.; Paci, Emanuele
2009-01-01
Computer generated trajectories can, in principle, reveal the folding pathways of a protein at atomic resolution and possibly suggest general and simple rules for predicting the folded structure of a given sequence. While such reversible folding trajectories can only be determined ab initio using all-atom transferable force-fields for a few small proteins, they can be determined for a large number of proteins using coarse-grained and structure-based force-fields, in which a known folded structure is by construction the absolute energy and free-energy minimum. Here we use a model of the fast folding helical λ-repressor protein to generate trajectories in which native and non-native states are in equilibrium and transitions are accurately sampled. Yet, representation of the free-energy surface, which underlies the thermodynamic and dynamic properties of the protein model, from such a trajectory remains a challenge. Projections over one or a small number of arbitrarily chosen progress variables often hide the most important features of such surfaces. The results unequivocally show that an unprojected representation of the free-energy surface provides important and unbiased information and allows a simple and meaningful description of many-dimensional, heterogeneous trajectories, providing new insight into the possible mechanisms of fast-folding proteins. PMID:19593364
Analysis of the free-energy surface of proteins from reversible folding simulations.
Allen, Lucy R; Krivov, Sergei V; Paci, Emanuele
2009-07-01
Computer generated trajectories can, in principle, reveal the folding pathways of a protein at atomic resolution and possibly suggest general and simple rules for predicting the folded structure of a given sequence. While such reversible folding trajectories can only be determined ab initio using all-atom transferable force-fields for a few small proteins, they can be determined for a large number of proteins using coarse-grained and structure-based force-fields, in which a known folded structure is by construction the absolute energy and free-energy minimum. Here we use a model of the fast folding helical lambda-repressor protein to generate trajectories in which native and non-native states are in equilibrium and transitions are accurately sampled. Yet, representation of the free-energy surface, which underlies the thermodynamic and dynamic properties of the protein model, from such a trajectory remains a challenge. Projections over one or a small number of arbitrarily chosen progress variables often hide the most important features of such surfaces. The results unequivocally show that an unprojected representation of the free-energy surface provides important and unbiased information and allows a simple and meaningful description of many-dimensional, heterogeneous trajectories, providing new insight into the possible mechanisms of fast-folding proteins.
Neu, Ursula; Wang, Jianbo; Macejak, Dennis; Garcea, Robert L; Stehle, Thilo
2011-07-01
The Karolinska Institutet and Washington University polyomaviruses (KIPyV and WUPyV, respectively) are recently discovered human viruses that infect the respiratory tract. Although they have not yet been linked to disease, they are prevalent in populations worldwide, with initial infection occurring in early childhood. Polyomavirus capsids consist of 72 pentamers of the major capsid protein viral protein 1 (VP1), which determines antigenicity and receptor specificity. The WUPyV and KIPyV VP1 proteins are distant in evolution from VP1 proteins of known structure such as simian virus 40 or murine polyomavirus. We present here the crystal structures of unassembled recombinant WUPyV and KIPyV VP1 pentamers at resolutions of 2.9 and 2.55 Å, respectively. The WUPyV and KIPyV VP1 core structures fold into the same β-sandwich that is a hallmark of all polyomavirus VP1 proteins crystallized to date. However, differences in sequence translate into profoundly different surface loop structures in KIPyV and WUPyV VP1 proteins. Such loop structures have not been observed for other polyomaviruses, and they provide initial clues about the possible interactions of these viruses with cell surface receptors.
Weerth, R. Sophia; Michalska, Karolina; Bingman, Craig A.; ...
2014-12-18
Here, proteins belonging to the cupin superfamily have a wide range of catalytic and noncatalytic functions. Cupin proteins commonly have the capacity to bind a metal ion with the metal frequently determining the function of the protein. We have been investigating the function of homologous cupin proteins that are conserved in more than 40 species of bacteria. In conclusion, to gain insights into the potential function of these proteins we have solved the structure of Plu4264 from Photorhabdus luminescens TTO1 at a resolution of 1.35 Å and identified manganese as the likely natural metal ligand of the protein. Proteins 2015;more » 83:383–388.« less
Visualizing water molecules in transmembrane proteins using radiolytic labeling methods†
Orban, Tivadar; Gupta, Sayan; Palczewski, Krzysztof; Chance, Mark R.
2010-01-01
Essential to cells and their organelles, water is both shuttled to where it is needed and trapped within cellular compartments and structures. Moreover, ordered waters within protein structures often co-localize with strategically placed polar or charged groups critical for protein function. Yet it is unclear if these ordered water molecules provide structural stabilization, mediate conformational changes in signaling, neutralize charged residues, or carry out a combination of all these functions. Structures of many integral membrane proteins, including G protein-coupled receptors (GPCRs), reveal the presence of ordered water molecules that may act like prosthetic groups in a manner quite unlike bulk water. Identification of ‘ordered’ waters within a crystalline protein structure requires sufficient occupancy of water to enable its detection in the protein's X-ray diffraction pattern and thus the observed waters likely represent a subset of tightly-bound functional waters. In this review, we highlight recent studies that suggest the structures of ordered waters within GPCRs are as conserved (and thus as important) as conserved side chains. In addition, methods of radiolysis, coupled to structural mass spectrometry (protein footprinting), reveal dynamic changes in water structure that mediate transmembrane signaling. The idea of water as a prosthetic group mediating chemical reaction dynamics is not new in fields such as catalysis. However, the concept of water as a mediator of conformational dynamics in signaling is just emerging, owing to advances in both crystallographic structure determination and new methods of protein footprinting. Although oil and water do not mix, understanding the roles of water is essential to understanding the function of membrane proteins. PMID:20047303
Kendrick, B S; Kerwin, B A; Chang, B S; Philo, J S
2001-12-15
Characterizing the solution structure of protein-polymer conjugates and protein-ligand interactions is important in fields such as biotechnology and biochemistry. Size-exclusion high-performance liquid chromatography with online classical light scattering (LS), refractive index (RI), and UV detection offers a powerful tool in such characterization. Novel methods are presented utilizing LS, RI, and UV signals to rapidly determine the degree of conjugation and the molecular mass of the protein conjugate. Baseline resolution of the chromatographic peaks is not required; peaks need only be sufficiently separated to represent relatively pure fractions. An improved technique for determining the polypeptide-only mass of protein conjugates is also described. These techniques are applied to determining the degree of erythropoietin glycosylation, the degree of polyethylene glycol conjugation to RNase A and brain-derived neurotrophic factor, and the solution association states of these molecules. Calibration methods for the RI, UV, and LS detectors will also be addressed, as well as online methods to determine protein extinction coefficients and dn/dc values both unconjugated and conjugated protein molecules. (c)2001 Elsevier Science.
Crystal structure of a designed, thermostable, heterotrimeric coiled coil.
Nautiyal, S.; Alber, T.
1999-01-01
Electrostatic interactions are often critical for determining the specificity of protein-protein complexes. To study the role of electrostatic interactions for assembly of helical bundles, we previously designed a thermostable, heterotrimeric coiled coil, ABC, in which charged residues were employed to drive preferential association of three distinct, 34-residue helices. To investigate the basis for heterotrimer specificity, we have used multiwavelength anomalous diffraction (MAD) analysis to determine the 1.8 A resolution crystal structure of ABC. The structure shows that ABC forms a heterotrimeric coiled coil with the intended arrangement of parallel chains. Over half of the ion pairs engineered to restrict helix associations were apparent in the experimental electron density map. As seen in other trimeric coiled coils, ABC displays acute knobs-into-holes packing and a buried anion coordinated by core polar amino acids. These interactions validate the design strategy and illustrate how packing and polar contacts determine structural uniqueness. PMID:10210186
Tracing Primordial Protein Evolution through Structurally Guided Stepwise Segment Elongation*
Watanabe, Hideki; Yamasaki, Kazuhiko; Honda, Shinya
2014-01-01
The understanding of how primordial proteins emerged has been a fundamental and longstanding issue in biology and biochemistry. For a better understanding of primordial protein evolution, we synthesized an artificial protein on the basis of an evolutionary hypothesis, segment-based elongation starting from an autonomously foldable short peptide. A 10-residue protein, chignolin, the smallest foldable polypeptide ever reported, was used as a structural support to facilitate higher structural organization and gain-of-function in the development of an artificial protein. Repetitive cycles of segment elongation and subsequent phage display selection successfully produced a 25-residue protein, termed AF.2A1, with nanomolar affinity against the Fc region of immunoglobulin G. AF.2A1 shows exquisite molecular recognition ability such that it can distinguish conformational differences of the same molecule. The structure determined by NMR measurements demonstrated that AF.2A1 forms a globular protein-like conformation with the chignolin-derived β-hairpin and a tryptophan-mediated hydrophobic core. Using sequence analysis and a mutation study, we discovered that the structural organization and gain-of-function emerged from the vicinity of the chignolin segment, revealing that the structural support served as the core in both structural and functional development. Here, we propose an evolutionary model for primordial proteins in which a foldable segment serves as the evolving core to facilitate structural and functional evolution. This study provides insights into primordial protein evolution and also presents a novel methodology for designing small sized proteins useful for industrial and pharmaceutical applications. PMID:24356963
Sun, Yunxiang; Ming, Dengming
2014-01-01
Energetic frustration is becoming an important topic for understanding the mechanisms of protein folding, which is a long-standing big biological problem usually investigated by the free energy landscape theory. Despite the significant advances in probing the effects of folding frustrations on the overall features of protein folding pathways and folding intermediates, detailed characterizations of folding frustrations at an atomic or residue level are still lacking. In addition, how and to what extent folding frustrations interact with protein topology in determining folding mechanisms remains unclear. In this paper, we tried to understand energetic frustrations in the context of protein topology structures or native-contact networks by comparing the energetic frustrations of five homologous Im9 alpha-helix proteins that share very similar topology structures but have a single hydrophilic-to-hydrophobic mutual mutation. The folding simulations were performed using a coarse-grained Gō-like model, while non-native hydrophobic interactions were introduced as energetic frustrations using a Lennard-Jones potential function. Energetic frustrations were then examined at residue level based on φ-value analyses of the transition state ensemble structures and mapped back to native-contact networks. Our calculations show that energetic frustrations have highly heterogeneous influences on the folding of the four helices of the examined structures depending on the local environment of the frustration centers. Also, the closer the introduced frustration is to the center of the native-contact network, the larger the changes in the protein folding. Our findings add a new dimension to the understanding of protein folding the topology determination in that energetic frustrations works closely with native-contact networks to affect the protein folding.
Nannenga, Brent L; Iadanza, Matthew G; Vollmar, Breanna S; Gonen, Tamir
2013-01-01
Electron cryomicroscopy, or cryoEM, is an emerging technique for studying the three-dimensional structures of proteins and large macromolecular machines. Electron crystallography is a branch of cryoEM in which structures of proteins can be studied at resolutions that rival those achieved by X-ray crystallography. Electron crystallography employs two-dimensional crystals of a membrane protein embedded within a lipid bilayer. The key to a successful electron crystallographic experiment is the crystallization, or reconstitution, of the protein of interest. This unit describes ways in which protein can be expressed, purified, and reconstituted into well-ordered two-dimensional crystals. A protocol is also provided for negative stain electron microscopy as a tool for screening crystallization trials. When large and well-ordered crystals are obtained, the structures of both protein and its surrounding membrane can be determined to atomic resolution.
NASA Astrophysics Data System (ADS)
Kutuzova, G. D.; Ugarova, N. N.; Berezin, Ilya V.
1984-11-01
The principal structural and physicochemical factors determining the stability of protein macromolecules in solution and the characteristics of the structure of the proteins from thermophilic microorganisms are examined. The mechanism of the changes in the thermal stability of proteins and enzymes after the chemical modification of their functional side groups and the experimental data concerning the influence of chemical modification on the thermal stability of proteins are analysed. The dependence of the stabilisation effect and of the changes in the structure of protein macromolecules on the degree of modification and on the nature of the modified groups and the groups introduced into proteins in the course of modification (their charge and hydrophobic properties) is demonstrated. The great practical value of the method of chemical modification for the preparation of stabilised forms of biocatalysts is shown in relation to specific examples. The bibliography includes 178 references.
Marsella, Luca; Sirocco, Francesco; Trovato, Antonio; Seno, Flavio; Tosatto, Silvio C.E.
2009-01-01
Motivation: Proteins with solenoid repeats evolve more quickly than non-repetitive ones and their periodicity may be rapidly hidden at sequence level, while still evident in structure. In order to identify these repeats, we propose here a novel method based on a metric characterizing amino-acid properties (polarity, secondary structure, molecular volume, codon diversity, electric charge) using five previously derived numerical functions. Results: The five spectra of the candidate sequences coding for structural repeats, obtained by Discrete Fourier Transform (DFT), show common features allowing determination of repeat periodicity with excellent results. Moreover it is possible to introduce a phase space parameterized by two quantities related to the Fourier spectra which allow for a clear distinction between a non-homologous set of globular proteins and proteins with solenoid repeats. The DFT method is shown to be competitive with other state of the art methods in the detection of solenoid structures, while improving its performance especially in the identification of periodicities, since it is able to recognize the actual repeat length in most cases. Moreover it highlights the relevance of local structural propensities in determining solenoid repeats. Availability: A web tool implementing the algorithm presented in the article (REPETITA) is available with additional details on the data sets at the URL: http://protein.bio.unipd.it/repetita/. Contact: silvio.tosatto@unipd.it PMID:19478001
The structure and host entry of an invertebrate parvovirus.
Meng, Geng; Zhang, Xinzheng; Plevka, Pavel; Yu, Qian; Tijssen, Peter; Rossmann, Michael G
2013-12-01
The 3.5-Å resolution X-ray crystal structure of mature cricket parvovirus (Acheta domesticus densovirus [AdDNV]) has been determined. Structural comparisons show that vertebrate and invertebrate parvoviruses have evolved independently, although there are common structural features among all parvovirus capsid proteins. It was shown that raising the temperature of the AdDNV particles caused a loss of their genomes. The structure of these emptied particles was determined by cryo-electron microscopy to 5.5-Å resolution, and the capsid structure was found to be the same as that for the full, mature virus except for the absence of the three ordered nucleotides observed in the crystal structure. The viral protein 1 (VP1) amino termini could be externalized without significant damage to the capsid. In vitro, this externalization of the VP1 amino termini is accompanied by the release of the viral genome.
The Structure and Host Entry of an Invertebrate Parvovirus
Meng, Geng; Zhang, Xinzheng; Plevka, Pavel; Yu, Qian; Tijssen, Peter
2013-01-01
The 3.5-Å resolution X-ray crystal structure of mature cricket parvovirus (Acheta domesticus densovirus [AdDNV]) has been determined. Structural comparisons show that vertebrate and invertebrate parvoviruses have evolved independently, although there are common structural features among all parvovirus capsid proteins. It was shown that raising the temperature of the AdDNV particles caused a loss of their genomes. The structure of these emptied particles was determined by cryo-electron microscopy to 5.5-Å resolution, and the capsid structure was found to be the same as that for the full, mature virus except for the absence of the three ordered nucleotides observed in the crystal structure. The viral protein 1 (VP1) amino termini could be externalized without significant damage to the capsid. In vitro, this externalization of the VP1 amino termini is accompanied by the release of the viral genome. PMID:24027306
Crystallization of the Large Membrane Protein Complex Photosystem I in a Microfluidic Channel
Abdallah, Bahige G.; Kupitz, Christopher; Fromme, Petra; Ros, Alexandra
2014-01-01
Traditional macroscale protein crystallization is accomplished non-trivially by exploring a range of protein concentrations and buffers in solution until a suitable combination is attained. This methodology is time consuming and resource intensive, hindering protein structure determination. Even more difficulties arise when crystallizing large membrane protein complexes such as photosystem I (PSI) due to their large unit cells dominated by solvent and complex characteristics that call for even stricter buffer requirements. Structure determination techniques tailored for these ‘difficult to crystallize’ proteins such as femtosecond nanocrystallography are being developed, yet still need specific crystal characteristics. Here, we demonstrate a simple and robust method to screen protein crystallization conditions at low ionic strength in a microfluidic device. This is realized in one microfluidic experiment using low sample amounts, unlike traditional methods where each solution condition is set up separately. Second harmonic generation microscopy via Second Order Nonlinear Imaging of Chiral Crystals (SONICC) was applied for the detection of nanometer and micrometer sized PSI crystals within microchannels. To develop a crystallization phase diagram, crystals imaged with SONICC at specific channel locations were correlated to protein and salt concentrations determined by numerical simulations of the time-dependent diffusion process along the channel. Our method demonstrated that a portion of the PSI crystallization phase diagram could be reconstructed in excellent agreement with crystallization conditions determined by traditional methods. We postulate that this approach could be utilized to efficiently study and optimize crystallization conditions for a wide range of proteins that are poorly understood to date. PMID:24191698
Breakdown of the Debye polarization ansatz at protein-water interfaces
NASA Astrophysics Data System (ADS)
Fernández Stigliano, Ariel
2013-06-01
The topographical and physico-chemical complexity of protein-water interfaces scales down to the sub-nanoscale range. At this level of confinement, we demonstrate that the dielectric structure of interfacial water entails a breakdown of the Debye ansatz that postulates the alignment of polarization with the protein electrostatic field. The tendencies to promote anomalous polarization are determined for each residue type and a particular kind of structural defect is shown to provide the predominant causal context.
1998-06-16
Eddie Snell (standing), Post-Doctoral Fellow the National Research Council (NRC),and Marc Pusey of Marshall Space Flight Center (MSFC) use a reciprocal space mapping diffractometer for marcromolecular crystal quality studies. The diffractometer is used in mapping the structure of marcromolecules such as proteins to determine their structure and thus understand how they function with other proteins in the body. This is one of several analytical tools used on proteins crystalized on Earth and in space experiments. Photo credit: NASA/Marshall Space Flight Center (MSFC)