Approaches to ab initio molecular replacement of α-helical transmembrane proteins.
Thomas, Jens M H; Simkovic, Felix; Keegan, Ronan; Mayans, Olga; Zhang, Chengxin; Zhang, Yang; Rigden, Daniel J
2017-12-01
α-Helical transmembrane proteins are a ubiquitous and important class of proteins, but present difficulties for crystallographic structure solution. Here, the effectiveness of the AMPLE molecular replacement pipeline in solving α-helical transmembrane-protein structures is assessed using a small library of eight ideal helices, as well as search models derived from ab initio models generated both with and without evolutionary contact information. The ideal helices prove to be surprisingly effective at solving higher resolution structures, but ab initio-derived search models are able to solve structures that could not be solved with the ideal helices. The addition of evolutionary contact information results in a marked improvement in the modelling and makes additional solutions possible.
Predicting protein structures with a multiplayer online game.
Cooper, Seth; Khatib, Firas; Treuille, Adrien; Barbero, Janos; Lee, Jeehyung; Beenen, Michael; Leaver-Fay, Andrew; Baker, David; Popović, Zoran; Players, Foldit
2010-08-05
People exert large amounts of problem-solving effort playing computer games. Simple image- and text-recognition tasks have been successfully 'crowd-sourced' through games, but it is not clear if more complex scientific problems can be solved with human-directed computing. Protein structure prediction is one such problem: locating the biologically relevant native conformation of a protein is a formidable computational challenge given the very large size of the search space. Here we describe Foldit, a multiplayer online game that engages non-scientists in solving hard prediction problems. Foldit players interact with protein structures using direct manipulation tools and user-friendly versions of algorithms from the Rosetta structure prediction methodology, while they compete and collaborate to optimize the computed energy. We show that top-ranked Foldit players excel at solving challenging structure refinement problems in which substantial backbone rearrangements are necessary to achieve the burial of hydrophobic residues. Players working collaboratively develop a rich assortment of new strategies and algorithms; unlike computational approaches, they explore not only the conformational space but also the space of possible search strategies. The integration of human visual problem-solving and strategy development capabilities with traditional computational algorithms through interactive multiplayer games is a powerful new approach to solving computationally-limited scientific problems.
Maximum likelihood density modification by pattern recognition of structural motifs
Terwilliger, Thomas C.
2004-04-13
An electron density for a crystallographic structure having protein regions and solvent regions is improved by maximizing the log likelihood of a set of structures factors {F.sub.h } using a local log-likelihood function: (x)+p(.rho.(x).vertline.SOLV)p.sub.SOLV (x)+p(.rho.(x).vertline.H)p.sub.H (x)], where p.sub.PROT (x) is the probability that x is in the protein region, p(.rho.(x).vertline.PROT) is the conditional probability for .rho.(x) given that x is in the protein region, and p.sub.SOLV (x) and p(.rho.(x).vertline.SOLV) are the corresponding quantities for the solvent region, p.sub.H (x) refers to the probability that there is a structural motif at a known location, with a known orientation, in the vicinity of the point x; and p(.rho.(x).vertline.H) is the probability distribution for electron density at this point given that the structural motif actually is present. One appropriate structural motif is a helical structure within the crystallographic structure.
Rosetta Structure Prediction as a Tool for Solving Difficult Molecular Replacement Problems.
DiMaio, Frank
2017-01-01
Molecular replacement (MR), a method for solving the crystallographic phase problem using phases derived from a model of the target structure, has proven extremely valuable, accounting for the vast majority of structures solved by X-ray crystallography. However, when the resolution of data is low, or the starting model is very dissimilar to the target protein, solving structures via molecular replacement may be very challenging. In recent years, protein structure prediction methodology has emerged as a powerful tool in model building and model refinement for difficult molecular replacement problems. This chapter describes some of the tools available in Rosetta for model building and model refinement specifically geared toward difficult molecular replacement cases.
Solving coiled-coil protein structures
Dauter, Zbigniew
2015-02-26
With the availability of more than 100,000 entries stored in the Protein Data Bank (PDB) that can be used as search models, molecular replacement (MR) is currently the most popular method of solving crystal structures of macromolecules. Significant methodological efforts have been directed in recent years towards making this approach more powerful and practical. This resulted in the creation of several computer programs, highly automated and user friendly, that are able to successfully solve many structures even by researchers who, although interested in structures of biomolecules, are not very experienced in crystallography.
Researchers at the Frederick National Lab (FNL) have collaborated in solving the three-dimensional structure of a key protein in Alzheimer’s disease, providing new insight into the basic mechanisms that give rise to the devastating illness. The pro
NCI Scientists Solve Structure of Protein that Enables MERS Virus to Spread | Poster
Scientists at the Frederick National Lab have produced three crystal structures that reveal a specific part of a protein that can be targeted to fight the Middle East respiratory syndrome coronavirus (MERS-CoV), which causes an emerging viral respiratory illness. Senior Investigator David Waugh, Ph.D., Macromolecular Crystallography Laboratory, has solved the structure of an
Advances in Homology Protein Structure Modeling
Xiang, Zhexin
2007-01-01
Homology modeling plays a central role in determining protein structure in the structural genomics project. The importance of homology modeling has been steadily increasing because of the large gap that exists between the overwhelming number of available protein sequences and experimentally solved protein structures, and also, more importantly, because of the increasing reliability and accuracy of the method. In fact, a protein sequence with over 30% identity to a known structure can often be predicted with an accuracy equivalent to a low-resolution X-ray structure. The recent advances in homology modeling, especially in detecting distant homologues, aligning sequences with template structures, modeling of loops and side chains, as well as detecting errors in a model, have contributed to reliable prediction of protein structure, which was not possible even several years ago. The ongoing efforts in solving protein structures, which can be time-consuming and often difficult, will continue to spur the development of a host of new computational methods that can fill in the gap and further contribute to understanding the relationship between protein structure and function. PMID:16787261
X-ray laser diffraction for structure determination of the rhodopsin-arrestin complex
NASA Astrophysics Data System (ADS)
Zhou, X. Edward; Gao, Xiang; Barty, Anton; Kang, Yanyong; He, Yuanzheng; Liu, Wei; Ishchenko, Andrii; White, Thomas A.; Yefanov, Oleksandr; Han, Gye Won; Xu, Qingping; de Waal, Parker W.; Suino-Powell, Kelly M.; Boutet, Sébastien; Williams, Garth J.; Wang, Meitian; Li, Dianfan; Caffrey, Martin; Chapman, Henry N.; Spence, John C. H.; Fromme, Petra; Weierstall, Uwe; Stevens, Raymond C.; Cherezov, Vadim; Melcher, Karsten; Xu, H. Eric
2016-04-01
Serial femtosecond X-ray crystallography (SFX) using an X-ray free electron laser (XFEL) is a recent advancement in structural biology for solving crystal structures of challenging membrane proteins, including G-protein coupled receptors (GPCRs), which often only produce microcrystals. An XFEL delivers highly intense X-ray pulses of femtosecond duration short enough to enable the collection of single diffraction images before significant radiation damage to crystals sets in. Here we report the deposition of the XFEL data and provide further details on crystallization, XFEL data collection and analysis, structure determination, and the validation of the structural model. The rhodopsin-arrestin crystal structure solved with SFX represents the first near-atomic resolution structure of a GPCR-arrestin complex, provides structural insights into understanding of arrestin-mediated GPCR signaling, and demonstrates the great potential of this SFX-XFEL technology for accelerating crystal structure determination of challenging proteins and protein complexes.
X-ray laser diffraction for structure determination of the rhodopsin-arrestin complex.
Zhou, X Edward; Gao, Xiang; Barty, Anton; Kang, Yanyong; He, Yuanzheng; Liu, Wei; Ishchenko, Andrii; White, Thomas A; Yefanov, Oleksandr; Han, Gye Won; Xu, Qingping; de Waal, Parker W; Suino-Powell, Kelly M; Boutet, Sébastien; Williams, Garth J; Wang, Meitian; Li, Dianfan; Caffrey, Martin; Chapman, Henry N; Spence, John C H; Fromme, Petra; Weierstall, Uwe; Stevens, Raymond C; Cherezov, Vadim; Melcher, Karsten; Xu, H Eric
2016-04-12
Serial femtosecond X-ray crystallography (SFX) using an X-ray free electron laser (XFEL) is a recent advancement in structural biology for solving crystal structures of challenging membrane proteins, including G-protein coupled receptors (GPCRs), which often only produce microcrystals. An XFEL delivers highly intense X-ray pulses of femtosecond duration short enough to enable the collection of single diffraction images before significant radiation damage to crystals sets in. Here we report the deposition of the XFEL data and provide further details on crystallization, XFEL data collection and analysis, structure determination, and the validation of the structural model. The rhodopsin-arrestin crystal structure solved with SFX represents the first near-atomic resolution structure of a GPCR-arrestin complex, provides structural insights into understanding of arrestin-mediated GPCR signaling, and demonstrates the great potential of this SFX-XFEL technology for accelerating crystal structure determination of challenging proteins and protein complexes.
X-ray laser diffraction for structure determination of the rhodopsin-arrestin complex
Zhou, X. Edward; Gao, Xiang; Barty, Anton; Kang, Yanyong; He, Yuanzheng; Liu, Wei; Ishchenko, Andrii; White, Thomas A.; Yefanov, Oleksandr; Han, Gye Won; Xu, Qingping; de Waal, Parker W.; Suino-Powell, Kelly M.; Boutet, Sébastien; Williams, Garth J.; Wang, Meitian; Li, Dianfan; Caffrey, Martin; Chapman, Henry N.; Spence, John C.H.; Fromme, Petra; Weierstall, Uwe; Stevens, Raymond C.; Cherezov, Vadim; Melcher, Karsten; Xu, H. Eric
2016-01-01
Serial femtosecond X-ray crystallography (SFX) using an X-ray free electron laser (XFEL) is a recent advancement in structural biology for solving crystal structures of challenging membrane proteins, including G-protein coupled receptors (GPCRs), which often only produce microcrystals. An XFEL delivers highly intense X-ray pulses of femtosecond duration short enough to enable the collection of single diffraction images before significant radiation damage to crystals sets in. Here we report the deposition of the XFEL data and provide further details on crystallization, XFEL data collection and analysis, structure determination, and the validation of the structural model. The rhodopsin-arrestin crystal structure solved with SFX represents the first near-atomic resolution structure of a GPCR-arrestin complex, provides structural insights into understanding of arrestin-mediated GPCR signaling, and demonstrates the great potential of this SFX-XFEL technology for accelerating crystal structure determination of challenging proteins and protein complexes. PMID:27070998
Structural basis for the fast maturation of Arthropoda green fluorescent protein
Evdokimov, Artem G; Pokross, Matthew E; Egorov, Nikolay S; Zaraisky, Andrey G; Yampolsky, Ilya V; Merzlyak, Ekaterina M; Shkoporov, Andrey N; Sander, Ian; Lukyanov, Konstantin A; Chudakov, Dmitriy M
2006-01-01
Since the cloning of Aequorea victoria green fluorescent protein (GFP) in 1992, a family of known GFP-like proteins has been growing rapidly. Today, it includes more than a hundred proteins with different spectral characteristics cloned from Cnidaria species. For some of these proteins, crystal structures have been solved, showing diversity in chromophore modifications and conformational states. However, we are still far from a complete understanding of the origin, functions and evolution of the GFP family. Novel proteins of the family were recently cloned from evolutionarily distant marine Copepoda species, phylum Arthropoda, demonstrating an extremely rapid generation of fluorescent signal. Here, we have generated a non-aggregating mutant of Copepoda fluorescent protein and solved its high-resolution crystal structure. It was found that the protein β-barrel contains a pore, leading to the chromophore. Using site-directed mutagenesis, we showed that this feature is critical for the fast maturation of the chromophore. PMID:16936637
NCI Scientists Solve Structure of Protein that Enables MERS Virus to Spread | Poster
Scientists at the Frederick National Lab have produced three crystal structures that reveal a specific part of a protein that can be targeted to fight the Middle East respiratory syndrome coronavirus (MERS-CoV), which causes an emerging viral respiratory illness. Senior Investigator David Waugh, Ph.D., Macromolecular Crystallography Laboratory, has solved the structure of an enzyme known as the 3C-like protease (3CLpro), which, if blocked, can prevent the virus from replicating...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jacques, David A.; Streamer, Margaret; Rowland, Susan L.
2009-06-01
The crystal structure of Sda, a DNA-replication/damage checkpoint inhibitor of sporulation in B. subtilis, has been solved via the MAD method. The subunit arrangement in the crystal has enabled a reappraisal of previous biophysical data, resulting in a new model for the behaviour of the protein in solution. The crystal structure of the DNA-damage checkpoint inhibitor of sporulation, Sda, from Bacillus subtilis, has been solved by the MAD technique using selenomethionine-substituted protein. The structure closely resembles that previously solved by NMR, as well as the structure of a homologue from Geobacillus stearothermophilus solved in complex with the histidine kinase KinB.more » The structure contains three molecules in the asymmetric unit. The unusual trimeric arrangement, which lacks simple internal symmetry, appears to be preserved in solution based on an essentially ideal fit to previously acquired scattering data for Sda in solution. This interpretation contradicts previous findings that Sda was monomeric or dimeric in solution. This study demonstrates the difficulties that can be associated with the characterization of small proteins and the value of combining multiple biophysical techniques. It also emphasizes the importance of understanding the physical principles behind these techniques and therefore their limitations.« less
Automated de novo phasing and model building of coiled-coil proteins.
Rämisch, Sebastian; Lizatović, Robert; André, Ingemar
2015-03-01
Models generated by de novo structure prediction can be very useful starting points for molecular replacement for systems where suitable structural homologues cannot be readily identified. Protein-protein complexes and de novo-designed proteins are examples of systems that can be challenging to phase. In this study, the potential of de novo models of protein complexes for use as starting points for molecular replacement is investigated. The approach is demonstrated using homomeric coiled-coil proteins, which are excellent model systems for oligomeric systems. Despite the stereotypical fold of coiled coils, initial phase estimation can be difficult and many structures have to be solved with experimental phasing. A method was developed for automatic structure determination of homomeric coiled coils from X-ray diffraction data. In a benchmark set of 24 coiled coils, ranging from dimers to pentamers with resolutions down to 2.5 Å, 22 systems were automatically solved, 11 of which had previously been solved by experimental phasing. The generated models contained 71-103% of the residues present in the deposited structures, had the correct sequence and had free R values that deviated on average by 0.01 from those of the respective reference structures. The electron-density maps were of sufficient quality that only minor manual editing was necessary to produce final structures. The method, named CCsolve, combines methods for de novo structure prediction, initial phase estimation and automated model building into one pipeline. CCsolve is robust against errors in the initial models and can readily be modified to make use of alternative crystallographic software. The results demonstrate the feasibility of de novo phasing of protein-protein complexes, an approach that could also be employed for other small systems beyond coiled coils.
The use of experimental structures to model protein dynamics.
Katebi, Ataur R; Sankar, Kannan; Jia, Kejue; Jernigan, Robert L
2015-01-01
The number of solved protein structures submitted in the Protein Data Bank (PDB) has increased dramatically in recent years. For some specific proteins, this number is very high-for example, there are over 550 solved structures for HIV-1 protease, one protein that is essential for the life cycle of human immunodeficiency virus (HIV) which causes acquired immunodeficiency syndrome (AIDS) in humans. The large number of structures for the same protein and its variants include a sample of different conformational states of the protein. A rich set of structures solved experimentally for the same protein has information buried within the dataset that can explain the functional dynamics and structural mechanism of the protein. To extract the dynamics information and functional mechanism from the experimental structures, this chapter focuses on two methods-Principal Component Analysis (PCA) and Elastic Network Models (ENM). PCA is a widely used statistical dimensionality reduction technique to classify and visualize high-dimensional data. On the other hand, ENMs are well-established simple biophysical method for modeling the functionally important global motions of proteins. This chapter covers the basics of these two. Moreover, an improved ENM version that utilizes the variations found within a given set of structures for a protein is described. As a practical example, we have extracted the functional dynamics and mechanism of HIV-1 protease dimeric structure by using a set of 329 PDB structures of this protein. We have described, step by step, how to select a set of protein structures, how to extract the needed information from the PDB files for PCA, how to extract the dynamics information using PCA, how to calculate ENM modes, how to measure the congruency between the dynamics computed from the principal components (PCs) and the ENM modes, and how to compute entropies using the PCs. We provide the computer programs or references to software tools to accomplish each step and show how to use these programs and tools. We also include computer programs to generate movies based on PCs and ENM modes and describe how to visualize them.
The Use of Experimental Structures to Model Protein Dynamics
Katebi, Ataur R.; Sankar, Kannan; Jia, Kejue; Jernigan, Robert L.
2014-01-01
Summary The number of solved protein structures submitted in the Protein Data Bank (PDB) has increased dramatically in recent years. For some specific proteins, this number is very high – for example, there are over 550 solved structures for HIV-1 protease, one protein that is essential for the life cycle of human immunodeficiency virus (HIV) which causes acquired immunodeficiency syndrome (AIDS) in humans. The large number of structures for the same protein and its variants include a sample of different conformational states of the protein. A rich set of structures solved experimentally for the same protein has information buried within the dataset that can explain the functional dynamics and structural mechanism of the protein. To extract the dynamics information and functional mechanism from the experimental structures, this chapter focuses on two methods – Principal Component Analysis (PCA) and Elastic Network Models (ENM). PCA is a widely used statistical dimensionality reduction technique to classify and visualize high-dimensional data. On the other hand, ENMs are well-established simple biophysical method for modeling the functionally important global motions of proteins. This chapter covers the basics of these two. Moreover, an improved ENM version that utilizes the variations found within a given set of structures for a protein is described. As a practical example, we have extracted the functional dynamics and mechanism of HIV-1 protease dimeric structure by using a set of 329 PDB structures of this protein. We have described, step by step, how to select a set of protein structures, how to extract the needed information from the PDB files for PCA, how to extract the dynamics information using PCA, how to calculate ENM modes, how to measure the congruency between the dynamics computed from the principal components (PCs) and the ENM modes, and how to compute entropies using the PCs. We provide the computer programs or references to software tools to accomplish each step and show how to use these programs and tools. We also include computer programs to generate movies based on PCs and ENM modes and describe how to visualize them. PMID:25330965
A Practical Approach to Protein Crystallography.
Ilari, Andrea; Savino, Carmelinda
2017-01-01
Macromolecular crystallography is a powerful tool for structural biology. The resolution of a protein crystal structure is becoming much easier than in the past, thanks to developments in computing, automation of crystallization techniques and high-flux synchrotron sources to collect diffraction datasets. The aim of this chapter is to provide practical procedures to determine a protein crystal structure, illustrating the new techniques, experimental methods, and software that have made protein crystallography a tool accessible to a larger scientific community.It is impossible to give more than a taste of what the X-ray crystallographic technique entails in one brief chapter and there are different ways to solve a protein structure. Since the number of structures available in the Protein Data Bank (PDB) is becoming ever larger (the protein data bank now contains more than 100,000 entries) and therefore the probability to find a good model to solve the structure is ever increasing, we focus our attention on the Molecular Replacement method. Indeed, whenever applicable, this method allows the resolution of macromolecular structures starting from a single data set and a search model downloaded from the PDB, with the aid only of computer work.
Xu, Xianzhong; Pulavarti, Surya V S R K; Eletsky, Alexander; Huang, Yuanpeng Janet; Acton, Thomas B; Xiao, Rong; Everett, John K; Montelione, Gaetano T; Szyperski, Thomas
2014-12-01
High-quality solution NMR structures of three homeodomains from human proteins ALX4, ZHX1 and CASP8AP2 were solved. These domains were chosen as targets of a biomedical theme project pursued by the Northeast Structural Genomics Consortium. This project focuses on increasing the structural coverage of human proteins associated with cancer.
Structure based alignment and clustering of proteins (STRALCP)
Zemla, Adam T.; Zhou, Carol E.; Smith, Jason R.; Lam, Marisa W.
2013-06-18
Disclosed are computational methods of clustering a set of protein structures based on local and pair-wise global similarity values. Pair-wise local and global similarity values are generated based on pair-wise structural alignments for each protein in the set of protein structures. Initially, the protein structures are clustered based on pair-wise local similarity values. The protein structures are then clustered based on pair-wise global similarity values. For each given cluster both a representative structure and spans of conserved residues are identified. The representative protein structure is used to assign newly-solved protein structures to a group. The spans are used to characterize conservation and assign a "structural footprint" to the cluster.
Keegan, Ronan M; Bibby, Jaclyn; Thomas, Jens; Xu, Dong; Zhang, Yang; Mayans, Olga; Winn, Martyn D; Rigden, Daniel J
2015-02-01
AMPLE clusters and truncates ab initio protein structure predictions, producing search models for molecular replacement. Here, an interesting degree of complementarity is shown between targets solved using the different ab initio modelling programs QUARK and ROSETTA. Search models derived from either program collectively solve almost all of the all-helical targets in the test set. Initial solutions produced by Phaser after only 5 min perform surprisingly well, improving the prospects for in situ structure solution by AMPLE during synchrotron visits. Taken together, the results show the potential for AMPLE to run more quickly and successfully solve more targets than previously suspected.
Resource for structure related information on transmembrane proteins
NASA Astrophysics Data System (ADS)
Tusnády, Gábor E.; Simon, István
Transmembrane proteins are involved in a wide variety of vital biological processes including transport of water-soluble molecules, flow of information and energy production. Despite significant efforts to determine the structures of these proteins, only a few thousand solved structures are known so far. Here, we review the various resources for structure-related information on these types of proteins ranging from the 3D structure to the topology and from the up-to-date databases to the various Internet sites and servers dealing with structure prediction and structure analysis. Abbreviations: 3D, three dimensional; PDB, Protein Data Bank; TMP, transmembrane protein.
Chakravorty, Arghya; Jia, Zhe; Li, Lin; Zhao, Shan; Alexov, Emil
2018-02-13
Typically, the ensemble average polar component of solvation energy (ΔG polar solv ) of a macromolecule is computed using molecular dynamics (MD) or Monte Carlo (MC) simulations to generate conformational ensemble and then single/rigid conformation solvation energy calculation is performed on each snapshot. The primary objective of this work is to demonstrate that Poisson-Boltzmann (PB)-based approach using a Gaussian-based smooth dielectric function for macromolecular modeling previously developed by us (Li et al. J. Chem. Theory Comput. 2013, 9 (4), 2126-2136) can reproduce that ensemble average (ΔG polar solv ) of a protein from a single structure. We show that the Gaussian-based dielectric model reproduces the ensemble average ΔG polar solv (⟨ΔG polar solv ⟩) from an energy-minimized structure of a protein regardless of the minimization environment (structure minimized in vacuo, implicit or explicit waters, or crystal structure); the best case, however, is when it is paired with an in vacuo-minimized structure. In other minimization environments (implicit or explicit waters or crystal structure), the traditional two-dielectric model can still be selected with which the model produces correct solvation energies. Our observations from this work reflect how the ability to appropriately mimic the motion of residues, especially the salt bridge residues, influences a dielectric model's ability to reproduce the ensemble average value of polar solvation free energy from a single in vacuo-minimized structure.
Tactile Teaching: Exploring Protein Structure/Function Using Physical Models
ERIC Educational Resources Information Center
Herman, Tim; Morris, Jennifer; Colton, Shannon; Batiza, Ann; Patrick, Michael; Franzen, Margaret; Goodsell, David S.
2006-01-01
The technology now exists to construct physical models of proteins based on atomic coordinates of solved structures. We review here our recent experiences in using physical models to teach concepts of protein structure and function at both the high school and the undergraduate levels. At the high school level, physical models are used in a…
DOE Office of Scientific and Technical Information (OSTI.GOV)
Brynes, Laura; /Rensselaer Poly.
2007-10-31
Guiana Extended-Spectrum-1 (GES-1) and Aminoglycoside phosphotransferase (2')-Ic (APH(2')-Ic) are two bacteria-produced enzymes that essentially perform the same task: they provide resistance to an array of antibiotics. Both enzymes are part of a growing resistance problem in the medical world. In order to overcome the ever-growing arsenal of antibiotic-resistance enzymes, it is necessary to understand the molecular basis of their action. Accurate structures of these proteins have become an invaluable tool to do this. Using protein crystallography techniques and X-ray diffraction, the protein structure of GES-1 bound to imipenem (an inhibitor) has been solved. Also, APH(2')-Ic has been successfully crystallized, butmore » its structure was unable to be solved using molecular replacement using APH(2')-Ib as a search model. The structure of GES-1, with bound imipenem was solved to a resolution of 1.89A, and though the inhibitor is bound with only moderate occupancy, the structure shows crucial interactions inside the active site that render the enzyme unable to complete the hydrolysis of the {beta}-lactam ring. The APH(2')-Ic dataset could not be matched to the model, APH(2')-Ib, with which it shares 25% sequence identity. The structural information gained from GES-1, and future studies using isomorphous replacement to solve the APH(2')-Ic structure can aid directly to the creation of novel drugs to combat both of these classes of resistance enzymes.« less
Overcoming barriers to membrane protein structure determination.
Bill, Roslyn M; Henderson, Peter J F; Iwata, So; Kunji, Edmund R S; Michel, Hartmut; Neutze, Richard; Newstead, Simon; Poolman, Bert; Tate, Christopher G; Vogel, Horst
2011-04-01
After decades of slow progress, the pace of research on membrane protein structures is beginning to quicken thanks to various improvements in technology, including protein engineering and microfocus X-ray diffraction. Here we review these developments and, where possible, highlight generic new approaches to solving membrane protein structures based on recent technological advances. Rational approaches to overcoming the bottlenecks in the field are urgently required as membrane proteins, which typically comprise ~30% of the proteomes of organisms, are dramatically under-represented in the structural database of the Protein Data Bank.
High-throughput Cloning and Expression of Integral Membrane Proteins in Escherichia coli
Bruni, Renato
2014-01-01
Recently, several structural genomics centers have been established and a remarkable number of three-dimensional structures of soluble proteins have been solved. For membrane proteins, the number of structures solved has been significantly trailing those for their soluble counterparts, not least because over-expression and purification of membrane proteins is a much more arduous process. By using high throughput technologies, a large number of membrane protein targets can be screened simultaneously and a greater number of expression and purification conditions can be employed, leading to a higher probability of successfully determining the structure of membrane proteins. This unit describes the cloning, expression and screening of membrane proteins using high throughput methodologies developed in our laboratory. Basic Protocol 1 deals with the cloning of inserts into expression vectors by ligation-independent cloning. Basic Protocol 2 describes the expression and purification of the target proteins on a miniscale. Lastly, for the targets that express at the miniscale, basic protocols 3 and 4 outline the methods employed for the expression and purification of targets at the midi-scale, as well as a procedure for detergent screening and identification of detergent(s) in which the target protein is stable. PMID:24510647
Keegan, Ronan M.; Bibby, Jaclyn; Thomas, Jens; Xu, Dong; Zhang, Yang; Mayans, Olga; Winn, Martyn D.; Rigden, Daniel J.
2015-01-01
AMPLE clusters and truncates ab initio protein structure predictions, producing search models for molecular replacement. Here, an interesting degree of complementarity is shown between targets solved using the different ab initio modelling programs QUARK and ROSETTA. Search models derived from either program collectively solve almost all of the all-helical targets in the test set. Initial solutions produced by Phaser after only 5 min perform surprisingly well, improving the prospects for in situ structure solution by AMPLE during synchrotron visits. Taken together, the results show the potential for AMPLE to run more quickly and successfully solve more targets than previously suspected. PMID:25664744
The First Mammalian Aldehyde Oxidase Crystal Structure
Coelho, Catarina; Mahro, Martin; Trincão, José; Carvalho, Alexandra T. P.; Ramos, Maria João; Terao, Mineko; Garattini, Enrico; Leimkühler, Silke; Romão, Maria João
2012-01-01
Aldehyde oxidases (AOXs) are homodimeric proteins belonging to the xanthine oxidase family of molybdenum-containing enzymes. Each 150-kDa monomer contains a FAD redox cofactor, two spectroscopically distinct [2Fe-2S] clusters, and a molybdenum cofactor located within the protein active site. AOXs are characterized by broad range substrate specificity, oxidizing different aldehydes and aromatic N-heterocycles. Despite increasing recognition of its role in the metabolism of drugs and xenobiotics, the physiological function of the protein is still largely unknown. We have crystallized and solved the crystal structure of mouse liver aldehyde oxidase 3 to 2.9 Å. This is the first mammalian AOX whose structure has been solved. The structure provides important insights into the protein active center and further evidence on the catalytic differences characterizing AOX and xanthine oxidoreductase. The mouse liver aldehyde oxidase 3 three-dimensional structure combined with kinetic, mutagenesis data, molecular docking, and molecular dynamics studies make a decisive contribution to understand the molecular basis of its rather broad substrate specificity. PMID:23019336
Website on Protein Interaction and Protein Structure Related Work
NASA Technical Reports Server (NTRS)
Samanta, Manoj; Liang, Shoudan; Biegel, Bryan (Technical Monitor)
2003-01-01
In today's world, three seemingly diverse fields - computer information technology, nanotechnology and biotechnology are joining forces to enlarge our scientific knowledge and solve complex technological problems. Our group is dedicated to conduct theoretical research exploring the challenges in this area. The major areas of research include: 1) Yeast Protein Interactions; 2) Protein Structures; and 3) Current Transport through Small Molecules.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sliwiak, Joanna; Jaskolski, Mariusz, E-mail: mariuszj@amu.edu.pl; A. Mickiewicz University, Grunwaldzka 6, 60-780 Poznan
With the implementation of a molecular-replacement likelihood target that accounts for translational noncrystallographic symmetry, it became possible to solve the crystal structure of a protein with seven tetrameric assemblies arrayed translationally along the c axis. The new algorithm found 56 protein molecules in reduced symmetry (P1), which was used to resolve space-group ambiguity caused by severe twinning. Translational noncrystallographic symmetry (tNCS) is a pathology of protein crystals in which multiple copies of a molecule or assembly are found in similar orientations. Structure solution is problematic because this breaks the assumptions used in current likelihood-based methods. To cope with such cases,more » new likelihood approaches have been developed and implemented in Phaser to account for the statistical effects of tNCS in molecular replacement. Using these new approaches, it was possible to solve the crystal structure of a protein exhibiting an extreme form of this pathology with seven tetrameric assemblies arrayed along the c axis. To resolve space-group ambiguities caused by tetartohedral twinning, the structure was initially solved by placing 56 copies of the monomer in space group P1 and using the symmetry of the solution to define the true space group, C2. The resulting structure of Hyp-1, a pathogenesis-related class 10 (PR-10) protein from the medicinal herb St John’s wort, reveals the binding modes of the fluorescent probe 8-anilino-1-naphthalene sulfonate (ANS), providing insight into the function of the protein in binding or storing hydrophobic ligands.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jacques, David A.; Streamer, Margaret; Rowland, Susan L.
2009-09-02
The crystal structure of the DNA-damage checkpoint inhibitor of sporulation, Sda, from Bacillus subtilis, has been solved by the MAD technique using selenomethionine-substituted protein. The structure closely resembles that previously solved by NMR, as well as the structure of a homologue from Geobacillus stearothermophilus solved in complex with the histidine kinase KinB. The structure contains three molecules in the asymmetric unit. The unusual trimeric arrangement, which lacks simple internal symmetry, appears to be preserved in solution based on an essentially ideal fit to previously acquired scattering data for Sda in solution. This interpretation contradicts previous findings that Sda was monomericmore » or dimeric in solution. This study demonstrates the difficulties that can be associated with the characterization of small proteins and the value of combining multiple biophysical techniques. It also emphasizes the importance of understanding the physical principles behind these techniques and therefore their limitations.« less
Functional and genomic analyses of alpha-solenoid proteins.
Fournier, David; Palidwor, Gareth A; Shcherbinin, Sergey; Szengel, Angelika; Schaefer, Martin H; Perez-Iratxeta, Carol; Andrade-Navarro, Miguel A
2013-01-01
Alpha-solenoids are flexible protein structural domains formed by ensembles of alpha-helical repeats (Armadillo and HEAT repeats among others). While homology can be used to detect many of these repeats, some alpha-solenoids have very little sequence homology to proteins of known structure and we expect that many remain undetected. We previously developed a method for detection of alpha-helical repeats based on a neural network trained on a dataset of protein structures. Here we improved the detection algorithm and updated the training dataset using recently solved structures of alpha-solenoids. Unexpectedly, we identified occurrences of alpha-solenoids in solved protein structures that escaped attention, for example within the core of the catalytic subunit of PI3KC. Our results expand the current set of known alpha-solenoids. Application of our tool to the protein universe allowed us to detect their significant enrichment in proteins interacting with many proteins, confirming that alpha-solenoids are generally involved in protein-protein interactions. We then studied the taxonomic distribution of alpha-solenoids to discuss an evolutionary scenario for the emergence of this type of domain, speculating that alpha-solenoids have emerged in multiple taxa in independent events by convergent evolution. We observe a higher rate of alpha-solenoids in eukaryotic genomes and in some prokaryotic families, such as Cyanobacteria and Planctomycetes, which could be associated to increased cellular complexity. The method is available at http://cbdm.mdc-berlin.de/~ard2/.
Proteins Are the Body's Worker Molecules
... molecular structures. Many of these new technologies are robots that automate previously labor-intensive steps in structure determination. Thanks to these robots, it is possible to solve structures faster than ...
Shen, Hong-Bin; Yi, Dong-Liang; Yao, Li-Xiu; Yang, Jie; Chou, Kuo-Chen
2008-10-01
In the postgenomic age, with the avalanche of protein sequences generated and relatively slow progress in determining their structures by experiments, it is important to develop automated methods to predict the structure of a protein from its sequence. The membrane proteins are a special group in the protein family that accounts for approximately 30% of all proteins; however, solved membrane protein structures only represent less than 1% of known protein structures to date. Although a great success has been achieved for developing computational intelligence techniques to predict secondary structures in both globular and membrane proteins, there is still much challenging work in this regard. In this review article, we firstly summarize the recent progress of automation methodology development in predicting protein secondary structures, especially in membrane proteins; we will then give some future directions in this research field.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chang, C.; Coggill, P.; Bateman, A.
Many Gram-positive lactic acid bacteria (LAB) produce anti-bacterial peptides and small proteins called bacteriocins, which enable them to compete against other bacteria in the environment. These peptides fall structurally into three different classes, I, II, III, with class IIa being pediocin-like single entities and class IIb being two-peptide bacteriocins. Self-protective cognate immunity proteins are usually co-transcribed with these toxins. Several examples of cognates for IIa have already been solved structurally. Streptococcus pyogenes, closely related to LAB, is one of the most common human pathogens, so knowledge of how it competes against other LAB species is likely to prove invaluable. Wemore » have solved the crystal structure of the gene-product of locus Spy-2152 from S. pyogenes, (PDB: 2fu2), and found it to comprise an anti-parallel four-helix bundle that is structurally similar to other bacteriocin immunity proteins. Sequence analyses indicate this protein to be a possible immunity protein protective against class IIa or IIb bacteriocins. However, given that S. pyogenes appears to lack any IIa pediocin-like proteins but does possess class IIb bacteriocins, we suggest this protein confers immunity to IIb-like peptides. Combined structural, genomic and proteomic analyses have allowed the identification and in silico characterization of a new putative immunity protein from S. pyogenes, possibly the first structure of an immunity protein protective against potential class IIb two-peptide bacteriocins. We have named the two pairs of putative bacteriocins found in S. pyogenes pyogenecin 1, 2, 3 and 4.« less
Fusion proteins as alternate crystallization paths to difficult structure problems
NASA Technical Reports Server (NTRS)
Carter, Daniel C.; Rueker, Florian; Ho, Joseph X.; Lim, Kap; Keeling, Kim; Gilliland, Gary; Ji, Xinhua
1994-01-01
The three-dimensional structure of a peptide fusion product with glutathione transferase from Schistosoma japonicum (SjGST) has been solved by crystallographic methods to 2.5 A resolution. Peptides or proteins can be fused to SjGST and expressed in a plasmid for rapid synthesis in Escherichia coli. Fusion proteins created by this commercial method can be purified rapidly by chromatography on immobilized glutathione. The potential utility of using SjGST fusion proteins as alternate paths to the crystallization and structure determination of proteins is demonstrated.
Moghadasi, Mohammad; Kozakov, Dima; Mamonov, Artem B.; Vakili, Pirooz; Vajda, Sandor; Paschalidis, Ioannis Ch.
2013-01-01
We introduce a message-passing algorithm to solve the Side Chain Positioning (SCP) problem. SCP is a crucial component of protein docking refinement, which is a key step of an important class of problems in computational structural biology called protein docking. We model SCP as a combinatorial optimization problem and formulate it as a Maximum Weighted Independent Set (MWIS) problem. We then employ a modified and convergent belief-propagation algorithm to solve a relaxation of MWIS and develop randomized estimation heuristics that use the relaxed solution to obtain an effective MWIS feasible solution. Using a benchmark set of protein complexes we demonstrate that our approach leads to more accurate docking predictions compared to a baseline algorithm that does not solve the SCP. PMID:23515575
Brodie, Nicholas I; Popov, Konstantin I; Petrotchenko, Evgeniy V; Dokholyan, Nikolay V; Borchers, Christoph H
2017-07-01
We present an integrated experimental and computational approach for de novo protein structure determination in which short-distance cross-linking data are incorporated into rapid discrete molecular dynamics (DMD) simulations as constraints, reducing the conformational space and achieving the correct protein folding on practical time scales. We tested our approach on myoglobin and FK506 binding protein-models for α helix-rich and β sheet-rich proteins, respectively-and found that the lowest-energy structures obtained were in agreement with the crystal structure, hydrogen-deuterium exchange, surface modification, and long-distance cross-linking validation data. Our approach is readily applicable to other proteins with unknown structures.
Functional and Genomic Analyses of Alpha-Solenoid Proteins
Fournier, David; Palidwor, Gareth A.; Shcherbinin, Sergey; Szengel, Angelika; Schaefer, Martin H.; Perez-Iratxeta, Carol; Andrade-Navarro, Miguel A.
2013-01-01
Alpha-solenoids are flexible protein structural domains formed by ensembles of alpha-helical repeats (Armadillo and HEAT repeats among others). While homology can be used to detect many of these repeats, some alpha-solenoids have very little sequence homology to proteins of known structure and we expect that many remain undetected. We previously developed a method for detection of alpha-helical repeats based on a neural network trained on a dataset of protein structures. Here we improved the detection algorithm and updated the training dataset using recently solved structures of alpha-solenoids. Unexpectedly, we identified occurrences of alpha-solenoids in solved protein structures that escaped attention, for example within the core of the catalytic subunit of PI3KC. Our results expand the current set of known alpha-solenoids. Application of our tool to the protein universe allowed us to detect their significant enrichment in proteins interacting with many proteins, confirming that alpha-solenoids are generally involved in protein-protein interactions. We then studied the taxonomic distribution of alpha-solenoids to discuss an evolutionary scenario for the emergence of this type of domain, speculating that alpha-solenoids have emerged in multiple taxa in independent events by convergent evolution. We observe a higher rate of alpha-solenoids in eukaryotic genomes and in some prokaryotic families, such as Cyanobacteria and Planctomycetes, which could be associated to increased cellular complexity. The method is available at http://cbdm.mdc-berlin.de/~ard2/. PMID:24278209
Al Nasr, Kamal; Ranjan, Desh; Zubair, Mohammad; Chen, Lin; He, Jing
2014-01-01
Electron cryomicroscopy is becoming a major experimental technique in solving the structures of large molecular assemblies. More and more three-dimensional images have been obtained at the medium resolutions between 5 and 10 Å. At this resolution range, major α-helices can be detected as cylindrical sticks and β-sheets can be detected as plain-like regions. A critical question in de novo modeling from cryo-EM images is to determine the match between the detected secondary structures from the image and those on the protein sequence. We formulate this matching problem into a constrained graph problem and present an O(Δ(2)N(2)2(N)) algorithm to this NP-Hard problem. The algorithm incorporates the dynamic programming approach into a constrained K-shortest path algorithm. Our method, DP-TOSS, has been tested using α-proteins with maximum 33 helices and α-β proteins up to five helices and 12 β-strands. The correct match was ranked within the top 35 for 19 of the 20 α-proteins and all nine α-β proteins tested. The results demonstrate that DP-TOSS improves accuracy, time and memory space in deriving the topologies of the secondary structure elements for proteins with a large number of secondary structures and a complex skeleton.
A simple and fast heuristic for protein structure comparison.
Pelta, David A; González, Juan R; Moreno Vega, Marcos
2008-03-25
Protein structure comparison is a key problem in bioinformatics. There exist several methods for doing protein comparison, being the solution of the Maximum Contact Map Overlap problem (MAX-CMO) one of the alternatives available. Although this problem may be solved using exact algorithms, researchers require approximate algorithms that obtain good quality solutions using less computational resources than the formers. We propose a variable neighborhood search metaheuristic for solving MAX-CMO. We analyze this strategy in two aspects: 1) from an optimization point of view the strategy is tested on two different datasets, obtaining an error of 3.5%(over 2702 pairs) and 1.7% (over 161 pairs) with respect to optimal values; thus leading to high accurate solutions in a simpler and less expensive way than exact algorithms; 2) in terms of protein structure classification, we conduct experiments on three datasets and show that is feasible to detect structural similarities at SCOP's family and CATH's architecture levels using normalized overlap values. Some limitations and the role of normalization are outlined for doing classification at SCOP's fold level. We designed, implemented and tested.a new tool for solving MAX-CMO, based on a well-known metaheuristic technique. The good balance between solution's quality and computational effort makes it a valuable tool. Moreover, to the best of our knowledge, this is the first time the MAX-CMO measure is tested at SCOP's fold and CATH's architecture levels with encouraging results.
Calderone, V; Fragai, M; Gallo, G; Luchinat, C
2017-06-01
The X-ray structure of human apo-S100Z has been solved and compared with that of the zebrafish calcium-bound S100Z, which is the closest in sequence. Human apo-S100A12, which shows only 43% sequence identity to human S100Z, has been used as template model to solve the crystallographic phase problem. Although a significant buried surface area between the two physiological dimers is present in the asymmetric unit of human apo-S100Z, the protein does not form the superhelical arrangement in the crystal as observed for the zebrafish calcium-bound S100Z and human calcium-bound S100A4. These findings further demonstrate that calcium plays a fundamental role in triggering quaternary structure formation in several S100s. Solving the X-ray structure of human apo-S100Z by standard molecular replacement procedures turned out to be a challenge and required trying different models and different software tools among which only one was successful. The model that allowed structure solution was that with one of the lowest sequence identity with the target protein among the S100 family in the apo state. Based on the previously solved zebrafish holo-S100Z, a putative human holo-S100Z structure has been then calculated through homology modeling; the differences between the experimental human apo and calculated holo structure have been compared to those existing for other members of the family.
Extant fold-switching proteins are widespread.
Porter, Lauren L; Looger, Loren L
2018-06-05
A central tenet of biology is that globular proteins have a unique 3D structure under physiological conditions. Recent work has challenged this notion by demonstrating that some proteins switch folds, a process that involves remodeling of secondary structure in response to a few mutations (evolved fold switchers) or cellular stimuli (extant fold switchers). To date, extant fold switchers have been viewed as rare byproducts of evolution, but their frequency has been neither quantified nor estimated. By systematically and exhaustively searching the Protein Data Bank (PDB), we found ∼100 extant fold-switching proteins. Furthermore, we gathered multiple lines of evidence suggesting that these proteins are widespread in nature. Based on these lines of evidence, we hypothesized that the frequency of extant fold-switching proteins may be underrepresented by the structures in the PDB. Thus, we sought to identify other putative extant fold switchers with only one solved conformation. To do this, we identified two characteristic features of our ∼100 extant fold-switching proteins, incorrect secondary structure predictions and likely independent folding cooperativity, and searched the PDB for other proteins with similar features. Reassuringly, this method identified dozens of other proteins in the literature with indication of a structural change but only one solved conformation in the PDB. Thus, we used it to estimate that 0.5-4% of PDB proteins switch folds. These results demonstrate that extant fold-switching proteins are likely more common than the PDB reflects, which has implications for cell biology, genomics, and human health. Copyright © 2018 the Author(s). Published by PNAS.
Structure-function insights of membrane and soluble proteins revealed by electron crystallography.
Dreaden, Tina M; Devarajan, Bharanidharan; Barry, Bridgette A; Schmidt-Krey, Ingeborg
2013-01-01
Electron crystallography is emerging as an important method in solving protein structures. While it has found extensive applications in the understanding of membrane protein structure and function at a wide range of resolutions, from revealing oligomeric arrangements to atomic models, electron crystallography has also provided invaluable information on the soluble α/β-tubulin which could not be obtained by any other method to date. Examples of critical insights from selected structures of membrane proteins as well as α/β-tubulin are described here, demonstrating the vast potential of electron crystallography that is first beginning to unfold.
Brodie, Nicholas I.; Popov, Konstantin I.; Petrotchenko, Evgeniy V.; Dokholyan, Nikolay V.; Borchers, Christoph H.
2017-01-01
We present an integrated experimental and computational approach for de novo protein structure determination in which short-distance cross-linking data are incorporated into rapid discrete molecular dynamics (DMD) simulations as constraints, reducing the conformational space and achieving the correct protein folding on practical time scales. We tested our approach on myoglobin and FK506 binding protein—models for α helix–rich and β sheet–rich proteins, respectively—and found that the lowest-energy structures obtained were in agreement with the crystal structure, hydrogen-deuterium exchange, surface modification, and long-distance cross-linking validation data. Our approach is readily applicable to other proteins with unknown structures. PMID:28695211
SAIL--stereo-array isotope labeling.
Kainosho, Masatsune; Güntert, Peter
2009-11-01
Optimal stereospecific and regiospecific labeling of proteins with stable isotopes enhances the nuclear magnetic resonance (NMR) method for the determination of the three-dimensional protein structures in solution. Stereo-array isotope labeling (SAIL) offers sharpened lines, spectral simplification without loss of information and the ability to rapidly collect and automatically evaluate the structural restraints required to solve a high-quality solution structure for proteins up to twice as large as before. This review gives an overview of stable isotope labeling methods for NMR spectroscopy with proteins and provides an in-depth treatment of the SAIL technology.
An experimental point of view on hydration/solvation in halophilic proteins
Talon, Romain; Coquelle, Nicolas; Madern, Dominique; Girard, Eric
2014-01-01
Protein-solvent interactions govern the behaviors of proteins isolated from extreme halophiles. In this work, we compared the solvent envelopes of two orthologous tetrameric malate dehydrogenases (MalDHs) from halophilic and non-halophilic bacteria. The crystal structure of the MalDH from the non-halophilic bacterium Chloroflexus aurantiacus (Ca MalDH) solved, de novo, at 1.7 Å resolution exhibits numerous water molecules in its solvation shell. We observed that a large number of these water molecules are arranged in pentagonal polygons in the first hydration shell of Ca MalDH. Some of them are clustered in large networks, which cover non-polar amino acid surface. The crystal structure of MalDH from the extreme halophilic bacterium Salinibacter ruber (Sr) solved at 1.55 Å resolution shows that its surface is strongly enriched in acidic amino acids. The structural comparison of these two models is the first direct observation of the relative impact of acidic surface enrichment on the water structure organization between a halophilic protein and its non-adapted counterpart. The data show that surface acidic amino acids disrupt pentagonal water networks in the hydration shell. These crystallographic observations are discussed with respect to halophilic protein behaviors in solution PMID:24600446
An experimental point of view on hydration/solvation in halophilic proteins.
Talon, Romain; Coquelle, Nicolas; Madern, Dominique; Girard, Eric
2014-01-01
Protein-solvent interactions govern the behaviors of proteins isolated from extreme halophiles. In this work, we compared the solvent envelopes of two orthologous tetrameric malate dehydrogenases (MalDHs) from halophilic and non-halophilic bacteria. The crystal structure of the MalDH from the non-halophilic bacterium Chloroflexus aurantiacus (Ca MalDH) solved, de novo, at 1.7 Å resolution exhibits numerous water molecules in its solvation shell. We observed that a large number of these water molecules are arranged in pentagonal polygons in the first hydration shell of Ca MalDH. Some of them are clustered in large networks, which cover non-polar amino acid surface. The crystal structure of MalDH from the extreme halophilic bacterium Salinibacter ruber (Sr) solved at 1.55 Å resolution shows that its surface is strongly enriched in acidic amino acids. The structural comparison of these two models is the first direct observation of the relative impact of acidic surface enrichment on the water structure organization between a halophilic protein and its non-adapted counterpart. The data show that surface acidic amino acids disrupt pentagonal water networks in the hydration shell. These crystallographic observations are discussed with respect to halophilic protein behaviors in solution.
Serial Millisecond Crystallography of Membrane Proteins.
Jaeger, Kathrin; Dworkowski, Florian; Nogly, Przemyslaw; Milne, Christopher; Wang, Meitian; Standfuss, Joerg
2016-01-01
Serial femtosecond crystallography (SFX) at X-ray free-electron lasers (XFELs) is a powerful method to determine high-resolution structures of pharmaceutically relevant membrane proteins. Recently, the technology has been adapted to carry out serial millisecond crystallography (SMX) at synchrotron sources, where beamtime is more abundant. In an injector-based approach, crystals grown in lipidic cubic phase (LCP) or embedded in viscous medium are delivered directly into the unattenuated beam of a microfocus beamline. Pilot experiments show the application of microjet-based SMX for solving the structure of a membrane protein and compatibility of the method with de novo phasing. Planned synchrotron upgrades, faster detectors and software developments will go hand-in-hand with developments at free-electron lasers to provide a powerful methodology for solving structures from microcrystals at room temperature, ligand screening or crystal optimization for time-resolved studies with minimal or no radiation damage.
Design of structurally distinct proteins using strategies inspired by evolution
Jacobs, T. M.; Williams, B.; Williams, T.; ...
2016-05-06
Natural recombination combines pieces of preexisting proteins to create new tertiary structures and functions. In this paper, we describe a computational protocol, called SEWING, which is inspired by this process and builds new proteins from connected or disconnected pieces of existing structures. Helical proteins designed with SEWING contain structural features absent from other de novo designed proteins and, in some cases, remain folded at more than 100°C. High-resolution structures of the designed proteins CA01 and DA05R1 were solved by x-ray crystallography (2.2 angstrom resolution) and nuclear magnetic resonance, respectively, and there was excellent agreement with the design models. Finally, thismore » method provides a new strategy to rapidly create large numbers of diverse and designable protein scaffolds.« less
NMR-based automated protein structure determination.
Würz, Julia M; Kazemi, Sina; Schmidt, Elena; Bagaria, Anurag; Güntert, Peter
2017-08-15
NMR spectra analysis for protein structure determination can now in many cases be performed by automated computational methods. This overview of the computational methods for NMR protein structure analysis presents recent automated methods for signal identification in multidimensional NMR spectra, sequence-specific resonance assignment, collection of conformational restraints, and structure calculation, as implemented in the CYANA software package. These algorithms are sufficiently reliable and integrated into one software package to enable the fully automated structure determination of proteins starting from NMR spectra without manual interventions or corrections at intermediate steps, with an accuracy of 1-2 Å backbone RMSD in comparison with manually solved reference structures. Copyright © 2017 Elsevier Inc. All rights reserved.
The Evolving Contribution of Mass Spectrometry to Integrative Structural Biology
NASA Astrophysics Data System (ADS)
Faini, Marco; Stengel, Florian; Aebersold, Ruedi
2016-06-01
Protein complexes are key catalysts and regulators for the majority of cellular processes. Unveiling their assembly and structure is essential to understanding their function and mechanism of action. Although conventional structural techniques such as X-ray crystallography and NMR have solved the structure of important protein complexes, they cannot consistently deal with dynamic and heterogeneous assemblies, limiting their applications to small scale experiments. A novel methodological paradigm, integrative structural biology, aims at overcoming such limitations by combining complementary data sources into a comprehensive structural model. Recent applications have shown that a range of mass spectrometry (MS) techniques are able to generate interaction and spatial restraints (cross-linking MS) information on native complexes or to study the stoichiometry and connectivity of entire assemblies (native MS) rapidly, reliably, and from small amounts of substrate. Although these techniques by themselves do not solve structures, they do provide invaluable structural information and are thus ideally suited to contribute to integrative modeling efforts. The group of Brian Chait has made seminal contributions in the use of mass spectrometric techniques to study protein complexes. In this perspective, we honor the contributions of the Chait group and discuss concepts and milestones of integrative structural biology. We also review recent examples of integration of structural MS techniques with an emphasis on cross-linking MS. We then speculate on future MS applications that would unravel the dynamic nature of protein complexes upon diverse cellular states.
A simple and fast heuristic for protein structure comparison
Pelta, David A; González, Juan R; Moreno Vega, Marcos
2008-01-01
Background Protein structure comparison is a key problem in bioinformatics. There exist several methods for doing protein comparison, being the solution of the Maximum Contact Map Overlap problem (MAX-CMO) one of the alternatives available. Although this problem may be solved using exact algorithms, researchers require approximate algorithms that obtain good quality solutions using less computational resources than the formers. Results We propose a variable neighborhood search metaheuristic for solving MAX-CMO. We analyze this strategy in two aspects: 1) from an optimization point of view the strategy is tested on two different datasets, obtaining an error of 3.5%(over 2702 pairs) and 1.7% (over 161 pairs) with respect to optimal values; thus leading to high accurate solutions in a simpler and less expensive way than exact algorithms; 2) in terms of protein structure classification, we conduct experiments on three datasets and show that is feasible to detect structural similarities at SCOP's family and CATH's architecture levels using normalized overlap values. Some limitations and the role of normalization are outlined for doing classification at SCOP's fold level. Conclusion We designed, implemented and tested.a new tool for solving MAX-CMO, based on a well-known metaheuristic technique. The good balance between solution's quality and computational effort makes it a valuable tool. Moreover, to the best of our knowledge, this is the first time the MAX-CMO measure is tested at SCOP's fold and CATH's architecture levels with encouraging results. Software is available for download at . PMID:18366735
Functional Implications of Domain Organization Within Prokaryotic Rhomboid Proteases.
Panigrahi, Rashmi; Lemieux, M Joanne
2015-01-01
Intramembrane proteases are membrane embedded enzymes that cleave transmembrane substrates. This interesting class of enzyme and its water mediated substrate cleavage mechanism occurring within the hydrophobic lipid bilayer has drawn the attention of researchers. Rhomboids are a family of ubiquitous serine intramembrane proteases. Bacterial forms of rhomboid proteases are mainly composed of six transmembrane helices that are preceded by a soluble N-terminal domain. Several crystal structures of the membrane domain of the E. coli rhomboid protease ecGlpG have been solved. Independently, the ecGlpG N-terminal cytoplasmic domain structure was solved using both NMR and protein crystallography. Despite these structures, we still do not know the structure of the full-length protein, nor do we know the functional role of these domains in the cell. This chapter will review the structural and functional roles of the different domains associated with prokaryotic rhomboid proteases. Lastly, we will address questions remaining in the field.
Multiple graph regularized protein domain ranking.
Wang, Jim Jing-Yan; Bensmail, Halima; Gao, Xin
2012-11-19
Protein domain ranking is a fundamental task in structural biology. Most protein domain ranking methods rely on the pairwise comparison of protein domains while neglecting the global manifold structure of the protein domain database. Recently, graph regularized ranking that exploits the global structure of the graph defined by the pairwise similarities has been proposed. However, the existing graph regularized ranking methods are very sensitive to the choice of the graph model and parameters, and this remains a difficult problem for most of the protein domain ranking methods. To tackle this problem, we have developed the Multiple Graph regularized Ranking algorithm, MultiG-Rank. Instead of using a single graph to regularize the ranking scores, MultiG-Rank approximates the intrinsic manifold of protein domain distribution by combining multiple initial graphs for the regularization. Graph weights are learned with ranking scores jointly and automatically, by alternately minimizing an objective function in an iterative algorithm. Experimental results on a subset of the ASTRAL SCOP protein domain database demonstrate that MultiG-Rank achieves a better ranking performance than single graph regularized ranking methods and pairwise similarity based ranking methods. The problem of graph model and parameter selection in graph regularized protein domain ranking can be solved effectively by combining multiple graphs. This aspect of generalization introduces a new frontier in applying multiple graphs to solving protein domain ranking applications.
Multiple graph regularized protein domain ranking
2012-01-01
Background Protein domain ranking is a fundamental task in structural biology. Most protein domain ranking methods rely on the pairwise comparison of protein domains while neglecting the global manifold structure of the protein domain database. Recently, graph regularized ranking that exploits the global structure of the graph defined by the pairwise similarities has been proposed. However, the existing graph regularized ranking methods are very sensitive to the choice of the graph model and parameters, and this remains a difficult problem for most of the protein domain ranking methods. Results To tackle this problem, we have developed the Multiple Graph regularized Ranking algorithm, MultiG-Rank. Instead of using a single graph to regularize the ranking scores, MultiG-Rank approximates the intrinsic manifold of protein domain distribution by combining multiple initial graphs for the regularization. Graph weights are learned with ranking scores jointly and automatically, by alternately minimizing an objective function in an iterative algorithm. Experimental results on a subset of the ASTRAL SCOP protein domain database demonstrate that MultiG-Rank achieves a better ranking performance than single graph regularized ranking methods and pairwise similarity based ranking methods. Conclusion The problem of graph model and parameter selection in graph regularized protein domain ranking can be solved effectively by combining multiple graphs. This aspect of generalization introduces a new frontier in applying multiple graphs to solving protein domain ranking applications. PMID:23157331
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tirado-Lee, Leidamarie; Lee, Allen; Rees, Douglas C.
2014-10-02
molA (HI1472) from H. influenzae encodes a periplasmic binding protein (PBP) that delivers substrate to the ABC transporter MolB{sub 2}C{sub 2} (formerly HI1470/71). The structures of MolA with molybdate and tungstate in the binding pocket were solved to 1.6 and 1.7 {angstrom} resolution, respectively. The MolA-binding protein binds molybdate and tungstate, but not other oxyanions such as sulfate and phosphate, making it the first class III molybdate-binding protein structurally solved. The {approx}100 {mu}M binding affinity for tungstate and molybdate is significantly lower than observed for the class II ModA molybdate-binding proteins that have nanomolar to low micromolar affinity for molybdate.more » The presence of two molybdate loci in H. influenzae suggests multiple transport systems for one substrate, with molABC constituting a low-affinity molybdate locus.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ghosh, Raka; Chakrabarti, Chandana, E-mail: chandana.chakrabarti@saha.ac.in
2005-08-01
A thaumatin-like antifungal protein, NP24-I, has been isolated from ripe tomato fruits. It was crystallized by the vapour-diffusion method and data were collected to 2.45 Å. The structure was solved by molecular replacement. NP24 is a 24 kDa (207-amino-acid) antifungal thaumatin-like protein (TLP) found in tomato fruits. An isoform of the protein, NP24-I, is reported to play a possible role in ripening of the fruit in addition to its antifungal properties. The protein has been isolated and purified and crystallized by the hanging-drop vapour-diffusion method. The crystals belong to the tetragonal space group P4{sub 3}, with unit-cell parameters a =more » b = 61.01, c = 62.90 Å and one molecule per asymmetric unit. X-ray diffraction data were processed to a resolution of 2.45 Å and the structure was solved by molecular replacement.« less
Abendroth, Jan; McCormick, Michael S.; Edwards, Thomas E.; Staker, Bart; Loewen, Roderick; Gifford, Martin; Rifkin, Jeff; Mayer, Chad; Guo, Wenjin; Zhang, Yang; Myler, Peter; Kelley, Angela; Analau, Erwin; Hewitt, Stephen Nakazawa; Napuli, Alberto J.; Kuhn, Peter; Ruth, Ronald D.; Stewart, Lance J.
2010-01-01
Structural genomics discovery projects require ready access to both X-ray and NMR instrumentation which support the collection of experimental data needed to solve large numbers of novel protein structures. The most productive X-ray crystal structure determination laboratories make extensive frequent use of tunable synchrotron X-ray light to solve novel structures by anomalous diffraction methods. This requires that frozen cryo-protected crystals be shipped to large government-run synchrotron facilities for data collection. In an effort to eliminate the need to ship crystals for data collection, we have developed the first laboratory-scale synchrotron light source capable of performing many of the state-of-the-art synchrotron applications in X-ray science. This Compact Light Source is a first-in-class device that uses inverse Compton scattering to generate X-rays of sufficient flux, tunable wavelength and beam size to allow high-resolution X-ray diffraction data collection from protein crystals. We report on benchmarking tests of X-ray diffraction data collection with hen egg white lysozyme, and the successful high-resolution X-ray structure determination of the Glycine cleavage system protein H from Mycobacterium tuberculosis using diffraction data collected with the Compact Light Source X-ray beam. PMID:20364333
Dal Palù, Alessandro; Pontelli, Enrico; He, Jing; Lu, Yonggang
2007-01-01
The paper describes a novel framework, constructed using Constraint Logic Programming (CLP) and parallelism, to determine the association between parts of the primary sequence of a protein and alpha-helices extracted from 3D low-resolution descriptions of large protein complexes. The association is determined by extracting constraints from the 3D information, regarding length, relative position and connectivity of helices, and solving these constraints with the guidance of a secondary structure prediction algorithm. Parallelism is employed to enhance performance on large proteins. The framework provides a fast, inexpensive alternative to determine the exact tertiary structure of unknown proteins.
Protein Structure Prediction by Protein Threading
NASA Astrophysics Data System (ADS)
Xu, Ying; Liu, Zhijie; Cai, Liming; Xu, Dong
The seminal work of Bowie, Lüthy, and Eisenberg (Bowie et al., 1991) on "the inverse protein folding problem" laid the foundation of protein structure prediction by protein threading. By using simple measures for fitness of different amino acid types to local structural environments defined in terms of solvent accessibility and protein secondary structure, the authors derived a simple and yet profoundly novel approach to assessing if a protein sequence fits well with a given protein structural fold. Their follow-up work (Elofsson et al., 1996; Fischer and Eisenberg, 1996; Fischer et al., 1996a,b) and the work by Jones, Taylor, and Thornton (Jones et al., 1992) on protein fold recognition led to the development of a new brand of powerful tools for protein structure prediction, which we now term "protein threading." These computational tools have played a key role in extending the utility of all the experimentally solved structures by X-ray crystallography and nuclear magnetic resonance (NMR), providing structural models and functional predictions for many of the proteins encoded in the hundreds of genomes that have been sequenced up to now.
The first mammalian aldehyde oxidase crystal structure: insights into substrate specificity.
Coelho, Catarina; Mahro, Martin; Trincão, José; Carvalho, Alexandra T P; Ramos, Maria João; Terao, Mineko; Garattini, Enrico; Leimkühler, Silke; Romão, Maria João
2012-11-23
Aldehyde oxidases have pharmacological relevance, and AOX3 is the major drug-metabolizing enzyme in rodents. The crystal structure of mouse AOX3 with kinetics and molecular docking studies provides insights into its enzymatic characteristics. Differences in substrate and inhibitor specificities can be rationalized by comparing the AOX3 and xanthine oxidase structures. The first aldehyde oxidase structure represents a major advance for drug design and mechanistic studies. Aldehyde oxidases (AOXs) are homodimeric proteins belonging to the xanthine oxidase family of molybdenum-containing enzymes. Each 150-kDa monomer contains a FAD redox cofactor, two spectroscopically distinct [2Fe-2S] clusters, and a molybdenum cofactor located within the protein active site. AOXs are characterized by broad range substrate specificity, oxidizing different aldehydes and aromatic N-heterocycles. Despite increasing recognition of its role in the metabolism of drugs and xenobiotics, the physiological function of the protein is still largely unknown. We have crystallized and solved the crystal structure of mouse liver aldehyde oxidase 3 to 2.9 Å. This is the first mammalian AOX whose structure has been solved. The structure provides important insights into the protein active center and further evidence on the catalytic differences characterizing AOX and xanthine oxidoreductase. The mouse liver aldehyde oxidase 3 three-dimensional structure combined with kinetic, mutagenesis data, molecular docking, and molecular dynamics studies make a decisive contribution to understand the molecular basis of its rather broad substrate specificity.
Introduction to bioinformatics.
Can, Tolga
2014-01-01
Bioinformatics is an interdisciplinary field mainly involving molecular biology and genetics, computer science, mathematics, and statistics. Data intensive, large-scale biological problems are addressed from a computational point of view. The most common problems are modeling biological processes at the molecular level and making inferences from collected data. A bioinformatics solution usually involves the following steps: Collect statistics from biological data. Build a computational model. Solve a computational modeling problem. Test and evaluate a computational algorithm. This chapter gives a brief introduction to bioinformatics by first providing an introduction to biological terminology and then discussing some classical bioinformatics problems organized by the types of data sources. Sequence analysis is the analysis of DNA and protein sequences for clues regarding function and includes subproblems such as identification of homologs, multiple sequence alignment, searching sequence patterns, and evolutionary analyses. Protein structures are three-dimensional data and the associated problems are structure prediction (secondary and tertiary), analysis of protein structures for clues regarding function, and structural alignment. Gene expression data is usually represented as matrices and analysis of microarray data mostly involves statistics analysis, classification, and clustering approaches. Biological networks such as gene regulatory networks, metabolic pathways, and protein-protein interaction networks are usually modeled as graphs and graph theoretic approaches are used to solve associated problems such as construction and analysis of large-scale networks.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hellberg, Kristina; Grimsrud, Paul A.; Kruse, Andrew C.
2012-07-11
Fatty acid binding proteins (FABP) have been characterized as facilitating the intracellular solubilization and transport of long-chain fatty acyl carboxylates via noncovalent interactions. More recent work has shown that the adipocyte FABP is also covalently modified in vivo on Cys117 with 4-hydroxy-2-nonenal (4-HNE), a bioactive aldehyde linked to oxidative stress and inflammation. To evaluate 4-HNE binding and modification, the crystal structures of adipocyte FABP covalently and noncovalently bound to 4-HNE have been solved to 1.9 {angstrom} and 2.3 {angstrom} resolution, respectively. While the 4-HNE in the noncovalently modified protein is coordinated similarly to a carboxylate of a fatty acid, themore » covalent form show a novel coordination through a water molecule at the polar end of the lipid. Other defining features between the two structures with 4-HNE and previously solved structures of the protein include a peptide flip between residues Ala36 and Lys37 and the rotation of the side chain of Phe57 into its closed conformation. Representing the first structure of an endogenous target protein covalently modified by 4-HNE, these results define a new class of in vivo ligands for FABPs and extend their physiological substrates to include bioactive aldehydes.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Alonso-García, Noelia; García-Rubio, Inés; Academia General Militar, Carretera de Huesca s/n, 50090 Zaragoza
The structure of the FnIII-3, 4 region of integrin β4 was solved using a hybrid approach that combines crystallographic structures, SAXS, DEER and molecular modelling. The structure helps in understanding how integrin β4 might bind to other hemidesmosomal proteins and mediate signalling. Integrin α6β4 is a major component of hemidesmosomes that mediate the stable anchorage of epithelial cells to the underlying basement membrane. Integrin α6β4 has also been implicated in cell proliferation and migration and in carcinoma progression. The third and fourth fibronectin type III domains (FnIII-3, 4) of integrin β4 mediate binding to the hemidesmosomal proteins BPAG1e and BPAG2,more » and participate in signalling. Here, it is demonstrated that X-ray crystallography, small-angle X-ray scattering and double electron–electron resonance (DEER) complement each other to solve the structure of the FnIII-3, 4 region. The crystal structures of the individual FnIII-3 and FnIII-4 domains were solved and the relative arrangement of the FnIII domains was elucidated by combining DEER with site-directed spin labelling. Multiple structures of the interdomain linker were modelled by Monte Carlo methods complying with DEER constraints, and the final structures were selected against experimental scattering data. FnIII-3, 4 has a compact and cambered flat structure with an evolutionary conserved surface that is likely to correspond to a protein-interaction site. Finally, this hybrid method is of general application for the study of other macromolecules and complexes.« less
Learning about protein solubility from bacterial inclusion bodies
Martínez-Alonso, Mónica; González-Montalbán, Nuria; García-Fruitós, Elena; Villaverde, Antonio
2009-01-01
The progressive solving of the conformation of aggregated proteins and the conceptual understanding of the biology of inclusion bodies in recombinant bacteria is providing exciting insights on protein folding and quality. Interestingly, newest data also show an unexpected functional and structural complexity of soluble recombinant protein species and picture the whole bacterial cell factory scenario as more intricate than formerly believed. PMID:19133126
Protein structure estimation from NMR data by matrix completion.
Li, Zhicheng; Li, Yang; Lei, Qiang; Zhao, Qing
2017-09-01
Knowledge of protein structures is very important to understand their corresponding physical and chemical properties. Nuclear Magnetic Resonance (NMR) spectroscopy is one of the main methods to measure protein structure. In this paper, we propose a two-stage approach to calculate the structure of a protein from a highly incomplete distance matrix, where most data are obtained from NMR. We first randomly "guess" a small part of unobservable distances by utilizing the triangle inequality, which is crucial for the second stage. Then we use matrix completion to calculate the protein structure from the obtained incomplete distance matrix. We apply the accelerated proximal gradient algorithm to solve the corresponding optimization problem. Furthermore, the recovery error of our method is analyzed, and its efficiency is demonstrated by several practical examples.
Baculovirus-mediated expression of GPCRs in insect cells.
Saarenpää, Tuulia; Jaakola, Veli-Pekka; Goldman, Adrian
2015-01-01
G-protein-coupled receptors (GPCRs) are a large family of seven transmembrane proteins that influence a considerable number of cellular events. For this reason, they are one of the most studied receptor types for their pharmacological and structural properties. Solving the structure of several GPCR receptor types has been possible using almost all expression systems, including Escherichia coli, yeast, mammalian, and insect cells. So far, however, most of the GPCR structures solved have been done using the baculovirus insect cell expression system. The reason for this is mainly due to cost-effectiveness, posttranslational modification efficiency, and overall effortless maintenance. The system has evolved so much that variables starting from vector type, purification tags, cell line, and growth conditions can be varied and optimized countless ways to suit the needs of new constructs. Here, we present the array of techniques that enable the rapid and efficient optimization of expression steps for maximal protein quality and quantity, including our emendations. © 2015 Elsevier Inc. All rights reserved.
A series of PDB related databases for everyday needs.
Joosten, Robbie P; te Beek, Tim A H; Krieger, Elmar; Hekkelman, Maarten L; Hooft, Rob W W; Schneider, Reinhard; Sander, Chris; Vriend, Gert
2011-01-01
The Protein Data Bank (PDB) is the world-wide repository of macromolecular structure information. We present a series of databases that run parallel to the PDB. Each database holds one entry, if possible, for each PDB entry. DSSP holds the secondary structure of the proteins. PDBREPORT holds reports on the structure quality and lists errors. HSSP holds a multiple sequence alignment for all proteins. The PDBFINDER holds easy to parse summaries of the PDB file content, augmented with essentials from the other systems. PDB_REDO holds re-refined, and often improved, copies of all structures solved by X-ray. WHY_NOT summarizes why certain files could not be produced. All these systems are updated weekly. The data sets can be used for the analysis of properties of protein structures in areas ranging from structural genomics, to cancer biology and protein design.
Cottee, Matthew A; Muschalik, Nadine; Johnson, Steven; Leveson, Joanna; Raff, Jordan W; Lea, Susan M
2015-01-01
Sas-6 and Ana2/STIL proteins are required for centriole duplication and the homo-oligomerisation properties of Sas-6 help establish the ninefold symmetry of the central cartwheel that initiates centriole assembly. Ana2/STIL proteins are poorly conserved, but they all contain a predicted Central Coiled-Coil Domain (CCCD). Here we show that the Drosophila Ana2 CCCD forms a tetramer, and we solve its structure to 0.8 Å, revealing that it adopts an unusual parallel-coil topology. We also solve the structure of the Drosophila Sas-6 N-terminal domain to 2.9 Å revealing that it forms higher-order oligomers through canonical interactions. Point mutations that perturb Sas-6 or Ana2 homo-oligomerisation in vitro strongly perturb centriole assembly in vivo. Thus, efficient centriole duplication in flies requires the homo-oligomerisation of both Sas-6 and Ana2, and the Ana2 CCCD tetramer structure provides important information on how these proteins might cooperate to form a cartwheel structure. DOI: http://dx.doi.org/10.7554/eLife.07236.001 PMID:26002084
How precise are reported protein coordinate data?
Konagurthu, Arun S; Allison, Lloyd; Abramson, David; Stuckey, Peter J; Lesk, Arthur M
2014-03-01
Atomic coordinates in the Worldwide Protein Data Bank (wwPDB) are generally reported to greater precision than the experimental structure determinations have actually achieved. By using information theory and data compression to study the compressibility of protein atomic coordinates, it is possible to quantify the amount of randomness in the coordinate data and thereby to determine the realistic precision of the reported coordinates. On average, the value of each C(α) coordinate in a set of selected protein structures solved at a variety of resolutions is good to about 0.1 Å.
Hassaïne, Ghérici; Deluz, Cédric; Grasso, Luigino; Wyss, Romain; Hovius, Ruud; Stahlberg, Henning; Tomizaki, Takashi; Desmyter, Aline; Moreau, Christophe; Peclinovska, Lucie; Minniberger, Sonja; Mebarki, Lamia; Li, Xiao-Dan; Vogel, Horst; Nury, Hugues
2017-01-01
There is growing interest in the use of mammalian protein expression systems, and in the use of antibody-derived chaperones, for structural studies. Here, we describe protocols ranging from the production of recombinant membrane proteins in stable inducible cell lines to biophysical characterization of purified membrane proteins in complex with llama antibody domains. These protocols were used to solve the structure of the mouse 5-HT3 serotonin receptor but are of broad applicability for crystallization or cryo-electron microscopy projects.
The protein structure prediction problem could be solved using the current PDB library
Zhang, Yang; Skolnick, Jeffrey
2005-01-01
For single-domain proteins, we examine the completeness of the structures in the current Protein Data Bank (PDB) library for use in full-length model construction of unknown sequences. To address this issue, we employ a comprehensive benchmark set of 1,489 medium-size proteins that cover the PDB at the level of 35% sequence identity and identify templates by structure alignment. With homologous proteins excluded, we can always find similar folds to native with an average rms deviation (RMSD) from native of 2.5 Å with ≈82% alignment coverage. These template structures often contain a significant number of insertions/deletions. The tasser algorithm was applied to build full-length models, where continuous fragments are excised from the top-scoring templates and reassembled under the guide of an optimized force field, which includes consensus restraints taken from the templates and knowledge-based statistical potentials. For almost all targets (except for 2/1,489), the resultant full-length models have an RMSD to native below 6 Å (97% of them below 4 Å). On average, the RMSD of full-length models is 2.25 Å, with aligned regions improved from 2.5 Å to 1.88 Å, comparable with the accuracy of low-resolution experimental structures. Furthermore, starting from state-of-the-art structural alignments, we demonstrate a methodology that can consistently bring template-based alignments closer to native. These results are highly suggestive that the protein-folding problem can in principle be solved based on the current PDB library by developing efficient fold recognition algorithms that can recover such initial alignments. PMID:15653774
Lappala, Anna; Nishima, Wataru; Miner, Jacob; Fenimore, Paul; Fischer, Will; Hraber, Peter; Zhang, Ming; McMahon, Benjamin; Tung, Chang-Shung
2018-05-10
Membrane fusion proteins are responsible for viral entry into host cells—a crucial first step in viral infection. These proteins undergo large conformational changes from pre-fusion to fusion-initiation structures, and, despite differences in viral genomes and disease etiology, many fusion proteins are arranged as trimers. Structural information for both pre-fusion and fusion-initiation states is critical for understanding virus neutralization by the host immune system. In the case of Ebola virus glycoprotein (EBOV GP) and Zika virus envelope protein (ZIKV E), pre-fusion state structures have been identified experimentally, but only partial structures of fusion-initiation states have been described. While the fusion-initiation structure is in an energetically unfavorable state that is difficult to solve experimentally, the existing structural information combined with computational approaches enabled the modeling of fusion-initiation state structures of both proteins. These structural models provide an improved understanding of four different neutralizing antibodies in the prevention of viral host entry.
Geometrical tile design for complex neighborhoods.
Czeizler, Eugen; Kari, Lila
2009-01-01
Recent research has showed that tile systems are one of the most suitable theoretical frameworks for the spatial study and modeling of self-assembly processes, such as the formation of DNA and protein oligomeric structures. A Wang tile is a unit square, with glues on its edges, attaching to other tiles and forming larger and larger structures. Although quite intuitive, the idea of glues placed on the edges of a tile is not always natural for simulating the interactions occurring in some real systems. For example, when considering protein self-assembly, the shape of a protein is the main determinant of its functions and its interactions with other proteins. Our goal is to use geometric tiles, i.e., square tiles with geometrical protrusions on their edges, for simulating tiled paths (zippers) with complex neighborhoods, by ribbons of geometric tiles with simple, local neighborhoods. This paper is a step toward solving the general case of an arbitrary neighborhood, by proposing geometric tile designs that solve the case of a "tall" von Neumann neighborhood, the case of the f-shaped neighborhood, and the case of a 3 x 5 "filled" rectangular neighborhood. The techniques can be combined and generalized to solve the problem in the case of any neighborhood, centered at the tile of reference, and included in a 3 x (2k + 1) rectangle.
Introducing the Levinthal's Protein Folding Paradox and Its Solution
ERIC Educational Resources Information Center
Martínez, Leandro
2014-01-01
The protein folding (Levinthal's) paradox states that it would not be possible in a physically meaningful time to a protein to reach the native (functional) conformation by a random search of the enormously large number of possible structures. This paradox has been solved: it was shown that small biases toward the native conformation result…
Cornilescu, Gabriel; Lee, Byeong Ryong; Cornilescu, Claudia C; Wang, Guangshun; Peterkofsky, Alan; Clore, G Marius
2002-11-01
The solution structure of the complex between the cytoplasmic A domain (IIA(Mtl)) of the mannitol transporter II(Mannitol) and the histidine-containing phosphocarrier protein (HPr) of the Escherichia coli phosphotransferase system has been solved by NMR, including the use of conjoined rigid body/torsion angle dynamics, and residual dipolar couplings, coupled with cross-validation, to permit accurate orientation of the two proteins. A convex surface on HPr, formed by helices 1 and 2, interacts with a complementary concave depression on the surface of IIA(Mtl) formed by helix 3, portions of helices 2 and 4, and beta-strands 2 and 3. The majority of intermolecular contacts are hydrophobic, with a small number of electrostatic interactions at the periphery of the interface. The active site histidines, His-15 of HPr and His-65 of IIA(Mtl), are in close spatial proximity, and a pentacoordinate phosphoryl transition state can be readily accommodated with no change in protein-protein orientation and only minimal perturbations of the backbone immediately adjacent to the histidines. Comparison with two previously solved structures of complexes of HPr with partner proteins of the phosphotransferase system, the N-terminal domain of enzyme I (EIN) and enzyme IIA(Glucose) (IIA(Glc)), reveals a number of common features despite the fact that EIN, IIA(Glc), and IIA(Mtl) bear no structural resemblance to one another. Thus, entirely different underlying structural elements can form binding surfaces for HPr that are similar in terms of both shape and residue composition. These structural comparisons illustrate the roles of surface and residue complementarity, redundancy, incremental build-up of specificity and conformational side chain plasticity in the formation of transient specific protein-protein complexes in signal transduction pathways.
NASA Astrophysics Data System (ADS)
Boyko, K. M.; Nikolaeva, A. Yu.; Kachalova, G. S.; Bonchuk, A. N.; Popov, V. O.
2017-11-01
The spatial organization of the genome is controlled by a special class of architectural proteins, including proteins containing BTB domains that are able to dimerize or multimerize. The centrosomal protein 190 is one of such architectural proteins. The purification, crystallization, and preliminary X-ray diffraction study of the BTB domain of the centrosomal protein 190 are reported. The crystallization conditions were found by the vapor-diffusion technique. The crystals diffracted to 1.5 Å resolution and belonged to sp. gr. P3221. The structure was solved by the molecular replacement method. The structure refinement is currently underway.
NASA Astrophysics Data System (ADS)
Chen, Xing-Ru; Wang, Xiao-Ting; Hao, Mei-Qi; Zhou, Yong-Hui; Cui, Wen-Qiang; Xing, Xiao-Xu; Xu, Chang-Geng; Bai, Jing-Wen; Li, Yan-Hua
2017-11-01
The imidazole glycerophosphate dehydratase (IGPD) protein is a therapeutic target for herbicide discovery. It is also regarded as a possible target in Staphylococcus xylosus (S. xylosus) for solving mastitis in the dairy cow. The 3D structure of IGPD protein is essential for discovering novel inhibitors during high-throughput virtual screening. However, to date, the 3D structure of IGPD protein of S. xylosus has not been solved. In this study, a series of computational techniques including homology modeling, Ramachandran Plots, and Verify 3D were performed in order to construct an appropriate 3D model of IGPD protein of S. xylosus. Nine hits were identified from 2500 compounds by docking studies. Then, these 9 compounds were first tested in vitro in S. xylosus biofilm formation using crystal violet staining. One of the potential compounds, baicalin was shown to significantly inhibit S. xylosus biofilm formation. Finally, the baicalin was further evaluated, which showed better inhibition of biofilm formation capability in S. xylosus by scanning electron microscopy. Hence, we have predicted the structure of IGPD protein of S. xylosus using computational techniques. We further discovered the IGPD protein was targeted by baicalin compound which inhibited the biofilm formation in S. xylosus. Our findings here would provide implications for the further development of novel IGPD inhibitors for the treatment of dairy mastitis.
Chen, Xing-Ru; Wang, Xiao-Ting; Hao, Mei-Qi; Zhou, Yong-Hui; Cui, Wen-Qiang; Xing, Xiao-Xu; Xu, Chang-Geng; Bai, Jing-Wen; Li, Yan-Hua
2017-01-01
The imidazole glycerophosphate dehydratase (IGPD) protein is a therapeutic target for herbicide discovery. It is also regarded as a possible target in Staphylococcus xylosus ( S. xylosus ) for solving mastitis in the dairy cow. The 3D structure of IGPD protein is essential for discovering novel inhibitors during high-throughput virtual screening. However, to date, the 3D structure of IGPD protein of S. xylosus has not been solved. In this study, a series of computational techniques including homology modeling, Ramachandran Plots, and Verify 3D were performed in order to construct an appropriate 3D model of IGPD protein of S. xylosus . Nine hits were identified from 2,500 compounds by docking studies. Then, these nine compounds were first tested in vitro in S. xylosus biofilm formation using crystal violet staining. One of the potential compounds, baicalin was shown to significantly inhibit S. xylosus biofilm formation. Finally, the baicalin was further evaluated, which showed better inhibition of biofilm formation capability in S. xylosus by scanning electron microscopy. Hence, we have predicted the structure of IGPD protein of S. xylosus using computational techniques. We further discovered the IGPD protein was targeted by baicalin compound which inhibited the biofilm formation in S. xylosus . Our findings here would provide implications for the further development of novel IGPD inhibitors for the treatment of dairy mastitis.
Ikeya, Teppei; Terauchi, Tsutomu; Güntert, Peter; Kainosho, Masatsune
2006-07-01
Recently we have developed the stereo-array isotope labeling (SAIL) technique to overcome the conventional molecular size limitation in NMR protein structure determination by employing complete stereo- and regiospecific patterns of stable isotopes. SAIL sharpens signals and simplifies spectra without the loss of requisite structural information, thus making large classes of proteins newly accessible to detailed solution structure determination. The automated structure calculation program CYANA can efficiently analyze SAIL-NOESY spectra and calculate structures without manual analysis. Nevertheless, the original SAIL method might not be capable of determining the structures of proteins larger than 50 kDa or membrane proteins, for which the spectra are characterized by many broadened and overlapped peaks. Here we have carried out simulations of new SAIL patterns optimized for minimal relaxation and overlap, to evaluate the combined use of SAIL and CYANA for solving the structures of larger proteins and membrane proteins. The modified approach reduces the number of peaks to nearly half of that observed with uniform labeling, while still yielding well-defined structures and is expected to enable NMR structure determinations of these challenging systems.
Apaydin, Mehmet Serkan; Çatay, Bülent; Patrick, Nicholas; Donald, Bruce R
2011-05-01
Nuclear magnetic resonance (NMR) spectroscopy is an important experimental technique that allows one to study protein structure and dynamics in solution. An important bottleneck in NMR protein structure determination is the assignment of NMR peaks to the corresponding nuclei. Structure-based assignment (SBA) aims to solve this problem with the help of a template protein which is homologous to the target and has applications in the study of structure-activity relationship, protein-protein and protein-ligand interactions. We formulate SBA as a linear assignment problem with additional nuclear overhauser effect constraints, which can be solved within nuclear vector replacement's (NVR) framework (Langmead, C., Yan, A., Lilien, R., Wang, L. and Donald, B. (2003) A Polynomial-Time Nuclear Vector Replacement Algorithm for Automated NMR Resonance Assignments. Proc. the 7th Annual Int. Conf. Research in Computational Molecular Biology (RECOMB) , Berlin, Germany, April 10-13, pp. 176-187. ACM Press, New York, NY. J. Comp. Bio. , (2004), 11, pp. 277-298; Langmead, C. and Donald, B. (2004) An expectation/maximization nuclear vector replacement algorithm for automated NMR resonance assignments. J. Biomol. NMR , 29, 111-138). Our approach uses NVR's scoring function and data types and also gives the option of using CH and NH residual dipolar coupling (RDCs), instead of NH RDCs which NVR requires. We test our technique on NVR's data set as well as on four new proteins. Our results are comparable to NVR's assignment accuracy on NVR's test set, but higher on novel proteins. Our approach allows partial assignments. It is also complete and can return the optimum as well as near-optimum assignments. Furthermore, it allows us to analyze the information content of each data type and is easily extendable to accept new forms of input data, such as additional RDCs.
Analysis of RNA structure using small-angle X-ray scattering
Cantara, William A.; Olson, Erik D.; Musier-Forsyth, Karin
2016-01-01
In addition to their role in correctly attaching specific amino acids to cognate tRNAs, aminoacyl-tRNA synthetases (aaRS) have been found to possess many alternative functions and often bind to and act on other nucleic acids. In contrast to the well-defined 3D structure of tRNA, the structures of many of the other RNAs recognized by aaRSs have not been solved. Despite advances in the use of X-ray crystallography (XRC), nuclear magnetic resonance (NMR) spectroscopy and cryo-electron microscopy (cryo-EM) for structural characterization of biomolecules, significant challenges to solving RNA structures still exist. Recently, small-angle X-ray scattering (SAXS) has been increasingly employed to characterize the 3D structures of RNAs and RNA-protein complexes. SAXS is capable of providing low-resolution tertiary structure information under physiological conditions and with less intensive sample preparation and data analysis requirements than XRC, NMR and cryo-EM. In this article, we describe best practices involved in the process of RNA and RNA-protein sample preparation, SAXS data collection, data analysis, and structural model building. PMID:27777026
ProTSAV: A protein tertiary structure analysis and validation server.
Singh, Ankita; Kaushik, Rahul; Mishra, Avinash; Shanker, Asheesh; Jayaram, B
2016-01-01
Quality assessment of predicted model structures of proteins is as important as the protein tertiary structure prediction. A highly efficient quality assessment of predicted model structures directs further research on function. Here we present a new server ProTSAV, capable of evaluating predicted model structures based on some popular online servers and standalone tools. ProTSAV furnishes the user with a single quality score in case of individual protein structure along with a graphical representation and ranking in case of multiple protein structure assessment. The server is validated on ~64,446 protein structures including experimental structures from RCSB and predicted model structures for CASP targets and from public decoy sets. ProTSAV succeeds in predicting quality of protein structures with a specificity of 100% and a sensitivity of 98% on experimentally solved structures and achieves a specificity of 88%and a sensitivity of 91% on predicted protein structures of CASP11 targets under 2Å.The server overcomes the limitations of any single server/method and is seen to be robust in helping in quality assessment. ProTSAV is freely available at http://www.scfbio-iitd.res.in/software/proteomics/protsav.jsp. Copyright © 2015 Elsevier B.V. All rights reserved.
Bamford, Vicki A; Armour, Maria; Mitchell, Sue A; Cartron, Michaël; Andrews, Simon C; Watson, Kimberly A
2008-09-01
YqjH is a cytoplasmic FAD-containing protein from Escherichia coli; based on homology to ViuB of Vibrio cholerae, it potentially acts as a ferri-siderophore reductase. This work describes its overexpression, purification, crystallization and structure solution at 3.0 A resolution. YqjH shares high sequence similarity with a number of known siderophore-interacting proteins and its structure was solved by molecular replacement using the siderophore-interacting protein from Shewanella putrefaciens as the search model. The YqjH structure resembles those of other members of the NAD(P)H:flavin oxidoreductase superfamily.
Automated structure determination of proteins with the SAIL-FLYA NMR method.
Takeda, Mitsuhiro; Ikeya, Teppei; Güntert, Peter; Kainosho, Masatsune
2007-01-01
The labeling of proteins with stable isotopes enhances the NMR method for the determination of 3D protein structures in solution. Stereo-array isotope labeling (SAIL) provides an optimal stereospecific and regiospecific pattern of stable isotopes that yields sharpened lines, spectral simplification without loss of information, and the ability to collect rapidly and evaluate fully automatically the structural restraints required to solve a high-quality solution structure for proteins up to twice as large as those that can be analyzed using conventional methods. Here, we describe a protocol for the preparation of SAIL proteins by cell-free methods, including the preparation of S30 extract and their automated structure analysis using the FLYA algorithm and the program CYANA. Once efficient cell-free expression of the unlabeled or uniformly labeled target protein has been achieved, the NMR sample preparation of a SAIL protein can be accomplished in 3 d. A fully automated FLYA structure calculation can be completed in 1 d on a powerful computer system.
X-ray laser diffraction for structure determination of the rhodopsin-arrestin complex
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhou, X. Edward; Gao, Xiang; Barty, Anton
Here, serial femtosecond X-ray crystallography (SFX) using an X-ray free electron laser (XFEL) is a recent advancement in structural biology for solving crystal structures of challenging membrane proteins, including G-protein coupled receptors (GPCRs), which often only produce microcrystals. An XFEL delivers highly intense X-ray pulses of femtosecond duration short enough to enable the collection of single diffraction images before significant radiation damage to crystals sets in. Here we report the deposition of the XFEL data and provide further details on crystallization, XFEL data collection and analysis, structure determination, and the validation of the structural model. The rhodopsin-arrestin crystal structure solvedmore » with SFX represents the first near-atomic resolution structure of a GPCR-arrestin complex, provides structural insights into understanding of arrestin-mediated GPCR signaling, and demonstrates the great potential of this SFX-XFEL technology for accelerating crystal structure determination of challenging proteins and protein complexes.« less
X-ray laser diffraction for structure determination of the rhodopsin-arrestin complex
Zhou, X. Edward; Gao, Xiang; Barty, Anton; ...
2016-04-12
Here, serial femtosecond X-ray crystallography (SFX) using an X-ray free electron laser (XFEL) is a recent advancement in structural biology for solving crystal structures of challenging membrane proteins, including G-protein coupled receptors (GPCRs), which often only produce microcrystals. An XFEL delivers highly intense X-ray pulses of femtosecond duration short enough to enable the collection of single diffraction images before significant radiation damage to crystals sets in. Here we report the deposition of the XFEL data and provide further details on crystallization, XFEL data collection and analysis, structure determination, and the validation of the structural model. The rhodopsin-arrestin crystal structure solvedmore » with SFX represents the first near-atomic resolution structure of a GPCR-arrestin complex, provides structural insights into understanding of arrestin-mediated GPCR signaling, and demonstrates the great potential of this SFX-XFEL technology for accelerating crystal structure determination of challenging proteins and protein complexes.« less
Structure of the choline-binding domain of Spr1274 in Streptococcus pneumoniae.
Zhang, Zhenyi; Li, Wenzhe; Frolet, Cecile; Bao, Rui; di Guilmi, Anne Marie; Vernet, Thierry; Chen, Yuxing
2009-08-01
Spr1274 is a putative choline-binding protein that is bound to the cell wall of Streptococcus pneumoniae through noncovalent interactions with the choline moieties of teichoic and lipoteichoic acids. Its function is still unknown. The crystal structure of the choline-binding domain of Spr1274 (residues 44-129) was solved at 2.38 A resolution with three molecules in the asymmetric unit. It may provide a structural basis for functional analysis of choline-binding proteins.
Assessment of Protein Side-Chain Conformation Prediction Methods in Different Residue Environments
Peterson, Lenna X.; Kang, Xuejiao; Kihara, Daisuke
2016-01-01
Computational prediction of side-chain conformation is an important component of protein structure prediction. Accurate side-chain prediction is crucial for practical applications of protein structure models that need atomic detailed resolution such as protein and ligand design. We evaluated the accuracy of eight side-chain prediction methods in reproducing the side-chain conformations of experimentally solved structures deposited to the Protein Data Bank. Prediction accuracy was evaluated for a total of four different structural environments (buried, surface, interface, and membrane-spanning) in three different protein types (monomeric, multimeric, and membrane). Overall, the highest accuracy was observed for buried residues in monomeric and multimeric proteins. Notably, side-chains at protein interfaces and membrane-spanning regions were better predicted than surface residues even though the methods did not all use multimeric and membrane proteins for training. Thus, we conclude that the current methods are as practically useful for modeling protein docking interfaces and membrane-spanning regions as for modeling monomers. PMID:24619909
Computational methods for constructing protein structure models from 3D electron microscopy maps.
Esquivel-Rodríguez, Juan; Kihara, Daisuke
2013-10-01
Protein structure determination by cryo-electron microscopy (EM) has made significant progress in the past decades. Resolutions of EM maps have been improving as evidenced by recently reported structures that are solved at high resolutions close to 3Å. Computational methods play a key role in interpreting EM data. Among many computational procedures applied to an EM map to obtain protein structure information, in this article we focus on reviewing computational methods that model protein three-dimensional (3D) structures from a 3D EM density map that is constructed from two-dimensional (2D) maps. The computational methods we discuss range from de novo methods, which identify structural elements in an EM map, to structure fitting methods, where known high resolution structures are fit into a low-resolution EM map. A list of available computational tools is also provided. Copyright © 2013 Elsevier Inc. All rights reserved.
Sheffler, Will; Baker, David
2009-01-01
We present a novel method called RosettaHoles for visual and quantitative assessment of underpacking in the protein core. RosettaHoles generates a set of spherical cavity balls that fill the empty volume between atoms in the protein interior. For visualization, the cavity balls are aggregated into contiguous overlapping clusters and small cavities are discarded, leaving an uncluttered representation of the unfilled regions of space in a structure. For quantitative analysis, the cavity ball data are used to estimate the probability of observing a given cavity in a high-resolution crystal structure. RosettaHoles provides excellent discrimination between real and computationally generated structures, is predictive of incorrect regions in models, identifies problematic structures in the Protein Data Bank, and promises to be a useful validation tool for newly solved experimental structures.
Sheffler, Will; Baker, David
2009-01-01
We present a novel method called RosettaHoles for visual and quantitative assessment of underpacking in the protein core. RosettaHoles generates a set of spherical cavity balls that fill the empty volume between atoms in the protein interior. For visualization, the cavity balls are aggregated into contiguous overlapping clusters and small cavities are discarded, leaving an uncluttered representation of the unfilled regions of space in a structure. For quantitative analysis, the cavity ball data are used to estimate the probability of observing a given cavity in a high-resolution crystal structure. RosettaHoles provides excellent discrimination between real and computationally generated structures, is predictive of incorrect regions in models, identifies problematic structures in the Protein Data Bank, and promises to be a useful validation tool for newly solved experimental structures. PMID:19177366
Three-dimensional structure of Erwinia carotovora L-asparaginase
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kislitsyn, Yu. A.; Kravchenko, O. V.; Nikonov, S. V.
2006-10-15
Three-dimensional structure of Erwinia carotovora L-asparaginase, which has antitumor activity and is used for the treatment of acute lymphoblastic leukemia, was solved at 3 A resolution and refined to R{sub cryst} = 20% and R{sub free} = 28%. Crystals of recombinant Erwinia carotovora L-asparaginase were grown by the hanging-drop vapor-diffusion method from protein solutions in a HEPES buffer (pH 6.5) and PEG MME 5000 solutions in a cacodylate buffer (pH 6.5) as the precipitant. Three-dimensional X-ray diffraction data were collected up to 3 A resolution from one crystal at room temperature. The structure was solved by the molecular replacement methodmore » using the coordinates of Erwinia chrysanthemi L-asparaginase as the starting model. The coordinates refined with the use of the CNS program package were deposited in the Protein Data Bank (PDB code 1ZCF)« less
Weerth, R. Sophia; Michalska, Karolina; Bingman, Craig A.; ...
2014-12-18
Here, proteins belonging to the cupin superfamily have a wide range of catalytic and noncatalytic functions. Cupin proteins commonly have the capacity to bind a metal ion with the metal frequently determining the function of the protein. We have been investigating the function of homologous cupin proteins that are conserved in more than 40 species of bacteria. In conclusion, to gain insights into the potential function of these proteins we have solved the structure of Plu4264 from Photorhabdus luminescens TTO1 at a resolution of 1.35 Å and identified manganese as the likely natural metal ligand of the protein. Proteins 2015;more » 83:383–388.« less
RaptorX server: a resource for template-based protein structure modeling.
Källberg, Morten; Margaryan, Gohar; Wang, Sheng; Ma, Jianzhu; Xu, Jinbo
2014-01-01
Assigning functional properties to a newly discovered protein is a key challenge in modern biology. To this end, computational modeling of the three-dimensional atomic arrangement of the amino acid chain is often crucial in determining the role of the protein in biological processes. We present a community-wide web-based protocol, RaptorX server ( http://raptorx.uchicago.edu ), for automated protein secondary structure prediction, template-based tertiary structure modeling, and probabilistic alignment sampling.Given a target sequence, RaptorX server is able to detect even remotely related template sequences by means of a novel nonlinear context-specific alignment potential and probabilistic consistency algorithm. Using the protocol presented here it is thus possible to obtain high-quality structural models for many target protein sequences when only distantly related protein domains have experimentally solved structures. At present, RaptorX server can perform secondary and tertiary structure prediction of a 200 amino acid target sequence in approximately 30 min.
Applications of graph theory in protein structure identification
2011-01-01
There is a growing interest in the identification of proteins on the proteome wide scale. Among different kinds of protein structure identification methods, graph-theoretic methods are very sharp ones. Due to their lower costs, higher effectiveness and many other advantages, they have drawn more and more researchers’ attention nowadays. Specifically, graph-theoretic methods have been widely used in homology identification, side-chain cluster identification, peptide sequencing and so on. This paper reviews several methods in solving protein structure identification problems using graph theory. We mainly introduce classical methods and mathematical models including homology modeling based on clique finding, identification of side-chain clusters in protein structures upon graph spectrum, and de novo peptide sequencing via tandem mass spectrometry using the spectrum graph model. In addition, concluding remarks and future priorities of each method are given. PMID:22165974
Computational 3D structures of drug-targeting proteins in the 2009-H1N1 influenza A virus
NASA Astrophysics Data System (ADS)
Du, Qi-Shi; Wang, Shu-Qing; Huang, Ri-Bo; Chou, Kuo-Chen
2010-01-01
The neuraminidase (NA) and M2 proton channel of influenza virus are the drug-targeting proteins, based on which several drugs were developed. However these once powerful drugs encountered drug-resistant problem to the H5N1 and H1N1 flu. To address this problem, the computational 3D structures of NA and M2 proteins of 2009-H1N1 influenza virus were built using the molecular modeling technique and computational chemistry method. Based on the models the structure features of NA and M2 proteins were analyzed, the docking structures of drug-protein complexes were computed, and the residue mutations were annotated. The results may help to solve the drug-resistant problem and stimulate designing more effective drugs against 2009-H1N1 influenza pandemic.
Geometrical Tile Design for Complex Neighborhoods
Czeizler, Eugen; Kari, Lila
2009-01-01
Recent research has showed that tile systems are one of the most suitable theoretical frameworks for the spatial study and modeling of self-assembly processes, such as the formation of DNA and protein oligomeric structures. A Wang tile is a unit square, with glues on its edges, attaching to other tiles and forming larger and larger structures. Although quite intuitive, the idea of glues placed on the edges of a tile is not always natural for simulating the interactions occurring in some real systems. For example, when considering protein self-assembly, the shape of a protein is the main determinant of its functions and its interactions with other proteins. Our goal is to use geometric tiles, i.e., square tiles with geometrical protrusions on their edges, for simulating tiled paths (zippers) with complex neighborhoods, by ribbons of geometric tiles with simple, local neighborhoods. This paper is a step toward solving the general case of an arbitrary neighborhood, by proposing geometric tile designs that solve the case of a “tall” von Neumann neighborhood, the case of the f-shaped neighborhood, and the case of a 3 × 5 “filled” rectangular neighborhood. The techniques can be combined and generalized to solve the problem in the case of any neighborhood, centered at the tile of reference, and included in a 3 × (2k + 1) rectangle. PMID:19956398
Reddy Chichili, Vishnu Priyanka; Kumar, Veerendra; Sivaraman, J.
2016-01-01
Protein-protein interactions are key events controlling several biological processes. We have developed and employed a method to trap transiently interacting protein complexes for structural studies using glycine-rich linkers to fuse interacting partners, one of which is unstructured. Initial steps involve isothermal titration calorimetry to identify the minimum binding region of the unstructured protein in its interaction with its stable binding partner. This is followed by computational analysis to identify the approximate site of the interaction and to design an appropriate linker length. Subsequently, fused constructs are generated and characterized using size exclusion chromatography and dynamic light scattering experiments. The structure of the chimeric protein is then solved by crystallization, and validated both in vitro and in vivo by substituting key interacting residues of the full length, unlinked proteins with alanine. This protocol offers the opportunity to study crucial and currently unattainable transient protein interactions involved in various biological processes. PMID:26985443
Takeda, Mitsuhiro; Chang, Chung-ke; Ikeya, Teppei; Güntert, Peter; Chang, Yuan-hsiang; Hsu, Yen-lan; Huang, Tai-huang; Kainosho, Masatsune
2008-07-18
The C-terminal domain (CTD) of the severe acute respiratory syndrome coronavirus (SARS-CoV) nucleocapsid protein (NP) contains a potential RNA-binding region in its N-terminal portion and also serves as a dimerization domain by forming a homodimer with a molecular mass of 28 kDa. So far, the structure determination of the SARS-CoV NP CTD in solution has been impeded by the poor quality of NMR spectra, especially for aromatic resonances. We have recently developed the stereo-array isotope labeling (SAIL) method to overcome the size problem of NMR structure determination by utilizing a protein exclusively composed of stereo- and regio-specifically isotope-labeled amino acids. Here, we employed the SAIL method to determine the high-quality solution structure of the SARS-CoV NP CTD by NMR. The SAIL protein yielded less crowded and better resolved spectra than uniform (13)C and (15)N labeling, and enabled the homodimeric solution structure of this protein to be determined. The NMR structure is almost identical with the previously solved crystal structure, except for a disordered putative RNA-binding domain at the N-terminus. Studies of the chemical shift perturbations caused by the binding of single-stranded DNA and mutational analyses have identified the disordered region at the N-termini as the prime site for nucleic acid binding. In addition, residues in the beta-sheet region also showed significant perturbations. Mapping of the locations of these residues onto the helical model observed in the crystal revealed that these two regions are parts of the interior lining of the positively charged helical groove, supporting the hypothesis that the helical oligomer may form in solution.
Bamford, Vicki A.; Armour, Maria; Mitchell, Sue A.; Cartron, Michaël; Andrews, Simon C.; Watson, Kimberly A.
2008-01-01
YqjH is a cytoplasmic FAD-containing protein from Escherichia coli; based on homology to ViuB of Vibrio cholerae, it potentially acts as a ferri-siderophore reductase. This work describes its overexpression, purification, crystallization and structure solution at 3.0 Å resolution. YqjH shares high sequence similarity with a number of known siderophore-interacting proteins and its structure was solved by molecular replacement using the siderophore-interacting protein from Shewanella putrefaciens as the search model. The YqjH structure resembles those of other members of the NAD(P)H:flavin oxidoreductase superfamily. PMID:18765906
Cloning, production, and purification of proteins for a medium-scale structural genomics project.
Quevillon-Cheruel, Sophie; Collinet, Bruno; Trésaugues, Lionel; Minard, Philippe; Henckes, Gilles; Aufrère, Robert; Blondeau, Karine; Zhou, Cong-Zhao; Liger, Dominique; Bettache, Nabila; Poupon, Anne; Aboulfath, Ilham; Leulliot, Nicolas; Janin, Joël; van Tilbeurgh, Herman
2007-01-01
The South-Paris Yeast Structural Genomics Pilot Project (http://www.genomics.eu.org) aims at systematically expressing, purifying, and determining the three-dimensional structures of Saccharomyces cerevisiae proteins. We have already cloned 240 yeast open reading frames in the Escherichia coli pET system. Eighty-two percent of the targets can be expressed in E. coli, and 61% yield soluble protein. We have currently purified 58 proteins. Twelve X-ray structures have been solved, six are in progress, and six other proteins gave crystals. In this chapter, we present the general experimental flowchart applied for this project. One of the main difficulties encountered in this pilot project was the low solubility of a great number of target proteins. We have developed parallel strategies to recover these proteins from inclusion bodies, including refolding, coexpression with chaperones, and an in vitro expression system. A limited proteolysis protocol, developed to localize flexible regions in proteins that could hinder crystallization, is also described.
Pandey, Aditya; Shin, Kyungsoo; Patterson, Robin E; Liu, Xiang-Qin; Rainey, Jan K
2016-12-01
Membrane proteins are still heavily under-represented in the protein data bank (PDB), owing to multiple bottlenecks. The typical low abundance of membrane proteins in their natural hosts makes it necessary to overexpress these proteins either in heterologous systems or through in vitro translation/cell-free expression. Heterologous expression of proteins, in turn, leads to multiple obstacles, owing to the unpredictability of compatibility of the target protein for expression in a given host. The highly hydrophobic and (or) amphipathic nature of membrane proteins also leads to challenges in producing a homogeneous, stable, and pure sample for structural studies. Circumventing these hurdles has become possible through the introduction of novel protein production protocols; efficient protein isolation and sample preparation methods; and, improvement in hardware and software for structural characterization. Combined, these advances have made the past 10-15 years very exciting and eventful for the field of membrane protein structural biology, with an exponential growth in the number of solved membrane protein structures. In this review, we focus on both the advances and diversity of protein production and purification methods that have allowed this growth in structural knowledge of membrane proteins through X-ray crystallography, nuclear magnetic resonance (NMR) spectroscopy, and cryo-electron microscopy (cryo-EM).
Pandey, Aditya; Shin, Kyungsoo; Patterson, Robin E.; Liu, Xiang-Qin; Rainey, Jan K.
2017-01-01
Membrane proteins are still heavily underrepresented in the protein data bank (PDB) due to multiple bottlenecks. The typical low abundance of membrane proteins in their natural hosts makes it necessary to overexpress these proteins either in heterologous systems or through in vitro translation/cell-free expression. Heterologous expression of proteins, in turn, leads to multiple obstacles due to the unpredictability of compatibility of the target protein for expression in a given host. The highly hydrophobic and/or amphipathic nature of membrane proteins also leads to challenges in producing a homogeneous, stable, and pure sample for structural studies. Circumventing these hurdles has become possible through introduction of novel protein production protocols; efficient protein isolation and sample preparation methods; and, improvement in hardware and software for structural characterization. Combined, these advances have made the past 10–15 years very exciting and eventful for the field of membrane protein structural biology, with an exponential growth in the number of solved membrane protein structures. In this review, we focus on both the advances and diversity of protein production and purification methods that have allowed this growth in structural knowledge of membrane proteins through X-ray crystallography, nuclear magnetic resonance (NMR) spectroscopy, and cryo-electron microscopy (cryo-EM). PMID:27010607
A fragmentation and reassembly method for ab initio phasing.
Shrestha, Rojan; Zhang, Kam Y J
2015-02-01
Ab initio phasing with de novo models has become a viable approach for structural solution from protein crystallographic diffraction data. This approach takes advantage of the known protein sequence information, predicts de novo models and uses them for structure determination by molecular replacement. However, even the current state-of-the-art de novo modelling method has a limit as to the accuracy of the model predicted, which is sometimes insufficient to be used as a template for successful molecular replacement. A fragment-assembly phasing method has been developed that starts from an ensemble of low-accuracy de novo models, disassembles them into fragments, places them independently in the crystallographic unit cell by molecular replacement and then reassembles them into a whole structure that can provide sufficient phase information to enable complete structure determination by automated model building. Tests on ten protein targets showed that the method could solve structures for eight of these targets, although the predicted de novo models cannot be used as templates for successful molecular replacement since the best model for each target is on average more than 4.0 Å away from the native structure. The method has extended the applicability of the ab initio phasing by de novo models approach. The method can be used to solve structures when the best de novo models are still of low accuracy.
Cura, Vincent; Troffer-Charlier, Nathalie; Lambert, Marie-Annick; Bonnefond, Luc; Cavarelli, Jean
2014-01-01
Protein arginine methyltransferase 7 (PRMT7) is a unique but less characterized member of the family of protein arginine methyltransferases (PRMTs) that plays a role in male germline gene imprinting. PRMT7 is the only known PRMT member that catalyzes the monomethylation but not the dimethylation of the target arginine residues and harbours two catalytic domains in tandem. PRMT7 genes from five different species were cloned and expressed in Escherichia coli and Sf21 insect cells. Four gave soluble proteins from Sf21 cells, of which two were homogeneous and one gave crystals. The mouse PRMT7 structure was solved by the single anomalous dispersion method using a crystal soaked with thimerosal that diffracted to beyond 2.1 Å resolution. The crystal belonged to space group P4(3)2(1)2, with unit-cell parameters a = b = 97.4, c = 168.1 Å and one PRMT7 monomer in the asymmetric unit. The structure of another crystal form belonging to space group I222 was solved by molecular replacement.
Cura, Vincent; Troffer-Charlier, Nathalie; Lambert, Marie-Annick; Bonnefond, Luc; Cavarelli, Jean
2014-01-01
Protein arginine methyltransferase 7 (PRMT7) is a unique but less characterized member of the family of protein arginine methyltransferases (PRMTs) that plays a role in male germline gene imprinting. PRMT7 is the only known PRMT member that catalyzes the monomethylation but not the dimethylation of the target arginine residues and harbours two catalytic domains in tandem. PRMT7 genes from five different species were cloned and expressed in Escherichia coli and Sf21 insect cells. Four gave soluble proteins from Sf21 cells, of which two were homogeneous and one gave crystals. The mouse PRMT7 structure was solved by the single anomalous dispersion method using a crystal soaked with thimerosal that diffracted to beyond 2.1 Å resolution. The crystal belonged to space group P43212, with unit-cell parameters a = b = 97.4, c = 168.1 Å and one PRMT7 monomer in the asymmetric unit. The structure of another crystal form belonging to space group I222 was solved by molecular replacement. PMID:24419624
Bent, Andrew F; Mann, Greg; Houssen, Wael E; Mykhaylyk, Vitaliy; Duman, Ramona; Thomas, Louise; Jaspars, Marcel; Wagner, Armin; Naismith, James H
2016-11-01
Determination of protein crystal structures requires that the phases are derived independently of the observed measurement of diffraction intensities. Many techniques have been developed to obtain phases, including heavy-atom substitution, molecular replacement and substitution during protein expression of the amino acid methionine with selenomethionine. Although the use of selenium-containing methionine has transformed the experimental determination of phases it is not always possible, either because the variant protein cannot be produced or does not crystallize. Phasing of structures by measuring the anomalous diffraction from S atoms could in theory be almost universal since almost all proteins contain methionine or cysteine. Indeed, many structures have been solved by the so-called native sulfur single-wavelength anomalous diffraction (S-SAD) phasing method. However, the anomalous effect is weak at the wavelengths where data are normally recorded (between 1 and 2 Å) and this limits the potential of this method to well diffracting crystals. Longer wavelengths increase the strength of the anomalous signal but at the cost of increasing air absorption and scatter, which degrade the precision of the anomalous measurement, consequently hindering phase determination. A new instrument, the long-wavelength beamline I23 at Diamond Light Source, was designed to work at significantly longer wavelengths compared with standard synchrotron beamlines in order to open up the native S-SAD method to projects of increasing complexity. Here, the first novel structure, that of the oxidase domain involved in the production of the natural product patellamide, solved on this beamline is reported using data collected to a resolution of 3.15 Å at a wavelength of 3.1 Å. The oxidase is an example of a protein that does not crystallize as the selenium variant and for which no suitable homology model for molecular replacement was available. Initial attempts collecting anomalous diffraction data for native sulfur phasing on a standard macromolecular crystallography beamline using a wavelength of 1.77 Å did not yield a structure. The new beamline thus has the potential to facilitate structure determination by native S-SAD phasing for what would previously have been regarded as very challenging cases with modestly diffracting crystals and low sulfur content.
A general protocol for the generation of Nanobodies for structural biology
Pardon, Els; Laeremans, Toon; Triest, Sarah; Rasmussen, Søren G. F.; Wohlkönig, Alexandre; Ruf, Armin; Muyldermans, Serge; Hol, Wim G. J.; Kobilka, Brian K.; Steyaert, Jan
2015-01-01
There is growing interest in using antibodies as auxiliary proteins to crystallize proteins. Here, we describe a general protocol for the generation of Nanobodies to be used as crystallization chaperones for the structural investigation of diverse conformational states of flexible (membrane) proteins and complexes thereof. Our technology has the competitive advantage over other recombinant crystallization chaperones in that we fully exploit the natural humoral response against native antigens. Accordingly, we provide detailed protocols for the immunization with native proteins and for the selection by phage display of in vivo matured Nanobodies that bind conformational epitopes of functional proteins. Three representative examples illustrate that the outlined procedures are robust, enabling to solve the structures of the most challenging proteins by Nanobody-assisted X-ray crystallography in a time span of 6 to 12 months. PMID:24577359
BALBES: a molecular-replacement pipeline.
Long, Fei; Vagin, Alexei A; Young, Paul; Murshudov, Garib N
2008-01-01
The number of macromolecular structures solved and deposited in the Protein Data Bank (PDB) is higher than 40 000. Using this information in macromolecular crystallography (MX) should in principle increase the efficiency of MX structure solution. This paper describes a molecular-replacement pipeline, BALBES, that makes extensive use of this repository. It uses a reorganized database taken from the PDB with multimeric as well as domain organization. A system manager written in Python controls the workflow of the process. Testing the current version of the pipeline using entries from the PDB has shown that this approach has huge potential and that around 75% of structures can be solved automatically without user intervention.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Aryal, Baikuntha P.; Brugarolas, Pedro; He, Chuan
2012-05-25
Radiolabeled biomolecules are routinely used for clinical diagnostics. {sup 99m}Tc is the most commonly used radioactive tracer in radiopharmaceuticals. {sup 188}Re and {sup 186}Re are also commonly used as radioactive tracers in medicine. However, currently available methods for radiolabeling are lengthy and involve several steps in bioconjugation processes. In this work we present a strategy to engineer proteins that may selectively recognize the perrhenate (ReO{sub 4}{sup -}) ion as a new way to label proteins. We found that a molybdate (MoO{sub 4}{sup 2-})-binding protein (ModA) from Escherichia coli can bind perrhenate with high affinity. Using fluorescence and isothermal titration calorimetrymore » measurements, we determined the dissociation constant of ModA for ReO{sub 4}{sup -} to be 541 nM and we solved a crystal structure of ModA with a bound ReO{sub 4}{sup -}. On the basis of the structure we created a mutant protein containing a disulfide linkage, which exhibited increased affinity for perrhenate (K{sub d} = 104 nM). High-resolution crystal structures of ModA (1.7 {angstrom}) and A11C/R153C mutant (2.0 {angstrom}) were solved with bound perrhenate. Both structures show that a perrhenate ion occupies the molybdate binding site using the same amino acid residues that are involved in molybdate binding. The overall structure of the perrhenate-bound ModA is unchanged compared with that of the molybdate-bound form. In the mutant protein, the bound perrhenate is further stabilized by the engineered disulfide bond.« less
Protein 3D Structure and Electron Microscopy Map Retrieval Using 3D-SURFER2.0 and EM-SURFER.
Han, Xusi; Wei, Qing; Kihara, Daisuke
2017-12-08
With the rapid growth in the number of solved protein structures stored in the Protein Data Bank (PDB) and the Electron Microscopy Data Bank (EMDB), it is essential to develop tools to perform real-time structure similarity searches against the entire structure database. Since conventional structure alignment methods need to sample different orientations of proteins in the three-dimensional space, they are time consuming and unsuitable for rapid, real-time database searches. To this end, we have developed 3D-SURFER and EM-SURFER, which utilize 3D Zernike descriptors (3DZD) to conduct high-throughput protein structure comparison, visualization, and analysis. Taking an atomic structure or an electron microscopy map of a protein or a protein complex as input, the 3DZD of a query protein is computed and compared with the 3DZD of all other proteins in PDB or EMDB. In addition, local geometrical characteristics of a query protein can be analyzed using VisGrid and LIGSITE CSC in 3D-SURFER. This article describes how to use 3D-SURFER and EM-SURFER to carry out protein surface shape similarity searches, local geometric feature analysis, and interpretation of the search results. © 2017 by John Wiley & Sons, Inc. Copyright © 2017 John Wiley & Sons, Inc.
A benchmark testing ground for integrating homology modeling and protein docking.
Bohnuud, Tanggis; Luo, Lingqi; Wodak, Shoshana J; Bonvin, Alexandre M J J; Weng, Zhiping; Vajda, Sandor; Schueler-Furman, Ora; Kozakov, Dima
2017-01-01
Protein docking procedures carry out the task of predicting the structure of a protein-protein complex starting from the known structures of the individual protein components. More often than not, however, the structure of one or both components is not known, but can be derived by homology modeling on the basis of known structures of related proteins deposited in the Protein Data Bank (PDB). Thus, the problem is to develop methods that optimally integrate homology modeling and docking with the goal of predicting the structure of a complex directly from the amino acid sequences of its component proteins. One possibility is to use the best available homology modeling and docking methods. However, the models built for the individual subunits often differ to a significant degree from the bound conformation in the complex, often much more so than the differences observed between free and bound structures of the same protein, and therefore additional conformational adjustments, both at the backbone and side chain levels need to be modeled to achieve an accurate docking prediction. In particular, even homology models of overall good accuracy frequently include localized errors that unfavorably impact docking results. The predicted reliability of the different regions in the model can also serve as a useful input for the docking calculations. Here we present a benchmark dataset that should help to explore and solve combined modeling and docking problems. This dataset comprises a subset of the experimentally solved 'target' complexes from the widely used Docking Benchmark from the Weng Lab (excluding antibody-antigen complexes). This subset is extended to include the structures from the PDB related to those of the individual components of each complex, and hence represent potential templates for investigating and benchmarking integrated homology modeling and docking approaches. Template sets can be dynamically customized by specifying ranges in sequence similarity and in PDB release dates, or using other filtering options, such as excluding sets of specific structures from the template list. Multiple sequence alignments, as well as structural alignments of the templates to their corresponding subunits in the target are also provided. The resource is accessible online or can be downloaded at http://cluspro.org/benchmark, and is updated on a weekly basis in synchrony with new PDB releases. Proteins 2016; 85:10-16. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Zook, James D.; Molugu, Trivikram R.; Jacobsen, Neil E.; Lin, Guangxin; Soll, Jürgen; Cherry, Brian R.; Brown, Michael F.; Fromme, Petra
2013-01-01
Solving high-resolution structures for membrane proteins continues to be a daunting challenge in the structural biology community. In this study we report our high-resolution NMR results for a transmembrane protein, outer envelope protein of molar mass 16 kDa (OEP16), an amino acid transporter from the outer membrane of chloroplasts. Three-dimensional, high-resolution NMR experiments on the 13C, 15N, 2H-triply-labeled protein were used to assign protein backbone resonances and to obtain secondary structure information. The results yield over 95% assignment of N, HN, CO, Cα, and Cβ chemical shifts, which is essential for obtaining a high resolution structure from NMR data. Chemical shift analysis from the assignment data reveals experimental evidence for the first time on the location of the secondary structure elements on a per residue basis. In addition T 1Z and T2 relaxation experiments were performed in order to better understand the protein dynamics. Arginine titration experiments yield an insight into the amino acid residues responsible for protein transporter function. The results provide the necessary basis for high-resolution structural determination of this important plant membrane protein. PMID:24205117
Rocchia, W; Neshich, G
2007-10-05
STING and Java Protein Dossier provide a collection of physical-chemical parameters, describing protein structure, stability, function, and interaction, considered one of the most comprehensive among the available protein databases of similar type. Particular attention in STING is paid to the electrostatic potential. It makes use of DelPhi, a well-known tool that calculates this physical-chemical quantity for biomolecules by solving the Poisson Boltzmann equation. In this paper, we describe a modification to the DelPhi program aimed at integrating it within the STING environment. We also outline how the "amino acid electrostatic potential" and the "surface amino acid electrostatic potential" are calculated (over all Protein Data Bank (PDB) content) and how the corresponding values are made searchable in STING_DB. In addition, we show that the STING and Java Protein Dossier are also capable of providing these particular parameter values for the analysis of protein structures modeled in computers or being experimentally solved, but not yet deposited in the PDB. Furthermore, we compare the calculated electrostatic potential values obtained by using the earlier version of DelPhi and those by STING, for the biologically relevant case of lysozyme-antibody interaction. Finally, we describe the STING capacity to make queries (at both residue and atomic levels) across the whole PDB, by looking at a specific case where the electrostatic potential parameter plays a crucial role in terms of a particular protein function, such as ligand binding. BlueStar STING is available at http://www.cbi.cnptia.embrapa.br.
Bunney, Tom D.; Cole, Ambrose R.; Broncel, Malgorzata; Esposito, Diego; Tate, Edward W.; Katan, Matilda
2014-01-01
Summary Protein AMPylation, the transfer of AMP from ATP to protein targets, has been recognized as a new mechanism of host-cell disruption by some bacterial effectors that typically contain a FIC-domain. Eukaryotic genomes also encode one FIC-domain protein, HYPE, which has remained poorly characterized. Here we describe the structure of human HYPE, solved by X-ray crystallography, representing the first structure of a eukaryotic FIC-domain protein. We demonstrate that HYPE forms stable dimers with structurally and functionally integrated FIC-domains and with TPR-motifs exposed for protein-protein interactions. As HYPE also uniquely possesses a transmembrane helix, dimerization is likely to affect its positioning and function in the membrane vicinity. The low rate of autoAMPylation of the wild-type HYPE could be due to autoinhibition, consistent with the mechanism proposed for a number of putative FIC AMPylators. Our findings also provide a basis to further consider possible alternative cofactors of HYPE and distinct modes of target-recognition. PMID:25435325
Bunney, Tom D; Cole, Ambrose R; Broncel, Malgorzata; Esposito, Diego; Tate, Edward W; Katan, Matilda
2014-12-02
Protein AMPylation, the transfer of AMP from ATP to protein targets, has been recognized as a new mechanism of host-cell disruption by some bacterial effectors that typically contain a FIC-domain. Eukaryotic genomes also encode one FIC-domain protein,HYPE, which has remained poorly characterized.Here we describe the structure of human HYPE, solved by X-ray crystallography, representing the first structure of a eukaryotic FIC-domain protein. We demonstrate that HYPE forms stable dimers with structurally and functionally integrated FIC-domains and with TPR-motifs exposed for protein-protein interactions. As HYPE also uniquely possesses a transmembrane helix, dimerization is likely to affect its positioning and function in the membrane vicinity. The low rate of auto AMPylation of the wild-type HYPE could be due to autoinhibition, consistent with the mechanism proposed for a number of putative FIC AMPylators. Our findings also provide a basis to further consider possible alternative cofactors of HYPE and distinct modes of target-recognition.
Recent advances in racemic protein crystallography.
Yan, Bingjia; Ye, Linzhi; Xu, Weiliang; Liu, Lei
2017-09-15
Solution of the three-dimensional structures of proteins is a critical step in deciphering the molecular mechanisms of their bioactivities. Among the many approaches for obtaining protein crystals, racemic protein crystallography has been developed as a unique method to solve the structures of an increasing number of proteins. Exploiting unnatural protein enantiomers in crystallization and resolution, racemic protein crystallography manifests two major advantages that are 1) to increase the success rate of protein crystallization, and 2) to obviate the phase problem in X-ray diffraction. The requirement of unnatural protein enantiomers in racemic protein crystallography necessitates chemical protein synthesis, which is hitherto accomplished through solid phase peptide synthesis and chemical ligation reactions. This review highlights the fundamental ideas of racemic protein crystallography and surveys the harvests in the field of racemic protein crystallography over the last five years from early 2012 to late 2016. Copyright © 2017. Published by Elsevier Ltd.
Potrzebowski, Wojciech; André, Ingemar
2015-07-01
For highly oriented fibrillar molecules, three-dimensional structures can often be determined from X-ray fiber diffraction data. However, because of limited information content, structure determination and validation can be challenging. We demonstrate that automated structure determination of protein fibers can be achieved by guiding the building of macromolecular models with fiber diffraction data. We illustrate the power of our approach by determining the structures of six bacteriophage viruses de novo using fiber diffraction data alone and together with solid-state NMR data. Furthermore, we demonstrate the feasibility of molecular replacement from monomeric and fibrillar templates by solving the structure of a plant virus using homology modeling and protein-protein docking. The generated models explain the experimental data to the same degree as deposited reference structures but with improved structural quality. We also developed a cross-validation method for model selection. The results highlight the power of fiber diffraction data as structural constraints.
The helical structure of DNA facilitates binding
NASA Astrophysics Data System (ADS)
Berg, Otto G.; Mahmutovic, Anel; Marklund, Emil; Elf, Johan
2016-09-01
The helical structure of DNA imposes constraints on the rate of diffusion-limited protein binding. Here we solve the reaction-diffusion equations for DNA-like geometries and extend with simulations when necessary. We find that the helical structure can make binding to the DNA more than twice as fast compared to a case where DNA would be reactive only along one side. We also find that this rate advantage remains when the contributions from steric constraints and rotational diffusion of the DNA-binding protein are included. Furthermore, we find that the association rate is insensitive to changes in the steric constraints on the DNA in the helix geometry, while it is much more dependent on the steric constraints on the DNA-binding protein. We conclude that the helical structure of DNA facilitates the nonspecific binding of transcription factors and structural DNA-binding proteins in general.
Structure and dynamics of the influenza A M2 channel: a comparison of three structures.
Leonov, Hadas; Arkin, Isaiah T
2009-11-01
The M2 protein is an essential component of the Influenza virus' infectivity cycle. It is a homo-tetrameric bundle forming a pH-gated H(+) channel. The structure of M2 was solved by three different groups, using different techniques, protein sequences and pH environment. For example, solid-state NMR spectroscopy was used on a protein in lipid bilayers, while X-ray crystallography and solution NMR spectroscopy were applied on a protein in detergent micelles. The resulting structures from the above efforts are rather distinct. Herein, we examine the different structures under uniform conditions such as a lipid bilayer and specified protonation state. We employ extensive molecular dynamics simulations, in several protonation states, representing both closed and open forms of the channel. Exploring the properties of each of these structures has shown that the X-ray structure is more stable than the other structures according to various criteria, although its water conductance and water-wire formation do not correlate to the protonation state of the channel.
Vergis, James M.; Purdy, Michael D.; Wiener, Michael C.
2015-01-01
Structural studies on integral membrane proteins are routinely performed on protein–detergent complexes (PDCs) consisting of purified protein solubilized in a particular detergent. Of all the membrane protein crystal structures solved to date, a subset of only four detergents has been used in more than half of these structures. Unfortunately, many membrane proteins are not well behaved in these four detergents and/or fail to yield well-diffracting crystals. Identification of detergents that maintain the solubility and stability of a membrane protein is a critical step and can be a lengthy and “protein-expensive” process. We have developed an assay that characterizes the stability and size of membrane proteins exchanged into a panel of 94 commercially available and chemically diverse detergents. This differential filtration assay (DFA), using a set of filtered microplates, requires sub-milligram quantities of purified protein and small quantities of detergents and other reagents and is performed in its entirety in several hours. PMID:20667442
Gunčar, Gregor; Wang, Ching-I A.; Forwood, Jade K.; Teh, Trazel; Catanzariti, Ann-Maree; Ellis, Jeffrey G.; Dodds, Peter N.; Kobe, Boštjan
2007-01-01
Metal-binding sites are ubiquitous in proteins and can be readily utilized for phasing. It is shown that a protein crystal structure can be solved using single-wavelength anomalous diffraction based on the anomalous signal of a cobalt ion measured on a conventional monochromatic X-ray source. The unique absorption edge of cobalt (1.61 Å) is compatible with the Cu Kα wavelength (1.54 Å) commonly available in macromolecular crystallography laboratories. This approach was applied to the determination of the structure of Melampsora lini avirulence protein AvrL567-A, a protein with a novel fold from the fungal pathogen flax rust that induces plant disease resistance in flax plants. This approach using cobalt ions may be applicable to all cobalt-binding proteins and may be advantageous when synchrotron radiation is not readily available. PMID:17329816
fRMSDPred: Predicting Local RMSD Between Structural Fragments Using Sequence Information
2007-04-04
machine learning approaches for estimating the RMSD value of a pair of protein fragments. These estimated fragment-level RMSD values can be used to construct the alignment, assess the quality of an alignment, and identify high-quality alignment segments. We present algorithms to solve this fragment-level RMSD prediction problem using a supervised learning framework based on support vector regression and classification that incorporates protein profiles, predicted secondary structure, effective information encoding schemes, and novel second-order pairwise exponential kernel
How cryo-electron microscopy and X-ray crystallography complement each other.
Wang, Hong-Wei; Wang, Jia-Wei
2017-01-01
With the ability to resolve structures of macromolecules at atomic resolution, X-ray crystallography has been the most powerful tool in modern structural biology. At the same time, recent technical improvements have triggered a resolution revolution in the single particle cryo-EM method. While the two methods are different in many respects, from sample preparation to structure determination, they both have the power to solve macromolecular structures at atomic resolution. It is important to understand the unique advantages and caveats of the two methods in solving structures and to appreciate the complementary nature of the two methods in structural biology. In this review we provide some examples, and discuss how X-ray crystallography and cryo-EM can be combined in deciphering structures of macromolecules for our full understanding of their biological mechanisms. © 2016 The Protein Society.
Ukleja, Marta; Valpuesta, José María; Dziembowski, Andrzej; Cuellar, Jorge
2016-10-01
Large protein assemblies are usually the effectors of major cellular processes. The intricate cell homeostasis network is divided into numerous interconnected pathways, each controlled by a set of protein machines. One of these master regulators is the CCR4-NOT complex, which ultimately controls protein expression levels. This multisubunit complex assembles around a scaffold platform, which enables a wide variety of well-studied functions from mRNA synthesis to transcript decay, as well as other tasks still being identified. Solving the structure of the entire CCR4-NOT complex will help to define the distribution of its functions. The recently published three-dimensional reconstruction of the complex, in combination with the known crystal structures of some of the components, has begun to address this. Methodological improvements in structural biology, especially in cryoelectron microscopy, encourage further structural and protein-protein interaction studies, which will advance our comprehension of the gene expression machinery. © 2016 WILEY Periodicals, Inc.
Arana-Daniel, Nancy; Gallegos, Alberto A; López-Franco, Carlos; Alanís, Alma Y; Morales, Jacob; López-Franco, Adriana
2016-01-01
With the increasing power of computers, the amount of data that can be processed in small periods of time has grown exponentially, as has the importance of classifying large-scale data efficiently. Support vector machines have shown good results classifying large amounts of high-dimensional data, such as data generated by protein structure prediction, spam recognition, medical diagnosis, optical character recognition and text classification, etc. Most state of the art approaches for large-scale learning use traditional optimization methods, such as quadratic programming or gradient descent, which makes the use of evolutionary algorithms for training support vector machines an area to be explored. The present paper proposes an approach that is simple to implement based on evolutionary algorithms and Kernel-Adatron for solving large-scale classification problems, focusing on protein structure prediction. The functional properties of proteins depend upon their three-dimensional structures. Knowing the structures of proteins is crucial for biology and can lead to improvements in areas such as medicine, agriculture and biofuels.
Medvedeva, Irina V; Demenkov, Pavel S; Ivanisenko, Vladimir A
2017-04-01
Functional sites define the diversity of protein functions and are the central object of research of the structural and functional organization of proteins. The mechanisms underlying protein functional sites emergence and their variability during evolution are distinguished by duplication, shuffling, insertion and deletion of the exons in genes. The study of the correlation between a site structure and exon structure serves as the basis for the in-depth understanding of sites organization. In this regard, the development of programming resources that allow the realization of the mutual projection of exon structure of genes and primary and tertiary structures of encoded proteins is still the actual problem. Previously, we developed the SitEx system that provides information about protein and gene sequences with mapped exon borders and protein functional sites amino acid positions. The database included information on proteins with known 3D structure. However, data with respect to orthologs was not available. Therefore, we added the projection of sites positions to the exon structures of orthologs in SitEx 2.0. We implemented a search through database using site conservation variability and site discontinuity through exon structure. Inclusion of the information on orthologs allowed to expand the possibilities of SitEx usage for solving problems regarding the analysis of the structural and functional organization of proteins. Database URL: http://www-bionet.sscc.ru/sitex/ .
Structure of CC chemokine receptor 2 with orthosteric and allosteric antagonists
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zheng, Yi; Qin, Ling; Zacarías, Natalia V. Ortiz
CC chemokine receptor 2 (CCR2) is one of 19 members of the chemokine receptor subfamily of human class A G-protein-coupled receptors. CCR2 is expressed on monocytes, immature dendritic cells, and T-cell subpopulations, and mediates their migration towards endogenous CC chemokine ligands such as CCL2 (ref. 1). CCR2 and its ligands are implicated in numerous inflammatory and neurodegenerative diseases2 including atherosclerosis, multiple sclerosis, asthma, neuropathic pain, and diabetic nephropathy, as well as cancer3. These disease associations have motivated numerous preclinical studies and clinical trials4 (see http://www.clinicaltrials.gov) in search of therapies that target the CCR2–chemokine axis. To aid drug discovery efforts5, heremore » we solve a structure of CCR2 in a ternary complex with an orthosteric (BMS-681 (ref. 6)) and allosteric (CCR2-RA-[R]7) antagonist. BMS-681 inhibits chemokine binding by occupying the orthosteric pocket of the receptor in a previously unseen binding mode. CCR2-RA-[R] binds in a novel, highly druggable pocket that is the most intracellular allosteric site observed in class A G-protein-coupled receptors so far; this site spatially overlaps the G-protein-binding site in homologous receptors. CCR2-RA-[R] inhibits CCR2 non-competitively by blocking activation-associated conformational changes and formation of the G-protein-binding interface. The conformational signature of the conserved microswitch residues observed in double-antagonist-bound CCR2 resembles the most inactive G-protein-coupled receptor structures solved so far. Like other protein–protein interactions, receptor–chemokine complexes are considered challenging therapeutic targets for small molecules, and the present structure suggests diverse pocket epitopes that can be exploited to overcome obstacles in drug design.« less
Protein Structure Prediction with Evolutionary Algorithms
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hart, W.E.; Krasnogor, N.; Pelta, D.A.
1999-02-08
Evolutionary algorithms have been successfully applied to a variety of molecular structure prediction problems. In this paper we reconsider the design of genetic algorithms that have been applied to a simple protein structure prediction problem. Our analysis considers the impact of several algorithmic factors for this problem: the confirmational representation, the energy formulation and the way in which infeasible conformations are penalized, Further we empirically evaluated the impact of these factors on a small set of polymer sequences. Our analysis leads to specific recommendations for both GAs as well as other heuristic methods for solving PSP on the HP model.
Buried and accessible surface area control intrinsic protein flexibility.
Marsh, Joseph A
2013-09-09
Proteins experience a wide variety of conformational dynamics that can be crucial for facilitating their diverse functions. How is the intrinsic flexibility required for these motions encoded in their three-dimensional structures? Here, the overall flexibility of a protein is demonstrated to be tightly coupled to the total amount of surface area buried within its fold. A simple proxy for this, the relative solvent-accessible surface area (Arel), therefore shows excellent agreement with independent measures of global protein flexibility derived from various experimental and computational methods. Application of Arel on a large scale demonstrates its utility by revealing unique sequence and structural properties associated with intrinsic flexibility. In particular, flexibility as measured by Arel shows little correspondence with intrinsic disorder, but instead tends to be associated with multiple domains and increased α-helical structure. Furthermore, the apparent flexibility of monomeric proteins is found to be useful for identifying quaternary-structure errors in published crystal structures. There is also a strong tendency for the crystal structures of more flexible proteins to be solved to lower resolutions. Finally, local solvent accessibility is shown to be a primary determinant of local residue flexibility. Overall, this work provides both fundamental mechanistic insight into the origin of protein flexibility and a simple, practical method for predicting flexibility from protein structures. © 2013 Elsevier Ltd. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rice, E.A.; Bannon, G.A.; Glenn, K.C.
2008-11-21
The lysine insensitive Corynebacterium glutamicum dihydrodipicolinate synthase enzyme (cDHDPS) was recently successfully introduced into maize plants to enhance the level of lysine in the grain. To better understand lysine insensitivity of the cDHDPS, we expressed, purified, kinetically characterized the protein, and solved its X-ray crystal structure. The cDHDPS enzyme has a fold and overall structure that is highly similar to other DHDPS proteins. A noteworthy feature of the active site is the evidence that the catalytic lysine residue forms a Schiff base adduct with pyruvate. Analyses of the cDHDPS structure in the vicinity of the putative binding site for S-lysinemore » revealed that the allosteric binding site in the Escherichia coli DHDPS protein does not exist in cDHDPS due to three non-conservative amino acids substitutions, and this is likely why cDHDPS is not feedback inhibited by lysine.« less
Bayesian Peak Picking for NMR Spectra
Cheng, Yichen; Gao, Xin; Liang, Faming
2013-01-01
Protein structure determination is a very important topic in structural genomics, which helps people to understand varieties of biological functions such as protein-protein interactions, protein–DNA interactions and so on. Nowadays, nuclear magnetic resonance (NMR) has often been used to determine the three-dimensional structures of protein in vivo. This study aims to automate the peak picking step, the most important and tricky step in NMR structure determination. We propose to model the NMR spectrum by a mixture of bivariate Gaussian densities and use the stochastic approximation Monte Carlo algorithm as the computational tool to solve the problem. Under the Bayesian framework, the peak picking problem is casted as a variable selection problem. The proposed method can automatically distinguish true peaks from false ones without preprocessing the data. To the best of our knowledge, this is the first effort in the literature that tackles the peak picking problem for NMR spectrum data using Bayesian method. PMID:24184964
Structural basis for activity of highly efficient RNA mimics of green fluorescent protein
Warner, Katherine Deigan; Chen, Michael C.; Song, Wenjiao; Strack, Rita L.; Thorn, Andrea; Jaffrey, Samie R.; Ferré-D’Amaré, Adrian R.
2014-01-01
Green fluorescent protein (GFP) and its derivatives revolutionized the study of proteins. Spinach is a recently reported in vitro evolved RNA mimic of GFP, which as genetically encoded fusions, makes possible live-cell, real-time imaging of biological RNAs, without resorting to large RNA-binding protein-GFP fusions. To elucidate the molecular basis of Spinach fluorescence, we have solved its co-crystal structure bound to its cognate exogenous chromophore, revealing that Spinach activates the small molecule by immobilizing it between a base triple, a G-quadruplex, and an unpaired guanine. Mutational and NMR analyses indicate that the G-quadruplex is essential for Spinach fluorescence, is also present in other fluorogenic RNAs, and may represent a general strategy for RNAs to induce fluorescence of chromophores. The structure has guided the design of a miniaturized 'Baby Spinach', and provides the foundation for structure-driven design and tuning of fluorescent RNAs. PMID:25026079
Structure of the catalytic domain of Plasmodium falciparum ARF GTPase-activating protein (ARFGAP)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cook, William J.; Senkovich, Olga; Chattopadhyay, Debasish
2012-03-26
The crystal structure of the catalytic domain of the ADP ribosylation factor GTPase-activating protein (ARFGAP) from Plasmodium falciparum has been determined and refined to 2.4 {angstrom} resolution. Multiwavelength anomalous diffraction (MAD) data were collected utilizing the Zn{sup 2+} ion bound at the zinc-finger domain and were used to solve the structure. The overall structure of the domain is similar to those of mammalian ARFGAPs. However, several amino-acid residues in the area where GAP interacts with ARF1 differ in P. falciparum ARFGAP. Moreover, a number of residues that form the dimer interface in the crystal structure are unique in P. falciparummore » ARFGAP.« less
Thompson, Jared J; Tabatabaei Ghomi, Hamed; Lill, Markus A
2014-12-01
Knowledge-based methods for analyzing protein structures, such as statistical potentials, primarily consider the distances between pairs of bodies (atoms or groups of atoms). Considerations of several bodies simultaneously are generally used to characterize bonded structural elements or those in close contact with each other, but historically do not consider atoms that are not in direct contact with each other. In this report, we introduce an information-theoretic method for detecting and quantifying distance-dependent through-space multibody relationships between the sidechains of three residues. The technique introduced is capable of producing convergent and consistent results when applied to a sufficiently large database of randomly chosen, experimentally solved protein structures. The results of our study can be shown to reproduce established physico-chemical properties of residues as well as more recently discovered properties and interactions. These results offer insight into the numerous roles that residues play in protein structure, as well as relationships between residue function, protein structure, and evolution. The techniques and insights presented in this work should be useful in the future development of novel knowledge-based tools for the evaluation of protein structure. © 2014 Wiley Periodicals, Inc.
Yamamoto, Norifumi
2014-08-21
The conformational conversion of proteins into an aggregation-prone form is a common feature of various neurodegenerative disorders including Alzheimer's, Huntington's, Parkinson's, and prion diseases. In the early stage of prion diseases, secondary structure conversion in prion protein (PrP) causing β-sheet expansion facilitates the formation of a pathogenic isoform with a high content of β-sheets and strong aggregation tendency to form amyloid fibrils. Herein, we propose a straightforward method to extract essential information regarding the secondary structure conversion of proteins from molecular simulations, named secondary structure principal component analysis (SSPCA). The definite existence of a PrP isoform with an increased β-sheet structure was confirmed in a free-energy landscape constructed by mapping protein structural data into a reduced space according to the principal components determined by the SSPCA. We suggest a "spot" of structural ambivalence in PrP-the C-terminal part of helix 2-that lacks a strong intrinsic secondary structure, thus promoting a partial α-helix-to-β-sheet conversion. This result is important to understand how the pathogenic conformational conversion of PrP is initiated in prion diseases. The SSPCA has great potential to solve various challenges in studying highly flexible molecular systems, such as intrinsically disordered proteins, structurally ambivalent peptides, and chameleon sequences.
Challenges in the Development of Functional Assays of Membrane Proteins
Tiefenauer, Louis; Demarche, Sophie
2012-01-01
Lipid bilayers are natural barriers of biological cells and cellular compartments. Membrane proteins integrated in biological membranes enable vital cell functions such as signal transduction and the transport of ions or small molecules. In order to determine the activity of a protein of interest at defined conditions, the membrane protein has to be integrated into artificial lipid bilayers immobilized on a surface. For the fabrication of such biosensors expertise is required in material science, surface and analytical chemistry, molecular biology and biotechnology. Specifically, techniques are needed for structuring surfaces in the micro- and nanometer scale, chemical modification and analysis, lipid bilayer formation, protein expression, purification and solubilization, and most importantly, protein integration into engineered lipid bilayers. Electrochemical and optical methods are suitable to detect membrane activity-related signals. The importance of structural knowledge to understand membrane protein function is obvious. Presently only a few structures of membrane proteins are solved at atomic resolution. Functional assays together with known structures of individual membrane proteins will contribute to a better understanding of vital biological processes occurring at biological membranes. Such assays will be utilized in the discovery of drugs, since membrane proteins are major drug targets.
Modeling disordered protein interactions from biophysical principles
Christoffer, Charles; Terashi, Genki
2017-01-01
Disordered protein-protein interactions (PPIs), those involving a folded protein and an intrinsically disordered protein (IDP), are prevalent in the cell, including important signaling and regulatory pathways. IDPs do not adopt a single dominant structure in isolation but often become ordered upon binding. To aid understanding of the molecular mechanisms of disordered PPIs, it is crucial to obtain the tertiary structure of the PPIs. However, experimental methods have difficulty in solving disordered PPIs and existing protein-protein and protein-peptide docking methods are not able to model them. Here we present a novel computational method, IDP-LZerD, which models the conformation of a disordered PPI by considering the biophysical binding mechanism of an IDP to a structured protein, whereby a local segment of the IDP initiates the interaction and subsequently the remaining IDP regions explore and coalesce around the initial binding site. On a dataset of 22 disordered PPIs with IDPs up to 69 amino acids, successful predictions were made for 21 bound and 18 unbound receptors. The successful modeling provides additional support for biophysical principles. Moreover, the new technique significantly expands the capability of protein structure modeling and provides crucial insights into the molecular mechanisms of disordered PPIs. PMID:28394890
Structure of Lmaj006129AAA, a hypothetical protein from Leishmania major
DOE Office of Scientific and Technical Information (OSTI.GOV)
Arakaki, Tracy; Le Trong, Isolde; Structural Genomics of Pathogenic Protozoa
2006-03-01
The crystal structure of a conserved hypothetical protein from L. major, Pfam sequence family PF04543, structural genomics target ID Lmaj006129AAA, has been determined at a resolution of 1.6 Å. The gene product of structural genomics target Lmaj006129 from Leishmania major codes for a 164-residue protein of unknown function. When SeMet expression of the full-length gene product failed, several truncation variants were created with the aid of Ginzu, a domain-prediction method. 11 truncations were selected for expression, purification and crystallization based upon secondary-structure elements and disorder. The structure of one of these variants, Lmaj006129AAH, was solved by multiple-wavelength anomalous diffraction (MAD)more » using ELVES, an automatic protein crystal structure-determination system. This model was then successfully used as a molecular-replacement probe for the parent full-length target, Lmaj006129AAA. The final structure of Lmaj006129AAA was refined to an R value of 0.185 (R{sub free} = 0.229) at 1.60 Å resolution. Structure and sequence comparisons based on Lmaj006129AAA suggest that proteins belonging to Pfam sequence families PF04543 and PF01878 may share a common ligand-binding motif.« less
NASA Astrophysics Data System (ADS)
Salary, Mohammad Mahdi; Mosallaei, Hossein
2015-06-01
Interactions between the plasmons of noble metal nanoparticles and non-absorbing biomolecules forms the basis of the plasmonic sensors, which have received much attention. Studying these interactions can help to exploit the full potentials of plasmonic sensors in quantification and analysis of biomolecules. Here, a quasi-static continuum model is adopted for this purpose. We present a boundary-element method for computing the optical response of plasmonic particles to the molecular binding events by solving the Poisson equation. The model represents biomolecules with their molecular surfaces, thus accurately accounting for the influence of exact binding conformations as well as structural differences between different proteins on the response of plasmonic nanoparticles. The linear systems arising in the method are solved iteratively with Krylov generalized minimum residual algorithm, and the acceleration is achieved by applying precorrected-Fast Fourier Transformation technique. We apply the developed method to investigate interactions of biotinylated gold nanoparticles (nanosphere and nanorod) with four different types of biotin-binding proteins. The interactions are studied at both ensemble and single-molecule level. Computational results demonstrate the ability of presented model for analyzing realistic nanoparticle-biomolecule configurations. The method can provide comprehensive study for wide variety of applications, including protein structures, monitoring structural and conformational transitions, and quantification of protein concentrations. In addition, it is suitable for design and optimization of the nano-plasmonic sensors.
PROCOS: computational analysis of protein-protein complexes.
Fink, Florian; Hochrein, Jochen; Wolowski, Vincent; Merkl, Rainer; Gronwald, Wolfram
2011-09-01
One of the main challenges in protein-protein docking is a meaningful evaluation of the many putative solutions. Here we present a program (PROCOS) that calculates a probability-like measure to be native for a given complex. In contrast to scores often used for analyzing complex structures, the calculated probabilities offer the advantage of providing a fixed range of expected values. This will allow, in principle, the comparison of models corresponding to different targets that were solved with the same algorithm. Judgments are based on distributions of properties derived from a large database of native and false complexes. For complex analysis PROCOS uses these property distributions of native and false complexes together with a support vector machine (SVM). PROCOS was compared to the established scoring schemes of ZRANK and DFIRE. Employing a set of experimentally solved native complexes, high probability values above 50% were obtained for 90% of these structures. Next, the performance of PROCOS was tested on the 40 binary targets of the Dockground decoy set, on 14 targets of the RosettaDock decoy set and on 9 targets that participated in the CAPRI scoring evaluation. Again the advantage of using a probability-based scoring system becomes apparent and a reasonable number of near native complexes was found within the top ranked complexes. In conclusion, a novel fully automated method is presented that allows the reliable evaluation of protein-protein complexes. Copyright © 2011 Wiley Periodicals, Inc.
Leite, Wellington C; Galvão, Carolina W; Saab, Sérgio C; Iulek, Jorge; Etto, Rafael M; Steffens, Maria B R; Chitteni-Pattu, Sindhu; Stanage, Tyler; Keck, James L; Cox, Michael M
2016-01-01
The bacterial RecA protein plays a role in the complex system of DNA damage repair. Here, we report the functional and structural characterization of the Herbaspirillum seropedicae RecA protein (HsRecA). HsRecA protein is more efficient at displacing SSB protein from ssDNA than Escherichia coli RecA protein. HsRecA also promotes DNA strand exchange more efficiently. The three dimensional structure of HsRecA-ADP/ATP complex has been solved to 1.7 Å resolution. HsRecA protein contains a small N-terminal domain, a central core ATPase domain and a large C-terminal domain, that are similar to homologous bacterial RecA proteins. Comparative structural analysis showed that the N-terminal polymerization motif of archaeal and eukaryotic RecA family proteins are also present in bacterial RecAs. Reconstruction of electrostatic potential from the hexameric structure of HsRecA-ADP/ATP revealed a high positive charge along the inner side, where ssDNA is bound inside the filament. The properties of this surface may explain the greater capacity of HsRecA protein to bind ssDNA, forming a contiguous nucleoprotein filament, displace SSB and promote DNA exchange relative to EcRecA. Our functional and structural analyses provide insight into the molecular mechanisms of polymerization of bacterial RecA as a helical nucleoprotein filament.
Bayesian peak picking for NMR spectra.
Cheng, Yichen; Gao, Xin; Liang, Faming
2014-02-01
Protein structure determination is a very important topic in structural genomics, which helps people to understand varieties of biological functions such as protein-protein interactions, protein-DNA interactions and so on. Nowadays, nuclear magnetic resonance (NMR) has often been used to determine the three-dimensional structures of protein in vivo. This study aims to automate the peak picking step, the most important and tricky step in NMR structure determination. We propose to model the NMR spectrum by a mixture of bivariate Gaussian densities and use the stochastic approximation Monte Carlo algorithm as the computational tool to solve the problem. Under the Bayesian framework, the peak picking problem is casted as a variable selection problem. The proposed method can automatically distinguish true peaks from false ones without preprocessing the data. To the best of our knowledge, this is the first effort in the literature that tackles the peak picking problem for NMR spectrum data using Bayesian method. Copyright © 2013. Production and hosting by Elsevier Ltd.
Research Team Engineers a Better Plastic-Degrading Enzyme | News | NREL
polyethylene terephthalate, or PET. While working to solve the crystal structure of PETase-a recently determine its structure to aid in protein engineering, but we ended up going a step further and accidentally discovery that PETase can also degrade polyethylene furandicarboxylate, or PEF, a bio-based substitute for
Structural analysis of a set of proteins resulting from a bacterial genomics project.
Badger, J; Sauder, J M; Adams, J M; Antonysamy, S; Bain, K; Bergseid, M G; Buchanan, S G; Buchanan, M D; Batiyenko, Y; Christopher, J A; Emtage, S; Eroshkina, A; Feil, I; Furlong, E B; Gajiwala, K S; Gao, X; He, D; Hendle, J; Huber, A; Hoda, K; Kearins, P; Kissinger, C; Laubert, B; Lewis, H A; Lin, J; Loomis, K; Lorimer, D; Louie, G; Maletic, M; Marsh, C D; Miller, I; Molinari, J; Muller-Dieckmann, H J; Newman, J M; Noland, B W; Pagarigan, B; Park, F; Peat, T S; Post, K W; Radojicic, S; Ramos, A; Romero, R; Rutter, M E; Sanderson, W E; Schwinn, K D; Tresser, J; Winhoven, J; Wright, T A; Wu, L; Xu, J; Harris, T J R
2005-09-01
The targets of the Structural GenomiX (SGX) bacterial genomics project were proteins conserved in multiple prokaryotic organisms with no obvious sequence homolog in the Protein Data Bank of known structures. The outcome of this work was 80 structures, covering 60 unique sequences and 49 different genes. Experimental phase determination from proteins incorporating Se-Met was carried out for 45 structures with most of the remainder solved by molecular replacement using members of the experimentally phased set as search models. An automated tool was developed to deposit these structures in the Protein Data Bank, along with the associated X-ray diffraction data (including refined experimental phases) and experimentally confirmed sequences. BLAST comparisons of the SGX structures with structures that had appeared in the Protein Data Bank over the intervening 3.5 years since the SGX target list had been compiled identified homologs for 49 of the 60 unique sequences represented by the SGX structures. This result indicates that, for bacterial structures that are relatively easy to express, purify, and crystallize, the structural coverage of gene space is proceeding rapidly. More distant sequence-structure relationships between the SGX and PDB structures were investigated using PDB-BLAST and Combinatorial Extension (CE). Only one structure, SufD, has a truly unique topology compared to all folds in the PDB. Copyright 2005 Wiley-Liss, Inc.
MacDonald, James T.; Kabasakal, Burak V.; Godding, David; Kraatz, Sebastian; Henderson, Louie; Barber, James; Freemont, Paul S.; Murray, James W.
2016-01-01
The ability to design and construct structures with atomic level precision is one of the key goals of nanotechnology. Proteins offer an attractive target for atomic design because they can be synthesized chemically or biologically and can self-assemble. However, the generalized protein folding and design problem is unsolved. One approach to simplifying the problem is to use a repetitive protein as a scaffold. Repeat proteins are intrinsically modular, and their folding and structures are better understood than large globular domains. Here, we have developed a class of synthetic repeat proteins based on the pentapeptide repeat family of beta-solenoid proteins. We have constructed length variants of the basic scaffold and computationally designed de novo loops projecting from the scaffold core. The experimentally solved 3.56-Å resolution crystal structure of one designed loop matches closely the designed hairpin structure, showing the computational design of a backbone extension onto a synthetic protein core without the use of backbone fragments from known structures. Two other loop designs were not clearly resolved in the crystal structures, and one loop appeared to be in an incorrect conformation. We have also shown that the repeat unit can accommodate whole-domain insertions by inserting a domain into one of the designed loops. PMID:27573845
Jahandideh, Samad; Srinivasasainagendra, Vinodh; Zhi, Degui
2012-11-07
RNA-protein interaction plays an important role in various cellular processes, such as protein synthesis, gene regulation, post-transcriptional gene regulation, alternative splicing, and infections by RNA viruses. In this study, using Gene Ontology Annotated (GOA) and Structural Classification of Proteins (SCOP) databases an automatic procedure was designed to capture structurally solved RNA-binding protein domains in different subclasses. Subsequently, we applied tuned multi-class SVM (TMCSVM), Random Forest (RF), and multi-class ℓ1/ℓq-regularized logistic regression (MCRLR) for analysis and classifying RNA-binding protein domains based on a comprehensive set of sequence and structural features. In this study, we compared prediction accuracy of three different state-of-the-art predictor methods. From our results, TMCSVM outperforms the other methods and suggests the potential of TMCSVM as a useful tool for facilitating the multi-class prediction of RNA-binding protein domains. On the other hand, MCRLR by elucidating importance of features for their contribution in predictive accuracy of RNA-binding protein domains subclasses, helps us to provide some biological insights into the roles of sequences and structures in protein-RNA interactions.
Functional Insights from Structural Genomics
DOE Office of Scientific and Technical Information (OSTI.GOV)
Forouhar,F.; Kuzin, A.; Seetharaman, J.
2007-01-01
Structural genomics efforts have produced structural information, either directly or by modeling, for thousands of proteins over the past few years. While many of these proteins have known functions, a large percentage of them have not been characterized at the functional level. The structural information has provided valuable functional insights on some of these proteins, through careful structural analyses, serendipity, and structure-guided functional screening. Some of the success stories based on structures solved at the Northeast Structural Genomics Consortium (NESG) are reported here. These include a novel methyl salicylate esterase with important role in plant innate immunity, a novel RNAmore » methyltransferase (H. influenzae yggJ (HI0303)), a novel spermidine/spermine N-acetyltransferase (B. subtilis PaiA), a novel methyltransferase or AdoMet binding protein (A. fulgidus AF{_}0241), an ATP:cob(I)alamin adenosyltransferase (B. subtilis YvqK), a novel carboxysome pore (E. coli EutN), a proline racemase homolog with a disrupted active site (B. melitensis BME11586), an FMN-dependent enzyme (S. pneumoniae SP{_}1951), and a 12-stranded {beta}-barrel with a novel fold (V. parahaemolyticus VPA1032).« less
Khvostichenko, Daria S.; Schieferstein, Jeremy M.; Pawate, Ashtamurthy S.; ...
2014-08-21
Crystallization from lipidic mesophase matrices is a promising route to diffraction-quality crystals and structures of membrane proteins. The microfluidic approach reported here eliminates two bottlenecks of the standard mesophase-based crystallization protocols: (i) manual preparation of viscous mesophases and (ii) manual harvesting of often small and fragile protein crystals. In the approach reported here, protein-loaded mesophases are formulated in an X-ray transparent microfluidic chip using only 60 nL of the protein solution per crystallization trial. The X-ray transparency of the chip enables diffraction data collection from multiple crystals residing in microfluidic wells, eliminating the normally required manual harvesting and mounting ofmore » individual crystals. In addition, we validated our approach by on-chip crystallization of photosynthetic reaction center, a membrane protein from Rhodobacter sphaeroides, followed by solving its structure to a resolution of 2.5 Å using X-ray diffraction data collected on-chip under ambient conditions. A moderate conformational change in hydrophilic chains of the protein was observed when comparing the on-chip, room temperature structure with known structures for which data were acquired under cryogenic conditions.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Khvostichenko, Daria S.; Schieferstein, Jeremy M.; Pawate, Ashtamurthy S.
2014-10-01
Crystallization from lipidic mesophase matrices is a promising route to diffraction-quality crystals and structures of membrane proteins. The microfluidic approach reported here eliminates two bottlenecks of the standard mesophase-based crystallization protocols: (i) manual preparation of viscous mesophases and (ii) manual harvesting of often small and fragile protein crystals. In the approach reported here, protein-loaded mesophases are formulated in an X-ray transparent microfluidic chip using only 60 nL of the protein solution per crystallization trial. The X-ray transparency of the chip enables diffraction data collection from multiple crystals residing in microfluidic wells, eliminating the normally required manual harvesting and mounting ofmore » individual crystals. We validated our approach by on-chip crystallization of photosynthetic reaction center, a membrane protein from Rhodobacter sphaeroides, followed by solving its structure to a resolution of 2.5 Å using X-ray diffraction data collected on-chip under ambient conditions. A moderate conformational change in hydrophilic chains of the protein was observed when comparing the on-chip, room temperature structure with known structures for which data were acquired under cryogenic conditions.« less
ERIC Educational Resources Information Center
Hernandez-Cortes, Patricia
2012-01-01
Vitellogenin (Vtg) is a lipid transfer protein that carries yolk to the ovary. The vitellogenin receptor (VtgR) mediates the uptake of Vtg into the oocyte of oviparous animals; its structure includes eight ligand-binding repeats (LBR). The binding site of VtgR and Vtg and the location of the interaction within the molecules are at these LBR.…
Cao, Han; Ng, Marcus C K; Jusoh, Siti Azma; Tai, Hio Kuan; Siu, Shirley W I
2017-09-01
[Formula: see text]-Helical transmembrane proteins are the most important drug targets in rational drug development. However, solving the experimental structures of these proteins remains difficult, therefore computational methods to accurately and efficiently predict the structures are in great demand. We present an improved structure prediction method TMDIM based on Park et al. (Proteins 57:577-585, 2004) for predicting bitopic transmembrane protein dimers. Three major algorithmic improvements are introduction of the packing type classification, the multiple-condition decoy filtering, and the cluster-based candidate selection. In a test of predicting nine known bitopic dimers, approximately 78% of our predictions achieved a successful fit (RMSD <2.0 Å) and 78% of the cases are better predicted than the two other methods compared. Our method provides an alternative for modeling TM bitopic dimers of unknown structures for further computational studies. TMDIM is freely available on the web at https://cbbio.cis.umac.mo/TMDIM . Website is implemented in PHP, MySQL and Apache, with all major browsers supported.
TMDIM: an improved algorithm for the structure prediction of transmembrane domains of bitopic dimers
NASA Astrophysics Data System (ADS)
Cao, Han; Ng, Marcus C. K.; Jusoh, Siti Azma; Tai, Hio Kuan; Siu, Shirley W. I.
2017-09-01
α-Helical transmembrane proteins are the most important drug targets in rational drug development. However, solving the experimental structures of these proteins remains difficult, therefore computational methods to accurately and efficiently predict the structures are in great demand. We present an improved structure prediction method TMDIM based on Park et al. (Proteins 57:577-585, 2004) for predicting bitopic transmembrane protein dimers. Three major algorithmic improvements are introduction of the packing type classification, the multiple-condition decoy filtering, and the cluster-based candidate selection. In a test of predicting nine known bitopic dimers, approximately 78% of our predictions achieved a successful fit (RMSD <2.0 Å) and 78% of the cases are better predicted than the two other methods compared. Our method provides an alternative for modeling TM bitopic dimers of unknown structures for further computational studies. TMDIM is freely available on the web at https://cbbio.cis.umac.mo/TMDIM. Website is implemented in PHP, MySQL and Apache, with all major browsers supported.
Parmodel: a web server for automated comparative modeling of proteins.
Uchôa, Hugo Brandão; Jorge, Guilherme Eberhart; Freitas Da Silveira, Nelson José; Camera, João Carlos; Canduri, Fernanda; De Azevedo, Walter Filgueira
2004-12-24
Parmodel is a web server for automated comparative modeling and evaluation of protein structures. The aim of this tool is to help inexperienced users to perform modeling, assessment, visualization, and optimization of protein models as well as crystallographers to evaluate structures solved experimentally. It is subdivided in four modules: Parmodel Modeling, Parmodel Assessment, Parmodel Visualization, and Parmodel Optimization. The main module is the Parmodel Modeling that allows the building of several models for a same protein in a reduced time, through the distribution of modeling processes on a Beowulf cluster. Parmodel automates and integrates the main softwares used in comparative modeling as MODELLER, Whatcheck, Procheck, Raster3D, Molscript, and Gromacs. This web server is freely accessible at .
Kihara, Daisuke; Sael, Lee; Chikhi, Rayan; Esquivel-Rodriguez, Juan
2011-09-01
The tertiary structures of proteins have been solved in an increasing pace in recent years. To capitalize the enormous efforts paid for accumulating the structure data, efficient and effective computational methods need to be developed for comparing, searching, and investigating interactions of protein structures. We introduce the 3D Zernike descriptor (3DZD), an emerging technique to describe molecular surfaces. The 3DZD is a series expansion of mathematical three-dimensional function, and thus a tertiary structure is represented compactly by a vector of coefficients of terms in the series. A strong advantage of the 3DZD is that it is invariant to rotation of target object to be represented. These two characteristics of the 3DZD allow rapid comparison of surface shapes, which is sufficient for real-time structure database screening. In this article, we review various applications of the 3DZD, which have been recently proposed.
A hetero-micro-seeding strategy for readily crystallizing closely related protein variants.
Islam, Mohammad M; Kuroda, Yutaka
2017-11-04
Protein crystallization remains difficult to rationalize and screening for optimal crystallization conditions is a tedious and time consuming procedure. Here, we report a hetero-micro-seeding strategy for producing high resolution crystals of closely related protein variants, where micro crystals from a readily crystallized variant are used as seeds to develop crystals of other variants less amenable to crystallization. We applied this strategy to Bovine Pancreatic Trypsin Inhibitor (BPTI) variants, which would not crystallize using standard crystallization practice. Out of six variants in our analysis, only one called BPTI-[5,55]A14G formed well behaving crystals; and the remaining five (A14GA38G, A14GA38V, A14GA38L, A14GA38I, and A14GA38K) could be crystallized only using micro-seeds from the BPTI-[5,55]A14G crystal. All hetero-seeded crystals diffracted at high resolution with minimum mosaicity, retaining the same space group and cell dimension. Moreover, hetero-micro-seeding did not introduce any biases into the mutant's structure toward the seed structure, as demonstrated by A14GA38I structures solved using micro-seeds from A14GA38G, A14GA38L and A14GA38I. Though hetero-micro-seeding is a simple and almost naïve strategy, this is the first direct demonstration of its workability. We believe that hetero-micro-seeding, which is contrasting with the popular idea that crystallization requires highly purified proteins, could contribute a new tool for rapidly solving protein structures in mutational analysis studies. Copyright © 2017 Elsevier Inc. All rights reserved.
Structure of CC Chemokine Receptor 2 with Orthosteric and Allosteric Antagonists
Zheng, Yi; Qin, Ling; Ortiz Zacarías, Natalia V.; de Vries, Henk; Han, Gye Won; Gustavsson, Martin; Dabros, Marta; Zhao, Chunxia; Cherney, Robert J.; Carter, Percy; Stamos, Dean; Abagyan, Ruben; Cherezov, Vadim; Stevens, Raymond C.; IJzerman, Adriaan P.; Heitman, Laura H.; Tebben, Andrew; Kufareva, Irina; Handel, Tracy M.
2016-01-01
Summary CC chemokine receptor 2 (CCR2) is one of 19 members of the chemokine receptor subfamily of human Class A G protein-coupled receptors (GPCRs). CCR2 is expressed on monocytes, immature dendritic cells and T cell subpopulations, and mediates their migration towards endogenous CC chemokine ligands such as CCL21. CCR2 and its ligands are implicated in numerous inflammatory and neurodegenerative diseases2 including atherosclerosis, multiple sclerosis, asthma, neuropathic pain, and diabetic nephropathy, as well as cancer3. These disease associations have motivated numerous preclinical studies and clinical trials4 (see ClinicalTrials.gov) in search of therapies that target the CCR2:chemokine axis. To aid drug discovery efforts5, we solved a structure of CCR2 in a ternary complex with an orthosteric (BMS-6816) and allosteric (CCR2-RA-[R]7) antagonist. BMS-681 inhibits chemokine binding by occupying the orthosteric pocket of the receptor in a previously unseen binding mode. CCR2-RA-[R] binds in a novel, highly druggable pocket that is the most intracellular allosteric site observed in Class A GPCRs to date; this site spatially overlaps the G protein-binding site in homologous receptors. CCR2-RA-[R] inhibits CCR2 non-competitively by blocking activation-associated conformational changes and formation of the G protein-binding interface. The conformational signature of the conserved microswitch residues observed in double-antagonist-bound CCR2 resembles the most inactive GPCR structures solved to date. Like other protein:protein interactions, receptor:chemokine complexes are considered challenging therapeutic targets for small molecules, and the present structure suggests diverse pocket epitopes that can be exploited to overcome drug design obstacles. PMID:27926736
Serial Femtosecond Crystallography of G Protein-Coupled Receptors
Liu, Wei; Wacker, Daniel; Gati, Cornelius; Han, Gye Won; James, Daniel; Wang, Dingjie; Nelson, Garrett; Weierstall, Uwe; Katritch, Vsevolod; Barty, Anton; Zatsepin, Nadia A.; Li, Dianfan; Messerschmidt, Marc; Boutet, Sébastien; Williams, Garth J.; Koglin, Jason E.; Seibert, M. Marvin; Wang, Chong; Shah, Syed T.A.; Basu, Shibom; Fromme, Raimund; Kupitz, Christopher; Rendek, Kimberley N.; Grotjohann, Ingo; Fromme, Petra; Kirian, Richard A.; Beyerlein, Kenneth R.; White, Thomas A.; Chapman, Henry N.; Caffrey, Martin; Spence, John C.H.; Stevens, Raymond C.; Cherezov, Vadim
2014-01-01
X-ray crystallography of G protein-coupled receptors and other membrane proteins is hampered by difficulties associated with growing sufficiently large crystals that withstand radiation damage and yield high-resolution data at synchrotron sources. Here we used an x-ray free-electron laser (XFEL) with individual 50-fs duration x-ray pulses to minimize radiation damage and obtained a high-resolution room temperature structure of a human serotonin receptor using sub-10 µm microcrystals grown in a membrane mimetic matrix known as lipidic cubic phase. Compared to the structure solved by traditional microcrystallography from cryo-cooled crystals of about two orders of magnitude larger volume, the room temperature XFEL structure displays a distinct distribution of thermal motions and conformations of residues that likely more accurately represent the receptor structure and dynamics in a cellular environment. PMID:24357322
Criteria to Extract High-Quality Protein Data Bank Subsets for Structure Users.
Carugo, Oliviero; Djinović-Carugo, Kristina
2016-01-01
It is often necessary to build subsets of the Protein Data Bank to extract structural trends and average values. For this purpose it is mandatory that the subsets are non-redundant and of high quality. The first problem can be solved relatively easily at the sequence level or at the structural level. The second, on the contrary, needs special attention. It is not sufficient, in fact, to consider the crystallographic resolution and other feature must be taken into account: the absence of strings of residues from the electron density maps and from the files deposited in the Protein Data Bank; the B-factor values; the appropriate validation of the structural models; the quality of the electron density maps, which is not uniform; and the temperature of the diffraction experiments. More stringent criteria produce smaller subsets, which can be enlarged with more tolerant selection criteria. The incessant growth of the Protein Data Bank and especially of the number of high-resolution structures is allowing the use of more stringent selection criteria, with a consequent improvement of the quality of the subsets of the Protein Data Bank.
X-ray diffraction study of Penicillium Vitale catalase in the complex with aminotriazole
DOE Office of Scientific and Technical Information (OSTI.GOV)
Borovik, A. A.; Grebenko, A. I.; Melik-Adamyan, V. R., E-mail: mawr@ns.crys.ras.ru
2011-07-15
The three-dimensional structure of the enzyme catalase from Penicillium vitale in a complex with the inhibitor aminotriazole was solved and refined by protein X-ray crystallography methods. An analysis of the three-dimensional structure of the complex showed that the inhibition of the enzyme occurs as a result of the covalent binding of aminotriazole to the amino-acid residue His64 in the active site of the enzyme. An investigation of the three-dimensional structure of the complex resulted in the amino-acid residues being more precisely identified. The binding sites of saccharide residues and calcium ions in the protein molecule were found.
An estimated 5% of new protein structures solved today represent a new Pfam family
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mistry, Jaina; Kloppmann, Edda; Rost, Burkhard
2013-11-01
This study uses the Pfam database to show that the sequence redundancy of protein structures deposited in the PDB is increasing. The possible reasons behind this trend are discussed. High-resolution structural knowledge is key to understanding how proteins function at the molecular level. The number of entries in the Protein Data Bank (PDB), the repository of all publicly available protein structures, continues to increase, with more than 8000 structures released in 2012 alone. The authors of this article have studied how structural coverage of the protein-sequence space has changed over time by monitoring the number of Pfam families that acquiredmore » their first representative structure each year from 1976 to 2012. Twenty years ago, for every 100 new PDB entries released, an estimated 20 Pfam families acquired their first structure. By 2012, this decreased to only about five families per 100 structures. The reasons behind the slower pace at which previously uncharacterized families are being structurally covered were investigated. It was found that although more than 50% of current Pfam families are still without a structural representative, this set is enriched in families that are small, functionally uncharacterized or rich in problem features such as intrinsically disordered and transmembrane regions. While these are important constraints, the reasons why it may not yet be time to give up the pursuit of a targeted but more comprehensive structural coverage of the protein-sequence space are discussed.« less
Low-resolution structure of Drosophila translin
Kumar, Vinay; Gupta, Gagan D.
2012-01-01
Crystals of native Drosophila melanogaster translin diffracted to 7 Å resolution. Reductive methylation of the protein improved crystal quality. The native and methylated proteins showed similar profiles in size-exclusion chromatography analyses but the methylated protein displayed reduced DNA-binding activity. Crystals of the methylated protein diffracted to 4.2 Å resolution at BM14 of the ESRF synchrotron. Crystals with 49% solvent content belonged to monoclinic space group P21 with eight protomers in the asymmetric unit. Only 2% of low-resolution structures with similar low percentage solvent content were found in the PDB. The crystal structure, solved by molecular replacement method, refined to Rwork (Rfree) of 0.24 (0.29) with excellent stereochemistry. The crystal structure clearly shows that drosophila protein exists as an octamer, and not as a decamer as expected from gel-filtration elution profiles. The similar octameric quaternary fold in translin orthologs and in translin–TRAX complexes suggests an up-down dimer as the basic structural subunit of translin-like proteins. The drosophila oligomer displays asymmetric assembly and increased radius of gyration that accounts for the observed differences between the elution profiles of human and drosophila proteins on gel-filtration columns. This study demonstrates clearly that low-resolution X-ray structure can be useful in understanding complex biological oligomers. PMID:23650579
Thoden, James B; Holden, Hazel M
2014-06-01
Unusual di- and trideoxysugars are often found on the O-antigens of Gram-negative bacteria, on the S-layers of Gram-positive bacteria, and on various natural products. One such sugar is 3-acetamido-3,6-dideoxy-D-glucose. A key step in its biosynthesis, catalyzed by a 3,4-ketoisomerase, is the conversion of thymidine diphosphate (dTDP)-4-keto-6-deoxyglucose to dTDP-3-keto-6-deoxyglucose. Here we report an X-ray analysis of a 3,4-ketoisomerase from Thermoanaerobacterium thermosaccharolyticum. For this investigation, the wild-type enzyme, referred to as QdtA, was crystallized in the presence of dTDP and its structure solved to 2.0-Å resolution. The dimeric enzyme adopts a three-dimensional architecture that is characteristic for proteins belonging to the cupin superfamily. In order to trap the dTDP-4-keto-6-deoxyglucose substrate into the active site, a mutant protein, H51N, was subsequently constructed, and the structure of this protein in complex with the dTDP-sugar ligand was solved to 1.9-Å resolution. Taken together, the structures suggest that His 51 serves as a catalytic base, that Tyr 37 likely functions as a catalytic acid, and that His 53 provides a proton shuttle between the C-3' hydroxyl and the C-4' keto group of the hexose. This study reports the first three-dimensional structure of a 3,4-ketoisomerase in complex with its dTDP-sugar substrate and thus sheds new molecular insight into this fascinating class of enzymes. © 2014 The Protein Society.
pK(A) in proteins solving the Poisson-Boltzmann equation with finite elements.
Sakalli, Ilkay; Knapp, Ernst-Walter
2015-11-05
Knowledge on pK(A) values is an eminent factor to understand the function of proteins in living systems. We present a novel approach demonstrating that the finite element (FE) method of solving the linearized Poisson-Boltzmann equation (lPBE) can successfully be used to compute pK(A) values in proteins with high accuracy as a possible replacement to finite difference (FD) method. For this purpose, we implemented the software molecular Finite Element Solver (mFES) in the framework of the Karlsberg+ program to compute pK(A) values. This work focuses on a comparison between pK(A) computations obtained with the well-established FD method and with the new developed FE method mFES, solving the lPBE using protein crystal structures without conformational changes. Accurate and coarse model systems are set up with mFES using a similar number of unknowns compared with the FD method. Our FE method delivers results for computations of pK(A) values and interaction energies of titratable groups, which are comparable in accuracy. We introduce different thermodynamic cycles to evaluate pK(A) values and we show for the FE method how different parameters influence the accuracy of computed pK(A) values. © 2015 Wiley Periodicals, Inc.
Kemege, Kyle E.; Hickey, John M.; Lovell, Scott; Battaile, Kevin P.; Zhang, Yang; Hefty, P. Scott
2011-01-01
Chlamydia trachomatis is a medically important pathogen that encodes a relatively high percentage of proteins with unknown function. The three-dimensional structure of a protein can be very informative regarding the protein's functional characteristics; however, determining protein structures experimentally can be very challenging. Computational methods that model protein structures with sufficient accuracy to facilitate functional studies have had notable successes. To evaluate the accuracy and potential impact of computational protein structure modeling of hypothetical proteins encoded by Chlamydia, a successful computational method termed I-TASSER was utilized to model the three-dimensional structure of a hypothetical protein encoded by open reading frame (ORF) CT296. CT296 has been reported to exhibit functional properties of a divalent cation transcription repressor (DcrA), with similarity to the Escherichia coli iron-responsive transcriptional repressor, Fur. Unexpectedly, the I-TASSER model of CT296 exhibited no structural similarity to any DNA-interacting proteins or motifs. To validate the I-TASSER-generated model, the structure of CT296 was solved experimentally using X-ray crystallography. Impressively, the ab initio I-TASSER-generated model closely matched (2.72-Å Cα root mean square deviation [RMSD]) the high-resolution (1.8-Å) crystal structure of CT296. Modeled and experimentally determined structures of CT296 share structural characteristics of non-heme Fe(II) 2-oxoglutarate-dependent enzymes, although key enzymatic residues are not conserved, suggesting a unique biochemical process is likely associated with CT296 function. Additionally, functional analyses did not support prior reports that CT296 has properties shared with divalent cation repressors such as Fur. PMID:21965559
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kemege, Kyle E.; Hickey, John M.; Lovell, Scott
2012-02-13
Chlamydia trachomatis is a medically important pathogen that encodes a relatively high percentage of proteins with unknown function. The three-dimensional structure of a protein can be very informative regarding the protein's functional characteristics; however, determining protein structures experimentally can be very challenging. Computational methods that model protein structures with sufficient accuracy to facilitate functional studies have had notable successes. To evaluate the accuracy and potential impact of computational protein structure modeling of hypothetical proteins encoded by Chlamydia, a successful computational method termed I-TASSER was utilized to model the three-dimensional structure of a hypothetical protein encoded by open reading frame (ORF)more » CT296. CT296 has been reported to exhibit functional properties of a divalent cation transcription repressor (DcrA), with similarity to the Escherichia coli iron-responsive transcriptional repressor, Fur. Unexpectedly, the I-TASSER model of CT296 exhibited no structural similarity to any DNA-interacting proteins or motifs. To validate the I-TASSER-generated model, the structure of CT296 was solved experimentally using X-ray crystallography. Impressively, the ab initio I-TASSER-generated model closely matched (2.72-{angstrom} C{alpha} root mean square deviation [RMSD]) the high-resolution (1.8-{angstrom}) crystal structure of CT296. Modeled and experimentally determined structures of CT296 share structural characteristics of non-heme Fe(II) 2-oxoglutarate-dependent enzymes, although key enzymatic residues are not conserved, suggesting a unique biochemical process is likely associated with CT296 function. Additionally, functional analyses did not support prior reports that CT296 has properties shared with divalent cation repressors such as Fur.« less
Bunker, Richard D; Mandal, Kalyaneswar; Bashiri, Ghader; Chaston, Jessica J; Pentelute, Bradley L; Lott, J Shaun; Kent, Stephen B H; Baker, Edward N
2015-04-07
Protein 3D structure can be a powerful predictor of function, but it often faces a critical roadblock at the crystallization step. Rv1738, a protein from Mycobacterium tuberculosis that is strongly implicated in the onset of nonreplicating persistence, and thereby latent tuberculosis, resisted extensive attempts at crystallization. Chemical synthesis of the L- and D-enantiomeric forms of Rv1738 enabled facile crystallization of the D/L-racemic mixture. The structure was solved by an ab initio approach that took advantage of the quantized phases characteristic of diffraction by centrosymmetric crystals. The structure, containing L- and D-dimers in a centrosymmetric space group, revealed unexpected homology with bacterial hibernation-promoting factors that bind to ribosomes and suppress translation. This suggests that the functional role of Rv1738 is to contribute to the shutdown of ribosomal protein synthesis during the onset of nonreplicating persistence of M. tuberculosis.
Progress in protein crystallography.
Dauter, Zbigniew; Wlodawer, Alexander
2016-01-01
Macromolecular crystallography evolved enormously from the pioneering days, when structures were solved by "wizards" performing all complicated procedures almost by hand. In the current situation crystal structures of large systems can be often solved very effectively by various powerful automatic programs in days or hours, or even minutes. Such progress is to a large extent coupled to the advances in many other fields, such as genetic engineering, computer technology, availability of synchrotron beam lines and many other techniques, creating the highly interdisciplinary science of macromolecular crystallography. Due to this unprecedented success crystallography is often treated as one of the analytical methods and practiced by researchers interested in structures of macromolecules, but not highly competent in the procedures involved in the process of structure determination. One should therefore take into account that the contemporary, highly automatic systems can produce results almost without human intervention, but the resulting structures must be carefully checked and validated before their release into the public domain.
Lobley, Carina M C; Aller, Pierre; Douangamath, Alice; Reddivari, Yamini; Bumann, Mario; Bird, Louise E; Nettleship, Joanne E; Brandao-Neto, Jose; Owens, Raymond J; O'Toole, Paul W; Walsh, Martin A
2012-12-01
The structure of ribose 5-phosphate isomerase from the probiotic bacterium Lactobacillus salivarius UCC188 has been determined at 1.72 Å resolution. The structure was solved by molecular replacement, which identified the functional homodimer in the asymmetric unit. Despite only showing 57% sequence identity to its closest homologue, the structure adopted the typical α and β D-ribose 5-phosphate isomerase fold. Comparison to other related structures revealed high homology in the active site, allowing a model of the substrate-bound protein to be proposed. The determination of the structure was expedited by the use of in situ crystallization-plate screening on beamline I04-1 at Diamond Light Source to identify well diffracting protein crystals prior to routine cryocrystallography.
Structural analysis of β-glucosidase mutants derived from a hyperthermophilic tetrameric structure
Nakabayashi, Makoto; Kataoka, Misumi; Mishima, Yumiko; Maeno, Yuka; Ishikawa, Kazuhiko
2014-01-01
β-Glucosidase from Pyrococcus furiosus (BGLPf) is a hyperthermophilic tetrameric enzyme which can degrade cellooligosaccharides to glucose under hyperthermophilic conditions and thus holds promise for the saccharification of lignocellulosic biomass at high temperature. Prior to the production of large amounts of this enzyme, detailed information regarding the oligomeric structure of the enzyme is required. Several crystals of BGLPf have been prepared over the past ten years, but its crystal structure had not been solved until recently. In 2011, the first crystal structure of BGLPf was solved and a model was constructed at somewhat low resolution (2.35 Å). In order to obtain more detailed structural data on BGLPf, the relationship between its tetrameric structure and the quality of the crystal was re-examined. A dimeric form of BGLPf was constructed and its crystal structure was solved at a resolution of 1.70 Å using protein-engineering methods. Furthermore, using the high-resolution crystal structural data for the dimeric form, a monomeric form of BGLPf was constructed which retained the intrinsic activity of the tetrameric form. The thermostability of BGLPf is affected by its oligomeric structure. Here, the biophysical and biochemical properties of engineered dimeric and monomeric BGLPfs are reported, which are promising prototype models to apply to the saccharification reaction. Furthermore, details regarding the oligomeric structures of BGLPf and the reasons why the mutations yielded improved crystal structures are discussed. PMID:24598756
DOE Office of Scientific and Technical Information (OSTI.GOV)
Leite, Wellington C.; Galvão, Carolina W.; Saab, Sérgio C.
The bacterial RecA protein plays a role in the complex system of DNA damage repair. Here, we report the functional and structural characterization of the Herbaspirillum seropedicae RecA protein (HsRecA). HsRecA protein is more efficient at displacing SSB protein from ssDNA than Escherichia coli RecA protein. HsRecA also promotes DNA strand exchange more efficiently. The three dimensional structure of HsRecA-ADP/ATP complex has been solved to 1.7 Å resolution. HsRecA protein contains a small N-terminal domain, a central core ATPase domain and a large C-terminal domain, that are similar to homologous bacterial RecA proteins. Comparative structural analysis showed that the N-terminalmore » polymerization motif of archaeal and eukaryotic RecA family proteins are also present in bacterial RecAs. Reconstruction of electrostatic potential from the hexameric structure of HsRecA-ADP/ATP revealed a high positive charge along the inner side, where ssDNA is bound inside the filament. The properties of this surface may explain the greater capacity of HsRecA protein to bind ssDNA, forming a contiguous nucleoprotein filament, displace SSB and promote DNA exchange relative to EcRecA. In conclusion, our functional and structural analyses provide insight into the molecular mechanisms of polymerization of bacterial RecA as a helical nucleoprotein filament.« less
Galvão, Carolina W.; Saab, Sérgio C.; Iulek, Jorge; Etto, Rafael M.; Steffens, Maria B. R.; Chitteni-Pattu, Sindhu; Stanage, Tyler; Keck, James L.; Cox, Michael M.
2016-01-01
The bacterial RecA protein plays a role in the complex system of DNA damage repair. Here, we report the functional and structural characterization of the Herbaspirillum seropedicae RecA protein (HsRecA). HsRecA protein is more efficient at displacing SSB protein from ssDNA than Escherichia coli RecA protein. HsRecA also promotes DNA strand exchange more efficiently. The three dimensional structure of HsRecA-ADP/ATP complex has been solved to 1.7 Å resolution. HsRecA protein contains a small N-terminal domain, a central core ATPase domain and a large C-terminal domain, that are similar to homologous bacterial RecA proteins. Comparative structural analysis showed that the N-terminal polymerization motif of archaeal and eukaryotic RecA family proteins are also present in bacterial RecAs. Reconstruction of electrostatic potential from the hexameric structure of HsRecA-ADP/ATP revealed a high positive charge along the inner side, where ssDNA is bound inside the filament. The properties of this surface may explain the greater capacity of HsRecA protein to bind ssDNA, forming a contiguous nucleoprotein filament, displace SSB and promote DNA exchange relative to EcRecA. Our functional and structural analyses provide insight into the molecular mechanisms of polymerization of bacterial RecA as a helical nucleoprotein filament. PMID:27447485
Beebe, Emily T.; Makino, Shin-ichi; Nozawa, Akira; Matsubara, Yuko; Frederick, Ronnie O.; Primm, John G.; Goren, Michael A.; Fox, Brian G.
2010-01-01
The use of the Protemist XE, an automated discontinuous-batch protein synthesis robot, in cell-free translation is reported. The soluble Galdieria sulphuraria protein DCN1 was obtained in greater than 2 mg total synthesis yield per mL of reaction mixture from the Protemist XE, and the structure was subsequently solved by X-ray crystallography using material from one 10 mL synthesis (PDB ID: 3KEV). The Protemist XE was also capable of membrane protein translation. Thus human sigma-1 receptor was translated in the presence of unilamellar liposomes and bacteriorhodopsin was translated directly into detergent micelles in the presence of all-trans-retinal. The versatility, ease of use, and compact size of the Protemist XE robot demonstrate its suitability for large-scale synthesis of many classes of proteins. PMID:20637905
DOE Office of Scientific and Technical Information (OSTI.GOV)
Clore, G. Marius; Venditti, Vincenzo
2013-10-01
The bacterial phosphotransferase system (PTS) couples phosphoryl transfer, via a series of bimolecular protein–protein interactions, to sugar transport across the membrane. The multitude of complexes in the PTS provides a paradigm for studying protein interactions, and for understanding how the same binding surface can specifically recognize a diverse array of targets. Fifteen years of work aimed at solving the solution structures of all soluble protein–protein complexes of the PTS has served as a test bed for developing NMR and integrated hybrid approaches to study larger complexes in solution and to probe transient, spectroscopically invisible states, including encounter complexes. We reviewmore » these approaches, highlighting the problems that can be tackled with these methods, and summarize the current findings on protein interactions.« less
Matching multiple rigid domain decompositions of proteins
Flynn, Emily; Streinu, Ileana
2017-01-01
We describe efficient methods for consistently coloring and visualizing collections of rigid cluster decompositions obtained from variations of a protein structure, and lay the foundation for more complex setups that may involve different computational and experimental methods. The focus here is on three biological applications: the conceptually simpler problems of visualizing results of dilution and mutation analyses, and the more complex task of matching decompositions of multiple NMR models of the same protein. Implemented into the KINARI web server application, the improved visualization techniques give useful information about protein folding cores, help examining the effect of mutations on protein flexibility and function, and provide insights into the structural motions of PDB proteins solved with solution NMR. These tools have been developed with the goal of improving and validating rigidity analysis as a credible coarse-grained model capturing essential information about a protein’s slow motions near the native state. PMID:28141528
Moon, Andrea F; Mueller, Geoffrey A; Zhong, Xuejun; Pedersen, Lars C
2010-01-01
Protein crystallographers are often confronted with recalcitrant proteins not readily crystallizable, or which crystallize in problematic forms. A variety of techniques have been used to surmount such obstacles: crystallization using carrier proteins or antibody complexes, chemical modification, surface entropy reduction, proteolytic digestion, and additive screening. Here we present a synergistic approach for successful crystallization of proteins that do not form diffraction quality crystals using conventional methods. This approach combines favorable aspects of carrier-driven crystallization with surface entropy reduction. We have generated a series of maltose binding protein (MBP) fusion constructs containing different surface mutations designed to reduce surface entropy and encourage crystal lattice formation. The MBP advantageously increases protein expression and solubility, and provides a streamlined purification protocol. Using this technique, we have successfully solved the structures of three unrelated proteins that were previously unattainable. This crystallization technique represents a valuable rescue strategy for protein structure solution when conventional methods fail. PMID:20196072
Banerjee, Ankan; Tsai, Chi -Lin; Chaudhury, Paushali; ...
2015-05-01
Archaea employ the archaellum, a type IV pilus-like nanomachine, for swimming motility. In the crenarchaeon Sulfolobus acidocaldarius, the archaellum consists of seven proteins: FlaB/X/G/F/H/I/J. FlaF is conserved and essential for archaellum assembly but no FlaF structures exist. Here, we truncated the FlaF N terminus and solved 1.5-Å and 1.65-Å resolution crystal structures of this monotopic membrane protein. Structures revealed an N-terminal α-helix and an eight-strand β-sandwich, immunoglobulin-like fold with striking similarity to S-layer proteins. Crystal structures, X-ray scattering, and mutational analyses suggest dimer assembly is needed for in vivo function. The sole cell envelope component of S. acidocaldarius is amore » paracrystalline S-layer, and FlaF specifically bound to S-layer protein, suggesting that its interaction domain is located in the pseudoperiplasm with its N-terminal helix in the membrane. From these data, FlaF may act as the previously unknown archaellum stator protein that anchors the rotating archaellum to the archaeal cell envelope.« less
Nonlinear optical methods for the analysis of protein nanocrystals and biological tissues
NASA Astrophysics Data System (ADS)
Dow, Ximeng You
Structural biology underpins rational drug design and fundamental understanding of protein function. X-ray diffraction (XRD) has been the golden standard for solving for high-resolution protein structure. Second harmonic generation (SHG) microscopy has been developed by the Simpson lab as a sensitive, crystal-specific detection method for the identification of protein crystal and help optimize the crystallization condition. Protein nanocrystals has been widely used for structure determination of membrane proteins in serial femtosecond nanocrystallography. In this thesis work, novel nonlinear optical methods were developed to address the challenges associated with the detection and characterization of protein nanocrystals. SHG-correlation spectroscopy (SHG-CS) was developed to take advantage of the diffusing motion and retrieve the size distribution and crystal quality of the nanocrystals. Polarization-dependent SHG imaging technique was developed to measure the relative orientation as well as the internal structure of the sample. Two photon- excited fluorescence has been used in the Simpson lab as a complementary measurement besides the inherent SHG signal from the crystals. A novel instrumentation development was also introduced in this thesis work to greatly improve the speed of fluorescence lifetime imaging (FLIM).
Quaternary structure of a G-protein-coupled receptor heterotetramer in complex with Gi and Gs.
Navarro, Gemma; Cordomí, Arnau; Zelman-Femiak, Monika; Brugarolas, Marc; Moreno, Estefania; Aguinaga, David; Perez-Benito, Laura; Cortés, Antoni; Casadó, Vicent; Mallol, Josefa; Canela, Enric I; Lluís, Carme; Pardo, Leonardo; García-Sáez, Ana J; McCormick, Peter J; Franco, Rafael
2016-04-05
G-protein-coupled receptors (GPCRs), in the form of monomers or homodimers that bind heterotrimeric G proteins, are fundamental in the transfer of extracellular stimuli to intracellular signaling pathways. Different GPCRs may also interact to form heteromers that are novel signaling units. Despite the exponential growth in the number of solved GPCR crystal structures, the structural properties of heteromers remain unknown. We used single-particle tracking experiments in cells expressing functional adenosine A1-A2A receptors fused to fluorescent proteins to show the loss of Brownian movement of the A1 receptor in the presence of the A2A receptor, and a preponderance of cell surface 2:2 receptor heteromers (dimer of dimers). Using computer modeling, aided by bioluminescence resonance energy transfer assays to monitor receptor homomerization and heteromerization and G-protein coupling, we predict the interacting interfaces and propose a quaternary structure of the GPCR tetramer in complex with two G proteins. The combination of results points to a molecular architecture formed by a rhombus-shaped heterotetramer, which is bound to two different interacting heterotrimeric G proteins (Gi and Gs). These novel results constitute an important advance in understanding the molecular intricacies involved in GPCR function.
Functional classification of protein structures by local structure matching in graph representation.
Mills, Caitlyn L; Garg, Rohan; Lee, Joslynn S; Tian, Liang; Suciu, Alexandru; Cooperman, Gene; Beuning, Penny J; Ondrechen, Mary Jo
2018-03-31
As a result of high-throughput protein structure initiatives, over 14,400 protein structures have been solved by structural genomics (SG) centers and participating research groups. While the totality of SG data represents a tremendous contribution to genomics and structural biology, reliable functional information for these proteins is generally lacking. Better functional predictions for SG proteins will add substantial value to the structural information already obtained. Our method described herein, Graph Representation of Active Sites for Prediction of Function (GRASP-Func), predicts quickly and accurately the biochemical function of proteins by representing residues at the predicted local active site as graphs rather than in Cartesian coordinates. We compare the GRASP-Func method to our previously reported method, structurally aligned local sites of activity (SALSA), using the ribulose phosphate binding barrel (RPBB), 6-hairpin glycosidase (6-HG), and Concanavalin A-like Lectins/Glucanase (CAL/G) superfamilies as test cases. In each of the superfamilies, SALSA and the much faster method GRASP-Func yield similar correct classification of previously characterized proteins, providing a validated benchmark for the new method. In addition, we analyzed SG proteins using our SALSA and GRASP-Func methods to predict function. Forty-one SG proteins in the RPBB superfamily, nine SG proteins in the 6-HG superfamily, and one SG protein in the CAL/G superfamily were successfully classified into one of the functional families in their respective superfamily by both methods. This improved, faster, validated computational method can yield more reliable predictions of function that can be used for a wide variety of applications by the community. © 2018 The Authors Protein Science published by Wiley Periodicals, Inc. on behalf of The Protein Society.
ProDaMa: an open source Python library to generate protein structure datasets.
Armano, Giuliano; Manconi, Andrea
2009-10-02
The huge difference between the number of known sequences and known tertiary structures has justified the use of automated methods for protein analysis. Although a general methodology to solve these problems has not been yet devised, researchers are engaged in developing more accurate techniques and algorithms whose training plays a relevant role in determining their performance. From this perspective, particular importance is given to the training data used in experiments, and researchers are often engaged in the generation of specialized datasets that meet their requirements. To facilitate the task of generating specialized datasets we devised and implemented ProDaMa, an open source Python library than provides classes for retrieving, organizing, updating, analyzing, and filtering protein data. ProDaMa has been used to generate specialized datasets useful for secondary structure prediction and to develop a collaborative web application aimed at generating and sharing protein structure datasets. The library, the related database, and the documentation are freely available at the URL http://iasc.diee.unica.it/prodama.
Structure of the human DNA-repair protein RAD52 containing surface mutations.
Saotome, Mika; Saito, Kengo; Onodera, Keiichi; Kurumizaka, Hitoshi; Kagawa, Wataru
2016-08-01
The Rad52 protein is a eukaryotic single-strand DNA-annealing protein that is involved in the homologous recombinational repair of DNA double-strand breaks. The isolated N-terminal half of the human RAD52 protein (RAD52(1-212)) forms an undecameric ring structure with a surface that is mostly positively charged. In the present study, it was found that RAD52(1-212) containing alanine mutations of the charged surface residues (Lys102, Lys133 and Glu202) is highly amenable to crystallization. The structure of the mutant RAD52(1-212) was solved at 2.4 Å resolution. The structure revealed an association between the symmetry-related RAD52(1-212) rings, in which a partially unfolded, C-terminal region of RAD52 extended into the DNA-binding groove of the neighbouring ring in the crystal. The alanine mutations probably reduced the surface entropy of the RAD52(1-212) ring and stabilized the ring-ring association observed in the crystal.
Huenges, M; Rölz, C; Gschwind, R; Peteranderl, R; Berglechner, F; Richter, G; Bacher, A; Kessler, H; Gemmecker, G
1998-01-01
The NusB protein of Escherichia coli is involved in the regulation of rRNA biosynthesis by transcriptional antitermination. In cooperation with several other proteins, it binds to a dodecamer motif designated rrn boxA on the nascent rRNA. The antitermination proteins of E.coli are recruited in the replication cycle of bacteriophage lambda, where they play an important role in switching from the lysogenic to the lytic cycle. Multidimensional heteronuclear NMR experiments were performed with recombinant NusB protein labelled with 13C, 15N and 2H. The three-dimensional structure of the protein was solved from 1926 NMR-derived distances and 80 torsion angle restraints. The protein folds into an alpha/alpha-helical topology consisting of six helices; the arginine-rich N-terminus appears to be disordered. Complexation of the protein with an RNA dodecamer equivalent to the rrn boxA site results in chemical shift changes of numerous amide signals. The overall packing of the protein appears to be conserved, but the flexible N-terminus adopts a more rigid structure upon RNA binding, indicating that the N-terminus functions as an arginine-rich RNA-binding motif (ARM). PMID:9670024
Structural basis for complement evasion by Lyme disease pathogen Borrelia burgdorferi.
Bhattacharjee, Arnab; Oeemig, Jesper S; Kolodziejczyk, Robert; Meri, Taru; Kajander, Tommi; Lehtinen, Markus J; Iwaï, Hideo; Jokiranta, T Sakari; Goldman, Adrian
2013-06-28
Borrelia burgdorferi spirochetes that cause Lyme borreliosis survive for a long time in human serum because they successfully evade the complement system, an important arm of innate immunity. The outer surface protein E (OspE) of B. burgdorferi is needed for this because it recruits complement regulator factor H (FH) onto the bacterial surface to evade complement-mediated cell lysis. To understand this process at the molecular level, we used a structural approach. First, we solved the solution structure of OspE by NMR, revealing a fold that has not been seen before in proteins involved in complement regulation. Next, we solved the x-ray structure of the complex between OspE and the FH C-terminal domains 19 and 20 (FH19-20) at 2.83 Å resolution. The structure shows that OspE binds FH19-20 in a way similar to, but not identical with, that used by endothelial cells to bind FH via glycosaminoglycans. The observed interaction of OspE with FH19-20 allows the full function of FH in down-regulation of complement activation on the bacteria. This reveals the molecular basis for how B. burgdorferi evades innate immunity and suggests how OspE could be used as a potential vaccine antigen.
Huang, Sheng Yu; Chen, Sung Fang; Chen, Chun Hao; Huang, Hsuan Wei; Wu, Wen Guey; Sung, Wang Chou
2014-09-02
Snake venom consists of toxin proteins with multiple disulfide linkages to generate unique structures and biological functions. Determination of these cysteine connections usually requires the purification of each protein followed by structural analysis. In this study, dimethyl labeling coupled with LC-MS/MS and RADAR algorithm was developed to identify the disulfide bonds in crude snake venom. Without any protein separation, the disulfide linkages of several cytotoxins and PLA2 could be solved, including more than 20 disulfide bonds. The results show that this method is capable of analyzing protein mixture. In addition, the approach was also used to compare native cytotoxin 3 (CTX III) and its scrambled isomer, another category of protein mixture, for unknown disulfide bonds. Two disulfide-linked peptides were observed in the native CTX III, and 10 in its scrambled form, X-CTX III. This is the first study that reports a platform for the global cysteine connection analysis on a protein mixture. The proposed method is simple and automatic, offering an efficient tool for structural and functional studies of venom proteins.
Integrating NOE and RDC using sum-of-squares relaxation for protein structure determination.
Khoo, Y; Singer, A; Cowburn, D
2017-07-01
We revisit the problem of protein structure determination from geometrical restraints from NMR, using convex optimization. It is well-known that the NP-hard distance geometry problem of determining atomic positions from pairwise distance restraints can be relaxed into a convex semidefinite program (SDP). However, often the NOE distance restraints are too imprecise and sparse for accurate structure determination. Residual dipolar coupling (RDC) measurements provide additional geometric information on the angles between atom-pair directions and axes of the principal-axis-frame. The optimization problem involving RDC is highly non-convex and requires a good initialization even within the simulated annealing framework. In this paper, we model the protein backbone as an articulated structure composed of rigid units. Determining the rotation of each rigid unit gives the full protein structure. We propose solving the non-convex optimization problems using the sum-of-squares (SOS) hierarchy, a hierarchy of convex relaxations with increasing complexity and approximation power. Unlike classical global optimization approaches, SOS optimization returns a certificate of optimality if the global optimum is found. Based on the SOS method, we proposed two algorithms-RDC-SOS and RDC-NOE-SOS, that have polynomial time complexity in the number of amino-acid residues and run efficiently on a standard desktop. In many instances, the proposed methods exactly recover the solution to the original non-convex optimization problem. To the best of our knowledge this is the first time SOS relaxation is introduced to solve non-convex optimization problems in structural biology. We further introduce a statistical tool, the Cramér-Rao bound (CRB), to provide an information theoretic bound on the highest resolution one can hope to achieve when determining protein structure from noisy measurements using any unbiased estimator. Our simulation results show that when the RDC measurements are corrupted by Gaussian noise of realistic variance, both SOS based algorithms attain the CRB. We successfully apply our method in a divide-and-conquer fashion to determine the structure of ubiquitin from experimental NOE and RDC measurements obtained in two alignment media, achieving more accurate and faster reconstructions compared to the current state of the art.
The mechanism of protein export enhancement by the SecDF membrane component
Tsukazaki, Tomoya; Nureki, Osamu
2011-01-01
Protein transport across membranes is a fundamental and essential cellular activity in all organisms. In bacteria, protein export across the cytoplasmic membrane, driven by dynamic interplays between the protein-conducting SecYEG channel (Sec translocon) and the SecA ATPase, is enhanced by the proton motive force (PMF) and a membrane-integrated Sec component, SecDF. However, the structure and function of SecDF have remained unclear. We solved the first crystal structure of SecDF, consisting of a pseudo-symmetrical 12-helix transmembrane domain and two protruding periplasmic domains. Based on the structural features, we proposed that SecDF functions as a membrane-integrated chaperone, which drives protein movement without using the major energetic currency, ATP, but with remarkable cycles of conformational changes, powered by the proton gradient across the membrane. By a series of biochemical and biophysical approaches, several functionally important residues in the transmembrane region have been identified and our model of the SecDF function has been verified. PMID:27857601
DOE Office of Scientific and Technical Information (OSTI.GOV)
Knapik, Aleksandra Alicja; Petkowski, Janusz Jurand; Otwinowski, Zbyszek
2014-10-02
RutC is the third enzyme in the Escherichia coli rut pathway of uracil degradation. RutC belongs to the highly conserved YjgF family of proteins. The structure of the RutC protein was determined and refined to 1.95 Å resolution. This crystal belonged to space group P21212 and contained six molecules in the asymmetric unit. The structure was solved by SAD phasing and was refined to an Rwork of 19.3% (Rfree = 21.7%). Moreover, the final model revealed that this protein has a Bacillus chorismate mutase-like fold and forms a homotrimer with a hydrophobic cavity in the center of the structure andmore » ligand-binding clefts between two subunits. A likely function for RutC is the reduction of peroxy-aminoacrylate to aminoacrylate as a part of a detoxification process.« less
An improved stochastic fractal search algorithm for 3D protein structure prediction.
Zhou, Changjun; Sun, Chuan; Wang, Bin; Wang, Xiaojun
2018-05-03
Protein structure prediction (PSP) is a significant area for biological information research, disease treatment, and drug development and so on. In this paper, three-dimensional structures of proteins are predicted based on the known amino acid sequences, and the structure prediction problem is transformed into a typical NP problem by an AB off-lattice model. This work applies a novel improved Stochastic Fractal Search algorithm (ISFS) to solve the problem. The Stochastic Fractal Search algorithm (SFS) is an effective evolutionary algorithm that performs well in exploring the search space but falls into local minimums sometimes. In order to avoid the weakness, Lvy flight and internal feedback information are introduced in ISFS. In the experimental process, simulations are conducted by ISFS algorithm on Fibonacci sequences and real peptide sequences. Experimental results prove that the ISFS performs more efficiently and robust in terms of finding the global minimum and avoiding getting stuck in local minimums.
Breathing, bubbling, and bending: DNA flexibility from multimicrosecond simulations.
Zeida, Ari; Machado, Matías Rodrigo; Dans, Pablo Daniel; Pantano, Sergio
2012-08-01
Bending of the seemingly stiff DNA double helix is a fundamental physical process for any living organism. Specialized proteins recognize DNA inducing and stabilizing sharp curvatures of the double helix. However, experimental evidence suggests a high protein-independent flexibility of DNA. On the basis of coarse-grained simulations, we propose that DNA experiences thermally induced kinks associated with the spontaneous formation of internal bubbles. Comparison of the protein-induced DNA curvature calculated from the Protein Data Bank with that sampled by our simulations suggests that thermally induced distortions can account for ~80% of the DNA curvature present in experimentally solved structures.
Xu, Dong; Jaroszewski, Lukasz; Li, Zhanwen; Godzik, Adam
2015-01-01
Motivation: Most proteins consist of multiple domains, independent structural and evolutionary units that are often reshuffled in genomic rearrangements to form new protein architectures. Template-based modeling methods can often detect homologous templates for individual domains, but templates that could be used to model the entire query protein are often not available. Results: We have developed a fast docking algorithm ab initio domain assembly (AIDA) for assembling multi-domain protein structures, guided by the ab initio folding potential. This approach can be extended to discontinuous domains (i.e. domains with ‘inserted’ domains). When tested on experimentally solved structures of multi-domain proteins, the relative domain positions were accurately found among top 5000 models in 86% of cases. AIDA server can use domain assignments provided by the user or predict them from the provided sequence. The latter approach is particularly useful for automated protein structure prediction servers. The blind test consisting of 95 CASP10 targets shows that domain boundaries could be successfully determined for 97% of targets. Availability and implementation: The AIDA package as well as the benchmark sets used here are available for download at http://ffas.burnham.org/AIDA/. Contact: adam@sanfordburnham.org Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25701568
From protein structure to function via single crystal optical spectroscopy
Ronda, Luca; Bruno, Stefano; Bettati, Stefano; Storici, Paola; Mozzarelli, Andrea
2015-01-01
The more than 100,000 protein structures determined by X-ray crystallography provide a wealth of information for the characterization of biological processes at the molecular level. However, several crystallographic “artifacts,” including conformational selection, crystallization conditions and radiation damages, may affect the quality and the interpretation of the electron density maps, thus limiting the relevance of structure determinations. Moreover, for most of these structures, no functional data have been obtained in the crystalline state, thus posing serious questions on their validity in infereing protein mechanisms. In order to solve these issues, spectroscopic methods have been applied for the determination of equilibrium and kinetic properties of proteins in the crystalline state. These methods are UV-vis spectrophotometry, spectrofluorimetry, IR, EPR, Raman, and resonance Raman spectroscopy. Some of these approaches have been implemented with on-line instruments at X-ray synchrotron beamlines. Here, we provide an overview of investigations predominantly carried out in our laboratory by single crystal polarized absorption UV-vis microspectrophotometry, the most applied technique for the functional characterization of proteins in the crystalline state. Studies on hemoglobins, pyridoxal 5′-phosphate dependent enzymes and green fluorescent protein in the crystalline state have addressed key biological issues, leading to either straightforward structure-function correlations or limitations to structure-based mechanisms. PMID:25988179
Dehzangi, Abdollah; Paliwal, Kuldip; Sharma, Alok; Dehzangi, Omid; Sattar, Abdul
2013-01-01
Better understanding of structural class of a given protein reveals important information about its overall folding type and its domain. It can also be directly used to provide critical information on general tertiary structure of a protein which has a profound impact on protein function determination and drug design. Despite tremendous enhancements made by pattern recognition-based approaches to solve this problem, it still remains as an unsolved issue for bioinformatics that demands more attention and exploration. In this study, we propose a novel feature extraction model that incorporates physicochemical and evolutionary-based information simultaneously. We also propose overlapped segmented distribution and autocorrelation-based feature extraction methods to provide more local and global discriminatory information. The proposed feature extraction methods are explored for 15 most promising attributes that are selected from a wide range of physicochemical-based attributes. Finally, by applying an ensemble of different classifiers namely, Adaboost.M1, LogitBoost, naive Bayes, multilayer perceptron (MLP), and support vector machine (SVM) we show enhancement of the protein structural class prediction accuracy for four popular benchmarks.
Structure and DNA-binding of meiosis-specific protein Hop2
NASA Astrophysics Data System (ADS)
Zhou, Donghua; Moktan, Hem; Pezza, Roberto
2014-03-01
Here we report structure elucidation of the DNA binding domain of homologous pairing protein 2 (Hop2), which is important to gene diversity when sperms and eggs are produced. Together with another protein Mnd1, Hop2 enhances the strand invasion activity of recombinase Dmc1 by over 30 times, facilitating proper synapsis of homologous chromosomes. However, the structural and biochemical bases for the function of Hop2 and Mnd1 have not been well understood. As a first step toward such understanding, we recently solved the structure for the N-terminus of Hop2 (1-84) using solution NMR. This fragment shows a typical winged-head conformation with recognized DNA binding activity. DNA interacting sites were then investigated by chemical shift perturbations in a titration experiment. Information of these sites was used to guide protein-DNA docking with MD simulation, revealing that helix 3 is stably lodged in the DNA major groove and that wing 1 (connecting strands 2 and 3) transiently comes in contact with the minor groove in nanosecond time scale. Mutagenesis analysis further confirmed the DNA binding sites in this fragment of the protein.
The neuronal porosome complex in health and disease
Naik, Akshata R; Lewis, Kenneth T
2015-01-01
Cup-shaped secretory portals at the cell plasma membrane called porosomes mediate the precision release of intravesicular material from cells. Membrane-bound secretory vesicles transiently dock and fuse at the base of porosomes facing the cytosol to expel pressurized intravesicular contents from the cell during secretion. The structure, isolation, composition, and functional reconstitution of the neuronal porosome complex have greatly progressed, providing a molecular understanding of its function in health and disease. Neuronal porosomes are 15 nm cup-shaped lipoprotein structures composed of nearly 40 proteins, compared to the 120 nm nuclear pore complex composed of >500 protein molecules. Membrane proteins compose the porosome complex, making it practically impossible to solve its atomic structure. However, atomic force microscopy and small-angle X-ray solution scattering studies have provided three-dimensional structural details of the native neuronal porosome at sub-nanometer resolution, providing insights into the molecular mechanism of its function. The participation of several porosome proteins previously implicated in neurotransmission and neurological disorders, further attest to the crosstalk between porosome proteins and their coordinated involvement in release of neurotransmitter at the synapse. PMID:26264442
Use of 13Cα Chemical-Shifts in Protein Structure Determination
Vila, Jorge A.; Ripoll, Daniel R.; Scheraga, Harold A.
2008-01-01
A physics-based method, aimed at determining protein structures by using NOE-derived distances together with observed and computed 13C chemical shifts, is proposed. The approach makes use of 13Cα chemical shifts, computed at the density functional level of theory, to obtain torsional constraints for all backbone and side-chain torsional angles without making a priori use of the occupancy of any region of the Ramachandran map by the amino acid residues. The torsional constraints are not fixed but are changed dynamically in each step of the procedure, following an iterative self-consistent approach intended to identify a set of conformations for which the computed 13Cα chemical shifts match the experimental ones. A test is carried out on a 76-amino acid all-α-helical protein, namely the B. Subtilis acyl carrier protein. It is shown that, starting from randomly generated conformations, the final protein models are more accurate than an existing NMR-derived structure model of this protein, in terms of both the agreement between predicted and observed 13Cα chemical shifts and some stereochemical quality indicators, and of similar accuracy as one of the protein models solved at a high level of resolution. The results provide evidence that this methodology can be used not only for structure determination but also for additional protein structure refinement of NMR-derived models deposited in the Protein Data Bank. PMID:17516673
A novel inert crystal delivery medium for serial femtosecond crystallography
DOE Office of Scientific and Technical Information (OSTI.GOV)
Conrad, Chelsie E.; Basu, Shibom; James, Daniel
Serial femtosecond crystallography (SFX) has opened a new era in crystallography by permitting nearly damage-free, room-temperature structure determination of challenging proteins such as membrane proteins. In SFX, femtosecond X-ray free-electron laser pulses produce diffraction snapshots from nanocrystals and microcrystals delivered in a liquid jet, which leads to high protein consumption. A slow-moving stream of agarose has been developed as a new crystal delivery medium for SFX. It has low background scattering, is compatible with both soluble and membrane proteins, and can deliver the protein crystals at a wide range of temperatures down to 4°C. Using this crystal-laden agarose stream, themore » structure of a multi-subunit complex, phycocyanin, was solved to 2.5 Å resolution using 300 µg of microcrystals embedded into the agarose medium post-crystallization. The agarose delivery method reduces protein consumption by at least 100-fold and has the potential to be used for a diverse population of proteins, including membrane protein complexes.« less
A novel inert crystal delivery medium for serial femtosecond crystallography
DOE Office of Scientific and Technical Information (OSTI.GOV)
Conrad, Chelsie E.; Basu, Shibom; James, Daniel
Serial femtosecond crystallography (SFX) has opened a new era in crystallography by permitting nearly damage-free, room-temperature structure determination of challenging proteins such as membrane proteins. In SFX, femtosecond X-ray free-electron laser pulses produce diffraction snapshots from nanocrystals and microcrystals delivered in a liquid jet, which leads to high protein consumption. A slow-moving stream of agarose has been developed as a new crystal delivery medium for SFX. It has low background scattering, is compatible with both soluble and membrane proteins, and can deliver the protein crystals at a wide range of temperatures down to 4°C. Using this crystal-laden agarose stream, themore » structure of a multi-subunit complex, phycocyanin, was solved to 2.5Å resolution using 300µg of microcrystals embedded into the agarose medium post-crystallization. The agarose delivery method reduces protein consumption by at least 100-fold and has the potential to be used for a diverse population of proteins, including membrane protein complexes.« less
A novel inert crystal delivery medium for serial femtosecond crystallography
Conrad, Chelsie E.; Basu, Shibom; James, Daniel; ...
2015-06-30
Serial femtosecond crystallography (SFX) has opened a new era in crystallography by permitting nearly damage-free, room-temperature structure determination of challenging proteins such as membrane proteins. In SFX, femtosecond X-ray free-electron laser pulses produce diffraction snapshots from nanocrystals and microcrystals delivered in a liquid jet, which leads to high protein consumption. A slow-moving stream of agarose has been developed as a new crystal delivery medium for SFX. It has low background scattering, is compatible with both soluble and membrane proteins, and can deliver the protein crystals at a wide range of temperatures down to 4°C. Using this crystal-laden agarose stream, themore » structure of a multi-subunit complex, phycocyanin, was solved to 2.5 Å resolution using 300 µg of microcrystals embedded into the agarose medium post-crystallization. The agarose delivery method reduces protein consumption by at least 100-fold and has the potential to be used for a diverse population of proteins, including membrane protein complexes.« less
Structural Insights into Functional Overlapping and Differentiation among Myosin V Motors*
Nascimento, Andrey F. Z.; Trindade, Daniel M.; Tonoli, Celisa C. C.; de Giuseppe, Priscila O.; Assis, Leandro H. P.; Honorato, Rodrigo V.; de Oliveira, Paulo S. L.; Mahajan, Pravin; Burgess-Brown, Nicola A.; von Delft, Frank; Larson, Roy E.; Murakami, Mario T.
2013-01-01
Myosin V (MyoV) motors have been implicated in the intracellular transport of diverse cargoes including vesicles, organelles, RNA-protein complexes, and regulatory proteins. Here, we have solved the cargo-binding domain (CBD) structures of the three human MyoV paralogs (Va, Vb, and Vc), revealing subtle structural changes that drive functional differentiation and a novel redox mechanism controlling the CBD dimerization process, which is unique for the MyoVc subclass. Moreover, the cargo- and motor-binding sites were structurally assigned, indicating the conservation of residues involved in the recognition of adaptors for peroxisome transport and providing high resolution insights into motor domain inhibition by CBD. These results contribute to understanding the structural requirements for cargo transport, autoinhibition, and regulatory mechanisms in myosin V motors. PMID:24097982
Research Associate | Center for Cancer Research
PROGRAM DESCRIPTION The Basic Science Program (BSP) pursues independent, multidisciplinary research in basic and applied molecular biology, immunology, retrovirology, cancer biology, and human genetics. Research efforts and support are an integral part of the Center for Cancer Research (CCR) at the Frederick National Laboratory for Cancer Research (FNLCR). KEY ROLES/RESPONSIBILITIES - Research Associate III Dr. Zbigniew Dauter is the head investigator of the Synchrotron Radiation Research Section (SRRS) of CCR’s Macromolecular Crystallography Laboratory. The Synchrotron Radiation Research Section is located at Argonne National Laboratory, Argonne, Illinois; this is the site of the largest U.S. synchrotron facility. The SRRS uses X-ray diffraction technique to solve crystal structures of various proteins and nucleic acids of biological and medical relevance. The section is also specializing in analyzing crystal structures at extremely high resolution and accuracy and in developing methods of effective diffraction data collection and in using weak anomalous dispersion effects to solve structures of macromolecules. The areas of expertise are: Structural and molecular biology Macromolecular crystallography Diffraction data collection Dr. Dauter requires research support in these areas, and the individual will engage in the purification and preparation of samples, crystallize proteins using various techniques, and derivatize them with heavy atoms/anomalous scatterers, and establish conditions for cryogenic freezing. Individual will also participate in diffraction data collection at the Advanced Photon Source. In addition, the candidate will perform spectroscopic and chromatographic analyses of protein and nucleic acid samples in the context of their purity, oligomeric state and photophysical properties.
Analysis of crystallization data in the Protein Data Bank
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kirkwood, Jobie; Hargreaves, David; O’Keefe, Simon
In a large-scale study using data from the Protein Data Bank, some of the many reported findings regarding the crystallization of proteins were investigated. The Protein Data Bank (PDB) is the largest available repository of solved protein structures and contains a wealth of information on successful crystallization. Many centres have used their own experimental data to draw conclusions about proteins and the conditions in which they crystallize. Here, data from the PDB were used to reanalyse some of these results. The most successful crystallization reagents were identified, the link between solution pH and the isoelectric point of the protein wasmore » investigated and the possibility of predicting whether a protein will crystallize was explored.« less
Structural studies of human glioma pathogenesis-related protein 1
DOE Office of Scientific and Technical Information (OSTI.GOV)
Asojo, Oluwatoyin A., E-mail: oasojo@unmc.edu; Koski, Raymond A.; Bonafé, Nathalie
2011-10-01
Structural analysis of a truncated soluble domain of human glioma pathogenesis-related protein 1, a membrane protein implicated in the proliferation of aggressive brain cancer, is presented. Human glioma pathogenesis-related protein 1 (GLIPR1) is a membrane protein that is highly upregulated in brain cancers but is barely detectable in normal brain tissue. GLIPR1 is composed of a signal peptide that directs its secretion, a conserved cysteine-rich CAP (cysteine-rich secretory proteins, antigen 5 and pathogenesis-related 1 proteins) domain and a transmembrane domain. GLIPR1 is currently being investigated as a candidate for prostate cancer gene therapy and for glioblastoma targeted therapy. Crystal structuresmore » of a truncated soluble domain of the human GLIPR1 protein (sGLIPR1) solved by molecular replacement using a truncated polyalanine search model of the CAP domain of stecrisp, a snake-venom cysteine-rich secretory protein (CRISP), are presented. The correct molecular-replacement solution could only be obtained by removing all loops from the search model. The native structure was refined to 1.85 Å resolution and that of a Zn{sup 2+} complex was refined to 2.2 Å resolution. The latter structure revealed that the putative binding cavity coordinates Zn{sup 2+} similarly to snake-venom CRISPs, which are involved in Zn{sup 2+}-dependent mechanisms of inflammatory modulation. Both sGLIPR1 structures have extensive flexible loop/turn regions and unique charge distributions that were not observed in any of the previously reported CAP protein structures. A model is also proposed for the structure of full-length membrane-bound GLIPR1.« less
xMDFF: molecular dynamics flexible fitting of low-resolution X-ray structures.
McGreevy, Ryan; Singharoy, Abhishek; Li, Qufei; Zhang, Jingfen; Xu, Dong; Perozo, Eduardo; Schulten, Klaus
2014-09-01
X-ray crystallography remains the most dominant method for solving atomic structures. However, for relatively large systems, the availability of only medium-to-low-resolution diffraction data often limits the determination of all-atom details. A new molecular dynamics flexible fitting (MDFF)-based approach, xMDFF, for determining structures from such low-resolution crystallographic data is reported. xMDFF employs a real-space refinement scheme that flexibly fits atomic models into an iteratively updating electron-density map. It addresses significant large-scale deformations of the initial model to fit the low-resolution density, as tested with synthetic low-resolution maps of D-ribose-binding protein. xMDFF has been successfully applied to re-refine six low-resolution protein structures of varying sizes that had already been submitted to the Protein Data Bank. Finally, via systematic refinement of a series of data from 3.6 to 7 Å resolution, xMDFF refinements together with electrophysiology experiments were used to validate the first all-atom structure of the voltage-sensing protein Ci-VSP.
Crystal structures of OrfX2 and P47 from a Botulinum neurotoxin OrfX-type gene cluster.
Gustafsson, Robert; Berntsson, Ronnie P-A; Martínez-Carranza, Markel; El Tekle, Geniver; Odegrip, Richard; Johnson, Eric A; Stenmark, Pål
2017-11-01
Botulinum neurotoxins are highly toxic substances and are all encoded together with one of two alternative gene clusters, the HA or the OrfX gene cluster. Very little is known about the function and structure of the proteins encoded in the OrfX gene cluster, which in addition to the toxin contains five proteins (OrfX1, OrfX2, OrfX3, P47, and NTNH). We here present the structures of OrfX2 and P47, solved to 2.1 and 1.8 Å, respectively. We show that they belong to the TULIP protein superfamily, which are often involved in lipid binding. OrfX1 and OrfX2 were both found to bind phosphatidylinositol lipids. © 2017 Federation of European Biochemical Societies.
Membrane protein structure determination — The next generation☆☆☆
Moraes, Isabel; Evans, Gwyndaf; Sanchez-Weatherby, Juan; Newstead, Simon; Stewart, Patrick D. Shaw
2014-01-01
The field of Membrane Protein Structural Biology has grown significantly since its first landmark in 1985 with the first three-dimensional atomic resolution structure of a membrane protein. Nearly twenty-six years later, the crystal structure of the beta2 adrenergic receptor in complex with G protein has contributed to another landmark in the field leading to the 2012 Nobel Prize in Chemistry. At present, more than 350 unique membrane protein structures solved by X-ray crystallography (http://blanco.biomol.uci.edu/mpstruc/exp/list, Stephen White Lab at UC Irvine) are available in the Protein Data Bank. The advent of genomics and proteomics initiatives combined with high-throughput technologies, such as automation, miniaturization, integration and third-generation synchrotrons, has enhanced membrane protein structure determination rate. X-ray crystallography is still the only method capable of providing detailed information on how ligands, cofactors, and ions interact with proteins, and is therefore a powerful tool in biochemistry and drug discovery. Yet the growth of membrane protein crystals suitable for X-ray diffraction studies amazingly remains a fine art and a major bottleneck in the field. It is often necessary to apply as many innovative approaches as possible. In this review we draw attention to the latest methods and strategies for the production of suitable crystals for membrane protein structure determination. In addition we also highlight the impact that third-generation synchrotron radiation has made in the field, summarizing the latest strategies used at synchrotron beamlines for screening and data collection from such demanding crystals. This article is part of a Special Issue entitled: Structural and biophysical characterisation of membrane protein-ligand binding. PMID:23860256
DNA Nanotubes for NMR Structure Determination of Membrane Proteins
Bellot, Gaëtan; McClintock, Mark A.; Chou, James J; Shih, William M.
2013-01-01
Structure determination of integral membrane proteins by solution NMR represents one of the most important challenges of structural biology. A Residual-Dipolar-Coupling-based refinement approach can be used to solve the structure of membrane proteins up to 40 kDa in size, however, a weak-alignment medium that is detergent-resistant is required. Previously, availability of media suitable for weak alignment of membrane proteins was severely limited. We describe here a protocol for robust, large-scale synthesis of detergent-resistant DNA nanotubes that can be assembled into dilute liquid crystals for application as weak-alignment media in solution NMR structure determination of membrane proteins in detergent micelles. The DNA nanotubes are heterodimers of 400nm-long six-helix bundles each self-assembled from a M13-based p7308 scaffold strand and >170 short oligonucleotide staple strands. Compatibility with proteins bearing considerable positive charge as well as modulation of molecular alignment, towards collection of linearly independent restraints, can be introduced by reducing the negative charge of DNA nanotubes via counter ions and small DNA binding molecules. This detergent-resistant liquid-crystal media offers a number of properties conducive for membrane protein alignment, including high-yield production, thermal stability, buffer compatibility, and structural programmability. Production of sufficient nanotubes for 4–5 NMR experiments can be completed in one week by a single individual. PMID:23518667
De Novo Protein Structure Prediction
NASA Astrophysics Data System (ADS)
Hung, Ling-Hong; Ngan, Shing-Chung; Samudrala, Ram
An unparalleled amount of sequence data is being made available from large-scale genome sequencing efforts. The data provide a shortcut to the determination of the function of a gene of interest, as long as there is an existing sequenced gene with similar sequence and of known function. This has spurred structural genomic initiatives with the goal of determining as many protein folds as possible (Brenner and Levitt, 2000; Burley, 2000; Brenner, 2001; Heinemann et al., 2001). The purpose of this is twofold: First, the structure of a gene product can often lead to direct inference of its function. Second, since the function of a protein is dependent on its structure, direct comparison of the structures of gene products can be more sensitive than the comparison of sequences of genes for detecting homology. Presently, structural determination by crystallography and NMR techniques is still slow and expensive in terms of manpower and resources, despite attempts to automate the processes. Computer structure prediction algorithms, while not providing the accuracy of the traditional techniques, are extremely quick and inexpensive and can provide useful low-resolution data for structure comparisons (Bonneau and Baker, 2001). Given the immense number of structures which the structural genomic projects are attempting to solve, there would be a considerable gain even if the computer structure prediction approach were applicable to a subset of proteins.
The solution structure of the pentatricopeptide repeat protein PPR10 upon binding atpH RNA
Gully, Benjamin S.; Cowieson, Nathan; Stanley, Will A.; Shearston, Kate; Small, Ian D.; Barkan, Alice; Bond, Charles S.
2015-01-01
The pentatricopeptide repeat (PPR) protein family is a large family of RNA-binding proteins that is characterized by tandem arrays of a degenerate 35-amino-acid motif which form an α-solenoid structure. PPR proteins influence the editing, splicing, translation and stability of specific RNAs in mitochondria and chloroplasts. Zea mays PPR10 is amongst the best studied PPR proteins, where sequence-specific binding to two RNA transcripts, atpH and psaJ, has been demonstrated to follow a recognition code where the identity of two amino acids per repeat determines the base-specificity. A recently solved ZmPPR10:psaJ complex crystal structure suggested a homodimeric complex with considerably fewer sequence-specific protein–RNA contacts than inferred previously. Here we describe the solution structure of the ZmPPR10:atpH complex using size-exclusion chromatography-coupled synchrotron small-angle X-ray scattering (SEC-SY-SAXS). Our results support prior evidence that PPR10 binds RNA as a monomer, and that it does so in a manner that is commensurate with a canonical and predictable RNA-binding mode across much of the RNA–protein interface. PMID:25609698
Machado Benelli, Elaine; Buck, Martin; Polikarpov, Igor; Maltempi de Souza, Emanuel; Cruz, Leonardo M; Pedrosa, Fábio O
2002-07-01
PII-like proteins are signal transduction proteins found in bacteria, archaea and eukaryotes. They mediate a variety of cellular responses. A second PII-like protein, called GlnK, has been found in several organisms. In the diazotroph Herbaspirillum seropedicae, PII protein is involved in sensing nitrogen levels and controlling nitrogen fixation genes. In this work, the crystal structure of the unliganded H. seropedicae PII was solved by X-ray diffraction. H. seropedicae PII has a Gly residue, Gly108 preceding Pro109 and the main-chain forms a beta turn. The glycine at position 108 allows a bend in the C-terminal main-chain, thereby modifying the surface of the cleft between monomers and potentially changing function. The structure suggests that the C-terminal region of PII proteins may be involved in specificity of function, and nonenteric diazotrophs are found to have the C-terminal consensus XGXDAX(107-112). We are also proposing binding sites for ATP and 2-oxoglutarate based on the structural alignment of PII with PII-ATP/GlnK-ATP, 5-carboxymethyl-2-hydroxymuconate isomerase and 4-oxalocrotonate tautomerase bound to the inhibitor 2-oxo-3-pentynoate.
Structure of a rare non-standard sequence k-turn bound by L7Ae protein
Huang, Lin; Lilley, David M.J.
2014-01-01
Kt-23 from Thelohania solenopsae is a rare RNA kink turn (k-turn) where an adenine replaces the normal guanine at the 2n position. L7Ae is a member of a strongly conserved family of proteins that bind a range of k-turn structures in the ribosome, box C/D and H/ACA small nucleolar RNAs and U4 small nuclear RNA. We have solved the crystal structure of T. solenopsae Kt-23 RNA bound to Archeoglobus fulgidus L7Ae protein at a resolution of 2.95 Å. The protein binds in the major groove displayed on the outer face of the k-turn, in a manner similar to complexes with standard k-turn structures. The k-turn adopts a standard N3 class conformation, with a single hydrogen bond from A2b N6 to A2n N3. This contrasts with the structure of the same sequence located in the SAM-I riboswitch, where it adopts an N1 structure, showing the inherent plasticity of k-turn structure. This potentially can affect any tertiary interactions in which the RNA participates. PMID:24482444
Conlan, Andrea R.; Paddock, Mark L.; Axelrod, Herbert L.; Cohen, Aina E.; Abresch, Edward C.; Wiley, Sandra; Roy, Melinda; Nechushtai, Rachel; Jennings, Patricia A.
2009-01-01
A primary role for mitochondrial dysfunction is indicated in the pathogenesis of insulin resistance. A widely used drug for the treatment of type 2 diabetes is pioglitazone, a member of the thiazolidinedione class of molecules. MitoNEET, a 2Fe–2S outer mitochondrial membrane protein, binds pioglitazone [Colca et al. (2004 ▶), Am. J. Physiol. Endocrinol. Metab. 286, E252–E260]. The soluble domain of the human mitoNEET protein has been expressed C-terminal to the superfolder green fluorescent protein and the mitoNEET protein has been isolated. Comparison of the crystal structure of mitoNEET isolated from cleavage of the fusion protein (1.4 Å resolution, R factor = 20.2%) with other solved structures shows that the CDGSH domains are superimposable, indicating proper assembly of mitoNEET. Furthermore, there is considerable flexibility in the position of the cytoplasmic tethering arms, resulting in two different conformations in the crystal structure. This flexibility affords multiple orientations on the outer mitochondrial membrane. PMID:19574633
Ambrosi, Emmanuele; Capaldi, Stefano; Bovi, Michele; Saccomani, Gianmaria; Perduca, Massimiliano; Monaco, Hugo L.
2011-01-01
The SOUL protein is known to induce apoptosis by provoking the mitochondrial permeability transition, and a sequence homologous with the BH3 (Bcl-2 homology 3) domains has recently been identified in the protein, thus making it a potential new member of the BH3-only protein family. In the present study, we provide NMR, SPR (surface plasmon resonance) and crystallographic evidence that a peptide spanning residues 147–172 in SOUL interacts with the anti-apoptotic protein Bcl-xL. We have crystallized SOUL alone and the complex of its BH3 domain peptide with Bcl-xL, and solved their three-dimensional structures. The SOUL monomer is a single domain organized as a distorted β-barrel with eight anti-parallel strands and two α-helices. The BH3 domain extends across 15 residues at the end of the second helix and eight amino acids in the chain following it. There are important structural differences in the BH3 domain in the intact SOUL molecule and the same sequence bound to Bcl-xL. PMID:21639858
Penttinen, Leena; Rutanen, Chiara; Saloheimo, Markku; Kruus, Kristiina; Rouvinen, Juha; Hakulinen, Nina
2018-01-01
Coupled binuclear copper (CBC) enzymes have a conserved type 3 copper site that binds molecular oxygen to oxidize various mono- and diphenolic compounds. In this study, we found a new crystal form of catechol oxidase from Aspergillus oryzae (AoCO4) and solved two new structures from two different crystals at 1.8-Å and at 2.5-Å resolutions. These structures showed different copper site forms (met/deoxy and deoxy) and also differed from the copper site observed in the previously solved structure of AoCO4. We also analysed the electron density maps of all of the 56 CBC enzyme structures available in the protein data bank (PDB) and found that many of the published structures have vague copper sites. Some of the copper sites were then re-refined to find a better fit to the observed electron density. General problems in the refinement of metalloproteins and metal centres are discussed.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chakraborti, Subhendu; Bahnson, Brian J.
2010-05-25
Human senescence marker protein 30 (SMP30), which functions enzymatically as a lactonase, hydrolyzes various carbohydrate lactones. The penultimate step in vitamin-C biosynthesis is catalyzed by this enzyme in nonprimate mammals. It has also been implicated as an organophosphate hydrolase, with the ability to hydrolyze diisopropyl phosphofluoridate and other nerve agents. SMP30 was originally identified as an aging marker protein, whose expression decreased androgen independently in aging cells. SMP30 is also referred to as regucalcin and has been suggested to have functions in calcium homeostasis. The crystal structure of the human enzyme has been solved from X-ray diffraction data collected tomore » a resolution of 1.4 {angstrom}. The protein has a 6-bladed {beta}-propeller fold, and it contains a single metal ion. Crystal structures have been solved with the metal site bound with either a Ca{sup 2+} or a Zn{sup 2+} atom. The catalytic role of the metal ion has been confirmed by mutagenesis of the metal coordinating residues. Kinetic studies using the substrate gluconolactone showed a k{sub cat} preference of divalent cations in the order Zn{sup 2+} > Mn{sup 2+} > Ca{sup 2+} > Mg{sup 2+}. Notably, the Ca{sup 2+} had a significantly higher value of K{sub d} compared to those of the other metal ions tested (566, 82, 7, and 0.6 {micro}m for Ca{sup 2+}, Mg{sup 2+}, Zn{sup 2+}, and Mn{sup 2+}, respectively), suggesting that the Ca{sup 2+}-bound form may be physiologically relevant for stressed cells with an elevated free calcium level.« less
Wlodawer, Alexander; Minor, Wladek; Dauter, Zbigniew; Jaskolski, Mariusz
2015-01-01
The number of macromolecular structures deposited in the Protein Data Bank now exceeds 45 000, with the vast majority determined using crystallographic methods. Thousands of studies describing such structures have been published in the scientific literature, and 14 Nobel prizes in chemistry or medicine have been awarded to protein crystallographers. As important as these structures are for understanding the processes that take place in living organisms and also for practical applications such as drug design, many non-crystallographers still have problems with critical evaluation of the structural literature data. This review attempts to provide a brief outline of technical aspects of crystallography and to explain the meaning of some parameters that should be evaluated by users of macromolecular structures in order to interpret, but not over-interpret, the information present in the coordinate files and in their description. A discussion of the extent of the information that can be gleaned from the coordinates of structures solved at different resolution, as well as problems and pitfalls encountered in structure determination and interpretation are also covered. PMID:18034855
Zhang, Jian; Yang, Jianyi; Jang, Richard; Zhang, Yang
2015-01-01
SUMMARY Experimental structure determination remains very difficult for G protein-coupled receptors (GPCRs). We propose a new hybrid protocol to construct GPCR structure models that integrates experimental mutagenesis data with ab initio transmembrane (TM) helix assembly simulations. The method was tested on 24 known GPCRs where the ab initio TM-helix assembly procedure constructed the correct fold for 20 cases. When combined with weak-homology and sparse mutagenesis restraints, the method generated correct folds for all the tested cases with an average C-alpha RMSD 2.4 Å in the TM-regions. The new hybrid protocol was applied to model all 1026 GPCRs in the human genome, where 923 have a high confidence score that are expected to have correct folds; these contain many pharmaceutically important families with no previously solved structures, including Trace amine, Prostanoids, Releasing hormones, Melanocortins, Vasopressin and Neuropeptide Y receptors. The results demonstrate new progress on genome-wide structure modeling of transmembrane proteins. PMID:26190572
NMR Spectroscopy and Its Value: A Primer
ERIC Educational Resources Information Center
Veeraraghavan, Sudha
2008-01-01
Nuclear magnetic resonance (NMR) spectroscopy is widely used by chemists. Furthermore, the use of NMR spectroscopy to solve structures of macromolecules or to examine protein-ligand interactions is popular. Yet, few students entering graduate education in biological sciences have been introduced to this method or its utility. Over the last six…
ERIC Educational Resources Information Center
Davis-McGibony, C. Michele
2010-01-01
The jigsaw technique has been used in a fourth-year biochemistry course to increase problem-solving abilities of the students. The jigsaw method is a cooperative-learning technique that involves a group structure. Students start with a "home" group. That group is responsible for learning an assigned portion of a task. Then the instructor separates…
Immersive Protein Gaming for Bio Edutainment
ERIC Educational Resources Information Center
Cai, Yiyu; Lu, Baifang; Zheng, Jianmin; Li, Lin
2006-01-01
Games have long been used as a tool for teaching important subject matter, from concept building to problem solving. Through fun learning, students may further develop their curiosities and interest in their study. This article addresses the issue of learning biomolecular structures by virtual reality gaming. A bio edutainment solution featuring…
High-resolution structure of a retroviral protease folded as a monomer
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gilski, Miroslaw; Polish Academy of Sciences, 61-704 Poznan; Kazmierczyk, Maciej
2011-11-01
The crystal structure of Mason–Pfizer monkey virus protease folded as a monomer has been solved by molecular replacement using a model generated by players of the online game Foldit. The structure shows at high resolution the details of a retroviral protease folded as a monomer which can guide rational design of protease dimerization inhibitors as retroviral drugs. Mason–Pfizer monkey virus (M-PMV), a D-type retrovirus assembling in the cytoplasm, causes simian acquired immunodeficiency syndrome (SAIDS) in rhesus monkeys. Its pepsin-like aspartic protease (retropepsin) is an integral part of the expressed retroviral polyproteins. As in all retroviral life cycles, release and dimerizationmore » of the protease (PR) is strictly required for polyprotein processing and virion maturation. Biophysical and NMR studies have indicated that in the absence of substrates or inhibitors M-PMV PR should fold into a stable monomer, but the crystal structure of this protein could not be solved by molecular replacement despite countless attempts. Ultimately, a solution was obtained in mr-rosetta using a model constructed by players of the online protein-folding game Foldit. The structure indeed shows a monomeric protein, with the N- and C-termini completely disordered. On the other hand, the flap loop, which normally gates access to the active site of homodimeric retropepsins, is clearly traceable in the electron density. The flap has an unusual curled shape and a different orientation from both the open and closed states known from dimeric retropepsins. The overall fold of the protein follows the retropepsin canon, but the C{sup α} deviations are large and the active-site ‘DTG’ loop (here NTG) deviates up to 2.7 Å from the standard conformation. This structure of a monomeric retropepsin determined at high resolution (1.6 Å) provides important extra information for the design of dimerization inhibitors that might be developed as drugs for the treatment of retroviral infections, including AIDS.« less
Strop, P.; Marinescu, A. M.; Mayo, S. L.
2000-01-01
Six helix surface positions of protein G (Gbeta1) were redesigned using a computational protein design algorithm, resulting in the five fold mutant Gbeta1m2. Gbeta1m2 is well folded with a circular dichroism spectrum nearly identical to that of Gbeta1, and a melting temperature of 91 degrees C, approximately 6 degrees C higher than that of Gbeta1. The crystal structure of Gbeta1m2 was solved to 2.0 A resolution by molecular replacement. The absence of hydrogen bond or salt bridge interactions between the designed residues in Gbeta1m2 suggests that the increased stability of Gbeta1m2 is due to increased helix propensity and more favorable helix dipole interactions. PMID:10933505
(Hyper)thermophilic enzymes: production and purification.
Falcicchio, Pierpaolo; Levisson, Mark; Kengen, Servé W M; Koutsopoulos, Sotirios
2014-01-01
The discovery of thermophilic and hyperthermophilic microorganisms, thriving at environmental temperatures near or above 100 °C, has revolutionized our ideas about the upper temperature limit at which life can exist. The characterization of (hyper)thermostable proteins has broadened our understanding and presented new opportunities for solving one of the most challenging problems in biophysics: how is structural stability and biological function maintained at high temperatures where "normal" proteins undergo dramatic structural changes? In our laboratory we have purified and studied many thermostable and hyperthermostable proteins in an attempt to determine the molecular basis of heat stability. Here, we present methods to express such proteins and enzymes in E. coli and provide a general protocol for overproduction and purification. The ability to produce enzymes that retain their stability and activity at elevated temperatures creates exciting opportunities for a wide range of biocatalytic applications.
Wu, Wei; Park, Kyung-Tae; Holyoak, Todd; Lutkenhaus, Joe
2011-01-01
Summary The three Min proteins spatially regulate Z ring positioning in E. coli and are dynamically associated with the membrane. MinD binds to vesicles in the presence of ATP and can recruit MinC or MinE. Biochemical and genetic evidence indicate the binding sites for these two proteins on MinD overlap. Here we solved the structure of a hydrolytic-deficient mutant of MinD truncated for the C-terminal amphipathic helix involved in binding to the membrane. The structure solved in the presence of ATP is a dimer and reveals the face of MinD abutting the membrane. Using a combination of random and extensive site-directed mutagenesis additional residues important for MinE and MinC binding were identified. The location of these residues on the MinD structure confirms that the binding sites overlap and reveals that the binding sites are at the dimer interface and exposed to the cytosol. The location of the binding sites at the dimer interface offers a simple explanation for the ATP-dependency of MinC and MinE binding to MinD. PMID:21231967
Venko, Katja; Roy Choudhury, A; Novič, Marjana
2017-01-01
The structural and functional details of transmembrane proteins are vastly underexplored, mostly due to experimental difficulties regarding their solubility and stability. Currently, the majority of transmembrane protein structures are still unknown and this present a huge experimental and computational challenge. Nowadays, thanks to X-ray crystallography or NMR spectroscopy over 3000 structures of membrane proteins have been solved, among them only a few hundred unique ones. Due to the vast biological and pharmaceutical interest in the elucidation of the structure and the functional mechanisms of transmembrane proteins, several computational methods have been developed to overcome the experimental gap. If combined with experimental data the computational information enables rapid, low cost and successful predictions of the molecular structure of unsolved proteins. The reliability of the predictions depends on the availability and accuracy of experimental data associated with structural information. In this review, the following methods are proposed for in silico structure elucidation: sequence-dependent predictions of transmembrane regions, predictions of transmembrane helix-helix interactions, helix arrangements in membrane models, and testing their stability with molecular dynamics simulations. We also demonstrate the usage of the computational methods listed above by proposing a model for the molecular structure of the transmembrane protein bilitranslocase. Bilitranslocase is bilirubin membrane transporter, which shares similar tissue distribution and functional properties with some of the members of the Organic Anion Transporter family and is the only member classified in the Bilirubin Transporter Family. Regarding its unique properties, bilitranslocase is a potentially interesting drug target.
Local Structural Differences in Homologous Proteins: Specificities in Different SCOP Classes
Joseph, Agnel Praveen; Valadié, Hélène; Srinivasan, Narayanaswamy; de Brevern, Alexandre G.
2012-01-01
The constant increase in the number of solved protein structures is of great help in understanding the basic principles behind protein folding and evolution. 3-D structural knowledge is valuable in designing and developing methods for comparison, modelling and prediction of protein structures. These approaches for structure analysis can be directly implicated in studying protein function and for drug design. The backbone of a protein structure favours certain local conformations which include α-helices, β-strands and turns. Libraries of limited number of local conformations (Structural Alphabets) were developed in the past to obtain a useful categorization of backbone conformation. Protein Block (PB) is one such Structural Alphabet that gave a reasonable structure approximation of 0.42 Å. In this study, we use PB description of local structures to analyse conformations that are preferred sites for structural variations and insertions, among group of related folds. This knowledge can be utilized in improving tools for structure comparison that work by analysing local structure similarities. Conformational differences between homologous proteins are known to occur often in the regions comprising turns and loops. Interestingly, these differences are found to have specific preferences depending upon the structural classes of proteins. Such class-specific preferences are mainly seen in the all-β class with changes involving short helical conformations and hairpin turns. A test carried out on a benchmark dataset also indicates that the use of knowledge on the class specific variations can improve the performance of a PB based structure comparison approach. The preference for the indel sites also seem to be confined to a few backbone conformations involving β-turns and helix C-caps. These are mainly associated with short loops joining the regular secondary structures that mediate a reversal in the chain direction. Rare β-turns of type I’ and II’ are also identified as preferred sites for insertions. PMID:22745680
Amporndanai, Kangsa; O’Neill, Paul M.
2018-01-01
Cytochrome bc 1, a dimeric multi-subunit electron-transport protein embedded in the inner mitochondrial membrane, is a major drug target for the treatment and prevention of malaria and toxoplasmosis. Structural studies of cytochrome bc 1 from mammalian homologues co-crystallized with lead compounds have underpinned structure-based drug design to develop compounds with higher potency and selectivity. However, owing to the limited amount of cytochrome bc 1 that may be available from parasites, all efforts have been focused on homologous cytochrome bc 1 complexes from mammalian species, which has resulted in the failure of some drug candidates owing to toxicity in the host. Crystallographic studies of the native parasite proteins are not feasible owing to limited availability of the proteins. Here, it is demonstrated that cytochrome bc 1 is highly amenable to single-particle cryo-EM (which uses significantly less protein) by solving the apo and two inhibitor-bound structures to ∼4.1 Å resolution, revealing clear inhibitor density at the binding site. Therefore, cryo-EM is proposed as a viable alternative method for structure-based drug discovery using both host and parasite enzymes. PMID:29765610
Eukaryotic ribonucleases P/MRP: the crystal structure of the P3 domain.
Perederina, Anna; Esakova, Olga; Quan, Chao; Khanova, Elena; Krasilnikov, Andrey S
2010-02-17
Ribonuclease (RNase) P is a site-specific endoribonuclease found in all kingdoms of life. Typical RNase P consists of a catalytic RNA component and a protein moiety. In the eukaryotes, the RNase P lineage has split into two, giving rise to a closely related enzyme, RNase MRP, which has similar components but has evolved to have different specificities. The eukaryotic RNases P/MRP have acquired an essential helix-loop-helix protein-binding RNA domain P3 that has an important function in eukaryotic enzymes and distinguishes them from bacterial and archaeal RNases P. Here, we present a crystal structure of the P3 RNA domain from Saccharomyces cerevisiae RNase MRP in a complex with RNase P/MRP proteins Pop6 and Pop7 solved to 2.7 A. The structure suggests similar structural organization of the P3 RNA domains in RNases P/MRP and possible functions of the P3 domains and proteins bound to them in the stabilization of the holoenzymes' structures as well as in interactions with substrates. It provides the first insight into the structural organization of the eukaryotic enzymes of the RNase P/MRP family.
Prediction of Protein-Protein Interaction Sites by Random Forest Algorithm with mRMR and IFS
Li, Bi-Qing; Feng, Kai-Yan; Chen, Lei; Huang, Tao; Cai, Yu-Dong
2012-01-01
Prediction of protein-protein interaction (PPI) sites is one of the most challenging problems in computational biology. Although great progress has been made by employing various machine learning approaches with numerous characteristic features, the problem is still far from being solved. In this study, we developed a novel predictor based on Random Forest (RF) algorithm with the Minimum Redundancy Maximal Relevance (mRMR) method followed by incremental feature selection (IFS). We incorporated features of physicochemical/biochemical properties, sequence conservation, residual disorder, secondary structure and solvent accessibility. We also included five 3D structural features to predict protein-protein interaction sites and achieved an overall accuracy of 0.672997 and MCC of 0.347977. Feature analysis showed that 3D structural features such as Depth Index (DPX) and surface curvature (SC) contributed most to the prediction of protein-protein interaction sites. It was also shown via site-specific feature analysis that the features of individual residues from PPI sites contribute most to the determination of protein-protein interaction sites. It is anticipated that our prediction method will become a useful tool for identifying PPI sites, and that the feature analysis described in this paper will provide useful insights into the mechanisms of interaction. PMID:22937126
Computational protein design: a review
NASA Astrophysics Data System (ADS)
Coluzza, Ivan
2017-04-01
Proteins are one of the most versatile modular assembling systems in nature. Experimentally, more than 110 000 protein structures have been identified and more are deposited every day in the Protein Data Bank. Such an enormous structural variety is to a first approximation controlled by the sequence of amino acids along the peptide chain of each protein. Understanding how the structural and functional properties of the target can be encoded in this sequence is the main objective of protein design. Unfortunately, rational protein design remains one of the major challenges across the disciplines of biology, physics and chemistry. The implications of solving this problem are enormous and branch into materials science, drug design, evolution and even cryptography. For instance, in the field of drug design an effective computational method to design protein-based ligands for biological targets such as viruses, bacteria or tumour cells, could give a significant boost to the development of new therapies with reduced side effects. In materials science, self-assembly is a highly desired property and soon artificial proteins could represent a new class of designable self-assembling materials. The scope of this review is to describe the state of the art in computational protein design methods and give the reader an outline of what developments could be expected in the near future.
THGS: a web-based database of Transmembrane Helices in Genome Sequences
Fernando, S. A.; Selvarani, P.; Das, Soma; Kumar, Ch. Kiran; Mondal, Sukanta; Ramakumar, S.; Sekar, K.
2004-01-01
Transmembrane Helices in Genome Sequences (THGS) is an interactive web-based database, developed to search the transmembrane helices in the user-interested gene sequences available in the Genome Database (GDB). The proposed database has provision to search sequence motifs in transmembrane and globular proteins. In addition, the motif can be searched in the other sequence databases (Swiss-Prot and PIR) or in the macromolecular structure database, Protein Data Bank (PDB). Further, the 3D structure of the corresponding queried motif, if it is available in the solved protein structures deposited in the Protein Data Bank, can also be visualized using the widely used graphics package RASMOL. All the sequence databases used in the present work are updated frequently and hence the results produced are up to date. The database THGS is freely available via the world wide web and can be accessed at http://pranag.physics.iisc.ernet.in/thgs/ or http://144.16.71.10/thgs/. PMID:14681375
Observing the overall rocking motion of a protein in a crystal
NASA Astrophysics Data System (ADS)
Ma, Peixiang; Xue, Yi; Coquelle, Nicolas; Haller, Jens D.; Yuwen, Tairan; Ayala, Isabel; Mikhailovskii, Oleg; Willbold, Dieter; Colletier, Jacques-Philippe; Skrynnikov, Nikolai R.; Schanda, Paul
2015-10-01
The large majority of three-dimensional structures of biological macromolecules have been determined by X-ray diffraction of crystalline samples. High-resolution structure determination crucially depends on the homogeneity of the protein crystal. Overall `rocking' motion of molecules in the crystal is expected to influence diffraction quality, and such motion may therefore affect the process of solving crystal structures. Yet, so far overall molecular motion has not directly been observed in protein crystals, and the timescale of such dynamics remains unclear. Here we use solid-state NMR, X-ray diffraction methods and μs-long molecular dynamics simulations to directly characterize the rigid-body motion of a protein in different crystal forms. For ubiquitin crystals investigated in this study we determine the range of possible correlation times of rocking motion, 0.1-100 μs. The amplitude of rocking varies from one crystal form to another and is correlated with the resolution obtainable in X-ray diffraction experiments.
Takeda, Mitsuhiro; Sugimori, Nozomi; Torizawa, Takuya; Terauchi, Tsutomu; Ono, Akira Mei; Yagi, Hirokazu; Yamaguchi, Yoshiki; Kato, Koichi; Ikeya, Teppei; Jee, JunGoo; Güntert, Peter; Aceti, David J.; Markley, John L.; Kainosho, Masatsune
2009-01-01
The product of gene At3g16450.1 from Arabidopsis thaliana is a 32 kDa, 299-residue protein classified as resembling a myrosinase-binding protein (MyroBP). MyroBPs are found in plants as part of a complex with the glucosinolate-degrading enzyme, myrosinase, and are suspected to play a role in myrosinase-dependent defense against pathogens. Many MyroBPs and MyroBP-related proteins are composed of repeated homologous sequences with unknown structure. We report here the three-dimensional structure of the At3g16450.1 protein from Arabidopsis, which consists of two tandem repeats. Because the size of the protein is larger than that amenable to high-throughput analysis by uniformly 13C/15N labeling methods, we used our stereo-array isotope labeling (SAIL) technology to prepare an optimally 2H/13C/15N-labeled sample. NMR data sets collected with the SAIL-protein enabled us to assign 1H, 13C and 15N chemical shifts to 95.5% of all atoms, even at the low concentration (0.2 mM) of the protein product. We collected additional NOESY data and solved the three-dimensional structure with the CYANA software package. The structure, the first for a MyroBP family member, revealed that the At3g16450.1 protein consists of two independent, but similar, lectin-fold domains composed of three β-sheets. PMID:19021763
Structure of the immature HIV-1 capsid in intact virus particles at 8.8 Å resolution
NASA Astrophysics Data System (ADS)
Schur, Florian K. M.; Hagen, Wim J. H.; Rumlová, Michaela; Ruml, Tomáš; Müller, Barbara; Kräusslich, Hans-Georg; Briggs, John A. G.
2015-01-01
Human immunodeficiency virus type 1 (HIV-1) assembly proceeds in two stages. First, the 55 kilodalton viral Gag polyprotein assembles into a hexameric protein lattice at the plasma membrane of the infected cell, inducing budding and release of an immature particle. Second, Gag is cleaved by the viral protease, leading to internal rearrangement of the virus into the mature, infectious form. Immature and mature HIV-1 particles are heterogeneous in size and morphology, preventing high-resolution analysis of their protein arrangement in situ by conventional structural biology methods. Here we apply cryo-electron tomography and sub-tomogram averaging methods to resolve the structure of the capsid lattice within intact immature HIV-1 particles at subnanometre resolution, allowing unambiguous positioning of all α-helices. The resulting model reveals tertiary and quaternary structural interactions that mediate HIV-1 assembly. Strikingly, these interactions differ from those predicted by the current model based on in vitro-assembled arrays of Gag-derived proteins from Mason-Pfizer monkey virus. To validate this difference, we solve the structure of the capsid lattice within intact immature Mason-Pfizer monkey virus particles. Comparison with the immature HIV-1 structure reveals that retroviral capsid proteins, while having conserved tertiary structures, adopt different quaternary arrangements during virus assembly. The approach demonstrated here should be applicable to determine structures of other proteins at subnanometre resolution within heterogeneous environments.
Bijelic, Aleksandar; Molitor, Christian; Mauracher, Stephan G; Al-Oweini, Rami; Kortz, Ulrich; Rompel, Annette
2015-01-01
As synchrotron radiation becomes more intense, detectors become faster and structure-solving software becomes more elaborate, obtaining single crystals suitable for data collection is now the bottleneck in macromolecular crystallography. Hence, there is a need for novel and advanced crystallisation agents with the ability to crystallise proteins that are otherwise challenging. Here, an Anderson–Evans-type polyoxometalate (POM), specifically Na6[TeW6O24]⋅22 H2O (TEW), is employed as a crystallisation additive. Its effects on protein crystallisation are demonstrated with hen egg-white lysozyme (HEWL), which co-crystallises with TEW in the vicinity (or within) the liquid–liquid phase separation (LLPS) region. The X-ray structure (PDB ID: 4PHI) determination revealed that TEW molecules are part of the crystal lattice, thus demonstrating specific binding to HEWL with electrostatic interactions and hydrogen bonds. The negatively charged TEW polyoxotungstate binds to sites with a positive electrostatic potential located between two (or more) symmetry-related protein chains. Thus, TEW facilitates the formation of protein–protein interfaces of otherwise repulsive surfaces, and thereby the realisation of a stable crystal lattice. In addition to retaining the isomorphicity of the protein structure, the anomalous scattering of the POMs was used for macromolecular phasing. The results suggest that hexatungstotellurate(VI) has great potential as a crystallisation additive to promote both protein crystallisation and structure elucidation. PMID:25521080
DOE Office of Scientific and Technical Information (OSTI.GOV)
Liu, Zhongchuan; Xie, Tian; Key Laboratory of Environmental Microbiology of Sichuan Province, Chengdu 610041, People’s Republic of
2016-03-24
The crystal structure of CotA complexed with 2,2-azinobis-(3-ethylbenzothiazoline-6-sulfonate) in a hole motif has been solved; this novel binding site could be a potential structure-based target for protein engineering of CotA laccase. The CotA laccase from Bacillus subtilis is an abundant component of the spore outer coat and has been characterized as a typical laccase. The crystal structure of CotA complexed with 2,2-azinobis-(3-ethylbenzothiazoline-6-sulfonate) (ABTS) in a hole motif has been solved. The novel binding site was about 26 Å away from the T1 binding pocket. Comparison with known structures of other laccases revealed that the hole is a specific feature ofmore » CotA. The key residues Arg476 and Ser360 were directly bound to ABTS. Site-directed mutagenesis studies revealed that the residues Arg146, Arg429 and Arg476, which are located at the bottom of the novel binding site, are essential for the oxidation of ABTS and syringaldazine. Specially, a Thr480Phe variant was identified to be almost 3.5 times more specific for ABTS than for syringaldazine compared with the wild type. These results suggest this novel binding site for ABTS could be a potential target for protein engineering of CotA laccases.« less
Addy, Christine; Ohara, Masato; Kawai, Fumihiro; Kidera, Akinori; Ikeguchi, Mitsunori; Fuchigami, Sotaro; Osawa, Masanori; Shimada, Ichio; Park, Sam-Yong; Tame, Jeremy R H; Heddle, Jonathan G
2007-02-01
Intracellular nickel is required by Escherichia coli as a cofactor for a number of enzymes and is necessary for anaerobic respiration. However, high concentrations of nickel are toxic, so both import and export systems have evolved to control the cellular level of the metal. The nik operon in E. coli encodes a nickel-uptake system that includes the periplasmic nickel-binding protein NikA. The crystal structures of wild-type NikA both bound to nickel and in the apo form have been solved previously. The liganded structure appeared to show an unusual interaction between the nickel and the protein in which no direct bonds are formed. The highly unusual nickel coordination suggested by the crystal structure contrasted strongly with earlier X-ray spectroscopic studies. The known nickel-binding site has been probed by extensive mutagenesis and isothermal titration calorimetry and it has been found that even large numbers of disruptive mutations appear to have little effect on the nickel affinity. The crystal structure of a binding-site mutant with nickel bound has been solved and it is found that nickel is bound to two histidine residues at a position distant from the previously characterized binding site. This novel site immediately resolves the conflict between the crystal structures and other biophysical analyses. The physiological relevance of the two binding sites is discussed.
Izoré, Thierry; Duman, Ramona; Kureisaite-Ciziene, Danguole; Löwe, Jan
2014-01-01
Polymerising proteins of the actin family are nearly ubiquitous. Crenactins, restricted to Crenarchaea, are more closely related to actin than bacterial MreB. Crenactins occur in gene clusters hinting at an unknown, but conserved function. We solved the crystal structure of crenactin at 3.2 Å resolution. The protein crystallises as a continuous right-handed helix with 8 subunits per complete turn, spanning 419 Å. The structure of crenactin shows several loops that are longer than in actin, but overall, crenactin is closely related to eukaryotic actin, with an RMSD of 1.6 Å. Crenactin filaments imaged by electron microscopy showed polymers with very similar helical parameters. PMID:24486010
From Structure-Function Analyses to Protein Engineering for Practical Applications of DNA Ligase
Tanabe, Maiko; Nishida, Hirokazu
2015-01-01
DNA ligases are indispensable in all living cells and ubiquitous in all organs. DNA ligases are broadly utilized in molecular biology research fields, such as genetic engineering and DNA sequencing technologies. Here we review the utilization of DNA ligases in a variety of in vitro gene manipulations, developed over the past several decades. During this period, fewer protein engineering attempts for DNA ligases have been made, as compared to those for DNA polymerases. We summarize the recent progress in the elucidation of the DNA ligation mechanisms obtained from the tertiary structures solved thus far, in each step of the ligation reaction scheme. We also present some examples of engineered DNA ligases, developed from the viewpoint of their three-dimensional structures. PMID:26508902
Helix Unwinding and Base Flipping Enable Human MTERF1 to Terminate Mitochondrial Transcription
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yakubovskaya, E.; Mejia, E; Byrnes, J
2010-01-01
Defects in mitochondrial gene expression are associated with aging and disease. Mterf proteins have been implicated in modulating transcription, replication and protein synthesis. We have solved the structure of a member of this family, the human mitochondrial transcriptional terminator MTERF1, bound to dsDNA containing the termination sequence. The structure indicates that upon sequence recognition MTERF1 unwinds the DNA molecule, promoting eversion of three nucleotides. Base flipping is critical for stable binding and transcriptional termination. Additional structural and biochemical results provide insight into the DNA binding mechanism and explain how MTERF1 recognizes its target sequence. Finally, we have demonstrated that themore » mitochondrial pathogenic G3249A and G3244A mutations interfere with key interactions for sequence recognition, eliminating termination. Our results provide insight into the role of mterf proteins and suggest a link between mitochondrial disease and the regulation of mitochondrial transcription.« less
Constructing failure in big biology: The socio-technical anatomy of Japan's Protein 3000 Project.
Fukushima, Masato
2016-02-01
This study focuses on the 5-year Protein 3000 Project launched in 2002, the largest biological project in Japan. The project aimed to overcome Japan's alleged failure to contribute fully to the Human Genome Project, by determining 3000 protein structures, 30 percent of the global target. Despite its achievement of this goal, the project was fiercely criticized in various sectors of society and was often branded an awkward failure. This article tries to solve the mystery of why such failure discourse was prevalent. Three explanatory factors are offered: first, because some goals were excluded during project development, there was a dynamic of failed expectations; second, structural genomics, while promoting collaboration with the international community, became an 'anti-boundary object', only the absence of which bound heterogeneous domestic actors; third, there developed an urgent sense of international competition in order to obtain patents on such structural information.
NASA Astrophysics Data System (ADS)
Siewny, Matthew; Kmetko, Jan
2010-10-01
We work out a novel protocol for measuring the solvent content (the fraction of crystal volume occupied by solvent) in biological crystals by the technique of fluorescence recovery after photobleaching (FRAP). Crystals of proteins with widely varying known solvent content (lysozyme, thaumatin, catalase, and ferritin) were grown in their native solution doped with sodium fluorescein dye and hydroxylamine (to prevent dye from binding to amine groups of the proteins.) The crystals were irradiated by a broadband, high intensity light through knife slits, leaving a rectangular area of bleached dye within the crystals. Measuring the flow of dye out of the bleached area allowed us to construct a curve relating the diffusion coefficient of dye to the channel size within the crystals, by solving the diffusion equation analytically. This curve may be used to measure the solvent content of any biological crystal in its native solution and help determine the number of proteins in the crystallographic asymmetric unit cell in x-ray structure solving procedures.
Constraint Logic Programming approach to protein structure prediction.
Dal Palù, Alessandro; Dovier, Agostino; Fogolari, Federico
2004-11-30
The protein structure prediction problem is one of the most challenging problems in biological sciences. Many approaches have been proposed using database information and/or simplified protein models. The protein structure prediction problem can be cast in the form of an optimization problem. Notwithstanding its importance, the problem has very seldom been tackled by Constraint Logic Programming, a declarative programming paradigm suitable for solving combinatorial optimization problems. Constraint Logic Programming techniques have been applied to the protein structure prediction problem on the face-centered cube lattice model. Molecular dynamics techniques, endowed with the notion of constraint, have been also exploited. Even using a very simplified model, Constraint Logic Programming on the face-centered cube lattice model allowed us to obtain acceptable results for a few small proteins. As a test implementation their (known) secondary structure and the presence of disulfide bridges are used as constraints. Simplified structures obtained in this way have been converted to all atom models with plausible structure. Results have been compared with a similar approach using a well-established technique as molecular dynamics. The results obtained on small proteins show that Constraint Logic Programming techniques can be employed for studying protein simplified models, which can be converted into realistic all atom models. The advantage of Constraint Logic Programming over other, much more explored, methodologies, resides in the rapid software prototyping, in the easy way of encoding heuristics, and in exploiting all the advances made in this research area, e.g. in constraint propagation and its use for pruning the huge search space.
Insights into the Specificity of Lysine Acetyltransferases
Tucker, Alex C.; Taylor, Keenan C.; Rank, Katherine C.; ...
2014-11-07
Reversible lysine acetylation by protein acetyltransferases is a conserved regulatory mechanism that controls diverse cellular pathways. Gcn5-related N-acetyltransferases (GNATs), named after their founding member, are found in all domains of life. GNATs are known for their role as histone acetyltransferases, but non-histone bacterial protein acetytransferases have been identified. Only structures of GNAT complexes with short histone peptide substrates are available in databases. Given the biological importance of this modification and the abundance of lysine in polypeptides, how specificity is attained for larger protein substrates is central to understanding acetyl-lysine-regulated networks. In this paper, we report the structure of a GNATmore » in complex with a globular protein substrate solved to 1.9 Å. GNAT binds the protein substrate with extensive surface interactions distinct from those reported for GNAT-peptide complexes. Finally, our data reveal determinants needed for the recognition of a protein substrate and provide insight into the specificity of GNATs.« less
Statistical inference of protein structural alignments using information and compression.
Collier, James H; Allison, Lloyd; Lesk, Arthur M; Stuckey, Peter J; Garcia de la Banda, Maria; Konagurthu, Arun S
2017-04-01
Structural molecular biology depends crucially on computational techniques that compare protein three-dimensional structures and generate structural alignments (the assignment of one-to-one correspondences between subsets of amino acids based on atomic coordinates). Despite its importance, the structural alignment problem has not been formulated, much less solved, in a consistent and reliable way. To overcome these difficulties, we present here a statistical framework for the precise inference of structural alignments, built on the Bayesian and information-theoretic principle of Minimum Message Length (MML). The quality of any alignment is measured by its explanatory power-the amount of lossless compression achieved to explain the protein coordinates using that alignment. We have implemented this approach in MMLigner , the first program able to infer statistically significant structural alignments. We also demonstrate the reliability of MMLigner 's alignment results when compared with the state of the art. Importantly, MMLigner can also discover different structural alignments of comparable quality, a challenging problem for oligomers and protein complexes. Source code, binaries and an interactive web version are available at http://lcb.infotech.monash.edu.au/mmligner . arun.konagurthu@monash.edu. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Solution structure of leptospiral LigA4 Big domain
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mei, Song; Zhang, Jiahai; Zhang, Xuecheng
Pathogenic Leptospiraspecies express immunoglobulin-like proteins which serve as adhesins to bind to the extracellular matrices of host cells. Leptospiral immunoglobulin-like protein A (LigA), a surface exposed protein containing tandem repeats of bacterial immunoglobulin-like (Big) domains, has been proved to be involved in the interaction of pathogenic Leptospira with mammalian host. In this study, the solution structure of the fourth Big domain of LigA (LigA4 Big domain) from Leptospira interrogans was solved by nuclear magnetic resonance (NMR). The structure of LigA4 Big domain displays a similar bacterial immunoglobulin-like fold compared with other Big domains, implying some common structural aspects of Bigmore » domain family. On the other hand, it displays some structural characteristics significantly different from classic Ig-like domain. Furthermore, Stains-all assay and NMR chemical shift perturbation revealed the Ca{sup 2+} binding property of LigA4 Big domain. - Highlights: • Determining the solution structure of a bacterial immunoglobulin-like domain from a surface protein of Leptospira. • The solution structure shows some structural characteristics significantly different from the classic Ig-like domains. • A potential Ca{sup 2+}-binding site was identified by strains-all and NMR chemical shift perturbation.« less
Johnson, Kenneth A.; Ve, Thomas; Larsen, Øivind; Pedersen, Rolf B.; Lillehaug, Johan R.; Jensen, Harald B.; Helland, Ronny; Karlsen, Odd A.
2014-01-01
CorA is a copper repressible protein previously identified in the methanotrophic bacterium Methylomicrobium album BG8. In this work, we demonstrate that CorA is located on the cell surface and binds one copper ion per protein molecule, which, based on X-ray Absorption Near Edge Structure analysis, is in the reduced state (Cu(I)). The structure of endogenously expressed CorA was solved using X-ray crystallography. The 1.6 Å three-dimensional structure confirmed the binding of copper and revealed that the copper atom was coordinated in a mononuclear binding site defined by two histidines, one water molecule, and the tryptophan metabolite, kynurenine. This arrangement of the copper-binding site is similar to that of its homologous protein MopE* from Metylococcus capsulatus Bath, confirming the importance of kynurenine for copper binding in these proteins. Our findings show that CorA has an overall fold similar to MopE, including the unique copper(I)-binding site and most of the secondary structure elements. We suggest that CorA plays a role in the M. album BG8 copper acquisition. PMID:24498370
Gordon, Sherald H; Harry-O'kuru, Rogers E; Mohamed, Abdellatif A
2017-11-01
Infrared analysis of proteins and polysaccharides by the well known KBr disk technique is notoriously frustrated and defeated by absorbed water interference in the important amide and hydroxyl regions of spectra. This interference has too often been overlooked or ignored even when the resulting distortion is critical or even fatal, as in quantitative analyses of protein secondary structure, because the water has been impossible to measure or eliminate. Therefore, a new chemometric method was devised that corrects spectra of materials in KBr disks by mathematically eliminating the water interference. A new concept termed the Beer-Lambert law absorbance ratio (R-matrix) model was augmented with water concentration ratios computed via an exponential decay kinetic model of the water absorption process in KBr, which rendered the otherwise indeterminate system of linear equations determinate and thus possible to solve in a formal analytic manner. Consequently, the heretofore baffling KBr water elimination problem is now solved once and for all. Using the new formal solution, efforts to eliminate water interference from KBr disks in research will be defeated no longer. Resulting spectra of protein were much more accurate than attenuated total reflection (ATR) spectra corrected using the well-accepted Advanced ATR Correction Algorithm. Published by Elsevier B.V.
A Real-Time All-Atom Structural Search Engine for Proteins
Gonzalez, Gabriel; Hannigan, Brett; DeGrado, William F.
2014-01-01
Protein designers use a wide variety of software tools for de novo design, yet their repertoire still lacks a fast and interactive all-atom search engine. To solve this, we have built the Suns program: a real-time, atomic search engine integrated into the PyMOL molecular visualization system. Users build atomic-level structural search queries within PyMOL and receive a stream of search results aligned to their query within a few seconds. This instant feedback cycle enables a new “designability”-inspired approach to protein design where the designer searches for and interactively incorporates native-like fragments from proven protein structures. We demonstrate the use of Suns to interactively build protein motifs, tertiary interactions, and to identify scaffolds compatible with hot-spot residues. The official web site and installer are located at http://www.degradolab.org/suns/ and the source code is hosted at https://github.com/godotgildor/Suns (PyMOL plugin, BSD license), https://github.com/Gabriel439/suns-cmd (command line client, BSD license), and https://github.com/Gabriel439/suns-search (search engine server, GPLv2 license). PMID:25079944
Molecular architectures and functions of radical enzymes and their (re)activating proteins.
Shibata, Naoki; Toraya, Tetsuo
2015-10-01
Certain proteins utilize the high reactivity of radicals for catalysing chemically challenging reactions. These proteins contain or form a radical and therefore named 'radical enzymes'. Radicals are introduced by enzymes themselves or by (re)activating proteins called (re)activases. The X-ray structures of radical enzymes and their (re)activases revealed some structural features of these molecular apparatuses which solved common enigmas of radical enzymes—i.e. how the enzymes form or introduce radicals at the active sites, how they use the high reactivity of radicals for catalysis, how they suppress undesired side reactions of highly reactive radicals and how they are (re)activated when inactivated by extinction of radicals. This review highlights molecular architectures of radical B12 enzymes, radical SAM enzymes, tyrosyl radical enzymes, glycyl radical enzymes and their (re)activating proteins that support their functions. For generalization, comparisons of the recently reported structures of radical enzymes with those of canonical radical enzymes are summarized here. © The Authors 2015. Published by Oxford University Press on behalf of the Japanese Biochemical Society. All rights reserved.
A real-time all-atom structural search engine for proteins.
Gonzalez, Gabriel; Hannigan, Brett; DeGrado, William F
2014-07-01
Protein designers use a wide variety of software tools for de novo design, yet their repertoire still lacks a fast and interactive all-atom search engine. To solve this, we have built the Suns program: a real-time, atomic search engine integrated into the PyMOL molecular visualization system. Users build atomic-level structural search queries within PyMOL and receive a stream of search results aligned to their query within a few seconds. This instant feedback cycle enables a new "designability"-inspired approach to protein design where the designer searches for and interactively incorporates native-like fragments from proven protein structures. We demonstrate the use of Suns to interactively build protein motifs, tertiary interactions, and to identify scaffolds compatible with hot-spot residues. The official web site and installer are located at http://www.degradolab.org/suns/ and the source code is hosted at https://github.com/godotgildor/Suns (PyMOL plugin, BSD license), https://github.com/Gabriel439/suns-cmd (command line client, BSD license), and https://github.com/Gabriel439/suns-search (search engine server, GPLv2 license).
Solving the mystery of the internal structure of casein micelles.
Ingham, B; Erlangga, G D; Smialowska, A; Kirby, N M; Wang, C; Matia-Merino, L; Haverkamp, R G; Carr, A J
2015-04-14
The interpretation of milk X-ray and neutron scattering data in relation to the internal structure of the casein micelle is an ongoing debate. We performed resonant X-ray scattering measurements on liquid milk and conclusively identified key scattering features, namely those corresponding to the size of and the distance between colloidal calcium phosphate particles. An X-ray scattering feature commonly assigned to the particle size is instead due to protein inhomogeneities.
Direct demodulation method for heavy atom position determination in protein crystallography
NASA Astrophysics Data System (ADS)
Zhou, Liang; Liu, Zhong-Chuan; Liu, Peng; Dong, Yu-Hui
2013-01-01
The first step of phasing in any de novo protein structure determination using isomorphous replacement (IR) or anomalous scattering (AD) experiments is to find heavy atom positions. Traditionally, heavy atom positions can be solved by inspecting the difference Patterson maps. Due to the weak signals in isomorphous or anomalous differences and the noisy background in the Patterson map, the search for heavy atoms may become difficult. Here, the direct demodulation (DD) method is applied to the difference Patterson maps to reduce the noisy backgrounds and sharpen the signal peaks. The real space Patterson search by using these optimized maps can locate the heavy atom positions more accurately. It is anticipated that the direct demodulation method can assist in heavy atom position determination and facilitate the de novo structure determination of proteins.
Holm, Liisa; Laakso, Laura M
2016-07-08
The Dali server (http://ekhidna2.biocenter.helsinki.fi/dali) is a network service for comparing protein structures in 3D. In favourable cases, comparing 3D structures may reveal biologically interesting similarities that are not detectable by comparing sequences. The Dali server has been running in various places for over 20 years and is used routinely by crystallographers on newly solved structures. The latest update of the server provides enhanced analytics for the study of sequence and structure conservation. The server performs three types of structure comparisons: (i) Protein Data Bank (PDB) search compares one query structure against those in the PDB and returns a list of similar structures; (ii) pairwise comparison compares one query structure against a list of structures specified by the user; and (iii) all against all structure comparison returns a structural similarity matrix, a dendrogram and a multidimensional scaling projection of a set of structures specified by the user. Structural superimpositions are visualized using the Java-free WebGL viewer PV. The structural alignment view is enhanced by sequence similarity searches against Uniprot. The combined structure-sequence alignment information is compressed to a stack of aligned sequence logos. In the stack, each structure is structurally aligned to the query protein and represented by a sequence logo. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Chemical Shift Assignments of the C-terminal Eps15 Homology Domain-3 EH Domain*
Caplan, Steve; Sorgen, Paul L.
2013-01-01
The C-terminal Eps15 homology (EH) domain 3 (EHD3) belongs to a eukaryotic family of endocytic regulatory proteins and is involved in the recycling of various receptors from the early endosome to the endocytic recycling compartment or in retrograde transport from the endosomes to the Golgi. EH domains are highly conserved in the EHD family and function as protein-protein interaction units that bind to Asn-Pro-Phe (NPF) motif-containing proteins. The EH domain of EHD1 was the first C-terminal EH domain from the EHD family to be solved by NMR. The differences observed between this domain and proteins with N-terminal EH domains helped describe a mechanism for the differential binding of NPF-containing proteins. Here, structural studies were expanded to include the EHD3 EH domain. While the EHD1 and EHD3 EH domains are highly homologous, they have different protein partners. A comparison of these structures will help determine the selectivity in protein binding between the EHD family members and lead to a better understanding of their unique roles in endocytic regulation. PMID:23754701
Rissanen, Ilona; Grimes, Jonathan M.; Pawlowski, Alice; Mäntynen, Sari; Harlos, Karl; Bamford, Jaana K.H.; Stuart, David I.
2013-01-01
Summary It has proved difficult to classify viruses unless they are closely related since their rapid evolution hinders detection of remote evolutionary relationships in their genetic sequences. However, structure varies more slowly than sequence, allowing deeper evolutionary relationships to be detected. Bacteriophage P23-77 is an example of a newly identified viral lineage, with members inhabiting extreme environments. We have solved multiple crystal structures of the major capsid proteins VP16 and VP17 of bacteriophage P23-77. They fit the 14 Å resolution cryo-electron microscopy reconstruction of the entire virus exquisitely well, allowing us to propose a model for both the capsid architecture and viral assembly, quite different from previously published models. The structures of the capsid proteins and their mode of association to form the viral capsid suggest that the P23-77-like and adeno-PRD1 lineages of viruses share an extremely ancient common ancestor. PMID:23623731
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bera, Asim K.; Atanasova, Vesna; Gamage, Swarna
2010-06-01
The structure of EhpF from P. agglomerans has been solved alone and in complex with phenazine-1,6-dicarboxylate. Apo EhpF was solved and refined in two different space groups at 1.95 and 2.3 Å resolution and the EhpF–phenazine-1,6-dicarboxylate complex structure was determined at 2.8 Å resolution. The structure of EhpF, a 41 kDa protein that functions in the biosynthetic pathway leading to the broad-spectrum antimicrobial compound d-alanylgriseoluteic acid (AGA), is reported. A cluster of approximately 16 genes, including ehpF, located on a 200 kbp plasmid native to certain strains of Pantoea agglomerans encodes the proteins that are required for the conversion ofmore » chorismic acid to AGA. Phenazine-1,6-dicarboxylate has been identified as an intermediate in AGA biosynthesis and deletion of ehpF results in accumulation of this compound in vivo. The crystallographic data presented here reveal that EhpF is an atypical member of the acyl-CoA synthase or ANL superfamily of adenylating enzymes. These enzymes typically catalyze two-step reactions involving adenylation of a carboxylate substrate followed by transfer of the substrate from AMP to coenzyme A or another phosphopantetheine. EhpF is distinguished by the absence of the C-terminal domain that is characteristic of enzymes from this family and is involved in phosphopantetheine binding and in the second half of the canonical two-step reaction that is typically observed. Based on the structure of EhpF and a bioinformatic analysis, it is proposed that EhpF and EhpG convert phenazine-1,6-dicarboxylate to 6-formylphenazine-1-carboxylate via an adenylyl intermediate.« less
Yan, Si; Guo, Changmiao; Hou, Guangjin; Zhang, Huilan; Lu, Xingyu; Williams, John Charles; Polenova, Tatyana
2015-11-24
Microtubules and their associated proteins perform a broad array of essential physiological functions, including mitosis, polarization and differentiation, cell migration, and vesicle and organelle transport. As such, they have been extensively studied at multiple levels of resolution (e.g., from structural biology to cell biology). Despite these efforts, there remain significant gaps in our knowledge concerning how microtubule-binding proteins bind to microtubules, how dynamics connect different conformational states, and how these interactions and dynamics affect cellular processes. Structures of microtubule-associated proteins assembled on polymeric microtubules are not known at atomic resolution. Here, we report a structure of the cytoskeleton-associated protein glycine-rich (CAP-Gly) domain of dynactin motor on polymeric microtubules, solved by magic angle spinning NMR spectroscopy. We present the intermolecular interface of CAP-Gly with microtubules, derived by recording direct dipolar contacts between CAP-Gly and tubulin using double rotational echo double resonance (dREDOR)-filtered experiments. Our results indicate that the structure adopted by CAP-Gly varies, particularly around its loop regions, permitting its interaction with multiple binding partners and with the microtubules. To our knowledge, this study reports the first atomic-resolution structure of a microtubule-associated protein on polymeric microtubules. Our approach lays the foundation for atomic-resolution structural analysis of other microtubule-associated motors.
Zhou, Peng; Wang, Congcong; Tian, Feifei; Ren, Yanrong; Yang, Chao; Huang, Jian
2013-01-01
Quantitative structure-activity relationship (QSAR), a regression modeling methodology that establishes statistical correlation between structure feature and apparent behavior for a series of congeneric molecules quantitatively, has been widely used to evaluate the activity, toxicity and property of various small-molecule compounds such as drugs, toxicants and surfactants. However, it is surprising to see that such useful technique has only very limited applications to biomacromolecules, albeit the solved 3D atom-resolution structures of proteins, nucleic acids and their complexes have accumulated rapidly in past decades. Here, we present a proof-of-concept paradigm for the modeling, prediction and interpretation of the binding affinity of 144 sequence-nonredundant, structure-available and affinity-known protein complexes (Kastritis et al. Protein Sci 20:482-491, 2011) using a biomacromolecular QSAR (BioQSAR) scheme. We demonstrate that the modeling performance and predictive power of BioQSAR are comparable to or even better than that of traditional knowledge-based strategies, mechanism-type methods and empirical scoring algorithms, while BioQSAR possesses certain additional features compared to the traditional methods, such as adaptability, interpretability, deep-validation and high-efficiency. The BioQSAR scheme could be readily modified to infer the biological behavior and functions of other biomacromolecules, if their X-ray crystal structures, NMR conformation assemblies or computationally modeled structures are available.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jia, Xiaofei; Singh, Rajendra; Homann, Stefanie
The HIV-1 protein Nef inhibits antigen presentation by class I major histocompatibility complex (MHC-I). We determined the mechanism of this activity by solving the crystal structure of a protein complex comprising Nef, the MHC-I cytoplasmic domain (MHC-I CD) and the {mu}1 subunit of the clathrin adaptor protein complex 1. A ternary, cooperative interaction clamps the MHC-I CD into a narrow binding groove at the Nef-{mu}1 interface, which encompasses the cargo-recognition site of {mu}1 and the proline-rich strand of Nef. The Nef C terminus induces a previously unobserved conformational change in {mu}1, whereas the N terminus binds the Nef core tomore » position it optimally for complex formation. Positively charged patches on {mu}1 recognize acidic clusters in Nef and MHC-I. The structure shows how Nef functions as a clathrin-associated sorting protein to alter the specificity of host membrane trafficking and enable viral evasion of adaptive immunity.« less
Squeglia, Flavia; Bachert, Beth; Romano, Maria; Lukomski, Slawomir; Berisio, Rita
2013-09-01
Streptococcal collagen-like proteins (Scls) are widely expressed by the well recognized human pathogen Streptococcus pyogenes. These surface proteins contain a signature central collagen-like region and an amino-terminal globular domain, termed the variable domain, which is protruded away from the cell surface by the collagen-like domain. Despite their recognized importance in bacterial pathogenicity, no structural information is presently available on proteins of the Scl class. The variable domain of Scl2 from invasive M3-type S. pyogenes has successfully been crystallized using vapour-diffusion methods. The crystals diffracted to 1.5 Å resolution and belonged to space group H32, with unit-cell parameters a = 44.23, b = 44.23, c = 227.83 Å. The crystal structure was solved by single-wavelength anomalous dispersion using anomalous signal from a europium chloride derivative.|
Structure and proposed mechanism of α-glycerophosphate oxidase from Mycoplasma pneumoniae
Elkhal, Callia K.; Kean, Kelsey M.; Parsonage, Derek; ...
2015-03-14
In this study, the formation of hydrogen peroxide (H₂O₂) by the FAD-dependent α-glycerophosphate oxidase (GlpO), is important for the pathogenesis of Streptococcus pneumoniae and Mycoplasma pneumoniae. The structurally known GlpO from Streptococcus sp. ( SspGlpO) is similar to the pneumococcal protein ( SpGlpO) and provides a guide for drug design against that target. However, M. pneumoniae GlpO ( MpGlpO), having <20% sequence identity with structurally known GlpOs, appears to represent a second type of GlpO we designate as Type II GlpOs. Here, the recombinant His-tagged MpGlpO structure is described at ~2.5 Å resolution, solved by molecular replacement using as amore » search model the Bordetella pertussis protein 3253 (Bp3253) a protein of unknown function solved by structural genomics efforts. Recombinant MpGlpO is an active oxidase with a turnover number of ~580 min⁻¹ while Bp3253 showed no GlpO activity. No substantial differences exist between the oxidized and dithionite-reduced MpGlpO structures. Although, no liganded structures were determined, a comparison with the tartrate-bound Bp3253 structure and consideration of residue conservation patterns guided the construction of a model for α-glycerophosphate (Glp) recognition and turnover by MpGlpO. The predicted binding mode also appears relevant for the type I GlpOs (such as SspGlpO) despite differences in substrate recognition residues, and it implicates a histidine conserved in type I and II Glp oxidases and dehydrogenases as the catalytic acid/base. This work provides a solid foundation for guiding further studies of the mitochondrial Glp dehydrogenases as well as for continued studies of M. pneumoniae and S. pneumoniae glycerol metabolism and the development of novel therapeutics targeting MpGlpO and SpGlpO.« less
Structure and proposed mechanism of α-glycerophosphate oxidase from Mycoplasma pneumoniae
DOE Office of Scientific and Technical Information (OSTI.GOV)
Elkhal, Callia K.; Kean, Kelsey M.; Parsonage, Derek
In this study, the formation of hydrogen peroxide (H₂O₂) by the FAD-dependent α-glycerophosphate oxidase (GlpO), is important for the pathogenesis of Streptococcus pneumoniae and Mycoplasma pneumoniae. The structurally known GlpO from Streptococcus sp. ( SspGlpO) is similar to the pneumococcal protein ( SpGlpO) and provides a guide for drug design against that target. However, M. pneumoniae GlpO ( MpGlpO), having <20% sequence identity with structurally known GlpOs, appears to represent a second type of GlpO we designate as Type II GlpOs. Here, the recombinant His-tagged MpGlpO structure is described at ~2.5 Å resolution, solved by molecular replacement using as amore » search model the Bordetella pertussis protein 3253 (Bp3253) a protein of unknown function solved by structural genomics efforts. Recombinant MpGlpO is an active oxidase with a turnover number of ~580 min⁻¹ while Bp3253 showed no GlpO activity. No substantial differences exist between the oxidized and dithionite-reduced MpGlpO structures. Although, no liganded structures were determined, a comparison with the tartrate-bound Bp3253 structure and consideration of residue conservation patterns guided the construction of a model for α-glycerophosphate (Glp) recognition and turnover by MpGlpO. The predicted binding mode also appears relevant for the type I GlpOs (such as SspGlpO) despite differences in substrate recognition residues, and it implicates a histidine conserved in type I and II Glp oxidases and dehydrogenases as the catalytic acid/base. This work provides a solid foundation for guiding further studies of the mitochondrial Glp dehydrogenases as well as for continued studies of M. pneumoniae and S. pneumoniae glycerol metabolism and the development of novel therapeutics targeting MpGlpO and SpGlpO.« less
Protein space: a natural method for realizing the nature of protein universe.
Yu, Chenglong; Deng, Mo; Cheng, Shiu-Yuen; Yau, Shek-Chung; He, Rong L; Yau, Stephen S-T
2013-02-07
Current methods cannot tell us what the nature of the protein universe is concretely. They are based on different models of amino acid substitution and multiple sequence alignment which is an NP-hard problem and requires manual intervention. Protein structural analysis also gives a direction for mapping the protein universe. Unfortunately, now only a minuscule fraction of proteins' 3-dimensional structures are known. Furthermore, the phylogenetic tree representations are not unique for any existing tree construction methods. Here we develop a novel method to realize the nature of protein universe. We show the protein universe can be realized as a protein space in 60-dimensional Euclidean space using a distance based on a normalized distribution of amino acids. Every protein is in one-to-one correspondence with a point in protein space, where proteins with similar properties stay close together. Thus the distance between two points in protein space represents the biological distance of the corresponding two proteins. We also propose a natural graphical representation for inferring phylogenies. The representation is natural and unique based on the biological distances of proteins in protein space. This will solve the fundamental question of how proteins are distributed in the protein universe. Copyright © 2012 Elsevier Ltd. All rights reserved.
Mode localization in the cooperative dynamics of protein recognition
NASA Astrophysics Data System (ADS)
Copperman, J.; Guenza, M. G.
2016-07-01
The biological function of proteins is encoded in their structure and expressed through the mediation of their dynamics. This paper presents a study on the correlation between local fluctuations, binding, and biological function for two sample proteins, starting from the Langevin Equation for Protein Dynamics (LE4PD). The LE4PD is a microscopic and residue-specific coarse-grained approach to protein dynamics, which starts from the static structural ensemble of a protein and predicts the dynamics analytically. It has been shown to be accurate in its prediction of NMR relaxation experiments and Debye-Waller factors. The LE4PD is solved in a set of diffusive modes which span a vast range of time scales of the protein dynamics, and provides a detailed picture of the mode-dependent localization of the fluctuation as a function of the primary structure of the protein. To investigate the dynamics of protein complexes, the theory is implemented here to treat the coarse-grained dynamics of interacting macromolecules. As an example, calculations of the dynamics of monomeric and dimerized HIV protease and the free Insulin Growth Factor II Receptor (IGF2R) domain 11 and its IGF2R:IGF2 complex are presented. Either simulation-derived or experimentally measured NMR conformers are used as input structural ensembles to the theory. The picture that emerges suggests a dynamical heterogeneous protein where biologically active regions provide energetically comparable conformational states that are trapped by a reacting partner in agreement with the conformation-selection mechanism of binding.
Hafsa, Noor E.; Arndt, David; Wishart, David S.
2015-01-01
The Chemical Shift Index or CSI 3.0 (http://csi3.wishartlab.com) is a web server designed to accurately identify the location of secondary and super-secondary structures in protein chains using only nuclear magnetic resonance (NMR) backbone chemical shifts and their corresponding protein sequence data. Unlike earlier versions of CSI, which only identified three types of secondary structure (helix, β-strand and coil), CSI 3.0 now identifies total of 11 types of secondary and super-secondary structures, including helices, β-strands, coil regions, five common β-turns (type I, II, I′, II′ and VIII), β hairpins as well as interior and edge β-strands. CSI 3.0 accepts experimental NMR chemical shift data in multiple formats (NMR Star 2.1, NMR Star 3.1 and SHIFTY) and generates colorful CSI plots (bar graphs) and secondary/super-secondary structure assignments. The output can be readily used as constraints for structure determination and refinement or the images may be used for presentations and publications. CSI 3.0 uses a pipeline of several well-tested, previously published programs to identify the secondary and super-secondary structures in protein chains. Comparisons with secondary and super-secondary structure assignments made via standard coordinate analysis programs such as DSSP, STRIDE and VADAR on high-resolution protein structures solved by X-ray and NMR show >90% agreement between those made with CSI 3.0. PMID:25979265
Misra, Rajeev
2012-01-01
In the last decade, there has been an explosion of publications on the assembly of β-barrel outer membrane proteins (OMPs), which carry out diverse cellular functions, including solute transport, protein secretion, and assembly of protein and lipid components of the outer membrane. Of the three outer membrane model systems—Gram-negative bacteria, mitochondria and chloroplasts—research on bacterial and mitochondrial systems has so far led the way in dissecting the β-barrel OMP assembly pathways. Many exciting discoveries have been made, including the identification of β-barrel OMP assembly machineries in bacteria and mitochondria, and potentially the core assembly component in chloroplasts. The atomic structures of all five components of the bacterial β-barrel assembly machinery (BAM) complex, except the β-barrel domain of the core BamA protein, have been solved. Structures reveal that these proteins contain domains/motifs known to facilitate protein-protein interactions, which are at the heart of the assembly pathways. While structural information has been valuable, most of our current understanding of the β-barrel OMP assembly pathways has come from genetic, molecular biology, and biochemical analyses. This paper provides a comparative account of the β-barrel OMP assembly pathways in Gram-negative bacteria, mitochondria, and chloroplasts. PMID:27335668
Dang, Bobo; Kubota, Tomoya; Mandal, Kalyaneswar; Bezanilla, Francisco; Kent, Stephen B H
2013-08-14
We have re-examined the utility of native chemical ligation at -Gln/Glu-Cys- [Glx-Cys] and -Asn/Asp-Cys- [Asx-Cys] sites. Using the improved thioaryl catalyst 4-mercaptophenylacetic acid (MPAA), native chemical ligation could be performed at -Gln-Cys- and Asn-Cys- sites without side reactions. After optimization, ligation at a -Glu-Cys- site could also be used as a ligation site, with minimal levels of byproduct formation. However, -Asp-Cys- is not appropriate for use as a site for native chemical ligation because of formation of significant amounts of β-linked byproduct. The feasibility of native chemical ligation at -Gln-Cys- enabled a convergent total chemical synthesis of the enantiomeric forms of the ShK toxin protein molecule. The D-ShK protein molecule was ~50,000-fold less active in blocking the Kv1.3 channel than the L-ShK protein molecule. Racemic protein crystallography was used to obtain high-resolution X-ray diffraction data for ShK toxin. The structure was solved by direct methods and showed significant differences from the previously reported NMR structures in some regions of the ShK protein molecule.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rantanen, Mika K.; Lehtiö, Lari; Rajagopal, Lakshmi
Two S. agalactiae proteins, the inorganic pyrophosphatase and the serine/threonine phosphatase, were crystallized and diffraction data were collected and processed from these crystals. The data from the two protein crystals extended to 2.80 and 2.65 Å, respectively. Streptococcus agalactiae, which infects human neonates and causes sepsis and meningitis, has recently been shown to possess a eukaryotic-like serine/threonine protein phosphorylation signalling cascade. Through their target proteins, the S. agalactiae Ser/Thr kinase and Ser/Thr phosphatase together control the growth as well as the morphology and virulence of this organism. One of the targets is the S. agalactiae family II inorganic pyrophosphatase. Themore » inorganic pyrophosphatase and the serine/threonine phosphatase have therefore been purified and crystallized and diffraction data have been collected from their crystals. The data were processed using XDS. The inorganic pyrosphosphatase crystals diffracted to 2.80 Å and the Ser/Thr phosphatase crystals to 2.65 Å. Initial structure-solution experiments indicate that structure solution will be successful in both cases. Solving the structure of the proteins involved in this cascade is the first step towards understanding this phenomenon in atomic detail.« less
The rate of cis-trans conformation errors is increasing in low-resolution crystal structures.
Croll, Tristan Ian
2015-03-01
Cis-peptide bonds (with the exception of X-Pro) are exceedingly rare in native protein structures, yet a check for these is not currently included in the standard workflow for some common crystallography packages nor in the automated quality checks that are applied during submission to the Protein Data Bank. This appears to be leading to a growing rate of inclusion of spurious cis-peptide bonds in low-resolution structures both in absolute terms and as a fraction of solved residues. Most concerningly, it is possible for structures to contain very large numbers (>1%) of spurious cis-peptide bonds while still achieving excellent quality reports from MolProbity, leading to concerns that ignoring such errors is allowing software to overfit maps without producing telltale errors in, for example, the Ramachandran plot.
Protein 3D Structure Computed from Evolutionary Sequence Variation
Sheridan, Robert; Hopf, Thomas A.; Pagnani, Andrea; Zecchina, Riccardo; Sander, Chris
2011-01-01
The evolutionary trajectory of a protein through sequence space is constrained by its function. Collections of sequence homologs record the outcomes of millions of evolutionary experiments in which the protein evolves according to these constraints. Deciphering the evolutionary record held in these sequences and exploiting it for predictive and engineering purposes presents a formidable challenge. The potential benefit of solving this challenge is amplified by the advent of inexpensive high-throughput genomic sequencing. In this paper we ask whether we can infer evolutionary constraints from a set of sequence homologs of a protein. The challenge is to distinguish true co-evolution couplings from the noisy set of observed correlations. We address this challenge using a maximum entropy model of the protein sequence, constrained by the statistics of the multiple sequence alignment, to infer residue pair couplings. Surprisingly, we find that the strength of these inferred couplings is an excellent predictor of residue-residue proximity in folded structures. Indeed, the top-scoring residue couplings are sufficiently accurate and well-distributed to define the 3D protein fold with remarkable accuracy. We quantify this observation by computing, from sequence alone, all-atom 3D structures of fifteen test proteins from different fold classes, ranging in size from 50 to 260 residues., including a G-protein coupled receptor. These blinded inferences are de novo, i.e., they do not use homology modeling or sequence-similar fragments from known structures. The co-evolution signals provide sufficient information to determine accurate 3D protein structure to 2.7–4.8 Å Cα-RMSD error relative to the observed structure, over at least two-thirds of the protein (method called EVfold, details at http://EVfold.org). This discovery provides insight into essential interactions constraining protein evolution and will facilitate a comprehensive survey of the universe of protein structures, new strategies in protein and drug design, and the identification of functional genetic variants in normal and disease genomes. PMID:22163331
Structure of a putative acetyltransferase (PA1377) from Pseudomonas aeruginosa
DOE Office of Scientific and Technical Information (OSTI.GOV)
Davies, Anna M.; Tata, Renée; Chauviac, François-Xavier
2008-05-01
The crystal structure of an acetyltransferase encoded by the gene PA1377 from Pseudomonas aeruginosa has been determined at 2.25 Å resolution. Comparison with a related acetyltransferase revealed a structural difference in the active site that was taken to reflect a difference in substrate binding and/or specificity between the two enzymes. Gene PA1377 from Pseudomonas aeruginosa encodes a 177-amino-acid conserved hypothetical protein of unknown function. The structure of this protein (termed pitax) has been solved in space group I222 to 2.25 Å resolution. Pitax belongs to the GCN5-related N-acetyltransferase family and contains all four sequence motifs conserved among family members. Themore » β-strand structure in one of these motifs (motif A) is disrupted, which is believed to affect binding of the substrate that accepts the acetyl group from acetyl-CoA.« less
Genome Pool Strategy for Structural Coverage of Protein Families
Jaroszewski, Lukasz; Slabinski, Lukasz; Wooley, John; Deacon, Ashley M.; Lesley, Scott A.; Wilson, Ian. A.; Godzik, Adam
2010-01-01
As noticed by generations of structural biologists, closely homologous proteins may have substantially different crystallization properties and propensities. These observations can be used to systematically introduce additional dimensionality into crystallization trials by targeting homologous proteins from multiple genomes in a “genome pool” strategy. Through extensive use of our recently introduced “crystallization feasibility score” (Slabinski et al., 2007a), we can explain that the genome pool strategy works well because the crystallization feasibility scores are surprisingly broad within families of homologous proteins, with most families containing a range of optimal to very difficult targets. We also show that some families can be regarded as relatively “easy”, where a significant number of proteins are predicted to have optimal crystallization features, and others are “very difficult”, where almost none are predicted to result in a crystal structure. Thus, the outcome of such variable distributions of such crystallizability' preferences leads to uneven structural coverage of known families, with “easier” or “optimal” families having several times more solved structures than “very difficult” ones. Nevertheless, this latter category can be successfully targeted by increasing the number of genomes that are used to select targets from a given family. On average, adding 10 new genomes to the “genome pool” provides more promising targets for 7 “very difficult” families. In contrast, our crystallization feasibility score does not indicate that any specific microbial genomes can be readily classified as “easier” or “very difficult” with respect to providing suitable candidates for crystallization and structure determination. Finally, our analyses show that specific physicochemical properties of the protein sequence favor successful outcomes for structure determination and, hence, the group of proteins with known 3D structures is systematically different from the general pool of known proteins. We, therefore, assess the structural consequences of these differences in protein sequence and protein biophysical properties. PMID:19000818
Urvoas, Agathe; Guellouz, Asma; Valerio-Lepiniec, Marie; Graille, Marc; Durand, Dominique; Desravines, Danielle C; van Tilbeurgh, Herman; Desmadril, Michel; Minard, Philippe
2010-11-26
Repeat proteins have a modular organization and a regular architecture that make them attractive models for design and directed evolution experiments. HEAT repeat proteins, although very common, have not been used as a scaffold for artificial proteins, probably because they are made of long and irregular repeats. Here, we present and validate a consensus sequence for artificial HEAT repeat proteins. The sequence was defined from the structure-based sequence analysis of a thermostable HEAT-like repeat protein. Appropriate sequences were identified for the N- and C-caps. A library of genes coding for artificial proteins based on this sequence design, named αRep, was assembled using new and versatile methodology based on circular amplification. Proteins picked randomly from this library are expressed as soluble proteins. The biophysical properties of proteins with different numbers of repeats and different combinations of side chains in hypervariable positions were characterized. Circular dichroism and differential scanning calorimetry experiments showed that all these proteins are folded cooperatively and are very stable (T(m) >70 °C). Stability of these proteins increases with the number of repeats. Detailed gel filtration and small-angle X-ray scattering studies showed that the purified proteins form either monomers or dimers. The X-ray structure of a stable dimeric variant structure was solved. The protein is folded with a highly regular topology and the repeat structure is organized, as expected, as pairs of alpha helices. In this protein variant, the dimerization interface results directly from the variable surface enriched in aromatic residues located in the randomized positions of the repeats. The dimer was crystallized both in an apo and in a PEG-bound form, revealing a very well defined binding crevice and some structure flexibility at the interface. This fortuitous binding site could later prove to be a useful binding site for other low molecular mass partners. Copyright © 2010 Elsevier Ltd. All rights reserved.
Membrane protein properties revealed through data-rich electrostatics calculations
Guerriero, Christopher J.; Brodsky, Jeffrey L.; Grabe, Michael
2015-01-01
SUMMARY The electrostatic properties of membrane proteins often reveal many of their key biophysical characteristics, such as ion channel selectivity and the stability of charged membrane-spanning segments. The Poisson-Boltzmann (PB) equation is the gold standard for calculating protein electrostatics, and the software APBSmem enables the solution of the PB equation in the presence of a membrane. Here, we describe significant advances to APBSmem including: full automation of system setup, per-residue energy decomposition, incorporation of PDB2PQR, calculation of membrane induced pKa shifts, calculation of non-polar energies, and command-line scripting for large scale calculations. We highlight these new features with calculations carried out on a number of membrane proteins, including the recently solved structure of the ion channel TRPV1 and a large survey of 1,614 membrane proteins of known structure. This survey provides a comprehensive list of residues with large electrostatic penalties for being embedded in the membrane potentially revealing interesting functional information. PMID:26118532
Membrane Protein Properties Revealed through Data-Rich Electrostatics Calculations.
Marcoline, Frank V; Bethel, Neville; Guerriero, Christopher J; Brodsky, Jeffrey L; Grabe, Michael
2015-08-04
The electrostatic properties of membrane proteins often reveal many of their key biophysical characteristics, such as ion channel selectivity and the stability of charged membrane-spanning segments. The Poisson-Boltzmann (PB) equation is the gold standard for calculating protein electrostatics, and the software APBSmem enables the solution of the PB equation in the presence of a membrane. Here, we describe significant advances to APBSmem, including full automation of system setup, per-residue energy decomposition, incorporation of PDB2PQR, calculation of membrane-induced pKa shifts, calculation of non-polar energies, and command-line scripting for large-scale calculations. We highlight these new features with calculations carried out on a number of membrane proteins, including the recently solved structure of the ion channel TRPV1 and a large survey of 1,614 membrane proteins of known structure. This survey provides a comprehensive list of residues with large electrostatic penalties for being embedded in the membrane, potentially revealing interesting functional information. Copyright © 2015 Elsevier Ltd. All rights reserved.
The Significance of G Protein-Coupled Receptor Crystallography for Drug Discovery
Salon, John A.; Lodowski, David T.
2011-01-01
Crucial as molecular sensors for many vital physiological processes, seven-transmembrane domain G protein-coupled receptors (GPCRs) comprise the largest family of proteins targeted by drug discovery. Together with structures of the prototypical GPCR rhodopsin, solved structures of other liganded GPCRs promise to provide insights into the structural basis of the superfamily's biochemical functions and assist in the development of new therapeutic modalities and drugs. One of the greatest technical and theoretical challenges to elucidating and exploiting structure-function relationships in these systems is the emerging concept of GPCR conformational flexibility and its cause-effect relationship for receptor-receptor and receptor-effector interactions. Such conformational changes can be subtle and triggered by relatively small binding energy effects, leading to full or partial efficacy in the activation or inactivation of the receptor system at large. Pharmacological dogma generally dictates that these changes manifest themselves through kinetic modulation of the receptor's G protein partners. Atomic resolution information derived from increasingly available receptor structures provides an entrée to the understanding of these events and practically applying it to drug design. Supported by structure-activity relationship information arising from empirical screening, a unified structural model of GPCR activation/inactivation promises to both accelerate drug discovery in this field and improve our fundamental understanding of structure-based drug design in general. This review discusses fundamental problems that persist in drug design and GPCR structural determination. PMID:21969326
Preorganization of molecular binding sites in designed diiron proteins.
Maglio, Ornella; Nastri, Flavia; Pavone, Vincenzo; Lombardi, Angela; DeGrado, William F
2003-04-01
De novo protein design provides an attractive approach to critically test the features that are required for metalloprotein structure and function. Previously we designed and crystallographically characterized an idealized dimeric model for the four-helix bundle class of diiron and dimanganese proteins [Dueferri 1 (DF1)]. Although the protein bound metal ions in the expected manner, access to its active site was blocked by large bulky hydrophobic residues. Subsequently, a substrate-access channel was introduced proximal to the metal-binding center, resulting in a protein with properties more closely resembling those of natural enzymes. Here we delineate the energetic and structural consequences associated with the introduction of these binding sites. To determine the extent to which the binding site was preorganized in the absence of metal ions, the apo structure of DF1 in solution was solved by NMR and compared with the crystal structure of the di-Zn(II) derivative. The overall fold of the apo protein was highly similar to that of the di-Zn(II) derivative, although there was a rotation of one of the helices. We also examined the thermodynamic consequences associated with building a small molecule-binding site within the protein. The protein exists in an equilibrium between folded dimers and unfolded monomers. DF1 is a highly stable protein (K(diss) = 0.001 fM), but the dissociation constant increases to 0.6 nM (deltadeltaG = 5.4 kcalmol monomer) as the active-site cavity is increased to accommodate small molecules.
Novel Computational Approaches to Drug Discovery
NASA Astrophysics Data System (ADS)
Skolnick, Jeffrey; Brylinski, Michal
2010-01-01
New approaches to protein functional inference based on protein structure and evolution are described. First, FINDSITE, a threading based approach to protein function prediction, is summarized. Then, the results of large scale benchmarking of ligand binding site prediction, ligand screening, including applications to HIV protease, and GO molecular functional inference are presented. A key advantage of FINDSITE is its ability to use low resolution, predicted structures as well as high resolution experimental structures. Then, an extension of FINDSITE to ligand screening in GPCRs using predicted GPCR structures, FINDSITE/QDOCKX, is presented. This is a particularly difficult case as there are few experimentally solved GPCR structures. Thus, we first train on a subset of known binding ligands for a set of GPCRs; this is then followed by benchmarking against a large ligand library. For the virtual ligand screening of a number of Dopamine receptors, encouraging results are seen, with significant enrichment in identified ligands over those found in the training set. Thus, FINDSITE and its extensions represent a powerful approach to the successful prediction of a variety of molecular functions.
Zhang, Jian; Yang, Jianyi; Jang, Richard; Zhang, Yang
2015-08-04
Experimental structure determination remains difficult for G protein-coupled receptors (GPCRs). We propose a new hybrid protocol to construct GPCR structure models that integrates experimental mutagenesis data with ab initio transmembrane (TM) helix assembly simulations. The method was tested on 24 known GPCRs where the ab initio TM-helix assembly procedure constructed the correct fold for 20 cases. When combined with weak homology and sparse mutagenesis restraints, the method generated correct folds for all the tested cases with an average Cα root-mean-square deviation 2.4 Å in the TM regions. The new hybrid protocol was applied to model all 1,026 GPCRs in the human genome, where 923 have a high confidence score and are expected to have correct folds; these contain many pharmaceutically important families with no previously solved structures, including Trace amine, Prostanoids, Releasing hormones, Melanocortins, Vasopressin, and Neuropeptide Y receptors. The results demonstrate new progress on genome-wide structure modeling of TM proteins. Copyright © 2015 Elsevier Ltd. All rights reserved.
Expression and crystallization of the plant alternative oxidase.
May, Benjamin; Elliott, Catherine; Iwata, Momi; Young, Luke; Shearman, Julia; Albury, Mary S; Moore, Anthony L
2015-01-01
The alternative oxidase (AOX) is an integral monotopic membrane protein located on the inner surface of the inner mitochondrial membrane. Branching from the traditional respiratory chain at the quinone pool, AOX is responsible for cyanide-resistant respiration in plants and fungi, heat generation in thermogenic plants, and survival of parasites, such as Trypanosoma brucei, in the human host. A recently solved AOX structure provides insight into its active site, thereby facilitating rational phytopathogenic and antiparasitic drug design. Here, we describe expression of recombinant AOX using two different expression systems. Purification protocols for the production of highly pure and stable AOX protein in sufficient quantities to facilitate further kinetic, biophysical, and structural analyses are also described.
DOE Office of Scientific and Technical Information (OSTI.GOV)
He, Hongxing; Fang, Hengrui; Miller, Mitchell D.
2016-07-15
An iterative transform algorithm is proposed to improve the conventional molecular-replacement method for solving the phase problem in X-ray crystallography. Several examples of successful trial calculations carried out with real diffraction data are presented. An iterative transform method proposed previously for direct phasing of high-solvent-content protein crystals is employed for enhancing the molecular-replacement (MR) algorithm in protein crystallography. Target structures that are resistant to conventional MR due to insufficient similarity between the template and target structures might be tractable with this modified phasing method. Trial calculations involving three different structures are described to test and illustrate the methodology. The relationshipmore » of the approach to PHENIX Phaser-MR and MR-Rosetta is discussed.« less
Protein Data Bank depositions from synchrotron sources.
Jiang, Jiansheng; Sweet, Robert M
2004-07-01
A survey and analysis of Protein Data Bank (PDB) depositions from international synchrotron radiation facilities, based on the latest released PDB entries, are reported. The results (http://asdp.bnl.gov/asda/Libraries/) show that worldwide, every year since 1999, more than 50% of the deposited X-ray structures have used synchrotron facilities, reaching 75% by 2003. In this web-based database, all PDB entries among individual synchrotron beamlines are archived, synchronized with the weekly PDB release. Statistics regarding the quality of experimental data and the refined model for all structures are presented, and these are analysed to reflect the impact of synchrotron sources. The results confirm the common impression that synchrotron sources extend the size of structures that can be solved with equivalent or better quality than home sources.
Epa, V. Chandana; Dolezal, Olan; Doughty, Larissa; Xiao, Xiaowen; Jost, Christian; Plückthun, Andreas; Adams, Timothy E.
2013-01-01
Designed Ankyrin Repeat Proteins are a class of novel binding proteins that can be selected and evolved to bind to targets with high affinity and specificity. We are interested in the DARPin H10-2-G3, which has been evolved to bind with very high affinity to the human epidermal growth factor receptor 2 (HER2). HER2 is found to be over-expressed in 30% of breast cancers, and is the target for the FDA-approved therapeutic monoclonal antibodies trastuzumab and pertuzumab and small molecule tyrosine kinase inhibitors. Here, we use computational macromolecular docking, coupled with several interface metrics such as shape complementarity, interaction energy, and electrostatic complementarity, to model the structure of the complex between the DARPin H10-2-G3 and HER2. We analyzed the interface between the two proteins and then validated the structural model by showing that selected HER2 point mutations at the putative interface with H10-2-G3 reduce the affinity of binding up to 100-fold without affecting the binding of trastuzumab. Comparisons made with a subsequently solved X-ray crystal structure of the complex yielded a backbone atom root mean square deviation of 0.84–1.14 Ångstroms. The study presented here demonstrates the capability of the computational techniques of structural bioinformatics in generating useful structural models of protein-protein interactions. PMID:23527120
DOE Office of Scientific and Technical Information (OSTI.GOV)
Geerds, Christina; Wohlmann, Jens; Haas, Albert
The structure of VapB, a member of the Vap protein family that is involved in virulence of the bacterial pathogen R. equi, was determined by SAD phasing and reveals an eight-stranded antiparallel β-barrel similar to avidin, suggestive of a binding function. Made up of two Greek-key motifs, the topology of VapB is unusual or even unique. Members of the virulence-associated protein (Vap) family from the pathogen Rhodococcus equi regulate virulence in an unknown manner. They do not share recognizable sequence homology with any protein of known structure. VapB and VapA are normally associated with isolates from pigs and horses, respectively.more » To contribute to a molecular understanding of Vap function, the crystal structure of a protease-resistant VapB fragment was determined at 1.4 Å resolution. The structure was solved by SAD phasing employing the anomalous signal of one endogenous S atom and two bound Co ions with low occupancy. VapB is an eight-stranded antiparallel β-barrel with a single helix. Structural similarity to avidins suggests a potential binding function. Unlike other eight- or ten-stranded β-barrels found in avidins, bacterial outer membrane proteins, fatty-acid-binding proteins and lysozyme inhibitors, Vaps do not have a next-neighbour arrangement but consist of two Greek-key motifs with strand order 41238567, suggesting an unusual or even unique topology.« less
Structural characterization of metal binding to a cold-adapted frataxin.
Noguera, Martín E; Roman, Ernesto A; Rigal, Juan B; Cousido-Siah, Alexandra; Mitschler, André; Podjarny, Alberto; Santos, Javier
2015-06-01
Frataxin is an evolutionary conserved protein that participates in iron metabolism. Deficiency of this small protein in humans causes a severe neurodegenerative disease known as Friedreich's ataxia. A number of studies indicate that frataxin binds iron and regulates Fe-S cluster biosynthesis. Previous structural studies showed that metal binding occurs mainly in a region of high density of negative charge. However, a comprehensive characterization of the binding sites is required to gain further insights into the mechanistic details of frataxin function. In this work, we have solved the X-ray crystal structures of a cold-adapted frataxin from a psychrophilic bacterium in the presence of cobalt or europium ions. We have identified a number of metal-binding sites, mainly solvent exposed, several of which had not been observed in previous studies on mesophilic homologues. No major structural changes were detected upon metal binding, although the structures exhibit significant changes in crystallographic B-factors. The analysis of these B-factors, in combination with crystal packing and RMSD among structures, suggests the existence of localized changes in the internal motions. Based on these results, we propose that bacterial frataxins possess binding sites of moderate affinity for a quick capture and transfer of iron to other proteins and for the regulation of Fe-S cluster biosynthesis, modulating interactions with partner proteins.
Kumar, Avishek; Campitelli, Paul; Thorpe, M F; Ozkan, S Banu
2015-12-01
The most successful protein structure prediction methods to date have been template-based modeling (TBM) or homology modeling, which predicts protein structure based on experimental structures. These high accuracy predictions sometimes retain structural errors due to incorrect templates or a lack of accurate templates in the case of low sequence similarity, making these structures inadequate in drug-design studies or molecular dynamics simulations. We have developed a new physics based approach to the protein refinement problem by mimicking the mechanism of chaperons that rehabilitate misfolded proteins. The template structure is unfolded by selectively (targeted) pulling on different portions of the protein using the geometric based technique FRODA, and then refolded using hierarchically restrained replica exchange molecular dynamics simulations (hr-REMD). FRODA unfolding is used to create a diverse set of topologies for surveying near native-like structures from a template and to provide a set of persistent contacts to be employed during re-folding. We have tested our approach on 13 previous CASP targets and observed that this method of folding an ensemble of partially unfolded structures, through the hierarchical addition of contact restraints (that is, first local and then nonlocal interactions), leads to a refolding of the structure along with refinement in most cases (12/13). Although this approach yields refined models through advancement in sampling, the task of blind selection of the best refined models still needs to be solved. Overall, the method can be useful for improved sampling for low resolution models where certain of the portions of the structure are incorrectly modeled. © 2015 Wiley Periodicals, Inc.
Structural alignment of protein descriptors - a combinatorial model.
Antczak, Maciej; Kasprzak, Marta; Lukasiak, Piotr; Blazewicz, Jacek
2016-09-17
Structural alignment of proteins is one of the most challenging problems in molecular biology. The tertiary structure of a protein strictly correlates with its function and computationally predicted structures are nowadays a main premise for understanding the latter. However, computationally derived 3D models often exhibit deviations from the native structure. A way to confirm a model is a comparison with other structures. The structural alignment of a pair of proteins can be defined with the use of a concept of protein descriptors. The protein descriptors are local substructures of protein molecules, which allow us to divide the original problem into a set of subproblems and, consequently, to propose a more efficient algorithmic solution. In the literature, one can find many applications of the descriptors concept that prove its usefulness for insight into protein 3D structures, but the proposed approaches are presented rather from the biological perspective than from the computational or algorithmic point of view. Efficient algorithms for identification and structural comparison of descriptors can become crucial components of methods for structural quality assessment as well as tertiary structure prediction. In this paper, we propose a new combinatorial model and new polynomial-time algorithms for the structural alignment of descriptors. The model is based on the maximum-size assignment problem, which we define here and prove that it can be solved in polynomial time. We demonstrate suitability of this approach by comparison with an exact backtracking algorithm. Besides a simplification coming from the combinatorial modeling, both on the conceptual and complexity level, we gain with this approach high quality of obtained results, in terms of 3D alignment accuracy and processing efficiency. All the proposed algorithms were developed and integrated in a computationally efficient tool descs-standalone, which allows the user to identify and structurally compare descriptors of biological molecules, such as proteins and RNAs. Both PDB (Protein Data Bank) and mmCIF (macromolecular Crystallographic Information File) formats are supported. The proposed tool is available as an open source project stored on GitHub ( https://github.com/mantczak/descs-standalone ).
Structural genomics reveals EVE as a new ASCH/PUA-related domain
Bertonati, Claudia; Punta, Marco; Fischer, Markus; Yachdav, Guy; Forouhar, Farhad; Zhou, Weihong; Kuzin, Alexander P.; Seetharaman, Jayaraman; Abashidze, Mariam; Ramelot, Theresa A.; Kennedy, Michael A.; Cort, John R.; Belachew, Adam; Hunt, John F.; Tong, Liang; Montelione, Gaetano T.; Rost, Burkhard
2014-01-01
Summary We report on several proteins recently solved by structural genomics consortia, in particular by the Northeast Structural Genomics consortium (NESG). The proteins considered in this study differ substantially in their sequences but they share a similar structural core, characterized by a pseudobarrel five-stranded beta sheet. This core corresponds to the PUA domain-like architecture in the SCOP database. By connecting sequence information with structural knowledge, we characterize a new subgroup of these proteins that we propose to be distinctly different from previously described PUA domain-like domains such as PUA proper or ASCH. We refer to these newly defined domains as EVE. Although EVE may have retained the ability of PUA domains to bind RNA, the available experimental and computational data suggests that both the details of its molecular function and its cellular function differ from those of other PUA domain-like domains. This study of EVE and its relatives illustrates how the combination of structure and genomics creates new insights by connecting a cornucopia of structures that map to the same evolutionary potential. Primary sequence information alone would have not been sufficient to reveal these evolutionary links. PMID:19191354
Structural Genomics Reveals EVE as a New ASCH/PUA-Related Domain
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bertonati, C.; Punta, M; Fischer, M
2008-01-01
We report on several proteins recently solved by structural genomics consortia, in particular by the Northeast Structural Genomics consortium (NESG). The proteins considered in this study differ substantially in their sequences but they share a similar structural core, characterized by a pseudobarrel five-stranded beta sheet. This core corresponds to the PUA domain-like architecture in the SCOP database. By connecting sequence information with structural knowledge, we characterize a new subgroup of these proteins that we propose to be distinctly different from previously described PUA domain-like domains such as PUA proper or ASCH. We refer to these newly defined domains as EVE.more » Although EVE may have retained the ability of PUA domains to bind RNA, the available experimental and computational data suggests that both the details of its molecular function and its cellular function differ from those of other PUA domain-like domains. This study of EVE and its relatives illustrates how the combination of structure and genomics creates new insights by connecting a cornucopia of structures that map to the same evolutionary potential. Primary sequence information alone would have not been sufficient to reveal these evolutionary links.« less
Krojer, Tobias; Talon, Romain; Pearce, Nicholas; Collins, Patrick; Douangamath, Alice; Brandao-Neto, Jose; Dias, Alexandre; Marsden, Brian; von Delft, Frank
2017-03-01
XChemExplorer (XCE) is a data-management and workflow tool to support large-scale simultaneous analysis of protein-ligand complexes during structure-based ligand discovery (SBLD). The user interfaces of established crystallographic software packages such as CCP4 [Winn et al. (2011), Acta Cryst. D67, 235-242] or PHENIX [Adams et al. (2010), Acta Cryst. D66, 213-221] have entrenched the paradigm that a `project' is concerned with solving one structure. This does not hold for SBLD, where many almost identical structures need to be solved and analysed quickly in one batch of work. Functionality to track progress and annotate structures is essential. XCE provides an intuitive graphical user interface which guides the user from data processing, initial map calculation, ligand identification and refinement up until data dissemination. It provides multiple entry points depending on the need of each project, enables batch processing of multiple data sets and records metadata, progress and annotations in an SQLite database. XCE is freely available and works on any Linux and Mac OS X system, and the only dependency is to have the latest version of CCP4 installed. The design and usage of this tool are described here, and its usefulness is demonstrated in the context of fragment-screening campaigns at the Diamond Light Source. It is routinely used to analyse projects comprising 1000 data sets or more, and therefore scales well to even very large ligand-design projects.
Schieferstein, Jeremy M.; Pawate, Ashtamurthy S.; Wan, Frank; Sheraden, Paige N.; Broecker, Jana; Ernst, Oliver P.; Gennis, Robert B.
2017-01-01
Elucidating and clarifying the function of membrane proteins ultimately requires atomic resolution structures as determined most commonly by X-ray crystallography. Many high impact membrane protein structures have resulted from advanced techniques such as in meso crystallization that present technical difficulties for the set-up and scale-out of high-throughput crystallization experiments. In prior work, we designed a novel, low-throughput X-ray transparent microfluidic device that automated the mixing of protein and lipid by diffusion for in meso crystallization trials. Here, we report X-ray transparent microfluidic devices for high-throughput crystallization screening and optimization that overcome the limitations of scale and demonstrate their application to the crystallization of several membrane proteins. Two complementary chips are presented: (1) a high-throughput screening chip to test 192 crystallization conditions in parallel using as little as 8 nl of membrane protein per well and (2) a crystallization optimization chip to rapidly optimize preliminary crystallization hits through fine-gradient re-screening. We screened three membrane proteins for new in meso crystallization conditions, identifying several preliminary hits that we tested for X-ray diffraction quality. Further, we identified and optimized the crystallization condition for a photosynthetic reaction center mutant and solved its structure to a resolution of 3.5 Å. PMID:28469762
Caffrey, Martin; Li, Dianfan; Dukkipati, Abhiram
2012-01-01
The crystal structure of the β2-adrenergic receptor in complex with an agonist and its cognate G protein has just recently been solved. It is now possible to explore in molecular detail the means by which this paradigmatic transmembrane receptor binds agonist, communicates the impulse or signalling event across the membrane and sets in motion a series of G protein-directed intracellular responses. The structure was determined using crystals of the ternary complex grown in a rationally designed lipidic mesophase by the so-called in meso method. The method is proving to be particularly useful in the G protein-coupled receptor field where the structures of thirteen distinct receptor types have been solved in the past five years. In addition to receptors, the method has proven useful with a wide variety of integral membrane protein classes that include bacterial and eukaryotic rhodopsins, a light harvesting complex II (LHII), photosynthetic reaction centers, cytochrome oxidases, β-barrels, an exchanger, and an integral membrane peptide. This attests to the versatility and range of the method and supports the view that the in meso method should be included in the arsenal of the serious membrane structural biologist. For this to happen however, the reluctance in adopting it attributable, in part, to the anticipated difficulties associated with handling the sticky, viscous cubic mesophase in which crystals grow must be overcome. Harvesting and collecting diffraction data with the mesophase-grown crystals is also viewed with some trepidation. It is acknowledged that there are challenges associated with the method. Over the years, we have endeavored to establish how the method works at a molecular level and to make it user-friendly. To these ends, tools for handling the mesophase in the pico- to nano-liter volume range have been developed for highly efficient crystallization screening in manual and robotic modes. Methods have been implemented for evaluating the functional activity of membrane proteins reconstituted into the bilayer of the cubic phase as a prelude to crystallogenesis. Glass crystallization plates have been built that provide unparalleled optical quality and sensitivity to nascent crystals. Lipid and precipitant screens have been designed for a more rational approach to crystallogenesis such that the method can now be applied to an even wider variety of membrane protein types. In this Current Topics article, these assorted advances are outlined along with a summary of the membrane proteins that have yielded to the method. The prospects for and the challenges that must be overcome to further develop the method are described. PMID:22783824
Mechanisms of protein-folding diseases at a glance.
Valastyan, Julie S; Lindquist, Susan
2014-01-01
For a protein to function appropriately, it must first achieve its proper conformation and location within the crowded environment inside the cell. Multiple chaperone systems are required to fold proteins correctly. In addition, degradation pathways participate by destroying improperly folded proteins. The intricacy of this multisystem process provides many opportunities for error. Furthermore, mutations cause misfolded, nonfunctional forms of proteins to accumulate. As a result, many pathological conditions are fundamentally rooted in the protein-folding problem that all cells must solve to maintain their function and integrity. Here, to illustrate the breadth of this phenomenon, we describe five examples of protein-misfolding events that can lead to disease: improper degradation, mislocalization, dominant-negative mutations, structural alterations that establish novel toxic functions, and amyloid accumulation. In each case, we will highlight current therapeutic options for battling such diseases.
2014-01-01
Background The advent of human genome sequencing project has led to a spurt in the number of protein sequences in the databanks. Success of structure based drug discovery severely hinges on the availability of structures. Despite significant progresses in the area of experimental protein structure determination, the sequence-structure gap is continually widening. Data driven homology based computational methods have proved successful in predicting tertiary structures for sequences sharing medium to high sequence similarities. With dwindling similarities of query sequences, advanced homology/ ab initio hybrid approaches are being explored to solve structure prediction problem. Here we describe Bhageerath-H, a homology/ ab initio hybrid software/server for predicting protein tertiary structures with advancing drug design attempts as one of the goals. Results Bhageerath-H web-server was validated on 75 CASP10 targets which showed TM-scores ≥0.5 in 91% of the cases and Cα RMSDs ≤5Å from the native in 58% of the targets, which is well above the CASP10 water mark. Comparison with some leading servers demonstrated the uniqueness of the hybrid methodology in effectively sampling conformational space, scoring best decoys and refining low resolution models to high and medium resolution. Conclusion Bhageerath-H methodology is web enabled for the scientific community as a freely accessible web server. The methodology is fielded in the on-going CASP11 experiment. PMID:25521245
A global optimization algorithm for protein surface alignment
2010-01-01
Background A relevant problem in drug design is the comparison and recognition of protein binding sites. Binding sites recognition is generally based on geometry often combined with physico-chemical properties of the site since the conformation, size and chemical composition of the protein surface are all relevant for the interaction with a specific ligand. Several matching strategies have been designed for the recognition of protein-ligand binding sites and of protein-protein interfaces but the problem cannot be considered solved. Results In this paper we propose a new method for local structural alignment of protein surfaces based on continuous global optimization techniques. Given the three-dimensional structures of two proteins, the method finds the isometric transformation (rotation plus translation) that best superimposes active regions of two structures. We draw our inspiration from the well-known Iterative Closest Point (ICP) method for three-dimensional (3D) shapes registration. Our main contribution is in the adoption of a controlled random search as a more efficient global optimization approach along with a new dissimilarity measure. The reported computational experience and comparison show viability of the proposed approach. Conclusions Our method performs well to detect similarity in binding sites when this in fact exists. In the future we plan to do a more comprehensive evaluation of the method by considering large datasets of non-redundant proteins and applying a clustering technique to the results of all comparisons to classify binding sites. PMID:20920230
Gross, David A.; Snapp, Erik L.; Silver, David L.
2010-01-01
Fat storage-Inducing Transmembrane proteins 1 & 2 (FIT1/FITM1 and FIT2/FITM2) belong to a unique family of evolutionarily conserved proteins localized to the endoplasmic reticulum that are involved in triglyceride lipid droplet formation. FIT proteins have been shown to mediate the partitioning of cellular triglyceride into lipid droplets, but not triglyceride biosynthesis. FIT proteins do not share primary sequence homology with known proteins and no structural information is available to inform on the mechanism by which FIT proteins function. Here, we present the experimentally-solved topological models for FIT1 and FIT2 using N-glycosylation site mapping and indirect immunofluorescence techniques. These methods indicate that both proteins have six-transmembrane-domains with both N- and C-termini localized to the cytosol. Utilizing this model for structure-function analysis, we identified and characterized a gain-of-function mutant of FIT2 (FLL(157-9)AAA) in transmembrane domain 4 that markedly augmented the total number and mean size of lipid droplets. Using limited-trypsin proteolysis we determined that the FLL(157-9)AAA mutant has enhanced trypsin cleavage at K86 relative to wild-type FIT2, indicating a conformational change. Taken together, these studies indicate that FIT2 is a 6 transmembrane domain-containing protein whose conformation likely regulates its activity in mediating lipid droplet formation. PMID:20520733
[Methods of quantitative proteomics].
Kopylov, A T; Zgoda, V G
2007-01-01
In modern science proteomic analysis is inseparable from other fields of systemic biology. Possessing huge resources quantitative proteomics operates colossal information on molecular mechanisms of life. Advances in proteomics help researchers to solve complex problems of cell signaling, posttranslational modification, structure and functional homology of proteins, molecular diagnostics etc. More than 40 various methods have been developed in proteomics for quantitative analysis of proteins. Although each method is unique and has certain advantages and disadvantages all these use various isotope labels (tags). In this review we will consider the most popular and effective methods employing both chemical modifications of proteins and also metabolic and enzymatic methods of isotope labeling.
Structure and function of POTRA domains of Omp85/TPS superfamily.
Simmerman, Richard F; Dave, Ashita M; Bruce, Barry D
2014-01-01
The Omp85/TPS (outer-membrane protein of 85 kDa/two-partner secretion) superfamily is a ubiquitous and major class of β-barrel proteins. This superfamily is restricted to the outer membranes of gram-negative bacteria, mitochondria, and chloroplasts. The common architecture, with an N-terminus consisting of repeats of soluble polypeptide-transport-associated (POTRA) domains and a C-terminal β-barrel pore is highly conserved. The structures of multiple POTRA domains and one full-length TPS protein have been solved, yet discovering roles of individual POTRA domains has been difficult. This review focuses on similarities and differences between POTRA structures, emphasizing POTRA domains in autotrophic organisms including plants and cyanobacteria. Unique roles, specific for certain POTRA domains, are examined in the context of POTRA location with respect to their attachment to the β-barrel pore, and their degree of biological dispensability. Finally, because many POTRA domains may have the ability to interact with thousands of partner proteins, possible modes of these interactions are also explored. © 2014 Elsevier Inc. All rights reserved.
Zhan, Xuanzhi; Gimenez, Luis E.; Gurevich, Vsevolod V.; Spiller, Benjamin W.
2011-01-01
Arrestins are multi-functional proteins that regulate signaling and trafficking of the majority of G protein-coupled receptors (GPCRs), as well as sub-cellular localization and activity of many other signaling proteins. Here we report the first crystal structure of arrestin-3, solved at 3.0Å. Arrestin-3 is an elongated two-domain molecule with the overall fold and key inter-domain interactions that hold free protein in the basal conformation similar to the other subtypes. Arrestin-3 is the least selective member of the family, binding wide variety of GPCRs with high affinity and demonstrating lower preference for active phosphorylated forms of the receptors. In contrast to the other three arrestins, part of the receptor-binding surface in the arrestin-3 C-domain does not form a contiguous β-sheet, consistent with increased flexibility. By swapping the corresponding elements between arrestin-2 and -3 we show that the presence of this loose structure correlates with reduced arrestin selectivity for activated receptor, consistent with a conformational change in this β-sheet upon receptor binding. PMID:21215759
Musyoki, Abednego Moki; Shi, Zhongyu; Xuan, Chunling; Lu, Guangwen; Qi, Jianxun; Gao, Feng; Zheng, Beiwen; Zhang, Qiangmin; Li, Yan; Haywood, Joel; Liu, Cuihua; Yan, Jinghua; Shi, Yi; Gao, George F
2016-11-29
The anchorless fibronectin-binding proteins (FnBPs) are a group of important virulence factors for which the structures are not available and the functions are not well defined. In this study we performed comprehensive studies on a prototypic member of this group: the fibronectin-/fibrinogen-binding protein from Streptococcus suis (FBPS). The structures of the N- and C-terminal halves (FBPS-N and FBPS-C), which together cover the full-length protein in sequence, were solved at a resolution of 2.1 and 2.6 Å, respectively, and each was found to be composed of two domains with unique folds. Furthermore, we have elucidated the organization of these domains by small-angle X-ray scattering. We further showed that the fibronectin-binding site is located in FBPS-C and that FBPS promotes the adherence of S suis to host cells by attaching the bacteria via FBPS-N. Finally, we demonstrated that FBPS functions both as an adhesin, promoting S suis attachment to host cells, and as a bacterial factor, activating signaling pathways via β1 integrin receptors to induce chemokine production.
Zebavidin - An Avidin-Like Protein from Zebrafish
Taskinen, Barbara; Zmurko, Joanna; Ojanen, Markus; Kukkurainen, Sampo; Parthiban, Marimuthu; Määttä, Juha A. E.; Leppiniemi, Jenni; Jänis, Janne; Parikka, Mataleena; Turpeinen, Hannu; Rämet, Mika; Pesu, Marko; Johnson, Mark S.; Kulomaa, Markku S.; Airenne, Tomi T.; Hytönen, Vesa P.
2013-01-01
The avidin protein family members are well known for their high affinity towards D-biotin and high structural stability. These properties make avidins valuable tools for a wide range of biotechnology applications. We have identified a new member of the avidin family in the zebrafish (Danio rerio) genome, hereafter called zebavidin. The protein is highly expressed in the gonads of both male and female zebrafish and in the gills of male fish, but our data suggest that zebavidin is not crucial for the developing embryo. Biophysical and structural characterisation of zebavidin revealed distinct properties not found in any previously characterised avidins. Gel filtration chromatography and native mass spectrometry suggest that the protein forms dimers in the absence of biotin at low ionic strength, but assembles into tetramers upon binding biotin. Ligand binding was analysed using radioactive and fluorescently labelled biotin and isothermal titration calorimetry. Moreover, the crystal structure of zebavidin in complex with biotin was solved at 2.4 Å resolution and unveiled unique ligand binding and subunit interface architectures; the atomic-level details support our physicochemical observations. PMID:24204770
Colletier, Jacques-Philippe; Sliwa, Michel; Gallat, François-Xavier; Sugahara, Michihiro; Guillon, Virginia; Schirò, Giorgio; Coquelle, Nicolas; Woodhouse, Joyce; Roux, Laure; Gotthard, Guillaume; Royant, Antoine; Uriarte, Lucas Martinez; Ruckebusch, Cyril; Joti, Yasumasa; Byrdin, Martin; Mizohata, Eiichi; Nango, Eriko; Tanaka, Tomoyuki; Tono, Kensuke; Yabashi, Makina; Adam, Virgile; Cammarata, Marco; Schlichting, Ilme; Bourgeois, Dominique; Weik, Martin
2016-03-03
Reversibly photoswitchable fluorescent proteins find growing applications in cell biology, yet mechanistic details, in particular on the ultrafast photochemical time scale, remain unknown. We employed time-resolved pump-probe absorption spectroscopy on the reversibly photoswitchable fluorescent protein IrisFP in solution to study photoswitching from the nonfluorescent (off) to the fluorescent (on) state. Evidence is provided for the existence of several intermediate states on the pico- and microsecond time scales that are attributed to chromophore isomerization and proton transfer, respectively. Kinetic modeling favors a sequential mechanism with the existence of two excited state intermediates with lifetimes of 2 and 15 ps, the second of which controls the photoswitching quantum yield. In order to support that IrisFP is suited for time-resolved experiments aiming at a structural characterization of these ps intermediates, we used serial femtosecond crystallography at an X-ray free electron laser and solved the structure of IrisFP in its on state. Sample consumption was minimized by embedding crystals in mineral grease, in which they remain photoswitchable. Our spectroscopic and structural results pave the way for time-resolved serial femtosecond crystallography aiming at characterizing the structure of ultrafast intermediates in reversibly photoswitchable fluorescent proteins.
Lee, Woonghee; Stark, Jaime L; Markley, John L
2014-11-01
Peak-picking Of Noe Data Enabled by Restriction Of Shift Assignments-Client Server (PONDEROSA-C/S) builds on the original PONDEROSA software (Lee et al. in Bioinformatics 27:1727-1728. doi: 10.1093/bioinformatics/btr200, 2011) and includes improved features for structure calculation and refinement. PONDEROSA-C/S consists of three programs: Ponderosa Server, Ponderosa Client, and Ponderosa Analyzer. PONDEROSA-C/S takes as input the protein sequence, a list of assigned chemical shifts, and nuclear Overhauser data sets ((13)C- and/or (15)N-NOESY). The output is a set of assigned NOEs and 3D structural models for the protein. Ponderosa Analyzer supports the visualization, validation, and refinement of the results from Ponderosa Server. These tools enable semi-automated NMR-based structure determination of proteins in a rapid and robust fashion. We present examples showing the use of PONDEROSA-C/S in solving structures of four proteins: two that enable comparison with the original PONDEROSA package, and two from the Critical Assessment of automated Structure Determination by NMR (Rosato et al. in Nat Methods 6:625-626. doi: 10.1038/nmeth0909-625 , 2009) competition. The software package can be downloaded freely in binary format from http://pine.nmrfam.wisc.edu/download_packages.html. Registered users of the National Magnetic Resonance Facility at Madison can submit jobs to the PONDEROSA-C/S server at http://ponderosa.nmrfam.wisc.edu, where instructions, tutorials, and instructions can be found. Structures are normally returned within 1-2 days.
Structure of a CLC chloride ion channel by cryo-electron microscopy
Park, Eunyong; Campbell, Ernest B.; MacKinnon, Roderick
2017-01-01
CLC proteins transport chloride (Cl−) ions across cellular membranes to regulate muscle excitability, electrolyte movement across epithelia, and acidification of intracellular organelles. Some CLC proteins are channels that conduct Cl− ions passively, whereas others are secondary active transporters that exchange two Cl− ions for one H+. The structural basis underlying these distinctive transport mechanisms is puzzling because CLC channels and transporters are expected to share the same architecture based on sequence homology. To solve this puzzle we determined the structure of a mammalian CLC channel (CLC-K) using cryo-electron microscopy. A conserved loop in the Cl− transport pathway shows a structure markedly different from that of CLC transporters. Consequently, the cytosolic constriction for Cl− passage is widened in CLC-K such that the kinetic barrier previously postulated for Cl−/H+ transporter function would be reduced. Thus, reduction of a kinetic barrier in CLC channels enables fast flow of Cl− down its electrochemical gradient. PMID:28002411
Esteban-Torres, María; Alvarez, Yanaisis; Acebrón, Iván; de las Rivas, Blanca; Muñoz, Rosario; Kohring, Gert-Wieland; Roa, Ana María; Sobrino, Mónica; Mancheño, José M
2012-09-21
Endogenous galactitol-1-phosphate 5-dehydrogenase (GPDH) (EC 1.1.1.251) from Escherichia coli spontaneously interacts with Ni(2+)-NTA matrices becoming a potential contaminant for recombinant, target His-tagged proteins. Purified recombinant, untagged GPDH (rGPDH) converted galactitol into tagatose, and d-tagatose-6-phosphate into galactitol-1-phosphate, in a Zn(2+)- and NAD(H)-dependent manner and readily crystallized what has permitted to solve its crystal structure. In contrast, N-terminally His-tagged GPDH was marginally stable and readily aggregated. The structure of rGPDH revealed metal-binding sites characteristic from the medium-chain dehydrogenase/reductase protein superfamily which may explain its ability to interact with immobilized metals. The structure also provides clues on the harmful effects of the N-terminal His-tag. Copyright © 2012 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
Understanding pre-mRNA splicing through crystallography.
Espinosa, Sara; Zhang, Lingdi; Li, Xueni; Zhao, Rui
2017-08-01
Crystallography is a powerful tool to determine the atomic structures of proteins and RNAs. X-ray crystallography has been used to determine the structure of many splicing related proteins and RNAs, making major contributions to our understanding of the molecular mechanism and regulation of pre-mRNA splicing. Compared to other structural methods, crystallography has its own advantage in the high-resolution structural information it can provide and the unique biological questions it can answer. In addition, two new crystallographic methods - the serial femtosecond crystallography and 3D electron crystallography - were developed to overcome some of the limitations of traditional X-ray crystallography and broaden the range of biological problems that crystallography can solve. This review discusses the theoretical basis, instrument requirements, troubleshooting, and exciting potential of these crystallographic methods to further our understanding of pre-mRNA splicing, a critical event in gene expression of all eukaryotes. Copyright © 2017 Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shetty, Nishant D.; Reddy, Manchi C.M.; Palaninathan, Satheesh K.
2010-10-11
PII constitutes a family of signal transduction proteins that act as nitrogen sensors in microorganisms and plants. Mycobacterium tuberculosis (Mtb) has a single homologue of PII whose precise role has as yet not been explored. We have solved the crystal structures of the Mtb PII protein in its apo and ATP bound forms to 1.4 and 2.4 {angstrom} resolutions, respectively. The protein forms a trimeric assembly in the crystal lattice and folds similarly to the other PII family proteins. The Mtb PII:ATP binary complex structure reveals three ATP molecules per trimer, each bound between the base of the T-loop ofmore » one subunit and the C-loop of the neighboring subunit. In contrast to the apo structure, at least one subunit of the binary complex structure contains a completely ordered T-loop indicating that ATP binding plays a role in orienting this loop region towards target proteins like the ammonium transporter, AmtB. Arg38 of the T-loop makes direct contact with the {gamma}-phosphate of the ATP molecule replacing the Mg{sup 2+} position seen in the Methanococcus jannaschii GlnK1 structure. The C-loop of a neighboring subunit encloses the other side of the ATP molecule, placing the GlnK specific C-terminal 3{sub 10} helix in the vicinity. Homology modeling studies with the E. coli GlnK:AmtB complex reveal that Mtb PII could form a complex similar to the complex in E. coli. The structural conservation and operon organization suggests that the Mtb PII gene encodes for a GlnK protein and might play a key role in the nitrogen regulatory pathway.« less
A new multi-scale method to reveal hierarchical modular structures in biological networks.
Jiao, Qing-Ju; Huang, Yan; Shen, Hong-Bin
2016-11-15
Biological networks are effective tools for studying molecular interactions. Modular structure, in which genes or proteins may tend to be associated with functional modules or protein complexes, is a remarkable feature of biological networks. Mining modular structure from biological networks enables us to focus on a set of potentially important nodes, which provides a reliable guide to future biological experiments. The first fundamental challenge in mining modular structure from biological networks is that the quality of the observed network data is usually low owing to noise and incompleteness in the obtained networks. The second problem that poses a challenge to existing approaches to the mining of modular structure is that the organization of both functional modules and protein complexes in networks is far more complicated than was ever thought. For instance, the sizes of different modules vary considerably from each other and they often form multi-scale hierarchical structures. To solve these problems, we propose a new multi-scale protocol for mining modular structure (named ISIMB) driven by a node similarity metric, which works in an iteratively converged space to reduce the effects of the low data quality of the observed network data. The multi-scale node similarity metric couples both the local and the global topology of the network with a resolution regulator. By varying this resolution regulator to give different weightings to the local and global terms in the metric, the ISIMB method is able to fit the shape of modules and to detect them on different scales. Experiments on protein-protein interaction and genetic interaction networks show that our method can not only mine functional modules and protein complexes successfully, but can also predict functional modules from specific to general and reveal the hierarchical organization of protein complexes.
X-ray Crystal Structures of the Type IVb Secretion System DotB ATPases.
Prevost, Marie S; Waksman, Gabriel
2018-05-17
Human infections by the intracellular bacterial pathogen Legionella pneumophila result in a severe form of pneumonia, the Legionnaire's disease. L. pneumophila utilises a type IVb secretion (T4bS) system termed "dot/icm" to secrete protein effectors to the host cytoplasm. The dot/icm system is powered at least in part by a functionally critical AAA+ ATPase, a protein called DotB, thought to belong to the VirB11 family of proteins. Here we present the crystal structure of DotB at 3.19 Å resolution, in its hexameric form. We observe that DotB is in fact a structural intermediate between VirB11 and PilT family proteins, with a PAS-like N-terminal domain coupled to a RecA-like C-terminal domain. It also shares critical structural elements only found in PilT. The structure also reveals two conformers, termed α and β, with an αβαβαβ configuration. The existence of α and β conformers in this class of proteins was confirmed by solving the structure of DotB from another bacterial pathogen, Yersinia, where, intriguingly, we observed an ααβααβ configuration. The two conformers co-exist regardless of the nucleotide-bound states of the proteins. Our investigation therefore reveals that these ATPases can adopt a wider range of conformational states than was known before, shedding new light on the extraordinary spectrum of conformations these ATPases can access to carry out their function. Overall, the structure of DotB provides a template for further rational drug-design to develop more specific antibiotics to tackle Legionnaire's disease. This article is protected by copyright. All rights reserved. © 2018 The Protein Society.
Abhinand, P A; Shaikh, Faraz; Bhakat, Soumendranath; Radadiya, Ashish; Bhaskar, L V K S; Shah, Anamik; Ragunath, P K
2016-01-01
Methylenetetrahydrofolate reductase (MTHFR) protein catalyzes the only biochemical reaction which produces methyltetrahydrofolate, the active form of folic acid essential for several molecular functions. The Ala222Val polymorphism of human MTHFR encodes a thermolabile protein associated with increased risk of neural tube defects and cardiovascular disease. Experimental studies have shown that the mutation does not affect the kinetic properties of MTHFR, but inactivates the protein by increasing flavin adenine dinucleotide (FAD) loss. The lack of completely solved crystal structure of MTHFR is an impediment in understanding the structural perturbations caused by the Ala222Val mutation; computational modeling provides a suitable alternative. The three-dimensional structure of human MTHFR protein was obtained through homology modeling, by taking the MTHFR structures from Escherichia coli and Thermus thermophilus as templates. Subsequently, the modeled structure was docked with FAD using Glide, which revealed a very good binding affinity, authenticated by a Glide XP score of -10.3983 (kcal mol(-1)). The MTHFR was mutated by changing Alanine 222 to Valine. The wild-type MTHFR-FAD complex and the Ala222Val mutant MTHFR-FAD complex were subjected to molecular dynamics simulation over 50 ns period. The average difference in backbone root mean square deviation (RMSD) between wild and mutant variant was found to be ~.11 Å. The greater degree of fluctuations in the mutant protein translates to increased conformational stability as a result of mutation. The FAD-binding ability of the mutant MTHFR was also found to be significantly lowered as a result of decreased protein grip caused by increased conformational flexibility. The study provides insights into the Ala222Val mutation of human MTHFR that induces major conformational changes in the tertiary structure, causing a significant reduction in the FAD-binding affinity.
Structural Insights into Ail-Mediated Adhesion in Yersinia pestis
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yamashita, Satoshi; Lukacik, Petra; Barnard, Travis J.
2012-01-30
Ail is an outer membrane protein from Yersinia pestis that is highly expressed in a rodent model of bubonic plague, making it a good candidate for vaccine development. Ail is important for attaching to host cells and evading host immune responses, facilitating rapid progression of a plague infection. Binding to host cells is important for injection of cytotoxic Yersinia outer proteins. To learn more about how Ail mediates adhesion, we solved two high-resolution crystal structures of Ail, with no ligand bound and in complex with a heparin analog called sucrose octasulfate. We identified multiple adhesion targets, including laminin and heparin,more » and showed that a 40 kDa domain of laminin called LG4-5 specifically binds to Ail. We also evaluated the contribution of laminin to delivery of Yops to HEp-2 cells. This work constitutes a structural description of how a bacterial outer membrane protein uses a multivalent approach to bind host cells.« less
Alsarraf, Husam M. A. B.; Laroche, Fabrice; Spaink, Herman; Thirup, Søren; Blaise, Mickael
2011-01-01
Cell metabolic processes are constantly producing reactive oxygen species (ROS), which have deleterious effects by triggering, for example, DNA damage. Numerous enzymes such as catalase, and small compounds such as vitamin C, provide protection against ROS. The TLDc domain of the human oxidation resistance protein has been shown to be able to protect DNA from oxidative stress; however, its mechanism of action is still not understood and no structural information is available on this domain. Structural information on the TLDc domain may therefore help in understanding exactly how it works. Here, the purification, crystallization and preliminary crystallographic studies of the TLDc domain from zebrafish are reported. Crystals belonging to the orthorhombic space group P21212 were obtained and diffracted to 0.97 Å resolution. Selenomethionine-substituted protein could also be crystallized; these crystals diffracted to 1.1 Å resolution and the structure could be solved by SAD/MAD methods. PMID:22102041
Hafsa, Noor E; Arndt, David; Wishart, David S
2015-07-01
The Chemical Shift Index or CSI 3.0 (http://csi3.wishartlab.com) is a web server designed to accurately identify the location of secondary and super-secondary structures in protein chains using only nuclear magnetic resonance (NMR) backbone chemical shifts and their corresponding protein sequence data. Unlike earlier versions of CSI, which only identified three types of secondary structure (helix, β-strand and coil), CSI 3.0 now identifies total of 11 types of secondary and super-secondary structures, including helices, β-strands, coil regions, five common β-turns (type I, II, I', II' and VIII), β hairpins as well as interior and edge β-strands. CSI 3.0 accepts experimental NMR chemical shift data in multiple formats (NMR Star 2.1, NMR Star 3.1 and SHIFTY) and generates colorful CSI plots (bar graphs) and secondary/super-secondary structure assignments. The output can be readily used as constraints for structure determination and refinement or the images may be used for presentations and publications. CSI 3.0 uses a pipeline of several well-tested, previously published programs to identify the secondary and super-secondary structures in protein chains. Comparisons with secondary and super-secondary structure assignments made via standard coordinate analysis programs such as DSSP, STRIDE and VADAR on high-resolution protein structures solved by X-ray and NMR show >90% agreement between those made with CSI 3.0. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Lee, Jinwoo; Nyenhuis, David A; Nelson, Elizabeth A; Cafiso, David S; White, Judith M; Tamm, Lukas K
2017-09-19
Ebolavirus (EBOV), an enveloped filamentous RNA virus causing severe hemorrhagic fever, enters cells by macropinocytosis and membrane fusion in a late endosomal compartment. Fusion is mediated by the EBOV envelope glycoprotein GP, which consists of subunits GP1 and GP2. GP1 binds to cellular receptors, including Niemann-Pick C1 (NPC1) protein, and GP2 is responsible for low pH-induced membrane fusion. Proteolytic cleavage and NPC1 binding at endosomal pH lead to conformational rearrangements of GP2 that include exposing the hydrophobic fusion loop (FL) for insertion into the cellular target membrane and forming a six-helix bundle structure. Although major portions of the GP2 structure have been solved in pre- and postfusion states and although current models place the transmembrane (TM) and FL domains of GP2 in close proximity at critical steps of membrane fusion, their structures in membrane environments, and especially interactions between them, have not yet been characterized. Here, we present the structure of the membrane proximal external region (MPER) connected to the TM domain: i.e., the missing parts of the EBOV GP2 structure. The structure, solved by solution NMR and EPR spectroscopy in membrane-mimetic environments, consists of a helix-turn-helix architecture that is independent of pH. Moreover, the MPER region is shown to interact in the membrane interface with the previously determined structure of the EBOV FL through several critical aromatic residues. Mutation of aromatic and neighboring residues in both binding partners decreases fusion and viral entry, highlighting the functional importance of the MPER/TM-FL interaction in EBOV entry and fusion.
Masuda, Taro; Zhao, Guanghua; Mikami, Bunzo
2015-01-01
Chitinase hydrolyzes the β-1,4-glycosidic bond in chitin. In higher plants, this enzyme has been regarded as a pathogenesis-related protein. Recently, we identified a class III chitinase, which functions as a calcium storage protein in pomegranate (Punica granatum) seed (PSC, pomegranate seed chitinase). Here, we solved a crystal structure of PSC at 1.6 Å resolution. Although its overall structure, including the structure of catalytic site and non-proline cis-peptides, was closely similar to those of other class III chitinases, PSC had some unique structural characteristics. First, there were some metal-binding sites with coordinated water molecules on the surface of PSC. Second, many unconserved aspartate residues were present in the PSC sequence which rendered the surface of PSC negatively charged. This acidic electrostatic property is in contrast to that of hevamine, well-characterized plant class III chitinase, which has rather a positively charged surface. Thus, the crystal structure provides a clue for metal association property of PSC.
Huang, Jianyun; Chen, Shuai; Zhang, J. Jillian; Huang, Xin-Yun
2013-01-01
G protein-coupled receptors (GPCRs) mediate transmembrane signaling. Before ligand binding, GPCRs exist in a basal state. Crystal structures of several GPCRs bound with antagonists or agonists have been solved. However, the crystal structure of the ligand-free basal state of a GPCR, the starting point of GPCR activation and function, has not been determined. Here we report the X-ray crystal structure of the first ligand-free basal state of a GPCR in a lipid membrane-like environment. Oligomeric turkey β1-adrenergic receptors display two alternating dimer interfaces. One interface involves the transmembrane domain (TM) 1, TM2, the C-terminal H8, and the extracellular loop 1. The other interface engages residues from TM4, TM5, the intracellular loop 2 and the extracellular loop 2. Structural comparisons show that this ligand-free state is in an inactive conformation. This provides the structural information regarding GPCR dimerization and oligomerization. PMID:23435379
Sayer, Christopher; Isupov, Michail N; Westlake, Aaron; Littlechild, Jennifer A
2013-04-01
The crystal structures and inhibitor complexes of two industrially important ω-aminotransferase enzymes from Pseudomonas aeruginosa and Chromobacterium violaceum have been determined in order to understand the differences in their substrate specificity. The two enzymes share 30% sequence identity and use the same amino acceptor, pyruvate; however, the Pseudomonas enzyme shows activity towards the amino donor β-alanine, whilst the Chromobacterium enzyme does not. Both enzymes show activity towards S-α-methylbenzylamine (MBA), with the Chromobacterium enzyme having a broader substrate range. The crystal structure of the P. aeruginosa enzyme has been solved in the holo form and with the inhibitor gabaculine bound. The C. violaceum enzyme has been solved in the apo and holo forms and with gabaculine bound. The structures of the holo forms of both enzymes are quite similar. There is little conformational difference observed between the inhibitor complex and the holoenzyme for the P. aeruginosa aminotransferase. In comparison, the crystal structure of the C. violaceum gabaculine complex shows significant structural rearrangements from the structures of both the apo and holo forms of the enzyme. It appears that the different rigidity of the protein scaffold contributes to the substrate specificity observed for the two ω-aminotransferases.
NASA Astrophysics Data System (ADS)
Kim, Duckhoe; Sahin, Ozgur
2015-03-01
Scanning probe microscopes can be used to image and chemically characterize surfaces down to the atomic scale. However, the localized tip-sample interactions in scanning probe microscopes limit high-resolution images to the topmost atomic layer of surfaces, and characterizing the inner structures of materials and biomolecules is a challenge for such instruments. Here, we show that an atomic force microscope can be used to image and three-dimensionally reconstruct chemical groups inside a protein complex. We use short single-stranded DNAs as imaging labels that are linked to target regions inside a protein complex, and T-shaped atomic force microscope cantilevers functionalized with complementary probe DNAs allow the labels to be located with sequence specificity and subnanometre resolution. After measuring pairwise distances between labels, we reconstruct the three-dimensional structure formed by the target chemical groups within the protein complex using simple geometric calculations. Experiments with the biotin-streptavidin complex show that the predicted three-dimensional loci of the carboxylic acid groups of biotins are within 2 Å of their respective loci in the corresponding crystal structure, suggesting that scanning probe microscopes could complement existing structural biological techniques in solving structures that are difficult to study due to their size and complexity.
Thakur, Manish Kumar; Kumar, Amit; Birudukota, Swarnakumari; Swaminathan, Srinivasan; Tyagi, Rajiv; Gosu, Ramachandraiah
2016-09-16
Human Protein tyrosine kinase 6 (PTK6) (EC:2.7.10.2), also known as the breast tumor kinase (BRK), is an intracellular non-receptor Src-related tyrosine kinase expressed in a majority of human breast tumors and breast cancer cell lines, but its expression is low or completely absent in normal mammary glands. In the recent past, several studies have suggested that PTK6 is a potential therapeutic target in cancer. To understand its structural and functional properties, the PTK6 kinase domain (PTK6-KD) gene was cloned, overexpressed in a baculo-insect cell system, purified and crystallized at room temperature. X-ray diffraction data to 2.33 Å resolution was collected on a single PTK6-KD crystal, which belonged to the triclinic space group P1. The Matthews coefficient calculation suggested the presence of four protein molecules per asymmetric unit, with a solvent content of ∼50%.The structure has been solved by molecular replacement and crystal structure data submitted to the protein data bank under the accession number 5D7V. This is the first report of apo PTK6-KD structure crystallized in DFG-in and αC-helix-out conformation. Copyright © 2016 Elsevier Inc. All rights reserved.
He, Yan; Estephan, Rima; Yang, Xiaomin; Vela, Adriana; Wang, Hsin; Bernard, Cédric; Stark, Ruth E.
2011-01-01
Liver fatty acid-binding protein (LFABP) is a 14-kDa cytosolic polypeptide, differing from other family members in number of ligand binding sites, diversity of bound ligands, and transfer of fatty acid(s) to membranes primarily via aqueous diffusion rather than direct collisional interactions. Distinct two-dimensional 1H-15N NMR signals indicative of slowly exchanging LFABP assemblies formed during stepwise ligand titration were exploited, without solving the protein-ligand complex structures, to yield the stoichiometries for the bound ligands, their locations within the protein binding cavity, the sequence of ligand occupation, and the corresponding protein structural accommodations. Chemical shifts were monitored for wild-type LFABP and a R122L/S124A mutant in which electrostatic interactions viewed as essential to fatty acid binding were removed. For wild-type LFABP the results compared favorably with previous tertiary structures of oleate-bound wild-type LFABP in crystals and in solution: there are two oleates, one U-shaped ligand that positions the long hydrophobic chain deep within the cavity and another extended structure with the hydrophobic chain facing the cavity and the carboxylate group lying close to the protein surface. The NMR titration validated a prior hypothesis that the first oleate to enter the cavity occupies the internal protein site. In contrast, 1H/15N chemical shift changes supported only one liganded oleate for R122L/S124A LFABP, at an intermediate location within the protein cavity. A rationale based on protein sequence and electrostatics was developed to explain the stoichiometry and binding site trends for LFABPs and to put these findings into context within the larger protein family. PMID:21226535
Tsujino, Soichiro; Tomizaki, Takashi
2016-05-06
Increasing the data acquisition rate of X-ray diffraction images for macromolecular crystals at room temperature at synchrotrons has the potential to significantly accelerate both structural analysis of biomolecules and structure-based drug developments. Using lysozyme model crystals, we demonstrated the rapid acquisition of X-ray diffraction datasets by combining a high frame rate pixel array detector with ultrasonic acoustic levitation of protein crystals in liquid droplets. The rapid spinning of the crystal within a levitating droplet ensured an efficient sampling of the reciprocal space. The datasets were processed with a program suite developed for serial femtosecond crystallography (SFX). The structure, which was solved by molecular replacement, was found to be identical to the structure obtained by the conventional oscillation method for up to a 1.8-Å resolution limit. In particular, the absence of protein crystal damage resulting from the acoustic levitation was carefully established. These results represent a key step towards a fully automated sample handling and measurement pipeline, which has promising prospects for a high acquisition rate and high sample efficiency for room temperature X-ray crystallography.
Ultrasonic acoustic levitation for fast frame rate X-ray protein crystallography at room temperature
NASA Astrophysics Data System (ADS)
Tsujino, Soichiro; Tomizaki, Takashi
2016-05-01
Increasing the data acquisition rate of X-ray diffraction images for macromolecular crystals at room temperature at synchrotrons has the potential to significantly accelerate both structural analysis of biomolecules and structure-based drug developments. Using lysozyme model crystals, we demonstrated the rapid acquisition of X-ray diffraction datasets by combining a high frame rate pixel array detector with ultrasonic acoustic levitation of protein crystals in liquid droplets. The rapid spinning of the crystal within a levitating droplet ensured an efficient sampling of the reciprocal space. The datasets were processed with a program suite developed for serial femtosecond crystallography (SFX). The structure, which was solved by molecular replacement, was found to be identical to the structure obtained by the conventional oscillation method for up to a 1.8-Å resolution limit. In particular, the absence of protein crystal damage resulting from the acoustic levitation was carefully established. These results represent a key step towards a fully automated sample handling and measurement pipeline, which has promising prospects for a high acquisition rate and high sample efficiency for room temperature X-ray crystallography.
Ultrasonic acoustic levitation for fast frame rate X-ray protein crystallography at room temperature
Tsujino, Soichiro; Tomizaki, Takashi
2016-01-01
Increasing the data acquisition rate of X-ray diffraction images for macromolecular crystals at room temperature at synchrotrons has the potential to significantly accelerate both structural analysis of biomolecules and structure-based drug developments. Using lysozyme model crystals, we demonstrated the rapid acquisition of X-ray diffraction datasets by combining a high frame rate pixel array detector with ultrasonic acoustic levitation of protein crystals in liquid droplets. The rapid spinning of the crystal within a levitating droplet ensured an efficient sampling of the reciprocal space. The datasets were processed with a program suite developed for serial femtosecond crystallography (SFX). The structure, which was solved by molecular replacement, was found to be identical to the structure obtained by the conventional oscillation method for up to a 1.8-Å resolution limit. In particular, the absence of protein crystal damage resulting from the acoustic levitation was carefully established. These results represent a key step towards a fully automated sample handling and measurement pipeline, which has promising prospects for a high acquisition rate and high sample efficiency for room temperature X-ray crystallography. PMID:27150272
Bugge, Katrine; Staby, Lasse; Kemplen, Katherine R; O'Shea, Charlotte; Bendsen, Sidsel K; Jensen, Mikael K; Olsen, Johan G; Skriver, Karen; Kragelund, Birthe B
2018-05-01
Communication within cells relies on a few protein nodes called hubs, which organize vast interactomes with many partners. Frequently, hub proteins are intrinsically disordered conferring multi-specificity and dynamic communication. Conversely, folded hub proteins may organize networks using disordered partners. In this work, the structure of the RST domain, a unique folded hub, is solved by nuclear magnetic resonance spectroscopy and small-angle X-ray scattering, and its complex with a region of the transcription factor DREB2A is provided through data-driven HADDOCK modeling and mutagenesis analysis. The RST fold is unique, but similar structures are identified in the PAH (paired amphipathic helix), TAFH (TATA-box-associated factor homology), and NCBD (nuclear coactivator binding domain) domains. We designate them as a group the αα hubs, as they share an αα-hairpin super-secondary motif, which serves as an organizing platform for malleable helices of varying topology. This allows for partner adaptation, exclusion, and selection. Our findings provide valuable insights into structural features enabling signaling fidelity. Copyright © 2018 Elsevier Ltd. All rights reserved.
Andhirka, Sai Krishna; Vignesh, Ravichandran; Aradhyam, Gopala Krishna
2017-08-01
Deciphering the mechanism of activation of heterotrimeric G proteins by their cognate receptors continues to be an intriguing area of research. The recently solved crystal structure of the ternary complex captured the receptor-bound α-subunit in an open conformation, without bound nucleotide has improved our understanding of the activation process. Despite these advancements, the mechanism by which the receptor causes GDP release from the α-subunit remains elusive. To elucidate the mechanism of activation, we studied guanine nucleotide-induced structural stability of the α-subunit (in response to thermal/chaotrope-mediated stress). Inherent stabilities of the inactive (GDP-bound) and active (GTP-bound) forms contribute antagonistically to the difference in conformational stability whereas the GDP-bound protein is able to switch to a stable intermediate state, GTP-bound protein loses this ability. Partial perturbation of the protein fold reveals the underlying influence of the bound nucleotide providing an insight into the mechanism of activation. An extra stable, pretransition intermediate, 'empty pocket' state (conformationally active-state like) in the unfolding pathway of GDP-bound protein mimics a gating system - the activation process having to overcome this stable intermediate state. We demonstrate that a relatively more complex conformational fold of the GDP-bound protein is at the core of the gating system. We report capturing this threshold, 'metastable empty pocket' conformation (the gate) of α-subunit of G protein and hypothesize that the receptor activates the G protein by enabling it to achieve this structure through mild structural perturbation. © 2017 Federation of European Biochemical Societies.
Structure of the human voltage-dependent anion channel
Bayrhuber, Monika; Meins, Thomas; Habeck, Michael; Becker, Stefan; Giller, Karin; Villinger, Saskia; Vonrhein, Clemens; Griesinger, Christian; Zweckstetter, Markus; Zeth, Kornelius
2008-01-01
The voltage-dependent anion channel (VDAC), also known as mitochondrial porin, is the most abundant protein in the mitochondrial outer membrane (MOM). VDAC is the channel known to guide the metabolic flux across the MOM and plays a key role in mitochondrially induced apoptosis. Here, we present the 3D structure of human VDAC1, which was solved conjointly by NMR spectroscopy and x-ray crystallography. Human VDAC1 (hVDAC1) adopts a β-barrel architecture composed of 19 β-strands with an α-helix located horizontally midway within the pore. Bioinformatic analysis indicates that this channel architecture is common to all VDAC proteins and is adopted by the general import pore TOM40 of mammals, which is also located in the MOM. PMID:18832158
Christensen, Signe; Horowitz, Scott; Bardwell, James C.A.; Olsen, Johan G.; Willemoës, Martin; Lindorff-Larsen, Kresten; Ferkinghoff-Borg, Jesper; Hamelryck, Thomas; Winther, Jakob R.
2017-01-01
Despite the development of powerful computational tools, the full-sequence design of proteins still remains a challenging task. To investigate the limits and capabilities of computational tools, we conducted a study of the ability of the program Rosetta to predict sequences that recreate the authentic fold of thioredoxin. Focusing on the influence of conformational details in the template structures, we based our study on 8 experimentally determined template structures and generated 120 designs from each. For experimental evaluation, we chose six sequences from each of the eight templates by objective criteria. The 48 selected sequences were evaluated based on their progressive ability to (1) produce soluble protein in Escherichia coli and (2) yield stable monomeric protein, and (3) on the ability of the stable, soluble proteins to adopt the target fold. Of the 48 designs, we were able to synthesize 32, 20 of which resulted in soluble protein. Of these, only two were sufficiently stable to be purified. An X-ray crystal structure was solved for one of the designs, revealing a close resemblance to the target structure. We found a significant difference among the eight template structures to realize the above three criteria despite their high structural similarity. Thus, in order to improve the success rate of computational full-sequence design methods, we recommend that multiple template structures are used. Furthermore, this study shows that special care should be taken when optimizing the geometry of a structure prior to computational design when using a method that is based on rigid conformations. PMID:27659562
Johansson, Kristoffer E; Tidemand Johansen, Nicolai; Christensen, Signe; Horowitz, Scott; Bardwell, James C A; Olsen, Johan G; Willemoës, Martin; Lindorff-Larsen, Kresten; Ferkinghoff-Borg, Jesper; Hamelryck, Thomas; Winther, Jakob R
2016-10-23
Despite the development of powerful computational tools, the full-sequence design of proteins still remains a challenging task. To investigate the limits and capabilities of computational tools, we conducted a study of the ability of the program Rosetta to predict sequences that recreate the authentic fold of thioredoxin. Focusing on the influence of conformational details in the template structures, we based our study on 8 experimentally determined template structures and generated 120 designs from each. For experimental evaluation, we chose six sequences from each of the eight templates by objective criteria. The 48 selected sequences were evaluated based on their progressive ability to (1) produce soluble protein in Escherichia coli and (2) yield stable monomeric protein, and (3) on the ability of the stable, soluble proteins to adopt the target fold. Of the 48 designs, we were able to synthesize 32, 20 of which resulted in soluble protein. Of these, only two were sufficiently stable to be purified. An X-ray crystal structure was solved for one of the designs, revealing a close resemblance to the target structure. We found a significant difference among the eight template structures to realize the above three criteria despite their high structural similarity. Thus, in order to improve the success rate of computational full-sequence design methods, we recommend that multiple template structures are used. Furthermore, this study shows that special care should be taken when optimizing the geometry of a structure prior to computational design when using a method that is based on rigid conformations. Copyright © 2016 Elsevier Ltd. All rights reserved.
Tan, Yen Hock; Huang, He; Kihara, Daisuke
2006-08-15
Aligning distantly related protein sequences is a long-standing problem in bioinformatics, and a key for successful protein structure prediction. Its importance is increasing recently in the context of structural genomics projects because more and more experimentally solved structures are available as templates for protein structure modeling. Toward this end, recent structure prediction methods employ profile-profile alignments, and various ways of aligning two profiles have been developed. More fundamentally, a better amino acid similarity matrix can improve a profile itself; thereby resulting in more accurate profile-profile alignments. Here we have developed novel amino acid similarity matrices from knowledge-based amino acid contact potentials. Contact potentials are used because the contact propensity to the other amino acids would be one of the most conserved features of each position of a protein structure. The derived amino acid similarity matrices are tested on benchmark alignments at three different levels, namely, the family, the superfamily, and the fold level. Compared to BLOSUM45 and the other existing matrices, the contact potential-based matrices perform comparably in the family level alignments, but clearly outperform in the fold level alignments. The contact potential-based matrices perform even better when suboptimal alignments are considered. Comparing the matrices themselves with each other revealed that the contact potential-based matrices are very different from BLOSUM45 and the other matrices, indicating that they are located in a different basin in the amino acid similarity matrix space.
Crystal structure of TBC1D15 GTPase-activating protein (GAP) domain and its activity on Rab GTPases.
Chen, Yan-Na; Gu, Xin; Zhou, X Edward; Wang, Weidong; Cheng, Dandan; Ge, Yinghua; Ye, Fei; Xu, H Eric; Lv, Zhengbing
2017-04-01
TBC1D15 belongs to the TBC (Tre-2/Bub2/Cdc16) domain family and functions as a GTPase-activating protein (GAP) for Rab GTPases. So far, the structure of TBC1D15 or the TBC1D15·Rab complex has not been determined, thus, its catalytic mechanism on Rab GTPases is still unclear. In this study, we solved the crystal structures of the Shark and Sus TBC1D15 GAP domains, to 2.8 Å and 2.5 Å resolution, respectively. Shark-TBC1D15 and Sus-TBC1D15 belong to the same subfamily of TBC domain-containing proteins, and their GAP-domain structures are highly similar. This demonstrates the evolutionary conservation of the TBC1D15 protein family. Meanwhile, the newly determined crystal structures display new variations compared to the structures of yeast Gyp1p Rab GAP domain and TBC1D1. GAP assays show that Shark and Sus GAPs both have higher catalytic activity on Rab11a·GTP than Rab7a·GTP, which differs from the previous study. We also demonstrated the importance of arginine and glutamine on the catalytic sites of Shark GAP and Sus GAP. When arginine and glutamine are changed to alanine or lysine, the activities of Shark GAP and Sus GAP are lost. © 2017 The Protein Society.
Quantum-mechanics-derived 13Cα chemical shift server (CheShift) for protein structure validation
Vila, Jorge A.; Arnautova, Yelena A.; Martin, Osvaldo A.; Scheraga, Harold A.
2009-01-01
A server (CheShift) has been developed to predict 13Cα chemical shifts of protein structures. It is based on the generation of 696,916 conformations as a function of the φ, ψ, ω, χ1 and χ2 torsional angles for all 20 naturally occurring amino acids. Their 13Cα chemical shifts were computed at the DFT level of theory with a small basis set and extrapolated, with an empirically-determined linear regression formula, to reproduce the values obtained with a larger basis set. Analysis of the accuracy and sensitivity of the CheShift predictions, in terms of both the correlation coefficient R and the conformational-averaged rmsd between the observed and predicted 13Cα chemical shifts, was carried out for 3 sets of conformations: (i) 36 x-ray-derived protein structures solved at 2.3 Å or better resolution, for which sets of 13Cα chemical shifts were available; (ii) 15 pairs of x-ray and NMR-derived sets of protein conformations; and (iii) a set of decoys for 3 proteins showing an rmsd with respect to the x-ray structure from which they were derived of up to 3 Å. Comparative analysis carried out with 4 popular servers, namely SHIFTS, SHIFTX, SPARTA, and PROSHIFT, for these 3 sets of conformations demonstrated that CheShift is the most sensitive server with which to detect subtle differences between protein models and, hence, to validate protein structures determined by either x-ray or NMR methods, if the observed 13Cα chemical shifts are available. CheShift is available as a web server. PMID:19805131
Hao, Xiaohu; Zhang, Guijun; Zhou, Xiaogen
2018-04-01
Computing conformations which are essential to associate structural and functional information with gene sequences, is challenging due to the high dimensionality and rugged energy surface of the protein conformational space. Consequently, the dimension of the protein conformational space should be reduced to a proper level, and an effective exploring algorithm should be proposed. In this paper, a plug-in method for guiding exploration in conformational feature space with Lipschitz underestimation (LUE) for ab-initio protein structure prediction is proposed. The conformational space is converted into ultrafast shape recognition (USR) feature space firstly. Based on the USR feature space, the conformational space can be further converted into Underestimation space according to Lipschitz estimation theory for guiding exploration. As a consequence of the use of underestimation model, the tight lower bound estimate information can be used for exploration guidance, the invalid sampling areas can be eliminated in advance, and the number of energy function evaluations can be reduced. The proposed method provides a novel technique to solve the exploring problem of protein conformational space. LUE is applied to differential evolution (DE) algorithm, and metropolis Monte Carlo(MMC) algorithm which is available in the Rosetta; When LUE is applied to DE and MMC, it will be screened by the underestimation method prior to energy calculation and selection. Further, LUE is compared with DE and MMC by testing on 15 small-to-medium structurally diverse proteins. Test results show that near-native protein structures with higher accuracy can be obtained more rapidly and efficiently with the use of LUE. Copyright © 2018 Elsevier Ltd. All rights reserved.
Structural Basis for Activation of Fatty Acid-binding Protein 4
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gillilan,R.; Ayers, S.; Noy, N.
2007-01-01
Fatty acid-binding protein 4 (FABP4) delivers ligands from the cytosol to the nuclear receptor PPAR{gamma} in the nucleus, thereby enhancing the transcriptional activity of the receptor. Notably, FABP4 binds multiple ligands with a similar affinity but its nuclear translocation is activated only by specific compounds. To gain insight into the structural features that underlie the ligand-specificity in activation of the nuclear import of FABP4, we solved the crystal structures of the protein complexed with two compounds that induce its nuclear translocation, and compared these to the apo-protein and to FABP4 structures bound to non-activating ligands. Examination of these structures indicatesmore » that activation coincides with closure of a portal loop phenylalanine side-chain, contraction of the binding pocket, a subtle shift in a helical domain containing the nuclear localization signal of the protein, and a resultant change in oligomeric state that exposes the nuclear localization signal to the solution. Comparisons of backbone displacements induced by activating ligands with a measure of mobility derived from translation, libration, screw (TLS) refinement, and with a composite of slowest normal modes of the apo state suggest that the helical motion associated with the activation of the protein is part of the repertoire of the equilibrium motions of the apo-protein, i.e. that ligand binding does not induce the activated configuration but serves to stabilize it. Nuclear import of FABP4 can thus be understood in terms of the pre-existing equilibrium hypothesis of ligand binding.« less
Enz, Ralf
2012-01-01
Metabotropic glutamate receptors (mGluRs) regulate intracellular signal pathways that control several physiological tasks, including neuronal excitability, learning, and memory. This is achieved by the formation of synaptic signal complexes, in which mGluRs assemble with functionally related proteins such as enzymes, scaffolds, and cytoskeletal anchor proteins. Thus, mGluR associated proteins actively participate in the regulation of glutamatergic neurotransmission. Importantly, dysfunction of mGluRs and interacting proteins may lead to impaired signal transduction and finally result in neurological disorders, e.g., night blindness, addiction, epilepsy, schizophrenia, autism spectrum disorders and Parkinson's disease. In contrast to solved crystal structures of extracellular N-terminal domains of some mGluR types, only a few studies analyzed the conformation of intracellular receptor domains. Intracellular C-termini of most mGluR types are subject to alternative splicing and can be further modified by phosphorylation and SUMOylation. In this way, diverse interaction sites for intracellular proteins that bind to and regulate the glutamate receptors are generated. Indeed, most of the known mGluR binding partners interact with the receptors' C-terminal domains. Within the last years, different laboratories analyzed the structure of these domains and described the geometry of the contact surface between mGluR C-termini and interacting proteins. Here, I will review recent progress in the structure characterization of mGluR C-termini and provide an up-to-date summary of the geometry of these domains in contact with binding partners.
Characterization and assembly of a GFP-tagged cylindriform silk into hexameric complexes.
Öster, Carl; Svensson Bonde, Johan; Bülow, Leif; Dicko, Cedric
2014-04-01
Spider silk has been studied extensively for its attractive mechanical properties and potential applications in medicine and industry. The production of spider silk, however, has been lagging behind for lack of suitable systems. Our approach focuses on solving the production of spider silk by designing, expressing, purifying and characterizing the silk from cylindriform glands. We show that the cylindriform silk protein, in contrast to the commonly used dragline silk protein, is fully folded and stable in solution. With the help of GFP as a fusion tag we enhanced the expression of the silk protein in Escherichia coli and could optimize the downstream processing. Secondary structures analysis by circular dichroism and FTIR shows that the GFP-silk fusion protein is predominantly α-helical, and that pH can trigger a α- to β-transition resulting in aggregation. Structural analysis by small angle X-ray scattering suggests that the GFP-Silk exists in the form of a hexamer in solution. Copyright © 2013 Wiley Periodicals, Inc.
The Protein Micro-Crystallography Beamlines for Targeted Protein Research Program
NASA Astrophysics Data System (ADS)
Hirata, Kunio; Yamamoto, Masaki; Matsugaki, Naohiro; Wakatsuki, Soichi
In order to collect proper diffraction data from outstanding micro-crystals, a brand-new data collection system should be designed to provide high signal-to noise ratio in diffraction images. SPring-8 and KEK-PF are currently developing two micro-beam beamlines for Targeted Proteins Research Program by MEXT of Japan. The program aims to reveal the structure and function of proteins that are difficult to solve but have great importance in both academic research and industrial application. At SPring-8, a new 1-micron beam beamline for protein micro-crystallography, RIKEN Targeted Proteins Beamline (BL32XU), is developed. At KEK-PF a new low energy micro-beam beamline, BL-1A, is dedicated for SAD micro-crystallography. The two beamlines will start operation in the end of 2010. The present status of the research and development for protein micro-crystallography will be presented.
RBind: computational network method to predict RNA binding sites.
Wang, Kaili; Jian, Yiren; Wang, Huiwen; Zeng, Chen; Zhao, Yunjie
2018-04-26
Non-coding RNA molecules play essential roles by interacting with other molecules to perform various biological functions. However, it is difficult to determine RNA structures due to their flexibility. At present, the number of experimentally solved RNA-ligand and RNA-protein structures is still insufficient. Therefore, binding sites prediction of non-coding RNA is required to understand their functions. Current RNA binding site prediction algorithms produce many false positive nucleotides that are distance away from the binding sites. Here, we present a network approach, RBind, to predict the RNA binding sites. We benchmarked RBind in RNA-ligand and RNA-protein datasets. The average accuracy of 0.82 in RNA-ligand and 0.63 in RNA-protein testing showed that this network strategy has a reliable accuracy for binding sites prediction. The codes and datasets are available at https://zhaolab.com.cn/RBind. yjzhaowh@mail.ccnu.edu.cn. Supplementary data are available at Bioinformatics online.
SIMBAD : a sequence-independent molecular-replacement pipeline
Simpkin, Adam J.; Simkovic, Felix; Thomas, Jens M. H.; ...
2018-06-08
The conventional approach to finding structurally similar search models for use in molecular replacement (MR) is to use the sequence of the target to search against those of a set of known structures. Sequence similarity often correlates with structure similarity. Given sufficient similarity, a known structure correctly positioned in the target cell by the MR process can provide an approximation to the unknown phases of the target. An alternative approach to identifying homologous structures suitable for MR is to exploit the measured data directly, comparing the lattice parameters or the experimentally derived structure-factor amplitudes with those of known structures. Here,more » SIMBAD , a new sequence-independent MR pipeline which implements these approaches, is presented. SIMBAD can identify cases of contaminant crystallization and other mishaps such as mistaken identity (swapped crystallization trays), as well as solving unsequenced targets and providing a brute-force approach where sequence-dependent search-model identification may be nontrivial, for example because of conformational diversity among identifiable homologues. The program implements a three-step pipeline to efficiently identify a suitable search model in a database of known structures. The first step performs a lattice-parameter search against the entire Protein Data Bank (PDB), rapidly determining whether or not a homologue exists in the same crystal form. The second step is designed to screen the target data for the presence of a crystallized contaminant, a not uncommon occurrence in macromolecular crystallography. Solving structures with MR in such cases can remain problematic for many years, since the search models, which are assumed to be similar to the structure of interest, are not necessarily related to the structures that have actually crystallized. To cater for this eventuality, SIMBAD rapidly screens the data against a database of known contaminant structures. Where the first two steps fail to yield a solution, a final step in SIMBAD can be invoked to perform a brute-force search of a nonredundant PDB database provided by the MoRDa MR software. Through early-access usage of SIMBAD , this approach has solved novel cases that have otherwise proved difficult to solve.« less
SIMBAD : a sequence-independent molecular-replacement pipeline
DOE Office of Scientific and Technical Information (OSTI.GOV)
Simpkin, Adam J.; Simkovic, Felix; Thomas, Jens M. H.
The conventional approach to finding structurally similar search models for use in molecular replacement (MR) is to use the sequence of the target to search against those of a set of known structures. Sequence similarity often correlates with structure similarity. Given sufficient similarity, a known structure correctly positioned in the target cell by the MR process can provide an approximation to the unknown phases of the target. An alternative approach to identifying homologous structures suitable for MR is to exploit the measured data directly, comparing the lattice parameters or the experimentally derived structure-factor amplitudes with those of known structures. Here,more » SIMBAD , a new sequence-independent MR pipeline which implements these approaches, is presented. SIMBAD can identify cases of contaminant crystallization and other mishaps such as mistaken identity (swapped crystallization trays), as well as solving unsequenced targets and providing a brute-force approach where sequence-dependent search-model identification may be nontrivial, for example because of conformational diversity among identifiable homologues. The program implements a three-step pipeline to efficiently identify a suitable search model in a database of known structures. The first step performs a lattice-parameter search against the entire Protein Data Bank (PDB), rapidly determining whether or not a homologue exists in the same crystal form. The second step is designed to screen the target data for the presence of a crystallized contaminant, a not uncommon occurrence in macromolecular crystallography. Solving structures with MR in such cases can remain problematic for many years, since the search models, which are assumed to be similar to the structure of interest, are not necessarily related to the structures that have actually crystallized. To cater for this eventuality, SIMBAD rapidly screens the data against a database of known contaminant structures. Where the first two steps fail to yield a solution, a final step in SIMBAD can be invoked to perform a brute-force search of a nonredundant PDB database provided by the MoRDa MR software. Through early-access usage of SIMBAD , this approach has solved novel cases that have otherwise proved difficult to solve.« less
Radakovics, Katharina; Smith, Terry K.; Bobik, Nina; Round, Adam; Djinović-Carugo, Kristina; Usón, Isabel
2016-01-01
Vaccinia virus interferes with early events of the activation pathway of the transcriptional factor NF-kB by binding to numerous host TIR-domain containing adaptor proteins. We have previously determined the X-ray structure of the A46 C-terminal domain; however, the structure and function of the A46 N-terminal domain and its relationship to the C-terminal domain have remained unclear. Here, we biophysically characterize residues 1–83 of the N-terminal domain of A46 and present the X-ray structure at 1.55 Å. Crystallographic phases were obtained by a recently developed ab initio method entitled ARCIMBOLDO_BORGES that employs tertiary structure libraries extracted from the Protein Data Bank; data analysis revealed an all β-sheet structure. This is the first such structure solved by this method which should be applicable to any protein composed entirely of β-sheets. The A46(1–83) structure itself is a β-sandwich containing a co-purified molecule of myristic acid inside a hydrophobic pocket and represents a previously unknown lipid-binding fold. Mass spectrometry analysis confirmed the presence of long-chain fatty acids in both N-terminal and full-length A46; mutation of the hydrophobic pocket reduced the lipid content. Using a combination of high resolution X-ray structures of the N- and C-terminal domains and SAXS analysis of full-length protein A46(1–240), we present here a structural model of A46 in a tetrameric assembly. Integrating affinity measurements and structural data, we propose how A46 simultaneously interferes with several TIR-domain containing proteins to inhibit NF-κB activation and postulate that A46 employs a bipartite binding arrangement to sequester the host immune adaptors TRAM and MyD88. PMID:27973613
DOE Office of Scientific and Technical Information (OSTI.GOV)
Muench, Stephen P.; Prigge, Sean T.; McLeod, Rima
2007-03-01
The crystal structures of T. gondii and P. falciparum ENR in complex with NAD{sup +} and triclosan and of T. gondii ENR in an apo form have been solved to 2.6, 2.2 and 2.8 Å, respectively. Recent studies have demonstrated that submicromolar concentrations of the biocide triclosan arrest the growth of the apicomplexan parasites Plasmodium falciparum and Toxoplasma gondii and inhibit the activity of the apicomplexan enoyl acyl carrier protein reductase (ENR). The crystal structures of T. gondii and P. falciparum ENR in complex with NAD{sup +} and triclosan and of T. gondii ENR in an apo form have beenmore » solved to 2.6, 2.2 and 2.8 Å, respectively. The structures of T. gondii ENR have revealed that, as in its bacterial and plant homologues, a loop region which flanks the active site becomes ordered upon inhibitor binding, resulting in the slow tight binding of triclosan. In addition, the T. gondii ENR–triclosan complex reveals the folding of a hydrophilic insert common to the apicomplexan family that flanks the substrate-binding domain and is disordered in all other reported apicomplexan ENR structures. Structural comparison of the apicomplexan ENR structures with their bacterial and plant counterparts has revealed that although the active sites of the parasite enzymes are broadly similar to those of their bacterial counterparts, there are a number of important differences within the drug-binding pocket that reduce the packing interactions formed with several inhibitors in the apicomplexan ENR enzymes. Together with other significant structural differences, this provides a possible explanation of the lower affinity of the parasite ENR enzyme family for aminopyridine-based inhibitors, suggesting that an effective antiparasitic agent may well be distinct from equivalent antimicrobials.« less
Modeling the assembly order of multimeric heteroprotein complexes
Esquivel-Rodriguez, Juan; Terashi, Genki; Christoffer, Charles; Shin, Woong-Hee
2018-01-01
Protein-protein interactions are the cornerstone of numerous biological processes. Although an increasing number of protein complex structures have been determined using experimental methods, relatively fewer studies have been performed to determine the assembly order of complexes. In addition to the insights into the molecular mechanisms of biological function provided by the structure of a complex, knowing the assembly order is important for understanding the process of complex formation. Assembly order is also practically useful for constructing subcomplexes as a step toward solving the entire complex experimentally, designing artificial protein complexes, and developing drugs that interrupt a critical step in the complex assembly. There are several experimental methods for determining the assembly order of complexes; however, these techniques are resource-intensive. Here, we present a computational method that predicts the assembly order of protein complexes by building the complex structure. The method, named Path-LzerD, uses a multimeric protein docking algorithm that assembles a protein complex structure from individual subunit structures and predicts assembly order by observing the simulated assembly process of the complex. Benchmarked on a dataset of complexes with experimental evidence of assembly order, Path-LZerD was successful in predicting the assembly pathway for the majority of the cases. Moreover, when compared with a simple approach that infers the assembly path from the buried surface area of subunits in the native complex, Path-LZerD has the strong advantage that it can be used for cases where the complex structure is not known. The path prediction accuracy decreased when starting from unbound monomers, particularly for larger complexes of five or more subunits, for which only a part of the assembly path was correctly identified. As the first method of its kind, Path-LZerD opens a new area of computational protein structure modeling and will be an indispensable approach for studying protein complexes. PMID:29329283
Modeling the assembly order of multimeric heteroprotein complexes.
Peterson, Lenna X; Togawa, Yoichiro; Esquivel-Rodriguez, Juan; Terashi, Genki; Christoffer, Charles; Roy, Amitava; Shin, Woong-Hee; Kihara, Daisuke
2018-01-01
Protein-protein interactions are the cornerstone of numerous biological processes. Although an increasing number of protein complex structures have been determined using experimental methods, relatively fewer studies have been performed to determine the assembly order of complexes. In addition to the insights into the molecular mechanisms of biological function provided by the structure of a complex, knowing the assembly order is important for understanding the process of complex formation. Assembly order is also practically useful for constructing subcomplexes as a step toward solving the entire complex experimentally, designing artificial protein complexes, and developing drugs that interrupt a critical step in the complex assembly. There are several experimental methods for determining the assembly order of complexes; however, these techniques are resource-intensive. Here, we present a computational method that predicts the assembly order of protein complexes by building the complex structure. The method, named Path-LzerD, uses a multimeric protein docking algorithm that assembles a protein complex structure from individual subunit structures and predicts assembly order by observing the simulated assembly process of the complex. Benchmarked on a dataset of complexes with experimental evidence of assembly order, Path-LZerD was successful in predicting the assembly pathway for the majority of the cases. Moreover, when compared with a simple approach that infers the assembly path from the buried surface area of subunits in the native complex, Path-LZerD has the strong advantage that it can be used for cases where the complex structure is not known. The path prediction accuracy decreased when starting from unbound monomers, particularly for larger complexes of five or more subunits, for which only a part of the assembly path was correctly identified. As the first method of its kind, Path-LZerD opens a new area of computational protein structure modeling and will be an indispensable approach for studying protein complexes.
Inhibition of Pancreatic Cancer Cell Proliferation by LRH-1 Inhibitors
2014-12-01
coordinates and structure factors have been deposited in the Protein Data Bank, www.pdb.org [ PDB ID codes 4QJR (SF-1/PIP3) and 4QK4 (SF-1/PIP2)]. 1To whom...with Rfree/Rcryst values of 23/19% (Table S2). The structure was deposited with the PDB ID code 4QJR. SF 1/PIP3 (Fig. 1C) adopts the classic NR LBD...PIP2) was solved by molecular replacement, using PDB ID code 1YOW as the search model, and compared with the SF 1/PIP3 structure (Table S2). The
Crystal structures of the methyltransferase and helicase from the ZIKA 1947 MR766 Uganda strain
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bukrejewska, Malgorzata; Derewenda, Urszula; Radwanska, Malwina
2017-08-15
Two nonstructural proteins encoded byZika virusstrain MR766 RNA, a methyltransferase and a helicase, were crystallized and their structures were solved and refined at 2.10 and 2.01 Å resolution, respectively. The NS5 methyltransferase contains a boundS-adenosyl-L-methionine (SAM) co-substrate. The NS3 helicase is in the apo form. Comparison with published crystal structures of the helicase in the apo, nucleotide-bound and single-stranded RNA (ssRNA)-bound states suggests that binding of ssRNA to the helicase may occur through conformational selection rather than induced fit.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rodríguez Guilbe, María M.; Protein Research and Development Center, University of Puerto Rico; Alfaro Malavé, Elisa C.
The genetically encoded fluorescent calcium-indicator protein GCaMP2 was crystallized in the calcium-saturated form. X-ray diffraction data were collected to 2.0 Å resolution and the structure was solved by molecular replacement. Fluorescent proteins and their engineered variants have played an important role in the study of biology. The genetically encoded calcium-indicator protein GCaMP2 comprises a circularly permuted fluorescent protein coupled to the calcium-binding protein calmodulin and a calmodulin target peptide, M13, derived from the intracellular calmodulin target myosin light-chain kinase and has been used to image calcium transients in vivo. To aid rational efforts to engineer improved variants of GCaMP2, thismore » protein was crystallized in the calcium-saturated form. X-ray diffraction data were collected to 2.0 Å resolution. The crystals belong to space group C2, with unit-cell parameters a = 126.1, b = 47.1, c = 68.8 Å, β = 100.5° and one GCaMP2 molecule in the asymmetric unit. The structure was phased by molecular replacement and refinement is currently under way.« less
iDBPs: a web server for the identification of DNA binding proteins.
Nimrod, Guy; Schushan, Maya; Szilágyi, András; Leslie, Christina; Ben-Tal, Nir
2010-03-01
The iDBPs server uses the three-dimensional (3D) structure of a query protein to predict whether it binds DNA. First, the algorithm predicts the functional region of the protein based on its evolutionary profile; the assumption is that large clusters of conserved residues are good markers of functional regions. Next, various characteristics of the predicted functional region as well as global features of the protein are calculated, such as the average surface electrostatic potential, the dipole moment and cluster-based amino acid conservation patterns. Finally, a random forests classifier is used to predict whether the query protein is likely to bind DNA and to estimate the prediction confidence. We have trained and tested the classifier on various datasets and shown that it outperformed related methods. On a dataset that reflects the fraction of DNA binding proteins (DBPs) in a proteome, the area under the ROC curve was 0.90. The application of the server to an updated version of the N-Func database, which contains proteins of unknown function with solved 3D-structure, suggested new putative DBPs for experimental studies. http://idbps.tau.ac.il/
Pan, Ying H.; Bahnson, Brian J.
2010-01-01
The properties of three discrete premicellar complexes (E1#, E2#, E3#) of pig pancreatic group-IB secreted phospholipase A2 (sPLA2) with monodisperse alkyl sulfates has been characterized [Berg, O. G., et al., Biochemistry 43, 7999–8013, 2004]. Here we have solved the 2.7 Å crystal structure of group-IB sPLA2 complexed with 12 molecules of octyl sulfate (C8S) in a form consistent with a tetrameric oligomeric that exists during the E1# phase of premicellar complexes. The alkyl tails of the C8S molecules are centered in the middle of the tetrameric cluster of sPLA2 subunits. Three of the four sPLA2 subunits also contain a C8S molecule in the active site pocket. The sulfate oxygen of a C8S ligand is complexed to the active site calcium in 3 of the 4 protein active sites. The interactions of the alkyl sulfate head group with Arg-6 and Lys-10, as well as the backbone amide of Met-20, are analogous to those observed in the previously solved sPLA2 crystal structures with bound phosphate and sulfate anions. The cluster of three anions found in the present structure is postulated to be the site for nucleating the binding of anionic amphiphiles to the interfacial surface of the protein, and therefore this binding interaction has implications for interfacial activation of the enzyme. PMID:20302975
Towards fully automated structure-based function prediction in structural genomics: a case study.
Watson, James D; Sanderson, Steve; Ezersky, Alexandra; Savchenko, Alexei; Edwards, Aled; Orengo, Christine; Joachimiak, Andrzej; Laskowski, Roman A; Thornton, Janet M
2007-04-13
As the global Structural Genomics projects have picked up pace, the number of structures annotated in the Protein Data Bank as hypothetical protein or unknown function has grown significantly. A major challenge now involves the development of computational methods to assign functions to these proteins accurately and automatically. As part of the Midwest Center for Structural Genomics (MCSG) we have developed a fully automated functional analysis server, ProFunc, which performs a battery of analyses on a submitted structure. The analyses combine a number of sequence-based and structure-based methods to identify functional clues. After the first stage of the Protein Structure Initiative (PSI), we review the success of the pipeline and the importance of structure-based function prediction. As a dataset, we have chosen all structures solved by the MCSG during the 5 years of the first PSI. Our analysis suggests that two of the structure-based methods are particularly successful and provide examples of local similarity that is difficult to identify using current sequence-based methods. No one method is successful in all cases, so, through the use of a number of complementary sequence and structural approaches, the ProFunc server increases the chances that at least one method will find a significant hit that can help elucidate function. Manual assessment of the results is a time-consuming process and subject to individual interpretation and human error. We present a method based on the Gene Ontology (GO) schema using GO-slims that can allow the automated assessment of hits with a success rate approaching that of expert manual assessment.
Crystal structure of TBC1D15 GTPase‐activating protein (GAP) domain and its activity on Rab GTPases
Chen, Yan‐Na; Gu, Xin; Zhou, X. Edward; Wang, Weidong; Cheng, Dandan; Ge, Yinghua; Ye, Fei
2017-01-01
Abstract TBC1D15 belongs to the TBC (Tre‐2/Bub2/Cdc16) domain family and functions as a GTPase‐activating protein (GAP) for Rab GTPases. So far, the structure of TBC1D15 or the TBC1D15·Rab complex has not been determined, thus, its catalytic mechanism on Rab GTPases is still unclear. In this study, we solved the crystal structures of the Shark and Sus TBC1D15 GAP domains, to 2.8 Å and 2.5 Å resolution, respectively. Shark‐TBC1D15 and Sus‐TBC1D15 belong to the same subfamily of TBC domain‐containing proteins, and their GAP‐domain structures are highly similar. This demonstrates the evolutionary conservation of the TBC1D15 protein family. Meanwhile, the newly determined crystal structures display new variations compared to the structures of yeast Gyp1p Rab GAP domain and TBC1D1. GAP assays show that Shark and Sus GAPs both have higher catalytic activity on Rab11a·GTP than Rab7a·GTP, which differs from the previous study. We also demonstrated the importance of arginine and glutamine on the catalytic sites of Shark GAP and Sus GAP. When arginine and glutamine are changed to alanine or lysine, the activities of Shark GAP and Sus GAP are lost. PMID:28168758
DOE Office of Scientific and Technical Information (OSTI.GOV)
Verardi, Raffaello; Kim, Jin-Sik; Ghirlando, Rodolfo
DHHC enzymes catalyze palmitoylation, a major post-translational modification that regulates a number of key cellular processes. There are up to 24 DHHCs in mammals and hundreds of substrate proteins that get palmitoylated. However, how DHHC enzymes engage with their substrates is still poorly understood. There is currently no structural information about the interaction between any DHHC enzyme and protein substrates. In this study we have investigated the structural and thermodynamic bases of interaction between the ankyrin repeat domain of human DHHC17 (ANK17) and Snap25b. We solved a high-resolution crystal structure of the complex between ANK17 and a peptide fragment ofmore » Snap25b. Through structure-guided mutagenesis, we discovered key residues in DHHC17 that are critically important for interaction with Snap25b. We further extended our finding by showing that the same residues are also crucial for the interaction of DHHC17 with Huntingtin, one of its most physiologically relevant substrates.« less
Structure of the full-length glucagon class B G protein-coupled receptor
Zhang, Haonan; Qiao, Anna; Yang, Dehua; Yang, Linlin; Dai, Antao; de Graaf, Chris; Reedtz-Runge, Steffen; Dharmarajan, Venkatasubramanian; Zhang, Hui; Han, Gye Won; Grant, Thomas D.; Sierra, Raymond G.; Weierstall, Uwe; Nelson, Garrett; Liu, Wei; Wu, Yanhong; Ma, Limin; Cai, Xiaoqing; Lin, Guangyao; Wu, Xiaoai; Geng, Zhi; Dong, Yuhui; Song, Gaojie; Griffin, Patrick R.; Lau, Jesper; Cherezov, Vadim; Yang, Huaiyu; Hanson, Michael A.; Stevens, Raymond C.; Zhao, Qiang; Jiang, Hualiang; Wang, Ming-Wei; Wu, Beili
2017-01-01
The human glucagon receptor (GCGR) belongs to the class B G protein-coupled receptor (GPCR) family and plays a key role in glucose homeostasis and the pathophysiology of type 2 diabetes. Here we report the 3.0 Å crystal structure of full-length GCGR containing both extracellular domain (ECD) and transmembrane domain (TMD) in an inactive conformation. The two domains are connected by a 12-residue segment termed the ‘stalk’, which adopts a β-strand conformation, instead of forming an α-helix as observed in the previously solved structure of GCGR-TMD. The first extracellular loop (ECL1) exhibits a β-hairpin conformation and interacts with the stalk to form a compact β-sheet structure. Hydrogen/deuterium exchange, disulfide cross-linking and molecular dynamics studies suggest that the stalk and ECL1 play critical roles in modulating peptide ligand binding and receptor activation. These insights into the full-length GCGR structure deepen our understanding about the signaling mechanisms of class B GPCRs. PMID:28514451
Bijelic, Aleksandar; Molitor, Christian; Mauracher, Stephan G; Al-Oweini, Rami; Kortz, Ulrich; Rompel, Annette
2015-01-19
As synchrotron radiation becomes more intense, detectors become faster and structure-solving software becomes more elaborate, obtaining single crystals suitable for data collection is now the bottleneck in macromolecular crystallography. Hence, there is a need for novel and advanced crystallisation agents with the ability to crystallise proteins that are otherwise challenging. Here, an Anderson-Evans-type polyoxometalate (POM), specifically Na6 [TeW6 O24 ]⋅22 H2 O (TEW), is employed as a crystallisation additive. Its effects on protein crystallisation are demonstrated with hen egg-white lysozyme (HEWL), which co-crystallises with TEW in the vicinity (or within) the liquid-liquid phase separation (LLPS) region. The X-ray structure (PDB ID: 4PHI) determination revealed that TEW molecules are part of the crystal lattice, thus demonstrating specific binding to HEWL with electrostatic interactions and hydrogen bonds. The negatively charged TEW polyoxotungstate binds to sites with a positive electrostatic potential located between two (or more) symmetry-related protein chains. Thus, TEW facilitates the formation of protein-protein interfaces of otherwise repulsive surfaces, and thereby the realisation of a stable crystal lattice. In addition to retaining the isomorphicity of the protein structure, the anomalous scattering of the POMs was used for macromolecular phasing. The results suggest that hexatungstotellurate(VI) has great potential as a crystallisation additive to promote both protein crystallisation and structure elucidation. © 2014 The Authors. Published by Wiley-VCH Verlag GmbH & Co. KGaA. This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
Mercury increases water permeability of a plant aquaporin through a non-cysteine-related mechanism.
Frick, Anna; Järvå, Michael; Ekvall, Mikael; Uzdavinys, Povilas; Nyblom, Maria; Törnroth-Horsefield, Susanna
2013-09-15
Water transport across cellular membranes is mediated by a family of membrane proteins known as AQPs (aquaporins). AQPs were first discovered on the basis of their ability to be inhibited by mercurial compounds, an experiment which has followed the AQP field ever since. Although mercury inhibition is most common, many AQPs are mercury insensitive. In plants, regulation of AQPs is important in order to cope with environmental changes. Plant plasma membrane AQPs are known to be gated by phosphorylation, pH and Ca²⁺. We have previously solved the structure of the spinach AQP SoPIP2;1 (Spinacia oleracea plasma membrane intrinsic protein 2;1) in closed and open conformations and proposed a mechanism for how this gating can be achieved. To study the effect of mercury on SoPIP2;1 we solved the structure of the SoPIP2;1-mercury complex and characterized the water transport ability using proteoliposomes. The structure revealed mercury binding to three out of four cysteine residues. In contrast to what is normally seen for AQPs, mercury increased the water transport rate of SoPIP2;1, an effect which could not be attributed to any of the cysteine residues. This indicates that other factors might influence the effect of mercury on SoPIP2;1, one of which could be the properties of the lipid bilayer.
Cau, Ylenia; Fiorillo, Annarita; Mori, Mattia; Ilari, Andrea; Botta, Maurizo; Lalle, Marco
2015-12-28
Giardiasis is a gastrointestinal diarrheal illness caused by the protozoan parasite Giardia duodenalis, which affects annually over 200 million people worldwide. The limited antigiardial drug arsenal and the emergence of clinical cases refractory to standard treatments dictate the need for new chemotherapeutics. The 14-3-3 family of regulatory proteins, extensively involved in protein-protein interactions (PPIs) with pSer/pThr clients, represents a highly promising target. Despite homology with human counterparts, the single 14-3-3 of G. duodenalis (g14-3-3) is characterized by a constitutive phosphorylation in a region critical for target binding, thus affecting the function and the conformation of g14-3-3/clients interaction. However, to approach the design of specific small molecule modulators of g14-3-3 PPIs, structural elucidations are required. Here, we present a detailed computational and crystallographic study exploring the implications of g14-3-3 phosphorylation on protein structure and target binding. Self-Guided Langevin Dynamics and classical molecular dynamics simulations show that phosphorylation affects locally and globally g14-3-3 conformation, inducing a structural rearrangement more suitable for target binding. Profitable features for g14-3-3/clients interaction were highlighted using a hydrophobicity-based descriptor to characterize g14-3-3 client peptides. Finally, the X-ray structure of g14-3-3 in complex with a mode-1 prototype phosphopeptide was solved and combined with structure-based simulations to identify molecular features relevant for clients binding to g14-3-3. The data presented herein provide a further and structural understanding of g14-3-3 features and set the basis for drug design studies.
Improved method for predicting protein fold patterns with ensemble classifiers.
Chen, W; Liu, X; Huang, Y; Jiang, Y; Zou, Q; Lin, C
2012-01-27
Protein folding is recognized as a critical problem in the field of biophysics in the 21st century. Predicting protein-folding patterns is challenging due to the complex structure of proteins. In an attempt to solve this problem, we employed ensemble classifiers to improve prediction accuracy. In our experiments, 188-dimensional features were extracted based on the composition and physical-chemical property of proteins and 20-dimensional features were selected using a coupled position-specific scoring matrix. Compared with traditional prediction methods, these methods were superior in terms of prediction accuracy. The 188-dimensional feature-based method achieved 71.2% accuracy in five cross-validations. The accuracy rose to 77% when we used a 20-dimensional feature vector. These methods were used on recent data, with 54.2% accuracy. Source codes and dataset, together with web server and software tools for prediction, are available at: http://datamining.xmu.edu.cn/main/~cwc/ProteinPredict.html.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Germane, Katherine L., E-mail: katherine.germane.civ@mail.mil; Servinsky, Matthew D.; Gerlach, Elliot S.
2015-07-29
The crystal structure of the protein product of the C. acetobutylicum ATCC 824 gene CA-C0359 is structurally similar to YteR, an unsaturated rhamnogalacturonyl hydrolase from B. subtilis strain 168. Substrate modeling and electrostatic studies of the active site of the structure of CA-C0359 suggests that the protein can now be considered to be part of CAZy glycoside hydrolase family 105. Clostridium acetobutylicum ATCC 824 gene CA-C0359 encodes a putative unsaturated rhamnogalacturonyl hydrolase (URH) with distant amino-acid sequence homology to YteR of Bacillus subtilis strain 168. YteR, like other URHs, has core structural homology to unsaturated glucuronyl hydrolases, but hydrolyzes themore » unsaturated disaccharide derivative of rhamnogalacturonan I. The crystal structure of the recombinant CA-C0359 protein was solved to 1.6 Å resolution by molecular replacement using the phase information of the previously reported structure of YteR (PDB entry (http://scripts.iucr.org/cgi-bin/cr.cgi?rm)) from Bacillus subtilis strain 168. The YteR-like protein is a six-α-hairpin barrel with two β-sheet strands and a small helix overlaying the end of the hairpins next to the active site. The protein has low primary protein sequence identity to YteR but is structurally similar. The two tertiary structures align with a root-mean-square deviation of 1.4 Å and contain a highly conserved active pocket. There is a conserved aspartic acid residue in both structures, which has been shown to be important for hydration of the C=C bond during the release of unsaturated galacturonic acid by YteR. A surface electrostatic potential comparison of CA-C0359 and proteins from CAZy families GH88 and GH105 reveals the make-up of the active site to be a combination of the unsaturated rhamnogalacturonyl hydrolase and the unsaturated glucuronyl hydrolase from Bacillus subtilis strain 168. Structural and electrostatic comparisons suggests that the protein may have a slightly different substrate specificity from that of YteR.« less
Crystallizing Membrane Proteins Using Lipidic Mesophases
Caffrey, Martin; Cherezov, Vadim
2009-01-01
A detailed protocol for crystallizing membrane proteins that makes use of lipidic mesophases is described. This has variously been referred to as the lipid cubic phase or in meso method. The method has been shown to be quite general in that it has been used to solve X-ray crystallographic structures of prokaryotic and eukaryotic proteins, proteins that are monomeric, homo- and hetero-multimeric, chromophore-containing and chromophore-free, and α-helical and β-barrel proteins. Its most recent successes are the human engineered β2-adrenergic and adenosine A2A G protein-coupled receptors. Protocols are provided for preparing and characterizing the lipidic mesophase, for reconstituting the protein into the monoolein-based mesophase, for functional assay of the protein in the mesophase, and for setting up crystallizations in manual mode. Methods for harvesting micro-crystals are also described. The time required to prepare the protein-loaded mesophase and to set up a crystallization plate manually is about one hour. PMID:19390528
Zhang, Min; Wei, Zhiyi; Chang, Shaojie; Teng, Maikun; Gong, Weimin
2006-04-21
A 31kDa cysteine protease, SPE31, was isolated from the seeds of a legume plant, Pachyrizhus erosus. The protein was purified, crystallized and the 3D structure solved using molecular replacement. The cDNA was obtained by RT PCR followed by amplification using mRNA isolated from the seeds of the legume plant as a template. Analysis of the cDNA sequence and the 3D structure indicated the protein to belong to the papain family. Detailed analysis of the structure revealed an unusual replacement of the conserved catalytic Cys with Gly. Replacement of another conserved residue Ala/Gly by a Phe sterically blocks the access of the substrate to the active site. A polyethyleneglycol molecule and a natural peptide fragment were bound to the surface of the active site. Asn159 was found to be glycosylated. The SPE31 cDNA sequence shares several features with P34, a protein found in soybeans, that is implicated in plant defense mechanisms as an elicitor receptor binding to syringolide. P34 has also been shown to interact with vegetative storage proteins and NADH-dependent hydroxypyruvate reductase. These roles suggest that SPE31 and P34 form a unique subfamily within the papain family. The crystal structure of SPE31 complexed with a natural peptide ligand reveals a unique active site architecture. In addition, the clear evidence of glycosylated Asn159 provides useful information towards understanding the functional mechanism of SPE31/P34.
PDBe: towards reusable data delivery infrastructure at protein data bank in Europe
Alhroub, Younes; Anyango, Stephen; Armstrong, David R; Berrisford, John M; Clark, Alice R; Conroy, Matthew J; Dana, Jose M; Gupta, Deepti; Gutmanas, Aleksandras; Haslam, Pauline; Mak, Lora; Mukhopadhyay, Abhik; Nadzirin, Nurul; Paysan-Lafosse, Typhaine; Sehnal, David; Sen, Sanchayita; Smart, Oliver S; Varadi, Mihaly; Kleywegt, Gerard J
2018-01-01
Abstract The Protein Data Bank in Europe (PDBe, pdbe.org) is actively engaged in the deposition, annotation, remediation, enrichment and dissemination of macromolecular structure data. This paper describes new developments and improvements at PDBe addressing three challenging areas: data enrichment, data dissemination and functional reusability. New features of the PDBe Web site are discussed, including a context dependent menu providing links to raw experimental data and improved presentation of structures solved by hybrid methods. The paper also summarizes the features of the LiteMol suite, which is a set of services enabling fast and interactive 3D visualization of structures, with associated experimental maps, annotations and quality assessment information. We introduce a library of Web components which can be easily reused to port data and functionality available at PDBe to other services. We also introduce updates to the SIFTS resource which maps PDB data to other bioinformatics resources, and the PDBe REST API. PMID:29126160
Dissecting the telomere-inner nuclear membrane interface formed in meiosis.
Pendlebury, Devon F; Fujiwara, Yasuhiro; Tesmer, Valerie M; Smith, Eric M; Shibuya, Hiroki; Watanabe, Yoshinori; Nandakumar, Jayakrishnan
2017-12-01
Tethering telomeres to the inner nuclear membrane (INM) allows homologous chromosome pairing during meiosis. The meiosis-specific protein TERB1 binds the telomeric protein TRF1 to establish telomere-INM connectivity and is essential for mouse fertility. Here we solve the structure of the human TRF1-TERB1 interface to reveal the structural basis for telomere-INM linkage. Disruption of this interface abrogates binding and compromises telomere-INM attachment in mice. An embedded CDK-phosphorylation site within the TRF1-binding region of TERB1 provides a mechanism for cap exchange, a late-pachytene phenomenon involving the dissociation of the TRF1-TERB1 complex. Indeed, further strengthening this interaction interferes with cap exchange. Finally, our biochemical analysis implicates distinct complexes for telomere-INM tethering and chromosome-end protection during meiosis. Our studies unravel the structure, stoichiometry, and physiological implications underlying telomere-INM tethering, thereby providing unprecedented insights into the unique function of telomeres in meiosis.
Structural and biochemical analyses of YvgN and YtbE from Bacillus subtilis
Lei, Jian; Zhou, Yan-Feng; Li, Lan-Fen; Su, Xiao-Dong
2009-01-01
Bacillus subtilis is one of the most studied gram-positive bacteria. In this work, YvgN and YtbE from B. subtilis, assigned as AKR5G1 and AKR5G2 of aldo-keto reductase (AKR) superfamily. AKR catalyzes the NADPH-dependent reduction of aldehyde or aldose substrates to alcohols. YvgN and YtbE were studied by crystallographic and enzymatic analyses. The apo structures of these proteins were determined by molecular replacement, and the structure of holoenzyme YvgN with NADPH was also solved, revealing the conformational changes upon cofactor binding. Our biochemical data suggest both YvgN and YtbE have preferential specificity for derivatives of benzaldehyde, such as nitryl or halogen group substitution at the 2 or 4 positions. These proteins also showed broad catalytic activity on many standard substrates of AKR, such as glyoxal, dihydroxyacetone, and DL-glyceraldehyde, suggesting a possible role in bacterial detoxification. PMID:19585557
Plant photosystem I design in the light of evolution.
Amunts, Alexey; Nelson, Nathan
2009-05-13
Photosystem I (PSI) is a membrane protein complex that catalyzes sunlight-driven transmembrane electron transfer as part of the photosynthetic machinery. Photosynthetic organisms appeared on the Earth about 3.5 billion years ago and provided an essential source of potential energy for the development of life. During the course of evolution, these primordial organisms were phagocytosed by more sophisticated eukaryotic cells, resulting in the evolvement of algae and plants. Despite the extended time interval between primordial cyanobacteria and plants, PSI has retained its fundamental mechanism of sunlight conversion. Being probably the most efficient photoelectric apparatus in nature, PSI operates with a quantum efficiency close to 100%. However, adapting to different ecological niches necessitated structural changes in the PSI design. Based on the recently solved structure of plant PSI, which revealed a complex of 17 protein subunits and 178 prosthetic groups, we analyze the evolutionary development of PSI. In addition, some aspects of PSI structure determination are discussed.
The early years of retroviral protease crystal structures.
Miller, Maria
2010-01-01
Soon after its discovery, the attempts to develop anti-AIDS therapeutics focused on the retroviral protease (PR)-an enzyme used by lentiviruses to process the precursor polypeptide into mature viral proteins. An urgent need for the three-dimensional structure of PR to guide rational drug design prompted efforts to produce milligram quantities of this enzyme. However, only minute amounts of PR were present in the HIV-1 and HIV-2 viruses, and initial attempts to express this protein in bacteria were not successful. This review describes X-ray crystallographic studies of the retroviral proteases carried out at NCI-Frederick in the late 1980s and early 1990s and puts into perspective the crucial role that the total protein chemical synthesis played in unraveling the structure, mechanism of action, and inhibition of HIV-1 PR. Notably, the first fully correct structure of HIV-1 PR and the first cocrystal structure of its complex with an inhibitor (a substrate-derived, reduced isostere hexapeptide MVT-101) were determined using chemically synthesized protein. Most importantly, these sets of coordinates were made freely available to the research community and were used worldwide to solve X-ray structures of HIV-1 PR complexes with an array of inhibitors and set in motion a variety of theoretical studies. Publication of the structure of chemically synthesized HIV-1 PR complexed with MVT-101 preceded only by six years the approval of the first PR inhibitor as an anti-AIDS drug. Copyright (c) 2010 Wiley Periodicals, Inc.
Structure of the N-terminal fragment of Escherichia coli Lon protease
DOE Office of Scientific and Technical Information (OSTI.GOV)
Li, Mi; Gustchina, Alla; Rasulova, Fatima S.
2010-10-22
The structure of a recombinant construct consisting of residues 1-245 of Escherichia coli Lon protease, the prototypical member of the A-type Lon family, is reported. This construct encompasses all or most of the N-terminal domain of the enzyme. The structure was solved by SeMet SAD to 2.6 {angstrom} resolution utilizing trigonal crystals that contained one molecule in the asymmetric unit. The molecule consists of two compact subdomains and a very long C-terminal {alpha}-helix. The structure of the first subdomain (residues 1-117), which consists mostly of {beta}-strands, is similar to that of the shorter fragment previously expressed and crystallized, whereas themore » second subdomain is almost entirely helical. The fold and spatial relationship of the two subdomains, with the exception of the C-terminal helix, closely resemble the structure of BPP1347, a 203-amino-acid protein of unknown function from Bordetella parapertussis, and more distantly several other proteins. It was not possible to refine the structure to satisfactory convergence; however, since almost all of the Se atoms could be located on the basis of their anomalous scattering the correctness of the overall structure is not in question. The structure reported here was also compared with the structures of the putative substrate-binding domains of several proteins, showing topological similarities that should help in defining the binding sites used by Lon substrates.« less
2011-01-01
Many bacterial species contain intracellular nano- and micro-compartments consisting of self-assembling proteins that form protein-only shells. These structures are built up by combinations of a reduced number of repeated elements, from 60 repeated copies of one unique structural element self-assembled in encapsulins of 24 nm to 10,000-20,000 copies of a few protein species assembled in a organelle of around 100-150 nm in cross-section. However, this apparent simplicity does not correspond to the structural and functional sophistication of some of these organelles. They package, by not yet definitely solved mechanisms, one or more enzymes involved in specific metabolic pathways, confining such reactions and sequestering or increasing the inner concentration of unstable, toxics or volatile intermediate metabolites. From a biotechnological point of view, we can use the self assembling properties of these particles for directing shell assembling and enzyme packaging, mimicking nature to design new applications in biotechnology. Upon appropriate engineering of the building blocks, they could act as a new family of self-assembled, protein-based vehicles in Nanomedicine to encapsulate, target and deliver therapeutic cargoes to specific cell types and/or tissues. This would provide a new, intriguing platform of microbial origin for drug delivery. PMID:22046962
Structural biology of intrinsically disordered proteins: Revisiting unsolved mysteries.
Sigalov, Alexander B
2016-06-01
The emergence of intrinsically disordered proteins (IDPs) has challenged the classical protein structure-function paradigm by introducing a new paradigm of "coupled binding and folding". This paradigm suggests that IDPs fold upon binding to their partners. Further studies, however, revealed a novel and previously unrecognized phenomenon of "uncoupled binding and folding" suggesting that IDPs do not necessarily fold upon interaction with their lipid and protein partners. The complex and often unusual biophysics of IDPs makes structural characterization of these proteins and their complexes not only challenging but often resulting in opposite conclusions. For this reason, some crucial questions in this field remain unsolved for well over a decade. Considering an important role of IDPs in cellular regulation, signaling and control in health and disease, more efforts are needed to solve these mysteries. Here, I focus on two long-standing contradictions in the literature concerning dimerization and membrane-binding activities of IDPs. Molecular explanation of these discrepancies is provided. I also demonstrate how resolution of these critical issues in the field of IDPs results in our expanded understanding of cell function and has multiple applications in biology and medicine. Copyright © 2016 Elsevier B.V. and Société Française de Biochimie et Biologie Moléculaire (SFBBM). All rights reserved.
Rosetta:MSF: a modular framework for multi-state computational protein design.
Löffler, Patrick; Schmitz, Samuel; Hupfeld, Enrico; Sterner, Reinhard; Merkl, Rainer
2017-06-01
Computational protein design (CPD) is a powerful technique to engineer existing proteins or to design novel ones that display desired properties. Rosetta is a software suite including algorithms for computational modeling and analysis of protein structures and offers many elaborate protocols created to solve highly specific tasks of protein engineering. Most of Rosetta's protocols optimize sequences based on a single conformation (i. e. design state). However, challenging CPD objectives like multi-specificity design or the concurrent consideration of positive and negative design goals demand the simultaneous assessment of multiple states. This is why we have developed the multi-state framework MSF that facilitates the implementation of Rosetta's single-state protocols in a multi-state environment and made available two frequently used protocols. Utilizing MSF, we demonstrated for one of these protocols that multi-state design yields a 15% higher performance than single-state design on a ligand-binding benchmark consisting of structural conformations. With this protocol, we designed de novo nine retro-aldolases on a conformational ensemble deduced from a (βα)8-barrel protein. All variants displayed measurable catalytic activity, testifying to a high success rate for this concept of multi-state enzyme design.
Rosetta:MSF: a modular framework for multi-state computational protein design
Hupfeld, Enrico; Sterner, Reinhard
2017-01-01
Computational protein design (CPD) is a powerful technique to engineer existing proteins or to design novel ones that display desired properties. Rosetta is a software suite including algorithms for computational modeling and analysis of protein structures and offers many elaborate protocols created to solve highly specific tasks of protein engineering. Most of Rosetta’s protocols optimize sequences based on a single conformation (i. e. design state). However, challenging CPD objectives like multi-specificity design or the concurrent consideration of positive and negative design goals demand the simultaneous assessment of multiple states. This is why we have developed the multi-state framework MSF that facilitates the implementation of Rosetta’s single-state protocols in a multi-state environment and made available two frequently used protocols. Utilizing MSF, we demonstrated for one of these protocols that multi-state design yields a 15% higher performance than single-state design on a ligand-binding benchmark consisting of structural conformations. With this protocol, we designed de novo nine retro-aldolases on a conformational ensemble deduced from a (βα)8-barrel protein. All variants displayed measurable catalytic activity, testifying to a high success rate for this concept of multi-state enzyme design. PMID:28604768
The Quality and Validation of Structures from Structural Genomics
Domagalski, Marcin J.; Zheng, Heping; Zimmerman, Matthew D.; Dauter, Zbigniew; Wlodawer, Alexander; Minor, Wladek
2014-01-01
Quality control of three-dimensional structures of macromolecules is a critical step to ensure the integrity of structural biology data, especially those produced by structural genomics centers. Whereas the Protein Data Bank (PDB) has proven to be a remarkable success overall, the inconsistent quality of structures reveals a lack of universal standards for structure/deposit validation. Here, we review the state-of-the-art methods used in macromolecular structure validation, focusing on validation of structures determined by X-ray crystallography. We describe some general protocols used in the rebuilding and re-refinement of problematic structural models. We also briefly discuss some frontier areas of structure validation, including refinement of protein–ligand complexes, automation of structure redetermination, and the use of NMR structures and computational models to solve X-ray crystal structures by molecular replacement. PMID:24203341
Three dimensional electron microscopy and in silico tools for macromolecular structure determination
Borkotoky, Subhomoi; Meena, Chetan Kumar; Khan, Mohammad Wahab; Murali, Ayaluru
2013-01-01
Recently, structural biology witnessed a major tool - electron microscopy - in solving the structures of macromolecules in addition to the conventional techniques, X-ray crystallography and nuclear magnetic resonance (NMR). Three dimensional transmission electron microscopy (3DTEM) is one of the most sophisticated techniques for structure determination of molecular machines. Known to give the 3-dimensional structures in its native form with literally no upper limit on size of the macromolecule, this tool does not need the crystallization of the protein. Combining the 3DTEM data with in silico tools, one can have better refined structure of a desired complex. In this review we are discussing about the recent advancements in three dimensional electron microscopy and tools associated with it. PMID:27092033
Structure-based functional annotation: yeast ymr099c codes for a D-hexose-6-phosphate mutarotase.
Graille, Marc; Baltaze, Jean-Pierre; Leulliot, Nicolas; Liger, Dominique; Quevillon-Cheruel, Sophie; van Tilbeurgh, Herman
2006-10-06
Despite the generation of a large amount of sequence information over the last decade, more than 40% of well characterized enzymatic functions still lack associated protein sequences. Assigning protein sequences to documented biochemical functions is an interesting challenge. We illustrate here that structural genomics may be a reasonable approach in addressing these questions. We present the crystal structure of the Saccharomyces cerevisiae YMR099cp, a protein of unknown function. YMR099cp adopts the same fold as galactose mutarotase and shares the same catalytic machinery necessary for the interconversion of the alpha and beta anomers of galactose. The structure revealed the presence in the active site of a sulfate ion attached by an arginine clamp made by the side chain from two strictly conserved arginine residues. This sulfate is ideally positioned to mimic the phosphate group of hexose 6-phosphate. We have subsequently successfully demonstrated that YMR099cp is a hexose-6-phosphate mutarotase with broad substrate specificity. We solved high resolution structures of some substrate enzyme complexes, further confirming our functional hypothesis. The metabolic role of a hexose-6-phosphate mutarotase is discussed. This work illustrates that structural information has been crucial to assign YMR099cp to the orphan EC activity: hexose-phosphate mutarotase.
Chen, Jing-Hua; Yu, Long-Jiang; Boussac, Alain; Wang-Otomo, Zheng-Yu; Kuang, Tingyun; Shen, Jian-Ren
2018-04-24
The thermophilic purple sulfur bacterium Thermochromatium tepidum possesses four main water-soluble redox proteins involved in the electron transfer behavior. Crystal structures have been reported for three of them: a high potential iron-sulfur protein, cytochrome c', and one of two low-potential cytochrome c 552 (which is a flavocytochrome c) have been determined. In this study, we purified another low-potential cytochrome c 552 (LPC), determined its N-terminal amino acid sequence and the whole gene sequence, characterized it with absorption and electron paramagnetic spectroscopy, and solved its high-resolution crystal structure. This novel cytochrome was found to contain five c-type hemes. The overall fold of LPC consists of two distinct domains, one is the five heme-containing domain and the other one is an Ig-like domain. This provides a representative example for the structures of multiheme cytochromes containing an odd number of hemes, although the structures of multiheme cytochromes with an even number of hemes are frequently seen in the PDB database. Comparison of the sequence and structure of LPC with other proteins in the databases revealed several characteristic features which may be important for its functioning. Based on the results obtained, we discuss the possible intracellular function of this LPC in Tch. tepidum.
Verma, Anil Kumar; Goyal, Arun; Freire, Filipe; Bule, Pedro; Venditto, Immacolata; Brás, Joana L. A.; Santos, Helena; Cardoso, Vânia; Bonifácio, Cecília; Thompson, Andrew; Romão, Maria João; Prates, José A. M.; Ferreira, Luís M. A.; Fontes, Carlos M. G. A.; Najmudin, Shabir
2013-01-01
The modular carbohydrate-active enzyme belonging to glycoside hydrolase family 30 (GH30) from Clostridium thermocellum (CtXynGH30) is a cellulosomal protein which plays an important role in plant cell-wall degradation. The full-length CtXynGH30 contains an N-terminal catalytic module (Xyn30A) followed by a family 6 carbohydrate-binding module (CBM6) and a dockerin at the C-terminus. The recombinant protein has a molecular mass of 45 kDa. Preliminary structural characterization was carried out on Xyn30A crystallized in different conditions. All tested crystals belonged to space group P1 with one molecule in the asymmetric unit. Molecular replacement has been used to solve the Xyn30A structure. PMID:24316849
The origin and evolution of human glutaminases and their atypical C-terminal ankyrin repeats
Pasquali, Camila Cristina; Islam, Zeyaul; Adamoski, Douglas; ...
2017-05-19
On the basis of tissue-specific enzyme activity and inhibition by catalytic products, Hans Krebs first demonstrated the existence of multiple glutaminases in mammals. Currently, two human genes are known to encode at least four glutaminase isoforms. But, the phylogeny of these medically relevant enzymes remains unclear, prompting us to investigate their origin and evolution. Using prokaryotic and eukaryotic glutaminase sequences, we built a phylogenetic tree whose topology suggested that the multidomain architecture was inherited from bacterial ancestors, probably simultaneously with the hosting of the proto-mitochondrion endosymbiont. We propose an evolutionary model wherein the appearance of the most active enzyme isoform,more » glutaminase C (GAC), which is expressed in many cancers, was a late retrotransposition event that occurred in fishes from the Chondrichthyes class. The ankyrin (ANK) repeats in the glutaminases were acquired early in their evolution. In order to obtain information on ANK folding, we solved two high-resolution structures of the ANK repeat-containing C termini of both kidney-type glutaminase (KGA) and GLS2 isoforms (glutaminase B and liver-type glutaminase). We also found that the glutaminase ANK repeats form unique intramolecular contacts through two highly conserved motifs; curiously, this arrangement occludes a region usually involved in ANK-mediated protein-protein interactions. We also solved the crystal structure of full-length KGA and present a small-angle X-ray scattering model for full-length GLS2. These structures explain these proteins' compromised ability to assemble into catalytically active supra-tetrameric filaments, as previously shown for GAC. Collectively, these results provide information about glutaminases that may aid in the design of isoform-specific glutaminase inhibitors.« less
The origin and evolution of human glutaminases and their atypical C-terminal ankyrin repeats
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pasquali, Camila Cristina; Islam, Zeyaul; Adamoski, Douglas
On the basis of tissue-specific enzyme activity and inhibition by catalytic products, Hans Krebs first demonstrated the existence of multiple glutaminases in mammals. Currently, two human genes are known to encode at least four glutaminase isoforms. But, the phylogeny of these medically relevant enzymes remains unclear, prompting us to investigate their origin and evolution. Using prokaryotic and eukaryotic glutaminase sequences, we built a phylogenetic tree whose topology suggested that the multidomain architecture was inherited from bacterial ancestors, probably simultaneously with the hosting of the proto-mitochondrion endosymbiont. We propose an evolutionary model wherein the appearance of the most active enzyme isoform,more » glutaminase C (GAC), which is expressed in many cancers, was a late retrotransposition event that occurred in fishes from the Chondrichthyes class. The ankyrin (ANK) repeats in the glutaminases were acquired early in their evolution. In order to obtain information on ANK folding, we solved two high-resolution structures of the ANK repeat-containing C termini of both kidney-type glutaminase (KGA) and GLS2 isoforms (glutaminase B and liver-type glutaminase). We also found that the glutaminase ANK repeats form unique intramolecular contacts through two highly conserved motifs; curiously, this arrangement occludes a region usually involved in ANK-mediated protein-protein interactions. We also solved the crystal structure of full-length KGA and present a small-angle X-ray scattering model for full-length GLS2. These structures explain these proteins' compromised ability to assemble into catalytically active supra-tetrameric filaments, as previously shown for GAC. Collectively, these results provide information about glutaminases that may aid in the design of isoform-specific glutaminase inhibitors.« less
Rigden, Daniel J; Thomas, Jens M H; Simkovic, Felix; Simpkin, Adam; Winn, Martyn D; Mayans, Olga; Keegan, Ronan M
2018-03-01
Molecular replacement (MR) is the predominant route to solution of the phase problem in macromolecular crystallography. Although routine in many cases, it becomes more effortful and often impossible when the available experimental structures typically used as search models are only distantly homologous to the target. Nevertheless, with current powerful MR software, relatively small core structures shared between the target and known structure, of 20-40% of the overall structure for example, can succeed as search models where they can be isolated. Manual sculpting of such small structural cores is rarely attempted and is dependent on the crystallographer's expertise and understanding of the protein family in question. Automated search-model editing has previously been performed on the basis of sequence alignment, in order to eliminate, for example, side chains or loops that are not present in the target, or on the basis of structural features (e.g. solvent accessibility) or crystallographic parameters (e.g. B factors). Here, based on recent work demonstrating a correlation between evolutionary conservation and protein rigidity/packing, novel automated ways to derive edited search models from a given distant homologue over a range of sizes are presented. A variety of structure-based metrics, many readily obtained from online webservers, can be fed to the MR pipeline AMPLE to produce search models that succeed with a set of test cases where expertly manually edited comparators, further processed in diverse ways with MrBUMP, fail. Further significant performance gains result when the structure-based distance geometry method CONCOORD is used to generate ensembles from the distant homologue. To our knowledge, this is the first such approach whereby a single structure is meaningfully transformed into an ensemble for the purposes of MR. Additional cases further demonstrate the advantages of the approach. CONCOORD is freely available and computationally inexpensive, so these novel methods offer readily available new routes to solve difficult MR cases.
Simpkin, Adam; Mayans, Olga; Keegan, Ronan M.
2018-01-01
Molecular replacement (MR) is the predominant route to solution of the phase problem in macromolecular crystallography. Although routine in many cases, it becomes more effortful and often impossible when the available experimental structures typically used as search models are only distantly homologous to the target. Nevertheless, with current powerful MR software, relatively small core structures shared between the target and known structure, of 20–40% of the overall structure for example, can succeed as search models where they can be isolated. Manual sculpting of such small structural cores is rarely attempted and is dependent on the crystallographer’s expertise and understanding of the protein family in question. Automated search-model editing has previously been performed on the basis of sequence alignment, in order to eliminate, for example, side chains or loops that are not present in the target, or on the basis of structural features (e.g. solvent accessibility) or crystallographic parameters (e.g. B factors). Here, based on recent work demonstrating a correlation between evolutionary conservation and protein rigidity/packing, novel automated ways to derive edited search models from a given distant homologue over a range of sizes are presented. A variety of structure-based metrics, many readily obtained from online webservers, can be fed to the MR pipeline AMPLE to produce search models that succeed with a set of test cases where expertly manually edited comparators, further processed in diverse ways with MrBUMP, fail. Further significant performance gains result when the structure-based distance geometry method CONCOORD is used to generate ensembles from the distant homologue. To our knowledge, this is the first such approach whereby a single structure is meaningfully transformed into an ensemble for the purposes of MR. Additional cases further demonstrate the advantages of the approach. CONCOORD is freely available and computationally inexpensive, so these novel methods offer readily available new routes to solve difficult MR cases. PMID:29533226
Contact-assisted protein structure modeling by global optimization in CASP11.
Joo, Keehyoung; Joung, InSuk; Cheng, Qianyi; Lee, Sung Jong; Lee, Jooyoung
2016-09-01
We have applied the conformational space annealing method to the contact-assisted protein structure modeling in CASP11. For Tp targets, where predicted residue-residue contact information was provided, the contact energy term in the form of the Lorentzian function was implemented together with the physical energy terms used in our template-free modeling of proteins. Although we observed some structural improvement of Tp models over the models predicted without the Tp information, the improvement was not substantial on average. This is partly due to the inaccuracy of the provided contact information, where only about 18% of it was correct. For Ts targets, where the information of ambiguous NOE (Nuclear Overhauser Effect) restraints was provided, we formulated the modeling in terms of the two-tier optimization problem, which covers: (1) the assignment of NOE peaks and (2) the three-dimensional (3D) model generation based on the assigned NOEs. Although solving the problem in a direct manner appears to be intractable at first glance, we demonstrate through CASP11 that remarkably accurate protein 3D modeling is possible by brute force optimization of a relevant energy function. For 19 Ts targets of the average size of 224 residues, generated protein models were of about 3.6 Å Cα atom accuracy. Even greater structural improvement was observed when additional Tc contact information was provided. For 20 out of the total 24 Tc targets, we were able to generate protein structures which were better than the best model from the rest of the CASP11 groups in terms of GDT-TS. Proteins 2016; 84(Suppl 1):189-199. © 2015 Wiley Periodicals, Inc. © 2015 Wiley Periodicals, Inc.
Strecker, Claas; Meyer, Bernd
2018-05-29
Protein flexibility poses a major challenge to docking of potential ligands in that the binding site can adopt different shapes. Docking algorithms usually keep the protein rigid and only allow the ligand to be treated as flexible. However, a wrong assessment of the shape of the binding pocket can prevent a ligand from adapting a correct pose. Ensemble docking is a simple yet promising method to solve this problem: Ligands are docked into multiple structures, and the results are subsequently merged. Selection of protein structures is a significant factor for this approach. In this work we perform a comprehensive and comparative study evaluating the impact of structure selection on ensemble docking. We perform ensemble docking with several crystal structures and with structures derived from molecular dynamics simulations of renin, an attractive target for antihypertensive drugs. Here, 500 ns of MD simulations revealed binding site shapes not found in any available crystal structure. We evaluate the importance of structure selection for ensemble docking by comparing binding pose prediction, ability to rank actives above nonactives (screening utility), and scoring accuracy. As a result, for ensemble definition k-means clustering appears to be better suited than hierarchical clustering with average linkage. The best performing ensemble consists of four crystal structures and is able to reproduce the native ligand poses better than any individual crystal structure. Moreover this ensemble outperforms 88% of all individual crystal structures in terms of screening utility as well as scoring accuracy. Similarly, ensembles of MD-derived structures perform on average better than 75% of any individual crystal structure in terms of scoring accuracy at all inspected ensembles sizes.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kim, Suhkmann; Zhang, Ziming; Upchurch, Sean
2004-04-16
2 ARID is a homologous family of DNA-binding domains that occur in DNA binding proteins from a wide variety of species, ranging from yeast to nematodes, insects, mammals and plants. SWI1, a member of the SWI/SNF protein complex that is involved in chromatin remodeling during transcription, contains the ARID motif. The ARID domain of human SWI1 (also known as p270) does not select for a specific DNA sequence from a random sequence pool. The lack of sequence specificity shown by the SWI1 ARID domain stands in contrast to the other characterized ARID domains, which recognize specific AT-rich sequences. We havemore » solved the three-dimensional structure of human SWI1 ARID using solution NMR methods. In addition, we have characterized non-specific DNA-binding by the SWI1 ARID domain. Results from this study indicate that a flexible long internal loop in ARID motif is likely to be important for sequence specific DNA-recognition. The structure of human SWI1 ARID domain also represents a distinct structural subfamily. Studies of ARID indicate that boundary of the DNA binding structural and functional domains can extend beyond the sequence homologous region in a homologous family of proteins. Structural studies of homologous domains such as ARID family of DNA-binding domains should provide information to better predict the boundary of structural and functional domains in structural genomic studies. Key Words: ARID, SWI1, NMR, structural genomics, protein-DNA interaction.« less
Protein crystal structure from non-oriented, single-axis sparse X-ray data
Wierman, Jennifer L.; Lan, Ti-Yen; Tate, Mark W.; ...
2016-01-01
X-ray free-electron lasers (XFELs) have inspired the development of serial femtosecond crystallography (SFX) as a method to solve the structure of proteins. SFX datasets are collected from a sequence of protein microcrystals injected across ultrashort X-ray pulses. The idea behind SFX is that diffraction from the intense, ultrashort X-ray pulses leaves the crystal before the crystal is obliterated by the effects of the X-ray pulse. The success of SFX at XFELs has catalyzed interest in analogous experiments at synchrotron-radiation (SR) sources, where data are collected from many small crystals and the ultrashort pulses are replaced by exposure times that aremore » kept short enough to avoid significant crystal damage. The diffraction signal from each short exposure is so `sparse' in recorded photons that the process of recording the crystal intensity is itself a reconstruction problem. Using theEMCalgorithm, a successful reconstruction is demonstrated here in a sparsity regime where there are no Bragg peaks that conventionally would serve to determine the orientation of the crystal in each exposure. In this proof-of-principle experiment, a hen egg-white lysozyme (HEWL) crystal rotating about a single axis was illuminated by an X-ray beam from an X-ray generator to simulate the diffraction patterns of microcrystals from synchrotron radiation. Millions of these sparse frames, typically containing only ~200 photons per frame, were recorded using a fast-framing detector. It is shown that reconstruction of three-dimensional diffraction intensity is possible using theEMCalgorithm, even with these extremely sparse frames and without knowledge of the rotation angle. Further, the reconstructed intensity can be phased and refined to solve the protein structure using traditional crystallographic software. In conclusion, this suggests that synchrotron-based serial crystallography of micrometre-sized crystals can be practical with the aid of theEMCalgorithm even in cases where the data are sparse.« less
Protein crystal structure from non-oriented, single-axis sparse X-ray data
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wierman, Jennifer L.; Lan, Ti-Yen; Tate, Mark W.
X-ray free-electron lasers (XFELs) have inspired the development of serial femtosecond crystallography (SFX) as a method to solve the structure of proteins. SFX datasets are collected from a sequence of protein microcrystals injected across ultrashort X-ray pulses. The idea behind SFX is that diffraction from the intense, ultrashort X-ray pulses leaves the crystal before the crystal is obliterated by the effects of the X-ray pulse. The success of SFX at XFELs has catalyzed interest in analogous experiments at synchrotron-radiation (SR) sources, where data are collected from many small crystals and the ultrashort pulses are replaced by exposure times that aremore » kept short enough to avoid significant crystal damage. The diffraction signal from each short exposure is so `sparse' in recorded photons that the process of recording the crystal intensity is itself a reconstruction problem. Using theEMCalgorithm, a successful reconstruction is demonstrated here in a sparsity regime where there are no Bragg peaks that conventionally would serve to determine the orientation of the crystal in each exposure. In this proof-of-principle experiment, a hen egg-white lysozyme (HEWL) crystal rotating about a single axis was illuminated by an X-ray beam from an X-ray generator to simulate the diffraction patterns of microcrystals from synchrotron radiation. Millions of these sparse frames, typically containing only ~200 photons per frame, were recorded using a fast-framing detector. It is shown that reconstruction of three-dimensional diffraction intensity is possible using theEMCalgorithm, even with these extremely sparse frames and without knowledge of the rotation angle. Further, the reconstructed intensity can be phased and refined to solve the protein structure using traditional crystallographic software. In conclusion, this suggests that synchrotron-based serial crystallography of micrometre-sized crystals can be practical with the aid of theEMCalgorithm even in cases where the data are sparse.« less
Sayer, Christopher; Isupov, Michail N.; Westlake, Aaron; Littlechild, Jennifer A.
2013-01-01
The crystal structures and inhibitor complexes of two industrially important ω-aminotransferase enzymes from Pseudomonas aeruginosa and Chromobacterium violaceum have been determined in order to understand the differences in their substrate specificity. The two enzymes share 30% sequence identity and use the same amino acceptor, pyruvate; however, the Pseudomonas enzyme shows activity towards the amino donor β-alanine, whilst the Chromobacterium enzyme does not. Both enzymes show activity towards S-α-methylbenzylamine (MBA), with the Chromobacterium enzyme having a broader substrate range. The crystal structure of the P. aeruginosa enzyme has been solved in the holo form and with the inhibitor gabaculine bound. The C. violaceum enzyme has been solved in the apo and holo forms and with gabaculine bound. The structures of the holo forms of both enzymes are quite similar. There is little conformational difference observed between the inhibitor complex and the holoenzyme for the P. aeruginosa aminotransferase. In comparison, the crystal structure of the C. violaceum gabaculine complex shows significant structural rearrangements from the structures of both the apo and holo forms of the enzyme. It appears that the different rigidity of the protein scaffold contributes to the substrate specificity observed for the two ω-aminotransferases. PMID:23519665
Structure-based discovery and binding site analysis of histamine receptor ligands.
Kiss, Róbert; Keserű, György M
2016-12-01
The application of structure-based drug discovery in histamine receptor projects was previously hampered by the lack of experimental structures. The publication of the first X-ray structure of the histamine H1 receptor has been followed by several successful virtual screens and binding site analysis studies of H1-antihistamines. This structure together with several other recently solved aminergic G-protein coupled receptors (GPCRs) enabled the development of more realistic homology models for H2, H3 and H4 receptors. Areas covered: In this paper, the authors review the development of histamine receptor models and their application in drug discovery. Expert opinion: In the authors' opinion, the application of atomistic histamine receptor models has played a significant role in understanding key ligand-receptor interactions as well as in the discovery of novel chemical starting points. The recently solved H1 receptor structure is a major milestone in structure-based drug discovery; however, our analysis also demonstrates that for building H3 and H4 receptor homology models, other GPCRs may be more suitable as templates. For these receptors, the authors envisage that the development of higher quality homology models will significantly contribute to the discovery and optimization of novel H3 and H4 ligands.
Structure of a two-CAP-domain protein from the human hookworm parasite Necator americanus
DOE Office of Scientific and Technical Information (OSTI.GOV)
Asojo, Oluwatoyin A., E-mail: oasojo@unmc.edu
2011-05-01
The first structure of a two-CAP-domain protein, Na-ASP-1, from the major human hookworm parasite N. americanus refined to a resolution limit of 2.2 Å is presented. Major proteins secreted by the infective larval stage hookworms upon host entry include Ancylostoma secreted proteins (ASPs), which are characterized by one or two CAP (cysteine-rich secretory protein/antigen 5/pathogenesis related-1) domains. The CAP domain has been reported in diverse phylogenetically unrelated proteins, but has no confirmed function. The first structure of a two-CAP-domain protein, Na-ASP-1, from the major human hookworm parasite Necator americanus was refined to a resolution limit of 2.2 Å. The structuremore » was solved by molecular replacement (MR) using Na-ASP-2, a one-CAP-domain ASP, as the search model. The correct MR solution could only be obtained by truncating the polyalanine model of Na-ASP-2 and removing several loops. The structure reveals two CAP domains linked by an extended loop. Overall, the carboxyl-terminal CAP domain is more similar to Na-ASP-2 than to the amino-terminal CAP domain. A large central cavity extends from the amino-terminal CAP domain to the carboxyl-terminal CAP domain, encompassing the putative CAP-binding cavity. The putative CAP-binding cavity is a characteristic cavity in the carboxyl-terminal CAP domain that contains a His and Glu pair. These residues are conserved in all single-CAP-domain proteins, but are absent in the amino-terminal CAP domain. The conserved His residues are oriented such that they appear to be capable of directly coordinating a zinc ion as observed for CAP proteins from reptile venoms. This first structure of a two-CAP-domain ASP can serve as a template for homology modeling of other two-CAP-domain proteins.« less
Wei, Wei; Sun, Yang; Zhu, Mingli; Liu, Xiangzhi; Sun, Peiqing; Wang, Feng; Gui, Qiu; Meng, Wuyi; Cao, Yi; Zhao, Jing
2015-12-16
The coordination bond between gold and sulfur (Au-S) has been widely studied and utilized in many fields. However, detailed investigations on the basic nature of this bond are still lacking. A gold-specific binding protein, GolB, was recently identified, providing a unique opportunity for the study of the Au-S bond at the molecular level. We probed the mechanical strength of the gold-sulfur bond in GolB using single-molecule force spectroscopy. We measured the rupture force of the Au-S bond to be 165 pN, much lower than Au-S bonds measured on different gold surfaces (∼1000 pN). We further solved the structures of apo-GolB and Au(I)-GolB complex using X-ray crystallography. These structures showed that the average Au-S bond length in GolB is much longer than the reported average value of Au-S bonds. Our results highlight the dramatic influence of the unique biological environment on the stability and strength of metal coordination bonds in proteins.
Cherezov, Vadim; Hanson, Michael A.; Griffith, Mark T.; Hilgart, Mark C.; Sanishvili, Ruslan; Nagarajan, Venugopalan; Stepanov, Sergey; Fischetti, Robert F.; Kuhn, Peter; Stevens, Raymond C.
2009-01-01
Crystallization of human membrane proteins in lipidic cubic phase often results in very small but highly ordered crystals. Advent of the sub-10 µm minibeam at the APS GM/CA CAT has enabled the collection of high quality diffraction data from such microcrystals. Herein we describe the challenges and solutions related to growing, manipulating and collecting data from optically invisible microcrystals embedded in an opaque frozen in meso material. Of critical importance is the use of the intense and small synchrotron beam to raster through and locate the crystal sample in an efficient and reliable manner. The resulting diffraction patterns have a significant reduction in background, with strong intensity and improvement in diffraction resolution compared with larger beam sizes. Three high-resolution structures of human G protein-coupled receptors serve as evidence of the utility of these techniques that will likely be useful for future structural determination efforts. We anticipate that further innovations of the technologies applied to microcrystallography will enable the solving of structures of ever more challenging targets. PMID:19535414
Structure of Salmonella FlhE, conserved member of a flagellar Type III secretion operon
Lee, Jaemin; Monzingo, Arthur F.; Keatinge-Clay, Adrian T.; ...
2014-12-26
In this paper, the bacterial flagellum is assembled by a multicomponent transport apparatus categorized as a type III secretion system. The secretion of proteins that assemble into the flagellum is driven by the proton motive force. The periplasmic protein FlhE is a member of the flhBAE operon in the majority of bacteria where FlhE is found. FlhA and FlhB are established components of the flagellar type III secretion system. The absence of FlhE results in a proton leak through the flagellar system, inappropriate secretion patterns, and cell death, indicating that FlhE regulates an important aspect of proper flagellar biosynthesis. Wemore » isolated FlhE from the periplasm of Salmonella and solved its structure to 1.5 Å resolution. The structure reveals a β-sandwich fold, with no close structural homologs. Finally, possible roles of FlhE, including that of a chaperone, are discussed.« less
Structure of Salmonella FlhE, conserved member of a flagellar Type III secretion operon
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lee, Jaemin; Monzingo, Arthur F.; Keatinge-Clay, Adrian T.
In this paper, the bacterial flagellum is assembled by a multicomponent transport apparatus categorized as a type III secretion system. The secretion of proteins that assemble into the flagellum is driven by the proton motive force. The periplasmic protein FlhE is a member of the flhBAE operon in the majority of bacteria where FlhE is found. FlhA and FlhB are established components of the flagellar type III secretion system. The absence of FlhE results in a proton leak through the flagellar system, inappropriate secretion patterns, and cell death, indicating that FlhE regulates an important aspect of proper flagellar biosynthesis. Wemore » isolated FlhE from the periplasm of Salmonella and solved its structure to 1.5 Å resolution. The structure reveals a β-sandwich fold, with no close structural homologs. Finally, possible roles of FlhE, including that of a chaperone, are discussed.« less
Structure determination of helical filaments by solid-state NMR spectroscopy
Ahmed, Mumdooh; Spehr, Johannes; König, Renate; Lünsdorf, Heinrich; Rand, Ulfert; Lührs, Thorsten; Ritter, Christiane
2016-01-01
The controlled formation of filamentous protein complexes plays a crucial role in many biological systems and represents an emerging paradigm in signal transduction. The mitochondrial antiviral signaling protein (MAVS) is a central signal transduction hub in innate immunity that is activated by a receptor-induced conversion into helical superstructures (filaments) assembled from its globular caspase activation and recruitment domain. Solid-state NMR (ssNMR) spectroscopy has become one of the most powerful techniques for atomic resolution structures of protein fibrils. However, for helical filaments, the determination of the correct symmetry parameters has remained a significant hurdle for any structural technique and could thus far not be precisely derived from ssNMR data. Here, we solved the atomic resolution structure of helical MAVSCARD filaments exclusively from ssNMR data. We present a generally applicable approach that systematically explores the helical symmetry space by efficient modeling of the helical structure restrained by interprotomer ssNMR distance restraints. Together with classical automated NMR structure calculation, this allowed us to faithfully determine the symmetry that defines the entire assembly. To validate our structure, we probed the protomer arrangement by solvent paramagnetic resonance enhancement, analysis of chemical shift differences relative to the solution NMR structure of the monomer, and mutagenesis. We provide detailed information on the atomic contacts that determine filament stability and describe mechanistic details on the formation of signaling-competent MAVS filaments from inactive monomers. PMID:26733681
Structural Basis of Cyclic Nucleotide Selectivity in cGMP-dependent Protein Kinase II
Campbell, James C.; Kim, Jeong Joo; Li, Kevin Y.; ...
2016-01-14
Membrane-bound cGMP-dependent protein kinase (PKG) II is an important regulator of bone growth, renin secretion, and memory formation. Despite its crucial physiological roles, little is known about its cyclic nucleotide selectivity mechanism due to a lack of structural information. Here, we find that the C-terminal cyclic nucleotide binding (CNB-B) domain of PKGII binds cGMP with higher affinity and selectivity when compared with its N-terminal CNB (CNB-A) domain. To understand the structural basis of cGMP selectivity, we solved co-crystal structures of the CNB domains with cyclic nucleotides. Our structures combined with mutagenesis demonstrate that the guanine-specific contacts at Asp-412 and Arg-415more » of the αC-helix of CNB-B are crucial for cGMP selectivity and activation of PKG II. Structural comparison with the cGMP selective CNB domains of human PKG I and Plasmodium falciparum PKG (PfPKG) shows different contacts with the guanine moiety, revealing a unique cGMP selectivity mechanism for PKG II.« less
A Pipeline Software Architecture for NMR Spectrum Data Translation
Ellis, Heidi J.C.; Weatherby, Gerard; Nowling, Ronald J.; Vyas, Jay; Fenwick, Matthew; Gryk, Michael R.
2012-01-01
The problem of formatting data so that it conforms to the required input for scientific data processing tools pervades scientific computing. The CONNecticut Joint University Research Group (CONNJUR) has developed a data translation tool based on a pipeline architecture that partially solves this problem. The CONNJUR Spectrum Translator supports data format translation for experiments that use Nuclear Magnetic Resonance to determine the structure of large protein molecules. PMID:24634607
Structural Insights into the Phospholipid Binding Specificity of Human Evectin-2
NASA Astrophysics Data System (ADS)
Okazaki, Seiji; Kato, Ryuichi; Wakatsuki, Soichi; Uchida, Yasunori; Taguchi, Tomohiko; Arai, Hiroyuki
Evectin-2 is a recycling endosomal protein and plays an essential role in retrograde transport from recycling endosomes to the trans-Golgi network. The pleckstrin homology (PH) domain of Evectin-2 can specifically binds to phosphatidylserine (PS), which is enriched in recycling endosomes. To elucidate the molecular mechanism how it specifically binds to PS, we solved the crystal structures of human Evectin-2 PH domain for apo and O-phospho-L-serine complexed forms at 1.75 and 1.00 Å resolution, respectively. These structural analyses clearly show that PS-induced conformational change of Evectin-2 PH domain effectively explains the strict phospholipid binding specificity.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gao, Jianzhao; Wu, Zhonghua; Hu, Gang
Selection of proper targets for the X-ray crystallography will benefit biological research community immensely. Several computational models were proposed to predict propensity of successful protein production and diffraction quality crystallization from protein sequences. We reviewed a comprehensive collection of 22 such predictors that were developed in the last decade. We found that almost all of these models are easily accessible as webservers and/or standalone software and we demonstrated that some of them are widely used by the research community. We empirically evaluated and compared the predictive performance of seven representative methods. The analysis suggests that these methods produce quite accuratemore » propensities for the diffraction-quality crystallization. We also summarized results of the first study of the relation between these predictive propensities and the resolution of the crystallizable proteins. We found that the propensities predicted by several methods are significantly higher for proteins that have high resolution structures compared to those with the low resolution structures. Moreover, we tested a new meta-predictor, MetaXXC, which averages the propensities generated by the three most accurate predictors of the diffraction-quality crystallization. MetaXXC generates putative values of resolution that have modest levels of correlation with the experimental resolutions and it offers the lowest mean absolute error when compared to the seven considered methods. We conclude that protein sequences can be used to fairly accurately predict whether their corresponding protein structures can be solved using X-ray crystallography. Moreover, we also ascertain that sequences can be used to reasonably well predict the resolution of the resulting protein crystals.« less
Pan, Ying H; Bahnson, Brian J
2010-07-01
The properties of three discrete premicellar complexes (E1#, E2#, E3#) of pig pancreatic group-IB secreted phospholipase A2 (sPLA2) with monodisperse alkyl sulfates have been characterized [Berg, O. G. et al., Biochemistry 43, 7999-8013, 2004]. Here we have solved the 2.7 A crystal structure of group-IB sPLA2 complexed with 12 molecules of octyl sulfate (C8S) in a form consistent with a tetrameric oligomeric that exists during the E1# phase of premicellar complexes. The alkyl tails of the C8S molecules are centered in the middle of the tetrameric cluster of sPLA2 subunits. Three of the four sPLA2 subunits also contain a C8S molecule in the active site pocket. The sulfate oxygen of a C8S ligand is complexed to the active site calcium in three of the four protein active sites. The interactions of the alkyl sulfate head group with Arg-6 and Lys-10, as well as the backbone amide of Met-20, are analogous to those observed in the previously solved sPLA2 crystal structures with bound phosphate and sulfate anions. The cluster of three anions found in the present structure is postulated to be the site for nucleating the binding of anionic amphiphiles to the interfacial surface of the protein, and therefore this binding interaction has implications for interfacial activation of the enzyme. Copyright (c) 2010 Elsevier B.V. All rights reserved.
Fort, Joana; de la Ballina, Laura R; Burghardt, Hans E; Ferrer-Costa, Carles; Turnay, Javier; Ferrer-Orta, Cristina; Usón, Isabel; Zorzano, Antonio; Fernández-Recio, Juan; Orozco, Modesto; Lizarbe, María Antonia; Fita, Ignacio; Palacín, Manuel
2007-10-26
4F2hc (CD98hc) is a multifunctional type II membrane glycoprotein involved in amino acid transport and cell fusion, adhesion, and transformation. The structure of the ectodomain of human 4F2hc has been solved using monoclinic (Protein Data Bank code 2DH2) and orthorhombic (Protein Data Bank code 2DH3) crystal forms at 2.1 and 2.8 A, respectively. It is composed of a (betaalpha)(8) barrel and an antiparallel beta(8) sandwich related to bacterial alpha-glycosidases, although lacking key catalytic residues and consequently catalytic activity. 2DH3 is a dimer with Zn(2+) coordination at the interface. Human 4F2hc expressed in several cell types resulted in cell surface and Cys(109) disulfide bridge-linked homodimers with major architectural features of the crystal dimer, as demonstrated by cross-linking experiments. 4F2hc has no significant hydrophobic patches at the surface. Monomer and homodimer have a polarized charged surface. The N terminus of the solved structure, including the position of Cys(109) residue located four residues apart from the transmembrane domain, is adjacent to the positive face of the ectodomain. This location of the N terminus and the Cys(109)-intervening disulfide bridge imposes space restrictions sufficient to support a model for electrostatic interaction of the 4F2hc ectodomain with membrane phospholipids. These results provide the first crystal structure of heteromeric amino acid transporters and suggest a dynamic interaction of the 4F2hc ectodomain with the plasma membrane.
Integration of QUARK and I-TASSER for Ab Initio Protein Structure Prediction in CASP11.
Zhang, Wenxuan; Yang, Jianyi; He, Baoji; Walker, Sara Elizabeth; Zhang, Hongjiu; Govindarajoo, Brandon; Virtanen, Jouko; Xue, Zhidong; Shen, Hong-Bin; Zhang, Yang
2016-09-01
We tested two pipelines developed for template-free protein structure prediction in the CASP11 experiment. First, the QUARK pipeline constructs structure models by reassembling fragments of continuously distributed lengths excised from unrelated proteins. Five free-modeling (FM) targets have the model successfully constructed by QUARK with a TM-score above 0.4, including the first model of T0837-D1, which has a TM-score = 0.736 and RMSD = 2.9 Å to the native. Detailed analysis showed that the success is partly attributed to the high-resolution contact map prediction derived from fragment-based distance-profiles, which are mainly located between regular secondary structure elements and loops/turns and help guide the orientation of secondary structure assembly. In the Zhang-Server pipeline, weakly scoring threading templates are re-ordered by the structural similarity to the ab initio folding models, which are then reassembled by I-TASSER based structure assembly simulations; 60% more domains with length up to 204 residues, compared to the QUARK pipeline, were successfully modeled by the I-TASSER pipeline with a TM-score above 0.4. The robustness of the I-TASSER pipeline can stem from the composite fragment-assembly simulations that combine structures from both ab initio folding and threading template refinements. Despite the promising cases, challenges still exist in long-range beta-strand folding, domain parsing, and the uncertainty of secondary structure prediction; the latter of which was found to affect nearly all aspects of FM structure predictions, from fragment identification, target classification, structure assembly, to final model selection. Significant efforts are needed to solve these problems before real progress on FM could be made. Proteins 2016; 84(Suppl 1):76-86. © 2015 Wiley Periodicals, Inc. © 2015 Wiley Periodicals, Inc.
Boughton, Berin A; Dobson, Renwick C J; Hutton, Craig A
2012-08-01
The crystal structure of Escherichia coli dihydrodipicolinate synthase with pyruvate and substrate analogue succinic acid semialdehyde condensed with the active site lysine-161 was solved to a resolution of 2.3 Å. Comparative analysis to a previously reported structure both resolves the configuration at the aldol addition center, where the final addition product clearly displays the (S)-configuration, and the final conformation of the adduct within the active site. Direct comparison to two other crystal structures found in the Protein Data Bank, 1YXC, and 3DU0, demonstrates significant similarity between the active site residues of these structures. Copyright © 2012 Wiley Periodicals, Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hamiaux, C.; Stanley, D.; Greenwood, D.R.
Takeout (To) proteins are found exclusively in insects and have been proposed to have important roles in various aspects of their physiology and behavior. Limited sequence similarity with juvenile hormone-binding proteins (JHBPs), which specifically bind and transport juvenile hormones in Lepidoptera, suggested a role for To proteins in binding hydrophobic ligands. We present the first crystal structure of a To protein, EpTo1 from the light brown apple moth Epiphyas postvittana, solved in-house by the single-wavelength anomalous diffraction technique using sulfur anomalous dispersion, and refined to 1.3 {angstrom} resolution. EpTo1 adopts the unusual {alpha}/{beta}-wrap fold, seen only for JHBP and severalmore » mammalian lipid carrier proteins, a scaffold tailored for the binding and/or transport of hydrophobic ligands. EpTo1 has a 45 {angstrom} long, purely hydrophobic, internal tunnel that extends for the full length of the protein and accommodates a bound ligand. The latter was shown by mass spectrometry to be ubiquinone-8 and is probably derived from Escherichia coli. The structure provides the first direct experimental evidence that To proteins are ligand carriers; gives insights into the nature of endogenous ligand(s) of EpTo1; shows, by comparison with JHBP, a basis for different ligand specificities; and suggests a mechanism for the binding/release of ligands.« less
Shatabda, Swakkhar; Saha, Sanjay; Sharma, Alok; Dehzangi, Abdollah
2017-12-21
Bacteriophage proteins are viruses that can significantly impact on the functioning of bacteria and can be used in phage based therapy. The functioning of Bacteriophage in the host bacteria depends on its location in those host cells. It is very important to know the subcellular location of the phage proteins in a host cell in order to understand their working mechanism. In this paper, we propose iPHLoc-ES, a prediction method for subcellular localization of bacteriophage proteins. We aim to solve two problems: discriminating between host located and non-host located phage proteins and discriminating between the locations of host located protein in a host cell (membrane or cytoplasm). To do this, we extract sets of evolutionary and structural features of phage protein and employ Support Vector Machine (SVM) as our classifier. We also use recursive feature elimination (RFE) to reduce the number of features for effective prediction. On standard dataset using standard evaluation criteria, our method significantly outperforms the state-of-the-art predictor. iPHLoc-ES is readily available to use as a standalone tool from: https://github.com/swakkhar/iPHLoc-ES/ and as a web application from: http://brl.uiu.ac.bd/iPHLoc-ES/. Copyright © 2017 Elsevier Ltd. All rights reserved.
iDBPs: a web server for the identification of DNA binding proteins
Nimrod, Guy; Schushan, Maya; Szilágyi, András; Leslie, Christina; Ben-Tal, Nir
2010-01-01
Summary: The iDBPs server uses the three-dimensional (3D) structure of a query protein to predict whether it binds DNA. First, the algorithm predicts the functional region of the protein based on its evolutionary profile; the assumption is that large clusters of conserved residues are good markers of functional regions. Next, various characteristics of the predicted functional region as well as global features of the protein are calculated, such as the average surface electrostatic potential, the dipole moment and cluster-based amino acid conservation patterns. Finally, a random forests classifier is used to predict whether the query protein is likely to bind DNA and to estimate the prediction confidence. We have trained and tested the classifier on various datasets and shown that it outperformed related methods. On a dataset that reflects the fraction of DNA binding proteins (DBPs) in a proteome, the area under the ROC curve was 0.90. The application of the server to an updated version of the N-Func database, which contains proteins of unknown function with solved 3D-structure, suggested new putative DBPs for experimental studies. Availability: http://idbps.tau.ac.il/ Contact: NirB@tauex.tau.ac.il Supplementary information: Supplementary data are available at Bioinformatics online. PMID:20089514
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cardarelli, Lia; Lam, Robert; Tuite, Ashleigh
2010-08-17
The final step in the morphogenesis of long-tailed double-stranded DNA bacteriophages is the joining of the DNA-filled head to the tail. The connector is a specialized structure of the head that serves as the interface for tail attachment and the point of egress for DNA from the head during infection. Here, we report the determination of a 2.1 {angstrom} crystal structure of gp6 of bacteriophage HK97. Through structural comparisons, functional studies, and bioinformatic analysis, gp6 has been determined to be a component of the connector of phage HK97 that is evolutionarily related to gp15, a well-characterized connector component of bacteriophagemore » SPP1. Whereas the structure of gp15 was solved in a monomeric form, gp6 crystallized as an oligomeric ring with the dimensions expected for a connector protein. Although this ring is composed of 13 subunits, which does not match the symmetry of the connector within the phage, sequence conservation and modeling of this structure into the cryo-electron microscopy density of the SPP1 connector indicate that this oligomeric structure represents the arrangement of gp6 subunits within the mature phage particle. Through sequence searches and genomic position analysis, we determined that gp6 is a member of a large family of connector proteins that are present in long-tailed phages. We have also identified gp7 of HK97 as a homologue of gp16 of phage SPP1, which is the second component of the connector of this phage. These proteins are members of another large protein family involved in connector assembly.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cardarelli, Lia; Lam, Robert; Tuite, Ashleigh
2011-11-23
The final step in the morphogenesis of long-tailed double-stranded DNA bacteriophages is the joining of the DNA-filled head to the tail. The connector is a specialized structure of the head that serves as the interface for tail attachment and the point of egress for DNA from the head during infection. Here, we report the determination of a 2.1 Å crystal structure of gp6 of bacteriophage HK97. Through structural comparisons, functional studies, and bioinformatic analysis, gp6 has been determined to be a component of the connector of phage HK97 that is evolutionarily related to gp15, a well-characterized connector component of bacteriophagemore » SPP1. Whereas the structure of gp15 was solved in a monomeric form, gp6 crystallized as an oligomeric ring with the dimensions expected for a connector protein. Although this ring is composed of 13 subunits, which does not match the symmetry of the connector within the phage, sequence conservation and modeling of this structure into the cryo-electron microscopy density of the SPP1 connector indicate that this oligomeric structure represents the arrangement of gp6 subunits within the mature phage particle. Through sequence searches and genomic position analysis, we determined that gp6 is a member of a large family of connector proteins that are present in long-tailed phages. We have also identified gp7 of HK97 as a homologue of gp16 of phage SPP1, which is the second component of the connector of this phage. These proteins are members of another large protein family involved in connector assembly.« less
Simkovic, Felix; Thomas, Jens M H; Keegan, Ronan M; Winn, Martyn D; Mayans, Olga; Rigden, Daniel J
2016-07-01
For many protein families, the deluge of new sequence information together with new statistical protocols now allow the accurate prediction of contacting residues from sequence information alone. This offers the possibility of more accurate ab initio (non-homology-based) structure prediction. Such models can be used in structure solution by molecular replacement (MR) where the target fold is novel or is only distantly related to known structures. Here, AMPLE, an MR pipeline that assembles search-model ensembles from ab initio structure predictions ('decoys'), is employed to assess the value of contact-assisted ab initio models to the crystallographer. It is demonstrated that evolutionary covariance-derived residue-residue contact predictions improve the quality of ab initio models and, consequently, the success rate of MR using search models derived from them. For targets containing β-structure, decoy quality and MR performance were further improved by the use of a β-strand contact-filtering protocol. Such contact-guided decoys achieved 14 structure solutions from 21 attempted protein targets, compared with nine for simple Rosetta decoys. Previously encountered limitations were superseded in two key respects. Firstly, much larger targets of up to 221 residues in length were solved, which is far larger than the previously benchmarked threshold of 120 residues. Secondly, contact-guided decoys significantly improved success with β-sheet-rich proteins. Overall, the improved performance of contact-guided decoys suggests that MR is now applicable to a significantly wider range of protein targets than were previously tractable, and points to a direct benefit to structural biology from the recent remarkable advances in sequencing.
Simkovic, Felix; Thomas, Jens M. H.; Keegan, Ronan M.; Winn, Martyn D.; Mayans, Olga; Rigden, Daniel J.
2016-01-01
For many protein families, the deluge of new sequence information together with new statistical protocols now allow the accurate prediction of contacting residues from sequence information alone. This offers the possibility of more accurate ab initio (non-homology-based) structure prediction. Such models can be used in structure solution by molecular replacement (MR) where the target fold is novel or is only distantly related to known structures. Here, AMPLE, an MR pipeline that assembles search-model ensembles from ab initio structure predictions (‘decoys’), is employed to assess the value of contact-assisted ab initio models to the crystallographer. It is demonstrated that evolutionary covariance-derived residue–residue contact predictions improve the quality of ab initio models and, consequently, the success rate of MR using search models derived from them. For targets containing β-structure, decoy quality and MR performance were further improved by the use of a β-strand contact-filtering protocol. Such contact-guided decoys achieved 14 structure solutions from 21 attempted protein targets, compared with nine for simple Rosetta decoys. Previously encountered limitations were superseded in two key respects. Firstly, much larger targets of up to 221 residues in length were solved, which is far larger than the previously benchmarked threshold of 120 residues. Secondly, contact-guided decoys significantly improved success with β-sheet-rich proteins. Overall, the improved performance of contact-guided decoys suggests that MR is now applicable to a significantly wider range of protein targets than were previously tractable, and points to a direct benefit to structural biology from the recent remarkable advances in sequencing. PMID:27437113
Structural basis of recognition of farnesylated and methylated KRAS4b by PDEδ.
Dharmaiah, Srisathiyanarayanan; Bindu, Lakshman; Tran, Timothy H; Gillette, William K; Frank, Peter H; Ghirlando, Rodolfo; Nissley, Dwight V; Esposito, Dominic; McCormick, Frank; Stephen, Andrew G; Simanshu, Dhirendra K
2016-11-01
Farnesylation and carboxymethylation of KRAS4b (Kirsten rat sarcoma isoform 4b) are essential for its interaction with the plasma membrane where KRAS-mediated signaling events occur. Phosphodiesterase-δ (PDEδ) binds to KRAS4b and plays an important role in targeting it to cellular membranes. We solved structures of human farnesylated-methylated KRAS4b in complex with PDEδ in two different crystal forms. In these structures, the interaction is driven by the C-terminal amino acids together with the farnesylated and methylated C185 of KRAS4b that binds tightly in the central hydrophobic pocket present in PDEδ. In crystal form II, we see the full-length structure of farnesylated-methylated KRAS4b, including the hypervariable region. Crystal form I reveals structural details of farnesylated-methylated KRAS4b binding to PDEδ, and crystal form II suggests the potential binding mode of geranylgeranylated-methylated KRAS4b to PDEδ. We identified a 5-aa-long sequence motif (Lys-Ser-Lys-Thr-Lys) in KRAS4b that may enable PDEδ to bind both forms of prenylated KRAS4b. Structure and sequence analysis of various prenylated proteins that have been previously tested for binding to PDEδ provides a rationale for why some prenylated proteins, such as KRAS4a, RalA, RalB, and Rac1, do not bind to PDEδ. Comparison of all four available structures of PDEδ complexed with various prenylated proteins/peptides shows the presence of additional interactions due to a larger protein-protein interaction interface in KRAS4b-PDEδ complex. This interface might be exploited for designing an inhibitor with minimal off-target effects.
Molecular mechanism of ligand recognition by membrane transport protein, Mhp1
Simmons, Katie J; Jackson, Scott M; Brueckner, Florian; Patching, Simon G; Beckstein, Oliver; Ivanova, Ekaterina; Geng, Tian; Weyand, Simone; Drew, David; Lanigan, Joseph; Sharples, David J; Sansom, Mark SP; Iwata, So; Fishwick, Colin WG; Johnson, A Peter; Cameron, Alexander D; Henderson, Peter JF
2014-01-01
The hydantoin transporter Mhp1 is a sodium-coupled secondary active transport protein of the nucleobase-cation-symport family and a member of the widespread 5-helix inverted repeat superfamily of transporters. The structure of Mhp1 was previously solved in three different conformations providing insight into the molecular basis of the alternating access mechanism. Here, we elucidate detailed events of substrate binding, through a combination of crystallography, molecular dynamics, site-directed mutagenesis, biochemical/biophysical assays, and the design and synthesis of novel ligands. We show precisely where 5-substituted hydantoin substrates bind in an extended configuration at the interface of the bundle and hash domains. They are recognised through hydrogen bonds to the hydantoin moiety and the complementarity of the 5-substituent for a hydrophobic pocket in the protein. Furthermore, we describe a novel structure of an intermediate state of the protein with the external thin gate locked open by an inhibitor, 5-(2-naphthylmethyl)-L-hydantoin, which becomes a substrate when leucine 363 is changed to an alanine. We deduce the molecular events that underlie acquisition and transport of a ligand by Mhp1. PMID:24952894
GPU-Based Point Cloud Superpositioning for Structural Comparisons of Protein Binding Sites.
Leinweber, Matthias; Fober, Thomas; Freisleben, Bernd
2018-01-01
In this paper, we present a novel approach to solve the labeled point cloud superpositioning problem for performing structural comparisons of protein binding sites. The solution is based on a parallel evolution strategy that operates on large populations and runs on GPU hardware. The proposed evolution strategy reduces the likelihood of getting stuck in a local optimum of the multimodal real-valued optimization problem represented by labeled point cloud superpositioning. The performance of the GPU-based parallel evolution strategy is compared to a previously proposed CPU-based sequential approach for labeled point cloud superpositioning, indicating that the GPU-based parallel evolution strategy leads to qualitatively better results and significantly shorter runtimes, with speed improvements of up to a factor of 1,500 for large populations. Binary classification tests based on the ATP, NADH, and FAD protein subsets of CavBase, a database containing putative binding sites, show average classification rate improvements from about 92 percent (CPU) to 96 percent (GPU). Further experiments indicate that the proposed GPU-based labeled point cloud superpositioning approach can be superior to traditional protein comparison approaches based on sequence alignments.
Detection of isolated protein-bound metal ions by single-particle cryo-STEM.
Elad, Nadav; Bellapadrona, Giuliano; Houben, Lothar; Sagi, Irit; Elbaum, Michael
2017-10-17
Metal ions play essential roles in many aspects of biological chemistry. Detecting their presence and location in proteins and cells is important for understanding biological function. Conventional structural methods such as X-ray crystallography and cryo-transmission electron microscopy can identify metal atoms on protein only if the protein structure is solved to atomic resolution. We demonstrate here the detection of isolated atoms of Zn and Fe on ferritin, using cryogenic annular dark-field scanning transmission electron microscopy (cryo-STEM) coupled with single-particle 3D reconstructions. Zn atoms are found in a pattern that matches precisely their location at the ferroxidase sites determined earlier by X-ray crystallography. By contrast, the Fe distribution is smeared along an arc corresponding to the proposed path from the ferroxidase sites to the mineral nucleation sites along the twofold axes. In this case the single-particle reconstruction is interpreted as a probability distribution function based on the average of individual locations. These results establish conditions for detection of isolated metal atoms in the broader context of electron cryo-microscopy and tomography.
NASA Astrophysics Data System (ADS)
Leone, Serena; Pica, Andrea; Merlino, Antonello; Sannino, Filomena; Temussi, Piero Andrea; Picone, Delia
2016-09-01
Sweet proteins are a family of proteins with no structure or sequence homology, able to elicit a sweet sensation in humans through their interaction with the dimeric T1R2-T1R3 sweet receptor. In particular, monellin and its single chain derivative (MNEI) are among the sweetest proteins known to men. Starting from a careful analysis of the surface electrostatic potentials, we have designed new mutants of MNEI with enhanced sweetness. Then, we have included in the most promising variant the stabilising mutation E23Q, obtaining a construct with enhanced performances, which combines extreme sweetness to high, pH-independent, thermal stability. The resulting mutant, with a sweetness threshold of only 0.28 mg/L (25 nM) is the strongest sweetener known to date. All the new proteins have been produced and purified and the structures of the most powerful mutants have been solved by X-ray crystallography. Docking studies have then confirmed the rationale of their interaction with the human sweet receptor, hinting at a previously unpredicted role of plasticity in said interaction.
Detection of isolated protein-bound metal ions by single-particle cryo-STEM
Elad, Nadav; Bellapadrona, Giuliano; Houben, Lothar; Sagi, Irit; Elbaum, Michael
2017-01-01
Metal ions play essential roles in many aspects of biological chemistry. Detecting their presence and location in proteins and cells is important for understanding biological function. Conventional structural methods such as X-ray crystallography and cryo-transmission electron microscopy can identify metal atoms on protein only if the protein structure is solved to atomic resolution. We demonstrate here the detection of isolated atoms of Zn and Fe on ferritin, using cryogenic annular dark-field scanning transmission electron microscopy (cryo-STEM) coupled with single-particle 3D reconstructions. Zn atoms are found in a pattern that matches precisely their location at the ferroxidase sites determined earlier by X-ray crystallography. By contrast, the Fe distribution is smeared along an arc corresponding to the proposed path from the ferroxidase sites to the mineral nucleation sites along the twofold axes. In this case the single-particle reconstruction is interpreted as a probability distribution function based on the average of individual locations. These results establish conditions for detection of isolated metal atoms in the broader context of electron cryo-microscopy and tomography. PMID:28973937
Willis, Charlene; Wang, Conan K.; Osman, Asiah; Simon, Anne; Pickering, Darren; Mulvenna, Jason; Riboldi-Tunicliffe, Alan; Jones, Malcolm K.; Loukas, Alex; Hofmann, Andreas
2011-01-01
Saposin-like proteins (SAPLIPs) from soil-transmitted helminths play pivotal roles in host-pathogen interactions and have a high potential as targets for vaccination against parasitic diseases. We have identified two non-orthologous SAPLIPs from human and dog hookworm, Na-SLP-1 and Ac-SLP-1, and solved their three-dimensional crystal structures. Both proteins share the property of membrane binding as monitored by liposome co-pelleting assays and monolayer adsorption. Neither SAPLIP displayed any significant haemolytic or bactericidal activity. Based on the structural information, as well as the results from monolayer adsorption, we propose models of membrane interactions for both SAPLIPs. Initial membrane contact of the monomeric Na-SLP-1 is most likely by electrostatic interactions between the membrane surface and a prominent basic surface patch. In case of the dimeric Ac-SLP-1, membrane interactions are most likely initiated by a unique tryptophan residue that has previously been implicated in membrane interactions in other SAPLIPs. PMID:21991310
Willis, Charlene; Wang, Conan K; Osman, Asiah; Simon, Anne; Pickering, Darren; Mulvenna, Jason; Riboldi-Tunicliffe, Alan; Jones, Malcolm K; Loukas, Alex; Hofmann, Andreas
2011-01-01
Saposin-like proteins (SAPLIPs) from soil-transmitted helminths play pivotal roles in host-pathogen interactions and have a high potential as targets for vaccination against parasitic diseases. We have identified two non-orthologous SAPLIPs from human and dog hookworm, Na-SLP-1 and Ac-SLP-1, and solved their three-dimensional crystal structures. Both proteins share the property of membrane binding as monitored by liposome co-pelleting assays and monolayer adsorption. Neither SAPLIP displayed any significant haemolytic or bactericidal activity. Based on the structural information, as well as the results from monolayer adsorption, we propose models of membrane interactions for both SAPLIPs. Initial membrane contact of the monomeric Na-SLP-1 is most likely by electrostatic interactions between the membrane surface and a prominent basic surface patch. In case of the dimeric Ac-SLP-1, membrane interactions are most likely initiated by a unique tryptophan residue that has previously been implicated in membrane interactions in other SAPLIPs.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kim, Kyung-Jin, E-mail: kkj@postech.ac.kr; Kim, Sujin; Lee, Sujin
2006-11-01
The Corynebacterium glutamicum NTA monooxygenase component A protein, which plays the central role in NTA biodegradation, was crystallized. The initial X-ray crystallographic characterization is reported. Safety and environmental concerns have recently dictated the proper disposal of nitrilotriacetate (NTA). Biodegradation of NTA is initiated by NTA monooxygenase, which is composed of two proteins: component A and component B. The NTA monooxygenase component A protein from Corynebacterium glutamicum was crystallized using the sitting-drop vapour-diffusion method in the presence of ammonium sulfate as the precipitant. X-ray diffraction data were collected to a maximum resolution of 2.5 Å on a synchrotron beamline. The crystalmore » belongs to the monoclinic space group C2, with unit-cell parameters a = 111.04, b = 98.51, c = 171.61 Å, β = 101.94°. The asymmetric unit consists of four molecules, corresponding to a packing density of 2.3 Å{sup 3} Da{sup −1}. The structure was solved by molecular replacement. Structure refinement is in progress.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wernimont, Amy K; Artz, Jennifer D.; Jr, Patrick Finerty
2010-09-21
Calcium-dependent protein kinases (CDPKs) have pivotal roles in the calcium-signaling pathway in plants, ciliates and apicomplexan parasites and comprise a calmodulin-dependent kinase (CaMK)-like kinase domain regulated by a calcium-binding domain in the C terminus. To understand this intramolecular mechanism of activation, we solved the structures of the autoinhibited (apo) and activated (calcium-bound) conformations of CDPKs from the apicomplexan parasites Toxoplasma gondii and Cryptosporidium parvum. In the apo form, the C-terminal CDPK activation domain (CAD) resembles a calmodulin protein with an unexpected long helix in the N terminus that inhibits the kinase domain in the same manner as CaMKII. Calcium bindingmore » triggers the reorganization of the CAD into a highly intricate fold, leading to its relocation around the base of the kinase domain to a site remote from the substrate binding site. This large conformational change constitutes a distinct mechanism in calcium signal-transduction pathways.« less
Native Mass Spectrometry: What is in the Name?
NASA Astrophysics Data System (ADS)
Leney, Aneika C.; Heck, Albert J. R.
2017-01-01
Electrospray ionization mass spectrometry (ESI-MS) is nowadays one of the cornerstones of biomolecular mass spectrometry and proteomics. Advances in sample preparation and mass analyzers have enabled researchers to extract much more information from biological samples than just the molecular weight. In particular, relevant for structural biology, noncovalent protein-protein and protein-ligand complexes can now also be analyzed by MS. For these types of analyses, assemblies need to be retained in their native quaternary state in the gas phase. This initial small niche of biomolecular mass spectrometry, nowadays often referred to as "native MS," has come to maturation over the last two decades, with dozens of laboratories using it to study mostly protein assemblies, but also DNA and RNA-protein assemblies, with the goal to define structure-function relationships. In this perspective, we describe the origins of and (re)define the term native MS, portraying in detail what we meant by "native MS," when the term was coined and also describing what it does (according to us) not entail. Additionally, we describe a few examples highlighting what native MS is, showing its successes to date while illustrating the wide scope this technology has in solving complex biological questions.
Fiorillo, Annarita; di Marino, Daniele; Bertuccini, Lucia; Via, Allegra; Pozio, Edoardo; Camerini, Serena; Ilari, Andrea; Lalle, Marco
2014-01-01
The 14-3-3s are a family of dimeric evolutionary conserved pSer/pThr binding proteins that play a key role in multiple biological processes by interacting with a plethora of client proteins. Giardia duodenalis is a flagellated protozoan that affects millions of people worldwide causing an acute and chronic diarrheal disease. The single giardial 14-3-3 isoform (g14-3-3), unique in the 14-3-3 family, needs the constitutive phosphorylation of Thr214 and the polyglycylation of its C-terminus to be fully functional in vivo. Alteration of the phosphorylation and polyglycylation status affects the parasite differentiation into the cyst stage. To further investigate the role of these post-translational modifications, the crystal structure of the g14-3-3 was solved in the unmodified apo form. Oligomers of g14-3-3 were observed due to domain swapping events at the protein C-terminus. The formation of filaments was supported by TEM. Mutational analysis, in combination with native PAGE and chemical cross-linking, proved that polyglycylation prevents oligomerization. In silico phosphorylation and molecular dynamics simulations supported a structural role for the phosphorylation of Thr214 in promoting target binding. Our findings highlight unique structural features of g14-3-3 opening novel perspectives on the evolutionary history of this protein family and envisaging the possibility to develop anti-giardial drugs targeting g14-3-3. PMID:24658679
Vance, Tyler D R; Graham, Laurie A; Davies, Peter L
2018-04-01
Out of the dozen different ice-binding protein (IBP) structures known, the DUF3494 domain is the most widespread, having been passed many times between prokaryotic and eukaryotic microorganisms by horizontal gene transfer. This ~25-kDa β-solenoid domain with an adjacent parallel α-helix is most commonly associated with an N-terminal secretory signal peptide. However, examples of the DUF3494 domain preceded by tandem Bacterial Immunoglobulin-like (BIg) domains are sometimes found, though uncharacterized. Here, we present one such protein (SfIBP_1) from the Antarctic bacterium Shewanella frigidimarina. We have confirmed and characterized the ice-binding activity of its ice-binding domain using thermal hysteresis measurements, fluorescent ice plane affinity analysis, and ice recrystallization inhibition assays. X-ray crystallography was used to solve the structure of the SfIBP_1 ice-binding domain, to further characterize its ice-binding surface and unique method of stabilizing or 'capping' the ends of the solenoid structure. The latter is formed from the interaction of two loops mediated by a combination of tandem prolines and electrostatic interactions. Furthermore, given their domain architecture and membrane association, we propose that these BIg-containing DUF3494 IBPs serve as ice-binding adhesion proteins that are capable of adsorbing their host bacterium onto ice. Submitted new structure to the Protein Data Bank (PDB: 6BG8). © 2018 Federation of European Biochemical Societies.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gribenko, Alexey; Mosyak, Lidia; Ghosh, Sharmistha
MntC is a metal-binding protein component of the Mn 2 +-specific mntABC transporter from the pathogen Staphylococcus aureus. The protein is expressed during the early stages of infection and was proven to be effective at reducing both S. aureus and Staphylococcus epidermidis infections in a murine animal model when used as a vaccine antigen. MntC is currently being tested in human clinical trials as a component of a multiantigen vaccine for the prevention of S. aureus infections. To better understand the biological function of MntC, we are providing structural and biophysical characterization of the protein in this work. The three-dimensionalmore » structure of the protein was solved by X-ray crystallography at 2.2 Å resolution and suggests two potential metal binding modes, which may lead to reversible as well as irreversible metal binding. Precise Mn 2 +-binding affinity of the protein was determined from the isothermal titration calorimetry experiments using a competition approach. Differential scanning calorimetry experiments confirmed that divalent metals can indeed bind to MntC reversibly as well as irreversibly. Finally, Mn 2 +-induced structural and dynamics changes have been characterized using spectroscopic methods and deuterium–hydrogen exchange mass spectroscopy. Results of the experiments show that these changes are minimal and are largely restricted to the structural elements involved in metal coordination. Therefore, it is unlikely that antibody binding to this antigen will be affected by the occupancy of the metal-binding site by Mn 2 +.« less
Womack, James C; Anton, Lucian; Dziedzic, Jacek; Hasnip, Phil J; Probert, Matt I J; Skylaris, Chris-Kriton
2018-03-13
The solution of the Poisson equation is a crucial step in electronic structure calculations, yielding the electrostatic potential-a key component of the quantum mechanical Hamiltonian. In recent decades, theoretical advances and increases in computer performance have made it possible to simulate the electronic structure of extended systems in complex environments. This requires the solution of more complicated variants of the Poisson equation, featuring nonhomogeneous dielectric permittivities, ionic concentrations with nonlinear dependencies, and diverse boundary conditions. The analytic solutions generally used to solve the Poisson equation in vacuum (or with homogeneous permittivity) are not applicable in these circumstances, and numerical methods must be used. In this work, we present DL_MG, a flexible, scalable, and accurate solver library, developed specifically to tackle the challenges of solving the Poisson equation in modern large-scale electronic structure calculations on parallel computers. Our solver is based on the multigrid approach and uses an iterative high-order defect correction method to improve the accuracy of solutions. Using two chemically relevant model systems, we tested the accuracy and computational performance of DL_MG when solving the generalized Poisson and Poisson-Boltzmann equations, demonstrating excellent agreement with analytic solutions and efficient scaling to ∼10 9 unknowns and 100s of CPU cores. We also applied DL_MG in actual large-scale electronic structure calculations, using the ONETEP linear-scaling electronic structure package to study a 2615 atom protein-ligand complex with routinely available computational resources. In these calculations, the overall execution time with DL_MG was not significantly greater than the time required for calculations using a conventional FFT-based solver.
Batra, Jyotica; Robinson, Jessica; Soares, Alexei S; Fields, Alan P; Radisky, Derek C; Radisky, Evette S
2012-05-04
Matrix metalloproteinase 10 (MMP-10, stromelysin-2) is a secreted metalloproteinase with functions in skeletal development, wound healing, and vascular remodeling; its overexpression is also implicated in lung tumorigenesis and tumor progression. To understand the regulation of MMP-10 by tissue inhibitors of metalloproteinases (TIMPs), we have assessed equilibrium inhibition constants (K(i)) of putative physiological inhibitors TIMP-1 and TIMP-2 for the active catalytic domain of human MMP-10 (MMP-10cd) using multiple kinetic approaches. We find that TIMP-1 inhibits the MMP-10cd with a K(i) of 1.1 × 10(-9) M; this interaction is 10-fold weaker than the inhibition of the similar MMP-3 (stromelysin-1) catalytic domain (MMP-3cd) by TIMP-1. TIMP-2 inhibits the MMP-10cd with a K(i) of 5.8 × 10(-9) M, which is again 10-fold weaker than the inhibition of MMP-3cd by this inhibitor (K(i) = 5.5 × 10(-10) M). We solved the x-ray crystal structure of TIMP-1 bound to the MMP-10cd at 1.9 Å resolution; the structure was solved by molecular replacement and refined with an R-factor of 0.215 (R(free) = 0.266). Comparing our structure of MMP-10cd·TIMP-1 with the previously solved structure of MMP-3cd·TIMP-1 (Protein Data Bank entry 1UEA), we see substantial differences at the binding interface that provide insight into the differential binding of stromelysin family members to TIMP-1. This structural information may ultimately assist in the design of more selective TIMP-based inhibitors tailored for specificity toward individual members of the stromelysin family, with potential therapeutic applications.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gustchina, Alla; Li, Mi; Wunschmann, Sabina
2010-07-19
The crystal structure of Bla g 2 was solved in order to investigate the structural basis for the allergenic properties of this unusual protein. This is the first structure of an aspartic protease in which conserved glycine residues, in two canonical DTG triads, are substituted by different amino acid residues. Another unprecedented feature revealed by the structure is the single phenylalanine residue insertion on the tip of the flap, with the side-chain occupying the S1 binding pocket. This and other important amino acid substitutions in the active site region of Bla g 2 modify the interactions in the vicinity ofmore » the catalytic aspartate residues, increasing the distance between them to {approx}4 {angstrom} and establishing unique direct contacts between the flap and the catalytic residues. We attribute the absence of substantial catalytic activity in Bla g 2 to these unusual features of the active site. Five disulfide bridges and a Zn-binding site confer stability to the protein, which may contribute to sensitization at lower levels of exposure than other allergens.« less
Inhibition of Fatty Acid Synthase in Prostate Cancer by Olristat, a Novel Therapeutic
2006-11-01
previous crystallography studies by solving the crystal structure of FAS bound to a cleaved orlistat . These data will provide valuable insight into...timeline of XBP-1 15 processing following orlistat treatment (Figure 3A). Previous studies have demonstrated that inhibition of protein translation with...future drug discovery and design within the FAS pathway. In total, we have made great strides toward understanding the anti-tumor effects of orlistat
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shevtsov, M. B.; Streeter, S. D.; Thresh, S.-J.
2015-02-01
The structure of the new class of controller proteins (exemplified by C.Csp231I) in complex with its 21 bp DNA-recognition sequence is presented, and the molecular basis of sequence recognition in this class of proteins is discussed. An unusual extended spacer between the dimer binding sites suggests a novel interaction between the two C-protein dimers. In a wide variety of bacterial restriction–modification systems, a regulatory ‘controller’ protein (or C-protein) is required for effective transcription of its own gene and for transcription of the endonuclease gene found on the same operon. We have recently turned our attention to a new class ofmore » controller proteins (exemplified by C.Csp231I) that have quite novel features, including a much larger DNA-binding site with an 18 bp (∼60 Å) spacer between the two palindromic DNA-binding sequences and a very different recognition sequence from the canonical GACT/AGTC. Using X-ray crystallography, the structure of the protein in complex with its 21 bp DNA-recognition sequence was solved to 1.8 Å resolution, and the molecular basis of sequence recognition in this class of proteins was elucidated. An unusual aspect of the promoter sequence is the extended spacer between the dimer binding sites, suggesting a novel interaction between the two C-protein dimers when bound to both recognition sites correctly spaced on the DNA. A U-bend model is proposed for this tetrameric complex, based on the results of gel-mobility assays, hydrodynamic analysis and the observation of key contacts at the interface between dimers in the crystal.« less
Ronin, Céline; Costa, David Mendes; Tavares, Joana; Faria, Joana; Ciesielski, Fabrice; Ciapetti, Paola; Smith, Terry K; MacDougall, Jane; Cordeiro-da-Silva, Anabela; Pemberton, Iain K
2018-01-01
The de novo crystal structure of the Leishmania infantum Silent Information Regulator 2 related protein 1 (LiSir2rp1) has been solved at 1.99Å in complex with an acetyl-lysine peptide substrate. The structure is broadly commensurate with Hst2/SIRT2 proteins of yeast and human origin, reproducing many of the structural features common to these sirtuin deacetylases, including the characteristic small zinc-binding domain, and the larger Rossmann-fold domain involved in NAD+-binding interactions. The two domains are linked via a cofactor binding loop ordered in open conformation. The peptide substrate binds to the LiSir2rp1 protein via a cleft formed between the small and large domains, with the acetyl-lysine side chain inserting further into the resultant hydrophobic tunnel. Crystals were obtained only with recombinant LiSir2rp1 possessing an extensive internal deletion of a proteolytically-sensitive region unique to the sirtuins of kinetoplastid origin. Deletion of 51 internal amino acids (P253-E303) from LiSir2rp1 did not appear to alter peptide substrate interactions in deacetylation assays, but was indispensable to obtain crystals. Removal of this potentially flexible region, that otherwise extends from the classical structural elements of the Rossmann-fold, specifically the β8-β9 connector, appears to result in lower accumulation of the protein when expressed from episomal vectors in L. infantum SIR2rp1 single knockout promastigotes. The biological function of the large serine-rich insertion in kinetoplastid/trypanosomatid sirtuins, highlighted as a disordered region with strong potential for post-translational modification, remains unknown but may confer additional cellular functions that are distinct from their human counterparts. These unique molecular features, along with the resolution of the first kinetoplastid sirtuin deacetylase structure, present novel opportunities for drug design against a protein target previously established as essential to parasite survival and proliferation.
Problem-Solving Test: The Mechanism of Protein Synthesis
ERIC Educational Resources Information Center
Szeberenyi, Jozsef
2009-01-01
Terms to be familiar with before you start to solve the test: protein synthesis, ribosomes, amino acids, peptides, peptide bond, polypeptide chain, N- and C-terminus, hemoglobin, [alpha]- and [beta]-globin chains, radioactive labeling, [[to the third power]H] and [[to the fourteenth power]C]leucine, cytosol, differential centrifugation, density…
Can You Solve the Crime? Using Agarose Electrophoresis To Identify an Unknown Colored Protein.
ERIC Educational Resources Information Center
Wiltfong, Cynthia L.; Chester, Emily; Albertin, Faith; Smith, Julia; Hall, Judith C.; Arth, Emily C.; Martin, Stephanie
2003-01-01
Describes a lab that introduces agarose electrophoresis techniques and basic information on proteins to middle school and high school students. Insists that, built around a scenario in which students must solve a crime, the lab has real-world applications that should spark student interest. (KHR)
3D Protein structure prediction with genetic tabu search algorithm
2010-01-01
Background Protein structure prediction (PSP) has important applications in different fields, such as drug design, disease prediction, and so on. In protein structure prediction, there are two important issues. The first one is the design of the structure model and the second one is the design of the optimization technology. Because of the complexity of the realistic protein structure, the structure model adopted in this paper is a simplified model, which is called off-lattice AB model. After the structure model is assumed, optimization technology is needed for searching the best conformation of a protein sequence based on the assumed structure model. However, PSP is an NP-hard problem even if the simplest model is assumed. Thus, many algorithms have been developed to solve the global optimization problem. In this paper, a hybrid algorithm, which combines genetic algorithm (GA) and tabu search (TS) algorithm, is developed to complete this task. Results In order to develop an efficient optimization algorithm, several improved strategies are developed for the proposed genetic tabu search algorithm. The combined use of these strategies can improve the efficiency of the algorithm. In these strategies, tabu search introduced into the crossover and mutation operators can improve the local search capability, the adoption of variable population size strategy can maintain the diversity of the population, and the ranking selection strategy can improve the possibility of an individual with low energy value entering into next generation. Experiments are performed with Fibonacci sequences and real protein sequences. Experimental results show that the lowest energy obtained by the proposed GATS algorithm is lower than that obtained by previous methods. Conclusions The hybrid algorithm has the advantages from both genetic algorithm and tabu search algorithm. It makes use of the advantage of multiple search points in genetic algorithm, and can overcome poor hill-climbing capability in the conventional genetic algorithm by using the flexible memory functions of TS. Compared with some previous algorithms, GATS algorithm has better performance in global optimization and can predict 3D protein structure more effectively. PMID:20522256
Knutson, Stacy T; Westwood, Brian M; Leuthaeuser, Janelle B; Turner, Brandon E; Nguyendac, Don; Shea, Gabrielle; Kumar, Kiran; Hayden, Julia D; Harper, Angela F; Brown, Shoshana D; Morris, John H; Ferrin, Thomas E; Babbitt, Patricia C; Fetrow, Jacquelyn S
2017-04-01
Protein function identification remains a significant problem. Solving this problem at the molecular functional level would allow mechanistic determinant identification-amino acids that distinguish details between functional families within a superfamily. Active site profiling was developed to identify mechanistic determinants. DASP and DASP2 were developed as tools to search sequence databases using active site profiling. Here, TuLIP (Two-Level Iterative clustering Process) is introduced as an iterative, divisive clustering process that utilizes active site profiling to separate structurally characterized superfamily members into functionally relevant clusters. Underlying TuLIP is the observation that functionally relevant families (curated by Structure-Function Linkage Database, SFLD) self-identify in DASP2 searches; clusters containing multiple functional families do not. Each TuLIP iteration produces candidate clusters, each evaluated to determine if it self-identifies using DASP2. If so, it is deemed a functionally relevant group. Divisive clustering continues until each structure is either a functionally relevant group member or a singlet. TuLIP is validated on enolase and glutathione transferase structures, superfamilies well-curated by SFLD. Correlation is strong; small numbers of structures prevent statistically significant analysis. TuLIP-identified enolase clusters are used in DASP2 GenBank searches to identify sequences sharing functional site features. Analysis shows a true positive rate of 96%, false negative rate of 4%, and maximum false positive rate of 4%. F-measure and performance analysis on the enolase search results and comparison to GEMMA and SCI-PHY demonstrate that TuLIP avoids the over-division problem of these methods. Mechanistic determinants for enolase families are evaluated and shown to correlate well with literature results. © 2017 The Authors Protein Science published by Wiley Periodicals, Inc. on behalf of The Protein Society.
NASA Astrophysics Data System (ADS)
Zoete, V.; Michielin, O.; Karplus, M.
2003-12-01
A method is proposed for the estimation of absolute binding free energy of interaction between proteins and ligands. Conformational sampling of the protein-ligand complex is performed by molecular dynamics (MD) in vacuo and the solvent effect is calculated a posteriori by solving the Poisson or the Poisson-Boltzmann equation for selected frames of the trajectory. The binding free energy is written as a linear combination of the buried surface upon complexation, SAS bur, the electrostatic interaction energy between the ligand and the protein, Eelec, and the difference of the solvation free energies of the complex and the isolated ligand and protein, ΔGsolv. The method uses the buried surface upon complexation to account for the non-polar contribution to the binding free energy because it is less sensitive to the details of the structure than the van der Waals interaction energy. The parameters of the method are developed for a training set of 16 HIV-1 protease-inhibitor complexes of known 3D structure. A correlation coefficient of 0.91 was obtained with an unsigned mean error of 0.8 kcal/mol. When applied to a set of 25 HIV-1 protease-inhibitor complexes of unknown 3D structures, the method provides a satisfactory correlation between the calculated binding free energy and the experimental pIC 50 without reparametrization.
The Histone Database: an integrated resource for histones and histone fold-containing proteins
Mariño-Ramírez, Leonardo; Levine, Kevin M.; Morales, Mario; Zhang, Suiyuan; Moreland, R. Travis; Baxevanis, Andreas D.; Landsman, David
2011-01-01
Eukaryotic chromatin is composed of DNA and protein components—core histones—that act to compactly pack the DNA into nucleosomes, the fundamental building blocks of chromatin. These nucleosomes are connected to adjacent nucleosomes by linker histones. Nucleosomes are highly dynamic and, through various core histone post-translational modifications and incorporation of diverse histone variants, can serve as epigenetic marks to control processes such as gene expression and recombination. The Histone Sequence Database is a curated collection of sequences and structures of histones and non-histone proteins containing histone folds, assembled from major public databases. Here, we report a substantial increase in the number of sequences and taxonomic coverage for histone and histone fold-containing proteins available in the database. Additionally, the database now contains an expanded dataset that includes archaeal histone sequences. The database also provides comprehensive multiple sequence alignments for each of the four core histones (H2A, H2B, H3 and H4), the linker histones (H1/H5) and the archaeal histones. The database also includes current information on solved histone fold-containing structures. The Histone Sequence Database is an inclusive resource for the analysis of chromatin structure and function focused on histones and histone fold-containing proteins. Database URL: The Histone Sequence Database is freely available and can be accessed at http://research.nhgri.nih.gov/histones/. PMID:22025671
Structure and activity of the Pseudomonas aeruginosa hotdog-fold thioesterases PA5202 and PA2801
Gonzalez, Claudio F.; Tchigvintsev, Anatoli; Brown, Greg; Flick, Robert; Evdokimova, Elena; Xu, Xiaohui; Osipiuk, Jerzy; Cuff, Marianne E.; Lynch, Susan; Joachimiak, Andrzej; Savchenko, Alexei; Yakunin, Alexander F.
2013-01-01
The hotdog fold is one of the basic protein folds widely present in bacteria, archaea, and eukaryotes. Many of these proteins exhibit thioesterase activity against fatty acyl-CoAs and play important roles in lipid metabolism, cellular signaling, and degradation of xenobiotics. The genome of the opportunistic pathogen Pseudomonas aeruginosa contains over 20 genes encoding predicted hotdog-fold proteins, none of which have been experimentally characterized. We have found that two P. aeruginosa hotdog proteins display high thioesterase activity against 3-hydroxy-3-methylglutaryl-CoA and glutaryl-CoA (PA5202), and octanoyl-CoA (PA2801). Crystal structures of these proteins were solved (1.70 and 1.75 Å) and revealed a hotdog fold with a potential catalytic carboxylate residue located on the long alpha helix (Asp57 in PA5202 and Glu35 in PA2801). Alanine replacement mutagenesis of PA5202 identified four residues (Asn42, Arg43, Asp57, and Thr76), which are critical for activity and are located in the active site. A P. aeruginosa PA5202 deletion strain showed an increased secretion of the antimicrobial pigment pyocyanine and an increased expression of genes involved in pyocyanin biosynthesis suggesting a functional link between the PA5202 activity and pyocyanin production. Thus, the P. aeruginosa hotdog thioesterases PA5202 and PA2801 have similar structures, but exhibit different substrate preferences and functions. PMID:22439787
The Protein-DNA Interface database
2010-01-01
The Protein-DNA Interface database (PDIdb) is a repository containing relevant structural information of Protein-DNA complexes solved by X-ray crystallography and available at the Protein Data Bank. The database includes a simple functional classification of the protein-DNA complexes that consists of three hierarchical levels: Class, Type and Subtype. This classification has been defined and manually curated by humans based on the information gathered from several sources that include PDB, PubMed, CATH, SCOP and COPS. The current version of the database contains only structures with resolution of 2.5 Å or higher, accounting for a total of 922 entries. The major aim of this database is to contribute to the understanding of the main rules that underlie the molecular recognition process between DNA and proteins. To this end, the database is focused on each specific atomic interface rather than on the separated binding partners. Therefore, each entry in this database consists of a single and independent protein-DNA interface. We hope that PDIdb will be useful to many researchers working in fields such as the prediction of transcription factor binding sites in DNA, the study of specificity determinants that mediate enzyme recognition events, engineering and design of new DNA binding proteins with distinct binding specificity and affinity, among others. Finally, due to its friendly and easy-to-use web interface, we hope that PDIdb will also serve educational and teaching purposes. PMID:20482798
The Protein-DNA Interface database.
Norambuena, Tomás; Melo, Francisco
2010-05-18
The Protein-DNA Interface database (PDIdb) is a repository containing relevant structural information of Protein-DNA complexes solved by X-ray crystallography and available at the Protein Data Bank. The database includes a simple functional classification of the protein-DNA complexes that consists of three hierarchical levels: Class, Type and Subtype. This classification has been defined and manually curated by humans based on the information gathered from several sources that include PDB, PubMed, CATH, SCOP and COPS. The current version of the database contains only structures with resolution of 2.5 A or higher, accounting for a total of 922 entries. The major aim of this database is to contribute to the understanding of the main rules that underlie the molecular recognition process between DNA and proteins. To this end, the database is focused on each specific atomic interface rather than on the separated binding partners. Therefore, each entry in this database consists of a single and independent protein-DNA interface.We hope that PDIdb will be useful to many researchers working in fields such as the prediction of transcription factor binding sites in DNA, the study of specificity determinants that mediate enzyme recognition events, engineering and design of new DNA binding proteins with distinct binding specificity and affinity, among others. Finally, due to its friendly and easy-to-use web interface, we hope that PDIdb will also serve educational and teaching purposes.
Crystal structure of the GTPase domain and the bundle signalling element of dynamin in the GDP state
DOE Office of Scientific and Technical Information (OSTI.GOV)
Anand, Roopsee; Eschenburg, Susanne; Reubold, Thomas F., E-mail: Reubold.Thomas@mh-hannover.de
Dynamin is the prototype of a family of large multi-domain GTPases. The 100 kDa protein is a key player in clathrin-mediated endocytosis, where it cleaves off vesicles from membranes using the energy from GTP hydrolysis. We have solved the high resolution crystal structure of a fusion protein of the GTPase domain and the bundle signalling element (BSE) of dynamin 1 liganded with GDP. The structure provides a hitherto missing snapshot of the GDP state of the hydrolytic cycle of dynamin and reveals how the switch I region moves away from the active site after GTP hydrolysis and release of inorganic phosphate.more » Comparing our structure of the GDP state with the known structures of the GTP state, the transition state and the nucleotide-free state of dynamin 1 we describe the structural changes through the hydrolytic cycle. - Highlights: • High resolution crystal structure of the GDP-state of a dynamin 1 GTPase-BSE fusion. • Visualizes one of the key states of the hydrolytic cycle of dynamin. • The dynamin-specific loop forms a helix as soon as a guanine base is present.« less
Natural triple beta-stranded fibrous folds.
Mitraki, Anna; Papanikolopoulou, Katerina; Van Raaij, Mark J
2006-01-01
A distinctive family of beta-structured folds has recently been described for fibrous proteins from viruses. Virus fibers are usually involved in specific host-cell recognition. They are asymmetric homotrimeric proteins consisting of an N-terminal virus-binding tail, a central shaft or stalk domain, and a C-terminal globular receptor-binding domain. Often they are entirely or nearly entirely composed of beta-structure. Apart from their biological relevance and possible gene therapy applications, their shape, stability, and rigidity suggest they may be useful as blueprints for biomechanical design. Folding and unfolding studies suggest their globular C-terminal domain may fold first, followed by a "zipping-up" of the shaft domains. The C-terminal domains appear to be important for registration because peptides corresponding to shaft domains alone aggregate into nonnative fibers and/or amyloid structures. C-terminal domains can be exchanged between different fibers and the resulting chimeric proteins are useful as a way to solve structures of unknown parts of the shaft domains. The following natural triple beta-stranded fibrous folds have been discovered by X-ray crystallography: the triple beta-spiral, triple beta-helix, and T4 short tail fiber fold. All have a central longitudinal hydrophobic core and extensive intermonomer polar and nonpolar interactions. Now that a reasonable body of structural and folding knowledge has been assembled about these fibrous proteins, the next challenge and opportunity is to start using this information in medical and industrial applications such as gene therapy and nanotechnology.
Hao, Xiao-Hu; Zhang, Gui-Jun; Zhou, Xiao-Gen; Yu, Xu-Feng
2016-01-01
To address the searching problem of protein conformational space in ab-initio protein structure prediction, a novel method using abstract convex underestimation (ACUE) based on the framework of evolutionary algorithm was proposed. Computing such conformations, essential to associate structural and functional information with gene sequences, is challenging due to the high-dimensionality and rugged energy surface of the protein conformational space. As a consequence, the dimension of protein conformational space should be reduced to a proper level. In this paper, the high-dimensionality original conformational space was converted into feature space whose dimension is considerably reduced by feature extraction technique. And, the underestimate space could be constructed according to abstract convex theory. Thus, the entropy effect caused by searching in the high-dimensionality conformational space could be avoided through such conversion. The tight lower bound estimate information was obtained to guide the searching direction, and the invalid searching area in which the global optimal solution is not located could be eliminated in advance. Moreover, instead of expensively calculating the energy of conformations in the original conformational space, the estimate value is employed to judge if the conformation is worth exploring to reduce the evaluation time, thereby making computational cost lower and the searching process more efficient. Additionally, fragment assembly and the Monte Carlo method are combined to generate a series of metastable conformations by sampling in the conformational space. The proposed method provides a novel technique to solve the searching problem of protein conformational space. Twenty small-to-medium structurally diverse proteins were tested, and the proposed ACUE method was compared with It Fix, HEA, Rosetta and the developed method LEDE without underestimate information. Test results show that the ACUE method can more rapidly and more efficiently obtain the near-native protein structure.
JAIL: a structure-based interface library for macromolecules.
Günther, Stefan; von Eichborn, Joachim; May, Patrick; Preissner, Robert
2009-01-01
The increasing number of solved macromolecules provides a solid number of 3D interfaces, if all types of molecular contacts are being considered. JAIL annotates three different kinds of macromolecular interfaces, those between interacting protein domains, interfaces of different protein chains and interfaces between proteins and nucleic acids. This results in a total number of about 184,000 database entries. All the interfaces can easily be identified by a detailed search form or by a hierarchical tree that describes the protein domain architectures classified by the SCOP database. Visual inspection of the interfaces is possible via an interactive protein viewer. Furthermore, large scale analyses are supported by an implemented sequential and by a structural clustering. Similar interfaces as well as non-redundant interfaces can be easily picked out. Additionally, the sequential conservation of binding sites was also included in the database and is retrievable via Jmol. A comprehensive download section allows the composition of representative data sets with user defined parameters. The huge data set in combination with various search options allow a comprehensive view on all interfaces between macromolecules included in the Protein Data Bank (PDB). The download of the data sets supports numerous further investigations in macromolecular recognition. JAIL is publicly available at http://bioinformatics.charite.de/jail.
Electrochemistry-Assisted Top-Down Characterization of Disulfide-Containing Proteins
Zhang, Yun; Cui, Weidong; Zhang, Hao; Dewald, Howard D.; Chen, Hao
2013-01-01
Covalent disulfide bond linkage in a protein represents an important challenge for mass spectrometry (MS)-based top-down protein structure analysis as it reduces the backbone cleavage efficiency for MS/MS dissociation. This study presents a strategy for solving this critical issue via integrating electrochemistry (EC) online with top-down MS approach. In this approach, proteins undergo electrolytic reduction in an electrochemical cell to break disulfide bonds and then online ionized into gaseous ions for analysis by electron-capture dissociation (ECD) and collision-induced dissociation (CID). The electrochemical reduction of proteins allows to remove disulfide bond constraints and also leads to increased charge numbers of the resulting protein ions. As a result, sequence coverage was significantly enhanced, as exemplified by β-lactoglobulin A (24 vs. 73 backbone cleavages before and after electrolytic reduction, respectively) and lysozyme (5 vs. 66 backbone cleavages before and after electrolytic reduction, respectively). This methodology is fast and does not need chemical reductants, which would have an important impact in high-throughput proteomics research. PMID:22448817
Electrochemistry-assisted top-down characterization of disulfide-containing proteins.
Zhang, Yun; Cui, Weidong; Zhang, Hao; Dewald, Howard D; Chen, Hao
2012-04-17
Covalent disulfide bond linkage in a protein represents an important challenge for mass spectrometry (MS)-based top-down protein structure analysis as it reduces the backbone cleavage efficiency for MS/MS dissociation. This study presents a strategy for solving this critical issue via integrating electrochemistry (EC) online with a top-down MS approach. In this approach, proteins undergo electrolytic reduction in an electrochemical cell to break disulfide bonds and then undergo online ionization into gaseous ions for analysis by electron-capture dissociation (ECD) and collision-induced dissociation (CID). The electrochemical reduction of proteins allows one to remove disulfide bond constraints and also leads to increased charge numbers of the resulting protein ions. As a result, sequence coverage was significantly enhanced, as exemplified by β-lactoglobulin A (24 vs 75 backbone cleavages before and after electrolytic reduction, respectively) and lysozyme (5 vs 66 backbone cleavages before and after electrolytic reduction, respectively). This methodology is fast and does not need chemical reductants, which would have an important impact in high-throughput proteomics research.
Crystal structure of a macrophage migration inhibitory factor from Giardia lamblia
DOE Office of Scientific and Technical Information (OSTI.GOV)
Buchko, Garry W.; Abendroth, Jan; Robinson, Howard
2013-06-15
Macrophage migration inhibitory factor (MIF) is a eukaryotic cytokine that affects a broad spectrum of immune responses and its activation/inactivation is associated with numerous diseases. During protozoan infections MIF is not only expressed by the host, but, has also been observed to be expressed by some parasites and released into the host. To better understand the biological role of parasitic MIF proteins, the crystal structure of the MIF protein from Giardia lamblia (Gl-MIF), the etiological agent responsible for giardiasis, has been determined at 2.30 Å resolution. The 114-residue protein adopts an α/β fold consisting of a four-stranded β-sheet with twomore » anti-parallel α-helices packed against a face of the β-sheet. An additional short β-strand aligns anti-parallel to β4 of the β-sheet in the adjacent protein unit to help stabilize a trimer, the biologically relevant unit observed in all solved MIF crystal structures to date, and form a discontinuous β-barrel. The structure of Gl-MIF is compared to the MIF structures from humans (Hs-MIF) and three Plasmodium species (falciparum, berghei, and yoelii). The structure of all five MIF proteins are generally similar with the exception of a channel that runs through the center of each trimer complex. Relative to Hs-MIF, there are differences in solvent accessibility and electrostatic potential distribution in the channel of Gl-MIF and the Plasmodium-MIFs due primarily to two “gate-keeper” residues in the parasitic MIFs. For the Plasmodium MIFs the gate-keeper residues are at positions 44 (Y==>R) and 100 (V==>D) and for Gl-MIF it is at position 100 (V==>R). If these gate-keeper residues have a biological function and contribute to the progression of parasitemia they may also form the basis for structure-based drug design targeting parasitic MIF proteins.« less
Abendroth, Jan; Robinson, Howard; Zhang, Yanfeng; Hewitt, Stephen N.; Edwards, Thomas E.; Van Voorhis, Wesley C.; Myler, Peter J.
2013-01-01
Macrophage migration inhibitory factor (MIF) is a eukaryotic cytokine that affects a broad spectrum of immune responses and its activation/inactivation is associated with numerous diseases. During protozoan infections MIF is not only expressed by the host, but, has also been observed to be expressed by some parasites and released into the host. To better understand the biological role of parasitic MIF proteins, the crystal structure of the MIF protein from Giardia lamblia (Gl-MIF), the etiological agent responsible for giardiasis, has been determined at 2.30 Å resolution. The 114-residue protein adopts an α/β fold consisting of a four-stranded β-sheet with two anti-parallel α-helices packed against a face of the β-sheet. An additional short β-strand aligns anti-parallel to β4 of the β-sheet in the adjacent protein unit to help stabilize a trimer, the biologically relevant unit observed in all solved MIF crystal structures to date, and form a discontinuous β-barrel. The structure of Gl-MIF is compared to the MIF structures from humans (Hs-MIF) and three Plasmodium species (falciparum, berghei, and yoelii). The structure of all five MIF proteins are generally similar with the exception of a channel that runs through the center of each trimer complex. Relative to Hs-MIF, there are differences in solvent accessibility and electrostatic potential distribution in the channel of Gl-MIF and the Plasmodium-MIFs due primarily to two “gate-keeper” residues in the parasitic MIFs. For the Plasmodium MIFs the gate-keeper residues are at positions 44 (Y⇒R) and 100 (V⇒D) and for Gl-MIF it is at position 100 (V⇒R). If these gate-keeper residues have a biological function and contribute to the progression of parasitemia they may also form the basis for structure-based drug design targeting parasitic MIF proteins. PMID:23709284
Variability of Protein Structure Models from Electron Microscopy.
Monroe, Lyman; Terashi, Genki; Kihara, Daisuke
2017-04-04
An increasing number of biomolecular structures are solved by electron microscopy (EM). However, the quality of structure models determined from EM maps vary substantially. To understand to what extent structure models are supported by information embedded in EM maps, we used two computational structure refinement methods to examine how much structures can be refined using a dataset of 49 maps with accompanying structure models. The extent of structure modification as well as the disagreement between refinement models produced by the two computational methods scaled inversely with the global and the local map resolutions. A general quantitative estimation of deviations of structures for particular map resolutions are provided. Our results indicate that the observed discrepancy between the deposited map and the refined models is due to the lack of structural information present in EM maps and thus these annotations must be used with caution for further applications. Copyright © 2017 Elsevier Ltd. All rights reserved.
Romo, Tod D.; Grossfield, Alan; Pitman, Michael C.
2010-01-01
Abstract The recently solved crystallographic structures for the A2A adenosine receptor and the β1 and β2 adrenergic receptors have shown important differences between members of the class-A G-protein-coupled receptors and their archetypal model, rhodopsin, such as the apparent breaking of the ionic lock that stabilizes the inactive structure. Here, we characterize a 1.02 μs all-atom simulation of an apo-β2 adrenergic receptor that is missing the third intracellular loop to better understand the inactive structure. Although we find that the structure is remarkably rigid, there is a rapid influx of water into the core of the protein, as well as a slight expansion of the molecule relative to the crystal structure. In contrast to the x-ray crystal structures, the ionic lock rapidly reforms, although we see an activation-precursor-like event wherein the ionic lock opens for ∼200 ns, accompanied by movements in the transmembrane helices associated with activation. When the lock reforms, we see the structure return to its inactive conformation. We also find that the ionic lock exists in three states: closed (or locked), semi-open with a bridging water molecule, and open. The interconversion of these states involves the concerted motion of the entire protein. We characterize these states and the concerted motion underlying their interconversion. These findings may help elucidate the connection between key local events and the associated global structural changes during activation. PMID:20074514
Double-flow focused liquid injector for efficient serial femtosecond crystallography
DOE Office of Scientific and Technical Information (OSTI.GOV)
Oberthuer, Dominik; Knoška, Juraj; Wiedorn, Max O.
Serial femtosecond crystallography requires reliable and efficient delivery of fresh crystals across the beam of an X-ray free-electron laser over the course of an experiment. We introduce a double-flow focusing nozzle to meet this challenge, with significantly reduced sample consumption, while improving jet stability over previous generations of nozzles. We demonstrate its use to determine the first room-temperature structure of RNA polymerase II at high resolution, revealing new structural details. Furthermore, the double flow-focusing nozzles were successfully tested with three other protein samples and the first room temperature structure of an extradiol ring-cleaving dioxygenase was solved by utilizing the improvedmore » operation and characteristics of these devices.« less
Double-flow focused liquid injector for efficient serial femtosecond crystallography
Oberthuer, Dominik; Knoška, Juraj; Wiedorn, Max O.; Beyerlein, Kenneth R.; Bushnell, David A.; Kovaleva, Elena G.; Heymann, Michael; Gumprecht, Lars; Kirian, Richard A.; Barty, Anton; Mariani, Valerio; Tolstikova, Aleksandra; Adriano, Luigi; Awel, Salah; Barthelmess, Miriam; Dörner, Katerina; Xavier, P. Lourdu; Yefanov, Oleksandr; James, Daniel R.; Nelson, Garrett; Wang, Dingjie; Calvey, George; Chen, Yujie; Schmidt, Andrea; Szczepek, Michael; Frielingsdorf, Stefan; Lenz, Oliver; Snell, Edward; Robinson, Philip J.; Šarler, Božidar; Belšak, Grega; Maček, Marjan; Wilde, Fabian; Aquila, Andrew; Boutet, Sébastien; Liang, Mengning; Hunter, Mark S.; Scheerer, Patrick; Lipscomb, John D.; Weierstall, Uwe; Kornberg, Roger D.; Spence, John C. H.; Pollack, Lois; Chapman, Henry N.; Bajt, Saša
2017-01-01
Serial femtosecond crystallography requires reliable and efficient delivery of fresh crystals across the beam of an X-ray free-electron laser over the course of an experiment. We introduce a double-flow focusing nozzle to meet this challenge, with significantly reduced sample consumption, while improving jet stability over previous generations of nozzles. We demonstrate its use to determine the first room-temperature structure of RNA polymerase II at high resolution, revealing new structural details. Moreover, the double flow-focusing nozzles were successfully tested with three other protein samples and the first room temperature structure of an extradiol ring-cleaving dioxygenase was solved by utilizing the improved operation and characteristics of these devices. PMID:28300169
Double-flow focused liquid injector for efficient serial femtosecond crystallography
Oberthuer, Dominik; Knoška, Juraj; Wiedorn, Max O.; ...
2017-03-16
Serial femtosecond crystallography requires reliable and efficient delivery of fresh crystals across the beam of an X-ray free-electron laser over the course of an experiment. We introduce a double-flow focusing nozzle to meet this challenge, with significantly reduced sample consumption, while improving jet stability over previous generations of nozzles. We demonstrate its use to determine the first room-temperature structure of RNA polymerase II at high resolution, revealing new structural details. Furthermore, the double flow-focusing nozzles were successfully tested with three other protein samples and the first room temperature structure of an extradiol ring-cleaving dioxygenase was solved by utilizing the improvedmore » operation and characteristics of these devices.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lorenzini, Emily; Singer, Alexander; Singh, Bhag
2010-07-28
Comparative genomic studies have identified many proteins that are found only in various Chlamydiae species and exhibit no significant sequence similarity to any protein in organisms that do not belong to this group. The CT670 protein of Chlamydia trachomatis is one of the proteins whose genes are in one of the type III secretion gene clusters but whose cellular functions are not known. CT670 shares several characteristics with the YscO protein of Yersinia pestis, including the neighboring genes, size, charge, and secondary structure, but the structures and/or functions of these proteins remain to be determined. Although a BLAST search withmore » CT670 did not identify YscO as a related protein, our analysis indicated that these two proteins exhibit significant sequence similarity. In this paper, we report that the CT670 crystal, solved at a resolution of 2 {angstrom}, consists of a single coiled coil containing just two long helices. Gel filtration and analytical ultracentrifugation studies showed that in solution CT670 exists in both monomeric and dimeric forms and that the monomer predominates at lower protein concentrations. We examined the interaction of CT670 with many type III secretion system-related proteins (viz., CT091, CT665, CT666, CT667, CT668, CT669, CT671, CT672, and CT673) by performing bacterial two-hybrid assays. In these experiments, CT670 was found to interact only with the CT671 protein (YscP homolog), whose gene is immediately downstream of ct670. A specific interaction between CT670 and CT671 was also observed when affinity chromatography pull-down experiments were performed. These results suggest that CT670 and CT671 are putative homologs of the YcoO and YscP proteins, respectively, and that they likely form a chaperone-effector pair.« less
Crystal structure and functional characterization of SF216 from Shigella flexneri.
Kim, Ha-Neul; Seok, Seung-Hyeon; Lee, Yoo-Sup; Won, Hyung-Sik; Seo, Min-Duk
2017-11-01
Shigella flexneri is a Gram-negative anaerobic bacterium that causes highly infectious bacterial dysentery in humans. Here, we solved the crystal structure of SF216, a hypothetical protein from the S. flexneri 5a strain M90T, at 1.7 Å resolution. The crystal structure of SF216 represents a homotrimer stabilized by intersubunit interactions and ion-mediated electrostatic interactions. Each subunit consists of three β-strands and five α-helices with the β-β-β-α-α-α-α-α topology. Based on the structural information, we also demonstrate that SF216 shows weak ribonuclease activity by a fluorescence quenching assay. Furthermore, we identify potential druggable pockets (putative hot spots) on the surface of the SF216 structure by computational mapping. © 2017 Federation of European Biochemical Societies.
Pietrocola, Giampiero; Arciola, Carla Renata; Rindi, Simonetta; Montanaro, Lucio; Speziale, Pietro
2018-01-01
Group B Streptococcus (GBS) remains an important etiological agent of several infectious diseases including neonatal septicemia, pneumonia, meningitis, and orthopedic device infections. This pathogenicity is due to a variety of virulence factors expressed by Streptococcus agalactiae. Single virulence factors are not sufficient to provoke a streptococcal infection, which is instead promoted by the coordinated activity of several pathogenicity factors. Such determinants, mostly cell wall-associated and secreted proteins, include adhesins that mediate binding of the pathogen to host extracellular matrix/plasma ligands and cell surfaces, proteins that cooperate in the invasion of and survival within host cells and factors that neutralize phagocytosis and/or modulate the immune response. The genome-based approaches and bioinformatics tools and the extensive use of biophysical and biochemical methods and animal model studies have provided a great wealth of information on the molecular structure and function of these virulence factors. In fact, a number of new GBS surface-exposed or secreted proteins have been identified (GBS immunogenic bacterial adhesion protein, leucine-rich repeat of GBS, serine-rich repeat proteins), the three-dimensional structures of known streptococcal proteins (αC protein, C5a peptidase) have been solved and an understanding of the pathogenetic role of “old” and new determinants has been better defined in recent years. Herein, we provide an update of our current understanding of the major surface cell wall-anchored proteins from GBS, with emphasis on their biochemical and structural properties and the pathogenetic roles they may have in the onset and progression of host infection. We also focus on the antigenic profile of these compounds and discuss them as targets for therapeutic intervention. PMID:29686667
Bernardes, Natalia E; Takeda, Agnes A S; Dreyer, Thiago R; Freitas, Fernanda Z; Bertolini, Maria Célia; Fontes, Marcos R M
2015-01-01
Neurospora crassa is a filamentous fungus that has been extensively studied as a model organism for eukaryotic biology, providing fundamental insights into cellular processes such as cell signaling, growth and differentiation. To advance in the study of this multicellular organism, an understanding of the specific mechanisms for protein transport into the cell nucleus is essential. Importin-α (Imp-α) is the receptor for cargo proteins that contain specific nuclear localization signals (NLSs) that play a key role in the classical nuclear import pathway. Structures of Imp-α from different organisms (yeast, rice, mouse, and human) have been determined, revealing that this receptor possesses a conserved structural scaffold. However, recent studies have demonstrated that the Impα mechanism of action may vary significantly for different organisms or for different isoforms from the same organism. Therefore, structural, functional, and biophysical characterization of different Impα proteins is necessary to understand the selectivity of nuclear transport. Here, we determined the first crystal structure of an Impα from a filamentous fungus which is also the highest resolution Impα structure already solved to date (1.75 Å). In addition, we performed calorimetric analysis to determine the affinity and thermodynamic parameters of the interaction between Imp-α and the classical SV40 NLS peptide. The comparison of these data with previous studies on Impα proteins led us to demonstrate that N. crassa Imp-α possess specific features that are distinct from mammalian Imp-α but exhibit important similarities to rice Imp-α, particularly at the minor NLS binding site.
Ginn, Helen M.; Messerschmidt, Marc; Ji, Xiaoyun; ...
2015-03-09
The X-ray free-electron laser (XFEL) allows the analysis of small weakly diffracting protein crystals, but has required very many crystals to obtain good data. Here we use an XFEL to determine the room temperature atomic structure for the smallest cytoplasmic polyhedrosis virus polyhedra yet characterized, which we failed to solve at a synchrotron. These protein microcrystals, roughly a micron across, accrue within infected cells. We use a new physical model for XFEL diffraction, which better estimates the experimental signal, delivering a high-resolution XFEL structure (1.75 Å), using fewer crystals than previously required for this resolution. The crystal lattice and proteinmore » core are conserved compared with a polyhedrin with less than 10% sequence identity. We explain how the conserved biological phenotype, the crystal lattice, is maintained in the face of extreme environmental challenge and massive evolutionary divergence. Our improved methods should open up more challenging biological samples to XFEL analysis.« less
Dissecting the telomere–inner nuclear membrane interface formed in meiosis
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pendlebury, Devon F.; Fujiwara, Yasuhiro; Tesmer, Valerie M.
Tethering telomeres to the inner nuclear membrane (INM) allows homologous chromosome pairing during meiosis. The meiosis-specific protein TERB1 binds the telomeric protein TRF1 to establish telomere–INM connectivity and is essential for mouse fertility. Here we solve the structure of the human TRF1–TERB1 interface to reveal the structural basis for telomere–INM linkage. Disruption of this interface abrogates binding and compromises telomere–INM attachment in mice. An embedded CDK-phosphorylation site within the TRF1-binding region of TERB1 provides a mechanism for cap exchange, a late-pachytene phenomenon involving the dissociation of the TRF1–TERB1 complex. Indeed, further strengthening this interaction interferes with cap exchange. Finally, ourmore » biochemical analysis implicates distinct complexes for telomere–INM tethering and chromosome-end protection during meiosis. Our studies unravel the structure, stoichiometry, and physiological implications underlying telomere–INM tethering, thereby providing unprecedented insights into the unique function of telomeres in meiosis.« less
Miller, Ona K; Potter, Jane A; Vijayakrishnan, Swetha; Bhella, David; Naismith, James H; Elliott, Richard M
2017-01-01
Rift Valley fever phlebovirus (RVFV) is a clinically and economically important pathogen increasingly likely to cause widespread epidemics. RVFV virulence depends on the interferon antagonist non-structural protein (NSs), which remains poorly characterized. We identified a stable core domain of RVFV NSs (residues 83–248), and solved its crystal structure, a novel all-helical fold organized into highly ordered fibrils. A hallmark of RVFV pathology is NSs filament formation in infected cell nuclei. Recombinant virus encoding the NSs core domain induced intranuclear filaments, suggesting it contains all essential determinants for nuclear translocation and filament formation. Mutations of key crystal fibril interface residues in viruses encoding full-length NSs completely abrogated intranuclear filament formation in infected cells. We propose the fibrillar arrangement of the NSs core domain in crystals reveals the molecular basis of assembly of this key virulence factor in cell nuclei. Our findings have important implications for fundamental understanding of RVFV virulence. PMID:28915104
Barski, Michal; Brennan, Benjamin; Miller, Ona K; Potter, Jane A; Vijayakrishnan, Swetha; Bhella, David; Naismith, James H; Elliott, Richard M; Schwarz-Linek, Ulrich
2017-09-15
Rift Valley fever phlebovirus (RVFV) is a clinically and economically important pathogen increasingly likely to cause widespread epidemics. RVFV virulence depends on the interferon antagonist non-structural protein (NSs), which remains poorly characterized. We identified a stable core domain of RVFV NSs (residues 83-248), and solved its crystal structure, a novel all-helical fold organized into highly ordered fibrils. A hallmark of RVFV pathology is NSs filament formation in infected cell nuclei. Recombinant virus encoding the NSs core domain induced intranuclear filaments, suggesting it contains all essential determinants for nuclear translocation and filament formation. Mutations of key crystal fibril interface residues in viruses encoding full-length NSs completely abrogated intranuclear filament formation in infected cells. We propose the fibrillar arrangement of the NSs core domain in crystals reveals the molecular basis of assembly of this key virulence factor in cell nuclei. Our findings have important implications for fundamental understanding of RVFV virulence.
PDBe: towards reusable data delivery infrastructure at protein data bank in Europe.
Mir, Saqib; Alhroub, Younes; Anyango, Stephen; Armstrong, David R; Berrisford, John M; Clark, Alice R; Conroy, Matthew J; Dana, Jose M; Deshpande, Mandar; Gupta, Deepti; Gutmanas, Aleksandras; Haslam, Pauline; Mak, Lora; Mukhopadhyay, Abhik; Nadzirin, Nurul; Paysan-Lafosse, Typhaine; Sehnal, David; Sen, Sanchayita; Smart, Oliver S; Varadi, Mihaly; Kleywegt, Gerard J; Velankar, Sameer
2018-01-04
The Protein Data Bank in Europe (PDBe, pdbe.org) is actively engaged in the deposition, annotation, remediation, enrichment and dissemination of macromolecular structure data. This paper describes new developments and improvements at PDBe addressing three challenging areas: data enrichment, data dissemination and functional reusability. New features of the PDBe Web site are discussed, including a context dependent menu providing links to raw experimental data and improved presentation of structures solved by hybrid methods. The paper also summarizes the features of the LiteMol suite, which is a set of services enabling fast and interactive 3D visualization of structures, with associated experimental maps, annotations and quality assessment information. We introduce a library of Web components which can be easily reused to port data and functionality available at PDBe to other services. We also introduce updates to the SIFTS resource which maps PDB data to other bioinformatics resources, and the PDBe REST API. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Active Site Sharing and Subterminal Hairpin Recognition in a New Class of DNA Transposases
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ronning, Donald R.; Guynet, Catherine; Ton-Hoang, Bao
2010-07-20
Many bacteria harbor simple transposable elements termed insertion sequences (IS). In Helicobacter pylori, the chimeric IS605 family elements are particularly interesting due to their proximity to genes encoding gastric epithelial invasion factors. Protein sequences of IS605 transposases do not bear the hallmarks of other well-characterized transposases. We have solved the crystal structure of full-length transposase (TnpA) of a representative member, ISHp608. Structurally, TnpA does not resemble any characterized transposase; rather, it is related to rolling circle replication (RCR) proteins. Consistent with RCR, Mg{sup 2+} and a conserved tyrosine, Tyr127, are essential for DNA nicking and the formation of a covalentmore » intermediate between TnpA and DNA. TnpA is dimeric, contains two shared active sites, and binds two DNA stem loops representing the conserved inverted repeats near each end of ISHp608. The cocrystal structure with stem-loop DNA illustrates how this family of transposases specifically recognizes and pairs ends, necessary steps during transposition.« less
Strategies for carbohydrate model building, refinement and validation.
Agirre, Jon
2017-02-01
Sugars are the most stereochemically intricate family of biomolecules and present substantial challenges to anyone trying to understand their nomenclature, reactions or branched structures. Current crystallographic programs provide an abstraction layer allowing inexpert structural biologists to build complete protein or nucleic acid model components automatically either from scratch or with little manual intervention. This is, however, still not generally true for sugars. The need for carbohydrate-specific building and validation tools has been highlighted a number of times in the past, concomitantly with the introduction of a new generation of experimental methods that have been ramping up the production of protein-sugar complexes and glycoproteins for the past decade. While some incipient advances have been made to address these demands, correctly modelling and refining carbohydrates remains a challenge. This article will address many of the typical difficulties that a structural biologist may face when dealing with carbohydrates, with an emphasis on problem solving in the resolution range where X-ray crystallography and cryo-electron microscopy are expected to overlap in the next decade.
Structural engineering of a phage lysin that targets Gram-negative pathogens
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lukacik, Petra; Barnard, Travis J.; Keller, Paul W.
Bacterial pathogens are becoming increasingly resistant to antibiotics. As an alternative therapeutic strategy, phage therapy reagents containing purified viral lysins have been developed against Gram-positive organisms but not against Gram-negative organisms due to the inability of these types of drugs to cross the bacterial outer membrane. We solved the crystal structures of a Yersinia pestis outer membrane transporter called FyuA and a bacterial toxin called pesticin that targets this transporter. FyuA is a {beta}-barrel membrane protein belonging to the family of TonB dependent transporters, whereas pesticin is a soluble protein with two domains, one that binds to FyuA and anothermore » that is structurally similar to phage T4 lysozyme. The structure of pesticin allowed us to design a phage therapy reagent comprised of the FyuA binding domain of pesticin fused to the N-terminus of T4 lysozyme. This hybrid toxin kills specific Yersinia and pathogenic E. coli strains and, importantly, can evade the pesticin immunity protein (Pim) giving it a distinct advantage over pesticin. Furthermore, because FyuA is required for virulence and is more common in pathogenic bacteria, the hybrid toxin also has the advantage of targeting primarily disease-causing bacteria rather than indiscriminately eliminating natural gut flora.« less
Albert, Armando; Yunta, Cristina; Arranz, Rocío; Peña, Álvaro; Salido, Eduardo; Valpuesta, José María; Martín-Benito, Jaime
2010-01-01
Primary hyperoxaluria type 1 is a rare autosomal recessive disease caused by mutations in the alanine glyoxylate aminotransferase gene (AGXT). We have previously shown that P11L and I340M polymorphisms together with I244T mutation (AGXT-LTM) represent a conformational disease that could be amenable to pharmacological intervention. Thus, the study of the folding mechanism of AGXT is crucial to understand the molecular basis of the disease. Here, we provide biochemical and structural data showing that AGXT-LTM is able to form non-native folding intermediates. The three-dimensional structure of a complex between the bacterial chaperonin GroEL and a folding intermediate of AGXT-LTM mutant has been solved by cryoelectron microscopy. The electron density map shows the protein substrate in a non-native extended conformation that crosses the GroEL central cavity. Addition of ATP to the complex induces conformational changes on the chaperonin and the internalization of the protein substrate into the folding cavity. The structure provides a three-dimensional picture of an in vivo early ATP-dependent step of the folding reaction cycle of the chaperonin and supports a GroEL functional model in which the chaperonin promotes folding of the AGXT-LTM mutant protein through forced unfolding mechanism. PMID:20056599
Albert, Armando; Yunta, Cristina; Arranz, Rocío; Peña, Alvaro; Salido, Eduardo; Valpuesta, José María; Martín-Benito, Jaime
2010-02-26
Primary hyperoxaluria type 1 is a rare autosomal recessive disease caused by mutations in the alanine glyoxylate aminotransferase gene (AGXT). We have previously shown that P11L and I340M polymorphisms together with I244T mutation (AGXT-LTM) represent a conformational disease that could be amenable to pharmacological intervention. Thus, the study of the folding mechanism of AGXT is crucial to understand the molecular basis of the disease. Here, we provide biochemical and structural data showing that AGXT-LTM is able to form non-native folding intermediates. The three-dimensional structure of a complex between the bacterial chaperonin GroEL and a folding intermediate of AGXT-LTM mutant has been solved by cryoelectron microscopy. The electron density map shows the protein substrate in a non-native extended conformation that crosses the GroEL central cavity. Addition of ATP to the complex induces conformational changes on the chaperonin and the internalization of the protein substrate into the folding cavity. The structure provides a three-dimensional picture of an in vivo early ATP-dependent step of the folding reaction cycle of the chaperonin and supports a GroEL functional model in which the chaperonin promotes folding of the AGXT-LTM mutant protein through forced unfolding mechanism.
Kurpiewska, Katarzyna; Font, Josep; Ribó, Marc; Vilanova, Maria; Lewiński, Krzysztof
2009-11-15
To investigate the structural origin of decreased pressure and temperature stability, the crystal structure of bovine pancreatic ribonuclease A variants V47A, V54A, V57A, I81A, I106A, and V108A was solved at 1.4-2.0 A resolution and compared with the structure of wild-type protein. The introduced mutations had only minor influence on the global structure of ribonuclease A. The structural changes had individual character that depends on the localization of mutated residue, however, they seemed to expand from mutation site to the rest of the structure. Several different parameters have been evaluated to find correlation with decrease of free energy of unfolding DeltaDeltaG(T), and the most significant correlation was found for main cavity volume change. Analysis of the difference distance matrices revealed that the ribonuclease A molecule is organized into five relatively rigid subdomains with individual response to mutation. This behavior consistent with results of unfolding experiments is an intrinsic feature of ribonuclease A that might be surviving remnants of folding intermediates and reflects the dynamic nature of the molecule. 2009 Wiley-Liss, Inc.
Solution structure of the Legionella pneumophila Mip-rapamycin complex.
Ceymann, Andreas; Horstmann, Martin; Ehses, Philipp; Schweimer, Kristian; Paschke, Anne-Katrin; Steinert, Michael; Faber, Cornelius
2008-03-17
Legionella pneumphila is the causative agent of Legionnaires' disease. A major virulence factor of the pathogen is the homodimeric surface protein Mip. It shows peptidyl-prolyl cis/trans isomerase activty and is a receptor of FK506 and rapamycin, which both inhibit its enzymatic function. Insight into the binding process may be used for the design of novel Mip inhibitors as potential drugs against Legionnaires' disease. We have solved the solution structure of free Mip77-213 and the Mip77-213-rapamycin complex by NMR spectroscopy. Mip77-213 showed the typical FKBP-fold and only minor rearrangements upon binding of rapamycin. Apart from the configuration of a flexible hairpin loop, which is partly stabilized upon binding, the solution structure confirms the crystal structure. Comparisons to the structures of free FKBP12 and the FKBP12-rapamycin complex suggested an identical binding mode for both proteins. The structural similarity of the Mip-rapamycin and FKBP12-rapamycin complexes suggests that FKBP12 ligands may be promising starting points for the design of novel Mip inhibitors. The search for a novel drug against Legionnaires' disease may therefore benefit from the large variety of known FKBP12 inhibitors.
Solution structure of the Legionella pneumophila Mip-rapamycin complex
Ceymann, Andreas; Horstmann, Martin; Ehses, Philipp; Schweimer, Kristian; Paschke, Anne-Katrin; Steinert, Michael; Faber, Cornelius
2008-01-01
Background Legionella pneumphila is the causative agent of Legionnaires' disease. A major virulence factor of the pathogen is the homodimeric surface protein Mip. It shows peptidyl-prolyl cis/trans isomerase activty and is a receptor of FK506 and rapamycin, which both inhibit its enzymatic function. Insight into the binding process may be used for the design of novel Mip inhibitors as potential drugs against Legionnaires' disease. Results We have solved the solution structure of free Mip77–213 and the Mip77–213-rapamycin complex by NMR spectroscopy. Mip77–213 showed the typical FKBP-fold and only minor rearrangements upon binding of rapamycin. Apart from the configuration of a flexible hairpin loop, which is partly stabilized upon binding, the solution structure confirms the crystal structure. Comparisons to the structures of free FKBP12 and the FKBP12-rapamycin complex suggested an identical binding mode for both proteins. Conclusion The structural similarity of the Mip-rapamycin and FKBP12-rapamycin complexes suggests that FKBP12 ligands may be promising starting points for the design of novel Mip inhibitors. The search for a novel drug against Legionnaires' disease may therefore benefit from the large variety of known FKBP12 inhibitors. PMID:18366641
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhao, Baoguang; Smallwood, Angela; Yang, Jingsong
2008-10-24
VX-680, also known as MK-0457, is an ATP-competitive small molecule inhibitor of the Aurora kinases that has entered phase II clinical trials for the treatment of cancer. We have solved the cocrystal structure of AurA/TPX2/VX-680 at 2.3 {angstrom} resolution. In the crystal structure, VX-680 binds to the active conformation of AurA. The glycine-rich loop in AurA adopts a unique bent conformation, forming a {pi}-{pi} interaction with the phenyl group of VX-680. In contrast, in the published AurA/VX-680 structure, VX-680 binds to AurA in the inactive conformation, interacting with a hydrophobic pocket only present in the inactive conformation. These data suggestmore » that TPX2, a protein cofactor, can alter the binding mode of VX-680 with AurA. More generally, the presence of physiologically relevant cofactor proteins can alter the kinetics, binding interactions, and inhibition of enzymes, and studies with these multiprotein complexes may be beneficial to the discovery and optimization of enzyme inhibitors as therapeutic agents.« less
Systematic size study of an insect antifreeze protein and its interaction with ice.
Liu, Kai; Jia, Zongchao; Chen, Guangju; Tung, Chenho; Liu, Ruozhuang
2005-02-01
Because of their remarkable ability to depress the freezing point of aqueous solutions, antifreeze proteins (AFPs) play a critical role in helping many organisms survive subzero temperatures. The beta-helical insect AFP structures solved to date, consisting of multiple repeating circular loops or coils, are perhaps the most regular protein structures discovered thus far. Taking an exceptional advantage of the unusually high structural regularity of insect AFPs, we have employed both semiempirical and quantum mechanics computational approaches to systematically investigate the relationship between the number of AFP coils and the AFP-ice interaction energy, an indicator of antifreeze activity. We generated a series of AFP models with varying numbers of 12-residue coils (sequence TCTxSxxCxxAx) and calculated their interaction energies with ice. Using several independent computational methods, we found that the AFP-ice interaction energy increased as the number of coils increased, until an upper bound was reached. The increase of interaction energy was significant for each of the first five coils, and there was a clear synergism that gradually diminished and even decreased with further increase of the number of coils. Our results are in excellent agreement with the recently reported experimental observations.
Systematic Size Study of an Insect Antifreeze Protein and Its Interaction with Ice
Liu, Kai; Jia, Zongchao; Chen, Guangju; Tung, Chenho; Liu, Ruozhuang
2005-01-01
Because of their remarkable ability to depress the freezing point of aqueous solutions, antifreeze proteins (AFPs) play a critical role in helping many organisms survive subzero temperatures. The β-helical insect AFP structures solved to date, consisting of multiple repeating circular loops or coils, are perhaps the most regular protein structures discovered thus far. Taking an exceptional advantage of the unusually high structural regularity of insect AFPs, we have employed both semiempirical and quantum mechanics computational approaches to systematically investigate the relationship between the number of AFP coils and the AFP-ice interaction energy, an indicator of antifreeze activity. We generated a series of AFP models with varying numbers of 12-residue coils (sequence TCTxSxxCxxAx) and calculated their interaction energies with ice. Using several independent computational methods, we found that the AFP-ice interaction energy increased as the number of coils increased, until an upper bound was reached. The increase of interaction energy was significant for each of the first five coils, and there was a clear synergism that gradually diminished and even decreased with further increase of the number of coils. Our results are in excellent agreement with the recently reported experimental observations. PMID:15713600
Yoshida, Hisashi; Kawai, Fumihiro; Obayashi, Eiji; Akashi, Satoko; Roper, David I; Tame, Jeremy R H; Park, Sam-Yong
2012-10-26
Staphylococcus aureus is a widespread Gram-positive opportunistic pathogen, and a methicillin-resistant form (MRSA) is particularly difficult to treat clinically. We have solved two crystal structures of penicillin-binding protein (PBP) 3 (PBP3) from MRSA, the apo form and a complex with the β-lactam antibiotic cefotaxime, and used electrospray mass spectrometry to measure its sensitivity to a variety of penicillin derivatives. PBP3 is a class B PBP, possessing an N-terminal non-penicillin-binding domain, sometimes called a dimerization domain, and a C-terminal transpeptidase domain. The model shows a different orientation of its two domains compared to earlier models of other class B PBPs and a novel, larger N-domain. Consistent with the nomenclature of "dimerization domain", the N-terminal region forms an apparently tight interaction with a neighboring molecule related by a 2-fold symmetry axis in the crystal structure. This dimer form is predicted to be highly stable in solution by the PISA server, but mass spectrometry and analytical ultracentrifugation provide unequivocal evidence that the protein is a monomer in solution. Copyright © 2012 Elsevier Ltd. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sayer, Christopher; Isupov, Michail N.; Westlake, Aaron
2013-04-01
The X-ray structures of two ω-aminotransferases from P. aeruginosa and C. violaceum in complex with an inhibitor offer the first detailed insight into the structural basis of the substrate specificity of these industrially important enzymes. The crystal structures and inhibitor complexes of two industrially important ω-aminotransferase enzymes from Pseudomonas aeruginosa and Chromobacterium violaceum have been determined in order to understand the differences in their substrate specificity. The two enzymes share 30% sequence identity and use the same amino acceptor, pyruvate; however, the Pseudomonas enzyme shows activity towards the amino donor β-alanine, whilst the Chromobacterium enzyme does not. Both enzymes showmore » activity towards S-α-methylbenzylamine (MBA), with the Chromobacterium enzyme having a broader substrate range. The crystal structure of the P. aeruginosa enzyme has been solved in the holo form and with the inhibitor gabaculine bound. The C. violaceum enzyme has been solved in the apo and holo forms and with gabaculine bound. The structures of the holo forms of both enzymes are quite similar. There is little conformational difference observed between the inhibitor complex and the holoenzyme for the P. aeruginosa aminotransferase. In comparison, the crystal structure of the C. violaceum gabaculine complex shows significant structural rearrangements from the structures of both the apo and holo forms of the enzyme. It appears that the different rigidity of the protein scaffold contributes to the substrate specificity observed for the two ω-aminotransferases.« less
Structure of the glucagon receptor in complex with a glucagon analogue.
Zhang, Haonan; Qiao, Anna; Yang, Linlin; Van Eps, Ned; Frederiksen, Klaus S; Yang, Dehua; Dai, Antao; Cai, Xiaoqing; Zhang, Hui; Yi, Cuiying; Cao, Can; He, Lingli; Yang, Huaiyu; Lau, Jesper; Ernst, Oliver P; Hanson, Michael A; Stevens, Raymond C; Wang, Ming-Wei; Reedtz-Runge, Steffen; Jiang, Hualiang; Zhao, Qiang; Wu, Beili
2018-01-03
Class B G-protein-coupled receptors (GPCRs), which consist of an extracellular domain (ECD) and a transmembrane domain (TMD), respond to secretin peptides to play a key part in hormonal homeostasis, and are important therapeutic targets for a variety of diseases. Previous work has suggested that peptide ligands bind to class B GPCRs according to a two-domain binding model, in which the C-terminal region of the peptide targets the ECD and the N-terminal region of the peptide binds to the TMD binding pocket. Recently, three structures of class B GPCRs in complex with peptide ligands have been solved. These structures provide essential insights into peptide ligand recognition by class B GPCRs. However, owing to resolution limitations, the specific molecular interactions for peptide binding to class B GPCRs remain ambiguous. Moreover, these previously solved structures have different ECD conformations relative to the TMD, which introduces questions regarding inter-domain conformational flexibility and the changes required for receptor activation. Here we report the 3.0 Å-resolution crystal structure of the full-length human glucagon receptor (GCGR) in complex with a glucagon analogue and partial agonist, NNC1702. This structure provides molecular details of the interactions between GCGR and the peptide ligand. It reveals a marked change in the relative orientation between the ECD and TMD of GCGR compared to the previously solved structure of the inactive GCGR-NNC0640-mAb1 complex. Notably, the stalk region and the first extracellular loop undergo major conformational changes in secondary structure during peptide binding, forming key interactions with the peptide. We further propose a dual-binding-site trigger model for GCGR activation-which requires conformational changes of the stalk, first extracellular loop and TMD-that extends our understanding of the previously established two-domain peptide-binding model of class B GPCRs.
Bargiello, Thaddeus A; Oh, Seunghoon; Tang, Qingxiu; Bargiello, Nicholas K; Dowd, Terry L; Kwon, Taekyung
2018-01-01
Voltage is an important physiologic regulator of channels formed by the connexin gene family. Connexins are unique among ion channels in that both plasma membrane inserted hemichannels (undocked hemichannels) and intercellular channels (aggregates of which form gap junctions) have important physiological roles. The hemichannel is the fundamental unit of gap junction voltage-gating. Each hemichannel displays two distinct voltage-gating mechanisms that are primarily sensitive to a voltage gradient formed along the length of the channel pore (the transjunctional voltage) rather than sensitivity to the absolute membrane potential (V m or V i-o ). These transjunctional voltage dependent processes have been termed V j - or fast-gating and loop- or slow-gating. Understanding the mechanism of voltage-gating, defined as the sequence of voltage-driven transitions that connect open and closed states, first and foremost requires atomic resolution models of the end states. Although ion channels formed by connexins were among the first to be characterized structurally by electron microscopy and x-ray diffraction in the early 1980's, subsequent progress has been slow. Much of the current understanding of the structure-function relations of connexin channels is based on two crystal structures of Cx26 gap junction channels. Refinement of crystal structure by all-atom molecular dynamics and incorporation of charge changing protein modifications has resulted in an atomic model of the open state that arguably corresponds to the physiologic open state. Obtaining validated atomic models of voltage-dependent closed states is more challenging, as there are currently no methods to solve protein structure while a stable voltage gradient is applied across the length of an oriented channel. It is widely believed that the best approach to solve the atomic structure of a voltage-gated closed ion channel is to apply different but complementary experimental and computational methods and to use the resulting information to derive a consensus atomic structure that is then subjected to rigorous validation. In this paper, we summarize our efforts to obtain and validate atomic models of the open and voltage-driven closed states of undocked connexin hemichannels. This article is part of a Special Issue entitled: Gap Junction Proteins edited by Jean Claude Herve. Copyright © 2017 Elsevier B.V. All rights reserved.
Validating a Coarse-Grained Potential Energy Function through Protein Loop Modelling
MacDonald, James T.; Kelley, Lawrence A.; Freemont, Paul S.
2013-01-01
Coarse-grained (CG) methods for sampling protein conformational space have the potential to increase computational efficiency by reducing the degrees of freedom. The gain in computational efficiency of CG methods often comes at the expense of non-protein like local conformational features. This could cause problems when transitioning to full atom models in a hierarchical framework. Here, a CG potential energy function was validated by applying it to the problem of loop prediction. A novel method to sample the conformational space of backbone atoms was benchmarked using a standard test set consisting of 351 distinct loops. This method used a sequence-independent CG potential energy function representing the protein using -carbon positions only and sampling conformations with a Monte Carlo simulated annealing based protocol. Backbone atoms were added using a method previously described and then gradient minimised in the Rosetta force field. Despite the CG potential energy function being sequence-independent, the method performed similarly to methods that explicitly use either fragments of known protein backbones with similar sequences or residue-specific /-maps to restrict the search space. The method was also able to predict with sub-Angstrom accuracy two out of seven loops from recently solved crystal structures of proteins with low sequence and structure similarity to previously deposited structures in the PDB. The ability to sample realistic loop conformations directly from a potential energy function enables the incorporation of additional geometric restraints and the use of more advanced sampling methods in a way that is not possible to do easily with fragment replacement methods and also enable multi-scale simulations for protein design and protein structure prediction. These restraints could be derived from experimental data or could be design restraints in the case of computational protein design. C++ source code is available for download from http://www.sbg.bio.ic.ac.uk/phyre2/PD2/. PMID:23824634
Structure of the N-terminal fragment of Escherichia coli Lon protease
DOE Office of Scientific and Technical Information (OSTI.GOV)
Li, Mi; Basic Research Program, SAIC-Frederick, Frederick, MD 21702; Gustchina, Alla
2010-08-01
The medium-resolution structure of the N-terminal fragment of E. coli Lon protease shows that this part of the enzyme consists of two compact domains and a very long α-helix. The structure of a recombinant construct consisting of residues 1–245 of Escherichia coli Lon protease, the prototypical member of the A-type Lon family, is reported. This construct encompasses all or most of the N-terminal domain of the enzyme. The structure was solved by SeMet SAD to 2.6 Å resolution utilizing trigonal crystals that contained one molecule in the asymmetric unit. The molecule consists of two compact subdomains and a very longmore » C-terminal α-helix. The structure of the first subdomain (residues 1–117), which consists mostly of β-strands, is similar to that of the shorter fragment previously expressed and crystallized, whereas the second subdomain is almost entirely helical. The fold and spatial relationship of the two subdomains, with the exception of the C-terminal helix, closely resemble the structure of BPP1347, a 203-amino-acid protein of unknown function from Bordetella parapertussis, and more distantly several other proteins. It was not possible to refine the structure to satisfactory convergence; however, since almost all of the Se atoms could be located on the basis of their anomalous scattering the correctness of the overall structure is not in question. The structure reported here was also compared with the structures of the putative substrate-binding domains of several proteins, showing topological similarities that should help in defining the binding sites used by Lon substrates.« less
Bae, Chanhyung; Anselmi, Claudio; Kalia, Jeet; Jara-Oseguera, Andres; Schwieters, Charles D; Krepkiy, Dmitriy; Won Lee, Chul; Kim, Eun-Hee; Kim, Jae Il; Faraldo-Gómez, José D; Swartz, Kenton J
2016-01-01
Venom toxins are invaluable tools for exploring the structure and mechanisms of ion channels. Here, we solve the structure of double-knot toxin (DkTx), a tarantula toxin that activates the heat-activated TRPV1 channel. We also provide improved structures of TRPV1 with and without the toxin bound, and investigate the interactions of DkTx with the channel and membranes. We find that DkTx binds to the outer edge of the external pore of TRPV1 in a counterclockwise configuration, using a limited protein-protein interface and inserting hydrophobic residues into the bilayer. We also show that DkTx partitions naturally into membranes, with the two lobes exhibiting opposing energetics for membrane partitioning and channel activation. Finally, we find that the toxin disrupts a cluster of hydrophobic residues behind the selectivity filter that are critical for channel activation. Collectively, our findings reveal a novel mode of toxin-channel recognition that has important implications for the mechanism of thermosensation. DOI: http://dx.doi.org/10.7554/eLife.11273.001 PMID:26880553
Chu, Byron C. H.; Otten, Renee; Krewulak, Karla D.; Mulder, Frans A. A.; Vogel, Hans J.
2014-01-01
The periplasmic binding protein (PBP) FepB plays a key role in transporting the catecholate siderophore ferric enterobactin from the outer to the inner membrane in Gram-negative bacteria. The solution structures of the 34-kDa apo- and holo-FepB from Escherichia coli, solved by NMR, represent the first solution structures determined for the type III class of PBPs. Unlike type I and II PBPs, which undergo large “Venus flytrap” conformational changes upon ligand binding, both forms of FepB maintain similar overall folds; however, binding of the ligand is accompanied by significant loop movements. Reverse methyl cross-saturation experiments corroborated chemical shift perturbation results and uniquely defined the binding pocket for gallium enterobactin (GaEnt). NMR relaxation experiments indicated that a flexible loop (residues 225–250) adopted a more rigid and extended conformation upon ligand binding, which positioned residues for optimal interactions with the ligand and the cytoplasmic membrane ABC transporter (FepCD), respectively. In conclusion, this work highlights the pivotal role that structural dynamics plays in ligand binding and transporter interactions in type III PBPs. PMID:25173704
Lee, Hyung Ho; Jung, Sang Taek
2013-02-01
β-N-acetylglucosaminidase (NagA) protein hs a chitin-degrading activity and chitin is one of the most abundant polymers in nature. NagA contains a family 3 glycoside (GH3)-type N-terminal domain and a unique C-terminal domain. The structurally uncharacterized C-terminal domain of NagA may be involved in substrate specificity. To provide a structural basis for the substrate specificity of NagA, structural analysis of NagA from Thermotoga maritima encoded by the Tm0809 gene was initiated. NagA from T. maritima has been overexpressed in Escherichia coli and crystallized at 296 K using ammonium sulfate as a precipitant. Crystals of T. maritima NagA diffracted to 3.80 Å resolution and belonged to the monoclinic space group C2, with unit-cell parameters a = 231.15, b = 133.62, c = 140.88 Å, β = 89.97°. The crystallization of selenomethionyl-substituted protein is in progress to solve the crystal structure of T. maritima NagA.
A Survey of Computational Intelligence Techniques in Protein Function Prediction
Tiwari, Arvind Kumar; Srivastava, Rajeev
2014-01-01
During the past, there was a massive growth of knowledge of unknown proteins with the advancement of high throughput microarray technologies. Protein function prediction is the most challenging problem in bioinformatics. In the past, the homology based approaches were used to predict the protein function, but they failed when a new protein was different from the previous one. Therefore, to alleviate the problems associated with homology based traditional approaches, numerous computational intelligence techniques have been proposed in the recent past. This paper presents a state-of-the-art comprehensive review of various computational intelligence techniques for protein function predictions using sequence, structure, protein-protein interaction network, and gene expression data used in wide areas of applications such as prediction of DNA and RNA binding sites, subcellular localization, enzyme functions, signal peptides, catalytic residues, nuclear/G-protein coupled receptors, membrane proteins, and pathway analysis from gene expression datasets. This paper also summarizes the result obtained by many researchers to solve these problems by using computational intelligence techniques with appropriate datasets to improve the prediction performance. The summary shows that ensemble classifiers and integration of multiple heterogeneous data are useful for protein function prediction. PMID:25574395
DOE Office of Scientific and Technical Information (OSTI.GOV)
Struble, E. B., E-mail: evi.struble@nist.gov; Department of Biochemistry and Molecular Biology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21205; Center for Advanced Research in Biotechnology/NIST, 9600 Gudelsky Drive, Rockville, MD 20850
2007-06-01
Crystallization and preliminary diffraction data of the N-terminal 19–139 fragment of the origin-binding domain of bacteriophage λ O replication initiator are reported. The bacteriophage λ O protein binds to the λ replication origin (oriλ) and serves as the primary replication initiator for the viral genome. The binding energy derived from the binding of O to oriλ is thought to help drive DNA opening to facilitate initiation of DNA replication. Detailed understanding of this process is severely limited by the lack of high-resolution structures of O protein or of any lambdoid phage-encoded paralogs either with or without DNA. The production ofmore » crystals of the origin-binding domain of λ O that diffract to 2.5 Å is reported. Anomalous dispersion methods will be used to solve this structure.« less
Hemond, Michael; Rothstein, Thomas L.; Wagner, Gerhard
2009-01-01
Summary Fas apoptosis inhibitory molecule (FAIM) is a soluble cytosolic protein inhibitor of programmed cell death and is found in organisms throughout the animal kingdom. A short isoform (FAIM-S) is expressed in all tissue types, while an alternatively spliced long isoform (FAIM-L) is specifically expressed in the brain. Here FAIM-S is shown to consist of two independently folding domains in contact with one another. The NMR solution structure of the C-terminal domain of murine FAIM is solved in isolation and revealed to be a novel protein fold, a noninterleaved seven-stranded beta sandwich. The structure and sequence reveal several residues that are likely to be involved in functionally significant interactions with the N-terminal domain or other binding partners. Chemical shift perturbation is used to elucidate contacts made between the N- and C-terminal domains. PMID:19168072
Modular architecture of eukaryotic RNase P and RNase MRP revealed by electron microscopy.
Hipp, Katharina; Galani, Kyriaki; Batisse, Claire; Prinz, Simone; Böttcher, Bettina
2012-04-01
Ribonuclease P (RNase P) and RNase MRP are closely related ribonucleoprotein enzymes, which process RNA substrates including tRNA precursors for RNase P and 5.8 S rRNA precursors, as well as some mRNAs, for RNase MRP. The structures of RNase P and RNase MRP have not yet been solved, so it is unclear how the proteins contribute to the structure of the complexes and how substrate specificity is determined. Using electron microscopy and image processing we show that eukaryotic RNase P and RNase MRP have a modular architecture, where proteins stabilize the RNA fold and contribute to cavities, channels and chambers between the modules. Such features are located at strategic positions for substrate recognition by shape and coordination of the cleaved-off sequence. These are also the sites of greatest difference between RNase P and RNase MRP, highlighting the importance of the adaptation of this region to the different substrates.
Unexpected fold in the circumsporozoite protein target of malaria vaccines
DOE Office of Scientific and Technical Information (OSTI.GOV)
Doud, Michael B.; Koksal, Adem C.; Mi, Li-Zhi
Circumsporozoite (CS) protein is the major surface component of Plasmodium falciparum sporozoites and is essential for host cell invasion. A vaccine containing tandem repeats, region III, and thrombospondin type-I repeat (TSR) of CS is efficacious in phase III trials but gives only a 35% reduction in severe malaria in the first year postimmunization. We solved crystal structures showing that region III and TSR fold into a single unit, an '{alpha}TSR' domain. The {alpha}TSR domain possesses a hydrophobic pocket and core, missing in TSR domains. CS binds heparin, but {alpha}TSR does not. Interestingly, polymorphic T-cell epitopes map to specialized {alpha}TSR regions.more » The N and C termini are unexpectedly close, providing clues for sporozoite sheath organization. Elucidation of a unique structure of a domain within CS enables rational design of next-generation subunit vaccines and functional and medicinal chemical investigation of the conserved hydrophobic pocket.« less
Pegos, Vanessa R.; Santos, Rodrigo M. L.; Medrano, Francisco J.
2017-01-01
In Escherichia coli, the ATP-Binding Cassette transporter for phosphate is encoded by the pstSCAB operon. PstS is the periplasmic component responsible for affinity and specificity of the system and has also been related to a regulatory role and chemotaxis during depletion of phosphate. Xanthomonas citri has two phosphate-binding proteins: PstS and PhoX, which are differentially expressed under phosphate limitation. In this work, we focused on PhoX characterization and comparison with PstS. The PhoX three-dimensional structure was solved in a closed conformation with a phosphate engulfed in the binding site pocket between two domains. Comparison between PhoX and PstS revealed that they originated from gene duplication, but despite their similarities they show significant differences in the region that interacts with the permeases. PMID:28542513
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yanez, M.E.; Korotkov, K.V.; Abendroth, J.
2009-05-28
Type II secretion systems (T2SS) translocate virulence factors from the periplasmic space of many pathogenic bacteria into the extracellular environment. The T2SS of Vibrio cholerae and related species is called the extracellular protein secretion (Eps) system that consists of a core of multiple copies of 11 different proteins. The pseudopilins, EpsG, EpsH, EpsI, EpsJ and EpsK, are five T2SS proteins that are thought to assemble into a pseudopilus, which is assumed to interact with the outer membrane pore, and may actively participate in the export of proteins. We report here biochemical evidence that the minor pseudopilins EpsI and EpsJ frommore » Vibrio species interact directly with one another. Moreover, the 2.3 {angstrom} resolution crystal structure of a complex of EspI and EpsJ from Vibrio vulnificus represents the first atomic resolution structure of a complex of two different pseudopilin components from the T2SS. Both EpsI and EpsJ appear to be structural extremes within the family of type 4a pilin structures solved to date, with EpsI having the smallest, and EpsJ the largest, 'variable pilin segment' seen thus far. A high degree of sequence conservation in the EpsI:EpsJ interface indicates that this heterodimer occurs in the T2SS of a large number of bacteria. The arrangement of EpsI and EpsJ in the heterodimer would correspond to a right-handed helical character of proteins assembled into a pseudopilus.« less
Alvarez-Cabrera, Ana L.; Delgado, Sandra; Gil-Carton, David; Mortuza, Gulnahar B.; Montoya, Guillermo; Sorzano, Carlos O. S.; Tang, Tang K.; Carazo, Jose M.
2017-01-01
Centrosomal P4.1-associated protein (CPAP) is a cell cycle regulated protein fundamental for centrosome assembly and centriole elongation. In humans, the region between residues 897–1338 of CPAP mediates interactions with other proteins and includes a homodimerization domain. CPAP mutations cause primary autosomal recessive microcephaly and Seckel syndrome. Despite of the biological/clinical relevance of CPAP, its mechanistic behavior remains unclear and its C-terminus (the G-box/TCP domain) is the only part whose structure has been solved. This situation is perhaps due in part to the challenges that represent obtaining the protein in a soluble, homogeneous state for structural studies. Our work constitutes a systematic structural analysis on multiple oligomers of HsCPAP897−1338, using single-particle electron microscopy (EM) of negatively stained (NS) samples. Based on image classification into clearly different regular 3D maps (putatively corresponding to dimers and tetramers) and direct observation of individual images representing other complexes of HsCPAP897−1338 (i.e., putative flexible monomers and higher-order multimers), we report a dynamic oligomeric behavior of this protein, where different homo-oligomers coexist in variable proportions. We propose that dimerization of the putative homodimer forms a putative tetramer which could be the structural unit for the scaffold that either tethers the pericentriolar material to centrioles or promotes procentriole elongation. A coarse fitting of atomic models into the NS 3D maps at resolutions around 20 Å is performed only to complement our experimental data, allowing us to hypothesize on the oligomeric composition of the different complexes. In this way, the current EM work represents an initial step toward the structural characterization of different oligomers of CPAP, suggesting further insights to understand how this protein works, contributing to the elucidation of control mechanisms for centriole biogenesis. PMID:28396859
Yadav, Ravi P.; Gakhar, Lokesh; Yu, Liping
2017-01-01
FKBP-domain proteins (FKBPs) are pivotal modulators of cellular signaling, protein folding, and gene transcription. Aryl hydrocarbon receptor-interacting protein-like 1 (AIPL1) is a distinctive member of the FKBP superfamily in terms of its biochemical properties, and it plays an important biological role as a chaperone of phosphodiesterase 6 (PDE6), an effector enzyme of the visual transduction cascade. Malfunction of mutant AIPL1 proteins triggers a severe form of Leber congenital amaurosis and leads to blindness. The mechanism underlying the chaperone activity of AIPL1 is largely unknown, but involves the binding of isoprenyl groups on PDE6 to the FKBP domain of AIPL1. We solved the crystal structures of the AIPL1–FKBP domain and its pathogenic mutant V71F, both in the apo form and in complex with isoprenyl moieties. These structures reveal a module for lipid binding that is unparalleled within the FKBP superfamily. The prenyl binding is enabled by a unique “loop-out” conformation of the β4-α1 loop and a conformational “flip-out” switch of the key W72 residue. A second major conformation of apo AIPL1–FKBP was identified by NMR studies. This conformation, wherein W72 flips into the ligand-binding pocket and renders the protein incapable of prenyl binding, is supported by molecular dynamics simulations and appears to underlie the pathogenicity of the V71F mutant. Our findings offer critical insights into the mechanisms that underlie AIPL1 function in health and disease, and highlight the structural and functional diversity of the FKBPs. PMID:28739921
Structural and Biochemical Characterization of a Novel Aminopeptidase from Human Intestine
Tykvart, Jan; Bařinka, Cyril; Svoboda, Michal; ...
2015-03-09
N-acetylated α-linked acidic dipeptidase-like protein (NAALADase L), encoded by the NAALADL1 gene, is a close homolog of glutamate carboxypeptidase II, a metallopeptidase that has been intensively studied as a target for imaging and therapy of solid malignancies and neuropathologies. However, neither the physiological functions nor structural features of NAALADase L are known at present. In this paper, we report a thorough characterization of the protein product of the human NAALADL1 gene, including heterologous overexpression and purification, structural and biochemical characterization, and analysis of its expression profile. By solving the NAALADase L x-ray structure, we provide the first experimental evidence thatmore » it is a zinc-dependent metallopeptidase with a catalytic mechanism similar to that of glutamate carboxypeptidase II yet distinct substrate specificity. A proteome-based assay revealed that the NAALADL1 gene product possesses previously unrecognized aminopeptidase activity but no carboxy- or endopeptidase activity. These findings were corroborated by site-directed mutagenesis and identification of bestatin as a potent inhibitor of the enzyme. Analysis of NAALADL1 gene expression at both the mRNA and protein levels revealed the small intestine as the major site of protein expression and points toward extensive alternative splicing of the NAALADL1 gene transcript. Taken together, our data imply that the NAALADL1 gene product's primary physiological function is associated with the final stages of protein/peptide digestion and absorption in the human digestive system. Finally, based on these results, we suggest a new name for this enzyme: human ileal aminopeptidase (HILAP).« less
Conformation switching of AIM2 PYD domain revealed by NMR relaxation and MD simulation.
Wang, Haobo; Yang, Lijiang; Niu, Xiaogang
2016-04-29
Protein absent in melanoma 2 (AIM2) is a double-strand DNA (ds DNA) sensor mainly located in cytoplasm of cell. It includes one N terminal PYD domain and one C terminal HIN domain. When the ds DNA such as DNA viruses and bacteria entered cytoplasm, the HIN domain of AIM2 will recognize and bind to DNA, and the PYD domain will bind to ASC protein which will result in the formation of AIM2 inflammasome. Three AIM2 PYD domain structures have been solved, but every structure yields a unique conformation around the α3 helix region. To understand why different AIM2 PYD structures show different conformations in this region, we use NMR relaxation techniques to study the backbone dynamics of mouse AIM2 PYD domain and perform molecular dynamics (MD) simulations on both mouse and human AIM2 PYD structures. Our results indicate that this region is highly flexible in both mouse and human AIM2 PYD domains, and the PYD domain may exist as a conformation ensemble in solution. Different environment makes the population vary among pre-existing conformational substrates of the ensemble, which may be the reason why different AIM2 PYD structures were observed under different conditions. Further docking analysis reveals that the conformation switching may be important for the autoinhibition of the AIM2 protein. Copyright © 2016 Elsevier Inc. All rights reserved.
Functional diversity of the superfamily of K⁺ transporters to meet various requirements.
Diskowski, Marina; Mikusevic, Vedrana; Stock, Charlott; Hänelt, Inga
2015-09-01
The superfamily of K+ transporters unites proteins from plants, fungi, bacteria, and archaea that translocate K+ and/or Na+ across membranes. These proteins are key components in osmotic regulation, pH homeostasis, and resistance to high salinity and dryness. The members of the superfamily are closely related to K+ channels such as KcsA but also show several striking differences that are attributed to their altered functions. This review highlights these functional differences, focusing on the bacterial superfamily members KtrB, TrkH, and KdpA. The functional variations within the family and comparison to MPM-type K+ channels are discussed in light of the recently solved structures of the Ktr and Trk systems.
Problem-Solving Test: Analysis of DNA Damage Recognizing Proteins in Yeast and Human Cells
ERIC Educational Resources Information Center
Szeberenyi, Jozsef
2013-01-01
The experiment described in this test was aimed at identifying DNA repair proteins in human and yeast cells. Terms to be familiar with before you start to solve the test: DNA repair, germline mutation, somatic mutation, inherited disease, cancer, restriction endonuclease, radioactive labeling, [alpha-[superscript 32]P]ATP, [gamma-[superscript…
SVM-Fold: a tool for discriminative multi-class protein fold and superfamily recognition
Melvin, Iain; Ie, Eugene; Kuang, Rui; Weston, Jason; Stafford, William Noble; Leslie, Christina
2007-01-01
Background Predicting a protein's structural class from its amino acid sequence is a fundamental problem in computational biology. Much recent work has focused on developing new representations for protein sequences, called string kernels, for use with support vector machine (SVM) classifiers. However, while some of these approaches exhibit state-of-the-art performance at the binary protein classification problem, i.e. discriminating between a particular protein class and all other classes, few of these studies have addressed the real problem of multi-class superfamily or fold recognition. Moreover, there are only limited software tools and systems for SVM-based protein classification available to the bioinformatics community. Results We present a new multi-class SVM-based protein fold and superfamily recognition system and web server called SVM-Fold, which can be found at . Our system uses an efficient implementation of a state-of-the-art string kernel for sequence profiles, called the profile kernel, where the underlying feature representation is a histogram of inexact matching k-mer frequencies. We also employ a novel machine learning approach to solve the difficult multi-class problem of classifying a sequence of amino acids into one of many known protein structural classes. Binary one-vs-the-rest SVM classifiers that are trained to recognize individual structural classes yield prediction scores that are not comparable, so that standard "one-vs-all" classification fails to perform well. Moreover, SVMs for classes at different levels of the protein structural hierarchy may make useful predictions, but one-vs-all does not try to combine these multiple predictions. To deal with these problems, our method learns relative weights between one-vs-the-rest classifiers and encodes information about the protein structural hierarchy for multi-class prediction. In large-scale benchmark results based on the SCOP database, our code weighting approach significantly improves on the standard one-vs-all method for both the superfamily and fold prediction in the remote homology setting and on the fold recognition problem. Moreover, our code weight learning algorithm strongly outperforms nearest-neighbor methods based on PSI-BLAST in terms of prediction accuracy on every structure classification problem we consider. Conclusion By combining state-of-the-art SVM kernel methods with a novel multi-class algorithm, the SVM-Fold system delivers efficient and accurate protein fold and superfamily recognition. PMID:17570145
Revisiting the Roco G-protein cycle.
Terheyden, Susanne; Ho, Franz Y; Gilsbach, Bernd K; Wittinghofer, Alfred; Kortholt, Arjan
2015-01-01
Mutations in leucine-rich-repeat kinase 2 (LRRK2) are the most frequent cause of late-onset Parkinson's disease (PD). LRRK2 belongs to the Roco family of proteins which share a conserved Ras-like G-domain (Roc) and a C-terminal of Roc (COR) domain tandem. The nucleotide state of small G-proteins is strictly controlled by guanine-nucleotide-exchange factors (GEFs) and GTPase-activating proteins (GAPs). Because of contradictory structural and biochemical data, the regulatory mechanism of the LRRK2 Roc G-domain and the RocCOR tandem is still under debate. In the present study, we solved the first nucleotide-bound Roc structure and used LRRK2 and bacterial Roco proteins to characterize the RocCOR function in more detail. Nucleotide binding induces a drastic structural change in the Roc/COR domain interface, a region strongly implicated in patients with an LRRK2 mutation. Our data confirm previous assumptions that the C-terminal subdomain of COR functions as a dimerization device. We show that the dimer formation is independent of nucleotide. The affinity for GDP/GTP is in the micromolar range, the result of which is high dissociation rates in the s-1 range. Thus Roco proteins are unlikely to need GEFs to achieve activation. Monomeric LRRK2 and Roco G-domains have a similar low GTPase activity to small G-proteins. We show that GTPase activity in bacterial Roco is stimulated by the nucleotide-dependent dimerization of the G-domain within the complex. We thus propose that the Roco proteins do not require GAPs to stimulate GTP hydrolysis but stimulate each other by one monomer completing the catalytic machinery of the other.
2013-01-01
Background The widespread protozoan parasite Toxoplasma gondii interferes with host cell functions by exporting the contents of a unique apical organelle, the rhoptry. Among the mix of secreted proteins are an expanded, lineage-specific family of protein kinases termed rhoptry kinases (ROPKs), several of which have been shown to be key virulence factors, including the pseudokinase ROP5. The extent and details of the diversification of this protein family are poorly understood. Results In this study, we comprehensively catalogued the ROPK family in the genomes of Toxoplasma gondii, Neospora caninum and Eimeria tenella, as well as portions of the unfinished genome of Sarcocystis neurona, and classified the identified genes into 42 distinct subfamilies. We systematically compared the rhoptry kinase protein sequences and structures to each other and to the broader superfamily of eukaryotic protein kinases to study the patterns of diversification and neofunctionalization in the ROPK family and its subfamilies. We identified three ROPK sub-clades of particular interest: those bearing a structurally conserved N-terminal extension to the kinase domain (NTE), an E. tenella-specific expansion, and a basal cluster including ROP35 and BPK1 that we term ROPKL. Structural analysis in light of the solved structures ROP2, ROP5, ROP8 and in comparison to typical eukaryotic protein kinases revealed ROPK-specific conservation patterns in two key regions of the kinase domain, surrounding a ROPK-conserved insert in the kinase hinge region and a disulfide bridge in the kinase substrate-binding lobe. We also examined conservation patterns specific to the NTE-bearing clade. We discuss the possible functional consequences of each. Conclusions Our work sheds light on several important but previously unrecognized features shared among rhoptry kinases, as well as the essential differences between active and degenerate protein kinases. We identify the most distinctive ROPK-specific features conserved across both active kinases and pseudokinases, and discuss these in terms of sequence motifs, evolutionary context, structural impact and potential functional relevance. By characterizing the proteins that enable these parasites to invade the host cell and co-opt its signaling mechanisms, we provide guidance on potential therapeutic targets for the diseases caused by coccidian parasites. PMID:23742205
Platania, Chiara Bianca Maria; Salomone, Salvatore; Leggio, Gian Marco; Drago, Filippo; Bucolo, Claudio
2012-01-01
Dopamine (DA) receptors, a class of G-protein coupled receptors (GPCRs), have been targeted for drug development for the treatment of neurological, psychiatric and ocular disorders. The lack of structural information about GPCRs and their ligand complexes has prompted the development of homology models of these proteins aimed at structure-based drug design. Crystal structure of human dopamine D3 (hD3) receptor has been recently solved. Based on the hD3 receptor crystal structure we generated dopamine D2 and D3 receptor models and refined them with molecular dynamics (MD) protocol. Refined structures, obtained from the MD simulations in membrane environment, were subsequently used in molecular docking studies in order to investigate potential sites of interaction. The structure of hD3 and hD2L receptors was differentiated by means of MD simulations and D3 selective ligands were discriminated, in terms of binding energy, by docking calculation. Robust correlation of computed and experimental Ki was obtained for hD3 and hD2L receptor ligands. In conclusion, the present computational approach seems suitable to build and refine structure models of homologous dopamine receptors that may be of value for structure-based drug discovery of selective dopaminergic ligands. PMID:22970199
Zhou, P; Huang, J; Tian, F
2012-01-01
Specific noncovalent interactions that are indicative of attractive, directional intermolecular forces have always been of key interest to medicinal chemists in their search for the "glue" that holds drugs and their targets together. With the rapid increase in the number of solved biomolecular structures as well as the performance enhancement of computer hardware and software in recent years, it is now possible to give more comprehensive insight into the geometrical characteristics and energetic landscape of certain sophisticated noncovalent interactions present at the binding interface of protein receptors and small ligands based on accumulated knowledge gaining from the combination of two quite disparate but complementary approaches: crystallographic data analysis and quantum-mechanical ab initio calculation. In this perspective, we survey massive body of published works relating to structural characterization and theoretical investigation of three kinds of strong, specific, direct, enthalpy-driven intermolecular forces, including hydrogen bond, halogen bond and salt bridge, involved in the formation of protein-ligand complex architecture in order to characterize their biological functions in conferring affinity and specificity for ligand recognition by host protein. In particular, the biomedical implications of raised knowledge are discussed with respect to potential applications in rational drug design.
Yung, Yuk-Lin; Cheung, Ming-Yan; Miao, Rui; Fong, Yu-Hang; Li, Kwan-Pok; Yu, Mei-Hui; Chye, Mee-Len; Wong, Kam-Bo; Lam, Hon-Ming
2015-09-25
The C2 domain is one of the most diverse phospholipid-binding domains mediating cellular signaling. One group of C2-domain proteins are plant-specific and are characterized by their small sizes and simple structures. We have previously reported that a member of this group, OsGAP1, is able to alleviate salt stress and stimulate defense responses, and bind to both phospholipids and an unconventional G-protein, OsYchF1. Here we solved the crystal structure of OsGAP1 to a resolution of 1.63 Å. Using site-directed mutagenesis, we successfully differentiated between the clusters of surface residues that are required for binding to phospholipids versus OsYchF1, which, in turn, is critical for its role in stimulating defense responses. On the other hand, the ability to alleviate salt stress by OsGAP1 is dependent only on its ability to bind OsYchF1 and is independent of its phospholipid-binding activity. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.