protein structure refinement: Topics by Science.gov

Sample records for protein structure refinement

Improved cryoEM-Guided Iterative Molecular Dynamics–Rosetta Protein Structure Refinement Protocol for High Precision Protein Structure Prediction

PubMed Central

2016-01-01

Many excellent methods exist that incorporate cryo-electron microscopy (cryoEM) data to constrain computational protein structure prediction and refinement. Previously, it was shown that iteration of two such orthogonal sampling and scoring methods – Rosetta and molecular dynamics (MD) simulations – facilitated exploration of conformational space in principle. Here, we go beyond a proof-of-concept study and address significant remaining limitations of the iterative MD–Rosetta protein structure refinement protocol. Specifically, all parts of the iterative refinement protocol are now guided by medium-resolution cryoEM density maps, and previous knowledge about the native structure of the protein is no longer necessary. Models are identified solely based on score or simulation time. All four benchmark proteins showed substantial improvement through three rounds of the iterative refinement protocol. The best-scoring final models of two proteins had sub-Ångstrom RMSD to the native structure over residues in secondary structure elements. Molecular dynamics was most efficient in refining secondary structure elements and was thus highly complementary to the Rosetta refinement which is most powerful in refining side chains and loop regions. PMID:25883538
Structure Refinement of Protein Low Resolution Models Using the GNEIMO Constrained Dynamics Method

PubMed Central

Park, In-Hee; Gangupomu, Vamshi; Wagner, Jeffrey; Jain, Abhinandan; Vaidehi, Nagara-jan

2012-01-01

The challenge in protein structure prediction using homology modeling is the lack of reliable methods to refine the low resolution homology models. Unconstrained all-atom molecular dynamics (MD) does not serve well for structure refinement due to its limited conformational search. We have developed and tested the constrained MD method, based on the Generalized Newton-Euler Inverse Mass Operator (GNEIMO) algorithm for protein structure refinement. In this method, the high-frequency degrees of freedom are replaced with hard holonomic constraints and a protein is modeled as a collection of rigid body clusters connected by flexible torsional hinges. This allows larger integration time steps and enhances the conformational search space. In this work, we have demonstrated the use of a constraint free GNEIMO method for protein structure refinement that starts from low-resolution decoy sets derived from homology methods. In the eight proteins with three decoys for each, we observed an improvement of ~2 Å in the RMSD to the known experimental structures of these proteins. The GNEIMO method also showed enrichment in the population density of native-like conformations. In addition, we demonstrated structural refinement using a “Freeze and Thaw” clustering scheme with the GNEIMO framework as a viable tool for enhancing localized conformational search. We have derived a robust protocol based on the GNEIMO replica exchange method for protein structure refinement that can be readily extended to other proteins and possibly applicable for high throughput protein structure refinement. PMID:22260550
i3Drefine software for protein 3D structure refinement and its assessment in CASP10.

PubMed

Bhattacharya, Debswapna; Cheng, Jianlin

2013-01-01

Protein structure refinement refers to the process of improving the qualities of protein structures during structure modeling processes to bring them closer to their native states. Structure refinement has been drawing increasing attention in the community-wide Critical Assessment of techniques for Protein Structure prediction (CASP) experiments since its addition in 8(th) CASP experiment. During the 9(th) and recently concluded 10(th) CASP experiments, a consistent growth in number of refinement targets and participating groups has been witnessed. Yet, protein structure refinement still remains a largely unsolved problem with majority of participating groups in CASP refinement category failed to consistently improve the quality of structures issued for refinement. In order to alleviate this need, we developed a completely automated and computationally efficient protein 3D structure refinement method, i3Drefine, based on an iterative and highly convergent energy minimization algorithm with a powerful all-atom composite physics and knowledge-based force fields and hydrogen bonding (HB) network optimization technique. In the recent community-wide blind experiment, CASP10, i3Drefine (as 'MULTICOM-CONSTRUCT') was ranked as the best method in the server section as per the official assessment of CASP10 experiment. Here we provide the community with free access to i3Drefine software and systematically analyse the performance of i3Drefine in strict blind mode on the refinement targets issued in CASP10 refinement category and compare with other state-of-the-art refinement methods participating in CASP10. Our analysis demonstrates that i3Drefine is only fully-automated server participating in CASP10 exhibiting consistent improvement over the initial structures in both global and local structural quality metrics. Executable version of i3Drefine is freely available at http://protein.rnet.missouri.edu/i3drefine/.
Implementation of a parallel protein structure alignment service on cloud.

PubMed

Hung, Che-Lun; Lin, Yaw-Ling

2013-01-01

Protein structure alignment has become an important strategy by which to identify evolutionary relationships between protein sequences. Several alignment tools are currently available for online comparison of protein structures. In this paper, we propose a parallel protein structure alignment service based on the Hadoop distribution framework. This service includes a protein structure alignment algorithm, a refinement algorithm, and a MapReduce programming model. The refinement algorithm refines the result of alignment. To process vast numbers of protein structures in parallel, the alignment and refinement algorithms are implemented using MapReduce. We analyzed and compared the structure alignments produced by different methods using a dataset randomly selected from the PDB database. The experimental results verify that the proposed algorithm refines the resulting alignments more accurately than existing algorithms. Meanwhile, the computational performance of the proposed service is proportional to the number of processors used in our cloud platform.
Implementation of a Parallel Protein Structure Alignment Service on Cloud

PubMed Central

Hung, Che-Lun; Lin, Yaw-Ling

2013-01-01

Protein structure alignment has become an important strategy by which to identify evolutionary relationships between protein sequences. Several alignment tools are currently available for online comparison of protein structures. In this paper, we propose a parallel protein structure alignment service based on the Hadoop distribution framework. This service includes a protein structure alignment algorithm, a refinement algorithm, and a MapReduce programming model. The refinement algorithm refines the result of alignment. To process vast numbers of protein structures in parallel, the alignment and refinement algorithms are implemented using MapReduce. We analyzed and compared the structure alignments produced by different methods using a dataset randomly selected from the PDB database. The experimental results verify that the proposed algorithm refines the resulting alignments more accurately than existing algorithms. Meanwhile, the computational performance of the proposed service is proportional to the number of processors used in our cloud platform. PMID:23671842
GalaxyRefineComplex: Refinement of protein-protein complex model structures driven by interface repacking.

PubMed

Heo, Lim; Lee, Hasup; Seok, Chaok

2016-08-18

Protein-protein docking methods have been widely used to gain an atomic-level understanding of protein interactions. However, docking methods that employ low-resolution energy functions are popular because of computational efficiency. Low-resolution docking tends to generate protein complex structures that are not fully optimized. GalaxyRefineComplex takes such low-resolution docking structures and refines them to improve model accuracy in terms of both interface contact and inter-protein orientation. This refinement method allows flexibility at the protein interface and in the overall docking structure to capture conformational changes that occur upon binding. Symmetric refinement is also provided for symmetric homo-complexes. This method was validated by refining models produced by available docking programs, including ZDOCK and M-ZDOCK, and was successfully applied to CAPRI targets in a blind fashion. An example of using the refinement method with an existing docking method for ligand binding mode prediction of a drug target is also presented. A web server that implements the method is freely available at http://galaxy.seoklab.org/refinecomplex.
i3Drefine Software for Protein 3D Structure Refinement and Its Assessment in CASP10

PubMed Central

Bhattacharya, Debswapna; Cheng, Jianlin

2013-01-01

Protein structure refinement refers to the process of improving the qualities of protein structures during structure modeling processes to bring them closer to their native states. Structure refinement has been drawing increasing attention in the community-wide Critical Assessment of techniques for Protein Structure prediction (CASP) experiments since its addition in 8th CASP experiment. During the 9th and recently concluded 10th CASP experiments, a consistent growth in number of refinement targets and participating groups has been witnessed. Yet, protein structure refinement still remains a largely unsolved problem with majority of participating groups in CASP refinement category failed to consistently improve the quality of structures issued for refinement. In order to alleviate this need, we developed a completely automated and computationally efficient protein 3D structure refinement method, i3Drefine, based on an iterative and highly convergent energy minimization algorithm with a powerful all-atom composite physics and knowledge-based force fields and hydrogen bonding (HB) network optimization technique. In the recent community-wide blind experiment, CASP10, i3Drefine (as ‘MULTICOM-CONSTRUCT’) was ranked as the best method in the server section as per the official assessment of CASP10 experiment. Here we provide the community with free access to i3Drefine software and systematically analyse the performance of i3Drefine in strict blind mode on the refinement targets issued in CASP10 refinement category and compare with other state-of-the-art refinement methods participating in CASP10. Our analysis demonstrates that i3Drefine is only fully-automated server participating in CASP10 exhibiting consistent improvement over the initial structures in both global and local structural quality metrics. Executable version of i3Drefine is freely available at http://protein.rnet.missouri.edu/i3drefine/. PMID:23894517
NMRe: a web server for NMR protein structure refinement with high-quality structure validation scores.

PubMed

Ryu, Hyojung; Lim, GyuTae; Sung, Bong Hyun; Lee, Jinhyuk

2016-02-15

Protein structure refinement is a necessary step for the study of protein function. In particular, some nuclear magnetic resonance (NMR) structures are of lower quality than X-ray crystallographic structures. Here, we present NMRe, a web-based server for NMR structure refinement. The previously developed knowledge-based energy function STAP (Statistical Torsion Angle Potential) was used for NMRe refinement. With STAP, NMRe provides two refinement protocols using two types of distance restraints. If a user provides NOE (Nuclear Overhauser Effect) data, the refinement is performed with the NOE distance restraints as a conventional NMR structure refinement. Additionally, NMRe generates NOE-like distance restraints based on the inter-hydrogen distances derived from the input structure. The efficiency of NMRe refinement was validated on 20 NMR structures. Most of the quality assessment scores of the refined NMR structures were better than those of the original structures. The refinement results are provided as a three-dimensional structure view, a secondary structure scheme, and numerical and graphical structure validation scores. NMRe is available at http://psb.kobic.re.kr/nmre/. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Structure refinement of membrane proteins via molecular dynamics simulations.

PubMed

Dutagaci, Bercem; Heo, Lim; Feig, Michael

2018-07-01

A refinement protocol based on physics-based techniques established for water soluble proteins is tested for membrane protein structures. Initial structures were generated by homology modeling and sampled via molecular dynamics simulations in explicit lipid bilayer and aqueous solvent systems. Snapshots from the simulations were selected based on scoring with either knowledge-based or implicit membrane-based scoring functions and averaged to obtain refined models. The protocol resulted in consistent and significant refinement of the membrane protein structures similar to the performance of refinement methods for soluble proteins. Refinement success was similar between sampling in the presence of lipid bilayers and aqueous solvent but the presence of lipid bilayers may benefit the improvement of lipid-facing residues. Scoring with knowledge-based functions (DFIRE and RWplus) was found to be as good as scoring using implicit membrane-based scoring functions suggesting that differences in internal packing is more important than orientations relative to the membrane during the refinement of membrane protein homology models. © 2018 Wiley Periodicals, Inc.
Template-based modeling and ab initio refinement of protein oligomer structures using GALAXY in CAPRI round 30.

PubMed

Lee, Hasup; Baek, Minkyung; Lee, Gyu Rie; Park, Sangwoo; Seok, Chaok

2017-03-01

Many proteins function as homo- or hetero-oligomers; therefore, attempts to understand and regulate protein functions require knowledge of protein oligomer structures. The number of available experimental protein structures is increasing, and oligomer structures can be predicted using the experimental structures of related proteins as templates. However, template-based models may have errors due to sequence differences between the target and template proteins, which can lead to functional differences. Such structural differences may be predicted by loop modeling of local regions or refinement of the overall structure. In CAPRI (Critical Assessment of PRotein Interactions) round 30, we used recently developed features of the GALAXY protein modeling package, including template-based structure prediction, loop modeling, model refinement, and protein-protein docking to predict protein complex structures from amino acid sequences. Out of the 25 CAPRI targets, medium and acceptable quality models were obtained for 14 and 1 target(s), respectively, for which proper oligomer or monomer templates could be detected. Symmetric interface loop modeling on oligomer model structures successfully improved model quality, while loop modeling on monomer model structures failed. Overall refinement of the predicted oligomer structures consistently improved the model quality, in particular in interface contacts. Proteins 2017; 85:399-407. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
KoBaMIN: a knowledge-based minimization web server for protein structure refinement.

PubMed

Rodrigues, João P G L M; Levitt, Michael; Chopra, Gaurav

2012-07-01

The KoBaMIN web server provides an online interface to a simple, consistent and computationally efficient protein structure refinement protocol based on minimization of a knowledge-based potential of mean force. The server can be used to refine either a single protein structure or an ensemble of proteins starting from their unrefined coordinates in PDB format. The refinement method is particularly fast and accurate due to the underlying knowledge-based potential derived from structures deposited in the PDB; as such, the energy function implicitly includes the effects of solvent and the crystal environment. Our server allows for an optional but recommended step that optimizes stereochemistry using the MESHI software. The KoBaMIN server also allows comparison of the refined structures with a provided reference structure to assess the changes brought about by the refinement protocol. The performance of KoBaMIN has been benchmarked widely on a large set of decoys, all models generated at the seventh worldwide experiments on critical assessment of techniques for protein structure prediction (CASP7) and it was also shown to produce top-ranking predictions in the refinement category at both CASP8 and CASP9, yielding consistently good results across a broad range of model quality values. The web server is fully functional and freely available at http://csb.stanford.edu/kobamin.
3Drefine: an interactive web server for efficient protein structure refinement

PubMed Central

Bhattacharya, Debswapna; Nowotny, Jackson; Cao, Renzhi; Cheng, Jianlin

2016-01-01

3Drefine is an interactive web server for consistent and computationally efficient protein structure refinement with the capability to perform web-based statistical and visual analysis. The 3Drefine refinement protocol utilizes iterative optimization of hydrogen bonding network combined with atomic-level energy minimization on the optimized model using a composite physics and knowledge-based force fields for efficient protein structure refinement. The method has been extensively evaluated on blind CASP experiments as well as on large-scale and diverse benchmark datasets and exhibits consistent improvement over the initial structure in both global and local structural quality measures. The 3Drefine web server allows for convenient protein structure refinement through a text or file input submission, email notification, provided example submission and is freely available without any registration requirement. The server also provides comprehensive analysis of submissions through various energy and statistical feedback and interactive visualization of multiple refined models through the JSmol applet that is equipped with numerous protein model analysis tools. The web server has been extensively tested and used by many users. As a result, the 3Drefine web server conveniently provides a useful tool easily accessible to the community. The 3Drefine web server has been made publicly available at the URL: http://sysbio.rnet.missouri.edu/3Drefine/. PMID:27131371
3Drefine: an interactive web server for efficient protein structure refinement.

PubMed

Bhattacharya, Debswapna; Nowotny, Jackson; Cao, Renzhi; Cheng, Jianlin

2016-07-08

3Drefine is an interactive web server for consistent and computationally efficient protein structure refinement with the capability to perform web-based statistical and visual analysis. The 3Drefine refinement protocol utilizes iterative optimization of hydrogen bonding network combined with atomic-level energy minimization on the optimized model using a composite physics and knowledge-based force fields for efficient protein structure refinement. The method has been extensively evaluated on blind CASP experiments as well as on large-scale and diverse benchmark datasets and exhibits consistent improvement over the initial structure in both global and local structural quality measures. The 3Drefine web server allows for convenient protein structure refinement through a text or file input submission, email notification, provided example submission and is freely available without any registration requirement. The server also provides comprehensive analysis of submissions through various energy and statistical feedback and interactive visualization of multiple refined models through the JSmol applet that is equipped with numerous protein model analysis tools. The web server has been extensively tested and used by many users. As a result, the 3Drefine web server conveniently provides a useful tool easily accessible to the community. The 3Drefine web server has been made publicly available at the URL: http://sysbio.rnet.missouri.edu/3Drefine/. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
SFESA: a web server for pairwise alignment refinement by secondary structure shifts.

PubMed

Tong, Jing; Pei, Jimin; Grishin, Nick V

2015-09-03

Protein sequence alignment is essential for a variety of tasks such as homology modeling and active site prediction. Alignment errors remain the main cause of low-quality structure models. A bioinformatics tool to refine alignments is needed to make protein alignments more accurate. We developed the SFESA web server to refine pairwise protein sequence alignments. Compared to the previous version of SFESA, which required a set of 3D coordinates for a protein, the new server will search a sequence database for the closest homolog with an available 3D structure to be used as a template. For each alignment block defined by secondary structure elements in the template, SFESA evaluates alignment variants generated by local shifts and selects the best-scoring alignment variant. A scoring function that combines the sequence score of profile-profile comparison and the structure score of template-derived contact energy is used for evaluation of alignments. PROMALS pairwise alignments refined by SFESA are more accurate than those produced by current advanced alignment methods such as HHpred and CNFpred. In addition, SFESA also improves alignments generated by other software. SFESA is a web-based tool for alignment refinement, designed for researchers to compute, refine, and evaluate pairwise alignments with a combined sequence and structure scoring of alignment blocks. To our knowledge, the SFESA web server is the only tool that refines alignments by evaluating local shifts of secondary structure elements. The SFESA web server is available at http://prodata.swmed.edu/sfesa.
Charge-density analysis of a protein structure at subatomic resolution: the human aldose reductase case.

PubMed

Guillot, Benoît; Jelsch, Christian; Podjarny, Alberto; Lecomte, Claude

2008-05-01

The valence electron density of the protein human aldose reductase was analyzed at 0.66 angstroms resolution. The methodological developments in the software MoPro to adapt standard charge-density techniques from small molecules to macromolecular structures are described. The deformation electron density visible in initial residual Fourier difference maps was significantly enhanced after high-order refinement. The protein structure was refined after transfer of the experimental library multipolar atom model (ELMAM). The effects on the crystallographic statistics, on the atomic thermal displacement parameters and on the structure stereochemistry are analyzed. Constrained refinements of the transferred valence populations Pval and multipoles Plm were performed against the X-ray diffraction data on a selected substructure of the protein with low thermal motion. The resulting charge densities are of good quality, especially for chemical groups with many copies present in the polypeptide chain. To check the effect of the starting point on the result of the constrained multipolar refinement, the same charge-density refinement strategy was applied but using an initial neutral spherical atom model, i.e. without transfer from the ELMAM library. The best starting point for a protein multipolar refinement is the structure with the electron density transferred from the database. This can be assessed by the crystallographic statistical indices, including Rfree, and the quality of the static deformation electron-density maps, notably on the oxygen electron lone pairs. The analysis of the main-chain bond lengths suggests that stereochemical dictionaries would benefit from a revision based on recently determined unrestrained atomic resolution protein structures.
Princeton_TIGRESS 2.0: High refinement consistency and net gains through support vector machines and molecular dynamics in double-blind predictions during the CASP11 experiment.

PubMed

Khoury, George A; Smadbeck, James; Kieslich, Chris A; Koskosidis, Alexandra J; Guzman, Yannis A; Tamamis, Phanourios; Floudas, Christodoulos A

2017-06-01

Protein structure refinement is the challenging problem of operating on any protein structure prediction to improve its accuracy with respect to the native structure in a blind fashion. Although many approaches have been developed and tested during the last four CASP experiments, a majority of the methods continue to degrade models rather than improve them. Princeton_TIGRESS (Khoury et al., Proteins 2014;82:794-814) was developed previously and utilizes separate sampling and selection stages involving Monte Carlo and molecular dynamics simulations and classification using an SVM predictor. The initial implementation was shown to consistently refine protein structures 76% of the time in our own internal benchmarking on CASP 7-10 targets. In this work, we improved the sampling and selection stages and tested the method in blind predictions during CASP11. We added a decomposition of physics-based and hybrid energy functions, as well as a coordinate-free representation of the protein structure through distance-binning Cα-Cα distances to capture fine-grained movements. We performed parameter estimation to optimize the adjustable SVM parameters to maximize precision while balancing sensitivity and specificity across all cross-validated data sets, finding enrichment in our ability to select models from the populations of similar decoys generated for targets in CASPs 7-10. The MD stage was enhanced such that larger structures could be further refined. Among refinement methods that are currently implemented as web-servers, Princeton_TIGRESS 2.0 demonstrated the most consistent and most substantial net refinement in blind predictions during CASP11. The enhanced refinement protocol Princeton_TIGRESS 2.0 is freely available as a web server at http://atlas.engr.tamu.edu/refinement/. Proteins 2017; 85:1078-1098. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Correction of erroneously packed protein's side chains in the NMR structure based on ab initio chemical shift calculations.

PubMed

Zhu, Tong; Zhang, John Z H; He, Xiao

2014-09-14

In this work, protein side chain (1)H chemical shifts are used as probes to detect and correct side-chain packing errors in protein's NMR structures through structural refinement. By applying the automated fragmentation quantum mechanics/molecular mechanics (AF-QM/MM) method for ab initio calculation of chemical shifts, incorrect side chain packing was detected in the NMR structures of the Pin1 WW domain. The NMR structure is then refined by using molecular dynamics simulation and the polarized protein-specific charge (PPC) model. The computationally refined structure of the Pin1 WW domain is in excellent agreement with the corresponding X-ray structure. In particular, the use of the PPC model yields a more accurate structure than that using the standard (nonpolarizable) force field. For comparison, some of the widely used empirical models for chemical shift calculations are unable to correctly describe the relationship between the particular proton chemical shift and protein structures. The AF-QM/MM method can be used as a powerful tool for protein NMR structure validation and structural flaw detection.
PREFMD: a web server for protein structure refinement via molecular dynamics simulations.

PubMed

Heo, Lim; Feig, Michael

2018-03-15

Refinement of protein structure models is a long-standing problem in structural bioinformatics. Molecular dynamics-based methods have emerged as an avenue to achieve consistent refinement. The PREFMD web server implements an optimized protocol based on the method successfully tested in CASP11. Validation with recent CASP refinement targets shows consistent and more significant improvement in global structure accuracy over other state-of-the-art servers. PREFMD is freely available as a web server at http://feiglab.org/prefmd. Scripts for running PREFMD as a stand-alone package are available at https://github.com/feiglab/prefmd.git. feig@msu.edu. Supplementary data are available at Bioinformatics online.
Refinement of protein termini in template-based modeling using conformational space annealing.

PubMed

Park, Hahnbeom; Ko, Junsu; Joo, Keehyoung; Lee, Julian; Seok, Chaok; Lee, Jooyoung

2011-09-01

The rapid increase in the number of experimentally determined protein structures in recent years enables us to obtain more reliable protein tertiary structure models than ever by template-based modeling. However, refinement of template-based models beyond the limit available from the best templates is still needed for understanding protein function in atomic detail. In this work, we develop a new method for protein terminus modeling that can be applied to refinement of models with unreliable terminus structures. The energy function for terminus modeling consists of both physics-based and knowledge-based potential terms with carefully optimized relative weights. Effective sampling of both the framework and terminus is performed using the conformational space annealing technique. This method has been tested on a set of termini derived from a nonredundant structure database and two sets of termini from the CASP8 targets. The performance of the terminus modeling method is significantly improved over our previous method that does not employ terminus refinement. It is also comparable or superior to the best server methods tested in CASP8. The success of the current approach suggests that similar strategy may be applied to other types of refinement problems such as loop modeling or secondary structure rearrangement. Copyright © 2011 Wiley-Liss, Inc.
Partial unfolding and refolding for structure refinement: A unified approach of geometric simulations and molecular dynamics.

PubMed

Kumar, Avishek; Campitelli, Paul; Thorpe, M F; Ozkan, S Banu

2015-12-01

The most successful protein structure prediction methods to date have been template-based modeling (TBM) or homology modeling, which predicts protein structure based on experimental structures. These high accuracy predictions sometimes retain structural errors due to incorrect templates or a lack of accurate templates in the case of low sequence similarity, making these structures inadequate in drug-design studies or molecular dynamics simulations. We have developed a new physics based approach to the protein refinement problem by mimicking the mechanism of chaperons that rehabilitate misfolded proteins. The template structure is unfolded by selectively (targeted) pulling on different portions of the protein using the geometric based technique FRODA, and then refolded using hierarchically restrained replica exchange molecular dynamics simulations (hr-REMD). FRODA unfolding is used to create a diverse set of topologies for surveying near native-like structures from a template and to provide a set of persistent contacts to be employed during re-folding. We have tested our approach on 13 previous CASP targets and observed that this method of folding an ensemble of partially unfolded structures, through the hierarchical addition of contact restraints (that is, first local and then nonlocal interactions), leads to a refolding of the structure along with refinement in most cases (12/13). Although this approach yields refined models through advancement in sampling, the task of blind selection of the best refined models still needs to be solved. Overall, the method can be useful for improved sampling for low resolution models where certain of the portions of the structure are incorrectly modeled. © 2015 Wiley Periodicals, Inc.

Refinement of NMR structures using implicit solvent and advanced sampling techniques.

PubMed

Chen, Jianhan; Im, Wonpil; Brooks, Charles L

2004-12-15

NMR biomolecular structure calculations exploit simulated annealing methods for conformational sampling and require a relatively high level of redundancy in the experimental restraints to determine quality three-dimensional structures. Recent advances in generalized Born (GB) implicit solvent models should make it possible to combine information from both experimental measurements and accurate empirical force fields to improve the quality of NMR-derived structures. In this paper, we study the influence of implicit solvent on the refinement of protein NMR structures and identify an optimal protocol of utilizing these improved force fields. To do so, we carry out structure refinement experiments for model proteins with published NMR structures using full NMR restraints and subsets of them. We also investigate the application of advanced sampling techniques to NMR structure refinement. Similar to the observations of Xia et al. (J.Biomol. NMR 2002, 22, 317-331), we find that the impact of implicit solvent is rather small when there is a sufficient number of experimental restraints (such as in the final stage of NMR structure determination), whether implicit solvent is used throughout the calculation or only in the final refinement step. The application of advanced sampling techniques also seems to have minimal impact in this case. However, when the experimental data are limited, we demonstrate that refinement with implicit solvent can substantially improve the quality of the structures. In particular, when combined with an advanced sampling technique, the replica exchange (REX) method, near-native structures can be rapidly moved toward the native basin. The REX method provides both enhanced sampling and automatic selection of the most native-like (lowest energy) structures. An optimal protocol based on our studies first generates an ensemble of initial structures that maximally satisfy the available experimental data with conventional NMR software using a simplified force field and then refines these structures with implicit solvent using the REX method. We systematically examine the reliability and efficacy of this protocol using four proteins of various sizes ranging from the 56-residue B1 domain of Streptococcal protein G to the 370-residue Maltose-binding protein. Significant improvement in the structures was observed in all cases when refinement was based on low-redundancy restraint data. The proposed protocol is anticipated to be particularly useful in early stages of NMR structure determination where a reliable estimate of the native fold from limited data can significantly expedite the overall process. This refinement procedure is also expected to be useful when redundant experimental data are not readily available, such as for large multidomain biomolecules and in solid-state NMR structure determination.
xMDFF: molecular dynamics flexible fitting of low-resolution X-ray structures.

PubMed

McGreevy, Ryan; Singharoy, Abhishek; Li, Qufei; Zhang, Jingfen; Xu, Dong; Perozo, Eduardo; Schulten, Klaus

2014-09-01

X-ray crystallography remains the most dominant method for solving atomic structures. However, for relatively large systems, the availability of only medium-to-low-resolution diffraction data often limits the determination of all-atom details. A new molecular dynamics flexible fitting (MDFF)-based approach, xMDFF, for determining structures from such low-resolution crystallographic data is reported. xMDFF employs a real-space refinement scheme that flexibly fits atomic models into an iteratively updating electron-density map. It addresses significant large-scale deformations of the initial model to fit the low-resolution density, as tested with synthetic low-resolution maps of D-ribose-binding protein. xMDFF has been successfully applied to re-refine six low-resolution protein structures of varying sizes that had already been submitted to the Protein Data Bank. Finally, via systematic refinement of a series of data from 3.6 to 7 Å resolution, xMDFF refinements together with electrophysiology experiments were used to validate the first all-atom structure of the voltage-sensing protein Ci-VSP.
Significance of structural changes in proteins: expected errors in refined protein structures.

PubMed Central

Stroud, R. M.; Fauman, E. B.

1995-01-01

A quantitative expression key to evaluating significant structural differences or induced shifts between any two protein structures is derived. Because crystallography leads to reports of a single (or sometimes dual) position for each atom, the significance of any structural change based on comparison of two structures depends critically on knowing the expected precision of each median atomic position reported, and on extracting it for each atom, from the information provided in the Protein Data Bank and in the publication. The differences between structures of protein molecules that should be identical, and that are normally distributed, indicating that they are not affected by crystal contacts, were analyzed with respect to many potential indicators of structure precision, so as to extract, essentially by "machine learning" principles, a generally applicable expression involving the highest correlates. Eighteen refined crystal structures from the Protein Data Bank, in which there are multiple molecules in the crystallographic asymmetric unit, were selected and compared. The thermal B factor, the connectivity of the atom, and the ratio of the number of reflections to the number of atoms used in refinement correlate best with the magnitude of the positional differences between regions of the structures that otherwise would be expected to be the same. These results are embodied in a six-parameter equation that can be applied to any crystallographically refined structure to estimate the expected uncertainty in position of each atom. Structure change in a macromolecule can thus be referenced to the expected uncertainty in atomic position as reflected in the variance between otherwise identical structures with the observed values of correlated parameters. PMID:8563637
Protein Structure Validation and Refinement Using Amide Proton Chemical Shifts Derived from Quantum Mechanics

PubMed Central

Christensen, Anders S.; Linnet, Troels E.; Borg, Mikael; Boomsma, Wouter; Lindorff-Larsen, Kresten; Hamelryck, Thomas; Jensen, Jan H.

2013-01-01

We present the ProCS method for the rapid and accurate prediction of protein backbone amide proton chemical shifts - sensitive probes of the geometry of key hydrogen bonds that determine protein structure. ProCS is parameterized against quantum mechanical (QM) calculations and reproduces high level QM results obtained for a small protein with an RMSD of 0.25 ppm (r = 0.94). ProCS is interfaced with the PHAISTOS protein simulation program and is used to infer statistical protein ensembles that reflect experimentally measured amide proton chemical shift values. Such chemical shift-based structural refinements, starting from high-resolution X-ray structures of Protein G, ubiquitin, and SMN Tudor Domain, result in average chemical shifts, hydrogen bond geometries, and trans-hydrogen bond (h3 JNC') spin-spin coupling constants that are in excellent agreement with experiment. We show that the structural sensitivity of the QM-based amide proton chemical shift predictions is needed to obtain this agreement. The ProCS method thus offers a powerful new tool for refining the structures of hydrogen bonding networks to high accuracy with many potential applications such as protein flexibility in ligand binding. PMID:24391900
Homology-based hydrogen bond information improves crystallographic structures in the PDB.

PubMed

van Beusekom, Bart; Touw, Wouter G; Tatineni, Mahidhar; Somani, Sandeep; Rajagopal, Gunaretnam; Luo, Jinquan; Gilliland, Gary L; Perrakis, Anastassis; Joosten, Robbie P

2018-03-01

The Protein Data Bank (PDB) is the global archive for structural information on macromolecules, and a popular resource for researchers, teachers, and students, amassing more than one million unique users each year. Crystallographic structure models in the PDB (more than 100,000 entries) are optimized against the crystal diffraction data and geometrical restraints. This process of crystallographic refinement typically ignored hydrogen bond (H-bond) distances as a source of information. However, H-bond restraints can improve structures at low resolution where diffraction data are limited. To improve low-resolution structure refinement, we present methods for deriving H-bond information either globally from well-refined high-resolution structures from the PDB-REDO databank, or specifically from on-the-fly constructed sets of homologous high-resolution structures. Refinement incorporating HOmology DErived Restraints (HODER), improves geometrical quality and the fit to the diffraction data for many low-resolution structures. To make these improvements readily available to the general public, we applied our new algorithms to all crystallographic structures in the PDB: using massively parallel computing, we constructed a new instance of the PDB-REDO databank (https://pdb-redo.eu). This resource is useful for researchers to gain insight on individual structures, on specific protein families (as we demonstrate with examples), and on general features of protein structure using data mining approaches on a uniformly treated dataset. © 2017 The Protein Society.
iATTRACT: simultaneous global and local interface optimization for protein-protein docking refinement.

PubMed

Schindler, Christina E M; de Vries, Sjoerd J; Zacharias, Martin

2015-02-01

Protein-protein interactions are abundant in the cell but to date structural data for a large number of complexes is lacking. Computational docking methods can complement experiments by providing structural models of complexes based on structures of the individual partners. A major caveat for docking success is accounting for protein flexibility. Especially, interface residues undergo significant conformational changes upon binding. This limits the performance of docking methods that keep partner structures rigid or allow limited flexibility. A new docking refinement approach, iATTRACT, has been developed which combines simultaneous full interface flexibility and rigid body optimizations during docking energy minimization. It employs an atomistic molecular mechanics force field for intermolecular interface interactions and a structure-based force field for intramolecular contributions. The approach was systematically evaluated on a large protein-protein docking benchmark, starting from an enriched decoy set of rigidly docked protein-protein complexes deviating by up to 15 Å from the native structure at the interface. Large improvements in sampling and slight but significant improvements in scoring/discrimination of near native docking solutions were observed. Complexes with initial deviations at the interface of up to 5.5 Å were refined to significantly better agreement with the native structure. Improvements in the fraction of native contacts were especially favorable, yielding increases of up to 70%. © 2014 Wiley Periodicals, Inc.
[Can the local energy minimization refine the PDB structures of different resolution universally?].

PubMed

Godzi, M G; Gromova, A P; Oferkin, I V; Mironov, P V

2009-01-01

The local energy minimization was statistically validated as the refinement strategy for PDB structure pairs of different resolution. Thirteen pairs of structures with the only difference in resolution were extracted from PDB, and the structures of 11 identical proteins obtained by different X-ray diffraction techniques were represented. The distribution of RMSD value was calculated for these pairs before and after the local energy minimization of each structure. The MMFF94 field was used for energy calculations, and the quasi-Newton method was used for local energy minimization. By comparison of these two RMSD distributions, the local energy minimization was proved to statistically increase the structural differences in pairs so that it cannot be used for refinement purposes. To explore the prospects of complex refinement strategies based on energy minimization, randomized structures were obtained by moving the initial PDB structures as far as the minimized structures had been moved in a multidimensional space of atomic coordinates. For these randomized structures, the RMSD distribution was calculated and compared with that for minimized structures. The significant differences in their mean values proved the energy surface of the protein to have only few minima near the conformations of different resolution obtained by X-ray diffraction for PDB. Some other results obtained by exploring the energy surface near these conformations are also presented. These results are expected to be very useful for the development of new protein refinement strategies based on energy minimization.
Improving virtual screening of G protein-coupled receptors via ligand-directed modeling

PubMed Central

Simms, John; Christopoulos, Arthur; Wootten, Denise

2017-01-01

G protein-coupled receptors (GPCRs) play crucial roles in cell physiology and pathophysiology. There is increasing interest in using structural information for virtual screening (VS) of libraries and for structure-based drug design to identify novel agonist or antagonist leads. However, the sparse availability of experimentally determined GPCR/ligand complex structures with diverse ligands impedes the application of structure-based drug design (SBDD) programs directed to identifying new molecules with a select pharmacology. In this study, we apply ligand-directed modeling (LDM) to available GPCR X-ray structures to improve VS performance and selectivity towards molecules of specific pharmacological profile. The described method refines a GPCR binding pocket conformation using a single known ligand for that GPCR. The LDM method is a computationally efficient, iterative workflow consisting of protein sampling and ligand docking. We developed an extensive benchmark comparing LDM-refined binding pockets to GPCR X-ray crystal structures across seven different GPCRs bound to a range of ligands of different chemotypes and pharmacological profiles. LDM-refined models showed improvement in VS performance over origin X-ray crystal structures in 21 out of 24 cases. In all cases, the LDM-refined models had superior performance in enriching for the chemotype of the refinement ligand. This likely contributes to the LDM success in all cases of inhibitor-bound to agonist-bound binding pocket refinement, a key task for GPCR SBDD programs. Indeed, agonist ligands are required for a plethora of GPCRs for therapeutic intervention, however GPCR X-ray structures are mostly restricted to their inactive inhibitor-bound state. PMID:29131821
Solution NMR Refinement of a Metal Ion Bound Protein Using Metal Ion Inclusive Restrained Molecular Dynamics Methods

PubMed Central

Chakravorty, Dhruva K.; Wang, Bing; Lee, Chul Won; Guerra, Alfredo J.; Giedroc, David P.; Merz, Kenneth M.

2013-01-01

Correctly calculating the structure of metal coordination sites in a protein during the process of nuclear magnetic resonance (NMR) structure determination and refinement continues to be a challenging task. In this study, we present an accurate and convenient means by which to include metal ions in the NMR structure determination process using molecular dynamics (MD) constrained by NMR-derived data to obtain a realistic and physically viable description of the metal binding site(s). This method provides the framework to accurately portray the metal ions and its binding residues in a pseudo-bond or dummy-cation like approach, and is validated by quantum mechanical/molecular mechanical (QM/MM) MD calculations constrained by NMR-derived data. To illustrate this approach, we refine the zinc coordination complex structure of the zinc sensing transcriptional repressor protein Staphylococcus aureus CzrA, generating over 130 ns of MD and QM/MM MD NMR-data compliant sampling. In addition to refining the first coordination shell structure of the Zn(II) ion, this protocol benefits from being performed in a periodically replicated solvation environment including long-range electrostatics. We determine that unrestrained (not based on NMR data) MD simulations correlated to the NMR data in a time-averaged ensemble. The accurate solution structure ensemble of the metal-bound protein accurately describes the role of conformational dynamics in allosteric regulation of DNA binding by zinc and serves to validate our previous unrestrained MD simulations of CzrA. This methodology has potentially broad applicability in the structure determination of metal ion bound proteins, protein folding and metal template protein-design studies. PMID:23609042
A conservation and biophysics guided stochastic approach to refining docked multimeric proteins.

PubMed

Akbal-Delibas, Bahar; Haspel, Nurit

2013-01-01

We introduce a protein docking refinement method that accepts complexes consisting of any number of monomeric units. The method uses a scoring function based on a tight coupling between evolutionary conservation, geometry and physico-chemical interactions. Understanding the role of protein complexes in the basic biology of organisms heavily relies on the detection of protein complexes and their structures. Different computational docking methods are developed for this purpose, however, these methods are often not accurate and their results need to be further refined to improve the geometry and the energy of the resulting complexes. Also, despite the fact that complexes in nature often have more than two monomers, most docking methods focus on dimers since the computational complexity increases exponentially due to the addition of monomeric units. Our results show that the refinement scheme can efficiently handle complexes with more than two monomers by biasing the results towards complexes with native interactions, filtering out false positive results. Our refined complexes have better IRMSDs with respect to the known complexes and lower energies than those initial docked structures. Evolutionary conservation information allows us to bias our results towards possible functional interfaces, and the probabilistic selection scheme helps us to escape local energy minima. We aim to incorporate our refinement method in a larger framework which also enables docking of multimeric complexes given only monomeric structures.
Using more than 801 296 small-molecule crystal structures to aid in protein structure refinement and analysis

PubMed Central

Cole, Jason C.

2017-01-01

The Cambridge Structural Database (CSD) is the worldwide resource for the dissemination of all published three-dimensional structures of small-molecule organic and metal–organic compounds. This paper briefly describes how this collection of crystal structures can be used en masse in the context of macromolecular crystallography. Examples highlight how the CSD and associated software aid protein–ligand complex validation, and show how the CSD could be further used in the generation of geometrical restraints for protein structure refinement. PMID:28291758
GRID: a high-resolution protein structure refinement algorithm.

PubMed

Chitsaz, Mohsen; Mayo, Stephen L

2013-03-05

The energy-based refinement of protein structures generated by fold prediction algorithms to atomic-level accuracy remains a major challenge in structural biology. Energy-based refinement is mainly dependent on two components: (1) sufficiently accurate force fields, and (2) efficient conformational space search algorithms. Focusing on the latter, we developed a high-resolution refinement algorithm called GRID. It takes a three-dimensional protein structure as input and, using an all-atom force field, attempts to improve the energy of the structure by systematically perturbing backbone dihedrals and side-chain rotamer conformations. We compare GRID to Backrub, a stochastic algorithm that has been shown to predict a significant fraction of the conformational changes that occur with point mutations. We applied GRID and Backrub to 10 high-resolution (≤ 2.8 Å) crystal structures from the Protein Data Bank and measured the energy improvements obtained and the computation times required to achieve them. GRID resulted in energy improvements that were significantly better than those attained by Backrub while expending about the same amount of computational resources. GRID resulted in relaxed structures that had slightly higher backbone RMSDs compared to Backrub relative to the starting crystal structures. The average RMSD was 0.25 ± 0.02 Å for GRID versus 0.14 ± 0.04 Å for Backrub. These relatively minor deviations indicate that both algorithms generate structures that retain their original topologies, as expected given the nature of the algorithms. Copyright © 2012 Wiley Periodicals, Inc.
Modelling dynamics in protein crystal structures by ensemble refinement

PubMed Central

Burnley, B Tom; Afonine, Pavel V; Adams, Paul D; Gros, Piet

2012-01-01

Single-structure models derived from X-ray data do not adequately account for the inherent, functionally important dynamics of protein molecules. We generated ensembles of structures by time-averaged refinement, where local molecular vibrations were sampled by molecular-dynamics (MD) simulation whilst global disorder was partitioned into an underlying overall translation–libration–screw (TLS) model. Modeling of 20 protein datasets at 1.1–3.1 Å resolution reduced cross-validated Rfree values by 0.3–4.9%, indicating that ensemble models fit the X-ray data better than single structures. The ensembles revealed that, while most proteins display a well-ordered core, some proteins exhibit a ‘molten core’ likely supporting functionally important dynamics in ligand binding, enzyme activity and protomer assembly. Order–disorder changes in HIV protease indicate a mechanism of entropy compensation for ordering the catalytic residues upon ligand binding by disordering specific core residues. Thus, ensemble refinement extracts dynamical details from the X-ray data that allow a more comprehensive understanding of structure–dynamics–function relationships. DOI: http://dx.doi.org/10.7554/eLife.00311.001 PMID:23251785
Neutron protein crystallography: A complementary tool for locating hydrogens in proteins.

PubMed

O'Dell, William B; Bodenheimer, Annette M; Meilleur, Flora

2016-07-15

Neutron protein crystallography is a powerful tool for investigating protein chemistry because it directly locates hydrogen atom positions in a protein structure. The visibility of hydrogen and deuterium atoms arises from the strong interaction of neutrons with the nuclei of these isotopes. Positions can be unambiguously assigned from diffraction at resolutions typical of protein crystals. Neutrons have the additional benefit to structural biology of not inducing radiation damage in protein crystals. The same crystal could be measured multiple times for parametric studies. Here, we review the basic principles of neutron protein crystallography. The information that can be gained from a neutron structure is presented in balance with practical considerations. Methods to produce isotopically-substituted proteins and to grow large crystals are provided in the context of neutron structures reported in the literature. Available instruments for data collection and software for data processing and structure refinement are described along with technique-specific strategies including joint X-ray/neutron structure refinement. Examples are given to illustrate, ultimately, the unique scientific value of neutron protein crystal structures. Copyright © 2015 Elsevier Inc. All rights reserved.
Homology‐based hydrogen bond information improves crystallographic structures in the PDB

PubMed Central

van Beusekom, Bart; Touw, Wouter G.; Tatineni, Mahidhar; Somani, Sandeep; Rajagopal, Gunaretnam; Luo, Jinquan; Gilliland, Gary L.; Perrakis, Anastassis

2017-01-01

Abstract The Protein Data Bank (PDB) is the global archive for structural information on macromolecules, and a popular resource for researchers, teachers, and students, amassing more than one million unique users each year. Crystallographic structure models in the PDB (more than 100,000 entries) are optimized against the crystal diffraction data and geometrical restraints. This process of crystallographic refinement typically ignored hydrogen bond (H‐bond) distances as a source of information. However, H‐bond restraints can improve structures at low resolution where diffraction data are limited. To improve low‐resolution structure refinement, we present methods for deriving H‐bond information either globally from well‐refined high‐resolution structures from the PDB‐REDO databank, or specifically from on‐the‐fly constructed sets of homologous high‐resolution structures. Refinement incorporating HOmology DErived Restraints (HODER), improves geometrical quality and the fit to the diffraction data for many low‐resolution structures. To make these improvements readily available to the general public, we applied our new algorithms to all crystallographic structures in the PDB: using massively parallel computing, we constructed a new instance of the PDB‐REDO databank (https://pdb-redo.eu). This resource is useful for researchers to gain insight on individual structures, on specific protein families (as we demonstrate with examples), and on general features of protein structure using data mining approaches on a uniformly treated dataset. PMID:29168245
Advanced Computational Methods for High-accuracy Refinement of Protein Low-quality Models

NASA Astrophysics Data System (ADS)

Zang, Tianwu

Predicting the 3-dimentional structure of protein has been a major interest in the modern computational biology. While lots of successful methods can generate models with 3˜5A root-mean-square deviation (RMSD) from the solution, the progress of refining these models is quite slow. It is therefore urgently needed to develop effective methods to bring low-quality models to higher-accuracy ranges (e.g., less than 2 A RMSD). In this thesis, I present several novel computational methods to address the high-accuracy refinement problem. First, an enhanced sampling method, named parallel continuous simulated tempering (PCST), is developed to accelerate the molecular dynamics (MD) simulation. Second, two energy biasing methods, Structure-Based Model (SBM) and Ensemble-Based Model (EBM), are introduced to perform targeted sampling around important conformations. Third, a three-step method is developed to blindly select high-quality models along the MD simulation. These methods work together to make significant refinement of low-quality models without any knowledge of the solution. The effectiveness of these methods is examined in different applications. Using the PCST-SBM method, models with higher global distance test scores (GDT_TS) are generated and selected in the MD simulation of 18 targets from the refinement category of the 10th Critical Assessment of Structure Prediction (CASP10). In addition, in the refinement test of two CASP10 targets using the PCST-EBM method, it is indicated that EBM may bring the initial model to even higher-quality levels. Furthermore, a multi-round refinement protocol of PCST-SBM improves the model quality of a protein to the level that is sufficient high for the molecular replacement in X-ray crystallography. Our results justify the crucial position of enhanced sampling in the protein structure prediction and demonstrate that a considerable improvement of low-accuracy structures is still achievable with current force fields.
Accurate macromolecular crystallographic refinement: incorporation of the linear scaling, semiempirical quantum-mechanics program DivCon into the PHENIX refinement package.

PubMed

Borbulevych, Oleg Y; Plumley, Joshua A; Martin, Roger I; Merz, Kenneth M; Westerhoff, Lance M

2014-05-01

Macromolecular crystallographic refinement relies on sometimes dubious stereochemical restraints and rudimentary energy functionals to ensure the correct geometry of the model of the macromolecule and any covalently bound ligand(s). The ligand stereochemical restraint file (CIF) requires a priori understanding of the ligand geometry within the active site, and creation of the CIF is often an error-prone process owing to the great variety of potential ligand chemistry and structure. Stereochemical restraints have been replaced with more robust functionals through the integration of the linear-scaling, semiempirical quantum-mechanics (SE-QM) program DivCon with the PHENIX X-ray refinement engine. The PHENIX/DivCon package has been thoroughly validated on a population of 50 protein-ligand Protein Data Bank (PDB) structures with a range of resolutions and chemistry. The PDB structures used for the validation were originally refined utilizing various refinement packages and were published within the past five years. PHENIX/DivCon does not utilize CIF(s), link restraints and other parameters for refinement and hence it does not make as many a priori assumptions about the model. Across the entire population, the method results in reasonable ligand geometries and low ligand strains, even when the original refinement exhibited difficulties, indicating that PHENIX/DivCon is applicable to both single-structure and high-throughput crystallography.
Discrete Molecular Dynamics Approach to the Study of Disordered and Aggregating Proteins.

PubMed

Emperador, Agustí; Orozco, Modesto

2017-03-14

We present a refinement of the Coarse Grained PACSAB force field for Discrete Molecular Dynamics (DMD) simulations of proteins in aqueous conditions. As the original version, the refined method provides good representation of the structure and dynamics of folded proteins but provides much better representations of a variety of unfolded proteins, including some very large, impossible to analyze by atomistic simulation methods. The PACSAB/DMD method also reproduces accurately aggregation properties, providing good pictures of the structural ensembles of proteins showing a folded core and an intrinsically disordered region. The combination of accuracy and speed makes the method presented here a good alternative for the exploration of unstructured protein systems.
Automated protein structure modeling in CASP9 by I-TASSER pipeline combined with QUARK-based ab initio folding and FG-MD-based structure refinement

PubMed Central

Xu, Dong; Zhang, Jian; Roy, Ambrish; Zhang, Yang

2011-01-01

I-TASSER is an automated pipeline for protein tertiary structure prediction using multiple threading alignments and iterative structure assembly simulations. In CASP9 experiments, two new algorithms, QUARK and FG-MD, were added to the I-TASSER pipeline for improving the structural modeling accuracy. QUARK is a de novo structure prediction algorithm used for structure modeling of proteins that lack detectable template structures. For distantly homologous targets, QUARK models are found useful as a reference structure for selecting good threading alignments and guiding the I-TASSER structure assembly simulations. FG-MD is an atomic-level structural refinement program that uses structural fragments collected from the PDB structures to guide molecular dynamics simulation and improve the local structure of predicted model, including hydrogen-bonding networks, torsion angles and steric clashes. Despite considerable progress in both the template-based and template-free structure modeling, significant improvements on protein target classification, domain parsing, model selection, and ab initio folding of beta-proteins are still needed to further improve the I-TASSER pipeline. PMID:22069036
Protein homology model refinement by large-scale energy optimization.

PubMed

Park, Hahnbeom; Ovchinnikov, Sergey; Kim, David E; DiMaio, Frank; Baker, David

2018-03-20

Proteins fold to their lowest free-energy structures, and hence the most straightforward way to increase the accuracy of a partially incorrect protein structure model is to search for the lowest-energy nearby structure. This direct approach has met with little success for two reasons: first, energy function inaccuracies can lead to false energy minima, resulting in model degradation rather than improvement; and second, even with an accurate energy function, the search problem is formidable because the energy only drops considerably in the immediate vicinity of the global minimum, and there are a very large number of degrees of freedom. Here we describe a large-scale energy optimization-based refinement method that incorporates advances in both search and energy function accuracy that can substantially improve the accuracy of low-resolution homology models. The method refined low-resolution homology models into correct folds for 50 of 84 diverse protein families and generated improved models in recent blind structure prediction experiments. Analyses of the basis for these improvements reveal contributions from both the improvements in conformational sampling techniques and the energy function.

Ultra-high-resolution X-ray structure of proteins.

PubMed

Lecomte, C; Guillot, B; Muzet, N; Pichon-Pesme, V; Jelsch, C

2004-04-01

The constant advances in synchrotron radiation sources and crystallogenesis methods and the impulse of structural genomics projects have brought biocrystallography to a context favorable to subatomic resolution protein and nucleic acid structures. Thus, as soon as such precision can be frequently obtained, the amount of information available in the precise electron density should also be easily and naturally exploited, similarly to the field of small molecule charge density studies. Indeed, the use of a nonspherical model for the atomic electron density in the refinement of subatomic resolution protein structures allows the experimental description of their electrostatic properties. Some methods we have developed and implemented in our multipolar refinement program MoPro for this purpose are presented. Examples of successful applications to several subatomic resolution protein structures, including the 0.66 angstrom resolution human aldose reductase, are described.
Improving consensus structure by eliminating averaging artifacts

PubMed Central

KC, Dukka B

2009-01-01

Background Common structural biology methods (i.e., NMR and molecular dynamics) often produce ensembles of molecular structures. Consequently, averaging of 3D coordinates of molecular structures (proteins and RNA) is a frequent approach to obtain a consensus structure that is representative of the ensemble. However, when the structures are averaged, artifacts can result in unrealistic local geometries, including unphysical bond lengths and angles. Results Herein, we describe a method to derive representative structures while limiting the number of artifacts. Our approach is based on a Monte Carlo simulation technique that drives a starting structure (an extended or a 'close-by' structure) towards the 'averaged structure' using a harmonic pseudo energy function. To assess the performance of the algorithm, we applied our approach to Cα models of 1364 proteins generated by the TASSER structure prediction algorithm. The average RMSD of the refined model from the native structure for the set becomes worse by a mere 0.08 Å compared to the average RMSD of the averaged structures from the native structure (3.28 Å for refined structures and 3.36 A for the averaged structures). However, the percentage of atoms involved in clashes is greatly reduced (from 63% to 1%); in fact, the majority of the refined proteins had zero clashes. Moreover, a small number (38) of refined structures resulted in lower RMSD to the native protein versus the averaged structure. Finally, compared to PULCHRA [1], our approach produces representative structure of similar RMSD quality, but with much fewer clashes. Conclusion The benchmarking results demonstrate that our approach for removing averaging artifacts can be very beneficial for the structural biology community. Furthermore, the same approach can be applied to almost any problem where averaging of 3D coordinates is performed. Namely, structure averaging is also commonly performed in RNA secondary prediction [2], which could also benefit from our approach. PMID:19267905
Protein structure refinement using a quantum mechanics-based chemical shielding predictor.

PubMed

Bratholm, Lars A; Jensen, Jan H

2017-03-01

The accurate prediction of protein chemical shifts using a quantum mechanics (QM)-based method has been the subject of intense research for more than 20 years but so far empirical methods for chemical shift prediction have proven more accurate. In this paper we show that a QM-based predictor of a protein backbone and CB chemical shifts (ProCS15, PeerJ , 2016, 3, e1344) is of comparable accuracy to empirical chemical shift predictors after chemical shift-based structural refinement that removes small structural errors. We present a method by which quantum chemistry based predictions of isotropic chemical shielding values (ProCS15) can be used to refine protein structures using Markov Chain Monte Carlo (MCMC) simulations, relating the chemical shielding values to the experimental chemical shifts probabilistically. Two kinds of MCMC structural refinement simulations were performed using force field geometry optimized X-ray structures as starting points: simulated annealing of the starting structure and constant temperature MCMC simulation followed by simulated annealing of a representative ensemble structure. Annealing of the CHARMM structure changes the CA-RMSD by an average of 0.4 Å but lowers the chemical shift RMSD by 1.0 and 0.7 ppm for CA and N. Conformational averaging has a relatively small effect (0.1-0.2 ppm) on the overall agreement with carbon chemical shifts but lowers the error for nitrogen chemical shifts by 0.4 ppm. If an amino acid specific offset is included the ProCS15 predicted chemical shifts have RMSD values relative to experiments that are comparable to popular empirical chemical shift predictors. The annealed representative ensemble structures differ in CA-RMSD relative to the initial structures by an average of 2.0 Å, with >2.0 Å difference for six proteins. In four of the cases, the largest structural differences arise in structurally flexible regions of the protein as determined by NMR, and in the remaining two cases, the large structural change may be due to force field deficiencies. The overall accuracy of the empirical methods are slightly improved by annealing the CHARMM structure with ProCS15, which may suggest that the minor structural changes introduced by ProCS15-based annealing improves the accuracy of the protein structures. Having established that QM-based chemical shift prediction can deliver the same accuracy as empirical shift predictors we hope this can help increase the accuracy of related approaches such as QM/MM or linear scaling approaches or interpreting protein structural dynamics from QM-derived chemical shift.
GIRAF: a method for fast search and flexible alignment of ligand binding interfaces in proteins at atomic resolution

PubMed Central

Kinjo, Akira R.; Nakamura, Haruki

2012-01-01

Comparison and classification of protein structures are fundamental means to understand protein functions. Due to the computational difficulty and the ever-increasing amount of structural data, however, it is in general not feasible to perform exhaustive all-against-all structure comparisons necessary for comprehensive classifications. To efficiently handle such situations, we have previously proposed a method, now called GIRAF. We herein describe further improvements in the GIRAF protein structure search and alignment method. The GIRAF method achieves extremely efficient search of similar structures of ligand binding sites of proteins by exploiting database indexing of structural features of local coordinate frames. In addition, it produces refined atom-wise alignments by iterative applications of the Hungarian method to the bipartite graph defined for a pair of superimposed structures. By combining the refined alignments based on different local coordinate frames, it is made possible to align structures involving domain movements. We provide detailed accounts for the database design, the search and alignment algorithms as well as some benchmark results. PMID:27493524
Solution NMR structure of a designed metalloprotein and complementary molecular dynamics refinement.

PubMed

Calhoun, Jennifer R; Liu, Weixia; Spiegel, Katrin; Dal Peraro, Matteo; Klein, Michael L; Valentine, Kathleen G; Wand, A Joshua; DeGrado, William F

2008-02-01

We report the solution NMR structure of a designed dimetal-binding protein, di-Zn(II) DFsc, along with a secondary refinement step employing molecular dynamics techniques. Calculation of the initial NMR structural ensemble by standard methods led to distortions in the metal-ligand geometries at the active site. Unrestrained molecular dynamics using a nonbonded force field for the metal shell, followed by quantum mechanical/molecular mechanical dynamics of DFsc, were used to relax local frustrations at the dimetal site that were apparent in the initial NMR structure and provide a more realistic description of the structure. The MD model is consistent with NMR restraints, and in good agreement with the structural and functional properties expected for DF proteins. This work demonstrates that NMR structures of metalloproteins can be further refined using classical and first-principles molecular dynamics methods in the presence of explicit solvent to provide otherwise unavailable insight into the geometry of the metal center.
Conformational Sampling of a Biomolecular Rugged Energy Landscape.

PubMed

Rydzewski, Jakub; Jakubowski, Rafal; Nicosia, Giuseppe; Nowak, Wieslaw

2018-01-01

The protein structure refinement using conformational sampling is important in hitherto protein studies. In this paper, we examined the protein structure refinement by means of potential energy minimization using immune computing as a method of sampling conformations. The method was tested on the x-ray structure and 30 decoys of the mutant of [Leu]Enkephalin, a paradigmatic example of the biomolecular multiple-minima problem. In order to score the refined conformations, we used a standard potential energy function with the OPLSAA force field. The effectiveness of the search was assessed using a variety of methods. The robustness of sampling was checked by the energy yield function which measures quantitatively the number of the peptide decoys residing in an energetic funnel. Furthermore, the potential energy-dependent Pareto fronts were calculated to elucidate dissimilarities between peptide conformations and the native state as observed by x-ray crystallography. Our results showed that the probed potential energy landscape of [Leu]Enkephalin is self-similar on different metric scales and that the local potential energy minima of the peptide decoys are metastable, thus they can be refined to conformations whose potential energy is decreased by approximately 250 kJ/mol.
Variability of Protein Structure Models from Electron Microscopy.

PubMed

Monroe, Lyman; Terashi, Genki; Kihara, Daisuke

2017-04-04

An increasing number of biomolecular structures are solved by electron microscopy (EM). However, the quality of structure models determined from EM maps vary substantially. To understand to what extent structure models are supported by information embedded in EM maps, we used two computational structure refinement methods to examine how much structures can be refined using a dataset of 49 maps with accompanying structure models. The extent of structure modification as well as the disagreement between refinement models produced by the two computational methods scaled inversely with the global and the local map resolutions. A general quantitative estimation of deviations of structures for particular map resolutions are provided. Our results indicate that the observed discrepancy between the deposited map and the refined models is due to the lack of structural information present in EM maps and thus these annotations must be used with caution for further applications. Copyright © 2017 Elsevier Ltd. All rights reserved.
Evaluation of unrestrained replica-exchange simulations using dynamic walkers in temperature space for protein structure refinement.

PubMed

Olson, Mark A; Lee, Michael S

2014-01-01

A central problem of computational structural biology is the refinement of modeled protein structures taken from either comparative modeling or knowledge-based methods. Simulations are commonly used to achieve higher resolution of the structures at the all-atom level, yet methodologies that consistently yield accurate results remain elusive. In this work, we provide an assessment of an adaptive temperature-based replica exchange simulation method where the temperature clients dynamically walk in temperature space to enrich their population and exchanges near steep energetic barriers. This approach is compared to earlier work of applying the conventional method of static temperature clients to refine a dataset of conformational decoys. Our results show that, while an adaptive method has many theoretical advantages over a static distribution of client temperatures, only limited improvement was gained from this strategy in excursions of the downhill refinement regime leading to an increase in the fraction of native contacts. To illustrate the sampling differences between the two simulation methods, energy landscapes are presented along with their temperature client profiles.
AssignFit: a program for simultaneous assignment and structure refinement from solid-state NMR spectra

PubMed Central

Tian, Ye; Schwieters, Charles D.; Opella, Stanley J.; Marassi, Francesca M.

2011-01-01

AssignFit is a computer program developed within the XPLOR-NIH package for the assignment of dipolar coupling (DC) and chemical shift anisotropy (CSA) restraints derived from the solid-state NMR spectra of protein samples with uniaxial order. The method is based on minimizing the difference between experimentally observed solid-state NMR spectra and the frequencies back calculated from a structural model. Starting with a structural model and a set of DC and CSA restraints grouped only by amino acid type, as would be obtained by selective isotopic labeling, AssignFit generates all of the possible assignment permutations and calculates the corresponding atomic coordinates oriented in the alignment frame, together with the associated set of NMR frequencies, which are then compared with the experimental data for best fit. Incorporation of AssignFit in a simulated annealing refinement cycle provides an approach for simultaneous assignment and structure refinement (SASR) of proteins from solid-state NMR orientation restraints. The methods are demonstrated with data from two integral membrane proteins, one α-helical and one β-barrel, embedded in phospholipid bilayer membranes. PMID:22036904
Rosetta Structure Prediction as a Tool for Solving Difficult Molecular Replacement Problems.

PubMed

DiMaio, Frank

2017-01-01

Molecular replacement (MR), a method for solving the crystallographic phase problem using phases derived from a model of the target structure, has proven extremely valuable, accounting for the vast majority of structures solved by X-ray crystallography. However, when the resolution of data is low, or the starting model is very dissimilar to the target protein, solving structures via molecular replacement may be very challenging. In recent years, protein structure prediction methodology has emerged as a powerful tool in model building and model refinement for difficult molecular replacement problems. This chapter describes some of the tools available in Rosetta for model building and model refinement specifically geared toward difficult molecular replacement cases.
Homology Modeling of Dopamine D2 and D3 Receptors: Molecular Dynamics Refinement and Docking Evaluation

PubMed Central

Platania, Chiara Bianca Maria; Salomone, Salvatore; Leggio, Gian Marco; Drago, Filippo; Bucolo, Claudio

2012-01-01

Dopamine (DA) receptors, a class of G-protein coupled receptors (GPCRs), have been targeted for drug development for the treatment of neurological, psychiatric and ocular disorders. The lack of structural information about GPCRs and their ligand complexes has prompted the development of homology models of these proteins aimed at structure-based drug design. Crystal structure of human dopamine D3 (hD3) receptor has been recently solved. Based on the hD3 receptor crystal structure we generated dopamine D2 and D3 receptor models and refined them with molecular dynamics (MD) protocol. Refined structures, obtained from the MD simulations in membrane environment, were subsequently used in molecular docking studies in order to investigate potential sites of interaction. The structure of hD3 and hD2L receptors was differentiated by means of MD simulations and D3 selective ligands were discriminated, in terms of binding energy, by docking calculation. Robust correlation of computed and experimental Ki was obtained for hD3 and hD2L receptor ligands. In conclusion, the present computational approach seems suitable to build and refine structure models of homologous dopamine receptors that may be of value for structure-based drug discovery of selective dopaminergic ligands. PMID:22970199
Refined views of multi-protein complexes in the erythrocyte membrane

PubMed Central

Mankelow, TJ; Satchwell, TJ; Burton, NM

2015-01-01

The erythrocyte membrane has been extensively studied, both as a model membrane system and to investigate its role in gas exchange and transport. Much is now known about the protein components of the membrane, how they are organised into large multi-protein complexes and how they interact with each other within these complexes. Many links between the membrane and the cytoskeleton have also been delineated and have been demonstrated to be crucial for maintaining the deformability and integrity of the erythrocyte. In this study we have refined previous, highly speculative molecular models of these complexes by including the available data pertaining to known protein-protein interactions. While the refined models remain highly speculative, they provide an evolving framework for visualisation of these important cellular structures at the atomic level. PMID:22465511
A CPU benchmark for protein crystallographic refinement.

PubMed

Bourne, P E; Hendrickson, W A

1990-01-01

The CPU time required to complete a cycle of restrained least-squares refinement of a protein structure from X-ray crystallographic data using the FORTRAN codes PROTIN and PROLSQ are reported for 48 different processors, ranging from single-user workstations to supercomputers. Sequential, vector, VLIW, multiprocessor, and RISC hardware architectures are compared using both a small and a large protein structure. Representative compile times for each hardware type are also given, and the improvement in run-time when coding for a specific hardware architecture considered. The benchmarks involve scalar integer and vector floating point arithmetic and are representative of the calculations performed in many scientific disciplines.
In situ data collection and structure refinement from microcapillary protein crystallization

PubMed Central

Yadav, Maneesh K.; Gerdts, Cory J.; Sanishvili, Ruslan; Smith, Ward W.; Roach, L. Spencer; Ismagilov, Rustem F.; Kuhn, Peter; Stevens, Raymond C.

2007-01-01

In situ X-ray data collection has the potential to eliminate the challenging task of mounting and cryocooling often fragile protein crystals, reducing a major bottleneck in the structure determination process. An apparatus used to grow protein crystals in capillaries and to compare the background X-ray scattering of the components, including thin-walled glass capillaries against Teflon, and various fluorocarbon oils against each other, is described. Using thaumatin as a test case at 1.8 Å resolution, this study demonstrates that high-resolution electron density maps and refined models can be obtained from in situ diffraction of crystals grown in microcapillaries. PMID:17468785
Ab initio structure determination and refinement of a scorpion protein toxin.

PubMed

Smith, G D; Blessing, R H; Ealick, S E; Fontecilla-Camps, J C; Hauptman, H A; Housset, D; Langs, D A; Miller, R

1997-09-01

The structure of toxin II from the scorpion Androctonus australis Hector has been determined ab initio by direct methods using SnB at 0.96 A resolution. For the purpose of this structure redetermination, undertaken as a test of the minimal function and the SnB program, the identity and sequence of the protein was withheld from part of the research team. A single solution obtained from 1 619 random atom trials was clearly revealed by the bimodal distribution of the final value of the minimal function associated with each individual trial. Five peptide fragments were identified from a conservative analysis of the initial E-map, and following several refinement cycles with X-PLOR, a model was built of the complete structure. At the end of the X-PLOR refinement, the sequence was compared with the published sequence and 57 of the 64 residues had been correctly identified. Two errors in sequence resulted from side chains with similar size while the rest of the errors were a result of severe disorder or high thermal motion in the side chains. Given the amino-acid sequence, it is estimated that the initial E-map could have produced a model containing 99% of all main-chain and 81% of side-chain atoms. The structure refinement was completed with PROFFT, including the contributions of protein H atoms, and converged at a residual of 0.158 for 30 609 data with F >or= 2sigma(F) in the resolution range 8.0-0.964 A. The final model consisted of 518 non-H protein atoms (36 disordered), 407 H atoms, and 129 water molecules (43 with occupancies less than unity). This total of 647 non-H atoms represents the largest light-atom structure solved to date.
Protein structure refinement using a quantum mechanics-based chemical shielding predictor† †Electronic supplementary information (ESI) available. See DOI: 10.1039/c6sc04344e Click here for additional data file.

PubMed Central

2017-01-01

The accurate prediction of protein chemical shifts using a quantum mechanics (QM)-based method has been the subject of intense research for more than 20 years but so far empirical methods for chemical shift prediction have proven more accurate. In this paper we show that a QM-based predictor of a protein backbone and CB chemical shifts (ProCS15, PeerJ, 2016, 3, e1344) is of comparable accuracy to empirical chemical shift predictors after chemical shift-based structural refinement that removes small structural errors. We present a method by which quantum chemistry based predictions of isotropic chemical shielding values (ProCS15) can be used to refine protein structures using Markov Chain Monte Carlo (MCMC) simulations, relating the chemical shielding values to the experimental chemical shifts probabilistically. Two kinds of MCMC structural refinement simulations were performed using force field geometry optimized X-ray structures as starting points: simulated annealing of the starting structure and constant temperature MCMC simulation followed by simulated annealing of a representative ensemble structure. Annealing of the CHARMM structure changes the CA-RMSD by an average of 0.4 Å but lowers the chemical shift RMSD by 1.0 and 0.7 ppm for CA and N. Conformational averaging has a relatively small effect (0.1–0.2 ppm) on the overall agreement with carbon chemical shifts but lowers the error for nitrogen chemical shifts by 0.4 ppm. If an amino acid specific offset is included the ProCS15 predicted chemical shifts have RMSD values relative to experiments that are comparable to popular empirical chemical shift predictors. The annealed representative ensemble structures differ in CA-RMSD relative to the initial structures by an average of 2.0 Å, with >2.0 Å difference for six proteins. In four of the cases, the largest structural differences arise in structurally flexible regions of the protein as determined by NMR, and in the remaining two cases, the large structural change may be due to force field deficiencies. The overall accuracy of the empirical methods are slightly improved by annealing the CHARMM structure with ProCS15, which may suggest that the minor structural changes introduced by ProCS15-based annealing improves the accuracy of the protein structures. Having established that QM-based chemical shift prediction can deliver the same accuracy as empirical shift predictors we hope this can help increase the accuracy of related approaches such as QM/MM or linear scaling approaches or interpreting protein structural dynamics from QM-derived chemical shift. PMID:28451325
Bayesian refinement of protein structures and ensembles against SAXS data using molecular dynamics

PubMed Central

Shevchuk, Roman; Hub, Jochen S.

2017-01-01

Small-angle X-ray scattering is an increasingly popular technique used to detect protein structures and ensembles in solution. However, the refinement of structures and ensembles against SAXS data is often ambiguous due to the low information content of SAXS data, unknown systematic errors, and unknown scattering contributions from the solvent. We offer a solution to such problems by combining Bayesian inference with all-atom molecular dynamics simulations and explicit-solvent SAXS calculations. The Bayesian formulation correctly weights the SAXS data versus prior physical knowledge, it quantifies the precision or ambiguity of fitted structures and ensembles, and it accounts for unknown systematic errors due to poor buffer matching. The method further provides a probabilistic criterion for identifying the number of states required to explain the SAXS data. The method is validated by refining ensembles of a periplasmic binding protein against calculated SAXS curves. Subsequently, we derive the solution ensembles of the eukaryotic chaperone heat shock protein 90 (Hsp90) against experimental SAXS data. We find that the SAXS data of the apo state of Hsp90 is compatible with a single wide-open conformation, whereas the SAXS data of Hsp90 bound to ATP or to an ATP-analogue strongly suggest heterogenous ensembles of a closed and a wide-open state. PMID:29045407
Mimicking the action of folding chaperones by Hamiltonian replica-exchange molecular dynamics simulations: application in the refinement of de novo models.

PubMed

Fan, Hao; Periole, Xavier; Mark, Alan E

2012-07-01

The efficiency of using a variant of Hamiltonian replica-exchange molecular dynamics (Chaperone H-replica-exchange molecular dynamics [CH-REMD]) for the refinement of protein structural models generated de novo is investigated. In CH-REMD, the interaction between the protein and its environment, specifically, the electrostatic interaction between the protein and the solvating water, is varied leading to cycles of partial unfolding and refolding mimicking some aspects of folding chaperones. In 10 of the 15 cases examined, the CH-REMD approach sampled structures in which the root-mean-square deviation (RMSD) of secondary structure elements (SSE-RMSD) with respect to the experimental structure was more than 1.0 Å lower than the initial de novo model. In 14 of the 15 cases, the improvement was more than 0.5 Å. The ability of three different statistical potentials to identify near-native conformations was also examined. Little correlation between the SSE-RMSD of the sampled structures with respect to the experimental structure and any of the scoring functions tested was found. The most effective scoring function tested was the DFIRE potential. Using the DFIRE potential, the SSE-RMSD of the best scoring structures was on average 0.3 Å lower than the initial model. Overall the work demonstrates that targeted enhanced-sampling techniques such as CH-REMD can lead to the systematic refinement of protein structural models generated de novo but that improved potentials for the identification of near-native structures are still needed. Copyright © 2012 Wiley Periodicals, Inc.
Interplay of I-TASSER and QUARK for template-based and ab initio protein structure prediction in CASP10

PubMed Central

Zhang, Yang

2014-01-01

We develop and test a new pipeline in CASP10 to predict protein structures based on an interplay of I-TASSER and QUARK for both free-modeling (FM) and template-based modeling (TBM) targets. The most noteworthy observation is that sorting through the threading template pool using the QUARK-based ab initio models as probes allows the detection of distant-homology templates which might be ignored by the traditional sequence profile-based threading alignment algorithms. Further template assembly refinement by I-TASSER resulted in successful folding of two medium-sized FM targets with >150 residues. For TBM, the multiple threading alignments from LOMETS are, for the first time, incorporated into the ab initio QUARK simulations, which were further refined by I-TASSER assembly refinement. Compared with the traditional threading assembly refinement procedures, the inclusion of the threading-constrained ab initio folding models can consistently improve the quality of the full-length models as assessed by the GDT-HA and hydrogen-bonding scores. Despite the success, significant challenges still exist in domain boundary prediction and consistent folding of medium-size proteins (especially beta-proteins) for nonhomologous targets. Further developments of sensitive fold-recognition and ab initio folding methods are critical for solving these problems. PMID:23760925
Interplay of I-TASSER and QUARK for template-based and ab initio protein structure prediction in CASP10.

PubMed

Zhang, Yang

2014-02-01

We develop and test a new pipeline in CASP10 to predict protein structures based on an interplay of I-TASSER and QUARK for both free-modeling (FM) and template-based modeling (TBM) targets. The most noteworthy observation is that sorting through the threading template pool using the QUARK-based ab initio models as probes allows the detection of distant-homology templates which might be ignored by the traditional sequence profile-based threading alignment algorithms. Further template assembly refinement by I-TASSER resulted in successful folding of two medium-sized FM targets with >150 residues. For TBM, the multiple threading alignments from LOMETS are, for the first time, incorporated into the ab initio QUARK simulations, which were further refined by I-TASSER assembly refinement. Compared with the traditional threading assembly refinement procedures, the inclusion of the threading-constrained ab initio folding models can consistently improve the quality of the full-length models as assessed by the GDT-HA and hydrogen-bonding scores. Despite the success, significant challenges still exist in domain boundary prediction and consistent folding of medium-size proteins (especially beta-proteins) for nonhomologous targets. Further developments of sensitive fold-recognition and ab initio folding methods are critical for solving these problems. Copyright © 2013 Wiley Periodicals, Inc.

Designing and benchmarking the MULTICOM protein structure prediction system

PubMed Central

2013-01-01

Background Predicting protein structure from sequence is one of the most significant and challenging problems in bioinformatics. Numerous bioinformatics techniques and tools have been developed to tackle almost every aspect of protein structure prediction ranging from structural feature prediction, template identification and query-template alignment to structure sampling, model quality assessment, and model refinement. How to synergistically select, integrate and improve the strengths of the complementary techniques at each prediction stage and build a high-performance system is becoming a critical issue for constructing a successful, competitive protein structure predictor. Results Over the past several years, we have constructed a standalone protein structure prediction system MULTICOM that combines multiple sources of information and complementary methods at all five stages of the protein structure prediction process including template identification, template combination, model generation, model assessment, and model refinement. The system was blindly tested during the ninth Critical Assessment of Techniques for Protein Structure Prediction (CASP9) in 2010 and yielded very good performance. In addition to studying the overall performance on the CASP9 benchmark, we thoroughly investigated the performance and contributions of each component at each stage of prediction. Conclusions Our comprehensive and comparative study not only provides useful and practical insights about how to select, improve, and integrate complementary methods to build a cutting-edge protein structure prediction system but also identifies a few new sources of information that may help improve the design of a protein structure prediction system. Several components used in the MULTICOM system are available at: http://sysbio.rnet.missouri.edu/multicom_toolbox/. PMID:23442819
Structure of Escherichia coli RutC, a member of the YjgF family and putative aminoacrylate peracid reductase of the rut operon

DOE Office of Scientific and Technical Information (OSTI.GOV)

Knapik, Aleksandra Alicja; Petkowski, Janusz Jurand; Otwinowski, Zbyszek

2014-10-02

RutC is the third enzyme in the Escherichia coli rut pathway of uracil degradation. RutC belongs to the highly conserved YjgF family of proteins. The structure of the RutC protein was determined and refined to 1.95 Å resolution. This crystal belonged to space group P21212 and contained six molecules in the asymmetric unit. The structure was solved by SAD phasing and was refined to an Rwork of 19.3% (Rfree = 21.7%). Moreover, the final model revealed that this protein has a Bacillus chorismate mutase-like fold and forms a homotrimer with a hydrophobic cavity in the center of the structure andmore » ligand-binding clefts between two subunits. A likely function for RutC is the reduction of peroxy-aminoacrylate to aminoacrylate as a part of a detoxification process.« less
Cry1A(b)16 toxin from Bacillus thuringiensis: Theoretical refinement of three-dimensional structure and prediction of peptides as molecular markers for detection of genetically modified organisms.

PubMed

Plácido, Alexandra; Coelho, Andreia; Abreu Nascimento, Lucas; Gomes Vasconcelos, Andreanne; Fátima Barroso, Maria; Ramos-Jesus, Joilson; Costa, Vladimir; das Chagas Alves Lima, Francisco; Delerue-Matos, Cristina; Martins Ramos, Ricardo; Marani, Mariela M; Roberto de Souza de Almeida Leite, José

2017-07-01

Transgenic maize produced by the insertion of the Cry transgene into its genome became the second most cultivated crop worldwide. Cry gene from Bacillus thuringiensis kurstaki expresses protein derivatives of crystalline endotoxins which confer insect resistance onto the maize crop. Mandatory labeling of processed food containing or made by genetically modified organisms is in force in many countries, so, it is very urgent to develop fast and practical methods for GMO identification, for example, biosensors. In the absence of an available empirical structure of Cry1A(b)16 protein, a theoretical model was effectively generated, in this work, by homology modeling and molecular dynamics simulations based on two available homologous protein structures. Molecular dynamics simulations were carried out to refine the selected model, and an analysis of its global structure was performed. The refined models of Cry1A(b)16 showed a standard fold and structural characteristics similar to those seen in Bacillus thuringiensis Cry1A(a) insecticidal toxin and Bacillus thuringiensis serovar kurstaki Cry1A(c) toxin. After in silico analysis of Cry1A(b)16, two immunoreactive candidate peptides were selected and specific polyclonal antibodies were produced resulting in antibody-peptide interaction. Biosensing devices are expected to be developed for detection of the Cry1A(b) protein as a marker of transgenic maize in food. Proteins 2017; 85:1248-1257. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Template-based structure modeling of protein-protein interactions

PubMed Central

Szilagyi, Andras; Zhang, Yang

2014-01-01

The structure of protein-protein complexes can be constructed by using the known structure of other protein complexes as a template. The complex structure templates are generally detected either by homology-based sequence alignments or, given the structure of monomer components, by structure-based comparisons. Critical improvements have been made in recent years by utilizing interface recognition and by recombining monomer and complex template libraries. Encouraging progress has also been witnessed in genome-wide applications of template-based modeling, with modeling accuracy comparable to high-throughput experimental data. Nevertheless, bottlenecks exist due to the incompleteness of the proteinprotein complex structure library and the lack of methods for distant homologous template identification and full-length complex structure refinement. PMID:24721449
Structural Refinement of Proteins by Restrained Molecular Dynamics Simulations with Non-interacting Molecular Fragments.

PubMed

Shen, Rong; Han, Wei; Fiorin, Giacomo; Islam, Shahidul M; Schulten, Klaus; Roux, Benoît

2015-10-01

The knowledge of multiple conformational states is a prerequisite to understand the function of membrane transport proteins. Unfortunately, the determination of detailed atomic structures for all these functionally important conformational states with conventional high-resolution approaches is often difficult and unsuccessful. In some cases, biophysical and biochemical approaches can provide important complementary structural information that can be exploited with the help of advanced computational methods to derive structural models of specific conformational states. In particular, functional and spectroscopic measurements in combination with site-directed mutations constitute one important source of information to obtain these mixed-resolution structural models. A very common problem with this strategy, however, is the difficulty to simultaneously integrate all the information from multiple independent experiments involving different mutations or chemical labels to derive a unique structural model consistent with the data. To resolve this issue, a novel restrained molecular dynamics structural refinement method is developed to simultaneously incorporate multiple experimentally determined constraints (e.g., engineered metal bridges or spin-labels), each treated as an individual molecular fragment with all atomic details. The internal structure of each of the molecular fragments is treated realistically, while there is no interaction between different molecular fragments to avoid unphysical steric clashes. The information from all the molecular fragments is exploited simultaneously to constrain the backbone to refine a three-dimensional model of the conformational state of the protein. The method is illustrated by refining the structure of the voltage-sensing domain (VSD) of the Kv1.2 potassium channel in the resting state and by exploring the distance histograms between spin-labels attached to T4 lysozyme. The resulting VSD structures are in good agreement with the consensus model of the resting state VSD and the spin-spin distance histograms from ESR/DEER experiments on T4 lysozyme are accurately reproduced.
Structural studies of human glioma pathogenesis-related protein 1

DOE Office of Scientific and Technical Information (OSTI.GOV)

Asojo, Oluwatoyin A., E-mail: oasojo@unmc.edu; Koski, Raymond A.; Bonafé, Nathalie

2011-10-01

Structural analysis of a truncated soluble domain of human glioma pathogenesis-related protein 1, a membrane protein implicated in the proliferation of aggressive brain cancer, is presented. Human glioma pathogenesis-related protein 1 (GLIPR1) is a membrane protein that is highly upregulated in brain cancers but is barely detectable in normal brain tissue. GLIPR1 is composed of a signal peptide that directs its secretion, a conserved cysteine-rich CAP (cysteine-rich secretory proteins, antigen 5 and pathogenesis-related 1 proteins) domain and a transmembrane domain. GLIPR1 is currently being investigated as a candidate for prostate cancer gene therapy and for glioblastoma targeted therapy. Crystal structuresmore » of a truncated soluble domain of the human GLIPR1 protein (sGLIPR1) solved by molecular replacement using a truncated polyalanine search model of the CAP domain of stecrisp, a snake-venom cysteine-rich secretory protein (CRISP), are presented. The correct molecular-replacement solution could only be obtained by removing all loops from the search model. The native structure was refined to 1.85 Å resolution and that of a Zn{sup 2+} complex was refined to 2.2 Å resolution. The latter structure revealed that the putative binding cavity coordinates Zn{sup 2+} similarly to snake-venom CRISPs, which are involved in Zn{sup 2+}-dependent mechanisms of inflammatory modulation. Both sGLIPR1 structures have extensive flexible loop/turn regions and unique charge distributions that were not observed in any of the previously reported CAP protein structures. A model is also proposed for the structure of full-length membrane-bound GLIPR1.« less
Overview of refinement procedures within REFMAC5: utilizing data from different sources.

PubMed

Kovalevskiy, Oleg; Nicholls, Robert A; Long, Fei; Carlon, Azzurra; Murshudov, Garib N

2018-03-01

Refinement is a process that involves bringing into agreement the structural model, available prior knowledge and experimental data. To achieve this, the refinement procedure optimizes a posterior conditional probability distribution of model parameters, including atomic coordinates, atomic displacement parameters (B factors), scale factors, parameters of the solvent model and twin fractions in the case of twinned crystals, given observed data such as observed amplitudes or intensities of structure factors. A library of chemical restraints is typically used to ensure consistency between the model and the prior knowledge of stereochemistry. If the observation-to-parameter ratio is small, for example when diffraction data only extend to low resolution, the Bayesian framework implemented in REFMAC5 uses external restraints to inject additional information extracted from structures of homologous proteins, prior knowledge about secondary-structure formation and even data obtained using different experimental methods, for example NMR. The refinement procedure also generates the `best' weighted electron-density maps, which are useful for further model (re)building. Here, the refinement of macromolecular structures using REFMAC5 and related tools distributed as part of the CCP4 suite is discussed.
SOV_refine: A further refined definition of segment overlap score and its significance for protein structure similarity.

PubMed

Liu, Tong; Wang, Zheng

2018-01-01

The segment overlap score (SOV) has been used to evaluate the predicted protein secondary structures, a sequence composed of helix (H), strand (E), and coil (C), by comparing it with the native or reference secondary structures, another sequence of H, E, and C. SOV's advantage is that it can consider the size of continuous overlapping segments and assign extra allowance to longer continuous overlapping segments instead of only judging from the percentage of overlapping individual positions as Q3 score does. However, we have found a drawback from its previous definition, that is, it cannot ensure increasing allowance assignment when more residues in a segment are further predicted accurately. A new way of assigning allowance has been designed, which keeps all the advantages of the previous SOV score definitions and ensures that the amount of allowance assigned is incremental when more elements in a segment are predicted accurately. Furthermore, our improved SOV has achieved a higher correlation with the quality of protein models measured by GDT-TS score and TM-score, indicating its better abilities to evaluate tertiary structure quality at the secondary structure level. We analyzed the statistical significance of SOV scores and found the threshold values for distinguishing two protein structures (SOV_refine > 0.19) and indicating whether two proteins are under the same CATH fold (SOV_refine > 0.94 and > 0.90 for three- and eight-state secondary structures respectively). We provided another two example applications, which are when used as a machine learning feature for protein model quality assessment and comparing different definitions of topologically associating domains. We proved that our newly defined SOV score resulted in better performance. The SOV score can be widely used in bioinformatics research and other fields that need to compare two sequences of letters in which continuous segments have important meanings. We also generalized the previous SOV definitions so that it can work for sequences composed of more than three states (e.g., it can work for the eight-state definition of protein secondary structures). A standalone software package has been implemented in Perl with source code released. The software can be downloaded from http://dna.cs.miami.edu/SOV/.
Construction of a 3D model of nattokinase, a novel fibrinolytic enzyme from Bacillus natto. A novel nucleophilic catalytic mechanism for nattokinase.

PubMed

Zheng, Zhong-liang; Zuo, Zhen-yu; Liu, Zhi-gang; Tsai, Keng-chang; Liu, Ai-fu; Zou, Guo-lin

2005-01-01

A three-dimensional structural model of nattokinase (NK) from Bacillus natto was constructed by homology modeling. High-resolution X-ray structures of Subtilisin BPN' (SB), Subtilisin Carlsberg (SC), Subtilisin E (SE) and Subtilisin Savinase (SS), four proteins with sequential, structural and functional homology were used as templates. Initial models of NK were built by MODELLER and analyzed by the PROCHECK programs. The best quality model was chosen for further refinement by constrained molecular dynamics simulations. The overall quality of the refined model was evaluated. The refined model NKC1 was analyzed by different protein analysis programs including PROCHECK for the evaluation of Ramachandran plot quality, PROSA for testing interaction energies and WHATIF for the calculation of packing quality. This structure was found to be satisfactory and also stable at room temperature as demonstrated by a 300ps long unconstrained molecular dynamics (MD) simulation. Further docking analysis promoted the coming of a new nucleophilic catalytic mechanism for NK, which is induced by attacking of hydroxyl rich in catalytic environment and locating of S221.
Applying an Empirical Hydropathic Forcefield in Refinement May Improve Low-Resolution Protein X-Ray Crystal Structures

PubMed Central

Koparde, Vishal N.; Scarsdale, J. Neel; Kellogg, Glen E.

2011-01-01

Background The quality of X-ray crystallographic models for biomacromolecules refined from data obtained at high-resolution is assured by the data itself. However, at low-resolution, >3.0 Å, additional information is supplied by a forcefield coupled with an associated refinement protocol. These resulting structures are often of lower quality and thus unsuitable for downstream activities like structure-based drug discovery. Methodology An X-ray crystallography refinement protocol that enhances standard methodology by incorporating energy terms from the HINT (Hydropathic INTeractions) empirical forcefield is described. This protocol was tested by refining synthetic low-resolution structural data derived from 25 diverse high-resolution structures, and referencing the resulting models to these structures. The models were also evaluated with global structural quality metrics, e.g., Ramachandran score and MolProbity clashscore. Three additional structures, for which only low-resolution data are available, were also re-refined with this methodology. Results The enhanced refinement protocol is most beneficial for reflection data at resolutions of 3.0 Å or worse. At the low-resolution limit, ≥4.0 Å, the new protocol generated models with Cα positions that have RMSDs that are 0.18 Å more similar to the reference high-resolution structure, Ramachandran scores improved by 13%, and clashscores improved by 51%, all in comparison to models generated with the standard refinement protocol. The hydropathic forcefield terms are at least as effective as Coulombic electrostatic terms in maintaining polar interaction networks, and significantly more effective in maintaining hydrophobic networks, as synthetic resolution is decremented. Even at resolutions ≥4.0 Å, these latter networks are generally native-like, as measured with a hydropathic interactions scoring tool. PMID:21246043
Distance matrix-based approach to protein structure prediction.

PubMed

Kloczkowski, Andrzej; Jernigan, Robert L; Wu, Zhijun; Song, Guang; Yang, Lei; Kolinski, Andrzej; Pokarowski, Piotr

2009-03-01

Much structural information is encoded in the internal distances; a distance matrix-based approach can be used to predict protein structure and dynamics, and for structural refinement. Our approach is based on the square distance matrix D = [r(ij)(2)] containing all square distances between residues in proteins. This distance matrix contains more information than the contact matrix C, that has elements of either 0 or 1 depending on whether the distance r (ij) is greater or less than a cutoff value r (cutoff). We have performed spectral decomposition of the distance matrices D = sigma lambda(k)V(k)V(kT), in terms of eigenvalues lambda kappa and the corresponding eigenvectors v kappa and found that it contains at most five nonzero terms. A dominant eigenvector is proportional to r (2)--the square distance of points from the center of mass, with the next three being the principal components of the system of points. By predicting r (2) from the sequence we can approximate a distance matrix of a protein with an expected RMSD value of about 7.3 A, and by combining it with the prediction of the first principal component we can improve this approximation to 4.0 A. We can also explain the role of hydrophobic interactions for the protein structure, because r is highly correlated with the hydrophobic profile of the sequence. Moreover, r is highly correlated with several sequence profiles which are useful in protein structure prediction, such as contact number, the residue-wise contact order (RWCO) or mean square fluctuations (i.e. crystallographic temperature factors). We have also shown that the next three components are related to spatial directionality of the secondary structure elements, and they may be also predicted from the sequence, improving overall structure prediction. We have also shown that the large number of available HIV-1 protease structures provides a remarkable sampling of conformations, which can be viewed as direct structural information about the dynamics. After structure matching, we apply principal component analysis (PCA) to obtain the important apparent motions for both bound and unbound structures. There are significant similarities between the first few key motions and the first few low-frequency normal modes calculated from a static representative structure with an elastic network model (ENM) that is based on the contact matrix C (related to D), strongly suggesting that the variations among the observed structures and the corresponding conformational changes are facilitated by the low-frequency, global motions intrinsic to the structure. Similarities are also found when the approach is applied to an NMR ensemble, as well as to atomic molecular dynamics (MD) trajectories. Thus, a sufficiently large number of experimental structures can directly provide important information about protein dynamics, but ENM can also provide a similar sampling of conformations. Finally, we use distance constraints from databases of known protein structures for structure refinement. We use the distributions of distances of various types in known protein structures to obtain the most probable ranges or the mean-force potentials for the distances. We then impose these constraints on structures to be refined or include the mean-force potentials directly in the energy minimization so that more plausible structural models can be built. This approach has been successfully used by us in 2006 in the CASPR structure refinement (http://predictioncenter.org/caspR).
Underestimated Halogen Bonds Forming with Protein Backbone in Protein Data Bank.

PubMed

Zhang, Qian; Xu, Zhijian; Shi, Jiye; Zhu, Weiliang

2017-07-24

Halogen bonds (XBs) are attracting increasing attention in biological systems. Protein Data Bank (PDB) archives experimentally determined XBs in biological macromolecules. However, no software for structure refinement in X-ray crystallography takes into account XBs, which might result in the weakening or even vanishing of experimentally determined XBs in PDB. In our previous study, we showed that side-chain XBs forming with protein side chains are underestimated in PDB on the basis of the phenomenon that the proportion of side-chain XBs to overall XBs decreases as structural resolution becomes lower and lower. However, whether the dominant backbone XBs forming with protein backbone are overlooked is still a mystery. Here, with the help of the ratio (R F ) of the observed XBs' frequency of occurrence to their frequency expected at random, we demonstrated that backbone XBs are largely overlooked in PDB, too. Furthermore, three cases were discovered possessing backbone XBs in high resolution structures while losing the XBs in low resolution structures. In the last two cases, even at 1.80 Å resolution, the backbone XBs were lost, manifesting the urgent need to consider XBs in the refinement process during X-ray crystallography study.
Reduced Fragment Diversity for Alpha and Alpha-Beta Protein Structure Prediction using Rosetta.

PubMed

Abbass, Jad; Nebel, Jean-Christophe

2017-01-01

Protein structure prediction is considered a main challenge in computational biology. The biannual international competition, Critical Assessment of protein Structure Prediction (CASP), has shown in its eleventh experiment that free modelling target predictions are still beyond reliable accuracy, therefore, much effort should be made to improve ab initio methods. Arguably, Rosetta is considered as the most competitive method when it comes to targets with no homologues. Relying on fragments of length 9 and 3 from known structures, Rosetta creates putative structures by assembling candidate fragments. Generally, the structure with the lowest energy score, also known as first model, is chosen to be the "predicted one". A thorough study has been conducted on the role and diversity of 3-mers involved in Rosetta's model "refinement" phase. Usage of the standard number of 3-mers - i.e. 200 - has been shown to degrade alpha and alpha-beta protein conformations initially achieved by assembling 9-mers. Therefore, a new prediction pipeline is proposed for Rosetta where the "refinement" phase is customised according to a target's structural class prediction. Over 8% improvement in terms of first model structure accuracy is reported for alpha and alpha-beta classes when decreasing the number of 3- mers. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Three-dimensional (3D) structure prediction of the American and African oil-palms β-ketoacyl-[ACP] synthase-II protein by comparative modelling

PubMed Central

Wang, Edina; Chinni, Suresh; Bhore, Subhash Janardhan

2014-01-01

Background: The fatty-acid profile of the vegetable oils determines its properties and nutritional value. Palm-oil obtained from the African oil-palm [Elaeis guineensis Jacq. (Tenera)] contains 44% palmitic acid (C16:0), but, palm-oil obtained from the American oilpalm [Elaeis oleifera] contains only 25% C16:0. In part, the b-ketoacyl-[ACP] synthase II (KASII) [EC: 2.3.1.179] protein is responsible for the high level of C16:0 in palm-oil derived from the African oil-palm. To understand more about E. guineensis KASII (EgKASII) and E. oleifera KASII (EoKASII) proteins, it is essential to know its structures. Hence, this study was undertaken. Objective: The objective of this study was to predict three-dimensional (3D) structure of EgKASII and EoKASII proteins using molecular modelling tools. Materials and Methods: The amino-acid sequences for KASII proteins were retrieved from the protein database of National Center for Biotechnology Information (NCBI), USA. The 3D structures were predicted for both proteins using homology modelling and ab-initio technique approach of protein structure prediction. The molecular dynamics (MD) simulation was performed to refine the predicted structures. The predicted structure models were evaluated and root mean square deviation (RMSD) and root mean square fluctuation (RMSF) values were calculated. Results: The homology modelling showed that EgKASII and EoKASII proteins are 78% and 74% similar with Streptococcus pneumonia KASII and Brucella melitensis KASII, respectively. The EgKASII and EoKASII structures predicted by using ab-initio technique approach shows 6% and 9% deviation to its structures predicted by homology modelling, respectively. The structure refinement and validation confirmed that the predicted structures are accurate. Conclusion: The 3D structures for EgKASII and EoKASII proteins were predicted. However, further research is essential to understand the interaction of EgKASII and EoKASII proteins with its substrates. PMID:24748752
Three-dimensional (3D) structure prediction of the American and African oil-palms β-ketoacyl-[ACP] synthase-II protein by comparative modelling.

PubMed

Wang, Edina; Chinni, Suresh; Bhore, Subhash Janardhan

2014-01-01

The fatty-acid profile of the vegetable oils determines its properties and nutritional value. Palm-oil obtained from the African oil-palm [Elaeis guineensis Jacq. (Tenera)] contains 44% palmitic acid (C16:0), but, palm-oil obtained from the American oilpalm [Elaeis oleifera] contains only 25% C16:0. In part, the b-ketoacyl-[ACP] synthase II (KASII) [EC: 2.3.1.179] protein is responsible for the high level of C16:0 in palm-oil derived from the African oil-palm. To understand more about E. guineensis KASII (EgKASII) and E. oleifera KASII (EoKASII) proteins, it is essential to know its structures. Hence, this study was undertaken. The objective of this study was to predict three-dimensional (3D) structure of EgKASII and EoKASII proteins using molecular modelling tools. The amino-acid sequences for KASII proteins were retrieved from the protein database of National Center for Biotechnology Information (NCBI), USA. The 3D structures were predicted for both proteins using homology modelling and ab-initio technique approach of protein structure prediction. The molecular dynamics (MD) simulation was performed to refine the predicted structures. The predicted structure models were evaluated and root mean square deviation (RMSD) and root mean square fluctuation (RMSF) values were calculated. The homology modelling showed that EgKASII and EoKASII proteins are 78% and 74% similar with Streptococcus pneumonia KASII and Brucella melitensis KASII, respectively. The EgKASII and EoKASII structures predicted by using ab-initio technique approach shows 6% and 9% deviation to its structures predicted by homology modelling, respectively. The structure refinement and validation confirmed that the predicted structures are accurate. The 3D structures for EgKASII and EoKASII proteins were predicted. However, further research is essential to understand the interaction of EgKASII and EoKASII proteins with its substrates.
Conformation-dependent backbone geometry restraints set a new standard for protein crystallographic refinement

DOE PAGES

Moriarty, Nigel W.; Tronrud, Dale E.; Adams, Paul D.; ...

2014-06-17

Ideal values of bond angles and lengths used as external restraints are crucial for the successful refinement of protein crystal structures at all but the highest of resolutions. The restraints in common usage today have been designed based on the assumption that each type of bond or angle has a single ideal value independent of context. However, recent work has shown that the ideal values are, in fact, sensitive to local conformation, and as a first step toward using such information to build more accurate models, ultra-high resolution protein crystal structures have been used to derive a conformation-dependent library (CDL)more » of restraints for the protein backbone (Berkholz et al. 2009. Structure. 17, 1316). Here, we report the introduction of this CDL into the Phenix package and the results of test refinements of thousands of structures across a wide range of resolutions. These tests show that use of the conformation dependent library yields models that have substantially better agreement with ideal main-chain bond angles and lengths and, on average, a slightly enhanced fit to the X-ray data. No disadvantages of using the backbone CDL are apparent. In Phenix usage of the CDL can be selected by simply specifying the cdl=True option. This successful implementation paves the way for further aspects of the context-dependence of ideal geometry to be characterized and applied to improve experimental and predictive modelling accuracy.« less
Predicting protein-protein interactions on a proteome scale by matching evolutionary and structural similarities at interfaces using PRISM.

PubMed

Tuncbag, Nurcan; Gursoy, Attila; Nussinov, Ruth; Keskin, Ozlem

2011-08-11

Prediction of protein-protein interactions at the structural level on the proteome scale is important because it allows prediction of protein function, helps drug discovery and takes steps toward genome-wide structural systems biology. We provide a protocol (termed PRISM, protein interactions by structural matching) for large-scale prediction of protein-protein interactions and assembly of protein complex structures. The method consists of two components: rigid-body structural comparisons of target proteins to known template protein-protein interfaces and flexible refinement using a docking energy function. The PRISM rationale follows our observation that globally different protein structures can interact via similar architectural motifs. PRISM predicts binding residues by using structural similarity and evolutionary conservation of putative binding residue 'hot spots'. Ultimately, PRISM could help to construct cellular pathways and functional, proteome-scale annotation. PRISM is implemented in Python and runs in a UNIX environment. The program accepts Protein Data Bank-formatted protein structures and is available at http://prism.ccbb.ku.edu.tr/prism_protocol/.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Leimkuhler, B.; Hermans, J.; Skeel, R.D.

A workshop was held on algorithms and parallel implementations for macromolecular dynamics, protein folding, and structural refinement. This document contains abstracts and brief reports from that workshop.
Accurate macromolecular crystallographic refinement: incorporation of the linear scaling, semiempirical quantum-mechanics program DivCon into the PHENIX refinement package

DOE Office of Scientific and Technical Information (OSTI.GOV)

Borbulevych, Oleg Y.; Plumley, Joshua A.; Martin, Roger I.

2014-05-01

Semiempirical quantum-chemical X-ray macromolecular refinement using the program DivCon integrated with PHENIX is described. Macromolecular crystallographic refinement relies on sometimes dubious stereochemical restraints and rudimentary energy functionals to ensure the correct geometry of the model of the macromolecule and any covalently bound ligand(s). The ligand stereochemical restraint file (CIF) requires a priori understanding of the ligand geometry within the active site, and creation of the CIF is often an error-prone process owing to the great variety of potential ligand chemistry and structure. Stereochemical restraints have been replaced with more robust functionals through the integration of the linear-scaling, semiempirical quantum-mechanics (SE-QM)more » program DivCon with the PHENIX X-ray refinement engine. The PHENIX/DivCon package has been thoroughly validated on a population of 50 protein–ligand Protein Data Bank (PDB) structures with a range of resolutions and chemistry. The PDB structures used for the validation were originally refined utilizing various refinement packages and were published within the past five years. PHENIX/DivCon does not utilize CIF(s), link restraints and other parameters for refinement and hence it does not make as many a priori assumptions about the model. Across the entire population, the method results in reasonable ligand geometries and low ligand strains, even when the original refinement exhibited difficulties, indicating that PHENIX/DivCon is applicable to both single-structure and high-throughput crystallography.« less
XAS Characterization of the Zn Site of Non-structural Protein 3 (NS3) from Hepatitis C Virus

NASA Astrophysics Data System (ADS)

Ascone, I.; Nobili, G.; Benfatto, M.; Congiu-Castellano, A.

2007-02-01

XANES spectra of non structural protein 3 (NS3) have been calculated using 4 Zn coordination models from three crystallographic structures in the Protein Data Base (PDB): 1DY9, subunit B, 1CU1 subunit A and B, and 1JXP subunit B. Results indicate that XANES is an appropriate tool to distinguish among them. Experimental XANES spectra have been simulated refining crystallographic data. The model obtained by XAS is compared with the PDB models.

Similarity Measures for Protein Ensembles

PubMed Central

Lindorff-Larsen, Kresten; Ferkinghoff-Borg, Jesper

2009-01-01

Analyses of similarities and changes in protein conformation can provide important information regarding protein function and evolution. Many scores, including the commonly used root mean square deviation, have therefore been developed to quantify the similarities of different protein conformations. However, instead of examining individual conformations it is in many cases more relevant to analyse ensembles of conformations that have been obtained either through experiments or from methods such as molecular dynamics simulations. We here present three approaches that can be used to compare conformational ensembles in the same way as the root mean square deviation is used to compare individual pairs of structures. The methods are based on the estimation of the probability distributions underlying the ensembles and subsequent comparison of these distributions. We first validate the methods using a synthetic example from molecular dynamics simulations. We then apply the algorithms to revisit the problem of ensemble averaging during structure determination of proteins, and find that an ensemble refinement method is able to recover the correct distribution of conformations better than standard single-molecule refinement. PMID:19145244
Reintroducing electrostatics into macromolecular crystallographic refinement: application to neutron crystallography and DNA hydration.

PubMed

Fenn, Timothy D; Schnieders, Michael J; Mustyakimov, Marat; Wu, Chuanjie; Langan, Paul; Pande, Vijay S; Brunger, Axel T

2011-04-13

Most current crystallographic structure refinements augment the diffraction data with a priori information consisting of bond, angle, dihedral, planarity restraints, and atomic repulsion based on the Pauli exclusion principle. Yet, electrostatics and van der Waals attraction are physical forces that provide additional a priori information. Here, we assess the inclusion of electrostatics for the force field used for all-atom (including hydrogen) joint neutron/X-ray refinement. Two DNA and a protein crystal structure were refined against joint neutron/X-ray diffraction data sets using force fields without electrostatics or with electrostatics. Hydrogen-bond orientation/geometry favors the inclusion of electrostatics. Refinement of Z-DNA with electrostatics leads to a hypothesis for the entropic stabilization of Z-DNA that may partly explain the thermodynamics of converting the B form of DNA to its Z form. Thus, inclusion of electrostatics assists joint neutron/X-ray refinements, especially for placing and orienting hydrogen atoms. Copyright © 2011 Elsevier Ltd. All rights reserved.
Reintroducing Electrostatics into Macromolecular Crystallographic Refinement: Application to Neutron Crystallography and DNA Hydration

PubMed Central

Fenn, Timothy D.; Schnieders, Michael J.; Mustyakimov, Marat; Wu, Chuanjie; Langan, Paul; Pande, Vijay S.; Brunger, Axel T.

2011-01-01

Summary Most current crystallographic structure refinements augment the diffraction data with a priori information consisting of bond, angle, dihedral, planarity restraints and atomic repulsion based on the Pauli exclusion principle. Yet, electrostatics and van der Waals attraction are physical forces that provide additional a priori information. Here we assess the inclusion of electrostatics for the force field used for all-atom (including hydrogen) joint neutron/X-ray refinement. Two DNA and a protein crystal structure were refined against joint neutron/X-ray diffraction data sets using force fields without electrostatics or with electrostatics. Hydrogen bond orientation/geometry favors the inclusion of electrostatics. Refinement of Z-DNA with electrostatics leads to a hypothesis for the entropic stabilization of Z-DNA that may partly explain the thermodynamics of converting the B form of DNA to its Z form. Thus, inclusion of electrostatics assists joint neutron/X-ray refinements, especially for placing and orienting hydrogen atoms. PMID:21481775
Simultaneous use of solution NMR and X-ray data in REFMAC5 for joint refinement/detection of structural differences.

PubMed

Rinaldelli, Mauro; Ravera, Enrico; Calderone, Vito; Parigi, Giacomo; Murshudov, Garib N; Luchinat, Claudio

2014-04-01

The program REFMAC5 from CCP4 was modified to allow the simultaneous use of X-ray crystallographic data and paramagnetic NMR data (pseudocontact shifts and self-orientation residual dipolar couplings) and/or diamagnetic residual dipolar couplings. Incorporation of these long-range NMR restraints in REFMAC5 can reveal differences between solid-state and solution conformations of molecules or, in their absence, can be used together with X-ray crystallographic data for structural refinement. Since NMR and X-ray data are complementary, when a single structure is consistent with both sets of data and still maintains reasonably `ideal' geometries, the reliability of the derived atomic model is expected to increase. The program was tested on five different proteins: the catalytic domain of matrix metalloproteinase 1, GB3, ubiquitin, free calmodulin and calmodulin complexed with a peptide. In some cases the joint refinement produced a single model consistent with both sets of observations, while in other cases it indicated, outside the experimental uncertainty, the presence of different protein conformations in solution and in the solid state.
Protein Structure and Function Prediction Using I-TASSER

PubMed Central

Yang, Jianyi; Zhang, Yang

2016-01-01

I-TASSER is a hierarchical protocol for automated protein structure prediction and structure-based function annotation. Starting from the amino acid sequence of target proteins, I-TASSER first generates full-length atomic structural models from multiple threading alignments and iterative structural assembly simulations followed by atomic-level structure refinement. The biological functions of the protein, including ligand-binding sites, enzyme commission number, and gene ontology terms, are then inferred from known protein function databases based on sequence and structure profile comparisons. I-TASSER is freely available as both an on-line server and a stand-alone package. This unit describes how to use the I-TASSER protocol to generate structure and function prediction and how to interpret the prediction results, as well as alternative approaches for further improving the I-TASSER modeling quality for distant-homologous and multi-domain protein targets. PMID:26678386
Dynamic New World: Refining Our View of Protein Structure, Function and Evolution

PubMed Central

Mannige, Ranjan V.

2014-01-01

Proteins are crucial to the functioning of all lifeforms. Traditional understanding posits that a single protein occupies a single structure (“fold”), which performs a single function. This view is radically challenged with the recognition that high structural dynamism—the capacity to be extra “floppy”—is more prevalent in functional proteins than previously assumed. As reviewed here, this dynamic take on proteins affects our understanding of protein “structure”, function, and evolution, and even gives us a glimpse into protein origination. Specifically, this review will discuss historical developments concerning protein structure, and important new relationships between dynamism and aspects of protein sequence, structure, binding modes, binding promiscuity, evolvability, and origination. Along the way, suggestions will be provided for how key parts of textbook definitions—that so far have excluded membership to intrinsically disordered proteins (IDPs)—could be modified to accommodate our more dynamic understanding of proteins. PMID:28250374
PDB_REDO: automated re-refinement of X-ray structure models in the PDB.

PubMed

Joosten, Robbie P; Salzemann, Jean; Bloch, Vincent; Stockinger, Heinz; Berglund, Ann-Charlott; Blanchet, Christophe; Bongcam-Rudloff, Erik; Combet, Christophe; Da Costa, Ana L; Deleage, Gilbert; Diarena, Matteo; Fabbretti, Roberto; Fettahi, Géraldine; Flegel, Volker; Gisel, Andreas; Kasam, Vinod; Kervinen, Timo; Korpelainen, Eija; Mattila, Kimmo; Pagni, Marco; Reichstadt, Matthieu; Breton, Vincent; Tickle, Ian J; Vriend, Gert

2009-06-01

Structural biology, homology modelling and rational drug design require accurate three-dimensional macromolecular coordinates. However, the coordinates in the Protein Data Bank (PDB) have not all been obtained using the latest experimental and computational methods. In this study a method is presented for automated re-refinement of existing structure models in the PDB. A large-scale benchmark with 16 807 PDB entries showed that they can be improved in terms of fit to the deposited experimental X-ray data as well as in terms of geometric quality. The re-refinement protocol uses TLS models to describe concerted atom movement. The resulting structure models are made available through the PDB_REDO databank (http://www.cmbi.ru.nl/pdb_redo/). Grid computing techniques were used to overcome the computational requirements of this endeavour.
PROGEN: An automated modelling algorithm for the generation of complete protein structures from the α-carbon atomic coordinates

NASA Astrophysics Data System (ADS)

Mandal, Chhabinath; Linthicum, D. Scott

1993-04-01

A modelling algorithm (PROGEN) for the generation of complete protein atomic coordinates from only the α-carbon coordinates is described. PROGEN utilizes an optimal geometry parameter (OGP) database for the positioning of atoms for each amino acid of the polypeptide model. The OGP database was established by examining the statistical correlations between 23 different intra-peptide and inter-peptide geometric parameters relative to the α-carbon distances for each amino acid in a library of 19 known proteins from the Brookhaven Protein Database (BPDB). The OGP files for specific amino acids and peptides were used to generate the atomic positions, with respect to α-carbons, for main-chain and side-chain atoms in the modelled structure. Refinement of the initial model was accomplished using energy minimization (EM) and molecular dynamics techniques. PROGEN was tested using 60 known proteins in the BPDB, representing a wide spectrum of primary and secondary structures. Comparison between PROGEN models and BPDB crystal reference structures gave r.m.s.d. values for peptide main-chain atoms between 0.29 and 0.76 Å, with a grand average of 0.53 Å for all 60 models. The r.m.s.d. for all non-hydrogen atoms ranged between 1.44 and 1.93 Å for the 60 polypeptide models. PROGEN was also able to make the correct assignment of cis- or trans-proline configurations in the protein structures examined. PROGEN offers a fully automatic building and refinement procedure and requires no special or specific structural considerations for the protein to be modelled.
Refined crystal structure of DsRed, a red fluorescent protein from coral, at 2.0-A resolution.

PubMed

Yarbrough, D; Wachter, R M; Kallio, K; Matz, M V; Remington, S J

2001-01-16

The crystal structure of DsRed, a red fluorescent protein from a corallimorpharian, has been determined at 2.0-A resolution by multiple-wavelength anomalous dispersion and crystallographic refinement. Crystals of the selenomethionine-substituted protein have space group P2(1) and contain a tetramer with 222 noncrystallographic symmetry in the asymmetric unit. The refined model has satisfactory stereochemistry and a final crystallographic R factor of 0.162. The protein, which forms an obligatory tetramer in solution and in the crystal, is a squat rectangular prism comprising four protomers whose fold is extremely similar to that of the Aequorea victoria green fluorescent protein despite low ( approximately 23%) amino acid sequence homology. The monomer consists of an 11-stranded beta barrel with a coaxial helix. The chromophores, formed from the primary sequence -Gln-Tyr-Gly- (residues 66-68), are arranged in a approximately 27 x 34-A rectangular array in two approximately antiparallel pairs. The geometry at the alpha carbon of Gln-66 (refined without stereochemical restraints) is consistent with an sp(2) hybridized center, in accord with the proposal that red fluorescence is because of an additional oxidation step that forms an acylimine extension to the chromophore [Gross, L. A., Baird, G. S., Hoffman, R. C., Baldridge, K. K. & Tsien, R. Y. (2000) Proc. Natl. Acad. Sci. USA 87, 11990-11995]. The carbonyl oxygen of Phe-65 is almost 90 degrees out of the plane of the chromophore, consistent with theoretical calculations suggesting that this is the minimum energy conformation of this moiety despite the conjugation of this group with the rest of the chromophore.
PONDEROSA-C/S: client-server based software package for automated protein 3D structure determination.

PubMed

Lee, Woonghee; Stark, Jaime L; Markley, John L

2014-11-01

Peak-picking Of Noe Data Enabled by Restriction Of Shift Assignments-Client Server (PONDEROSA-C/S) builds on the original PONDEROSA software (Lee et al. in Bioinformatics 27:1727-1728. doi: 10.1093/bioinformatics/btr200, 2011) and includes improved features for structure calculation and refinement. PONDEROSA-C/S consists of three programs: Ponderosa Server, Ponderosa Client, and Ponderosa Analyzer. PONDEROSA-C/S takes as input the protein sequence, a list of assigned chemical shifts, and nuclear Overhauser data sets ((13)C- and/or (15)N-NOESY). The output is a set of assigned NOEs and 3D structural models for the protein. Ponderosa Analyzer supports the visualization, validation, and refinement of the results from Ponderosa Server. These tools enable semi-automated NMR-based structure determination of proteins in a rapid and robust fashion. We present examples showing the use of PONDEROSA-C/S in solving structures of four proteins: two that enable comparison with the original PONDEROSA package, and two from the Critical Assessment of automated Structure Determination by NMR (Rosato et al. in Nat Methods 6:625-626. doi: 10.1038/nmeth0909-625 , 2009) competition. The software package can be downloaded freely in binary format from http://pine.nmrfam.wisc.edu/download_packages.html. Registered users of the National Magnetic Resonance Facility at Madison can submit jobs to the PONDEROSA-C/S server at http://ponderosa.nmrfam.wisc.edu, where instructions, tutorials, and instructions can be found. Structures are normally returned within 1-2 days.
Triclinic lysozyme at 0.65 angstrom resolution.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wang, J.; Dauter, M.; Alkire, R.

The crystal structure of triclinic hen egg-white lysozyme (HEWL) has been refined against diffraction data extending to 0.65 {angstrom} resolution measured at 100 K using synchrotron radiation. Refinement with anisotropic displacement parameters and with the removal of stereochemical restraints for the well ordered parts of the structure converged with a conventional R factor of 8.39% and an R{sub free} of 9.52%. The use of full-matrix refinement provided an estimate of the variances in the derived parameters. In addition to the 129-residue protein, a total of 170 water molecules, nine nitrate ions, one acetate ion and three ethylene glycol molecules weremore » located in the electron-density map. Eight sections of the main chain and many side chains were modeled with alternate conformations. The occupancies of the water sites were refined and this step is meaningful when assessed by use of the free R factor. A detailed description and comparison of the structure are made with reference to the previously reported triclinic HEWL structures refined at 0.925 {angstrom} (at the low temperature of 120 K) and at 0.95 {angstrom} resolution (at room temperature).« less
Real-space refinement in PHENIX for cryo-EM and crystallography

DOE PAGES

Afonine, Pavel V.; Poon, Billy K.; Read, Randy J.; ...

2018-06-01

This work describes the implementation of real-space refinement in the phenix.real_space_refine program from the PHENIX suite. The use of a simplified refinement target function enables very fast calculation, which in turn makes it possible to identify optimal data-restraint weights as part of routine refinements with little runtime cost. Refinement of atomic models against low-resolution data benefits from the inclusion of as much additional information as is available. In addition to standard restraints on covalent geometry, phenix.real_space_refine makes use of extra information such as secondary-structure and rotamer-specific restraints, as well as restraints or constraints on internal molecular symmetry. The re-refinement ofmore » 385 cryo-EM-derived models available in the Protein Data Bank at resolutions of 6 Å or better shows significant improvement of the models and of the fit of these models to the target maps.« less
Real-space refinement in PHENIX for cryo-EM and crystallography

DOE Office of Scientific and Technical Information (OSTI.GOV)

Afonine, Pavel V.; Poon, Billy K.; Read, Randy J.

This work describes the implementation of real-space refinement in the phenix.real_space_refine program from the PHENIX suite. The use of a simplified refinement target function enables very fast calculation, which in turn makes it possible to identify optimal data-restraint weights as part of routine refinements with little runtime cost. Refinement of atomic models against low-resolution data benefits from the inclusion of as much additional information as is available. In addition to standard restraints on covalent geometry, phenix.real_space_refine makes use of extra information such as secondary-structure and rotamer-specific restraints, as well as restraints or constraints on internal molecular symmetry. The re-refinement ofmore » 385 cryo-EM-derived models available in the Protein Data Bank at resolutions of 6 Å or better shows significant improvement of the models and of the fit of these models to the target maps.« less
Designing and evaluating the MULTICOM protein local and global model quality prediction methods in the CASP10 experiment

PubMed Central

2014-01-01

Background Protein model quality assessment is an essential component of generating and using protein structural models. During the Tenth Critical Assessment of Techniques for Protein Structure Prediction (CASP10), we developed and tested four automated methods (MULTICOM-REFINE, MULTICOM-CLUSTER, MULTICOM-NOVEL, and MULTICOM-CONSTRUCT) that predicted both local and global quality of protein structural models. Results MULTICOM-REFINE was a clustering approach that used the average pairwise structural similarity between models to measure the global quality and the average Euclidean distance between a model and several top ranked models to measure the local quality. MULTICOM-CLUSTER and MULTICOM-NOVEL were two new support vector machine-based methods of predicting both the local and global quality of a single protein model. MULTICOM-CONSTRUCT was a new weighted pairwise model comparison (clustering) method that used the weighted average similarity between models in a pool to measure the global model quality. Our experiments showed that the pairwise model assessment methods worked better when a large portion of models in the pool were of good quality, whereas single-model quality assessment methods performed better on some hard targets when only a small portion of models in the pool were of reasonable quality. Conclusions Since digging out a few good models from a large pool of low-quality models is a major challenge in protein structure prediction, single model quality assessment methods appear to be poised to make important contributions to protein structure modeling. The other interesting finding was that single-model quality assessment scores could be used to weight the models by the consensus pairwise model comparison method to improve its accuracy. PMID:24731387
Designing and evaluating the MULTICOM protein local and global model quality prediction methods in the CASP10 experiment.

PubMed

Cao, Renzhi; Wang, Zheng; Cheng, Jianlin

2014-04-15

Protein model quality assessment is an essential component of generating and using protein structural models. During the Tenth Critical Assessment of Techniques for Protein Structure Prediction (CASP10), we developed and tested four automated methods (MULTICOM-REFINE, MULTICOM-CLUSTER, MULTICOM-NOVEL, and MULTICOM-CONSTRUCT) that predicted both local and global quality of protein structural models. MULTICOM-REFINE was a clustering approach that used the average pairwise structural similarity between models to measure the global quality and the average Euclidean distance between a model and several top ranked models to measure the local quality. MULTICOM-CLUSTER and MULTICOM-NOVEL were two new support vector machine-based methods of predicting both the local and global quality of a single protein model. MULTICOM-CONSTRUCT was a new weighted pairwise model comparison (clustering) method that used the weighted average similarity between models in a pool to measure the global model quality. Our experiments showed that the pairwise model assessment methods worked better when a large portion of models in the pool were of good quality, whereas single-model quality assessment methods performed better on some hard targets when only a small portion of models in the pool were of reasonable quality. Since digging out a few good models from a large pool of low-quality models is a major challenge in protein structure prediction, single model quality assessment methods appear to be poised to make important contributions to protein structure modeling. The other interesting finding was that single-model quality assessment scores could be used to weight the models by the consensus pairwise model comparison method to improve its accuracy.
Modeling the Structure of Helical Assemblies with Experimental Constraints in Rosetta.

PubMed

André, Ingemar

2018-01-01

Determining high-resolution structures of proteins with helical symmetry can be challenging due to limitations in experimental data. In such instances, structure-based protein simulations driven by experimental data can provide a valuable approach for building models of helical assemblies. This chapter describes how the Rosetta macromolecular package can be used to model homomeric protein assemblies with helical symmetry in a range of modeling scenarios including energy refinement, symmetrical docking, comparative modeling, and de novo structure prediction. Data-guided structure modeling of helical assemblies with experimental information from electron density, X-ray fiber diffraction, solid-state NMR, and chemical cross-linking mass spectrometry is also described.
Molecular dynamics-based refinement and validation for sub-5 Å cryo-electron microscopy maps.

PubMed

Singharoy, Abhishek; Teo, Ivan; McGreevy, Ryan; Stone, John E; Zhao, Jianhua; Schulten, Klaus

2016-07-07

Two structure determination methods, based on the molecular dynamics flexible fitting (MDFF) paradigm, are presented that resolve sub-5 Å cryo-electron microscopy (EM) maps with either single structures or ensembles of such structures. The methods, denoted cascade MDFF and resolution exchange MDFF, sequentially re-refine a search model against a series of maps of progressively higher resolutions, which ends with the original experimental resolution. Application of sequential re-refinement enables MDFF to achieve a radius of convergence of ~25 Å demonstrated with the accurate modeling of β-galactosidase and TRPV1 proteins at 3.2 Å and 3.4 Å resolution, respectively. The MDFF refinements uniquely offer map-model validation and B-factor determination criteria based on the inherent dynamics of the macromolecules studied, captured by means of local root mean square fluctuations. The MDFF tools described are available to researchers through an easy-to-use and cost-effective cloud computing resource on Amazon Web Services.
The Structure and Function of Non-Collagenous Bone Proteins

NASA Technical Reports Server (NTRS)

Hook, Magnus; McQuillan, David J.

1997-01-01

The research done under the cooperative research agreement for the project titled 'The structure and function of non-collagenous bone proteins' represented the first phase of an ongoing program to define the structural and functional relationships of the principal noncollagenous proteins in bone. An ultimate goal of this research is to enable design and execution of useful pharmacological compounds that will have a beneficial effect in treatment of osteoporosis, both land-based and induced by long-duration space travel. The goals of the now complete first phase were as follows: 1. Establish and/or develop powerful recombinant protein expression systems; 2. Develop and refine isolation and purification of recombinant proteins; 3. Express wild-type non-collagenous bone proteins; 4. Express site-specific mutant proteins and domains of wild-type proteins to enhance likelihood of crystal formation for subsequent solution of structure.
A new default restraint library for the protein backbone in Phenix: a conformation-dependent geometry goes mainstream

DOE PAGES

Moriarty, Nigel W.; Tronrud, Dale E.; Adams, Paul D.; ...

2016-01-01

Chemical restraints are a fundamental part of crystallographic protein structure refinement. In response to mounting evidence that conventional restraints have shortcomings, it has previously been documented that using backbone restraints that depend on the protein backbone conformation helps to address these shortcomings and improves the performance of refinements [Moriartyet al.(2014),FEBS J.281, 4061–4071]. It is important that these improvements be made available to all in the protein crystallography community. Toward this end, a change in the default geometry library used byPhenixis described here. Tests are presented showing that this change will not generate increased numbers of outliers during validation, or depositionmore » in the Protein Data Bank, during the transition period in which some validation tools still use the conventional restraint libraries.« less
A new default restraint library for the protein backbone in Phenix: a conformation-dependent geometry goes mainstream

DOE Office of Scientific and Technical Information (OSTI.GOV)

Moriarty, Nigel W.; Tronrud, Dale E.; Adams, Paul D.

Chemical restraints are a fundamental part of crystallographic protein structure refinement. In response to mounting evidence that conventional restraints have shortcomings, it has previously been documented that using backbone restraints that depend on the protein backbone conformation helps to address these shortcomings and improves the performance of refinements [Moriartyet al.(2014),FEBS J.281, 4061–4071]. It is important that these improvements be made available to all in the protein crystallography community. Toward this end, a change in the default geometry library used byPhenixis described here. Tests are presented showing that this change will not generate increased numbers of outliers during validation, or depositionmore » in the Protein Data Bank, during the transition period in which some validation tools still use the conventional restraint libraries.« less

Targeting Neuroblastoma Cell Surface Proteins: Recommendations for Homology Modeling of hNET, ALK, and TrkB.

PubMed

Haddad, Yazan; Heger, Zbyněk; Adam, Vojtech

2017-01-01

Targeted therapy is a promising approach for treatment of neuroblastoma as evident from the large number of targeting agents employed in clinical practice today. In the absence of known crystal structures, researchers rely on homology modeling to construct template-based theoretical structures for drug design and testing. Here, we discuss three candidate cell surface proteins that are suitable for homology modeling: human norepinephrine transporter (hNET), anaplastic lymphoma kinase (ALK), and neurotrophic tyrosine kinase receptor 2 (NTRK2 or TrkB). When choosing templates, both sequence identity and structure quality are important for homology modeling and pose the first of many challenges in the modeling process. Homology modeling of hNET can be improved using template models of dopamine and serotonin transporters instead of the leucine transporter (LeuT). The extracellular domains of ALK and TrkB are yet to be exploited by homology modeling. There are several idiosyncrasies that require direct attention throughout the process of model construction, evaluation and refinement. Shifts/gaps in the alignment between the template and target, backbone outliers and side-chain rotamer outliers are among the main sources of physical errors in the structures. Low-conserved regions can be refined with loop modeling method. Residue hydrophobicity, accessibility to bound metals or glycosylation can aid in model refinement. We recommend resolving these idiosyncrasies as part of "good modeling practice" to obtain highest quality model. Decreasing physical errors in protein structures plays major role in the development of targeting agents and understanding of chemical interactions at the molecular level.
A Message Passing Approach to Side Chain Positioning with Applications in Protein Docking Refinement *

PubMed Central

Moghadasi, Mohammad; Kozakov, Dima; Mamonov, Artem B.; Vakili, Pirooz; Vajda, Sandor; Paschalidis, Ioannis Ch.

2013-01-01

We introduce a message-passing algorithm to solve the Side Chain Positioning (SCP) problem. SCP is a crucial component of protein docking refinement, which is a key step of an important class of problems in computational structural biology called protein docking. We model SCP as a combinatorial optimization problem and formulate it as a Maximum Weighted Independent Set (MWIS) problem. We then employ a modified and convergent belief-propagation algorithm to solve a relaxation of MWIS and develop randomized estimation heuristics that use the relaxed solution to obtain an effective MWIS feasible solution. Using a benchmark set of protein complexes we demonstrate that our approach leads to more accurate docking predictions compared to a baseline algorithm that does not solve the SCP. PMID:23515575
Solution structure of Syrian hamster prion protein rPrP(90-231).

PubMed

Liu, H; Farr-Jones, S; Ulyanov, N B; Llinas, M; Marqusee, S; Groth, D; Cohen, F E; Prusiner, S B; James, T L

1999-04-27

NMR has been used to refine the structure of Syrian hamster (SHa) prion protein rPrP(90-231), which is commensurate with the infectious protease-resistant core of the scrapie prion protein PrPSc. The structure of rPrP(90-231), refolded to resemble the normal cellular isoform PrPC spectroscopically and immunologically, has been studied using multidimensional NMR; initial results were published [James et al. (1997) Proc. Natl. Acad. Sci. U.S.A. 94, 10086-10091]. We now report refinement with better definition revealing important structural and dynamic features which can be related to biological observations pertinent to prion diseases. Structure refinement was based on 2778 unambiguously assigned nuclear Overhauser effect (NOE) connectivities, 297 ambiguous NOE restraints, and 63 scalar coupling constants (3JHNHa). The structure is represented by an ensemble of 25 best-scoring structures from 100 structures calculated using ARIA/X-PLOR and further refined with restrained molecular dynamics using the AMBER 4.1 force field with an explicit shell of water molecules. The rPrP(90-231) structure features a core domain (residues 125-228), with a backbone atomic root-mean-square deviation (RMSD) of 0.67 A, consisting of three alpha-helices (residues 144-154, 172-193, and 200-227) and two short antiparallel beta-strands (residues 129-131 and 161-163). The N-terminus (residues 90-119) is largely unstructured despite some sparse and weak medium-range NOEs implying the existence of bends or turns. The transition region between the core domain and flexible N-terminus, i.e., residues 113-128, consists of hydrophobic residues or glycines and does not adopt any regular secondary structure in aqueous solution. There are about 30 medium- and long-range NOEs within this hydrophobic cluster, so it clearly manifests structure. Multiple discrete conformations are evident, implying the possible existence of one or more metastable states, which may feature in conversion of PrPC to PrPSc. To obtain a more comprehensive picture of rPrP(90-231), dynamics have been studied using amide hydrogen-deuterium exchange and 15N NMR relaxation times (T1 and T2) and 15N{1H} NOE measurements. Comparison of the structure with previous reports suggests sequence-dependent features that may be reflected in a species barrier to prion disease transmission.
Three-dimensional structure of Erwinia carotovora L-asparaginase

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kislitsyn, Yu. A.; Kravchenko, O. V.; Nikonov, S. V.

2006-10-15

Three-dimensional structure of Erwinia carotovora L-asparaginase, which has antitumor activity and is used for the treatment of acute lymphoblastic leukemia, was solved at 3 A resolution and refined to R{sub cryst} = 20% and R{sub free} = 28%. Crystals of recombinant Erwinia carotovora L-asparaginase were grown by the hanging-drop vapor-diffusion method from protein solutions in a HEPES buffer (pH 6.5) and PEG MME 5000 solutions in a cacodylate buffer (pH 6.5) as the precipitant. Three-dimensional X-ray diffraction data were collected up to 3 A resolution from one crystal at room temperature. The structure was solved by the molecular replacement methodmore » using the coordinates of Erwinia chrysanthemi L-asparaginase as the starting model. The coordinates refined with the use of the CNS program package were deposited in the Protein Data Bank (PDB code 1ZCF)« less
The Quality and Validation of Structures from Structural Genomics

PubMed Central

Domagalski, Marcin J.; Zheng, Heping; Zimmerman, Matthew D.; Dauter, Zbigniew; Wlodawer, Alexander; Minor, Wladek

2014-01-01

Quality control of three-dimensional structures of macromolecules is a critical step to ensure the integrity of structural biology data, especially those produced by structural genomics centers. Whereas the Protein Data Bank (PDB) has proven to be a remarkable success overall, the inconsistent quality of structures reveals a lack of universal standards for structure/deposit validation. Here, we review the state-of-the-art methods used in macromolecular structure validation, focusing on validation of structures determined by X-ray crystallography. We describe some general protocols used in the rebuilding and re-refinement of problematic structural models. We also briefly discuss some frontier areas of structure validation, including refinement of protein–ligand complexes, automation of structure redetermination, and the use of NMR structures and computational models to solve X-ray crystal structures by molecular replacement. PMID:24203341
Online interactive analysis of protein structure ensembles with Bio3D-web.

PubMed

Skjærven, Lars; Jariwala, Shashank; Yao, Xin-Qiu; Grant, Barry J

2016-11-15

Bio3D-web is an online application for analyzing the sequence, structure and conformational heterogeneity of protein families. Major functionality is provided for identifying protein structure sets for analysis, their alignment and refined structure superposition, sequence and structure conservation analysis, mapping and clustering of conformations and the quantitative comparison of their predicted structural dynamics. Bio3D-web is based on the Bio3D and Shiny R packages. All major browsers are supported and full source code is available under a GPL2 license from http://thegrantlab.org/bio3d-web CONTACT: bjgrant@umich.edu or lars.skjarven@uib.no. © The Author 2016. Published by Oxford University Press.
A series of PDB related databases for everyday needs.

PubMed

Joosten, Robbie P; te Beek, Tim A H; Krieger, Elmar; Hekkelman, Maarten L; Hooft, Rob W W; Schneider, Reinhard; Sander, Chris; Vriend, Gert

2011-01-01

The Protein Data Bank (PDB) is the world-wide repository of macromolecular structure information. We present a series of databases that run parallel to the PDB. Each database holds one entry, if possible, for each PDB entry. DSSP holds the secondary structure of the proteins. PDBREPORT holds reports on the structure quality and lists errors. HSSP holds a multiple sequence alignment for all proteins. The PDBFINDER holds easy to parse summaries of the PDB file content, augmented with essentials from the other systems. PDB_REDO holds re-refined, and often improved, copies of all structures solved by X-ray. WHY_NOT summarizes why certain files could not be produced. All these systems are updated weekly. The data sets can be used for the analysis of properties of protein structures in areas ranging from structural genomics, to cancer biology and protein design.
Atomistic structural ensemble refinement reveals non-native structure stabilizes a sub-millisecond folding intermediate of CheY

NASA Astrophysics Data System (ADS)

Shi, Jade; Nobrega, R. Paul; Schwantes, Christian; Kathuria, Sagar V.; Bilsel, Osman; Matthews, C. Robert; Lane, T. J.; Pande, Vijay S.

2017-03-01

The dynamics of globular proteins can be described in terms of transitions between a folded native state and less-populated intermediates, or excited states, which can play critical roles in both protein folding and function. Excited states are by definition transient species, and therefore are difficult to characterize using current experimental techniques. Here, we report an atomistic model of the excited state ensemble of a stabilized mutant of an extensively studied flavodoxin fold protein CheY. We employed a hybrid simulation and experimental approach in which an aggregate 42 milliseconds of all-atom molecular dynamics were used as an informative prior for the structure of the excited state ensemble. This prior was then refined against small-angle X-ray scattering (SAXS) data employing an established method (EROS). The most striking feature of the resulting excited state ensemble was an unstructured N-terminus stabilized by non-native contacts in a conformation that is topologically simpler than the native state. Using these results, we then predict incisive single molecule FRET experiments as a means of model validation. This study demonstrates the paradigm of uniting simulation and experiment in a statistical model to study the structure of protein excited states and rationally design validating experiments.
Purification, isolation, crystallization, and preliminary X-ray diffraction study of the BTB domain of the centrosomal protein 190 from Drosophila melanogaster

NASA Astrophysics Data System (ADS)

Boyko, K. M.; Nikolaeva, A. Yu.; Kachalova, G. S.; Bonchuk, A. N.; Popov, V. O.

2017-11-01

The spatial organization of the genome is controlled by a special class of architectural proteins, including proteins containing BTB domains that are able to dimerize or multimerize. The centrosomal protein 190 is one of such architectural proteins. The purification, crystallization, and preliminary X-ray diffraction study of the BTB domain of the centrosomal protein 190 are reported. The crystallization conditions were found by the vapor-diffusion technique. The crystals diffracted to 1.5 Å resolution and belonged to sp. gr. P3221. The structure was solved by the molecular replacement method. The structure refinement is currently underway.
AMMOS2: a web server for protein-ligand-water complexes refinement via molecular mechanics.

PubMed

Labbé, Céline M; Pencheva, Tania; Jereva, Dessislava; Desvillechabrol, Dimitri; Becot, Jérôme; Villoutreix, Bruno O; Pajeva, Ilza; Miteva, Maria A

2017-07-03

AMMOS2 is an interactive web server for efficient computational refinement of protein-small organic molecule complexes. The AMMOS2 protocol employs atomic-level energy minimization of a large number of experimental or modeled protein-ligand complexes. The web server is based on the previously developed standalone software AMMOS (Automatic Molecular Mechanics Optimization for in silico Screening). AMMOS utilizes the physics-based force field AMMP sp4 and performs optimization of protein-ligand interactions at five levels of flexibility of the protein receptor. The new version 2 of AMMOS implemented in the AMMOS2 web server allows the users to include explicit water molecules and individual metal ions in the protein-ligand complexes during minimization. The web server provides comprehensive analysis of computed energies and interactive visualization of refined protein-ligand complexes. The ligands are ranked by the minimized binding energies allowing the users to perform additional analysis for drug discovery or chemical biology projects. The web server has been extensively tested on 21 diverse protein-ligand complexes. AMMOS2 minimization shows consistent improvement over the initial complex structures in terms of minimized protein-ligand binding energies and water positions optimization. The AMMOS2 web server is freely available without any registration requirement at the URL: http://drugmod.rpbs.univ-paris-diderot.fr/ammosHome.php. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Re-refinement of the spliceosomal U4 snRNP core-domain structure

PubMed Central

Li, Jade; Leung, Adelaine K.; Kondo, Yasushi; Oubridge, Chris; Nagai, Kiyoshi

2016-01-01

The core domain of small nuclear ribonucleoprotein (snRNP), comprised of a ring of seven paralogous proteins bound around a single-stranded RNA sequence, functions as the assembly nucleus in the maturation of U1, U2, U4 and U5 spliceosomal snRNPs. The structure of the human U4 snRNP core domain was initially solved at 3.6 Å resolution by experimental phasing using data with tetartohedral twinning. Molecular replacement from this model followed by density modification using untwinned data recently led to a structure of the minimal U1 snRNP at 3.3 Å resolution. With the latter structure providing a search model for molecular replacement, the U4 core-domain structure has now been re-refined. The U4 Sm site-sequence AAUUUUU has been shown to bind to the seven Sm proteins SmF–SmE–SmG–SmD3–SmB–SmD1–SmD2 in an identical manner as the U1 Sm-site sequence AAUUUGU, except in SmD1 where the bound U replaces G. The progression from the initial to the re-refined structure exemplifies a tortuous route to accuracy: where well diffracting crystals of complex assemblies are initially unavailable, the early model errors are rectified by exploiting preliminary interpretations in further experiments involving homologous structures. New insights are obtained from the more accurate model. PMID:26894541
Super-resolution biomolecular crystallography with low-resolution data.

PubMed

Schröder, Gunnar F; Levitt, Michael; Brunger, Axel T

2010-04-22

X-ray diffraction plays a pivotal role in the understanding of biological systems by revealing atomic structures of proteins, nucleic acids and their complexes, with much recent interest in very large assemblies like the ribosome. As crystals of such large assemblies often diffract weakly (resolution worse than 4 A), we need methods that work at such low resolution. In macromolecular assemblies, some of the components may be known at high resolution, whereas others are unknown: current refinement methods fail as they require a high-resolution starting structure for the entire complex. Determining the structure of such complexes, which are often of key biological importance, should be possible in principle as the number of independent diffraction intensities at a resolution better than 5 A generally exceeds the number of degrees of freedom. Here we introduce a method that adds specific information from known homologous structures but allows global and local deformations of these homology models. Our approach uses the observation that local protein structure tends to be conserved as sequence and function evolve. Cross-validation with R(free) (the free R-factor) determines the optimum deformation and influence of the homology model. For test cases at 3.5-5 A resolution with known structures at high resolution, our method gives significant improvements over conventional refinement in the model as monitored by coordinate accuracy, the definition of secondary structure and the quality of electron density maps. For re-refinements of a representative set of 19 low-resolution crystal structures from the Protein Data Bank, we find similar improvements. Thus, a structure derived from low-resolution diffraction data can have quality similar to a high-resolution structure. Our method is applicable to the study of weakly diffracting crystals using X-ray micro-diffraction as well as data from new X-ray light sources. Use of homology information is not restricted to X-ray crystallography and cryo-electron microscopy: as optical imaging advances to subnanometre resolution, it can use similar tools.
Carbohydrate structure: the rocky road to automation.

PubMed

Agirre, Jon; Davies, Gideon J; Wilson, Keith S; Cowtan, Kevin D

2017-06-01

With the introduction of intuitive graphical software, structural biologists who are not experts in crystallography are now able to build complete protein or nucleic acid models rapidly. In contrast, carbohydrates are in a wholly different situation: scant automation exists, with manual building attempts being sometimes toppled by incorrect dictionaries or refinement problems. Sugars are the most stereochemically complex family of biomolecules and, as pyranose rings, have clear conformational preferences. Despite this, all refinement programs may produce high-energy conformations at medium to low resolution, without any support from the electron density. This problem renders the affected structures unusable in glyco-chemical terms. Bringing structural glycobiology up to 'protein standards' will require a total overhaul of the methodology. Time is of the essence, as the community is steadily increasing the production rate of glycoproteins, and electron cryo-microscopy has just started to image them in precisely that resolution range where crystallographic methods falter most. Copyright © 2016 Elsevier Ltd. All rights reserved.
Conformational Heterogeneity of Unbound Proteins Enhances Recognition in Protein-Protein Encounters.

PubMed

Pallara, Chiara; Rueda, Manuel; Abagyan, Ruben; Fernández-Recio, Juan

2016-07-12

To understand cellular processes at the molecular level we need to improve our knowledge of protein-protein interactions, from a structural, mechanistic, and energetic point of view. Current theoretical studies and computational docking simulations show that protein dynamics plays a key role in protein association and support the need for including protein flexibility in modeling protein interactions. Assuming the conformational selection binding mechanism, in which the unbound state can sample bound conformers, one possible strategy to include flexibility in docking predictions would be the use of conformational ensembles originated from unbound protein structures. Here we present an exhaustive computational study about the use of precomputed unbound ensembles in the context of protein docking, performed on a set of 124 cases of the Protein-Protein Docking Benchmark 3.0. Conformational ensembles were generated by conformational optimization and refinement with MODELLER and by short molecular dynamics trajectories with AMBER. We identified those conformers providing optimal binding and investigated the role of protein conformational heterogeneity in protein-protein recognition. Our results show that a restricted conformational refinement can generate conformers with better binding properties and improve docking encounters in medium-flexible cases. For more flexible cases, a more extended conformational sampling based on Normal Mode Analysis was proven helpful. We found that successful conformers provide better energetic complementarity to the docking partners, which is compatible with recent views of binding association. In addition to the mechanistic considerations, these findings could be exploited for practical docking predictions of improved efficiency.
Structural elucidation of transmembrane domain zero (TMD0) of EcdL: A multidrug resistance-associated protein (MRP) family of ATP-binding cassette transporter protein revealed by atomistic simulation.

PubMed

Bera, Krishnendu; Rani, Priyanka; Kishor, Gaurav; Agarwal, Shikha; Kumar, Antresh; Singh, Durg Vijay

2017-09-20

ATP-Binding cassette (ABC) transporters play an extensive role in the translocation of diverse sets of biologically important molecules across membrane. EchnocandinB (antifungal) and EcdL protein of Aspergillus rugulosus are encoded by the same cluster of genes. Co-expression of EcdL and echinocandinB reflects tightly linked biological functions. EcdL belongs to Multidrug Resistance associated Protein (MRP) subfamily of ABC transporters with an extra transmembrane domain zero (TMD0). Complete structure of MRP subfamily comprising of TMD0 domain, at atomic resolution is not known. We hypothesized that the transportation of echonocandinB is mediated via EcdL protein. Henceforth, it is pertinent to know the topological arrangement of TMD0, with other domains of protein and its possible role in transportation of echinocandinB. Absence of effective template for TMD0 domain lead us to model by I-TASSER, further structure has been refined by multiple template modelling using homologous templates of remaining domains (TMD1, NBD1, TMD2, NBD2). The modelled structure has been validated for packing, folding and stereochemical properties. MD simulation for 0.1 μs has been carried out in the biphasic environment for refinement of modelled protein. Non-redundant structures have been excavated by clustering of MD trajectory. The structural alignment of modelled structure has shown Z-score -37.9; 31.6, 31.5 with RMSD; 2.4, 4.2, 4.8 with ABC transporters; PDB ID 4F4C, 4M1 M, 4M2T, respectively, reflecting the correctness of structure. EchinocandinB has been docked to the modelled as well as to the clustered structures, which reveals interaction of echinocandinB with TMD0 and other TM helices in the translocation path build of TMDs.
Crystal Structure of Prunin-1, a Major Component of the Almond (Prunus dulcis) Allergen Amandin

DOE Office of Scientific and Technical Information (OSTI.GOV)

Jin, Tengchuan; Albillos, Silvia M.; Guo, Feng

Seed storage proteins are accumulated during seed development and act as a reserve of nutrition for seed germination and young sprout growth. Plant seeds play an important role in human nutrition by providing a relatively inexpensive source of protein. However, many plant foods contain allergenic proteins, and the number of people suffering from food allergies has increased rapidly in recent years. The 11S globulins are the most widespread seed storage proteins, present in monocotyledonous and dicotyledonous seeds as well as in gymnosperms (conifers) and other spermatophytes. This family of proteins accounts for a number of known major food allergens. Theymore » are of interest to both the public and industry due to food safety concerns. Because of the interests in the structural basis of the allergenicity of food allergens, we sought to determine the crystal structure of Pru1, the major component of the 11 S storage protein from almonds. The structure was refined to 2.4 {angstrom}, and the R/Rfree for the final refined structure is 17.2/22.9. Pru1 is a hexamer made of two trimers. Most of the back-to-back trimer-trimer association was contributed by monomer-monomer interactions. An {alpha} helix (helix 6) at the C-terminal end of the acidic domain of one of the interacting monomers lies at the cleft of the two protomers. The residues in this helix correspond to a flexible region in the peanut allergen Ara h 3 that encompasses a previously defined linear IgE epitope.« less
Crystal structure of prunin-1, a major component of the almond (Prunus dulcis) allergen amandin.

PubMed

Jin, Tengchuan; Albillos, Silvia M; Guo, Feng; Howard, Andrew; Fu, Tong-Jen; Kothary, Mahendra H; Zhang, Yu-Zhu

2009-09-23

Seed storage proteins are accumulated during seed development and act as a reserve of nutrition for seed germination and young sprout growth. Plant seeds play an important role in human nutrition by providing a relatively inexpensive source of protein. However, many plant foods contain allergenic proteins, and the number of people suffering from food allergies has increased rapidly in recent years. The 11S globulins are the most widespread seed storage proteins, present in monocotyledonous and dicotyledonous seeds as well as in gymnosperms (conifers) and other spermatophytes. This family of proteins accounts for a number of known major food allergens. They are of interest to both the public and industry due to food safety concerns. Because of the interests in the structural basis of the allergenicity of food allergens, we sought to determine the crystal structure of Pru1, the major component of the 11 S storage protein from almonds. The structure was refined to 2.4 A, and the R/Rfree for the final refined structure is 17.2/22.9. Pru1 is a hexamer made of two trimers. Most of the back-to-back trimer-trimer association was contributed by monomer-monomer interactions. An alpha helix (helix 6) at the C-terminal end of the acidic domain of one of the interacting monomers lies at the cleft of the two protomers. The residues in this helix correspond to a flexible region in the peanut allergen Ara h 3 that encompasses a previously defined linear IgE epitope.
Predicting X-ray diffuse scattering from translation–libration–screw structural ensembles

DOE Office of Scientific and Technical Information (OSTI.GOV)

Van Benschoten, Andrew H.; Afonine, Pavel V.; Terwilliger, Thomas C.

2015-07-28

A method of simulating X-ray diffuse scattering from multi-model PDB files is presented. Despite similar agreement with Bragg data, different translation–libration–screw refinement strategies produce unique diffuse intensity patterns. Identifying the intramolecular motions of proteins and nucleic acids is a major challenge in macromolecular X-ray crystallography. Because Bragg diffraction describes the average positional distribution of crystalline atoms with imperfect precision, the resulting electron density can be compatible with multiple models of motion. Diffuse X-ray scattering can reduce this degeneracy by reporting on correlated atomic displacements. Although recent technological advances are increasing the potential to accurately measure diffuse scattering, computational modeling andmore » validation tools are still needed to quantify the agreement between experimental data and different parameterizations of crystalline disorder. A new tool, phenix.diffuse, addresses this need by employing Guinier’s equation to calculate diffuse scattering from Protein Data Bank (PDB)-formatted structural ensembles. As an example case, phenix.diffuse is applied to translation–libration–screw (TLS) refinement, which models rigid-body displacement for segments of the macromolecule. To enable the calculation of diffuse scattering from TLS-refined structures, phenix.tls-as-xyz builds multi-model PDB files that sample the underlying T, L and S tensors. In the glycerophosphodiesterase GpdQ, alternative TLS-group partitioning and different motional correlations between groups yield markedly dissimilar diffuse scattering maps with distinct implications for molecular mechanism and allostery. These methods demonstrate how, in principle, X-ray diffuse scattering could extend macromolecular structural refinement, validation and analysis.« less
Statistical radii associated with amino acids to determine the contact map: fixing the structure of a type I cohesin domain in the Clostridium thermocellum cellulosome

NASA Astrophysics Data System (ADS)

Chwastyk, Mateusz; Poma Bernaola, Adolfo; Cieplak, Marek

2015-07-01

We propose to improve and simplify protein refinement procedures through consideration of which pairs of amino acid residues should form native contacts. We first consider 11 330 proteins from the CATH database to determine statistical distributions of contacts associated with a given type of amino acid. The distributions are set across the distances between the α-C atoms that are in contact. Based on this data, we determine typical radii of effective spheres that can be placed on the α-C atoms in order to reconstruct the distribution of the contact lengths. This is done by checking for overlaps with enlarged van der Waals spheres associated with heavy atoms on other amino acids. The resulting contacts can be used to identify non-native contacts that may arise during the time evolution of structure-based models. Here, the radii are used to guide reconstruction of nine missing side chains in a type I cohesin domain with the Protein Data Bank code 1AOH. We first identify the likely missing contacts and then sculpt the corresponding side chains by standard refinement tools to achieve consistency with the expected contact map. One ambiguity in refinement is resolved by determining all-atom conformational energies.
Automated main-chain model building by template matching and iterative fragment extension.

PubMed

Terwilliger, Thomas C

2003-01-01

An algorithm for the automated macromolecular model building of polypeptide backbones is described. The procedure is hierarchical. In the initial stages, many overlapping polypeptide fragments are built. In subsequent stages, the fragments are extended and then connected. Identification of the locations of helical and beta-strand regions is carried out by FFT-based template matching. Fragment libraries of helices and beta-strands from refined protein structures are then positioned at the potential locations of helices and strands and the longest segments that fit the electron-density map are chosen. The helices and strands are then extended using fragment libraries consisting of sequences three amino acids long derived from refined protein structures. The resulting segments of polypeptide chain are then connected by choosing those which overlap at two or more C(alpha) positions. The fully automated procedure has been implemented in RESOLVE and is capable of model building at resolutions as low as 3.5 A. The algorithm is useful for building a preliminary main-chain model that can serve as a basis for refinement and side-chain addition.

Sampling Enrichment toward Target Structures Using Hybrid Molecular Dynamics-Monte Carlo Simulations

PubMed Central

Yang, Kecheng; Różycki, Bartosz; Cui, Fengchao; Shi, Ce; Chen, Wenduo; Li, Yunqi

2016-01-01

Sampling enrichment toward a target state, an analogue of the improvement of sampling efficiency (SE), is critical in both the refinement of protein structures and the generation of near-native structure ensembles for the exploration of structure-function relationships. We developed a hybrid molecular dynamics (MD)-Monte Carlo (MC) approach to enrich the sampling toward the target structures. In this approach, the higher SE is achieved by perturbing the conventional MD simulations with a MC structure-acceptance judgment, which is based on the coincidence degree of small angle x-ray scattering (SAXS) intensity profiles between the simulation structures and the target structure. We found that the hybrid simulations could significantly improve SE by making the top-ranked models much closer to the target structures both in the secondary and tertiary structures. Specifically, for the 20 mono-residue peptides, when the initial structures had the root-mean-squared deviation (RMSD) from the target structure smaller than 7 Å, the hybrid MD-MC simulations afforded, on average, 0.83 Å and 1.73 Å in RMSD closer to the target than the parallel MD simulations at 310K and 370K, respectively. Meanwhile, the average SE values are also increased by 13.2% and 15.7%. The enrichment of sampling becomes more significant when the target states are gradually detectable in the MD-MC simulations in comparison with the parallel MD simulations, and provide >200% improvement in SE. We also performed a test of the hybrid MD-MC approach in the real protein system, the results showed that the SE for 3 out of 5 real proteins are improved. Overall, this work presents an efficient way of utilizing solution SAXS to improve protein structure prediction and refinement, as well as the generation of near native structures for function annotation. PMID:27227775
Sampling Enrichment toward Target Structures Using Hybrid Molecular Dynamics-Monte Carlo Simulations.

PubMed

Yang, Kecheng; Różycki, Bartosz; Cui, Fengchao; Shi, Ce; Chen, Wenduo; Li, Yunqi

2016-01-01

Sampling enrichment toward a target state, an analogue of the improvement of sampling efficiency (SE), is critical in both the refinement of protein structures and the generation of near-native structure ensembles for the exploration of structure-function relationships. We developed a hybrid molecular dynamics (MD)-Monte Carlo (MC) approach to enrich the sampling toward the target structures. In this approach, the higher SE is achieved by perturbing the conventional MD simulations with a MC structure-acceptance judgment, which is based on the coincidence degree of small angle x-ray scattering (SAXS) intensity profiles between the simulation structures and the target structure. We found that the hybrid simulations could significantly improve SE by making the top-ranked models much closer to the target structures both in the secondary and tertiary structures. Specifically, for the 20 mono-residue peptides, when the initial structures had the root-mean-squared deviation (RMSD) from the target structure smaller than 7 Å, the hybrid MD-MC simulations afforded, on average, 0.83 Å and 1.73 Å in RMSD closer to the target than the parallel MD simulations at 310K and 370K, respectively. Meanwhile, the average SE values are also increased by 13.2% and 15.7%. The enrichment of sampling becomes more significant when the target states are gradually detectable in the MD-MC simulations in comparison with the parallel MD simulations, and provide >200% improvement in SE. We also performed a test of the hybrid MD-MC approach in the real protein system, the results showed that the SE for 3 out of 5 real proteins are improved. Overall, this work presents an efficient way of utilizing solution SAXS to improve protein structure prediction and refinement, as well as the generation of near native structures for function annotation.
Coarse Grained Model for Biological Simulations: Recent Refinements and Validation

PubMed Central

Vicatos, Spyridon; Rychkova, Anna; Mukherjee, Shayantani; Warshel, Arieh

2014-01-01

Exploring the free energy landscape of proteins and modeling the corresponding functional aspects presents a major challenge for computer simulation approaches. This challenge is due to the complexity of the landscape and the enormous computer time needed for converging simulations. The use of various simplified coarse grained (CG) models offers an effective way of sampling the landscape, but most current models are not expected to give a reliable description of protein stability and functional aspects. The main problem is associated with insufficient focus on the electrostatic features of the model. In this respect our recent CG model offers significant advantage as it has been refined while focusing on its electrostatic free energy. Here we review the current state of our model, describing recent refinement, extensions and validation studies while focusing on demonstrating key applications. These include studies of protein stability, extending the model to include membranes and electrolytes and electrodes as well as studies of voltage activated proteins, protein insertion trough the translocon, the action of molecular motors and even the coupling of the stalled ribosome and the translocon. Our example illustrates the general potential of our approach in overcoming major challenges in studies of structure function correlation in proteins and large macromolecular complexes. PMID:25050439
In-situ and real-time growth observation of high-quality protein crystals under quasi-microgravity on earth.

PubMed

Nakamura, Akira; Ohtsuka, Jun; Kashiwagi, Tatsuki; Numoto, Nobutaka; Hirota, Noriyuki; Ode, Takahiro; Okada, Hidehiko; Nagata, Koji; Kiyohara, Motosuke; Suzuki, Ei-Ichiro; Kita, Akiko; Wada, Hitoshi; Tanokura, Masaru

2016-02-26

Precise protein structure determination provides significant information on life science research, although high-quality crystals are not easily obtained. We developed a system for producing high-quality protein crystals with high throughput. Using this system, gravity-controlled crystallization are made possible by a magnetic microgravity environment. In addition, in-situ and real-time observation and time-lapse imaging of crystal growth are feasible for over 200 solution samples independently. In this paper, we also report results of crystallization experiments for two protein samples. Crystals grown in the system exhibited magnetic orientation and showed higher and more homogeneous quality compared with the control crystals. The structural analysis reveals that making use of the magnetic microgravity during the crystallization process helps us to build a well-refined protein structure model, which has no significant structural differences with a control structure. Therefore, the system contributes to improvement in efficiency of structural analysis for "difficult" proteins, such as membrane proteins and supermolecular complexes.
GPCR-ModSim: A comprehensive web based solution for modeling G-protein coupled receptors

PubMed Central

Esguerra, Mauricio; Siretskiy, Alexey; Bello, Xabier; Sallander, Jessica; Gutiérrez-de-Terán, Hugo

2016-01-01

GPCR-ModSim (http://open.gpcr-modsim.org) is a centralized and easy to use service dedicated to the structural modeling of G-protein Coupled Receptors (GPCRs). 3D molecular models can be generated from amino acid sequence by homology-modeling techniques, considering different receptor conformations. GPCR-ModSim includes a membrane insertion and molecular dynamics (MD) equilibration protocol, which can be used to refine the generated model or any GPCR structure uploaded to the server, including if desired non-protein elements such as orthosteric or allosteric ligands, structural waters or ions. We herein revise the main characteristics of GPCR-ModSim and present new functionalities. The templates used for homology modeling have been updated considering the latest structural data, with separate profile structural alignments built for inactive, partially-active and active groups of templates. We have also added the possibility to perform multiple-template homology modeling in a unique and flexible way. Finally, our new MD protocol considers a series of distance restraints derived from a recently identified conserved network of helical contacts, allowing for a smoother refinement of the generated models which is particularly advised when there is low homology to the available templates. GPCR- ModSim has been tested on the GPCR Dock 2013 competition with satisfactory results. PMID:27166369
Molecular dynamics-based refinement and validation for sub-5 Å cryo-electron microscopy maps

PubMed Central

Singharoy, Abhishek; Teo, Ivan; McGreevy, Ryan; Stone, John E; Zhao, Jianhua; Schulten, Klaus

2016-01-01

Two structure determination methods, based on the molecular dynamics flexible fitting (MDFF) paradigm, are presented that resolve sub-5 Å cryo-electron microscopy (EM) maps with either single structures or ensembles of such structures. The methods, denoted cascade MDFF and resolution exchange MDFF, sequentially re-refine a search model against a series of maps of progressively higher resolutions, which ends with the original experimental resolution. Application of sequential re-refinement enables MDFF to achieve a radius of convergence of ~25 Å demonstrated with the accurate modeling of β-galactosidase and TRPV1 proteins at 3.2 Å and 3.4 Å resolution, respectively. The MDFF refinements uniquely offer map-model validation and B-factor determination criteria based on the inherent dynamics of the macromolecules studied, captured by means of local root mean square fluctuations. The MDFF tools described are available to researchers through an easy-to-use and cost-effective cloud computing resource on Amazon Web Services. DOI: http://dx.doi.org/10.7554/eLife.16105.001 PMID:27383269
RosettaHoles: rapid assessment of protein core packing for structure prediction, refinement, design, and validation.

PubMed

Sheffler, Will; Baker, David

2009-01-01

We present a novel method called RosettaHoles for visual and quantitative assessment of underpacking in the protein core. RosettaHoles generates a set of spherical cavity balls that fill the empty volume between atoms in the protein interior. For visualization, the cavity balls are aggregated into contiguous overlapping clusters and small cavities are discarded, leaving an uncluttered representation of the unfilled regions of space in a structure. For quantitative analysis, the cavity ball data are used to estimate the probability of observing a given cavity in a high-resolution crystal structure. RosettaHoles provides excellent discrimination between real and computationally generated structures, is predictive of incorrect regions in models, identifies problematic structures in the Protein Data Bank, and promises to be a useful validation tool for newly solved experimental structures.
RosettaHoles: Rapid assessment of protein core packing for structure prediction, refinement, design, and validation

PubMed Central

Sheffler, Will; Baker, David

2009-01-01

We present a novel method called RosettaHoles for visual and quantitative assessment of underpacking in the protein core. RosettaHoles generates a set of spherical cavity balls that fill the empty volume between atoms in the protein interior. For visualization, the cavity balls are aggregated into contiguous overlapping clusters and small cavities are discarded, leaving an uncluttered representation of the unfilled regions of space in a structure. For quantitative analysis, the cavity ball data are used to estimate the probability of observing a given cavity in a high-resolution crystal structure. RosettaHoles provides excellent discrimination between real and computationally generated structures, is predictive of incorrect regions in models, identifies problematic structures in the Protein Data Bank, and promises to be a useful validation tool for newly solved experimental structures. PMID:19177366
Structure of a two-CAP-domain protein from the human hookworm parasite Necator americanus

DOE Office of Scientific and Technical Information (OSTI.GOV)

Asojo, Oluwatoyin A., E-mail: oasojo@unmc.edu

2011-05-01

The first structure of a two-CAP-domain protein, Na-ASP-1, from the major human hookworm parasite N. americanus refined to a resolution limit of 2.2 Å is presented. Major proteins secreted by the infective larval stage hookworms upon host entry include Ancylostoma secreted proteins (ASPs), which are characterized by one or two CAP (cysteine-rich secretory protein/antigen 5/pathogenesis related-1) domains. The CAP domain has been reported in diverse phylogenetically unrelated proteins, but has no confirmed function. The first structure of a two-CAP-domain protein, Na-ASP-1, from the major human hookworm parasite Necator americanus was refined to a resolution limit of 2.2 Å. The structuremore » was solved by molecular replacement (MR) using Na-ASP-2, a one-CAP-domain ASP, as the search model. The correct MR solution could only be obtained by truncating the polyalanine model of Na-ASP-2 and removing several loops. The structure reveals two CAP domains linked by an extended loop. Overall, the carboxyl-terminal CAP domain is more similar to Na-ASP-2 than to the amino-terminal CAP domain. A large central cavity extends from the amino-terminal CAP domain to the carboxyl-terminal CAP domain, encompassing the putative CAP-binding cavity. The putative CAP-binding cavity is a characteristic cavity in the carboxyl-terminal CAP domain that contains a His and Glu pair. These residues are conserved in all single-CAP-domain proteins, but are absent in the amino-terminal CAP domain. The conserved His residues are oriented such that they appear to be capable of directly coordinating a zinc ion as observed for CAP proteins from reptile venoms. This first structure of a two-CAP-domain ASP can serve as a template for homology modeling of other two-CAP-domain proteins.« less
Predicting X-ray diffuse scattering from translation–libration–screw structural ensembles

PubMed Central

Van Benschoten, Andrew H.; Afonine, Pavel V.; Terwilliger, Thomas C.; Wall, Michael E.; Jackson, Colin J.; Sauter, Nicholas K.; Adams, Paul D.; Urzhumtsev, Alexandre; Fraser, James S.

2015-01-01

Identifying the intramolecular motions of proteins and nucleic acids is a major challenge in macromolecular X-ray crystallography. Because Bragg diffraction describes the average positional distribution of crystalline atoms with imperfect precision, the resulting electron density can be compatible with multiple models of motion. Diffuse X-ray scattering can reduce this degeneracy by reporting on correlated atomic displacements. Although recent technological advances are increasing the potential to accurately measure diffuse scattering, computational modeling and validation tools are still needed to quantify the agreement between experimental data and different parameterizations of crystalline disorder. A new tool, phenix.diffuse, addresses this need by employing Guinier’s equation to calculate diffuse scattering from Protein Data Bank (PDB)-formatted structural ensembles. As an example case, phenix.diffuse is applied to translation–libration–screw (TLS) refinement, which models rigid-body displacement for segments of the macromolecule. To enable the calculation of diffuse scattering from TLS-refined structures, phenix.tls_as_xyz builds multi-model PDB files that sample the underlying T, L and S tensors. In the glycerophosphodiesterase GpdQ, alternative TLS-group partitioning and different motional correlations between groups yield markedly dissimilar diffuse scattering maps with distinct implications for molecular mechanism and allostery. These methods demonstrate how, in principle, X-ray diffuse scattering could extend macromolecular structural refinement, validation and analysis. PMID:26249347
Predicting X-ray diffuse scattering from translation–libration–screw structural ensembles

DOE PAGES

Van Benschoten, Andrew H.; Afonine, Pavel V.; Terwilliger, Thomas C.; ...

2015-07-28

Identifying the intramolecular motions of proteins and nucleic acids is a major challenge in macromolecular X-ray crystallography. Because Bragg diffraction describes the average positional distribution of crystalline atoms with imperfect precision, the resulting electron density can be compatible with multiple models of motion. Diffuse X-ray scattering can reduce this degeneracy by reporting on correlated atomic displacements. Although recent technological advances are increasing the potential to accurately measure diffuse scattering, computational modeling and validation tools are still needed to quantify the agreement between experimental data and different parameterizations of crystalline disorder. A new tool, phenix.diffuse, addresses this need by employing Guinier'smore » equation to calculate diffuse scattering from Protein Data Bank (PDB)-formatted structural ensembles. As an example case, phenix.diffuse is applied to translation–libration–screw (TLS) refinement, which models rigid-body displacement for segments of the macromolecule. To enable the calculation of diffuse scattering from TLS-refined structures, phenix.tls_as_xyz builds multi-model PDB files that sample the underlying T, L and S tensors. In the glycerophosphodiesterase GpdQ, alternative TLS-group partitioning and different motional correlations between groups yield markedly dissimilar diffuse scattering maps with distinct implications for molecular mechanism and allostery. In addition, these methods demonstrate how, in principle, X-ray diffuse scattering could extend macromolecular structural refinement, validation and analysis.« less
Identification of Potent Chloride Intracellular Channel Protein 1 Inhibitors from Traditional Chinese Medicine through Structure-Based Virtual Screening and Molecular Dynamics Analysis

PubMed Central

Wan, Minghui; Liao, Dongjiang; Peng, Guilin; Xu, Xin; Yin, Weiqiang; Guo, Guixin; Jiang, Funeng; Zhong, Weide

2017-01-01

Chloride intracellular channel 1 (CLIC1) is involved in the development of most aggressive human tumors, including gastric, colon, lung, liver, and glioblastoma cancers. It has become an attractive new therapeutic target for several types of cancer. In this work, we aim to identify natural products as potent CLIC1 inhibitors from Traditional Chinese Medicine (TCM) database using structure-based virtual screening and molecular dynamics (MD) simulation. First, structure-based docking was employed to screen the refined TCM database and the top 500 TCM compounds were obtained and reranked by X-Score. Then, 30 potent hits were achieved from the top 500 TCM compounds using cluster and ligand-protein interaction analysis. Finally, MD simulation was employed to validate the stability of interactions between each hit and CLIC1 protein from docking simulation, and Molecular Mechanics/Generalized Born Surface Area (MM-GBSA) analysis was used to refine the virtual hits. Six TCM compounds with top MM-GBSA scores and ideal-binding models were confirmed as the final hits. Our study provides information about the interaction between TCM compounds and CLIC1 protein, which may be helpful for further experimental investigations. In addition, the top 6 natural products structural scaffolds could serve as building blocks in designing drug-like molecules for CLIC1 inhibition. PMID:29147652
Atomistic structural ensemble refinement reveals non-native structure stabilizes a sub-millisecond folding intermediate of CheY

DOE PAGES

Shi, Jade; Nobrega, R. Paul; Schwantes, Christian; ...

2017-03-08

The dynamics of globular proteins can be described in terms of transitions between a folded native state and less-populated intermediates, or excited states, which can play critical roles in both protein folding and function. Excited states are by definition transient species, and therefore are difficult to characterize using current experimental techniques. We report an atomistic model of the excited state ensemble of a stabilized mutant of an extensively studied flavodoxin fold protein CheY. We employed a hybrid simulation and experimental approach in which an aggregate 42 milliseconds of all-atom molecular dynamics were used as an informative prior for the structuremore » of the excited state ensemble. The resulting prior was then refined against small-angle X-ray scattering (SAXS) data employing an established method (EROS). The most striking feature of the resulting excited state ensemble was an unstructured N-terminus stabilized by non-native contacts in a conformation that is topologically simpler than the native state. We then predict incisive single molecule FRET experiments, using these results, as a means of model validation. Our study demonstrates the paradigm of uniting simulation and experiment in a statistical model to study the structure of protein excited states and rationally design validating experiments.« less
PaFlexPepDock: parallel ab-initio docking of peptides onto their receptors with full flexibility based on Rosetta.

PubMed

Li, Haiou; Lu, Liyao; Chen, Rong; Quan, Lijun; Xia, Xiaoyan; Lü, Qiang

2014-01-01

Structural information related to protein-peptide complexes can be very useful for novel drug discovery and design. The computational docking of protein and peptide can supplement the structural information available on protein-peptide interactions explored by experimental ways. Protein-peptide docking of this paper can be described as three processes that occur in parallel: ab-initio peptide folding, peptide docking with its receptor, and refinement of some flexible areas of the receptor as the peptide is approaching. Several existing methods have been used to sample the degrees of freedom in the three processes, which are usually triggered in an organized sequential scheme. In this paper, we proposed a parallel approach that combines all the three processes during the docking of a folding peptide with a flexible receptor. This approach mimics the actual protein-peptide docking process in parallel way, and is expected to deliver better performance than sequential approaches. We used 22 unbound protein-peptide docking examples to evaluate our method. Our analysis of the results showed that the explicit refinement of the flexible areas of the receptor facilitated more accurate modeling of the interfaces of the complexes, while combining all of the moves in parallel helped the constructing of energy funnels for predictions.
Strategies for carbohydrate model building, refinement and validation.

PubMed

Agirre, Jon

2017-02-01

Sugars are the most stereochemically intricate family of biomolecules and present substantial challenges to anyone trying to understand their nomenclature, reactions or branched structures. Current crystallographic programs provide an abstraction layer allowing inexpert structural biologists to build complete protein or nucleic acid model components automatically either from scratch or with little manual intervention. This is, however, still not generally true for sugars. The need for carbohydrate-specific building and validation tools has been highlighted a number of times in the past, concomitantly with the introduction of a new generation of experimental methods that have been ramping up the production of protein-sugar complexes and glycoproteins for the past decade. While some incipient advances have been made to address these demands, correctly modelling and refining carbohydrates remains a challenge. This article will address many of the typical difficulties that a structural biologist may face when dealing with carbohydrates, with an emphasis on problem solving in the resolution range where X-ray crystallography and cryo-electron microscopy are expected to overlap in the next decade.
The PDB_REDO server for macromolecular structure model optimization.

PubMed

Joosten, Robbie P; Long, Fei; Murshudov, Garib N; Perrakis, Anastassis

2014-07-01

The refinement and validation of a crystallographic structure model is the last step before the coordinates and the associated data are submitted to the Protein Data Bank (PDB). The success of the refinement procedure is typically assessed by validating the models against geometrical criteria and the diffraction data, and is an important step in ensuring the quality of the PDB public archive [Read et al. (2011 ▶), Structure, 19, 1395-1412]. The PDB_REDO procedure aims for 'constructive validation', aspiring to consistent and optimal refinement parameterization and pro-active model rebuilding, not only correcting errors but striving for optimal interpretation of the electron density. A web server for PDB_REDO has been implemented, allowing thorough, consistent and fully automated optimization of the refinement procedure in REFMAC and partial model rebuilding. The goal of the web server is to help practicing crystallo-graphers to improve their model prior to submission to the PDB. For this, additional steps were implemented in the PDB_REDO pipeline, both in the refinement procedure, e.g. testing of resolution limits and k-fold cross-validation for small test sets, and as new validation criteria, e.g. the density-fit metrics implemented in EDSTATS and ligand validation as implemented in YASARA. Innovative ways to present the refinement and validation results to the user are also described, which together with auto-generated Coot scripts can guide users to subsequent model inspection and improvement. It is demonstrated that using the server can lead to substantial improvement of structure models before they are submitted to the PDB.
The PDB_REDO server for macromolecular structure model optimization

PubMed Central

Joosten, Robbie P.; Long, Fei; Murshudov, Garib N.; Perrakis, Anastassis

2014-01-01

The refinement and validation of a crystallographic structure model is the last step before the coordinates and the associated data are submitted to the Protein Data Bank (PDB). The success of the refinement procedure is typically assessed by validating the models against geometrical criteria and the diffraction data, and is an important step in ensuring the quality of the PDB public archive [Read et al. (2011 ▶), Structure, 19, 1395–1412]. The PDB_REDO procedure aims for ‘constructive validation’, aspiring to consistent and optimal refinement parameterization and pro-active model rebuilding, not only correcting errors but striving for optimal interpretation of the electron density. A web server for PDB_REDO has been implemented, allowing thorough, consistent and fully automated optimization of the refinement procedure in REFMAC and partial model rebuilding. The goal of the web server is to help practicing crystallographers to improve their model prior to submission to the PDB. For this, additional steps were implemented in the PDB_REDO pipeline, both in the refinement procedure, e.g. testing of resolution limits and k-fold cross-validation for small test sets, and as new validation criteria, e.g. the density-fit metrics implemented in EDSTATS and ligand validation as implemented in YASARA. Innovative ways to present the refinement and validation results to the user are also described, which together with auto-generated Coot scripts can guide users to subsequent model inspection and improvement. It is demonstrated that using the server can lead to substantial improvement of structure models before they are submitted to the PDB. PMID:25075342
Assessment of Detection and Refinement Strategies for de novo Protein Structures using Force Field and Statistical Potentials

DTIC Science & Technology

2007-01-01

energy landscape of real proteins . As such, real proteins may have a subtle free energy gradient toward the native that requires long folding times...some leaning, however slight, toward the lowest free - energy basin .9 One caveat in the connection between the scoring funnel and the folding funnel is... protein sets. The average DFIRE-AA scores from each cluster were ranked, and the lowest- energy conformers from each of the top 16 clusters
Improved in-cell structure determination of proteins at near-physiological concentration

PubMed Central

Ikeya, Teppei; Hanashima, Tomomi; Hosoya, Saori; Shimazaki, Manato; Ikeda, Shiro; Mishima, Masaki; Güntert, Peter; Ito, Yutaka

2016-01-01

Investigating three-dimensional (3D) structures of proteins in living cells by in-cell nuclear magnetic resonance (NMR) spectroscopy opens an avenue towards understanding the structural basis of their functions and physical properties under physiological conditions inside cells. In-cell NMR provides data at atomic resolution non-invasively, and has been used to detect protein-protein interactions, thermodynamics of protein stability, the behavior of intrinsically disordered proteins, etc. in cells. However, so far only a single de novo 3D protein structure could be determined based on data derived only from in-cell NMR. Here we introduce methods that enable in-cell NMR protein structure determination for a larger number of proteins at concentrations that approach physiological ones. The new methods comprise (1) advances in the processing of non-uniformly sampled NMR data, which reduces the measurement time for the intrinsically short-lived in-cell NMR samples, (2) automatic chemical shift assignment for obtaining an optimal resonance assignment, and (3) structure refinement with Bayesian inference, which makes it possible to calculate accurate 3D protein structures from sparse data sets of conformational restraints. As an example application we determined the structure of the B1 domain of protein G at about 250 μM concentration in living E. coli cells. PMID:27910948
RepeatsDB-lite: a web server for unit annotation of tandem repeat proteins.

PubMed

Hirsh, Layla; Paladin, Lisanna; Piovesan, Damiano; Tosatto, Silvio C E

2018-05-09

RepeatsDB-lite (http://protein.bio.unipd.it/repeatsdb-lite) is a web server for the prediction of repetitive structural elements and units in tandem repeat (TR) proteins. TRs are a widespread but poorly annotated class of non-globular proteins carrying heterogeneous functions. RepeatsDB-lite extends the prediction to all TR types and strongly improves the performance both in terms of computational time and accuracy over previous methods, with precision above 95% for solenoid structures. The algorithm exploits an improved TR unit library derived from the RepeatsDB database to perform an iterative structural search and assignment. The web interface provides tools for analyzing the evolutionary relationships between units and manually refine the prediction by changing unit positions and protein classification. An all-against-all structure-based sequence similarity matrix is calculated and visualized in real-time for every user edit. Reviewed predictions can be submitted to RepeatsDB for review and inclusion.

Molecular and Structural Characterization of the Tegumental 20.6-kDa Protein in Clonorchis sinensis as a Potential Druggable Target.

PubMed

Kim, Yu-Jung; Yoo, Won Gi; Lee, Myoung-Ro; Kang, Jung-Mi; Na, Byoung-Kuk; Cho, Shin-Hyeong; Park, Mi-Yeoun; Ju, Jung-Won

2017-03-04

The tegument, representing the membrane-bound outer surface of platyhelminth parasites, plays an important role for the regulation of the host immune response and parasite survival. A comprehensive understanding of tegumental proteins can provide drug candidates for use against helminth-associated diseases, such as clonorchiasis caused by the liver fluke Clonorchis sinensis . However, little is known regarding the physicochemical properties of C. sinensis teguments. In this study, a novel 20.6-kDa tegumental protein of the C. sinensis adult worm (CsTegu20.6) was identified and characterized by molecular and in silico methods. The complete coding sequence of 525 bp was derived from cDNA clones and encodes a protein of 175 amino acids. Homology search using BLASTX showed CsTegu20.6 identity ranging from 29% to 39% with previously-known tegumental proteins in C. sinensis . Domain analysis indicated the presence of a calcium-binding EF-hand domain containing a basic helix-loop-helix structure and a dynein light chain domain exhibiting a ferredoxin fold. We used a modified method to obtain the accurate tertiary structure of the CsTegu20.6 protein because of the unavailability of appropriate templates. The CsTegu20.6 protein sequence was split into two domains based on the disordered region, and then, the structure of each domain was modeled using I-TASSER. A final full-length structure was obtained by combining two structures and refining the whole structure. A refined CsTegu20.6 structure was used to identify a potential CsTegu20.6 inhibitor based on protein structure-compound interaction analysis. The recombinant proteins were expressed in Escherichia coli and purified by nickel-nitrilotriacetic acid affinity chromatography. In C. sinensis , CsTegu20.6 mRNAs were abundant in adult and metacercariae, but not in the egg. Immunohistochemistry revealed that CsTegu20.6 localized to the surface of the tegument in the adult fluke. Collectively, our results contribute to a better understanding of the structural and functional characteristics of CsTegu20.6 and homologs of flukes. One compound is proposed as a putative inhibitor of CsTegu20.6 to facilitate further studies for anthelmintics.
High-resolution structures of adenylate kinase from yeast ligated with inhibitor Ap5A, showing the pathway of phosphoryl transfer.

PubMed Central

Abele, U.; Schulz, G. E.

1995-01-01

The structure of adenylate kinase from yeast ligated with the two-substrate-mimicking inhibitor Ap5A and Mg2+ has been refined to 1.96 A resolution. In addition, the refined structure of the same complex with a bound imidazole molecule replacing Mg2+ has been determined at 1.63 A. These structures indicate that replacing Mg2+ by imidazole disturbs the water structure and thus the complex. A comparison with the G-proteins shows that Mg2+ is exactly at the same position with respect to the phosphates. However, although the Mg2+ ligand sphere of the G-proteins is a regular octahedron containing peptide ligands, the reported adenylate kinase has no such ligands and an open octahedron leaving space for the Mg2+ to accompany the transferred phosphoryl group. A superposition of the known crystalline and therefore perturbed phosphoryl transfer geometries in the adenylate kinases demonstrates that all of them are close to the start of the forward reaction with bound ATP and AMP. Averaging all observed perturbed structures gives rise to a close approximation of the transition state, indicating in general how to establish an elusive transition state geometry. The average shows that the in-line phosphoryl transfer is associative, because there is no space for a dissociative metaphosphate intermediate. As a side result, the secondary dipole interaction in the alpha-helices of both protein structures has been quantified. PMID:7670369
High-throughput Crystallography for Structural Genomics

PubMed Central

Joachimiak, Andrzej

2009-01-01

Protein X-ray crystallography recently celebrated its 50th anniversary. The structures of myoglobin and hemoglobin determined by Kendrew and Perutz provided the first glimpses into the complex protein architecture and chemistry. Since then, the field of structural molecular biology has experienced extraordinary progress and now over 53,000 proteins structures have been deposited into the Protein Data Bank. In the past decade many advances in macromolecular crystallography have been driven by world-wide structural genomics efforts. This was made possible because of third-generation synchrotron sources, structure phasing approaches using anomalous signal and cryo-crystallography. Complementary progress in molecular biology, proteomics, hardware and software for crystallographic data collection, structure determination and refinement, computer science, databases, robotics and automation improved and accelerated many processes. These advancements provide the robust foundation for structural molecular biology and assure strong contribution to science in the future. In this report we focus mainly on reviewing structural genomics high-throughput X-ray crystallography technologies and their impact. PMID:19765976
Automated determination of fibrillar structures by simultaneous model building and fiber diffraction refinement.

PubMed

Potrzebowski, Wojciech; André, Ingemar

2015-07-01

For highly oriented fibrillar molecules, three-dimensional structures can often be determined from X-ray fiber diffraction data. However, because of limited information content, structure determination and validation can be challenging. We demonstrate that automated structure determination of protein fibers can be achieved by guiding the building of macromolecular models with fiber diffraction data. We illustrate the power of our approach by determining the structures of six bacteriophage viruses de novo using fiber diffraction data alone and together with solid-state NMR data. Furthermore, we demonstrate the feasibility of molecular replacement from monomeric and fibrillar templates by solving the structure of a plant virus using homology modeling and protein-protein docking. The generated models explain the experimental data to the same degree as deposited reference structures but with improved structural quality. We also developed a cross-validation method for model selection. The results highlight the power of fiber diffraction data as structural constraints.
WeFold: A Coopetition for Protein Structure Prediction

PubMed Central

Khoury, George A.; Liwo, Adam; Khatib, Firas; Zhou, Hongyi; Chopra, Gaurav; Bacardit, Jaume; Bortot, Leandro O.; Faccioli, Rodrigo A.; Deng, Xin; He, Yi; Krupa, Pawel; Li, Jilong; Mozolewska, Magdalena A.; Sieradzan, Adam K.; Smadbeck, James; Wirecki, Tomasz; Cooper, Seth; Flatten, Jeff; Xu, Kefan; Baker, David; Cheng, Jianlin; Delbem, Alexandre C. B.; Floudas, Christodoulos A.; Keasar, Chen; Levitt, Michael; Popović, Zoran; Scheraga, Harold A.; Skolnick, Jeffrey; Crivelli, Silvia N.; Players, Foldit

2014-01-01

The protein structure prediction problem continues to elude scientists. Despite the introduction of many methods, only modest gains were made over the last decade for certain classes of prediction targets. To address this challenge, a social-media based worldwide collaborative effort, named WeFold, was undertaken by thirteen labs. During the collaboration, the labs were simultaneously competing with each other. Here, we present the first attempt at “coopetition” in scientific research applied to the protein structure prediction and refinement problems. The coopetition was possible by allowing the participating labs to contribute different components of their protein structure prediction pipelines and create new hybrid pipelines that they tested during CASP10. This manuscript describes both successes and areas needing improvement as identified throughout the first WeFold experiment and discusses the efforts that are underway to advance this initiative. A footprint of all contributions and structures are publicly accessible at http://www.wefold.org. PMID:24677212
Computational Amide I Spectroscopy for Refinement of Disordered Peptide Ensembles: Maximum Entropy and Related Approaches

NASA Astrophysics Data System (ADS)

Reppert, Michael; Tokmakoff, Andrei

The structural characterization of intrinsically disordered peptides (IDPs) presents a challenging biophysical problem. Extreme heterogeneity and rapid conformational interconversion make traditional methods difficult to interpret. Due to its ultrafast (ps) shutter speed, Amide I vibrational spectroscopy has received considerable interest as a novel technique to probe IDP structure and dynamics. Historically, Amide I spectroscopy has been limited to delivering global secondary structural information. More recently, however, the method has been adapted to study structure at the local level through incorporation of isotope labels into the protein backbone at specific amide bonds. Thanks to the acute sensitivity of Amide I frequencies to local electrostatic interactions-particularly hydrogen bonds-spectroscopic data on isotope labeled residues directly reports on local peptide conformation. Quantitative information can be extracted using electrostatic frequency maps which translate molecular dynamics trajectories into Amide I spectra for comparison with experiment. Here we present our recent efforts in the development of a rigorous approach to incorporating Amide I spectroscopic restraints into refined molecular dynamics structural ensembles using maximum entropy and related approaches. By combining force field predictions with experimental spectroscopic data, we construct refined structural ensembles for a family of short, strongly disordered, elastin-like peptides in aqueous solution.
Conformational Analysis of Free and Bound Retinoic Acid

PubMed Central

Fu, Zheng; Li, Xue; Merz, Kenneth M.

2012-01-01

The conformational profiles of unbound all-trans and 9-cis retinoic acid (RA) have been determined using classical and quantum mechanical calculations. Sixty-six all-trans-RA (ATRA) and forty-eight 9-cis-RA energy minimum conformers were identified via HF/6-31G* geometry optimizations in vacuo. Their relative conformational energies were estimated utilizing the M06, M06-2x and MP2 methods combined with the 6-311+G(d,p), aug-cc-pVDZ and aug-cc-pVTZ basis sets, as well as complete basis set MP2 extrapolations using the latter two basis sets. Single-point energy calculations performed with the M06-2x density functional were found to yield similar results to MP2/CBS for the low-energy retinoic acid conformations. Not unexpectedly, the conformational propensities of retinoic acid were governed by the orientation and arrangement of the torsion angles associated with the polyene tail. We also used previously reported QM/MM X-ray refinement results on four ATRA-protein crystal structures plus one newly refined 9-cis-RA complex (PDB ID 1XDK) in order to investigate the conformational preferences of bound retinoic acid. In the re-refined RA conformers the conjugated double bonds are nearly coplanar, which is consistent with the global minimum identified by the Omega/QM method rather than the corresponding crystallographically determined conformations given in the PDB. Consequently, a 91.3% average reduction of the local strain energy in the gas phase, as well as 92.1% in PCM solvent, was observed using the QM/MM refined structures versus the PDB deposited RA conformations. These results thus demonstrate that our QM/MM X-ray refinement approach can significantly enhance the quality of X-ray crystal structures refined by conventional refinement protocols, thereby providing reliable drug-target structural information for use in structure-based drug discovery applications. PMID:22844234
Protein structure modeling and refinement by global optimization in CASP12.

PubMed

Hong, Seung Hwan; Joung, InSuk; Flores-Canales, Jose C; Manavalan, Balachandran; Cheng, Qianyi; Heo, Seungryong; Kim, Jong Yun; Lee, Sun Young; Nam, Mikyung; Joo, Keehyoung; Lee, In-Ho; Lee, Sung Jong; Lee, Jooyoung

2018-03-01

For protein structure modeling in the CASP12 experiment, we have developed a new protocol based on our previous CASP11 approach. The global optimization method of conformational space annealing (CSA) was applied to 3 stages of modeling: multiple sequence-structure alignment, three-dimensional (3D) chain building, and side-chain re-modeling. For better template selection and model selection, we updated our model quality assessment (QA) method with the newly developed SVMQA (support vector machine for quality assessment). For 3D chain building, we updated our energy function by including restraints generated from predicted residue-residue contacts. New energy terms for the predicted secondary structure and predicted solvent accessible surface area were also introduced. For difficult targets, we proposed a new method, LEEab, where the template term played a less significant role than it did in LEE, complemented by increased contributions from other terms such as the predicted contact term. For TBM (template-based modeling) targets, LEE performed better than LEEab, but for FM targets, LEEab was better. For model refinement, we modified our CASP11 molecular dynamics (MD) based protocol by using explicit solvents and tuning down restraint weights. Refinement results from MD simulations that used a new augmented statistical energy term in the force field were quite promising. Finally, when using inaccurate information (such as the predicted contacts), it was important to use the Lorentzian function for which the maximal penalty arising from wrong information is always bounded. © 2017 Wiley Periodicals, Inc.
Unraveling the meaning of chemical shifts in protein NMR.

PubMed

Berjanskii, Mark V; Wishart, David S

2017-11-01

Chemical shifts are among the most informative parameters in protein NMR. They provide wealth of information about protein secondary and tertiary structure, protein flexibility, and protein-ligand binding. In this report, we review the progress in interpreting and utilizing protein chemical shifts that has occurred over the past 25years, with a particular focus on the large body of work arising from our group and other Canadian NMR laboratories. More specifically, this review focuses on describing, assessing, and providing some historical context for various chemical shift-based methods to: (1) determine protein secondary and super-secondary structure; (2) derive protein torsion angles; (3) assess protein flexibility; (4) predict residue accessible surface area; (5) refine 3D protein structures; (6) determine 3D protein structures and (7) characterize intrinsically disordered proteins. This review also briefly covers some of the methods that we previously developed to predict chemical shifts from 3D protein structures and/or protein sequence data. It is hoped that this review will help to increase awareness of the considerable utility of NMR chemical shifts in structural biology and facilitate more widespread adoption of chemical-shift based methods by the NMR spectroscopists, structural biologists, protein biophysicists, and biochemists worldwide. This article is part of a Special Issue entitled: Biophysics in Canada, edited by Lewis Kay, John Baenziger, Albert Berghuis and Peter Tieleman. Copyright © 2017 Elsevier B.V. All rights reserved.
Prediction of protein tertiary structure to low resolution: performance for a large and structurally diverse test set.

PubMed

Eyrich, V A; Standley, D M; Friesner, R A

1999-05-14

We report the tertiary structure predictions for 95 proteins ranging in size from 17 to 160 residues starting from known secondary structure. Predictions are obtained from global minimization of an empirical potential function followed by the application of a refined atomic overlap potential. The minimization strategy employed represents a variant of the Monte Carlo plus minimization scheme of Li and Scheraga applied to a reduced model of the protein chain. For all of the cases except beta-proteins larger than 75 residues, a native-like structure, usually 4-6 A root-mean-square deviation from the native, is located. For beta-proteins larger than 75 residues, the energy gap between native-like structures and the lowest energy structures produced in the simulation is large, so that low RMSD structures are not generated starting from an unfolded state. This is attributed to the lack of an explicit hydrogen bond term in the potential function, which we hypothesize is necessary to stabilize large assemblies of beta-strands. Copyright 1999 Academic Press.
Discriminative structural approaches for enzyme active-site prediction.

PubMed

Kato, Tsuyoshi; Nagano, Nozomi

2011-02-15

Predicting enzyme active-sites in proteins is an important issue not only for protein sciences but also for a variety of practical applications such as drug design. Because enzyme reaction mechanisms are based on the local structures of enzyme active-sites, various template-based methods that compare local structures in proteins have been developed to date. In comparing such local sites, a simple measurement, RMSD, has been used so far. This paper introduces new machine learning algorithms that refine the similarity/deviation for comparison of local structures. The similarity/deviation is applied to two types of applications, single template analysis and multiple template analysis. In the single template analysis, a single template is used as a query to search proteins for active sites, whereas a protein structure is examined as a query to discover the possible active-sites using a set of templates in the multiple template analysis. This paper experimentally illustrates that the machine learning algorithms effectively improve the similarity/deviation measurements for both the analyses.
Evaluating the quality of NMR structures by local density of protons.

PubMed

Ban, Yih-En Andrew; Rudolph, Johannes; Zhou, Pei; Edelsbrunner, Herbert

2006-03-01

Evaluating the quality of experimentally determined protein structural models is an essential step toward identifying potential errors and guiding further structural refinement. Herein, we report the use of proton local density as a sensitive measure to assess the quality of nuclear magnetic resonance (NMR) structures. Using 256 high-resolution crystal structures with protons added and optimized, we show that the local density of different proton types display distinct distributions. These distributions can be characterized by statistical moments and are used to establish local density Z-scores for evaluating both global and local packing for individual protons. Analysis of 546 crystal structures at various resolutions shows that the local density Z-scores increase as the structural resolution decreases and correlate well with the ClashScore (Word et al. J Mol Biol 1999;285(4):1711-1733) generated by all atom contact analysis. Local density Z-scores for NMR structures exhibit a significantly wider range of values than for X-ray structures and demonstrate a combination of potentially problematic inflation and compression. Water-refined NMR structures show improved packing quality. Our analysis of a high-quality structural ensemble of ubiquitin refined against order parameters shows proton density distributions that correlate nearly perfectly with our standards derived from crystal structures, further validating our approach. We present an automated analysis and visualization tool for proton packing to evaluate the quality of NMR structures. 2005 Wiley-Liss, Inc.
Surface layer protein characterization by small angle x-ray scattering and a fractal mean force concept: from protein structure to nanodisk assemblies.

PubMed

Horejs, Christine; Pum, Dietmar; Sleytr, Uwe B; Peterlik, Herwig; Jungbauer, Alois; Tscheliessnig, Rupert

2010-11-07

Surface layers (S-layers) are the most commonly observed cell surface structure of prokaryotic organisms. They are made up of proteins that spontaneously self-assemble into functional crystalline lattices in solution, on various solid surfaces, and interfaces. While classical experimental techniques failed to recover a complete structural model of an unmodified S-layer protein, small angle x-ray scattering (SAXS) provides an opportunity to study the structure of S-layer monomers in solution and of self-assembled two-dimensional sheets. For the protein under investigation we recently suggested an atomistic structural model by the use of molecular dynamics simulations. This structural model is now refined on the basis of SAXS data together with a fractal assembly approach. Here we show that a nondiluted critical system of proteins, which crystallize into monomolecular structures, might be analyzed by SAXS if protein-protein interactions are taken into account by relating a fractal local density distribution to a fractal local mean potential, which has to fulfill the Poisson equation. The present work demonstrates an important step into the elucidation of the structure of S-layers and offers a tool to analyze the structure of self-assembling systems in solution by means of SAXS and computer simulations.
Surface layer protein characterization by small angle x-ray scattering and a fractal mean force concept: From protein structure to nanodisk assemblies

NASA Astrophysics Data System (ADS)

Horejs, Christine; Pum, Dietmar; Sleytr, Uwe B.; Peterlik, Herwig; Jungbauer, Alois; Tscheliessnig, Rupert

2010-11-01

Surface layers (S-layers) are the most commonly observed cell surface structure of prokaryotic organisms. They are made up of proteins that spontaneously self-assemble into functional crystalline lattices in solution, on various solid surfaces, and interfaces. While classical experimental techniques failed to recover a complete structural model of an unmodified S-layer protein, small angle x-ray scattering (SAXS) provides an opportunity to study the structure of S-layer monomers in solution and of self-assembled two-dimensional sheets. For the protein under investigation we recently suggested an atomistic structural model by the use of molecular dynamics simulations. This structural model is now refined on the basis of SAXS data together with a fractal assembly approach. Here we show that a nondiluted critical system of proteins, which crystallize into monomolecular structures, might be analyzed by SAXS if protein-protein interactions are taken into account by relating a fractal local density distribution to a fractal local mean potential, which has to fulfill the Poisson equation. The present work demonstrates an important step into the elucidation of the structure of S-layers and offers a tool to analyze the structure of self-assembling systems in solution by means of SAXS and computer simulations.
Surface layer protein characterization by small angle x-ray scattering and a fractal mean force concept: From protein structure to nanodisk assemblies

DOE Office of Scientific and Technical Information (OSTI.GOV)

Horejs, Christine; Pum, Dietmar; Sleytr, Uwe B.

2010-11-07

Surface layers (S-layers) are the most commonly observed cell surface structure of prokaryotic organisms. They are made up of proteins that spontaneously self-assemble into functional crystalline lattices in solution, on various solid surfaces, and interfaces. While classical experimental techniques failed to recover a complete structural model of an unmodified S-layer protein, small angle x-ray scattering (SAXS) provides an opportunity to study the structure of S-layer monomers in solution and of self-assembled two-dimensional sheets. For the protein under investigation we recently suggested an atomistic structural model by the use of molecular dynamics simulations. This structural model is now refined on themore » basis of SAXS data together with a fractal assembly approach. Here we show that a nondiluted critical system of proteins, which crystallize into monomolecular structures, might be analyzed by SAXS if protein-protein interactions are taken into account by relating a fractal local density distribution to a fractal local mean potential, which has to fulfill the Poisson equation. The present work demonstrates an important step into the elucidation of the structure of S-layers and offers a tool to analyze the structure of self-assembling systems in solution by means of SAXS and computer simulations.« less
Energetically Unfavorable Amide Conformations for N6-Acetyllysine Side Chains in Refined Protein Structures

PubMed Central

Genshaft, Alexander; Moser, Joe-Ann S.; D'Antonio, Edward L.; Bowman, Christine M.; Christianson, David W.

2013-01-01

The reversible acetylation of lysine to form N6-acetyllysine in the regulation of protein function is a hallmark of epigenetics. Acetylation of the positively charged amino group of the lysine side chain generates a neutral N-alkylacetamide moiety that serves as a molecular “switch” for the modulation of protein function and protein-protein interactions. We now report the analysis of 381 N6-acetyllysine side chain amide conformations as found in 79 protein crystal structures and 11 protein NMR structures deposited in the Protein Data Bank (PDB) of the Research Collaboratory for Structural Bioinformatics. We find that only 74.3% of N6-acetyllysine residues in protein crystal structures and 46.5% in protein NMR structures contain amide groups with energetically preferred trans or generously trans conformations. Surprisingly, 17.6% of N6-acetyllysine residues in protein crystal structures and 5.3% in protein NMR structures contain amide groups with energetically unfavorable cis or generously cis conformations. Even more surprisingly, 8.1% of N6-acetyllysine residues in protein crystal structures and 48.2% in NMR structures contain amide groups with energetically prohibitive twisted conformations that approach the transition state structure for cis-trans isomerization. In contrast, 109 unique N-alkylacetamide groups contained in 84 highly-accurate small molecule crystal structures retrieved from the Cambridge Structural Database exclusively adopt energetically preferred trans conformations. Therefore, we conclude that cis and twisted N6-acetyllysine amides in protein structures deposited in the PDB are erroneously modeled due to their energetically unfavorable or prohibitive conformations. PMID:23401043
Structure of the catalytic domain of Plasmodium falciparum ARF GTPase-activating protein (ARFGAP)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Cook, William J.; Senkovich, Olga; Chattopadhyay, Debasish

2012-03-26

The crystal structure of the catalytic domain of the ADP ribosylation factor GTPase-activating protein (ARFGAP) from Plasmodium falciparum has been determined and refined to 2.4 {angstrom} resolution. Multiwavelength anomalous diffraction (MAD) data were collected utilizing the Zn{sup 2+} ion bound at the zinc-finger domain and were used to solve the structure. The overall structure of the domain is similar to those of mammalian ARFGAPs. However, several amino-acid residues in the area where GAP interacts with ARF1 differ in P. falciparum ARFGAP. Moreover, a number of residues that form the dimer interface in the crystal structure are unique in P. falciparummore » ARFGAP.« less
Structural Basis for "Flip-Flop" Action of Human Pyruvate Dehydrogenase

NASA Technical Reports Server (NTRS)

Ciszak, Ewa; Korotchkina, Lioubov; Dominiak, Paulina; Sidhu, Sukhdeep; Patel, Mulchand

2003-01-01

The derivative of vitamin B1, thiamin pyrophosphate is a cofactor of pyruvate dehydrogenase, a component enzyme of the mitochondrial pyruvate dehydrogenase multienzyme complex that plays a major role in directing energy metabolism in the cell. This cofactor is used to cleave the C(sup alpha)-C(=O) bond of pyruvate followed by reductive acetyl transfer to lipoyl-dihydrolipoamide acetyltransferase. In alpha(sub 2)beta(sub 2)-tetrameric human pyruvate dehydrogenase, there are two cofactor binding sites, each of them being a center of independently conducted, although highly coordinated enzymatic reactions. The dynamic nonequivalence of two, otherwise chemically equivalent, catalytic sites can now be understood based on the recently determined crystal structure of the holo-form of human pyruvate dehydrogenase at 1.95A resolution. The structure of pyruvate dehydrogenase was determined using a combination of MAD phasing and molecular replacement followed by rounds of torsion-angles molecular-dynamics simulated-annealing refinement. The final pyruvate dehydrogenase structure included coordinates for all protein amino acids two cofactor molecules, two magnesium and two potassium ions, and 742 water molecules. The structure was refined to R = 0.202 and R(sub free) = 0.244. Our structural analysis of the enzyme folding and domain assembly identified a simple mechanism of this protein motion required for the conduct of catalytic action.
Crystal structure of the protein At3g01520, a eukaryotic universal stress protein-like protein from Arabidopsis thaliana in complex with AMP.

PubMed

Kim, Do Jin; Bitto, Eduard; Bingman, Craig A; Kim, Hyun-Jung; Han, Byung Woo; Phillips, George N

2015-07-01

Members of the universal stress protein (USP) family are conserved in a phylogenetically diverse range of prokaryotes, fungi, protists, and plants and confer abilities to respond to a wide range of environmental stresses. Arabidopsis thaliana contains 44 USP domain-containing proteins, and USP domain is found either in a small protein with unknown physiological function or in an N-terminal portion of a multi-domain protein, usually a protein kinase. Here, we report the first crystal structure of a eukaryotic USP-like protein encoded from the gene At3g01520. The crystal structure of the protein At3g01520 was determined by the single-wavelength anomalous dispersion method and refined to an R factor of 21.8% (Rfree = 26.1%) at 2.5 Å resolution. The crystal structure includes three At3g01520 protein dimers with one AMP molecule bound to each protomer, comprising a Rossmann-like α/β overall fold. The bound AMP and conservation of residues in the ATP-binding loop suggest that the protein At3g01520 also belongs to the ATP-binding USP subfamily members. © 2015 The Authors. Proteins: Structure, Function, and Bioinformatics Published by Wiley Periodicals, Inc.
The flexible C-terminal arm of the Lassa arenavirus Z-protein mediates interactions with multiple binding partners.

PubMed

May, Eric R; Armen, Roger S; Mannan, Aristotle M; Brooks, Charles L

2010-08-01

The arenavirus genome encodes for a Z-protein, which contains a RING domain that coordinates two zinc ions, and has been identified as having several functional roles at various stages of the virus life cycle. Z-protein binds to multiple host proteins and has been directly implicated in the promotion of viral budding, repression of mRNA translation, and apoptosis of infected cells. Using homology models of the Z-protein from Lassa strain arenavirus, replica exchange molecular dynamics (MD) was used to refine the structures, which were then subsequently clustered. Population-weighted ensembles of low-energy cluster representatives were predicted based upon optimal agreement of the chemical shifts computed with the SPARTA program with the experimental NMR chemical shifts. A member of the refined ensemble was identified to be a potential binder of budding factor Tsg101 based on its correspondence to the structure of the HIV-1 Gag late domain when bound to Tsg101. Members of these ensembles were docked against the crystal structure of human eIF4E translation initiation factor. Two plausible binding modes emerged based upon their agreement with experimental observation, favorable interaction energies and stability during MD trajectories. Mutations to Z are proposed that would either inhibit both binding mechanisms or selectively inhibit only one mode. The C-terminal domain conformation of the most populated member of the representative ensemble shielded protein-binding recognition motifs for Tsg101 and eIF4E and represents the most populated state free in solution. We propose that C-terminal flexibility is key for mediating the different functional states of the Z-protein. (c) 2010 Wiley-Liss, Inc.

The Flexible C-terminal Arm of the Lassa Arenavirus Z-Protein Mediates Interactions with Multiple Binding Partners

PubMed Central

May, Eric R.; Armen, Roger S.; Mannan, Aristotle M.; Brooks, Charles L.

2010-01-01

The arenavirus genome encodes for a Z-protein, which contains a RING domain that coordinates two zinc ions, and has been identified as having several functional roles at various stages of the virus life cycle. Z-protein binds to multiple host proteins and has been directly implicated in the promotion of viral budding, repression of mRNA translation and apoptosis of infected cells. Using homology models of the Z-protein from Lassa strain arenavirus, replica exchange molecular dynamics were employed to refine the structures, which were then subsequently clustered. Population weighted ensembles of low energy cluster representatives were predicted based upon optimal agreement of the chemical shifts computed with the SPARTA program with the experimental NMR chemical shifts. A member of the refined ensemble was indentified to be a potential binder of budding factor Tsg101 based on its correspondence to the structure of the HIV-1 Gag late domain when bound to Tsg101. Members of these ensembles were docked against the crystal structure of human eIF4E translation initiation factor. Two plausible binding modes emerged based upon their agreement with experimental observation, favorable interaction energies and stability during molecular dynamics trajectories. Mutations to Z are proposed that would either inhibit both binding mechanisms or selectively inhibit only one mode. The C-terminal domain conformation of the most populated member of the representative ensemble shielded protein binding recognition motifs for Tsg101 and eIF4E, and represents the most populated state free in solution. We propose that C-terminal flexibility is key for mediating the different functional states of the Z-protein. PMID:20544962
Three-dimensional structure of photosystem II from Thermosynechococcus elongates in complex with terbutryn

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gabdulkhakov, A. G., E-mail: azat@vega.protes.ru; Dontsova, M. V.; Saenger, W.

Photosystem II is a key component of the photosynthetic pathway producing oxygen at the thylakoid membrane of cyanobacteria, green algae, and plants. The three-dimensional structure of photosystem II from the cyanobacterium Thermosynechococcus elongates in a complex with herbicide terbutryn (a photosynthesis inhibitor) was determined for the first time by X-ray diffraction and refined at 3.2 Angstrom-Sign resolution (R{sub factor} = 26.9%, R{sub free} = 29.9%, rmsd for bond lengths is 0.013 Angstrom-Sign , and rmsd for bond angles is 2.2 Degree-Sign ). The terbutryn molecule was located in the binding pocket of the mobile plastoquinone. The atomic coordinates of themore » refined structure of photosystem II in a complex with terbutryn were deposited in the Protein Data Bank.« less
Structural Refinement of Membrane Proteins by Restrained Molecular Dynamics and Solvent Accessibility Data

PubMed Central

Sompornpisut, Pornthep; Roux, Benoît; Perozo, Eduardo

2008-01-01

We present an approach for incorporating solvent accessibility data from electron paramagnetic resonance experiments in the structural refinement of membrane proteins through restrained molecular dynamics simulations. The restraints have been parameterized from oxygen (ΠO2) and nickel-ethylenediaminediacetic acid (ΠNiEdda) collision frequencies, as indicators of lipid or aqueous exposed spin-label sites. These are enforced through interactions between a pseudoatom representation of the covalently attached Nitroxide spin-label and virtual “solvent” particles corresponding to O2 and NiEdda in the surrounding environment. Interactions were computed using an empirical potential function, where the parameters have been optimized to account for the different accessibilities of the spin-label pseudoatoms to the surrounding environment. This approach, “pseudoatom-driven solvent accessibility refinement”, was validated by refolding distorted conformations of the Streptomyces lividans potassium channel (KcsA), corresponding to a range of 2–30 Å root mean-square deviations away from the native structure. Molecular dynamics simulations based on up to 58 electron paramagnetic resonance restraints derived from spin-label mutants were able to converge toward the native structure within 1–3 Å root mean-square deviations with minimal computational cost. The use of energy-based ranking and structure similarity clustering as selection criteria helped in the convergence and identification of correctly folded structures from a large number of simulations. This approach can be applied to a variety of integral membrane protein systems, regardless of oligomeric state, and should be particularly useful in calculating conformational changes from a known reference crystal structure. PMID:18676641
Physics-based method to validate and repair flaws in protein structures

PubMed Central

Martin, Osvaldo A.; Arnautova, Yelena A.; Icazatti, Alejandro A.; Scheraga, Harold A.; Vila, Jorge A.

2013-01-01

A method that makes use of information provided by the combination of 13Cα and 13Cβ chemical shifts, computed at the density functional level of theory, enables one to (i) validate, at the residue level, conformations of proteins and detect backbone or side-chain flaws by taking into account an ensemble average of chemical shifts over all of the conformations used to represent a protein, with a sensitivity of ∼90%; and (ii) provide a set of (χ1/χ2) torsional angles that leads to optimal agreement between the observed and computed 13Cα and 13Cβ chemical shifts. The method has been incorporated into the CheShift-2 protein validation Web server. To test the reliability of the provided set of (χ1/χ2) torsional angles, the side chains of all reported conformations of five NMR-determined protein models were refined by a simple routine, without using NOE-based distance restraints. The refinement of each of these five proteins leads to optimal agreement between the observed and computed 13Cα and 13Cβ chemical shifts for ∼94% of the flaws, on average, without introducing a significantly large number of violations of the NOE-based distance restraints for a distance range ≤ 0.5 Ǻ, in which the largest number of distance violations occurs. The results of this work suggest that use of the provided set of (χ1/χ2) torsional angles together with other observables, such as NOEs, should lead to a fast and accurate refinement of the side-chain conformations of protein models. PMID:24082119
Physics-based method to validate and repair flaws in protein structures.

PubMed

Martin, Osvaldo A; Arnautova, Yelena A; Icazatti, Alejandro A; Scheraga, Harold A; Vila, Jorge A

2013-10-15

A method that makes use of information provided by the combination of (13)C(α) and (13)C(β) chemical shifts, computed at the density functional level of theory, enables one to (i) validate, at the residue level, conformations of proteins and detect backbone or side-chain flaws by taking into account an ensemble average of chemical shifts over all of the conformations used to represent a protein, with a sensitivity of ∼90%; and (ii) provide a set of (χ1/χ2) torsional angles that leads to optimal agreement between the observed and computed (13)C(α) and (13)C(β) chemical shifts. The method has been incorporated into the CheShift-2 protein validation Web server. To test the reliability of the provided set of (χ1/χ2) torsional angles, the side chains of all reported conformations of five NMR-determined protein models were refined by a simple routine, without using NOE-based distance restraints. The refinement of each of these five proteins leads to optimal agreement between the observed and computed (13)C(α) and (13)C(β) chemical shifts for ∼94% of the flaws, on average, without introducing a significantly large number of violations of the NOE-based distance restraints for a distance range ≤ 0.5 , in which the largest number of distance violations occurs. The results of this work suggest that use of the provided set of (χ1/χ2) torsional angles together with other observables, such as NOEs, should lead to a fast and accurate refinement of the side-chain conformations of protein models.
Protein-protein structure prediction by scoring molecular dynamics trajectories of putative poses.

PubMed

Sarti, Edoardo; Gladich, Ivan; Zamuner, Stefano; Correia, Bruno E; Laio, Alessandro

2016-09-01

The prediction of protein-protein interactions and their structural configuration remains a largely unsolved problem. Most of the algorithms aimed at finding the native conformation of a protein complex starting from the structure of its monomers are based on searching the structure corresponding to the global minimum of a suitable scoring function. However, protein complexes are often highly flexible, with mobile side chains and transient contacts due to thermal fluctuations. Flexibility can be neglected if one aims at finding quickly the approximate structure of the native complex, but may play a role in structure refinement, and in discriminating solutions characterized by similar scores. We here benchmark the capability of some state-of-the-art scoring functions (BACH-SixthSense, PIE/PISA and Rosetta) in discriminating finite-temperature ensembles of structures corresponding to the native state and to non-native configurations. We produce the ensembles by running thousands of molecular dynamics simulations in explicit solvent starting from poses generated by rigid docking and optimized in vacuum. We find that while Rosetta outperformed the other two scoring functions in scoring the structures in vacuum, BACH-SixthSense and PIE/PISA perform better in distinguishing near-native ensembles of structures generated by molecular dynamics in explicit solvent. Proteins 2016; 84:1312-1320. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Developing a Multiplexed Quantitative Cross-Linking Mass Spectrometry Platform for Comparative Structural Analysis of Protein Complexes.

PubMed

Yu, Clinton; Huszagh, Alexander; Viner, Rosa; Novitsky, Eric J; Rychnovsky, Scott D; Huang, Lan

2016-10-18

Cross-linking mass spectrometry (XL-MS) represents a recently popularized hybrid methodology for defining protein-protein interactions (PPIs) and analyzing structures of large protein assemblies. In particular, XL-MS strategies have been demonstrated to be effective in elucidating molecular details of PPIs at the peptide resolution, providing a complementary set of structural data that can be utilized to refine existing complex structures or direct de novo modeling of unknown protein structures. To study structural and interaction dynamics of protein complexes, quantitative cross-linking mass spectrometry (QXL-MS) strategies based on isotope-labeled cross-linkers have been developed. Although successful, these approaches are mostly limited to pairwise comparisons. In order to establish a robust workflow enabling comparative analysis of multiple cross-linked samples simultaneously, we have developed a multiplexed QXL-MS strategy, namely, QMIX (Quantitation of Multiplexed, Isobaric-labeled cross (X)-linked peptides) by integrating MS-cleavable cross-linkers with isobaric labeling reagents. This study has established a new analytical platform for quantitative analysis of cross-linked peptides, which can be directly applied for multiplexed comparisons of the conformational dynamics of protein complexes and PPIs at the proteome scale in future studies.
Scoring functions for protein-protein interactions.

PubMed

Moal, Iain H; Moretti, Rocco; Baker, David; Fernández-Recio, Juan

2013-12-01

The computational evaluation of protein-protein interactions will play an important role in organising the wealth of data being generated by high-throughput initiatives. Here we discuss future applications, report recent developments and identify areas requiring further investigation. Many functions have been developed to quantify the structural and energetic properties of interacting proteins, finding use in interrelated challenges revolving around the relationship between sequence, structure and binding free energy. These include loop modelling, side-chain refinement, docking, multimer assembly, affinity prediction, affinity change upon mutation, hotspots location and interface design. Information derived from models optimised for one of these challenges can be used to benefit the others, and can be unified within the theoretical frameworks of multi-task learning and Pareto-optimal multi-objective learning. Copyright © 2013 Elsevier Ltd. All rights reserved.
Replica Exchange Improves Sampling in Low-Resolution Docking Stage of RosettaDock

PubMed Central

Zhang, Zhe; Lange, Oliver F.

2013-01-01

Many protein-protein docking protocols are based on a shotgun approach, in which thousands of independent random-start trajectories minimize the rigid-body degrees of freedom. Another strategy is enumerative sampling as used in ZDOCK. Here, we introduce an alternative strategy, ReplicaDock, using a small number of long trajectories of temperature replica exchange. We compare replica exchange sampling as low-resolution stage of RosettaDock with RosettaDock's original shotgun sampling as well as with ZDOCK. A benchmark of 30 complexes starting from structures of the unbound binding partners shows improved performance for ReplicaDock and ZDOCK when compared to shotgun sampling at equal or less computational expense. ReplicaDock and ZDOCK consistently reach lower energies and generate significantly more near-native conformations than shotgun sampling. Accordingly, they both improve typical metrics of prediction quality of complex structures after refinement. Additionally, the refined ReplicaDock ensembles reach significantly lower interface energies and many previously hidden features of the docking energy landscape become visible when ReplicaDock is applied. PMID:24009670
On the complexity of Engh and Huber refinement restraints: the angle τ as example

DOE Office of Scientific and Technical Information (OSTI.GOV)

Touw, Wouter G.; Vriend, Gert, E-mail: vriend@cmbi.ru.nl

2010-12-01

The angle τ (backbone N—C{sup α}—C) is the most contested Engh and Huber refinement target parameter. It is shown that this parameter is ‘correct’ as a PDB-wide average, but can be improved by taking into account residue types, secondary structures and many other aspects of our knowledge of the biophysical relations between residue type and protein structure. The Engh and Huber parameters for bond lengths and bond angles have been used uncontested in macromolecular structure refinement from 1991 until very recently, despite critical discussion of their ubiquitous validity by many authors. An extensive analysis of the backbone angle τ (N—C{supmore » α}—C) illustrates that the Engh and Huber parameters can indeed be improved and a recent study [Tronrud et al. (2010 ▶), Acta Cryst. D66, 834–842] confirms these ideas. However, the present study of τ shows that improving the Engh and Huber parameters will be considerably more complex than simply making the parameters a function of the backbone ϕ, ψ angles. Many other aspects, such as the cooperativity of hydrogen bonds, the bending of secondary-structure elements and a series of biophysical aspects of the 20 amino-acid types, will also need to be taken into account. Different sets of Engh and Huber parameters will be needed for conceptually different refinement programs.« less
PURY: a database of geometric restraints of hetero compounds for refinement in complexes with macromolecular structures.

PubMed

Andrejasic, Miha; Praaenikar, Jure; Turk, Dusan

2008-11-01

The number and variety of macromolecular structures in complex with ;hetero' ligands is growing. The need for rapid delivery of correct geometric parameters for their refinement, which is often crucial for understanding the biological relevance of the structure, is growing correspondingly. The current standard for describing protein structures is the Engh-Huber parameter set. It is an expert data set resulting from selection and analysis of the crystal structures gathered in the Cambridge Structural Database (CSD). Clearly, such a manual approach cannot be applied to the vast and ever-growing number of chemical compounds. Therefore, a database, named PURY, of geometric parameters of chemical compounds has been developed, together with a server that accesses it. PURY is a compilation of the whole CSD. It contains lists of atom classes and bonds connecting them, as well as angle, chirality, planarity and conformation parameters. The current compilation is based on CSD 5.28 and contains 1978 atom classes and 32,702 bonding, 237,068 angle, 201,860 dihedral and 64,193 improper geometric restraints. Analysis has confirmed that the restraints from the PURY database are suitable for use in macromolecular crystal structure refinement and should be of value to the crystallographic community. The database can be accessed through the web server http://pury.ijs.si/, which creates topology and parameter files from deposited coordinates in suitable forms for the refinement programs MAIN, CNS and REFMAC. In the near future, the server will move to the CSD website http://pury.ccdc.cam.ac.uk/.
Mining the protein data bank with CReF to predict approximate 3-D structures of polypeptides.

PubMed

Dorn, Márcio; de Souza, Osmar Norberto

2010-01-01

n this paper we describe CReF, a Central Residue Fragment-based method to predict approximate 3-D structures of polypeptides by mining the Protein Data Bank (PDB). The approximate predicted structures are good enough to be used as starting conformations in refinement procedures employing state-of-the-art molecular mechanics methods such as molecular dynamics simulations. CReF is very fast and we illustrate its efficacy in three case studies of polypeptides whose sizes vary from 34 to 70 amino acids. As indicated by the RMSD values, our initial results show that the predicted structures adopt the expected fold, similar to the experimental ones.
Refined structure of dimeric diphtheria toxin at 2.0 A resolution.

PubMed Central

Bennett, M. J.; Choe, S.; Eisenberg, D.

1994-01-01

The refined structure of dimeric diphtheria toxin (DT) at 2.0 A resolution, based on 37,727 unique reflections (F > 1 sigma (F)), yields a final R factor of 19.5% with a model obeying standard geometry. The refined model consists of 523 amino acid residues, 1 molecule of the bound dinucleotide inhibitor adenylyl 3'-5' uridine 3' monophosphate (ApUp), and 405 well-ordered water molecules. The 2.0-A refined model reveals that the binding motif for ApUp includes residues in the catalytic and receptor-binding domains and is different from the Rossmann dinucleotide-binding fold. ApUp is bound in part by a long loop (residues 34-52) that crosses the active site. Several residues in the active site were previously identified as NAD-binding residues. Glu 148, previously identified as playing a catalytic role in ADP-ribosylation of elongation factor 2 by DT, is about 5 A from uracil in ApUp. The trigger for insertion of the transmembrane domain of DT into the endosomal membrane at low pH may involve 3 intradomain and 4 interdomain salt bridges that will be weakened at low pH by protonation of their acidic residues. The refined model also reveals that each molecule in dimeric DT has an "open" structure unlike most globular proteins, which we call an open monomer. Two open monomers interact by "domain swapping" to form a compact, globular dimeric DT structure. The possibility that the open monomer resembles a membrane insertion intermediate is discussed. PMID:7833807
X-ray diffraction study of Penicillium Vitale catalase in the complex with aminotriazole

DOE Office of Scientific and Technical Information (OSTI.GOV)

Borovik, A. A.; Grebenko, A. I.; Melik-Adamyan, V. R., E-mail: mawr@ns.crys.ras.ru

2011-07-15

The three-dimensional structure of the enzyme catalase from Penicillium vitale in a complex with the inhibitor aminotriazole was solved and refined by protein X-ray crystallography methods. An analysis of the three-dimensional structure of the complex showed that the inhibition of the enzyme occurs as a result of the covalent binding of aminotriazole to the amino-acid residue His64 in the active site of the enzyme. An investigation of the three-dimensional structure of the complex resulted in the amino-acid residues being more precisely identified. The binding sites of saccharide residues and calcium ions in the protein molecule were found.
Tertiary structure prediction and identification of druggable pocket in the cancer biomarker – Osteopontin-c

PubMed Central

2014-01-01

Background Osteopontin (Eta, secreted sialoprotein 1, opn) is secreted from different cell types including cancer cells. Three splice variant forms namely osteopontin-a, osteopontin-b and osteopontin-c have been identified. The main astonishing feature is that osteopontin-c is found to be elevated in almost all types of cancer cells. This was the vital point to consider it for sequence analysis and structure predictions which provide ample chances for prognostic, therapeutic and preventive cancer research. Methods Osteopontin-c gene sequence was determined from Breast Cancer sample and was translated to protein sequence. It was then analyzed using various software and web tools for binding pockets, docking and druggability analysis. Due to the lack of homological templates, tertiary structure was predicted using ab-initio method server – I-TASSER and was evaluated after refinement using web tools. Refined structure was compared with known bone sialoprotein electron microscopic structure and docked with CD44 for binding analysis and binding pockets were identified for drug designing. Results Signal sequence of about sixteen amino acid residues was identified using signal sequence prediction servers. Due to the absence of known structures of similar proteins, three dimensional structure of osteopontin-c was predicted using I-TASSER server. The predicted structure was refined with the help of SUMMA server and was validated using SAVES server. Molecular dynamic analysis was carried out using GROMACS software. The final model was built and was used for docking with CD44. Druggable pockets were identified using pocket energies. Conclusions The tertiary structure of osteopontin-c was predicted successfully using the ab-initio method and the predictions showed that osteopontin-c is of fibrous nature comparable to firbronectin. Docking studies showed the significant similarities of QSAET motif in the interaction of CD44 and osteopontins between the normal and splice variant forms of osteopontins and binding pockets analyses revealed several pockets which paved the way to the identification of a druggable pocket. PMID:24401206
An approach for prominent enhancement of the quality of konjac flour: dimethyl sulfoxide as medium.

PubMed

Ye, Ting; Wang, Ling; Xu, Wei; Liu, Jinjin; Wang, Yuntao; Zhu, Kunkun; Wang, Sujuan; Li, Bin; Wang, Chao

2014-01-01

In this paper, an approach to improve several konjac flour (KF) qualities by dimethyl sulfoxide (DMSO) addition using various concentrations at different temperature levels was proposed. Also, various properties of native and refined KF, including transparency, chemical composition and rheological properties have been investigated. The results showed that the KF refined by 75% DMSO achieved 27.7% improvement in transparency, 99.7% removal of starch, 99.4% removal of soluble sugar, and 98.2% removal of protein as well as more satisfactory viscosity stability. In addition, the morphology structure of refined KF showed a significant difference compared with the native one as observed using the SEM, which is promising for further industrial application. Furthermore, the rheological properties of both native and refined konjac sols were studied and the results showed that DMSO refinement is an effective and alternative approach to improve the qualities of KF in many aspects. Copyright © 2013 Elsevier Ltd. All rights reserved.
Structure of Lmaj006129AAA, a hypothetical protein from Leishmania major

DOE Office of Scientific and Technical Information (OSTI.GOV)

Arakaki, Tracy; Le Trong, Isolde; Structural Genomics of Pathogenic Protozoa

2006-03-01

The crystal structure of a conserved hypothetical protein from L. major, Pfam sequence family PF04543, structural genomics target ID Lmaj006129AAA, has been determined at a resolution of 1.6 Å. The gene product of structural genomics target Lmaj006129 from Leishmania major codes for a 164-residue protein of unknown function. When SeMet expression of the full-length gene product failed, several truncation variants were created with the aid of Ginzu, a domain-prediction method. 11 truncations were selected for expression, purification and crystallization based upon secondary-structure elements and disorder. The structure of one of these variants, Lmaj006129AAH, was solved by multiple-wavelength anomalous diffraction (MAD)more » using ELVES, an automatic protein crystal structure-determination system. This model was then successfully used as a molecular-replacement probe for the parent full-length target, Lmaj006129AAA. The final structure of Lmaj006129AAA was refined to an R value of 0.185 (R{sub free} = 0.229) at 1.60 Å resolution. Structure and sequence comparisons based on Lmaj006129AAA suggest that proteins belonging to Pfam sequence families PF04543 and PF01878 may share a common ligand-binding motif.« less
Recent advances in automated protein design and its future challenges.

PubMed

Setiawan, Dani; Brender, Jeffrey; Zhang, Yang

2018-04-25

Protein function is determined by protein structure which is in turn determined by the corresponding protein sequence. If the rules that cause a protein to adopt a particular structure are understood, it should be possible to refine or even redefine the function of a protein by working backwards from the desired structure to the sequence. Automated protein design attempts to calculate the effects of mutations computationally with the goal of more radical or complex transformations than are accessible by experimental techniques. Areas covered: The authors give a brief overview of the recent methodological advances in computer-aided protein design, showing how methodological choices affect final design and how automated protein design can be used to address problems considered beyond traditional protein engineering, including the creation of novel protein scaffolds for drug development. Also, the authors address specifically the future challenges in the development of automated protein design. Expert opinion: Automated protein design holds potential as a protein engineering technique, particularly in cases where screening by combinatorial mutagenesis is problematic. Considering solubility and immunogenicity issues, automated protein design is initially more likely to make an impact as a research tool for exploring basic biology in drug discovery than in the design of protein biologics.
Determining crystal structures through crowdsourcing and coursework

NASA Astrophysics Data System (ADS)

Horowitz, Scott; Koepnick, Brian; Martin, Raoul; Tymieniecki, Agnes; Winburn, Amanda A.; Cooper, Seth; Flatten, Jeff; Rogawski, David S.; Koropatkin, Nicole M.; Hailu, Tsinatkeab T.; Jain, Neha; Koldewey, Philipp; Ahlstrom, Logan S.; Chapman, Matthew R.; Sikkema, Andrew P.; Skiba, Meredith A.; Maloney, Finn P.; Beinlich, Felix R. M.; Caglar, Ahmet; Coral, Alan; Jensen, Alice Elizabeth; Lubow, Allen; Boitano, Amanda; Lisle, Amy Elizabeth; Maxwell, Andrew T.; Failer, Barb; Kaszubowski, Bartosz; Hrytsiv, Bohdan; Vincenzo, Brancaccio; de Melo Cruz, Breno Renan; McManus, Brian Joseph; Kestemont, Bruno; Vardeman, Carl; Comisky, Casey; Neilson, Catherine; Landers, Catherine R.; Ince, Christopher; Buske, Daniel Jon; Totonjian, Daniel; Copeland, David Marshall; Murray, David; Jagieła, Dawid; Janz, Dietmar; Wheeler, Douglas C.; Cali, Elie; Croze, Emmanuel; Rezae, Farah; Martin, Floyd Orville; Beecher, Gil; de Jong, Guido Alexander; Ykman, Guy; Feldmann, Harald; Chan, Hugo Paul Perez; Kovanecz, Istvan; Vasilchenko, Ivan; Connellan, James C.; Borman, Jami Lynne; Norrgard, Jane; Kanfer, Jebbie; Canfield, Jeffrey M.; Slone, Jesse David; Oh, Jimmy; Mitchell, Joanne; Bishop, John; Kroeger, John Douglas; Schinkler, Jonas; McLaughlin, Joseph; Brownlee, June M.; Bell, Justin; Fellbaum, Karl Willem; Harper, Kathleen; Abbey, Kirk J.; Isaksson, Lennart E.; Wei, Linda; Cummins, Lisa N.; Miller, Lori Anne; Bain, Lyn; Carpenter, Lynn; Desnouck, Maarten; Sharma, Manasa G.; Belcastro, Marcus; Szew, Martin; Szew, Martin; Britton, Matthew; Gaebel, Matthias; Power, Max; Cassidy, Michael; Pfützenreuter, Michael; Minett, Michele; Wesselingh, Michiel; Yi, Minjune; Cameron, Neil Haydn Tormey; Bolibruch, Nicholas I.; Benevides, Noah; Kathleen Kerr, Norah; Barlow, Nova; Crevits, Nykole Krystyne; Dunn, Paul; Silveira Belo Nascimento Roque, Paulo Sergio; Riber, Peter; Pikkanen, Petri; Shehzad, Raafay; Viosca, Randy; James Fraser, Robert; Leduc, Robert; Madala, Roman; Shnider, Scott; de Boisblanc, Sharon; Butkovich, Slava; Bliven, Spencer; Hettler, Stephen; Telehany, Stephen; Schwegmann, Steven A.; Parkes, Steven; Kleinfelter, Susan C.; Michael Holst, Sven; van der Laan, T. J. A.; Bausewein, Thomas; Simon, Vera; Pulley, Warwick; Hull, William; Kim, Annes Yukyung; Lawton, Alexis; Ruesch, Amanda; Sundar, Anjali; Lawrence, Anna-Lisa; Afrin, Antara; Maheshwer, Bhargavi; Turfe, Bilal; Huebner, Christian; Killeen, Courtney Elizabeth; Antebi-Lerrman, Dalia; Luan, Danny; Wolfe, Derek; Pham, Duc; Michewicz, Elaina; Hull, Elizabeth; Pardington, Emily; Galal, Galal Osama; Sun, Grace; Chen, Grace; Anderson, Halie E.; Chang, Jane; Hewlett, Jeffrey Thomas; Sterbenz, Jennifer; Lim, Jiho; Morof, Joshua; Lee, Junho; Inn, Juyoung Samuel; Hahm, Kaitlin; Roth, Kaitlin; Nair, Karun; Markin, Katherine; Schramm, Katie; Toni Eid, Kevin; Gam, Kristina; Murphy, Lisha; Yuan, Lucy; Kana, Lulia; Daboul, Lynn; Shammas, Mario Karam; Chason, Max; Sinan, Moaz; Andrew Tooley, Nicholas; Korakavi, Nisha; Comer, Patrick; Magur, Pragya; Savliwala, Quresh; Davison, Reid Michael; Sankaran, Roshun Rajiv; Lewe, Sam; Tamkus, Saule; Chen, Shirley; Harvey, Sho; Hwang, Sin Ye; Vatsia, Sohrab; Withrow, Stefan; Luther, Tahra K.; Manett, Taylor; Johnson, Thomas James; Ryan Brash, Timothy; Kuhlman, Wyatt; Park, Yeonjung; Popović, Zoran; Baker, David; Khatib, Firas; Bardwell, James C. A.

2016-09-01

We show here that computer game players can build high-quality crystal structures. Introduction of a new feature into the computer game Foldit allows players to build and real-space refine structures into electron density maps. To assess the usefulness of this feature, we held a crystallographic model-building competition between trained crystallographers, undergraduate students, Foldit players and automatic model-building algorithms. After removal of disordered residues, a team of Foldit players achieved the most accurate structure. Analysing the target protein of the competition, YPL067C, uncovered a new family of histidine triad proteins apparently involved in the prevention of amyloid toxicity. From this study, we conclude that crystallographers can utilize crowdsourcing to interpret electron density information and to produce structure solutions of the highest quality.
Determining crystal structures through crowdsourcing and coursework.

PubMed

Horowitz, Scott; Koepnick, Brian; Martin, Raoul; Tymieniecki, Agnes; Winburn, Amanda A; Cooper, Seth; Flatten, Jeff; Rogawski, David S; Koropatkin, Nicole M; Hailu, Tsinatkeab T; Jain, Neha; Koldewey, Philipp; Ahlstrom, Logan S; Chapman, Matthew R; Sikkema, Andrew P; Skiba, Meredith A; Maloney, Finn P; Beinlich, Felix R M; Popović, Zoran; Baker, David; Khatib, Firas; Bardwell, James C A

2016-09-16

We show here that computer game players can build high-quality crystal structures. Introduction of a new feature into the computer game Foldit allows players to build and real-space refine structures into electron density maps. To assess the usefulness of this feature, we held a crystallographic model-building competition between trained crystallographers, undergraduate students, Foldit players and automatic model-building algorithms. After removal of disordered residues, a team of Foldit players achieved the most accurate structure. Analysing the target protein of the competition, YPL067C, uncovered a new family of histidine triad proteins apparently involved in the prevention of amyloid toxicity. From this study, we conclude that crystallographers can utilize crowdsourcing to interpret electron density information and to produce structure solutions of the highest quality.

Hydrogen atoms in protein structures: high-resolution X-ray diffraction structure of the DFPase

PubMed Central

2013-01-01

Background Hydrogen atoms represent about half of the total number of atoms in proteins and are often involved in substrate recognition and catalysis. Unfortunately, X-ray protein crystallography at usual resolution fails to access directly their positioning, mainly because light atoms display weak contributions to diffraction. However, sub-Ångstrom diffraction data, careful modeling and a proper refinement strategy can allow the positioning of a significant part of hydrogen atoms. Results A comprehensive study on the X-ray structure of the diisopropyl-fluorophosphatase (DFPase) was performed, and the hydrogen atoms were modeled, including those of solvent molecules. This model was compared to the available neutron structure of DFPase, and differences in the protein and the active site solvation were noticed. Conclusions A further examination of the DFPase X-ray structure provides substantial evidence about the presence of an activated water molecule that may constitute an interesting piece of information as regard to the enzymatic hydrolysis mechanism. PMID:23915572
Application of Enhanced Sampling Monte Carlo Methods for High-Resolution Protein-Protein Docking in Rosetta

PubMed Central

Zhang, Zhe; Schindler, Christina E. M.; Lange, Oliver F.; Zacharias, Martin

2015-01-01

The high-resolution refinement of docked protein-protein complexes can provide valuable structural and mechanistic insight into protein complex formation complementing experiment. Monte Carlo (MC) based approaches are frequently applied to sample putative interaction geometries of proteins including also possible conformational changes of the binding partners. In order to explore efficiency improvements of the MC sampling, several enhanced sampling techniques, including temperature or Hamiltonian replica exchange and well-tempered ensemble approaches, have been combined with the MC method and were evaluated on 20 protein complexes using unbound partner structures. The well-tempered ensemble method combined with a 2-dimensional temperature and Hamiltonian replica exchange scheme (WTE-H-REMC) was identified as the most efficient search strategy. Comparison with prolonged MC searches indicates that the WTE-H-REMC approach requires approximately 5 times fewer MC steps to identify near native docking geometries compared to conventional MC searches. PMID:26053419
Structural Genomics of Bacterial Virulence Factors

DTIC Science & Technology

2006-05-01

positioned in the unit cell by Molecular Replacement (Protein Data Bank ( PDB ) ID code 1acc)6 using MOLREP, and refined with REFMAC version 5.0 (ref. 24...increase our understanding of the molecular mechanisms of pathogenicity, putting us in a stronger position to anticipate and react to emerging...term, the accumulated structural information will generate important and testable hypotheses that will increase our understanding of the molecular
Analysis of protein-protein docking decoys using interaction fingerprints: application to the reconstruction of CaM-ligand complexes.

PubMed

Uchikoga, Nobuyuki; Hirokawa, Takatsugu

2010-05-11

Protein-protein docking for proteins with large conformational changes was analyzed by using interaction fingerprints, one of the scales for measuring similarities among complex structures, utilized especially for searching near-native protein-ligand or protein-protein complex structures. Here, we have proposed a combined method for analyzing protein-protein docking by taking large conformational changes into consideration. This combined method consists of ensemble soft docking with multiple protein structures, refinement of complexes, and cluster analysis using interaction fingerprints and energy profiles. To test for the applicability of this combined method, various CaM-ligand complexes were reconstructed from the NMR structures of unbound CaM. For the purpose of reconstruction, we used three known CaM-ligands, namely, the CaM-binding peptides of cyclic nucleotide gateway (CNG), CaM kinase kinase (CaMKK) and the plasma membrane Ca2+ ATPase pump (PMCA), and thirty-one structurally diverse CaM conformations. For each ligand, 62000 CaM-ligand complexes were generated in the docking step and the relationship between their energy profiles and structural similarities to the native complex were analyzed using interaction fingerprint and RMSD. Near-native clusters were obtained in the case of CNG and CaMKK. The interaction fingerprint method discriminated near-native structures better than the RMSD method in cluster analysis. We showed that a combined method that includes the interaction fingerprint is very useful for protein-protein docking analysis of certain cases.
integrating Solid State NMR and Computations in Membrane Protein Science

NASA Astrophysics Data System (ADS)

Cross, Timothy

2015-03-01

Helical membrane protein structures are influenced by their native environment. Therefore the characterization of their structure in an environment that models as closely as possible their native environment is critical for achieving not only structural but functional understanding of these proteins. Solid state NMR spectroscopy in liquid crystalline lipid bilayers provides an excellent tool for such characterizations. Two classes of restraints can be obtained - absolute restraints that constrain the structure to a laboratory frame of reference when using uniformly oriented samples (approximately 1° of mosaic spread) and relative restraints that restrain one part of the structure with respect to another part such as torsional and distance restraints. Here, I will discuss unique restraints derived from uniformly oriented samples and the characterization of initial structures utilizing both restraint types, followed by restrained molecular dynamics refinement in the same lipid bilayer environment as that used for the experimental restraint collection. Protein examples will be taken from Influenza virus and Mycobacterium tuberculosis. When available comparisons of structures to those obtained using different membrane mimetic environments will be shown and the causes for structural distortions explained based on an understanding of membrane biophysics and its sophisticated influence on membrane proteins.
Intrinsic disorder in pathogen effectors: protein flexibility as an evolutionary hallmark in a molecular arms race.

PubMed

Marín, Macarena; Uversky, Vladimir N; Ott, Thomas

2013-09-01

Effector proteins represent a refined mechanism of bacterial pathogens to overcome plants' innate immune systems. These modular proteins often manipulate host physiology by directly interfering with immune signaling of plant cells. Even if host cells have developed efficient strategies to perceive the presence of pathogenic microbes and to recognize intracellular effector activity, it remains an open question why only few effectors are recognized directly by plant resistance proteins. Based on in-silico genome-wide surveys and a reevaluation of published structural data, we estimated that bacterial effectors of phytopathogens are highly enriched in long-disordered regions (>50 residues). These structurally flexible segments have no secondary structure under physiological conditions but can fold in a stimulus-dependent manner (e.g., during protein-protein interactions). The high abundance of intrinsic disorder in effectors strongly suggests positive evolutionary selection of this structural feature and highlights the dynamic nature of these proteins. We postulate that such structural flexibility may be essential for (1) effector translocation, (2) evasion of the innate immune system, and (3) host function mimicry. The study of these dynamical regions will greatly complement current structural approaches to understand the molecular mechanisms of these proteins and may help in the prediction of new effectors.
A new crystal form of Aspergillus oryzae catechol oxidase and evaluation of copper site structures in coupled binuclear copper enzymes.

PubMed

Penttinen, Leena; Rutanen, Chiara; Saloheimo, Markku; Kruus, Kristiina; Rouvinen, Juha; Hakulinen, Nina

2018-01-01

Coupled binuclear copper (CBC) enzymes have a conserved type 3 copper site that binds molecular oxygen to oxidize various mono- and diphenolic compounds. In this study, we found a new crystal form of catechol oxidase from Aspergillus oryzae (AoCO4) and solved two new structures from two different crystals at 1.8-Å and at 2.5-Å resolutions. These structures showed different copper site forms (met/deoxy and deoxy) and also differed from the copper site observed in the previously solved structure of AoCO4. We also analysed the electron density maps of all of the 56 CBC enzyme structures available in the protein data bank (PDB) and found that many of the published structures have vague copper sites. Some of the copper sites were then re-refined to find a better fit to the observed electron density. General problems in the refinement of metalloproteins and metal centres are discussed.
Structural analysis of a set of proteins resulting from a bacterial genomics project.

PubMed

Badger, J; Sauder, J M; Adams, J M; Antonysamy, S; Bain, K; Bergseid, M G; Buchanan, S G; Buchanan, M D; Batiyenko, Y; Christopher, J A; Emtage, S; Eroshkina, A; Feil, I; Furlong, E B; Gajiwala, K S; Gao, X; He, D; Hendle, J; Huber, A; Hoda, K; Kearins, P; Kissinger, C; Laubert, B; Lewis, H A; Lin, J; Loomis, K; Lorimer, D; Louie, G; Maletic, M; Marsh, C D; Miller, I; Molinari, J; Muller-Dieckmann, H J; Newman, J M; Noland, B W; Pagarigan, B; Park, F; Peat, T S; Post, K W; Radojicic, S; Ramos, A; Romero, R; Rutter, M E; Sanderson, W E; Schwinn, K D; Tresser, J; Winhoven, J; Wright, T A; Wu, L; Xu, J; Harris, T J R

2005-09-01

The targets of the Structural GenomiX (SGX) bacterial genomics project were proteins conserved in multiple prokaryotic organisms with no obvious sequence homolog in the Protein Data Bank of known structures. The outcome of this work was 80 structures, covering 60 unique sequences and 49 different genes. Experimental phase determination from proteins incorporating Se-Met was carried out for 45 structures with most of the remainder solved by molecular replacement using members of the experimentally phased set as search models. An automated tool was developed to deposit these structures in the Protein Data Bank, along with the associated X-ray diffraction data (including refined experimental phases) and experimentally confirmed sequences. BLAST comparisons of the SGX structures with structures that had appeared in the Protein Data Bank over the intervening 3.5 years since the SGX target list had been compiled identified homologs for 49 of the 60 unique sequences represented by the SGX structures. This result indicates that, for bacterial structures that are relatively easy to express, purify, and crystallize, the structural coverage of gene space is proceeding rapidly. More distant sequence-structure relationships between the SGX and PDB structures were investigated using PDB-BLAST and Combinatorial Extension (CE). Only one structure, SufD, has a truly unique topology compared to all folds in the PDB. Copyright 2005 Wiley-Liss, Inc.
Evaluation of protein-protein docking model structures using all-atom molecular dynamics simulations combined with the solution theory in the energy representation

NASA Astrophysics Data System (ADS)

Takemura, Kazuhiro; Guo, Hao; Sakuraba, Shun; Matubayasi, Nobuyuki; Kitao, Akio

2012-12-01

We propose a method to evaluate binding free energy differences among distinct protein-protein complex model structures through all-atom molecular dynamics simulations in explicit water using the solution theory in the energy representation. Complex model structures are generated from a pair of monomeric structures using the rigid-body docking program ZDOCK. After structure refinement by side chain optimization and all-atom molecular dynamics simulations in explicit water, complex models are evaluated based on the sum of their conformational and solvation free energies, the latter calculated from the energy distribution functions obtained from relatively short molecular dynamics simulations of the complex in water and of pure water based on the solution theory in the energy representation. We examined protein-protein complex model structures of two protein-protein complex systems, bovine trypsin/CMTI-1 squash inhibitor (PDB ID: 1PPE) and RNase SA/barstar (PDB ID: 1AY7), for which both complex and monomer structures were determined experimentally. For each system, we calculated the energies for the crystal complex structure and twelve generated model structures including the model most similar to the crystal structure and very different from it. In both systems, the sum of the conformational and solvation free energies tended to be lower for the structure similar to the crystal. We concluded that our energy calculation method is useful for selecting low energy complex models similar to the crystal structure from among a set of generated models.
Evaluation of protein-protein docking model structures using all-atom molecular dynamics simulations combined with the solution theory in the energy representation.

PubMed

Takemura, Kazuhiro; Guo, Hao; Sakuraba, Shun; Matubayasi, Nobuyuki; Kitao, Akio

2012-12-07

We propose a method to evaluate binding free energy differences among distinct protein-protein complex model structures through all-atom molecular dynamics simulations in explicit water using the solution theory in the energy representation. Complex model structures are generated from a pair of monomeric structures using the rigid-body docking program ZDOCK. After structure refinement by side chain optimization and all-atom molecular dynamics simulations in explicit water, complex models are evaluated based on the sum of their conformational and solvation free energies, the latter calculated from the energy distribution functions obtained from relatively short molecular dynamics simulations of the complex in water and of pure water based on the solution theory in the energy representation. We examined protein-protein complex model structures of two protein-protein complex systems, bovine trypsin/CMTI-1 squash inhibitor (PDB ID: 1PPE) and RNase SA/barstar (PDB ID: 1AY7), for which both complex and monomer structures were determined experimentally. For each system, we calculated the energies for the crystal complex structure and twelve generated model structures including the model most similar to the crystal structure and very different from it. In both systems, the sum of the conformational and solvation free energies tended to be lower for the structure similar to the crystal. We concluded that our energy calculation method is useful for selecting low energy complex models similar to the crystal structure from among a set of generated models.
Accessing protein conformational ensembles using room-temperature X-ray crystallography

PubMed Central

Fraser, James S.; van den Bedem, Henry; Samelson, Avi J.; Lang, P. Therese; Holton, James M.; Echols, Nathaniel; Alber, Tom

2011-01-01

Modern protein crystal structures are based nearly exclusively on X-ray data collected at cryogenic temperatures (generally 100 K). The cooling process is thought to introduce little bias in the functional interpretation of structural results, because cryogenic temperatures minimally perturb the overall protein backbone fold. In contrast, here we show that flash cooling biases previously hidden structural ensembles in protein crystals. By analyzing available data for 30 different proteins using new computational tools for electron-density sampling, model refinement, and molecular packing analysis, we found that crystal cryocooling remodels the conformational distributions of more than 35% of side chains and eliminates packing defects necessary for functional motions. In the signaling switch protein, H-Ras, an allosteric network consistent with fluctuations detected in solution by NMR was uncovered in the room-temperature, but not the cryogenic, electron-density maps. These results expose a bias in structural databases toward smaller, overpacked, and unrealistically unique models. Monitoring room-temperature conformational ensembles by X-ray crystallography can reveal motions crucial for catalysis, ligand binding, and allosteric regulation. PMID:21918110
Principles of protein folding--a perspective from simple exact models.

PubMed Central

Dill, K. A.; Bromberg, S.; Yue, K.; Fiebig, K. M.; Yee, D. P.; Thomas, P. D.; Chan, H. S.

1995-01-01

General principles of protein structure, stability, and folding kinetics have recently been explored in computer simulations of simple exact lattice models. These models represent protein chains at a rudimentary level, but they involve few parameters, approximations, or implicit biases, and they allow complete explorations of conformational and sequence spaces. Such simulations have resulted in testable predictions that are sometimes unanticipated: The folding code is mainly binary and delocalized throughout the amino acid sequence. The secondary and tertiary structures of a protein are specified mainly by the sequence of polar and nonpolar monomers. More specific interactions may refine the structure, rather than dominate the folding code. Simple exact models can account for the properties that characterize protein folding: two-state cooperativity, secondary and tertiary structures, and multistage folding kinetics--fast hydrophobic collapse followed by slower annealing. These studies suggest the possibility of creating "foldable" chain molecules other than proteins. The encoding of a unique compact chain conformation may not require amino acids; it may require only the ability to synthesize specific monomer sequences in which at least one monomer type is solvent-averse. PMID:7613459
The Leptospiral Antigen Lp49 is a Two-Domain Protein with Putative Protein Binding Function

DOE Office of Scientific and Technical Information (OSTI.GOV)

Oliveira Giuseppe,P.; Oliveira Neves, F.; Nascimento, A.

2008-01-01

Pathogenic Leptospira is the etiological agent of leptospirosis, a life-threatening disease that affects populations worldwide. Currently available vaccines have limited effectiveness and therapeutic interventions are complicated by the difficulty in making an early diagnosis of leptospirosis. The genome of Leptospira interrogans was recently sequenced and comparative genomic analysis contributed to the identification of surface antigens, potential candidates for development of new vaccines and serodiagnosis. Lp49 is a membrane-associated protein recognized by antibodies present in sera from early and convalescent phases of leptospirosis patients. Its crystal structure was determined by single-wavelength anomalous diffraction using selenomethionine-labelled crystals and refined at 2.0 Angstromsmore » resolution. Lp49 is composed of two domains and belongs to the all-beta-proteins class. The N-terminal domain folds in an immunoglobulin-like beta-sandwich structure, whereas the C-terminal domain presents a seven-bladed beta-propeller fold. Structural analysis of Lp49 indicates putative protein-protein binding sites, suggesting a role in Leptospira-host interaction. This is the first crystal structure of a leptospiral antigen described to date.« less
Modeling of the structure of ribosomal protein L1 from the archaeon Haloarcula marismortui

NASA Astrophysics Data System (ADS)

Nevskaya, N. A.; Kljashtorny, V. G.; Vakhrusheva, A. V.; Garber, M. B.; Nikonov, S. V.

2017-07-01

The halophilic archaeon Haloarcula marismortui proliferates in the Dead Sea at extremely high salt concentrations (higher than 3 M). This is the only archaeon, for which the crystal structure of the ribosomal 50S subunit was determined. However, the structure of the functionally important side protuberance containing the abnormally negatively charged protein L1 (HmaL1) was not visualized. Attempts to crystallize HmaL1 in the isolated state or as its complex with RNA using normal salt concentrations (≤500 mM) failed. A theoretical model of HmaL1 was built based on the structural data for homologs of the protein L1 from other organisms, and this model was refined by molecular dynamics methods. Analysis of this model showed that the protein HmaL1 can undergo aggregation due to the presence of a cluster of positive charges unique for proteins L1. This cluster is located at the RNA-protein interface, which interferes with the crystallization of HmaL1 and the binding of the latter to RNA.
A Generic Force Field for Protein Coarse-Grained Molecular Dynamics Simulation

PubMed Central

Gu, Junfeng; Bai, Fang; Li, Honglin; Wang, Xicheng

2012-01-01

Coarse-grained (CG) force fields have become promising tools for studies of protein behavior, but the balance of speed and accuracy is still a challenge in the research of protein coarse graining methodology. In this work, 20 CG beads have been designed based on the structures of amino acid residues, with which an amino acid can be represented by one or two beads, and a CG solvent model with five water molecules was adopted to ensure the consistence with the protein CG beads. The internal interactions in protein were classified according to the types of the interacting CG beads, and adequate potential functions were chosen and systematically parameterized to fit the energy distributions. The proposed CG force field has been tested on eight proteins, and each protein was simulated for 1000 ns. Even without any extra structure knowledge of the simulated proteins, the Cα root mean square deviations (RMSDs) with respect to their experimental structures are close to those of relatively short time all atom molecular dynamics simulations. However, our coarse grained force field will require further refinement to improve agreement with and persistence of native-like structures. In addition, the root mean square fluctuations (RMSFs) relative to the average structures derived from the simulations show that the conformational fluctuations of the proteins can be sampled. PMID:23203075
Assessing food allergy risks from residual peanut protein in highly refined vegetable oil.

PubMed

Blom, W Marty; Kruizinga, Astrid G; Rubingh, Carina M; Remington, Ben C; Crevel, René W R; Houben, Geert F

2017-08-01

Refined vegetable oils including refined peanut oil are widely used in foods. Due to shared production processes, refined non-peanut vegetable oils can contain residual peanut proteins. We estimated the predicted number of allergic reactions to residual peanut proteins using probabilistic risk assessment applied to several scenarios involving food products made with vegetable oils. Variables considered were: a) the estimated production scale of refined peanut oil, b) estimated cross-contact between refined vegetable oils during production, c) the proportion of fat in representative food products and d) the peanut protein concentration in refined peanut oil. For all products examined the predicted risk of objective allergic reactions in peanut-allergic users of the food products was extremely low. The number of predicted reactions ranged depending on the model from a high of 3 per 1000 eating occasions (Weibull) to no reactions (LogNormal). Significantly, all reactions were predicted for allergen intakes well below the amounts reported for the most sensitive individual described in the clinical literature. We conclude that the health risk from cross-contact between vegetable oils and refined peanut oil is negligible. None of the food products would warrant precautionary labelling for peanut according to the VITAL ® programme of the Allergen Bureau. Copyright © 2017 Elsevier Ltd. All rights reserved.
Comparative analysis of three-dimensional structures of homodimers of uridine phosphorylase from Salmonella typhimurium in the unligated state and in a complex with potassium ion

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lashkov, A. A.; Zhukhlistova, N. E.; Gabdulkhakov, A. G.

2009-03-15

The spatial organization of the homodimer of unligated uridine phosphorylase from Salmonella typhimurium (St UPh) was determined with high accuracy. The structure was refined at 1.80 A resolution to R{sub work} = 16.1% and R{sub free} = 20.0%. The rms deviations for the bond lengths, bond angles, and chiral angles are 0.006 A, 1.042{sup o}, and 0.071{sup o}, respectively. The coordinate error estimated by the Luzzati plot is 0.166 A. The coordinate error based on the maximum likelihood is 0.199 A. A comparative analysis of the spatial organization of the homodimer in two independently refined structures and the structure ofmore » the homodimer St UPh in the complex with a K{sup +} ion was performed. The substrate-binding sites in the homodimers StUPhs in the unligated state were found to act asynchronously. In the presence of a potassium ion, the three-dimensional structures of the subunits in the homodimer are virtually identical, which is apparently of importance for the synchronous action of both substrate-binding sites. The atomic coordinates of the refined structure of the homodimer and structure factors have been deposited in the Protein Data Bank (PDB ID code 3DPS).« less
Rapid and reliable protein structure determination via chemical shift threading.

PubMed

Hafsa, Noor E; Berjanskii, Mark V; Arndt, David; Wishart, David S

2018-01-01

Protein structure determination using nuclear magnetic resonance (NMR) spectroscopy can be both time-consuming and labor intensive. Here we demonstrate how chemical shift threading can permit rapid, robust, and accurate protein structure determination using only chemical shift data. Threading is a relatively old bioinformatics technique that uses a combination of sequence information and predicted (or experimentally acquired) low-resolution structural data to generate high-resolution 3D protein structures. The key motivations behind using NMR chemical shifts for protein threading lie in the fact that they are easy to measure, they are available prior to 3D structure determination, and they contain vital structural information. The method we have developed uses not only sequence and chemical shift similarity but also chemical shift-derived secondary structure, shift-derived super-secondary structure, and shift-derived accessible surface area to generate a high quality protein structure regardless of the sequence similarity (or lack thereof) to a known structure already in the PDB. The method (called E-Thrifty) was found to be very fast (often < 10 min/structure) and to significantly outperform other shift-based or threading-based structure determination methods (in terms of top template model accuracy)-with an average TM-score performance of 0.68 (vs. 0.50-0.62 for other methods). Coupled with recent developments in chemical shift refinement, these results suggest that protein structure determination, using only NMR chemical shifts, is becoming increasingly practical and reliable. E-Thrifty is available as a web server at http://ethrifty.ca .
Crystallographic and Computational Analyses of AUUCU Repeating RNA That Causes Spinocerebellar Ataxia Type 10 (SCA10).

PubMed

Park, HaJeung; González, Àlex L; Yildirim, Ilyas; Tran, Tuan; Lohman, Jeremy R; Fang, Pengfei; Guo, Min; Disney, Matthew D

2015-06-23

Spinocerebellar ataxia type 10 (SCA10) is caused by a pentanucleotide repeat expansion of r(AUUCU) within intron 9 of the ATXN10 pre-mRNA. The RNA causes disease by a gain-of-function mechanism in which it inactivates proteins involved in RNA biogenesis. Spectroscopic studies showed that r(AUUCU) repeats form a hairpin structure; however, there were no high-resolution structural models prior to this work. Herein, we report the first crystal structure of model r(AUUCU) repeats refined to 2.8 Å and analysis of the structure via molecular dynamics simulations. The r(AUUCU) tracts adopt an overall A-form geometry in which 3 × 3 nucleotide (5')UCU(3')/(3')UCU(5') internal loops are closed by AU pairs. Helical parameters of the refined structure as well as the corresponding electron density map on the crystallographic model reflect dynamic features of the internal loop. The computational analyses captured dynamic motion of the loop closing pairs, which can form single-stranded conformations with relatively low energies. Overall, the results presented here suggest the possibility for r(AUUCU) repeats to form metastable A-from structures, which can rearrange into single-stranded conformations and attract proteins such as heterogeneous nuclear ribonucleoprotein K (hnRNP K). The information presented here may aid in the rational design of therapeutics targeting this RNA.
Crystallographic and Computational Analyses of AUUCU Repeating RNA That Causes Spinocerebellar Ataxia Type 10 (SCA10)

PubMed Central

Park, HaJeung; González, Àlex L.; Yildirim, Ilyas; Tran, Tuan; Lohman, Jeremy R.; Fang, Pengfei; Guo, Min; Disney, Matthew D.

2016-01-01

Spinocerebellar ataxia type 10 (SCA10) is caused by a pentanucleotide repeat expansion of r(AUUCU) within intron 9 of the ATXN10 pre-mRNA. The RNA causes disease by a gain-of-function mechanism in which it inactivates proteins involved in RNA biogenesis. Spectroscopic studies showed that r(AUUCU) repeats form a hairpin structure; however, there were no high-resolution structural models prior to this work. Herein, we report the first crystal structure of model r(AUUCU) repeats refined to 2.8 Å and analysis of the structure via molecular dynamics simulations. The r(AUUCU) tracts adopt an overall A-form geometry in which 3 × 3 nucleotide 5′UCU3′/3′UCU5′ internal loops are closed by AU pairs. Helical parameters of the refined structure as well as the corresponding electron density map on the crystallographic model reflect dynamic features of the internal loop. The computational analyses captured dynamic motion of the loop closing pairs, which can form single-stranded conformations with relatively low energies. Overall, the results presented here suggest the possibility for r(AUUCU) repeats to form metastable A-from structures, which can rearrange into single-stranded conformations and attract proteins such as heterogeneous nuclear ribonucleoprotein K (hnRNP K). The information presented here may aid in the rational design of therapeutics targeting this RNA. PMID:26039897

FoldGPCR: structure prediction protocol for the transmembrane domain of G protein-coupled receptors from class A.

PubMed

Michino, Mayako; Chen, Jianhan; Stevens, Raymond C; Brooks, Charles L

2010-08-01

Building reliable structural models of G protein-coupled receptors (GPCRs) is a difficult task because of the paucity of suitable templates, low sequence identity, and the wide variety of ligand specificities within the superfamily. Template-based modeling is known to be the most successful method for protein structure prediction. However, refinement of homology models within 1-3 A C alpha RMSD of the native structure remains a major challenge. Here, we address this problem by developing a novel protocol (foldGPCR) for modeling the transmembrane (TM) region of GPCRs in complex with a ligand, aimed to accurately model the structural divergence between the template and target in the TM helices. The protocol is based on predicted conserved inter-residue contacts between the template and target, and exploits an all-atom implicit membrane force field. The placement of the ligand in the binding pocket is guided by biochemical data. The foldGPCR protocol is implemented by a stepwise hierarchical approach, in which the TM helical bundle and the ligand are assembled by simulated annealing trials in the first step, and the receptor-ligand complex is refined with replica exchange sampling in the second step. The protocol is applied to model the human beta(2)-adrenergic receptor (beta(2)AR) bound to carazolol, using contacts derived from the template structure of bovine rhodopsin. Comparison with the X-ray crystal structure of the beta(2)AR shows that our protocol is particularly successful in accurately capturing helix backbone irregularities and helix-helix packing interactions that distinguish rhodopsin from beta(2)AR. (c) 2010 Wiley-Liss, Inc.
Ensembler: Enabling High-Throughput Molecular Simulations at the Superfamily Scale.

PubMed

Parton, Daniel L; Grinaway, Patrick B; Hanson, Sonya M; Beauchamp, Kyle A; Chodera, John D

2016-06-01

The rapidly expanding body of available genomic and protein structural data provides a rich resource for understanding protein dynamics with biomolecular simulation. While computational infrastructure has grown rapidly, simulations on an omics scale are not yet widespread, primarily because software infrastructure to enable simulations at this scale has not kept pace. It should now be possible to study protein dynamics across entire (super)families, exploiting both available structural biology data and conformational similarities across homologous proteins. Here, we present a new tool for enabling high-throughput simulation in the genomics era. Ensembler takes any set of sequences-from a single sequence to an entire superfamily-and shepherds them through various stages of modeling and refinement to produce simulation-ready structures. This includes comparative modeling to all relevant PDB structures (which may span multiple conformational states of interest), reconstruction of missing loops, addition of missing atoms, culling of nearly identical structures, assignment of appropriate protonation states, solvation in explicit solvent, and refinement and filtering with molecular simulation to ensure stable simulation. The output of this pipeline is an ensemble of structures ready for subsequent molecular simulations using computer clusters, supercomputers, or distributed computing projects like Folding@home. Ensembler thus automates much of the time-consuming process of preparing protein models suitable for simulation, while allowing scalability up to entire superfamilies. A particular advantage of this approach can be found in the construction of kinetic models of conformational dynamics-such as Markov state models (MSMs)-which benefit from a diverse array of initial configurations that span the accessible conformational states to aid sampling. We demonstrate the power of this approach by constructing models for all catalytic domains in the human tyrosine kinase family, using all available kinase catalytic domain structures from any organism as structural templates. Ensembler is free and open source software licensed under the GNU General Public License (GPL) v2. It is compatible with Linux and OS X. The latest release can be installed via the conda package manager, and the latest source can be downloaded from https://github.com/choderalab/ensembler.
Low-resolution structure of Drosophila translin

PubMed Central

Kumar, Vinay; Gupta, Gagan D.

2012-01-01

Crystals of native Drosophila melanogaster translin diffracted to 7 Å resolution. Reductive methylation of the protein improved crystal quality. The native and methylated proteins showed similar profiles in size-exclusion chromatography analyses but the methylated protein displayed reduced DNA-binding activity. Crystals of the methylated protein diffracted to 4.2 Å resolution at BM14 of the ESRF synchrotron. Crystals with 49% solvent content belonged to monoclinic space group P21 with eight protomers in the asymmetric unit. Only 2% of low-resolution structures with similar low percentage solvent content were found in the PDB. The crystal structure, solved by molecular replacement method, refined to Rwork (Rfree) of 0.24 (0.29) with excellent stereochemistry. The crystal structure clearly shows that drosophila protein exists as an octamer, and not as a decamer as expected from gel-filtration elution profiles. The similar octameric quaternary fold in translin orthologs and in translin–TRAX complexes suggests an up-down dimer as the basic structural subunit of translin-like proteins. The drosophila oligomer displays asymmetric assembly and increased radius of gyration that accounts for the observed differences between the elution profiles of human and drosophila proteins on gel-filtration columns. This study demonstrates clearly that low-resolution X-ray structure can be useful in understanding complex biological oligomers. PMID:23650579
Characterization of member of DUF1888 protein family, self-cleaving and self-assembling endopeptidase.

PubMed

Osipiuk, Jerzy; Mulligan, Rory; Bargassa, Monireh; Hamilton, John E; Cunningham, Mark A; Joachimiak, Andrzej

2012-06-01

The crystal structure of SO1698 protein from Shewanella oneidensis was determined by a SAD method and refined to 1.57 Å. The structure is a β sandwich that unexpectedly consists of two polypeptides; the N-terminal fragment includes residues 1-116, and the C-terminal one includes residues 117-125. Electron density also displayed the Lys-98 side chain covalently linked to Asp-116. The putative active site residues involved in self-cleavage were identified; point mutants were produced and characterized structurally and in a biochemical assay. Numerical simulations utilizing molecular dynamics and hybrid quantum/classical calculations suggest a mechanism involving activation of a water molecule coordinated by a catalytic aspartic acid.
Application of the maximum entropy principle to determine ensembles of intrinsically disordered proteins from residual dipolar couplings.

PubMed

Sanchez-Martinez, M; Crehuet, R

2014-12-21

We present a method based on the maximum entropy principle that can re-weight an ensemble of protein structures based on data from residual dipolar couplings (RDCs). The RDCs of intrinsically disordered proteins (IDPs) provide information on the secondary structure elements present in an ensemble; however even two sets of RDCs are not enough to fully determine the distribution of conformations, and the force field used to generate the structures has a pervasive influence on the refined ensemble. Two physics-based coarse-grained force fields, Profasi and Campari, are able to predict the secondary structure elements present in an IDP, but even after including the RDC data, the re-weighted ensembles differ between both force fields. Thus the spread of IDP ensembles highlights the need for better force fields. We distribute our algorithm in an open-source Python code.
GalaxyHomomer: a web server for protein homo-oligomer structure prediction from a monomer sequence or structure.

PubMed

Baek, Minkyung; Park, Taeyong; Heo, Lim; Park, Chiwook; Seok, Chaok

2017-07-03

Homo-oligomerization of proteins is abundant in nature, and is often intimately related with the physiological functions of proteins, such as in metabolism, signal transduction or immunity. Information on the homo-oligomer structure is therefore important to obtain a molecular-level understanding of protein functions and their regulation. Currently available web servers predict protein homo-oligomer structures either by template-based modeling using homo-oligomer templates selected from the protein structure database or by ab initio docking of monomer structures resolved by experiment or predicted by computation. The GalaxyHomomer server, freely accessible at http://galaxy.seoklab.org/homomer, carries out template-based modeling, ab initio docking or both depending on the availability of proper oligomer templates. It also incorporates recently developed model refinement methods that can consistently improve model quality. Moreover, the server provides additional options that can be chosen by the user depending on the availability of information on the monomer structure, oligomeric state and locations of unreliable/flexible loops or termini. The performance of the server was better than or comparable to that of other available methods when tested on benchmark sets and in a recent CASP performed in a blind fashion. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Determining crystal structures through crowdsourcing and coursework

PubMed Central

Horowitz, Scott; Koepnick, Brian; Martin, Raoul; Tymieniecki, Agnes; Winburn, Amanda A.; Cooper, Seth; Flatten, Jeff; Rogawski, David S.; Koropatkin, Nicole M.; Hailu, Tsinatkeab T.; Jain, Neha; Koldewey, Philipp; Ahlstrom, Logan S.; Chapman, Matthew R.; Sikkema, Andrew P.; Skiba, Meredith A.; Maloney, Finn P.; Beinlich, Felix R. M.; Caglar, Ahmet; Coral, Alan; Jensen, Alice Elizabeth; Lubow, Allen; Boitano, Amanda; Lisle, Amy Elizabeth; Maxwell, Andrew T.; Failer, Barb; Kaszubowski, Bartosz; Hrytsiv, Bohdan; Vincenzo, Brancaccio; de Melo Cruz, Breno Renan; McManus, Brian Joseph; Kestemont, Bruno; Vardeman, Carl; Comisky, Casey; Neilson, Catherine; Landers, Catherine R.; Ince, Christopher; Buske, Daniel Jon; Totonjian, Daniel; Copeland, David Marshall; Murray, David; Jagieła, Dawid; Janz, Dietmar; Wheeler, Douglas C.; Cali, Elie; Croze, Emmanuel; Rezae, Farah; Martin, Floyd Orville; Beecher, Gil; de Jong, Guido Alexander; Ykman, Guy; Feldmann, Harald; Chan, Hugo Paul Perez; Kovanecz, Istvan; Vasilchenko, Ivan; Connellan, James C.; Borman, Jami Lynne; Norrgard, Jane; Kanfer, Jebbie; Canfield, Jeffrey M.; Slone, Jesse David; Oh, Jimmy; Mitchell, Joanne; Bishop, John; Kroeger, John Douglas; Schinkler, Jonas; McLaughlin, Joseph; Brownlee, June M.; Bell, Justin; Fellbaum, Karl Willem; Harper, Kathleen; Abbey, Kirk J.; Isaksson, Lennart E.; Wei, Linda; Cummins, Lisa N.; Miller, Lori Anne; Bain, Lyn; Carpenter, Lynn; Desnouck, Maarten; Sharma, Manasa G.; Belcastro, Marcus; Szew, Martin; Szew, Martin; Britton, Matthew; Gaebel, Matthias; Power, Max; Cassidy, Michael; Pfützenreuter, Michael; Minett, Michele; Wesselingh, Michiel; Yi, Minjune; Cameron, Neil Haydn Tormey; Bolibruch, Nicholas I.; Benevides, Noah; Kathleen Kerr, Norah; Barlow, Nova; Crevits, Nykole Krystyne; Dunn, Paul; Roque, Paulo Sergio Silveira Belo Nascimento; Riber, Peter; Pikkanen, Petri; Shehzad, Raafay; Viosca, Randy; James Fraser, Robert; Leduc, Robert; Madala, Roman; Shnider, Scott; de Boisblanc, Sharon; Butkovich, Slava; Bliven, Spencer; Hettler, Stephen; Telehany, Stephen; Schwegmann, Steven A.; Parkes, Steven; Kleinfelter, Susan C.; Michael Holst, Sven; van der Laan, T. J. A.; Bausewein, Thomas; Simon, Vera; Pulley, Warwick; Hull, William; Kim, Annes Yukyung; Lawton, Alexis; Ruesch, Amanda; Sundar, Anjali; Lawrence, Anna-Lisa; Afrin, Antara; Maheshwer, Bhargavi; Turfe, Bilal; Huebner, Christian; Killeen, Courtney Elizabeth; Antebi-Lerrman, Dalia; Luan, Danny; Wolfe, Derek; Pham, Duc; Michewicz, Elaina; Hull, Elizabeth; Pardington, Emily; Galal, Galal Osama; Sun, Grace; Chen, Grace; Anderson, Halie E.; Chang, Jane; Hewlett, Jeffrey Thomas; Sterbenz, Jennifer; Lim, Jiho; Morof, Joshua; Lee, Junho; Inn, Juyoung Samuel; Hahm, Kaitlin; Roth, Kaitlin; Nair, Karun; Markin, Katherine; Schramm, Katie; Toni Eid, Kevin; Gam, Kristina; Murphy, Lisha; Yuan, Lucy; Kana, Lulia; Daboul, Lynn; Shammas, Mario Karam; Chason, Max; Sinan, Moaz; Andrew Tooley, Nicholas; Korakavi, Nisha; Comer, Patrick; Magur, Pragya; Savliwala, Quresh; Davison, Reid Michael; Sankaran, Roshun Rajiv; Lewe, Sam; Tamkus, Saule; Chen, Shirley; Harvey, Sho; Hwang, Sin Ye; Vatsia, Sohrab; Withrow, Stefan; Luther, Tahra K; Manett, Taylor; Johnson, Thomas James; Ryan Brash, Timothy; Kuhlman, Wyatt; Park, Yeonjung; Popović, Zoran; Baker, David; Khatib, Firas; Bardwell, James C. A.

2016-01-01

We show here that computer game players can build high-quality crystal structures. Introduction of a new feature into the computer game Foldit allows players to build and real-space refine structures into electron density maps. To assess the usefulness of this feature, we held a crystallographic model-building competition between trained crystallographers, undergraduate students, Foldit players and automatic model-building algorithms. After removal of disordered residues, a team of Foldit players achieved the most accurate structure. Analysing the target protein of the competition, YPL067C, uncovered a new family of histidine triad proteins apparently involved in the prevention of amyloid toxicity. From this study, we conclude that crystallographers can utilize crowdsourcing to interpret electron density information and to produce structure solutions of the highest quality. PMID:27633552
Feasibility study: refinement of the TTC concept by additional rules based on in silico and experimental data.

PubMed

Hauge-Nilsen, Kristin; Keller, Detlef

2015-01-01

Starting from a single generic limit value, the threshold of toxicological concern (TTC) concept has been further developed over the years, e.g., by including differentiated structural classes according to the rules of Cramer et al. (Food Chem Toxicol 16: 255-276, 1978). In practice, the refined TTC concept of Munro et al. (Food Chem Toxicol 34: 829-867, 1996) is often applied. The purpose of this work was to explore the possibility of refining the concept by introducing additional structure-activity relationships and available toxicity data. Computer modeling was performed using the OECD Toolbox. No observed (adverse) effect level (NO(A)EL) data of 176 substances were collected in a basic data set. New subgroups were created applying the following criteria: extended Cramer rules, low bioavailability, low acute toxicity, no protein binding affinity, and consideration of predicted liver metabolism. The highest TTC limit value of 236 µg/kg/day was determined for a subgroup that combined the criteria "no protein binding affinity" and "predicted liver metabolism." This value was approximately eight times higher than the original Cramer class 1 limit value of 30 µg/kg/day. The results of this feasibility study indicate that inclusion of the proposed criteria may lead to improved TTC values. Thereby, the applicability of the TTC concept in risk assessment could be extended which could reduce the need to perform animal tests.
An Integrated Framework Advancing Membrane Protein Modeling and Design

PubMed Central

Weitzner, Brian D.; Duran, Amanda M.; Tilley, Drew C.; Elazar, Assaf; Gray, Jeffrey J.

2015-01-01

Membrane proteins are critical functional molecules in the human body, constituting more than 30% of open reading frames in the human genome. Unfortunately, a myriad of difficulties in overexpression and reconstitution into membrane mimetics severely limit our ability to determine their structures. Computational tools are therefore instrumental to membrane protein structure prediction, consequently increasing our understanding of membrane protein function and their role in disease. Here, we describe a general framework facilitating membrane protein modeling and design that combines the scientific principles for membrane protein modeling with the flexible software architecture of Rosetta3. This new framework, called RosettaMP, provides a general membrane representation that interfaces with scoring, conformational sampling, and mutation routines that can be easily combined to create new protocols. To demonstrate the capabilities of this implementation, we developed four proof-of-concept applications for (1) prediction of free energy changes upon mutation; (2) high-resolution structural refinement; (3) protein-protein docking; and (4) assembly of symmetric protein complexes, all in the membrane environment. Preliminary data show that these algorithms can produce meaningful scores and structures. The data also suggest needed improvements to both sampling routines and score functions. Importantly, the applications collectively demonstrate the potential of combining the flexible nature of RosettaMP with the power of Rosetta algorithms to facilitate membrane protein modeling and design. PMID:26325167
CASP10-BCL::Fold efficiently samples topologies of large proteins.

PubMed

Heinze, Sten; Putnam, Daniel K; Fischer, Axel W; Kohlmann, Tim; Weiner, Brian E; Meiler, Jens

2015-03-01

During CASP10 in summer 2012, we tested BCL::Fold for prediction of free modeling (FM) and template-based modeling (TBM) targets. BCL::Fold assembles the tertiary structure of a protein from predicted secondary structure elements (SSEs) omitting more flexible loop regions early on. This approach enables the sampling of conformational space for larger proteins with more complex topologies. In preparation of CASP11, we analyzed the quality of CASP10 models throughout the prediction pipeline to understand BCL::Fold's ability to sample the native topology, identify native-like models by scoring and/or clustering approaches, and our ability to add loop regions and side chains to initial SSE-only models. The standout observation is that BCL::Fold sampled topologies with a GDT_TS score > 33% for 12 of 18 and with a topology score > 0.8 for 11 of 18 test cases de novo. Despite the sampling success of BCL::Fold, significant challenges still exist in clustering and loop generation stages of the pipeline. The clustering approach employed for model selection often failed to identify the most native-like assembly of SSEs for further refinement and submission. It was also observed that for some β-strand proteins model refinement failed as β-strands were not properly aligned to form hydrogen bonds removing otherwise accurate models from the pool. Further, BCL::Fold samples frequently non-natural topologies that require loop regions to pass through the center of the protein. © 2015 Wiley Periodicals, Inc.
Structure and Function of the Macrolide Biosensor Protein, MphR(A), with and without Erythromycin

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zheng, Jianting; Sagar, Vatsala; Smolinsky, Adam

2009-09-02

The regulatory protein MphR(A) has recently seen extensive use in synthetic biological applications, such as metabolite sensing and exogenous control of gene expression. This protein negatively regulates the expression of a macrolide 2{prime}-phosphotransferase I resistance gene (mphA) via binding to a 35-bp DNA operator upstream of the start codon and is de-repressed by the presence of erythromycin. Here, we present the refined crystal structure of the MphR(A) protein free of erythromycin and that of the MphR(A) protein with bound erythromycin at 2.00- and 1.76-{angstrom} resolutions, respectively. We also studied the DNA binding properties of the protein and identified mutants ofmore » MphR(A) that are defective in gene repression and ligand binding in a cell-based reporter assay. The combination of these two structures illustrates the molecular basis of erythromycin-induced gene expression and provides a framework for additional applied uses of this protein in the isolation and engineered biosynthesis of polyketide natural products.« less
How large B-factors can be in protein crystal structures.

PubMed

Carugo, Oliviero

2018-02-23

Protein crystal structures are potentially over-interpreted since they are routinely refined without any restraint on the upper limit of atomic B-factors. Consequently, some of their atoms, undetected in the electron density maps, are allowed to reach extremely large B-factors, even above 100 square Angstroms, and their final positions are purely speculative and not based on any experimental evidence. A strategy to define B-factors upper limits is described here, based on the analysis of protein crystal structures deposited in the Protein Data Bank prior 2008, when the tendency to allow B-factor to arbitrary inflate was limited. This B-factor upper limit (B_max) is determined by extrapolating the relationship between crystal structure average B-factor and percentage of crystal volume occupied by solvent (pcVol) to pcVol =100%, when, ab absurdo, the crystal contains only liquid solvent, the structure of which is, by definition, undetectable in electron density maps. It is thus possible to highlight structures with average B-factors larger than B_max, which should be considered with caution by the users of the information deposited in the Protein Data Bank, in order to avoid scientifically deleterious over-interpretations.
Structural and Functional Studies of a Newly Grouped Haloquadratum walsbyi Bacteriorhodopsin Reveal the Acid-resistant Light-driven Proton Pumping Activity.

PubMed

Hsu, Min-Feng; Fu, Hsu-Yuan; Cai, Chun-Jie; Yi, Hsiu-Pin; Yang, Chii-Shen; Wang, Andrew H-J

2015-12-04

Retinal bound light-driven proton pumps are widespread in eukaryotic and prokaryotic organisms. Among these pumps, bacteriorhodopsin (BR) proteins cooperate with ATP synthase to convert captured solar energy into a biologically consumable form, ATP. In an acidic environment or when pumped-out protons accumulate in the extracellular region, the maximum absorbance of BR proteins shifts markedly to the longer wavelengths. These conditions affect the light-driven proton pumping functional exertion as well. In this study, wild-type crystal structure of a BR with optical stability under wide pH range from a square halophilic archaeon, Haloquadratum walsbyi (HwBR), was solved in two crystal forms. One crystal form, refined to 1.85 Å resolution, contains a trimer in the asymmetric unit, whereas another contains an antiparallel dimer was refined at 2.58 Å. HwBR could not be classified into any existing subgroup of archaeal BR proteins based on the protein sequence phylogenetic tree, and it showed unique absorption spectral stability when exposed to low pH values. All structures showed a unique hydrogen-bonding network between Arg(82) and Thr(201), linking the BC and FG loops to shield the retinal-binding pocket in the interior from the extracellular environment. This result was supported by R82E mutation that attenuated the optical stability. The negatively charged cytoplasmic side and the Arg(82)-Thr(201) hydrogen bond may play an important role in the proton translocation trend in HwBR under acidic conditions. Our findings have unveiled a strategy adopted by BR proteins to solidify their defenses against unfavorable environments and maintain their optical properties associated with proton pumping. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.
Structural and Functional Studies of a Newly Grouped Haloquadratum walsbyi Bacteriorhodopsin Reveal the Acid-resistant Light-driven Proton Pumping Activity*

PubMed Central

Hsu, Min-Feng; Fu, Hsu-Yuan; Cai, Chun-Jie; Yi, Hsiu-Pin; Yang, Chii-Shen; Wang, Andrew H.-J.

2015-01-01

Retinal bound light-driven proton pumps are widespread in eukaryotic and prokaryotic organisms. Among these pumps, bacteriorhodopsin (BR) proteins cooperate with ATP synthase to convert captured solar energy into a biologically consumable form, ATP. In an acidic environment or when pumped-out protons accumulate in the extracellular region, the maximum absorbance of BR proteins shifts markedly to the longer wavelengths. These conditions affect the light-driven proton pumping functional exertion as well. In this study, wild-type crystal structure of a BR with optical stability under wide pH range from a square halophilic archaeon, Haloquadratum walsbyi (HwBR), was solved in two crystal forms. One crystal form, refined to 1.85 Å resolution, contains a trimer in the asymmetric unit, whereas another contains an antiparallel dimer was refined at 2.58 Å. HwBR could not be classified into any existing subgroup of archaeal BR proteins based on the protein sequence phylogenetic tree, and it showed unique absorption spectral stability when exposed to low pH values. All structures showed a unique hydrogen-bonding network between Arg82 and Thr201, linking the BC and FG loops to shield the retinal-binding pocket in the interior from the extracellular environment. This result was supported by R82E mutation that attenuated the optical stability. The negatively charged cytoplasmic side and the Arg82–Thr201 hydrogen bond may play an important role in the proton translocation trend in HwBR under acidic conditions. Our findings have unveiled a strategy adopted by BR proteins to solidify their defenses against unfavorable environments and maintain their optical properties associated with proton pumping. PMID:26483542
Extracting physicochemical features to predict protein secondary structure.

PubMed

Huang, Yin-Fu; Chen, Shu-Ying

2013-01-01

We propose a protein secondary structure prediction method based on position-specific scoring matrix (PSSM) profiles and four physicochemical features including conformation parameters, net charges, hydrophobic, and side chain mass. First, the SVM with the optimal window size and the optimal parameters of the kernel function is found. Then, we train the SVM using the PSSM profiles generated from PSI-BLAST and the physicochemical features extracted from the CB513 data set. Finally, we use the filter to refine the predicted results from the trained SVM. For all the performance measures of our method, Q 3 reaches 79.52, SOV94 reaches 86.10, and SOV99 reaches 74.60; all the measures are higher than those of the SVMpsi method and the SVMfreq method. This validates that considering these physicochemical features in predicting protein secondary structure would exhibit better performances.
Extracting Physicochemical Features to Predict Protein Secondary Structure

PubMed Central

Chen, Shu-Ying

2013-01-01

We propose a protein secondary structure prediction method based on position-specific scoring matrix (PSSM) profiles and four physicochemical features including conformation parameters, net charges, hydrophobic, and side chain mass. First, the SVM with the optimal window size and the optimal parameters of the kernel function is found. Then, we train the SVM using the PSSM profiles generated from PSI-BLAST and the physicochemical features extracted from the CB513 data set. Finally, we use the filter to refine the predicted results from the trained SVM. For all the performance measures of our method, Q 3 reaches 79.52, SOV94 reaches 86.10, and SOV99 reaches 74.60; all the measures are higher than those of the SVMpsi method and the SVMfreq method. This validates that considering these physicochemical features in predicting protein secondary structure would exhibit better performances. PMID:23766688
Fitmunk: improving protein structures by accurate, automatic modeling of side-chain conformations.

PubMed

Porebski, Przemyslaw Jerzy; Cymborowski, Marcin; Pasenkiewicz-Gierula, Marta; Minor, Wladek

2016-02-01

Improvements in crystallographic hardware and software have allowed automated structure-solution pipelines to approach a near-`one-click' experience for the initial determination of macromolecular structures. However, in many cases the resulting initial model requires a laborious, iterative process of refinement and validation. A new method has been developed for the automatic modeling of side-chain conformations that takes advantage of rotamer-prediction methods in a crystallographic context. The algorithm, which is based on deterministic dead-end elimination (DEE) theory, uses new dense conformer libraries and a hybrid energy function derived from experimental data and prior information about rotamer frequencies to find the optimal conformation of each side chain. In contrast to existing methods, which incorporate the electron-density term into protein-modeling frameworks, the proposed algorithm is designed to take advantage of the highly discriminatory nature of electron-density maps. This method has been implemented in the program Fitmunk, which uses extensive conformational sampling. This improves the accuracy of the modeling and makes it a versatile tool for crystallographic model building, refinement and validation. Fitmunk was extensively tested on over 115 new structures, as well as a subset of 1100 structures from the PDB. It is demonstrated that the ability of Fitmunk to model more than 95% of side chains accurately is beneficial for improving the quality of crystallographic protein models, especially at medium and low resolutions. Fitmunk can be used for model validation of existing structures and as a tool to assess whether side chains are modeled optimally or could be better fitted into electron density. Fitmunk is available as a web service at http://kniahini.med.virginia.edu/fitmunk/server/ or at http://fitmunk.bitbucket.org/.
Protein-based materials, toward a new level of structural control.

PubMed

van Hest, J C; Tirrell, D A

2001-10-07

Through billions of years of evolution nature has created and refined structural proteins for a wide variety of specific purposes. Amino acid sequences and their associated folding patterns combine to create elastic, rigid or tough materials. In many respects, nature's intricately designed products provide challenging examples for materials scientists, but translation of natural structural concepts into bio-inspired materials requires a level of control of macromolecular architecture far higher than that afforded by conventional polymerization processes. An increasingly important approach to this problem has been to use biological systems for production of materials. Through protein engineering, artificial genes can be developed that encode protein-based materials with desired features. Structural elements found in nature, such as beta-sheets and alpha-helices, can be combined with great flexibility, and can be outfitted with functional elements such as cell binding sites or enzymatic domains. The possibility of incorporating non-natural amino acids increases the versatility of protein engineering still further. It is expected that such methods will have large impact in the field of materials science, and especially in biomedical materials science, in the future.
Small Angle X-Ray Scattering from Lipid-Bound Myelin Basic Protein in Solution

PubMed Central

Haas, H.; Oliveira, C. L. P.; Torriani, I. L.; Polverini, E.; Fasano, A.; Carlone, G.; Cavatorta, P.; Riccio, P.

2004-01-01

The structure of myelin basic protein (MBP), purified from the myelin sheath in both lipid-free (LF-MBP) and lipid-bound (LB-MBP) forms, was investigated in solution by small angle x-ray scattering. The water-soluble LF-MBP, extracted at pH < 3.0 from defatted brain, is the classical preparation of MBP, commonly regarded as an intrinsically unfolded protein. LB-MBP is a lipoprotein-detergent complex extracted from myelin with its native lipidic environment at pH > 7.0. Under all conditions, the scattering from the two protein forms was different, indicating different molecular shapes. For the LB-MBP, well-defined scattering curves were obtained, suggesting that the protein had a unique, compact (but not globular) structure. Furthermore, these data were compatible with earlier results from molecular modeling calculations on the MBP structure which have been refined by us. In contrast, the LF-MBP data were in accordance with the expected open-coil conformation. The results represent the first direct structural information from x-ray scattering measurements on MBP in its native lipidic environment in solution. PMID:14695288
Experimental conformational energy maps of proteins and peptides.

PubMed

Balaji, Govardhan A; Nagendra, H G; Balaji, Vitukudi N; Rao, Shashidhar N

2017-06-01

We have presented an extensive analysis of the peptide backbone dihedral angles in the PDB structures and computed experimental Ramachandran plots for their distributions seen under a various constraints on X-ray resolution, representativeness at different sequence identity percentages, and hydrogen bonding distances. These experimental distributions have been converted into isoenergy contour plots using the approach employed previously by F. M. Pohl. This has led to the identification of energetically favored minima in the Ramachandran (ϕ, ψ) plots in which global minima are predominantly observed either in the right-handed α-helical or the polyproline II regions. Further, we have identified low energy pathways for transitions between various minima in the (ϕ,ψ) plots. We have compared and presented the experimental plots with published theoretical plots obtained from both molecular mechanics and quantum mechanical approaches. In addition, we have developed and employed a root mean square deviation (RMSD) metric for isoenergy contours in various ranges, as a measure (in kcal.mol -1 ) to compare any two plots and determine the extent of correlation and similarity between their isoenergy contours. In general, we observe a greater degree of compatibility with experimental plots for energy maps obtained from molecular mechanics methods compared to most quantum mechanical methods. The experimental energy plots we have investigated could be helpful in refining protein structures obtained from X-ray, NMR, and electron microscopy and in refining force field parameters to enable simulations of peptide and protein structures that have higher degree of consistency with experiments. Proteins 2017; 85:979-1001. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.

Progressive structure-based alignment of homologous proteins: Adopting sequence comparison strategies.

PubMed

Joseph, Agnel Praveen; Srinivasan, Narayanaswamy; de Brevern, Alexandre G

2012-09-01

Comparison of multiple protein structures has a broad range of applications in the analysis of protein structure, function and evolution. Multiple structure alignment tools (MSTAs) are necessary to obtain a simultaneous comparison of a family of related folds. In this study, we have developed a method for multiple structure comparison largely based on sequence alignment techniques. A widely used Structural Alphabet named Protein Blocks (PBs) was used to transform the information on 3D protein backbone conformation as a 1D sequence string. A progressive alignment strategy similar to CLUSTALW was adopted for multiple PB sequence alignment (mulPBA). Highly similar stretches identified by the pairwise alignments are given higher weights during the alignment. The residue equivalences from PB based alignments are used to obtain a three dimensional fit of the structures followed by an iterative refinement of the structural superposition. Systematic comparisons using benchmark datasets of MSTAs underlines that the alignment quality is better than MULTIPROT, MUSTANG and the alignments in HOMSTRAD, in more than 85% of the cases. Comparison with other rigid-body and flexible MSTAs also indicate that mulPBA alignments are superior to most of the rigid-body MSTAs and highly comparable to the flexible alignment methods. Copyright © 2012 Elsevier Masson SAS. All rights reserved.
GeneSilico protein structure prediction meta-server.

PubMed

Kurowski, Michal A; Bujnicki, Janusz M

2003-07-01

Rigorous assessments of protein structure prediction have demonstrated that fold recognition methods can identify remote similarities between proteins when standard sequence search methods fail. It has been shown that the accuracy of predictions is improved when refined multiple sequence alignments are used instead of single sequences and if different methods are combined to generate a consensus model. There are several meta-servers available that integrate protein structure predictions performed by various methods, but they do not allow for submission of user-defined multiple sequence alignments and they seldom offer confidentiality of the results. We developed a novel WWW gateway for protein structure prediction, which combines the useful features of other meta-servers available, but with much greater flexibility of the input. The user may submit an amino acid sequence or a multiple sequence alignment to a set of methods for primary, secondary and tertiary structure prediction. Fold-recognition results (target-template alignments) are converted into full-atom 3D models and the quality of these models is uniformly assessed. A consensus between different FR methods is also inferred. The results are conveniently presented on-line on a single web page over a secure, password-protected connection. The GeneSilico protein structure prediction meta-server is freely available for academic users at http://genesilico.pl/meta.
GeneSilico protein structure prediction meta-server

PubMed Central

Kurowski, Michal A.; Bujnicki, Janusz M.

2003-01-01

Rigorous assessments of protein structure prediction have demonstrated that fold recognition methods can identify remote similarities between proteins when standard sequence search methods fail. It has been shown that the accuracy of predictions is improved when refined multiple sequence alignments are used instead of single sequences and if different methods are combined to generate a consensus model. There are several meta-servers available that integrate protein structure predictions performed by various methods, but they do not allow for submission of user-defined multiple sequence alignments and they seldom offer confidentiality of the results. We developed a novel WWW gateway for protein structure prediction, which combines the useful features of other meta-servers available, but with much greater flexibility of the input. The user may submit an amino acid sequence or a multiple sequence alignment to a set of methods for primary, secondary and tertiary structure prediction. Fold-recognition results (target-template alignments) are converted into full-atom 3D models and the quality of these models is uniformly assessed. A consensus between different FR methods is also inferred. The results are conveniently presented on-line on a single web page over a secure, password-protected connection. The GeneSilico protein structure prediction meta-server is freely available for academic users at http://genesilico.pl/meta. PMID:12824313
A scoring function based on solvation thermodynamics for protein structure prediction

PubMed Central

Du, Shiqiao; Harano, Yuichi; Kinoshita, Masahiro; Sakurai, Minoru

2012-01-01

We predict protein structure using our recently developed free energy function for describing protein stability, which is focused on solvation thermodynamics. The function is combined with the current most reliable sampling methods, i.e., fragment assembly (FA) and comparative modeling (CM). The prediction is tested using 11 small proteins for which high-resolution crystal structures are available. For 8 of these proteins, sequence similarities are found in the database, and the prediction is performed with CM. Fairly accurate models with average Cα root mean square deviation (RMSD) ∼ 2.0 Å are successfully obtained for all cases. For the rest of the target proteins, we perform the prediction following FA protocols. For 2 cases, we obtain predicted models with an RMSD ∼ 3.0 Å as the best-scored structures. For the other case, the RMSD remains larger than 7 Å. For all the 11 target proteins, our scoring function identifies the experimentally determined native structure as the best structure. Starting from the predicted structure, replica exchange molecular dynamics is performed to further refine the structures. However, we are unable to improve its RMSD toward the experimental structure. The exhaustive sampling by coarse-grained normal mode analysis around the native structures reveals that our function has a linear correlation with RMSDs < 3.0 Å. These results suggest that the function is quite reliable for the protein structure prediction while the sampling method remains one of the major limiting factors in it. The aspects through which the methodology could further be improved are discussed. PMID:27493529
Prediction of protein loop conformations using multiscale modeling methods with physical energy scoring functions.

PubMed

Olson, Mark A; Feig, Michael; Brooks, Charles L

2008-04-15

This article examines ab initio methods for the prediction of protein loops by a computational strategy of multiscale conformational sampling and physical energy scoring functions. Our approach consists of initial sampling of loop conformations from lattice-based low-resolution models followed by refinement using all-atom simulations. To allow enhanced conformational sampling, the replica exchange method was implemented. Physical energy functions based on CHARMM19 and CHARMM22 parameterizations with generalized Born (GB) solvent models were applied in scoring loop conformations extracted from the lattice simulations and, in the case of all-atom simulations, the ensemble of conformations were generated and scored with these models. Predictions are reported for 25 loop segments, each eight residues long and taken from a diverse set of 22 protein structures. We find that the simulations generally sampled conformations with low global root-mean-square-deviation (RMSD) for loop backbone coordinates from the known structures, whereas clustering conformations in RMSD space and scoring detected less favorable loop structures. Specifically, the lattice simulations sampled basins that exhibited an average global RMSD of 2.21 +/- 1.42 A, whereas clustering and scoring the loop conformations determined an RMSD of 3.72 +/- 1.91 A. Using CHARMM19/GB to refine the lattice conformations improved the sampling RMSD to 1.57 +/- 0.98 A and detection to 2.58 +/- 1.48 A. We found that further improvement could be gained from extending the upper temperature in the all-atom refinement from 400 to 800 K, where the results typically yield a reduction of approximately 1 A or greater in the RMSD of the detected loop. Overall, CHARMM19 with a simple pairwise GB solvent model is more efficient at sampling low-RMSD loop basins than CHARMM22 with a higher-resolution modified analytical GB model; however, the latter simulation method provides a more accurate description of the all-atom energy surface, yet demands a much greater computational cost. (c) 2007 Wiley Periodicals, Inc.
Use of 13Cα Chemical-Shifts in Protein Structure Determination

PubMed Central

Vila, Jorge A.; Ripoll, Daniel R.; Scheraga, Harold A.

2008-01-01

A physics-based method, aimed at determining protein structures by using NOE-derived distances together with observed and computed 13C chemical shifts, is proposed. The approach makes use of 13Cα chemical shifts, computed at the density functional level of theory, to obtain torsional constraints for all backbone and side-chain torsional angles without making a priori use of the occupancy of any region of the Ramachandran map by the amino acid residues. The torsional constraints are not fixed but are changed dynamically in each step of the procedure, following an iterative self-consistent approach intended to identify a set of conformations for which the computed 13Cα chemical shifts match the experimental ones. A test is carried out on a 76-amino acid all-α-helical protein, namely the B. Subtilis acyl carrier protein. It is shown that, starting from randomly generated conformations, the final protein models are more accurate than an existing NMR-derived structure model of this protein, in terms of both the agreement between predicted and observed 13Cα chemical shifts and some stereochemical quality indicators, and of similar accuracy as one of the protein models solved at a high level of resolution. The results provide evidence that this methodology can be used not only for structure determination but also for additional protein structure refinement of NMR-derived models deposited in the Protein Data Bank. PMID:17516673
The 15-K neutron structure of saccharide-free concanavalin A.

PubMed

Blakeley, M P; Kalb, A J; Helliwell, J R; Myles, D A A

2004-11-23

The positions of the ordered hydrogen isotopes of a protein and its bound solvent can be determined by using neutron crystallography. Furthermore, by collecting neutron data at cryo temperatures, the dynamic disorder within a protein crystal is reduced, which may lead to improved definition of the nuclear density. It has proved possible to cryo-cool very large Con A protein crystals (>1.5 mm3) suitable for high-resolution neutron and x-ray structure analysis. We can thereby report the neutron crystal structure of the saccharide-free form of Con A and its bound water, including 167 intact D2O molecules and 60 oxygen atoms at 15 K to 2.5-A resolution, along with the 1.65-A x-ray structure of an identical crystal at 100 K. Comparison with the 293-K neutron structure shows that the bound water molecules are better ordered and have lower average B factors than those at room temperature. Overall, twice as many bound waters (as D2O) are identified at 15 K than at 293 K. We note that alteration of bound water orientations occurs between 293 and 15 K; such changes, as illustrated here with this example, could be important more generally in protein crystal structure analysis and ligand design. Methodologically, this successful neutron cryo protein structure refinement opens up categories of neutron protein crystallography, including freeze-trapped structures and cryo to room temperature comparisons.
Investigating energy-based pool structure selection in the structure ensemble modeling with experimental distance constraints: The example from a multidomain protein Pub1.

PubMed

Zhu, Guanhua; Liu, Wei; Bao, Chenglong; Tong, Dudu; Ji, Hui; Shen, Zuowei; Yang, Daiwen; Lu, Lanyuan

2018-05-01

The structural variations of multidomain proteins with flexible parts mediate many biological processes, and a structure ensemble can be determined by selecting a weighted combination of representative structures from a simulated structure pool, producing the best fit to experimental constraints such as interatomic distance. In this study, a hybrid structure-based and physics-based atomistic force field with an efficient sampling strategy is adopted to simulate a model di-domain protein against experimental paramagnetic relaxation enhancement (PRE) data that correspond to distance constraints. The molecular dynamics simulations produce a wide range of conformations depicted on a protein energy landscape. Subsequently, a conformational ensemble recovered with low-energy structures and the minimum-size restraint is identified in good agreement with experimental PRE rates, and the result is also supported by chemical shift perturbations and small-angle X-ray scattering data. It is illustrated that the regularizations of energy and ensemble-size prevent an arbitrary interpretation of protein conformations. Moreover, energy is found to serve as a critical control to refine the structure pool and prevent data overfitting, because the absence of energy regularization exposes ensemble construction to the noise from high-energy structures and causes a more ambiguous representation of protein conformations. Finally, we perform structure-ensemble optimizations with a topology-based structure pool, to enhance the understanding on the ensemble results from different sources of pool candidates. © 2018 Wiley Periodicals, Inc.
High-resolution structure of the M14-type cytosolic carboxypeptidase from Burkholderia cenocepacia refined exploiting PDB-REDO strategies

DOE Office of Scientific and Technical Information (OSTI.GOV)

Rimsa, Vadim; Eadsforth, Thomas C.; Joosten, Robbie P.

2014-02-01

The structure of a bacterial M14-family carboxypeptidase determined exploiting microfocus synchrotron radiation and highly automated refinement protocols reveals its potential to act as a polyglutamylase. A potential cytosolic metallocarboxypeptidase from Burkholderia cenocepacia has been crystallized and a synchrotron-radiation microfocus beamline allowed the acquisition of diffraction data to 1.9 Å resolution. The asymmetric unit comprises a tetramer containing over 1500 amino acids, and the high-throughput automated protocols embedded in PDB-REDO were coupled with model–map inspections in refinement. This approach has highlighted the value of such protocols for efficient analyses. The subunit is constructed from two domains. The N-terminal domain has previouslymore » only been observed in cytosolic carboxypeptidase (CCP) proteins. The C-terminal domain, which carries the Zn{sup 2+}-containing active site, serves to classify this protein as a member of the M14D subfamily of carboxypeptidases. Although eukaryotic CCPs possess deglutamylase activity and are implicated in processing modified tubulin, the function and substrates of the bacterial family members remain unknown. The B. cenocepacia protein did not display deglutamylase activity towards a furylacryloyl glutamate derivative, a potential substrate. Residues previously shown to coordinate the divalent cation and that contribute to peptide-bond cleavage in related enzymes such as bovine carboxypeptidase are conserved. The location of a conserved basic patch in the active site adjacent to the catalytic Zn{sup 2+}, where an acetate ion is identified, suggests recognition of the carboxy-terminus in a similar fashion to other carboxypeptidases. However, there are significant differences that indicate the recognition of substrates with different properties. Of note is the presence of a lysine in the S1′ recognition subsite that suggests specificity towards an acidic substrate.« less
Crystal Structure of Cocosin, A Potential Food Allergen from Coconut (Cocos nucifera).

PubMed

Jin, Tengchuan; Wang, Cheng; Zhang, Caiying; Wang, Yang; Chen, Yu-Wei; Guo, Feng; Howard, Andrew; Cao, Min-Jie; Fu, Tong-Jen; McHugh, Tara H; Zhang, Yuzhu

2017-08-30

Coconut (Cocos nucifera) is an important palm tree. Coconut fruit is widely consumed. The most abundant storage protein in coconut fruit is cocosin (a likely food allergen), which belongs to the 11S globulin family. Cocosin was crystallized near a century ago, but its structure remains unknown. By optimizing crystallization conditions and cryoprotectant solutions, we were able to obtain cocosin crystals that diffracted to 1.85 Å. The cocosin gene was cloned from genomic DNA isolated from dry coconut tissue. The protein sequence deduced from the predicted cocosin coding sequence was used to guide model building and structure refinement. The structure of cocosin was determined for the first time, and it revealed a typical 11S globulin feature of a double layer doughnut-shaped hexamer.
Weak data do not make a free lunch, only a cheap meal

DOE Office of Scientific and Technical Information (OSTI.GOV)

Luo, Zhipu; Rajashankar, Kanagalaghatta; Dauter, Zbigniew, E-mail: dauter@anl.gov

2014-02-01

Refinement and analysis of four structures with various data resolution cutoffs suggests that at present there are no reliable criteria for judging the diffraction data resolution limit and the condition I/σ(I) = 2.0 is reasonable. However, extending the limit by about 0.2 Å beyond the resolution defined by this threshold does not deteriorate the quality of refined structures and in some cases may be beneficial. Four data sets were processed at resolutions significantly exceeding the criteria traditionally used for estimating the diffraction data resolution limit. The analysis of these data and the corresponding model-quality indicators suggests that the criteria ofmore » resolution limits widely adopted in the past may be somewhat conservative. Various parameters, such as R{sub merge} and I/σ(I), optical resolution and the correlation coefficients CC{sub 1/2} and CC*, can be used for judging the internal data quality, whereas the reliability factors R and R{sub free} as well as the maximum-likelihood target values and real-space map correlation coefficients can be used to estimate the agreement between the data and the refined model. However, none of these criteria provide a reliable estimate of the data resolution cutoff limit. The analysis suggests that extension of the maximum resolution by about 0.2 Å beyond the currently adopted limit where the I/σ(I) value drops to 2.0 does not degrade the quality of the refined structural models, but may sometimes be advantageous. Such an extension may be particularly beneficial for significantly anisotropic diffraction. Extension of the maximum resolution at the stage of data collection and structure refinement is cheap in terms of the required effort and is definitely more advisable than accepting a too conservative resolution cutoff, which is unfortunately quite frequent among the crystal structures deposited in the Protein Data Bank.« less
The origin of consistent protein structure refinement from structural averaging.

PubMed

Park, Hahnbeom; DiMaio, Frank; Baker, David

2015-06-02

Recent studies have shown that explicit solvent molecular dynamics (MD) simulation followed by structural averaging can consistently improve protein structure models. We find that improvement upon averaging is not limited to explicit water MD simulation, as consistent improvements are also observed for more efficient implicit solvent MD or Monte Carlo minimization simulations. To determine the origin of these improvements, we examine the changes in model accuracy brought about by averaging at the individual residue level. We find that the improvement in model quality from averaging results from the superposition of two effects: a dampening of deviations from the correct structure in the least well modeled regions, and a reinforcement of consistent movements towards the correct structure in better modeled regions. These observations are consistent with an energy landscape model in which the magnitude of the energy gradient toward the native structure decreases with increasing distance from the native state. Copyright © 2015 Elsevier Ltd. All rights reserved.
Atomic resolution view into the structure–function relationships of the human myelin peripheral membrane protein P2

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ruskamo, Salla; University of Oulu, Oulu; Yadav, Ravi P.

2014-01-01

The structure of the human myelin peripheral membrane protein P2 has been refined at 0.93 Å resolution. In combination with functional experiments in vitro, in vivo and in silico, the fine details of the structure–function relationships in P2 are emerging. P2 is a fatty acid-binding protein expressed in vertebrate peripheral nerve myelin, where it may function in bilayer stacking and lipid transport. P2 binds to phospholipid membranes through its positively charged surface and a hydrophobic tip, and accommodates fatty acids inside its barrel structure. The structure of human P2 refined at the ultrahigh resolution of 0.93 Å allows detailed structuralmore » analyses, including the full organization of an internal hydrogen-bonding network. The orientation of the bound fatty-acid carboxyl group is linked to the protonation states of two coordinating arginine residues. An anion-binding site in the portal region is suggested to be relevant for membrane interactions and conformational changes. When bound to membrane multilayers, P2 has a preferred orientation and is stabilized, and the repeat distance indicates a single layer of P2 between membranes. Simulations show the formation of a double bilayer in the presence of P2, and in cultured cells wild-type P2 induces membrane-domain formation. Here, the most accurate structural and functional view to date on P2, a major component of peripheral nerve myelin, is presented, showing how it can interact with two membranes simultaneously while going through conformational changes at its portal region enabling ligand transfer.« less
Measuring and modeling diffuse scattering in protein X-ray crystallography

PubMed Central

Van Benschoten, Andrew H.; Liu, Lin; Gonzalez, Ana; Brewster, Aaron S.; Sauter, Nicholas K.; Wall, Michael E.

2016-01-01

X-ray diffraction has the potential to provide rich information about the structural dynamics of macromolecules. To realize this potential, both Bragg scattering, which is currently used to derive macromolecular structures, and diffuse scattering, which reports on correlations in charge density variations, must be measured. Until now, measurement of diffuse scattering from protein crystals has been scarce because of the extra effort of collecting diffuse data. Here, we present 3D measurements of diffuse intensity collected from crystals of the enzymes cyclophilin A and trypsin. The measurements were obtained from the same X-ray diffraction images as the Bragg data, using best practices for standard data collection. To model the underlying dynamics in a practical way that could be used during structure refinement, we tested translation–libration–screw (TLS), liquid-like motions (LLM), and coarse-grained normal-modes (NM) models of protein motions. The LLM model provides a global picture of motions and was refined against the diffuse data, whereas the TLS and NM models provide more detailed and distinct descriptions of atom displacements, and only used information from the Bragg data. Whereas different TLS groupings yielded similar Bragg intensities, they yielded different diffuse intensities, none of which agreed well with the data. In contrast, both the LLM and NM models agreed substantially with the diffuse data. These results demonstrate a realistic path to increase the number of diffuse datasets available to the wider biosciences community and indicate that dynamics-inspired NM structural models can simultaneously agree with both Bragg and diffuse scattering. PMID:27035972
Measuring and modeling diffuse scattering in protein X-ray crystallography

DOE PAGES

Van Benschoten, Andrew H.; Liu, Lin; Gonzalez, Ana; ...

2016-03-28

X-ray diffraction has the potential to provide rich information about the structural dynamics of macromolecules. To realize this potential, both Bragg scattering, which is currently used to derive macromolecular structures, and diffuse scattering, which reports on correlations in charge density variations, must be measured. Until now, measurement of diffuse scattering from protein crystals has been scarce because of the extra effort of collecting diffuse data. Here, we present 3D measurements of diffuse intensity collected from crystals of the enzymes cyclophilin A and trypsin. The measurements were obtained from the same X-ray diffraction images as the Bragg data, using best practicesmore » for standard data collection. To model the underlying dynamics in a practical way that could be used during structure refinement, we tested translation–libration–screw (TLS), liquid-like motions (LLM), and coarse-grained normal-modes (NM) models of protein motions. The LLM model provides a global picture of motions and was refined against the diffuse data, whereas the TLS and NM models provide more detailed and distinct descriptions of atom displacements, and only used information from the Bragg data. Whereas different TLS groupings yielded similar Bragg intensities, they yielded different diffuse intensities, none of which agreed well with the data. In contrast, both the LLM and NM models agreed substantially with the diffuse data. In conclusion, these results demonstrate a realistic path to increase the number of diffuse datasets available to the wider biosciences community and indicate that dynamics-inspired NM structural models can simultaneously agree with both Bragg and diffuse scattering.« less
Atomic structure of unligated laccase from Cerrena maxima at 1.76 A with molecular oxygen and hydrogen peroxide

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhukova, Yu. N., E-mail: amm@ns.crys.ras.ru; Lyashenko, A. V.; Lashkov, A. A.

2010-05-15

The three-dimensional structure of unligated laccase from Cerrena maxima was established by X-ray diffraction at 1.76-A resolution; R{sub work} = 18.07%, R{sub free} = 21.71%, rmsd of bond lengths, bond angles, and chiral angles are 0.008 A, 1.19{sup o}, and 0.077{sup o}, respectively. The coordinate error for the refined structure estimated from the Luzzati plot is 0.195 A. The maximum average error in the atomic coordinates is 0.047 A. A total of 99.4% of amino-acid residues of the polypeptide chain are in the most favorable, allowable, and accessible regions of the Ramachandran plot. The three-dimensional structures of the complexes ofmore » laccase from C. maxima with molecular oxygen and hydrogen peroxide were determined by the molecular simulation. These data provide insight into the structural aspect of the mechanism of the enzymatic cycle. The structure factors and the refined atomic coordinates were deposited in the Protein Data Bank (PDB-ID code is 3DIV).« less
Characterization of Member of DUF1888 Protein Family, Self-cleaving and Self-assembling Endopeptidase*

PubMed Central

Osipiuk, Jerzy; Mulligan, Rory; Bargassa, Monireh; Hamilton, John E.; Cunningham, Mark A.; Joachimiak, Andrzej

2012-01-01

The crystal structure of SO1698 protein from Shewanella oneidensis was determined by a SAD method and refined to 1.57 Å. The structure is a β sandwich that unexpectedly consists of two polypeptides; the N-terminal fragment includes residues 1–116, and the C-terminal one includes residues 117–125. Electron density also displayed the Lys-98 side chain covalently linked to Asp-116. The putative active site residues involved in self-cleavage were identified; point mutants were produced and characterized structurally and in a biochemical assay. Numerical simulations utilizing molecular dynamics and hybrid quantum/classical calculations suggest a mechanism involving activation of a water molecule coordinated by a catalytic aspartic acid. PMID:22493430
DNA Nanotubes for NMR Structure Determination of Membrane Proteins

PubMed Central

Bellot, Gaëtan; McClintock, Mark A.; Chou, James J; Shih, William M.

2013-01-01

Structure determination of integral membrane proteins by solution NMR represents one of the most important challenges of structural biology. A Residual-Dipolar-Coupling-based refinement approach can be used to solve the structure of membrane proteins up to 40 kDa in size, however, a weak-alignment medium that is detergent-resistant is required. Previously, availability of media suitable for weak alignment of membrane proteins was severely limited. We describe here a protocol for robust, large-scale synthesis of detergent-resistant DNA nanotubes that can be assembled into dilute liquid crystals for application as weak-alignment media in solution NMR structure determination of membrane proteins in detergent micelles. The DNA nanotubes are heterodimers of 400nm-long six-helix bundles each self-assembled from a M13-based p7308 scaffold strand and >170 short oligonucleotide staple strands. Compatibility with proteins bearing considerable positive charge as well as modulation of molecular alignment, towards collection of linearly independent restraints, can be introduced by reducing the negative charge of DNA nanotubes via counter ions and small DNA binding molecules. This detergent-resistant liquid-crystal media offers a number of properties conducive for membrane protein alignment, including high-yield production, thermal stability, buffer compatibility, and structural programmability. Production of sufficient nanotubes for 4–5 NMR experiments can be completed in one week by a single individual. PMID:23518667
Estimation of Uncertainties in the Global Distance Test (GDT_TS) for CASP Models.

PubMed

Li, Wenlin; Schaeffer, R Dustin; Otwinowski, Zbyszek; Grishin, Nick V

2016-01-01

The Critical Assessment of techniques for protein Structure Prediction (or CASP) is a community-wide blind test experiment to reveal the best accomplishments of structure modeling. Assessors have been using the Global Distance Test (GDT_TS) measure to quantify prediction performance since CASP3 in 1998. However, identifying significant score differences between close models is difficult because of the lack of uncertainty estimations for this measure. Here, we utilized the atomic fluctuations caused by structure flexibility to estimate the uncertainty of GDT_TS scores. Structures determined by nuclear magnetic resonance are deposited as ensembles of alternative conformers that reflect the structural flexibility, whereas standard X-ray refinement produces the static structure averaged over time and space for the dynamic ensembles. To recapitulate the structural heterogeneous ensemble in the crystal lattice, we performed time-averaged refinement for X-ray datasets to generate structural ensembles for our GDT_TS uncertainty analysis. Using those generated ensembles, our study demonstrates that the time-averaged refinements produced structure ensembles with better agreement with the experimental datasets than the averaged X-ray structures with B-factors. The uncertainty of the GDT_TS scores, quantified by their standard deviations (SDs), increases for scores lower than 50 and 70, with maximum SDs of 0.3 and 1.23 for X-ray and NMR structures, respectively. We also applied our procedure to the high accuracy version of GDT-based score and produced similar results with slightly higher SDs. To facilitate score comparisons by the community, we developed a user-friendly web server that produces structure ensembles for NMR and X-ray structures and is accessible at http://prodata.swmed.edu/SEnCS. Our work helps to identify the significance of GDT_TS score differences, as well as to provide structure ensembles for estimating SDs of any scores.
The three-dimensional structure of diaminopimelate decarboxylase from Mycobacterium tuberculosis reveals a tetrameric enzyme organisation.

PubMed

Weyand, Simone; Kefala, Georgia; Svergun, Dmitri I; Weiss, Manfred S

2009-09-01

The three-dimensional structure of the enzyme diaminopimelate decarboxylase from Mycobacterium tuberculosis has been determined in a new crystal form and refined to a resolution of 2.33 A. The monoclinic crystals contain one tetramer exhibiting D(2)-symmetry in the asymmetric unit. The tetramer exhibits a donut-like structure with a hollow interior. All four active sites are accessible only from the interior of the tetrameric assembly. Small-angle X-ray scattering indicates that in solution the predominant oligomeric species of the protein is a dimer, but also that higher oligomers exist at higher protein concentrations. The observed scattering data are best explained by assuming a dimer-tetramer equilibrium with about 7% tetramers present in solution. Consequently, at the elevated protein concentrations in the crowded environment inside the cell the observed tetramer may constitute the biologically relevant functional unit of the enzyme.

Purification, crystallization and initial crystallographic characterization of the Ginkgo biloba 11S seed globulin ginnacin

DOE Office of Scientific and Technical Information (OSTI.GOV)

Jin, Tengchuan; Chen, Yu-Wei; Howard, Andrew

2008-07-01

The crystallization of ginnacin, the 11S seed storage protein from G. biloba, is reported. Ginkgo biloba, a well known ‘living fossil’ native to China, is grown worldwide as an ornamental shade plant. Medicinal and nutritional uses of G. biloba in Asia have a long history. However, ginkgo seed proteins have not been well studied at the biochemical and molecular level. In this study, the G. biloba 11S seed storage protein ginnacin was purified by sequential anion-exchange and gel-filtration chromatography. A crystallization screen was performed and well diffracting single crystals were obtained by the vapor-diffusion method. A molecular-replacement structural solution hasmore » been obtained. There are six protomers in an asymmetric unit. Structure refinement is currently in progress.« less
Meigo governs dendrite targeting specificity by modulating Ephrin level and N-glycosylation

PubMed Central

Sekine, Sayaka U; Haraguchi, Shuka; Chao, Kinhong; Kato, Tomoko; Luo, Liqun; Miura, Masayuki; Chihara, Takahiro

2016-01-01

Neural circuit assembly requires precise dendrite and axon targeting. We identified an evolutionarily conserved endoplasmic reticulum (ER) protein, Meigo, from a mosaic genetic screen in Drosophila melanogaster. Meigo was cell-autonomously required in olfactory receptor neurons and projection neurons to target their axons and dendrites to the lateral antennal lobe and to refine projection neuron dendrites into individual glomeruli. Loss of Meigo induced an unfolded protein response and reduced the amount of neuronal cell surface proteins, including Ephrin. Ephrin overexpression specifically suppressed the projection neuron dendrite refinement defect present in meigo mutant flies, and ephrin knockdown caused a similar projection neuron dendrite refinement defect. Meigo positively regulated the level of Ephrin N-glycosylation, which was required for its optimal function in vivo. Thus, Meigo, an ER-resident protein, governs neuronal targeting specificity by regulating ER folding capacity and protein N-glycosylation. Furthermore, Ephrin appears to be an important substrate that mediates Meigo’s function in refinement of glomerular targeting. PMID:23624514
Adapting Poisson-Boltzmann to the self-consistent mean field theory: Application to protein side-chain modeling

NASA Astrophysics Data System (ADS)

Koehl, Patrice; Orland, Henri; Delarue, Marc

2011-08-01

We present an extension of the self-consistent mean field theory for protein side-chain modeling in which solvation effects are included based on the Poisson-Boltzmann (PB) theory. In this approach, the protein is represented with multiple copies of its side chains. Each copy is assigned a weight that is refined iteratively based on the mean field energy generated by the rest of the protein, until self-consistency is reached. At each cycle, the variational free energy of the multi-copy system is computed; this free energy includes the internal energy of the protein that accounts for vdW and electrostatics interactions and a solvation free energy term that is computed using the PB equation. The method converges in only a few cycles and takes only minutes of central processing unit time on a commodity personal computer. The predicted conformation of each residue is then set to be its copy with the highest weight after convergence. We have tested this method on a database of hundred highly refined NMR structures to circumvent the problems of crystal packing inherent to x-ray structures. The use of the PB-derived solvation free energy significantly improves prediction accuracy for surface side chains. For example, the prediction accuracies for χ1 for surface cysteine, serine, and threonine residues improve from 68%, 35%, and 43% to 80%, 53%, and 57%, respectively. A comparison with other side-chain prediction algorithms demonstrates that our approach is consistently better in predicting the conformations of exposed side chains.
Predictive Structure and Topology of Peroxisomal ATP-Binding Cassette (ABC) Transporters

PubMed Central

Andreoletti, Pierre; Raas, Quentin; Gondcaille, Catherine; Cherkaoui-Malki, Mustapha; Trompier, Doriane; Savary, Stéphane

2017-01-01

The peroxisomal ATP-binding Cassette (ABC) transporters, which are called ABCD1, ABCD2 and ABCD3, are transmembrane proteins involved in the transport of various lipids that allow their degradation inside the organelle. Defective ABCD1 leads to the accumulation of very long-chain fatty acids and is associated with a complex and severe neurodegenerative disorder called X-linked adrenoleukodystrophy (X-ALD). Although the nucleotide-binding domain is highly conserved and characterized within the ABC transporters family, solid data are missing for the transmembrane domain (TMD) of ABCD proteins. The lack of a clear consensus on the secondary and tertiary structure of the TMDs weakens any structure-function hypothesis based on the very diverse ABCD1 mutations found in X-ALD patients. Therefore, we first reinvestigated thoroughly the structure-function data available and performed refined alignments of ABCD protein sequences. Based on the 2.85 Å resolution crystal structure of the mitochondrial ABC transporter ABCB10, here we propose a structural model of peroxisomal ABCD proteins that specifies the position of the transmembrane and coupling helices, and highlight functional motifs and putative important amino acid residues. PMID:28737695
Refinement of Generalized Born Implicit Solvation Parameters for Nucleic Acids and their Complexes with Proteins

PubMed Central

Nguyen, Hai; Pérez, Alberto; Bermeo, Sherry; Simmerling, Carlos

2016-01-01

The Generalized Born (GB) implicit solvent model has undergone significant improvements in accuracy for modeling of proteins and small molecules. However, GB still remains a less widely explored option for nucleic acid simulations, in part because fast GB models are often unable to maintain stable nucleic acid structures, or they introduce structural bias in proteins, leading to difficulty in application of GB models in simulations of protein-nucleic acid complexes. Recently, GB-neck2 was developed to improve the behavior of protein simulations. In an effort to create a more accurate model for nucleic acids, a similar procedure to the development of GB-neck2 is described here for nucleic acids. The resulting parameter set significantly reduces absolute and relative energy error relative to Poisson Boltzmann for both nucleic acids and nucleic acid-protein complexes, when compared to its predecessor GB-neck model. This improvement in solvation energy calculation translates to increased structural stability for simulations of DNA and RNA duplexes, quadruplexes, and protein-nucleic acid complexes. The GB-neck2 model also enables successful folding of small DNA and RNA hairpins to near native structures as determined from comparison with experiment. The functional form and all required parameters are provided here and also implemented in the AMBER software. PMID:26574454
Mechanisms of amyloid formation revealed by solution NMR

PubMed Central

Karamanos, Theodoros K.; Kalverda, Arnout P.; Thompson, Gary S.; Radford, Sheena E.

2015-01-01

Amyloid fibrils are proteinaceous elongated aggregates involved in more than fifty human diseases. Recent advances in electron microscopy and solid state NMR have allowed the characterization of fibril structures to different extents of refinement. However, structural details about the mechanism of fibril formation remain relatively poorly defined. This is mainly due to the complex, heterogeneous and transient nature of the species responsible for assembly; properties that make them difficult to detect and characterize in structural detail using biophysical techniques. The ability of solution NMR spectroscopy to investigate exchange between multiple protein states, to characterize transient and low-population species, and to study high molecular weight assemblies, render NMR an invaluable technique for studies of amyloid assembly. In this article we review state-of-the-art solution NMR methods for investigations of: (a) protein dynamics that lead to the formation of aggregation-prone species; (b) amyloidogenic intrinsically disordered proteins; and (c) protein–protein interactions on pathway to fibril formation. Together, these topics highlight the power and potential of NMR to provide atomic level information about the molecular mechanisms of one of the most fascinating problems in structural biology. PMID:26282197
Quantum.Ligand.Dock: protein-ligand docking with quantum entanglement refinement on a GPU system.

PubMed

Kantardjiev, Alexander A

2012-07-01

Quantum.Ligand.Dock (protein-ligand docking with graphic processing unit (GPU) quantum entanglement refinement on a GPU system) is an original modern method for in silico prediction of protein-ligand interactions via high-performance docking code. The main flavour of our approach is a combination of fast search with a special account for overlooked physical interactions. On the one hand, we take care of self-consistency and proton equilibria mutual effects of docking partners. On the other hand, Quantum.Ligand.Dock is the the only docking server offering such a subtle supplement to protein docking algorithms as quantum entanglement contributions. The motivation for development and proposition of the method to the community hinges upon two arguments-the fundamental importance of quantum entanglement contribution in molecular interaction and the realistic possibility to implement it by the availability of supercomputing power. The implementation of sophisticated quantum methods is made possible by parallelization at several bottlenecks on a GPU supercomputer. The high-performance implementation will be of use for large-scale virtual screening projects, structural bioinformatics, systems biology and fundamental research in understanding protein-ligand recognition. The design of the interface is focused on feasibility and ease of use. Protein and ligand molecule structures are supposed to be submitted as atomic coordinate files in PDB format. A customization section is offered for addition of user-specified charges, extra ionogenic groups with intrinsic pK(a) values or fixed ions. Final predicted complexes are ranked according to obtained scores and provided in PDB format as well as interactive visualization in a molecular viewer. Quantum.Ligand.Dock server can be accessed at http://87.116.85.141/LigandDock.html.
NMR data-driven structure determination using NMR-I-TASSER in the CASD-NMR experiment

PubMed Central

Jang, Richard; Wang, Yan

2015-01-01

NMR-I-TASSER, an adaption of the I-TASSER algorithm combining NMR data for protein structure determination, recently joined the second round of the CASD-NMR experiment. Unlike many molecular dynamics-based methods, NMR-I-TASSER takes a molecular replacement-like approach to the problem by first threading the target through the PDB to identify structural templates which are then used for iterative NOE assignments and fragment structure assembly refinements. The employment of multiple templates allows NMR-I-TASSER to sample different topologies while convergence to a single structure is not required. Retroactive and blind tests of the CASD-NMR targets from Rounds 1 and 2 demonstrate that even without using NOE peak lists I-TASSER can generate correct structure topology with 15 of 20 targets having a TM-score above 0.5. With the addition of NOE-based distance restraints, NMR-I-TASSER significantly improved the I-TASSER models with all models having the TM-score above 0.5. The average RMSD was reduced from 5.29 to 2.14 Å in Round 1 and 3.18 to 1.71 Å in Round 2. There is no obvious difference in the modeling results with using raw and refined peak lists, indicating robustness of the pipeline to the NOE assignment errors. Overall, despite the low-resolution modeling the current NMR-I-TASSER pipeline provides a coarse-grained structure folding approach complementary to traditional molecular dynamics simulations, which can produce fast near-native frameworks for atomic-level structural refinement. PMID:25737244
Large-scale model quality assessment for improving protein tertiary structure prediction.

PubMed

Cao, Renzhi; Bhattacharya, Debswapna; Adhikari, Badri; Li, Jilong; Cheng, Jianlin

2015-06-15

Sampling structural models and ranking them are the two major challenges of protein structure prediction. Traditional protein structure prediction methods generally use one or a few quality assessment (QA) methods to select the best-predicted models, which cannot consistently select relatively better models and rank a large number of models well. Here, we develop a novel large-scale model QA method in conjunction with model clustering to rank and select protein structural models. It unprecedentedly applied 14 model QA methods to generate consensus model rankings, followed by model refinement based on model combination (i.e. averaging). Our experiment demonstrates that the large-scale model QA approach is more consistent and robust in selecting models of better quality than any individual QA method. Our method was blindly tested during the 11th Critical Assessment of Techniques for Protein Structure Prediction (CASP11) as MULTICOM group. It was officially ranked third out of all 143 human and server predictors according to the total scores of the first models predicted for 78 CASP11 protein domains and second according to the total scores of the best of the five models predicted for these domains. MULTICOM's outstanding performance in the extremely competitive 2014 CASP11 experiment proves that our large-scale QA approach together with model clustering is a promising solution to one of the two major problems in protein structure modeling. The web server is available at: http://sysbio.rnet.missouri.edu/multicom_cluster/human/. © The Author 2015. Published by Oxford University Press.
Experimentally observed conformation-dependent geometry and hidden strain in proteins.

PubMed Central

Karplus, P. A.

1996-01-01

A database has been compiled documenting the peptide conformations and geometries from 70 diverse proteins refined at 1.75 A or better. Analysis of the well-ordered residues within the database shows phi, psi-distributions that have more fine structure than is generally observed. Also, clear evidence is presented that the peptide covalent geometry depends on conformation, with the interpeptide N-C alpha-C bond angle varying by nearly +/-5 degrees from its standard value. The observed deviations from standard peptide geometry are greatest near the edges of well-populated regions, consistent with strain occurring in these conformations. Minimization of such hidden strain could be an important factor in thermostability of proteins. These empirical data describing how equilibrium peptide geometry varies as a function of conformation confirm and extend quantum mechanics calculations, and have predictive value that will aid both theoretical and experimental analyses of protein structure. PMID:8819173
Only Five of 10 Strictly Conserved Disulfide Bonds Are Essential for Folding and Eight for Function of the HIV-1 Envelope Glycoprotein

PubMed Central

van Anken, Eelco; Sanders, Rogier W.; Liscaljet, I. Marije; Land, Aafke; Bontjer, Ilja; Tillemans, Sonja; Nabatov, Alexey A.; Paxton, William A.; Berkhout, Ben

2008-01-01

Protein folding in the endoplasmic reticulum goes hand in hand with disulfide bond formation, and disulfide bonds are considered key structural elements for a protein's folding and function. We used the HIV-1 Envelope glycoprotein to examine in detail the importance of its 10 completely conserved disulfide bonds. We systematically mutated the cysteines in its ectodomain, assayed the mutants for oxidative folding, transport, and incorporation into the virus, and tested fitness of mutant viruses. We found that the protein was remarkably tolerant toward manipulation of its disulfide-bonded structure. Five of 10 disulfide bonds were dispensable for folding. Two of these were even expendable for viral replication in cell culture, indicating that the relevance of these disulfide bonds becomes manifest only during natural infection. Our findings refine old paradigms on the importance of disulfide bonds for proteins. PMID:18653472
Targeting Common but Complex Proteoglycans on Breast Cancer Cells and Stem Cells Using Evolutionary Refined Malaria Proteins

DTIC Science & Technology

2014-09-01

protein VAR2CSA. We have extensive data demonstrating that this protein specifically targets sulfated chondroitin sulfate A proteoglycans present on all... chondroitin sulfate A on circulating tumor cells using a evolutionary refined malaria protein B) National Annual PhD meeting in Oncology, March 26-27...malaria protein VAR2CSA when sulfated on carbon 4 of the CS backbone. We have identified CSPG4 as a major protein in breast cancer cells, but also a
Cholesterol oxidase: ultrahigh-resolution crystal structure and multipolar atom model-based analysis.

PubMed

Zarychta, Bartosz; Lyubimov, Artem; Ahmed, Maqsood; Munshi, Parthapratim; Guillot, Benoît; Vrielink, Alice; Jelsch, Christian

2015-04-01

Examination of protein structure at the subatomic level is required to improve the understanding of enzymatic function. For this purpose, X-ray diffraction data have been collected at 100 K from cholesterol oxidase crystals using synchrotron radiation to an optical resolution of 0.94 Å. After refinement using the spherical atom model, nonmodelled bonding peaks were detected in the Fourier residual electron density on some of the individual bonds. Well defined bond density was observed in the peptide plane after averaging maps on the residues with the lowest thermal motion. The multipolar electron density of the protein-cofactor complex was modelled by transfer of the ELMAM2 charge-density database, and the topology of the intermolecular interactions between the protein and the flavin adenine dinucleotide (FAD) cofactor was subsequently investigated. Taking advantage of the high resolution of the structure, the stereochemistry of main-chain bond lengths and of C=O···H-N hydrogen bonds was analyzed with respect to the different secondary-structure elements.
Protein secondary structure determination by constrained single-particle cryo-electron tomography.

PubMed

Bartesaghi, Alberto; Lecumberry, Federico; Sapiro, Guillermo; Subramaniam, Sriram

2012-12-05

Cryo-electron microscopy (cryo-EM) is a powerful technique for 3D structure determination of protein complexes by averaging information from individual molecular images. The resolutions that can be achieved with single-particle cryo-EM are frequently limited by inaccuracies in assigning molecular orientations based solely on 2D projection images. Tomographic data collection schemes, however, provide powerful constraints that can be used to more accurately determine molecular orientations necessary for 3D reconstruction. Here, we propose "constrained single-particle tomography" as a general strategy for 3D structure determination in cryo-EM. A key component of our approach is the effective use of images recorded in tilt series to extract high-resolution information and correct for the contrast transfer function. By incorporating geometric constraints into the refinement to improve orientational accuracy of images, we reduce model bias and overrefinement artifacts and demonstrate that protein structures can be determined at resolutions of ∼8 Å starting from low-dose tomographic tilt series. Copyright © 2012 Elsevier Ltd. All rights reserved.
Automated side-chain model building and sequence assignment by template matching.

PubMed

Terwilliger, Thomas C

2003-01-01

An algorithm is described for automated building of side chains in an electron-density map once a main-chain model is built and for alignment of the protein sequence to the map. The procedure is based on a comparison of electron density at the expected side-chain positions with electron-density templates. The templates are constructed from average amino-acid side-chain densities in 574 refined protein structures. For each contiguous segment of main chain, a matrix with entries corresponding to an estimate of the probability that each of the 20 amino acids is located at each position of the main-chain model is obtained. The probability that this segment corresponds to each possible alignment with the sequence of the protein is estimated using a Bayesian approach and high-confidence matches are kept. Once side-chain identities are determined, the most probable rotamer for each side chain is built into the model. The automated procedure has been implemented in the RESOLVE software. Combined with automated main-chain model building, the procedure produces a preliminary model suitable for refinement and extension by an experienced crystallographer.
Immunohistochemical Analysis in the Rat Central Nervous System and Peripheral Lymph Node Tissue Sections.

PubMed

Adzemovic, Milena Z; Zeitelhofer, Manuel; Leisser, Marianne; Köck, Ulricke; Kury, Angela; Olsson, Tomas

2016-11-14

Immunohistochemistry (IHC) provides highly specific, reliable and attractive protein visualization. Correct performance and interpretation of an IHC-based multicolor labeling is challenging, especially when utilized for assessing interrelations between target proteins in the tissue with a high fat content such as the central nervous system (CNS). Our protocol represents a refinement of the standard immunolabeling technique particularly adjusted for detection of both structural and soluble proteins in the rat CNS and peripheral lymph nodes (LN) affected by neuroinflammation. Nonetheless, with or without further modifications, our protocol could likely be used for detection of other related protein targets, even in other organs and species than here presented.
Crosslinking Constraints and Computational Models as Complementary Tools in Modeling the Extracellular Domain of the Glycine Receptor

PubMed Central

Liu, Zhenyu; Szarecka, Agnieszka; Yonkunas, Michael; Speranskiy, Kirill; Kurnikova, Maria; Cascio, Michael

2014-01-01

The glycine receptor (GlyR), a member of the pentameric ligand-gated ion channel superfamily, is the major inhibitory neurotransmitter-gated receptor in the spinal cord and brainstem. In these receptors, the extracellular domain binds agonists, antagonists and various other modulatory ligands that act allosterically to modulate receptor function. The structures of homologous receptors and binding proteins provide templates for modeling of the ligand-binding domain of GlyR, but limitations in sequence homology and structure resolution impact on modeling studies. The determination of distance constraints via chemical crosslinking studies coupled with mass spectrometry can provide additional structural information to aid in model refinement, however it is critical to be able to distinguish between intra- and inter-subunit constraints. In this report we model the structure of GlyBP, a structural and functional homolog of the extracellular domain of human homomeric α1 GlyR. We then show that intra- and intersubunit Lys-Lys crosslinks in trypsinized samples of purified monomeric and oligomeric protein bands from SDS-polyacrylamide gels may be identified and differentiated by MALDI-TOF MS studies of limited resolution. Thus, broadly available MS platforms are capable of providing distance constraints that may be utilized in characterizing large complexes that may be less amenable to NMR and crystallographic studies. Systematic studies of state-dependent chemical crosslinking and mass spectrometric identification of crosslinked sites has the potential to complement computational modeling efforts by providing constraints that can validate and refine allosteric models. PMID:25025226
TAP score: torsion angle propensity normalization applied to local protein structure evaluation

PubMed Central

Tosatto, Silvio CE; Battistutta, Roberto

2007-01-01

Background Experimentally determined protein structures may contain errors and require validation. Conformational criteria based on the Ramachandran plot are mainly used to distinguish between distorted and adequately refined models. While the readily available criteria are sufficient to detect totally wrong structures, establishing the more subtle differences between plausible structures remains more challenging. Results A new criterion, called TAP score, measuring local sequence to structure fitness based on torsion angle propensities normalized against the global minimum and maximum is introduced. It is shown to be more accurate than previous methods at estimating the validity of a protein model in terms of commonly used experimental quality parameters on two test sets representing the full PDB database and a subset of obsolete PDB structures. Highly selective TAP thresholds are derived to recognize over 90% of the top experimental structures in the absence of experimental information. Both a web server and an executable version of the TAP score are available at . Conclusion A novel procedure for energy normalization (TAP) has significantly improved the possibility to recognize the best experimental structures. It will allow the user to more reliably isolate problematic structures in the context of automated experimental structure determination. PMID:17504537
DNA nanotubes for NMR structure determination of membrane proteins.

PubMed

Bellot, Gaëtan; McClintock, Mark A; Chou, James J; Shih, William M

2013-04-01

Finding a way to determine the structures of integral membrane proteins using solution nuclear magnetic resonance (NMR) spectroscopy has proved to be challenging. A residual-dipolar-coupling-based refinement approach can be used to resolve the structure of membrane proteins up to 40 kDa in size, but to do this you need a weak-alignment medium that is detergent-resistant and it has thus far been difficult to obtain such a medium suitable for weak alignment of membrane proteins. We describe here a protocol for robust, large-scale synthesis of detergent-resistant DNA nanotubes that can be assembled into dilute liquid crystals for application as weak-alignment media in solution NMR structure determination of membrane proteins in detergent micelles. The DNA nanotubes are heterodimers of 400-nm-long six-helix bundles, each self-assembled from a M13-based p7308 scaffold strand and >170 short oligonucleotide staple strands. Compatibility with proteins bearing considerable positive charge as well as modulation of molecular alignment, toward collection of linearly independent restraints, can be introduced by reducing the negative charge of DNA nanotubes using counter ions and small DNA-binding molecules. This detergent-resistant liquid-crystal medium offers a number of properties conducive for membrane protein alignment, including high-yield production, thermal stability, buffer compatibility and structural programmability. Production of sufficient nanotubes for four or five NMR experiments can be completed in 1 week by a single individual.
Using Local States To Drive the Sampling of Global Conformations in Proteins

PubMed Central

2016-01-01

Conformational changes associated with protein function often occur beyond the time scale currently accessible to unbiased molecular dynamics (MD) simulations, so that different approaches have been developed to accelerate their sampling. Here we investigate how the knowledge of backbone conformations preferentially adopted by protein fragments, as contained in precalculated libraries known as structural alphabets (SA), can be used to explore the landscape of protein conformations in MD simulations. We find that (a) enhancing the sampling of native local states in both metadynamics and steered MD simulations allows the recovery of global folded states in small proteins; (b) folded states can still be recovered when the amount of information on the native local states is reduced by using a low-resolution version of the SA, where states are clustered into macrostates; and (c) sequences of SA states derived from collections of structural motifs can be used to sample alternative conformations of preselected protein regions. The present findings have potential impact on several applications, ranging from protein model refinement to protein folding and design. PMID:26808351

Using Local States To Drive the Sampling of Global Conformations in Proteins.

PubMed

Pandini, Alessandro; Fornili, Arianna

2016-03-08

Conformational changes associated with protein function often occur beyond the time scale currently accessible to unbiased molecular dynamics (MD) simulations, so that different approaches have been developed to accelerate their sampling. Here we investigate how the knowledge of backbone conformations preferentially adopted by protein fragments, as contained in precalculated libraries known as structural alphabets (SA), can be used to explore the landscape of protein conformations in MD simulations. We find that (a) enhancing the sampling of native local states in both metadynamics and steered MD simulations allows the recovery of global folded states in small proteins; (b) folded states can still be recovered when the amount of information on the native local states is reduced by using a low-resolution version of the SA, where states are clustered into macrostates; and (c) sequences of SA states derived from collections of structural motifs can be used to sample alternative conformations of preselected protein regions. The present findings have potential impact on several applications, ranging from protein model refinement to protein folding and design.
Pharmacophore modeling, docking, and principal component analysis based clustering: combined computer-assisted approaches to identify new inhibitors of the human rhinovirus coat protein.

PubMed

Steindl, Theodora M; Crump, Carolyn E; Hayden, Frederick G; Langer, Thierry

2005-10-06

The development and application of a sophisticated virtual screening and selection protocol to identify potential, novel inhibitors of the human rhinovirus coat protein employing various computer-assisted strategies are described. A large commercially available database of compounds was screened using a highly selective, structure-based pharmacophore model generated with the program Catalyst. A docking study and a principal component analysis were carried out within the software package Cerius and served to validate and further refine the obtained results. These combined efforts led to the selection of six candidate structures, for which in vitro anti-rhinoviral activity could be shown in a biological assay.
Protein Data Bank depositions from synchrotron sources.

PubMed

Jiang, Jiansheng; Sweet, Robert M

2004-07-01

A survey and analysis of Protein Data Bank (PDB) depositions from international synchrotron radiation facilities, based on the latest released PDB entries, are reported. The results (http://asdp.bnl.gov/asda/Libraries/) show that worldwide, every year since 1999, more than 50% of the deposited X-ray structures have used synchrotron facilities, reaching 75% by 2003. In this web-based database, all PDB entries among individual synchrotron beamlines are archived, synchronized with the weekly PDB release. Statistics regarding the quality of experimental data and the refined model for all structures are presented, and these are analysed to reflect the impact of synchrotron sources. The results confirm the common impression that synchrotron sources extend the size of structures that can be solved with equivalent or better quality than home sources.
Combining functional and structural genomics to sample the essential Burkholderia structome.

PubMed

Baugh, Loren; Gallagher, Larry A; Patrapuvich, Rapatbhorn; Clifton, Matthew C; Gardberg, Anna S; Edwards, Thomas E; Armour, Brianna; Begley, Darren W; Dieterich, Shellie H; Dranow, David M; Abendroth, Jan; Fairman, James W; Fox, David; Staker, Bart L; Phan, Isabelle; Gillespie, Angela; Choi, Ryan; Nakazawa-Hewitt, Steve; Nguyen, Mary Trang; Napuli, Alberto; Barrett, Lynn; Buchko, Garry W; Stacy, Robin; Myler, Peter J; Stewart, Lance J; Manoil, Colin; Van Voorhis, Wesley C

2013-01-01

The genus Burkholderia includes pathogenic gram-negative bacteria that cause melioidosis, glanders, and pulmonary infections of patients with cancer and cystic fibrosis. Drug resistance has made development of new antimicrobials critical. Many approaches to discovering new antimicrobials, such as structure-based drug design and whole cell phenotypic screens followed by lead refinement, require high-resolution structures of proteins essential to the parasite. We experimentally identified 406 putative essential genes in B. thailandensis, a low-virulence species phylogenetically similar to B. pseudomallei, the causative agent of melioidosis, using saturation-level transposon mutagenesis and next-generation sequencing (Tn-seq). We selected 315 protein products of these genes based on structure-determination criteria, such as excluding very large and/or integral membrane proteins, and entered them into the Seattle Structural Genomics Center for Infection Disease (SSGCID) structure determination pipeline. To maximize structural coverage of these targets, we applied an "ortholog rescue" strategy for those producing insoluble or difficult to crystallize proteins, resulting in the addition of 387 orthologs (or paralogs) from seven other Burkholderia species into the SSGCID pipeline. This structural genomics approach yielded structures from 31 putative essential targets from B. thailandensis, and 25 orthologs from other Burkholderia species, yielding an overall structural coverage for 49 of the 406 essential gene families, with a total of 88 depositions into the Protein Data Bank. Of these, 25 proteins have properties of a potential antimicrobial drug target i.e., no close human homolog, part of an essential metabolic pathway, and a deep binding pocket. We describe the structures of several potential drug targets in detail. This collection of structures, solubility and experimental essentiality data provides a resource for development of drugs against infections and diseases caused by Burkholderia. All expression clones and proteins created in this study are freely available by request.
Assessing the Chemical Accuracy of Protein Structures via Peptide Acidity

PubMed Central

Anderson, Janet S.; Hernández, Griselda; LeMaster, David M.

2012-01-01

Although the protein native state is a Boltzmann conformational ensemble, practical applications often require a representative model from the most populated region of that distribution. The acidity of the backbone amides, as reflected in hydrogen exchange rates, is exquisitely sensitive to the surrounding charge and dielectric volume distribution. For each of four proteins, three independently determined X-ray structures of differing crystallographic resolution were used to predict exchange for the static solvent-exposed amide hydrogens. The average correlation coefficients range from 0.74 for ubiquitin to 0.93 for Pyrococcus furiosus rubredoxin, reflecting the larger range of experimental exchange rates exhibited by the latter protein. The exchange prediction errors modestly correlate with the crystallographic resolution. MODELLER 9v6-derived homology models at ~60% sequence identity (36% identity for chymotrypsin inhibitor CI2) yielded correlation coefficients that are ~0.1 smaller than for the cognate X-ray structures. The most recently deposited NOE-based ubiquitin structure and the original NMR structure of CI2 fail to provide statistically significant predictions of hydrogen exchange. However, the more recent RECOORD refinement study of CI2 yielded predictions comparable to the X-ray and homology model-based analyses. PMID:23182463
The La-related protein 1-specific domain repurposes HEAT-like repeats to directly bind a 5'TOP sequence.

PubMed

Lahr, Roni M; Mack, Seshat M; Héroux, Annie; Blagden, Sarah P; Bousquet-Antonelli, Cécile; Deragon, Jean-Marc; Berman, Andrea J

2015-09-18

La-related protein 1 (LARP1) regulates the stability of many mRNAs. These include 5'TOPs, mTOR-kinase responsive mRNAs with pyrimidine-rich 5' UTRs, which encode ribosomal proteins and translation factors. We determined that the highly conserved LARP1-specific C-terminal DM15 region of human LARP1 directly binds a 5'TOP sequence. The crystal structure of this DM15 region refined to 1.86 Å resolution has three structurally related and evolutionarily conserved helix-turn-helix modules within each monomer. These motifs resemble HEAT repeats, ubiquitous helical protein-binding structures, but their sequences are inconsistent with consensus sequences of known HEAT modules, suggesting this structure has been repurposed for RNA interactions. A putative mTORC1-recognition sequence sits within a flexible loop C-terminal to these repeats. We also present modelling of pyrimidine-rich single-stranded RNA onto the highly conserved surface of the DM15 region. These studies lay the foundation necessary for proceeding toward a structural mechanism by which LARP1 links mTOR signalling to ribosome biogenesis. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
The La-related protein 1-specific domain repurposes HEAT-like repeats to directly bind a 5'TOP sequence

DOE PAGES

Lahr, Roni M.; Mack, Seshat M.; Heroux, Annie; ...

2015-07-22

La-related protein 1 (LARP1) regulates the stability of many mRNAs. These include 5'TOPs, mTOR-kinase responsive mRNAs with pyrimidine-rich 5' UTRs, which encode ribosomal proteins and translation factors. We determined that the highly conserved LARP1-specific C-terminal DM15 region of human LARP1 directly binds a 5'TOP sequence. The crystal structure of this DM15 region refined to 1.86 Å resolution has three structurally related and evolutionarily conserved helix-turn-helix modules within each monomer. These motifs resemble HEAT repeats, ubiquitous helical protein-binding structures, but their sequences are inconsistent with consensus sequences of known HEAT modules, suggesting this structure has been repurposed for RNA interactions. Amore » putative mTORC1-recognition sequence sits within a flexible loop C-terminal to these repeats. We also present modelling of pyrimidine-rich single-stranded RNA onto the highly conserved surface of the DM15 region. Ultimately, these studies lay the foundation necessary for proceeding toward a structural mechanism by which LARP1 links mTOR signalling to ribosome biogenesis.« less
FRODOCK 2.0: fast protein-protein docking server.

PubMed

Ramírez-Aportela, Erney; López-Blanco, José Ramón; Chacón, Pablo

2016-08-01

The prediction of protein-protein complexes from the structures of unbound components is a challenging and powerful strategy to decipher the mechanism of many essential biological processes. We present a user-friendly protein-protein docking server based on an improved version of FRODOCK that includes a complementary knowledge-based potential. The web interface provides a very effective tool to explore and select protein-protein models and interactively screen them against experimental distance constraints. The competitive success rates and efficiency achieved allow the retrieval of reliable potential protein-protein binding conformations that can be further refined with more computationally demanding strategies. The server is free and open to all users with no login requirement at http://frodock.chaconlab.org pablo@chaconlab.org Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Strategies for carbohydrate model building, refinement and validation

PubMed Central

2017-01-01

Sugars are the most stereochemically intricate family of biomolecules and present substantial challenges to anyone trying to understand their nomenclature, reactions or branched structures. Current crystallographic programs provide an abstraction layer allowing inexpert structural biologists to build complete protein or nucleic acid model components automatically either from scratch or with little manual intervention. This is, however, still not generally true for sugars. The need for carbohydrate-specific building and validation tools has been highlighted a number of times in the past, concomitantly with the introduction of a new generation of experimental methods that have been ramping up the production of protein–sugar complexes and glycoproteins for the past decade. While some incipient advances have been made to address these demands, correctly modelling and refining carbohydrates remains a challenge. This article will address many of the typical difficulties that a structural biologist may face when dealing with carbohydrates, with an emphasis on problem solving in the resolution range where X-ray crystallography and cryo-electron microscopy are expected to overlap in the next decade. PMID:28177313
Insights from molecular modeling and dynamics simulation of pathogen resistance (R) protein from brinjal.

PubMed

Shrivastava, Dipty; Nain, Vikrant; Sahi, Shakti; Verma, Anju; Sharma, Priyanka; Sharma, Prakash Chand; Kumar, Polumetla Ananda

2011-01-22

Resistance (R) protein recognizes molecular signature of pathogen infection and activates downstream hypersensitive response signalling in plants. R protein works as a molecular switch for pathogen defence signalling and represent one of the largest plant gene family. Hence, understanding molecular structure and function of R proteins has been of paramount importance for plant biologists. The present study is aimed at predicting structure of R proteins signalling domains (CC-NBS) by creating a homology model, refining and optimising the model by molecular dynamics simulation and comparing ADP and ATP binding. Based on sequence similarity with proteins of known structures, CC-NBS domains were initially modelled using CED- 4 (cell death abnormality protein) and APAF-1 (apoptotic protease activating factor) as multiple templates. The final CC-NBS structural model was built and optimized by molecular dynamic simulation for 5 nanoseconds (ns). Docking of ADP and ATP at active site shows that both ligand bind specifically with same residues and with minor difference (1 Kcal/mol) in binding energy. Sharing of binding site by ADP and ATP and low difference in their binding site makes CC-NBS suitable for working as molecular switch. Furthermore, structural superimposition elucidate that CC-NBS and CARD (caspase recruitment domains) domain of CED-4 have low RMSD value of 0.9 A° Availability of 3D structural model for both CC and NBS domains will . help in getting deeper insight in these pathogen defence genes.
Crystal structure of MTCP-1: Implications for role of TCL-1 and MTCP-1 in T cell malignancies

PubMed Central

Fu, Zheng-Qing; Du Bois, Garrett C.; Song, Sherry P.; Kulikovskaya, Irina; Virgilio, Laura; Rothstein, Jay L.; Croce, Carlo M.; Weber, Irene T.; Harrison, Robert W.

1998-01-01

Two related oncogenes, TCL-1 and MTCP-1, are overexpressed in T cell prolymphocytic leukemias as a result of chromosomal rearrangements that involve the translocation of one T cell receptor gene to either chromosome 14q32 or Xq28. The crystal structure of human recombinant MTCP-1 protein has been determined at 2.0 Å resolution by using multiwavelength anomalous dispersion data from selenomethionine-enriched protein and refined to an R factor of 0.21. MTCP-1 folds into a compact eight-stranded β barrel structure with a short helix between the fourth and fifth strands. The topology is unique. The structure of TCL-1 has been predicted by molecular modeling based on 40% amino acid sequence identity with MTCP-1. The identical residues are clustered inside the barrel and on the surface at one side of the barrel. The overall structure of MTCP-1 superficially resembles the structures of proteins in the lipocalin family and calycin superfamily. These proteins have diverse functions, including transport of retinol, fatty acids, chromophores, pheromones, synthesis of prostaglandin, immune modulation, and cell regulation. However, MTCP-1 differs in the topology of the β strands. The structural similarity suggests that MTCP-1 and TCL-1 form a unique family of β barrel proteins that is predicted to bind small hydrophobic ligands and function in cell regulation. PMID:9520380
An asymmetric structure of the Bacillus subtilis replication terminator protein in complex with DNA.

PubMed

Vivian, J P; Porter, C J; Wilce, J A; Wilce, M C J

2007-07-13

In Bacillus subtilis, the termination of DNA replication via polar fork arrest is effected by a specific protein:DNA complex formed between the replication terminator protein (RTP) and DNA terminator sites. We report the crystal structure of a replication terminator protein homologue (RTP.C110S) of B. subtilis in complex with the high affinity component of one of its cognate DNA termination sites, known as the TerI B-site, refined at 2.5 A resolution. The 21 bp RTP:DNA complex displays marked structural asymmetry in both the homodimeric protein and the DNA. This is in contrast to the previously reported complex formed with a symmetrical TerI B-site homologue. The induced asymmetry is consistent with the complex's solution properties as determined using NMR spectroscopy. Concomitant with this asymmetry is variation in the protein:DNA binding pattern for each of the subunits of the RTP homodimer. It is proposed that the asymmetric "wing" positions, as well as other asymmetrical features of the RTP:DNA complex, are critical for the cooperative binding that underlies the mechanism of polar fork arrest at the complete terminator site.
A General Safety Assessment for Purified Food Ingredients Derived From Biotechnology Crops: Case Study of Brazilian Sugar and Beverages Produced From Insect-Protected Sugarcane.

PubMed

Kennedy, Reese D; Cheavegatti-Gianotto, Adriana; de Oliveira, Wladecir S; Lirette, Ronald P; Hjelle, Jerry J

2018-01-01

Insect-protected sugarcane that expresses Cry1Ab has been developed in Brazil. Analysis of trade information has shown that effectively all the sugarcane-derived Brazilian exports are raw or refined sugar and ethanol. The fact that raw and refined sugar are highly purified food ingredients, with no detectable transgenic protein, provides an interesting case study of a generalized safety assessment approach. In this study, both the theoretical protein intakes and safety assessments of Cry1Ab, Cry1Ac, NPTII, and Bar proteins used in insect-protected biotechnology crops were examined. The potential consumption of these proteins was examined using local market research data of average added sugar intakes in eight diverse and representative Brazilian raw and refined sugar export markets (Brazil, Canada, China, Indonesia, India, Japan, Russia, and the USA). The average sugar intakes, which ranged from 5.1 g of added sugar/person/day (India) to 126 g sugar/p/day (USA) were used to calculated possible human exposure. The theoretical protein intake estimates were carried out in the "Worst-case" scenario, assumed that 1 μg of newly-expressed protein is detected/g of raw or refined sugar; and the "Reasonable-case" scenario assumed 1 ng protein/g sugar. The "Worst-case" scenario was based on results of detailed studies of sugarcane processing in Brazil that showed that refined sugar contains less than 1 μg of total plant protein /g refined sugar. The "Reasonable-case" scenario was based on assumption that the expression levels in stalk of newly-expressed proteins were less than 0.1% of total stalk protein. Using these calculated protein intake values from the consumption of sugar, along with the accepted NOAEL levels of the four representative proteins we concluded that safety margins for the "Worst-case" scenario ranged from 6.9 × 10 5 to 5.9 × 10 7 and for the "Reasonable-case" scenario ranged from 6.9 × 10 8 to 5.9 × 10 10 . These safety margins are very high due to the extremely low possible exposures and the high NOAELs for these non-toxic proteins. This generalized approach to the safety assessment of highly purified food ingredients like sugar illustrates that sugar processed from Brazilian GM varieties are safe for consumption in representative markets globally.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhang, Qibin; Monroe, Matthew E.; Schepmoes, Athena A.

Non-enzymatic glycation of proteins is implicated in diabetes mellitus and its related complications. In this report, we extend our previous development and refinement of proteomics-based methods for the analysis of non-enzymatically glycated proteins to comprehensively identify glycated proteins in normal and diabetic human plasma and erythrocytes. Using immunodepletion, enrichment, and fractionation strategies, we identified 7749 unique glycated peptides, corresponding to 3742 unique glycated proteins. Semi-quantitative comparisons revealed a number of proteins with glycation levels significantly increased in diabetes relative to control samples and that erythrocyte proteins are more extensively glycated than plasma proteins. A glycation motif analysis revealed amino acidsmore » that are favored more than others in the protein primary structures in the vicinity of the glycation sites in both sample types. The glycated peptides and corresponding proteins reported here provide a foundation for the potential identification of novel markers for diabetes, glycemia, or diabetic complications.« less
Sugar-binding and crystallographic studies of an arabinose-binding protein mutant (Met108Leu) that exhibits enhanced affinity and altered specificity.

PubMed

Vermersch, P S; Lemon, D D; Tesmer, J J; Quiocho, F A

1991-07-16

In addition to hydrogen bonds, van der Waals forces contribute to the affinity of protein-carbohydrate interactions. Nonpolar van der Waals contacts in the complexes of the L-arabinose-binding protein (ABP) with monosaccharides have been studied by means of site-directed mutagenesis, equilibrium and rapid kinetic binding techniques, and X-ray crystallography. ABP, a periplasmic transport receptor of Escherichia coli, binds L-arabinose, D-galactose, and D-fucose with preferential affinity in the order of Ara greater than Gal much greater than Fuc. Well-refined, high-resolution structures of ABP complexed with the three sugars revealed that the structural differences in the ABP-sugar complexes are localized around C5 of the sugars, where the equatorial H of Ara has been substituted for CH3 (Fuc) or CH2OH (Gal). The side chain of Met108 undergoes a sterically dictated, ligand-specific, conformational change to optimize nonpolar interactions between its methyl group and the sugar. We found that the Met108Leu ABP binds Gal tighter than wild-type ABP binds Ara and exhibits a preference for ligand in the order of Gal much greater than Fuc greater than Ara. The differences in affinity can be attributed to differences in the dissociation rates of the ABP-sugar complexes. We have refined at better than 1.7-A resolution the crystal structures of the Met108Leu ABP complexed with each of the sugars and offer a molecular explanation for the altered binding properties.
Structural evidence for solvent-stabilisation by aspartic acid as a mechanism for halophilic protein stability in high salt concentrations.

PubMed

Lenton, Samuel; Walsh, Danielle L; Rhys, Natasha H; Soper, Alan K; Dougan, Lorna

2016-07-21

Halophilic organisms have adapted to survive in high salt environments, where mesophilic organisms would perish. One of the biggest challenges faced by halophilic proteins is the ability to maintain both the structure and function at molar concentrations of salt. A distinct adaptation of halophilic proteins, compared to mesophilic homologues, is the abundance of aspartic acid on the protein surface. Mutagenesis and crystallographic studies of halophilic proteins suggest an important role for solvent interactions with the surface aspartic acid residues. This interaction, between the regions of the acidic protein surface and the solvent, is thought to maintain a hydration layer around the protein at molar salt concentrations thereby allowing halophilic proteins to retain their functional state. Here we present neutron diffraction data of the monomeric zwitterionic form of aspartic acid solutions at physiological pH in 0.25 M and 2.5 M concentration of potassium chloride, to mimic mesophilic and halophilic-like environmental conditions. We have used isotopic substitution in combination with empirical potential structure refinement to extract atomic-scale information from the data. Our study provides structural insights that support the hypothesis that carboxyl groups on acidic residues bind water more tightly under high salt conditions, in support of the residue-ion interaction model of halophilic protein stabilisation. Furthermore our data show that in the presence of high salt the self-association between the zwitterionic form of aspartic acid molecules is reduced, suggesting a possible mechanism through which protein aggregation is prevented.
The determinants of bond angle variability in protein/peptide backbones: A comprehensive statistical/quantum mechanics analysis.

PubMed

Improta, Roberto; Vitagliano, Luigi; Esposito, Luciana

2015-11-01

The elucidation of the mutual influence between peptide bond geometry and local conformation has important implications for protein structure refinement, validation, and prediction. To gain insights into the structural determinants and the energetic contributions associated with protein/peptide backbone plasticity, we here report an extensive analysis of the variability of the peptide bond angles by combining statistical analyses of protein structures and quantum mechanics calculations on small model peptide systems. Our analyses demonstrate that all the backbone bond angles strongly depend on the peptide conformation and unveil the existence of regular trends as function of ψ and/or φ. The excellent agreement of the quantum mechanics calculations with the statistical surveys of protein structures validates the computational scheme here employed and demonstrates that the valence geometry of protein/peptide backbone is primarily dictated by local interactions. Notably, for the first time we show that the position of the H(α) hydrogen atom, which is an important parameter in NMR structural studies, is also dependent on the local conformation. Most of the trends observed may be satisfactorily explained by invoking steric repulsive interactions; in some specific cases the valence bond variability is also influenced by hydrogen-bond like interactions. Moreover, we can provide a reliable estimate of the energies involved in the interplay between geometry and conformations. © 2015 Wiley Periodicals, Inc.
Deformable complex network for refining low-resolution X-ray structures

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhang, Chong; Wang, Qinghua; Ma, Jianpeng, E-mail: jpma@bcm.edu

2015-10-27

A new refinement algorithm called the deformable complex network that combines a novel angular network-based restraint with a deformable elastic network model in the target function has been developed to aid in structural refinement in macromolecular X-ray crystallography. In macromolecular X-ray crystallography, building more accurate atomic models based on lower resolution experimental diffraction data remains a great challenge. Previous studies have used a deformable elastic network (DEN) model to aid in low-resolution structural refinement. In this study, the development of a new refinement algorithm called the deformable complex network (DCN) is reported that combines a novel angular network-based restraint withmore » the DEN model in the target function. Testing of DCN on a wide range of low-resolution structures demonstrated that it constantly leads to significantly improved structural models as judged by multiple refinement criteria, thus representing a new effective refinement tool for low-resolution structural determination.« less
CSI 3.0: a web server for identifying secondary and super-secondary structure in proteins using NMR chemical shifts

PubMed Central

Hafsa, Noor E.; Arndt, David; Wishart, David S.

2015-01-01

The Chemical Shift Index or CSI 3.0 (http://csi3.wishartlab.com) is a web server designed to accurately identify the location of secondary and super-secondary structures in protein chains using only nuclear magnetic resonance (NMR) backbone chemical shifts and their corresponding protein sequence data. Unlike earlier versions of CSI, which only identified three types of secondary structure (helix, β-strand and coil), CSI 3.0 now identifies total of 11 types of secondary and super-secondary structures, including helices, β-strands, coil regions, five common β-turns (type I, II, I′, II′ and VIII), β hairpins as well as interior and edge β-strands. CSI 3.0 accepts experimental NMR chemical shift data in multiple formats (NMR Star 2.1, NMR Star 3.1 and SHIFTY) and generates colorful CSI plots (bar graphs) and secondary/super-secondary structure assignments. The output can be readily used as constraints for structure determination and refinement or the images may be used for presentations and publications. CSI 3.0 uses a pipeline of several well-tested, previously published programs to identify the secondary and super-secondary structures in protein chains. Comparisons with secondary and super-secondary structure assignments made via standard coordinate analysis programs such as DSSP, STRIDE and VADAR on high-resolution protein structures solved by X-ray and NMR show >90% agreement between those made with CSI 3.0. PMID:25979265
Conformationally selective multidimensional chemical shift ranges in proteins from a PACSY database purged using intrinsic quality criteria

PubMed Central

Hong, Mei

2016-01-01

We have determined refined multidimensional chemical shift ranges for intra-residue correlations (13C–13C, 15N–13C, etc.) in proteins, which can be used to gain type-assignment and/or secondary-structure information from experimental NMR spectra. The chemical-shift ranges are the result of a statistical analysis of the PACSY database of >3000 proteins with 3D structures (1,200,207 13C chemical shifts and >3 million chemical shifts in total); these data were originally derived from the Biological Magnetic Resonance Data Bank. Using relatively simple non-parametric statistics to find peak maxima in the distributions of helix, sheet, coil and turn chemical shifts, and without the use of limited “hand-picked” data sets, we show that ~94 % of the 13C NMR data and almost all 15N data are quite accurately referenced and assigned, with smaller standard deviations (0.2 and 0.8 ppm, respectively) than recognized previously. On the other hand, approximately 6 % of the 13C chemical shift data in the PACSY database are shown to be clearly misreferenced, mostly by ca. −2.4 ppm. The removal of the misreferenced data and other outliers by this purging by intrinsic quality criteria (PIQC) allows for reliable identification of secondary maxima in the two-dimensional chemical-shift distributions already pre-separated by secondary structure. We demonstrate that some of these correspond to specific regions in the Ramachandran plot, including left-handed helix dihedral angles, reflect unusual hydrogen bonding, or are due to the influence of a following proline residue. With appropriate smoothing, significantly more tightly defined chemical shift ranges are obtained for each amino acid type in the different secondary structures. These chemical shift ranges, which may be defined at any statistical threshold, can be used for amino-acid type assignment and secondary-structure analysis of chemical shifts from intra-residue cross peaks by inspection or by using a provided command-line Python script (PLUQin), which should be useful in protein structure determination. The refined chemical shift distributions are utilized in a simple quality test (SQAT) that should be applied to new protein NMR data before deposition in a databank, and they could benefit many other chemical-shift based tools. PMID:26787537

Probing the structure of Leishmania major DHFR TS and structure based virtual screening of peptide library for the identification of anti-leishmanial leads.

PubMed

Rajasekaran, Rajalakshmi; Chen, Yi-Ping Phoebe

2012-09-01

Leishmaniasis, a multi-faceted ethereal disease is considered to be one of the World's major communicable diseases that demands exhaustive research and control measures. The substantial data on these protozoan parasites has not been utilized completely to develop potential therapeutic strategies against Leishmaniasis. Dihydrofolate reductase thymidylate synthase (DHFR-TS) plays a major role in the infective state of the parasite and hence the DHFR-TS based drugs remains of much interest to researchers working on Leishmaniasis. Although, crystal structures of DHFR-TS from different species including Plasmodium falciparum and Trypanosoma cruzi are available, the experimentally determined structure of the Leishmania major DHFR-TS has not yet been reported in the Protein Data Bank. A high quality three dimensional structure of L.major DHFR-TS has been modeled through the homology modeling approach. Carefully refined and the energy minimized structure of the modeled protein was validated using a number of structure validation programs to confirm its structure quality. The modeled protein structure was used in the process of structure based virtual screening to figure out a potential lead structure against DHFR TS. The lead molecule identified has a binding affinity of 0.51 nM and clearly follows drug like properties.
Membrane Fusion Proteins as Nanomachines

NASA Astrophysics Data System (ADS)

Tamm, Lukas

2009-03-01

Membrane fusion is key to fertilization, virus infection, and neurotransmission. Specific proteins work like nanomachines to stitch together fluid, yet highly ordered lipid bilayers. The energy gained from large exothermic conformational changes of these proteins is utilized to fuse lipid bilayers that do not fuse spontaneously. Structural studies using x-ray crystallography and NMR spectroscopy have yielded detailed information about architecture and inner workings of these molecular machines. The question now is: how is mechanical energy gained from such protein transformations harnessed to transform membrane topology? To answer this question, we have determined that a boomerang-shaped structure of the influenza fusion peptide is critical to generate a high-energy binding intermediate in the target membrane and to return the ``boomerang'' to its place of release near the viral membrane for completion of the fusion cycle. In presynaptic exocytosis, receptor and acceptor SNAREs are zippered to form a helical bundle that is arrested shortly before the membrane. Ca binding to interlocked synaptotagmin releases the fusion block. Structural NMR and single molecule fluorescence data are combined to arrive at and further refine this picture.
Crystal structures of the methyltransferase and helicase from the ZIKA 1947 MR766 Uganda strain

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bukrejewska, Malgorzata; Derewenda, Urszula; Radwanska, Malwina

2017-08-15

Two nonstructural proteins encoded byZika virusstrain MR766 RNA, a methyltransferase and a helicase, were crystallized and their structures were solved and refined at 2.10 and 2.01 Å resolution, respectively. The NS5 methyltransferase contains a boundS-adenosyl-L-methionine (SAM) co-substrate. The NS3 helicase is in the apo form. Comparison with published crystal structures of the helicase in the apo, nucleotide-bound and single-stranded RNA (ssRNA)-bound states suggests that binding of ssRNA to the helicase may occur through conformational selection rather than induced fit.
PaLaCe: A Coarse-Grain Protein Model for Studying Mechanical Properties.

PubMed

Pasi, Marco; Lavery, Richard; Ceres, Nicoletta

2013-01-08

We present a coarse-grain protein model PaLaCe (Pasi-Lavery-Ceres) that has been developed principally to allow fast computational studies of protein mechanics and to clarify the links between mechanics and function. PaLaCe uses a two-tier protein representation with one to three pseudoatoms representing each amino acid for the main nonbonded interactions, combined with atomic-scale peptide groups and some side chain atoms to allow the explicit representation of backbone hydrogen bonds and to simplify the treatment of bonded interactions. The PaLaCe force field is composed of physics-based terms, parametrized using Boltzmann inversion of conformational probability distributions derived from a protein structure data set, and iteratively refined to reproduce the experimental distributions. PaLaCe has been implemented in the MMTK simulation package and can be used for energy minimization, normal mode calculations, and molecular or stochastic dynamics. We present simulations with PaLaCe that test its ability to maintain stable structures for folded proteins, reproduce their dynamic fluctuations, and correctly model large-scale, force-induced conformational changes.
NONUNIFORM FOURIER TRANSFORMS FOR RIGID-BODY AND MULTI-DIMENSIONAL ROTATIONAL CORRELATIONS

PubMed Central

BAJAJ, CHANDRAJIT; BAUER, BENEDIKT; BETTADAPURA, RADHAKRISHNA; VOLLRATH, ANTJE

2013-01-01

The task of evaluating correlations is central to computational structural biology. The rigid-body correlation problem seeks the rigid-body transformation (R, t), R ∈ SO(3), t ∈ ℝ3 that maximizes the correlation between a pair of input scalar-valued functions representing molecular structures. Exhaustive solutions to the rigid-body correlation problem take advantage of the fast Fourier transform to achieve a speedup either with respect to the sought translation or rotation. We present PFcorr, a new exhaustive solution, based on the non-equispaced SO(3) Fourier transform, to the rigid-body correlation problem; unlike previous solutions, ours achieves a combination of translational and rotational speedups without requiring equispaced grids. PFcorr can be straightforwardly applied to a variety of problems in protein structure prediction and refinement that involve correlations under rigid-body motions of the protein. Additionally, we show how it applies, along with an appropriate flexibility model, to analogs of the above problems in which the flexibility of the protein is relevant. PMID:24379643
First Protein Crystallization Experiments on The International Space Station: Sweet Success in Space With Thaumatin

NASA Technical Reports Server (NTRS)

Kundrot, Craig E.; Barnes, Cindy L.; Snell, Eddie H.; Achari, Aniruddha; Whitaker, Ann F. (Technical Monitor)

2001-01-01

We determined the room temperature 1.2 A structure of thaumatin using a crystal grown in the first protein crystallization experiment conducted aboard the International Space Station (ISS). The crystals were grown in the Enhanced Gaseous Nitrogen Dewar (EGN) developed by Alexander McPherson and co-workers. EGN transports frozen solutions contained in tygon tubing in a liquid nitrogen Dewar to ISS where the tubes then thaw. Batch, free interface diffusion (FID), or vapor diffusion crystallization occurs after thawing. EGN was flown to the ISS on STS-106 on September 8, 2000. This was a "risk mitigation" flight that tested EGN performance and the process of conducting experiments on ISS. We focused on how to map a hanging drop crystallization recipe to the EGN FID method. Thaumatin was chosen as the test system. Three series of crystallization recipes were set-up. Each series tested different volume ratios of protein-rich solution to precipitant-rich solution. The series differed from each other by fixing either the protein concentration or the amount of protein in the solutions. Upon return of the samples to Earth on October 24 by STS-92, bubbles that spanned the diameter of the tubing were observed in all tubes. Such bubbles interrupt liquid-liquid diffusion and force vapor diffusion equilibration to occur instead. Nonetheless, crystals grew in 9 of 30 tubes. Many large crystals were grown, the largest being 2.0 x 1.1 x 1.0 cubic mm. The largest crystal was used to collect data at room temperature on beamline 7-1 of the Stanford Synchrotron Radiation Source to a maximum resolution of 1.2 A. The structure was refined anisotropically using SHELX with a data to parameter ratio of 4.5 to give an R(sub factor) of 15.8% (R(sub free) = 18.2%) for ail reflections without generated hydrogens. This refinement is proceeding. Comparisons of this 1.2 A microgravity structure to previous reports of the thaumatin structure at 1.75 A and to ground control crystals will be presented.
Bhageerath-H: A homology/ab initio hybrid server for predicting tertiary structures of monomeric soluble proteins

PubMed Central

2014-01-01

Background The advent of human genome sequencing project has led to a spurt in the number of protein sequences in the databanks. Success of structure based drug discovery severely hinges on the availability of structures. Despite significant progresses in the area of experimental protein structure determination, the sequence-structure gap is continually widening. Data driven homology based computational methods have proved successful in predicting tertiary structures for sequences sharing medium to high sequence similarities. With dwindling similarities of query sequences, advanced homology/ ab initio hybrid approaches are being explored to solve structure prediction problem. Here we describe Bhageerath-H, a homology/ ab initio hybrid software/server for predicting protein tertiary structures with advancing drug design attempts as one of the goals. Results Bhageerath-H web-server was validated on 75 CASP10 targets which showed TM-scores ≥0.5 in 91% of the cases and Cα RMSDs ≤5Å from the native in 58% of the targets, which is well above the CASP10 water mark. Comparison with some leading servers demonstrated the uniqueness of the hybrid methodology in effectively sampling conformational space, scoring best decoys and refining low resolution models to high and medium resolution. Conclusion Bhageerath-H methodology is web enabled for the scientific community as a freely accessible web server. The methodology is fielded in the on-going CASP11 experiment. PMID:25521245
CORAL: aligning conserved core regions across domain families.

PubMed

Fong, Jessica H; Marchler-Bauer, Aron

2009-08-01

Homologous protein families share highly conserved sequence and structure regions that are frequent targets for comparative analysis of related proteins and families. Many protein families, such as the curated domain families in the Conserved Domain Database (CDD), exhibit similar structural cores. To improve accuracy in aligning such protein families, we propose a profile-profile method CORAL that aligns individual core regions as gap-free units. CORAL computes optimal local alignment of two profiles with heuristics to preserve continuity within core regions. We benchmarked its performance on curated domains in CDD, which have pre-defined core regions, against COMPASS, HHalign and PSI-BLAST, using structure superpositions and comprehensive curator-optimized alignments as standards of truth. CORAL improves alignment accuracy on core regions over general profile methods, returning a balanced score of 0.57 for over 80% of all domain families in CDD, compared with the highest balanced score of 0.45 from other methods. Further, CORAL provides E-values to aid in detecting homologous protein families and, by respecting block boundaries, produces alignments with improved 'readability' that facilitate manual refinement. CORAL will be included in future versions of the NCBI Cn3D/CDTree software, which can be downloaded at http://www.ncbi.nlm.nih.gov/Structure/cdtree/cdtree.shtml. Supplementary data are available at Bioinformatics online.
Molecular dynamics study of the structural and dynamic characteristics of the polyextremophilic short-chain dehydrogenase from the Thermococcus sibiricus archaeon and its homologues

NASA Astrophysics Data System (ADS)

Popinako, Anna V.; Antonov, Mikhail Yu.; Bezsudnova, Ekaterina Yu.; Prokopiev, Georgiy A.; Popov, Vladimir O.

2017-11-01

The study of structural adaptations of proteins from polyextremophilic organisms using computational molecular dynamics method is appealing because the obtained knowledge can be applied to construction of synthetic proteins with high activity and stability in polyextreme media which is useful for many industrial applications. To investigate molecular adaptations to high temperature, we have focused on a superthermostable short-chain dehydrogenase TsAdh319 from the Thermococcus sibiricus polyextremophilic archaeon and its closest structural homologues. Molecular dynamics method is widely used for molecular structure refinement, investigation of biological macromolecules motion, and, consequently, for interpreting the results of certain biophysical experiments. We performed molecular dynamics simulations of the proteins at different temperatures. Comparison of root mean square fluctuations (RMSF) of the atoms in thermophilic alcohol dehydrogenases (ADHs) at 300 K and 358 K revealed the existence of stable residues at 358 K. These residues surround the active site and form a "nucleus of rigidity" in thermophilic ADHs. The results of our studies suggest that the existence of the "nucleus of rigidity" is crucial for the stability of TsAdh319. Absence of the "nucleus of rigidity" in non-thermally stable proteins causes fluctuations throughout the protein, especially on the surface, triggering the process of denaturation at high temperatures.
Probing structures of large protein complexes using zero-length cross-linking.

PubMed

Rivera-Santiago, Roland F; Sriswasdi, Sira; Harper, Sandra L; Speicher, David W

2015-11-01

Structural mass spectrometry (MS) is a field with growing applicability for addressing complex biophysical questions regarding proteins and protein complexes. One of the major structural MS approaches involves the use of chemical cross-linking coupled with MS analysis (CX-MS) to identify proximal sites within macromolecules. Identified cross-linked sites can be used to probe novel protein-protein interactions or the derived distance constraints can be used to verify and refine molecular models. This review focuses on recent advances of "zero-length" cross-linking. Zero-length cross-linking reagents do not add any atoms to the cross-linked species due to the lack of a spacer arm. This provides a major advantage in the form of providing more precise distance constraints as the cross-linkable groups must be within salt bridge distances in order to react. However, identification of cross-linked peptides using these reagents presents unique challenges. We discuss recent efforts by our group to minimize these challenges by using multiple cycles of LC-MS/MS analysis and software specifically developed and optimized for identification of zero-length cross-linked peptides. Representative data utilizing our current protocol are presented and discussed. Copyright © 2015 Elsevier Inc. All rights reserved.
Predicting the tolerated sequences for proteins and protein interfaces using RosettaBackrub flexible backbone design.

PubMed

Smith, Colin A; Kortemme, Tanja

2011-01-01

Predicting the set of sequences that are tolerated by a protein or protein interface, while maintaining a desired function, is useful for characterizing protein interaction specificity and for computationally designing sequence libraries to engineer proteins with new functions. Here we provide a general method, a detailed set of protocols, and several benchmarks and analyses for estimating tolerated sequences using flexible backbone protein design implemented in the Rosetta molecular modeling software suite. The input to the method is at least one experimentally determined three-dimensional protein structure or high-quality model. The starting structure(s) are expanded or refined into a conformational ensemble using Monte Carlo simulations consisting of backrub backbone and side chain moves in Rosetta. The method then uses a combination of simulated annealing and genetic algorithm optimization methods to enrich for low-energy sequences for the individual members of the ensemble. To emphasize certain functional requirements (e.g. forming a binding interface), interactions between and within parts of the structure (e.g. domains) can be reweighted in the scoring function. Results from each backbone structure are merged together to create a single estimate for the tolerated sequence space. We provide an extensive description of the protocol and its parameters, all source code, example analysis scripts and three tests applying this method to finding sequences predicted to stabilize proteins or protein interfaces. The generality of this method makes many other applications possible, for example stabilizing interactions with small molecules, DNA, or RNA. Through the use of within-domain reweighting and/or multistate design, it may also be possible to use this method to find sequences that stabilize particular protein conformations or binding interactions over others.
Objective identification of residue ranges for the superposition of protein structures

PubMed Central

2011-01-01

Background The automation of objectively selecting amino acid residue ranges for structure superpositions is important for meaningful and consistent protein structure analyses. So far there is no widely-used standard for choosing these residue ranges for experimentally determined protein structures, where the manual selection of residue ranges or the use of suboptimal criteria remain commonplace. Results We present an automated and objective method for finding amino acid residue ranges for the superposition and analysis of protein structures, in particular for structure bundles resulting from NMR structure calculations. The method is implemented in an algorithm, CYRANGE, that yields, without protein-specific parameter adjustment, appropriate residue ranges in most commonly occurring situations, including low-precision structure bundles, multi-domain proteins, symmetric multimers, and protein complexes. Residue ranges are chosen to comprise as many residues of a protein domain that increasing their number would lead to a steep rise in the RMSD value. Residue ranges are determined by first clustering residues into domains based on the distance variance matrix, and then refining for each domain the initial choice of residues by excluding residues one by one until the relative decrease of the RMSD value becomes insignificant. A penalty for the opening of gaps favours contiguous residue ranges in order to obtain a result that is as simple as possible, but not simpler. Results are given for a set of 37 proteins and compared with those of commonly used protein structure validation packages. We also provide residue ranges for 6351 NMR structures in the Protein Data Bank. Conclusions The CYRANGE method is capable of automatically determining residue ranges for the superposition of protein structure bundles for a large variety of protein structures. The method correctly identifies ordered regions. Global structure superpositions based on the CYRANGE residue ranges allow a clear presentation of the structure, and unnecessary small gaps within the selected ranges are absent. In the majority of cases, the residue ranges from CYRANGE contain fewer gaps and cover considerably larger parts of the sequence than those from other methods without significantly increasing the RMSD values. CYRANGE thus provides an objective and automatic method for standardizing the choice of residue ranges for the superposition of protein structures. PMID:21592348
Replica exchange Monte-Carlo simulations of helix bundle membrane proteins: rotational parameters of helices

NASA Astrophysics Data System (ADS)

Wu, H.-H.; Chen, C.-C.; Chen, C.-M.

2012-03-01

We propose a united-residue model of membrane proteins to investigate the structures of helix bundle membrane proteins (HBMPs) using coarse-grained (CG) replica exchange Monte-Carlo (REMC) simulations. To demonstrate the method, it is used to identify the ground state of HBMPs in a CG model, including bacteriorhodopsin (BR), halorhodopsin (HR), and their subdomains. The rotational parameters of transmembrane helices (TMHs) are extracted directly from the simulations, which can be compared with their experimental measurements from site-directed dichroism. In particular, the effects of amphiphilic interaction among the surfaces of TMHs on the rotational angles of helices are discussed. The proposed CG model gives a reasonably good structure prediction of HBMPs, as well as a clear physical picture for the packing, tilting, orientation, and rotation of TMHs. The root mean square deviation (RMSD) in coordinates of Cα atoms of the ground state CG structure from the X-ray structure is 5.03 Å for BR and 6.70 Å for HR. The final structure of HBMPs is obtained from the all-atom molecular dynamics simulations by refining the predicted CG structure, whose RMSD is 4.38 Å for BR and 5.70 Å for HR.
Cellular Signaling Pathways and Posttranslational Modifications Mediated by Nematode Effector Proteins.

PubMed

Hewezi, Tarek

2015-10-01

Plant-parasitic cyst and root-knot nematodes synthesize and secrete a suite of effector proteins into infected host cells and tissues. These effectors are the major virulence determinants mediating the transformation of normal root cells into specialized feeding structures. Compelling evidence indicates that these effectors directly hijack or manipulate refined host physiological processes to promote the successful parasitism of host plants. Here, we provide an update on recent progress in elucidating the molecular functions of nematode effectors. In particular, we emphasize how nematode effectors modify plant cell wall structure, mimic the activity of host proteins, alter auxin signaling, and subvert defense signaling and immune responses. In addition, we discuss the emerging evidence suggesting that nematode effectors target and recruit various components of host posttranslational machinery in order to perturb the host signaling networks required for immunity and to regulate their own activity and subcellular localization. © 2015 American Society of Plant Biologists. All Rights Reserved.
A dynamic structural model of expanded RNA CAG repeats: A refined X-ray structure and computational investigations using molecular dynamics and umbrella sampling simulations

PubMed Central

Yildirim, Ilyas; Park, Hajeung; Disney, Matthew D.; Schatz, George C.

2013-01-01

One class of functionally important RNA is repeating transcripts that cause disease through various mechanisms. For example, expanded r(CAG) repeats can cause Huntington’s and other disease through translation of toxic proteins. Herein, crystal structure of r[5ʹUUGGGC(CAG)3GUCC]2, a model of CAG expanded transcripts, refined to 1.65 Å resolution is disclosed that show both anti-anti and syn-anti orientations for 1×1 nucleotide AA internal loops. Molecular dynamics (MD) simulations using Amber force field in explicit solvent were run for over 500 ns on model systems r(5ʹGCGCAGCGC)2 (MS1) and r(5ʹCCGCAGCGG)2 (MS2). In these MD simulations, both anti-anti and syn-anti AA base pairs appear to be stable. While anti-anti AA base pairs were dynamic and sampled multiple anti-anti conformations, no syn-anti↔anti-anti transformations were observed. Umbrella sampling simulations were run on MS2, and a 2D free energy surface was created to extract transformation pathways. In addition, over 800 ns explicit solvent MD simulation was run on r[5ʹGGGC(CAG)3GUCC]2, which closely represents the refined crystal structure. One of the terminal AA base pairs (syn-anti conformation), transformed to anti-anti conformation. The pathway followed in this transformation was the one predicted by umbrella sampling simulations. Further analysis showed a binding pocket near AA base pairs in syn-anti conformations. Computational results combined with the refined crystal structure show that global minimum conformation of 1×1 nucleotide AA internal loops in r(CAG) repeats is anti-anti but can adopt syn-anti depending on the environment. These results are important to understand RNA dynamic-function relationships and develop small molecules that target RNA dynamic ensembles. PMID:23441937
Prediction of β-turns in proteins from multiple alignment using neural network

PubMed Central

Kaur, Harpreet; Raghava, Gajendra Pal Singh

2003-01-01

A neural network-based method has been developed for the prediction of β-turns in proteins by using multiple sequence alignment. Two feed-forward back-propagation networks with a single hidden layer are used where the first-sequence structure network is trained with the multiple sequence alignment in the form of PSI-BLAST–generated position-specific scoring matrices. The initial predictions from the first network and PSIPRED-predicted secondary structure are used as input to the second structure-structure network to refine the predictions obtained from the first net. A significant improvement in prediction accuracy has been achieved by using evolutionary information contained in the multiple sequence alignment. The final network yields an overall prediction accuracy of 75.5% when tested by sevenfold cross-validation on a set of 426 nonhomologous protein chains. The corresponding Qpred, Qobs, and Matthews correlation coefficient values are 49.8%, 72.3%, and 0.43, respectively, and are the best among all the previously published β-turn prediction methods. The Web server BetaTPred2 (http://www.imtech.res.in/raghava/betatpred2/) has been developed based on this approach. PMID:12592033
Docking and Virtual Screening Strategies for GPCR Drug Discovery.

PubMed

Beuming, Thijs; Lenselink, Bart; Pala, Daniele; McRobb, Fiona; Repasky, Matt; Sherman, Woody

2015-01-01

Progress in structure determination of G protein-coupled receptors (GPCRs) has made it possible to apply structure-based drug design (SBDD) methods to this pharmaceutically important target class. The quality of GPCR structures available for SBDD projects fall on a spectrum ranging from high resolution crystal structures (<2 Å), where all water molecules in the binding pocket are resolved, to lower resolution (>3 Å) where some protein residues are not resolved, and finally to homology models that are built using distantly related templates. Each GPCR project involves a distinct set of opportunities and challenges, and requires different approaches to model the interaction between the receptor and the ligands. In this review we will discuss docking and virtual screening to GPCRs, and highlight several refinement and post-processing steps that can be used to improve the accuracy of these calculations. Several examples are discussed that illustrate specific steps that can be taken to improve upon the docking and virtual screening accuracy. While GPCRs are a unique target class, many of the methods and strategies outlined in this review are general and therefore applicable to other protein families.
An automated method for modeling proteins on known templates using distance geometry.

PubMed

Srinivasan, S; March, C J; Sudarsanam, S

1993-02-01

We present an automated method incorporated into a software package, FOLDER, to fold a protein sequence on a given three-dimensional (3D) template. Starting with the sequence alignment of a family of homologous proteins, tertiary structures are modeled using the known 3D structure of one member of the family as a template. Homologous interatomic distances from the template are used as constraints. For nonhomologous regions in the model protein, the lower and the upper bounds for the interatomic distances are imposed by steric constraints and the globular dimensions of the template, respectively. Distance geometry is used to embed an ensemble of structures consistent with these distance bounds. Structures are selected from this ensemble based on minimal distance error criteria, after a penalty function optimization step. These structures are then refined using energy optimization methods. The method is tested by simulating the alpha-chain of horse hemoglobin using the alpha-chain of human hemoglobin as the template and by comparing the generated models with the crystal structure of the alpha-chain of horse hemoglobin. We also test the packing efficiency of this method by reconstructing the atomic positions of the interior side chains beyond C beta atoms of a protein domain from a known 3D structure. In both test cases, models retain the template constraints and any additionally imposed constraints while the packing of the interior residues is optimized with no short contacts or bond deformations. To demonstrate the use of this method in simulating structures of proteins with nonhomologous disulfides, we construct a model of murine interleukin (IL)-4 using the NMR structure of human IL-4 as the template. The resulting geometry of the nonhomologous disulfide in the model structure for murine IL-4 is consistent with standard disulfide geometry.
Structural constraints on the three-dimensional geometry of simple viruses: case studies of a new predictive tool

PubMed Central

Keef, Thomas; Wardman, Jessica P.; Ranson, Neil A.; Stockley, Peter G.; Twarock, Reidun

2013-01-01

Understanding the fundamental principles of virus architecture is one of the most important challenges in biology and medicine. Crick and Watson were the first to propose that viruses exhibit symmetry in the organization of their protein containers for reasons of genetic economy. Based on this, Caspar and Klug introduced quasi-equivalence theory to predict the relative locations of the coat proteins within these containers and classified virus structure in terms of T-numbers. Here it is shown that quasi-equivalence is part of a wider set of structural constraints on virus structure. These constraints can be formulated using an extension of the underlying symmetry group and this is demonstrated with a number of case studies. This new concept in virus biology provides for the first time predictive information on the structural constraints on coat protein and genome topography, and reveals a previously unrecognized structural interdependence of the shapes and sizes of different viral components. It opens up the possibility of distinguishing the structures of different viruses with the same T-number, suggesting a refined viral structure classification scheme. It can moreover be used as a basis for models of virus function, e.g. to characterize the start and end configurations of a structural transition important for infection. PMID:23403965
Structural constraints on the three-dimensional geometry of simple viruses: case studies of a new predictive tool.

PubMed

Keef, Thomas; Wardman, Jessica P; Ranson, Neil A; Stockley, Peter G; Twarock, Reidun

2013-03-01

Understanding the fundamental principles of virus architecture is one of the most important challenges in biology and medicine. Crick and Watson were the first to propose that viruses exhibit symmetry in the organization of their protein containers for reasons of genetic economy. Based on this, Caspar and Klug introduced quasi-equivalence theory to predict the relative locations of the coat proteins within these containers and classified virus structure in terms of T-numbers. Here it is shown that quasi-equivalence is part of a wider set of structural constraints on virus structure. These constraints can be formulated using an extension of the underlying symmetry group and this is demonstrated with a number of case studies. This new concept in virus biology provides for the first time predictive information on the structural constraints on coat protein and genome topography, and reveals a previously unrecognized structural interdependence of the shapes and sizes of different viral components. It opens up the possibility of distinguishing the structures of different viruses with the same T-number, suggesting a refined viral structure classification scheme. It can moreover be used as a basis for models of virus function, e.g. to characterize the start and end configurations of a structural transition important for infection.

Weak data do not make a free lunch, only a cheap meal

DOE Office of Scientific and Technical Information (OSTI.GOV)

Luo, Zhipu; Rajashankar, Kanagalaghatta; Dauter, Zbigniew

2014-01-17

Four data sets were processed at resolutions significantly exceeding the criteria traditionally used for estimating the diffraction data resolution limit. The analysis of these data and the corresponding model-quality indicators suggests that the criteria of resolution limits widely adopted in the past may be somewhat conservative. Various parameters, such asR mergeandI/σ(I), optical resolution and the correlation coefficients CC 1/2and CC*, can be used for judging the internal data quality, whereas the reliability factorsRandR freeas well as the maximum-likelihood target values and real-space map correlation coefficients can be used to estimate the agreement between the data and the refined model. However,more » none of these criteria provide a reliable estimate of the data resolution cutoff limit. The analysis suggests that extension of the maximum resolution by about 0.2 Å beyond the currently adopted limit where theI/σ(I) value drops to 2.0 does not degrade the quality of the refined structural models, but may sometimes be advantageous. Such an extension may be particularly beneficial for significantly anisotropic diffraction. Extension of the maximum resolution at the stage of data collection and structure refinement is cheap in terms of the required effort and is definitely more advisable than accepting a too conservative resolution cutoff, which is unfortunately quite frequent among the crystal structures deposited in the Protein Data Bank.« less
Evolution of intrinsic disorder in eukaryotic proteins.

PubMed

Ahrens, Joseph B; Nunez-Castilla, Janelle; Siltberg-Liberles, Jessica

2017-09-01

Conformational flexibility conferred though regions of intrinsic structural disorder allows proteins to behave as dynamic molecules. While it is well-known that intrinsically disordered regions can undergo disorder-to-order transitions in real-time as part of their function, we also are beginning to learn more about the dynamics of disorder-to-order transitions along evolutionary time-scales. Intrinsically disordered regions endow proteins with functional promiscuity, which is further enhanced by the ability of some of these regions to undergo real-time disorder-to-order transitions. Disorder content affects gene retention after whole genome duplication, but it is not necessarily conserved. Altered patterns of disorder resulting from evolutionary disorder-to-order transitions indicate that disorder evolves to modify function through refining stability, regulation, and interactions. Here, we review the evolution of intrinsically disordered regions in eukaryotic proteins. We discuss the interplay between secondary structure and disorder on evolutionary time-scales, the importance of disorder for eukaryotic proteome expansion and functional divergence, and the evolutionary dynamics of disorder.
Insights into channel dysfunction from modelling and molecular dynamics simulations.

PubMed

Musgaard, Maria; Paramo, Teresa; Domicevica, Laura; Andersen, Ole Juul; Biggin, Philip C

2018-04-01

Developments in structural biology mean that the number of different ion channel structures has increased significantly in recent years. Structures of ion channels enable us to rationalize how mutations may lead to channelopathies. However, determining the structures of ion channels is still not trivial, especially as they necessarily exist in many distinct functional states. Therefore, the use of computational modelling can provide complementary information that can refine working hypotheses of both wild type and mutant ion channels. The simplest but still powerful tool is homology modelling. Many structures are available now that can provide suitable templates for many different types of ion channels, allowing a full three-dimensional interpretation of mutational effects. These structural models, and indeed the structures themselves obtained by X-ray crystallography, and more recently cryo-electron microscopy, can be subjected to molecular dynamics simulations, either as a tool to help explore the conformational dynamics in detail or simply as a means to refine the models further. Here we review how these approaches have been used to improve our understanding of how diseases might be linked to specific mutations in ion channel proteins. This article is part of the Special Issue entitled 'Channelopathies.' Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Lammert, Heiko; Noel, Jeffrey K.; Haglund, Ellinor

The diversity in a set of protein nuclear magnetic resonance (NMR) structures provides an estimate of native state fluctuations that can be used to refine and enrich structure-based protein models (SBMs). Dynamics are an essential part of a protein’s functional native state. The dynamics in the native state are controlled by the same funneled energy landscape that guides the entire folding process. SBMs apply the principle of minimal frustration, drawn from energy landscape theory, to construct a funneled folding landscape for a given protein using only information from the native structure. On an energy landscape smoothed by evolution towards minimalmore » frustration, geometrical constraints, imposed by the native structure, control the folding mechanism and shape the native dynamics revealed by the model. Native-state fluctuations can alternatively be estimated directly from the diversity in the set of NMR structures for a protein. Based on this information, we identify a highly flexible loop in the ribosomal protein S6 and modify the contact map in a SBM to accommodate the inferred dynamics. By taking into account the probable native state dynamics, the experimental transition state is recovered in the model, and the correct order of folding events is restored. Our study highlights how the shared energy landscape connects folding and function by showing that a better description of the native basin improves the prediction of the folding mechanism.« less
Predicting protein structures with a multiplayer online game.

PubMed

Cooper, Seth; Khatib, Firas; Treuille, Adrien; Barbero, Janos; Lee, Jeehyung; Beenen, Michael; Leaver-Fay, Andrew; Baker, David; Popović, Zoran; Players, Foldit

2010-08-05

People exert large amounts of problem-solving effort playing computer games. Simple image- and text-recognition tasks have been successfully 'crowd-sourced' through games, but it is not clear if more complex scientific problems can be solved with human-directed computing. Protein structure prediction is one such problem: locating the biologically relevant native conformation of a protein is a formidable computational challenge given the very large size of the search space. Here we describe Foldit, a multiplayer online game that engages non-scientists in solving hard prediction problems. Foldit players interact with protein structures using direct manipulation tools and user-friendly versions of algorithms from the Rosetta structure prediction methodology, while they compete and collaborate to optimize the computed energy. We show that top-ranked Foldit players excel at solving challenging structure refinement problems in which substantial backbone rearrangements are necessary to achieve the burial of hydrophobic residues. Players working collaboratively develop a rich assortment of new strategies and algorithms; unlike computational approaches, they explore not only the conformational space but also the space of possible search strategies. The integration of human visual problem-solving and strategy development capabilities with traditional computational algorithms through interactive multiplayer games is a powerful new approach to solving computationally-limited scientific problems.
AFAL: a web service for profiling amino acids surrounding ligands in proteins

NASA Astrophysics Data System (ADS)

Arenas-Salinas, Mauricio; Ortega-Salazar, Samuel; Gonzales-Nilo, Fernando; Pohl, Ehmke; Holmes, David S.; Quatrini, Raquel

2014-11-01

With advancements in crystallographic technology and the increasing wealth of information populating structural databases, there is an increasing need for prediction tools based on spatial information that will support the characterization of proteins and protein-ligand interactions. Herein, a new web service is presented termed amino acid frequency around ligand (AFAL) for determining amino acids type and frequencies surrounding ligands within proteins deposited in the Protein Data Bank and for assessing the atoms and atom-ligand distances involved in each interaction (availability: http://structuralbio.utalca.cl/AFAL/index.html). AFAL allows the user to define a wide variety of filtering criteria (protein family, source organism, resolution, sequence redundancy and distance) in order to uncover trends and evolutionary differences in amino acid preferences that define interactions with particular ligands. Results obtained from AFAL provide valuable statistical information about amino acids that may be responsible for establishing particular ligand-protein interactions. The analysis will enable investigators to compare ligand-binding sites of different proteins and to uncover general as well as specific interaction patterns from existing data. Such patterns can be used subsequently to predict ligand binding in proteins that currently have no structural information and to refine the interpretation of existing protein models. The application of AFAL is illustrated by the analysis of proteins interacting with adenosine-5'-triphosphate.
AFAL: a web service for profiling amino acids surrounding ligands in proteins.

PubMed

Arenas-Salinas, Mauricio; Ortega-Salazar, Samuel; Gonzales-Nilo, Fernando; Pohl, Ehmke; Holmes, David S; Quatrini, Raquel

2014-11-01

With advancements in crystallographic technology and the increasing wealth of information populating structural databases, there is an increasing need for prediction tools based on spatial information that will support the characterization of proteins and protein-ligand interactions. Herein, a new web service is presented termed amino acid frequency around ligand (AFAL) for determining amino acids type and frequencies surrounding ligands within proteins deposited in the Protein Data Bank and for assessing the atoms and atom-ligand distances involved in each interaction (availability: http://structuralbio.utalca.cl/AFAL/index.html ). AFAL allows the user to define a wide variety of filtering criteria (protein family, source organism, resolution, sequence redundancy and distance) in order to uncover trends and evolutionary differences in amino acid preferences that define interactions with particular ligands. Results obtained from AFAL provide valuable statistical information about amino acids that may be responsible for establishing particular ligand-protein interactions. The analysis will enable investigators to compare ligand-binding sites of different proteins and to uncover general as well as specific interaction patterns from existing data. Such patterns can be used subsequently to predict ligand binding in proteins that currently have no structural information and to refine the interpretation of existing protein models. The application of AFAL is illustrated by the analysis of proteins interacting with adenosine-5'-triphosphate.
Knowledge-based fragment binding prediction.

PubMed

Tang, Grace W; Altman, Russ B

2014-04-01

Target-based drug discovery must assess many drug-like compounds for potential activity. Focusing on low-molecular-weight compounds (fragments) can dramatically reduce the chemical search space. However, approaches for determining protein-fragment interactions have limitations. Experimental assays are time-consuming, expensive, and not always applicable. At the same time, computational approaches using physics-based methods have limited accuracy. With increasing high-resolution structural data for protein-ligand complexes, there is now an opportunity for data-driven approaches to fragment binding prediction. We present FragFEATURE, a machine learning approach to predict small molecule fragments preferred by a target protein structure. We first create a knowledge base of protein structural environments annotated with the small molecule substructures they bind. These substructures have low-molecular weight and serve as a proxy for fragments. FragFEATURE then compares the structural environments within a target protein to those in the knowledge base to retrieve statistically preferred fragments. It merges information across diverse ligands with shared substructures to generate predictions. Our results demonstrate FragFEATURE's ability to rediscover fragments corresponding to the ligand bound with 74% precision and 82% recall on average. For many protein targets, it identifies high scoring fragments that are substructures of known inhibitors. FragFEATURE thus predicts fragments that can serve as inputs to fragment-based drug design or serve as refinement criteria for creating target-specific compound libraries for experimental or computational screening.
Knowledge-based Fragment Binding Prediction

PubMed Central

Tang, Grace W.; Altman, Russ B.

2014-01-01

Target-based drug discovery must assess many drug-like compounds for potential activity. Focusing on low-molecular-weight compounds (fragments) can dramatically reduce the chemical search space. However, approaches for determining protein-fragment interactions have limitations. Experimental assays are time-consuming, expensive, and not always applicable. At the same time, computational approaches using physics-based methods have limited accuracy. With increasing high-resolution structural data for protein-ligand complexes, there is now an opportunity for data-driven approaches to fragment binding prediction. We present FragFEATURE, a machine learning approach to predict small molecule fragments preferred by a target protein structure. We first create a knowledge base of protein structural environments annotated with the small molecule substructures they bind. These substructures have low-molecular weight and serve as a proxy for fragments. FragFEATURE then compares the structural environments within a target protein to those in the knowledge base to retrieve statistically preferred fragments. It merges information across diverse ligands with shared substructures to generate predictions. Our results demonstrate FragFEATURE's ability to rediscover fragments corresponding to the ligand bound with 74% precision and 82% recall on average. For many protein targets, it identifies high scoring fragments that are substructures of known inhibitors. FragFEATURE thus predicts fragments that can serve as inputs to fragment-based drug design or serve as refinement criteria for creating target-specific compound libraries for experimental or computational screening. PMID:24762971
Solidification Based Grain Refinement in Steels

DTIC Science & Technology

2009-07-24

pearlite (See Figure 1). No evidence of the as-cast austenite dendrite structure was observed. The gating system for this sample resides at the thermal...possible nucleating compounds. 3) Extend grain refinement theory and solidification knowledge through experimental data. 4) Determine structure ...refine the structure of a casting through heat treatment. The energy required for grain refining via thermomechanical processes or heat treatment
Rapid Design of Knowledge-Based Scoring Potentials for Enrichment of Near-Native Geometries in Protein-Protein Docking.

PubMed

Sasse, Alexander; de Vries, Sjoerd J; Schindler, Christina E M; de Beauchêne, Isaure Chauvot; Zacharias, Martin

2017-01-01

Protein-protein docking protocols aim to predict the structures of protein-protein complexes based on the structure of individual partners. Docking protocols usually include several steps of sampling, clustering, refinement and re-scoring. The scoring step is one of the bottlenecks in the performance of many state-of-the-art protocols. The performance of scoring functions depends on the quality of the generated structures and its coupling to the sampling algorithm. A tool kit, GRADSCOPT (GRid Accelerated Directly SCoring OPTimizing), was designed to allow rapid development and optimization of different knowledge-based scoring potentials for specific objectives in protein-protein docking. Different atomistic and coarse-grained potentials can be created by a grid-accelerated directly scoring dependent Monte-Carlo annealing or by a linear regression optimization. We demonstrate that the scoring functions generated by our approach are similar to or even outperform state-of-the-art scoring functions for predicting near-native solutions. Of additional importance, we find that potentials specifically trained to identify the native bound complex perform rather poorly on identifying acceptable or medium quality (near-native) solutions. In contrast, atomistic long-range contact potentials can increase the average fraction of near-native poses by up to a factor 2.5 in the best scored 1% decoys (compared to existing scoring), emphasizing the need of specific docking potentials for different steps in the docking protocol.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Zask, Arie; Verheijen, Jeroen C.; Curran, Kevin

The mammalian target of rapamycin (mTOR), a central regulator of growth, survival, and metabolism, is a validated target for cancer therapy. Rapamycin and its analogues, allosteric inhibitors of mTOR, only partially inhibit one mTOR protein complex. ATP-competitive, global inhibitors of mTOR that have the potential for enhanced anticancer efficacy are described. Structural features leading to potency and selectivity were identified and refined leading to compounds with in vivo efficacy in tumor xenograft models.
Sequential Release of Proteins from Structured Multishell Microcapsules.

PubMed

Shimanovich, Ulyana; Michaels, Thomas C T; De Genst, Erwin; Matak-Vinkovic, Dijana; Dobson, Christopher M; Knowles, Tuomas P J

2017-10-09

In nature, a wide range of functional materials is based on proteins. Increasing attention is also turning to the use of proteins as artificial biomaterials in the form of films, gels, particles, and fibrils that offer great potential for applications in areas ranging from molecular medicine to materials science. To date, however, most such applications have been limited to single component materials despite the fact that their natural analogues are composed of multiple types of proteins with a variety of functionalities that are coassembled in a highly organized manner on the micrometer scale, a process that is currently challenging to achieve in the laboratory. Here, we demonstrate the fabrication of multicomponent protein microcapsules where the different components are positioned in a controlled manner. We use molecular self-assembly to generate multicomponent structures on the nanometer scale and droplet microfluidics to bring together the different components on the micrometer scale. Using this approach, we synthesize a wide range of multiprotein microcapsules containing three well-characterized proteins: glucagon, insulin, and lysozyme. The localization of each protein component in multishell microcapsules has been detected by labeling protein molecules with different fluorophores, and the final three-dimensional microcapsule structure has been resolved by using confocal microscopy together with image analysis techniques. In addition, we show that these structures can be used to tailor the release of such functional proteins in a sequential manner. Moreover, our observations demonstrate that the protein release mechanism from multishell capsules is driven by the kinetic control of mass transport of the cargo and by the dissolution of the shells. The ability to generate artificial materials that incorporate a variety of different proteins with distinct functionalities increases the breadth of the potential applications of artificial protein-based materials and provides opportunities to design more refined functional protein delivery systems.
Factors affecting the use of 13Cα chemical shifts to determine, refine, and validate protein structures

PubMed Central

Vila, Jorge A.; Scheraga, Harold A.

2008-01-01

Interest centers here on the analysis of two different, but related, phenomena that affect side-chain conformations and consequently 13Cα chemical shifts and their applications to determine, refine, and validate protein structures. The first is whether 13Cα chemical shifts, computed at the DFT level of approximation with charged residues is a better approximation of observed 13Cα chemical shifts than those computed with neutral residues for proteins in solution. Accurate computation of 13Cα chemical shifts requires a proper representation of the charges, which might not take on integral values. For this analysis, the charges for 139 conformations of the protein ubiquitin were determined by explicit consideration of protein binding equilibria, at a given pH, that is, by exploring the 2ξ possible ionization states of the whole molecule, with ξ being the number of ionizable groups. The results of this analysis, as revealed by the shielding/deshield-ing of the 13Cα nucleus, indicated that: (i) there is a significant difference in the computed 13Cα chemical shifts, between basic and acidic groups, as a function of the degree of charge of the side chain; (ii) this difference is attributed to the distance between the ionizable groups and the 13Cα nucleus, which is shorter for the acidic Asp and Glu groups as compared with that for the basic Lys and Arg groups; and (iii) the use of neutral, rather than charged, basic and acidic groups is a better approximation of the observed 13Cα chemical shifts of a protein in solution. The second is how side-chain flexibility influences computed 13Cα chemical shifts in an additional set of ubiquitin conformations, in which the side chains are generated from an NMR-derived structure with the backbone conformation assumed to be fixed. The 13Cα chemical shift of a given amino acid residue in a protein is determined, mainly, by its own backbone and side-chain torsional angles, independent of the neighboring residues; the conformation of a given residue itself, however, depends on the environment of this residue and, hence, on the whole protein structure. As a consequence, this analysis reveals the role and impact of an accurate side-chain computation in the determination and refinement of protein conformation. The results of this analysis are: (i) a lower error between computed and observed 13Cα chemical shifts (by up to 3.7 ppm), was found for ~68% and ~63% of all ionizable residues and all non-Ala/Pro/Gly residues, respectively, in the additional set of conformations, compared with results for the model from which the set was derived; and (ii) all the additional conformations exhibit a lower root-mean-square-deviation (1.97 ppm ≤ rmsd ≤ 2.13 ppm), between computed and observed 13Cα chemical shifts, than the rmsd (2.32 ppm) computed for the starting conformation from which this additional set was derived. As a validation test, an analysis of the additional set of ubiquitin conformations, comparing computed and observed values of both 13Cα chemical shifts and χ1 torsional angles (given by the vicinal coupling constants, 3JN–Cγ and 3JC′–Cγ, is discussed. PMID:17975838
CSI 3.0: a web server for identifying secondary and super-secondary structure in proteins using NMR chemical shifts.

PubMed

Hafsa, Noor E; Arndt, David; Wishart, David S

2015-07-01

The Chemical Shift Index or CSI 3.0 (http://csi3.wishartlab.com) is a web server designed to accurately identify the location of secondary and super-secondary structures in protein chains using only nuclear magnetic resonance (NMR) backbone chemical shifts and their corresponding protein sequence data. Unlike earlier versions of CSI, which only identified three types of secondary structure (helix, β-strand and coil), CSI 3.0 now identifies total of 11 types of secondary and super-secondary structures, including helices, β-strands, coil regions, five common β-turns (type I, II, I', II' and VIII), β hairpins as well as interior and edge β-strands. CSI 3.0 accepts experimental NMR chemical shift data in multiple formats (NMR Star 2.1, NMR Star 3.1 and SHIFTY) and generates colorful CSI plots (bar graphs) and secondary/super-secondary structure assignments. The output can be readily used as constraints for structure determination and refinement or the images may be used for presentations and publications. CSI 3.0 uses a pipeline of several well-tested, previously published programs to identify the secondary and super-secondary structures in protein chains. Comparisons with secondary and super-secondary structure assignments made via standard coordinate analysis programs such as DSSP, STRIDE and VADAR on high-resolution protein structures solved by X-ray and NMR show >90% agreement between those made with CSI 3.0. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Structural insight into arginine methylation by the mouse protein arginine methyltransferase 7: a zinc finger freezes the mimic of the dimeric state into a single active site.

PubMed

Cura, Vincent; Troffer-Charlier, Nathalie; Wurtz, Jean Marie; Bonnefond, Luc; Cavarelli, Jean

2014-09-01

Protein arginine methyltransferase 7 (PRMT7) is a type III arginine methyltransferase which has been implicated in several biological processes such as transcriptional regulation, DNA damage repair, RNA splicing, cell differentiation and metastasis. PRMT7 is a unique but less characterized member of the family of PRMTs. The crystal structure of full-length PRMT7 from Mus musculus refined at 1.7 Å resolution is described. The PRMT7 structure is composed of two catalytic modules in tandem forming a pseudo-dimer and contains only one AdoHcy molecule bound to the N-terminal module. The high-resolution crystal structure presented here revealed several structural features showing that the second active site is frozen in an inactive state by a conserved zinc finger located at the junction between the two PRMT modules and by the collapse of two degenerated AdoMet-binding loops.
Crystallization and preliminary X-ray characterization of the genetically encoded fluorescent calcium indicator protein GCaMP2

DOE Office of Scientific and Technical Information (OSTI.GOV)

Rodríguez Guilbe, María M.; Protein Research and Development Center, University of Puerto Rico; Alfaro Malavé, Elisa C.

The genetically encoded fluorescent calcium-indicator protein GCaMP2 was crystallized in the calcium-saturated form. X-ray diffraction data were collected to 2.0 Å resolution and the structure was solved by molecular replacement. Fluorescent proteins and their engineered variants have played an important role in the study of biology. The genetically encoded calcium-indicator protein GCaMP2 comprises a circularly permuted fluorescent protein coupled to the calcium-binding protein calmodulin and a calmodulin target peptide, M13, derived from the intracellular calmodulin target myosin light-chain kinase and has been used to image calcium transients in vivo. To aid rational efforts to engineer improved variants of GCaMP2, thismore » protein was crystallized in the calcium-saturated form. X-ray diffraction data were collected to 2.0 Å resolution. The crystals belong to space group C2, with unit-cell parameters a = 126.1, b = 47.1, c = 68.8 Å, β = 100.5° and one GCaMP2 molecule in the asymmetric unit. The structure was phased by molecular replacement and refinement is currently under way.« less
FoldMiner and LOCK 2: protein structure comparison and motif discovery on the web.

PubMed

Shapiro, Jessica; Brutlag, Douglas

2004-07-01

The FoldMiner web server (http://foldminer.stanford.edu/) provides remote access to methods for protein structure alignment and unsupervised motif discovery. FoldMiner is unique among such algorithms in that it improves both the motif definition and the sensitivity of a structural similarity search by combining the search and motif discovery methods and using information from each process to enhance the other. In a typical run, a query structure is aligned to all structures in one of several databases of single domain targets in order to identify its structural neighbors and to discover a motif that is the basis for the similarity among the query and statistically significant targets. This process is fully automated, but options for manual refinement of the results are available as well. The server uses the Chime plugin and customized controls to allow for visualization of the motif and of structural superpositions. In addition, we provide an interface to the LOCK 2 algorithm for rapid alignments of a query structure to smaller numbers of user-specified targets.
Computational study of aggregation mechanism in human lysozyme[D67H

PubMed Central

Patel, Dharmeshkumar

2017-01-01

Aggregation of proteins is an undesired phenomena that affects both human health and bioengineered products such as therapeutic proteins. Finding preventative measures could be facilitated by a molecular-level understanding of dimer formation, which is the first step in aggregation. Here we present a molecular dynamics (MD) study of dimer formation propensity in human lysozyme and its D67H variant. Because the latter protein aggregates while the former does not, they offer an ideal system for testing the feasibility of the proposed MD approach which comprises three stages: i) partially unfolded conformers involved in dimer formation are generated via high-temperature MD simulations, ii) potential dimer structures are searched using docking and refined with MD, iii) free energy calculations are performed to find the most stable dimer structure. Our results provide a detailed explanation for how a single mutation (D67H) turns human lysozyme from non-aggregating to an aggregating protein. Conversely, the proposed method can be used to identify the residues causing aggregation in a protein, which can be mutated to prevent it. PMID:28467454
Combining Functional and Structural Genomics to Sample the Essential Burkholderia Structome

PubMed Central

Baugh, Loren; Gallagher, Larry A.; Patrapuvich, Rapatbhorn; Clifton, Matthew C.; Gardberg, Anna S.; Edwards, Thomas E.; Armour, Brianna; Begley, Darren W.; Dieterich, Shellie H.; Dranow, David M.; Abendroth, Jan; Fairman, James W.; Fox, David; Staker, Bart L.; Phan, Isabelle; Gillespie, Angela; Choi, Ryan; Nakazawa-Hewitt, Steve; Nguyen, Mary Trang; Napuli, Alberto; Barrett, Lynn; Buchko, Garry W.; Stacy, Robin; Myler, Peter J.; Stewart, Lance J.; Manoil, Colin; Van Voorhis, Wesley C.

2013-01-01

Background The genus Burkholderia includes pathogenic gram-negative bacteria that cause melioidosis, glanders, and pulmonary infections of patients with cancer and cystic fibrosis. Drug resistance has made development of new antimicrobials critical. Many approaches to discovering new antimicrobials, such as structure-based drug design and whole cell phenotypic screens followed by lead refinement, require high-resolution structures of proteins essential to the parasite. Methodology/Principal Findings We experimentally identified 406 putative essential genes in B. thailandensis, a low-virulence species phylogenetically similar to B. pseudomallei, the causative agent of melioidosis, using saturation-level transposon mutagenesis and next-generation sequencing (Tn-seq). We selected 315 protein products of these genes based on structure-determination criteria, such as excluding very large and/or integral membrane proteins, and entered them into the Seattle Structural Genomics Center for Infection Disease (SSGCID) structure determination pipeline. To maximize structural coverage of these targets, we applied an “ortholog rescue” strategy for those producing insoluble or difficult to crystallize proteins, resulting in the addition of 387 orthologs (or paralogs) from seven other Burkholderia species into the SSGCID pipeline. This structural genomics approach yielded structures from 31 putative essential targets from B. thailandensis, and 25 orthologs from other Burkholderia species, yielding an overall structural coverage for 49 of the 406 essential gene families, with a total of 88 depositions into the Protein Data Bank. Of these, 25 proteins have properties of a potential antimicrobial drug target i.e., no close human homolog, part of an essential metabolic pathway, and a deep binding pocket. We describe the structures of several potential drug targets in detail. Conclusions/Significance This collection of structures, solubility and experimental essentiality data provides a resource for development of drugs against infections and diseases caused by Burkholderia. All expression clones and proteins created in this study are freely available by request. PMID:23382856

Three dimensional electron microscopy and in silico tools for macromolecular structure determination

PubMed Central

Borkotoky, Subhomoi; Meena, Chetan Kumar; Khan, Mohammad Wahab; Murali, Ayaluru

2013-01-01

Recently, structural biology witnessed a major tool - electron microscopy - in solving the structures of macromolecules in addition to the conventional techniques, X-ray crystallography and nuclear magnetic resonance (NMR). Three dimensional transmission electron microscopy (3DTEM) is one of the most sophisticated techniques for structure determination of molecular machines. Known to give the 3-dimensional structures in its native form with literally no upper limit on size of the macromolecule, this tool does not need the crystallization of the protein. Combining the 3DTEM data with in silico tools, one can have better refined structure of a desired complex. In this review we are discussing about the recent advancements in three dimensional electron microscopy and tools associated with it. PMID:27092033
High quality NMR structures: a new force field with implicit water and membrane solvation for Xplor-NIH.

PubMed

Tian, Ye; Schwieters, Charles D; Opella, Stanley J; Marassi, Francesca M

2017-01-01

Structure determination of proteins by NMR is unique in its ability to measure restraints, very accurately, in environments and under conditions that closely mimic those encountered in vivo. For example, advances in solid-state NMR methods enable structure determination of membrane proteins in detergent-free lipid bilayers, and of large soluble proteins prepared by sedimentation, while parallel advances in solution NMR methods and optimization of detergent-free lipid nanodiscs are rapidly pushing the envelope of the size limit for both soluble and membrane proteins. These experimental advantages, however, are partially squandered during structure calculation, because the commonly used force fields are purely repulsive and neglect solvation, Van der Waals forces and electrostatic energy. Here we describe a new force field, and updated energy functions, for protein structure calculations with EEFx implicit solvation, electrostatics, and Van der Waals Lennard-Jones forces, in the widely used program Xplor-NIH. The new force field is based primarily on CHARMM22, facilitating calculations with a wider range of biomolecules. The new EEFx energy function has been rewritten to enable OpenMP parallelism, and optimized to enhance computation efficiency. It implements solvation, electrostatics, and Van der Waals energy terms together, thus ensuring more consistent and efficient computation of the complete nonbonded energy lists. Updates in the related python module allow detailed analysis of the interaction energies and associated parameters. The new force field and energy function work with both soluble proteins and membrane proteins, including those with cofactors or engineered tags, and are very effective in situations where there are sparse experimental restraints. Results obtained for NMR-restrained calculations with a set of five soluble proteins and five membrane proteins show that structures calculated with EEFx have significant improvements in accuracy, precision, and conformation, and that structure refinement can be obtained by short relaxation with EEFx to obtain improvements in these key metrics. These developments broaden the range of biomolecular structures that can be calculated with high fidelity from NMR restraints.
Schistosoma mansoni venom allergen-like protein 4 (SmVAL4) is a novel lipid-binding SCP/TAPS protein that lacks the prototypical CAP motifs

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kelleher, Alan; Darwiche, Rabih; Rezende, Wanderson C.

2014-08-01

The first structure of an S. mansoni venom allergen-like protein is presented. Schistosomiasis is a parasitic disease that affects over 200 million people. Vaccine candidates have been identified, including Schistosoma mansoni venom allergen-like proteins (SmVALs) from the SCP/TAPS (sperm-coating protein/Tpx/antigen 5/pathogenesis related-1/Sc7) superfamily. The first SmVAL structure, SmVAL4, was refined to a resolution limit of 2.16 Å. SmVAL4 has a unique structure that could not be predicted from homologous structures, with longer loops and an unusual C-terminal extension. SmVAL4 has the characteristic α/β-sandwich and central SCP/TAPS cavity. Furthermore, SmVAL4 has only one of the signature CAP cavity tetrad amino-acid residuesmore » and is missing the histidines that coordinate divalent cations such as Zn{sup 2+} in other SCP/TAPS proteins. SmVAL4 has a cavity between α-helices 1 and 4 that was observed to bind lipids in tablysin-15, suggesting the ability to bind lipids. Subsequently, SmVAL4 was shown to bind cholesterol in vitro. Additionally, SmVAL4 was shown to complement the in vivo sterol-export phenotype of yeast mutants lacking their endogenous CAP proteins. Expression of SmVAL4 in yeast cells lacking endogenous CAP function restores the block in sterol export. These studies suggest an evolutionarily conserved lipid-binding function shared by CAP proteins such as SmVAL4 and yeast CAP proteins such as Pry1.« less
GPCRdb: an information system for G protein-coupled receptors

PubMed Central

Isberg, Vignir; Mordalski, Stefan; Munk, Christian; Rataj, Krzysztof; Harpsøe, Kasper; Hauser, Alexander S.; Vroling, Bas; Bojarski, Andrzej J.; Vriend, Gert; Gloriam, David E.

2016-01-01

Recent developments in G protein-coupled receptor (GPCR) structural biology and pharmacology have greatly enhanced our knowledge of receptor structure-function relations, and have helped improve the scientific foundation for drug design studies. The GPCR database, GPCRdb, serves a dual role in disseminating and enabling new scientific developments by providing reference data, analysis tools and interactive diagrams. This paper highlights new features in the fifth major GPCRdb release: (i) GPCR crystal structure browsing, superposition and display of ligand interactions; (ii) direct deposition by users of point mutations and their effects on ligand binding; (iii) refined snake and helix box residue diagram looks; and (iii) phylogenetic trees with receptor classification colour schemes. Under the hood, the entire GPCRdb front- and back-ends have been re-coded within one infrastructure, ensuring a smooth browsing experience and development. GPCRdb is available at http://www.gpcrdb.org/ and it's open source code at https://bitbucket.org/gpcr/protwis. PMID:26582914
Structural Basis for Activation of Fatty Acid-binding Protein 4

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gillilan,R.; Ayers, S.; Noy, N.

2007-01-01

Fatty acid-binding protein 4 (FABP4) delivers ligands from the cytosol to the nuclear receptor PPAR{gamma} in the nucleus, thereby enhancing the transcriptional activity of the receptor. Notably, FABP4 binds multiple ligands with a similar affinity but its nuclear translocation is activated only by specific compounds. To gain insight into the structural features that underlie the ligand-specificity in activation of the nuclear import of FABP4, we solved the crystal structures of the protein complexed with two compounds that induce its nuclear translocation, and compared these to the apo-protein and to FABP4 structures bound to non-activating ligands. Examination of these structures indicatesmore » that activation coincides with closure of a portal loop phenylalanine side-chain, contraction of the binding pocket, a subtle shift in a helical domain containing the nuclear localization signal of the protein, and a resultant change in oligomeric state that exposes the nuclear localization signal to the solution. Comparisons of backbone displacements induced by activating ligands with a measure of mobility derived from translation, libration, screw (TLS) refinement, and with a composite of slowest normal modes of the apo state suggest that the helical motion associated with the activation of the protein is part of the repertoire of the equilibrium motions of the apo-protein, i.e. that ligand binding does not induce the activated configuration but serves to stabilize it. Nuclear import of FABP4 can thus be understood in terms of the pre-existing equilibrium hypothesis of ligand binding.« less
REFMAC5 for the refinement of macromolecular crystal structures

DOE Office of Scientific and Technical Information (OSTI.GOV)

Murshudov, Garib N., E-mail: garib@ysbl.york.ac.uk; Skubák, Pavol; Lebedev, Andrey A.

The general principles behind the macromolecular crystal structure refinement program REFMAC5 are described. This paper describes various components of the macromolecular crystallographic refinement program REFMAC5, which is distributed as part of the CCP4 suite. REFMAC5 utilizes different likelihood functions depending on the diffraction data employed (amplitudes or intensities), the presence of twinning and the availability of SAD/SIRAS experimental diffraction data. To ensure chemical and structural integrity of the refined model, REFMAC5 offers several classes of restraints and choices of model parameterization. Reliable models at resolutions at least as low as 4 Å can be achieved thanks to low-resolution refinement toolsmore » such as secondary-structure restraints, restraints to known homologous structures, automatic global and local NCS restraints, ‘jelly-body’ restraints and the use of novel long-range restraints on atomic displacement parameters (ADPs) based on the Kullback–Leibler divergence. REFMAC5 additionally offers TLS parameterization and, when high-resolution data are available, fast refinement of anisotropic ADPs. Refinement in the presence of twinning is performed in a fully automated fashion. REFMAC5 is a flexible and highly optimized refinement package that is ideally suited for refinement across the entire resolution spectrum encountered in macromolecular crystallography.« less
Amino acid pair- and triplet-wise groupings in the interior of α-helical segments in proteins.

PubMed

de Sousa, Miguel M; Munteanu, Cristian R; Pazos, Alejandro; Fonseca, Nuno A; Camacho, Rui; Magalhães, A L

2011-02-21

A statistical approach has been applied to analyse primary structure patterns at inner positions of α-helices in proteins. A systematic survey was carried out in a recent sample of non-redundant proteins selected from the Protein Data Bank, which were used to analyse α-helix structures for amino acid pairing patterns. Only residues more than three positions apart from both termini of the α-helix were considered as inner. Amino acid pairings i, i+k (k=1, 2, 3, 4, 5), were analysed and the corresponding 20×20 matrices of relative global propensities were constructed. An analysis of (i, i+4, i+8) and (i, i+3, i+4) triplet patterns was also performed. These analysis yielded information on a series of amino acid patterns (pairings and triplets) showing either high or low preference for α-helical motifs and suggested a novel approach to protein alphabet reduction. In addition, it has been shown that the individual amino acid propensities are not enough to define the statistical distribution of these patterns. Global pair propensities also depend on the type of pattern, its composition and orientation in the protein sequence. The data presented should prove useful to obtain and refine useful predictive rules which can further the development and fine-tuning of protein structure prediction algorithms and tools. Copyright Â© 2010 Elsevier Ltd. All rights reserved.
Structural basis for the mechanism of inhibition of uridine phosphorylase from Salmonella typhimurium

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lashkov, A. A.; Zhukhlistova, N. E.; Sotnichenko, S. E.

2010-01-15

The three-dimensional structures of three complexes of Salmonella typhimurium uridine phosphorylase with the inhibitor 2,2'-anhydrouridine, the substrate PO{sub 4}, and with both the inhibitor 2,2'-anhydrouridine and the substrate PO{sub 4} (a binary complex) were studied in detail by X-ray diffraction. The structures of the complexes were refined at 2.38, 1.5, and 1.75 A resolution, respectively. Changes in the three-dimensional structure of the subunits in different crystal structures are considered depending on the presence or absence of the inhibitor molecule and (or) the phosphate ion in the active site of the enzyme. The presence of the phosphate ion in the phosphate-bindingmore » site was found to substantially change the orientations of the side chains of the amino-acid residues Arg30, Arg91, and Arg48 coordinated to this ion. A comparison showed that the highly flexible loop L9 is unstable. The atomic coordinates of the refined structures of the complexes and the corresponding structure factors were deposited in the Protein Data Bank (their PDB ID codes are 3DD0 and 3C74). The experimental data on the spatial reorganization of the active site caused by changes in its functional state from the unligated to the completely inhibited state suggest the structural basis for the mechanism of inhibition of Salmonella typhimurium uridine phosphorylase.« less
X-ray structure determination at low resolution

DOE Office of Scientific and Technical Information (OSTI.GOV)

Brunger, Axel T., E-mail: brunger@stanford.edu; Department of Molecular and Cellular Physiology, Stanford University; Department of Neurology and Neurological Sciences, Stanford University

2009-02-01

Refinement is meaningful even at 4 Å or lower, but with present methodologies it should start from high-resolution crystal structures whenever possible. As an example of structure determination in the 3.5–4.5 Å resolution range, crystal structures of the ATPase p97/VCP, consisting of an N-terminal domain followed by a tandem pair of ATPase domains (D1 and D2), are discussed. The structures were originally solved by molecular replacement with the high-resolution structure of the N-D1 fragment of p97/VCP, whereas the D2 domain was manually built using its homology to the D1 domain as a guide. The structure of the D2 domain alonemore » was subsequently solved at 3 Å resolution. The refined model of D2 and the high-resolution structure of the N-D1 fragment were then used as starting models for re-refinement against the low-resolution diffraction data for full-length p97. The re-refined full-length models showed significant improvement in both secondary structure and R values. The free R values dropped by as much as 5% compared with the original structure refinements, indicating that refinement is meaningful at low resolution and that there is information in the diffraction data even at ∼4 Å resolution that objectively assesses the quality of the model. It is concluded that de novo model building is problematic at low resolution and refinement should start from high-resolution crystal structures whenever possible.« less
Refining the treatment of membrane proteins by coarse-grained models.

PubMed

Vorobyov, Igor; Kim, Ilsoo; Chu, Zhen T; Warshel, Arieh

2016-01-01

Obtaining a quantitative description of the membrane proteins stability is crucial for understanding many biological processes. However the advance in this direction has remained a major challenge for both experimental studies and molecular modeling. One of the possible directions is the use of coarse-grained models but such models must be carefully calibrated and validated. Here we use a recent progress in benchmark studies on the energetics of amino acid residue and peptide membrane insertion and membrane protein stability in refining our previously developed coarse-grained model (Vicatos et al., Proteins 2014;82:1168). Our refined model parameters were fitted and/or tested to reproduce water/membrane partitioning energetics of amino acid side chains and a couple of model peptides. This new model provides a reasonable agreement with experiment for absolute folding free energies of several β-barrel membrane proteins as well as effects of point mutations on a relative stability for one of those proteins, OmpLA. The consideration and ranking of different rotameric states for a mutated residue was found to be essential to achieve satisfactory agreement with the reference data. © 2015 Wiley Periodicals, Inc.
Investigation of non-corrin cobalt(II)-containing sites in protein structures of the Protein Data Bank.

PubMed

Abriata, Luciano Andres

2013-04-01

Protein X-ray structures with non-corrin cobalt(II)-containing sites, either natural or substituting another native ion, were downloaded from the Protein Data Bank and explored to (i) describe which amino acids are involved in their first ligand shells and (ii) analyze cobalt(II)-donor bond lengths in comparison with previously reported target distances, CSD data and EXAFS data. The set of amino acids involved in Co(II) binding is similar to that observed for catalytic Zn(II) sites, i.e. with a large fraction of carboxylate O atoms from aspartate and glutamate and aromatic N atoms from histidine. The computed Co(II)-donor bond lengths were found to depend strongly on structure resolution, an artifact previously detected for other metal-donor distances. Small corrections are suggested for the target bond lengths to the aromatic N atoms of histidines and the O atoms of water and hydroxide. The available target distance for cysteine (Scys) is confirmed; those for backbone O and other donors remain uncertain and should be handled with caution in refinement and modeling protocols. Finally, a relationship between both Co(II)-O bond lengths in bidentate carboxylates is quantified.
Prediction of Protein Configurational Entropy (Popcoen).

PubMed

Goethe, Martin; Gleixner, Jan; Fita, Ignacio; Rubi, J Miguel

2018-03-13

A knowledge-based method for configurational entropy prediction of proteins is presented; this methodology is extremely fast, compared to previous approaches, because it does not involve any type of configurational sampling. Instead, the configurational entropy of a query fold is estimated by evaluating an artificial neural network, which was trained on molecular-dynamics simulations of ∼1000 proteins. The predicted entropy can be incorporated into a large class of protein software based on cost-function minimization/evaluation, in which configurational entropy is currently neglected for performance reasons. Software of this type is used for all major protein tasks such as structure predictions, proteins design, NMR and X-ray refinement, docking, and mutation effect predictions. Integrating the predicted entropy can yield a significant accuracy increase as we show exemplarily for native-state identification with the prominent protein software FoldX. The method has been termed Popcoen for Prediction of Protein Configurational Entropy. An implementation is freely available at http://fmc.ub.edu/popcoen/ .
Assembly of the outermost spore layer: pieces of the puzzle are coming together.

PubMed

Stewart, George C

2017-05-01

Certain endospore-forming soil dwelling bacteria are important human, animal or insect pathogens. These organisms produce spores containing an outer layer, the exosporium. The exosporium is the site of interactions between the spore and the soil environment and between the spore and the infected host during the initial stages of infection. The composition and assembly process of the exosporium are poorly understood. This is partly due to the extreme stability of the exosporium that has proven to be refractive to existing methods to deconstruct the intact structure into its component parts. Although more than 20 proteins have been identified as exosporium-associated, their abundance, relationship to other proteins and the processes by which they are assembled to create the exosporium are largely unknown. In this issue of Molecular Microbiology, Terry, Jiang, and colleagues in Per Bullough's laboratory show that the ExsY protein is a major structural protein of the exosporium basal layer of B. cereus family spores and that it can self-assemble into complex structures that possess many of the structural features characteristic of the exosporium basal layer. The authors refined a model for exosporium assembly. Their findings may have implications for exosporium formation in other spore forming bacteria, including Clostridium species. © 2017 John Wiley & Sons Ltd.
Improving the accuracy of macromolecular structure refinement at 7 Å resolution.

PubMed

Brunger, Axel T; Adams, Paul D; Fromme, Petra; Fromme, Raimund; Levitt, Michael; Schröder, Gunnar F

2012-06-06

In X-ray crystallography, molecular replacement and subsequent refinement is challenging at low resolution. We compared refinement methods using synchrotron diffraction data of photosystem I at 7.4 Å resolution, starting from different initial models with increasing deviations from the known high-resolution structure. Standard refinement spoiled the initial models, moving them further away from the true structure and leading to high R(free)-values. In contrast, DEN refinement improved even the most distant starting model as judged by R(free), atomic root-mean-square differences to the true structure, significance of features not included in the initial model, and connectivity of electron density. The best protocol was DEN refinement with initial segmented rigid-body refinement. For the most distant initial model, the fraction of atoms within 2 Å of the true structure improved from 24% to 60%. We also found a significant correlation between R(free) values and the accuracy of the model, suggesting that R(free) is useful even at low resolution. Copyright © 2012 Elsevier Ltd. All rights reserved.
Structure of the membrane channel porin from Rhodopseudomonas blastica at 2.0 A resolution.

PubMed Central

Kreusch, A.; Neubüser, A.; Schiltz, E.; Weckesser, J.; Schulz, G. E.

1994-01-01

The crystal structure of a membrane channel, homotrimeric porin from Rhodopseudomonas blastica has been determined at 2.0 A resolution by multiple isomorphous replacement and structural refinement. The current model has an R-factor of 16.5% and consists of 289 amino acids, 238 water molecules, and 3 detergent molecules per subunit. The partial protein sequence and subsequently the complete DNA sequence were determined. The general architecture is similar to those of the structurally known porins. As a particular feature there are 3 adjacent binding sites for n-alkyl chains at the molecular 3-fold axis. The side chain arrangement in the channel indicates a transverse electric field across each of the 3 pore eyelets, which may explain the discrimination against nonpolar solutes. Moreover, there are 2 significantly ordered girdles of aromatic residues at the nonpolar/polar borderlines of the interface between protein and membrane. Possibly, these residues shield the polypeptide conformation against adverse membrane fluctuations. PMID:8142898
Beyond basins: φ,ψ preferences of a residue depend heavily on the φ,ψ values of its neighbors.

PubMed

Hollingsworth, Scott A; Lewis, Matthew C; Karplus, P Andrew

2016-09-01

The Ramachandran plot distributions of nonglycine residues from experimentally determined structures are routinely described as grouping into one of six major basins: β, PII , α, αL , ξ and γ'. Recent work describing the most common conformations adopted by pairs of residues in folded proteins [i.e., (φ,ψ)2 -motifs] showed that commonly described major basins are not true single thermodynamic basins, but are composed of distinct subregions that are associated with various conformations of either the preceding or following neighbor residue. Here, as documentation of the extent to which the conformational preferences of a central residue are influenced by the conformations of its two neighbors, we present a set of φ,ψ-plots that are delimited simultaneously by the φ,ψ-angles of its neighboring residues on both sides. The level of influence seen here is typically greater than the influence associated with considering the identities of neighboring residues, implying that the use of this heretofore untapped information can improve the accuracy of structure prediction algorithms and low resolution protein structure refinement. © 2016 The Protein Society.
Raster-scanning serial protein crystallography using micro- and nano-focused synchrotron beams

PubMed Central

Coquelle, Nicolas; Brewster, Aaron S.; Kapp, Ulrike; Shilova, Anastasya; Weinhausen, Britta; Burghammer, Manfred; Colletier, Jacques-Philippe

2015-01-01

High-resolution structural information was obtained from lysozyme microcrystals (20 µm in the largest dimension) using raster-scanning serial protein crystallography on micro- and nano-focused beamlines at the ESRF. Data were collected at room temperature (RT) from crystals sandwiched between two silicon nitride wafers, thereby preventing their drying, while limiting background scattering and sample consumption. In order to identify crystal hits, new multi-processing and GUI-driven Python-based pre-analysis software was developed, named NanoPeakCell, that was able to read data from a variety of crystallographic image formats. Further data processing was carried out using CrystFEL, and the resultant structures were refined to 1.7 Å resolution. The data demonstrate the feasibility of RT raster-scanning serial micro- and nano-protein crystallography at synchrotrons and validate it as an alternative approach for the collection of high-resolution structural data from micro-sized crystals. Advantages of the proposed approach are its thriftiness, its handling-free nature, the reduced amount of sample required, the adjustable hit rate, the high indexing rate and the minimization of background scattering. PMID:25945583
Identification and Characterization of a New Pecan [Carya illinoinensis (Wangenh.) K. Koch] Allergen, Car i 2.

PubMed

Zhang, Yuzhu; Lee, BoRam; Du, Wen-Xian; Lyu, Shu-Chen; Nadeau, Kari C; Grauke, Larry J; Zhang, Yan; Wang, Shuo; Fan, Yuting; Yi, Jiang; McHugh, Tara H

2016-05-25

The 7S vicilin and 11S legumin seed storage globulins belong to the cupin protein superfamily and are major food allergens in many foods from the "big eight" food allergen groups. Here, for the first time, pecan vicilin was found to be a food allergen. Western blot experiments revealed that 30% of 27 sera used in this study and 24% of the sera from 25 patients with double-blind, placebo controlled clinical pecan allergy contained IgE antibodies specific to pecan vicilin. This allergen consists of a low-complexity region at its N-terminal and a structured domain at the C-terminal that contains two cupin motifs and forms homotrimers. The crystal structure of recombinant pecan vicilin was determined. The refined structure gave R/Rfree values of 0.218/0.262 for all data to 2.65 Å. There were two trimeric biological units in the crystallographic asymmetric unit. Pecan vicilin is also a copper protein. These data may facilitate the understanding of the nutritional value and the allergenicity relevance of the copper binding property of seed storage proteins in tree nuts.
Raster-scanning serial protein crystallography using micro- and nano-focused synchrotron beams

DOE Office of Scientific and Technical Information (OSTI.GOV)

Coquelle, Nicolas; Brewster, Aaron S.; Kapp, Ulrike

High-resolution structural information was obtained from lysozyme microcrystals (20 µm in the largest dimension) using raster-scanning serial protein crystallography on micro- and nano-focused beamlines at the ESRF. Data were collected at room temperature (RT) from crystals sandwiched between two silicon nitride wafers, thereby preventing their drying, while limiting background scattering and sample consumption. In order to identify crystal hits, new multi-processing and GUI-driven Python-based pre-analysis software was developed, named NanoPeakCell, that was able to read data from a variety of crystallographic image formats. Further data processing was carried out using CrystFEL, and the resultant structures were refined to 1.7 Åmore » resolution. The data demonstrate the feasibility of RT raster-scanning serial micro- and nano-protein crystallography at synchrotrons and validate it as an alternative approach for the collection of high-resolution structural data from micro-sized crystals. Advantages of the proposed approach are its thriftiness, its handling-free nature, the reduced amount of sample required, the adjustable hit rate, the high indexing rate and the minimization of background scattering.« less
Raster-scanning serial protein crystallography using micro- and nano-focused synchrotron beams.

PubMed

Coquelle, Nicolas; Brewster, Aaron S; Kapp, Ulrike; Shilova, Anastasya; Weinhausen, Britta; Burghammer, Manfred; Colletier, Jacques Philippe

2015-05-01

High-resolution structural information was obtained from lysozyme microcrystals (20 µm in the largest dimension) using raster-scanning serial protein crystallography on micro- and nano-focused beamlines at the ESRF. Data were collected at room temperature (RT) from crystals sandwiched between two silicon nitride wafers, thereby preventing their drying, while limiting background scattering and sample consumption. In order to identify crystal hits, new multi-processing and GUI-driven Python-based pre-analysis software was developed, named NanoPeakCell, that was able to read data from a variety of crystallographic image formats. Further data processing was carried out using CrystFEL, and the resultant structures were refined to 1.7 Å resolution. The data demonstrate the feasibility of RT raster-scanning serial micro- and nano-protein crystallography at synchrotrons and validate it as an alternative approach for the collection of high-resolution structural data from micro-sized crystals. Advantages of the proposed approach are its thriftiness, its handling-free nature, the reduced amount of sample required, the adjustable hit rate, the high indexing rate and the minimization of background scattering.

Raster-scanning serial protein crystallography using micro- and nano-focused synchrotron beams

DOE PAGES

Coquelle, Nicolas; Brewster, Aaron S.; Kapp, Ulrike; ...

2015-04-25

High-resolution structural information was obtained from lysozyme microcrystals (20 µm in the largest dimension) using raster-scanning serial protein crystallography on micro- and nano-focused beamlines at the ESRF. Data were collected at room temperature (RT) from crystals sandwiched between two silicon nitride wafers, thereby preventing their drying, while limiting background scattering and sample consumption. In order to identify crystal hits, new multi-processing and GUI-driven Python-based pre-analysis software was developed, named NanoPeakCell, that was able to read data from a variety of crystallographic image formats. Further data processing was carried out using CrystFEL, and the resultant structures were refined to 1.7 Åmore » resolution. The data demonstrate the feasibility of RT raster-scanning serial micro- and nano-protein crystallography at synchrotrons and validate it as an alternative approach for the collection of high-resolution structural data from micro-sized crystals. Advantages of the proposed approach are its thriftiness, its handling-free nature, the reduced amount of sample required, the adjustable hit rate, the high indexing rate and the minimization of background scattering.« less
Model-based high-throughput design of ion exchange protein chromatography.

PubMed

Khalaf, Rushd; Heymann, Julia; LeSaout, Xavier; Monard, Florence; Costioli, Matteo; Morbidelli, Massimo

2016-08-12

This work describes the development of a model-based high-throughput design (MHD) tool for the operating space determination of a chromatographic cation-exchange protein purification process. Based on a previously developed thermodynamic mechanistic model, the MHD tool generates a large amount of system knowledge and thereby permits minimizing the required experimental workload. In particular, each new experiment is designed to generate information needed to help refine and improve the model. Unnecessary experiments that do not increase system knowledge are avoided. Instead of aspiring to a perfectly parameterized model, the goal of this design tool is to use early model parameter estimates to find interesting experimental spaces, and to refine the model parameter estimates with each new experiment until a satisfactory set of process parameters is found. The MHD tool is split into four sections: (1) prediction, high throughput experimentation using experiments in (2) diluted conditions and (3) robotic automated liquid handling workstations (robotic workstation), and (4) operating space determination and validation. (1) Protein and resin information, in conjunction with the thermodynamic model, is used to predict protein resin capacity. (2) The predicted model parameters are refined based on gradient experiments in diluted conditions. (3) Experiments on the robotic workstation are used to further refine the model parameters. (4) The refined model is used to determine operating parameter space that allows for satisfactory purification of the protein of interest on the HPLC scale. Each section of the MHD tool is used to define the adequate experimental procedures for the next section, thus avoiding any unnecessary experimental work. We used the MHD tool to design a polishing step for two proteins, a monoclonal antibody and a fusion protein, on two chromatographic resins, in order to demonstrate it has the ability to strongly accelerate the early phases of process development. Copyright © 2016 Elsevier B.V. All rights reserved.
Novel proteases from the genome of the carnivorous plant Drosera capensis: structural prediction and comparative analysis

PubMed Central

Butts, Carter T.; Bierma, Jan C.; Martin, Rachel W.

2016-01-01

In his 1875 monograph on insectivorous plants, Darwin described the feeding reactions of Drosera flypaper traps and predicted that their secretions contained a “ferment” similar to mammalian pepsin, an aspartic protease. Here we report a high-quality draft genome sequence for the cape sundew, Drosera capensis, the first genome of a carnivorous plant from order Caryophyllales, which also includes the Venus flytrap (Dionaea) and the tropical pitcher plants (Nepenthes). This species was selected in part for its hardiness and ease of cultivation, making it an excellent model organism for further investigations of plant carnivory. Analysis of predicted protein sequences yields genes encoding proteases homologous to those found in other plants, some of which display sequence and structural features that suggest novel functionalities. Because the sequence similarity to proteins of known structure is in most cases too low for traditional homology modeling, 3D structures of representative proteases are predicted using comparative modeling with all-atom refinement. Although the overall folds and active residues for these proteins are conserved, we find structural and sequence differences consistent with a diversity of substrate recognition patterns. Finally, we predict differences in substrate specificities using in silico experiments, providing targets for structure/function studies of novel enzymes with biological and technological significance. PMID:27353064
Validation of nuclear magnetic resonance structures of proteins and nucleic acids: hydrogen geometry and nomenclature.

PubMed

Doreleijers, J F; Vriend, G; Raves, M L; Kaptein, R

1999-11-15

A statistical analysis is reported of 1,200 of the 1,404 nuclear magnetic resonance (NMR)-derived protein and nucleic acid structures deposited in the Protein Data Bank (PDB) before 1999. Excluded from this analysis were the entries not yet fully validated by the PDB and the more than 100 entries that contained < 95% of the expected hydrogens. The aim was to assess the geometry of the hydrogens in the remaining structures and to provide a check on their nomenclature. Deviations in bond lengths, bond angles, improper dihedral angles, and planarity with respect to estimated values were checked. More than 100 entries showed anomalous protonation states for some of their amino acids. Approximately 250,000 (1.7%) atom names differed from the consensus PDB nomenclature. Most of the inconsistencies are due to swapped prochiral labeling. Large deviations from the expected geometry exist for a considerable number of entries, many of which are average structures. The most common causes for these deviations seem to be poor minimization of average structures and an improper balance between force-field constraints for experimental and holonomic data. Some specific geometric outliers are related to the refinement programs used. A number of recommendations for biomolecular databases, modeling programs, and authors submitting biomolecular structures are given.
Probing the structure of Leishmania donovani chagasi DHFR-TS: comparative protein modeling and protein-ligand interaction studies.

PubMed

Maganti, Lakshmi; Manoharan, Prabu; Ghoshal, Nanda

2010-09-01

Dihydrofolate reductase (DHFR) has been used successfully as a drug target in the area of anti-bacterial, anti-cancer and anti-malarial therapy. It also acts as a drug target for Leishmaniasis. Inhibition of DHFR leads to cell death through lack of thymine (nucleotide metabolism). Although the crystal structures of Leishmania major and Trypanosoma cruzi DHFR-thymidylate synthase (TS) have been resolved, to date there is no three-dimensional (3D)-structural information on DHFR-TS of Leishmania donovani chagasi, which causes visceral leishmaniasis. Our aim in this study was to model the 3D structure of L. donovani chagasi DHFR-TS, and to investigate the structural requirements for its inhibition. In this paper we describe a highly refined homology model of L. donovani chagasi DHFR-TS based on available crystallographic structures by using the Homology module of Insight II. Structural refinement and minimization of the generated L. donovani chagasi DHFR-TS model employed the Discover 3 module of Insight II and molecular dynamic simulations. The model was further validated through use of the PROCHECK, Verify_3D, PROSA, PSQS and ERRAT programs, which confirm that the model is reliable. Superimposition of the model structure with the templates L. major A chain, L. major B chain And T. cruzi A chain showed root mean square deviations of 0.69 A, 0.71 A and 1.11 A, respectively. Docking analysis of the L. donovani chagasi DHFR-TS model with methotrexate enabled us to identify specific residues, viz. Val156, Val30, Lys95, Lys75 and Arg97, within the L. donovani chagasi DHFR-TS binding pocket, that play an important role in ligand or substrate binding. Docking studies clearly indicated that these five residues are important determinants for binding as they have strong hydrogen bonding interactions with the ligand.
Structure of the N-terminal fragment of Escherichia coli Lon protease

DOE Office of Scientific and Technical Information (OSTI.GOV)

Li, Mi; Gustchina, Alla; Rasulova, Fatima S.

2010-10-22

The structure of a recombinant construct consisting of residues 1-245 of Escherichia coli Lon protease, the prototypical member of the A-type Lon family, is reported. This construct encompasses all or most of the N-terminal domain of the enzyme. The structure was solved by SeMet SAD to 2.6 {angstrom} resolution utilizing trigonal crystals that contained one molecule in the asymmetric unit. The molecule consists of two compact subdomains and a very long C-terminal {alpha}-helix. The structure of the first subdomain (residues 1-117), which consists mostly of {beta}-strands, is similar to that of the shorter fragment previously expressed and crystallized, whereas themore » second subdomain is almost entirely helical. The fold and spatial relationship of the two subdomains, with the exception of the C-terminal helix, closely resemble the structure of BPP1347, a 203-amino-acid protein of unknown function from Bordetella parapertussis, and more distantly several other proteins. It was not possible to refine the structure to satisfactory convergence; however, since almost all of the Se atoms could be located on the basis of their anomalous scattering the correctness of the overall structure is not in question. The structure reported here was also compared with the structures of the putative substrate-binding domains of several proteins, showing topological similarities that should help in defining the binding sites used by Lon substrates.« less
Crystal Structure of Allophycocyanin from Marine Cyanobacterium Phormidium sp. A09DM

PubMed Central

Gupta, Gagan Deep; Madamwar, Datta

2015-01-01

Isolated phycobilisome (PBS) sub-assemblies have been widely subjected to X-ray crystallography analysis to obtain greater insights into the structure-function relationship of this light harvesting complex. Allophycocyanin (APC) is the phycobiliprotein always found in the PBS core complex. Phycocyanobilin (PCB) chromophores, covalently bound to conserved Cys residues of α- and β- subunits of APC, are responsible for solar energy absorption from phycocyanin and for transfer to photosynthetic apparatus. In the known APC structures, heterodimers of α- and β- subunits (known as αβ monomers) assemble as trimer or hexamer. We here for the first time report the crystal structure of APC isolated from a marine cyanobacterium (Phormidium sp. A09DM). The crystal structure has been refined against all the observed data to the resolution of 2.51 Å to Rwork (Rfree) of 0.158 (0.229) with good stereochemistry of the atomic model. The Phormidium protein exists as a trimer of αβ monomers in solution and in crystal lattice. The overall tertiary structures of α- and β- subunits, and trimeric quaternary fold of the Phormidium protein resemble the other known APC structures. Also, configuration and conformation of the two covalently bound PCB chromophores in the marine APC are same as those observed in fresh water cyanobacteria and marine red algae. More hydrophobic residues, however, constitute the environment of the chromophore bound to α-subunit of the Phormidium protein, owing mainly to amino acid substitutions in the marine protein. PMID:25923120
CNA web server: rigidity theory-based thermal unfolding simulations of proteins for linking structure, (thermo-)stability, and function.

PubMed

Krüger, Dennis M; Rathi, Prakash Chandra; Pfleger, Christopher; Gohlke, Holger

2013-07-01

The Constraint Network Analysis (CNA) web server provides a user-friendly interface to the CNA approach developed in our laboratory for linking results from rigidity analyses to biologically relevant characteristics of a biomolecular structure. The CNA web server provides a refined modeling of thermal unfolding simulations that considers the temperature dependence of hydrophobic tethers and computes a set of global and local indices for quantifying biomacromolecular stability. From the global indices, phase transition points are identified where the structure switches from a rigid to a floppy state; these phase transition points can be related to a protein's (thermo-)stability. Structural weak spots (unfolding nuclei) are automatically identified, too; this knowledge can be exploited in data-driven protein engineering. The local indices are useful in linking flexibility and function and to understand the impact of ligand binding on protein flexibility. The CNA web server robustly handles small-molecule ligands in general. To overcome issues of sensitivity with respect to the input structure, the CNA web server allows performing two ensemble-based variants of thermal unfolding simulations. The web server output is provided as raw data, plots and/or Jmol representations. The CNA web server, accessible at http://cpclab.uni-duesseldorf.de/cna or http://www.cnanalysis.de, is free and open to all users with no login requirement.
Structural studies of P-type ATPase–ligand complexes using an X-ray free-electron laser

DOE PAGES

Bublitz, Maike; Nass, Karol; Drachmann, Nikolaj D.; ...

2015-06-11

Membrane proteins are key players in biological systems, mediating signalling events and the specific transport ofe.g.ions and metabolites. Consequently, membrane proteins are targeted by a large number of currently approved drugs. Understanding their functions and molecular mechanisms is greatly dependent on structural information, not least on complexes with functionally or medically important ligands. Structure determination, however, is hampered by the difficulty of obtaining well diffracting, macroscopic crystals. Here, the feasibility of X-ray free-electron-laser-based serial femtosecond crystallography (SFX) for the structure determination of membrane protein–ligand complexes using microcrystals of various native-source and recombinant P-type ATPase complexes is demonstrated. The data revealmore » the binding sites of a variety of ligands, including lipids and inhibitors such as the hallmark P-type ATPase inhibitor orthovanadate. By analyzing the resolution dependence of ligand densities and overall model qualities, SFX data quality metrics as well as suitable refinement procedures are discussed. Even at relatively low resolution and multiplicity, the identification of ligands can be demonstrated. This makes SFX a useful tool for ligand screening and thus for unravelling the molecular mechanisms of biologically active proteins.« less
Computational prediction of atomic structures of helical membrane proteins aided by EM maps.

PubMed

Kovacs, Julio A; Yeager, Mark; Abagyan, Ruben

2007-09-15

Integral membrane proteins pose a major challenge for protein-structure prediction because only approximately 100 high-resolution structures are available currently, thereby impeding the development of rules or empirical potentials to predict the packing of transmembrane alpha-helices. However, when an intermediate-resolution electron microscopy (EM) map is available, it can be used to provide restraints which, in combination with a suitable computational protocol, make structure prediction feasible. In this work we present such a protocol, which proceeds in three stages: 1), generation of an ensemble of alpha-helices by flexible fitting into each of the density rods in the low-resolution EM map, spanning a range of rotational angles around the main helical axes and translational shifts along the density rods; 2), fast optimization of side chains and scoring of the resulting conformations; and 3), refinement of the lowest-scoring conformations with internal coordinate mechanics, by optimizing the van der Waals, electrostatics, hydrogen bonding, torsional, and solvation energy contributions. In addition, our method implements a penalty term through a so-called tethering map, derived from the EM map, which restrains the positions of the alpha-helices. The protocol was validated on three test cases: GpA, KcsA, and MscL.
Blocking Protein kinase C signaling pathway: mechanistic insights into the anti-leishmanial activity of prospective herbal drugs from Withania somnifera

PubMed Central

2012-01-01

Background Leishmaniasis is caused by several species of leishmania protozoan and is one of the major vector-born diseases after malaria and sleeping sickness. Toxicity of available drugs and drug resistance development by protozoa in recent years has made Leishmaniasis cure difficult and challenging. This urges the need to discover new antileishmanial-drug targets and antileishmanial-drug development. Results Tertiary structure of leishmanial protein kinase C was predicted and found stable with a RMSD of 5.8Å during MD simulations. Natural compound withaferin A inhibited the predicted protein at its active site with -28.47 kcal/mol binding free energy. Withanone was also found to inhibit LPKC with good binding affinity of -22.57 kcal/mol. Both withaferin A and withanone were found stable within the binding pocket of predicted protein when MD simulations of ligand-bound protein complexes were carried out to examine the consistency of interactions between the two. Conclusions Leishmanial protein kinase C (LPKC) has been identified as a potential target to develop drugs against Leishmaniasis. We modelled and refined the tertiary structure of LPKC using computational methods such as homology modelling and molecular dynamics simulations. This structure of LPKC was used to reveal mode of inhibition of two previous experimentally reported natural compounds from Withania somnifera - withaferin A and withanone. PMID:23281834
3-Dimensional Protein Structure of Influenza

NASA Technical Reports Server (NTRS)

2004-01-01

The loss of productivity due to flu is staggering. Costs range as much as $20 billio a year. High mutation rates of the flu virus have hindered development of new drugs or vaccines. The secret lies in a small molecule which is attached to the host cell's surface. Each flu virus, no matter what strain, must remove this small molecule to escape the host cell to spread infection. Using data from space and earth grown crystals, researchers from the Center of Macromolecular Crystallography (CMC) are desining drugs to bind with this protein's active site. This lock and key fit reduces the spread of flu in the body by blocking its escape route. In collaboration with its corporate partner, the CMC has refined drug structure in preparation for clinical trials. Tested and approved relief is expected to reach drugstores by year 2004.
On the possibility of using polycrystalline material in the development of structure-based generic assays

DOE Office of Scientific and Technical Information (OSTI.GOV)

Allaire, Marc, E-mail: allaire@bnl.gov; Moiseeva, Natalia; Botez, Cristian E.

The correlation coefficients calculated between raw powder diffraction profiles can be used to identify ligand-bound/unbound states of lysozyme. The discovery of ligands that bind specifically to a targeted protein benefits from the development of generic assays for high-throughput screening of a library of chemicals. Protein powder diffraction (PPD) has been proposed as a potential method for use as a structure-based assay for high-throughput screening applications. Building on this effort, powder samples of bound/unbound states of soluble hen-egg white lysozyme precipitated with sodium chloride were compared. The correlation coefficients calculated between the raw diffraction profiles were consistent with the known bindingmore » properties of the ligands and suggested that the PPD approach can be used even prior to a full description using stereochemically restrained Rietveld refinement.« less
Modeling Cytoskeletal Active Matter Systems

NASA Astrophysics Data System (ADS)

Blackwell, Robert

Active networks of filamentous proteins and crosslinking motor proteins play a critical role in many important cellular processes. One of the most important microtubule-motor protein assemblies is the mitotic spindle, a self-organized active liquid-crystalline structure that forms during cell division and that ultimately separates chromosomes into two daughter cells. Although the spindle has been intensively studied for decades, the physical principles that govern its self-organization and function remain mysterious. To evolve a better understanding of spindle formation, structure, and dynamics, I investigate course-grained models of active liquid-crystalline networks composed of microtubules, modeled as hard spherocylinders, in diffusive equilibrium with a reservoir of active crosslinks, modeled as hookean springs that can adsorb to microtubules and and translocate at finite velocity along the microtubule axis. This model is investigated using a combination of brownian dynamics and kinetic monte carlo simulation. I have further refined this model to simulate spindle formation and kinetochore capture in the fission yeast S. pombe. I then make predictions for experimentally realizable perturbations in motor protein presence and function in S. pombe.
The moving junction of apicomplexan parasites: a key structure for invasion.

PubMed

Besteiro, Sébastien; Dubremetz, Jean-François; Lebrun, Maryse

2011-06-01

Most Apicomplexa are obligate intracellular parasites and many are important pathogens of human and domestic animals. For a successful cell invasion, they rely on their own motility and on a firm anchorage to their host cell, depending on the secretion of proteins and the establishment of a structure called the moving junction (MJ). The MJ moves from the apical to the posterior end of the parasite, leading to the internalization of the parasite into a parasitophorous vacuole. Based on recent data obtained in Plasmodium and Toxoplasma, an emerging model emphasizes a cooperative role of secreted parasitic proteins in building the MJ and driving this crucial invasive process. More precisely, the parasite exports the microneme protein AMA1 to its own surface and the rhoptry neck RON2 protein as a receptor inserted into the host cell together with other RON partners. Ongoing and future research will certainly help refining the model by characterizing the molecular organization within the MJ and its interactions with both host and parasite cytoskeleton for anchoring of the complex. © 2011 Blackwell Publishing Ltd.
Fast and automated functional classification with MED-SuMo: an application on purine-binding proteins.

PubMed

Doppelt-Azeroual, Olivia; Delfaud, François; Moriaud, Fabrice; de Brevern, Alexandre G

2010-04-01

Ligand-protein interactions are essential for biological processes, and precise characterization of protein binding sites is crucial to understand protein functions. MED-SuMo is a powerful technology to localize similar local regions on protein surfaces. Its heuristic is based on a 3D representation of macromolecules using specific surface chemical features associating chemical characteristics with geometrical properties. MED-SMA is an automated and fast method to classify binding sites. It is based on MED-SuMo technology, which builds a similarity graph, and it uses the Markov Clustering algorithm. Purine binding sites are well studied as drug targets. Here, purine binding sites of the Protein DataBank (PDB) are classified. Proteins potentially inhibited or activated through the same mechanism are gathered. Results are analyzed according to PROSITE annotations and to carefully refined functional annotations extracted from the PDB. As expected, binding sites associated with related mechanisms are gathered, for example, the Small GTPases. Nevertheless, protein kinases from different Kinome families are also found together, for example, Aurora-A and CDK2 proteins which are inhibited by the same drugs. Representative examples of different clusters are presented. The effectiveness of the MED-SMA approach is demonstrated as it gathers binding sites of proteins with similar structure-activity relationships. Moreover, an efficient new protocol associates structures absent of cocrystallized ligands to the purine clusters enabling those structures to be associated with a specific binding mechanism. Applications of this classification by binding mode similarity include target-based drug design and prediction of cross-reactivity and therefore potential toxic side effects.
Fast and automated functional classification with MED-SuMo: An application on purine-binding proteins

PubMed Central

Doppelt-Azeroual, Olivia; Delfaud, François; Moriaud, Fabrice; de Brevern, Alexandre G

2010-01-01

Ligand–protein interactions are essential for biological processes, and precise characterization of protein binding sites is crucial to understand protein functions. MED-SuMo is a powerful technology to localize similar local regions on protein surfaces. Its heuristic is based on a 3D representation of macromolecules using specific surface chemical features associating chemical characteristics with geometrical properties. MED-SMA is an automated and fast method to classify binding sites. It is based on MED-SuMo technology, which builds a similarity graph, and it uses the Markov Clustering algorithm. Purine binding sites are well studied as drug targets. Here, purine binding sites of the Protein DataBank (PDB) are classified. Proteins potentially inhibited or activated through the same mechanism are gathered. Results are analyzed according to PROSITE annotations and to carefully refined functional annotations extracted from the PDB. As expected, binding sites associated with related mechanisms are gathered, for example, the Small GTPases. Nevertheless, protein kinases from different Kinome families are also found together, for example, Aurora-A and CDK2 proteins which are inhibited by the same drugs. Representative examples of different clusters are presented. The effectiveness of the MED-SMA approach is demonstrated as it gathers binding sites of proteins with similar structure-activity relationships. Moreover, an efficient new protocol associates structures absent of cocrystallized ligands to the purine clusters enabling those structures to be associated with a specific binding mechanism. Applications of this classification by binding mode similarity include target-based drug design and prediction of cross-reactivity and therefore potential toxic side effects. PMID:20162627
Structure and Dynamics of Type III Secretion Effector Protein ExoU As determined by SDSL-EPR Spectroscopy in Conjunction with De Novo Protein Folding

PubMed Central

2017-01-01

ExoU is a 74 kDa cytotoxin that undergoes substantial conformational changes as part of its function, that is, it has multiple thermodynamically stable conformations that interchange depending on its environment. Such flexible proteins pose unique challenges to structural biology: (1) not only is it often difficult to determine structures by X-ray crystallography for all biologically relevant conformations because of the flat energy landscape (2) but also experimental conditions can easily perturb the biologically relevant conformation. The first challenge can be overcome by applying orthogonal structural biology techniques that are capable of observing alternative, biologically relevant conformations. The second challenge can be addressed by determining the structure in the same biological state with two independent techniques under different experimental conditions. If both techniques converge to the same structural model, the confidence that an unperturbed biologically relevant conformation is observed increases. To this end, we determine the structure of the C-terminal domain of the effector protein, ExoU, from data obtained by electron paramagnetic resonance spectroscopy in conjunction with site-directed spin labeling and in silico de novo structure determination. Our protocol encompasses a multimodule approach, consisting of low-resolution topology sampling, clustering, and high-resolution refinement. The resulting model was compared with an ExoU model in complex with its chaperone SpcU obtained previously by X-ray crystallography. The two models converged to a minimal RMSD100 of 3.2 Å, providing evidence that the unbound structure of ExoU matches the fold observed in complex with SpcU. PMID:28691114
Using Entropy Maximization to Understand the Determinants of Structural Dynamics beyond Native Contact Topology

PubMed Central

Lezon, Timothy R.; Bahar, Ivet

2010-01-01

Comparison of elastic network model predictions with experimental data has provided important insights on the dominant role of the network of inter-residue contacts in defining the global dynamics of proteins. Most of these studies have focused on interpreting the mean-square fluctuations of residues, or deriving the most collective, or softest, modes of motions that are known to be insensitive to structural and energetic details. However, with increasing structural data, we are in a position to perform a more critical assessment of the structure-dynamics relations in proteins, and gain a deeper understanding of the major determinants of not only the mean-square fluctuations and lowest frequency modes, but the covariance or the cross-correlations between residue fluctuations and the shapes of higher modes. A systematic study of a large set of NMR-determined proteins is analyzed using a novel method based on entropy maximization to demonstrate that the next level of refinement in the elastic network model description of proteins ought to take into consideration properties such as contact order (or sequential separation between contacting residues) and the secondary structure types of the interacting residues, whereas the types of amino acids do not play a critical role. Most importantly, an optimal description of observed cross-correlations requires the inclusion of destabilizing, as opposed to exclusively stabilizing, interactions, stipulating the functional significance of local frustration in imparting native-like dynamics. This study provides us with a deeper understanding of the structural basis of experimentally observed behavior, and opens the way to the development of more accurate models for exploring protein dynamics. PMID:20585542
Using entropy maximization to understand the determinants of structural dynamics beyond native contact topology.

PubMed

Lezon, Timothy R; Bahar, Ivet

2010-06-17

Comparison of elastic network model predictions with experimental data has provided important insights on the dominant role of the network of inter-residue contacts in defining the global dynamics of proteins. Most of these studies have focused on interpreting the mean-square fluctuations of residues, or deriving the most collective, or softest, modes of motions that are known to be insensitive to structural and energetic details. However, with increasing structural data, we are in a position to perform a more critical assessment of the structure-dynamics relations in proteins, and gain a deeper understanding of the major determinants of not only the mean-square fluctuations and lowest frequency modes, but the covariance or the cross-correlations between residue fluctuations and the shapes of higher modes. A systematic study of a large set of NMR-determined proteins is analyzed using a novel method based on entropy maximization to demonstrate that the next level of refinement in the elastic network model description of proteins ought to take into consideration properties such as contact order (or sequential separation between contacting residues) and the secondary structure types of the interacting residues, whereas the types of amino acids do not play a critical role. Most importantly, an optimal description of observed cross-correlations requires the inclusion of destabilizing, as opposed to exclusively stabilizing, interactions, stipulating the functional significance of local frustration in imparting native-like dynamics. This study provides us with a deeper understanding of the structural basis of experimentally observed behavior, and opens the way to the development of more accurate models for exploring protein dynamics.

SMOQ: a tool for predicting the absolute residue-specific quality of a single protein model with support vector machines

PubMed Central

2014-01-01

Background It is important to predict the quality of a protein structural model before its native structure is known. The method that can predict the absolute local quality of individual residues in a single protein model is rare, yet particularly needed for using, ranking and refining protein models. Results We developed a machine learning tool (SMOQ) that can predict the distance deviation of each residue in a single protein model. SMOQ uses support vector machines (SVM) with protein sequence and structural features (i.e. basic feature set), including amino acid sequence, secondary structures, solvent accessibilities, and residue-residue contacts to make predictions. We also trained a SVM model with two new additional features (profiles and SOV scores) on 20 CASP8 targets and found that including them can only improve the performance when real deviations between native and model are higher than 5Å. The SMOQ tool finally released uses the basic feature set trained on 85 CASP8 targets. Moreover, SMOQ implemented a way to convert predicted local quality scores into a global quality score. SMOQ was tested on the 84 CASP9 single-domain targets. The average difference between the residue-specific distance deviation predicted by our method and the actual distance deviation on the test data is 2.637Å. The global quality prediction accuracy of the tool is comparable to other good tools on the same benchmark. Conclusion SMOQ is a useful tool for protein single model quality assessment. Its source code and executable are available at: http://sysbio.rnet.missouri.edu/multicom_toolbox/. PMID:24776231
SMOQ: a tool for predicting the absolute residue-specific quality of a single protein model with support vector machines.

PubMed

Cao, Renzhi; Wang, Zheng; Wang, Yiheng; Cheng, Jianlin

2014-04-28

It is important to predict the quality of a protein structural model before its native structure is known. The method that can predict the absolute local quality of individual residues in a single protein model is rare, yet particularly needed for using, ranking and refining protein models. We developed a machine learning tool (SMOQ) that can predict the distance deviation of each residue in a single protein model. SMOQ uses support vector machines (SVM) with protein sequence and structural features (i.e. basic feature set), including amino acid sequence, secondary structures, solvent accessibilities, and residue-residue contacts to make predictions. We also trained a SVM model with two new additional features (profiles and SOV scores) on 20 CASP8 targets and found that including them can only improve the performance when real deviations between native and model are higher than 5Å. The SMOQ tool finally released uses the basic feature set trained on 85 CASP8 targets. Moreover, SMOQ implemented a way to convert predicted local quality scores into a global quality score. SMOQ was tested on the 84 CASP9 single-domain targets. The average difference between the residue-specific distance deviation predicted by our method and the actual distance deviation on the test data is 2.637Å. The global quality prediction accuracy of the tool is comparable to other good tools on the same benchmark. SMOQ is a useful tool for protein single model quality assessment. Its source code and executable are available at: http://sysbio.rnet.missouri.edu/multicom_toolbox/.
Crystallization and initial crystallographic characterization of the Corynebacterium glutamicum nitrilotriacetate monooxygenase component A

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kim, Kyung-Jin, E-mail: kkj@postech.ac.kr; Kim, Sujin; Lee, Sujin

2006-11-01

The Corynebacterium glutamicum NTA monooxygenase component A protein, which plays the central role in NTA biodegradation, was crystallized. The initial X-ray crystallographic characterization is reported. Safety and environmental concerns have recently dictated the proper disposal of nitrilotriacetate (NTA). Biodegradation of NTA is initiated by NTA monooxygenase, which is composed of two proteins: component A and component B. The NTA monooxygenase component A protein from Corynebacterium glutamicum was crystallized using the sitting-drop vapour-diffusion method in the presence of ammonium sulfate as the precipitant. X-ray diffraction data were collected to a maximum resolution of 2.5 Å on a synchrotron beamline. The crystalmore » belongs to the monoclinic space group C2, with unit-cell parameters a = 111.04, b = 98.51, c = 171.61 Å, β = 101.94°. The asymmetric unit consists of four molecules, corresponding to a packing density of 2.3 Å{sup 3} Da{sup −1}. The structure was solved by molecular replacement. Structure refinement is in progress.« less
The Smooth Muscle of the Artery

DTIC Science & Technology

1975-01-01

membrpne structure (343). Certainly, the transported proteins may serve is A source of amino acids (lysin-rich and/or proline rich proceins?) for bio...into Long-Chain Fatty Acids by Cellular Fractions from Normal Rabbit Aorts Suprrnatant fraction 1(# 4 hTABOLI CHIAACTERIStICS OF SMOOH MUSCLE 3 I shall...acetate into fatty acids in a homogenate of rat aorta. VT It took many more years and much refine- Lipid Synthesis ment of techniques to analyze more by
ILP-2 modeling and virtual screening of an FDA-approved library:a possible anticancer therapy.

PubMed

Khalili, Saeed; Mohammadpour, Hemn; Shokrollahi Barough, Mahideh; Kokhaei, Parviz

2016-06-23

The members of the inhibitors of apoptosis protein (IAP) family inhibit diverse components of the caspase signaling pathway, notably caspase 3, 7, and 9. ILP-2 (BIRC-8) is the most recently identified member of the IAPs, mainly interacting with caspase 9. This interaction would eventually lead to death resistance in the case of cancerous cells. Therefore, structural modeling of ILP-2 and finding applicable inhibitors of its interaction with caspase 9 are a compelling challenge. Three main protein modeling approaches along with various model refinement measures were harnessed to achieve a reliable 3D model, using state-of-the-art software. Thereafter, the selected model was employed to perform virtual screening of an FDA approved library. A model built by a combinatorial approach (homology and ab initio approaches) was chosen as the best model. Model refinement processes successfully bolstered the model quality. Virtual screening of the compound library introduced several high affinity inhibitor candidates that interact with functional residues of ILP2. Given the 3D structure of the ILP2 molecule, we found promising inhibitory molecules. In addition to high affinity towards the ILP2 molecule, these molecules interact with residues that play pivotal rules in ILP2-caspase interaction. These molecules would inhibit ILP2-caspase interaction and consequently would lead to reactivated cell apoptosis through the caspases pathway.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Hamiaux, C.; Stanley, D.; Greenwood, D.R.

Takeout (To) proteins are found exclusively in insects and have been proposed to have important roles in various aspects of their physiology and behavior. Limited sequence similarity with juvenile hormone-binding proteins (JHBPs), which specifically bind and transport juvenile hormones in Lepidoptera, suggested a role for To proteins in binding hydrophobic ligands. We present the first crystal structure of a To protein, EpTo1 from the light brown apple moth Epiphyas postvittana, solved in-house by the single-wavelength anomalous diffraction technique using sulfur anomalous dispersion, and refined to 1.3 {angstrom} resolution. EpTo1 adopts the unusual {alpha}/{beta}-wrap fold, seen only for JHBP and severalmore » mammalian lipid carrier proteins, a scaffold tailored for the binding and/or transport of hydrophobic ligands. EpTo1 has a 45 {angstrom} long, purely hydrophobic, internal tunnel that extends for the full length of the protein and accommodates a bound ligand. The latter was shown by mass spectrometry to be ubiquinone-8 and is probably derived from Escherichia coli. The structure provides the first direct experimental evidence that To proteins are ligand carriers; gives insights into the nature of endogenous ligand(s) of EpTo1; shows, by comparison with JHBP, a basis for different ligand specificities; and suggests a mechanism for the binding/release of ligands.« less
Homology-based Modeling of Rhodopsin-like Family Members in the Inactive State: Structural Analysis and Deduction of Tips for Modeling and Optimization.

PubMed

Pappalardo, Matteo; Rayan, Mahmoud; Abu-Lafi, Saleh; Leonardi, Martha E; Milardi, Danilo; Guccione, Salvatore; Rayan, Anwar

2017-08-01

Modeling G-Protein Coupled Receptors (GPCRs) is an emergent field of research, since utility of high-quality models in receptor structure-based strategies might facilitate the discovery of interesting drug candidates. The findings from a quantitative analysis of eighteen resolved structures of rhodopsin family "A" receptors crystallized with antagonists and 153 pairs of structures are described. A strategy termed endeca-amino acids fragmentation was used to analyze the structures models aiming to detect the relationship between sequence identity and Root Mean Square Deviation (RMSD) at each trans-membrane-domain. Moreover, we have applied the leave-one-out strategy to study the shiftiness likelihood of the helices. The type of correlation between sequence identity and RMSD was studied using the aforementioned set receptors as representatives of membrane proteins and 98 serine proteases with 4753 pairs of structures as representatives of globular proteins. Data analysis using fragmentation strategy revealed that there is some extent of correlation between sequence identity and global RMSD of 11AA width windows. However, spatial conservation is not always close to the endoplasmic side as was reported before. A comparative study with globular proteins shows that GPCRs have higher standard deviation and higher slope in the graph with correlation between sequence identity and RMSD. The extracted information disclosed in this paper could be incorporated in the modeling protocols while using technique for model optimization and refinement. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Customizing G Protein-coupled receptor models for structure-based virtual screening.

PubMed

de Graaf, Chris; Rognan, Didier

2009-01-01

This review will focus on the construction, refinement, and validation of G Protein-coupled receptor models for the purpose of structure-based virtual screening. Practical tips and tricks derived from concrete modeling and virtual screening exercises to overcome the problems and pitfalls associated with the different steps of the receptor modeling workflow will be presented. These examples will not only include rhodopsin-like (class A), but also secretine-like (class B), and glutamate-like (class C) receptors. In addition, the review will present a careful comparative analysis of current crystal structures and their implication on homology modeling. The following themes will be discussed: i) the use of experimental anchors in guiding the modeling procedure; ii) amino acid sequence alignments; iii) ligand binding mode accommodation and binding cavity expansion; iv) proline-induced kinks in transmembrane helices; v) binding mode prediction and virtual screening by receptor-ligand interaction fingerprint scoring; vi) extracellular loop modeling; vii) virtual filtering schemes. Finally, an overview of several successful structure-based screening shows that receptor models, despite structural inaccuracies, can be efficiently used to find novel ligands.
Formation of Polyphenol-Denatured Protein Flocs in Alcohol Beverages Sweetened with Refined Cane Sugars.

PubMed

Eggleston, Gillian; Triplett, Alexa

2017-11-08

The sporadic appearance of floc from refined, white cane sugars in alcohol beverages remains a technical problem for both beverage manufacturers and sugar refiners. Cane invert sugars mixed with 60% pure alcohol and water increased light scattering by up to ∼1000-fold. Insoluble and soluble starch, fat, inorganic ash, oligosaccharides, Brix, and pH were not involved in the prevailing floc-formation mechanism. Strong polynomial correlations existed between the haze floc and indicator values (IVs) (color at 420 nm pH 9.0/color at pH 4.0-an indirect measure of polyphenolic and flavonoid colorants) (R 2 = 0.815) and protein (R 2 = 0.819) content of the invert sugars. Ethanol-induced denaturation of the protein exposed hydrophobic polyphenol-binding sites that were further exposed when heated to 80 °C. A tentative mechanism for floc formation was advanced by molecular probing with a haze (floc) active protein and polyphenol as well as polar, nonpolar, and ionic solvents.
GCView: the genomic context viewer for protein homology searches

PubMed Central

Grin, Iwan; Linke, Dirk

2011-01-01

Genomic neighborhood can provide important insights into evolution and function of a protein or gene. When looking at operons, changes in operon structure and composition can only be revealed by looking at the operon as a whole. To facilitate the analysis of the genomic context of a query in multiple organisms we have developed Genomic Context Viewer (GCView). GCView accepts results from one or multiple protein homology searches such as BLASTp as input. For each hit, the neighboring protein-coding genes are extracted, the regions of homology are labeled for each input and the results are presented as a clear, interactive graphical output. It is also possible to add more searches to iteratively refine the output. GCView groups outputs by the hits for different proteins. This allows for easy comparison of different operon compositions and structures. The tool is embedded in the framework of the Bioinformatics Toolkit of the Max-Planck Institute for Developmental Biology (MPI Toolkit). Job results from the homology search tools inside the MPI Toolkit can be forwarded to GCView and results can be subsequently analyzed by sequence analysis tools. Results are stored online, allowing for later reinspection. GCView is freely available at http://toolkit.tuebingen.mpg.de/gcview. PMID:21609955
Crystallographic studies of the complex of human HINT1 protein with a non-hydrolyzable analog of Ap4A.

PubMed

Dolot, Rafał; Kaczmarek, Renata; Sęda, Aleksandra; Krakowiak, Agnieszka; Baraniak, Janina; Nawrot, Barbara

2016-06-01

Histidine triad nucleotide-binding protein 1 (HINT1) represents the most ancient and widespread branch in the histidine triad proteins superfamily. HINT1 plays an important role in various biological processes, and it has been found in many species. Here, we report the first structure (at a 2.34Å resolution) of a complex of human HINT1 with a non-hydrolyzable analog of an Ap4A dinucleotide, containing bis-phosphorothioated glycerol mimicking a polyphosphate chain, obtained from a primitive monoclinic space group P21 crystal. In addition, the apo form of hHINT1 at the space group P21 refined to 1.92Å is reported for comparative studies. Copyright © 2016 Elsevier B.V. All rights reserved.
Iterative model building, structure refinement and density modification with the PHENIX AutoBuild wizard.

PubMed

Terwilliger, Thomas C; Grosse-Kunstleve, Ralf W; Afonine, Pavel V; Moriarty, Nigel W; Zwart, Peter H; Hung, Li Wei; Read, Randy J; Adams, Paul D

2008-01-01

The PHENIX AutoBuild wizard is a highly automated tool for iterative model building, structure refinement and density modification using RESOLVE model building, RESOLVE statistical density modification and phenix.refine structure refinement. Recent advances in the AutoBuild wizard and phenix.refine include automated detection and application of NCS from models as they are built, extensive model-completion algorithms and automated solvent-molecule picking. Model-completion algorithms in the AutoBuild wizard include loop building, crossovers between chains in different models of a structure and side-chain optimization. The AutoBuild wizard has been applied to a set of 48 structures at resolutions ranging from 1.1 to 3.2 A, resulting in a mean R factor of 0.24 and a mean free R factor of 0.29. The R factor of the final model is dependent on the quality of the starting electron density and is relatively independent of resolution.
Accounting for epistatic interactions improves the functional analysis of protein structures.

PubMed

Wilkins, Angela D; Venner, Eric; Marciano, David C; Erdin, Serkan; Atri, Benu; Lua, Rhonald C; Lichtarge, Olivier

2013-11-01

The constraints under which sequence, structure and function coevolve are not fully understood. Bringing this mutual relationship to light can reveal the molecular basis of binding, catalysis and allostery, thereby identifying function and rationally guiding protein redesign. Underlying these relationships are the epistatic interactions that occur when the consequences of a mutation to a protein are determined by the genetic background in which it occurs. Based on prior data, we hypothesize that epistatic forces operate most strongly between residues nearby in the structure, resulting in smooth evolutionary importance across the structure. We find that when residue scores of evolutionary importance are distributed smoothly between nearby residues, functional site prediction accuracy improves. Accordingly, we designed a novel measure of evolutionary importance that focuses on the interaction between pairs of structurally neighboring residues. This measure that we term pair-interaction Evolutionary Trace yields greater functional site overlap and better structure-based proteome-wide functional predictions. Our data show that the structural smoothness of evolutionary importance is a fundamental feature of the coevolution of sequence, structure and function. Mutations operate on individual residues, but selective pressure depends in part on the extent to which a mutation perturbs interactions with neighboring residues. In practice, this principle led us to redefine the importance of a residue in terms of the importance of its epistatic interactions with neighbors, yielding better annotation of functional residues, motivating experimental validation of a novel functional site in LexA and refining protein function prediction. lichtarge@bcm.edu. Supplementary data are available at Bioinformatics online.
Accounting for epistatic interactions improves the functional analysis of protein structures

PubMed Central

Wilkins, Angela D.; Venner, Eric; Marciano, David C.; Erdin, Serkan; Atri, Benu; Lua, Rhonald C.; Lichtarge, Olivier

2013-01-01

Motivation: The constraints under which sequence, structure and function coevolve are not fully understood. Bringing this mutual relationship to light can reveal the molecular basis of binding, catalysis and allostery, thereby identifying function and rationally guiding protein redesign. Underlying these relationships are the epistatic interactions that occur when the consequences of a mutation to a protein are determined by the genetic background in which it occurs. Based on prior data, we hypothesize that epistatic forces operate most strongly between residues nearby in the structure, resulting in smooth evolutionary importance across the structure. Methods and Results: We find that when residue scores of evolutionary importance are distributed smoothly between nearby residues, functional site prediction accuracy improves. Accordingly, we designed a novel measure of evolutionary importance that focuses on the interaction between pairs of structurally neighboring residues. This measure that we term pair-interaction Evolutionary Trace yields greater functional site overlap and better structure-based proteome-wide functional predictions. Conclusions: Our data show that the structural smoothness of evolutionary importance is a fundamental feature of the coevolution of sequence, structure and function. Mutations operate on individual residues, but selective pressure depends in part on the extent to which a mutation perturbs interactions with neighboring residues. In practice, this principle led us to redefine the importance of a residue in terms of the importance of its epistatic interactions with neighbors, yielding better annotation of functional residues, motivating experimental validation of a novel functional site in LexA and refining protein function prediction. Contact: lichtarge@bcm.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:24021383
Integration of QUARK and I-TASSER for Ab Initio Protein Structure Prediction in CASP11.

PubMed

Zhang, Wenxuan; Yang, Jianyi; He, Baoji; Walker, Sara Elizabeth; Zhang, Hongjiu; Govindarajoo, Brandon; Virtanen, Jouko; Xue, Zhidong; Shen, Hong-Bin; Zhang, Yang

2016-09-01

We tested two pipelines developed for template-free protein structure prediction in the CASP11 experiment. First, the QUARK pipeline constructs structure models by reassembling fragments of continuously distributed lengths excised from unrelated proteins. Five free-modeling (FM) targets have the model successfully constructed by QUARK with a TM-score above 0.4, including the first model of T0837-D1, which has a TM-score = 0.736 and RMSD = 2.9 Å to the native. Detailed analysis showed that the success is partly attributed to the high-resolution contact map prediction derived from fragment-based distance-profiles, which are mainly located between regular secondary structure elements and loops/turns and help guide the orientation of secondary structure assembly. In the Zhang-Server pipeline, weakly scoring threading templates are re-ordered by the structural similarity to the ab initio folding models, which are then reassembled by I-TASSER based structure assembly simulations; 60% more domains with length up to 204 residues, compared to the QUARK pipeline, were successfully modeled by the I-TASSER pipeline with a TM-score above 0.4. The robustness of the I-TASSER pipeline can stem from the composite fragment-assembly simulations that combine structures from both ab initio folding and threading template refinements. Despite the promising cases, challenges still exist in long-range beta-strand folding, domain parsing, and the uncertainty of secondary structure prediction; the latter of which was found to affect nearly all aspects of FM structure predictions, from fragment identification, target classification, structure assembly, to final model selection. Significant efforts are needed to solve these problems before real progress on FM could be made. Proteins 2016; 84(Suppl 1):76-86. © 2015 Wiley Periodicals, Inc. © 2015 Wiley Periodicals, Inc.
Crystallization and preliminary X-ray characterization of the genetically encoded fluorescent calcium indicator protein GCaMP2

PubMed Central

Rodríguez Guilbe, María M.; Alfaro Malavé, Elisa C.; Akerboom, Jasper; Marvin, Jonathan S.; Looger, Loren L.; Schreiter, Eric R.

2008-01-01

Fluorescent proteins and their engineered variants have played an important role in the study of biology. The genetically encoded calcium-indicator protein GCaMP2 comprises a circularly permuted fluorescent protein coupled to the calcium-binding protein calmodulin and a calmodulin target peptide, M13, derived from the intracellular calmodulin target myosin light-chain kinase and has been used to image calcium transients in vivo. To aid rational efforts to engineer improved variants of GCaMP2, this protein was crystallized in the calcium-saturated form. X-ray diffraction data were collected to 2.0 Å resolution. The crystals belong to space group C2, with unit-cell parameters a = 126.1, b = 47.1, c = 68.8 Å, β = 100.5° and one GCaMP2 molecule in the asymmetric unit. The structure was phased by molecular replacement and refinement is currently under way. PMID:18607093
MOCASSIN-prot: a multi-objective clustering approach for protein similarity networks.

PubMed

Keel, Brittney N; Deng, Bo; Moriyama, Etsuko N

2018-04-15

Proteins often include multiple conserved domains. Various evolutionary events including duplication and loss of domains, domain shuffling, as well as sequence divergence contribute to generating complexities in protein structures, and consequently, in their functions. The evolutionary history of proteins is hence best modeled through networks that incorporate information both from the sequence divergence and the domain content. Here, a game-theoretic approach proposed for protein network construction is adapted into the framework of multi-objective optimization, and extended to incorporate clustering refinement procedure. The new method, MOCASSIN-prot, was applied to cluster multi-domain proteins from ten genomes. The performance of MOCASSIN-prot was compared against two protein clustering methods, Markov clustering (TRIBE-MCL) and spectral clustering (SCPS). We showed that compared to these two methods, MOCASSIN-prot, which uses both domain composition and quantitative sequence similarity information, generates fewer false positives. It achieves more functionally coherent protein clusters and better differentiates protein families. MOCASSIN-prot, implemented in Perl and Matlab, is freely available at http://bioinfolab.unl.edu/emlab/MOCASSINprot. emoriyama2@unl.edu. Supplementary data are available at Bioinformatics online.
Rosetta FlexPepDock ab-initio: simultaneous folding, docking and refinement of peptides onto their receptors.

PubMed

Raveh, Barak; London, Nir; Zimmerman, Lior; Schueler-Furman, Ora

2011-04-29

Flexible peptides that fold upon binding to another protein molecule mediate a large number of regulatory interactions in the living cell and may provide highly specific recognition modules. We present Rosetta FlexPepDock ab-initio, a protocol for simultaneous docking and de-novo folding of peptides, starting from an approximate specification of the peptide binding site. Using the Rosetta fragments library and a coarse-grained structural representation of the peptide and the receptor, FlexPepDock ab-initio samples efficiently and simultaneously the space of possible peptide backbone conformations and rigid-body orientations over the receptor surface of a given binding site. The subsequent all-atom refinement of the coarse-grained models includes full side-chain modeling of both the receptor and the peptide, resulting in high-resolution models in which key side-chain interactions are recapitulated. The protocol was applied to a benchmark in which peptides were modeled over receptors in either their bound backbone conformations or in their free, unbound form. Near-native peptide conformations were identified in 18/26 of the bound cases and 7/14 of the unbound cases. The protocol performs well on peptides from various classes of secondary structures, including coiled peptides with unusual turns and kinks. The results presented here significantly extend the scope of state-of-the-art methods for high-resolution peptide modeling, which can now be applied to a wide variety of peptide-protein interactions where no prior information about the peptide backbone conformation is available, enabling detailed structure-based studies and manipulation of those interactions. © 2011 Raveh et al.
Rosetta FlexPepDock ab-initio: Simultaneous Folding, Docking and Refinement of Peptides onto Their Receptors

PubMed Central

Raveh, Barak; London, Nir; Zimmerman, Lior; Schueler-Furman, Ora

2011-01-01

Flexible peptides that fold upon binding to another protein molecule mediate a large number of regulatory interactions in the living cell and may provide highly specific recognition modules. We present Rosetta FlexPepDock ab-initio, a protocol for simultaneous docking and de-novo folding of peptides, starting from an approximate specification of the peptide binding site. Using the Rosetta fragments library and a coarse-grained structural representation of the peptide and the receptor, FlexPepDock ab-initio samples efficiently and simultaneously the space of possible peptide backbone conformations and rigid-body orientations over the receptor surface of a given binding site. The subsequent all-atom refinement of the coarse-grained models includes full side-chain modeling of both the receptor and the peptide, resulting in high-resolution models in which key side-chain interactions are recapitulated. The protocol was applied to a benchmark in which peptides were modeled over receptors in either their bound backbone conformations or in their free, unbound form. Near-native peptide conformations were identified in 18/26 of the bound cases and 7/14 of the unbound cases. The protocol performs well on peptides from various classes of secondary structures, including coiled peptides with unusual turns and kinks. The results presented here significantly extend the scope of state-of-the-art methods for high-resolution peptide modeling, which can now be applied to a wide variety of peptide-protein interactions where no prior information about the peptide backbone conformation is available, enabling detailed structure-based studies and manipulation of those interactions. PMID:21572516
Advanced Structural Analyses by Third Generation Synchrotron Radiation Powder Diffraction

DOE Office of Scientific and Technical Information (OSTI.GOV)

Sakata, M.; Aoyagi, S.; Ogura, T.

2007-01-19

Since the advent of the 3rd generation Synchrotron Radiation (SR) sources, such as SPring-8, the capabilities of SR powder diffraction increased greatly not only in an accurate structure refinement but also ab initio structure determination. In this study, advanced structural analyses by 3rd generation SR powder diffraction based on the Large Debye-Scherrer camera installed at BL02B2, SPring-8 is described. Because of high angular resolution and high counting statistics powder data collected at BL02B2, SPring-8, ab initio structure determination can cope with a molecular crystals with 65 atoms including H atoms. For the structure refinements, it is found that a kindmore » of Maximum Entropy Method in which several atoms are omitted in phase calculation become very important to refine structural details of fairy large molecule in a crystal. It should be emphasized that until the unknown structure is refined very precisely, the obtained structure by Genetic Algorithm (GA) or some other ab initio structure determination method using real space structural knowledge, it is not possible to tell whether the structure obtained by the method is correct or not. In order to determine and/or refine crystal structure of rather complicated molecules, we cannot overemphasize the importance of the 3rd generation SR sources.« less

MCL-CAw: a refinement of MCL for detecting yeast complexes from weighted PPI networks by incorporating core-attachment structure

PubMed Central

2010-01-01

Background The reconstruction of protein complexes from the physical interactome of organisms serves as a building block towards understanding the higher level organization of the cell. Over the past few years, several independent high-throughput experiments have helped to catalogue enormous amount of physical protein interaction data from organisms such as yeast. However, these individual datasets show lack of correlation with each other and also contain substantial number of false positives (noise). Over these years, several affinity scoring schemes have also been devised to improve the qualities of these datasets. Therefore, the challenge now is to detect meaningful as well as novel complexes from protein interaction (PPI) networks derived by combining datasets from multiple sources and by making use of these affinity scoring schemes. In the attempt towards tackling this challenge, the Markov Clustering algorithm (MCL) has proved to be a popular and reasonably successful method, mainly due to its scalability, robustness, and ability to work on scored (weighted) networks. However, MCL produces many noisy clusters, which either do not match known complexes or have additional proteins that reduce the accuracies of correctly predicted complexes. Results Inspired by recent experimental observations by Gavin and colleagues on the modularity structure in yeast complexes and the distinctive properties of "core" and "attachment" proteins, we develop a core-attachment based refinement method coupled to MCL for reconstruction of yeast complexes from scored (weighted) PPI networks. We combine physical interactions from two recent "pull-down" experiments to generate an unscored PPI network. We then score this network using available affinity scoring schemes to generate multiple scored PPI networks. The evaluation of our method (called MCL-CAw) on these networks shows that: (i) MCL-CAw derives larger number of yeast complexes and with better accuracies than MCL, particularly in the presence of natural noise; (ii) Affinity scoring can effectively reduce the impact of noise on MCL-CAw and thereby improve the quality (precision and recall) of its predicted complexes; (iii) MCL-CAw responds well to most available scoring schemes. We discuss several instances where MCL-CAw was successful in deriving meaningful complexes, and where it missed a few proteins or whole complexes due to affinity scoring of the networks. We compare MCL-CAw with several recent complex detection algorithms on unscored and scored networks, and assess the relative performance of the algorithms on these networks. Further, we study the impact of augmenting physical datasets with computationally inferred interactions for complex detection. Finally, we analyse the essentiality of proteins within predicted complexes to understand a possible correlation between protein essentiality and their ability to form complexes. Conclusions We demonstrate that core-attachment based refinement in MCL-CAw improves the predictions of MCL on yeast PPI networks. We show that affinity scoring improves the performance of MCL-CAw. PMID:20939868
COMPUTATIONAL METHODOLOGIES for REAL-SPACE STRUCTURAL REFINEMENT of LARGE MACROMOLECULAR COMPLEXES

PubMed Central

Goh, Boon Chong; Hadden, Jodi A.; Bernardi, Rafael C.; Singharoy, Abhishek; McGreevy, Ryan; Rudack, Till; Cassidy, C. Keith; Schulten, Klaus

2017-01-01

The rise of the computer as a powerful tool for model building and refinement has revolutionized the field of structure determination for large biomolecular systems. Despite the wide availability of robust experimental methods capable of resolving structural details across a range of spatiotemporal resolutions, computational hybrid methods have the unique ability to integrate the diverse data from multimodal techniques such as X-ray crystallography and electron microscopy into consistent, fully atomistic structures. Here, commonly employed strategies for computational real-space structural refinement are reviewed, and their specific applications are illustrated for several large macromolecular complexes: ribosome, virus capsids, chemosensory array, and photosynthetic chromatophore. The increasingly important role of computational methods in large-scale structural refinement, along with current and future challenges, is discussed. PMID:27145875
Prediction of binding poses to FXR using multi-targeted docking combined with molecular dynamics and enhanced sampling

NASA Astrophysics Data System (ADS)

Bhakat, Soumendranath; Åberg, Emil; Söderhjelm, Pär

2018-01-01

Advanced molecular docking methods often aim at capturing the flexibility of the protein upon binding to the ligand. In this study, we investigate whether instead a simple rigid docking method can be applied, if combined with multiple target structures to model the backbone flexibility and molecular dynamics simulations to model the sidechain and ligand flexibility. The methods are tested for the binding of 35 ligands to FXR as part of the first stage of the Drug Design Data Resource (D3R) Grand Challenge 2 blind challenge. The results show that the multiple-target docking protocol performs surprisingly well, with correct poses found for 21 of the ligands. MD simulations started on the docked structures are remarkably stable, but show almost no tendency of refining the structure closer to the experimentally found binding pose. Reconnaissance metadynamics enhances the exploration of new binding poses, but additional collective variables involving the protein are needed to exploit the full potential of the method.
Prediction of binding poses to FXR using multi-targeted docking combined with molecular dynamics and enhanced sampling.

PubMed

Bhakat, Soumendranath; Åberg, Emil; Söderhjelm, Pär

2018-01-01

Advanced molecular docking methods often aim at capturing the flexibility of the protein upon binding to the ligand. In this study, we investigate whether instead a simple rigid docking method can be applied, if combined with multiple target structures to model the backbone flexibility and molecular dynamics simulations to model the sidechain and ligand flexibility. The methods are tested for the binding of 35 ligands to FXR as part of the first stage of the Drug Design Data Resource (D3R) Grand Challenge 2 blind challenge. The results show that the multiple-target docking protocol performs surprisingly well, with correct poses found for 21 of the ligands. MD simulations started on the docked structures are remarkably stable, but show almost no tendency of refining the structure closer to the experimentally found binding pose. Reconnaissance metadynamics enhances the exploration of new binding poses, but additional collective variables involving the protein are needed to exploit the full potential of the method.
Backbone amide 15N chemical shift tensors report on hydrogen bonding interactions in proteins: A magic angle spinning NMR study.

PubMed

Paramasivam, Sivakumar; Gronenborn, Angela M; Polenova, Tatyana

2018-08-01

Chemical shift tensors (CSTs) are an exquisite probe of local geometric and electronic structure. 15 N CST are very sensitive to hydrogen bonding, yet they have been reported for very few proteins to date. Here we present experimental results and statistical analysis of backbone amide 15 N CSTs for 100 residues of four proteins, two E. coli thioredoxin reassemblies (1-73-(U- 13 C, 15 N)/74-108-(U- 15 N) and 1-73-(U- 15 N)/74-108-(U- 13 C, 15 N)), dynein light chain 8 LC8, and CAP-Gly domain of the mammalian dynactin. The 15 N CSTs were measured by a symmetry-based CSA recoupling method, ROCSA. Our results show that the principal component δ 11 is very sensitive to the presence of hydrogen bonding interactions due to its unique orientation in the molecular frame. The downfield chemical shift change of backbone amide nitrogen nuclei with increasing hydrogen bond strength is manifested in the negative correlation of the principal components with hydrogen bond distance for both α-helical and β-sheet secondary structure elements. Our findings highlight the potential for the use of 15 N CSTs in protein structure refinement. Copyright © 2018 Elsevier Inc. All rights reserved.
A Deep Learning Network Approach to ab initio Protein Secondary Structure Prediction

PubMed Central

Spencer, Matt; Eickholt, Jesse; Cheng, Jianlin

2014-01-01

Ab initio protein secondary structure (SS) predictions are utilized to generate tertiary structure predictions, which are increasingly demanded due to the rapid discovery of proteins. Although recent developments have slightly exceeded previous methods of SS prediction, accuracy has stagnated around 80% and many wonder if prediction cannot be advanced beyond this ceiling. Disciplines that have traditionally employed neural networks are experimenting with novel deep learning techniques in attempts to stimulate progress. Since neural networks have historically played an important role in SS prediction, we wanted to determine whether deep learning could contribute to the advancement of this field as well. We developed an SS predictor that makes use of the position-specific scoring matrix generated by PSI-BLAST and deep learning network architectures, which we call DNSS. Graphical processing units and CUDA software optimize the deep network architecture and efficiently train the deep networks. Optimal parameters for the training process were determined, and a workflow comprising three separately trained deep networks was constructed in order to make refined predictions. This deep learning network approach was used to predict SS for a fully independent test data set of 198 proteins, achieving a Q3 accuracy of 80.7% and a Sov accuracy of 74.2%. PMID:25750595
A Deep Learning Network Approach to ab initio Protein Secondary Structure Prediction.

PubMed

Spencer, Matt; Eickholt, Jesse; Jianlin Cheng

2015-01-01

Ab initio protein secondary structure (SS) predictions are utilized to generate tertiary structure predictions, which are increasingly demanded due to the rapid discovery of proteins. Although recent developments have slightly exceeded previous methods of SS prediction, accuracy has stagnated around 80 percent and many wonder if prediction cannot be advanced beyond this ceiling. Disciplines that have traditionally employed neural networks are experimenting with novel deep learning techniques in attempts to stimulate progress. Since neural networks have historically played an important role in SS prediction, we wanted to determine whether deep learning could contribute to the advancement of this field as well. We developed an SS predictor that makes use of the position-specific scoring matrix generated by PSI-BLAST and deep learning network architectures, which we call DNSS. Graphical processing units and CUDA software optimize the deep network architecture and efficiently train the deep networks. Optimal parameters for the training process were determined, and a workflow comprising three separately trained deep networks was constructed in order to make refined predictions. This deep learning network approach was used to predict SS for a fully independent test dataset of 198 proteins, achieving a Q3 accuracy of 80.7 percent and a Sov accuracy of 74.2 percent.
Raster-scanning serial protein crystallography using micro- and nano-focused synchrotron beams

DOE Office of Scientific and Technical Information (OSTI.GOV)

Coquelle, Nicolas; CNRS, IBS, 38044 Grenoble; CEA, IBS, 38044 Grenoble

A raster scanning serial protein crystallography approach is presented, that consumes as low ∼200–700 nl of sedimented crystals. New serial data pre-analysis software, NanoPeakCell, is introduced. High-resolution structural information was obtained from lysozyme microcrystals (20 µm in the largest dimension) using raster-scanning serial protein crystallography on micro- and nano-focused beamlines at the ESRF. Data were collected at room temperature (RT) from crystals sandwiched between two silicon nitride wafers, thereby preventing their drying, while limiting background scattering and sample consumption. In order to identify crystal hits, new multi-processing and GUI-driven Python-based pre-analysis software was developed, named NanoPeakCell, that was able tomore » read data from a variety of crystallographic image formats. Further data processing was carried out using CrystFEL, and the resultant structures were refined to 1.7 Å resolution. The data demonstrate the feasibility of RT raster-scanning serial micro- and nano-protein crystallography at synchrotrons and validate it as an alternative approach for the collection of high-resolution structural data from micro-sized crystals. Advantages of the proposed approach are its thriftiness, its handling-free nature, the reduced amount of sample required, the adjustable hit rate, the high indexing rate and the minimization of background scattering.« less
Structural insights into the neuroprotective-acting carbonyl reductase Sniffer of Drosophila melanogaster.

PubMed

Sgraja, Tanja; Ulschmid, Julia; Becker, Katja; Schneuwly, Stephan; Klebe, Gerhard; Reuter, Klaus; Heine, Andreas

2004-10-01

In vivo studies with the fruit-fly Drosophila melanogaster have shown that the Sniffer protein prevents age-dependent and oxidative stress-induced neurodegenerative processes. Sniffer is a NADPH-dependent carbonyl reductase belonging to the enzyme family of short-chain dehydrogenases/reductases (SDRs). The crystal structure of the homodimeric Sniffer protein from Drosophila melanogaster in complex with NADP+ has been determined by multiple-wavelength anomalous dispersion and refined to a resolution of 1.75 A. The observed fold represents a typical dinucleotide-binding domain as detected for other SDRs. With respect to the cofactor-binding site and the region referred to as substrate-binding loop, the Sniffer protein shows a striking similarity to the porcine carbonyl reductase (PTCR). This loop, in both Sniffer and PTCR, is substantially shortened compared to other SDRs. In most enzymes of the SDR family this loop adopts a well-defined conformation only after substrate binding and remains disordered in the absence of any bound ligands or even if only the dinucleotide cofactor is bound. In the structure of the Sniffer protein, however, the conformation of this loop is well defined, although no substrate is present. Molecular modeling studies provide an idea of how binding of substrate molecules to Sniffer could possibly occur.
Structural pierce into molecular mechanism underlying Clostridium perfringens Epsilon toxin function.

PubMed

Khalili, Saeed; Jahangiri, Abolfazl; Hashemi, Zahra Sadat; Khalesi, Bahman; Mard-Soltani, Maysam; Amani, Jafar

2017-03-01

Epsilon toxin of the Clostridium perfringens garnered a lot of attention due to its potential for toxicity in humans, extreme potency for cytotoxicity in mice and lack of any approved therapeutics prescribed for human. However, the intricacies of the Epsilon toxin action mechanism are yet to be understood. In this regard, various in silico tools have been exploited to model and refine the 3D structure of the toxin and its two receptors. The receptor proteins were embedded into designed lipid membranes within an aqueous and ionized environment. Thereafter, the modeled structures subjected to series of consecutive molecular dynamics runs to achieve the most natural like coordination for each model. Ultimately, protein-protein interaction analyses were performed to understand the probable action mechanism. The obtained results successfully confirmed the accuracy of employed methods to achieve high quality models for the toxin and its receptors within their lipid bilayers. Molecular dynamics analyses lead the structures to a more native like coordination. Moreover, the results of previous empirical studies were confirmed, while new insights for action mechanisms including the detailed roles of Hepatitis A virus cellular receptor 1 (HAVCR1) and Myelin and lymphocyte protein (MAL) proteins were achieved. In light of previous and our observations, we suggested novel models which elucidated the existing interplay between potential players of Epsilon toxin action mechanism with detailed structural evidences. These models would pave the way to have more robust understanding of the Epsilon toxin biology, more precise vaccine construction and more successful drug (inhibitor) design. Copyright © 2017 Elsevier Ltd. All rights reserved.
Comparison Between Self-Guided Langevin Dynamics and Molecular Dynamics Simulations for Structure Refinement of Protein Loop Conformations

DTIC Science & Technology

2011-01-01

SECURITY CLASSIFICATION OF: 17. LIMITATION OF ABSTRACT Same as Report (SAR) 18 . NUMBER OF PAGES 9 19a. NAME OF RESPONSIBLE PERSON a. REPORT...unclassified b. ABSTRACT unclassified c. THIS PAGE unclassified Standard Form 298 (Rev. 8-98) Prescribed by ANSI Std Z39- 18 sampling is based on...atom distance-scaled ideal-gas reference state (DFIRE-AA) statistical potential func- tion.[ 18 ] The third approach is the Rosetta all-atom energy func
Structure of catalase determined by MicroED

PubMed Central

Nannenga, Brent L; Shi, Dan; Hattne, Johan; Reyes, Francis E; Gonen, Tamir

2014-01-01

MicroED is a recently developed method that uses electron diffraction for structure determination from very small three-dimensional crystals of biological material. Previously we used a series of still diffraction patterns to determine the structure of lysozyme at 2.9 Å resolution with MicroED (Shi et al., 2013). Here we present the structure of bovine liver catalase determined from a single crystal at 3.2 Å resolution by MicroED. The data were collected by continuous rotation of the sample under constant exposure and were processed and refined using standard programs for X-ray crystallography. The ability of MicroED to determine the structure of bovine liver catalase, a protein that has long resisted atomic analysis by traditional electron crystallography, demonstrates the potential of this method for structure determination. DOI: http://dx.doi.org/10.7554/eLife.03600.001 PMID:25303172
Validating metal binding sites in macromolecule structures using the CheckMyMetal web server

PubMed Central

Zheng, Heping; Chordia, Mahendra D.; Cooper, David R.; Chruszcz, Maksymilian; Müller, Peter; Sheldrick, George M.

2015-01-01

Metals play vital roles in both the mechanism and architecture of biological macromolecules. Yet structures of metal-containing macromolecules where metals are misidentified and/or suboptimally modeled are abundant in the Protein Data Bank (PDB). This shows the need for a diagnostic tool to identify and correct such modeling problems with metal binding environments. The "CheckMyMetal" (CMM) web server (http://csgid.org/csgid/metal_sites/) is a sophisticated, user-friendly web-based method to evaluate metal binding sites in macromolecular structures in respect to 7350 metal binding sites observed in a benchmark dataset of 2304 high resolution crystal structures. The protocol outlines how the CMM server can be used to detect geometric and other irregularities in the structures of metal binding sites and alert researchers to potential errors in metal assignment. The protocol also gives practical guidelines for correcting problematic sites by modifying the metal binding environment and/or redefining metal identity in the PDB file. Several examples where this has led to meaningful results are described in the anticipated results section. CMM was designed for a broad audience—biomedical researchers studying metal-containing proteins and nucleic acids—but is equally well suited for structural biologists to validate new structures during modeling or refinement. The CMM server takes the coordinates of a metal-containing macromolecule structure in the PDB format as input and responds within a few seconds for a typical protein structure modeled with a few hundred amino acids. PMID:24356774
DOE Office of Scientific and Technical Information (OSTI.GOV)

Guo, Feng; Jin, Tengchuan; Howard, Andrew

The crystallization of the brazil nut allergen Ber e 2 is reported. Peanut and tree-nut allergies have attracted considerable attention because of their frequency and their lifelong persistence. Brazil-nut (Bertholletia excelsa) allergies have been well documented and the 11S legumin-like seed storage protein Ber e 2 (excelsin) is one of the two known brazil-nut allergens. In this study, Ber e 2 was extracted from brazil-nut kernels and purified to high purity by crystalline precipitation and gel-filtration chromatography. Well diffracting single crystals were obtained using the hanging-drop vapour-diffusion method. A molecular-replacement structural solution has been obtained. Refinement of the structure ismore » currently under way.« less
Discovery of an Unexplored Protein Structural Scaffold of Serine Protease from Big Blue Octopus (Octopus cyanea): A New Prospective Lead Molecule.

PubMed

Panda, Subhamay; Kumari, Leena

2017-01-01

Serine proteases are a group of enzymes that hydrolyses the peptide bonds in proteins. In mammals, these enzymes help in the regulation of several major physiological functions such as digestion, blood clotting, responses of immune system, reproductive functions and the complement system. Serine proteases obtained from the venom of Octopodidae family is a relatively unexplored area of research. In the present work, we tried to effectively utilize comparative composite molecular modeling technique. Our key aim was to propose the first molecular model structure of unexplored serine protease 5 derived from big blue octopus. The other objective of this study was to analyze the distribution of negatively and positively charged amino acid over molecular modeled structure, distribution of secondary structural elements, hydrophobicity molecular surface analysis and electrostatic potential analysis with the aid of different bioinformatic tools. In the present study, molecular model has been generated with the help of I-TASSER suite. Afterwards the refined structural model was validated with standard methods. For functional annotation of protein molecule we used Protein Information Resource (PIR) database. Serine protease 5 of big blue octopus was analyzed with different bioinformatical algorithms for the distribution of negatively and positively charged amino acid over molecular modeled structure, distribution of secondary structural elements, hydrophobicity molecular surface analysis and electrostatic potential analysis. The functionally critical amino acids and ligand- binding site (LBS) of the proteins (modeled) were determined using the COACH program. The molecular model data in cooperation to other pertinent post model analysis data put forward molecular insight to proteolytic activity of serine protease 5, which helps in the clear understanding of procoagulant and anticoagulant characteristics of this natural lead molecule. Our approach was to investigate the octopus venom protein as a whole or a part of their structure that may result in the development of new lead molecule. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Resolution of structural heterogeneity in dynamic crystallography

PubMed Central

Ren, Zhong; Chan, Peter W. Y.; Moffat, Keith; Pai, Emil F.; Royer, William E.; Šrajer, Vukica; Yang, Xiaojing

2013-01-01

Dynamic behavior of proteins is critical to their function. X-ray crystallography, a powerful yet mostly static technique, faces inherent challenges in acquiring dynamic information despite decades of effort. Dynamic ‘structural changes’ are often indirectly inferred from ‘structural differences’ by comparing related static structures. In contrast, the direct observation of dynamic structural changes requires the initiation of a biochemical reaction or process in a crystal. Both the direct and the indirect approaches share a common challenge in analysis: how to interpret the structural heterogeneity intrinsic to all dynamic processes. This paper presents a real-space approach to this challenge, in which a suite of analytical methods and tools to identify and refine the mixed structural species present in multiple crystallographic data sets have been developed. These methods have been applied to representative scenarios in dynamic crystallography, and reveal structural information that is otherwise difficult to interpret or inaccessible using conventional methods. PMID:23695239
Resolution of structural heterogeneity in dynamic crystallography.

PubMed

Ren, Zhong; Chan, Peter W Y; Moffat, Keith; Pai, Emil F; Royer, William E; Šrajer, Vukica; Yang, Xiaojing

2013-06-01

Dynamic behavior of proteins is critical to their function. X-ray crystallography, a powerful yet mostly static technique, faces inherent challenges in acquiring dynamic information despite decades of effort. Dynamic `structural changes' are often indirectly inferred from `structural differences' by comparing related static structures. In contrast, the direct observation of dynamic structural changes requires the initiation of a biochemical reaction or process in a crystal. Both the direct and the indirect approaches share a common challenge in analysis: how to interpret the structural heterogeneity intrinsic to all dynamic processes. This paper presents a real-space approach to this challenge, in which a suite of analytical methods and tools to identify and refine the mixed structural species present in multiple crystallographic data sets have been developed. These methods have been applied to representative scenarios in dynamic crystallography, and reveal structural information that is otherwise difficult to interpret or inaccessible using conventional methods.
Insights into the nature of DNA binding of AbrB-like transcription factors

PubMed Central

Sullivan, Daniel M.; Bobay, Benjamin G.; Kojetin, Douglas J.; Thompson, Richele J.; Rance, Mark; Strauch, Mark A.; Cavanagh, John

2008-01-01

Summary Understanding the DNA recognition and binding by the AbrB-like family of transcriptional regulators is of significant interest since these proteins enable bacteria to elicit the appropriate response to diverse environmental stimuli. Although these ‘transition-state regulator’ proteins have been well characterized at the genetic level, the general and specific mechanisms of DNA binding remain elusive. We present RDC-refined NMR solution structures and dynamic properties of the DNA-binding domains of three Bacillus subtilis transition-state regulators AbrB, Abh, and SpoVT. We combined previously investigated DNase I footprinting, DNA methylation, gel shift assays, mutagenic and NMR studies to generate a structural model of the complex between AbrBN55 and its cognate promoter, abrB8. These investigations have enabled us to generate the first model for the specific nature of the transition-state regulator-DNA interaction. PMID:19000822
Sequence-dependent DNA flexibility mediates DNase I cleavage.

PubMed

Heddi, Brahim; Abi-Ghanem, Josephine; Lavigne, Marc; Hartmann, Brigitte

2010-01-08

Understanding the preference of nonspecific proteins for certain DNA structural features requires an accurate description of the properties of free DNA, especially regarding their possible predisposition to adopt a conformation that favors the formation of a complex. Exploiting previous exhaustive NMR studies performed on free DNA oligomers, we investigated the molecular basis of DNase I sensitivity under conditions where DNase I binding limits the probability of cleavage. We showed that cleavage intensity was correlated with adjacent 3' phosphate linkage flexibility, monitored by (31)P chemical shifts. Examining NMR-refined DNA structures highlighted that sequence-dependent flexible phosphates were associated with large minor groove variations that may promote the affinity of DNase I, according to relevant DNA-protein complexes. In sum, this work demonstrates that specificity in DNA-DNase I interaction is mediated by DNA flexibility, which influences the induced-fit transitions required to form productive complexes.
A grid-enabled web service for low-resolution crystal structure refinement.

PubMed

O'Donovan, Daniel J; Stokes-Rees, Ian; Nam, Yunsun; Blacklow, Stephen C; Schröder, Gunnar F; Brunger, Axel T; Sliz, Piotr

2012-03-01

Deformable elastic network (DEN) restraints have proved to be a powerful tool for refining structures from low-resolution X-ray crystallographic data sets. Unfortunately, optimal refinement using DEN restraints requires extensive calculations and is often hindered by a lack of access to sufficient computational resources. The DEN web service presented here intends to provide structural biologists with access to resources for running computationally intensive DEN refinements in parallel on the Open Science Grid, the US cyberinfrastructure. Access to the grid is provided through a simple and intuitive web interface integrated into the SBGrid Science Portal. Using this portal, refinements combined with full parameter optimization that would take many thousands of hours on standard computational resources can now be completed in several hours. An example of the successful application of DEN restraints to the human Notch1 transcriptional complex using the grid resource, and summaries of all submitted refinements, are presented as justification.

Towards solution and refinement of organic crystal structures by fitting to the atomic pair distribution function.

PubMed

Prill, Dragica; Juhás, Pavol; Billinge, Simon J L; Schmidt, Martin U

2016-01-01

A method towards the solution and refinement of organic crystal structures by fitting to the atomic pair distribution function (PDF) is developed. Approximate lattice parameters and molecular geometry must be given as input. The molecule is generally treated as a rigid body. The positions and orientations of the molecules inside the unit cell are optimized starting from random values. The PDF is obtained from carefully measured X-ray powder diffraction data. The method resembles `real-space' methods for structure solution from powder data, but works with PDF data instead of the diffraction pattern itself. As such it may be used in situations where the organic compounds are not long-range-ordered, are poorly crystalline, or nanocrystalline. The procedure was applied to solve and refine the crystal structures of quinacridone (β phase), naphthalene and allopurinol. In the case of allopurinol it was even possible to successfully solve and refine the structure in P1 with four independent molecules. As an example of a flexible molecule, the crystal structure of paracetamol was refined using restraints for bond lengths, bond angles and selected torsion angles. In all cases, the resulting structures are in excellent agreement with structures from single-crystal data.
Iterative model building, structure refinement and density modification with the PHENIX AutoBuild wizard

PubMed Central

Terwilliger, Thomas C.; Grosse-Kunstleve, Ralf W.; Afonine, Pavel V.; Moriarty, Nigel W.; Zwart, Peter H.; Hung, Li-Wei; Read, Randy J.; Adams, Paul D.

2008-01-01

The PHENIX AutoBuild wizard is a highly automated tool for iterative model building, structure refinement and density modification using RESOLVE model building, RESOLVE statistical density modification and phenix.refine structure refinement. Recent advances in the AutoBuild wizard and phenix.refine include automated detection and application of NCS from models as they are built, extensive model-completion algorithms and automated solvent-molecule picking. Model-completion algorithms in the AutoBuild wizard include loop building, crossovers between chains in different models of a structure and side-chain optimization. The AutoBuild wizard has been applied to a set of 48 structures at resolutions ranging from 1.1 to 3.2 Å, resulting in a mean R factor of 0.24 and a mean free R factor of 0.29. The R factor of the final model is dependent on the quality of the starting electron density and is relatively independent of resolution. PMID:18094468
Structure of the Trypanosoma cruzi protein tyrosine phosphatase TcPTP1, a potential therapeutic target for Chagas' disease

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lountos, George T.; Tropea, Joseph E.; Waugh, David S.

2013-06-05

Chagas’ disease, a neglected tropical affliction transmitted by the flagellated protozoan Trypanosoma cruzi, is prevalent in Latin America and affects nearly 18 million people worldwide, yet few approved drugs are available to treat the disease. Moreover, the currently available drugs exhibit severe toxicity or are poorly effective in the chronic phase of the disease. This limitation, along with the large population at risk, underscores the urgent need to discover new molecular targets and novel therapeutic agents. Recently, the T. cruzi protein tyrosine phosphatase TcPTP1 has been implicated in the cellular differentiation and infectivity of the parasite and is therefore amore » promising target for the design of novel anti-parasitic drugs. Here, we report the X-ray crystal structure of TcPTP1 refined to a resolution of 2.18 Å, which provides structural insights into the active site environment that can be used to initiate structure-based drug design efforts to develop specific TcPTP1 inhibitors. Potential strategies to develop such inhibitors are also discussed.« less
Predicting binding modes of reversible peptide-based inhibitors of falcipain-2 consistent with structure-activity relationships.

PubMed

Hernández González, Jorge Enrique; Hernández Alvarez, Lilian; Pascutti, Pedro Geraldo; Valiente, Pedro A

2017-09-01

Falcipain-2 (FP-2) is a major hemoglobinase of Plasmodium falciparum, considered an important drug target for the development of antimalarials. A previous study reported a novel series of 20 reversible peptide-based inhibitors of FP-2. However, the lack of tridimensional structures of the complexes hinders further optimization strategies to enhance the inhibitory activity of the compounds. Here we report the prediction of the binding modes of the aforementioned inhibitors to FP-2. A computational approach combining previous knowledge on the determinants of binding to the enzyme, docking, and postdocking refinement steps, is employed. The latter steps comprise molecular dynamics simulations and free energy calculations. Remarkably, this approach leads to the identification of near-native ligand conformations when applied to a validation set of protein-ligand structures. Overall, we proposed substrate-like binding modes of the studied compounds fulfilling the structural requirements for FP-2 binding and yielding free energy values that correlated well with the experimental data. Proteins 2017; 85:1666-1683. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
IMAGINE: first neutron protein structure and new capabilities for neutron macromolecular crystallography

DOE Office of Scientific and Technical Information (OSTI.GOV)

Munshi, Parthapratim; Myles, Dean A A; Robertson, Lee

2013-01-01

We report the first high resolution neutron protein structure of perdeuterated rubredoxin from Pyrococcus furiosus (PfRd) determined using the new IMAGINE macromolecular neutron crystallography instrument at the Oak Ridge National Laboratory. Neutron diffraction data extending to 1.65 resolution were collected from a relatively small 0.7 mm3 PfRd crystal using 2.5 days (60 h) of beam time. The refined structure contains 371 out of 391, or 95%, of the deuterium atoms of the protein, and 58 solvent molecules. The IMAGINE instrument is designed to provide neutron data at or near atomic resolutions (1.5 ) from crystals with volume < 1.0 mm3more » and with unit cell edges < 100 . Beam line features include elliptical focusing mirrors that deliver 3x107 n s-1 cm-2 into a 3.5 x 2.0 mm2 focal spot at the sample position, and variable short and long wavelength cutoff optics that provide automated exchange between multiple wavelength configurations ( min=2.0 , 2.8 , 3.3 - max =3.0 , 4.0 , 4.5 , ~20 ). Notably, the crystal used to collect this PfRd data is 5-10 times smaller than has been previously reported.« less
DiffPy-CMI-Python libraries for Complex Modeling Initiative

DOE Office of Scientific and Technical Information (OSTI.GOV)

Billinge, Simon; Juhas, Pavol; Farrow, Christopher

2014-02-01

Software to manipulate and describe crystal and molecular structures and set up structural refinements from multiple experimental inputs. Calculation and simulation of structure derived physical quantities. Library for creating customized refinements of atomic structures from available experimental and theoretical inputs.
Sensitivity and Limitations of Structures from X-ray and Neutron-Based Diffraction Analyses of Transition Metal Oxide Lithium-Battery Electrodes

DOE PAGES

Liu, Hao; Liu, Haodong; Lapidus, Saul H.; ...

2017-06-21

Lithium transition metal oxides are an important class of electrode materials for lithium-ion batteries. Binary or ternary (transition) metal doping brings about new opportunities to improve the electrode’s performance and often leads to more complex stoichiometries and atomic structures than the archetypal LiCoO 2. Rietveld structural analyses of X-ray and neutron diffraction data is a widely-used approach for structural characterization of crystalline materials. But, different structural models and refinement approaches can lead to differing results, and some parameters can be difficult to quantify due to the inherent limitations of the data. Here, through the example of LiNi 0.8Co 0.15Al 0.05Omore » 2 (NCA), we demonstrated the sensitivity of various structural parameters in Rietveld structural analysis to different refinement approaches and structural models, and proposed an approach to reduce refinement uncertainties due to the inexact X-ray scattering factors of the constituent atoms within the lattice. Furthermore, this refinement approach was implemented for electrochemically-cycled NCA samples and yielded accurate structural parameters using only X-ray diffraction data. The present work provides the best practices for performing structural refinement of lithium transition metal oxides.« less
Sensitivity and Limitations of Structures from X-ray and Neutron-Based Diffraction Analyses of Transition Metal Oxide Lithium-Battery Electrodes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Liu, Hao; Liu, Haodong; Lapidus, Saul H.

Lithium transition metal oxides are an important class of electrode materials for lithium-ion batteries. Binary or ternary (transition) metal doping brings about new opportunities to improve the electrode’s performance and often leads to more complex stoichiometries and atomic structures than the archetypal LiCoO 2. Rietveld structural analyses of X-ray and neutron diffraction data is a widely-used approach for structural characterization of crystalline materials. But, different structural models and refinement approaches can lead to differing results, and some parameters can be difficult to quantify due to the inherent limitations of the data. Here, through the example of LiNi 0.8Co 0.15Al 0.05Omore » 2 (NCA), we demonstrated the sensitivity of various structural parameters in Rietveld structural analysis to different refinement approaches and structural models, and proposed an approach to reduce refinement uncertainties due to the inexact X-ray scattering factors of the constituent atoms within the lattice. Furthermore, this refinement approach was implemented for electrochemically-cycled NCA samples and yielded accurate structural parameters using only X-ray diffraction data. The present work provides the best practices for performing structural refinement of lithium transition metal oxides.« less
Computational Prediction of Atomic Structures of Helical Membrane Proteins Aided by EM Maps

PubMed Central

Kovacs, Julio A.; Yeager, Mark; Abagyan, Ruben

2007-01-01

Integral membrane proteins pose a major challenge for protein-structure prediction because only ≈100 high-resolution structures are available currently, thereby impeding the development of rules or empirical potentials to predict the packing of transmembrane α-helices. However, when an intermediate-resolution electron microscopy (EM) map is available, it can be used to provide restraints which, in combination with a suitable computational protocol, make structure prediction feasible. In this work we present such a protocol, which proceeds in three stages: 1), generation of an ensemble of α-helices by flexible fitting into each of the density rods in the low-resolution EM map, spanning a range of rotational angles around the main helical axes and translational shifts along the density rods; 2), fast optimization of side chains and scoring of the resulting conformations; and 3), refinement of the lowest-scoring conformations with internal coordinate mechanics, by optimizing the van der Waals, electrostatics, hydrogen bonding, torsional, and solvation energy contributions. In addition, our method implements a penalty term through a so-called tethering map, derived from the EM map, which restrains the positions of the α-helices. The protocol was validated on three test cases: GpA, KcsA, and MscL. PMID:17496035
Exploiting structure similarity in refinement: automated NCS and target-structure restraints in BUSTER

DOE Office of Scientific and Technical Information (OSTI.GOV)

Smart, Oliver S., E-mail: osmart@globalphasing.com; Womack, Thomas O.; Flensburg, Claus

2012-04-01

Local structural similarity restraints (LSSR) provide a novel method for exploiting NCS or structural similarity to an external target structure. Two examples are given where BUSTER re-refinement of PDB entries with LSSR produces marked improvements, enabling further structural features to be modelled. Maximum-likelihood X-ray macromolecular structure refinement in BUSTER has been extended with restraints facilitating the exploitation of structural similarity. The similarity can be between two or more chains within the structure being refined, thus favouring NCS, or to a distinct ‘target’ structure that remains fixed during refinement. The local structural similarity restraints (LSSR) approach considers all distances less thanmore » 5.5 Å between pairs of atoms in the chain to be restrained. For each, the difference from the distance between the corresponding atoms in the related chain is found. LSSR applies a restraint penalty on each difference. A functional form that reaches a plateau for large differences is used to avoid the restraints distorting parts of the structure that are not similar. Because LSSR are local, there is no need to separate out domains. Some restraint pruning is still necessary, but this has been automated. LSSR have been available to academic users of BUSTER since 2009 with the easy-to-use -autoncs and @@target target.pdb options. The use of LSSR is illustrated in the re-refinement of PDB entries http://scripts.iucr.org/cgi-bin/cr.cgi?rm, where -target enables the correct ligand-binding structure to be found, and http://scripts.iucr.org/cgi-bin/cr.cgi?rm, where -autoncs contributes to the location of an additional copy of the cyclic peptide ligand.« less
Role of the visual experience-dependent nascent proteome in neuronal plasticity

PubMed Central

Liu, Han-Hsuan; McClatchy, Daniel B; Schiapparelli, Lucio; Shen, Wanhua; Yates, John R

2018-01-01

Experience-dependent synaptic plasticity refines brain circuits during development. To identify novel protein synthesis-dependent mechanisms contributing to experience-dependent plasticity, we conducted a quantitative proteomic screen of the nascent proteome in response to visual experience in Xenopus optic tectum using bio-orthogonal metabolic labeling (BONCAT). We identified 83 differentially synthesized candidate plasticity proteins (CPPs). The CPPs form strongly interconnected networks and are annotated to a variety of biological functions, including RNA splicing, protein translation, and chromatin remodeling. Functional analysis of select CPPs revealed the requirement for eukaryotic initiation factor three subunit A (eIF3A), fused in sarcoma (FUS), and ribosomal protein s17 (RPS17) in experience-dependent structural plasticity in tectal neurons and behavioral plasticity in tadpoles. These results demonstrate that the nascent proteome is dynamic in response to visual experience and that de novo synthesis of machinery that regulates RNA splicing and protein translation is required for experience-dependent plasticity. PMID:29412139
Stabilization of the dimeric birch pollen allergen Bet v 1 impacts its immunological properties.

PubMed

Kofler, Stefan; Ackaert, Chloé; Samonig, Martin; Asam, Claudia; Briza, Peter; Horejs-Hoeck, Jutta; Cabrele, Chiara; Ferreira, Fatima; Duschl, Albert; Huber, Christian; Brandstetter, Hans

2014-01-03

Many allergens share several biophysical characteristics, including the capability to undergo oligomerization. The dimerization mechanism in Bet v 1 and its allergenic properties are so far poorly understood. Here, we report crystal structures of dimeric Bet v 1, revealing a noncanonical incorporation of cysteine at position 5 instead of genetically encoded tyrosine. Cysteine polysulfide bridging stabilized different dimeric assemblies, depending on the polysulfide linker length. These dimers represent quaternary arrangements that are frequently observed in related proteins, reflecting their prevalence in unmodified Bet v 1. These conclusions were corroborated by characteristic immunologic properties of monomeric and dimeric allergen variants. Hereby, residue 5 could be identified as an allergenic hot spot in Bet v 1. The presented results refine fundamental principles in protein chemistry and emphasize the importance of protein modifications in understanding the molecular basis of allergenicity.
Re-refinement from deposited X-ray data can deliver improved models for most PDB entries

DOE Office of Scientific and Technical Information (OSTI.GOV)

Joosten, Robbie P.; Womack, Thomas; Vriend, Gert, E-mail: vriend@cmbi.ru.nl

2009-02-01

An evaluation of validation and real-space intervention possibilities for improving existing automated (re-)refinement methods. The deposition of X-ray data along with the customary structural models defining PDB entries makes it possible to apply large-scale re-refinement protocols to these entries, thus giving users the benefit of improvements in X-ray methods that have occurred since the structure was deposited. Automated gradient refinement is an effective method to achieve this goal, but real-space intervention is most often required in order to adequately address problems detected by structure-validation software. In order to improve the existing protocol, automated re-refinement was combined with structure validation andmore » difference-density peak analysis to produce a catalogue of problems in PDB entries that are amenable to automatic correction. It is shown that re-refinement can be effective in producing improvements, which are often associated with the systematic use of the TLS parameterization of B factors, even for relatively new and high-resolution PDB entries, while the accompanying manual or semi-manual map analysis and fitting steps show good prospects for eventual automation. It is proposed that the potential for simultaneous improvements in methods and in re-refinement results be further encouraged by broadening the scope of depositions to include refinement metadata and ultimately primary rather than reduced X-ray data.« less
The new program OPAL for molecular dynamics simulations and energy refinements of biological macromolecules.

PubMed

Luginbühl, P; Güntert, P; Billeter, M; Wüthrich, K

1996-09-01

A new program for molecular dynamics (MD) simulation and energy refinement of biological macromolecules, OPAL, is introduced. Combined with the supporting program TRAJEC for the analysis of MD trajectories, OPAL affords high efficiency and flexibility for work with different force fields, and offers a user-friendly interface and extensive trajectory analysis capabilities. Salient features are computational speeds of up to 1.5 GFlops on vector supercomputers such as the NEC SX-3, ellipsoidal boundaries to reduce the system size for studies in explicit solvents, and natural treatment of the hydrostatic pressure. Practical applications of OPAL are illustrated with MD simulations of pure water, energy minimization of the NMR structure of the mixed disulfide of a mutant E. coli glutaredoxin with glutathione in different solvent models, and MD simulations of a small protein, pheromone Er-2, using either instantaneous or time-averaged NMR restraints, or no restraints.
GeneBuilder: interactive in silico prediction of gene structure.

PubMed

Milanesi, L; D'Angelo, D; Rogozin, I B

1999-01-01

Prediction of gene structure in newly sequenced DNA becomes very important in large genome sequencing projects. This problem is complicated due to the exon-intron structure of eukaryotic genes and because gene expression is regulated by many different short nucleotide domains. In order to be able to analyse the full gene structure in different organisms, it is necessary to combine information about potential functional signals (promoter region, splice sites, start and stop codons, 3' untranslated region) together with the statistical properties of coding sequences (coding potential), information about homologous proteins, ESTs and repeated elements. We have developed the GeneBuilder system which is based on prediction of functional signals and coding regions by different approaches in combination with similarity searches in proteins and EST databases. The potential gene structure models are obtained by using a dynamic programming method. The program permits the use of several parameters for gene structure prediction and refinement. During gene model construction, selecting different exon homology levels with a protein sequence selected from a list of homologous proteins can improve the accuracy of the gene structure prediction. In the case of low homology, GeneBuilder is still able to predict the gene structure. The GeneBuilder system has been tested by using the standard set (Burset and Guigo, Genomics, 34, 353-367, 1996) and the performances are: 0.89 sensitivity and 0.91 specificity at the nucleotide level. The total correlation coefficient is 0.88. The GeneBuilder system is implemented as a part of the WebGene a the URL: http://www.itba.mi. cnr.it/webgene and TRADAT (TRAncription Database and Analysis Tools) launcher URL: http://www.itba.mi.cnr.it/tradat.
Exploiting structure similarity in refinement: automated NCS and target-structure restraints in BUSTER.

PubMed

Smart, Oliver S; Womack, Thomas O; Flensburg, Claus; Keller, Peter; Paciorek, Włodek; Sharff, Andrew; Vonrhein, Clemens; Bricogne, Gérard

2012-04-01

Maximum-likelihood X-ray macromolecular structure refinement in BUSTER has been extended with restraints facilitating the exploitation of structural similarity. The similarity can be between two or more chains within the structure being refined, thus favouring NCS, or to a distinct 'target' structure that remains fixed during refinement. The local structural similarity restraints (LSSR) approach considers all distances less than 5.5 Å between pairs of atoms in the chain to be restrained. For each, the difference from the distance between the corresponding atoms in the related chain is found. LSSR applies a restraint penalty on each difference. A functional form that reaches a plateau for large differences is used to avoid the restraints distorting parts of the structure that are not similar. Because LSSR are local, there is no need to separate out domains. Some restraint pruning is still necessary, but this has been automated. LSSR have been available to academic users of BUSTER since 2009 with the easy-to-use -autoncs and -target target.pdb options. The use of LSSR is illustrated in the re-refinement of PDB entries 5rnt, where -target enables the correct ligand-binding structure to be found, and 1osg, where -autoncs contributes to the location of an additional copy of the cyclic peptide ligand.
Improving Protocols for Protein Mapping through Proper Comparison to Crystallography Data

PubMed Central

Lexa, Katrina W.; Carlson, Heather A.

2013-01-01

Computational approaches to fragment-based drug design (FBDD) can complement experiments and facilitate the identification of potential hot spots along the protein surface. However, the evaluation of computational methods for mapping binding sites frequently focuses upon the ability to reproduce crystallographic coordinates to within a low RMSD threshold. This dependency on the deposited coordinate data overlooks the original electron density from the experiment, thus techniques may be developed based upon subjective - or even erroneous - atomic coordinates. This can become a significant drawback in applications to systems where the location of hot spots is unknown. Based on comparison to crystallographic density, we previously showed that mixed-solvent molecular dynamics (MixMD) accurately identifies the active site for HEWL, with acetonitrile as an organic solvent. Here, we concentrated on the influence of protic solvent on simulation and refined the optimal MixMD approach for extrapolation of the method to systems without established sites. Our results establish an accurate approach for comparing simulations to experiment. We have outlined the most efficient strategy for MixMD, based on simulation length and number of runs. The development outlined here makes MixMD a robust method which should prove useful across a broad range of target structures. Lastly, our results with MixMD match experimental data so well that consistency between simulations and density may be a useful way to aid the identification of probes vs waters during the refinement of future MSCS crystallographic structures. PMID:23327200
Macromolecular powder diffraction : structure solution via molecular.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Doebbler, J.; Von Dreele, R.; X-Ray Science Division

Macromolecular powder diffraction is a burgeoning technique for protein structure solution - ideally suited for cases where no suitable single crystals are available. Over the past seven years, pioneering work by Von Dreele et al. [1,2] and Margiolaki et al. [3,4] has demonstrated the viability of this approach for several protein structures. Among these initial powder studies, molecular replacement solutions of insulin and turkey lysozyme into alternate space groups were accomplished. Pressing the technique further, Margiolaki et al. [5] executed the first molecular replacement of an unknown protein structure: the SH3 domain of ponsin, using data from a multianalyzer diffractometer.more » To demonstrate that cross-species molecular replacement using image plate data is also possible, we present the solution of hen egg white lysozyme using the 60% identical human lysozyme (PDB code: 1LZ1) as the search model. Due to the high incidence of overlaps in powder patterns, especially in more complex structures, we have used extracted intensities from five data sets taken at different salt concentrations in a multi-pattern Pawley refinement. The use of image plates severely increases the overlap problem due to lower detector resolution, but radiation damage effects are minimized with shorter exposure times and the fact that the entire pattern is obtained in a single exposure. This image plate solution establishes the robustness of powder molecular replacement resulting from different data collection techniques.« less
CNA web server: rigidity theory-based thermal unfolding simulations of proteins for linking structure, (thermo-)stability, and function

PubMed Central

Krüger, Dennis M.; Rathi, Prakash Chandra; Pfleger, Christopher; Gohlke, Holger

2013-01-01

The Constraint Network Analysis (CNA) web server provides a user-friendly interface to the CNA approach developed in our laboratory for linking results from rigidity analyses to biologically relevant characteristics of a biomolecular structure. The CNA web server provides a refined modeling of thermal unfolding simulations that considers the temperature dependence of hydrophobic tethers and computes a set of global and local indices for quantifying biomacromolecular stability. From the global indices, phase transition points are identified where the structure switches from a rigid to a floppy state; these phase transition points can be related to a protein’s (thermo-)stability. Structural weak spots (unfolding nuclei) are automatically identified, too; this knowledge can be exploited in data-driven protein engineering. The local indices are useful in linking flexibility and function and to understand the impact of ligand binding on protein flexibility. The CNA web server robustly handles small-molecule ligands in general. To overcome issues of sensitivity with respect to the input structure, the CNA web server allows performing two ensemble-based variants of thermal unfolding simulations. The web server output is provided as raw data, plots and/or Jmol representations. The CNA web server, accessible at http://cpclab.uni-duesseldorf.de/cna or http://www.cnanalysis.de, is free and open to all users with no login requirement. PMID:23609541
How to tackle protein structural data from solution and solid state: An integrated approach.

PubMed

Carlon, Azzurra; Ravera, Enrico; Andrałojć, Witold; Parigi, Giacomo; Murshudov, Garib N; Luchinat, Claudio

2016-02-01

Long-range NMR restraints, such as diamagnetic residual dipolar couplings and paramagnetic data, can be used to determine 3D structures of macromolecules. They are also used to monitor, and potentially to improve, the accuracy of a macromolecular structure in solution by validating or "correcting" a crystal model. Since crystal structures suffer from crystal packing forces they may not be accurate models for the macromolecular structures in solution. However, the presence of real differences should be tested for by simultaneous refinement of the structure using both crystal and solution NMR data. To achieve this, the program REFMAC5 from CCP4 was modified to allow the simultaneous use of X-ray crystallographic and paramagnetic NMR data and/or diamagnetic residual dipolar couplings. Inconsistencies between crystal structures and solution NMR data, if any, may be due either to structural rearrangements occurring on passing from the solution to solid state, or to a greater degree of conformational heterogeneity in solution with respect to the crystal. In the case of multidomain proteins, paramagnetic restraints can provide the correct mutual orientations and positions of domains in solution, as well as information on the conformational variability experienced by the macromolecule. Copyright © 2016 Elsevier B.V. All rights reserved.

STRUM: structure-based prediction of protein stability changes upon single-point mutation.

PubMed

Quan, Lijun; Lv, Qiang; Zhang, Yang

2016-10-01

Mutations in human genome are mainly through single nucleotide polymorphism, some of which can affect stability and function of proteins, causing human diseases. Several methods have been proposed to predict the effect of mutations on protein stability; but most require features from experimental structure. Given the fast progress in protein structure prediction, this work explores the possibility to improve the mutation-induced stability change prediction using low-resolution structure modeling. We developed a new method (STRUM) for predicting stability change caused by single-point mutations. Starting from wild-type sequences, 3D models are constructed by the iterative threading assembly refinement (I-TASSER) simulations, where physics- and knowledge-based energy functions are derived on the I-TASSER models and used to train STRUM models through gradient boosting regression. STRUM was assessed by 5-fold cross validation on 3421 experimentally determined mutations from 150 proteins. The Pearson correlation coefficient (PCC) between predicted and measured changes of Gibbs free-energy gap, ΔΔG, upon mutation reaches 0.79 with a root-mean-square error 1.2 kcal/mol in the mutation-based cross-validations. The PCC reduces if separating training and test mutations from non-homologous proteins, which reflects inherent correlations in the current mutation sample. Nevertheless, the results significantly outperform other state-of-the-art methods, including those built on experimental protein structures. Detailed analyses show that the most sensitive features in STRUM are the physics-based energy terms on I-TASSER models and the conservation scores from multiple-threading template alignments. However, the ΔΔG prediction accuracy has only a marginal dependence on the accuracy of protein structure models as long as the global fold is correct. These data demonstrate the feasibility to use low-resolution structure modeling for high-accuracy stability change prediction upon point mutations. http://zhanglab.ccmb.med.umich.edu/STRUM/ CONTACT: qiang@suda.edu.cn and zhng@umich.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
STRUM: structure-based prediction of protein stability changes upon single-point mutation

PubMed Central

Quan, Lijun; Lv, Qiang; Zhang, Yang

2016-01-01

Motivation: Mutations in human genome are mainly through single nucleotide polymorphism, some of which can affect stability and function of proteins, causing human diseases. Several methods have been proposed to predict the effect of mutations on protein stability; but most require features from experimental structure. Given the fast progress in protein structure prediction, this work explores the possibility to improve the mutation-induced stability change prediction using low-resolution structure modeling. Results: We developed a new method (STRUM) for predicting stability change caused by single-point mutations. Starting from wild-type sequences, 3D models are constructed by the iterative threading assembly refinement (I-TASSER) simulations, where physics- and knowledge-based energy functions are derived on the I-TASSER models and used to train STRUM models through gradient boosting regression. STRUM was assessed by 5-fold cross validation on 3421 experimentally determined mutations from 150 proteins. The Pearson correlation coefficient (PCC) between predicted and measured changes of Gibbs free-energy gap, ΔΔG, upon mutation reaches 0.79 with a root-mean-square error 1.2 kcal/mol in the mutation-based cross-validations. The PCC reduces if separating training and test mutations from non-homologous proteins, which reflects inherent correlations in the current mutation sample. Nevertheless, the results significantly outperform other state-of-the-art methods, including those built on experimental protein structures. Detailed analyses show that the most sensitive features in STRUM are the physics-based energy terms on I-TASSER models and the conservation scores from multiple-threading template alignments. However, the ΔΔG prediction accuracy has only a marginal dependence on the accuracy of protein structure models as long as the global fold is correct. These data demonstrate the feasibility to use low-resolution structure modeling for high-accuracy stability change prediction upon point mutations. Availability and Implementation: http://zhanglab.ccmb.med.umich.edu/STRUM/ Contact: qiang@suda.edu.cn and zhng@umich.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27318206
Three-dimensional structure of the human immunodeficiency virus type 1 matrix protein.

PubMed

Massiah, M A; Starich, M R; Paschall, C; Summers, M F; Christensen, A M; Sundquist, W I

1994-11-25

The HIV-1 matrix protein forms an icosahedral shell associated with the inner membrane of the mature virus. Genetic analyses have indicated that the protein performs important functions throughout the viral life-cycle, including anchoring the transmembrane envelope protein on the surface of the virus, assisting in viral penetration, transporting the proviral integration complex across the nuclear envelope, and localizing the assembling virion to the cell membrane. We now report the three-dimensional structure of recombinant HIV-1 matrix protein, determined at high resolution by nuclear magnetic resonance (NMR) methods. The HIV-1 matrix protein is the first retroviral matrix protein to be characterized structurally and only the fourth HIV-1 protein of known structure. NMR signal assignments required recently developed triple-resonance (1H, 13C, 15N) NMR methodologies because signals for 91% of 132 assigned H alpha protons and 74% of the 129 assignable backbone amide protons resonate within chemical shift ranges of 0.8 p.p.m. and 1 p.p.m., respectively. A total of 636 nuclear Overhauser effect-derived distance restraints were employed for distance geometry-based structure calculations, affording an average of 13.0 NMR-derived distance restraints per residue for the experimentally constrained amino acids. An ensemble of 25 refined distance geometry structures with penalties (sum of the squares of the distance violations) of 0.32 A2 or less and individual distance violations under 0.06 A was generated; best-fit superposition of ordered backbone heavy atoms relative to mean atom positions afforded root-mean-square deviations of 0.50 (+/- 0.08) A. The folded HIV-1 matrix protein structure is composed of five alpha-helices, a short 3(10) helical stretch, and a three-strand mixed beta-sheet. Helices I to III and the 3(10) helix pack about a central helix (IV) to form a compact globular domain that is capped by the beta-sheet. The C-terminal helix (helix V) projects away from the beta-sheet to expose carboxyl-terminal residues essential for early steps in the HIV-1 infectious cycle. Basic residues implicated in membrane binding and nuclear localization functions cluster about an extruded cationic loop that connects beta-strands 1 and 2. The structure suggests that both membrane binding and nuclear localization may be mediated by complex tertiary structures rather than simple linear determinants.
Re-evaluation of low-resolution crystal structures via interactive molecular-dynamics flexible fitting (iMDFF): a case study in complement C4.

PubMed

Croll, Tristan Ian; Andersen, Gregers Rom

2016-09-01

While the rapid proliferation of high-resolution structures in the Protein Data Bank provides a rich set of templates for starting models, it remains the case that a great many structures both past and present are built at least in part by hand-threading through low-resolution and/or weak electron density. With current model-building tools this task can be challenging, and the de facto standard for acceptable error rates (in the form of atomic clashes and unfavourable backbone and side-chain conformations) in structures based on data with dmax not exceeding 3.5 Å reflects this. When combined with other factors such as model bias, these residual errors can conspire to make more serious errors in the protein fold difficult or impossible to detect. The three recently published 3.6-4.2 Å resolution structures of complement C4 (PDB entries 4fxg, 4fxk and 4xam) rank in the top quartile of structures of comparable resolution both in terms of Rfree and MolProbity score, yet, as shown here, contain register errors in six β-strands. By applying a molecular-dynamics force field that explicitly models interatomic forces and hence excludes most physically impossible conformations, the recently developed interactive molecular-dynamics flexible fitting (iMDFF) approach significantly reduces the complexity of the conformational space to be searched during manual rebuilding. This substantially improves the rate of detection and correction of register errors, and allows user-guided model building in maps with a resolution lower than 3.5 Å to converge to solutions with a stereochemical quality comparable to atomic resolution structures. Here, iMDFF has been used to individually correct and re-refine these three structures to MolProbity scores of <1.7, and strategies for working with such challenging data sets are suggested. Notably, the improved model allowed the resolution for complement C4b to be extended from 4.2 to 3.5 Å as demonstrated by paired refinement.
Structure of the N-terminal fragment of Escherichia coli Lon protease

DOE Office of Scientific and Technical Information (OSTI.GOV)

Li, Mi; Basic Research Program, SAIC-Frederick, Frederick, MD 21702; Gustchina, Alla

2010-08-01

The medium-resolution structure of the N-terminal fragment of E. coli Lon protease shows that this part of the enzyme consists of two compact domains and a very long α-helix. The structure of a recombinant construct consisting of residues 1–245 of Escherichia coli Lon protease, the prototypical member of the A-type Lon family, is reported. This construct encompasses all or most of the N-terminal domain of the enzyme. The structure was solved by SeMet SAD to 2.6 Å resolution utilizing trigonal crystals that contained one molecule in the asymmetric unit. The molecule consists of two compact subdomains and a very longmore » C-terminal α-helix. The structure of the first subdomain (residues 1–117), which consists mostly of β-strands, is similar to that of the shorter fragment previously expressed and crystallized, whereas the second subdomain is almost entirely helical. The fold and spatial relationship of the two subdomains, with the exception of the C-terminal helix, closely resemble the structure of BPP1347, a 203-amino-acid protein of unknown function from Bordetella parapertussis, and more distantly several other proteins. It was not possible to refine the structure to satisfactory convergence; however, since almost all of the Se atoms could be located on the basis of their anomalous scattering the correctness of the overall structure is not in question. The structure reported here was also compared with the structures of the putative substrate-binding domains of several proteins, showing topological similarities that should help in defining the binding sites used by Lon substrates.« less
Re-refinement from deposited X-ray data can deliver improved models for most PDB entries.

PubMed

Joosten, Robbie P; Womack, Thomas; Vriend, Gert; Bricogne, Gérard

2009-02-01

The deposition of X-ray data along with the customary structural models defining PDB entries makes it possible to apply large-scale re-refinement protocols to these entries, thus giving users the benefit of improvements in X-ray methods that have occurred since the structure was deposited. Automated gradient refinement is an effective method to achieve this goal, but real-space intervention is most often required in order to adequately address problems detected by structure-validation software. In order to improve the existing protocol, automated re-refinement was combined with structure validation and difference-density peak analysis to produce a catalogue of problems in PDB entries that are amenable to automatic correction. It is shown that re-refinement can be effective in producing improvements, which are often associated with the systematic use of the TLS parameterization of B factors, even for relatively new and high-resolution PDB entries, while the accompanying manual or semi-manual map analysis and fitting steps show good prospects for eventual automation. It is proposed that the potential for simultaneous improvements in methods and in re-refinement results be further encouraged by broadening the scope of depositions to include refinement metadata and ultimately primary rather than reduced X-ray data.
Detailed analysis of grid-based molecular docking: A case study of CDOCKER-A CHARMm-based MD docking algorithm.

PubMed

Wu, Guosheng; Robertson, Daniel H; Brooks, Charles L; Vieth, Michal

2003-10-01

The influence of various factors on the accuracy of protein-ligand docking is examined. The factors investigated include the role of a grid representation of protein-ligand interactions, the initial ligand conformation and orientation, the sampling rate of the energy hyper-surface, and the final minimization. A representative docking method is used to study these factors, namely, CDOCKER, a molecular dynamics (MD) simulated-annealing-based algorithm. A major emphasis in these studies is to compare the relative performance and accuracy of various grid-based approximations to explicit all-atom force field calculations. In these docking studies, the protein is kept rigid while the ligands are treated as fully flexible and a final minimization step is used to refine the docked poses. A docking success rate of 74% is observed when an explicit all-atom representation of the protein (full force field) is used, while a lower accuracy of 66-76% is observed for grid-based methods. All docking experiments considered a 41-member protein-ligand validation set. A significant improvement in accuracy (76 vs. 66%) for the grid-based docking is achieved if the explicit all-atom force field is used in a final minimization step to refine the docking poses. Statistical analysis shows that even lower-accuracy grid-based energy representations can be effectively used when followed with full force field minimization. The results of these grid-based protocols are statistically indistinguishable from the detailed atomic dockings and provide up to a sixfold reduction in computation time. For the test case examined here, improving the docking accuracy did not necessarily enhance the ability to estimate binding affinities using the docked structures. Copyright 2003 Wiley Periodicals, Inc.
Flexible CDOCKER: Development and application of a pseudo-explicit structure-based docking method within CHARMM

PubMed Central

Gagnon, Jessica K.; Law, Sean M.; Brooks, Charles L.

2016-01-01

Protein-ligand docking is a commonly used method for lead identification and refinement. While traditional structure-based docking methods represent the receptor as a rigid body, recent developments have been moving toward the inclusion of protein flexibility. Proteins exist in an inter-converting ensemble of conformational states, but effectively and efficiently searching the conformational space available to both the receptor and ligand remains a well-appreciated computational challenge. To this end, we have developed the Flexible CDOCKER method as an extension of the family of complete docking solutions available within CHARMM. This method integrates atomically detailed side chain flexibility with grid-based docking methods, maintaining efficiency while allowing the protein and ligand configurations to explore their conformational space simultaneously. This is in contrast to existing approaches that use induced-fit like sampling, such as Glide or Autodock, where the protein or the ligand space is sampled independently in an iterative fashion. Presented here are developments to the CHARMM docking methodology to incorporate receptor flexibility and improvements to the sampling protocol as demonstrated with re-docking trials on a subset of the CCDC/Astex set. These developments within CDOCKER achieve docking accuracy competitive with or exceeding the performance of other widely utilized docking programs. PMID:26691274
Flexible CDOCKER: Development and application of a pseudo-explicit structure-based docking method within CHARMM.

PubMed

Gagnon, Jessica K; Law, Sean M; Brooks, Charles L

2016-03-30

Protein-ligand docking is a commonly used method for lead identification and refinement. While traditional structure-based docking methods represent the receptor as a rigid body, recent developments have been moving toward the inclusion of protein flexibility. Proteins exist in an interconverting ensemble of conformational states, but effectively and efficiently searching the conformational space available to both the receptor and ligand remains a well-appreciated computational challenge. To this end, we have developed the Flexible CDOCKER method as an extension of the family of complete docking solutions available within CHARMM. This method integrates atomically detailed side chain flexibility with grid-based docking methods, maintaining efficiency while allowing the protein and ligand configurations to explore their conformational space simultaneously. This is in contrast to existing approaches that use induced-fit like sampling, such as Glide or Autodock, where the protein or the ligand space is sampled independently in an iterative fashion. Presented here are developments to the CHARMM docking methodology to incorporate receptor flexibility and improvements to the sampling protocol as demonstrated with re-docking trials on a subset of the CCDC/Astex set. These developments within CDOCKER achieve docking accuracy competitive with or exceeding the performance of other widely utilized docking programs. © 2015 Wiley Periodicals, Inc.
Cloning, overexpression, purification and preliminary crystallographic studies of a mitochondrial type II peroxiredoxin from Pisum sativum.

PubMed

Barranco-Medina, Sergio; López-Jaramillo, Francisco Javier; Bernier-Villamor, Laura; Sevilla, Francisca; Lázaro, Juan José

2006-07-01

A cDNA encoding an open reading frame of 199 amino acids corresponding to a type II peroxiredoxin from Pisum sativum with its transit peptide was isolated by RT-PCR. The 171-amino-acid mature protein (estimated molecular weight 18.6 kDa) was cloned into the pET3d vector and overexpressed in Escherichia coli. The recombinant protein was purified and crystallized by the hanging-drop vapour-diffusion technique. A full data set (98.2% completeness) was collected using a rotating-anode generator to a resolution of 2.8 angstroms from a single crystal flash-cooled at 100 K. X-ray data revealed that the protein crystallizes in space group P1, with unit-cell parameters a = 61.88, b = 66.40, c = 77.23 angstroms, alpha = 102.90, beta = 104.40, gamma = 99.07 degrees, and molecular replacement using a theoretical model predicted from the primary structure as a search model confirmed the presence of six molecules in the unit cell as expected from the Matthews coefficient. Refinement of the structure is in progress.
Mapping the temperature-dependent conformational landscapes of the dynamic enzymes cyclophilin A and urease

NASA Astrophysics Data System (ADS)

Thorne, Robert; Keedy, Daniel; Warkentin, Matthew; Fraser, James; Moreau, David; Atakisi, Hakan; Rau, Peter

Proteins populate complex, temperature-dependent ensembles of conformations that enable their function. Yet in X-ray crystallographic studies, roughly 98% of structures have been determined at 100 K, and most refined to only a single conformation. A combination of experimental methods enabled by studies of ice formation and computational methods for mining low-density features in electron density maps have been applied to determine the evolution of the conformational landscapes of the enzymes cyclophilin A and urease between 300 K and 100 K. Minority conformations of most side chains depopulate on cooling from 300 to ~200 K, below which subsequent conformational evolution is quenched. The characteristic temperatures for this depopulation are highly heterogeneous throughout each enzyme. The temperature-dependent ensemble of the active site flap in urease has also been mapped. These all-atom, site-resolved measurements and analyses rule out one interpretation of the protein-solvent glass transition, and give an alternative interpretation of a dynamical transition identified in site-averaged experiments. They demonstrate a powerful approach to structural characterization of the dynamic underpinnings of protein function. Supported by NSF MCB-1330685.
Structural characterization of acylimine-containing blue and red chromophores in mTagBFP and TagRFP fluorescent proteins.

PubMed

Subach, Oksana M; Malashkevich, Vladimir N; Zencheck, Wendy D; Morozova, Kateryna S; Piatkevich, Kiryl D; Almo, Steven C; Verkhusha, Vladislav V

2010-04-23

We determined the 2.2 A crystal structures of the red fluorescent protein TagRFP and its derivative, the blue fluorescent protein mTagBFP. The crystallographic analysis is consistent with a model in which TagRFP has the trans coplanar anionic chromophore with the conjugated pi-electron system, similar to that of DsRed-like chromophores. Refined conformation of mTagBFP suggests the presence of an N-acylimine functionality in its chromophore and single C(alpha)-C(beta) bond in the Tyr64 side chain. Mass spectrum of mTagBFP chromophore-bearing peptide indicates a loss of 20 Da upon maturation, whereas tandem mass spectrometry reveals that the C(alpha)-N bond in Leu63 is oxidized. These data indicate that mTagBFP has a new type of the chromophore, N-[(5-hydroxy-1H-imidazole-2-yl)methylidene]acetamide. We propose a chemical mechanism in which the DsRed-like chromophore is formed via the mTagBFP-like blue intermediate. (c) 2010 Elsevier Ltd. All rights reserved.
Water-refined solution structure of the human Grb7-SH2 domain in complex with the erbB2 receptor peptide pY1139.

PubMed

Pias, Sally C; Johnson, Dennis L; Smith, David E; Lyons, Barbara A

2012-08-01

We report a refinement in implicit water of the previously published solution structure of the Grb7-SH2 domain bound to the erbB2 receptor peptide pY1139. Structure quality measures indicate substantial improvement, with residues in the most favored regions of the Ramachandran plot increasing by 14 % and with WHAT IF statistics (Vriend, G. J. Mol. Graph., 1990, 8(1), 52-56) falling closer to expected values for well-refined structures.
Towards solution and refinement of organic crystal structures by fitting to the atomic pair distribution function

DOE Office of Scientific and Technical Information (OSTI.GOV)

Prill, Dragica; Juhas, Pavol; Billinge, Simon J. L.

2016-01-01

In this study, a method towards the solution and refinement of organic crystal structures by fitting to the atomic pair distribution function (PDF) is developed. Approximate lattice parameters and molecular geometry must be given as input. The molecule is generally treated as a rigid body. The positions and orientations of the molecules inside the unit cell are optimized starting from random values. The PDF is obtained from carefully measured X-ray powder diffraction data. The method resembles `real-space' methods for structure solution from powder data, but works with PDF data instead of the diffraction pattern itself. As such it may bemore » used in situations where the organic compounds are not long-range-ordered, are poorly crystalline, or nanocrystalline. The procedure was applied to solve and refine the crystal structures of quinacridone (β phase), naphthalene and allopurinol. In the case of allopurinol it was even possible to successfully solve and refine the structure in P1 with four independent molecules. As an example of a flexible molecule, the crystal structure of paracetamol was refined using restraints for bond lengths, bond angles and selected torsion angles. In all cases, the resulting structures are in excellent agreement with structures from single-crystal data.« less
Microfocus diffraction from different regions of a protein crystal: structural variations and unit-cell polymorphism

DOE Office of Scientific and Technical Information (OSTI.GOV)

Thompson, Michael C.; Cascio, Duilio; Yeates, Todd O.

Real macromolecular crystals can be non-ideal in a myriad of ways. This often creates challenges for structure determination, while also offering opportunities for greater insight into the crystalline state and the dynamic behavior of macromolecules. To evaluate whether different parts of a single crystal of a dynamic protein, EutL, might be informative about crystal and protein polymorphism, a microfocus X-ray synchrotron beam was used to collect a series of 18 separate data sets from non-overlapping regions of the same crystal specimen. A principal component analysis (PCA) approach was employed to compare the structure factors and unit cells across the datamore » sets, and it was found that the 18 data sets separated into two distinct groups, with largeRvalues (in the 40% range) and significant unit-cell variations between the members of the two groups. This categorization mapped the different data-set types to distinct regions of the crystal specimen. Atomic models of EutL were then refined against two different data sets obtained by separately merging data from the two distinct groups. A comparison of the two resulting models revealed minor but discernable differences in certain segments of the protein structure, and regions of higher deviation were found to correlate with regions where larger dynamic motions were predicted to occur by normal-mode molecular-dynamics simulations. The findings emphasize that large spatially dependent variations may be present across individual macromolecular crystals. This information can be uncovered by simultaneous analysis of multiple partial data sets and can be exploited to reveal new insights about protein dynamics, while also improving the accuracy of the structure-factor data ultimately obtained in X-ray diffraction experiments.« less
Insight into the Intermolecular Recognition Mechanism between Keap1 and IKKβ Combining Homology Modelling, Protein-Protein Docking, Molecular Dynamics Simulations and Virtual Alanine Mutation

PubMed Central

Jiang, Zheng-Yu; Chu, Hong-Xi; Xi, Mei-Yang; Yang, Ting-Ting; Jia, Jian-Min; Huang, Jing-Jie; Guo, Xiao-Ke; Zhang, Xiao-Jin; You, Qi-Dong; Sun, Hao-Peng

2013-01-01

Degradation of certain proteins through the ubiquitin-proteasome pathway is a common strategy taken by the key modulators responsible for stress responses. Kelch-like ECH-associated protein-1(Keap1), a substrate adaptor component of the Cullin3 (Cul3)-based ubiquitin E3 ligase complex, mediates the ubiquitination of two key modulators, NF-E2-related factor 2 (Nrf2) and IκB kinase β (IKKβ), which are involved in the redox control of gene transcription. However, compared to the Keap1-Nrf2 protein-protein interaction (PPI), the intermolecular recognition mechanism of Keap1 and IKKβ has been poorly investigated. In order to explore the binding pattern between Keap1 and IKKβ, the PPI model of Keap1 and IKKβ was investigated. The structure of human IKKβ was constructed by means of the homology modeling method and using reported crystal structure of Xenopus laevis IKKβ as the template. A protein-protein docking method was applied to develop the Keap1-IKKβ complex model. After the refinement and visual analysis of docked proteins, the chosen pose was further optimized through molecular dynamics simulations. The resulting structure was utilized to conduct the virtual alanine mutation for the exploration of hot-spots significant for the intermolecular interaction. Overall, our results provided structural insights into the PPI model of Keap1-IKKβ and suggest that the substrate specificity of Keap1 depend on the interaction with the key tyrosines, namely Tyr525, Tyr574 and Tyr334. The study presented in the current project may be useful to design molecules that selectively modulate Keap1. The selective recognition mechanism of Keap1 with IKKβ or Nrf2 will be helpful to further know the crosstalk between NF-κB and Nrf2 signaling. PMID:24066166
Insight into the intermolecular recognition mechanism between Keap1 and IKKβ combining homology modelling, protein-protein docking, molecular dynamics simulations and virtual alanine mutation.

PubMed

Jiang, Zheng-Yu; Chu, Hong-Xi; Xi, Mei-Yang; Yang, Ting-Ting; Jia, Jian-Min; Huang, Jing-Jie; Guo, Xiao-Ke; Zhang, Xiao-Jin; You, Qi-Dong; Sun, Hao-Peng

2013-01-01

Degradation of certain proteins through the ubiquitin-proteasome pathway is a common strategy taken by the key modulators responsible for stress responses. Kelch-like ECH-associated protein-1(Keap1), a substrate adaptor component of the Cullin3 (Cul3)-based ubiquitin E3 ligase complex, mediates the ubiquitination of two key modulators, NF-E2-related factor 2 (Nrf2) and IκB kinase β (IKKβ), which are involved in the redox control of gene transcription. However, compared to the Keap1-Nrf2 protein-protein interaction (PPI), the intermolecular recognition mechanism of Keap1 and IKKβ has been poorly investigated. In order to explore the binding pattern between Keap1 and IKKβ, the PPI model of Keap1 and IKKβ was investigated. The structure of human IKKβ was constructed by means of the homology modeling method and using reported crystal structure of Xenopus laevis IKKβ as the template. A protein-protein docking method was applied to develop the Keap1-IKKβ complex model. After the refinement and visual analysis of docked proteins, the chosen pose was further optimized through molecular dynamics simulations. The resulting structure was utilized to conduct the virtual alanine mutation for the exploration of hot-spots significant for the intermolecular interaction. Overall, our results provided structural insights into the PPI model of Keap1-IKKβ and suggest that the substrate specificity of Keap1 depend on the interaction with the key tyrosines, namely Tyr525, Tyr574 and Tyr334. The study presented in the current project may be useful to design molecules that selectively modulate Keap1. The selective recognition mechanism of Keap1 with IKKβ or Nrf2 will be helpful to further know the crosstalk between NF-κB and Nrf2 signaling.
Optimization of Melt Treatment for Austenitic Steel Grain Refinement

NASA Astrophysics Data System (ADS)

Lekakh, Simon N.; Ge, Jun; Richards, Von; O'Malley, Ron; TerBush, Jessica R.

2017-02-01

Refinement of the as-cast grain structure of austenitic steels requires the presence of active solid nuclei during solidification. These nuclei can be formed in situ in the liquid alloy by promoting reactions between transition metals (Ti, Zr, Nb, and Hf) and metalloid elements (C, S, O, and N) dissolved in the melt. Using thermodynamic simulations, experiments were designed to evaluate the effectiveness of a predicted sequence of reactions targeted to form precipitates that could act as active nuclei for grain refinement in austenitic steel castings. Melt additions performed to promote the sequential precipitation of titanium nitride (TiN) onto previously formed spinel (Al2MgO4) inclusions in the melt resulted in a significant refinement of the as-cast grain structure in heavy section Cr-Ni-Mo stainless steel castings. A refined as-cast structure consisting of an inner fine-equiaxed grain structure and outer columnar dendrite zone structure of limited length was achieved in experimental castings. The sequential of precipitation of TiN onto Al2MgO4 was confirmed using automated SEM/EDX and TEM analyses.
Ab Initio Protein Structure Prediction Using Chunk-TASSER

PubMed Central

Zhou, Hongyi; Skolnick, Jeffrey

2007-01-01

We have developed an ab initio protein structure prediction method called chunk-TASSER that uses ab initio folded supersecondary structure chunks of a given target as well as threading templates for obtaining contact potentials and distance restraints. The predicted chunks, selected on the basis of a new fragment comparison method, are folded by a fragment insertion method. Full-length models are built and refined by the TASSER methodology, which searches conformational space via parallel hyperbolic Monte Carlo. We employ an optimized reduced force field that includes knowledge-based statistical potentials and restraints derived from the chunks as well as threading templates. The method is tested on a dataset of 425 hard target proteins ≤250 amino acids in length. The average TM-scores of the best of top five models per target are 0.266, 0.336, and 0.362 by the threading algorithm SP3, original TASSER and chunk-TASSER, respectively. For a subset of 80 proteins with predicted α-helix content ≥50%, these averages are 0.284, 0.356, and 0.403, respectively. The percentages of proteins with the best of top five models having TM-score ≥0.4 (a statistically significant threshold for structural similarity) are 3.76, 20.94, and 28.94% by SP3, TASSER, and chunk-TASSER, respectively, overall, while for the subset of 80 predominantly helical proteins, these percentages are 2.50, 23.75, and 41.25%. Thus, chunk-TASSER shows a significant improvement over TASSER for modeling hard targets where no good template can be identified. We also tested chunk-TASSER on 21 medium/hard targets <200 amino-acids-long from CASP7. Chunk-TASSER is ∼11% (10%) better than TASSER for the total TM-score of the first (best of top five) models. Chunk-TASSER is fully automated and can be used in proteome scale protein structure prediction. PMID:17496016
High-Resolution Crystal Structures of Protein Helices Reconciled with Three-Centered Hydrogen Bonds and Multipole Electrostatics

PubMed Central

Kuster, Daniel J.; Liu, Chengyu; Fang, Zheng; Ponder, Jay W.; Marshall, Garland R.

2015-01-01

Theoretical and experimental evidence for non-linear hydrogen bonds in protein helices is ubiquitous. In particular, amide three-centered hydrogen bonds are common features of helices in high-resolution crystal structures of proteins. These high-resolution structures (1.0 to 1.5 Å nominal crystallographic resolution) position backbone atoms without significant bias from modeling constraints and identify Φ = -62°, ψ = -43 as the consensus backbone torsional angles of protein helices. These torsional angles preserve the atomic positions of α-β carbons of the classic Pauling α-helix while allowing the amide carbonyls to form bifurcated hydrogen bonds as first suggested by Némethy et al. in 1967. Molecular dynamics simulations of a capped 12-residue oligoalanine in water with AMOEBA (Atomic Multipole Optimized Energetics for Biomolecular Applications), a second-generation force field that includes multipole electrostatics and polarizability, reproduces the experimentally observed high-resolution helical conformation and correctly reorients the amide-bond carbonyls into bifurcated hydrogen bonds. This simple modification of backbone torsional angles reconciles experimental and theoretical views to provide a unified view of amide three-centered hydrogen bonds as crucial components of protein helices. The reason why they have been overlooked by structural biologists depends on the small crankshaft-like changes in orientation of the amide bond that allows maintenance of the overall helical parameters (helix pitch (p) and residues per turn (n)). The Pauling 3.613 α-helix fits the high-resolution experimental data with the minor exception of the amide-carbonyl electron density, but the previously associated backbone torsional angles (Φ, Ψ) needed slight modification to be reconciled with three-atom centered H-bonds and multipole electrostatics. Thus, a new standard helix, the 3.613/10-, Némethy- or N-helix, is proposed. Due to the use of constraints from monopole force fields and assumed secondary structures used in low-resolution refinement of electron density of proteins, such structures in the PDB often show linear hydrogen bonding. PMID:25894612

High-resolution crystal structures of protein helices reconciled with three-centered hydrogen bonds and multipole electrostatics.

PubMed

Kuster, Daniel J; Liu, Chengyu; Fang, Zheng; Ponder, Jay W; Marshall, Garland R

2015-01-01

Theoretical and experimental evidence for non-linear hydrogen bonds in protein helices is ubiquitous. In particular, amide three-centered hydrogen bonds are common features of helices in high-resolution crystal structures of proteins. These high-resolution structures (1.0 to 1.5 Å nominal crystallographic resolution) position backbone atoms without significant bias from modeling constraints and identify Φ = -62°, ψ = -43 as the consensus backbone torsional angles of protein helices. These torsional angles preserve the atomic positions of α-β carbons of the classic Pauling α-helix while allowing the amide carbonyls to form bifurcated hydrogen bonds as first suggested by Némethy et al. in 1967. Molecular dynamics simulations of a capped 12-residue oligoalanine in water with AMOEBA (Atomic Multipole Optimized Energetics for Biomolecular Applications), a second-generation force field that includes multipole electrostatics and polarizability, reproduces the experimentally observed high-resolution helical conformation and correctly reorients the amide-bond carbonyls into bifurcated hydrogen bonds. This simple modification of backbone torsional angles reconciles experimental and theoretical views to provide a unified view of amide three-centered hydrogen bonds as crucial components of protein helices. The reason why they have been overlooked by structural biologists depends on the small crankshaft-like changes in orientation of the amide bond that allows maintenance of the overall helical parameters (helix pitch (p) and residues per turn (n)). The Pauling 3.6(13) α-helix fits the high-resolution experimental data with the minor exception of the amide-carbonyl electron density, but the previously associated backbone torsional angles (Φ, Ψ) needed slight modification to be reconciled with three-atom centered H-bonds and multipole electrostatics. Thus, a new standard helix, the 3.6(13/10)-, Némethy- or N-helix, is proposed. Due to the use of constraints from monopole force fields and assumed secondary structures used in low-resolution refinement of electron density of proteins, such structures in the PDB often show linear hydrogen bonding.
Structure of the lutein-binding domain of human StARD3 at 1.74 Å resolution and model of a complex with lutein

DOE Office of Scientific and Technical Information (OSTI.GOV)

Horvath, Martin P., E-mail: martin.horvath@utah.edu; George, Evan W.; Tran, Quang T.

The structure of a START-domain protein known to bind lutein in the human retina is reported to an improved resolution limit. Rigid-body docking demonstrates that at least a portion of lutein must protrude from the large tunnel-like cavity characteristic of this helix-grip protein and suggests a mechanism for lutein binding specificity. A crystal structure of the lutein-binding domain of human StARD3 (StAR-related lipid-transfer protein 3; also known as MLN64) has been refined to 1.74 Å resolution. A previous structure of the same protein determined to 2.2 Å resolution highlighted homology with StARD1 and shared cholesterol-binding character. StARD3 has since beenmore » recognized as a carotenoid-binding protein in the primate retina, where its biochemical function of binding lutein with specificity appears to be well suited to recruit this photoprotective molecule. The current and previous structures correspond closely to each other (r.m.s.d. of 0.25 Å), especially in terms of the helix-grip fold constructed around a solvent-filled cavity. Regions of interest were defined with alternate conformations in the current higher-resolution structure, including Arg351 found within the cavity and Ω1, a loop of four residues found just outside the cavity entrance. Models of the complex with lutein generated by rigid-body docking indicate that one of the ionone rings must protrude outside the cavity, and this insight has implications for molecular interactions with transport proteins and enzymes that act on lutein. Interestingly, models with the ∊-ionone ring characteristic of lutein pointing towards the bottom of the cavity were associated with fewer steric clashes, suggesting that steric complementarity and ligand asymmetry may play a role in discriminating lutein from the other ocular carotenoids zeaxanthin and meso-zeaxanthin, which only have β-ionone rings.« less
WatAA: Atlas of Protein Hydration. Exploring synergies between data mining and ab initio calculations.

PubMed

Černý, Jiří; Schneider, Bohdan; Biedermannová, Lada

2017-07-14

Water molecules represent an integral part of proteins and a key determinant of protein structure, dynamics and function. WatAA is a newly developed, web-based atlas of amino-acid hydration in proteins. The atlas provides information about the ordered first hydration shell of the most populated amino-acid conformers in proteins. The data presented in the atlas are drawn from two sources: experimental data and ab initio quantum-mechanics calculations. The experimental part is based on a data-mining study of a large set of high-resolution protein crystal structures. The crystal-derived data include 3D maps of water distribution around amino-acids and probability of occurrence of each of the identified hydration sites. The quantum mechanics calculations validate and extend this primary description by optimizing the water position for each hydration site, by providing hydrogen atom positions and by quantifying the interaction energy that stabilizes the water molecule at the particular hydration site position. The calculations show that the majority of experimentally derived hydration sites are positioned near local energy minima for water, and the calculated interaction energies help to assess the preference of water for the individual hydration sites. We propose that the atlas can be used to validate water placement in electron density maps in crystallographic refinement, to locate water molecules mediating protein-ligand interactions in drug design, and to prepare and evaluate molecular dynamics simulations. WatAA: Atlas of Protein Hydration is freely available without login at .
New generation of elastic network models.

PubMed

López-Blanco, José Ramón; Chacón, Pablo

2016-04-01

The intrinsic flexibility of proteins and nucleic acids can be grasped from remarkably simple mechanical models of particles connected by springs. In recent decades, Elastic Network Models (ENMs) combined with Normal Model Analysis widely confirmed their ability to predict biologically relevant motions of biomolecules and soon became a popular methodology to reveal large-scale dynamics in multiple structural biology scenarios. The simplicity, robustness, low computational cost, and relatively high accuracy are the reasons behind the success of ENMs. This review focuses on recent advances in the development and application of ENMs, paying particular attention to combinations with experimental data. Successful application scenarios include large macromolecular machines, structural refinement, docking, and evolutionary conservation. Copyright © 2015 Elsevier Ltd. All rights reserved.
Chiral pathways in DNA dinucleotides using gradient optimized refinement along metastable borders

NASA Astrophysics Data System (ADS)

Romano, Pablo; Guenza, Marina

We present a study of DNA breathing fluctuations using Markov state models (MSM) with our novel refinement procedure. MSM have become a favored method of building kinetic models, however their accuracy has always depended on using a significant number of microstates, making the method costly. We present a method which optimizes macrostates by refining borders with respect to the gradient along the free energy surface. As the separation between macrostates contains highest discretization errors, this method corrects for any errors produced by limited microstate sampling. Using our refined MSM methods, we investigate DNA breathing fluctuations, thermally induced conformational changes in native B-form DNA. Running several microsecond MD simulations of DNA dinucleotides of varying sequences, to include sequence and polarity effects, we've analyzed using our refined MSM to investigate conformational pathways inherent in the unstacking of DNA bases. Our kinetic analysis has shown preferential chirality in unstacking pathways that may be critical in how proteins interact with single stranded regions of DNA. These breathing dynamics can help elucidate the connection between conformational changes and key mechanisms within protein-DNA recognition. NSF Chemistry Division (Theoretical Chemistry), the Division of Physics (Condensed Matter: Material Theory), XSEDE.
Exploiting structure similarity in refinement: automated NCS and target-structure restraints in BUSTER

PubMed Central

Smart, Oliver S.; Womack, Thomas O.; Flensburg, Claus; Keller, Peter; Paciorek, Włodek; Sharff, Andrew; Vonrhein, Clemens; Bricogne, Gérard

2012-01-01

Maximum-likelihood X-ray macromolecular structure refinement in BUSTER has been extended with restraints facilitating the exploitation of structural similarity. The similarity can be between two or more chains within the structure being refined, thus favouring NCS, or to a distinct ‘target’ structure that remains fixed during refinement. The local structural similarity restraints (LSSR) approach considers all distances less than 5.5 Å between pairs of atoms in the chain to be restrained. For each, the difference from the distance between the corresponding atoms in the related chain is found. LSSR applies a restraint penalty on each difference. A functional form that reaches a plateau for large differences is used to avoid the restraints distorting parts of the structure that are not similar. Because LSSR are local, there is no need to separate out domains. Some restraint pruning is still necessary, but this has been automated. LSSR have been available to academic users of BUSTER since 2009 with the easy-to-use -autoncs and -target target.pdb options. The use of LSSR is illustrated in the re-refinement of PDB entries 5rnt, where -target enables the correct ligand-binding structure to be found, and 1osg, where -autoncs contributes to the location of an additional copy of the cyclic peptide ligand. PMID:22505257
Homology modeling of Homo sapiens lipoic acid synthase: Substrate docking and insights on its binding mode.

PubMed

Krishnamoorthy, Ezhilarasi; Hassan, Sameer; Hanna, Luke Elizabeth; Padmalayam, Indira; Rajaram, Rama; Viswanathan, Vijay

2017-05-07

Lipoic acid synthase (LIAS) is an iron-sulfur cluster mitochondrial enzyme which catalyzes the final step in the de novo pathway for the biosynthesis of lipoic acid, a potent antioxidant. Recently there has been significant interest in its role in metabolic diseases and its deficiency in LIAS expression has been linked to conditions such as diabetes, atherosclerosis and neonatal-onset epilepsy, suggesting a strong inverse correlation between LIAS reduction and disease status. In this study we use a bioinformatics approach to predict its structure, which would be helpful to understanding its role. A homology model for LIAS protein was generated using X-ray crystallographic structure of Thermosynechococcus elongatus BP-1 (PDB ID: 4U0P). The predicted structure has 93% of the residues in the most favour region of Ramachandran plot. The active site of LIAS protein was mapped and docked with S-Adenosyl Methionine (SAM) using GOLD software. The LIAS-SAM complex was further refined using molecular dynamics simulation within the subsite 1 and subsite 3 of the active site. To the best of our knowledge, this is the first study to report a reliable homology model of LIAS protein. This study will facilitate a better understanding mode of action of the enzyme-substrate complex for future studies in designing drugs that can target LIAS protein. Copyright © 2017 Elsevier Ltd. All rights reserved.
Structural model of the hUbA1-UbcH10 quaternary complex: in silico and experimental analysis of the protein-protein interactions between E1, E2 and ubiquitin.

PubMed

Correale, Stefania; de Paola, Ivan; Morgillo, Carmine Marco; Federico, Antonella; Zaccaro, Laura; Pallante, Pierlorenzo; Galeone, Aldo; Fusco, Alfredo; Pedone, Emilia; Luque, F Javier; Catalanotti, Bruno

2014-01-01

UbcH10 is a component of the Ubiquitin Conjugation Enzymes (Ubc; E2) involved in the ubiquitination cascade controlling the cell cycle progression, whereby ubiquitin, activated by E1, is transferred through E2 to the target protein with the involvement of E3 enzymes. In this work we propose the first three dimensional model of the tetrameric complex formed by the human UbA1 (E1), two ubiquitin molecules and UbcH10 (E2), leading to the transthiolation reaction. The 3D model was built up by using an experimentally guided incremental docking strategy that combined homology modeling, protein-protein docking and refinement by means of molecular dynamics simulations. The structural features of the in silico model allowed us to identify the regions that mediate the recognition between the interacting proteins, revealing the active role of the ubiquitin crosslinked to E1 in the complex formation. Finally, the role of these regions involved in the E1-E2 binding was validated by designing short peptides that specifically interfere with the binding of UbcH10, thus supporting the reliability of the proposed model and representing valuable scaffolds for the design of peptidomimetic compounds that can bind selectively to Ubcs and inhibit the ubiquitylation process in pathological disorders.
Adaptive mesh refinement and load balancing based on multi-level block-structured Cartesian mesh

NASA Astrophysics Data System (ADS)

Misaka, Takashi; Sasaki, Daisuke; Obayashi, Shigeru

2017-11-01

We developed a framework for a distributed-memory parallel computer that enables dynamic data management for adaptive mesh refinement and load balancing. We employed simple data structure of the building cube method (BCM) where a computational domain is divided into multi-level cubic domains and each cube has the same number of grid points inside, realising a multi-level block-structured Cartesian mesh. Solution adaptive mesh refinement, which works efficiently with the help of the dynamic load balancing, was implemented by dividing cubes based on mesh refinement criteria. The framework was investigated with the Laplace equation in terms of adaptive mesh refinement, load balancing and the parallel efficiency. It was then applied to the incompressible Navier-Stokes equations to simulate a turbulent flow around a sphere. We considered wall-adaptive cube refinement where a non-dimensional wall distance y+ near the sphere is used for a criterion of mesh refinement. The result showed the load imbalance due to y+ adaptive mesh refinement was corrected by the present approach. To utilise the BCM framework more effectively, we also tested a cube-wise algorithm switching where an explicit and implicit time integration schemes are switched depending on the local Courant-Friedrichs-Lewy (CFL) condition in each cube.
NMR Studies on Structure and Dynamics of the Monomeric Derivative of BS-RNase: New Insights for 3D Domain Swapping

PubMed Central

Spadaccini, Roberta; Ercole, Carmine; Gentile, Maria A.; Sanfelice, Domenico; Boelens, Rolf; Wechselberger, Rainer; Batta, Gyula; Bernini, Andrea; Niccolai, Neri; Picone, Delia

2012-01-01

Three-dimensional domain swapping is a common phenomenon in pancreatic-like ribonucleases. In the aggregated state, these proteins acquire new biological functions, including selective cytotoxicity against tumour cells. RNase A is able to dislocate both N- and C-termini, but usually this process requires denaturing conditions. In contrast, bovine seminal ribonuclease (BS-RNase), which is a homo-dimeric protein sharing 80% of sequence identity with RNase A, occurs natively as a mixture of swapped and unswapped isoforms. The presence of two disulfides bridging the subunits, indeed, ensures a dimeric structure also to the unswapped molecule. In vitro, the two BS-RNase isoforms interconvert under physiological conditions. Since the tendency to swap is often related to the instability of the monomeric proteins, in these paper we have analysed in detail the stability in solution of the monomeric derivative of BS-RNase (mBS) by a combination of NMR studies and Molecular Dynamics Simulations. The refinement of NMR structure and relaxation data indicate a close similarity with RNase A, without any evidence of aggregation or partial opening. The high compactness of mBS structure is confirmed also by H/D exchange, urea denaturation, and TEMPOL mapping of the protein surface. The present extensive structural and dynamic investigation of (monomeric) mBS did not show any experimental evidence that could explain the known differences in swapping between BS-RNase and RNase A. Hence, we conclude that the swapping in BS-RNase must be influenced by the distinct features of the dimers, suggesting a prominent role for the interchain disulfide bridges. PMID:22253705
Hydrogens detected by subatomic resolution protein crystallography in a [NiFe] hydrogenase.

PubMed

Ogata, Hideaki; Nishikawa, Koji; Lubitz, Wolfgang

2015-04-23

The enzyme hydrogenase reversibly converts dihydrogen to protons and electrons at a metal catalyst. The location of the abundant hydrogens is of key importance for understanding structure and function of the protein. However, in protein X-ray crystallography the detection of hydrogen atoms is one of the major problems, since they display only weak contributions to diffraction and the quality of the single crystals is often insufficient to obtain sub-ångström resolution. Here we report the crystal structure of a standard [NiFe] hydrogenase (∼91.3 kDa molecular mass) at 0.89 Å resolution. The strictly anoxically isolated hydrogenase has been obtained in a specific spectroscopic state, the active reduced Ni-R (subform Ni-R1) state. The high resolution, proper refinement strategy and careful modelling allow the positioning of a large part of the hydrogen atoms in the structure. This has led to the direct detection of the products of the heterolytic splitting of dihydrogen into a hydride (H(-)) bridging the Ni and Fe and a proton (H(+)) attached to the sulphur of a cysteine ligand. The Ni-H(-) and Fe-H(-) bond lengths are 1.58 Å and 1.78Å, respectively. Furthermore, we can assign the Fe-CO and Fe-CN(-) ligands at the active site, and can obtain the hydrogen-bond networks and the preferred proton transfer pathway in the hydrogenase. Our results demonstrate the precise comprehensive information available from ultra-high-resolution structures of proteins as an alternative to neutron diffraction and other methods such as NMR structural analysis.
GalaxyTBM: template-based modeling by building a reliable core and refining unreliable local regions.

PubMed

Ko, Junsu; Park, Hahnbeom; Seok, Chaok

2012-08-10

Protein structures can be reliably predicted by template-based modeling (TBM) when experimental structures of homologous proteins are available. However, it is challenging to obtain structures more accurate than the single best templates by either combining information from multiple templates or by modeling regions that vary among templates or are not covered by any templates. We introduce GalaxyTBM, a new TBM method in which the more reliable core region is modeled first from multiple templates and less reliable, variable local regions, such as loops or termini, are then detected and re-modeled by an ab initio method. This TBM method is based on "Seok-server," which was tested in CASP9 and assessed to be amongst the top TBM servers. The accuracy of the initial core modeling is enhanced by focusing on more conserved regions in the multiple-template selection and multiple sequence alignment stages. Additional improvement is achieved by ab initio modeling of up to 3 unreliable local regions in the fixed framework of the core structure. Overall, GalaxyTBM reproduced the performance of Seok-server, with GalaxyTBM and Seok-server resulting in average GDT-TS of 68.1 and 68.4, respectively, when tested on 68 single-domain CASP9 TBM targets. For application to multi-domain proteins, GalaxyTBM must be combined with domain-splitting methods. Application of GalaxyTBM to CASP9 targets demonstrates that accurate protein structure prediction is possible by use of a multiple-template-based approach, and ab initio modeling of variable regions can further enhance the model quality.
The short-lived signaling state of the photoactive yellow protein photoreceptor revealed by combined structural probes.

PubMed

Ramachandran, Pradeep L; Lovett, Janet E; Carl, Patrick J; Cammarata, Marco; Lee, Jae Hyuk; Jung, Yang Ouk; Ihee, Hyotcherl; Timmel, Christiane R; van Thor, Jasper J

2011-06-22

The signaling state of the photoactive yellow protein (PYP) photoreceptor is transiently developed via isomerization of its blue-light-absorbing chromophore. The associated structural rearrangements have large amplitude but, due to its transient nature and chemical exchange reactions that complicate NMR detection, its accurate three-dimensional structure in solution has been elusive. Here we report on direct structural observation of the transient signaling state by combining double electron electron resonance spectroscopy (DEER), NMR, and time-resolved pump-probe X-ray solution scattering (TR-SAXS/WAXS). Measurement of distance distributions for doubly spin-labeled photoreceptor constructs using DEER spectroscopy suggests that the signaling state is well ordered and shows that interspin-label distances change reversibly up to 19 Å upon illumination. The SAXS/WAXS difference signal for the signaling state relative to the ground state indicates the transient formation of an ordered and rearranged conformation, which has an increased radius of gyration, an increased maximum dimension, and a reduced excluded volume. Dynamical annealing calculations using the DEER derived long-range distance restraints in combination with short-range distance information from (1)H-(15)N HSQC perturbation spectroscopy give strong indication for a rearrangement that places part of the N-terminal domain in contact with the exposed chromophore binding cleft while the terminal residues extend away from the core. Time-resolved global structural information from pump-probe TR-SAXS/WAXS data supports this conformation and allows subsequent structural refinement that includes the combined energy terms from DEER, NMR, and SAXS/WAXS together. The resulting ensemble simultaneously satisfies all restraints, and the inclusion of TR-SAXS/WAXS effectively reduces the uncertainty arising from the possible spin-label orientations. The observations are essentially compatible with reduced folding of the I(2)' state (also referred to as the 'pB' state) that is widely reported, but indicates it to be relatively ordered and rearranged. Furthermore, there is direct evidence for the repositioning of the N-terminal region in the I(2)' state, which is structurally modeled by dynamical annealing and refinement calculations.
Interolog interfaces in protein–protein docking

PubMed Central

Alsop, James D.

2015-01-01

ABSTRACT Proteins are essential elements of biological systems, and their function typically relies on their ability to successfully bind to specific partners. Recently, an emphasis of study into protein interactions has been on hot spots, or residues in the binding interface that make a significant contribution to the binding energetics. In this study, we investigate how conservation of hot spots can be used to guide docking prediction. We show that the use of evolutionary data combined with hot spot prediction highlights near‐native structures across a range of benchmark examples. Our approach explores various strategies for using hot spots and evolutionary data to score protein complexes, using both absolute and chemical definitions of conservation along with refinements to these strategies that look at windowed conservation and filtering to ensure a minimum number of hot spots in each binding partner. Finally, structure‐based models of orthologs were generated for comparison with sequence‐based scoring. Using two data sets of 22 and 85 examples, a high rate of top 10 and top 1 predictions are observed, with up to 82% of examples returning a top 10 hit and 35% returning top 1 hit depending on the data set and strategy applied; upon inclusion of the native structure among the decoys, up to 55% of examples yielded a top 1 hit. The 20 common examples between data sets show that more carefully curated interolog data yields better predictions, particularly in achieving top 1 hits. Proteins 2015; 83:1940–1946. © 2015 The Authors. Proteins: Structure, Function, and Bioinformatics Published by Wiley Periodicals, Inc. PMID:25740680
Three-dimensional crystal structure of recombinant murine interferon-beta.

PubMed Central

Senda, T; Shimazu, T; Matsuda, S; Kawano, G; Shimizu, H; Nakamura, K T; Mitsui, Y

1992-01-01

The crystal structure of recombinant murine interferon-beta (IFN-beta) has been solved by the multiple isomorphous replacement method and refined to an R-factor of 20.5% against 2.6 A X-ray diffraction data. The structure shows a variant of the alpha-helix bundle with a new chain-folding topology, which seems to represent a basic structural framework of all the IFN-alpha and IFN-beta molecules belonging to the type I family. Functionally important segments of the polypeptide chain, as implied through numerous gene manipulation studies carried out so far, are spatially clustered indicating the binding site(s) to the receptor(s). Comparison of the present structure with those of other alpha-helical cytokine proteins, including porcine growth hormone, interleukin 2 and interferon gamma, indicated either a topological similarity in chain folding or a similar spatial arrangement of the alpha-helices. Images PMID:1505514
Structure of the SH3 Domain of Rat Endophilin A2

DOE Office of Scientific and Technical Information (OSTI.GOV)

Loll,P.; Swain, E.; Chen, Y.

2008-01-01

The crystal structure of the SH3 domain of rat endophilin A2 has been determined by the multiwavelength anomalous dispersion method and refined at a resolution of 1.70 Angstroms to R and Rfree values of 0.196 and 0.217, respectively. The structure adheres to the canonical SH3-domain fold and is highly similar to those of the corresponding domains of endophilins A1 and A3. An intermolecular packing interaction between two molecules in the lattice exploits features that are commonly observed in SH3-domain ligand recognition, including the insertion of a proline side chain into the ligand-binding groove of the protein and the recognition ofmore » a basic residue by a cluster of acidic side chains on the RT loop.« less
Microstructural characteristics of adiabatic shear localization in a metastable beta titanium alloy deformed at high strain rate and elevated temperatures

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhan, Hongyi, E-mail: h.zhan@uq.edu.au; Zeng, Weidong; Wang, Gui

2015-04-15

The microstructural evolution and grain refinement within adiabatic shear bands in the Ti6554 alloy deformed at high strain rates and elevated temperatures have been characterized using transmission electron microscopy. No stress drops were observed in the corresponding stress–strain curve, indicating that the initiation of adiabatic shear bands does not lead to the loss of load capacity for the Ti6554 alloy. The outer region of the shear bands mainly consists of cell structures bounded by dislocation clusters. Equiaxed subgrains in the core area of the shear band can be evolved from the subdivision of cell structures or reconstruction and transverse segmentationmore » of dislocation clusters. It is proposed that dislocation activity dominates the grain refinement process. The rotational recrystallization mechanism may operate as the kinetic requirements for it are fulfilled. The coexistence of different substructures across the shear bands implies that the microstructural evolution inside the shear bands is not homogeneous and different grain refinement mechanisms may operate simultaneously to refine the structure. - Graphical abstract: Display Omitted - Highlights: • The microstructure within the adiabatic shear band was characterized by TEM. • No stress drops were observed in the corresponding stress–strain curve. • Dislocation activity dominated the grain refinement process. • The kinetic requirements for rotational recrystallization mechanism were fulfilled. • Different grain refinement mechanisms operated simultaneously to refine the structure.« less
ADAPTIVE TETRAHEDRAL GRID REFINEMENT AND COARSENING IN MESSAGE-PASSING ENVIRONMENTS

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hallberg, J.; Stagg, A.

2000-10-01

A grid refinement and coarsening scheme has been developed for tetrahedral and triangular grid-based calculations in message-passing environments. The element adaption scheme is based on an edge bisection of elements marked for refinement by an appropriate error indicator. Hash-table/linked-list data structures are used to store nodal and element formation. The grid along inter-processor boundaries is refined and coarsened consistently with the update of these data structures via MPI calls. The parallel adaption scheme has been applied to the solution of a transient, three-dimensional, nonlinear, groundwater flow problem. Timings indicate efficiency of the grid refinement process relative to the flow solvermore » calculations.« less
MAIN software for density averaging, model building, structure refinement and validation

PubMed Central

Turk, Dušan

2013-01-01

MAIN is software that has been designed to interactively perform the complex tasks of macromolecular crystal structure determination and validation. Using MAIN, it is possible to perform density modification, manual and semi-automated or automated model building and rebuilding, real- and reciprocal-space structure optimization and refinement, map calculations and various types of molecular structure validation. The prompt availability of various analytical tools and the immediate visualization of molecular and map objects allow a user to efficiently progress towards the completed refined structure. The extraordinary depth perception of molecular objects in three dimensions that is provided by MAIN is achieved by the clarity and contrast of colours and the smooth rotation of the displayed objects. MAIN allows simultaneous work on several molecular models and various crystal forms. The strength of MAIN lies in its manipulation of averaged density maps and molecular models when noncrystallographic symmetry (NCS) is present. Using MAIN, it is possible to optimize NCS parameters and envelopes and to refine the structure in single or multiple crystal forms. PMID:23897458
Integration of QUARK and I-TASSER for ab initio protein structure prediction in CASP11

PubMed Central

Zhang, Wenxuan; Yang, Jianyi; He, Baoji; Walker, Sara Elizabeth; Zhang, Hongjiu; Govindarajoo, Brandon; Virtanen, Jouko; Xue, Zhidong; Shen, Hong-Bin; Zhang, Yang

2015-01-01

We tested two pipelines developed for template-free protein structure prediction in the CASP11 experiment. First, the QUARK pipeline constructs structure models by reassembling fragments of continuously distributed lengths excised from unrelated proteins. Five free-modeling (FM) targets have the model successfully constructed by QUARK with a TM-score above 0.4, including the first model of T0837-D1, which has a TM-score=0.736 and RMSD=2.9 Å to the native. Detailed analysis showed that the success is partly attributed to the high-resolution contact map prediction derived from fragment-based distance-profiles, which are mainly located between regular secondary structure elements and loops/turns and help guide the orientation of secondary structure assembly. In the Zhang-Server pipeline, weakly scoring threading templates are re-ordered by the structural similarity to the ab initio folding models, which are then reassembled by I-TASSER based structure assembly simulations; 60% more domains with length up to 204 residues, compared to the QUARK pipeline, were successfully modeled by the I-TASSER pipeline with a TM-score above 0.4. The robustness of the I-TASSER pipeline can stem from the composite fragment-assembly simulations that combine structures from both ab initio folding and threading template refinements. Despite the promising cases, challenges still exist in long-range beta-strand folding, domain parsing, and the uncertainty of secondary structure prediction; the latter of which was found to affect nearly all aspects of FM structure predictions, from fragment identification, target classification, structure assembly, to final model selection. Significant efforts are needed to solve these problems before real progress on FM could be made. PMID:26370505

PICKY: a novel SVD-based NMR spectra peak picking method.

PubMed

Alipanahi, Babak; Gao, Xin; Karakoc, Emre; Donaldson, Logan; Li, Ming

2009-06-15

Picking peaks from experimental NMR spectra is a key unsolved problem for automated NMR protein structure determination. Such a process is a prerequisite for resonance assignment, nuclear overhauser enhancement (NOE) distance restraint assignment, and structure calculation tasks. Manual or semi-automatic peak picking, which is currently the prominent way used in NMR labs, is tedious, time consuming and costly. We introduce new ideas, including noise-level estimation, component forming and sub-division, singular value decomposition (SVD)-based peak picking and peak pruning and refinement. PICKY is developed as an automated peak picking method. Different from the previous research on peak picking, we provide a systematic study of the proposed method. PICKY is tested on 32 real 2D and 3D spectra of eight target proteins, and achieves an average of 88% recall and 74% precision. PICKY is efficient. It takes PICKY on average 15.7 s to process an NMR spectrum. More important than these numbers, PICKY actually works in practice. We feed peak lists generated by PICKY to IPASS for resonance assignment, feed IPASS assignment to SPARTA for fragments generation, and feed SPARTA fragments to FALCON for structure calculation. This results in high-resolution structures of several proteins, for example, TM1112, at 1.25 A. PICKY is available upon request. The peak lists of PICKY can be easily loaded by SPARKY to enable a better interactive strategy for rapid peak picking.
Structures of endothiapepsin-fragment complexes from crystallographic fragment screening using a novel, diverse and affordable 96-compound fragment library.

PubMed

Huschmann, Franziska U; Linnik, Janina; Sparta, Karine; Ühlein, Monika; Wang, Xiaojie; Metz, Alexander; Schiebel, Johannes; Heine, Andreas; Klebe, Gerhard; Weiss, Manfred S; Mueller, Uwe

2016-05-01

Crystallographic screening of the binding of small organic compounds (termed fragments) to proteins is increasingly important for medicinal chemistry-oriented drug discovery. To enable such experiments in a widespread manner, an affordable 96-compound library has been assembled for fragment screening in both academia and industry. The library is selected from already existing protein-ligand structures and is characterized by a broad ligand diversity, including buffer ingredients, carbohydrates, nucleotides, amino acids, peptide-like fragments and various drug-like organic compounds. When applied to the model protease endothiapepsin in a crystallographic screening experiment, a hit rate of nearly 10% was obtained. In comparison to other fragment libraries and considering that no pre-screening was performed, this hit rate is remarkably high. This demonstrates the general suitability of the selected compounds for an initial fragment-screening campaign. The library composition, experimental considerations and time requirements for a complete crystallographic fragment-screening campaign are discussed as well as the nine fully refined obtained endothiapepsin-fragment structures. While most of the fragments bind close to the catalytic centre of endothiapepsin in poses that have been observed previously, two fragments address new sites on the protein surface. ITC measurements show that the fragments bind to endothiapepsin with millimolar affinity.
Structural insights into the adaptation of proliferating cell nuclear antigen (PCNA) from Haloferax volcanii to a high-salt environment

DOE Office of Scientific and Technical Information (OSTI.GOV)

Morgunova, Ekaterina, E-mail: ekaterina.morgunova@ki.se; Gray, Fiona C.; MacNeill, Stuart A.

2009-10-01

The crystal structure of PCNA from the halophilic archaeon H. volcanii reveals specific features of the charge distribution on the protein surface that reflect adaptation to a high-salt environment and suggests a different type of interaction with DNA in halophilic PCNAs. The sliding clamp proliferating cell nuclear antigen (PCNA) plays vital roles in many aspects of DNA replication and repair in eukaryotic cells and in archaea. Realising the full potential of archaea as a model for PCNA function requires a combination of biochemical and genetic approaches. In order to provide a platform for subsequent reverse genetic analysis, PCNA from themore » halophilic archaeon Haloferax volcanii was subjected to crystallographic analysis. The gene was cloned and expressed in Escherichia coli and the protein was purified by affinity chromatography and crystallized by the vapour-diffusion technique. The structure was determined by molecular replacement and refined at 3.5 Å resolution to a final R factor of 23.7% (R{sub free} = 25%). PCNA from H. volcanii was found to be homotrimeric and to resemble other homotrimeric PCNA clamps but with several differences that appear to be associated with adaptation of the protein to the high intracellular salt concentrations found in H. volcanii cells.« less
Multidimensional oriented solid-state NMR experiments enable the sequential assignment of uniformly 15N labeled integral membrane proteins in magnetically aligned lipid bilayers.

PubMed

Mote, Kaustubh R; Gopinath, T; Traaseth, Nathaniel J; Kitchen, Jason; Gor'kov, Peter L; Brey, William W; Veglia, Gianluigi

2011-11-01

Oriented solid-state NMR is the most direct methodology to obtain the orientation of membrane proteins with respect to the lipid bilayer. The method consists of measuring (1)H-(15)N dipolar couplings (DC) and (15)N anisotropic chemical shifts (CSA) for membrane proteins that are uniformly aligned with respect to the membrane bilayer. A significant advantage of this approach is that tilt and azimuthal (rotational) angles of the protein domains can be directly derived from analytical expression of DC and CSA values, or, alternatively, obtained by refining protein structures using these values as harmonic restraints in simulated annealing calculations. The Achilles' heel of this approach is the lack of suitable experiments for sequential assignment of the amide resonances. In this Article, we present a new pulse sequence that integrates proton driven spin diffusion (PDSD) with sensitivity-enhanced PISEMA in a 3D experiment ([(1)H,(15)N]-SE-PISEMA-PDSD). The incorporation of 2D (15)N/(15)N spin diffusion experiments into this new 3D experiment leads to the complete and unambiguous assignment of the (15)N resonances. The feasibility of this approach is demonstrated for the membrane protein sarcolipin reconstituted in magnetically aligned lipid bicelles. Taken with low electric field probe technology, this approach will propel the determination of sequential assignment as well as structure and topology of larger integral membrane proteins in aligned lipid bilayers. © Springer Science+Business Media B.V. 2011
Lessons from (co-)evolution in the docking of proteins and peptides for CAPRI Rounds 28-35.

PubMed

Yu, Jinchao; Andreani, Jessica; Ochsenbein, Françoise; Guerois, Raphaël

2017-03-01

Computational protein-protein docking is of great importance for understanding protein interactions at the structural level. Critical assessment of prediction of interactions (CAPRI) experiments provide the protein docking community with a unique opportunity to blindly test methods based on real-life cases and help accelerate methodology development. For CAPRI Rounds 28-35, we used an automatic docking pipeline integrating the coarse-grained co-evolution-based potential InterEvScore. This score was developed to exploit the information contained in the multiple sequence alignments of binding partners and selectively recognize co-evolved interfaces. Together with Zdock/Frodock for rigid-body docking, SOAP-PP for atomic potential and Rosetta applications for structural refinement, this pipeline reached high performance on a majority of targets. For protein-peptide docking and interfacial water position predictions, we also explored different means of taking evolutionary information into account. Overall, our group ranked 1 st by correctly predicting 10 targets, composed of 1 High, 7 Medium and 2 Acceptable predictions. Excellent and Outstanding levels of accuracy were reached for each of the two water prediction targets, respectively. Altogether, in 15 out of 18 targets in total, evolutionary information, either through co-evolution or conservation analyses, could provide key constraints to guide modeling towards the most likely assemblies. These results open promising perspectives regarding the way evolutionary information can be valuable to improve docking prediction accuracy. Proteins 2017; 85:378-390. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
High-resolution Structures of Protein-Membrane Complexes by Neutron Reflection and MD Simulation: Membrane Association of the PTEN Tumor Suppressor

NASA Astrophysics Data System (ADS)

Lösche, Matthias

2012-02-01

The lipid matrix of biomembranes is an in-plane fluid, thermally and compositionally disordered leaflet of 5 nm thickness and notoriously difficult to characterize in structural terms. Yet, biomembranes are ubiquitous in the cell, and membrane-bound proteins are implicated in a variety of signaling pathways and intra-cellular transport. We developed methodology to study proteins associated with model membranes using neutron reflection measurements and showed recently that this approach can resolve the penetration depth and orientation of membrane proteins with ångstrom resolution if their crystal or NMR structure is known. Here we apply this technology to determine the membrane bindung and unravel functional details of the PTEN phosphatase, a key player in the PI3K apoptosis pathway. PTEN is an important regulatory protein and tumor suppressor that performs its phosphatase activity as an interfacial enzyme at the plasma membrane-cytoplasm boundary. Acting as an antagonist to phosphoinositide-3-kinase (PI3K) in cell signaling, it is deleted in many human cancers. Despite its importance in regulating the levels of the phosphoinositoltriphosphate PI(3,4,5)P3, there is little understanding of how PTEN binds to membranes, is activated and then acts as a phosphatase. We investigated the structure and function of PTEN by studying its membrane affinity and localization on in-plane fluid, thermally disordered synthetic membrane models. The membrane association of the protein depends strongly on membrane composition, where phosphatidylserine (PS) and phosphatidylinositol diphosphate (PI(4,5)P2) act synergetically in attracting the enzyme to the membrane surface. Membrane affinities depend strongly on membrane fluidity, which suggests multiple binding sites on the protein for PI(4,5)P2. Neutron reflection measurements show that the PTEN phosphatase ``scoots'' along the membrane surface (penetration < 5 å) but binds the membrane tightly with its two major domains, the C2 and phosphatase domains. In the bound state, PTEN's regulatory C-terminal tail is displaced from the membrane and organized on the far side of the protein, ˜ 60 å away from the bilayer surface, in a rather compact structure. The combination of binding studies and neutron reflection allows us to distinguish between PTEN mutant proteins and ultimately may identify the structural features required for membrane binding and activation of PTEN. Molecular dynamics simulations, currently in progress, refine this structural picture further.
Solution structure of the catalytic domain of RICH protein from goldfish.

PubMed

Kozlov, Guennadi; Denisov, Alexey Y; Pomerantseva, Ekaterina; Gravel, Michel; Braun, Peter E; Gehring, Kalle

2007-03-01

Regeneration-induced CNPase homolog (RICH) is an axonal growth-associated protein, which is induced in teleost fish upon optical nerve injury. RICH consists of a highly acidic N-terminal domain, a catalytic domain with 2',3'-cyclic nucleotide 3'-phosphodiesterase (CNPase) activity and a C-terminal isoprenylation site. In vitro RICH and mammalian brain CNPase specifically catalyze the hydrolysis of 2',3'-cyclic nucleotides to produce 2'-nucleotides, but the physiologically relevant in vivo substrate remains unknown. Here, we report the NMR structure of the catalytic domain of goldfish RICH and describe its binding to CNPase inhibitors. The structure consists of a twisted nine-stranded antiparallel beta-sheet surrounded by alpha-helices on both sides. Despite significant local differences mostly arising from a seven-residue insert in the RICH sequence, the active site region is highly similar to that of human CNPase. Likewise, refinement of the catalytic domain of rat CNPase using residual dipolar couplings gave improved agreement with the published crystal structure. NMR titrations of RICH with inhibitors point to a similar catalytic mechanism for RICH and CNPase. The results suggest a functional importance for the evolutionarily conserved phosphodiesterase activity and hint of a link with pre-tRNA splicing.
Iterative refinement of structure-based sequence alignments by Seed Extension

PubMed Central

Kim, Changhoon; Tai, Chin-Hsien; Lee, Byungkook

2009-01-01

Background Accurate sequence alignment is required in many bioinformatics applications but, when sequence similarity is low, it is difficult to obtain accurate alignments based on sequence similarity alone. The accuracy improves when the structures are available, but current structure-based sequence alignment procedures still mis-align substantial numbers of residues. In order to correct such errors, we previously explored the possibility of replacing the residue-based dynamic programming algorithm in structure alignment procedures with the Seed Extension algorithm, which does not use a gap penalty. Here, we describe a new procedure called RSE (Refinement with Seed Extension) that iteratively refines a structure-based sequence alignment. Results RSE uses SE (Seed Extension) in its core, which is an algorithm that we reported recently for obtaining a sequence alignment from two superimposed structures. The RSE procedure was evaluated by comparing the correctly aligned fractions of residues before and after the refinement of the structure-based sequence alignments produced by popular programs. CE, DaliLite, FAST, LOCK2, MATRAS, MATT, TM-align, SHEBA and VAST were included in this analysis and the NCBI's CDD root node set was used as the reference alignments. RSE improved the average accuracy of sequence alignments for all programs tested when no shift error was allowed. The amount of improvement varied depending on the program. The average improvements were small for DaliLite and MATRAS but about 5% for CE and VAST. More substantial improvements have been seen in many individual cases. The additional computation times required for the refinements were negligible compared to the times taken by the structure alignment programs. Conclusion RSE is a computationally inexpensive way of improving the accuracy of a structure-based sequence alignment. It can be used as a standalone procedure following a regular structure-based sequence alignment or to replace the traditional iterative refinement procedures based on residue-level dynamic programming algorithm in many structure alignment programs. PMID:19589133
Structural consequences of metallothionein dimerization: solution structure of the isolated Cd4-alpha-domain and comparison with the holoprotein dimer.

PubMed

Ejnik, John W; Muñoz, Amalia; DeRose, Eugene; Shaw, C Frank; Petering, David H

2003-07-22

The NMR determination of the structure of Cd(7)-metallothionein was done previously using a relatively large protein concentration that favors dimer formation. The reactivity of the protein is also affected under this condition. To examine the influence of protein concentration on metallothionein conformation, the isolated Cd(4)-alpha-domain was prepared from rabbit metallothionein-2 (MT 2), and its three-dimensional structure was determined by heteronuclear, (1)H-(111)Cd, and homonuclear, (1)H-(1)H NMR, correlation experiments. The three-dimensional structure was refined using distance and angle constraints derived from these two-dimensional NMR data sets and a distance geometry/simulated annealing protocol. The backbone superposition of the alpha-domain from rabbit holoprotein Cd(7)-MT 2 and the isolated rabbit Cd(4)-alpha was measured at a RMSD of 2.0 A. Nevertheless, the conformations of the two Cd-thiolate clusters were distinctly different at two of the cadmium centers. In addition, solvent access to the sulfhydryl ligands of the isolated Cd(4)-alpha cluster was 130% larger due to this small change in cluster geometry. To probe whether these differences were an artifact of the structure calculation, the Cd(4)-alpha-domain structure in rabbit Cd(7)-MT 2 was redetermined, using the previously defined set of NOEs and the present calculation protocol. All calculations employed the same ionic radius for Cd(2+) and same cadmium-thiolate bond distance. The newly calculated structure matched the original with an RMSD of 1.24 A. It is hypothesized that differences in the two alpha-domain structures result from a perturbation of the holoprotein structure because of head-to-tail dimerization under the conditions of the NMR experiments.
Elucidation of sulfadoxine resistance with structural models of the bifunctional Plasmodium falciparum dihydropterin pyrophosphokinase-dihydropteroate synthase.

PubMed

de Beer, Tjaart A P; Louw, Abraham I; Joubert, Fourie

2006-07-01

Resistance of the most virulent human malaria parasite, Plasmodium falciparum, to antifolates is spreading with increasing speed, especially in Africa. Antifolate resistance is mainly caused by point mutations in the P. falciparum dihydropteroate synthase (DHPS) and dihydrofolate reductase (DHFR) target proteins. Homology models of the bifunctional P. falciparum dihydropterin pyrophosphokinase-dihydropteroate synthase (PPPK-DHPS) enzyme as well as the separate domains complete with bound substrates were constructed using the crystal structures of Saccharomyces cerevisiae (PPPK-DHPS), Mycobacterium tuberculosis (DHPS), Bacillus anthracis (DHPS), and Escherichia coli (PPPK) as templates. The resulting structures were subsequently solvated and refined using molecular dynamics. The active site residues of DHPS are highly conserved in S. cerevisiae, M. tuberculosis, E. coli, S. aureus, and B. anthracis, an attribute also shared by P. falciparum DHPS. Sulfadoxine was superimposed into the equivalent position of the p-aminobenzoic acid substrate and its binding parameters were refined using minimization and molecular dynamics. Sulfadoxine appears to interact mainly with P. falciparum DHPS mainly through hydrophobic interactions. Rational explanations are provided by the model for the sulfadoxine resistance-causing effects of four of the five known mutations in P. falciparum DHPS. A possible structure for the bifunctional PPPK-DHPS was derived from the structure from the S. cerevisiae bifunctional enzyme. The active site residues of P. falciparum PPPK are also conserved when compared to S. cerevisiae, Haemophilus influenzae, and E. coli. The informative nature of these models opens up avenues for structure-based drug design approaches toward the development of alternative and more effective inhibitors of P. falciparum PPPK-DHPS.
Structure determination of a major facilitator peptide transporter: Inward facing PepTSt from Streptococcus thermophilus crystallized in space group P3121

PubMed Central

Quistgaard, Esben M.; Martinez Molledo, Maria

2017-01-01

Major facilitator superfamily (MFS) peptide transporters (typically referred to as PepT, POT or PTR transporters) mediate the uptake of di- and tripeptides, and so play an important dietary role in many organisms. In recent years, a better understanding of the molecular basis for this process has emerged, which is in large part due to a steep increase in structural information. Yet, the conformational transitions underlying the transport mechanism are still not fully understood, and additional data is therefore needed. Here we report in detail the detergent screening, crystallization, experimental MIRAS phasing, and refinement of the peptide transporter PepTSt from Streptococcus thermophilus. The space group is P3121, and the protein is crystallized in a monomeric inward facing form. The binding site is likely to be somewhat occluded, as the lobe encompassing transmembrane helices 10 and 11 is markedly bent towards the central pore of the protein, but the extent of this potential occlusion could not be determined due to disorder at the apex of the lobe. Based on structural comparisons with the seven previously determined P212121 and C2221 structures of inward facing PepTSt, the structural flexibility as well as the conformational changes mediating transition between the inward open and inward facing occluded states are discussed. In conclusion, this report improves our understanding of the structure and conformational cycle of PepTSt, and can furthermore serve as a case study, which may aid in supporting future structure determinations of additional MFS transporters or other integral membrane proteins. PMID:28264013
An object-oriented approach for parallel self adaptive mesh refinement on block structured grids

NASA Technical Reports Server (NTRS)

Lemke, Max; Witsch, Kristian; Quinlan, Daniel

1993-01-01

Self-adaptive mesh refinement dynamically matches the computational demands of a solver for partial differential equations to the activity in the application's domain. In this paper we present two C++ class libraries, P++ and AMR++, which significantly simplify the development of sophisticated adaptive mesh refinement codes on (massively) parallel distributed memory architectures. The development is based on our previous research in this area. The C++ class libraries provide abstractions to separate the issues of developing parallel adaptive mesh refinement applications into those of parallelism, abstracted by P++, and adaptive mesh refinement, abstracted by AMR++. P++ is a parallel array class library to permit efficient development of architecture independent codes for structured grid applications, and AMR++ provides support for self-adaptive mesh refinement on block-structured grids of rectangular non-overlapping blocks. Using these libraries, the application programmers' work is greatly simplified to primarily specifying the serial single grid application and obtaining the parallel and self-adaptive mesh refinement code with minimal effort. Initial results for simple singular perturbation problems solved by self-adaptive multilevel techniques (FAC, AFAC), being implemented on the basis of prototypes of the P++/AMR++ environment, are presented. Singular perturbation problems frequently arise in large applications, e.g. in the area of computational fluid dynamics. They usually have solutions with layers which require adaptive mesh refinement and fast basic solvers in order to be resolved efficiently.
Molecular modeling of calmodulin: a comparison with crystallographic data

NASA Technical Reports Server (NTRS)

McDonald, J. J.; Rein, R.

1989-01-01

Two methods of side-chain placement on a modeled protein have been examined. Two molecular models of calmodulin were constructed that differ in the treatment of side chains prior to optimization of the molecule. A virtual bond analysis program developed by Purisima and Scheraga was used to determine the backbone conformation based on 2.2 angstroms resolution C alpha coordinates for the molecules. In the first model, side chains were initially constructed in an extended conformation. In the second model, a conformational grid search technique was employed. Calcium ions were treated explicitly during energy optimization using CHARMM. The models are compared to a recently published refined crystal structure of calmodulin. The results indicate that the initial choices for side-chains, but also significant effects on the main-chain conformation and supersecondary structure. The conformational differences are discussed. Analysis of these and other methods makes possible the formulation of a methodology for more appropriate side-chain placement in modeled proteins.
The effects of rigid motions on elastic network model force constants

PubMed Central

Lezon, Timothy R.

2012-01-01

Elastic network models provide an efficient way to quickly calculate protein global dynamics from experimentally determined structures. The model’s single parameter, its force constant, determines the physical extent of equilibrium fluctuations. The values of force constants can be calculated by fitting to experimental data, but the results depend on the type of experimental data used. Here we investigate the differences between calculated values of force constants _t to data from NMR and X-ray structures. We find that X-ray B factors carry the signature of rigid-body motions, to the extent that B factors can be almost entirely accounted for by rigid motions alone. When fitting to more refined anisotropic temperature factors, the contributions of rigid motions are significantly reduced, indicating that the large contribution of rigid motions to B factors is a result of over-fitting. No correlation is found between force constants fit to NMR data and those fit to X-ray data, possibly due to the inability of NMR data to accurately capture protein dynamics. PMID:22228562
Multidataset Refinement Resonant Diffraction, and Magnetic Structures

PubMed Central

Attfield, J. Paul

2004-01-01

The scope of Rietveld and other powder diffraction refinements continues to expand, driven by improvements in instrumentation, methodology and software. This will be illustrated by examples from our research in recent years. Multidataset refinement is now commonplace; the datasets may be from different detectors, e.g., in a time-of-flight experiment, or from separate experiments, such as at several x-ray energies giving resonant information. The complementary use of x rays and neutrons is exemplified by a recent combined refinement of the monoclinic superstructure of magnetite, Fe3O4, below the 122 K Verwey transition, which reveals evidence for Fe2+/Fe3+ charge ordering. Powder neutron diffraction data continue to be used for the solution and Rietveld refinement of magnetic structures. Time-of-flight instruments on cold neutron sources can produce data that have a high intensity and good resolution at high d-spacings. Such profiles have been used to study incommensurate magnetic structures such as FeAsO4 and β–CrPO4. A multiphase, multidataset refinement of the phase-separated perovskite (Pr0.35Y0.07Th0.04Ca0.04Sr0.5)MnO3 has been used to fit three components with different crystal and magnetic structures at low temperatures. PMID:27366599
Disease-Associated Mutations Disrupt Functionally Important Regions of Intrinsic Protein Disorder

PubMed Central

Vacic, Vladimir; Markwick, Phineus R. L.; Oldfield, Christopher J.; Zhao, Xiaoyue; Haynes, Chad; Uversky, Vladimir N.; Iakoucheva, Lilia M.

2012-01-01

The effects of disease mutations on protein structure and function have been extensively investigated, and many predictors of the functional impact of single amino acid substitutions are publicly available. The majority of these predictors are based on protein structure and evolutionary conservation, following the assumption that disease mutations predominantly affect folded and conserved protein regions. However, the prevalence of the intrinsically disordered proteins (IDPs) and regions (IDRs) in the human proteome together with their lack of fixed structure and low sequence conservation raise a question about the impact of disease mutations in IDRs. Here, we investigate annotated missense disease mutations and show that 21.7% of them are located within such intrinsically disordered regions. We further demonstrate that 20% of disease mutations in IDRs cause local disorder-to-order transitions, which represents a 1.7–2.7 fold increase compared to annotated polymorphisms and neutral evolutionary substitutions, respectively. Secondary structure predictions show elevated rates of transition from helices and strands into loops and vice versa in the disease mutations dataset. Disease disorder-to-order mutations also influence predicted molecular recognition features (MoRFs) more often than the control mutations. The repertoire of disorder-to-order transition mutations is limited, with five most frequent mutations (R→W, R→C, E→K, R→H, R→Q) collectively accounting for 44% of all deleterious disorder-to-order transitions. As a proof of concept, we performed accelerated molecular dynamics simulations on a deleterious disorder-to-order transition mutation of tumor protein p63 and, in agreement with our predictions, observed an increased α-helical propensity of the region harboring the mutation. Our findings highlight the importance of mutations in IDRs and refine the traditional structure-centric view of disease mutations. The results of this study offer a new perspective on the role of mutations in disease, with implications for improving predictors of the functional impact of missense mutations. PMID:23055912
APPRIS 2017: principal isoforms for multiple gene sets

PubMed Central

Rodriguez-Rivas, Juan; Di Domenico, Tomás; Vázquez, Jesús; Valencia, Alfonso

2018-01-01

Abstract The APPRIS database (http://appris-tools.org) uses protein structural and functional features and information from cross-species conservation to annotate splice isoforms in protein-coding genes. APPRIS selects a single protein isoform, the ‘principal’ isoform, as the reference for each gene based on these annotations. A single main splice isoform reflects the biological reality for most protein coding genes and APPRIS principal isoforms are the best predictors of these main proteins isoforms. Here, we present the updates to the database, new developments that include the addition of three new species (chimpanzee, Drosophila melangaster and Caenorhabditis elegans), the expansion of APPRIS to cover the RefSeq gene set and the UniProtKB proteome for six species and refinements in the core methods that make up the annotation pipeline. In addition APPRIS now provides a measure of reliability for individual principal isoforms and updates with each release of the GENCODE/Ensembl and RefSeq reference sets. The individual GENCODE/Ensembl, RefSeq and UniProtKB reference gene sets for six organisms have been merged to produce common sets of splice variants. PMID:29069475
DOE Office of Scientific and Technical Information (OSTI.GOV)

Los Alamos National Laboratory, Mailstop M888, Los Alamos, NM 87545, USA; Lawrence Berkeley National Laboratory, One Cyclotron Road, Building 64R0121, Berkeley, CA 94720, USA; Department of Haematology, University of Cambridge, Cambridge CB2 0XY, England

The PHENIX AutoBuild Wizard is a highly automated tool for iterative model-building, structure refinement and density modification using RESOLVE or TEXTAL model-building, RESOLVE statistical density modification, and phenix.refine structure refinement. Recent advances in the AutoBuild Wizard and phenix.refine include automated detection and application of NCS from models as they are built, extensive model completion algorithms, and automated solvent molecule picking. Model completion algorithms in the AutoBuild Wizard include loop-building, crossovers between chains in different models of a structure, and side-chain optimization. The AutoBuild Wizard has been applied to a set of 48 structures at resolutions ranging from 1.1 {angstrom} tomore » 3.2 {angstrom}, resulting in a mean R-factor of 0.24 and a mean free R factor of 0.29. The R-factor of the final model is dependent on the quality of the starting electron density, and relatively independent of resolution.« less
Electrostatic interactions and binding orientation of HIV-1 matrix studied by neutron reflectivity.

PubMed

Nanda, Hirsh; Datta, Siddhartha A K; Heinrich, Frank; Lösche, Mathias; Rein, Alan; Krueger, Susan; Curtis, Joseph E

2010-10-20

The N-terminal matrix (MA) domain of the HIV-1 Gag protein is responsible for binding to the plasma membrane of host cells during viral assembly. The putative membrane-binding interface of MA was previously mapped by means of mutagenesis and analysis of its trimeric crystal structure. However, the orientation of MA on membranes has not been directly determined by experimental measurements. We present neutron reflectivity measurements that resolve the one-dimensional scattering length density profile of MA bound to a biomimetic of the native viral membrane. A molecular refinement procedure was developed using atomic structures of MA to determine the orientation of the protein on the membrane. The orientation defines a lipid-binding interface consistent with previous mutagenesis results. The MA protein maintains this orientation without the presence of a myristate group, driven only by electrostatic interactions. Furthermore, MA is found to penetrate the membrane headgroup region peripherally such that only the side chains of specific Lys and Arg residues interact with the surface. The results suggest that electrostatic interactions are sufficient to favorably orient MA on viral membrane mimics. The spatial determination of the membrane-bound protein demonstrates the ability of neutron reflectivity to discern orientation and penetration under physiologically relevant conditions. Copyright © 2010 Biophysical Society. Published by Elsevier Inc. All rights reserved.
A structural perspective on the interactions of TRAF6 and Basigin during the onset of melanoma: A molecular dynamics simulation study.

PubMed

Biswas, Ria; Ghosh, Semanti; Bagchi, Angshuman

2017-11-01

Metastatic melanoma is the most fatal type of skin cancer. The roles of matrix metalloproteinases (MMPs) have well been established in the onset of melanoma. Basigin (BSG) belongs to the immunoglobulin superfamily and is critical for induction of extracellular MMPs during the onset of various cancers including melanoma. Tumor necrosis factor receptor-associated factor 6 (TRAF6) is an E3-ligase that interacts with BSG and mediates its membrane localization, which leads to MMP expression in melanoma cells. This makes TRAF6 a potential therapeutic target in melanoma. We here conducted protein-protein interaction studies on TRAF6 and BSG to get molecular level insights of the reactions. The structure of human BSG was constructed by protein threading. Molecular-docking method was applied to develop the TRAF6-BSG complex. The refined docked complex was further optimized by molecular dynamics simulations. Results from binding free energy, surface properties, and electrostatic interaction analysis indicate that Lys340 and Glu417 of TRAF6 play as the anchor residues in the protein interaction interface. The current study will be helpful in designing specific modulators of TRAF6 to control melanoma metastasis. Copyright © 2017 John Wiley & Sons, Ltd.

Structure of Hordeum vulgare NADPH-dependent thioredoxin reductase 2. Unwinding the reaction mechanism

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kirkensgaard, Kristine G.; Enzyme and Protein Chemistry, Department of Systems BioIogy, Technical University of Denmark; Hägglund, Per

2009-09-01

The first crystal structure of a cereal NTR, a protein involved in seed development and germination, has been determined. The structure is in a conformation that excludes NADPH binding and indicates that a domain reorientation facilitated by Trx binding precedes NADPH binding in the reaction mechanism. Thioredoxins (Trxs) are protein disulfide reductases that regulate the intracellular redox environment and are important for seed germination in plants. Trxs are in turn regulated by NADPH-dependent thioredoxin reductases (NTRs), which provide reducing equivalents to Trx using NADPH to recycle Trxs to the active form. Here, the first crystal structure of a cereal NTR,more » HvNTR2 from Hordeum vulgare (barley), is presented, which is also the first structure of a monocot plant NTR. The structure was determined at 2.6 Å resolution and refined to an R{sub cryst} of 19.0% and an R{sub free} of 23.8%. The dimeric protein is structurally similar to the structures of AtNTR-B from Arabidopsis thaliana and other known low-molecular-weight NTRs. However, the relative position of the two NTR cofactor-binding domains, the FAD and the NADPH domains, is not the same. The NADPH domain is rotated by 25° and bent by a 38% closure relative to the FAD domain in comparison with AtNTR-B. The structure may represent an intermediate between the two conformations described previously: the flavin-oxidizing (FO) and the flavin-reducing (FR) conformations. Here, analysis of interdomain contacts as well as phylogenetic studies lead to the proposal of a new reaction scheme in which NTR–Trx interactions mediate the FO to FR transformation.« less
MAMPs and MIMPs: proposed classifications for inducers of innate immunity.

PubMed

Mackey, David; McFall, Aidan J

2006-09-01

Plants encode a sophisticated innate immune system. Resistance against potential pathogens often relies on active responses. Prerequisite to the induction of defences is recognition of the pathogenic threat. Significant advances have been made in our understanding of the non-self molecules that are recognized by plants and the means by which plants perceive them. Established terms describing these recognition events, including microbe-associated molecular pattern (MAMP), MAMP-receptor, effector, and resistance (R) protein, need clarification to represent our current knowledge adequately. In this review we propose criteria to classify inducers of plant defence as either MAMPs or microbe-induced molecular patterns (MIMPs). We refine the definition of MAMP to mean a molecular sequence or structure in ANY pathogen-derived molecule that is perceived via direct interaction with a host defence receptor. MIMPs are modifications of host-derived molecules that are induced by an intrinsic activity of a pathogen-derived effector and are perceived by a host defence receptor. MAMP-receptors have previously been classified separately from R-proteins as a discrete class of surveillance molecules. However, MAMP-receptors and R-proteins cannot be distinguished on the basis of their protein structures or their induced responses. We propose that MAMP-receptors and MIMP-receptors are each a subset of R-proteins. Although our review is based on examples from plant pathogens and plants, the principles discussed might prove applicable to other organisms.
A template-based approach for parallel hexahedral two-refinement

DOE PAGES

Owen, Steven J.; Shih, Ryan M.; Ernst, Corey D.

2016-10-17

Here, we provide a template-based approach for generating locally refined all-hex meshes. We focus specifically on refinement of initially structured grids utilizing a 2-refinement approach where uniformly refined hexes are subdivided into eight child elements. The refinement algorithm consists of identifying marked nodes that are used as the basis for a set of four simple refinement templates. The target application for 2-refinement is a parallel grid-based all-hex meshing tool for high performance computing in a distributed environment. The result is a parallel consistent locally refined mesh requiring minimal communication and where minimum mesh quality is greater than scaled Jacobian 0.3more » prior to smoothing.« less
A template-based approach for parallel hexahedral two-refinement

DOE Office of Scientific and Technical Information (OSTI.GOV)

Owen, Steven J.; Shih, Ryan M.; Ernst, Corey D.

Here, we provide a template-based approach for generating locally refined all-hex meshes. We focus specifically on refinement of initially structured grids utilizing a 2-refinement approach where uniformly refined hexes are subdivided into eight child elements. The refinement algorithm consists of identifying marked nodes that are used as the basis for a set of four simple refinement templates. The target application for 2-refinement is a parallel grid-based all-hex meshing tool for high performance computing in a distributed environment. The result is a parallel consistent locally refined mesh requiring minimal communication and where minimum mesh quality is greater than scaled Jacobian 0.3more » prior to smoothing.« less
Bulk Nanolaminated Nickel: Preparation, Microstructure, Mechanical Property, and Thermal Stability

NASA Astrophysics Data System (ADS)

Liu, Fan; Yuan, Hao; Goel, Sunkulp; Liu, Ying; Wang, Jing Tao

2018-02-01

A bulk nanolaminated (NL) structure with distinctive fractions of low- and high-angle grain boundaries ( f LAGBs and f HAGBs) is produced in pure nickel, through a two-step process of primary grain refinement by equal-channel angular pressing (ECAP), followed by a secondary geometrical refinement via liquid nitrogen rolling (LNR). The lamellar boundary spacings of 2N and 4N nickel are refined to 40 and 70 nm, respectively, and the yield strength of the NL structure in 2N nickel reaches 1.5 GPa. The impacts of the deformation path, material purity, grain boundary (GB) misorientation, and energy on the microstructure, refinement ability, mechanical strength, and thermal stability are investigated to understand the inherent governing mechanisms. GB migration is the main restoration mechanism limiting the refinement of an NL structure in 4N nickel, while in 2N nickel, shear banding occurs and mediates one-fifth of the total true normal rolling strain at the mesoscale, restricting further refinement. Three typical structures [ultrafine grained (UFG), NL with low f LAGBs, and NL with high f LAGBs] obtained through three different combinations of ECAP and LNR were studied by isochronal annealing for 1 hour at temperatures ranging from 433 K to 973 K (160 °C to 700 °C). Higher thermal stability in the NL structure with high f LAGBs is shown by a 50 K (50 °C) delay in the initiation temperature of recrystallization. Based on calculations and analyses of the stored energies of deformed structures from strain distribution, as characterized by kernel average misorientation (KAM), and from GB misorientations, higher thermal stability is attributed to high f LAGBs in this type of NL structure. This is confirmed by a slower change in the microstructure, as revealed by characterizing its annealing kinetics using KAM maps.
Room temperature structures beyond 1.5 Å by serial femtosecond crystallography

PubMed Central

Schmidt, Marius; Pande, Kanupriya; Basu, Shibom; Tenboer, Jason

2015-01-01

About 2.5 × 106 snapshots on microcrystals of photoactive yellow protein (PYP) from a recent serial femtosecond crystallographic (SFX) experiment were reanalyzed to maximum resolution. The resolution is pushed to 1.46 Å, and a PYP structural model is refined at that resolution. The result is compared to other PYP models determined at atomic resolution around 1 Å and better at the synchrotron. By comparing subtleties such as individual isotropic temperature factors and hydrogen bond lengths, we were able to assess the quality of the SFX data at that resolution. We also show that the determination of anisotropic temperature factor ellipsoids starts to become feasible with the SFX data at resolutions better than 1.5 Å. PMID:26798807
Structural Health Monitoring of Large Structures

NASA Technical Reports Server (NTRS)

Kim, Hyoung M.; Bartkowicz, Theodore J.; Smith, Suzanne Weaver; Zimmerman, David C.

1994-01-01

This paper describes a damage detection and health monitoring method that was developed for large space structures using on-orbit modal identification. After evaluating several existing model refinement and model reduction/expansion techniques, a new approach was developed to identify the location and extent of structural damage with a limited number of measurements. A general area of structural damage is first identified and, subsequently, a specific damaged structural component is located. This approach takes advantage of two different model refinement methods (optimal-update and design sensitivity) and two different model size matching methods (model reduction and eigenvector expansion). Performance of the proposed damage detection approach was demonstrated with test data from two different laboratory truss structures. This space technology can also be applied to structural inspection of aircraft, offshore platforms, oil tankers, ridges, and buildings. In addition, its applications to model refinement will improve the design of structural systems such as automobiles and electronic packaging.
Knowledge-Building Activity Structures in Japanese Elementary Science Pedagogy

ERIC Educational Resources Information Center

Oshima, Jun; Oshima, Ritsuko; Murayama, Isao; Inagaki, Shigenori; Takenaka, Makiko; Yamamoto, Tomokazu; Yamaguchi, Etsuji; Nakayama, Hayashi

2006-01-01

The purpose of this study is to refine Japanese elementary science activity structures by using a CSCL approach to transform the classroom into a knowledge-building community. We report design studies on two science lessons in two consecutive years and describe the progressive refinement of the activity structures. Through comparisons of student…
Toward Mycobacterium tuberculosis DXR inhibitor design: homology modeling and molecular dynamics simulations

NASA Astrophysics Data System (ADS)

Singh, Nidhi; Avery, Mitchell A.; McCurdy, Christopher R.

2007-09-01

Mycobacterium tuberculosis 1-deoxy- d-xylulose-5-phosphate reductoisomerase ( MtDXR) is a potential target for antitubercular chemotherapy. In the absence of its crystallographic structure, our aim was to develop a structural model of MtDXR. This will allow us to gain early insight into the structure and function of the enzyme and its likely binding to ligands and cofactors and thus, facilitate structure-based inhibitor design. To achieve this goal, initial models of MtDXR were generated using MODELER. The best quality model was refined using a series of minimizations and molecular dynamics simulations. A protein-ligand complex was also developed from the initial homology model of the target protein by including information about the known ligand as spatial restraints and optimizing the mutual interactions between the ligand and the binding site. The final model was evaluated on the basis of its ability to explain several site-directed mutagenesis data. Furthermore, a comparison of the homology model with the X-ray structure published in the final stages of the project shows excellent agreement and validates the approach. The knowledge gained from the current study should prove useful in the design and development of inhibitors as potential novel therapeutic agents against tuberculosis by either de novo drug design or virtual screening of large chemical databases.
Enabling X-ray free electron laser crystallography for challenging biological systems from a limited number of crystals

DOE PAGES

Uervirojnangkoorn, Monarin; Zeldin, Oliver B.; Lyubimov, Artem Y.; ...

2015-03-17

There is considerable potential for X-ray free electron lasers (XFELs) to enable determination of macromolecular crystal structures that are difficult to solve using current synchrotron sources. Prior XFEL studies often involved the collection of thousands to millions of diffraction images, in part due to limitations of data processing methods. We implemented a data processing system based on classical post-refinement techniques, adapted to specific properties of XFEL diffraction data. When applied to XFEL data from three different proteins collected using various sample delivery systems and XFEL beam parameters, our method improved the quality of the diffraction data as well as themore » resulting refined atomic models and electron density maps. Moreover, the number of observations for a reflection necessary to assemble an accurate data set could be reduced to a few observations. In conclusion, these developments will help expand the applicability of XFEL crystallography to challenging biological systems, including cases where sample is limited.« less
Enabling X-ray free electron laser crystallography for challenging biological systems from a limited number of crystals

DOE PAGES

Uervirojnangkoorn, Monarin; Zeldin, Oliver B.; Lyubimov, Artem Y.; ...

2015-03-17

There is considerable potential for X-ray free electron lasers (XFELs) to enable determination of macromolecular crystal structures that are difficult to solve using current synchrotron sources. Prior XFEL studies often involved the collection of thousands to millions of diffraction images, in part due to limitations of data processing methods. We implemented a data processing system based on classical post-refinement techniques, adapted to specific properties of XFEL diffraction data. When applied to XFEL data from three different proteins collected using various sample delivery systems and XFEL beam parameters, our method improved the quality of the diffraction data as well as themore » resulting refined atomic models and electron density maps. Moreover, the number of observations for a reflection necessary to assemble an accurate data set could be reduced to a few observations. These developments will help expand the applicability of XFEL crystallography to challenging biological systems, including cases where sample is limited.« less
Enabling X-ray free electron laser crystallography for challenging biological systems from a limited number of crystals

PubMed Central

Uervirojnangkoorn, Monarin; Zeldin, Oliver B; Lyubimov, Artem Y; Hattne, Johan; Brewster, Aaron S; Sauter, Nicholas K; Brunger, Axel T; Weis, William I

2015-01-01

There is considerable potential for X-ray free electron lasers (XFELs) to enable determination of macromolecular crystal structures that are difficult to solve using current synchrotron sources. Prior XFEL studies often involved the collection of thousands to millions of diffraction images, in part due to limitations of data processing methods. We implemented a data processing system based on classical post-refinement techniques, adapted to specific properties of XFEL diffraction data. When applied to XFEL data from three different proteins collected using various sample delivery systems and XFEL beam parameters, our method improved the quality of the diffraction data as well as the resulting refined atomic models and electron density maps. Moreover, the number of observations for a reflection necessary to assemble an accurate data set could be reduced to a few observations. These developments will help expand the applicability of XFEL crystallography to challenging biological systems, including cases where sample is limited. DOI: http://dx.doi.org/10.7554/eLife.05421.001 PMID:25781634
The 1.1 Å resolution structure of a periplasmic phosphate-binding protein from Stenotrophomonas maltophilia: a crystallization contaminant identified by molecular replacement using the entire Protein Data Bank.

PubMed

Keegan, Ronan; Waterman, David G; Hopper, David J; Coates, Leighton; Taylor, Graham; Guo, Jingxu; Coker, Alun R; Erskine, Peter T; Wood, Steve P; Cooper, Jonathan B

2016-08-01

During efforts to crystallize the enzyme 2,4-dihydroxyacetophenone dioxygenase (DAD) from Alcaligenes sp. 4HAP, a small number of strongly diffracting protein crystals were obtained after two years of crystal growth in one condition. The crystals diffracted synchrotron radiation to almost 1.0 Å resolution and were, until recently, assumed to be formed by the DAD protein. However, when another crystal form of this enzyme was eventually solved at lower resolution, molecular replacement using this new structure as the search model did not give a convincing solution with the original atomic resolution data set. Hence, it was considered that these crystals might have arisen from a protein impurity, although molecular replacement using the structures of common crystallization contaminants as search models again failed. A script to perform molecular replacement using MOLREP in which the first chain of every structure in the PDB was used as a search model was run on a multi-core cluster. This identified a number of prokaryotic phosphate-binding proteins as scoring highly in the MOLREP peak lists. Calculation of an electron-density map at 1.1 Å resolution based on the solution obtained with PDB entry 2q9t allowed most of the amino acids to be identified visually and built into the model. A BLAST search then indicated that the molecule was most probably a phosphate-binding protein from Stenotrophomonas maltophilia (UniProt ID B4SL31; gene ID Smal_2208), and fitting of the corresponding sequence to the atomic resolution map fully corroborated this. Proteins in this family have been linked to the virulence of antibiotic-resistant strains of pathogenic bacteria and with biofilm formation. The structure of the S. maltophilia protein has been refined to an R factor of 10.15% and an Rfree of 12.46% at 1.1 Å resolution. The molecule adopts the type II periplasmic binding protein (PBP) fold with a number of extensively elaborated loop regions. A fully dehydrated phosphate anion is bound tightly between the two domains of the protein and interacts with conserved residues and a number of helix dipoles.
On-column refolding of recombinant human interleukin-4 from inclusion bodies.

PubMed

Razeghifard, M Reza

2004-09-01

Interleukin-4 (IL4) is a multifunctional cytokine which plays a key role in the immune system. Several antagonists/agonists of IL4 are reported through mutagenesis studies, but their solution structural studies using nuclear magnetic resonance (NMR) spectroscopy are hindered as milligram quantities of isotopically labeled protein are required for structural refinements. In this work, a His-tagged recombinant form of human IL4 was overexpressed in Escherichia coli under the control of a T7 promoter. The resulting inclusion bodies were separated from cellular debris by centrifugation and solubilized by 6M guanidine-HCl in the presence of reducing agents. The denatured IL4 was immobilized on Ni2+-fractogel beads and refolded in a single chromatographic step by gradual removal of denaturant. This protocol yielded 15-20 mg of isotope-enriched protein from 1L of culture grown in minimal medium. The refolded protein was highly pure and was correctly folded as judged by its two-dimensional NMR spectrum. To show the successful application of this refolding protocol to IL4 variants, 15N-labeled Y124D-IL4 was also prepared and its first two-dimensional NMR spectrum was presented.
Advanced techniques in placental biology -- workshop report.

PubMed

Nelson, D M; Sadovsky, Y; Robinson, J M; Croy, B A; Rice, G; Kniss, D A

2006-04-01

Major advances in placental biology have been realized as new technologies have been developed and existing methods have been refined in many areas of biological research. Classical anatomy and whole-organ physiology tools once used to analyze placental structure and function have been supplanted by more sophisticated techniques adapted from molecular biology, proteomics, and computational biology and bioinformatics. In addition, significant refinements in morphological study of the placenta and its constituent cell types have improved our ability to assess form and function in highly integrated manner. To offer an overview of modern technologies used by investigators to study the placenta, this workshop: Advanced techniques in placental biology, assembled experts who discussed fundamental principles and real time examples of four separate methodologies. Y. Sadovsky presented the principles of microRNA function as an endogenous mechanism of gene regulation. J. Robinson demonstrated the utility of correlative microscopy in which light-level and transmission electron microscopy are combined to provide cellular and subcellular views of placental cells. A. Croy provided a lecture on the use of microdissection techniques which are invaluable for isolating very small subsets of cell types for molecular analysis. Finally, G. Rice presented an overview methods on profiling of complex protein mixtures within tissue and/or fluid samples that, when refined, will offer databases that will underpin a systems approach to modern trophoblast biology.
FF12MC: A revised AMBER forcefield and new protein simulation protocol

PubMed Central

2016-01-01

ABSTRACT Specialized to simulate proteins in molecular dynamics (MD) simulations with explicit solvation, FF12MC is a combination of a new protein simulation protocol employing uniformly reduced atomic masses by tenfold and a revised AMBER forcefield FF99 with (i) shortened C—H bonds, (ii) removal of torsions involving a nonperipheral sp3 atom, and (iii) reduced 1–4 interaction scaling factors of torsions ϕ and ψ. This article reports that in multiple, distinct, independent, unrestricted, unbiased, isobaric–isothermal, and classical MD simulations FF12MC can (i) simulate the experimentally observed flipping between left‐ and right‐handed configurations for C14–C38 of BPTI in solution, (ii) autonomously fold chignolin, CLN025, and Trp‐cage with folding times that agree with the experimental values, (iii) simulate subsequent unfolding and refolding of these miniproteins, and (iv) achieve a robust Z score of 1.33 for refining protein models TMR01, TMR04, and TMR07. By comparison, the latest general‐purpose AMBER forcefield FF14SB locks the C14–C38 bond to the right‐handed configuration in solution under the same protein simulation conditions. Statistical survival analysis shows that FF12MC folds chignolin and CLN025 in isobaric–isothermal MD simulations 2–4 times faster than FF14SB under the same protein simulation conditions. These results suggest that FF12MC may be used for protein simulations to study kinetics and thermodynamics of miniprotein folding as well as protein structure and dynamics. Proteins 2016; 84:1490–1516. © 2016 The Authors Proteins: Structure, Function, and Bioinformatics Published by Wiley Periodicals, Inc. PMID:27348292
Exploring protein interiors: the role of a buried histidine in the KH module fold.

PubMed

Fraternali, F; Amodeo, P; Musco, G; Nilges, M; Pastore, A

1999-03-01

The K-homology (KH) module is a novel RNA-binding motif. The structures of a representative KH motif from vigilin (vig-KH6) and of the first KH domain of fmr1 have been recently solved by nuclear magnetic resonance (NMR) and automated assignment-refinement techniques (ARIA). While a hydrophobic residue is found at position 21 in most of the KH modules, a buried His is conserved in all the 15 KH repeats of vigilin. This position must therefore have a key structural role in stabilizing the hydrophobic core. In the present work, we have addressed the following questions in order to obtain a detailed description of the role of His 21: i) what is the exact role of the histidine in the hydrophobic core of vig-KH6? ii) can we define the interactions that allow a conserved buried position to be occupied by a histidine both in vig-KH6 and in the whole vigilin KH sub-family? iii) how is the structure and stability of vig-KH6 influenced by the state of protonation of this histidine? To answer these questions, we have carried out an extensive refinement of the vig-KH6 structure using both an improved ARIA protocol starting from different initial structures and successively running restrained and unrestrained trajectories in water. An analysis of the stability of secondary structural elements, solvent accessibility, and hydrogen bonding patterns allows hypothesis on the structural role of residue His 21 and on the interactions that this residue forms with the environment. The importance of the protonation state of His 21 on the stability of the KH fold was addressed and validated by experimental results.
Familial temporal lobe epilepsy with psychic auras associated with a novel LGI1 mutation.

PubMed

Striano, P; Busolin, G; Santulli, L; Leonardi, E; Coppola, A; Vitiello, L; Rigon, L; Michelucci, R; Tosatto, S C E; Striano, S; Nobile, C

2011-03-29

Autosomal dominant lateral temporal epilepsy (ADLTE) is characterized by focal seizures with auditory features or aphasia. Mutations in the LGI1 gene have been reported in up to 50% of ADLTE pedigrees. We report a family with temporal lobe epilepsy characterized by psychic symptoms associated with a novel LGI1 mutation. All participants were personally interviewed and underwent neurologic examination and video-EEG recordings. LGI1 exons were sequenced by standard methods. Mutant cDNA was transfected into human embryonic kidney 293 cells; both cell lysates and media were analyzed by Western blot. In silico modeling of the Lgi1 protein EPTP domain was carried out using the structure of WD repeat protein and manually refined. Three affected family members were ascertained, 2 of whom had temporal epilepsy with psychic symptoms (déjà vu, fear) but no auditory or aphasic phenomena, while the third had complex partial seizures without any aura. In all patients, we found a novel LGI1 mutation, Arg407Cys, which did not hamper protein secretion in vitro. Mapping of the mutation on a 3-dimensional protein model showed that this mutation does not induce large structural rearrangements but could destabilize interactions of Lgi1 with target proteins. The Arg407Cys is the first mutation with no effect on Lgi1 protein secretion. The uncommon, isolated psychic symptoms associated with it suggests that ADLTE encompasses a wider range of auras of temporal origin than hitherto reported.
Conformational variability of the stationary phase survival protein E from Xylella fastidiosa revealed by X-ray crystallography, small-angle X-ray scattering studies, and normal mode analysis.

PubMed

Machado, Agnes Thiane Pereira; Fonseca, Emanuella Maria Barreto; Reis, Marcelo Augusto Dos; Saraiva, Antonio Marcos; Santos, Clelton Aparecido Dos; de Toledo, Marcelo Augusto Szymanski; Polikarpov, Igor; de Souza, Anete Pereira; Aparicio, Ricardo; Iulek, Jorge

2017-10-01

Xylella fastidiosa is a xylem-limited bacterium that infects a wide variety of plants. Stationary phase survival protein E is classified as a nucleotidase, which is expressed when bacterial cells are in the stationary growth phase and subjected to environmental stresses. Here, we report four refined X-ray structures of this protein from X. fastidiosa in four different crystal forms in the presence and/or absence of the substrate 3'-AMP. In all chains, the conserved loop verified in family members assumes a closed conformation in either condition. Therefore, the enzymatic mechanism for the target protein might be different of its homologs. Two crystal forms exhibit two monomers whereas the other two show four monomers in the asymmetric unit. While the biological unit has been characterized as a tetramer, differences of their sizes and symmetry are remarkable. Four conformers identified by Small-Angle X-ray Scattering (SAXS) in a ligand-free solution are related to the low frequency normal modes of the crystallographic structures associated with rigid body-like protomer arrangements responsible for the longitudinal and symmetric adjustments between tetramers. When the substrate is present in solution, only two conformers are selected. The most prominent conformer for each case is associated to a normal mode able to elongate the protein by moving apart two dimers. To our knowledge, this work was the first investigation based on the normal modes that analyzed the quaternary structure variability for an enzyme of the SurE family followed by crystallography and SAXS validation. The combined results raise new directions to study allosteric features of XfSurE protein. © 2017 Wiley Periodicals, Inc.
Structure refinement of Zn and Pr-doped Y-Ba-Cu-oxides

NASA Astrophysics Data System (ADS)

Naik, M. S.; Sarode, P. R.; Priolkar, K. R.; Prabhu, R. B.

2018-05-01

Superconducting compounds of composition Y0.9 Pr0.1Ba2 [Cu1-yZny]3O7-δ (0 ≤ y ≤ 0.10) have been synthesized. The structure of these materials has been studied using powder X-ray diffraction technique and refinement has been carried out by using Rietveld refinement procedure. It has been shown that all these compounds crystallize in orthorhombic structure with slight change in c parameter. Increase of parameter O(2) and decrease of parameter O(3)suggest the changes in the Cu-O2 plane of these orthorhombic materials on Zn substitution.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.