protein folding problem: Topics by Science.gov

Sample records for protein folding problem

SVM-Fold: a tool for discriminative multi-class protein fold and superfamily recognition

PubMed Central

Melvin, Iain; Ie, Eugene; Kuang, Rui; Weston, Jason; Stafford, William Noble; Leslie, Christina

2007-01-01

Background Predicting a protein's structural class from its amino acid sequence is a fundamental problem in computational biology. Much recent work has focused on developing new representations for protein sequences, called string kernels, for use with support vector machine (SVM) classifiers. However, while some of these approaches exhibit state-of-the-art performance at the binary protein classification problem, i.e. discriminating between a particular protein class and all other classes, few of these studies have addressed the real problem of multi-class superfamily or fold recognition. Moreover, there are only limited software tools and systems for SVM-based protein classification available to the bioinformatics community. Results We present a new multi-class SVM-based protein fold and superfamily recognition system and web server called SVM-Fold, which can be found at . Our system uses an efficient implementation of a state-of-the-art string kernel for sequence profiles, called the profile kernel, where the underlying feature representation is a histogram of inexact matching k-mer frequencies. We also employ a novel machine learning approach to solve the difficult multi-class problem of classifying a sequence of amino acids into one of many known protein structural classes. Binary one-vs-the-rest SVM classifiers that are trained to recognize individual structural classes yield prediction scores that are not comparable, so that standard "one-vs-all" classification fails to perform well. Moreover, SVMs for classes at different levels of the protein structural hierarchy may make useful predictions, but one-vs-all does not try to combine these multiple predictions. To deal with these problems, our method learns relative weights between one-vs-the-rest classifiers and encodes information about the protein structural hierarchy for multi-class prediction. In large-scale benchmark results based on the SCOP database, our code weighting approach significantly improves on the standard one-vs-all method for both the superfamily and fold prediction in the remote homology setting and on the fold recognition problem. Moreover, our code weight learning algorithm strongly outperforms nearest-neighbor methods based on PSI-BLAST in terms of prediction accuracy on every structure classification problem we consider. Conclusion By combining state-of-the-art SVM kernel methods with a novel multi-class algorithm, the SVM-Fold system delivers efficient and accurate protein fold and superfamily recognition. PMID:17570145
Protein folding, protein structure and the origin of life: Theoretical methods and solutions of dynamical problems

NASA Technical Reports Server (NTRS)

Weaver, D. L.

1982-01-01

Theoretical methods and solutions of the dynamics of protein folding, protein aggregation, protein structure, and the origin of life are discussed. The elements of a dynamic model representing the initial stages of protein folding are presented. The calculation and experimental determination of the model parameters are discussed. The use of computer simulation for modeling protein folding is considered.
Evolution, Energy Landscapes and the Paradoxes of Protein Folding

PubMed Central

Wolynes, Peter G.

2014-01-01

Protein folding has been viewed as a difficult problem of molecular self-organization. The search problem involved in folding however has been simplified through the evolution of folding energy landscapes that are funneled. The funnel hypothesis can be quantified using energy landscape theory based on the minimal frustration principle. Strong quantitative predictions that follow from energy landscape theory have been widely confirmed both through laboratory folding experiments and from detailed simulations. Energy landscape ideas also have allowed successful protein structure prediction algorithms to be developed. The selection constraint of having funneled folding landscapes has left its imprint on the sequences of existing protein structural families. Quantitative analysis of co-evolution patterns allows us to infer the statistical characteristics of the folding landscape. These turn out to be consistent with what has been obtained from laboratory physicochemical folding experiments signalling a beautiful confluence of genomics and chemical physics. PMID:25530262
Folding and Stabilization of Native-Sequence-Reversed Proteins

PubMed Central

Zhang, Yuanzhao; Weber, Jeffrey K; Zhou, Ruhong

2016-01-01

Though the problem of sequence-reversed protein folding is largely unexplored, one might speculate that reversed native protein sequences should be significantly more foldable than purely random heteropolymer sequences. In this article, we investigate how the reverse-sequences of native proteins might fold by examining a series of small proteins of increasing structural complexity (α-helix, β-hairpin, α-helix bundle, and α/β-protein). Employing a tandem protein structure prediction algorithmic and molecular dynamics simulation approach, we find that the ability of reverse sequences to adopt native-like folds is strongly influenced by protein size and the flexibility of the native hydrophobic core. For β-hairpins with reverse-sequences that fail to fold, we employ a simple mutational strategy for guiding stable hairpin formation that involves the insertion of amino acids into the β-turn region. This systematic look at reverse sequence duality sheds new light on the problem of protein sequence-structure mapping and may serve to inspire new protein design and protein structure prediction protocols. PMID:27113844
Folding and Stabilization of Native-Sequence-Reversed Proteins

NASA Astrophysics Data System (ADS)

Zhang, Yuanzhao; Weber, Jeffrey K.; Zhou, Ruhong

2016-04-01

Though the problem of sequence-reversed protein folding is largely unexplored, one might speculate that reversed native protein sequences should be significantly more foldable than purely random heteropolymer sequences. In this article, we investigate how the reverse-sequences of native proteins might fold by examining a series of small proteins of increasing structural complexity (α-helix, β-hairpin, α-helix bundle, and α/β-protein). Employing a tandem protein structure prediction algorithmic and molecular dynamics simulation approach, we find that the ability of reverse sequences to adopt native-like folds is strongly influenced by protein size and the flexibility of the native hydrophobic core. For β-hairpins with reverse-sequences that fail to fold, we employ a simple mutational strategy for guiding stable hairpin formation that involves the insertion of amino acids into the β-turn region. This systematic look at reverse sequence duality sheds new light on the problem of protein sequence-structure mapping and may serve to inspire new protein design and protein structure prediction protocols.
Recent developments in the theory of protein folding: searching for the global energy minimum.

PubMed

Scheraga, H A

1996-04-16

Statistical mechanical theories and computer simulation are being used to gain an understanding of the fundamental features of protein folding. A major obstacle in the computation of protein structures is the multiple-minima problem arising from the existence of many local minima in the multidimensional energy landscape of the protein. This problem has been surmounted for small open-chain and cyclic peptides, and for regular-repeating sequences of models of fibrous proteins. Progress is being made in resolving this problem for globular proteins.
A stoichiometry driven universal spatial organization of backbones of folded proteins: are there Chargaff's rules for protein folding?

PubMed

Mittal, A; Jayaram, B; Shenoy, Sandhya; Bawa, Tejdeep Singh

2010-10-01

Protein folding is at least a six decade old problem, since the times of Pauling and Anfinsen. However, rules of protein folding remain elusive till date. In this work, rigorous analyses of several thousand crystal structures of folded proteins reveal a surprisingly simple unifying principle of backbone organization in protein folding. We find that protein folding is a direct consequence of a narrow band of stoichiometric occurrences of amino-acids in primary sequences, regardless of the size and the fold of a protein. We observe that "preferential interactions" between amino-acids do not drive protein folding, contrary to all prevalent views. We dedicate our discovery to the seminal contribution of Chargaff which was one of the major keys to elucidation of the stoichiometry-driven spatially organized double helical structure of DNA.
Soft Computing Techniques for the Protein Folding Problem on High Performance Computing Architectures.

PubMed

Llanes, Antonio; Muñoz, Andrés; Bueno-Crespo, Andrés; García-Valverde, Teresa; Sánchez, Antonia; Arcas-Túnez, Francisco; Pérez-Sánchez, Horacio; Cecilia, José M

2016-01-01

The protein-folding problem has been extensively studied during the last fifty years. The understanding of the dynamics of global shape of a protein and the influence on its biological function can help us to discover new and more effective drugs to deal with diseases of pharmacological relevance. Different computational approaches have been developed by different researchers in order to foresee the threedimensional arrangement of atoms of proteins from their sequences. However, the computational complexity of this problem makes mandatory the search for new models, novel algorithmic strategies and hardware platforms that provide solutions in a reasonable time frame. We present in this revision work the past and last tendencies regarding protein folding simulations from both perspectives; hardware and software. Of particular interest to us are both the use of inexact solutions to this computationally hard problem as well as which hardware platforms have been used for running this kind of Soft Computing techniques.
Computational Modeling of Proteins based on Cellular Automata: A Method of HP Folding Approximation.

PubMed

Madain, Alia; Abu Dalhoum, Abdel Latif; Sleit, Azzam

2018-06-01

The design of a protein folding approximation algorithm is not straightforward even when a simplified model is used. The folding problem is a combinatorial problem, where approximation and heuristic algorithms are usually used to find near optimal folds of proteins primary structures. Approximation algorithms provide guarantees on the distance to the optimal solution. The folding approximation approach proposed here depends on two-dimensional cellular automata to fold proteins presented in a well-studied simplified model called the hydrophobic-hydrophilic model. Cellular automata are discrete computational models that rely on local rules to produce some overall global behavior. One-third and one-fourth approximation algorithms choose a subset of the hydrophobic amino acids to form H-H contacts. Those algorithms start with finding a point to fold the protein sequence into two sides where one side ignores H's at even positions and the other side ignores H's at odd positions. In addition, blocks or groups of amino acids fold the same way according to a predefined normal form. We intend to improve approximation algorithms by considering all hydrophobic amino acids and folding based on the local neighborhood instead of using normal forms. The CA does not assume a fixed folding point. The proposed approach guarantees one half approximation minus the H-H endpoints. This lower bound guaranteed applies to short sequences only. This is proved as the core and the folds of the protein will have two identical sides for all short sequences.
Dynamics of protein folding: probing the kinetic network of folding-unfolding transitions with experiment and theory.

PubMed

Buchner, Ginka S; Murphy, Ronan D; Buchete, Nicolae-Viorel; Kubelka, Jan

2011-08-01

The problem of spontaneous folding of amino acid chains into highly organized, biologically functional three-dimensional protein structures continues to challenge the modern science. Understanding how proteins fold requires characterization of the underlying energy landscapes as well as the dynamics of the polypeptide chains in all stages of the folding process. In recent years, important advances toward these goals have been achieved owing to the rapidly growing interdisciplinary interest and significant progress in both experimental techniques and theoretical methods. Improvements in the experimental time resolution led to determination of the timescales of the important elementary events in folding, such as formation of secondary structure and tertiary contacts. Sensitive single molecule methods made possible probing the distributions of the unfolded and folded states and following the folding reaction of individual protein molecules. Discovery of proteins that fold in microseconds opened the possibility of atomic-level theoretical simulations of folding and their direct comparisons with experimental data, as well as of direct experimental observation of the barrier-less folding transition. The ultra-fast folding also brought new questions, concerning the intrinsic limits of the folding rates and experimental signatures of barrier-less "downhill" folding. These problems will require novel approaches for even more detailed experimental investigations of the folding dynamics as well as for the analysis of the folding kinetic data. For theoretical simulations of folding, a main challenge is how to extract the relevant information from overwhelmingly detailed atomistic trajectories. New theoretical methods have been devised to allow a systematic approach towards a quantitative analysis of the kinetic network of folding-unfolding transitions between various configuration states of a protein, revealing the transition states and the associated folding pathways at multiple levels, from atomistic to coarse-grained representations. This article is part of a Special Issue entitled: Protein Dynamics: Experimental and Computational Approaches. Copyright © 2010 Elsevier B.V. All rights reserved.
Protein classification using sequential pattern mining.

PubMed

Exarchos, Themis P; Papaloukas, Costas; Lampros, Christos; Fotiadis, Dimitrios I

2006-01-01

Protein classification in terms of fold recognition can be employed to determine the structural and functional properties of a newly discovered protein. In this work sequential pattern mining (SPM) is utilized for sequence-based fold recognition. One of the most efficient SPM algorithms, cSPADE, is employed for protein primary structure analysis. Then a classifier uses the extracted sequential patterns for classifying proteins of unknown structure in the appropriate fold category. The proposed methodology exhibited an overall accuracy of 36% in a multi-class problem of 17 candidate categories. The classification performance reaches up to 65% when the three most probable protein folds are considered.
Improving Protein Fold Recognition by Deep Learning Networks.

PubMed

Jo, Taeho; Hou, Jie; Eickholt, Jesse; Cheng, Jianlin

2015-12-04

For accurate recognition of protein folds, a deep learning network method (DN-Fold) was developed to predict if a given query-template protein pair belongs to the same structural fold. The input used stemmed from the protein sequence and structural features extracted from the protein pair. We evaluated the performance of DN-Fold along with 18 different methods on Lindahl's benchmark dataset and on a large benchmark set extracted from SCOP 1.75 consisting of about one million protein pairs, at three different levels of fold recognition (i.e., protein family, superfamily, and fold) depending on the evolutionary distance between protein sequences. The correct recognition rate of ensembled DN-Fold for Top 1 predictions is 84.5%, 61.5%, and 33.6% and for Top 5 is 91.2%, 76.5%, and 60.7% at family, superfamily, and fold levels, respectively. We also evaluated the performance of single DN-Fold (DN-FoldS), which showed the comparable results at the level of family and superfamily, compared to ensemble DN-Fold. Finally, we extended the binary classification problem of fold recognition to real-value regression task, which also show a promising performance. DN-Fold is freely available through a web server at http://iris.rnet.missouri.edu/dnfold.
Improving Protein Fold Recognition by Deep Learning Networks

NASA Astrophysics Data System (ADS)

Jo, Taeho; Hou, Jie; Eickholt, Jesse; Cheng, Jianlin

2015-12-01

For accurate recognition of protein folds, a deep learning network method (DN-Fold) was developed to predict if a given query-template protein pair belongs to the same structural fold. The input used stemmed from the protein sequence and structural features extracted from the protein pair. We evaluated the performance of DN-Fold along with 18 different methods on Lindahl’s benchmark dataset and on a large benchmark set extracted from SCOP 1.75 consisting of about one million protein pairs, at three different levels of fold recognition (i.e., protein family, superfamily, and fold) depending on the evolutionary distance between protein sequences. The correct recognition rate of ensembled DN-Fold for Top 1 predictions is 84.5%, 61.5%, and 33.6% and for Top 5 is 91.2%, 76.5%, and 60.7% at family, superfamily, and fold levels, respectively. We also evaluated the performance of single DN-Fold (DN-FoldS), which showed the comparable results at the level of family and superfamily, compared to ensemble DN-Fold. Finally, we extended the binary classification problem of fold recognition to real-value regression task, which also show a promising performance. DN-Fold is freely available through a web server at http://iris.rnet.missouri.edu/dnfold.
Shaping up the protein folding funnel by local interaction: lesson from a structure prediction study.

PubMed

Chikenji, George; Fujitsuka, Yoshimi; Takada, Shoji

2006-02-28

Predicting protein tertiary structure by folding-like simulations is one of the most stringent tests of how much we understand the principle of protein folding. Currently, the most successful method for folding-based structure prediction is the fragment assembly (FA) method. Here, we address why the FA method is so successful and its lesson for the folding problem. To do so, using the FA method, we designed a structure prediction test of "chimera proteins." In the chimera proteins, local structural preference is specific to the target sequences, whereas nonlocal interactions are only sequence-independent compaction forces. We find that these chimera proteins can find the native folds of the intact sequences with high probability indicating dominant roles of the local interactions. We further explore roles of local structural preference by exact calculation of the HP lattice model of proteins. From these results, we suggest principles of protein folding: For small proteins, compact structures that are fully compatible with local structural preference are few, one of which is the native fold. These local biases shape up the funnel-like energy landscape.
Shaping up the protein folding funnel by local interaction: Lesson from a structure prediction study

PubMed Central

Chikenji, George; Fujitsuka, Yoshimi; Takada, Shoji

2006-01-01

Predicting protein tertiary structure by folding-like simulations is one of the most stringent tests of how much we understand the principle of protein folding. Currently, the most successful method for folding-based structure prediction is the fragment assembly (FA) method. Here, we address why the FA method is so successful and its lesson for the folding problem. To do so, using the FA method, we designed a structure prediction test of “chimera proteins.” In the chimera proteins, local structural preference is specific to the target sequences, whereas nonlocal interactions are only sequence-independent compaction forces. We find that these chimera proteins can find the native folds of the intact sequences with high probability indicating dominant roles of the local interactions. We further explore roles of local structural preference by exact calculation of the HP lattice model of proteins. From these results, we suggest principles of protein folding: For small proteins, compact structures that are fully compatible with local structural preference are few, one of which is the native fold. These local biases shape up the funnel-like energy landscape. PMID:16488978
Mining sequential patterns for protein fold recognition.

PubMed

Exarchos, Themis P; Papaloukas, Costas; Lampros, Christos; Fotiadis, Dimitrios I

2008-02-01

Protein data contain discriminative patterns that can be used in many beneficial applications if they are defined correctly. In this work sequential pattern mining (SPM) is utilized for sequence-based fold recognition. Protein classification in terms of fold recognition plays an important role in computational protein analysis, since it can contribute to the determination of the function of a protein whose structure is unknown. Specifically, one of the most efficient SPM algorithms, cSPADE, is employed for the analysis of protein sequence. A classifier uses the extracted sequential patterns to classify proteins in the appropriate fold category. For training and evaluating the proposed method we used the protein sequences from the Protein Data Bank and the annotation of the SCOP database. The method exhibited an overall accuracy of 25% in a classification problem with 36 candidate categories. The classification performance reaches up to 56% when the five most probable protein folds are considered.
PREFACE Protein folding: lessons learned and new frontiers Protein folding: lessons learned and new frontiers

NASA Astrophysics Data System (ADS)

Pappu, Rohit V.; Nussinov, Ruth

2009-03-01

In appropriate physiological milieux proteins spontaneously fold into their functional three-dimensional structures. The amino acid sequences of functional proteins contain all the information necessary to specify the folds. This remarkable observation has spawned research aimed at answering two major questions. (1) Of all the conceivable structures that a protein can adopt, why is the ensemble of native-like structures the most favorable? (2) What are the paths by which proteins manage to robustly and reproducibly fold into their native structures? Anfinsen's thermodynamic hypothesis has guided the pursuit of answers to the first question whereas Levinthal's paradox has influenced the development of models for protein folding dynamics. Decades of work have led to significant advances in the folding problem. Mean-field models have been developed to capture our current, coarse grain understanding of the driving forces for protein folding. These models are being used to predict three-dimensional protein structures from sequence and stability profiles as a function of thermodynamic and chemical perturbations. Impressive strides have also been made in the field of protein design, also known as the inverse folding problem, thereby testing our understanding of the determinants of the fold specificities of different sequences. Early work on protein folding pathways focused on the specific sequence of events that could lead to a simplification of the search process. However, unifying principles proved to be elusive. Proteins that show reversible two-state folding-unfolding transitions turned out to be a gift of natural selection. Focusing on these simple systems helped researchers to uncover general principles regarding the origins of cooperativity in protein folding thermodynamics and kinetics. On the theoretical front, concepts borrowed from polymer physics and the physics of spin glasses led to the development of a framework based on energy landscape theories. These theories predict that evolved sequences (functional proteins as opposed to random sequences) find their native folds by minimizing geometric (topological) frustration (i.e. avoiding entropic bottlenecks/kinetic traps). In some cases, following a dominant pathway is the optimal way to minimize frustration, whereas in extreme cases, proteins may fold without encountering bottlenecks. Experimental studies of two-state proteins led in turn to the development of quantitative descriptors that have allowed specific testing of theoretical predictions. These include methods such as phi value analysis to characterize transition state ensembles and descriptors that measure the effects of geometry/topology on folding rates. Interestingly, there exists a striking inverse correlation between the relative contact order (the distance in sequence space between spatially proximal contacts made in the native state) and the folding rates of several two-state proteins. The relative contact order provides a rough estimate of the net entropic cost associated with realizing the folded state, and theories have been developed to explain the observed correlation between the contact order and folding rates. Despite its maturity as a field, there are several areas that come under the rubric of protein folding that are just beginning to receive attention. For example, how do complications in vivo such as macromolecular crowding, confinement, the presence of cosolutes, membrane anchoring, and tethering to surfaces influence protein stabilities and folding dynamics? While we are accustomed to studying proteins at concentrations that are amenable to investigation via probes whose signal intensities grow with protein concentration, this does not make these readouts relevant to the in vivo setting. In cells, protein concentrations are tightly regulated and are likely to be orders of magnitude lower than what we are accustomed to using within in vitro experimental setups. Protein folding in vivo is a complex multi-scale dynamical problem when one considers the synergies between protein expression, spontaneous folding, chaperonin-assisted folding, protein targeting, the kinetics of post-translational modifications, protein degradation, and of course the drive to avoid aggregation. Further, there is growing recognition that cells not only tolerate but select for proteins that are intrinsically disordered. These proteins are essential for many crucial activities, and yet their inability to fold in isolation makes them prone to proteolytic processing and aggregation. In the series of papers that make up this special focus on protein folding in physical biology, leading researchers provide insights into diverse cross-sections of problems in protein folding. Barrick provides a concise review of what we have learned from the study of two-state folders and draws attention to how several unanswered questions are being approached using studies on large repeat proteins. Dissecting the contribution of hydration-mediated interactions to driving forces for protein folding and assembly has been extremely challenging. There is renewed interest in using hydrostatic pressure as a tool to access folding intermediates and decipher the role of partially hydrated states in folding, misfolding, and aggregation. Silva and Foguel review many of the nuances that have been uncovered by perturbing hydrostatic pressure as a thermodynamic parameter. As noted above, protein folding in vivo is expected to be considerably more complex than the folding of two-state proteins in dilute solutions. Lucent et al review the state-of-the-art in the development of quantitative theories to explain chaperonin-assisted folding in vivo. Additionally, they highlight unanswered questions pertaining to the processing of unfolded/misfolded proteins by the chaperone machinery. Zhuang et al present results that focus on the effects of surface tethering on transition state ensembles and folding mechanisms of a model two-state protein. Their results are important because several proteins in vivo fold while being anchored to membranes. Finally, several neurodegenerative and systemic diseases are associated with the aggregation of intrinsically disordered polypeptides. The search for cures in these debilitating and fatal diseases has focused attention on shared attributes in aggregation mechanisms of different proteins and the possibility of identifying druggable targets from mechanistic studies. Abedini and Raleigh review common features gleaned from mechanistic studies of the aggregation of several intrinsically disordered proteins. They propose that the population of helical intermediates and their stabilization via interactions with membranes might be an important route by which the process of aggregation leads to toxicity. The five papers that form this protein folding focus cover specific sub-topics within the larger field of protein folding. They address current questions and emphasize the importance of the growing and productive interface between the physical sciences and biology. We hope that these papers will stimulate much discussion and more importantly advances in the areas highlighted by the contributors.
Improved method for predicting protein fold patterns with ensemble classifiers.

PubMed

Chen, W; Liu, X; Huang, Y; Jiang, Y; Zou, Q; Lin, C

2012-01-27

Protein folding is recognized as a critical problem in the field of biophysics in the 21st century. Predicting protein-folding patterns is challenging due to the complex structure of proteins. In an attempt to solve this problem, we employed ensemble classifiers to improve prediction accuracy. In our experiments, 188-dimensional features were extracted based on the composition and physical-chemical property of proteins and 20-dimensional features were selected using a coupled position-specific scoring matrix. Compared with traditional prediction methods, these methods were superior in terms of prediction accuracy. The 188-dimensional feature-based method achieved 71.2% accuracy in five cross-validations. The accuracy rose to 77% when we used a 20-dimensional feature vector. These methods were used on recent data, with 54.2% accuracy. Source codes and dataset, together with web server and software tools for prediction, are available at: http://datamining.xmu.edu.cn/main/~cwc/ProteinPredict.html.
Solitons and protein folding: An In Silico experiment

NASA Astrophysics Data System (ADS)

Ilieva, N.; Dai, J.; Sieradzan, A.; Niemi, A.

2015-10-01

Protein folding [1] is the process of formation of a functional 3D structure from a random coil — the shape in which amino-acid chains leave the ribosome. Anfinsen's dogma states that the native 3D shape of a protein is completely determined by protein's amino acid sequence. Despite the progress in understanding the process rate and the success in folding prediction for some small proteins, with presently available physics-based methods it is not yet possible to reliably deduce the shape of a biologically active protein from its amino acid sequence. The protein-folding problem endures as one of the most important unresolved problems in science; it addresses the origin of life itself. Furthermore, a wrong fold is a common cause for a protein to lose its function or even endanger the living organism. Soliton solutions of a generalized discrete non-linear Schrödinger equation (GDNLSE) obtained from the energy function in terms of bond and torsion angles κ and τ provide a constructive theoretical framework for describing protein folds and folding patterns [2]. Here we study the dynamics of this process by means of molecular-dynamics simulations. The soliton manifestation is the pattern helix-loop-helix in the secondary structure of the protein, which explains the importance of understanding loop formation in helical proteins. We performed in silico experiments for unfolding one subunit of the core structure of gp41 from the HIV envelope glycoprotein (PDB ID: 1AIK [3]) by molecular-dynamics simulations with the MD package GROMACS. We analyzed 80 ns trajectories, obtained with one united-atom and two different all-atom force fields, to justify the side-chain orientation quantification scheme adopted in the studies and to eliminate force-field based artifacts. Our results are compatible with the soliton model of protein folding and provide first insight into soliton-formation dynamics.
Solvent viscosity and friction in protein folding dynamics.

PubMed

Hagen, Stephen J

2010-08-01

The famous Kramers rate theory for diffusion-controlled reactions has been extended in numerous ways and successfully applied to many types of reactions. Its application to protein folding reactions has been of particular interest in recent years, as many researchers have performed experiments and simulations to test whether folding reactions are diffusion-controlled, whether the solvent is the source of the reaction friction, and whether the friction-dependence of folding rates generally can provide insight into folding dynamics. These experiments involve many practical difficulties, however. They have also produced some unexpected results. Here we briefly review the Kramers theory for reactions in the presence of strong friction and summarize some of the subtle problems that arise in the application of the theory to protein folding. We discuss how the results of these experiments ultimately point to a significant role for internal friction in protein folding dynamics. Studies of friction in protein folding, far from revealing any weakness in Kramers theory, may actually lead to new approaches for probing diffusional dynamics and energy landscapes in protein folding.

Visualizing chaperone-assisted protein folding

DOE PAGES

Horowitz, Scott; Salmon, Loïc; Koldewey, Philipp; ...

2016-05-30

We present that challenges in determining the structures of heterogeneous and dynamic protein complexes have greatly hampered past efforts to obtain a mechanistic understanding of many important biological processes. One such process is chaperone-assisted protein folding. Obtaining structural ensembles of chaperone–substrate complexes would ultimately reveal how chaperones help proteins fold into their native state. To address this problem, we devised a new structural biology approach based on X-ray crystallography, termed residual electron and anomalous density (READ). READ enabled us to visualize even sparsely populated conformations of the substrate protein immunity protein 7 (Im7) in complex with the Escherichia coli chaperonemore » Spy, and to capture a series of snapshots depicting the various folding states of Im7 bound to Spy. The ensemble shows that Spy-associated Im7 samples conformations ranging from unfolded to partially folded to native-like states and reveals how a substrate can explore its folding landscape while being bound to a chaperone.« less
Protein Aggregation/Folding: The Role of Deterministic Singularities of Sequence Hydrophobicity as Determined by Nonlinear Signal Analysis of Acylphosphatase and Aβ(1–40)

PubMed Central

Zbilut, Joseph P.; Colosimo, Alfredo; Conti, Filippo; Colafranceschi, Mauro; Manetti, Cesare; Valerio, MariaCristina; Webber, Charles L.; Giuliani, Alessandro

2003-01-01

The problem of protein folding vs. aggregation was investigated in acylphosphatase and the amyloid protein Aβ(1–40) by means of nonlinear signal analysis of their chain hydrophobicity. Numerical descriptors of recurrence patterns provided the basis for statistical evaluation of folding/aggregation distinctive features. Static and dynamic approaches were used to elucidate conditions coincident with folding vs. aggregation using comparisons with known protein secondary structure classifications, site-directed mutagenesis studies of acylphosphatase, and molecular dynamics simulations of amyloid protein, Aβ(1–40). The results suggest that a feature derived from principal component space characterized by the smoothness of singular, deterministic hydrophobicity patches plays a significant role in the conditions governing protein aggregation. PMID:14645049
Molecular simulation of surfactant-assisted protein refolding

NASA Astrophysics Data System (ADS)

Lu, Diannan; Liu, Zheng; Liu, Zhixia; Zhang, Minlian; Ouyang, Pingkai

2005-04-01

Protein refolding to its native state in vitro is a challenging problem in biotechnology, i.e., in the biomedical, pharmaceutical, and food industry. Protein aggregation and misfolding usually inhibit the recovery of proteins with their native states. These problems can be partially solved by adding a surfactant into a suitable solution environment. However, the process of this surfactant-assisted protein refolding is not well understood. In this paper, we wish to report on the first-ever simulations of surfactant-assisted protein refolding. For these studies, we defined a simple model for the protein and the surfactant and investigated how a surfactant affected the folding behavior of a two-dimensional lattice protein molecule. The model protein and model surfactant were chosen such that we could capture the important features of the folding process and the interaction between the protein and the surfactant, namely, the hydrophobic interaction. It was shown that, in the absence of surfactants, a protein in an "energy trap" conformation, i.e., a local energy minima, could not fold into the native form, which was characterized by a global energy minimum. The addition of surfactants created folding pathways via the formation of protein-surfactant complexes and thus enabled the conformations that fell into energy trap states to escape from these traps and to form the native proteins. The simulation results also showed that it was necessary to match the hydrophobicity of surfactant to the concentration of denaturant, which was added to control the folding or unfolding of a protein. The surfactants with different hydrophobicity had their own concentration range on assisting protein refolding. All of these simulations agreed well with experimental results reported elsewhere, indicating both the validity of the simulations presented here and the potential application of the simulations for the design of a surfactant on assisting protein refolding.
Mapping the distribution of packing topologies within protein interiors shows predominant preference for specific packing motifs

PubMed Central

2011-01-01

Background Mapping protein primary sequences to their three dimensional folds referred to as the 'second genetic code' remains an unsolved scientific problem. A crucial part of the problem concerns the geometrical specificity in side chain association leading to densely packed protein cores, a hallmark of correctly folded native structures. Thus, any model of packing within proteins should constitute an indispensable component of protein folding and design. Results In this study an attempt has been made to find, characterize and classify recurring patterns in the packing of side chain atoms within a protein which sustains its native fold. The interaction of side chain atoms within the protein core has been represented as a contact network based on the surface complementarity and overlap between associating side chain surfaces. Some network topologies definitely appear to be preferred and they have been termed 'packing motifs', analogous to super secondary structures in proteins. Study of the distribution of these motifs reveals the ubiquitous presence of typical smaller graphs, which appear to get linked or coalesce to give larger graphs, reminiscent of the nucleation-condensation model in protein folding. One such frequently occurring motif, also envisaged as the unit of clustering, the three residue clique was invariably found in regions of dense packing. Finally, topological measures based on surface contact networks appeared to be effective in discriminating sequences native to a specific fold amongst a set of decoys. Conclusions Out of innumerable topological possibilities, only a finite number of specific packing motifs are actually realized in proteins. This small number of motifs could serve as a basis set in the construction of larger networks. Of these, the triplet clique exhibits distinct preference both in terms of composition and geometry. PMID:21605466
Mechanisms of protein-folding diseases at a glance.

PubMed

Valastyan, Julie S; Lindquist, Susan

2014-01-01

For a protein to function appropriately, it must first achieve its proper conformation and location within the crowded environment inside the cell. Multiple chaperone systems are required to fold proteins correctly. In addition, degradation pathways participate by destroying improperly folded proteins. The intricacy of this multisystem process provides many opportunities for error. Furthermore, mutations cause misfolded, nonfunctional forms of proteins to accumulate. As a result, many pathological conditions are fundamentally rooted in the protein-folding problem that all cells must solve to maintain their function and integrity. Here, to illustrate the breadth of this phenomenon, we describe five examples of protein-misfolding events that can lead to disease: improper degradation, mislocalization, dominant-negative mutations, structural alterations that establish novel toxic functions, and amyloid accumulation. In each case, we will highlight current therapeutic options for battling such diseases.
Extracting features from protein sequences to improve deep extreme learning machine for protein fold recognition.

PubMed

Ibrahim, Wisam; Abadeh, Mohammad Saniee

2017-05-21

Protein fold recognition is an important problem in bioinformatics to predict three-dimensional structure of a protein. One of the most challenging tasks in protein fold recognition problem is the extraction of efficient features from the amino-acid sequences to obtain better classifiers. In this paper, we have proposed six descriptors to extract features from protein sequences. These descriptors are applied in the first stage of a three-stage framework PCA-DELM-LDA to extract feature vectors from the amino-acid sequences. Principal Component Analysis PCA has been implemented to reduce the number of extracted features. The extracted feature vectors have been used with original features to improve the performance of the Deep Extreme Learning Machine DELM in the second stage. Four new features have been extracted from the second stage and used in the third stage by Linear Discriminant Analysis LDA to classify the instances into 27 folds. The proposed framework is implemented on the independent and combined feature sets in SCOP datasets. The experimental results show that extracted feature vectors in the first stage could improve the performance of DELM in extracting new useful features in second stage. Copyright © 2017 Elsevier Ltd. All rights reserved.
Principal component analysis for protein folding dynamics.

PubMed

Maisuradze, Gia G; Liwo, Adam; Scheraga, Harold A

2009-01-09

Protein folding is considered here by studying the dynamics of the folding of the triple beta-strand WW domain from the Formin-binding protein 28. Starting from the unfolded state and ending either in the native or nonnative conformational states, trajectories are generated with the coarse-grained united residue (UNRES) force field. The effectiveness of principal components analysis (PCA), an already established mathematical technique for finding global, correlated motions in atomic simulations of proteins, is evaluated here for coarse-grained trajectories. The problems related to PCA and their solutions are discussed. The folding and nonfolding of proteins are examined with free-energy landscapes. Detailed analyses of many folding and nonfolding trajectories at different temperatures show that PCA is very efficient for characterizing the general folding and nonfolding features of proteins. It is shown that the first principal component captures and describes in detail the dynamics of a system. Anomalous diffusion in the folding/nonfolding dynamics is examined by the mean-square displacement (MSD) and the fractional diffusion and fractional kinetic equations. The collisionless (or ballistic) behavior of a polypeptide undergoing Brownian motion along the first few principal components is accounted for.
Coarse-grained sequences for protein folding and design.

PubMed

Brown, Scott; Fawzi, Nicolas J; Head-Gordon, Teresa

2003-09-16

We present the results of sequence design on our off-lattice minimalist model in which no specification of native-state tertiary contacts is needed. We start with a sequence that adopts a target topology and build on it through sequence mutation to produce new sequences that comprise distinct members within a target fold class. In this work, we use the alpha/beta ubiquitin fold class and design two new sequences that, when characterized through folding simulations, reproduce the differences in folding mechanism seen experimentally for proteins L and G. The primary implication of this work is that patterning of hydrophobic and hydrophilic residues is the physical origin for the success of relative contact-order descriptions of folding, and that these physics-based potentials provide a predictive connection between free energy landscapes and amino acid sequence (the original protein folding problem). We present results of the sequence mapping from a 20- to the three-letter code for determining a sequence that folds into the WW domain topology to illustrate future extensions to protein design.
Coarse-grained sequences for protein folding and design

PubMed Central

Brown, Scott; Fawzi, Nicolas J.; Head-Gordon, Teresa

2003-01-01

We present the results of sequence design on our off-lattice minimalist model in which no specification of native-state tertiary contacts is needed. We start with a sequence that adopts a target topology and build on it through sequence mutation to produce new sequences that comprise distinct members within a target fold class. In this work, we use the α/β ubiquitin fold class and design two new sequences that, when characterized through folding simulations, reproduce the differences in folding mechanism seen experimentally for proteins L and G. The primary implication of this work is that patterning of hydrophobic and hydrophilic residues is the physical origin for the success of relative contact-order descriptions of folding, and that these physics-based potentials provide a predictive connection between free energy landscapes and amino acid sequence (the original protein folding problem). We present results of the sequence mapping from a 20- to the three-letter code for determining a sequence that folds into the WW domain topology to illustrate future extensions to protein design. PMID:12963815
Energetic frustrations in protein folding at residue resolution: a homologous simulation study of Im9 proteins.

PubMed

Sun, Yunxiang; Ming, Dengming

2014-01-01

Energetic frustration is becoming an important topic for understanding the mechanisms of protein folding, which is a long-standing big biological problem usually investigated by the free energy landscape theory. Despite the significant advances in probing the effects of folding frustrations on the overall features of protein folding pathways and folding intermediates, detailed characterizations of folding frustrations at an atomic or residue level are still lacking. In addition, how and to what extent folding frustrations interact with protein topology in determining folding mechanisms remains unclear. In this paper, we tried to understand energetic frustrations in the context of protein topology structures or native-contact networks by comparing the energetic frustrations of five homologous Im9 alpha-helix proteins that share very similar topology structures but have a single hydrophilic-to-hydrophobic mutual mutation. The folding simulations were performed using a coarse-grained Gō-like model, while non-native hydrophobic interactions were introduced as energetic frustrations using a Lennard-Jones potential function. Energetic frustrations were then examined at residue level based on φ-value analyses of the transition state ensemble structures and mapped back to native-contact networks. Our calculations show that energetic frustrations have highly heterogeneous influences on the folding of the four helices of the examined structures depending on the local environment of the frustration centers. Also, the closer the introduced frustration is to the center of the native-contact network, the larger the changes in the protein folding. Our findings add a new dimension to the understanding of protein folding the topology determination in that energetic frustrations works closely with native-contact networks to affect the protein folding.
Meta-structure correlation in protein space unveils different selection rules for folded and intrinsically disordered proteins.

PubMed

Naranjo, Yandi; Pons, Miquel; Konrat, Robert

2012-01-01

The number of existing protein sequences spans a very small fraction of sequence space. Natural proteins have overcome a strong negative selective pressure to avoid the formation of insoluble aggregates. Stably folded globular proteins and intrinsically disordered proteins (IDPs) use alternative solutions to the aggregation problem. While in globular proteins folding minimizes the access to aggregation prone regions, IDPs on average display large exposed contact areas. Here, we introduce the concept of average meta-structure correlation maps to analyze sequence space. Using this novel conceptual view we show that representative ensembles of folded and ID proteins show distinct characteristics and respond differently to sequence randomization. By studying the way evolutionary constraints act on IDPs to disable a negative function (aggregation) we might gain insight into the mechanisms by which function-enabling information is encoded in IDPs.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Ilieva, N., E-mail: nevena.ilieva@parallel.bas.bg; Dai, J., E-mail: daijing491@gmail.com; Sieradzan, A., E-mail: adams86@wp.pl

Protein folding [1] is the process of formation of a functional 3D structure from a random coil — the shape in which amino-acid chains leave the ribosome. Anfinsen’s dogma states that the native 3D shape of a protein is completely determined by protein’s amino acid sequence. Despite the progress in understanding the process rate and the success in folding prediction for some small proteins, with presently available physics-based methods it is not yet possible to reliably deduce the shape of a biologically active protein from its amino acid sequence. The protein-folding problem endures as one of the most important unresolvedmore » problems in science; it addresses the origin of life itself. Furthermore, a wrong fold is a common cause for a protein to lose its function or even endanger the living organism. Soliton solutions of a generalized discrete non-linear Schrödinger equation (GDNLSE) obtained from the energy function in terms of bond and torsion angles κ and τ provide a constructive theoretical framework for describing protein folds and folding patterns [2]. Here we study the dynamics of this process by means of molecular-dynamics simulations. The soliton manifestation is the pattern helix–loop–helix in the secondary structure of the protein, which explains the importance of understanding loop formation in helical proteins. We performed in silico experiments for unfolding one subunit of the core structure of gp41 from the HIV envelope glycoprotein (PDB ID: 1AIK [3]) by molecular-dynamics simulations with the MD package GROMACS. We analyzed 80 ns trajectories, obtained with one united-atom and two different all-atom force fields, to justify the side-chain orientation quantification scheme adopted in the studies and to eliminate force-field based artifacts. Our results are compatible with the soliton model of protein folding and provide first insight into soliton-formation dynamics.« less
WeFold: A Coopetition for Protein Structure Prediction

PubMed Central

Khoury, George A.; Liwo, Adam; Khatib, Firas; Zhou, Hongyi; Chopra, Gaurav; Bacardit, Jaume; Bortot, Leandro O.; Faccioli, Rodrigo A.; Deng, Xin; He, Yi; Krupa, Pawel; Li, Jilong; Mozolewska, Magdalena A.; Sieradzan, Adam K.; Smadbeck, James; Wirecki, Tomasz; Cooper, Seth; Flatten, Jeff; Xu, Kefan; Baker, David; Cheng, Jianlin; Delbem, Alexandre C. B.; Floudas, Christodoulos A.; Keasar, Chen; Levitt, Michael; Popović, Zoran; Scheraga, Harold A.; Skolnick, Jeffrey; Crivelli, Silvia N.; Players, Foldit

2014-01-01

The protein structure prediction problem continues to elude scientists. Despite the introduction of many methods, only modest gains were made over the last decade for certain classes of prediction targets. To address this challenge, a social-media based worldwide collaborative effort, named WeFold, was undertaken by thirteen labs. During the collaboration, the labs were simultaneously competing with each other. Here, we present the first attempt at “coopetition” in scientific research applied to the protein structure prediction and refinement problems. The coopetition was possible by allowing the participating labs to contribute different components of their protein structure prediction pipelines and create new hybrid pipelines that they tested during CASP10. This manuscript describes both successes and areas needing improvement as identified throughout the first WeFold experiment and discusses the efforts that are underway to advance this initiative. A footprint of all contributions and structures are publicly accessible at http://www.wefold.org. PMID:24677212
DOE Office of Scientific and Technical Information (OSTI.GOV)

Horowitz, Scott; Salmon, Loïc; Koldewey, Philipp

We present that challenges in determining the structures of heterogeneous and dynamic protein complexes have greatly hampered past efforts to obtain a mechanistic understanding of many important biological processes. One such process is chaperone-assisted protein folding. Obtaining structural ensembles of chaperone–substrate complexes would ultimately reveal how chaperones help proteins fold into their native state. To address this problem, we devised a new structural biology approach based on X-ray crystallography, termed residual electron and anomalous density (READ). READ enabled us to visualize even sparsely populated conformations of the substrate protein immunity protein 7 (Im7) in complex with the Escherichia coli chaperonemore » Spy, and to capture a series of snapshots depicting the various folding states of Im7 bound to Spy. The ensemble shows that Spy-associated Im7 samples conformations ranging from unfolded to partially folded to native-like states and reveals how a substrate can explore its folding landscape while being bound to a chaperone.« less
Protein folding optimization based on 3D off-lattice model via an improved artificial bee colony algorithm.

PubMed

Li, Bai; Lin, Mu; Liu, Qiao; Li, Ya; Zhou, Changjun

2015-10-01

Protein folding is a fundamental topic in molecular biology. Conventional experimental techniques for protein structure identification or protein folding recognition require strict laboratory requirements and heavy operating burdens, which have largely limited their applications. Alternatively, computer-aided techniques have been developed to optimize protein structures or to predict the protein folding process. In this paper, we utilize a 3D off-lattice model to describe the original protein folding scheme as a simplified energy-optimal numerical problem, where all types of amino acid residues are binarized into hydrophobic and hydrophilic ones. We apply a balance-evolution artificial bee colony (BE-ABC) algorithm as the minimization solver, which is featured by the adaptive adjustment of search intensity to cater for the varying needs during the entire optimization process. In this work, we establish a benchmark case set with 13 real protein sequences from the Protein Data Bank database and evaluate the convergence performance of BE-ABC algorithm through strict comparisons with several state-of-the-art ABC variants in short-term numerical experiments. Besides that, our obtained best-so-far protein structures are compared to the ones in comprehensive previous literature. This study also provides preliminary insights into how artificial intelligence techniques can be applied to reveal the dynamics of protein folding. Graphical Abstract Protein folding optimization using 3D off-lattice model and advanced optimization techniques.
Statistical mechanics of simple models of protein folding and design.

PubMed Central

Pande, V S; Grosberg, A Y; Tanaka, T

1997-01-01

It is now believed that the primary equilibrium aspects of simple models of protein folding are understood theoretically. However, current theories often resort to rather heavy mathematics to overcome some technical difficulties inherent in the problem or start from a phenomenological model. To this end, we take a new approach in this pedagogical review of the statistical mechanics of protein folding. The benefit of our approach is a drastic mathematical simplification of the theory, without resort to any new approximations or phenomenological prescriptions. Indeed, the results we obtain agree precisely with previous calculations. Because of this simplification, we are able to present here a thorough and self contained treatment of the problem. Topics discussed include the statistical mechanics of the random energy model (REM), tests of the validity of REM as a model for heteropolymer freezing, freezing transition of random sequences, phase diagram of designed ("minimally frustrated") sequences, and the degree to which errors in the interactions employed in simulations of either folding and design can still lead to correct folding behavior. Images FIGURE 2 FIGURE 3 FIGURE 4 FIGURE 6 PMID:9414231
Protein folding: Over half a century lasting quest. Comment on "There and back again: Two views on the protein folding puzzle" by Alexei V. Finkelstein et al.

NASA Astrophysics Data System (ADS)

Krokhotin, Andrey; Dokholyan, Nikolay V.

2017-07-01

Most proteins fold into unique three-dimensional (3D) structures that determine their biological functions, such as catalytic activity or macromolecular binding. Misfolded proteins can pose a threat through aberrant interactions with other proteins leading to a number of diseases including Alzheimer's disease, Parkinson's disease, and amyotrophic lateral sclerosis [1,2]. What does determine 3D structure of proteins? The first clue to this question came more than fifty years ago when Anfinsen demonstrated that unfolded proteins can spontaneously fold to their native 3D structures [3,4]. Anfinsen's experiments lead to the conclusion that proteins fold to unique native structure corresponding to the stable and kinetically accessible free energy minimum, and protein native structure is solely determined by its amino acid sequence. The question of how exactly proteins find their free energy minimum proved to be a difficult problem. One of the puzzles, initially pointed out by Levinthal, was an inconsistency between observed protein folding times and theoretical estimates. A self-avoiding polymer model of a globular protein of 100-residues length on a cubic lattice can sample at least 1047 states. Based on the assumption that conformational sampling occurs at the highest vibrational mode of proteins (∼picoseconds), predicted folding time by searching among all the possible conformations leads to ∼1027 years (much larger than the age of the universe) [5]. In contrast, observed protein folding time range from microseconds to minutes. Due to tremendous theoretical progress in protein folding field that has been achieved in past decades, the source of this inconsistency is currently understood that is thoroughly described in the review by Finkelstein et al. [6].
Theoretical aspects of pressure and solute denaturation of proteins: A Kirkwood-buff-theory approach.

PubMed

Ben-Naim, Arieh

2012-12-21

A new approach to the problem of pressure-denaturation (PD) and solute-denaturation (SD) of proteins is presented. The problem is formulated in terms of Le Chatelier principle, and a solution is sought in terms of the Kirkwood-Buff theory of solutions. It is found that both problems have one factor in common; the excluded volumes of the folded and the unfolded forms with respect to the solvent molecules. It is shown that solvent-induced effects operating on hydrophilic groups along the protein are probably the main reason for PD. On the other hand, the SD depends on the preferential solvation of the folded and the unfolded forms with respect to solvent and co-solvent molecules.
Theoretical aspects of pressure and solute denaturation of proteins: A Kirkwood-buff-theory approach

NASA Astrophysics Data System (ADS)

Ben-Naim, Arieh

2012-12-01

A new approach to the problem of pressure-denaturation (PD) and solute-denaturation (SD) of proteins is presented. The problem is formulated in terms of Le Chatelier principle, and a solution is sought in terms of the Kirkwood-Buff theory of solutions. It is found that both problems have one factor in common; the excluded volumes of the folded and the unfolded forms with respect to the solvent molecules. It is shown that solvent-induced effects operating on hydrophilic groups along the protein are probably the main reason for PD. On the other hand, the SD depends on the preferential solvation of the folded and the unfolded forms with respect to solvent and co-solvent molecules.
Three key residues form a critical contact network in a protein folding transition state

NASA Astrophysics Data System (ADS)

Vendruscolo, Michele; Paci, Emanuele; Dobson, Christopher M.; Karplus, Martin

2001-02-01

Determining how a protein folds is a central problem in structural biology. The rate of folding of many proteins is determined by the transition state, so that a knowledge of its structure is essential for understanding the protein folding reaction. Here we use mutation measurements-which determine the role of individual residues in stabilizing the transition state-as restraints in a Monte Carlo sampling procedure to determine the ensemble of structures that make up the transition state. We apply this approach to the experimental data for the 98-residue protein acylphosphatase, and obtain a transition-state ensemble with the native-state topology and an average root-mean-square deviation of 6Å from the native structure. Although about 20 residues with small positional fluctuations form the structural core of this transition state, the native-like contact network of only three of these residues is sufficient to determine the overall fold of the protein. This result reveals how a nucleation mechanism involving a small number of key residues can lead to folding of a polypeptide chain to its unique native-state structure.

New Protein Mimetics: The Zinc Finger Motif as a Locked-In Tertiary Fold.

PubMed

Tuchscherer, Gabriele; Lehmann, Christian; Mathieu, Marc

1998-11-16

The principle of a molecular kit is used for the covalent assembly of secondary structure forming peptide blocks to predetermined packing topologies. The resulting locked-in folds (LIFs; depicted schematically) are readily accessible and bypass the intriguing folding problem of linear peptide chains. This strategy allows, for example, mimicking of the essential structural and functional features of zinc finger proteins. © 1998 WILEY-VCH Verlag GmbH, Weinheim, Fed. Rep. of Germany.
Protein folding simulations: from coarse-grained model to all-atom model.

PubMed

Zhang, Jian; Li, Wenfei; Wang, Jun; Qin, Meng; Wu, Lei; Yan, Zhiqiang; Xu, Weixin; Zuo, Guanghong; Wang, Wei

2009-06-01

Protein folding is an important and challenging problem in molecular biology. During the last two decades, molecular dynamics (MD) simulation has proved to be a paramount tool and was widely used to study protein structures, folding kinetics and thermodynamics, and structure-stability-function relationship. It was also used to help engineering and designing new proteins, and to answer even more general questions such as the minimal number of amino acid or the evolution principle of protein families. Nowadays, the MD simulation is still undergoing rapid developments. The first trend is to toward developing new coarse-grained models and studying larger and more complex molecular systems such as protein-protein complex and their assembling process, amyloid related aggregations, and structure and motion of chaperons, motors, channels and virus capsides; the second trend is toward building high resolution models and explore more detailed and accurate pictures of protein folding and the associated processes, such as the coordination bond or disulfide bond involved folding, the polarization, charge transfer and protonate/deprotonate process involved in metal coupled folding, and the ion permeation and its coupling with the kinetics of channels. On these new territories, MD simulations have given many promising results and will continue to offer exciting views. Here, we review several new subjects investigated by using MD simulations as well as the corresponding developments of appropriate protein models. These include but are not limited to the attempt to go beyond the topology based Gō-like model and characterize the energetic factors in protein structures and dynamics, the study of the thermodynamics and kinetics of disulfide bond involved protein folding, the modeling of the interactions between chaperonin and the encapsulated protein and the protein folding under this circumstance, the effort to clarify the important yet still elusive folding mechanism of protein BBL, the development of discrete MD and its application in studying the alpha-beta conformational conversion and oligomer assembling process, and the modeling of metal ion involved protein folding. (c) 2009 IUBMB.
Influence of the native topology on the folding barrier for small proteins

NASA Astrophysics Data System (ADS)

Prieto, Lidia; Rey, Antonio

2007-11-01

The possibility of downhill instead of two-state folding for proteins has been a very controversial topic which arose from recent experimental studies. From the theoretical side, this question has also been accomplished in different ways. Given the experimental observation that a relationship exists between the native structure topology of a protein and the kinetic and thermodynamic properties of its folding process, Gō-type potentials are an appropriate way to approach this problem. In this work, we employ an interaction potential from this family to get a better insight on the topological characteristics of the native state that may somehow determine the presence of a thermodynamic barrier in the folding pathway. The results presented here show that, indeed, the native topology of a small protein has a great influence on its folding behavior, mostly depending on the proportion of local and long range contacts the protein has in its native structure. Furthermore, when all the interactions present contribute in a balanced way, the transition results to be cooperative. Otherwise, the tendency to a downhill folding behavior increases.
Learning To Fold Proteins Using Energy Landscape Theory

PubMed Central

Schafer, N.P.; Kim, B.L.; Zheng, W.; Wolynes, P.G.

2014-01-01

This review is a tutorial for scientists interested in the problem of protein structure prediction, particularly those interested in using coarse-grained molecular dynamics models that are optimized using lessons learned from the energy landscape theory of protein folding. We also present a review of the results of the AMH/AMC/AMW/AWSEM family of coarse-grained molecular dynamics protein folding models to illustrate the points covered in the first part of the article. Accurate coarse-grained structure prediction models can be used to investigate a wide range of conceptual and mechanistic issues outside of protein structure prediction; specifically, the paper concludes by reviewing how AWSEM has in recent years been able to elucidate questions related to the unusual kinetic behavior of artificially designed proteins, multidomain protein misfolding, and the initial stages of protein aggregation. PMID:25308991
RNA folding: structure prediction, folding kinetics and ion electrostatics.

PubMed

Tan, Zhijie; Zhang, Wenbing; Shi, Yazhou; Wang, Fenghua

2015-01-01

Beyond the "traditional" functions such as gene storage, transport and protein synthesis, recent discoveries reveal that RNAs have important "new" biological functions including the RNA silence and gene regulation of riboswitch. Such functions of noncoding RNAs are strongly coupled to the RNA structures and proper structure change, which naturally leads to the RNA folding problem including structure prediction and folding kinetics. Due to the polyanionic nature of RNAs, RNA folding structure, stability and kinetics are strongly coupled to the ion condition of solution. The main focus of this chapter is to review the recent progress in the three major aspects in RNA folding problem: structure prediction, folding kinetics and ion electrostatics. This chapter will introduce both the recent experimental and theoretical progress, while emphasize the theoretical modelling on the three aspects in RNA folding.
Negative Charge Neutralization in the Loops and Turns of Outer Membrane Phospholipase A Impacts Folding Hysteresis at Neutral pH.

PubMed

McDonald, Sarah K; Fleming, Karen G

2016-11-08

Hysteresis in equilibrium protein folding titrations is an experimental barrier that must be overcome to extract meaningful thermodynamic quantities. Traditional approaches to solving this problem involve testing a spectrum of solution conditions to find ones that achieve path independence. Through this procedure, a specific pH of 3.8 was required to achieve path independence for the water-to-bilayer equilibrium folding of outer membrane protein OmpLA. We hypothesized that the neutralization of negatively charged side chains (Asp and Glu) at pH 3.8 could be the physical basis for path-independent folding at this pH. To test this idea, we engineered variants of OmpLA with Asp → Asn and Glu → Gln mutations to neutralize the negative charges within various regions of the protein and tested for reversible folding at neutral pH. Although not fully resolved, our results show that these mutations in the periplasmic turns and extracellular loops are responsible for 60% of the hysteresis in wild-type folding. Overall, our study suggests that negative charges impact the folding hysteresis in outer membrane proteins and their neutralization may aid in protein engineering applications.
Support Vector Machine-based classification of protein folds using the structural properties of amino acid residues and amino acid residue pairs.

PubMed

Shamim, Mohammad Tabrez Anwar; Anwaruddin, Mohammad; Nagarajaram, H A

2007-12-15

Fold recognition is a key step in the protein structure discovery process, especially when traditional sequence comparison methods fail to yield convincing structural homologies. Although many methods have been developed for protein fold recognition, their accuracies remain low. This can be attributed to insufficient exploitation of fold discriminatory features. We have developed a new method for protein fold recognition using structural information of amino acid residues and amino acid residue pairs. Since protein fold recognition can be treated as a protein fold classification problem, we have developed a Support Vector Machine (SVM) based classifier approach that uses secondary structural state and solvent accessibility state frequencies of amino acids and amino acid pairs as feature vectors. Among the individual properties examined secondary structural state frequencies of amino acids gave an overall accuracy of 65.2% for fold discrimination, which is better than the accuracy by any method reported so far in the literature. Combination of secondary structural state frequencies with solvent accessibility state frequencies of amino acids and amino acid pairs further improved the fold discrimination accuracy to more than 70%, which is approximately 8% higher than the best available method. In this study we have also tested, for the first time, an all-together multi-class method known as Crammer and Singer method for protein fold classification. Our studies reveal that the three multi-class classification methods, namely one versus all, one versus one and Crammer and Singer method, yield similar predictions. Dataset and stand-alone program are available upon request.
Collapse kinetics and chevron plots from simulations of denaturant-dependent folding of globular proteins

PubMed Central

Liu, Zhenxing; Reddy, Govardhan; O’Brien, Edward P.; Thirumalai, D.

2011-01-01

Quantitative description of how proteins fold under experimental conditions remains a challenging problem. Experiments often use urea and guanidinium chloride to study folding whereas the natural variable in simulations is temperature. To bridge the gap, we use the molecular transfer model that combines measured denaturant-dependent transfer free energies for the peptide group and amino acid residues, and a coarse-grained Cα-side chain model for polypeptide chains to simulate the folding of src SH3 domain. Stability of the native state decreases linearly as [C] (the concentration of guanidinium chloride) increases with the slope, m, that is in excellent agreement with experiments. Remarkably, the calculated folding rate at [C] = 0 is only 16-fold larger than the measured value. Most importantly ln kobs (kobs is the sum of folding and unfolding rates) as a function of [C] has the characteristic V (chevron) shape. In every folding trajectory, the times for reaching the native state, interactions stabilizing all the substructures, and global collapse coincide. The value of (mf is the slope of the folding arm of the chevron plot) is identical to the fraction of buried solvent accessible surface area in the structures of the transition state ensemble. In the dominant transition state, which does not vary significantly at low [C], the core of the protein and certain loops are structured. Besides solving the long-standing problem of computing the chevron plot, our work lays the foundation for incorporating denaturant effects in a physically transparent manner either in all-atom or coarse-grained simulations. PMID:21512127
Collapse kinetics and chevron plots from simulations of denaturant-dependent folding of globular proteins.

PubMed

Liu, Zhenxing; Reddy, Govardhan; O'Brien, Edward P; Thirumalai, D

2011-05-10

Quantitative description of how proteins fold under experimental conditions remains a challenging problem. Experiments often use urea and guanidinium chloride to study folding whereas the natural variable in simulations is temperature. To bridge the gap, we use the molecular transfer model that combines measured denaturant-dependent transfer free energies for the peptide group and amino acid residues, and a coarse-grained C(α)-side chain model for polypeptide chains to simulate the folding of src SH(3) domain. Stability of the native state decreases linearly as [C] (the concentration of guanidinium chloride) increases with the slope, m, that is in excellent agreement with experiments. Remarkably, the calculated folding rate at [C] = 0 is only 16-fold larger than the measured value. Most importantly ln k(obs) (k(obs) is the sum of folding and unfolding rates) as a function of [C] has the characteristic V (chevron) shape. In every folding trajectory, the times for reaching the native state, interactions stabilizing all the substructures, and global collapse coincide. The value of (m(f) is the slope of the folding arm of the chevron plot) is identical to the fraction of buried solvent accessible surface area in the structures of the transition state ensemble. In the dominant transition state, which does not vary significantly at low [C], the core of the protein and certain loops are structured. Besides solving the long-standing problem of computing the chevron plot, our work lays the foundation for incorporating denaturant effects in a physically transparent manner either in all-atom or coarse-grained simulations.
The Complexity of Folding Self-Folding Origami

NASA Astrophysics Data System (ADS)

Stern, Menachem; Pinson, Matthew B.; Murugan, Arvind

2017-10-01

Why is it difficult to refold a previously folded sheet of paper? We show that even crease patterns with only one designed folding motion inevitably contain an exponential number of "distractor" folding branches accessible from a bifurcation at the flat state. Consequently, refolding a sheet requires finding the ground state in a glassy energy landscape with an exponential number of other attractors of higher energy, much like in models of protein folding (Levinthal's paradox) and other NP-hard satisfiability (SAT) problems. As in these problems, we find that refolding a sheet requires actuation at multiple carefully chosen creases. We show that seeding successful folding in this way can be understood in terms of subpatterns that fold when cut out ("folding islands"). Besides providing guidelines for the placement of active hinges in origami applications, our results point to fundamental limits on the programmability of energy landscapes in sheets.
Balancing Force Field Protein–Lipid Interactions To Capture Transmembrane Helix–Helix Association

PubMed Central

2018-01-01

Atomistic simulations have recently been shown to be sufficiently accurate to reversibly fold globular proteins and have provided insights into folding mechanisms. Gaining similar understanding from simulations of membrane protein folding and association would be of great medical interest. All-atom simulations of the folding and assembly of transmembrane protein domains are much more challenging, not least due to very slow diffusion within the lipid bilayer membrane. Here, we focus on a simple and well-characterized prototype of membrane protein folding and assembly, namely the dimerization of glycophorin A, a homodimer of single transmembrane helices. We have determined the free energy landscape for association of the dimer using the CHARMM36 force field. We find that the native structure is a metastable state, but not stable as expected from experimental estimates of the dissociation constant and numerous experimental structures obtained under a variety of conditions. We explore two straightforward approaches to address this problem and demonstrate that they result in stable dimers with dissociation constants consistent with experimental data. PMID:29424543
Hill-Climbing search and diversification within an evolutionary approach to protein structure prediction.

PubMed

Chira, Camelia; Horvath, Dragos; Dumitrescu, D

2011-07-30

Proteins are complex structures made of amino acids having a fundamental role in the correct functioning of living cells. The structure of a protein is the result of the protein folding process. However, the general principles that govern the folding of natural proteins into a native structure are unknown. The problem of predicting a protein structure with minimum-energy starting from the unfolded amino acid sequence is a highly complex and important task in molecular and computational biology. Protein structure prediction has important applications in fields such as drug design and disease prediction. The protein structure prediction problem is NP-hard even in simplified lattice protein models. An evolutionary model based on hill-climbing genetic operators is proposed for protein structure prediction in the hydrophobic - polar (HP) model. Problem-specific search operators are implemented and applied using a steepest-ascent hill-climbing approach. Furthermore, the proposed model enforces an explicit diversification stage during the evolution in order to avoid local optimum. The main features of the resulting evolutionary algorithm - hill-climbing mechanism and diversification strategy - are evaluated in a set of numerical experiments for the protein structure prediction problem to assess their impact to the efficiency of the search process. Furthermore, the emerging consolidated model is compared to relevant algorithms from the literature for a set of difficult bidimensional instances from lattice protein models. The results obtained by the proposed algorithm are promising and competitive with those of related methods.
A discriminative method for protein remote homology detection and fold recognition combining Top-n-grams and latent semantic analysis.

PubMed

Liu, Bin; Wang, Xiaolong; Lin, Lei; Dong, Qiwen; Wang, Xuan

2008-12-01

Protein remote homology detection and fold recognition are central problems in bioinformatics. Currently, discriminative methods based on support vector machine (SVM) are the most effective and accurate methods for solving these problems. A key step to improve the performance of the SVM-based methods is to find a suitable representation of protein sequences. In this paper, a novel building block of proteins called Top-n-grams is presented, which contains the evolutionary information extracted from the protein sequence frequency profiles. The protein sequence frequency profiles are calculated from the multiple sequence alignments outputted by PSI-BLAST and converted into Top-n-grams. The protein sequences are transformed into fixed-dimension feature vectors by the occurrence times of each Top-n-gram. The training vectors are evaluated by SVM to train classifiers which are then used to classify the test protein sequences. We demonstrate that the prediction performance of remote homology detection and fold recognition can be improved by combining Top-n-grams and latent semantic analysis (LSA), which is an efficient feature extraction technique from natural language processing. When tested on superfamily and fold benchmarks, the method combining Top-n-grams and LSA gives significantly better results compared to related methods. The method based on Top-n-grams significantly outperforms the methods based on many other building blocks including N-grams, patterns, motifs and binary profiles. Therefore, Top-n-gram is a good building block of the protein sequences and can be widely used in many tasks of the computational biology, such as the sequence alignment, the prediction of domain boundary, the designation of knowledge-based potentials and the prediction of protein binding sites.
Unfolding of a ClC chloride transporter retains memory of its evolutionary history.

PubMed

Min, Duyoung; Jefferson, Robert E; Qi, Yifei; Wang, Jing Yang; Arbing, Mark A; Im, Wonpil; Bowie, James U

2018-05-01

ClC chloride channels and transporters are important for chloride homeostasis in species from bacteria to human. Mutations in ClC proteins cause genetically inherited diseases, some of which are likely to involve folding defects. The ClC proteins present a challenging and unusual biological folding problem because they are large membrane proteins possessing a complex architecture, with many reentrant helices that go only partway through membrane and loop back out. Here we were able to examine the unfolding of the Escherichia coli ClC transporter, ClC-ec1, using single-molecule forced unfolding methods. We found that the protein could be separated into two stable halves that unfolded independently. The independence of the two domains is consistent with an evolutionary model in which the two halves arose from independently folding subunits that later fused together. Maintaining smaller folding domains of lesser complexity within large membrane proteins may be an advantageous strategy to avoid misfolding traps.
Repetitive Protein Unfolding by the trans Ring of the GroEL-GroES Chaperonin Complex Stimulates Folding*

PubMed Central

Lin, Zong; Puchalla, Jason; Shoup, Daniel; Rye, Hays S.

2013-01-01

A key constraint on the growth of most organisms is the slow and inefficient folding of many essential proteins. To deal with this problem, several diverse families of protein folding machines, known collectively as molecular chaperones, developed early in evolutionary history. The functional role and operational steps of these remarkably complex nanomachines remain subjects of active debate. Here we present evidence that, for the GroEL-GroES chaperonin system, the non-native substrate protein enters the folding cycle on the trans ring of the double-ring GroEL-ATP-GroES complex rather than the ADP-bound complex. The properties of this ATP complex are designed to ensure that non-native substrate protein binds first, followed by ATP and finally GroES. This binding order ensures efficient occupancy of the open GroEL ring and allows for disruption of misfolded structures through two phases of multiaxis unfolding. In this model, repeated cycles of partial unfolding, followed by confinement within the GroEL-GroES chamber, provide the most effective overall mechanism for facilitating the folding of the most stringently dependent GroEL substrate proteins. PMID:24022487
Design of HIV-1-PR inhibitors that do not create resistance: blocking the folding of single monomers.

PubMed

Broglia, Ricardo A; Tiana, Guido; Sutto, Ludovico; Provasi, Davide; Simona, Fabio

2005-10-01

The main problems found in designing drugs are those of optimizing the drug-target interaction and of avoiding the insurgence of resistance. We suggest a scheme for the design of inhibitors that can be used as leads for the development of a drug and that do not face either of these problems, and then apply it to the case of HIV-1-PR. It is based on the knowledge that the folding of single-domain proteins, such as each of the monomers forming the HIV-1-PR homodimer, is controlled by local elementary structures (LES), stabilized by local contacts among hydrophobic, strongly interacting, and highly conserved amino acids that play a central role in the folding process. Because LES have evolved over many generations to recognize and strongly interact with each other so as to make the protein fold fast and avoid aggregation with other proteins, highly specific (and thus little toxic) as well as effective folding-inhibitor molecules suggest themselves: short peptides (or eventually their mimetic molecules) displaying the same amino acid sequence of that of LES (p-LES). Aside from being specific and efficient, these inhibitors are expected not to induce resistance; in fact, mutations in HIV-1-PR that successfully avoid the action of p-LES imply the destabilization of one or more LES and thus should lead to protein denaturation. Making use of Monte Carlo simulations, we first identify the LES of the HIV-1-PR and then show that the corresponding p-LES peptides act as effective inhibitors of the folding of the protease.
On the Role of Entropy in the Protein Folding Process

NASA Astrophysics Data System (ADS)

Hoppe, Travis

2011-12-01

A protein's ultimate function and activity is determined by the unique three-dimensional structure taken by the folding process. Protein malfunction due to misfolding is the culprit of many clinical disorders, such as abnormal protein aggregations. This leads to neurodegenerative disorders like Huntington's and Alzheimer's disease. We focus on a subset of the folding problem, exploring the role and effects of entropy on the process of protein folding. Four major concepts and models are developed and each pertains to a specific aspect of the folding process: entropic forces, conformational states under crowding, aggregation, and macrostate kinetics from microstate trajectories. The exclusive focus on entropy is well-suited for crowding studies, as many interactions are nonspecific. We show how a stabilizing entropic force can arise purely from the motion of crowders in solution. In addition we are able to make a a quantitative prediction of the crowding effect with an implicit crowding approximation using an aspherical scaled-particle theory. In order to investigate the effects of aggregation, we derive a new operator expansion method to solve the Ising/Potts model with external fields over an arbitrary graph. Here the external fields are representative of the entropic forces. We show that this method reduces the problem of calculating the partition function to the solution of recursion relations. Many of the methods employed are coarse-grained approximations. As such, it is useful to have a viable method for extracting macrostate information from time series data. We develop a method to cluster the microstates into physically meaningful macrostates by grouping similar relaxation times from a transition matrix. Overall, the studied topics allow us to understand deeper the complicated process involving proteins.
Extreme Folding

NASA Astrophysics Data System (ADS)

Demaine, Erik

2012-02-01

Our understanding of the mathematics and algorithms behind paper folding, and geometric folding in general, has increased dramatically over the past several years. These developments have found a surprisingly broad range of applications. In the art of origami, it has helped spur the technical origami revolution. In engineering and science, it has helped solve problems in areas such as manufacturing, robotics, graphics, and protein folding. On the recreational side, it has led to new kinds of folding puzzles and magic. I will give an overview of the mathematics and algorithms of folding, with a focus on new mathematics and sculpture.
Protein structure recognition: From eigenvector analysis to structural threading method

NASA Astrophysics Data System (ADS)

Cao, Haibo

In this work, we try to understand the protein folding problem using pair-wise hydrophobic interaction as the dominant interaction for the protein folding process. We found a strong correlation between amino acid sequence and the corresponding native structure of the protein. Some applications of this correlation were discussed in this dissertation include the domain partition and a new structural threading method as well as the performance of this method in the CASP5 competition. In the first part, we give a brief introduction to the protein folding problem. Some essential knowledge and progress from other research groups was discussed. This part include discussions of interactions among amino acids residues, lattice HP model, and the designablity principle. In the second part, we try to establish the correlation between amino acid sequence and the corresponding native structure of the protein. This correlation was observed in our eigenvector study of protein contact matrix. We believe the correlation is universal, thus it can be used in automatic partition of protein structures into folding domains. In the third part, we discuss a threading method based on the correlation between amino acid sequence and ominant eigenvector of the structure contact-matrix. A mathematically straightforward iteration scheme provides a self-consistent optimum global sequence-structure alignment. The computational efficiency of this method makes it possible to search whole protein structure databases for structural homology without relying on sequence similarity. The sensitivity and specificity of this method is discussed, along with a case of blind test prediction. In the appendix, we list the overall performance of this threading method in CASP5 blind test in comparison with other existing approaches.
FK506-Binding Protein 22 from a Psychrophilic Bacterium, a Cold Shock-Inducible Peptidyl Prolyl Isomerase with the Ability to Assist in Protein Folding

PubMed Central

Budiman, Cahyo; Koga, Yuichi; Takano, Kazufumi; Kanaya, Shigenori

2011-01-01

Adaptation of microorganisms to low temperatures remains to be fully elucidated. It has been previously reported that peptidyl prolyl cis-trans isomerases (PPIases) are involved in cold adaptation of various microorganisms whether they are hyperthermophiles, mesophiles or phsycrophiles. The rate of cis-trans isomerization at low temperatures is much slower than that at higher temperatures and may cause problems in protein folding. However, the mechanisms by which PPIases are involved in cold adaptation remain unclear. Here we used FK506-binding protein 22, a cold shock protein from the psychrophilic bacterium Shewanella sp. SIB1 (SIB1 FKBP22) as a model protein to decipher the involvement of PPIases in cold adaptation. SIB1 FKBP22 is homodimer that assumes a V-shaped structure based on a tertiary model. Each monomer consists of an N-domain responsible for dimerization and a C-catalytic domain. SIB1 FKBP22 is a typical cold-adapted enzyme as indicated by the increase of catalytic efficiency at low temperatures, the downward shift in optimal temperature of activity and the reduction in the conformational stability. SIB1 FKBP22 is considered as foldase and chaperone based on its ability to catalyze refolding of a cis-proline containing protein and bind to a folding intermediate protein, respectively. The foldase and chaperone activites of SIB1 FKBP22 are thought to be important for cold adaptation of Shewanella sp. SIB1. These activities are also employed by other PPIases for being involved in cold adaptation of various microorganisms. Despite other biological roles of PPIases, we proposed that foldase and chaperone activities of PPIases are the main requirement for overcoming the cold-stress problem in microorganisms due to folding of proteins. PMID:21954357

Protein structure prediction with local adjust tabu search algorithm

PubMed Central

2014-01-01

Background Protein folding structure prediction is one of the most challenging problems in the bioinformatics domain. Because of the complexity of the realistic protein structure, the simplified structure model and the computational method should be adopted in the research. The AB off-lattice model is one of the simplification models, which only considers two classes of amino acids, hydrophobic (A) residues and hydrophilic (B) residues. Results The main work of this paper is to discuss how to optimize the lowest energy configurations in 2D off-lattice model and 3D off-lattice model by using Fibonacci sequences and real protein sequences. In order to avoid falling into local minimum and faster convergence to the global minimum, we introduce a novel method (SATS) to the protein structure problem, which combines simulated annealing algorithm and tabu search algorithm. Various strategies, such as the new encoding strategy, the adaptive neighborhood generation strategy and the local adjustment strategy, are adopted successfully for high-speed searching the optimal conformation corresponds to the lowest energy of the protein sequences. Experimental results show that some of the results obtained by the improved SATS are better than those reported in previous literatures, and we can sure that the lowest energy folding state for short Fibonacci sequences have been found. Conclusions Although the off-lattice models is not very realistic, they can reflect some important characteristics of the realistic protein. It can be found that 3D off-lattice model is more like native folding structure of the realistic protein than 2D off-lattice model. In addition, compared with some previous researches, the proposed hybrid algorithm can more effectively and more quickly search the spatial folding structure of a protein chain. PMID:25474708
A rate distortion approach to protein symmetry.

PubMed

Wallace, Rodrick

2010-08-01

A spontaneous symmetry breaking argument is applied to the problem of protein folding, via a rate distortion analysis of the relation between genome coding and the final condensation of the protein molten globule that is, in spirit, analogous to Tlusty's (2007) exploration of the evolution of the genetic code. In the 'energy' picture, the average distortion between codon message and final protein structure, under constraints driven by evolutionary selection, serves as a temperature analog, so that low values limit the possible distribution of protein forms, producing the canonical folding funnel. A dual 'developmental' perspective sees the rate distortion function itself as the temperature analog, and permits incorporation of chaperons or toxic exposures as catalysts, driving the system to different possible outcomes or affecting the rate of convergence. The rate distortion function appears constrained by the availability of metabolic free energy, with implications for prebiotic evolution, and a nonequilibrium empirical Onsager treatment provides an adaptable statistical model that can be fitted to data, in the same manner as a regression equation. In sum, mechanistic models of protein folding fail to account for the observed spectrum of protein folding and aggregation disorders, suggesting that a biologically based cognitive paradigm describing folding will be needed for understanding the etiology, prevention, and treatment of these diseases. The developmental formalism introduced here may contribute substantially to such a paradigm.
Protein Structure Prediction by Protein Threading

NASA Astrophysics Data System (ADS)

Xu, Ying; Liu, Zhijie; Cai, Liming; Xu, Dong

The seminal work of Bowie, Lüthy, and Eisenberg (Bowie et al., 1991) on "the inverse protein folding problem" laid the foundation of protein structure prediction by protein threading. By using simple measures for fitness of different amino acid types to local structural environments defined in terms of solvent accessibility and protein secondary structure, the authors derived a simple and yet profoundly novel approach to assessing if a protein sequence fits well with a given protein structural fold. Their follow-up work (Elofsson et al., 1996; Fischer and Eisenberg, 1996; Fischer et al., 1996a,b) and the work by Jones, Taylor, and Thornton (Jones et al., 1992) on protein fold recognition led to the development of a new brand of powerful tools for protein structure prediction, which we now term "protein threading." These computational tools have played a key role in extending the utility of all the experimentally solved structures by X-ray crystallography and nuclear magnetic resonance (NMR), providing structural models and functional predictions for many of the proteins encoded in the hundreds of genomes that have been sequenced up to now.
Exploring the Energy Landscapes of Protein Folding Simulations with Bayesian Computation

PubMed Central

Burkoff, Nikolas S.; Várnai, Csilla; Wells, Stephen A.; Wild, David L.

2012-01-01

Nested sampling is a Bayesian sampling technique developed to explore probability distributions localized in an exponentially small area of the parameter space. The algorithm provides both posterior samples and an estimate of the evidence (marginal likelihood) of the model. The nested sampling algorithm also provides an efficient way to calculate free energies and the expectation value of thermodynamic observables at any temperature, through a simple post processing of the output. Previous applications of the algorithm have yielded large efficiency gains over other sampling techniques, including parallel tempering. In this article, we describe a parallel implementation of the nested sampling algorithm and its application to the problem of protein folding in a Gō-like force field of empirical potentials that were designed to stabilize secondary structure elements in room-temperature simulations. We demonstrate the method by conducting folding simulations on a number of small proteins that are commonly used for testing protein-folding procedures. A topological analysis of the posterior samples is performed to produce energy landscape charts, which give a high-level description of the potential energy surface for the protein folding simulations. These charts provide qualitative insights into both the folding process and the nature of the model and force field used. PMID:22385859
Exploring the energy landscapes of protein folding simulations with Bayesian computation.

PubMed

Burkoff, Nikolas S; Várnai, Csilla; Wells, Stephen A; Wild, David L

2012-02-22

Nested sampling is a Bayesian sampling technique developed to explore probability distributions localized in an exponentially small area of the parameter space. The algorithm provides both posterior samples and an estimate of the evidence (marginal likelihood) of the model. The nested sampling algorithm also provides an efficient way to calculate free energies and the expectation value of thermodynamic observables at any temperature, through a simple post processing of the output. Previous applications of the algorithm have yielded large efficiency gains over other sampling techniques, including parallel tempering. In this article, we describe a parallel implementation of the nested sampling algorithm and its application to the problem of protein folding in a Gō-like force field of empirical potentials that were designed to stabilize secondary structure elements in room-temperature simulations. We demonstrate the method by conducting folding simulations on a number of small proteins that are commonly used for testing protein-folding procedures. A topological analysis of the posterior samples is performed to produce energy landscape charts, which give a high-level description of the potential energy surface for the protein folding simulations. These charts provide qualitative insights into both the folding process and the nature of the model and force field used. Copyright Â© 2012 Biophysical Society. Published by Elsevier Inc. All rights reserved.
Dissecting Protein Configurational Entropy into Conformational and Vibrational Contributions.

PubMed

Chong, Song-Ho; Ham, Sihyun

2015-10-01

Quantifying how the rugged nature of the underlying free-energy landscape determines the entropic cost a protein must incur upon folding and ligand binding is a challenging problem. Here, we present a novel computational approach that dissects the protein configurational entropy on the basis of the classification of protein dynamics on the landscape into two separate components: short-term vibrational dynamics related to individual free-energy wells and long-term conformational dynamics associated with transitions between wells. We apply this method to separate the configurational entropy of the protein villin headpiece subdomain into its conformational and vibrational components. We find that the change in configurational entropy upon folding is dominated by the conformational entropy despite the fact that the magnitude of the vibrational entropy is the significantly larger component in each of the folded and unfolded states, which is in accord with the previous empirical estimations. The straightforward applicability of our method to unfolded proteins promises a wide range of applications, including those related to intrinsically disordered proteins.
Hydrophobic folding units derived from dissimilar monomer structures and their interactions.

PubMed

Tsai, C J; Nussinov, R

1997-01-01

We have designed an automated procedure to cut a protein into compact hydrophobic folding units. The hydrophobic units are large enough to contain tertiary non-local interactions, reflecting potential nucleation sites during protein folding. The quality of a hydrophobic folding unit is evaluated by four criteria. The first two correspond to visual characterization of a structural domain, namely, compactness and extent of isolation. We use the definition of Zehfus and Rose (Zehfus MH, Rose GD, 1986, Biochemistry 25:35-340) to calculate the compactness of a cut protein unit. The isolation of a unit is based on the solvent accessible surface area (ASA) originally buried in the interior and exposed to the solvent after cutting. The third quantity is the hydrophobicity, equivalent to the fraction of the buried non-polar ASA with respect to the total non-polar ASA. The last criterion in the evaluation of a folding unit is the number of segments it includes. To conform with the rationale of obtaining hydrophobic units, which may relate to early folding events, the hydrophobic interactions are implicitly and explicitly applied in their generation and assessment. We follow Holm and Sander (Holm L, Sander C, 1994, Proteins 19:256-268) to reduce the multiple cutting-point problem to a one-dimensional search for all reasonable trial cuts. However, as here we focus on the hydrophobic cores, the contact matrix used to obtain the first non-trivial eigenvector contains only hydrophobic contracts, rather than all, hydrophobic and hydrophilic, interactions. This dataset of hydrophobic folding units, derived from structurally dissimilar single chain monomers, is particularly useful for investigations of the mechanism of protein folding. For cases where there are kinetic data, the one or more hydrophobic folding units generated for a protein correlate with the two or with the three-state folding process observed. We carry out extensive amino acid sequence order independent structural comparisons to generate a structurally non-redundant set of hydrophobic folding units for fold recognition and for statistical purposes.
Protein functional landscapes, dynamics, allostery: a tortuous path towards a universal theoretical framework.

PubMed

Zhuravlev, Pavel I; Papoian, Garegin A

2010-08-01

Energy landscape theories have provided a common ground for understanding the protein folding problem, which once seemed to be overwhelmingly complicated. At the same time, the native state was found to be an ensemble of interconverting states with frustration playing a more important role compared to the folding problem. The landscape of the folded protein - the native landscape - is glassier than the folding landscape; hence, a general description analogous to the folding theories is difficult to achieve. On the other hand, the native basin phase volume is much smaller, allowing a protein to fully sample its native energy landscape on the biological timescales. Current computational resources may also be used to perform this sampling for smaller proteins, to build a 'topographical map' of the native landscape that can be used for subsequent analysis. Several major approaches to representing this topographical map are highlighted in this review, including the construction of kinetic networks, hierarchical trees and free energy surfaces with subsequent structural and kinetic analyses. In this review, we extensively discuss the important question of choosing proper collective coordinates characterizing functional motions. In many cases, the substates on the native energy landscape, which represent different functional states, can be used to obtain variables that are well suited for building free energy surfaces and analyzing the protein's functional dynamics. Normal mode analysis can provide such variables in cases where functional motions are dictated by the molecule's architecture. Principal component analysis is a more expensive way of inferring the essential variables from the protein's motions, one that requires a long molecular dynamics simulation. Finally, the two popular models for the allosteric switching mechanism, 'preexisting equilibrium' and 'induced fit', are interpreted within the energy landscape paradigm as extreme points of a continuum of transition mechanisms. Some experimental evidence illustrating each of these two models, as well as intermediate mechanisms, is presented and discussed.
The Energy Landscapes of Repeat-Containing Proteins: Topology, Cooperativity, and the Folding Funnels of One-Dimensional Architectures

PubMed Central

Komives, Elizabeth A.; Wolynes, Peter G.

2008-01-01

Repeat-proteins are made up of near repetitions of 20– to 40–amino acid stretches. These polypeptides usually fold up into non-globular, elongated architectures that are stabilized by the interactions within each repeat and those between adjacent repeats, but that lack contacts between residues distant in sequence. The inherent symmetries both in primary sequence and three-dimensional structure are reflected in a folding landscape that may be analyzed as a quasi–one-dimensional problem. We present a general description of repeat-protein energy landscapes based on a formal Ising-like treatment of the elementary interaction energetics in and between foldons, whose collective ensemble are treated as spin variables. The overall folding properties of a complete “domain” (the stability and cooperativity of the repeating array) can be derived from this microscopic description. The one-dimensional nature of the model implies there are simple relations for the experimental observables: folding free-energy (ΔGwater) and the cooperativity of denaturation (m-value), which do not ordinarily apply for globular proteins. We show how the parameters for the “coarse-grained” description in terms of foldon spin variables can be extracted from more detailed folding simulations on perfectly funneled landscapes. To illustrate the ideas, we present a case-study of a family of tetratricopeptide (TPR) repeat proteins and quantitatively relate the results to the experimentally observed folding transitions. Based on the dramatic effect that single point mutations exert on the experimentally observed folding behavior, we speculate that natural repeat proteins are “poised” at particular ratios of inter- and intra-element interaction energetics that allow them to readily undergo structural transitions in physiologically relevant conditions, which may be intrinsically related to their biological functions. PMID:18483553
NIAS-Server: Neighbors Influence of Amino acids and Secondary Structures in Proteins.

PubMed

Borguesan, Bruno; Inostroza-Ponta, Mario; Dorn, Márcio

2017-03-01

The exponential growth in the number of experimentally determined three-dimensional protein structures provide a new and relevant knowledge about the conformation of amino acids in proteins. Only a few of probability densities of amino acids are publicly available for use in structure validation and prediction methods. NIAS (Neighbors Influence of Amino acids and Secondary structures) is a web-based tool used to extract information about conformational preferences of amino acid residues and secondary structures in experimental-determined protein templates. This information is useful, for example, to characterize folds and local motifs in proteins, molecular folding, and can help the solution of complex problems such as protein structure prediction, protein design, among others. The NIAS-Server and supplementary data are available at http://sbcb.inf.ufrgs.br/nias .
THE DELICATE BALANCE BETWEEN SECRETED PROTEIN FOLDING AND ENDOPLASMIC RETICULUM-ASSOCIATED DEGRADATION IN HUMAN PHYSIOLOGY

PubMed Central

Guerriero, Christopher J.; Brodsky, Jeffrey L.

2014-01-01

Protein folding is a complex, error-prone process that often results in an irreparable protein by-product. These by-products can be recognized by cellular quality control machineries and targeted for proteasome-dependent degradation. The folding of proteins in the secretory pathway adds another layer to the protein folding “problem,” as the endoplasmic reticulum maintains a unique chemical environment within the cell. In fact, a growing number of diseases are attributed to defects in secretory protein folding, and many of these by-products are targeted for a process known as endoplasmic reticulum-associated degradation (ERAD). Since its discovery, research on the mechanisms underlying the ERAD pathway has provided new insights into how ERAD contributes to human health during both normal and diseases states. Links between ERAD and disease are evidenced from the loss of protein function as a result of degradation, chronic cellular stress when ERAD fails to keep up with misfolded protein production, and the ability of some pathogens to coopt the ERAD pathway. The growing number of ERAD substrates has also illuminated the differences in the machineries used to recognize and degrade a vast array of potential clients for this pathway. Despite all that is known about ERAD, many questions remain, and new paradigms will likely emerge. Clearly, the key to successful disease treatment lies within defining the molecular details of the ERAD pathway and in understanding how this conserved pathway selects and degrades an innumerable cast of substrates. PMID:22535891
Identification of a key structural element for protein folding within beta-hairpin turns.

PubMed

Kim, Jaewon; Brych, Stephen R; Lee, Jihun; Logan, Timothy M; Blaber, Michael

2003-05-09

Specific residues in a polypeptide may be key contributors to the stability and foldability of the unique native structure. Identification and prediction of such residues is, therefore, an important area of investigation in solving the protein folding problem. Atypical main-chain conformations can help identify strains within a folded protein, and by inference, positions where unique amino acids may have a naturally high frequency of occurrence due to favorable contributions to stability and folding. Non-Gly residues located near the left-handed alpha-helical region (L-alpha) of the Ramachandran plot are a potential indicator of structural strain. Although many investigators have studied mutations at such positions, no consistent energetic or kinetic contributions to stability or folding have been elucidated. Here we report a study of the effects of Gly, Ala and Asn substitutions found within the L-alpha region at a characteristic position in defined beta-hairpin turns within human acidic fibroblast growth factor, and demonstrate consistent effects upon stability and folding kinetics. The thermodynamic and kinetic data are compared to available data for similar mutations in other proteins, with excellent agreement. The results have identified that Gly at the i+3 position within a subset of beta-hairpin turns is a key contributor towards increasing the rate of folding to the native state of the polypeptide while leaving the rate of unfolding largely unchanged.
Ab initio protein structure assembly using continuous structure fragments and optimized knowledge-based force field.

PubMed

Xu, Dong; Zhang, Yang

2012-07-01

Ab initio protein folding is one of the major unsolved problems in computational biology owing to the difficulties in force field design and conformational search. We developed a novel program, QUARK, for template-free protein structure prediction. Query sequences are first broken into fragments of 1-20 residues where multiple fragment structures are retrieved at each position from unrelated experimental structures. Full-length structure models are then assembled from fragments using replica-exchange Monte Carlo simulations, which are guided by a composite knowledge-based force field. A number of novel energy terms and Monte Carlo movements are introduced and the particular contributions to enhancing the efficiency of both force field and search engine are analyzed in detail. QUARK prediction procedure is depicted and tested on the structure modeling of 145 nonhomologous proteins. Although no global templates are used and all fragments from experimental structures with template modeling score >0.5 are excluded, QUARK can successfully construct 3D models of correct folds in one-third cases of short proteins up to 100 residues. In the ninth community-wide Critical Assessment of protein Structure Prediction experiment, QUARK server outperformed the second and third best servers by 18 and 47% based on the cumulative Z-score of global distance test-total scores in the FM category. Although ab initio protein folding remains a significant challenge, these data demonstrate new progress toward the solution of the most important problem in the field. Copyright © 2012 Wiley Periodicals, Inc.
FRAN and RBF-PSO as two components of a hyper framework to recognize protein folds.

PubMed

Abbasi, Elham; Ghatee, Mehdi; Shiri, M E

2013-09-01

In this paper, an intelligent hyper framework is proposed to recognize protein folds from its amino acid sequence which is a fundamental problem in bioinformatics. This framework includes some statistical and intelligent algorithms for proteins classification. The main components of the proposed framework are the Fuzzy Resource-Allocating Network (FRAN) and the Radial Bases Function based on Particle Swarm Optimization (RBF-PSO). FRAN applies a dynamic method to tune up the RBF network parameters. Due to the patterns complexity captured in protein dataset, FRAN classifies the proteins under fuzzy conditions. Also, RBF-PSO applies PSO to tune up the RBF classifier. Experimental results demonstrate that FRAN improves prediction accuracy up to 51% and achieves acceptable multi-class results for protein fold prediction. Although RBF-PSO provides reasonable results for protein fold recognition up to 48%, it is weaker than FRAN in some cases. However the proposed hyper framework provides an opportunity to use a great range of intelligent methods and can learn from previous experiences. Thus it can avoid the weakness of some intelligent methods in terms of memory, computational time and static structure. Furthermore, the performance of this system can be enhanced throughout the system life-cycle. Copyright © 2013 Elsevier Ltd. All rights reserved.
Approaching the thermodynamic view of protein folding through the reproduction of Anfinsen's experiment by undergraduate physical biochemistry students.

PubMed

Fernandez-Reche, Andres; Cobos, Eva S; Luque, Irene; Ruiz-Sanz, Javier; Martinez, Jose C

2018-01-04

In 1972 Christian B. Anfinsen received the Nobel Prize in Chemistry for "…his work on ribonuclease, especially concerning the connection between the amino acid sequence and the biologically active conformation." The understanding of this principle is crucial for physical biochemistry students, since protein folding studies, bio-computing sciences and protein design approaches are founded on such a well-demonstrated connection. Herein, we describe a detailed and easy-to-follow experiment to reproduce the most relevant assays carried out at Anfinsen's laboratory in the 60s. This experiment provides students with a platform to interpret by themselves the structural and kinetic experiments conceived to understand the protein folding problem. In addition, this three-day experiment brings students a nice opportunity for protein manipulation as well as for the setting up of spectroscopic and chromatographic techniques. © 2018 by The International Union of Biochemistry and Molecular Biology, 2018. © 2018 The International Union of Biochemistry and Molecular Biology.
Effect of the geometry of confining media on the stability and folding rate of α -helix proteins

NASA Astrophysics Data System (ADS)

Wang, Congyue; Piroozan, Nariman; Javidpour, Leili; Sahimi, Muhammad

2018-05-01

Protein folding in confined media has attracted wide attention over the past 15 years due to its importance to both in vivo and in vitro applications. It is generally believed that protein stability increases by decreasing the size of the confining medium, if the medium's walls are repulsive, and that the maximum folding temperature in confinement is in a pore whose size D0 is only slightly larger than the smallest dimension of a protein's folded state. Until recently, the stability of proteins in pores with a size very close to that of the folded state has not received the attention it deserves. In a previous paper [L. Javidpour and M. Sahimi, J. Chem. Phys. 135, 125101 (2011)], we showed that, contrary to the current theoretical predictions, the maximum folding temperature occurs in larger pores for smaller α-helices. Moreover, in very tight pores, the free energy surface becomes rough, giving rise to a new barrier for protein folding close to the unfolded state. In contrast to unbounded domains, in small nanopores proteins with an α-helical native state that contain the β structures are entropically stabilized implying that folding rates decrease notably and that the free energy surface becomes rougher. In view of the potential significance of such results to interpretation of many sets of experimental data that could not be explained by the current theories, particularly the reported anomalously low rates of folding and the importance of entropic effects on proteins' misfolded states in highly confined environments, we address the following question in the present paper: To what extent the geometry of a confined medium affects the stability and folding rates of proteins? Using millisecond-long molecular dynamics simulations, we study the problem in three types of confining media, namely, cylindrical and slit pores and spherical cavities. Most importantly, we find that the prediction of the previous theories that the dependence of the maximum folding temperature Tf on the size D of a confined medium occurs in larger media for larger proteins is correct only in spherical geometry, whereas the opposite is true in the two other geometries that we study. Also studied is the effect of the strength of the interaction between the confined media's walls and the proteins. If the walls are only weakly or moderately attractive, a complex behavior emerges that depends on the size of the confining medium.
Protein fold recognition using geometric kernel data fusion.

PubMed

Zakeri, Pooya; Jeuris, Ben; Vandebril, Raf; Moreau, Yves

2014-07-01

Various approaches based on features extracted from protein sequences and often machine learning methods have been used in the prediction of protein folds. Finding an efficient technique for integrating these different protein features has received increasing attention. In particular, kernel methods are an interesting class of techniques for integrating heterogeneous data. Various methods have been proposed to fuse multiple kernels. Most techniques for multiple kernel learning focus on learning a convex linear combination of base kernels. In addition to the limitation of linear combinations, working with such approaches could cause a loss of potentially useful information. We design several techniques to combine kernel matrices by taking more involved, geometry inspired means of these matrices instead of convex linear combinations. We consider various sequence-based protein features including information extracted directly from position-specific scoring matrices and local sequence alignment. We evaluate our methods for classification on the SCOP PDB-40D benchmark dataset for protein fold recognition. The best overall accuracy on the protein fold recognition test set obtained by our methods is ∼ 86.7%. This is an improvement over the results of the best existing approach. Moreover, our computational model has been developed by incorporating the functional domain composition of proteins through a hybridization model. It is observed that by using our proposed hybridization model, the protein fold recognition accuracy is further improved to 89.30%. Furthermore, we investigate the performance of our approach on the protein remote homology detection problem by fusing multiple string kernels. The MATLAB code used for our proposed geometric kernel fusion frameworks are publicly available at http://people.cs.kuleuven.be/∼raf.vandebril/homepage/software/geomean.php?menu=5/. © The Author 2014. Published by Oxford University Press.
SIMPLE estimate of the free energy change due to aliphatic mutations: superior predictions based on first principles.

PubMed

Bueno, Marta; Camacho, Carlos J; Sancho, Javier

2007-09-01

The bioinformatics revolution of the last decade has been instrumental in the development of empirical potentials to quantitatively estimate protein interactions for modeling and design. Although computationally efficient, these potentials hide most of the relevant thermodynamics in 5-to-40 parameters that are fitted against a large experimental database. Here, we revisit this longstanding problem and show that a careful consideration of the change in hydrophobicity, electrostatics, and configurational entropy between the folded and unfolded state of aliphatic point mutations predicts 20-30% less false positives and yields more accurate predictions than any published empirical energy function. This significant improvement is achieved with essentially no free parameters, validating past theoretical and experimental efforts to understand the thermodynamics of protein folding. Our first principle analysis strongly suggests that both the solute-solute van der Waals interactions in the folded state and the electrostatics free energy change of exposed aliphatic mutations are almost completely compensated by similar interactions operating in the unfolded ensemble. Not surprisingly, the problem of properly accounting for the solvent contribution to the free energy of polar and charged group mutations, as well as of mutations that disrupt the protein backbone remains open. 2007 Wiley-Liss, Inc.
Exploration of the relationship between topology and designability of conformations

NASA Astrophysics Data System (ADS)

Leelananda, Sumudu P.; Towfic, Fadi; Jernigan, Robert L.; Kloczkowski, Andrzej

2011-06-01

Protein structures are evolutionarily more conserved than sequences, and sequences with very low sequence identity frequently share the same fold. This leads to the concept of protein designability. Some folds are more designable and lots of sequences can assume that fold. Elucidating the relationship between protein sequence and the three-dimensional (3D) structure that the sequence folds into is an important problem in computational structural biology. Lattice models have been utilized in numerous studies to model protein folds and predict the designability of certain folds. In this study, all possible compact conformations within a set of two-dimensional and 3D lattice spaces are explored. Complementary interaction graphs are then generated for each conformation and are described using a set of graph features. The full HP sequence space for each lattice model is generated and contact energies are calculated by threading each sequence onto all the possible conformations. Unique conformation giving minimum energy is identified for each sequence and the number of sequences folding to each conformation (designability) is obtained. Machine learning algorithms are used to predict the designability of each conformation. We find that the highly designable structures can be distinguished from other non-designable conformations based on certain graphical geometric features of the interactions. This finding confirms the fact that the topology of a conformation is an important determinant of the extent of its designability and suggests that the interactions themselves are important for determining the designability.
Ab Initio Protein Structure Assembly Using Continuous Structure Fragments and Optimized Knowledge-based Force Field

PubMed Central

Xu, Dong; Zhang, Yang

2012-01-01

Ab initio protein folding is one of the major unsolved problems in computational biology due to the difficulties in force field design and conformational search. We developed a novel program, QUARK, for template-free protein structure prediction. Query sequences are first broken into fragments of 1–20 residues where multiple fragment structures are retrieved at each position from unrelated experimental structures. Full-length structure models are then assembled from fragments using replica-exchange Monte Carlo simulations, which are guided by a composite knowledge-based force field. A number of novel energy terms and Monte Carlo movements are introduced and the particular contributions to enhancing the efficiency of both force field and search engine are analyzed in detail. QUARK prediction procedure is depicted and tested on the structure modeling of 145 non-homologous proteins. Although no global templates are used and all fragments from experimental structures with template modeling score (TM-score) >0.5 are excluded, QUARK can successfully construct 3D models of correct folds in 1/3 cases of short proteins up to 100 residues. In the ninth community-wide Critical Assessment of protein Structure Prediction (CASP9) experiment, QUARK server outperformed the second and third best servers by 18% and 47% based on the cumulative Z-score of global distance test-total (GDT-TS) scores in the free modeling (FM) category. Although ab initio protein folding remains a significant challenge, these data demonstrate new progress towards the solution of the most important problem in the field. PMID:22411565

Conservation of the C-type lectin fold for massive sequence variation in a Treponema diversity-generating retroelement

DOE Office of Scientific and Technical Information (OSTI.GOV)

Le Coq, Johanne; Ghosh, Partho

2012-06-19

Anticipatory ligand binding through massive protein sequence variation is rare in biological systems, having been observed only in the vertebrate adaptive immune response and in a phage diversity-generating retroelement (DGR). Earlier work has demonstrated that the prototypical DGR variable protein, major tropism determinant (Mtd), meets the demands of anticipatory ligand binding by novel means through the C-type lectin (CLec) fold. However, because of the low sequence identity among DGR variable proteins, it has remained unclear whether the CLec fold is a general solution for DGRs. We have addressed this problem by determining the structure of a second DGR variable protein,more » TvpA, from the pathogenic oral spirochete Treponema denticola. Despite its weak sequence identity to Mtd ({approx}16%), TvpA was found to also have a CLec fold, with predicted variable residues exposed in a ligand-binding site. However, this site in TvpA was markedly more variable than the one in Mtd, reflecting the unprecedented approximate 10{sup 20} potential variability of TvpA. In addition, similarity between TvpA and Mtd with formylglycine-generating enzymes was detected. These results provide strong evidence for the conservation of the formylglycine-generating enzyme-type CLec fold among DGRs as a means of accommodating massive sequence variation.« less
Conservation of the C-type lectin fold for massive sequence variation in a Treponema diversity-generating retroelement

PubMed Central

Le Coq, Johanne; Ghosh, Partho

2011-01-01

Anticipatory ligand binding through massive protein sequence variation is rare in biological systems, having been observed only in the vertebrate adaptive immune response and in a phage diversity-generating retroelement (DGR). Earlier work has demonstrated that the prototypical DGR variable protein, major tropism determinant (Mtd), meets the demands of anticipatory ligand binding by novel means through the C-type lectin (CLec) fold. However, because of the low sequence identity among DGR variable proteins, it has remained unclear whether the CLec fold is a general solution for DGRs. We have addressed this problem by determining the structure of a second DGR variable protein, TvpA, from the pathogenic oral spirochete Treponema denticola. Despite its weak sequence identity to Mtd (∼16%), TvpA was found to also have a CLec fold, with predicted variable residues exposed in a ligand-binding site. However, this site in TvpA was markedly more variable than the one in Mtd, reflecting the unprecedented approximate 1020 potential variability of TvpA. In addition, similarity between TvpA and Mtd with formylglycine-generating enzymes was detected. These results provide strong evidence for the conservation of the formylglycine-generating enzyme-type CLec fold among DGRs as a means of accommodating massive sequence variation. PMID:21873231
Conservation of the C-type lectin fold for massive sequence variation in a Treponema diversity-generating retroelement.

PubMed

Le Coq, Johanne; Ghosh, Partho

2011-08-30

Anticipatory ligand binding through massive protein sequence variation is rare in biological systems, having been observed only in the vertebrate adaptive immune response and in a phage diversity-generating retroelement (DGR). Earlier work has demonstrated that the prototypical DGR variable protein, major tropism determinant (Mtd), meets the demands of anticipatory ligand binding by novel means through the C-type lectin (CLec) fold. However, because of the low sequence identity among DGR variable proteins, it has remained unclear whether the CLec fold is a general solution for DGRs. We have addressed this problem by determining the structure of a second DGR variable protein, TvpA, from the pathogenic oral spirochete Treponema denticola. Despite its weak sequence identity to Mtd (∼16%), TvpA was found to also have a CLec fold, with predicted variable residues exposed in a ligand-binding site. However, this site in TvpA was markedly more variable than the one in Mtd, reflecting the unprecedented approximate 10(20) potential variability of TvpA. In addition, similarity between TvpA and Mtd with formylglycine-generating enzymes was detected. These results provide strong evidence for the conservation of the formylglycine-generating enzyme-type CLec fold among DGRs as a means of accommodating massive sequence variation.
Interplay of I-TASSER and QUARK for template-based and ab initio protein structure prediction in CASP10

PubMed Central

Zhang, Yang

2014-01-01

We develop and test a new pipeline in CASP10 to predict protein structures based on an interplay of I-TASSER and QUARK for both free-modeling (FM) and template-based modeling (TBM) targets. The most noteworthy observation is that sorting through the threading template pool using the QUARK-based ab initio models as probes allows the detection of distant-homology templates which might be ignored by the traditional sequence profile-based threading alignment algorithms. Further template assembly refinement by I-TASSER resulted in successful folding of two medium-sized FM targets with >150 residues. For TBM, the multiple threading alignments from LOMETS are, for the first time, incorporated into the ab initio QUARK simulations, which were further refined by I-TASSER assembly refinement. Compared with the traditional threading assembly refinement procedures, the inclusion of the threading-constrained ab initio folding models can consistently improve the quality of the full-length models as assessed by the GDT-HA and hydrogen-bonding scores. Despite the success, significant challenges still exist in domain boundary prediction and consistent folding of medium-size proteins (especially beta-proteins) for nonhomologous targets. Further developments of sensitive fold-recognition and ab initio folding methods are critical for solving these problems. PMID:23760925
Interplay of I-TASSER and QUARK for template-based and ab initio protein structure prediction in CASP10.

PubMed

Zhang, Yang

2014-02-01

We develop and test a new pipeline in CASP10 to predict protein structures based on an interplay of I-TASSER and QUARK for both free-modeling (FM) and template-based modeling (TBM) targets. The most noteworthy observation is that sorting through the threading template pool using the QUARK-based ab initio models as probes allows the detection of distant-homology templates which might be ignored by the traditional sequence profile-based threading alignment algorithms. Further template assembly refinement by I-TASSER resulted in successful folding of two medium-sized FM targets with >150 residues. For TBM, the multiple threading alignments from LOMETS are, for the first time, incorporated into the ab initio QUARK simulations, which were further refined by I-TASSER assembly refinement. Compared with the traditional threading assembly refinement procedures, the inclusion of the threading-constrained ab initio folding models can consistently improve the quality of the full-length models as assessed by the GDT-HA and hydrogen-bonding scores. Despite the success, significant challenges still exist in domain boundary prediction and consistent folding of medium-size proteins (especially beta-proteins) for nonhomologous targets. Further developments of sensitive fold-recognition and ab initio folding methods are critical for solving these problems. Copyright © 2013 Wiley Periodicals, Inc.
Protein folding: the optically induced electronic excitations model

NASA Astrophysics Data System (ADS)

Jeknić-Dugić, J.

2009-07-01

The large-molecules conformational transitions problem (the 'protein folding problem') is an open issue of vivid current science research work of fundamental importance for a number of modern science disciplines as well as for nanotechnology. Here, we elaborate the recently proposed quantum-decoherence-based approach to the issue. First, we emphasize a need for detecting the elementary quantum mechanical processes (whose combinations may give a proper description of the realistic experimental situations) and then we design such a model. As distinct from the standard approach that deals with the conformation system, we investigate the optically induced transitions in the molecule electrons system that, in effect, may give rise to a conformation change in the molecule. Our conclusion is that such a model may describe the comparatively slow conformational transitions.
Application of Time-Resolved Tryptophan Phosphorescence Spectroscopy to Protein Folding Studies.

NASA Astrophysics Data System (ADS)

Subramaniam, Vinod

This thesis presents studies of the protein folding problem, one of the most significant questions in contemporary biophysics. Sensitive biophysical techniques, including room temperature tryptophan phosphorescence, which reports on the local environment of the residue, and the lability of proteins to denaturation, a global parameter, were used to assess the validity of the traditional assumption that the biologically active state of a protein is the 'native' state, and to determine whether the pathways of folding in vitro lead to the folded state achieved in vivo. Phosphorescence techniques have also been extended to study, for the first time, emission from tryptophan residues engineered into specific positions as reporters of protein structure. During in vitro refolding of E. coli alkaline phosphatase and bovine 13-lactoglobulin, significant differences were found between the refolded proteins and the native conformations, which have no apparent effect on the biological functions. Slow conformational transitions, termed 'annealing,' that occur long after the return of enzyme activity of alkaline phosphatase are manifested in the retarded recovery of phosphorescence intensity, lifetime, and protein lability. While 'annealing' is not observed for beta -lactoglobulin, both phosphorescence and lability experiments reveal changes in the structure of the refolded protein, even though its biological activity, retinol binding, is fully recovered. This result suggests that the pathways of folding in vitro need not lead to the structure formed in vivo. We have used phosphorescence techniques to study the refolding of ribonuclease T1, which exhibits slow kinetics characteristic of proline isomerization. Furthermore, the ability to extract structural information from phosphorescent tryptophan probes engineered into selected regions represents an important advance in studying protein structure; we have reported the first such results from a mutant staphylococcal nuclease. The refolding data have been interpreted in the context of recent theoretical work on rugged energy landscape models of protein folding. Our results suggest that the barriers to folding can be as large as ~ 20 kcal-mol^{-1}, and imply that the conventional definition of the 'native' state as the biologically active conformation may need revision to acknowledge that the active state may represent a long-lived intermediate on the pathway to the native structure.
Synthetic beta-solenoid proteins with the fragment-free computational design of a beta-hairpin extension

PubMed Central

MacDonald, James T.; Kabasakal, Burak V.; Godding, David; Kraatz, Sebastian; Henderson, Louie; Barber, James; Freemont, Paul S.; Murray, James W.

2016-01-01

The ability to design and construct structures with atomic level precision is one of the key goals of nanotechnology. Proteins offer an attractive target for atomic design because they can be synthesized chemically or biologically and can self-assemble. However, the generalized protein folding and design problem is unsolved. One approach to simplifying the problem is to use a repetitive protein as a scaffold. Repeat proteins are intrinsically modular, and their folding and structures are better understood than large globular domains. Here, we have developed a class of synthetic repeat proteins based on the pentapeptide repeat family of beta-solenoid proteins. We have constructed length variants of the basic scaffold and computationally designed de novo loops projecting from the scaffold core. The experimentally solved 3.56-Å resolution crystal structure of one designed loop matches closely the designed hairpin structure, showing the computational design of a backbone extension onto a synthetic protein core without the use of backbone fragments from known structures. Two other loop designs were not clearly resolved in the crystal structures, and one loop appeared to be in an incorrect conformation. We have also shown that the repeat unit can accommodate whole-domain insertions by inserting a domain into one of the designed loops. PMID:27573845
Anomalous diffusion in neutral evolution of model proteins.

PubMed

Nelson, Erik D; Grishin, Nick V

2015-06-01

Protein evolution is frequently explored using minimalist polymer models, however, little attention has been given to the problem of structural drift, or diffusion. Here, we study neutral evolution of small protein motifs using an off-lattice heteropolymer model in which individual monomers interact as low-resolution amino acids. In contrast to most earlier models, both the length and folded structure of the polymers are permitted to change. To describe structural change, we compute the mean-square distance (MSD) between monomers in homologous folds separated by n neutral mutations. We find that structural change is episodic, and, averaged over lineages (for example, those extending from a single sequence), exhibits a power-law dependence on n. We show that this exponent depends on the alignment method used, and we analyze the distribution of waiting times between neutral mutations. The latter are more disperse than for models required to maintain a specific fold, but exhibit a similar power-law tail.
Anomalous diffusion in neutral evolution of model proteins

NASA Astrophysics Data System (ADS)

Nelson, Erik D.; Grishin, Nick V.

2015-06-01

Protein evolution is frequently explored using minimalist polymer models, however, little attention has been given to the problem of structural drift, or diffusion. Here, we study neutral evolution of small protein motifs using an off-lattice heteropolymer model in which individual monomers interact as low-resolution amino acids. In contrast to most earlier models, both the length and folded structure of the polymers are permitted to change. To describe structural change, we compute the mean-square distance (MSD) between monomers in homologous folds separated by n neutral mutations. We find that structural change is episodic, and, averaged over lineages (for example, those extending from a single sequence), exhibits a power-law dependence on n . We show that this exponent depends on the alignment method used, and we analyze the distribution of waiting times between neutral mutations. The latter are more disperse than for models required to maintain a specific fold, but exhibit a similar power-law tail.
An overview on molecular chaperones enhancing solubility of expressed recombinant proteins with correct folding.

PubMed

Mamipour, Mina; Yousefi, Mohammadreza; Hasanzadeh, Mohammad

2017-09-01

The majority of research topics declared that most of the recombinant proteins have been expressed by Escherichia coli in basic investigations. But the majority of high expressed proteins formed as inactive recombinant proteins that are called inclusion body. To overcome this problem, several methods have been used including suitable promoter, environmental factors, ladder tag to secretion of proteins into the periplasm, gene protein optimization, chemical chaperones and molecular chaperones sets. Co-expression of the interest protein with molecular chaperones is one of the common methods The chaperones are a group of proteins, which are involved in making correct folding of recombinant proteins. Chaperones are divided two groups including; cytoplasmic and periplasmic chaperones. Moreover, periplasmic chaperones and proteases can be manipulated to increase the yields of secreted proteins. In this article, we attempted to review cytoplasmic chaperones such as Hsp families and periplasmic chaperones including; generic chaperones, specialized chaperones, PPIases, and proteins involved in disulfide bond formation. Copyright © 2017 Elsevier B.V. All rights reserved.
The molecular matching problem

NASA Technical Reports Server (NTRS)

Kincaid, Rex K.

1993-01-01

Molecular chemistry contains many difficult optimization problems that have begun to attract the attention of optimizers in the Operations Research community. Problems including protein folding, molecular conformation, molecular similarity, and molecular matching have been addressed. Minimum energy conformations for simple molecular structures such as water clusters, Lennard-Jones microclusters, and short polypeptides have dominated the literature to date. However, a variety of interesting problems exist and we focus here on a molecular structure matching (MSM) problem.
Efficient molecular mechanics simulations of the folding, orientation, and assembly of peptides in lipid bilayers using an implicit atomic solvation model

NASA Astrophysics Data System (ADS)

Bordner, Andrew J.; Zorman, Barry; Abagyan, Ruben

2011-10-01

Membrane proteins comprise a significant fraction of the proteomes of sequenced organisms and are the targets of approximately half of marketed drugs. However, in spite of their prevalence and biomedical importance, relatively few experimental structures are available due to technical challenges. Computational simulations can potentially address this deficit by providing structural models of membrane proteins. Solvation within the spatially heterogeneous membrane/solvent environment provides a major component of the energetics driving protein folding and association within the membrane. We have developed an implicit solvation model for membranes that is both computationally efficient and accurate enough to enable molecular mechanics predictions for the folding and association of peptides within the membrane. We derived the new atomic solvation model parameters using an unbiased fitting procedure to experimental data and have applied it to diverse problems in order to test its accuracy and to gain insight into membrane protein folding. First, we predicted the positions and orientations of peptides and complexes within the lipid bilayer and compared the simulation results with solid-state NMR structures. Additionally, we performed folding simulations for a series of host-guest peptides with varying propensities to form alpha helices in a hydrophobic environment and compared the structures with experimental measurements. We were also able to successfully predict the structures of amphipathic peptides as well as the structures for dimeric complexes of short hexapeptides that have experimentally characterized propensities to form beta sheets within the membrane. Finally, we compared calculated relative transfer energies with data from experiments measuring the effects of mutations on the free energies of translocon-mediated insertion of proteins into lipid bilayers and of combined folding and membrane insertion of a beta barrel protein.
Combinatorial pattern discovery approach for the folding trajectory analysis of a beta-hairpin.

PubMed

Parida, Laxmi; Zhou, Ruhong

2005-06-01

The study of protein folding mechanisms continues to be one of the most challenging problems in computational biology. Currently, the protein folding mechanism is often characterized by calculating the free energy landscape versus various reaction coordinates, such as the fraction of native contacts, the radius of gyration, RMSD from the native structure, and so on. In this paper, we present a combinatorial pattern discovery approach toward understanding the global state changes during the folding process. This is a first step toward an unsupervised (and perhaps eventually automated) approach toward identification of global states. The approach is based on computing biclusters (or patterned clusters)-each cluster is a combination of various reaction coordinates, and its signature pattern facilitates the computation of the Z-score for the cluster. For this discovery process, we present an algorithm of time complexity c in RO((N + nm) log n), where N is the size of the output patterns and (n x m) is the size of the input with n time frames and m reaction coordinates. To date, this is the best time complexity for this problem. We next apply this to a beta-hairpin folding trajectory and demonstrate that this approach extracts crucial information about protein folding intermediate states and mechanism. We make three observations about the approach: (1) The method recovers states previously obtained by visually analyzing free energy surfaces. (2) It also succeeds in extracting meaningful patterns and structures that had been overlooked in previous works, which provides a better understanding of the folding mechanism of the beta-hairpin. These new patterns also interconnect various states in existing free energy surfaces versus different reaction coordinates. (3) The approach does not require calculating the free energy values, yet it offers an analysis comparable to, and sometimes better than, the methods that use free energy landscapes, thus validating the choice of reaction coordinates. (An abstract version of this work was presented at the 2005 Asia Pacific Bioinformatics Conference [1].).
Improved genetic algorithm for the protein folding problem by use of a Cartesian combination operator.

PubMed Central

Rabow, A. A.; Scheraga, H. A.

1996-01-01

We have devised a Cartesian combination operator and coding scheme for improving the performance of genetic algorithms applied to the protein folding problem. The genetic coding consists of the C alpha Cartesian coordinates of the protein chain. The recombination of the genes of the parents is accomplished by: (1) a rigid superposition of one parent chain on the other, to make the relation of Cartesian coordinates meaningful, then, (2) the chains of the children are formed through a linear combination of the coordinates of their parents. The children produced with this Cartesian combination operator scheme have similar topology and retain the long-range contacts of their parents. The new scheme is significantly more efficient than the standard genetic algorithm methods for locating low-energy conformations of proteins. The considerable superiority of genetic algorithms over Monte Carlo optimization methods is also demonstrated. We have also devised a new dynamic programming lattice fitting procedure for use with the Cartesian combination operator method. The procedure finds excellent fits of real-space chains to the lattice while satisfying bond-length, bond-angle, and overlap constraints. PMID:8880904
The protein structure prediction problem could be solved using the current PDB library

PubMed Central

Zhang, Yang; Skolnick, Jeffrey

2005-01-01

For single-domain proteins, we examine the completeness of the structures in the current Protein Data Bank (PDB) library for use in full-length model construction of unknown sequences. To address this issue, we employ a comprehensive benchmark set of 1,489 medium-size proteins that cover the PDB at the level of 35% sequence identity and identify templates by structure alignment. With homologous proteins excluded, we can always find similar folds to native with an average rms deviation (RMSD) from native of 2.5 Å with ≈82% alignment coverage. These template structures often contain a significant number of insertions/deletions. The tasser algorithm was applied to build full-length models, where continuous fragments are excised from the top-scoring templates and reassembled under the guide of an optimized force field, which includes consensus restraints taken from the templates and knowledge-based statistical potentials. For almost all targets (except for 2/1,489), the resultant full-length models have an RMSD to native below 6 Å (97% of them below 4 Å). On average, the RMSD of full-length models is 2.25 Å, with aligned regions improved from 2.5 Å to 1.88 Å, comparable with the accuracy of low-resolution experimental structures. Furthermore, starting from state-of-the-art structural alignments, we demonstrate a methodology that can consistently bring template-based alignments closer to native. These results are highly suggestive that the protein-folding problem can in principle be solved based on the current PDB library by developing efficient fold recognition algorithms that can recover such initial alignments. PMID:15653774
Unrelated solubility-enhancing fusion partners MBP and NusA utilize a similar mode of action

PubMed Central

Raran-Kurussi, Sreejith; Waugh, David S.

2014-01-01

The tendency of recombinant proteins to accumulate in the form of insoluble aggregates in Escherichia coli is a major hindrance to their overproduction. One of the more effective approaches to circumvent this problem is to use translation fusion partners (solubility-enhancers, SEs). E. coli maltose binding protein (MBP) and N-utilization substance A (NusA) are arguably the most effective solubilizing agents that have been discovered so far. Here, we show that although these two proteins are structurally, functionally, and physiochemically distinct, they influence the solubility and folding of their fusion partners in a very similar manner. These SEs act as “holdases” that prevent the aggregation of their fusion partners. Subsequent folding of the passenger proteins, when it occurs, is either spontaneous or chaperone-mediated. PMID:24942647
RNAslider: a faster engine for consecutive windows folding and its application to the analysis of genomic folding asymmetry.

PubMed

Horesh, Yair; Wexler, Ydo; Lebenthal, Ilana; Ziv-Ukelson, Michal; Unger, Ron

2009-03-04

Scanning large genomes with a sliding window in search of locally stable RNA structures is a well motivated problem in bioinformatics. Given a predefined window size L and an RNA sequence S of size N (L < N), the consecutive windows folding problem is to compute the minimal free energy (MFE) for the folding of each of the L-sized substrings of S. The consecutive windows folding problem can be naively solved in O(NL3) by applying any of the classical cubic-time RNA folding algorithms to each of the N-L windows of size L. Recently an O(NL2) solution for this problem has been described. Here, we describe and implement an O(NLpsi(L)) engine for the consecutive windows folding problem, where psi(L) is shown to converge to O(1) under the assumption of a standard probabilistic polymer folding model, yielding an O(L) speedup which is experimentally confirmed. Using this tool, we note an intriguing directionality (5'-3' vs. 3'-5') folding bias, i.e. that the minimal free energy (MFE) of folding is higher in the native direction of the DNA than in the reverse direction of various genomic regions in several organisms including regions of the genomes that do not encode proteins or ncRNA. This bias largely emerges from the genomic dinucleotide bias which affects the MFE, however we see some variations in the folding bias in the different genomic regions when normalized to the dinucleotide bias. We also present results from calculating the MFE landscape of a mouse chromosome 1, characterizing the MFE of the long ncRNA molecules that reside in this chromosome. The efficient consecutive windows folding engine described in this paper allows for genome wide scans for ncRNA molecules as well as large-scale statistics. This is implemented here as a software tool, called RNAslider, and applied to the scanning of long chromosomes, leading to the observation of features that are visible only on a large scale.
The Multiple-Minima Problem in Protein Folding

NASA Astrophysics Data System (ADS)

Scheraga, Harold A.

1991-10-01

The conformational energy surface of a polypeptide or protein has many local minima, and conventional energy minimization procedures reach only a local minimum (near the starting point of the optimization algorithm) instead of the global minimum (the multiple-minima problem). Several procedures have been developed to surmount this problem, the most promising of which are: (a) build up procedure, (b) optimization of electrostatics, (c) Monte Carlo-plus-energy minimization, (d) electrostatically-driven Monte Carlo, (e) inclusion of distance restraints, (f) adaptive importance-sampling Monte Carlo, (g) relaxation of dimensionality, (h) pattern-recognition, and (i) diffusion equation method. These procedures have been applied to a variety of polypeptide structural problems, and the results of such computations are presented. These include the computation of the structures of open-chain and cyclic peptides, fibrous proteins and globular proteins. Present efforts are being devoted to scaling up these procedures from small polypeptides to proteins, to try to compute the three-dimensional structure of a protein from its amino sequence.
CCBuilder: an interactive web-based tool for building, designing and assessing coiled-coil protein assemblies.

PubMed

Wood, Christopher W; Bruning, Marc; Ibarra, Amaurys Á; Bartlett, Gail J; Thomson, Andrew R; Sessions, Richard B; Brady, R Leo; Woolfson, Derek N

2014-11-01

The ability to accurately model protein structures at the atomistic level underpins efforts to understand protein folding, to engineer natural proteins predictably and to design proteins de novo. Homology-based methods are well established and produce impressive results. However, these are limited to structures presented by and resolved for natural proteins. Addressing this problem more widely and deriving truly ab initio models requires mathematical descriptions for protein folds; the means to decorate these with natural, engineered or de novo sequences; and methods to score the resulting models. We present CCBuilder, a web-based application that tackles the problem for a defined but large class of protein structure, the α-helical coiled coils. CCBuilder generates coiled-coil backbones, builds side chains onto these frameworks and provides a range of metrics to measure the quality of the models. Its straightforward graphical user interface provides broad functionality that allows users to build and assess models, in which helix geometry, coiled-coil architecture and topology and protein sequence can be varied rapidly. We demonstrate the utility of CCBuilder by assembling models for 653 coiled-coil structures from the PDB, which cover >96% of the known coiled-coil types, and by generating models for rarer and de novo coiled-coil structures. CCBuilder is freely available, without registration, at http://coiledcoils.chm.bris.ac.uk/app/cc_builder/. © The Author 2014. Published by Oxford University Press.

Revisiting the NMR structure of the ultrafast downhill folding protein gpW from bacteriophage λ.

PubMed

Sborgi, Lorenzo; Verma, Abhinav; Muñoz, Victor; de Alba, Eva

2011-01-01

GpW is a 68-residue protein from bacteriophage λ that participates in virus head morphogenesis. Previous NMR studies revealed a novel α+β fold for this protein. Recent experiments have shown that gpW folds in microseconds by crossing a marginal free energy barrier (i.e., downhill folding). These features make gpW a highly desirable target for further experimental and computational folding studies. As a step in that direction, we have re-determined the high-resolution structure of gpW by multidimensional NMR on a construct that eliminates the purification tags and unstructured C-terminal tail present in the prior study. In contrast to the previous work, we have obtained a full manual assignment and calculated the structure using only unambiguous distance restraints. This new structure confirms the α+β topology, but reveals important differences in tertiary packing. Namely, the two α-helices are rotated along their main axis to form a leucine zipper. The β-hairpin is orthogonal to the helical interface rather than parallel, displaying most tertiary contacts through strand 1. There also are differences in secondary structure: longer and less curved helices and a hairpin that now shows the typical right-hand twist. Molecular dynamics simulations starting from both gpW structures, and calculations with CS-Rosetta, all converge to our gpW structure. This confirms that the original structure has strange tertiary packing and strained secondary structure. A comparison of NMR datasets suggests that the problems were mainly caused by incomplete chemical shift assignments, mistakes in NOE assignment and the inclusion of ambiguous distance restraints during the automated procedure used in the original study. The new gpW corrects these problems, providing the appropriate structural reference for future work. Furthermore, our results are a cautionary tale against the inclusion of ambiguous experimental information in the determination of protein structures.
The transcriptional response of Escherichia coli to recombinant protein insolubility.

PubMed

Smith, Harold E

2007-03-01

Bacterial production of recombinant proteins offers several advantages over alternative expression methods and remains the system of choice for many structural genomics projects. However, a large percentage of targets accumulate as insoluble inclusion bodies rather than soluble protein, creating a significant bottleneck in the protein production pipeline. Numerous strategies have been reported that can improve in vivo protein solubility, but most do not scale easily for high-throughput expression screening. To understand better the host cell response to the accumulation of insoluble protein, we determined genome-wide changes in bacterial gene expression upon induction of either soluble or insoluble target proteins. By comparing transcriptional profiles for multiple examples from the soluble or insoluble class, we identified a pattern of gene expression that correlates strongly with protein solubility. Direct targets of the sigma32 heat shock sigma factor, which includes genes involved in protein folding and degradation, were highly expressed in response to induction of insoluble protein. This same group of genes was also upregulated by insoluble protein accumulation under a different growth regime, indicating that sigma32-mediated gene expression is a general response to protein insolubility. This knowledge provides a starting point for the rational design of growth parameters and host strains with improved protein solubility characteristics. Summary Problems with protein solubility are frequently encountered when recombinant proteins are expressed in E. coli. The bacterial host responds to this problem by increasing expression of the protein folding machinery via the heat shock sigma factor sigma32. Manipulation of the sigma32 regulon might provide a general mechanism for improving recombinant protein solubility.
Genetic Algorithms and Their Application to the Protein Folding Problem

DTIC Science & Technology

1993-12-01

and symbolic methods, random methods such as Monte Carlo simulation and simulated annealing, distance geometry, and molecular dynamics. Many of these...calculated energies with those obtained using the molecular simulation software package called CHARMm. 10 9) Test both the simple and parallel simpie genetic...homology-based, and simplification techniques. 3.21 Molecular Dynamics. Perhaps the most natural approach is to actually simulate the folding process. This
Computational design of water-soluble α-helical barrels.

PubMed

Thomson, Andrew R; Wood, Christopher W; Burton, Antony J; Bartlett, Gail J; Sessions, Richard B; Brady, R Leo; Woolfson, Derek N

2014-10-24

The design of protein sequences that fold into prescribed de novo structures is challenging. General solutions to this problem require geometric descriptions of protein folds and methods to fit sequences to these. The α-helical coiled coils present a promising class of protein for this and offer considerable scope for exploring hitherto unseen structures. For α-helical barrels, which have more than four helices and accessible central channels, many of the possible structures remain unobserved. Here, we combine geometrical considerations, knowledge-based scoring, and atomistic modeling to facilitate the design of new channel-containing α-helical barrels. X-ray crystal structures of the resulting designs match predicted in silico models. Furthermore, the observed channels are chemically defined and have diameters related to oligomer state, which present routes to design protein function. Copyright © 2014, American Association for the Advancement of Science.
Protein Folding Free Energy Landscape along the Committor - the Optimal Folding Coordinate.

PubMed

Krivov, Sergei V

2018-06-06

Recent advances in simulation and experiment have led to dramatic increases in the quantity and complexity of produced data, which makes the development of automated analysis tools very important. A powerful approach to analyze dynamics contained in such data sets is to describe/approximate it by diffusion on a free energy landscape - free energy as a function of reaction coordinates (RC). For the description to be quantitatively accurate, RCs should be chosen in an optimal way. Recent theoretical results show that such an optimal RC exists; however, determining it for practical systems is a very difficult unsolved problem. Here we describe a solution to this problem. We describe an adaptive nonparametric approach to accurately determine the optimal RC (the committor) for an equilibrium trajectory of a realistic system. In contrast to alternative approaches, which require a functional form with many parameters to approximate an RC and thus extensive expertise with the system, the suggested approach is nonparametric and can approximate any RC with high accuracy without system specific information. To avoid overfitting for a realistically sampled system, the approach performs RC optimization in an adaptive manner by focusing optimization on less optimized spatiotemporal regions of the RC. The power of the approach is illustrated on a long equilibrium atomistic folding simulation of HP35 protein. We have determined the optimal folding RC - the committor, which was confirmed by passing a stringent committor validation test. It allowed us to determine a first quantitatively accurate protein folding free energy landscape. We have confirmed the recent theoretical results that diffusion on such a free energy profile can be used to compute exactly the equilibrium flux, the mean first passage times, and the mean transition path times between any two points on the profile. We have shown that the mean squared displacement along the optimal RC grows linear with time as for simple diffusion. The free energy profile allowed us to obtain a direct rigorous estimate of the pre-exponential factor for the folding dynamics.
Time-resolved distance determination by tryptophan fluorescence quenching: probing intermediates in membrane protein folding.

PubMed

Kleinschmidt, J H; Tamm, L K

1999-04-20

The mechanism of insertion and folding of an integral membrane protein has been investigated with the beta-barrel forming outer membrane protein A (OmpA) of Escherichia coli. This work describes a new approach to this problem by combining structural information obtained from tryptophan fluorescence quenching at different depths in the lipid bilayer with the kinetics of the refolding process. Experiments carried out over a temperature range between 2 and 40 degrees C allowed us to detect, trap, and characterize previously unidentified folding intermediates on the pathway of OmpA insertion and folding into lipid bilayers. Three membrane-bound intermediates were found in which the average distances of the Trps were 14-16, 10-11, and 0-5 A, respectively, from the bilayer center. The first folding intermediate is stable at 2 degrees C for at least 1 h. A second intermediate has been isolated at temperatures between 7 and 20 degrees C. The Trps move 4-5 A closer to the center of the bilayer at this stage. Subsequently, in an intermediate that is observable at 26-28 degrees C, the Trps move another 5-10 A closer to the center of the bilayer. The final (native) structure is observed at higher temperatures of refolding. In this structure, the Trps are located on average about 9-10 A from the bilayer center. Monitoring the evolution of Trp fluorescence quenching by a set of brominated lipids during refolding at various temperatures therefore allowed us to identify and characterize intermediate states in the folding process of an integral membrane protein.
My 65 years in protein chemistry.

PubMed

Scheraga, Harold A

2015-05-01

This is a tour of a physical chemist through 65 years of protein chemistry from the time when emphasis was placed on the determination of the size and shape of the protein molecule as a colloidal particle, with an early breakthrough by James Sumner, followed by Linus Pauling and Fred Sanger, that a protein was a real molecule, albeit a macromolecule. It deals with the recognition of the nature and importance of hydrogen bonds and hydrophobic interactions in determining the structure, properties, and biological function of proteins until the present acquisition of an understanding of the structure, thermodynamics, and folding pathways from a linear array of amino acids to a biological entity. Along the way, with a combination of experiment and theoretical interpretation, a mechanism was elucidated for the thrombin-induced conversion of fibrinogen to a fibrin blood clot and for the oxidative-folding pathways of ribonuclease A. Before the atomic structure of a protein molecule was determined by x-ray diffraction or nuclear magnetic resonance spectroscopy, experimental studies of the fundamental interactions underlying protein structure led to several distance constraints which motivated the theoretical approach to determine protein structure, and culminated in the Empirical Conformational Energy Program for Peptides (ECEPP), an all-atom force field, with which the structures of fibrous collagen-like proteins and the 46-residue globular staphylococcal protein A were determined. To undertake the study of larger globular proteins, a physics-based coarse-grained UNited-RESidue (UNRES) force field was developed, and applied to the protein-folding problem in terms of structure, thermodynamics, dynamics, and folding pathways. Initially, single-chain and, ultimately, multiple-chain proteins were examined, and the methodology was extended to protein-protein interactions and to nucleic acids and to protein-nucleic acid interactions. The ultimate results led to an understanding of a variety of biological processes underlying natural and disease phenomena.
Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.

PubMed

Wang, Sheng; Sun, Siqi; Li, Zhen; Zhang, Renyu; Xu, Jinbo

2017-01-01

Protein contacts contain key information for the understanding of protein structure and function and thus, contact prediction from sequence is an important problem. Recently exciting progress has been made on this problem, but the predicted contacts for proteins without many sequence homologs is still of low quality and not very useful for de novo structure prediction. This paper presents a new deep learning method that predicts contacts by integrating both evolutionary coupling (EC) and sequence conservation information through an ultra-deep neural network formed by two deep residual neural networks. The first residual network conducts a series of 1-dimensional convolutional transformation of sequential features; the second residual network conducts a series of 2-dimensional convolutional transformation of pairwise information including output of the first residual network, EC information and pairwise potential. By using very deep residual networks, we can accurately model contact occurrence patterns and complex sequence-structure relationship and thus, obtain higher-quality contact prediction regardless of how many sequence homologs are available for proteins in question. Our method greatly outperforms existing methods and leads to much more accurate contact-assisted folding. Tested on 105 CASP11 targets, 76 past CAMEO hard targets, and 398 membrane proteins, the average top L long-range prediction accuracy obtained by our method, one representative EC method CCMpred and the CASP11 winner MetaPSICOV is 0.47, 0.21 and 0.30, respectively; the average top L/10 long-range accuracy of our method, CCMpred and MetaPSICOV is 0.77, 0.47 and 0.59, respectively. Ab initio folding using our predicted contacts as restraints but without any force fields can yield correct folds (i.e., TMscore>0.6) for 203 of the 579 test proteins, while that using MetaPSICOV- and CCMpred-predicted contacts can do so for only 79 and 62 of them, respectively. Our contact-assisted models also have much better quality than template-based models especially for membrane proteins. The 3D models built from our contact prediction have TMscore>0.5 for 208 of the 398 membrane proteins, while those from homology modeling have TMscore>0.5 for only 10 of them. Further, even if trained mostly by soluble proteins, our deep learning method works very well on membrane proteins. In the recent blind CAMEO benchmark, our fully-automated web server implementing this method successfully folded 6 targets with a new fold and only 0.3L-2.3L effective sequence homologs, including one β protein of 182 residues, one α+β protein of 125 residues, one α protein of 140 residues, one α protein of 217 residues, one α/β of 260 residues and one α protein of 462 residues. Our method also achieved the highest F1 score on free-modeling targets in the latest CASP (Critical Assessment of Structure Prediction), although it was not fully implemented back then. http://raptorx.uchicago.edu/ContactMap/.
Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model

PubMed Central

Li, Zhen; Zhang, Renyu

2017-01-01

Motivation Protein contacts contain key information for the understanding of protein structure and function and thus, contact prediction from sequence is an important problem. Recently exciting progress has been made on this problem, but the predicted contacts for proteins without many sequence homologs is still of low quality and not very useful for de novo structure prediction. Method This paper presents a new deep learning method that predicts contacts by integrating both evolutionary coupling (EC) and sequence conservation information through an ultra-deep neural network formed by two deep residual neural networks. The first residual network conducts a series of 1-dimensional convolutional transformation of sequential features; the second residual network conducts a series of 2-dimensional convolutional transformation of pairwise information including output of the first residual network, EC information and pairwise potential. By using very deep residual networks, we can accurately model contact occurrence patterns and complex sequence-structure relationship and thus, obtain higher-quality contact prediction regardless of how many sequence homologs are available for proteins in question. Results Our method greatly outperforms existing methods and leads to much more accurate contact-assisted folding. Tested on 105 CASP11 targets, 76 past CAMEO hard targets, and 398 membrane proteins, the average top L long-range prediction accuracy obtained by our method, one representative EC method CCMpred and the CASP11 winner MetaPSICOV is 0.47, 0.21 and 0.30, respectively; the average top L/10 long-range accuracy of our method, CCMpred and MetaPSICOV is 0.77, 0.47 and 0.59, respectively. Ab initio folding using our predicted contacts as restraints but without any force fields can yield correct folds (i.e., TMscore>0.6) for 203 of the 579 test proteins, while that using MetaPSICOV- and CCMpred-predicted contacts can do so for only 79 and 62 of them, respectively. Our contact-assisted models also have much better quality than template-based models especially for membrane proteins. The 3D models built from our contact prediction have TMscore>0.5 for 208 of the 398 membrane proteins, while those from homology modeling have TMscore>0.5 for only 10 of them. Further, even if trained mostly by soluble proteins, our deep learning method works very well on membrane proteins. In the recent blind CAMEO benchmark, our fully-automated web server implementing this method successfully folded 6 targets with a new fold and only 0.3L-2.3L effective sequence homologs, including one β protein of 182 residues, one α+β protein of 125 residues, one α protein of 140 residues, one α protein of 217 residues, one α/β of 260 residues and one α protein of 462 residues. Our method also achieved the highest F1 score on free-modeling targets in the latest CASP (Critical Assessment of Structure Prediction), although it was not fully implemented back then. Availability http://raptorx.uchicago.edu/ContactMap/ PMID:28056090
A simple and fast heuristic for protein structure comparison.

PubMed

Pelta, David A; González, Juan R; Moreno Vega, Marcos

2008-03-25

Protein structure comparison is a key problem in bioinformatics. There exist several methods for doing protein comparison, being the solution of the Maximum Contact Map Overlap problem (MAX-CMO) one of the alternatives available. Although this problem may be solved using exact algorithms, researchers require approximate algorithms that obtain good quality solutions using less computational resources than the formers. We propose a variable neighborhood search metaheuristic for solving MAX-CMO. We analyze this strategy in two aspects: 1) from an optimization point of view the strategy is tested on two different datasets, obtaining an error of 3.5%(over 2702 pairs) and 1.7% (over 161 pairs) with respect to optimal values; thus leading to high accurate solutions in a simpler and less expensive way than exact algorithms; 2) in terms of protein structure classification, we conduct experiments on three datasets and show that is feasible to detect structural similarities at SCOP's family and CATH's architecture levels using normalized overlap values. Some limitations and the role of normalization are outlined for doing classification at SCOP's fold level. We designed, implemented and tested.a new tool for solving MAX-CMO, based on a well-known metaheuristic technique. The good balance between solution's quality and computational effort makes it a valuable tool. Moreover, to the best of our knowledge, this is the first time the MAX-CMO measure is tested at SCOP's fold and CATH's architecture levels with encouraging results.
My 65 years in protein chemistry

PubMed Central

Scheraga, Harold A.

2015-01-01

This is a tour of a physical chemist through 65 years of protein chemistry from the time when emphasis was placed on the determination of the size and shape of the protein molecule as a colloidal particle, with an early breakthrough by James Sumner, followed by Linus Pauling and Fred Sanger, that a protein was a real molecule, albeit a macromolecule. It deals with the recognition of the nature and importance of hydrogen bonds and hydrophobic interactions in determining the structure, properties, and biological function of proteins until the present acquisition of an understanding of the structure, thermodynamics, and folding pathways from a linear array of amino acids to a biological entity. Along the way, with a combination of experiment and theoretical interpretation, a mechanism was elucidated for the thrombin-induced conversion of fibrinogen to a fibrin blood clot and for the oxidative-folding pathways of ribonuclease A. Before the atomic structure of a protein molecule was determined by x-ray diffraction or nuclear magnetic resonance spectroscopy, experimental studies of the fundamental interactions underlying protein structure led to several distance constraints which motivated the theoretical approach to determine protein structure, and culminated in the Empirical Conformational Energy Program for Peptides (ECEPP), an all-atom force field, with which the structures of fibrous collagen-like proteins and the 46-residue globular staphylococcal protein A were determined. To undertake the study of larger globular proteins, a physics-based coarse-grained UNited-RESidue (UNRES) force field was developed, and applied to the protein-folding problem in terms of structure, thermodynamics, dynamics, and folding pathways. Initially, single-chain and, ultimately, multiple-chain proteins were examined, and the methodology was extended to protein–protein interactions and to nucleic acids and to protein–nucleic acid interactions. The ultimate results led to an understanding of a variety of biological processes underlying natural and disease phenomena. PMID:25850343
SeqRate: sequence-based protein folding type classification and rates prediction

PubMed Central

2010-01-01

Background Protein folding rate is an important property of a protein. Predicting protein folding rate is useful for understanding protein folding process and guiding protein design. Most previous methods of predicting protein folding rate require the tertiary structure of a protein as an input. And most methods do not distinguish the different kinetic nature (two-state folding or multi-state folding) of the proteins. Here we developed a method, SeqRate, to predict both protein folding kinetic type (two-state versus multi-state) and real-value folding rate using sequence length, amino acid composition, contact order, contact number, and secondary structure information predicted from only protein sequence with support vector machines. Results We systematically studied the contributions of individual features to folding rate prediction. On a standard benchmark dataset, the accuracy of folding kinetic type classification is 80%. The Pearson correlation coefficient and the mean absolute difference between predicted and experimental folding rates (sec-1) in the base-10 logarithmic scale are 0.81 and 0.79 for two-state protein folders, and 0.80 and 0.68 for three-state protein folders. SeqRate is the first sequence-based method for protein folding type classification and its accuracy of fold rate prediction is improved over previous sequence-based methods. Its performance can be further enhanced with additional information, such as structure-based geometric contacts, as inputs. Conclusions Both the web server and software of predicting folding rate are publicly available at http://casp.rnet.missouri.edu/fold_rate/index.html. PMID:20438647
Competition between protein folding and aggregation: A three-dimensional lattice-model simulation

NASA Astrophysics Data System (ADS)

Bratko, D.; Blanch, H. W.

2001-01-01

Aggregation of protein molecules resulting in the loss of biological activity and the formation of insoluble deposits represents a serious problem for the biotechnology and pharmaceutical industries and in medicine. Considerable experimental and theoretical efforts are being made in order to improve our understanding of, and ability to control, the process. In the present work, we describe a Monte Carlo study of a multichain system of coarse-grained model proteins akin to lattice models developed for simulations of protein folding. The model is designed to examine the competition between intramolecular interactions leading to the native protein structure, and intermolecular association, resulting in the formation of aggregates of misfolded chains. Interactions between the segments are described by a variation of the Go potential [N. Go and H. Abe, Biopolymers 20, 1013 (1981)] that extends the recognition between attracting types of segments to pairs on distinct chains. For the particular model we adopt, the global free energy minimum of a pair of protein molecules corresponds to a dimer of native proteins. When three or more molecules interact, clusters of misfolded chains can be more stable than aggregates of native folds. A considerable fraction of native structure, however, is preserved in these cases. Rates of conformational changes rapidly decrease with the size of the protein cluster. Within the timescale accessible to computer simulations, the folding-aggregation balance is strongly affected by kinetic considerations. Both the native form and aggregates can persist in metastable states, even if conditions such as temperature or concentration favor a transition to an alternative form. Refolding yield can be affected by the presence of an additional polymer species mimicking the function of a molecular chaperone.
Folding superfunnel to describe cooperative folding of interacting proteins.

PubMed

Smeller, László

2016-07-01

This paper proposes a generalization of the well-known folding funnel concept of proteins. In the funnel model the polypeptide chain is treated as an individual object not interacting with other proteins. Since biological systems are considerably crowded, protein-protein interaction is a fundamental feature during the life cycle of proteins. The folding superfunnel proposed here describes the folding process of interacting proteins in various situations. The first example discussed is the folding of the freshly synthesized protein with the aid of chaperones. Another important aspect of protein-protein interactions is the folding of the recently characterized intrinsically disordered proteins, where binding to target proteins plays a crucial role in the completion of the folding process. The third scenario where the folding superfunnel is used is the formation of aggregates from destabilized proteins, which is an important factor in case of several conformational diseases. The folding superfunnel constructed here with the minimal assumption about the interaction potential explains all three cases mentioned above. Proteins 2016; 84:1009-1016. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Impact of hydrodynamic interactions on protein folding rates depends on temperature

NASA Astrophysics Data System (ADS)

Zegarra, Fabio C.; Homouz, Dirar; Eliaz, Yossi; Gasic, Andrei G.; Cheung, Margaret S.

2018-03-01

We investigated the impact of hydrodynamic interactions (HI) on protein folding using a coarse-grained model. The extent of the impact of hydrodynamic interactions, whether it accelerates, retards, or has no effect on protein folding, has been controversial. Together with a theoretical framework of the energy landscape theory (ELT) for protein folding that describes the dynamics of the collective motion with a single reaction coordinate across a folding barrier, we compared the kinetic effects of HI on the folding rates of two protein models that use a chain of single beads with distinctive topologies: a 64-residue α /β chymotrypsin inhibitor 2 (CI2) protein, and a 57-residue β -barrel α -spectrin Src-homology 3 domain (SH3) protein. When comparing the protein folding kinetics simulated with Brownian dynamics in the presence of HI to that in the absence of HI, we find that the effect of HI on protein folding appears to have a "crossover" behavior about the folding temperature. This means that at a temperature greater than the folding temperature, the enhanced friction from the hydrodynamic solvents between the beads in an unfolded configuration results in lowered folding rate; conversely, at a temperature lower than the folding temperature, HI accelerates folding by the backflow of solvent toward the folded configuration of a protein. Additionally, the extent of acceleration depends on the topology of a protein: for a protein like CI2, where its folding nucleus is rather diffuse in a transition state, HI channels the formation of contacts by favoring a major folding pathway in a complex free energy landscape, thus accelerating folding. For a protein like SH3, where its folding nucleus is already specific and less diffuse, HI matters less at a temperature lower than the folding temperature. Our findings provide further theoretical insight to protein folding kinetic experiments and simulations.
Electrostatics, structure prediction, and the energy landscapes for protein folding and binding.

PubMed

Tsai, Min-Yeh; Zheng, Weihua; Balamurugan, D; Schafer, Nicholas P; Kim, Bobby L; Cheung, Margaret S; Wolynes, Peter G

2016-01-01

While being long in range and therefore weakly specific, electrostatic interactions are able to modulate the stability and folding landscapes of some proteins. The relevance of electrostatic forces for steering the docking of proteins to each other is widely acknowledged, however, the role of electrostatics in establishing specifically funneled landscapes and their relevance for protein structure prediction are still not clear. By introducing Debye-Hückel potentials that mimic long-range electrostatic forces into the Associative memory, Water mediated, Structure, and Energy Model (AWSEM), a transferable protein model capable of predicting tertiary structures, we assess the effects of electrostatics on the landscapes of thirteen monomeric proteins and four dimers. For the monomers, we find that adding electrostatic interactions does not improve structure prediction. Simulations of ribosomal protein S6 show, however, that folding stability depends monotonically on electrostatic strength. The trend in predicted melting temperatures of the S6 variants agrees with experimental observations. Electrostatic effects can play a range of roles in binding. The binding of the protein complex KIX-pKID is largely assisted by electrostatic interactions, which provide direct charge-charge stabilization of the native state and contribute to the funneling of the binding landscape. In contrast, for several other proteins, including the DNA-binding protein FIS, electrostatics causes frustration in the DNA-binding region, which favors its binding with DNA but not with its protein partner. This study highlights the importance of long-range electrostatics in functional responses to problems where proteins interact with their charged partners, such as DNA, RNA, as well as membranes. © 2015 The Protein Society.
The Proteome Folding Project: Proteome-scale prediction of structure and function

PubMed Central

Drew, Kevin; Winters, Patrick; Butterfoss, Glenn L.; Berstis, Viktors; Uplinger, Keith; Armstrong, Jonathan; Riffle, Michael; Schweighofer, Erik; Bovermann, Bill; Goodlett, David R.; Davis, Trisha N.; Shasha, Dennis; Malmström, Lars; Bonneau, Richard

2011-01-01

The incompleteness of proteome structure and function annotation is a critical problem for biologists and, in particular, severely limits interpretation of high-throughput and next-generation experiments. We have developed a proteome annotation pipeline based on structure prediction, where function and structure annotations are generated using an integration of sequence comparison, fold recognition, and grid-computing-enabled de novo structure prediction. We predict protein domain boundaries and three-dimensional (3D) structures for protein domains from 94 genomes (including human, Arabidopsis, rice, mouse, fly, yeast, Escherichia coli, and worm). De novo structure predictions were distributed on a grid of more than 1.5 million CPUs worldwide (World Community Grid). We generated significant numbers of new confident fold annotations (9% of domains that are otherwise unannotated in these genomes). We demonstrate that predicted structures can be combined with annotations from the Gene Ontology database to predict new and more specific molecular functions. PMID:21824995
Modularity of Protein Folds as a Tool for Template-Free Modeling of Structures.

PubMed

Vallat, Brinda; Madrid-Aliste, Carlos; Fiser, Andras

2015-08-01

Predicting the three-dimensional structure of proteins from their amino acid sequences remains a challenging problem in molecular biology. While the current structural coverage of proteins is almost exclusively provided by template-based techniques, the modeling of the rest of the protein sequences increasingly require template-free methods. However, template-free modeling methods are much less reliable and are usually applicable for smaller proteins, leaving much space for improvement. We present here a novel computational method that uses a library of supersecondary structure fragments, known as Smotifs, to model protein structures. The library of Smotifs has saturated over time, providing a theoretical foundation for efficient modeling. The method relies on weak sequence signals from remotely related protein structures to create a library of Smotif fragments specific to the target protein sequence. This Smotif library is exploited in a fragment assembly protocol to sample decoys, which are assessed by a composite scoring function. Since the Smotif fragments are larger in size compared to the ones used in other fragment-based methods, the proposed modeling algorithm, SmotifTF, can employ an exhaustive sampling during decoy assembly. SmotifTF successfully predicts the overall fold of the target proteins in about 50% of the test cases and performs competitively when compared to other state of the art prediction methods, especially when sequence signal to remote homologs is diminishing. Smotif-based modeling is complementary to current prediction methods and provides a promising direction in addressing the structure prediction problem, especially when targeting larger proteins for modeling.
Modeling Loop Entropy

PubMed Central

Chirikjian, Gregory S.

2011-01-01

Proteins fold from a highly disordered state into a highly ordered one. Traditionally, the folding problem has been stated as one of predicting ‘the’ tertiary structure from sequential information. However, new evidence suggests that the ensemble of unfolded forms may not be as disordered as once believed, and that the native form of many proteins may not be described by a single conformation, but rather an ensemble of its own. Quantifying the relative disorder in the folded and unfolded ensembles as an entropy difference may therefore shed light on the folding process. One issue that clouds discussions of ‘entropy’ is that many different kinds of entropy can be defined: entropy associated with overall translational and rotational Brownian motion, configurational entropy, vibrational entropy, conformational entropy computed in internal or Cartesian coordinates (which can even be different from each other), conformational entropy computed on a lattice; each of the above with different solvation and solvent models; thermodynamic entropy measured experimentally, etc. The focus of this work is the conformational entropy of coil/loop regions in proteins. New mathematical modeling tools for the approximation of changes in conformational entropy during transition from unfolded to folded ensembles are introduced. In particular, models for computing lower and upper bounds on entropy for polymer models of polypeptide coils both with and without end constraints are presented. The methods reviewed here include kinematics (the mathematics of rigid-body motions), classical statistical mechanics and information theory. PMID:21187223
``Sequence space soup'' of proteins and copolymers

NASA Astrophysics Data System (ADS)

Chan, Hue Sun; Dill, Ken A.

1991-09-01

To study the protein folding problem, we use exhaustive computer enumeration to explore ``sequence space soup,'' an imaginary solution containing the ``native'' conformations (i.e., of lowest free energy) under folding conditions, of every possible copolymer sequence. The model is of short self-avoiding chains of hydrophobic (H) and polar (P) monomers configured on the two-dimensional square lattice. By exhaustive enumeration, we identify all native structures for every possible sequence. We find that random sequences of H/P copolymers will bear striking resemblance to known proteins: Most sequences under folding conditions will be approximately as compact as known proteins, will have considerable amounts of secondary structure, and it is most probable that an arbitrary sequence will fold to a number of lowest free energy conformations that is of order one. In these respects, this simple model shows that proteinlike behavior should arise simply in copolymers in which one monomer type is highly solvent averse. It suggests that the structures and uniquenesses of native proteins are not consequences of having 20 different monomer types, or of unique properties of amino acid monomers with regard to special packing or interactions, and thus that simple copolymers might be designable to collapse to proteinlike structures and properties. A good strategy for designing a sequence to have a minimum possible number of native states is to strategically insert many P monomers. Thus known proteins may be marginally stable due to a balance: More H residues stabilize the desired native state, but more P residues prevent simultaneous stabilization of undesired native states.

Characterization of protein folding by a Φ-value calculation with a statistical-mechanical model.

PubMed

Wako, Hiroshi; Abe, Haruo

2016-01-01

The Φ-value analysis approach provides information about transition-state structures along the folding pathway of a protein by measuring the effects of an amino acid mutation on folding kinetics. Here we compared the theoretically calculated Φ values of 27 proteins with their experimentally observed Φ values; the theoretical values were calculated using a simple statistical-mechanical model of protein folding. The theoretically calculated Φ values reflected the corresponding experimentally observed Φ values with reasonable accuracy for many of the proteins, but not for all. The correlation between the theoretically calculated and experimentally observed Φ values strongly depends on whether the protein-folding mechanism assumed in the model holds true in real proteins. In other words, the correlation coefficient can be expected to illuminate the folding mechanisms of proteins, providing the answer to the question of which model more accurately describes protein folding: the framework model or the nucleation-condensation model. In addition, we tried to characterize protein folding with respect to various properties of each protein apart from the size and fold class, such as the free-energy profile, contact-order profile, and sensitivity to the parameters used in the Φ-value calculation. The results showed that any one of these properties alone was not enough to explain protein folding, although each one played a significant role in it. We have confirmed the importance of characterizing protein folding from various perspectives. Our findings have also highlighted that protein folding is highly variable and unique across different proteins, and this should be considered while pursuing a unified theory of protein folding.
Characterization of protein folding by a Φ-value calculation with a statistical-mechanical model

PubMed Central

Wako, Hiroshi; Abe, Haruo

2016-01-01

The Φ-value analysis approach provides information about transition-state structures along the folding pathway of a protein by measuring the effects of an amino acid mutation on folding kinetics. Here we compared the theoretically calculated Φ values of 27 proteins with their experimentally observed Φ values; the theoretical values were calculated using a simple statistical-mechanical model of protein folding. The theoretically calculated Φ values reflected the corresponding experimentally observed Φ values with reasonable accuracy for many of the proteins, but not for all. The correlation between the theoretically calculated and experimentally observed Φ values strongly depends on whether the protein-folding mechanism assumed in the model holds true in real proteins. In other words, the correlation coefficient can be expected to illuminate the folding mechanisms of proteins, providing the answer to the question of which model more accurately describes protein folding: the framework model or the nucleation-condensation model. In addition, we tried to characterize protein folding with respect to various properties of each protein apart from the size and fold class, such as the free-energy profile, contact-order profile, and sensitivity to the parameters used in the Φ-value calculation. The results showed that any one of these properties alone was not enough to explain protein folding, although each one played a significant role in it. We have confirmed the importance of characterizing protein folding from various perspectives. Our findings have also highlighted that protein folding is highly variable and unique across different proteins, and this should be considered while pursuing a unified theory of protein folding. PMID:28409079
Analysis of Protein Thermostability Enhancing Factors in Industrially Important Thermus Bacteria Species

PubMed Central

Kumwenda, Benjamin; Litthauer, Derek; Bishop, Özlem Tastan; Reva, Oleg

2013-01-01

Elucidation of evolutionary factors that enhance protein thermostability is a critical problem and was the focus of this work on Thermus species. Pairs of orthologous sequences of T. scotoductus SA-01 and T. thermophilus HB27, with the largest negative minimum folding energy (MFE) as predicted by the UNAFold algorithm, were statistically analyzed. Favored substitutions of amino acids residues and their properties were determined. Substitutions were analyzed in modeled protein structures to determine their locations and contribution to energy differences using PyMOL and FoldX programs respectively. Dominant trends in amino acid substitutions consistent with differences in thermostability between orthologous sequences were observed. T. thermophilus thermophilic proteins showed an increase in non-polar, tiny, and charged amino acids. An abundance of alanine substituted by serine and threonine, as well as arginine substituted by glutamine and lysine was observed in T. thermophilus HB27. Structural comparison showed that stabilizing mutations occurred on surfaces and loops in protein structures. PMID:24023508
When fast is better: protein folding fundamentals and mechanisms from ultrafast approaches

PubMed Central

Muñoz, Victor; Cerminara, Michele

2016-01-01

Protein folding research stalled for decades because conventional experiments indicated that proteins fold slowly and in single strokes, whereas theory predicted a complex interplay between dynamics and energetics resulting in myriad microscopic pathways. Ultrafast kinetic methods turned the field upside down by providing the means to probe fundamental aspects of folding, test theoretical predictions and benchmark simulations. Accordingly, experimentalists could measure the timescales for all relevant folding motions, determine the folding speed limit and confirm that folding barriers are entropic bottlenecks. Moreover, a catalogue of proteins that fold extremely fast (microseconds) could be identified. Such fast-folding proteins cross shallow free energy barriers or fold downhill, and thus unfold with minimal co-operativity (gradually). A new generation of thermodynamic methods has exploited this property to map folding landscapes, interaction networks and mechanisms at nearly atomic resolution. In parallel, modern molecular dynamics simulations have finally reached the timescales required to watch fast-folding proteins fold and unfold in silico. All of these findings have buttressed the fundamentals of protein folding predicted by theory, and are now offering the first glimpses at the underlying mechanisms. Fast folding appears to also have functional implications as recent results connect downhill folding with intrinsically disordered proteins, their complex binding modes and ability to moonlight. These connections suggest that the coupling between downhill (un)folding and binding enables such protein domains to operate analogically as conformational rheostats. PMID:27574021
Predicting helix orientation for coiled-coil dimers

PubMed Central

Apgar, James R.; Gutwin, Karl N.; Keating, Amy E.

2008-01-01

The alpha-helical coiled coil is a structurally simple protein oligomerization or interaction motif consisting of two or more alpha helices twisted into a supercoiled bundle. Coiled coils can differ in their stoichiometry, helix orientation and axial alignment. Because of the near degeneracy of many of these variants, coiled coils pose a challenge to fold recognition methods for structure prediction. Whereas distinctions between some protein folds can be discriminated on the basis of hydrophobic/polar patterning or secondary structure propensities, the sequence differences that encode important details of coiled-coil structure can be subtle. This is emblematic of a larger problem in the field of protein structure and interaction prediction: that of establishing specificity between closely similar structures. We tested the behavior of different computational models on the problem of recognizing the correct orientation - parallel vs. antiparallel - of pairs of alpha helices that can form a dimeric coiled coil. For each of 131 examples of known structure, we constructed a large number of both parallel and antiparallel structural models and used these to asses the ability of five energy functions to recognize the correct fold. We also developed and tested three sequenced-based approaches that make use of varying degrees of implicit structural information. The best structural methods performed similarly to the best sequence methods, correctly categorizing ∼81% of dimers. Steric compatibility with the fold was important for some coiled coils we investigated. For many examples, the correct orientation was determined by smaller energy differences between parallel and antiparallel structures distributed over many residues and energy components. Prediction methods that used structure but incorporated varying approximations and assumptions showed quite different behaviors when used to investigate energetic contributions to orientation preference. Sequence based methods were sensitive to the choice of residue-pair interactions scored. PMID:18506779
Energy landscape of knotted protein folding

PubMed Central

Sułkowska, Joanna I.; Noel, Jeffrey K.; Onuchic, Jose N.

2012-01-01

Recent experiments have conclusively shown that proteins are able to fold from an unknotted, denatured polypeptide to the knotted, native state without the aid of chaperones. These experiments are consistent with a growing body of theoretical work showing that a funneled, minimally frustrated energy landscape is sufficient to fold small proteins with complex topologies. Here, we present a theoretical investigation of the folding of a knotted protein, 2ouf, engineered in the laboratory by a domain fusion that mimics an evolutionary pathway for knotted proteins. Unlike a previously studied knotted protein of similar length, we see reversible folding/knotting and a surprising lack of deep topological traps with a coarse-grained structure-based model. Our main interest is to investigate how evolution might further select the geometry and stiffness of the threading region of the newly fused protein. We compare the folding of the wild-type protein to several mutants. Similarly to the wild-type protein, all mutants show robust and reversible folding, and knotting coincides with the transition state ensemble. As observed experimentally, our simulations show that the knotted protein folds about ten times slower than an unknotted construct with an identical contact map. Simulated folding kinetics reflect the experimentally observed rollover in the folding limbs of chevron plots. Successful folding of the knotted protein is restricted to a narrow range of temperature as compared to the unknotted protein and fits of the kinetic folding data below folding temperature suggest slow, nondiffusive dynamics for the knotted protein. PMID:22891304
Protein purification and crystallization artifacts: The tale usually not told.

PubMed

Niedzialkowska, Ewa; Gasiorowska, Olga; Handing, Katarzyna B; Majorek, Karolina A; Porebski, Przemyslaw J; Shabalin, Ivan G; Zasadzinska, Ewelina; Cymborowski, Marcin; Minor, Wladek

2016-03-01

The misidentification of a protein sample, or contamination of a sample with the wrong protein, may be a potential reason for the non-reproducibility of experiments. This problem may occur in the process of heterologous overexpression and purification of recombinant proteins, as well as purification of proteins from natural sources. If the contaminated or misidentified sample is used for crystallization, in many cases the problem may not be detected until structures are determined. In the case of functional studies, the problem may not be detected for years. Here several procedures that can be successfully used for the identification of crystallized protein contaminants, including: (i) a lattice parameter search against known structures, (ii) sequence or fold identification from partially built models, and (iii) molecular replacement with common contaminants as search templates have been presented. A list of common contaminant structures to be used as alternative search models was provided. These methods were used to identify four cases of purification and crystallization artifacts. This report provides troubleshooting pointers for researchers facing difficulties in phasing or model building. © 2016 The Protein Society.
[Pichia pastoris as an expression system for recombinant protein production].

PubMed

Ciarkowska, Anna; Jakubowska, Anna

2013-01-01

Pichia pastoris has become increasingly popular as a host for recombinant protein production in recent years. P. pastoris is more cost effective and allows achieving higher expression levels than insect and mammalian cells. It also offers some significant advantages over E. coli expression systems, such as avoiding problems with proper protein folding. Also, P. pastoris as an eukaryotic organism can carry out posttranslational modifications of produced proteins. Additionally, P. pastoris can produce high levels of recombinant proteins in extracellular medium which simplifies protein purification. Having many advantages over other expression systems makes P. pastoris an organism of choice for industrial protein production.
A simple and fast heuristic for protein structure comparison

PubMed Central

Pelta, David A; González, Juan R; Moreno Vega, Marcos

2008-01-01

Background Protein structure comparison is a key problem in bioinformatics. There exist several methods for doing protein comparison, being the solution of the Maximum Contact Map Overlap problem (MAX-CMO) one of the alternatives available. Although this problem may be solved using exact algorithms, researchers require approximate algorithms that obtain good quality solutions using less computational resources than the formers. Results We propose a variable neighborhood search metaheuristic for solving MAX-CMO. We analyze this strategy in two aspects: 1) from an optimization point of view the strategy is tested on two different datasets, obtaining an error of 3.5%(over 2702 pairs) and 1.7% (over 161 pairs) with respect to optimal values; thus leading to high accurate solutions in a simpler and less expensive way than exact algorithms; 2) in terms of protein structure classification, we conduct experiments on three datasets and show that is feasible to detect structural similarities at SCOP's family and CATH's architecture levels using normalized overlap values. Some limitations and the role of normalization are outlined for doing classification at SCOP's fold level. Conclusion We designed, implemented and tested.a new tool for solving MAX-CMO, based on a well-known metaheuristic technique. The good balance between solution's quality and computational effort makes it a valuable tool. Moreover, to the best of our knowledge, this is the first time the MAX-CMO measure is tested at SCOP's fold and CATH's architecture levels with encouraging results. Software is available for download at . PMID:18366735
Atomic-level description of ubiquitin folding

PubMed Central

Piana, Stefano; Lindorff-Larsen, Kresten; Shaw, David E.

2013-01-01

Equilibrium molecular dynamics simulations, in which proteins spontaneously and repeatedly fold and unfold, have recently been used to help elucidate the mechanistic principles that underlie the folding of fast-folding proteins. The extent to which the conclusions drawn from the analysis of such proteins, which fold on the microsecond timescale, apply to the millisecond or slower folding of naturally occurring proteins is, however, unclear. As a first attempt to address this outstanding issue, we examine here the folding of ubiquitin, a 76-residue-long protein found in all eukaryotes that is known experimentally to fold on a millisecond timescale. Ubiquitin folding has been the subject of many experimental studies, but its slow folding rate has made it difficult to observe and characterize the folding process through all-atom molecular dynamics simulations. Here we determine the mechanism, thermodynamics, and kinetics of ubiquitin folding through equilibrium atomistic simulations. The picture emerging from the simulations is in agreement with a view of ubiquitin folding suggested from previous experiments. Our findings related to the folding of ubiquitin are also consistent, for the most part, with the folding principles derived from the simulation of fast-folding proteins, suggesting that these principles may be applicable to a wider range of proteins. PMID:23503848
When fast is better: protein folding fundamentals and mechanisms from ultrafast approaches.

PubMed

Muñoz, Victor; Cerminara, Michele

2016-09-01

Protein folding research stalled for decades because conventional experiments indicated that proteins fold slowly and in single strokes, whereas theory predicted a complex interplay between dynamics and energetics resulting in myriad microscopic pathways. Ultrafast kinetic methods turned the field upside down by providing the means to probe fundamental aspects of folding, test theoretical predictions and benchmark simulations. Accordingly, experimentalists could measure the timescales for all relevant folding motions, determine the folding speed limit and confirm that folding barriers are entropic bottlenecks. Moreover, a catalogue of proteins that fold extremely fast (microseconds) could be identified. Such fast-folding proteins cross shallow free energy barriers or fold downhill, and thus unfold with minimal co-operativity (gradually). A new generation of thermodynamic methods has exploited this property to map folding landscapes, interaction networks and mechanisms at nearly atomic resolution. In parallel, modern molecular dynamics simulations have finally reached the timescales required to watch fast-folding proteins fold and unfold in silico All of these findings have buttressed the fundamentals of protein folding predicted by theory, and are now offering the first glimpses at the underlying mechanisms. Fast folding appears to also have functional implications as recent results connect downhill folding with intrinsically disordered proteins, their complex binding modes and ability to moonlight. These connections suggest that the coupling between downhill (un)folding and binding enables such protein domains to operate analogically as conformational rheostats. © 2016 The Author(s).
On the Origin of Protein Superfamilies and Superfolds

NASA Astrophysics Data System (ADS)

Magner, Abram; Szpankowski, Wojciech; Kihara, Daisuke

2015-02-01

Distributions of protein families and folds in genomes are highly skewed, having a small number of prevalent superfamiles/superfolds and a large number of families/folds of a small size. Why are the distributions of protein families and folds skewed? Why are there only a limited number of protein families? Here, we employ an information theoretic approach to investigate the protein sequence-structure relationship that leads to the skewed distributions. We consider that protein sequences and folds constitute an information theoretic channel and computed the most efficient distribution of sequences that code all protein folds. The identified distributions of sequences and folds are found to follow a power law, consistent with those observed for proteins in nature. Importantly, the skewed distributions of sequences and folds are suggested to have different origins: the skewed distribution of sequences is due to evolutionary pressure to achieve efficient coding of necessary folds, whereas that of folds is based on the thermodynamic stability of folds. The current study provides a new information theoretic framework for proteins that could be widely applied for understanding protein sequences, structures, functions, and interactions.
Replica exchange molecular dynamics simulation of structure variation from α/4β-fold to 3α-fold protein.

PubMed

Lazim, Raudah; Mei, Ye; Zhang, Dawei

2012-03-01

Replica exchange molecular dynamics (REMD) simulation provides an efficient conformational sampling tool for the study of protein folding. In this study, we explore the mechanism directing the structure variation from α/4β-fold protein to 3α-fold protein after mutation by conducting REMD simulation on 42 replicas with temperatures ranging from 270 K to 710 K. The simulation began from a protein possessing the primary structure of GA88 but the tertiary structure of GB88, two G proteins with "high sequence identity." Albeit the large Cα-root mean square deviation (RMSD) of the folded protein (4.34 Å at 270 K and 4.75 Å at 304 K), a variation in tertiary structure was observed. Together with the analysis of secondary structure assignment, cluster analysis and principal component, it provides insights to the folding and unfolding pathway of 3α-fold protein and α/4β-fold protein respectively paving the way toward the understanding of the ongoings during conformational variation.
Folding of a single domain protein entering the endoplasmic reticulum precedes disulfide formation.

PubMed

Robinson, Philip J; Pringle, Marie Anne; Woolhead, Cheryl A; Bulleid, Neil J

2017-04-28

The relationship between protein synthesis, folding, and disulfide formation within the endoplasmic reticulum (ER) is poorly understood. Previous studies have suggested that pre-existing disulfide links are absolutely required to allow protein folding and, conversely, that protein folding occurs prior to disulfide formation. To address the question of what happens first within the ER, that is, protein folding or disulfide formation, we studied folding events at the early stages of polypeptide chain translocation into the mammalian ER using stalled translation intermediates. Our results demonstrate that polypeptide folding can occur without complete domain translocation. Protein disulfide isomerase (PDI) interacts with these early intermediates, but disulfide formation does not occur unless the entire sequence of the protein domain is translocated. This is the first evidence that folding of the polypeptide chain precedes disulfide formation within a cellular context and highlights key differences between protein folding in the ER and refolding of purified proteins. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Asymmetric scoring functions for proteins

NASA Astrophysics Data System (ADS)

Lezon, Timothy; Holter, Neal; Maritan, Amos; Banavar, Jayanth

2003-03-01

The protein folding problem entails the prediction of the native state structure of a protein given the sequence of amino acids. In a coarse-grained description of a protein, an important ingredient for attempting this task is the determination of the effective energies of interaction between amino acids. We will discuss a simple approach for determining such interaction potentials from a training set of protein sequences and their experimentally determined native state structures. The key new ingredient in our study is the incorporation of the lack of symmetry in the effective interactions between amino acids. Our results, obtained using a set of 513 proteins, and their implications will be discussed.
Rationally designed, heterologous S. cerevisiae transcripts expose novel expression determinants

PubMed Central

Ben-Yehezkel, Tuval; Atar, Shimshi; Zur, Hadas; Diament, Alon; Goz, Eli; Marx, Tzipy; Cohen, Rafael; Dana, Alexandra; Feldman, Anna; Shapiro, Ehud; Tuller, Tamir

2015-01-01

Deducing generic causal relations between RNA transcript features and protein expression profiles from endogenous gene expression data remains a major unsolved problem in biology. The analysis of gene expression from heterologous genes contributes significantly to solving this problem, but has been heavily biased toward the study of the effect of 5′ transcript regions and to prokaryotes. Here, we employ a synthetic biology driven approach that systematically differentiates the effect of different regions of the transcript on gene expression up to 240 nucleotides into the ORF. This enabled us to discover new causal effects between features in previously unexplored regions of transcripts, and gene expression in natural regimes. We rationally designed, constructed, and analyzed 383 gene variants of the viral HRSVgp04 gene ORF, with multiple synonymous mutations at key positions along the transcript in the eukaryote S. cerevisiae. Our results show that a few silent mutations at the 5′UTR can have a dramatic effect of up to 15 fold change on protein levels, and that even synonymous mutations in positions more than 120 nucleotides downstream from the ORF 5′end can modulate protein levels up to 160%–300%. We demonstrate that the correlation between protein levels and folding energy increases with the significance of the level of selection of the latter in endogenous genes, reinforcing the notion that selection for folding strength in different parts of the ORF is related to translation regulation. Our measured protein abundance correlates notably(correlation up to r = 0.62 (p=0.0013)) with mean relative codon decoding times, based on ribosomal densities (Ribo-Seq) in endogenous genes, supporting the conjecture that translation elongation and adaptation to the tRNA pool can modify protein levels in a causal/direct manner. This report provides an improved understanding of transcript evolution, design principles of gene expression regulation, and suggests simple rules for engineering synthetic gene expression in eukaryotes. PMID:26176266
Rationally designed, heterologous S. cerevisiae transcripts expose novel expression determinants.

PubMed

Ben-Yehezkel, Tuval; Atar, Shimshi; Zur, Hadas; Diament, Alon; Goz, Eli; Marx, Tzipy; Cohen, Rafael; Dana, Alexandra; Feldman, Anna; Shapiro, Ehud; Tuller, Tamir

2015-01-01

Deducing generic causal relations between RNA transcript features and protein expression profiles from endogenous gene expression data remains a major unsolved problem in biology. The analysis of gene expression from heterologous genes contributes significantly to solving this problem, but has been heavily biased toward the study of the effect of 5' transcript regions and to prokaryotes. Here, we employ a synthetic biology driven approach that systematically differentiates the effect of different regions of the transcript on gene expression up to 240 nucleotides into the ORF. This enabled us to discover new causal effects between features in previously unexplored regions of transcripts, and gene expression in natural regimes. We rationally designed, constructed, and analyzed 383 gene variants of the viral HRSVgp04 gene ORF, with multiple synonymous mutations at key positions along the transcript in the eukaryote S. cerevisiae. Our results show that a few silent mutations at the 5'UTR can have a dramatic effect of up to 15 fold change on protein levels, and that even synonymous mutations in positions more than 120 nucleotides downstream from the ORF 5'end can modulate protein levels up to 160%-300%. We demonstrate that the correlation between protein levels and folding energy increases with the significance of the level of selection of the latter in endogenous genes, reinforcing the notion that selection for folding strength in different parts of the ORF is related to translation regulation. Our measured protein abundance correlates notably(correlation up to r = 0.62 (p=0.0013)) with mean relative codon decoding times, based on ribosomal densities (Ribo-Seq) in endogenous genes, supporting the conjecture that translation elongation and adaptation to the tRNA pool can modify protein levels in a causal/direct manner. This report provides an improved understanding of transcript evolution, design principles of gene expression regulation, and suggests simple rules for engineering synthetic gene expression in eukaryotes.
Unique Features of Halophilic Proteins.

PubMed

Arakawa, Tsutomu; Yamaguchi, Rui; Tokunaga, Hiroko; Tokunaga, Masao

2017-01-01

Proteins from moderate and extreme halophiles have unique characteristics. They are highly acidic and hydrophilic, similar to intrinsically disordered proteins. These characteristics make the halophilic proteins soluble in water and fold reversibly. In addition to reversible folding, the rate of refolding of halophilic proteins from denatured structure is generally slow, often taking several days, for example, for extremely halophilic proteins. This slow folding rate makes the halophilic proteins a novel model system for folding mechanism analysis. High solubility and reversible folding also make the halophilic proteins excellent fusion partners for soluble expression of recombinant proteins.
A simple quantitative model of macromolecular crowding effects on protein folding: Application to the murine prion protein(121-231)

NASA Astrophysics Data System (ADS)

Bergasa-Caceres, Fernando; Rabitz, Herschel A.

2013-06-01

A model of protein folding kinetics is applied to study the effects of macromolecular crowding on protein folding rate and stability. Macromolecular crowding is found to promote a decrease of the entropic cost of folding of proteins that produces an increase of both the stability and the folding rate. The acceleration of the folding rate due to macromolecular crowding is shown to be a topology-dependent effect. The model is applied to the folding dynamics of the murine prion protein (121-231). The differential effect of macromolecular crowding as a function of protein topology suffices to make non-native configurations relatively more accessible.
Accurate secondary structure prediction and fold recognition for circular dichroism spectroscopy

PubMed Central

Micsonai, András; Wien, Frank; Kernya, Linda; Lee, Young-Ho; Goto, Yuji; Réfrégiers, Matthieu; Kardos, József

2015-01-01

Circular dichroism (CD) spectroscopy is a widely used technique for the study of protein structure. Numerous algorithms have been developed for the estimation of the secondary structure composition from the CD spectra. These methods often fail to provide acceptable results on α/β-mixed or β-structure–rich proteins. The problem arises from the spectral diversity of β-structures, which has hitherto been considered as an intrinsic limitation of the technique. The predictions are less reliable for proteins of unusual β-structures such as membrane proteins, protein aggregates, and amyloid fibrils. Here, we show that the parallel/antiparallel orientation and the twisting of the β-sheets account for the observed spectral diversity. We have developed a method called β-structure selection (BeStSel) for the secondary structure estimation that takes into account the twist of β-structures. This method can reliably distinguish parallel and antiparallel β-sheets and accurately estimates the secondary structure for a broad range of proteins. Moreover, the secondary structure components applied by the method are characteristic to the protein fold, and thus the fold can be predicted to the level of topology in the CATH classification from a single CD spectrum. By constructing a web server, we offer a general tool for a quick and reliable structure analysis using conventional CD or synchrotron radiation CD (SRCD) spectroscopy for the protein science research community. The method is especially useful when X-ray or NMR techniques fail. Using BeStSel on data collected by SRCD spectroscopy, we investigated the structure of amyloid fibrils of various disease-related proteins and peptides. PMID:26038575

Interstitial protein alterations in rabbit vocal fold with scar.

PubMed

Thibeault, Susan L; Bless, Diane M; Gray, Steven D

2003-09-01

Fibrous and interstitial proteins compose the extracellular matrix of the vocal fold lamina propria and account for its biomechanic properties. Vocal fold scarring is characterized by altered biomechanical properties, which create dysphonia. Although alterations of the fibrous proteins have been confirmed in the rabbit vocal fold scar, interstitial proteins, which are known to be important in wound repair, have not been investigated to date. Using a rabbit model, interstitial proteins decorin, fibromodulin, and fibronectin were examined immunohistologically, two months postinduction of vocal fold scar by means of forcep biopsy. Significantly decreased decorin and fibromodulin with significantly increased fibronectin characterized scarred vocal fold tissue. The implications of altered interstitial proteins levels and their affect on the fibrous proteins will be discussed in relation to increased vocal fold stiffness and viscosity, which characterizes vocal fold scar.
Protein folding by NMR.

PubMed

Zhuravleva, Anastasia; Korzhnev, Dmitry M

2017-05-01

Protein folding is a highly complex process proceeding through a number of disordered and partially folded nonnative states with various degrees of structural organization. These transiently and sparsely populated species on the protein folding energy landscape play crucial roles in driving folding toward the native conformation, yet some of these nonnative states may also serve as precursors for protein misfolding and aggregation associated with a range of devastating diseases, including neuro-degeneration, diabetes and cancer. Therefore, in vivo protein folding is often reshaped co- and post-translationally through interactions with the ribosome, molecular chaperones and/or other cellular components. Owing to developments in instrumentation and methodology, solution NMR spectroscopy has emerged as the central experimental approach for the detailed characterization of the complex protein folding processes in vitro and in vivo. NMR relaxation dispersion and saturation transfer methods provide the means for a detailed characterization of protein folding kinetics and thermodynamics under native-like conditions, as well as modeling high-resolution structures of weakly populated short-lived conformational states on the protein folding energy landscape. Continuing development of isotope labeling strategies and NMR methods to probe high molecular weight protein assemblies, along with advances of in-cell NMR, have recently allowed protein folding to be studied in the context of ribosome-nascent chain complexes and molecular chaperones, and even inside living cells. Here we review solution NMR approaches to investigate the protein folding energy landscape, and discuss selected applications of NMR methodology to studying protein folding in vitro and in vivo. Together, these examples highlight a vast potential of solution NMR in providing atomistic insights into molecular mechanisms of protein folding and homeostasis in health and disease. Copyright © 2016 Elsevier B.V. All rights reserved.
How the folding rates of two- and multistate proteins depend on the amino acid properties.

PubMed

Huang, Jitao T; Huang, Wei; Huang, Shanran R; Li, Xin

2014-10-01

Proteins fold by either two-state or multistate kinetic mechanism. We observe that amino acids play different roles in different mechanism. Many residues that are easy to form regular secondary structures (α helices, β sheets and turns) can promote the two-state folding reactions of small proteins. Most of hydrophilic residues can speed up the multistate folding reactions of large proteins. Folding rates of large proteins are equally responsive to the flexibility of partial amino acids. Other properties of amino acids (including volume, polarity, accessible surface, exposure degree, isoelectric point, and phase transfer energy) have contributed little to folding kinetics of the proteins. Cysteine is a special residue, it triggers two-state folding reaction and but inhibits multistate folding reaction. These findings not only provide a new insight into protein structure prediction, but also could be used to direct the point mutations that can change folding rate. © 2014 Wiley Periodicals, Inc.
Course 12: Proteins: Structural, Thermodynamic and Kinetic Aspects

NASA Astrophysics Data System (ADS)

Finkelstein, A. V.

1 Introduction 2 Overview of protein architectures and discussion of physical background of their natural selection 2.1 Protein structures 2.2 Physical selection of protein structures 3 Thermodynamic aspects of protein folding 3.1 Reversible denaturation of protein structures 3.2 What do denatured proteins look like? 3.3 Why denaturation of a globular protein is the first-order phase transition 3.4 "Gap" in energy spectrum: The main characteristic that distinguishes protein chains from random polymers 4 Kinetic aspects of protein folding 4.1 Protein folding in vivo 4.2 Protein folding in vitro (in the test-tube) 4.3 Theory of protein folding rates and solution of the Levinthal paradox
Congenital hypothyroidism mutations affect common folding and trafficking in the α/β-hydrolase fold proteins

PubMed Central

De Jaco, Antonella; Dubi, Noga; Camp, Shelley; Taylor, Palmer

2017-01-01

The α/β-hydrolase fold superfamily of proteins is composed of structurally related members that, despite great diversity in their catalytic, recognition, adhesion and chaperone functions, share a common fold governed by homologous residues and conserved disulfide bridges. Non-synonymous single nucleotide polymorphisms within the α/β-hydrolase fold domain in various family members have been found for congenital endocrine, metabolic and nervous system disorders. By examining the amino acid sequence from the various proteins, mutations were found to be prevalent in conserved residues within the α/β-hydrolase fold of the homologous proteins. This is the case for the thyroglobulin mutations linked to congenital hypothyroidism. To address whether correct folding of the common domain is required for protein export, we inserted the thyroglobulin mutations at homologous positions in two correlated but simpler α/β-hydrolase fold proteins known to be exported to the cell surface: neuroligin3 and acetylcholinesterase. Here we show that these mutations in the cholinesterase homologous region alter the folding properties of the α/β-hydrolase fold domain, which are reflected in defects in protein trafficking, folding and function, and ultimately result in retention of the partially processed proteins in the endoplasmic reticulum. Accordingly, mutations at conserved residues may be transferred amongst homologous proteins to produce common processing defects despite disparate functions, protein complexity and tissue-specific expression of the homologous proteins. More importantly, a similar assembly of the α/β-hydrolase fold domain tertiary structure among homologous members of the superfamily is required for correct trafficking of the proteins to their final destination. PMID:23035660
Extant fold-switching proteins are widespread.

PubMed

Porter, Lauren L; Looger, Loren L

2018-06-05

A central tenet of biology is that globular proteins have a unique 3D structure under physiological conditions. Recent work has challenged this notion by demonstrating that some proteins switch folds, a process that involves remodeling of secondary structure in response to a few mutations (evolved fold switchers) or cellular stimuli (extant fold switchers). To date, extant fold switchers have been viewed as rare byproducts of evolution, but their frequency has been neither quantified nor estimated. By systematically and exhaustively searching the Protein Data Bank (PDB), we found ∼100 extant fold-switching proteins. Furthermore, we gathered multiple lines of evidence suggesting that these proteins are widespread in nature. Based on these lines of evidence, we hypothesized that the frequency of extant fold-switching proteins may be underrepresented by the structures in the PDB. Thus, we sought to identify other putative extant fold switchers with only one solved conformation. To do this, we identified two characteristic features of our ∼100 extant fold-switching proteins, incorrect secondary structure predictions and likely independent folding cooperativity, and searched the PDB for other proteins with similar features. Reassuringly, this method identified dozens of other proteins in the literature with indication of a structural change but only one solved conformation in the PDB. Thus, we used it to estimate that 0.5-4% of PDB proteins switch folds. These results demonstrate that extant fold-switching proteins are likely more common than the PDB reflects, which has implications for cell biology, genomics, and human health. Copyright © 2018 the Author(s). Published by PNAS.
Flexibility damps macromolecular crowding effects on protein folding dynamics: Application to the murine prion protein (121-231)

NASA Astrophysics Data System (ADS)

Bergasa-Caceres, Fernando; Rabitz, Herschel A.

2014-01-01

A model of protein folding kinetics is applied to study the combined effects of protein flexibility and macromolecular crowding on protein folding rate and stability. It is found that the increase in stability and folding rate promoted by macromolecular crowding is damped for proteins with highly flexible native structures. The model is applied to the folding dynamics of the murine prion protein (121-231). It is found that the high flexibility of the native isoform of the murine prion protein (121-231) reduces the effects of macromolecular crowding on its folding dynamics. The relevance of these findings for the pathogenic mechanism are discussed.
Protein Folding Using a Vortex Fluidic Device.

PubMed

Britton, Joshua; Smith, Joshua N; Raston, Colin L; Weiss, Gregory A

2017-01-01

Essentially all biochemistry and most molecular biology experiments require recombinant proteins. However, large, hydrophobic proteins typically aggregate into insoluble and misfolded species, and are directed into inclusion bodies. Current techniques to fold proteins recovered from inclusion bodies rely on denaturation followed by dialysis or rapid dilution. Such approaches can be time consuming, wasteful, and inefficient. Here, we describe rapid protein folding using a vortex fluidic device (VFD). This process uses mechanical energy introduced into thin films to rapidly and efficiently fold proteins. With the VFD in continuous flow mode, large volumes of protein solution can be processed per day with 100-fold reductions in both folding times and buffer volumes.
Deterministic folding: The role of entropic forces and steric specificities

NASA Astrophysics Data System (ADS)

da Silva, Roosevelt A.; da Silva, M. A. A.; Caliri, A.

2001-03-01

The inverse folding problem of proteinlike macromolecules is studied by using a lattice Monte Carlo (MC) model in which steric specificities (nearest-neighbors constraints) are included and the hydrophobic effect is treated explicitly by considering interactions between the chain and solvent molecules. Chemical attributes and steric peculiarities of the residues are encoded in a 10-letter alphabet and a correspondent "syntax" is provided in order to write suitable sequences for the specified target structures; twenty-four target configurations, chosen in order to cover all possible values of the average contact order χ (0.2381⩽χ⩽0.4947 for this system), were encoded and analyzed. The results, obtained by MC simulations, are strongly influenced by geometrical properties of the native configuration, namely χ and the relative number φ of crankshafts-type structures: For χ<0.35 the folding is deterministic, that is, the syntax is able to encode successful sequences: The system presents larger encodability, minimum sequence-target degeneracies and smaller characteristic folding time τf. For χ⩾0.35 the above results are not reproduced any more: The folding success is severely reduced, showing strong correlation with φ. Additionally, the existence of distinct characteristic folding times suggests that different mechanisms are acting at the same time in the folding process. The results (all obtained from the same single model, under the same "physiological conditions") resemble some general features of the folding problem, supporting the premise that the steric specificities, in association with the entropic forces (hydrophobic effect), are basic ingredients in the protein folding process.
Concerted dihedral rotations give rise to internal friction in unfolded proteins.

PubMed

Echeverria, Ignacia; Makarov, Dmitrii E; Papoian, Garegin A

2014-06-18

Protein chains undergo conformational diffusion during folding and dynamics, experiencing both thermal kicks and viscous drag. Recent experiments have shown that the corresponding friction can be separated into wet friction, which is determined by the solvent viscosity, and dry friction, where frictional effects arise due to the interactions within the protein chain. Despite important advances, the molecular origins underlying dry friction in proteins have remained unclear. To address this problem, we studied the dynamics of the unfolded cold-shock protein at different solvent viscosities and denaturant concentrations. Using extensive all-atom molecular dynamics simulations we estimated the internal friction time scales and found them to agree well with the corresponding experimental measurements (Soranno et al. Proc. Natl. Acad. Sci. U.S.A. 2012, 109, 17800-17806). Analysis of the reconfiguration dynamics of the unfolded chain further revealed that hops in the dihedral space provide the dominant mechanism of internal friction. Furthermore, the increased number of concerted dihedral moves at physiological conditions suggest that, in such conditions, the concerted motions result in higher frictional forces. These findings have important implications for understanding the folding kinetics of proteins as well as the dynamics of intrinsically disordered proteins.
Characterization of the Protein Unfolding Processes Induced by Urea and Temperature

PubMed Central

Rocco, Alessandro Guerini; Mollica, Luca; Ricchiuto, Piero; Baptista, António M.; Gianazza, Elisabetta; Eberini, Ivano

2008-01-01

Correct folding is critical for the biological activities of proteins. As a contribution to a better understanding of the protein (un)folding problem, we studied the effect of temperature and of urea on peptostreptococcal Protein L destructuration. We performed standard molecular dynamics simulations at 300 K, 350 K, 400 K, and 480 K, both in 10 M urea and in water. Protein L followed at least two alternative unfolding pathways. Urea caused the loss of secondary structure acting preferentially on the β-sheets, while leaving the α-helices almost intact; on the contrary, high temperature preserved the β-sheets and led to a complete loss of the α-helices. These data suggest that urea and high temperature act through different unfolding mechanisms, and protein secondary motives reveal a differential sensitivity to various denaturant treatments. As further validation of our results, replica-exchange molecular dynamics simulations of the temperature-induced unfolding process in the presence of urea were performed. This set of simulations allowed us to compute the thermodynamical parameters of the process and confirmed that, in the configurational space of Protein L unfolding, both of the above pathways are accessible, although to a different relative extent. PMID:18065481
TopologyNet: Topology based deep convolutional and multi-task neural networks for biomolecular property predictions

PubMed Central

2017-01-01

Although deep learning approaches have had tremendous success in image, video and audio processing, computer vision, and speech recognition, their applications to three-dimensional (3D) biomolecular structural data sets have been hindered by the geometric and biological complexity. To address this problem we introduce the element-specific persistent homology (ESPH) method. ESPH represents 3D complex geometry by one-dimensional (1D) topological invariants and retains important biological information via a multichannel image-like representation. This representation reveals hidden structure-function relationships in biomolecules. We further integrate ESPH and deep convolutional neural networks to construct a multichannel topological neural network (TopologyNet) for the predictions of protein-ligand binding affinities and protein stability changes upon mutation. To overcome the deep learning limitations from small and noisy training sets, we propose a multi-task multichannel topological convolutional neural network (MM-TCNN). We demonstrate that TopologyNet outperforms the latest methods in the prediction of protein-ligand binding affinities, mutation induced globular protein folding free energy changes, and mutation induced membrane protein folding free energy changes. Availability: weilab.math.msu.edu/TDL/ PMID:28749969
Progress towards mapping the universe of protein folds

PubMed Central

Grant, Alastair; Lee, David; Orengo, Christine

2004-01-01

Although the precise aims differ between the various international structural genomics initiatives currently aiming to illuminate the universe of protein folds, many selectively target protein families for which the fold is unknown. How well can the current set of known protein families and folds be used to estimate the total number of folds in nature, and will structural genomics initiatives yield representatives for all the major protein families within a reasonable time scale? PMID:15128436
An ensemble approach to protein fold classification by integration of template-based assignment and support vector machine classifier.

PubMed

Xia, Jiaqi; Peng, Zhenling; Qi, Dawei; Mu, Hongbo; Yang, Jianyi

2017-03-15

Protein fold classification is a critical step in protein structure prediction. There are two possible ways to classify protein folds. One is through template-based fold assignment and the other is ab-initio prediction using machine learning algorithms. Combination of both solutions to improve the prediction accuracy was never explored before. We developed two algorithms, HH-fold and SVM-fold for protein fold classification. HH-fold is a template-based fold assignment algorithm using the HHsearch program. SVM-fold is a support vector machine-based ab-initio classification algorithm, in which a comprehensive set of features are extracted from three complementary sequence profiles. These two algorithms are then combined, resulting to the ensemble approach TA-fold. We performed a comprehensive assessment for the proposed methods by comparing with ab-initio methods and template-based threading methods on six benchmark datasets. An accuracy of 0.799 was achieved by TA-fold on the DD dataset that consists of proteins from 27 folds. This represents improvement of 5.4-11.7% over ab-initio methods. After updating this dataset to include more proteins in the same folds, the accuracy increased to 0.971. In addition, TA-fold achieved >0.9 accuracy on a large dataset consisting of 6451 proteins from 184 folds. Experiments on the LE dataset show that TA-fold consistently outperforms other threading methods at the family, superfamily and fold levels. The success of TA-fold is attributed to the combination of template-based fold assignment and ab-initio classification using features from complementary sequence profiles that contain rich evolution information. http://yanglab.nankai.edu.cn/TA-fold/. yangjy@nankai.edu.cn or mhb-506@163.com. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Dependence of Internal Friction on Folding Mechanism

PubMed Central

2016-01-01

An outstanding challenge in protein folding is understanding the origin of “internal friction” in folding dynamics, experimentally identified from the dependence of folding rates on solvent viscosity. A possible origin suggested by simulation is the crossing of local torsion barriers. However, it was unclear why internal friction varied from protein to protein or for different folding barriers of the same protein. Using all-atom simulations with variable solvent viscosity, in conjunction with transition-path sampling to obtain reaction rates and analysis via Markov state models, we are able to determine the internal friction in the folding of several peptides and miniproteins. In agreement with experiment, we find that the folding events with greatest internal friction are those that mainly involve helix formation, while hairpin formation exhibits little or no evidence of friction. Via a careful analysis of folding transition paths, we show that internal friction arises when torsion angle changes are an important part of the folding mechanism near the folding free energy barrier. These results suggest an explanation for the variation of internal friction effects from protein to protein and across the energy landscape of the same protein. PMID:25721133
Dependence of internal friction on folding mechanism.

PubMed

Zheng, Wenwei; De Sancho, David; Hoppe, Travis; Best, Robert B

2015-03-11

An outstanding challenge in protein folding is understanding the origin of "internal friction" in folding dynamics, experimentally identified from the dependence of folding rates on solvent viscosity. A possible origin suggested by simulation is the crossing of local torsion barriers. However, it was unclear why internal friction varied from protein to protein or for different folding barriers of the same protein. Using all-atom simulations with variable solvent viscosity, in conjunction with transition-path sampling to obtain reaction rates and analysis via Markov state models, we are able to determine the internal friction in the folding of several peptides and miniproteins. In agreement with experiment, we find that the folding events with greatest internal friction are those that mainly involve helix formation, while hairpin formation exhibits little or no evidence of friction. Via a careful analysis of folding transition paths, we show that internal friction arises when torsion angle changes are an important part of the folding mechanism near the folding free energy barrier. These results suggest an explanation for the variation of internal friction effects from protein to protein and across the energy landscape of the same protein.
Fast protein folding kinetics

PubMed Central

Gelman, Hannah; Gruebele, Martin

2014-01-01

Fast folding proteins have been a major focus of computational and experimental study because they are accessible to both techniques: they are small and fast enough to be reasonably simulated with current computational power, but have dynamics slow enough to be observed with specially developed experimental techniques. This coupled study of fast folding proteins has provided insight into the mechanisms which allow some proteins to find their native conformation well less than 1 ms and has uncovered examples of theoretically predicted phenomena such as downhill folding. The study of fast folders also informs our understanding of even “slow” folding processes: fast folders are small, relatively simple protein domains and the principles that govern their folding also govern the folding of more complex systems. This review summarizes the major theoretical and experimental techniques used to study fast folding proteins and provides an overview of the major findings of fast folding research. Finally, we examine the themes that have emerged from studying fast folders and briefly summarize their application to protein folding in general as well as some work that is left to do. PMID:24641816
Directed evolution methods for improving polypeptide folding and solubility and superfolder fluorescent proteins generated thereby

DOEpatents

Waldo, Geoffrey S.

2007-09-18

The current invention provides methods of improving folding of polypeptides using a poorly folding domain as a component of a fusion protein comprising the poorly folding domain and a polypeptide of interest to be improved. The invention also provides novel green fluorescent proteins (GFPs) and red fluorescent proteins that have enhanced folding properties.
Accurate prediction of cellular co-translational folding indicates proteins can switch from post- to co-translational folding

PubMed Central

Nissley, Daniel A.; Sharma, Ajeet K.; Ahmed, Nabeel; Friedrich, Ulrike A.; Kramer, Günter; Bukau, Bernd; O'Brien, Edward P.

2016-01-01

The rates at which domains fold and codons are translated are important factors in determining whether a nascent protein will co-translationally fold and function or misfold and malfunction. Here we develop a chemical kinetic model that calculates a protein domain's co-translational folding curve during synthesis using only the domain's bulk folding and unfolding rates and codon translation rates. We show that this model accurately predicts the course of co-translational folding measured in vivo for four different protein molecules. We then make predictions for a number of different proteins in yeast and find that synonymous codon substitutions, which change translation-elongation rates, can switch some protein domains from folding post-translationally to folding co-translationally—a result consistent with previous experimental studies. Our approach explains essential features of co-translational folding curves and predicts how varying the translation rate at different codon positions along a transcript's coding sequence affects this self-assembly process. PMID:26887592
Right- and left-handed three-helix proteins. I. Experimental and simulation analysis of differences in folding and structure.

PubMed

Glyakina, Anna V; Pereyaslavets, Leonid B; Galzitskaya, Oxana V

2013-09-01

Despite the large number of publications on three-helix protein folding, there is no study devoted to the influence of handedness on the rate of three-helix protein folding. From the experimental studies, we make a conclusion that the left-handed three-helix proteins fold faster than the right-handed ones. What may explain this difference? An important question arising in this paper is whether the modeling of protein folding can catch the difference between the protein folding rates of proteins with similar structures but with different folding mechanisms. To answer this question, the folding of eight three-helix proteins (four right-handed and four left-handed), which are similar in size, was modeled using the Monte Carlo and dynamic programming methods. The studies allowed us to determine the orders of folding of the secondary-structure elements in these domains and amino acid residues which are important for the folding. The obtained data are in good correlation with each other and with the experimental data. Structural analysis of these proteins demonstrated that the left-handed domains have a lesser number of contacts per residue and a smaller radius of cross section than the right-handed domains. This may be one of the explanations of the observed fact. The same tendency is observed for the large dataset consisting of 332 three-helix proteins (238 right- and 94 left-handed). From our analysis, we found that the left-handed three-helix proteins have some less-dense packing that should result in faster folding for some proteins as compared to the case of right-handed proteins. Copyright © 2013 Wiley Periodicals, Inc.

Accelerated molecular dynamics simulations of protein folding.

PubMed

Miao, Yinglong; Feixas, Ferran; Eun, Changsun; McCammon, J Andrew

2015-07-30

Folding of four fast-folding proteins, including chignolin, Trp-cage, villin headpiece and WW domain, was simulated via accelerated molecular dynamics (aMD). In comparison with hundred-of-microsecond timescale conventional molecular dynamics (cMD) simulations performed on the Anton supercomputer, aMD captured complete folding of the four proteins in significantly shorter simulation time. The folded protein conformations were found within 0.2-2.1 Å of the native NMR or X-ray crystal structures. Free energy profiles calculated through improved reweighting of the aMD simulations using cumulant expansion to the second-order are in good agreement with those obtained from cMD simulations. This allows us to identify distinct conformational states (e.g., unfolded and intermediate) other than the native structure and the protein folding energy barriers. Detailed analysis of protein secondary structures and local key residue interactions provided important insights into the protein folding pathways. Furthermore, the selections of force fields and aMD simulation parameters are discussed in detail. Our work shows usefulness and accuracy of aMD in studying protein folding, providing basic references in using aMD in future protein-folding studies. © 2015 Wiley Periodicals, Inc.
Probing sequence dependence of folding pathway of α-helix bundle proteins through free energy landscape analysis.

PubMed

Shao, Qiang

2014-06-05

A comparative study on the folding of multiple three-α-helix bundle proteins including α3D, α3W, and the B domain of protein A (BdpA) is presented. The use of integrated-tempering-sampling molecular dynamics simulations achieves reversible folding and unfolding events in individual short trajectories, which thus provides an efficient approach to sufficiently sample the configuration space of protein and delineate the folding pathway of α-helix bundle. The detailed free energy landscape analyses indicate that the folding mechanism of α-helix bundle is not uniform but sequence dependent. A simple model is then proposed to predict folding mechanism of α-helix bundle on the basis of amino acid composition: α-helical proteins containing higher percentage of hydrophobic residues than charged ones fold via nucleation-condensation mechanism (e.g., α3D and BdpA) whereas proteins having opposite tendency in amino acid composition more likely fold via the framework mechanism (e.g., α3W). The model is tested on various α-helix bundle proteins, and the predicted mechanism is similar to the most approved one for each protein. In addition, the common features in the folding pathway of α-helix bundle protein are also deduced. In summary, the present study provides comprehensive, atomic-level picture of the folding of α-helix bundle proteins.
Direct Detection of Biotinylated Proteins by Mass Spectrometry

PubMed Central

2015-01-01

Mass spectrometric strategies to identify protein subpopulations involved in specific biological functions rely on covalently tagging biotin to proteins using various chemical modification methods. The biotin tag is primarily used for enrichment of the targeted subpopulation for subsequent mass spectrometry (MS) analysis. A limitation of these strategies is that MS analysis does not easily discriminate unlabeled contaminants from the labeled protein subpopulation under study. To solve this problem, we developed a flexible method that only relies on direct MS detection of biotin-tagged proteins called “Direct Detection of Biotin-containing Tags” (DiDBiT). Compared with conventional targeted proteomic strategies, DiDBiT improves direct detection of biotinylated proteins ∼200 fold. We show that DiDBiT is applicable to several protein labeling protocols in cell culture and in vivo using cell permeable NHS-biotin and incorporation of the noncanonical amino acid, azidohomoalanine (AHA), into newly synthesized proteins, followed by click chemistry tagging with biotin. We demonstrate that DiDBiT improves the direct detection of biotin-tagged newly synthesized peptides more than 20-fold compared to conventional methods. With the increased sensitivity afforded by DiDBiT, we demonstrate the MS detection of newly synthesized proteins labeled in vivo in the rodent nervous system with unprecedented temporal resolution as short as 3 h. PMID:25117199
Inclusion bodies and purification of proteins in biologically active forms.

PubMed

Mukhopadhyay, A

1997-01-01

Even though recombinant DNA technology has made possible the production of valuable therapeutic proteins, its accumulation in the host cell as inclusion body poses serious problems in the recovery of functionally active proteins. In the last twenty years, alternative techniques have been evolved to purify biologically active proteins from inclusion bodies. Most of these remain only as inventions and very few are commercially exploited. This review summarizes the developments in isolation, refolding and purification of proteins from inclusion bodies that could be used for vaccine and non-vaccine applications. The second section involves a discussion on inclusion bodies, how they are formed, and their physicochemical properties. In vivo protein folding in Escherichia coli and kinetics of in vitro protein folding are the subjects of the third and fourth sections respectively. The next section covers the recovery of bioactive protein from inclusion bodies: it includes isolation of inclusion body from host cell debris, purification in denatured state alternate refolding techniques, and final purification of active molecules. Since purity and safety are two important issues in therapeutic grade proteins, the following three sections are devoted to immunological and biological characterization of biomolecules, nature, and type of impurities normally encountered, and their detection. Lastly, two case studies are discussed to demonstrate the sequence of process steps involved.
General mechanism of two-state protein folding kinetics.

PubMed

Rollins, Geoffrey C; Dill, Ken A

2014-08-13

We describe here a general model of the kinetic mechanism of protein folding. In the Foldon Funnel Model, proteins fold in units of secondary structures, which form sequentially along the folding pathway, stabilized by tertiary interactions. The model predicts that the free energy landscape has a volcano shape, rather than a simple funnel, that folding is two-state (single-exponential) when secondary structures are intrinsically unstable, and that each structure along the folding path is a transition state for the previous structure. It shows how sequential pathways are consistent with multiple stochastic routes on funnel landscapes, and it gives good agreement with the 9 order of magnitude dependence of folding rates on protein size for a set of 93 proteins, at the same time it is consistent with the near independence of folding equilibrium constant on size. This model gives estimates of folding rates of proteomes, leading to a median folding time in Escherichia coli of about 5 s.
Gaussian Accelerated Molecular Dynamics in NAMD

PubMed Central

2016-01-01

Gaussian accelerated molecular dynamics (GaMD) is a recently developed enhanced sampling technique that provides efficient free energy calculations of biomolecules. Like the previous accelerated molecular dynamics (aMD), GaMD allows for “unconstrained” enhanced sampling without the need to set predefined collective variables and so is useful for studying complex biomolecular conformational changes such as protein folding and ligand binding. Furthermore, because the boost potential is constructed using a harmonic function that follows Gaussian distribution in GaMD, cumulant expansion to the second order can be applied to recover the original free energy profiles of proteins and other large biomolecules, which solves a long-standing energetic reweighting problem of the previous aMD method. Taken together, GaMD offers major advantages for both unconstrained enhanced sampling and free energy calculations of large biomolecules. Here, we have implemented GaMD in the NAMD package on top of the existing aMD feature and validated it on three model systems: alanine dipeptide, the chignolin fast-folding protein, and the M3 muscarinic G protein-coupled receptor (GPCR). For alanine dipeptide, while conventional molecular dynamics (cMD) simulations performed for 30 ns are poorly converged, GaMD simulations of the same length yield free energy profiles that agree quantitatively with those of 1000 ns cMD simulation. Further GaMD simulations have captured folding of the chignolin and binding of the acetylcholine (ACh) endogenous agonist to the M3 muscarinic receptor. The reweighted free energy profiles are used to characterize the protein folding and ligand binding pathways quantitatively. GaMD implemented in the scalable NAMD is widely applicable to enhanced sampling and free energy calculations of large biomolecules. PMID:28034310
Gaussian Accelerated Molecular Dynamics in NAMD.

PubMed

Pang, Yui Tik; Miao, Yinglong; Wang, Yi; McCammon, J Andrew

2017-01-10

Gaussian accelerated molecular dynamics (GaMD) is a recently developed enhanced sampling technique that provides efficient free energy calculations of biomolecules. Like the previous accelerated molecular dynamics (aMD), GaMD allows for "unconstrained" enhanced sampling without the need to set predefined collective variables and so is useful for studying complex biomolecular conformational changes such as protein folding and ligand binding. Furthermore, because the boost potential is constructed using a harmonic function that follows Gaussian distribution in GaMD, cumulant expansion to the second order can be applied to recover the original free energy profiles of proteins and other large biomolecules, which solves a long-standing energetic reweighting problem of the previous aMD method. Taken together, GaMD offers major advantages for both unconstrained enhanced sampling and free energy calculations of large biomolecules. Here, we have implemented GaMD in the NAMD package on top of the existing aMD feature and validated it on three model systems: alanine dipeptide, the chignolin fast-folding protein, and the M 3 muscarinic G protein-coupled receptor (GPCR). For alanine dipeptide, while conventional molecular dynamics (cMD) simulations performed for 30 ns are poorly converged, GaMD simulations of the same length yield free energy profiles that agree quantitatively with those of 1000 ns cMD simulation. Further GaMD simulations have captured folding of the chignolin and binding of the acetylcholine (ACh) endogenous agonist to the M 3 muscarinic receptor. The reweighted free energy profiles are used to characterize the protein folding and ligand binding pathways quantitatively. GaMD implemented in the scalable NAMD is widely applicable to enhanced sampling and free energy calculations of large biomolecules.
Display of disulfide-rich proteins by complementary DNA display and disulfide shuffling assisted by protein disulfide isomerase.

PubMed

Naimuddin, Mohammed; Kubo, Tai

2011-12-01

We report an efficient system to produce and display properly folded disulfide-rich proteins facilitated by coupled complementary DNA (cDNA) display and protein disulfide isomerase-assisted folding. The results show that a neurotoxin protein containing four disulfide linkages can be displayed in the folded state. Furthermore, it can be refolded on a solid support that binds efficiently to its natural acetylcholine receptor. Probing the efficiency of the display proteins prepared by these methods provided up to 8-fold higher enrichment by the selective enrichment method compared with cDNA display alone, more than 10-fold higher binding to its receptor by the binding assays, and more than 10-fold higher affinities by affinity measurements. Cotranslational folding was found to have better efficiency than posttranslational refolding between the two investigated methods. We discuss the utilities of efficient display of such proteins in the preparation of superior quality proteins and protein libraries for directed evolution leading to ligand discovery. Copyright © 2011 Elsevier Inc. All rights reserved.
Small protein domains fold inside the ribosome exit tunnel.

PubMed

Marino, Jacopo; von Heijne, Gunnar; Beckmann, Roland

2016-03-01

Cotranslational folding of small protein domains within the ribosome exit tunnel may be an important cellular strategy to avoid protein misfolding. However, the pathway of cotranslational folding has so far been described only for a few proteins, and therefore, it is unclear whether folding in the ribosome exit tunnel is a common feature for small protein domains. Here, we have analyzed nine small protein domains and determined at which point during translation their folding generates sufficient force on the nascent chain to release translational arrest by the SecM arrest peptide, both in vitro and in live E. coli cells. We find that all nine protein domains initiate folding while still located well within the ribosome exit tunnel. © 2016 Federation of European Biochemical Societies.
Composition-related structural transition of random peptides: insight into the boundary between intrinsically disordered proteins and folded proteins.

PubMed

Kang, Wen-Bin; He, Chuan; Liu, Zhen-Xing; Wang, Jun; Wang, Wei

2018-05-16

Previous studies based on bioinformatics showed that there is a sharp distinction of structural features and residue composition between the intrinsically disordered proteins and the folded proteins. What induces such a composition-related structural transition? How do various kinds of interactions work in such processes? In this work, we investigate these problems based on a survey on peptides randomly composed of charged residues (including glutamic acids and lysines) and the residues with different hydrophobicity, such as alanines, glycines, or phenylalanines. Based on simulations using all-atom model and replica-exchange Monte Carlo method, a coil-globule transition is observed for each peptide. The corresponding transition temperature is found to be dependent on the contents of the hydrophobic and charged residues. For several cases, when the mean hydrophobicity is larger than a certain threshold, the transition temperature is higher than the room temperature, and vise versa. These thresholds of hydrophobicity and net charge are quantitatively consistent with the border line observed from the study of bioinformatics. These results outline the basic physical reasons for the compositional distinction between the intrinsically disordered proteins and the folded proteins. Furthermore, the contributions of various interactions to the structural variation of peptides are analyzed based on the contact statistics and the charge-pattern dependence of the gyration radii of the peptides. Our observations imply that the hydrophobicity contributes essentially to such composition-related transitions. Thus, we achieve a better understanding on composition-structure relation of the natural proteins and the underlying physics.
Improvement on a simplified model for protein folding simulation.

PubMed

Zhang, Ming; Chen, Changjun; He, Yi; Xiao, Yi

2005-11-01

Improvements were made on a simplified protein model--the Ramachandran model-to achieve better computer simulation of protein folding. To check the validity of such improvements, we chose the ultrafast folding protein Engrailed Homeodomain as an example and explored several aspects of its folding. The engrailed homeodomain is a mainly alpha-helical protein of 61 residues from Drosophila melanogaster. We found that the simplified model of Engrailed Homeodomain can fold into a global minimum state with a tertiary structure in good agreement with its native structure.
Amyloid Polymorphism in the Protein Folding and Aggregation Energy Landscape.

PubMed

Adamcik, Jozef; Mezzenga, Raffaele

2018-02-15

Protein folding involves a large number of steps and conformations in which the folding protein samples different thermodynamic states characterized by local minima. Kinetically trapped on- or off-pathway intermediates are metastable folding intermediates towards the lowest absolute energy minima, which have been postulated to be the natively folded state where intramolecular interactions dominate, and the amyloid state where intermolecular interactions dominate. However, this view largely neglects the rich polymorphism found within amyloid species. We review the protein folding energy landscape in view of recent findings identifying specific transition routes among different amyloid polymorphs. Observed transitions such as twisted ribbon→crystal or helical ribbon→nanotube, and forbidden transitions such helical ribbon↛crystal, are discussed and positioned within the protein folding and aggregation energy landscape. Finally, amyloid crystals are identified as the ground state of the protein folding and aggregation energy landscape. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Exploring the Sequence-based Prediction of Folding Initiation Sites in Proteins.

PubMed

Raimondi, Daniele; Orlando, Gabriele; Pancsa, Rita; Khan, Taushif; Vranken, Wim F

2017-08-18

Protein folding is a complex process that can lead to disease when it fails. Especially poorly understood are the very early stages of protein folding, which are likely defined by intrinsic local interactions between amino acids close to each other in the protein sequence. We here present EFoldMine, a method that predicts, from the primary amino acid sequence of a protein, which amino acids are likely involved in early folding events. The method is based on early folding data from hydrogen deuterium exchange (HDX) data from NMR pulsed labelling experiments, and uses backbone and sidechain dynamics as well as secondary structure propensities as features. The EFoldMine predictions give insights into the folding process, as illustrated by a qualitative comparison with independent experimental observations. Furthermore, on a quantitative proteome scale, the predicted early folding residues tend to become the residues that interact the most in the folded structure, and they are often residues that display evolutionary covariation. The connection of the EFoldMine predictions with both folding pathway data and the folded protein structure suggests that the initial statistical behavior of the protein chain with respect to local structure formation has a lasting effect on its subsequent states.
Guiding the folding pathway of DNA origami

NASA Astrophysics Data System (ADS)

Dunn, Katherine E.; Dannenberg, Frits; Ouldridge, Thomas E.; Kwiatkowska, Marta; Turberfield, Andrew J.; Bath, Jonathan

2015-09-01

DNA origami is a robust assembly technique that folds a single-stranded DNA template into a target structure by annealing it with hundreds of short `staple' strands. Its guiding design principle is that the target structure is the single most stable configuration. The folding transition is cooperative and, as in the case of proteins, is governed by information encoded in the polymer sequence. A typical origami folds primarily into the desired shape, but misfolded structures can kinetically trap the system and reduce the yield. Although adjusting assembly conditions or following empirical design rules can improve yield, well-folded origami often need to be separated from misfolded structures. The problem could in principle be avoided if assembly pathway and kinetics were fully understood and then rationally optimized. To this end, here we present a DNA origami system with the unusual property of being able to form a small set of distinguishable and well-folded shapes that represent discrete and approximately degenerate energy minima in a vast folding landscape, thus allowing us to probe the assembly process. The obtained high yield of well-folded origami structures confirms the existence of efficient folding pathways, while the shape distribution provides information about individual trajectories through the folding landscape. We find that, similarly to protein folding, the assembly of DNA origami is highly cooperative; that reversible bond formation is important in recovering from transient misfoldings; and that the early formation of long-range connections can very effectively enforce particular folds. We use these insights to inform the design of the system so as to steer assembly towards desired structures. Expanding the rational design process to include the assembly pathway should thus enable more reproducible synthesis, particularly when targeting more complex structures. We anticipate that this expansion will be essential if DNA origami is to continue its rapid development and become a reliable manufacturing technology.
Guiding the folding pathway of DNA origami.

PubMed

Dunn, Katherine E; Dannenberg, Frits; Ouldridge, Thomas E; Kwiatkowska, Marta; Turberfield, Andrew J; Bath, Jonathan

2015-09-03

DNA origami is a robust assembly technique that folds a single-stranded DNA template into a target structure by annealing it with hundreds of short 'staple' strands. Its guiding design principle is that the target structure is the single most stable configuration. The folding transition is cooperative and, as in the case of proteins, is governed by information encoded in the polymer sequence. A typical origami folds primarily into the desired shape, but misfolded structures can kinetically trap the system and reduce the yield. Although adjusting assembly conditions or following empirical design rules can improve yield, well-folded origami often need to be separated from misfolded structures. The problem could in principle be avoided if assembly pathway and kinetics were fully understood and then rationally optimized. To this end, here we present a DNA origami system with the unusual property of being able to form a small set of distinguishable and well-folded shapes that represent discrete and approximately degenerate energy minima in a vast folding landscape, thus allowing us to probe the assembly process. The obtained high yield of well-folded origami structures confirms the existence of efficient folding pathways, while the shape distribution provides information about individual trajectories through the folding landscape. We find that, similarly to protein folding, the assembly of DNA origami is highly cooperative; that reversible bond formation is important in recovering from transient misfoldings; and that the early formation of long-range connections can very effectively enforce particular folds. We use these insights to inform the design of the system so as to steer assembly towards desired structures. Expanding the rational design process to include the assembly pathway should thus enable more reproducible synthesis, particularly when targeting more complex structures. We anticipate that this expansion will be essential if DNA origami is to continue its rapid development and become a reliable manufacturing technology.
How a Spatial Arrangement of Secondary Structure Elements Is Dispersed in the Universe of Protein Folds

PubMed Central

Minami, Shintaro; Sawada, Kengo; Chikenji, George

2014-01-01

It has been known that topologically different proteins of the same class sometimes share the same spatial arrangement of secondary structure elements (SSEs). However, the frequency by which topologically different structures share the same spatial arrangement of SSEs is unclear. It is important to estimate this frequency because it provides both a deeper understanding of the geometry of protein folds and a valuable suggestion for predicting protein structures with novel folds. Here we clarified the frequency with which protein folds share the same SSE packing arrangement with other folds, the types of spatial arrangement of SSEs that are frequently observed across different folds, and the diversity of protein folds that share the same spatial arrangement of SSEs with a given fold, using a protein structure alignment program MICAN, which we have been developing. By performing comprehensive structural comparison of SCOP fold representatives, we found that approximately 80% of protein folds share the same spatial arrangement of SSEs with other folds. We also observed that many protein pairs that share the same spatial arrangement of SSEs belong to the different classes, often with an opposing N- to C-terminal direction of the polypeptide chain. The most frequently observed spatial arrangement of SSEs was the 2-layer α/β packing arrangement and it was dispersed among as many as 27% of SCOP fold representatives. These results suggest that the same spatial arrangements of SSEs are adopted by a wide variety of different folds and that the spatial arrangement of SSEs is highly robust against the N- to C-terminal direction of the polypeptide chain. PMID:25243952
GroEL-GroES assisted folding of multiple recombinant proteins simultaneously over-expressed in Escherichia coli.

PubMed

Goyal, Megha; Chaudhuri, Tapan K

2015-07-01

Folding of aggregation prone recombinant proteins through co-expression of chaperonin GroEL and GroES has been a popular practice in the effort to optimize preparation of functional protein in Escherichia coli. Considering the demand for functional recombinant protein products, it is desirable to apply the chaperone assisted protein folding strategy for enhancing the yield of properly folded protein. Toward the same direction, it is also worth attempting folding of multiple recombinant proteins simultaneously over-expressed in E. coli through the assistance of co-expressed GroEL-ES. The genesis of this thinking was originated from the fact that cellular GroEL and GroES assist in the folding of several endogenous proteins expressed in the bacterial cell. Here we present the experimental findings from our study on co-expressed GroEL-GroES assisted folding of simultaneously over-expressed proteins maltodextrin glucosidase (MalZ) and yeast mitochondrial aconitase (mAco). Both proteins mentioned here are relatively larger and aggregation prone, mostly form inclusion bodies, and undergo GroEL-ES assisted folding in E. coli cells during over-expression. It has been reported that the relative yield of properly folded functional forms of MalZ and mAco with the exogenous GroEL-ES assistance were comparable with the results when these proteins were overexpressed alone. This observation is quite promising and highlights the fact that GroEL and GroES can assist in the folding of multiple substrate proteins simultaneously when over-expressed in E. coli. This method might be a potential tool for enhanced production of multiple functional recombinant proteins simultaneously in E. coli. Copyright © 2015 Elsevier Ltd. All rights reserved.
An overlapping region between the two terminal folding units of the outer surface protein A (OspA) controls its folding behavior.

PubMed

Makabe, Koki; Nakamura, Takashi; Dhar, Debanjan; Ikura, Teikichi; Koide, Shohei; Kuwajima, Kunihiro

2018-04-27

Although many naturally occurring proteins consist of multiple domains, most studies on protein folding to date deal with single-domain proteins or isolated domains of multi-domain proteins. Studies of multi-domain protein folding are required for further advancing our understanding of protein folding mechanisms. Borrelia outer surface protein A (OspA) is a β-rich two-domain protein, in which two globular domains are connected by a rigid and stable single-layer β-sheet. Thus, OspA is particularly suited as a model system for studying the interplays of domains in protein folding. Here, we studied the equilibria and kinetics of the urea-induced folding-unfolding reactions of OspA probed with tryptophan fluorescence and ultraviolet circular dichroism. Global analysis of the experimental data revealed compelling lines of evidence for accumulation of an on-pathway intermediate during kinetic refolding and for the identity between the kinetic intermediate and a previously described equilibrium unfolding intermediate. The results suggest that the intermediate has the fully native structure in the N-terminal domain and the single layer β-sheet, with the C-terminal domain still unfolded. The observation of the productive on-pathway folding intermediate clearly indicates substantial interactions between the two domains mediated by the single-layer β-sheet. We propose that a rigid and stable intervening region between two domains creates an overlap between two folding units and can energetically couple their folding reactions. Copyright © 2018. Published by Elsevier Ltd.
Method of generating ploynucleotides encoding enhanced folding variants

DOEpatents

Bradbury, Andrew M.; Kiss, Csaba; Waldo, Geoffrey S.

2017-05-02

The invention provides directed evolution methods for improving the folding, solubility and stability (including thermostability) characteristics of polypeptides. In one aspect, the invention provides a method for generating folding and stability-enhanced variants of proteins, including but not limited to fluorescent proteins, chromophoric proteins and enzymes. In another aspect, the invention provides methods for generating thermostable variants of a target protein or polypeptide via an internal destabilization baiting strategy. Internally destabilization a protein of interest is achieved by inserting a heterologous, folding-destabilizing sequence (folding interference domain) within DNA encoding the protein of interest, evolving the protein sequences adjacent to the heterologous insertion to overcome the destabilization (using any number of mutagenesis methods), thereby creating a library of variants. The variants in the library are expressed, and those with enhanced folding characteristics selected.
Folding of the four-helix bundle FF domain from a compact on-pathway intermediate state is governed predominantly by water motion.

PubMed

Sekhar, Ashok; Vallurupalli, Pramodh; Kay, Lewis E

2012-11-20

Friction plays a critical role in protein folding. Frictional forces originating from random solvent and protein fluctuations both retard motion along the folding pathway and activate protein molecules to cross free energy barriers. Studies of friction thus may provide insights into the driving forces underlying protein conformational dynamics. However, the molecular origin of friction in protein folding remains poorly understood because, with the exception of the native conformer, there generally is little detailed structural information on the other states participating in the folding process. Here, we study the folding of the four-helix bundle FF domain that proceeds via a transiently formed, sparsely populated compact on-pathway folding intermediate whose structure was elucidated previously. Because the intermediate is stabilized by both native and nonnative interactions, friction in the folding transition between intermediate and folded states is expected to arise from intrachain reorganization in the protein. However, the viscosity dependencies of rates of folding from or unfolding to the intermediate, as established by relaxation dispersion NMR spectroscopy, clearly indicate that contributions from internal friction are small relative to those from solvent, so solvent frictional forces drive the folding process. Our results emphasize the importance of solvent dynamics in mediating the interconversion between protein configurations, even those that are highly compact, and in equilibrium folding/unfolding fluctuations in general.

A consensus view of fold space: Combining SCOP, CATH, and the Dali Domain Dictionary

PubMed Central

Day, Ryan; Beck, David A.C.; Armen, Roger S.; Daggett, Valerie

2003-01-01

We have determined consensus protein-fold classifications on the basis of three classification methods, SCOP, CATH, and Dali. These classifications make use of different methods of defining and categorizing protein folds that lead to different views of protein-fold space. Pairwise comparisons of domains on the basis of their fold classifications show that much of the disagreement between the classification systems is due to differing domain definitions rather than assigning the same domain to different folds. However, there are significant differences in the fold assignments between the three systems. These remaining differences can be explained primarily in terms of the breadth of the fold classifications. Many structures may be defined as having one fold in one system, whereas far fewer are defined as having the analogous fold in another system. By comparing these folds for a nonredundant set of proteins, the consensus method breaks up broad fold classifications and combines restrictive fold classifications into metafolds, creating, in effect, an averaged view of fold space. This averaged view requires that the structural similarities between proteins having the same metafold be recognized by multiple classification systems. Thus, the consensus map is useful for researchers looking for fold similarities that are relatively independent of the method used to compare proteins. The 30 most populated metafolds, representing the folds of about half of a nonredundant subset of the PDB, are presented here. The full list of metafolds is presented on the Web. PMID:14500873
A consensus view of fold space: combining SCOP, CATH, and the Dali Domain Dictionary.

PubMed

Day, Ryan; Beck, David A C; Armen, Roger S; Daggett, Valerie

2003-10-01

We have determined consensus protein-fold classifications on the basis of three classification methods, SCOP, CATH, and Dali. These classifications make use of different methods of defining and categorizing protein folds that lead to different views of protein-fold space. Pairwise comparisons of domains on the basis of their fold classifications show that much of the disagreement between the classification systems is due to differing domain definitions rather than assigning the same domain to different folds. However, there are significant differences in the fold assignments between the three systems. These remaining differences can be explained primarily in terms of the breadth of the fold classifications. Many structures may be defined as having one fold in one system, whereas far fewer are defined as having the analogous fold in another system. By comparing these folds for a nonredundant set of proteins, the consensus method breaks up broad fold classifications and combines restrictive fold classifications into metafolds, creating, in effect, an averaged view of fold space. This averaged view requires that the structural similarities between proteins having the same metafold be recognized by multiple classification systems. Thus, the consensus map is useful for researchers looking for fold similarities that are relatively independent of the method used to compare proteins. The 30 most populated metafolds, representing the folds of about half of a nonredundant subset of the PDB, are presented here. The full list of metafolds is presented on the Web.
A novel Multi-Agent Ada-Boost algorithm for predicting protein structural class with the information of protein secondary structure.

PubMed

Fan, Ming; Zheng, Bin; Li, Lihua

2015-10-01

Knowledge of the structural class of a given protein is important for understanding its folding patterns. Although a lot of efforts have been made, it still remains a challenging problem for prediction of protein structural class solely from protein sequences. The feature extraction and classification of proteins are the main problems in prediction. In this research, we extended our earlier work regarding these two aspects. In protein feature extraction, we proposed a scheme by calculating the word frequency and word position from sequences of amino acid, reduced amino acid, and secondary structure. For an accurate classification of the structural class of protein, we developed a novel Multi-Agent Ada-Boost (MA-Ada) method by integrating the features of Multi-Agent system into Ada-Boost algorithm. Extensive experiments were taken to test and compare the proposed method using four benchmark datasets in low homology. The results showed classification accuracies of 88.5%, 96.0%, 88.4%, and 85.5%, respectively, which are much better compared with the existing methods. The source code and dataset are available on request.
Statistical mechanics of protein structural transitions: Insights from the island model

PubMed Central

Kobayashi, Yukio

2016-01-01

The so-called island model of protein structural transition holds that hydrophobic interactions are the key to both the folding and function of proteins. Herein, the genesis and statistical mechanical basis of the island model of transitions are reviewed, by presenting the results of simulations of such transitions. Elucidating the physicochemical mechanism of protein structural formation is the foundation for understanding the hierarchical structure of life at the microscopic level. Based on the results obtained to date using the island model, remaining problems and future work in the field of protein structures are discussed, referencing Professor Saitô’s views on the hierarchic structure of science. PMID:28409078
Multiphase Simulated Annealing Based on Boltzmann and Bose-Einstein Distribution Applied to Protein Folding Problem.

PubMed

Frausto-Solis, Juan; Liñán-García, Ernesto; Sánchez-Hernández, Juan Paulo; González-Barbosa, J Javier; González-Flores, Carlos; Castilla-Valdez, Guadalupe

2016-01-01

A new hybrid Multiphase Simulated Annealing Algorithm using Boltzmann and Bose-Einstein distributions (MPSABBE) is proposed. MPSABBE was designed for solving the Protein Folding Problem (PFP) instances. This new approach has four phases: (i) Multiquenching Phase (MQP), (ii) Boltzmann Annealing Phase (BAP), (iii) Bose-Einstein Annealing Phase (BEAP), and (iv) Dynamical Equilibrium Phase (DEP). BAP and BEAP are simulated annealing searching procedures based on Boltzmann and Bose-Einstein distributions, respectively. DEP is also a simulated annealing search procedure, which is applied at the final temperature of the fourth phase, which can be seen as a second Bose-Einstein phase. MQP is a search process that ranges from extremely high to high temperatures, applying a very fast cooling process, and is not very restrictive to accept new solutions. However, BAP and BEAP range from high to low and from low to very low temperatures, respectively. They are more restrictive for accepting new solutions. DEP uses a particular heuristic to detect the stochastic equilibrium by applying a least squares method during its execution. MPSABBE parameters are tuned with an analytical method, which considers the maximal and minimal deterioration of problem instances. MPSABBE was tested with several instances of PFP, showing that the use of both distributions is better than using only the Boltzmann distribution on the classical SA.
Amino Acid Distribution Rules Predict Protein Fold: Protein Grammar for Beta-Strand Sandwich-Like Structures

PubMed Central

Kister, Alexander

2015-01-01

We present an alternative approach to protein 3D folding prediction based on determination of rules that specify distribution of “favorable” residues, that are mainly responsible for a given fold formation, and “unfavorable” residues, that are incompatible with that fold, in polypeptide sequences. The process of determining favorable and unfavorable residues is iterative. The starting assumptions are based on the general principles of protein structure formation as well as structural features peculiar to a protein fold under investigation. The initial assumptions are tested one-by-one for a set of all known proteins with a given structure. The assumption is accepted as a “rule of amino acid distribution” for the protein fold if it holds true for all, or near all, structures. If the assumption is not accepted as a rule, it can be modified to better fit the data and then tested again in the next step of the iterative search algorithm, or rejected. We determined the set of amino acid distribution rules for a large group of beta sandwich-like proteins characterized by a specific arrangement of strands in two beta sheets. It was shown that this set of rules is highly sensitive (~90%) and very specific (~99%) for identifying sequences of proteins with specified beta sandwich fold structure. The advantage of the proposed approach is that it does not require that query proteins have a high degree of homology to proteins with known structure. So long as the query protein satisfies residue distribution rules, it can be confidently assigned to its respective protein fold. Another advantage of our approach is that it allows for a better understanding of which residues play an essential role in protein fold formation. It may, therefore, facilitate rational protein engineering design. PMID:25625198
Developing a molecular dynamics force field for both folded and disordered protein states.

PubMed

Robustelli, Paul; Piana, Stefano; Shaw, David E

2018-05-07

Molecular dynamics (MD) simulation is a valuable tool for characterizing the structural dynamics of folded proteins and should be similarly applicable to disordered proteins and proteins with both folded and disordered regions. It has been unclear, however, whether any physical model (force field) used in MD simulations accurately describes both folded and disordered proteins. Here, we select a benchmark set of 21 systems, including folded and disordered proteins, simulate these systems with six state-of-the-art force fields, and compare the results to over 9,000 available experimental data points. We find that none of the tested force fields simultaneously provided accurate descriptions of folded proteins, of the dimensions of disordered proteins, and of the secondary structure propensities of disordered proteins. Guided by simulation results on a subset of our benchmark, however, we modified parameters of one force field, achieving excellent agreement with experiment for disordered proteins, while maintaining state-of-the-art accuracy for folded proteins. The resulting force field, a99SB- disp , should thus greatly expand the range of biological systems amenable to MD simulation. A similar approach could be taken to improve other force fields. Copyright © 2018 the Author(s). Published by PNAS.
A Simple and Effective Protein Folding Activity Suitable for Large Lectures

ERIC Educational Resources Information Center

White, Brian

2006-01-01

This article describes a simple and inexpensive hands-on simulation of protein folding suitable for use in large lecture classes. This activity uses a minimum of parts, tools, and skill to simulate some of the fundamental principles of protein folding. The major concepts targeted are that proteins begin as linear polypeptides and fold to…
General Mechanism of Two-State Protein Folding Kinetics

PubMed Central

Rollins, Geoffrey C.; Dill, Ken A.

2016-01-01

We describe here a general model of the kinetic mechanism of protein folding. In the Foldon Funnel Model, proteins fold in units of secondary structures, which form sequentially along the folding pathway, stabilized by tertiary interactions. The model predicts that the free energy landscape has a volcano shape, rather than a simple funnel, that folding is two-state (single-exponential) when secondary structures are intrinsically unstable, and that each structure along the folding path is a transition state for the previous structure. It shows how sequential pathways are consistent with multiple stochastic routes on funnel landscapes, and it gives good agreement with the 9 order of magnitude dependence of folding rates on protein size for a set of 93 proteins, at the same time it is consistent with the near independence of folding equilibrium constant on size. This model gives estimates of folding rates of proteomes, leading to a median folding time in Escherichia coli of about 5 s. PMID:25056406
There and back again: Two views on the protein folding puzzle.

PubMed

Finkelstein, Alexei V; Badretdin, Azat J; Galzitskaya, Oxana V; Ivankov, Dmitry N; Bogatyreva, Natalya S; Garbuzynskiy, Sergiy O

2017-07-01

The ability of protein chains to spontaneously form their spatial structures is a long-standing puzzle in molecular biology. Experimentally measured folding times of single-domain globular proteins range from microseconds to hours: the difference (10-11 orders of magnitude) is the same as that between the life span of a mosquito and the age of the universe. This review describes physical theories of rates of overcoming the free-energy barrier separating the natively folded (N) and unfolded (U) states of protein chains in both directions: "U-to-N" and "N-to-U". In the theory of protein folding rates a special role is played by the point of thermodynamic (and kinetic) equilibrium between the native and unfolded state of the chain; here, the theory obtains the simplest form. Paradoxically, a theoretical estimate of the folding time is easier to get from consideration of protein unfolding (the "N-to-U" transition) rather than folding, because it is easier to outline a good unfolding pathway of any structure than a good folding pathway that leads to the stable fold, which is yet unknown to the folding protein chain. And since the rates of direct and reverse reactions are equal at the equilibrium point (as follows from the physical "detailed balance" principle), the estimated folding time can be derived from the estimated unfolding time. Theoretical analysis of the "N-to-U" transition outlines the range of protein folding rates in a good agreement with experiment. Theoretical analysis of folding (the "U-to-N" transition), performed at the level of formation and assembly of protein secondary structures, outlines the upper limit of protein folding times (i.e., of the time of search for the most stable fold). Both theories come to essentially the same results; this is not a surprise, because they describe overcoming one and the same free-energy barrier, although the way to the top of this barrier from the side of the unfolded state is very different from the way from the side of the native state; and both theories agree with experiment. In addition, they predict the maximal size of protein domains that fold under solely thermodynamic (rather than kinetic) control and explain the observed maximal size of the "foldable" protein domains. Copyright © 2017 Elsevier B.V. All rights reserved.
There and back again: Two views on the protein folding puzzle

NASA Astrophysics Data System (ADS)

Finkelstein, Alexei V.; Badretdin, Azat J.; Galzitskaya, Oxana V.; Ivankov, Dmitry N.; Bogatyreva, Natalya S.; Garbuzynskiy, Sergiy O.

2017-07-01

The ability of protein chains to spontaneously form their spatial structures is a long-standing puzzle in molecular biology. Experimentally measured folding times of single-domain globular proteins range from microseconds to hours: the difference (10-11 orders of magnitude) is the same as that between the life span of a mosquito and the age of the universe. This review describes physical theories of rates of overcoming the free-energy barrier separating the natively folded (N) and unfolded (U) states of protein chains in both directions: ;U-to-N; and ;N-to-U;. In the theory of protein folding rates a special role is played by the point of thermodynamic (and kinetic) equilibrium between the native and unfolded state of the chain; here, the theory obtains the simplest form. Paradoxically, a theoretical estimate of the folding time is easier to get from consideration of protein unfolding (the ;N-to-U; transition) rather than folding, because it is easier to outline a good unfolding pathway of any structure than a good folding pathway that leads to the stable fold, which is yet unknown to the folding protein chain. And since the rates of direct and reverse reactions are equal at the equilibrium point (as follows from the physical ;detailed balance; principle), the estimated folding time can be derived from the estimated unfolding time. Theoretical analysis of the ;N-to-U; transition outlines the range of protein folding rates in a good agreement with experiment. Theoretical analysis of folding (the ;U-to-N; transition), performed at the level of formation and assembly of protein secondary structures, outlines the upper limit of protein folding times (i.e., of the time of search for the most stable fold). Both theories come to essentially the same results; this is not a surprise, because they describe overcoming one and the same free-energy barrier, although the way to the top of this barrier from the side of the unfolded state is very different from the way from the side of the native state; and both theories agree with experiment. In addition, they predict the maximal size of protein domains that fold under solely thermodynamic (rather than kinetic) control and explain the observed maximal size of the ;foldable; protein domains.
Complete fold annotation of the human proteome using a novel structural feature space.

PubMed

Middleton, Sarah A; Illuminati, Joseph; Kim, Junhyong

2017-04-13

Recognition of protein structural fold is the starting point for many structure prediction tools and protein function inference. Fold prediction is computationally demanding and recognizing novel folds is difficult such that the majority of proteins have not been annotated for fold classification. Here we describe a new machine learning approach using a novel feature space that can be used for accurate recognition of all 1,221 currently known folds and inference of unknown novel folds. We show that our method achieves better than 94% accuracy even when many folds have only one training example. We demonstrate the utility of this method by predicting the folds of 34,330 human protein domains and showing that these predictions can yield useful insights into potential biological function, such as prediction of RNA-binding ability. Our method can be applied to de novo fold prediction of entire proteomes and identify candidate novel fold families.
Complete fold annotation of the human proteome using a novel structural feature space

PubMed Central

Middleton, Sarah A.; Illuminati, Joseph; Kim, Junhyong

2017-01-01

Recognition of protein structural fold is the starting point for many structure prediction tools and protein function inference. Fold prediction is computationally demanding and recognizing novel folds is difficult such that the majority of proteins have not been annotated for fold classification. Here we describe a new machine learning approach using a novel feature space that can be used for accurate recognition of all 1,221 currently known folds and inference of unknown novel folds. We show that our method achieves better than 94% accuracy even when many folds have only one training example. We demonstrate the utility of this method by predicting the folds of 34,330 human protein domains and showing that these predictions can yield useful insights into potential biological function, such as prediction of RNA-binding ability. Our method can be applied to de novo fold prediction of entire proteomes and identify candidate novel fold families. PMID:28406174
Direct Observation of Parallel Folding Pathways Revealed Using a Symmetric Repeat Protein System

PubMed Central

Aksel, Tural; Barrick, Doug

2014-01-01

Although progress has been made to determine the native fold of a polypeptide from its primary structure, the diversity of pathways that connect the unfolded and folded states has not been adequately explored. Theoretical and computational studies predict that proteins fold through parallel pathways on funneled energy landscapes, although experimental detection of pathway diversity has been challenging. Here, we exploit the high translational symmetry and the direct length variation afforded by linear repeat proteins to directly detect folding through parallel pathways. By comparing folding rates of consensus ankyrin repeat proteins (CARPs), we find a clear increase in folding rates with increasing size and repeat number, although the size of the transition states (estimated from denaturant sensitivity) remains unchanged. The increase in folding rate with chain length, as opposed to a decrease expected from typical models for globular proteins, is a clear demonstration of parallel pathways. This conclusion is not dependent on extensive curve-fitting or structural perturbation of protein structure. By globally fitting a simple parallel-Ising pathway model, we have directly measured nucleation and propagation rates in protein folding, and have quantified the fluxes along each path, providing a detailed energy landscape for folding. This finding of parallel pathways differs from results from kinetic studies of repeat-proteins composed of sequence-variable repeats, where modest repeat-to-repeat energy variation coalesces folding into a single, dominant channel. Thus, for globular proteins, which have much higher variation in local structure and topology, parallel pathways are expected to be the exception rather than the rule. PMID:24988356
Can a pairwise contact potential stabilize native protein folds against decoys obtained by threading?

PubMed

Vendruscolo, M; Najmanovich, R; Domany, E

2000-02-01

We present a method to derive contact energy parameters from large sets of proteins. The basic requirement on which our method is based is that for each protein in the database the native contact map has lower energy than all its decoy conformations that are obtained by threading. Only when this condition is satisfied one can use the proposed energy function for fold identification. Such a set of parameters can be found (by perceptron learning) if Mp, the number of proteins in the database, is not too large. Other aspects that influence the existence of such a solution are the exact definition of contact and the value of the critical distance Rc, below which two residues are considered to be in contact. Another important novel feature of our approach is its ability to determine whether an energy function of some suitable proposed form can or cannot be parameterized in a way that satisfies our basic requirement. As a demonstration of this, we determine the region in the (Rc, Mp) plane in which the problem is solvable, i.e., we can find a set of contact parameters that stabilize simultaneously all the native conformations. We show that for large enough databases the contact approximation to the energy cannot stabilize all the native folds even against the decoys obtained by gapless threading.
Protein folding on Biosensor tips: Folding of Maltodextrin glucosidase monitored by its interactions with GroEL

PubMed Central

Pastor, Ashutosh; Singh, Amit K.; Fisher, Mark T.; Chaudhuri, Tapan K.

2016-01-01

Protein folding has been extensively studied for past four decades by employing solution based experiments such as solubility, enzymatic activity, secondary structure analysis, and analytical methods like FRET, NMR and HD exchange. However, for rapid analysis of the folding process, solution based approaches are often plagued with aggregation side reactions resulting in poor yields. In this work we demonstrate that a Bio-Layer Interferometry (BLI) chaperonin detection system can be potentially applied to identify superior refolding conditions for denatured proteins. The degree of immobilized protein folding as a function of time can be detected by monitoring the binding of the high-affinity nucleotide-free form of the chaperonin GroEL. GroEL preferentially interacts with proteins that have hydrophobic surfaces exposed in their unfolded or partially folded form so a decrease in GroEL binding can be correlated with burial of hydrophobic surfaces as folding progresses. The magnitude of GroEL binding to the protein immobilized on Bio-layer interferometry biosensor inversely reflects the extent of protein folding and hydrophobic residue burial. We demonstrate conditions where accelerated folding can be observed for the aggregation prone protein Maltodextrin glucosidase (MalZ). Superior immobilized folding conditions identified on the Bio-layer interferometry biosensor surface were reproduced on Ni-NTA sepharose bead surfaces and resulted in significant improvement in folding yields of released MalZ (measured by enzymatic activity) compared to bulk refolding conditions in solution. PMID:27367928
Protein homology model refinement by large-scale energy optimization.

PubMed

Park, Hahnbeom; Ovchinnikov, Sergey; Kim, David E; DiMaio, Frank; Baker, David

2018-03-20

Proteins fold to their lowest free-energy structures, and hence the most straightforward way to increase the accuracy of a partially incorrect protein structure model is to search for the lowest-energy nearby structure. This direct approach has met with little success for two reasons: first, energy function inaccuracies can lead to false energy minima, resulting in model degradation rather than improvement; and second, even with an accurate energy function, the search problem is formidable because the energy only drops considerably in the immediate vicinity of the global minimum, and there are a very large number of degrees of freedom. Here we describe a large-scale energy optimization-based refinement method that incorporates advances in both search and energy function accuracy that can substantially improve the accuracy of low-resolution homology models. The method refined low-resolution homology models into correct folds for 50 of 84 diverse protein families and generated improved models in recent blind structure prediction experiments. Analyses of the basis for these improvements reveal contributions from both the improvements in conformational sampling techniques and the energy function.
The dimerization equilibrium of a ClC Cl−/H+ antiporter in lipid bilayers

PubMed Central

Chadda, Rahul; Krishnamani, Venkatramanan; Mersch, Kacey; Wong, Jason; Brimberry, Marley; Chadda, Ankita; Kolmakova-Partensky, Ludmila; Friedman, Larry J; Gelles, Jeff; Robertson, Janice L

2016-01-01

Interactions between membrane protein interfaces in lipid bilayers play an important role in membrane protein folding but quantification of the strength of these interactions has been challenging. Studying dimerization of ClC-type transporters offers a new approach to the problem, as individual subunits adopt a stable and functionally verifiable fold that constrains the system to two states – monomer or dimer. Here, we use single-molecule photobleaching analysis to measure the probability of ClC-ec1 subunit capture into liposomes during extrusion of large, multilamellar membranes. The capture statistics describe a monomer to dimer transition that is dependent on the subunit/lipid mole fraction density and follows an equilibrium dimerization isotherm. This allows for the measurement of the free energy of ClC-ec1 dimerization in lipid bilayers, revealing that it is one of the strongest membrane protein complexes measured so far, and introduces it as new type of dimerization model to investigate the physical forces that drive membrane protein association in membranes. DOI: http://dx.doi.org/10.7554/eLife.17438.001 PMID:27484630
DOE Office of Scientific and Technical Information (OSTI.GOV)

Wołek, Karol; Cieplak, Marek, E-mail: mc@ifpan.edu.pl

In structure-based models of proteins, one often assumes that folding is accomplished when all contacts are established. This assumption may frequently lead to a conceptual problem that folding takes place in a temperature region of very low thermodynamic stability, especially when the contact map used is too sparse. We consider six different structure-based models and show that allowing for a small, but model-dependent, percentage of the native contacts not being established boosts the folding temperature substantially while affecting the time scales of folding only in a minor way. We also compare other properties of the six models. We show thatmore » the choice of the description of the backbone stiffness has a substantial effect on the values of characteristic temperatures that relate both to equilibrium and kinetic properties. Models without any backbone stiffness (like the self-organized polymer) are found to perform similar to those with the stiffness, including in the studies of stretching.« less
Protein Export by the Mycobacterial SecA2 System Is Determined by the Preprotein Mature Domain

PubMed Central

Feltcher, Meghan E.; Gibbons, Henry S.; Ligon, Lauren S.

2013-01-01

At the core of the bacterial general secretion (Sec) pathway is the SecA ATPase, which powers translocation of unfolded preproteins containing Sec signal sequences through the SecYEG membrane channel. Mycobacteria have two nonredundant SecA homologs: SecA1 and SecA2. While the essential SecA1 handles “housekeeping” export, the nonessential SecA2 exports a subset of proteins and is required for Mycobacterium tuberculosis virulence. Currently, it is not understood how SecA2 contributes to Sec export in mycobacteria. In this study, we focused on identifying the features of two SecA2 substrates that target them to SecA2 for export, the Ms1704 and Ms1712 lipoproteins of the model organism Mycobacterium smegmatis. We found that the mature domains of Ms1704 and Ms1712, not the N-terminal signal sequences, confer SecA2-dependent export. We also demonstrated that the lipid modification and the extreme N terminus of the mature protein do not impart the requirement for SecA2 in export. We further showed that the Ms1704 mature domain can be efficiently exported by the twin-arginine translocation (Tat) pathway. Because the Tat system exports only folded proteins, this result implies that SecA2 substrates can fold in the cytoplasm and suggests a putative role of SecA2 in enabling export of such proteins. Thus, the mycobacterial SecA2 system may represent another way that bacteria solve the problem of exporting proteins that can fold in the cytoplasm. PMID:23204463

Non-homologous isofunctional enzymes: a systematic analysis of alternative solutions in enzyme evolution.

PubMed

Omelchenko, Marina V; Galperin, Michael Y; Wolf, Yuri I; Koonin, Eugene V

2010-04-30

Evolutionarily unrelated proteins that catalyze the same biochemical reactions are often referred to as analogous - as opposed to homologous - enzymes. The existence of numerous alternative, non-homologous enzyme isoforms presents an interesting evolutionary problem; it also complicates genome-based reconstruction of the metabolic pathways in a variety of organisms. In 1998, a systematic search for analogous enzymes resulted in the identification of 105 Enzyme Commission (EC) numbers that included two or more proteins without detectable sequence similarity to each other, including 34 EC nodes where proteins were known (or predicted) to have distinct structural folds, indicating independent evolutionary origins. In the past 12 years, many putative non-homologous isofunctional enzymes were identified in newly sequenced genomes. In addition, efforts in structural genomics resulted in a vastly improved structural coverage of proteomes, providing for definitive assessment of (non)homologous relationships between proteins. We report the results of a comprehensive search for non-homologous isofunctional enzymes (NISE) that yielded 185 EC nodes with two or more experimentally characterized - or predicted - structurally unrelated proteins. Of these NISE sets, only 74 were from the original 1998 list. Structural assignments of the NISE show over-representation of proteins with the TIM barrel fold and the nucleotide-binding Rossmann fold. From the functional perspective, the set of NISE is enriched in hydrolases, particularly carbohydrate hydrolases, and in enzymes involved in defense against oxidative stress. These results indicate that at least some of the non-homologous isofunctional enzymes were recruited relatively recently from enzyme families that are active against related substrates and are sufficiently flexible to accommodate changes in substrate specificity.
Effects of tethering a multistate folding protein to a surface

NASA Astrophysics Data System (ADS)

Wei, Shuai; Knotts, Thomas A.

2011-05-01

Protein/surface interactions are important in a variety of fields and devices, yet fundamental understanding of the relevant phenomena remains fragmented due to resolution limitations of experimental techniques. Molecular simulation has provided useful answers, but such studies have focused on proteins that fold through a two-state process. This study uses simulation to show how surfaces can affect proteins which fold through a multistate process by investigating the folding mechanism of lysozyme (PDB ID: 7LZM). The results demonstrate that in the bulk 7LZM folds through a process with four stable states: the folded state, the unfolded state, and two stable intermediates. The folding mechanism remains the same when the protein is tethered to a surface at most residues; however, in one case the folding mechanism changes in such a way as to eliminate one of the intermediates. An analysis of the molecular configurations shows that tethering at this site is advantageous for protein arrays because the active site is both presented to the bulk phase and stabilized. Taken as a whole, the results offer hope that rational design of protein arrays is possible once the behavior of the protein on the surface is ascertained.
Using Chou's general PseAAC to analyze the evolutionary relationship of receptor associated proteins (RAP) with various folding patterns of protein domains.

PubMed

Muthu Krishnan, S

2018-05-14

The receptor-associated protein (RAP) is an inhibitor of endocytic receptors that belong to the lipoprotein receptor gene family. In this study, a computational approach was tried to find the evolutionarily related fold of the RAP proteins. Through the structural and sequence-based analysis, found various protein folds that are very close to the RAP folds. Remote homolog datasets were used potentially to develop a different support vector machine (SVM) methods to recognize the homologous RAP fold. This study helps in understanding the relationship of RAP homologs folds based on the structure, function and evolutionary history. Copyright © 2018 Elsevier Ltd. All rights reserved.
Improving protein fold recognition by extracting fold-specific features from predicted residue-residue contacts.

PubMed

Zhu, Jianwei; Zhang, Haicang; Li, Shuai Cheng; Wang, Chao; Kong, Lupeng; Sun, Shiwei; Zheng, Wei-Mou; Bu, Dongbo

2017-12-01

Accurate recognition of protein fold types is a key step for template-based prediction of protein structures. The existing approaches to fold recognition mainly exploit the features derived from alignments of query protein against templates. These approaches have been shown to be successful for fold recognition at family level, but usually failed at superfamily/fold levels. To overcome this limitation, one of the key points is to explore more structurally informative features of proteins. Although residue-residue contacts carry abundant structural information, how to thoroughly exploit these information for fold recognition still remains a challenge. In this study, we present an approach (called DeepFR) to improve fold recognition at superfamily/fold levels. The basic idea of our approach is to extract fold-specific features from predicted residue-residue contacts of proteins using deep convolutional neural network (DCNN) technique. Based on these fold-specific features, we calculated similarity between query protein and templates, and then assigned query protein with fold type of the most similar template. DCNN has showed excellent performance in image feature extraction and image recognition; the rational underlying the application of DCNN for fold recognition is that contact likelihood maps are essentially analogy to images, as they both display compositional hierarchy. Experimental results on the LINDAHL dataset suggest that even using the extracted fold-specific features alone, our approach achieved success rate comparable to the state-of-the-art approaches. When further combining these features with traditional alignment-related features, the success rate of our approach increased to 92.3%, 82.5% and 78.8% at family, superfamily and fold levels, respectively, which is about 18% higher than the state-of-the-art approach at fold level, 6% higher at superfamily level and 1% higher at family level. An independent assessment on SCOP_TEST dataset showed consistent performance improvement, indicating robustness of our approach. Furthermore, bi-clustering results of the extracted features are compatible with fold hierarchy of proteins, implying that these features are fold-specific. Together, these results suggest that the features extracted from predicted contacts are orthogonal to alignment-related features, and the combination of them could greatly facilitate fold recognition at superfamily/fold levels and template-based prediction of protein structures. Source code of DeepFR is freely available through https://github.com/zhujianwei31415/deepfr, and a web server is available through http://protein.ict.ac.cn/deepfr. zheng@itp.ac.cn or dbu@ict.ac.cn. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
The Folding of a Family of Three-Helix Bundle Proteins: Spectrin R15 Has a Robust Folding Nucleus, Unlike Its Homologous Neighbours☆

PubMed Central

Kwa, Lee Gyan; Wensley, Beth G.; Alexander, Crispin G.; Browning, Stuart J.; Lichman, Benjamin R.; Clarke, Jane

2014-01-01

Three homologous spectrin domains have remarkably different folding characteristics. We have previously shown that the slow-folding R16 and R17 spectrin domains can be altered to resemble the fast folding R15, in terms of speed of folding (and unfolding), landscape roughness and folding mechanism, simply by substituting five residues in the core. Here we show that, by contrast, R15 cannot be engineered to resemble R16 and R17. It is possible to engineer a slow-folding version of R15, but our analysis shows that this protein neither has a rougher energy landscape nor does change its folding mechanism. Quite remarkably, R15 appears to be a rare example of a protein with a folding nucleus that does not change in position or in size when its folding nucleus is disrupted. Thus, while two members of this protein family are remarkably plastic, the third has apparently a restricted folding landscape. PMID:24373753
PROTERAN: animated terrain evolution for visual analysis of patterns in protein folding trajectory.

PubMed

Zhou, Ruhong; Parida, Laxmi; Kapila, Kush; Mudur, Sudhir

2007-01-01

The mechanism of protein folding remains largely a mystery in molecular biology, despite the enormous effort from many groups in the past decades. Currently, the protein folding mechanism is often characterized by calculating the free energy landscape versus various reaction coordinates such as the fraction of native contacts, the radius of gyration and so on. In this paper, we present an integrated approach towards understanding the folding process via visual analysis of patterns of these reaction coordinates. The three disparate processes (1) protein folding simulation, (2) pattern elicitation and (3) visualization of patterns, work in tandem. Thus as the protein folds, the changing landscape in the pattern space can be viewed via the visualization tool, PROTERAN, a program we developed for this purpose. We first present an incremental (on-line) trie-based pattern discovery algorithm to elicit the patterns and then describe the terrain metaphor based visualization tool. Using two example small proteins, a beta-hairpin and a designed protein Trp-cage, we next demonstrate that this combined pattern discovery and visualization approach extracts crucial information about protein folding intermediates and mechanism.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Middleton, Sarah A.; Illuminati, Joseph; Kim, Junhyong

Recognition of protein structural fold is the starting point for many structure prediction tools and protein function inference. Fold prediction is computationally demanding and recognizing novel folds is difficult such that the majority of proteins have not been annotated for fold classification. Here we describe a new machine learning approach using a novel feature space that can be used for accurate recognition of all 1,221 currently known folds and inference of unknown novel folds. We show that our method achieves better than 94% accuracy even when many folds have only one training example. We demonstrate the utility of this methodmore » by predicting the folds of 34,330 human protein domains and showing that these predictions can yield useful insights into potential biological function, such as prediction of RNA-binding ability. Finally, our method can be applied to de novo fold prediction of entire proteomes and identify candidate novel fold families.« less
Molecular chaperone function of Mia40 triggers consecutive induced folding steps of the substrate in mitochondrial protein import

PubMed Central

Banci, Lucia; Bertini, Ivano; Cefaro, Chiara; Cenacchi, Lucia; Ciofi-Baffoni, Simone; Felli, Isabella Caterina; Gallo, Angelo; Gonnelli, Leonardo; Luchinat, Enrico; Sideris, Dionisia; Tokatlidis, Kostas

2010-01-01

Several proteins of the mitochondrial intermembrane space are targeted by internal targeting signals. A class of such proteins with α-helical hairpin structure bridged by two intramolecular disulfides is trapped by a Mia40-dependent oxidative process. Here, we describe the oxidative folding mechanism underpinning this process by an exhaustive structural characterization of the protein in all stages and as a complex with Mia40. Two consecutive induced folding steps are at the basis of the protein-trapping process. In the first one, Mia40 functions as a molecular chaperone assisting α-helical folding of the internal targeting signal of the substrate. Subsequently, in a Mia40-independent manner, folding of the second substrate helix is induced by the folded targeting signal functioning as a folding scaffold. The Mia40-induced folding pathway provides a proof of principle for the general concept that internal targeting signals may operate as a folding nucleus upon compartment-specific activation. PMID:21059946
Structure of a Reptilian Adenovirus Reveals a Phage Tailspike Fold Stabilizing a Vertebrate Virus Capsid.

PubMed

Menéndez-Conejero, Rosa; Nguyen, Thanh H; Singh, Abhimanyu K; Condezo, Gabriela N; Marschang, Rachel E; van Raaij, Mark J; San Martín, Carmen

2017-10-03

Although non-human adenoviruses (AdVs) might offer solutions to problems posed by human AdVs as therapeutic vectors, little is known about their basic biology. In particular, there are no structural studies on the complete virion of any AdV with a non-mammalian host. We combine mass spectrometry, cryo-electron microscopy, and protein crystallography to characterize the composition and structure of a snake AdV (SnAdV-1, Atadenovirus genus). SnAdV-1 particles contain the genus-specific proteins LH3, p32k, and LH2, a previously unrecognized structural component. Remarkably, the cementing protein LH3 has a trimeric β helix fold typical of bacteriophage host attachment proteins. The organization of minor coat proteins differs from that in human AdVs, correlating with higher thermostability in SnAdV-1. These findings add a new piece to the intriguing puzzle of virus evolution, hint at the use of cell entry pathways different from those in human AdVs, and will help development of new, thermostable SnAdV-1-based vectors. Copyright © 2017 Elsevier Ltd. All rights reserved.
How protein chemists learned about the hydrophobic factor.

PubMed Central

Tanford, C.

1997-01-01

It is generally accepted today that the hydrophobic force is the dominant energetic factor that leads to the folding of polypeptide chains into compact globular entities. This principle was first explicitly introduced to protein chemists in 1938 by Irving Langmuir, past master in the application of hydrophobicity to other problems, and was enthusiastically endorsed by J.D. Bernal. But both proposal and endorsement came in the course of a debate about a quite different structural principle, the so-called "cyclol hypothesis" proposed by D. Wrinch, which soon proved to be theoretically and experimentally unsupportable. Being a more tangible idea, directly expressed in structural terms, the cyclol hypothesis received more attention than the hydrophobic principle and the latter never actually entered the mainstream of protein science until 1959, when it was thrust into the limelight in a lucid review by W. Kauzmann. A theoretical paper by H.S. Frank and M. Evans, not itself related to protein folding, probably played a major role in the acceptance of the hydrophobicity concept by protein chemists because it provided a crude but tangible picture of the origin of hydrophobicity per se in terms of water structure. PMID:9194199
A Particle Swarm Optimization-Based Approach with Local Search for Predicting Protein Folding.

PubMed

Yang, Cheng-Hong; Lin, Yu-Shiun; Chuang, Li-Yeh; Chang, Hsueh-Wei

2017-10-01

The hydrophobic-polar (HP) model is commonly used for predicting protein folding structures and hydrophobic interactions. This study developed a particle swarm optimization (PSO)-based algorithm combined with local search algorithms; specifically, the high exploration PSO (HEPSO) algorithm (which can execute global search processes) was combined with three local search algorithms (hill-climbing algorithm, greedy algorithm, and Tabu table), yielding the proposed HE-L-PSO algorithm. By using 20 known protein structures, we evaluated the performance of the HE-L-PSO algorithm in predicting protein folding in the HP model. The proposed HE-L-PSO algorithm exhibited favorable performance in predicting both short and long amino acid sequences with high reproducibility and stability, compared with seven reported algorithms. The HE-L-PSO algorithm yielded optimal solutions for all predicted protein folding structures. All HE-L-PSO-predicted protein folding structures possessed a hydrophobic core that is similar to normal protein folding.
Confinement in nanopores can destabilize α-helix folding proteins and stabilize the β structures

NASA Astrophysics Data System (ADS)

Javidpour, Leili; Sahimi, Muhammad

2011-09-01

Protein folding in confined media has attracted wide attention over the past decade due to its importance in both in vivo and in vitro applications. Currently, it is generally believed that protein stability increases by decreasing the size of the confining medium, if its interaction with the confining walls is repulsive, and that the maximum folding temperature in confinement occurs for a pore size only slightly larger than the smallest dimension of the folded state of a protein. Protein stability in pore sizes, very close to the size of the folded state, has not however received the attention that it deserves. Using detailed, 0.3-ms-long molecular dynamics simulations, we show that proteins with an α-helix native state can have an optimal folding temperature in pore sizes that do not affect the folded-state structure. In contradiction to the current theoretical explanations, we find that the maximum folding temperature occurs in larger pores for smaller α-helices. In highly confined pores the free energy surface becomes rough, and a new barrier for protein folding may appear close to the unfolded state. In addition, in small nanopores the protein states that contain the β structures are entropically stabilized, in contrast to the bulk. As a consequence, folding rates decrease notably and the free energy surface becomes rougher. The results shed light on many recent experimental observations that cannot be explained by the current theories, and demonstrate the importance of entropic effects on proteins' misfolded states in highly confined environments. They also support the concept of passive effect of chaperonin GroEL on protein folding by preventing it from aggregation in crowded environment of biological cells, and provide deeper clues to the α → β conformational transition, believed to contribute to Alzheimer's and Parkinson's diseases. The strategy of protein and enzyme stabilization in confined media may also have to be revisited in the case of tight confinement. For in silico studies of protein folding in confined media, use of non-Go potentials may be more appropriate.
How Does Your Protein Fold? Elucidating the Apomyoglobin Folding Pathway

PubMed Central

Dyson, H. Jane; Wright, Peter E.

2017-01-01

Conspectus Although each type of protein fold and in some cases individual proteins within a fold classification can have very different mechanisms of folding, the underlying biophysical and biochemical principles that operate to cause a linear polypeptide chain to fold into a globular structure must be the same. In an aqueous solution, the protein takes up the thermodynamically most stable structure, but the pathway along which the polypeptide proceeds in order to reach that structure is a function of the amino acid sequence, which must be the final determining factor, not only in shaping the final folded structure, but in dictating the folding pathway. A number of groups have focused on a single protein or group of proteins, to determine in detail the factors that influence the rate and mechanism of folding in a defined system, with the hope that hypothesis-driven experiments can elucidate the underlying principles governing the folding process. Our research group has focused on the folding of the globin family of proteins, and in particular on the monomeric protein apomyoglobin. Apomyoglobin (apoMb) folds relatively slowly (~2 seconds) via an ensemble of obligatory intermediates that form rapidly after the initiation of folding. The folding pathway can be dissected using rapid-mixing techniques, which can probe processes in the millisecond time range. Stopped-flow measurements detected by circular dichroism (CD) or fluorescence spectroscopy give information on the rates of folding events. Quench-flow experiments utilize the differential rates of hydrogen-deuterium exchange of amide protons protected in parts of the structure that are folded early; protection of amides can be detected by mass spectrometry or proton nuclear magnetic resonance spectroscopy (NMR). In addition, apoMb forms an intermediate at equilibrium at pH ~ 4, which is sufficiently stable for it to be structurally characterized by solution methods such as CD, fluorescence and NMR spectroscopies, and the conformational ensembles formed in the presence of denaturing agents and low pH can be characterized as models for the unfolded states of the protein. Newer NMR techniques such as measurement of residual dipolar couplings in the various partly folded states, and relaxation dispersion measurements to probe invisible states present at low concentrations, have contributed to providing a detailed picture of the apomyoglobin folding pathway. The research summarized in this review was aimed at characterizing and comparing the equilibrium and kinetic intermediates both structurally and dynamically, as well as delineating the complete folding pathway at a residue-specific level, in order to answer the question “What is it about the amino acid sequence that causes each molecule in the unfolded protein ensemble to start folding, and, once started, to proceed towards the formation of the correctly folded three-dimensional structure?” PMID:28032989
Genetic algorithms for protein threading.

PubMed

Yadgari, J; Amir, A; Unger, R

1998-01-01

Despite many years of efforts, a direct prediction of protein structure from sequence is still not possible. As a result, in the last few years researchers have started to address the "inverse folding problem": Identifying and aligning a sequence to the fold with which it is most compatible, a process known as "threading". In two meetings in which protein folding predictions were objectively evaluated, it became clear that threading as a concept promises a real breakthrough, but that much improvement is still needed in the technique itself. Threading is a NP-hard problem, and thus no general polynomial solution can be expected. Still a practical approach with demonstrated ability to find optimal solutions in many cases, and acceptable solutions in other cases, is needed. We applied the technique of Genetic Algorithms in order to significantly improve the ability of threading algorithms to find the optimal alignment of a sequence to a structure, i.e. the alignment with the minimum free energy. A major progress reported here is the design of a representation of the threading alignment as a string of fixed length. With this representation validation of alignments and genetic operators are effectively implemented. Appropriate data structure and parameters have been selected. It is shown that Genetic Algorithm threading is effective and is able to find the optimal alignment in a few test cases. Furthermore, the described algorithm is shown to perform well even without pre-definition of core elements. Existing threading methods are dependent on such constraints to make their calculations feasible. But the concept of core elements is inherently arbitrary and should be avoided if possible. While a rigorous proof is hard to submit yet an, we present indications that indeed Genetic Algorithm threading is capable of finding consistently good solutions of full alignments in search spaces of size up to 10(70).
Ab initio folding of proteins using all-atom discrete molecular dynamics

PubMed Central

Ding, Feng; Tsao, Douglas; Nie, Huifen; Dokholyan, Nikolay V.

2008-01-01

Summary Discrete molecular dynamics (DMD) is a rapid sampling method used in protein folding and aggregation studies. Until now, DMD was used to perform simulations of simplified protein models in conjunction with structure-based force fields. Here, we develop an all-atom protein model and a transferable force field featuring packing, solvation, and environment-dependent hydrogen bond interactions. Using the replica exchange method, we perform folding simulations of six small proteins (20–60 residues) with distinct native structures. In all cases, native or near-native states are reached in simulations. For three small proteins, multiple folding transitions are observed and the computationally-characterized thermodynamics are in quantitative agreement with experiments. The predictive power of all-atom DMD highlights the importance of environment-dependent hydrogen bond interactions in modeling protein folding. The developed approach can be used for accurate and rapid sampling of conformational spaces of proteins and protein-protein complexes, and applied to protein engineering and design of protein-protein interactions. PMID:18611374
Electrostatically Accelerated Encounter and Folding for Facile Recognition of Intrinsically Disordered Proteins

PubMed Central

Ganguly, Debabani; Zhang, Weihong; Chen, Jianhan

2013-01-01

Achieving facile specific recognition is essential for intrinsically disordered proteins (IDPs) that are involved in cellular signaling and regulation. Consideration of the physical time scales of protein folding and diffusion-limited protein-protein encounter has suggested that the frequent requirement of protein folding for specific IDP recognition could lead to kinetic bottlenecks. How IDPs overcome such potential kinetic bottlenecks to viably function in signaling and regulation in general is poorly understood. Our recent computational and experimental study of cell-cycle regulator p27 (Ganguly et al., J. Mol. Biol. (2012)) demonstrated that long-range electrostatic forces exerted on enriched charges of IDPs could accelerate protein-protein encounter via “electrostatic steering” and at the same time promote “folding-competent” encounter topologies to enhance the efficiency of IDP folding upon encounter. Here, we further investigated the coupled binding and folding mechanisms and the roles of electrostatic forces in the formation of three IDP complexes with more complex folded topologies. The surface electrostatic potentials of these complexes lack prominent features like those observed for the p27/Cdk2/cyclin A complex to directly suggest the ability of electrostatic forces to facilitate folding upon encounter. Nonetheless, similar electrostatically accelerated encounter and folding mechanisms were consistently predicted for all three complexes using topology-based coarse-grained simulations. Together with our previous analysis of charge distributions in known IDP complexes, our results support a prevalent role of electrostatic interactions in promoting efficient coupled binding and folding for facile specific recognition. These results also suggest that there is likely a co-evolution of IDP folded topology, charge characteristics, and coupled binding and folding mechanisms, driven at least partially by the need to achieve fast association kinetics for cellular signaling and regulation. PMID:24278008
Structure optimisation by thermal cycling for the hydrophobic-polar lattice model of protein folding

NASA Astrophysics Data System (ADS)

Günther, Florian; Möbius, Arnulf; Schreiber, Michael

2017-03-01

The function of a protein depends strongly on its spatial structure. Therefore the transition from an unfolded stage to the functional fold is one of the most important problems in computational molecular biology. Since the corresponding free energy landscapes exhibit huge numbers of local minima, the search for the lowest-energy configurations is very demanding. Because of that, efficient heuristic algorithms are of high value. In the present work, we investigate whether and how the thermal cycling (TC) approach can be applied to the hydrophobic-polar (HP) lattice model of protein folding. Evaluating the efficiency of TC for a set of two- and three-dimensional examples, we compare the performance of this strategy with that of multi-start local search (MSLS) procedures and that of simulated annealing (SA). For this aim, we incorporated several simple but rather efficient modifications into the standard procedures: in particular, a strong improvement was achieved by also allowing energy conserving state modifications. Furthermore, the consideration of ensembles instead of single samples was found to greatly improve the efficiency of TC. In the framework of different benchmarks, for all considered HP sequences, we found TC to be far superior to SA, and to be faster than Wang-Landau sampling.
Hydrogen bonds are a primary driving force for de novo protein folding

DOE PAGES

Lee, Schuyler; Wang, Chao; Liu, Haolin; ...

2017-11-10

The protein-folding mechanism remains a major puzzle in life science. Purified soluble activation-induced cytidine deaminase (AID) is one of the most difficult proteins to obtain. Starting from inclusion bodies containing a C-terminally truncated version of AID (residues 1–153; AID 153 ), an optimized in vitro folding procedure was derived to obtain large amounts of AID 153 , which led to crystals with good quality and to final structural determination. Interestingly, it was found that the final refolding yield of the protein is proline residue-dependent. The difference in the distribution of cis and trans configurations of proline residues in the proteinmore » after complete denaturation is a major determining factor of the final yield. A point mutation of one of four proline residues to an asparagine led to a near-doubling of the yield of refolded protein after complete denaturation. It was concluded that the driving force behind protein folding could not overcome the cis -to- trans proline isomerization, or vice versa , during the protein-folding process. Furthermore, it was found that successful refolding of proteins optimally occurs at high pH values, which may mimic protein folding in vivo . It was found that high pH values could induce the polarization of peptide bonds, which may trigger the formation of protein secondary structures through hydrogen bonds. It is proposed that a hydrophobic environment coupled with negative charges is essential for protein folding. Combined with our earlier discoveries on protein-unfolding mechanisms, it is proposed that hydrogen bonds are a primary driving force for de novo protein folding.« less
Hydrogen bonds are a primary driving force for de novo protein folding

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lee, Schuyler; Wang, Chao; Liu, Haolin

The protein-folding mechanism remains a major puzzle in life science. Purified soluble activation-induced cytidine deaminase (AID) is one of the most difficult proteins to obtain. Starting from inclusion bodies containing a C-terminally truncated version of AID (residues 1–153; AID 153 ), an optimized in vitro folding procedure was derived to obtain large amounts of AID 153 , which led to crystals with good quality and to final structural determination. Interestingly, it was found that the final refolding yield of the protein is proline residue-dependent. The difference in the distribution of cis and trans configurations of proline residues in the proteinmore » after complete denaturation is a major determining factor of the final yield. A point mutation of one of four proline residues to an asparagine led to a near-doubling of the yield of refolded protein after complete denaturation. It was concluded that the driving force behind protein folding could not overcome the cis -to- trans proline isomerization, or vice versa , during the protein-folding process. Furthermore, it was found that successful refolding of proteins optimally occurs at high pH values, which may mimic protein folding in vivo . It was found that high pH values could induce the polarization of peptide bonds, which may trigger the formation of protein secondary structures through hydrogen bonds. It is proposed that a hydrophobic environment coupled with negative charges is essential for protein folding. Combined with our earlier discoveries on protein-unfolding mechanisms, it is proposed that hydrogen bonds are a primary driving force for de novo protein folding.« less
The Movable Type Method Applied to Protein-Ligand Binding.

PubMed

Zheng, Zheng; Ucisik, Melek N; Merz, Kenneth M

2013-12-10

Accurately computing the free energy for biological processes like protein folding or protein-ligand association remains a challenging problem. Both describing the complex intermolecular forces involved and sampling the requisite configuration space make understanding these processes innately difficult. Herein, we address the sampling problem using a novel methodology we term "movable type". Conceptually it can be understood by analogy with the evolution of printing and, hence, the name movable type. For example, a common approach to the study of protein-ligand complexation involves taking a database of intact drug-like molecules and exhaustively docking them into a binding pocket. This is reminiscent of early woodblock printing where each page had to be laboriously created prior to printing a book. However, printing evolved to an approach where a database of symbols (letters, numerals, etc.) was created and then assembled using a movable type system, which allowed for the creation of all possible combinations of symbols on a given page, thereby, revolutionizing the dissemination of knowledge. Our movable type (MT) method involves the identification of all atom pairs seen in protein-ligand complexes and then creating two databases: one with their associated pairwise distant dependent energies and another associated with the probability of how these pairs can combine in terms of bonds, angles, dihedrals and non-bonded interactions. Combining these two databases coupled with the principles of statistical mechanics allows us to accurately estimate binding free energies as well as the pose of a ligand in a receptor. This method, by its mathematical construction, samples all of configuration space of a selected region (the protein active site here) in one shot without resorting to brute force sampling schemes involving Monte Carlo, genetic algorithms or molecular dynamics simulations making the methodology extremely efficient. Importantly, this method explores the free energy surface eliminating the need to estimate the enthalpy and entropy components individually. Finally, low free energy structures can be obtained via a free energy minimization procedure yielding all low free energy poses on a given free energy surface. Besides revolutionizing the protein-ligand docking and scoring problem this approach can be utilized in a wide range of applications in computational biology which involve the computation of free energies for systems with extensive phase spaces including protein folding, protein-protein docking and protein design.

The Folding of de Novo Designed Protein DS119 via Molecular Dynamics Simulations.

PubMed

Wang, Moye; Hu, Jie; Zhang, Zhuqing

2016-04-26

As they are not subjected to natural selection process, de novo designed proteins usually fold in a manner different from natural proteins. Recently, a de novo designed mini-protein DS119, with a βαβ motif and 36 amino acids, has folded unusually slowly in experiments, and transient dimers have been detected in the folding process. Here, by means of all-atom replica exchange molecular dynamics (REMD) simulations, several comparably stable intermediate states were observed on the folding free-energy landscape of DS119. Conventional molecular dynamics (CMD) simulations showed that when two unfolded DS119 proteins bound together, most binding sites of dimeric aggregates were located at the N-terminal segment, especially residues 5-10, which were supposed to form β-sheet with its own C-terminal segment. Furthermore, a large percentage of individual proteins in the dimeric aggregates adopted conformations similar to those in the intermediate states observed in REMD simulations. These results indicate that, during the folding process, DS119 can easily become trapped in intermediate states. Then, with diffusion, a transient dimer would be formed and stabilized with the binding interface located at N-terminals. This means that it could not quickly fold to the native structure. The complicated folding manner of DS119 implies the important influence of natural selection on protein-folding kinetics, and more improvement should be achieved in rational protein design.
The Folding of de Novo Designed Protein DS119 via Molecular Dynamics Simulations

PubMed Central

Wang, Moye; Hu, Jie; Zhang, Zhuqing

2016-01-01

As they are not subjected to natural selection process, de novo designed proteins usually fold in a manner different from natural proteins. Recently, a de novo designed mini-protein DS119, with a βαβ motif and 36 amino acids, has folded unusually slowly in experiments, and transient dimers have been detected in the folding process. Here, by means of all-atom replica exchange molecular dynamics (REMD) simulations, several comparably stable intermediate states were observed on the folding free-energy landscape of DS119. Conventional molecular dynamics (CMD) simulations showed that when two unfolded DS119 proteins bound together, most binding sites of dimeric aggregates were located at the N-terminal segment, especially residues 5–10, which were supposed to form β-sheet with its own C-terminal segment. Furthermore, a large percentage of individual proteins in the dimeric aggregates adopted conformations similar to those in the intermediate states observed in REMD simulations. These results indicate that, during the folding process, DS119 can easily become trapped in intermediate states. Then, with diffusion, a transient dimer would be formed and stabilized with the binding interface located at N-terminals. This means that it could not quickly fold to the native structure. The complicated folding manner of DS119 implies the important influence of natural selection on protein-folding kinetics, and more improvement should be achieved in rational protein design. PMID:27128902
Light-activated control of protein channel assembly mediated by membrane mechanics

NASA Astrophysics Data System (ADS)

Miller, David M.; Findlay, Heather E.; Ces, Oscar; Templer, Richard H.; Booth, Paula J.

2016-12-01

Photochemical processes provide versatile triggers of chemical reactions. Here, we use a photoactivated lipid switch to modulate the folding and assembly of a protein channel within a model biological membrane. In contrast to the information rich field of water-soluble protein folding, there is only a limited understanding of the assembly of proteins that are integral to biological membranes. It is however possible to exploit the foreboding hydrophobic lipid environment and control membrane protein folding via lipid bilayer mechanics. Mechanical properties such as lipid chain lateral pressure influence the insertion and folding of proteins in membranes, with different stages of folding having contrasting sensitivities to the bilayer properties. Studies to date have relied on altering bilayer properties through lipid compositional changes made at equilibrium, and thus can only be made before or after folding. We show that light-activation of photoisomerisable di-(5-[[4-(4-butylphenyl)azo]phenoxy]pentyl)phosphate (4-Azo-5P) lipids influences the folding and assembly of the pentameric bacterial mechanosensitive channel MscL. The use of a photochemical reaction enables the bilayer properties to be altered during folding, which is unprecedented. This mechanical manipulation during folding, allows for optimisation of different stages of the component insertion, folding and assembly steps within the same lipid system. The photochemical approach offers the potential to control channel assembly when generating synthetic devices that exploit the mechanosensitive protein as a nanovalve.
Revealing the global map of protein folding space by large-scale simulations

NASA Astrophysics Data System (ADS)

Sinner, Claude; Lutz, Benjamin; Verma, Abhinav; Schug, Alexander

2015-12-01

The full characterization of protein folding is a remarkable long-standing challenge both for experiment and simulation. Working towards a complete understanding of this process, one needs to cover the full diversity of existing folds and identify the general principles driving the process. Here, we want to understand and quantify the diversity in folding routes for a large and representative set of protein topologies covering the full range from all alpha helical topologies towards beta barrels guided by the key question: Does the majority of the observed routes contribute to the folding process or only a particular route? We identified a set of two-state folders among non-homologous proteins with a sequence length of 40-120 residues. For each of these proteins, we ran native-structure based simulations both with homogeneous and heterogeneous contact potentials. For each protein, we simulated dozens of folding transitions in continuous uninterrupted simulations and constructed a large database of kinetic parameters. We investigate folding routes by tracking the formation of tertiary structure interfaces and discuss whether a single specific route exists for a topology or if all routes are equiprobable. These results permit us to characterize the complete folding space for small proteins in terms of folding barrier ΔG‡, number of routes, and the route specificity RT.
Unfolding the chaperone story

PubMed Central

Hartl, F. Ulrich

2017-01-01

Protein folding in the cell was originally assumed to be a spontaneous process, based on Anfinsen’s discovery that purified proteins can fold on their own after removal from denaturant. Consequently cell biologists showed little interest in the protein folding process. This changed only in the mid and late 1980s, when the chaperone story began to unfold. As a result, we now know that in vivo, protein folding requires assistance by a complex machinery of molecular chaperones. To ensure efficient folding, members of different chaperone classes receive the nascent protein chain emerging from the ribosome and guide it along an ordered pathway toward the native state. I was fortunate to contribute to these developments early on. In this short essay, I will describe some of the critical steps leading to the current concept of protein folding as a highly organized cellular process. PMID:29084909
Conformational dynamics of a protein in the folded and the unfolded state

NASA Astrophysics Data System (ADS)

Fitter, Jörg

2003-08-01

In a quasielastic neutron scattering experiment, the picosecond dynamics of α-amylase was investigated for the folded and the unfolded state of the protein. In order to ensure a reasonable interpretation of the internal protein dynamics, the protein was measured in D 2O-buffer solution. The much higher structural flexibility of the pH induced unfolded state as compared to the native folded state was quantified using a simple analytical model, describing a local diffusion inside a sphere. In terms of this model the conformational volume, which is explored mainly by confined protein side-chain movements, is parameterized by the radius of a sphere (folded state, r=1.2 Å; unfolded state, 1.8 Å). Differences in conformational dynamics between the folded and the unfolded state of a protein are of fundamental interest in the field of protein science, because they are assumed to play an important role for the thermodynamics of folding/unfolding transition and for protein stability.
High Pressure ZZ-Exchange NMR Reveals Key Features of Protein Folding Transition States.

PubMed

Zhang, Yi; Kitazawa, Soichiro; Peran, Ivan; Stenzoski, Natalie; McCallum, Scott A; Raleigh, Daniel P; Royer, Catherine A

2016-11-23

Understanding protein folding mechanisms and their sequence dependence requires the determination of residue-specific apparent kinetic rate constants for the folding and unfolding reactions. Conventional two-dimensional NMR, such as HSQC experiments, can provide residue-specific information for proteins. However, folding is generally too fast for such experiments. ZZ-exchange NMR spectroscopy allows determination of folding and unfolding rates on much faster time scales, yet even this regime is not fast enough for many protein folding reactions. The application of high hydrostatic pressure slows folding by orders of magnitude due to positive activation volumes for the folding reaction. We combined high pressure perturbation with ZZ-exchange spectroscopy on two autonomously folding protein domains derived from the ribosomal protein, L9. We obtained residue-specific apparent rates at 2500 bar for the N-terminal domain of L9 (NTL9), and rates at atmospheric pressure for a mutant of the C-terminal domain (CTL9) from pressure dependent ZZ-exchange measurements. Our results revealed that NTL9 folding is almost perfectly two-state, while small deviations from two-state behavior were observed for CTL9. Both domains exhibited large positive activation volumes for folding. The volumetric properties of these domains reveal that their transition states contain most of the internal solvent excluded voids that are found in the hydrophobic cores of the respective native states. These results demonstrate that by coupling it with high pressure, ZZ-exchange can be extended to investigate a large number of protein conformational transitions.
Study of protein folding under native conditions by rapidly switching the hydrostatic pressure inside an NMR sample cell

PubMed Central

Charlier, Cyril; Alderson, T. Reid; Courtney, Joseph M.; Ying, Jinfa; Anfinrud, Philip

2018-01-01

In general, small proteins rapidly fold on the timescale of milliseconds or less. For proteins with a substantial volume difference between the folded and unfolded states, their thermodynamic equilibrium can be altered by varying the hydrostatic pressure. Using a pressure-sensitized mutant of ubiquitin, we demonstrate that rapidly switching the pressure within an NMR sample cell enables study of the unfolded protein under native conditions and, vice versa, study of the native protein under denaturing conditions. This approach makes it possible to record 2D and 3D NMR spectra of the unfolded protein at atmospheric pressure, providing residue-specific information on the folding process. 15N and 13C chemical shifts measured immediately after dropping the pressure from 2.5 kbar (favoring unfolding) to 1 bar (native) are close to the random-coil chemical shifts observed for a large, disordered peptide fragment of the protein. However, 15N relaxation data show evidence for rapid exchange, on a ∼100-μs timescale, between the unfolded state and unstable, structured states that can be considered as failed folding events. The NMR data also provide direct evidence for parallel folding pathways, with approximately one-half of the protein molecules efficiently folding through an on-pathway kinetic intermediate, whereas the other half fold in a single step. At protein concentrations above ∼300 μM, oligomeric off-pathway intermediates compete with folding of the native state. PMID:29666248
GroEL actively stimulates folding of the endogenous substrate protein PepQ.

PubMed

Weaver, Jeremy; Jiang, Mengqiu; Roth, Andrew; Puchalla, Jason; Zhang, Junjie; Rye, Hays S

2017-06-30

Many essential proteins cannot fold without help from chaperonins, like the GroELS system of Escherichia coli. How chaperonins accelerate protein folding remains controversial. Here we test key predictions of both passive and active models of GroELS-stimulated folding, using the endogenous E. coli metalloprotease PepQ. While GroELS increases the folding rate of PepQ by over 15-fold, we demonstrate that slow spontaneous folding of PepQ is not caused by aggregation. Fluorescence measurements suggest that, when folding inside the GroEL-GroES cavity, PepQ populates conformations not observed during spontaneous folding in free solution. Using cryo-electron microscopy, we show that the GroEL C-termini make physical contact with the PepQ folding intermediate and help retain it deep within the GroEL cavity, resulting in reduced compactness of the PepQ monomer. Our findings strongly support an active model of chaperonin-mediated protein folding, where partial unfolding of misfolded intermediates plays a key role.
Complete fold annotation of the human proteome using a novel structural feature space

DOE PAGES

Middleton, Sarah A.; Illuminati, Joseph; Kim, Junhyong

2017-04-13

Recognition of protein structural fold is the starting point for many structure prediction tools and protein function inference. Fold prediction is computationally demanding and recognizing novel folds is difficult such that the majority of proteins have not been annotated for fold classification. Here we describe a new machine learning approach using a novel feature space that can be used for accurate recognition of all 1,221 currently known folds and inference of unknown novel folds. We show that our method achieves better than 94% accuracy even when many folds have only one training example. We demonstrate the utility of this methodmore » by predicting the folds of 34,330 human protein domains and showing that these predictions can yield useful insights into potential biological function, such as prediction of RNA-binding ability. Finally, our method can be applied to de novo fold prediction of entire proteomes and identify candidate novel fold families.« less
Folding and Function of a T4 Lysozyme Containing 10 Consecutive Alanines Illustrate the Redundancy of Information in an Amino Acid Sequence

NASA Astrophysics Data System (ADS)

Heinz, Dirk W.; Baase, Walt A.; Matthews, Brian W.

1992-05-01

Single and multiple Xaa -> Ala substitutions were constructed in the α-helix comprising residues 39-50 in bacteriophage T4 lysozyme. The variant with alanines at 10 consecutive positions (A40-49) folds normally and has activity essentially the same as wild type, although it is less stable. The crystal structure of this polyalanine mutant displays no significant change in the main-chain atoms of the helix when compared with the wild-type structure. The individual substitutions of the solvent-exposed residues Asn-40, Ser-44, and Glu-45 with alanine tend to increase the thermostability of the protein, whereas replacements of the buried or partially buried residues Lys-43 and Leu-46 are destabilizing. The melting temperature of the lysozyme in which Lys-43 and Leu-46 are retained and positions 40, 44, 45, 47, and 48 are substituted with alanine (i.e., A40-42/44-45/47-49) is increased by 3.1^circC relative to wild type at pH 3.0, but reduced by 1.6^circC at pH 6.7. In the case of the charged amino acids Glu-45 and Lys-48, the changes in melting temperature indicate that the putative salt bridge between these two residues contributes essentially nothing to the stability of the protein. The results clearly demonstrate that there is considerable redundancy in the sequence information in the polypeptide chain; not every amino acid is essential for folding. Also, further evidence is provided that the replacement of fully solvent-exposed residues within α-helices with alanines may be a general way to increase protein stability. The general approach may permit a simplification of the protein folding problem by retaining only amino acids proven to be essential for folding and replacing the remainder with alanine.
Molecular dynamics studies of protein folding and aggregation

NASA Astrophysics Data System (ADS)

Ding, Feng

This thesis applies molecular dynamics simulations and statistical mechanics to study: (i) protein folding; and (ii) protein aggregation. Most small proteins fold into their native states via a first-order-like phase transition with a major free energy barrier between the folded and unfolded states. A set of protein conformations corresponding to the free energy barrier, Delta G >> kBT, are the folding transition state ensemble (TSE). Due to their evasive nature, TSE conformations are hard to capture (probability ∝ exp(-DeltaG/k BT)) and characterize. A coarse-grained discrete molecular dynamics model with realistic steric constraints is constructed to reproduce the experimentally observed two-state folding thermodynamics. A kinetic approach is proposed to identify the folding TSE. A specific set of contacts, common to the TSE conformations, is identified as the folding nuclei which are necessary to be formed in order for the protein to fold. Interestingly, the amino acids at the site of the identified folding nuclei are highly conserved for homologous proteins sharing the same structures. Such conservation suggests that amino acids that are important for folding kinetics are under selective pressure to be preserved during the course of molecular evolution. In addition, studies of the conformations close to the transition states uncover the importance of topology in the construction of order parameter for protein folding transition. Misfolded proteins often form insoluble aggregates, amyloid fibrils, that deposit in the extracellular space and lead to a type of disease known as amyloidosis. Due to its insoluble and non-crystalline nature, the aggregation structure and, thus the aggregation mechanism, has yet to be uncovered. Discrete molecular dynamics studies reveal an aggregate structure with the same structural signatures as in experimental observations and show a nucleation aggregation scenario. The simulations also suggest a generic aggregation mechanism that globular proteins under a denaturing environment partially unfold and aggregate by forming stabilizing hydrogen bonds between the backbones of the partial folded substructures. Proteins or peptides rich in alpha-helices also aggregate into beta-rich amyloid fibrils. Upon aggregation, the protein or peptide undergoes a conformational transition from alpha-helices to beta-sheets. The transition of alpha-helix to beta-hairpin (two-stranded beta-sheet) is studied in an all-heavy-atom discrete molecular dynamics model of a polyalanine chain. An entropical driving scenario for the alpha-helix to beta-hairpin transition is discovered.
Generation of a consensus protein domain dictionary

PubMed Central

Schaeffer, R. Dustin; Jonsson, Amanda L.; Simms, Andrew M.; Daggett, Valerie

2011-01-01

Motivation: The discovery of new protein folds is a relatively rare occurrence even as the rate of protein structure determination increases. This rarity reinforces the concept of folds as reusable units of structure and function shared by diverse proteins. If the folding mechanism of proteins is largely determined by their topology, then the folding pathways of members of existing folds could encompass the full set used by globular protein domains. Results: We have used recent versions of three common protein domain dictionaries (SCOP, CATH and Dali) to generate a consensus domain dictionary (CDD). Surprisingly, 40% of the metafolds in the CDD are not composed of autonomous structural domains, i.e. they are not plausible independent folding units. This finding has serious ramifications for bioinformatics studies mining these domain dictionaries for globular protein properties. However, our main purpose in deriving this CDD was to generate an updated CDD to choose targets for MD simulation as part of our dynameomics effort, which aims to simulate the native and unfolding pathways of representatives of all globular protein consensus folds (metafolds). Consequently, we also compiled a list of representative protein targets of each metafold in the CDD. Availability and implementation: This domain dictionary is available at www.dynameomics.org. Contact: daggett@u.washington.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:21068000
Structural Characteristic of the Initial Unfolded State on Refolding Determines Catalytic Efficiency of the Folded Protein in Presence of Osmolytes

PubMed Central

Warepam, Marina; Sharma, Gurumayum Suraj; Dar, Tanveer Ali; Khan, Md. Khurshid Alam; Singh, Laishram Rajendrakumar

2014-01-01

Osmolytes are low molecular weight organic molecules accumulated by organisms to assist proper protein folding, and to provide protection to the structural integrity of proteins under denaturing stress conditions. It is known that osmolyte-induced protein folding is brought by unfavorable interaction of osmolytes with the denatured/unfolded states. The interaction of osmolyte with the native state does not significantly contribute to the osmolyte-induced protein folding. We have therefore investigated if different denatured states of a protein (generated by different denaturing agents) interact differently with the osmolytes to induce protein folding. We observed that osmolyte-assisted refolding of protein obtained from heat-induced denatured state produces native molecules with higher enzyme activity than those initiated from GdmCl- or urea-induced denatured state indicating that the structural property of the initial denatured state during refolding by osmolytes determines the catalytic efficiency of the folded protein molecule. These conclusions have been reached from the systematic measurements of enzymatic kinetic parameters (K m and k cat), thermodynamic stability (T m and ΔH m) and secondary and tertiary structures of the folded native proteins obtained from refolding of various denatured states (due to heat-, urea- and GdmCl-induced denaturation) of RNase-A in the presence of various osmolytes. PMID:25313668
Selection of stably folded proteins by phage-display with proteolysis.

PubMed

Bai, Yawen; Feng, Hanqiao

2004-05-01

To facilitate the process of protein design and learn the basic rules that control the structure and stability of proteins, combinatorial methods have been developed to select or screen proteins with desired properties from libraries of mutants. One such method uses phage-display and proteolysis to select stably folded proteins. This method does not rely on specific properties of proteins for selection. Therefore, in principle it can be applied to any protein. Since its first demonstration in 1998, the method has been used to create hyperthermophilic proteins, to evolve novel folded domains from a library generated by combinatorial shuffling of polypeptide segments and to convert a partially unfolded structure to a fully folded protein.
A semi-analytical description of protein folding that incorporates detailed geometrical information

PubMed Central

Suzuki, Yoko; Noel, Jeffrey K.; Onuchic, José N.

2011-01-01

Much has been done to study the interplay between geometric and energetic effects on the protein folding energy landscape. Numerical techniques such as molecular dynamics simulations are able to maintain a precise geometrical representation of the protein. Analytical approaches, however, often focus on the energetic aspects of folding, including geometrical information only in an average way. Here, we investigate a semi-analytical expression of folding that explicitly includes geometrical effects. We consider a Hamiltonian corresponding to a Gaussian filament with structure-based interactions. The model captures local features of protein folding often averaged over by mean-field theories, for example, loop contact formation and excluded volume. We explore the thermodynamics and folding mechanisms of beta-hairpin and alpha-helical structures as functions of temperature and Q, the fraction of native contacts formed. Excluded volume is shown to be an important component of a protein Hamiltonian, since it both dominates the cooperativity of the folding transition and alters folding mechanisms. Understanding geometrical effects in analytical formulae will help illuminate the consequences of the approximations required for the study of larger proteins. PMID:21721664
Protein Folding and Self-Organized Criticality

NASA Astrophysics Data System (ADS)

Bajracharya, Arun; Murray, Joelle

Proteins are known to fold into tertiary structures that determine their functionality in living organisms. However, the complex dynamics of protein folding and the way they consistently fold into the same structures is not fully understood. Self-organized criticality (SOC) has provided a framework for understanding complex systems in various systems (earthquakes, forest fires, financial markets, and epidemics) through scale invariance and the associated power law behavior. In this research, we use a simple hydrophobic-polar lattice-bound computational model to investigate self-organized criticality as a possible mechanism for generating complexity in protein folding.
Atomic interaction networks in the core of protein domains and their native folds.

PubMed

Soundararajan, Venkataramanan; Raman, Rahul; Raguram, S; Sasisekharan, V; Sasisekharan, Ram

2010-02-23

Vastly divergent sequences populate a majority of protein folds. In the quest to identify features that are conserved within protein domains belonging to the same fold, we set out to examine the entire protein universe on a fold-by-fold basis. We report that the atomic interaction network in the solvent-unexposed core of protein domains are fold-conserved, extraordinary sequence divergence notwithstanding. Further, we find that this feature, termed protein core atomic interaction network (or PCAIN) is significantly distinguishable across different folds, thus appearing to be "signature" of a domain's native fold. As part of this study, we computed the PCAINs for 8698 representative protein domains from families across the 1018 known protein folds to construct our seed database and an automated framework was developed for PCAIN-based characterization of the protein fold universe. A test set of randomly selected domains that are not in the seed database was classified with over 97% accuracy, independent of sequence divergence. As an application of this novel fold signature, a PCAIN-based scoring scheme was developed for comparative (homology-based) structure prediction, with 1-2 angstroms (mean 1.61A) C(alpha) RMSD generally observed between computed structures and reference crystal structures. Our results are consistent across the full spectrum of test domains including those from recent CASP experiments and most notably in the 'twilight' and 'midnight' zones wherein <30% and <10% target-template sequence identity prevails (mean twilight RMSD of 1.69A). We further demonstrate the utility of the PCAIN protocol to derive biological insight into protein structure-function relationships, by modeling the structure of the YopM effector novel E3 ligase (NEL) domain from plague-causative bacterium Yersinia Pestis and discussing its implications for host adaptive and innate immune modulation by the pathogen. Considering the several high-throughput, sequence-identity-independent applications demonstrated in this work, we suggest that the PCAIN is a fundamental fold feature that could be a valuable addition to the arsenal of protein modeling and analysis tools.
Atomic Interaction Networks in the Core of Protein Domains and Their Native Folds

PubMed Central

Soundararajan, Venkataramanan; Raman, Rahul; Raguram, S.; Sasisekharan, V.; Sasisekharan, Ram

2010-01-01

Vastly divergent sequences populate a majority of protein folds. In the quest to identify features that are conserved within protein domains belonging to the same fold, we set out to examine the entire protein universe on a fold-by-fold basis. We report that the atomic interaction network in the solvent-unexposed core of protein domains are fold-conserved, extraordinary sequence divergence notwithstanding. Further, we find that this feature, termed protein core atomic interaction network (or PCAIN) is significantly distinguishable across different folds, thus appearing to be “signature” of a domain's native fold. As part of this study, we computed the PCAINs for 8698 representative protein domains from families across the 1018 known protein folds to construct our seed database and an automated framework was developed for PCAIN-based characterization of the protein fold universe. A test set of randomly selected domains that are not in the seed database was classified with over 97% accuracy, independent of sequence divergence. As an application of this novel fold signature, a PCAIN-based scoring scheme was developed for comparative (homology-based) structure prediction, with 1–2 angstroms (mean 1.61A) Cα RMSD generally observed between computed structures and reference crystal structures. Our results are consistent across the full spectrum of test domains including those from recent CASP experiments and most notably in the ‘twilight’ and ‘midnight’ zones wherein <30% and <10% target-template sequence identity prevails (mean twilight RMSD of 1.69A). We further demonstrate the utility of the PCAIN protocol to derive biological insight into protein structure-function relationships, by modeling the structure of the YopM effector novel E3 ligase (NEL) domain from plague-causative bacterium Yersinia Pestis and discussing its implications for host adaptive and innate immune modulation by the pathogen. Considering the several high-throughput, sequence-identity-independent applications demonstrated in this work, we suggest that the PCAIN is a fundamental fold feature that could be a valuable addition to the arsenal of protein modeling and analysis tools. PMID:20186337
Membrane protein serendipity

PubMed Central

von Heijne, Gunnar

2018-01-01

My scientific career has taken me from chemistry, via theoretical physics and bioinformatics, to molecular biology and even structural biology. Along the way, serendipity led me to work on problems such as the identification of signal peptides that direct protein trafficking, membrane protein biogenesis, and cotranslational protein folding. I've had some great collaborations that came about because of a stray conversation or from following up on an interesting paper. And I've had the good fortune to be asked to sit on the Nobel Committee for Chemistry, where I am constantly reminded of the amazing pace and often intricate history of scientific discovery. Could I have planned this? No way! I just went with the flow … PMID:29523692

Mathematics, thermodynamics, and modeling to address ten common misconceptions about protein structure, folding, and stability.

PubMed

Robic, Srebrenka

2010-01-01

To fully understand the roles proteins play in cellular processes, students need to grasp complex ideas about protein structure, folding, and stability. Our current understanding of these topics is based on mathematical models and experimental data. However, protein structure, folding, and stability are often introduced as descriptive, qualitative phenomena in undergraduate classes. In the process of learning about these topics, students often form incorrect ideas. For example, by learning about protein folding in the context of protein synthesis, students may come to an incorrect conclusion that once synthesized on the ribosome, a protein spends its entire cellular life time in its fully folded native confirmation. This is clearly not true; proteins are dynamic structures that undergo both local fluctuations and global unfolding events. To prevent and address such misconceptions, basic concepts of protein science can be introduced in the context of simple mathematical models and hands-on explorations of publicly available data sets. Ten common misconceptions about proteins are presented, along with suggestions for using equations, models, sequence, structure, and thermodynamic data to help students gain a deeper understanding of basic concepts relating to protein structure, folding, and stability.
Molecular Dynamics Simulations on Gas-Phase Proteins with Mobile Protons: Inclusion of All-Atom Charge Solvation.

PubMed

Konermann, Lars

2017-08-31

Molecular dynamics (MD) simulations have become a key tool for examining the properties of electrosprayed protein ions. Traditional force fields employ static charges on titratable sites, whereas in reality, protons are highly mobile in gas-phase proteins. Earlier studies tackled this problem by adjusting charge patterns during MD runs. Within those algorithms, proton redistribution was subject to energy minimization, taking into account electrostatic and proton affinity contributions. However, those earlier approaches described (de)protonated moieties as point charges, neglecting charge solvation, which is highly prevalent in the gas phase. Here, we describe a mobile proton algorithm that considers the electrostatic contributions from all atoms, such that charge solvation is explicitly included. MD runs were broken down into 50 ps fixed-charge segments. After each segment, the electrostatics was reanalyzed and protons were redistributed. Challenges associated with computational cost were overcome by devising a streamlined method for electrostatic calculations. Avidin (a 504-residue protein complex) maintained a nativelike fold over 200 ns. Proton transfer and side chain rearrangements produced extensive salt bridge networks at the protein surface. The mobile proton technique introduced here should pave the way toward future studies on protein folding, unfolding, collapse, and subunit dissociation in the gas phase.
Conformational stability as a design target to control protein aggregation.

PubMed

Costanzo, Joseph A; O'Brien, Christopher J; Tiller, Kathryn; Tamargo, Erin; Robinson, Anne Skaja; Roberts, Christopher J; Fernandez, Erik J

2014-05-01

Non-native protein aggregation is a prevalent problem occurring in many biotechnological manufacturing processes and can compromise the biological activity of the target molecule or induce an undesired immune response. Additionally, some non-native aggregation mechanisms lead to amyloid fibril formation, which can be associated with debilitating diseases. For natively folded proteins, partial or complete unfolding is often required to populate aggregation-prone conformational states, and therefore one proposed strategy to mitigate aggregation is to increase the free energy for unfolding (ΔGunf) prior to aggregation. A computational design approach was tested using human γD crystallin (γD-crys) as a model multi-domain protein. Two mutational strategies were tested for their ability to reduce/increase aggregation rates by increasing/decreasing ΔGunf: stabilizing the less stable domain and stabilizing the domain-domain interface. The computational protein design algorithm, RosettaDesign, was implemented to identify point variants. The results showed that although the predicted free energies were only weakly correlated with the experimental ΔGunf values, increased/decreased aggregation rates for γD-crys correlated reasonably well with decreases/increases in experimental ΔGunf, illustrating improved conformational stability as a possible design target to mitigate aggregation. However, the results also illustrate that conformational stability is not the sole design factor controlling aggregation rates of natively folded proteins.
Analyzing the effect of homogeneous frustration in protein folding.

PubMed

Contessoto, Vinícius G; Lima, Debora T; Oliveira, Ronaldo J; Bruni, Aline T; Chahine, Jorge; Leite, Vitor B P

2013-10-01

The energy landscape theory has been an invaluable theoretical framework in the understanding of biological processes such as protein folding, oligomerization, and functional transitions. According to the theory, the energy landscape of protein folding is funneled toward the native state, a conformational state that is consistent with the principle of minimal frustration. It has been accepted that real proteins are selected through natural evolution, satisfying the minimum frustration criterion. However, there is evidence that a low degree of frustration accelerates folding. We examined the interplay between topological and energetic protein frustration. We employed a Cα structure-based model for simulations with a controlled nonspecific energetic frustration added to the potential energy function. Thermodynamics and kinetics of a group of 19 proteins are completely characterized as a function of increasing level of energetic frustration. We observed two well-separated groups of proteins: one group where a little frustration enhances folding rates to an optimal value and another where any energetic frustration slows down folding. Protein energetic frustration regimes and their mechanisms are explained by the role of non-native contact interactions in different folding scenarios. These findings strongly correlate with the protein free-energy folding barrier and the absolute contact order parameters. These computational results are corroborated by principal component analysis and partial least square techniques. One simple theoretical model is proposed as a useful tool for experimentalists to predict the limits of improvements in real proteins. Copyright © 2013 Wiley Periodicals, Inc.
Production of membrane proteins without cells or detergents.

PubMed

Rajesh, Sundaresan; Knowles, Timothy; Overduin, Michael

2011-04-30

The production of membrane proteins in cellular systems is besieged by several problems due to their hydrophobic nature which often causes misfolding, protein aggregation and cytotoxicity, resulting in poor yields of stable proteins. Cell-free expression has emerged as one of the most versatile alternatives for circumventing these obstacles by producing membrane proteins directly into designed hydrophobic environments. Efficient optimisation of expression and solubilisation conditions using a variety of detergents, membrane mimetics and lipids has yielded structurally and functionally intact membrane proteins, with yields several fold above the levels possible from cell-based systems. Here we review recently developed techniques available to produce functional membrane proteins, and discuss amphipols, nanodisc and styrene maleic acid lipid particle (SMALP) technologies that can be exploited alongside cell-free expression of membrane proteins. Copyright © 2010 Elsevier B.V. All rights reserved.
Cooperativity and modularity in protein folding

PubMed Central

Sasai, Masaki; Chikenji, George; Terada, Tomoki P.

2016-01-01

A simple statistical mechanical model proposed by Wako and Saitô has explained the aspects of protein folding surprisingly well. This model was systematically applied to multiple proteins by Muñoz and Eaton and has since been referred to as the Wako-Saitô-Muñoz-Eaton (WSME) model. The success of the WSME model in explaining the folding of many proteins has verified the hypothesis that the folding is dominated by native interactions, which makes the energy landscape globally biased toward native conformation. Using the WSME and other related models, Saitô emphasized the importance of the hierarchical pathway in protein folding; folding starts with the creation of contiguous segments having a native-like configuration and proceeds as growth and coalescence of these segments. The Φ-values calculated for barnase with the WSME model suggested that segments contributing to the folding nucleus are similar to the structural modules defined by the pattern of native atomic contacts. The WSME model was extended to explain folding of multi-domain proteins having a complex topology, which opened the way to comprehensively understanding the folding process of multi-domain proteins. The WSME model was also extended to describe allosteric transitions, indicating that the allosteric structural movement does not occur as a deterministic sequential change between two conformations but as a stochastic diffusive motion over the dynamically changing energy landscape. Statistical mechanical viewpoint on folding, as highlighted by the WSME model, has been renovated in the context of modern methods and ideas, and will continue to provide insights on equilibrium and dynamical features of proteins. PMID:28409080
Proteomics of Skin Proteins in Psoriasis: From Discovery and Verification in a Mouse Model to Confirmation in Humans*

PubMed Central

Lundberg, Kathleen C.; Fritz, Yi; Johnston, Andrew; Foster, Alexander M.; Baliwag, Jaymie; Gudjonsson, Johann E.; Schlatzer, Daniela; Gokulrangan, Giridharan; McCormick, Thomas S.; Chance, Mark R.; Ward, Nicole L.

2015-01-01

Herein, we demonstrate the efficacy of an unbiased proteomics screening approach for studying protein expression changes in the KC-Tie2 psoriasis mouse model, identifying multiple protein expression changes in the mouse and validating these changes in human psoriasis. KC-Tie2 mouse skin samples (n = 3) were compared with littermate controls (n = 3) using gel-based fractionation followed by label-free protein expression analysis. 5482 peptides mapping to 1281 proteins were identified and quantitated: 105 proteins exhibited fold-changes ≥2.0 including: stefin A1 (average fold change of 342.4 and an average p = 0.0082; cystatin A, human ortholog); slc25a5 (average fold change of 46.2 and an average p = 0.0318); serpinb3b (average fold change of 35.6 and an average p = 0.0345; serpinB1, human ortholog); and kallikrein related peptidase 6 (average fold change of 4.7 and an average p = 0.2474; KLK6). We independently confirmed mouse gene expression-based increases of selected genes including serpinb3b (17.4-fold, p < 0.0001), KLK6 (9-fold, p = 0.002), stefin A1 (7.3-fold; p < 0.001), and slc25A5 (1.5-fold; p = 0.05) using qRT-PCR on a second cohort of animals (n = 8). Parallel LC/MS/MS analyses on these same samples verified protein-level increases of 1.3-fold (slc25a5; p < 0.05), 29,000-fold (stefinA1; p < 0.01), 322-fold (KLK6; p < 0.0001) between KC-Tie2 and control mice. To underscore the utility and translatability of our combined approach, we analyzed gene and protein expression levels in psoriasis patient skin and primary keratinocytes versus healthy controls. Increases in gene expression for slc25a5 (1.8-fold), cystatin A (3-fold), KLK6 (5.8-fold), and serpinB1 (76-fold; all p < 0.05) were observed between healthy controls and involved lesional psoriasis skin and primary psoriasis keratinocytes. Moreover, slc25a5, cystatin A, KLK6, and serpinB1 protein were all increased in lesional psoriasis skin compared with normal skin. These results highlight the usefulness of preclinical disease models using readily-available mouse skin and demonstrate the utility of proteomic approaches for identifying novel peptides/proteins that are differentially regulated in psoriasis that could serve as sources of auto-antigens or provide novel therapeutic targets for the development of new anti-psoriatic treatments. PMID:25351201
Frustration in Condensed Matter and Protein Folding

NASA Astrophysics Data System (ADS)

Li, Z.; Tanner, S.; Conroy, B.; Owens, F.; Tran, M. M.; Boekema, C.

2014-03-01

By means of computer modeling, we are studying frustration in condensed matter and protein folding, including the influence of temperature and Thomson-figure formation. Frustration is due to competing interactions in a disordered state. The key issue is how the particles interact to reach the lowest frustration. The relaxation for frustration is mostly a power function (randomly assigned pattern) or an exponential function (regular patterns like Thomson figures). For the atomic Thomson model, frustration is predicted to decrease with the formation of Thomson figures at zero kelvin. We attempt to apply our frustration modeling to protein folding and dynamics. We investigate the homogeneous protein frustration that would cause the speed of the protein folding to increase. Increase of protein frustration (where frustration and hydrophobicity interplay with protein folding) may lead to a protein mutation. Research is supported by WiSE@SJSU and AFC San Jose.
Single-molecule chemo-mechanical unfolding reveals multiple transition state barriers in a small single-domain protein

NASA Astrophysics Data System (ADS)

Guinn, Emily J.; Jagannathan, Bharat; Marqusee, Susan

2015-04-01

A fundamental question in protein folding is whether proteins fold through one or multiple trajectories. While most experiments indicate a single pathway, simulations suggest proteins can fold through many parallel pathways. Here, we use a combination of chemical denaturant, mechanical force and site-directed mutations to demonstrate the presence of multiple unfolding pathways in a simple, two-state folding protein. We show that these multiple pathways have structurally different transition states, and that seemingly small changes in protein sequence and environment can strongly modulate the flux between the pathways. These results suggest that in vivo, the crowded cellular environment could strongly influence the mechanisms of protein folding and unfolding. Our study resolves the apparent dichotomy between experimental and theoretical studies, and highlights the advantage of using a multipronged approach to reveal the complexities of a protein's free-energy landscape.
Role of Tryptophan Side Chain Dynamics on the Trp-Cage Mini-Protein Folding Studied by Molecular Dynamics Simulations

PubMed Central

Kannan, Srinivasaraghavan; Zacharias, Martin

2014-01-01

The 20 residue Trp-cage mini-protein is one of smallest proteins that adopt a stable folded structure containing also well-defined secondary structure elements. The hydrophobic core is arranged around a single central Trp residue. Despite several experimental and simulation studies the detailed folding mechanism of the Trp-cage protein is still not completely understood. Starting from fully extended as well as from partially folded Trp-cage structures a series of molecular dynamics simulations in explicit solvent and using four different force fields was performed. All simulations resulted in rapid collapse of the protein to on average relatively compact states. The simulations indicate a significant dependence of the speed of folding to near-native states on the side chain rotamer state of the central Trp residue. Whereas the majority of intermediate start structures with the central Trp side chain in a near-native rotameric state folded successfully within less than 100 ns only a fraction of start structures reached near-native folded states with an initially non-native Trp side chain rotamer state. Weak restraining of the Trp side chain dihedral angles to the state in the folded protein resulted in significant acceleration of the folding both starting from fully extended or intermediate conformations. The results indicate that the side chain conformation of the central Trp residue can create a significant barrier for controlling transitions to a near native folded structure. Similar mechanisms might be of importance for the folding of other protein structures. PMID:24563686
High-Resolution Mapping of a Repeat Protein Folding Free Energy Landscape.

PubMed

Fossat, Martin J; Dao, Thuy P; Jenkins, Kelly; Dellarole, Mariano; Yang, Yinshan; McCallum, Scott A; Garcia, Angel E; Barrick, Doug; Roumestand, Christian; Royer, Catherine A

2016-12-06

A complete description of the pathways and mechanisms of protein folding requires a detailed structural and energetic characterization of the conformational ensemble along the entire folding reaction coordinate. Simulations can provide this level of insight for small proteins. In contrast, with the exception of hydrogen exchange, which does not monitor folding directly, experimental studies of protein folding have not yielded such structural and energetic detail. NMR can provide residue specific atomic level structural information, but its implementation in protein folding studies using chemical or temperature perturbation is problematic. Here we present a highly detailed structural and energetic map of the entire folding landscape of the leucine-rich repeat protein, pp32 (Anp32), obtained by combining pressure-dependent site-specific 1 H- 15 N HSQC data with coarse-grained molecular dynamics simulations. The results obtained using this equilibrium approach demonstrate that the main barrier to folding of pp32 is quite broad and lies near the unfolded state, with structure apparent only in the C-terminal region. Significant deviation from two-state unfolding under pressure reveals an intermediate on the folded side of the main barrier in which the N-terminal region is disordered. A nonlinear temperature dependence of the population of this intermediate suggests a large heat capacity change associated with its formation. The combination of pressure, which favors the population of folding intermediates relative to chemical denaturants; NMR, which allows their observation; and constrained structure-based simulations yield unparalleled insight into protein folding mechanisms. Copyright Â© 2016 Biophysical Society. Published by Elsevier Inc. All rights reserved.
FROM FOLDING THEORIES TO FOLDING PROTEINS: A Review and Assessment of Simulation Studies of Protein Folding and Unfolding

NASA Astrophysics Data System (ADS)

Shea, Joan-Emma; Brooks, Charles L., III

2001-10-01

Beginning with simplified lattice and continuum "minimalist" models and progressing to detailed atomic models, simulation studies have augmented and directed development of the modern landscape perspective of protein folding. In this review we discuss aspects of detailed atomic simulation methods applied to studies of protein folding free energy surfaces, using biased-sampling free energy methods and temperature-induced protein unfolding. We review studies from each on systems of particular experimental interest and assess the strengths and weaknesses of each approach in the context of "exact" results for both free energies and kinetics of a minimalist model for a beta-barrel protein. We illustrate in detail how each approach is implemented and discuss analysis methods that have been developed as components of these studies. We describe key insights into the relationship between protein topology and the folding mechanism emerging from folding free energy surface calculations. We further describe the determination of detailed "pathways" and models of folding transition states that have resulted from unfolding studies. Our assessment of the two methods suggests that both can provide, often complementary, details of folding mechanism and thermodynamics, but this success relies on (a) adequate sampling of diverse conformational regions for the biased-sampling free energy approach and (b) many trajectories at multiple temperatures for unfolding studies. Furthermore, we find that temperature-induced unfolding provides representatives of folding trajectories only when the topology and sequence (energy) provide a relatively funneled landscape and "off-pathway" intermediates do not exist.
Atomic-level characterization of the structural dynamics of proteins.

PubMed

Shaw, David E; Maragakis, Paul; Lindorff-Larsen, Kresten; Piana, Stefano; Dror, Ron O; Eastwood, Michael P; Bank, Joseph A; Jumper, John M; Salmon, John K; Shan, Yibing; Wriggers, Willy

2010-10-15

Molecular dynamics (MD) simulations are widely used to study protein motions at an atomic level of detail, but they have been limited to time scales shorter than those of many biologically critical conformational changes. We examined two fundamental processes in protein dynamics--protein folding and conformational change within the folded state--by means of extremely long all-atom MD simulations conducted on a special-purpose machine. Equilibrium simulations of a WW protein domain captured multiple folding and unfolding events that consistently follow a well-defined folding pathway; separate simulations of the protein's constituent substructures shed light on possible determinants of this pathway. A 1-millisecond simulation of the folded protein BPTI reveals a small number of structurally distinct conformational states whose reversible interconversion is slower than local relaxations within those states by a factor of more than 1000.
Protein folding and misfolding: mechanism and principles

PubMed Central

Englander, S. Walter; Mayne, Leland; Krishna, Mallela M. G.

2012-01-01

Two fundamentally different views of how proteins fold are now being debated. Do proteins fold through multiple unpredictable routes directed only by the energetically downhill nature of the folding landscape or do they fold through specific intermediates in a defined pathway that systematically puts predetermined pieces of the target native protein into place? It has now become possible to determine the structure of protein folding intermediates, evaluate their equilibrium and kinetic parameters, and establish their pathway relationships. Results obtained for many proteins have serendipitously revealed a new dimension of protein structure. Cooperative structural units of the native protein, called foldons, unfold and refold repeatedly even under native conditions. Much evidence obtained by hydrogen exchange and other methods now indicates that cooperative foldon units and not individual amino acids account for the unit steps in protein folding pathways. The formation of foldons and their ordered pathway assembly systematically puts native-like foldon building blocks into place, guided by a sequential stabilization mechanism in which prior native-like structure templates the formation of incoming foldons with complementary structure. Thus the same propensities and interactions that specify the final native state, encoded in the amino-acid sequence of every protein, determine the pathway for getting there. Experimental observations that have been interpreted differently, in terms of multiple independent pathways, appear to be due to chance misfolding errors that cause different population fractions to block at different pathway points, populate different pathway intermediates, and fold at different rates. This paper summarizes the experimental basis for these three determining principles and their consequences. Cooperative native-like foldon units and the sequential stabilization process together generate predetermined stepwise pathways. Optional misfolding errors are responsible for 3-state and heterogeneous kinetic folding. PMID:18405419
Proteomic Alterations in Aqueous Humor From Patients With Primary Open Angle Glaucoma.

PubMed

Sharma, Shruti; Bollinger, Kathryn E; Kodeboyina, Sai Karthik; Zhi, Wenbo; Patton, Jordan; Bai, Shan; Edwards, Blake; Ulrich, Lane; Bogorad, David; Sharma, Ashok

2018-05-01

Primary open angle glaucoma (POAG) is the most prevalent form of glaucoma, accounting for approximately 90% of all cases. The aqueous humor (AH), a biological fluid in the anterior and posterior chambers of the eye, is involved in a multitude of functions including the maintenance of IOP and ocular homeostasis. This fluid is very close to the pathologic site and is also known to have a significant role in glaucoma pathogenesis. The purpose of this study was to identify proteomic alterations in AH from patients with POAG. AH samples were extracted from 47 patients undergoing cataract surgery (controls: n = 32; POAG: n = 15). Proteomic analysis of the digested samples was accomplished by liquid-chromatography-mass spectrometry. The identified proteins were evaluated using a variety of statistical and bioinformatics methods. A total of 33 proteins were significantly altered in POAG subjects compared with the controls. The most abundant proteins in POAG subjects are IGKC (13.56-fold), ITIH4 (4.1-fold), APOC3 (3.36-fold), IDH3A (3.11-fold), LOC105369216 (2.98-fold). SERPINF2 (2.94-fold), NPC2 (2.88-fold), SUCLG2 (2.70-fold), KIAA0100 (2.29-fold), CNOT4 (2.23-fold), AQP4 (2.11-fold), COL18A1 (2.08-fold), NWD1 (2.07-fold), and TMEM120B (2.06-fold). A significant increasing trend in the odds ratios of having POAG was observed with increased levels of these proteins. Proteins identified in this study are implicated in signaling, glycosylation, immune response, molecular transport, and lipid metabolism. The identified candidate proteins may be potential biomarkers associated with POAG development and may lead to more insight in understanding the mechanisms underlying the pathogenesis of this disease.
Studying the unfolding process of protein G and protein L under physical property space

PubMed Central

Zhao, Liling; Wang, Jihua; Dou, Xianghua; Cao, Zanxia

2009-01-01

Background The studies on protein folding/unfolding indicate that the native state topology is an important determinant of protein folding mechanism. The folding/unfolding behaviors of proteins which have similar topologies have been studied under Cartesian space and the results indicate that some proteins share the similar folding/unfolding characters. Results We construct physical property space with twelve different physical properties. By studying the unfolding process of the protein G and protein L under the property space, we find that the two proteins have the similar unfolding pathways that can be divided into three types and the one which with the umbrella-shape represents the preferred pathway. Moreover, the unfolding simulation time of the two proteins is different and protein L unfolding faster than protein G. Additionally, the distributing area of unfolded state ensemble of protein L is larger than that of protein G. Conclusion Under the physical property space, the protein G and protein L have the similar folding/unfolding behaviors, which agree with the previous results obtained from the studies under Cartesian coordinate space. At the same time, some different unfolding properties can be detected easily, which can not be analyzed under Cartesian coordinate space. PMID:19208146
Heterochiral Knottin Protein: Folding and Solution Structure.

PubMed

Mong, Surin K; Cochran, Frank V; Yu, Hongtao; Graziano, Zachary; Lin, Yu-Shan; Cochran, Jennifer R; Pentelute, Bradley L

2017-10-31

Homochirality is a general feature of biological macromolecules, and Nature includes few examples of heterochiral proteins. Herein, we report on the design, chemical synthesis, and structural characterization of heterochiral proteins possessing loops of amino acids of chirality opposite to that of the rest of a protein scaffold. Using the protein Ecballium elaterium trypsin inhibitor II, we discover that selective β-alanine substitution favors the efficient folding of our heterochiral constructs. Solution nuclear magnetic resonance spectroscopy of one such heterochiral protein reveals a homogeneous global fold. Additionally, steered molecular dynamics simulation indicate β-alanine reduces the free energy required to fold the protein. We also find these heterochiral proteins to be more resistant to proteolysis than homochiral l-proteins. This work informs the design of heterochiral protein architectures containing stretches of both d- and l-amino acids.
Processing of Cholinesterase-like α/β-Hydrolase Fold Proteins: Alterations Associated with Congenital Disorders

PubMed Central

De Jaco, Antonella; Comoletti, Davide; Dubi, Noga; Camp, Shelley; Taylor, Palmer

2016-01-01

The α/β hydrolase fold family is perhaps the largest group of proteins presenting significant structural homology with divergent functions, ranging from catalytic hydrolysis to heterophilic cell adhesive interactions to chaperones in hormone production. All the proteins of the family share a common three-dimensional core structure containing the α/β-hydrolase fold domain that is crucial for proper protein function. Several mutations associated with congenital diseases or disorders have been reported in conserved residues within the α/β-hydrolase fold domain of cholinesterase-like proteins, neuroligins, butyrylcholinesterase and thyroglobulin. These mutations are known to disrupt the architecture of the common structural domain either globally or locally. Characterization of the natural mutations affecting the α/β-hydrolase fold domain in these proteins has shown that they mainly impair processing and trafficking along the secretory pathway causing retention of the mutant protein in the endoplasmic reticulum. Studying the processing of α/β-hydrolase fold mutant proteins should uncover new functions for this domain, that in some cases require structural integrity for both export of the protein from the ER and for facilitating subunit dimerization. A comparative study of homologous mutations in proteins that are closely related family members, along with the definition of new three-dimensional crystal structures, will identify critical residues for the assembly of the α/β-hydrolase fold. PMID:21933121
Effect of interactions with the chaperonin cavity on protein folding and misfolding†

PubMed Central

Sirur, Anshul; Knott, Michael; Best, Robert B.

2015-01-01

Recent experimental and computational results have suggested that attractive interactions between a chaperonin and an enclosed substrate can have an important effect on the protein folding rate: it appears that folding may even be slower inside the cavity than under unconfined conditions, in contrast to what we would expect from excluded volume effects on the unfolded state. Here we examine systematically the dependence of the protein stability and folding rate on the strength of such attractive interactions between the chaperonin and substrate, by using molecular simulations of model protein systems in an idealised attractive cavity. Interestingly, we find a maximum in stability, and a rate which indeed slows down at high attraction strengths. We have developed a simple phenomenological model which can explain the variations in folding rate and stability due to differing effects on the free energies of the unfolded state, folded state, and transition state; changes in the diffusion coefficient along the folding coordinate are relatively small, at least for our simplified model. In order to investigate a possible role for these attractive interactions in folding, we have studied a recently developed model for misfolding in multidomain proteins. We find that, while encapsulation in repulsive cavities greatly increases the fraction of misfolded protein, sufficiently strong attractive protein-cavity interactions can strongly reduce the fraction of proteins reaching misfolded traps. PMID:24077053
The porous borders of the protein world.

PubMed

Cordes, Matthew H J; Stewart, Katie L

2012-02-08

Fold switching may play a role in the evolution of new protein folds and functions. He et al., in this issue of Structure, use protein design to illustrate that the same drastic change in a protein fold can occur via multiple different mutational pathways. Copyright © 2012 Elsevier Ltd. All rights reserved.

Mathematics, Thermodynamics, and Modeling to Address Ten Common Misconceptions about Protein Structure, Folding, and Stability

ERIC Educational Resources Information Center

Robic, Srebrenka

2010-01-01

To fully understand the roles proteins play in cellular processes, students need to grasp complex ideas about protein structure, folding, and stability. Our current understanding of these topics is based on mathematical models and experimental data. However, protein structure, folding, and stability are often introduced as descriptive, qualitative…
Mutational analysis of the folding transition state of the C-terminal domain of ribosomal protein L9: a protein with an unusual beta-sheet topology.

PubMed

Li, Ying; Gupta, Ruchi; Cho, Jae-Hyun; Raleigh, Daniel P

2007-01-30

The C-terminal domain of ribosomal protein L9 (CTL9) is a 92-residue alpha-beta protein which contains an unusual three-stranded mixed parallel and antiparallel beta-sheet. The protein folds in a two-state fashion, and the folding rate is slow. It is thought that the slow folding may be caused by the necessity of forming this unusual beta-sheet architecture in the transition state for folding. This hypothesis makes CTL9 an interesting target for folding studies. The transition state for the folding of CTL9 was characterized by phi-value analysis. The folding of a set of hydrophobic core mutants was analyzed together with a set of truncation mutants. The results revealed a few positions with high phi-values (> or = 0.5), notably, V131, L133, H134, V137, and L141. All of these residues were found in the beta-hairpin region, indicating that the formation of this structure is likely to be the rate-limiting step in the folding of CTL9. One face of the beta-hairpin docks against the N-terminal helix. Analysis of truncation mutants of this helix confirmed its importance in folding. Mutations at other sites in the protein gave small phi-values, despite the fact that some of them had major effects on stability. The analysis indicates that formation of the antiparallel hairpin is critical and its interactions with the first helix are also important. Thus, the slow folding is not a consequence of the need to fully form the unusual three-stranded beta-sheet in the transition state. Analysis of the urea dependence of the folding rates indicates that mutations modulate the unfolded state. The folding of CTL9 is broadly consistent with the nucleation-condensation model of protein folding.
Intermediates and the folding of proteins L and G

DOE Office of Scientific and Technical Information (OSTI.GOV)

Brown, Scott; Head-Gordon, Teresa

We use a minimalist protein model, in combination with a sequence design strategy, to determine differences in primary structure for proteins L and G that are responsible for the two proteins folding through distinctly different folding mechanisms. We find that the folding of proteins L and G are consistent with a nucleation-condensation mechanism, each of which is described as helix-assisted {beta}-1 and {beta}-2 hairpin formation, respectively. We determine that the model for protein G exhibits an early intermediate that precedes the rate-limiting barrier of folding and which draws together misaligned secondary structure elements that are stabilized by hydrophobic core contactsmore » involving the third {beta}-strand, and presages the later transition state in which the correct strand alignment of these same secondary structure elements is restored. Finally the validity of the targeted intermediate ensemble for protein G was analyzed by fitting the kinetic data to a two-step first order reversible reaction, proving that protein G folding involves an on-pathway early intermediate, and should be populated and therefore observable by experiment.« less
Intermediates and the folding of proteins L and G

PubMed Central

Brown, Scott; Head-Gordon, Teresa

2004-01-01

We use a minimalist protein model, in combination with a sequence design strategy, to determine differences in primary structure for proteins L and G, which are responsible for the two proteins folding through distinctly different folding mechanisms. We find that the folding of proteins L and G are consistent with a nucleation-condensation mechanism, each of which is described as helix-assisted β-1 and β-2 hairpin formation, respectively. We determine that the model for protein G exhibits an early intermediate that precedes the rate-limiting barrier of folding, and which draws together misaligned secondary structure elements that are stabilized by hydrophobic core contacts involving the third β-strand, and presages the later transition state in which the correct strand alignment of these same secondary structure elements is restored. Finally, the validity of the targeted intermediate ensemble for protein G was analyzed by fitting the kinetic data to a two-step first-order reversible reaction, proving that protein G folding involves an on-pathway early intermediate, and should be populated and therefore observable by experiment. PMID:15044729
Designing pH induced fold switch in proteins

NASA Astrophysics Data System (ADS)

Baruah, Anupaul; Biswas, Parbati

2015-05-01

This work investigates the computational design of a pH induced protein fold switch based on a self-consistent mean-field approach by identifying the ensemble averaged characteristics of sequences that encode a fold switch. The primary challenge to balance the alternative sets of interactions present in both target structures is overcome by simultaneously optimizing two foldability criteria corresponding to two target structures. The change in pH is modeled by altering the residual charge on the amino acids. The energy landscape of the fold switch protein is found to be double funneled. The fold switch sequences stabilize the interactions of the sites with similar relative surface accessibility in both target structures. Fold switch sequences have low sequence complexity and hence lower sequence entropy. The pH induced fold switch is mediated by attractive electrostatic interactions rather than hydrophobic-hydrophobic contacts. This study may provide valuable insights to the design of fold switch proteins.
Computer Folding of RNA Tetraloops: Identification of Key Force Field Deficiencies.

PubMed

Kührová, Petra; Best, Robert B; Bottaro, Sandro; Bussi, Giovanni; Šponer, Jiří; Otyepka, Michal; Banáš, Pavel

2016-09-13

The computer-aided folding of biomolecules, particularly RNAs, is one of the most difficult challenges in computational structural biology. RNA tetraloops are fundamental RNA motifs playing key roles in RNA folding and RNA-RNA and RNA-protein interactions. Although state-of-the-art Molecular Dynamics (MD) force fields correctly describe the native state of these tetraloops as a stable free-energy basin on the microsecond time scale, enhanced sampling techniques reveal that the native state is not the global free energy minimum, suggesting yet unidentified significant imbalances in the force fields. Here, we tested our ability to fold the RNA tetraloops in various force fields and simulation settings. We employed three different enhanced sampling techniques, namely, temperature replica exchange MD (T-REMD), replica exchange with solute tempering (REST2), and well-tempered metadynamics (WT-MetaD). We aimed to separate problems caused by limited sampling from those due to force-field inaccuracies. We found that none of the contemporary force fields is able to correctly describe folding of the 5'-GAGA-3' tetraloop over a range of simulation conditions. We thus aimed to identify which terms of the force field are responsible for this poor description of TL folding. We showed that at least two different imbalances contribute to this behavior, namely, overstabilization of base-phosphate and/or sugar-phosphate interactions and underestimated stability of the hydrogen bonding interaction in base pairing. The first artifact stabilizes the unfolded ensemble, while the second one destabilizes the folded state. The former problem might be partially alleviated by reparametrization of the van der Waals parameters of the phosphate oxygens suggested by Case et al., while in order to overcome the latter effect we suggest local potentials to better capture hydrogen bonding interactions.
Mitochondrial metabolic regulation by GRP78

PubMed Central

Prasad, Manoj; Pawlak, Kevin J.; Burak, William E.; Perry, Elizabeth E.; Marshall, Brendan; Whittal, Randy M.; Bose, Himangshu S.

2017-01-01

Steroids, essential for mammalian survival, are initiated by cholesterol transport by steroidogenic acute regulatory protein (StAR). Appropriate protein folding is an essential requirement of activity. Endoplasmic reticulum (ER) chaperones assist in folding of cytoplasmic proteins, whereas mitochondrial chaperones fold only mitochondrial proteins. We show that glucose regulatory protein 78 (GRP78), a master ER chaperone, is also present at the mitochondria-associated ER membrane (MAM), where it folds StAR for delivery to the outer mitochondrial membrane. StAR expression and activity are drastically reduced following GRP78 knockdown. StAR folding starts at the MAM region; thus, its cholesterol fostering capacity is regulated by GRP78 long before StAR reaches the mitochondria. In summary, GRP78 is an acute regulator of steroidogenesis at the MAM, regulating the intermediate folding of StAR that is crucial for its activity. PMID:28275724
GroEL stimulates protein folding through forced unfolding

PubMed Central

Lin, Zong; Madan, Damian; Rye, Hays S

2013-01-01

Many proteins cannot fold without the assistance of chaperonin machines like GroEL and GroES. The nature of this assistance, however, remains poorly understood. Here we demonstrate that unfolding of a substrate protein by GroEL enhances protein folding. We first show that capture of a protein on the open ring of a GroEL–ADP–GroES complex, GroEL’s physiological acceptor state for non-native proteins in vivo, leaves the substrate protein in an unexpectedly compact state. Subsequent binding of ATP to the same GroEL ring causes rapid, forced unfolding of the substrate protein. Notably, the fraction of the substrate protein that commits to the native state following GroES binding and protein release into the GroEL–GroES cavity is proportional to the extent of substrate-protein unfolding. Forced protein unfolding is thus a central component of the multilayered stimulatory mechanism used by GroEL to drive protein folding. PMID:18311152
Characterization of protein-folding pathways by reduced-space modeling.

PubMed

Kmiecik, Sebastian; Kolinski, Andrzej

2007-07-24

Ab initio simulations of the folding pathways are currently limited to very small proteins. For larger proteins, some approximations or simplifications in protein models need to be introduced. Protein folding and unfolding are among the basic processes in the cell and are very difficult to characterize in detail by experiment or simulation. Chymotrypsin inhibitor 2 (CI2) and barnase are probably the best characterized experimentally in this respect. For these model systems, initial folding stages were simulated by using CA-CB-side chain (CABS), a reduced-space protein-modeling tool. CABS employs knowledge-based potentials that proved to be very successful in protein structure prediction. With the use of isothermal Monte Carlo (MC) dynamics, initiation sites with a residual structure and weak tertiary interactions were identified. Such structures are essential for the initiation of the folding process through a sequential reduction of the protein conformational space, overcoming the Levinthal paradox in this manner. Furthermore, nucleation sites that initiate a tertiary interactions network were located. The MC simulations correspond perfectly to the results of experimental and theoretical research and bring insights into CI2 folding mechanism: unambiguous sequence of folding events was reported as well as cooperative substructures compatible with those obtained in recent molecular dynamics unfolding studies. The correspondence between the simulation and experiment shows that knowledge-based potentials are not only useful in protein structure predictions but are also capable of reproducing the folding pathways. Thus, the results of this work significantly extend the applicability range of reduced models in the theoretical study of proteins.
In vitro folding of inclusion body proteins.

PubMed

Rudolph, R; Lilie, H

1996-01-01

Insoluble, inactive inclusion bodies are frequently formed upon recombinant protein production in transformed microorganisms. These inclusion bodies, which contain the recombinant protein in an highly enriched form, can be isolated by solid/liquid separation. After solubilization, native proteins can be generated from the inactive material by using in vitro folding techniques. New folding procedures have been developed for efficient in vitro reconstitution of complex hydrophobic, multidomain, oligomeric, or highly disulfide-bonded proteins. These protocols take into account process parameters such as protein concentration, catalysis of disulfide bond formation, temperature, pH, and ionic strength, as well as specific solvent ingredients that reduce unproductive side reactions. Modification of the protein sequence has been exploited to improve in vitro folding.
Endoplasmic Reticulum Stress and Oxidative Stress: A Vicious Nexus Implicated in Bowel Disease Pathophysiology

PubMed Central

Chong, Wai Chin; Shastri, Madhur D.; Eri, Rajaraman

2017-01-01

The endoplasmic reticulum (ER) is a complex protein folding and trafficking organelle. Alteration and discrepancy in the endoplasmic reticulum environment can affect the protein folding process and hence, can result in the production of misfolded proteins. The accumulation of misfolded proteins causes cellular damage and elicits endoplasmic reticulum stress. Under such stress conditions, cells exhibit reduced functional synthesis, and will undergo apoptosis if the stress is prolonged. To resolve the ER stress, cells trigger an intrinsic mechanism called an unfolded protein response (UPR). UPR is an adaptive signaling process that triggers multiple pathways through the endoplasmic reticulum transmembrane transducers, to reduce and remove misfolded proteins and improve the protein folding mechanism, in order to improve and maintain endoplasmic reticulum homeostasis. An increasing number of studies support the view that oxidative stress has a strong connection with ER stress. During the protein folding process, reactive oxygen species are produced as by-products, leading to impaired reduction-oxidation (redox) balance conferring oxidative stress. As the protein folding process is dependent on redox homeostasis, the oxidative stress can disrupt the protein folding mechanism and enhance the production of misfolded proteins, causing further ER stress. It is proposed that endoplasmic reticulum stress and oxidative stress together play significant roles in the pathophysiology of bowel diseases. PMID:28379196
Endoplasmic Reticulum Stress and Oxidative Stress: A Vicious Nexus Implicated in Bowel Disease Pathophysiology.

PubMed

Chong, Wai Chin; Shastri, Madhur D; Eri, Rajaraman

2017-04-05

The endoplasmic reticulum (ER) is a complex protein folding and trafficking organelle. Alteration and discrepancy in the endoplasmic reticulum environment can affect the protein folding process and hence, can result in the production of misfolded proteins. The accumulation of misfolded proteins causes cellular damage and elicits endoplasmic reticulum stress. Under such stress conditions, cells exhibit reduced functional synthesis, and will undergo apoptosis if the stress is prolonged. To resolve the ER stress, cells trigger an intrinsic mechanism called an unfolded protein response (UPR). UPR is an adaptive signaling process that triggers multiple pathways through the endoplasmic reticulum transmembrane transducers, to reduce and remove misfolded proteins and improve the protein folding mechanism, in order to improve and maintain endoplasmic reticulum homeostasis. An increasing number of studies support the view that oxidative stress has a strong connection with ER stress. During the protein folding process, reactive oxygen species are produced as by-products, leading to impaired reduction-oxidation (redox) balance conferring oxidative stress. As the protein folding process is dependent on redox homeostasis, the oxidative stress can disrupt the protein folding mechanism and enhance the production of misfolded proteins, causing further ER stress. It is proposed that endoplasmic reticulum stress and oxidative stress together play significant roles in the pathophysiology of bowel diseases.
Steady-state structural fluctuation is a predictor of the necessity of pausing-mediated co-translational folding for small proteins.

PubMed

Huang, Wenxi; Liu, Wanting; Jin, Jingjie; Xiao, Qilan; Lu, Ruibin; Chen, Wei; Xiong, Sheng; Zhang, Gong

2018-03-25

Translational pausing coordinates protein synthesis and co-translational folding. It is a common factor that facilitates the correct folding of large, multi-domain proteins. For small proteins, pausing sites rarely occurs in the gene body, and the 3'-end pausing sites are only essential for the folding of a fraction of proteins. The determinant of the necessity of the pausings remains obscure. In this study, we demonstrated that the steady-state structural fluctuation is a predictor of the necessity of pausing-mediated co-translational folding for small proteins. Validated by experiments with 5 model proteins, we found that the rigid protein structures do not, while the flexible structures do need 3'-end pausings to fold correctly. Therefore, rational optimization of translational pausing can improve soluble expression of small proteins with flexible structures, but not the rigid ones. The rigidity of the structure can be quantitatively estimated in silico using molecular dynamic simulation. Nevertheless, we also found that the translational pausing optimization increases the fitness of the expression host, and thus benefits the recombinant protein production, independent from the soluble expression. These results shed light on the structural basis of the translational pausing and provided a practical tool for industrial protein fermentation. Copyright © 2017. Published by Elsevier Inc.
Predictors of natively unfolded proteins: unanimous consensus score to detect a twilight zone between order and disorder in generic datasets.

PubMed

Deiana, Antonio; Giansanti, Andrea

2010-04-21

Natively unfolded proteins lack a well defined three dimensional structure but have important biological functions, suggesting a re-assignment of the structure-function paradigm. To assess that a given protein is natively unfolded requires laborious experimental investigations, then reliable sequence-only methods for predicting whether a sequence corresponds to a folded or to an unfolded protein are of interest in fundamental and applicative studies. Many proteins have amino acidic compositions compatible both with the folded and unfolded status, and belong to a twilight zone between order and disorder. This makes difficult a dichotomic classification of protein sequences into folded and natively unfolded ones. In this work we propose an operational method to identify proteins belonging to the twilight zone by combining into a consensus score good performing single predictors of folding. In this methodological paper dichotomic folding indexes are considered: hydrophobicity-charge, mean packing, mean pairwise energy, Poodle-W and a new global index, that is called here gVSL2, based on the local disorder predictor VSL2. The performance of these indexes is evaluated on different datasets, in particular on a new dataset composed by 2369 folded and 81 natively unfolded proteins. Poodle-W, gVSL2 and mean pairwise energy have good performance and stability in all the datasets considered and are combined into a strictly unanimous combination score SSU, that leaves proteins unclassified when the consensus of all combined indexes is not reached. The unclassified proteins: i) belong to an overlap region in the vector space of amino acidic compositions occupied by both folded and unfolded proteins; ii) are composed by approximately the same number of order-promoting and disorder-promoting amino acids; iii) have a mean flexibility intermediate between that of folded and that of unfolded proteins. Our results show that proteins unclassified by SSU belong to a twilight zone. Proteins left unclassified by the consensus score SSU have physical properties intermediate between those of folded and those of natively unfolded proteins and their structural properties and evolutionary history are worth to be investigated.
Predictors of natively unfolded proteins: unanimous consensus score to detect a twilight zone between order and disorder in generic datasets

PubMed Central

2010-01-01

Background Natively unfolded proteins lack a well defined three dimensional structure but have important biological functions, suggesting a re-assignment of the structure-function paradigm. To assess that a given protein is natively unfolded requires laborious experimental investigations, then reliable sequence-only methods for predicting whether a sequence corresponds to a folded or to an unfolded protein are of interest in fundamental and applicative studies. Many proteins have amino acidic compositions compatible both with the folded and unfolded status, and belong to a twilight zone between order and disorder. This makes difficult a dichotomic classification of protein sequences into folded and natively unfolded ones. In this work we propose an operational method to identify proteins belonging to the twilight zone by combining into a consensus score good performing single predictors of folding. Results In this methodological paper dichotomic folding indexes are considered: hydrophobicity-charge, mean packing, mean pairwise energy, Poodle-W and a new global index, that is called here gVSL2, based on the local disorder predictor VSL2. The performance of these indexes is evaluated on different datasets, in particular on a new dataset composed by 2369 folded and 81 natively unfolded proteins. Poodle-W, gVSL2 and mean pairwise energy have good performance and stability in all the datasets considered and are combined into a strictly unanimous combination score SSU, that leaves proteins unclassified when the consensus of all combined indexes is not reached. The unclassified proteins: i) belong to an overlap region in the vector space of amino acidic compositions occupied by both folded and unfolded proteins; ii) are composed by approximately the same number of order-promoting and disorder-promoting amino acids; iii) have a mean flexibility intermediate between that of folded and that of unfolded proteins. Conclusions Our results show that proteins unclassified by SSU belong to a twilight zone. Proteins left unclassified by the consensus score SSU have physical properties intermediate between those of folded and those of natively unfolded proteins and their structural properties and evolutionary history are worth to be investigated. PMID:20409339
A Method for WD40 Repeat Detection and Secondary Structure Prediction

PubMed Central

Wang, Yang; Jiang, Fan; Zhuo, Zhu; Wu, Xian-Hui; Wu, Yun-Dong

2013-01-01

WD40-repeat proteins (WD40s), as one of the largest protein families in eukaryotes, play vital roles in assembling protein-protein/DNA/RNA complexes. WD40s fold into similar β-propeller structures despite diversified sequences. A program WDSP (WD40 repeat protein Structure Predictor) has been developed to accurately identify WD40 repeats and predict their secondary structures. The method is designed specifically for WD40 proteins by incorporating both local residue information and non-local family-specific structural features. It overcomes the problem of highly diversified protein sequences and variable loops. In addition, WDSP achieves a better prediction in identifying multiple WD40-domain proteins by taking the global combination of repeats into consideration. In secondary structure prediction, the average Q3 accuracy of WDSP in jack-knife test reaches 93.7%. A disease related protein LRRK2 was used as a representive example to demonstrate the structure prediction. PMID:23776530
AnchorDock for Blind Flexible Docking of Peptides to Proteins.

PubMed

Slutzki, Michal; Ben-Shimon, Avraham; Niv, Masha Y

2017-01-01

Due to increasing interest in peptides as signaling modulators and drug candidates, several methods for peptide docking to their target proteins are under active development. The "blind" docking problem, where the peptide-binding site on the protein surface is unknown, presents one of the current challenges in the field. AnchorDock protocol was developed by Ben-Shimon and Niv to address this challenge.This protocol narrows the docking search to the most relevant parts of the conformational space. This is achieved by pre-folding the free peptide and by computationally detecting anchoring spots on the surface of the unbound protein. Multiple flexible simulated annealing molecular dynamics (SAMD) simulations are subsequently carried out, starting from pre-folded peptide conformations, constrained to the various precomputed anchoring spots.Here, AnchorDock is demonstrated using two known protein-peptide complexes. A PDZ-peptide complex provides a relatively easy case due to the relatively small size of the protein, and a typical peptide conformation and binding region; a more challenging example is a complex between USP7 N-term and a p53-derived peptide, where the protein is larger, and the peptide conformation and a binding site are generally assumed to be unknown. AnchorDock returned native-like solutions ranked first and third for the PDZ and USP7 complexes, respectively. We describe the procedure step by step and discuss possible modifications where applicable.
Retarded protein folding of deficient human α1-antitrypsin D256V and L41P variants

PubMed Central

Jung, Chan-Hun; Na, Yu-Ran; Im, Hana

2004-01-01

α1-Antitrypsin is the most abundant protease inhibitor in plasma and is the archetype of the serine protease inhibitor superfamily. Genetic variants of human α1-antitrypsin are associated with early-onset emphysema and liver cirrhosis. However, the detailed molecular mechanism for the pathogenicity of most variant α1-antitrypsin molecules is not known. Here we examined the structural basis of a dozen deficient α1-antitrypsin variants. Unlike most α1-antitrypsin variants, which were unstable, D256V and L41P variants exhibited extremely retarded protein folding as compared with the wild-type molecule. Once folded, however, the stability and inhibitory activity of these variant proteins were comparable to those of the wild-type molecule. Retarded protein folding may promote protein aggregation by allowing the accumulation of aggregation-prone folding intermediates. Repeated observations of retarded protein folding indicate that it is an important mechanism causing α1-antitrypsin deficiency by variant molecules, which have to fold into the metastable native form to be functional. PMID:14767073
The Dominant Folding Route Minimizes Backbone Distortion in SH3

PubMed Central

Lammert, Heiko; Noel, Jeffrey K.; Onuchic, José N.

2012-01-01

Energetic frustration in protein folding is minimized by evolution to create a smooth and robust energy landscape. As a result the geometry of the native structure provides key constraints that shape protein folding mechanisms. Chain connectivity in particular has been identified as an essential component for realistic behavior of protein folding models. We study the quantitative balance of energetic and geometrical influences on the folding of SH3 in a structure-based model with minimal energetic frustration. A decomposition of the two-dimensional free energy landscape for the folding reaction into relevant energy and entropy contributions reveals that the entropy of the chain is not responsible for the folding mechanism. Instead the preferred folding route through the transition state arises from a cooperative energetic effect. Off-pathway structures are penalized by excess distortion in local backbone configurations and contact pair distances. This energy cost is a new ingredient in the malleable balance of interactions that controls the choice of routes during protein folding. PMID:23166485
Evidence for the principle of minimal frustration in the evolution of protein folding landscapes.

PubMed

Tzul, Franco O; Vasilchuk, Daniel; Makhatadze, George I

2017-02-28

Theoretical and experimental studies have firmly established that protein folding can be described by a funneled energy landscape. This funneled energy landscape is the result of foldable protein sequences evolving following the principle of minimal frustration, which allows proteins to rapidly fold to their native biologically functional conformations. For a protein family with a given functional fold, the principle of minimal frustration suggests that, independent of sequence, all proteins within this family should fold with similar rates. However, depending on the optimal living temperature of the organism, proteins also need to modulate their thermodynamic stability. Consequently, the difference in thermodynamic stability should be primarily caused by differences in the unfolding rates. To test this hypothesis experimentally, we performed comprehensive thermodynamic and kinetic analyses of 15 different proteins from the thioredoxin family. Eight of these thioredoxins were extant proteins from psychrophilic, mesophilic, or thermophilic organisms. The other seven protein sequences were obtained using ancestral sequence reconstruction and can be dated back over 4 billion years. We found that all studied proteins fold with very similar rates but unfold with rates that differ up to three orders of magnitude. The unfolding rates correlate well with the thermodynamic stability of the proteins. Moreover, proteins that unfold slower are more resistant to proteolysis. These results provide direct experimental support to the principle of minimal frustration hypothesis.

The Endoplasmic Reticulum and the Unfolded Protein Response

PubMed Central

Malhotra, Jyoti D.; Kaufman, Randal J.

2009-01-01

The endoplasmic reticulum (ER) is the site where proteins enter the secretory pathway. Proteins are translocated into the ER lumen in an unfolded state and require protein chaperones and catalysts of protein folding to attain their final appropriate conformation. A sensitive surveillance mechanism exists to prevent misfolded proteins from transiting the secretory pathway and ensures that persistently misfolded proteins are directed towards a degradative pathway. In addition, those processes that prevent accumulation of unfolded proteins in the ER lumen are highly regulated by an intracellular signaling pathway known as the unfolded protein response (UPR). The UPR provides a mechanism by which cells can rapidly adapt to alterations in client protein-folding load in the ER lumen by expanding the capacity for protein folding. In addition, a variety of insults that disrupt protein folding in the ER lumen also activate the UPR. These include changes in intralumenal calcium, altered glycosylation, nutrient deprivation, pathogen infection, expression of folding-defective proteins, and changes in redox status. Persistent protein misfolding initiates apoptotic cascades that are now known to play fundamental roles in the pathogenesis of multiple human diseases including diabetes, atherosclerosis and neurodegenerative diseases. PMID:18023214
Ligand-promoted protein folding by biased kinetic partitioning.

PubMed

Hingorani, Karan S; Metcalf, Matthew C; Deming, Derrick T; Garman, Scott C; Powers, Evan T; Gierasch, Lila M

2017-04-01

Protein folding in cells occurs in the presence of high concentrations of endogenous binding partners, and exogenous binding partners have been exploited as pharmacological chaperones. A combined mathematical modeling and experimental approach shows that a ligand improves the folding of a destabilized protein by biasing the kinetic partitioning between folding and alternative fates (aggregation or degradation). Computationally predicted inhibition of test protein aggregation and degradation as a function of ligand concentration are validated by experiments in two disparate cellular systems.
Ligand-Promoted Protein Folding by Biased Kinetic Partitioning

PubMed Central

Hingorani, Karan S.; Metcalf, Matthew C.; Deming, Derrick T.; Garman, Scott C.; Powers, Evan T.; Gierasch, Lila M.

2017-01-01

Protein folding in cells occurs in the presence of high concentrations of endogenous binding partners, and exogenous binding partners have been exploited as pharmacological chaperones. A combined mathematical modeling and experimental approach shows that a ligand improves the folding of a destabilized protein by biasing the kinetic partitioning between folding and alternative fates (aggregation or degradation). Computationally predicted inhibition of test protein aggregation and degradation as a function of ligand concentration are validated by experiments in two disparate cellular systems. PMID:28218913
Redesigning the type II' β-turn in green fluorescent protein to type I': implications for folding kinetics and stability.

PubMed

Madan, Bharat; Sokalingam, Sriram; Raghunathan, Govindan; Lee, Sun-Gu

2014-10-01

Both Type I' and Type II' β-turns have the same sense of the β-turn twist that is compatible with the β-sheet twist. They occur predominantly in two residue β-hairpins, but the occurrence of Type I' β-turns is two times higher than Type II' β-turns. This suggests that Type I' β-turns may be more stable than Type II' β-turns, and Type I' β-turn sequence and structure can be more favorable for protein folding than Type II' β-turns. Here, we redesigned the native Type II' β-turn in GFP to Type I' β-turn, and investigated its effect on protein folding and stability. The Type I' β-turns were designed based on the statistical analysis of residues in natural Type I' β-turns. The substitution of the native "GD" sequence of i+1 and i+2 residues with Type I' preferred "(N/D)G" sequence motif increased the folding rate by 50% and slightly improved the thermodynamic stability. Despite the enhancement of in vitro refolding kinetics and stability of the redesigned mutants, they showed poor soluble expression level compared to wild type. To overcome this problem, i and i + 3 residues of the designed Type I' β-turn were further engineered. The mutation of Thr to Lys at i + 3 could restore the in vivo soluble expression of the Type I' mutant. This study indicates that Type II' β-turns in natural β-hairpins can be further optimized by converting the sequence to Type I'. © 2014 Wiley Periodicals, Inc.
Structural Transitions of Confined Model Proteins: Molecular Dynamics Simulation and Experimental Validation

PubMed Central

Lu, Diannan; Liu, Zheng; Wu, Jianzhong

2006-01-01

Proteins fold in a confined space not only in vivo, i.e., folding assisted by molecular chaperons and chaperonins in a crowded cellular medium, but also in vitro as in production of recombinant proteins. Despite extensive work on protein folding in bulk, little is known about how and to what extent the thermodynamics and kinetics of protein folding are altered by confinement. In this work, we use a Gō-like off-lattice model to investigate the folding and stability of an all β-sheet protein in spherical cages of different sizes and surface hydrophobicity. We find whereas extreme confinement inhibits correct folding, a hydrophilic cage stabilizes the protein due to restriction of the unfolded configurations. In a hydrophobic cage, however, strong attraction from the cage surface destabilizes the confined protein because of competition between self-aggregation and adsorption of hydrophobic residues. We show that the kinetics of protein collapse and folding is strongly correlated with both the cage size and the surface hydrophobicity. It is demonstrated that a cage of moderate size and hydrophobicity optimizes both the folding yield and kinetics of structural transitions. To support the simulation results, we have also investigated the refolding of hen-egg lysozyme in the presence of cetyltrimethylammoniumbromide (CTAB) surfactants that provide an effective confinement of the proteins by micellization. The influence of the surfactant hydrophobicity on the structural and biological activity of the protein is determined with circular dichroism spectrum, fluorescence emission spectrum, and biological activity assay. It is shown that, as predicted by coarse-grained simulations, CTAB micelles facilitate the collapse of denatured lysozyme, whereas the addition of β-cyclodextrin-grafted-PNIPAAm, a weakly hydrophobic stripper, dissociates CTAB micelles and promotes the conformational rearrangement and thereby gives an improved recovery of lysozyme activity. PMID:16461405
Absolute comparison of simulated and experimental protein-folding dynamics

NASA Astrophysics Data System (ADS)

Snow, Christopher D.; Nguyen, Houbi; Pande, Vijay S.; Gruebele, Martin

2002-11-01

Protein folding is difficult to simulate with classical molecular dynamics. Secondary structure motifs such as α-helices and β-hairpins can form in 0.1-10µs (ref. 1), whereas small proteins have been shown to fold completely in tens of microseconds. The longest folding simulation to date is a single 1-µs simulation of the villin headpiece; however, such single runs may miss many features of the folding process as it is a heterogeneous reaction involving an ensemble of transition states. Here, we have used a distributed computing implementation to produce tens of thousands of 5-20-ns trajectories (700µs) to simulate mutants of the designed mini-protein BBA5. The fast relaxation dynamics these predict were compared with the results of laser temperature-jump experiments. Our computational predictions are in excellent agreement with the experimentally determined mean folding times and equilibrium constants. The rapid folding of BBA5 is due to the swift formation of secondary structure. The convergence of experimentally and computationally accessible timescales will allow the comparison of absolute quantities characterizing in vitro and in silico (computed) protein folding.
Solvent effect on the folding dynamics and structure of E6-associated protein characterized from ab initio protein folding simulations

NASA Astrophysics Data System (ADS)

Xu, Zhijun; Lazim, Raudah; Sun, Tiedong; Mei, Ye; Zhang, Dawei

2012-04-01

Solvent effect on protein conformation and folding mechanism of E6-associated protein (E6ap) peptide are investigated using a recently developed charge update scheme termed as adaptive hydrogen bond-specific charge (AHBC). On the basis of the close agreement between the calculated helix contents from AHBC simulations and experimental results, we observed based on the presented simulations that the two ends of the peptide may simultaneously take part in the formation of the helical structure at the early stage of folding and finally merge to form a helix with lowest backbone RMSD of about 0.9 Å in 40% 2,2,2-trifluoroethanol solution. However, in pure water, the folding may start at the center of the peptide sequence instead of at the two opposite ends. The analysis of the free energy landscape indicates that the solvent may determine the folding clusters of E6ap, which subsequently leads to the different final folded structure. The current study demonstrates new insight to the role of solvent in the determination of protein structure and folding dynamics.
Structural test of the parameterized-backbone method for protein design.

PubMed

Plecs, Joseph J; Harbury, Pehr B; Kim, Peter S; Alber, Tom

2004-09-03

Designing new protein folds requires a method for simultaneously optimizing the conformation of the backbone and the side-chains. One approach to this problem is the use of a parameterized backbone, which allows the systematic exploration of families of structures. We report the crystal structure of RH3, a right-handed, three-helix coiled coil that was designed using a parameterized backbone and detailed modeling of core packing. This crystal structure was determined using another rationally designed feature, a metal-binding site that permitted experimental phasing of the X-ray data. RH3 adopted the intended fold, which has not been observed previously in biological proteins. Unanticipated structural asymmetry in the trimer was a principal source of variation within the RH3 structure. The sequence of RH3 differs from that of a previously characterized right-handed tetramer, RH4, at only one position in each 11 amino acid sequence repeat. This close similarity indicates that the design method is sensitive to the core packing interactions that specify the protein structure. Comparison of the structures of RH3 and RH4 indicates that both steric overlap and cavity formation provide strong driving forces for oligomer specificity.
Molecular Dynamics Simulation of Tau Peptides for the Investigation of Conformational Changes Induced by Specific Phosphorylation Patterns.

PubMed

Gandhi, Neha S; Kukic, Predrag; Lippens, Guy; Mancera, Ricardo L

2017-01-01

The Tau protein plays an important role due to its biomolecular interactions in neurodegenerative diseases. The lack of stable structure and various posttranslational modifications such as phosphorylation at various sites in the Tau protein pose a challenge for many experimental methods that are traditionally used to study protein folding and aggregation. Atomistic molecular dynamics (MD) simulations can help around deciphering relationship between phosphorylation and various intermediate and stable conformations of the Tau protein which occur on longer timescales. This chapter outlines protocols for the preparation, execution, and analysis of all-atom MD simulations of a 21-amino acid-long phosphorylated Tau peptide with the aim of generating biologically relevant structural and dynamic information. The simulations are done in explicit solvent and starting from nearly extended configurations of the peptide. The scaled MD method implemented in AMBER14 was chosen to achieve enhanced conformational sampling in addition to a conventional MD approach, thereby allowing the characterization of folding for such an intrinsically disordered peptide at 293 K. Emphasis is placed on the analysis of the simulation trajectories to establish correlations with NMR data (i.e., chemical shifts and NOEs). Finally, in-depth discussions are provided for commonly encountered problems.
Dodging the crisis of folding proteins with knots

NASA Astrophysics Data System (ADS)

Sulkowska, Joanna

2009-03-01

Proteins with nontrivial topology, containing knots and slipknots, have the ability to fold to their native states without any additional external forces invoked. A mechanism is suggested for folding of these proteins, such as YibK and YbeA, which involves an intermediate configuration with a slipknot. It elucidates the role of topological barriers and backtracking during the folding event. It also illustrates that native contacts are sufficient to guarantee folding in around 1-2% of the simulations, and how slipknot intermediates are needed to reduce the topological bottlenecks. As expected, simulations of proteins with similar structure but with knot removed fold much more efficiently, clearly demonstrating the origin of these topological barriers. Although these studies are based on a simple coarse-grained model, they are already able to extract some of the underlying principles governing folding in such complex topologies.
A DsbA-Deficient Periplasm Enables Functional Display of a Protein with Redox-Sensitive Folding on M13 Phage.

PubMed

Chen, Minyong; Samuelson, James C

2016-06-14

The requirements for target protein folding in M13 phage display are largely underappreciated. Here we chose Fbs1, a carbohydrate binding protein, as a model to address this issue. Importantly, folding of Fbs1 is impaired in an oxidative environment. Fbs1 can be displayed on M13 phage using the SRP or Sec pathway. However, the displayed Fbs1 protein is properly folded only when Fbs1 is translocated via the SRP pathway and displayed using Escherichia coli cells with a DsbA-negative periplasm. This study indicates M13 phage display may be improved using a system specifically designed according to the folding requirements of each target protein.
Roles of beta-turns in protein folding: from peptide models to protein engineering.

PubMed

Marcelino, Anna Marie C; Gierasch, Lila M

2008-05-01

Reverse turns are a major class of protein secondary structure; they represent sites of chain reversal and thus sites where the globular character of a protein is created. It has been speculated for many years that turns may nucleate the formation of structure in protein folding, as their propensity to occur will favor the approximation of their flanking regions and their general tendency to be hydrophilic will favor their disposition at the solvent-accessible surface. Reverse turns are local features, and it is therefore not surprising that their structural properties have been extensively studied using peptide models. In this article, we review research on peptide models of turns to test the hypothesis that the propensities of turns to form in short peptides will relate to the roles of corresponding sequences in protein folding. Turns with significant stability as isolated entities should actively promote the folding of a protein, and by contrast, turn sequences that merely allow the chain to adopt conformations required for chain reversal are predicted to be passive in the folding mechanism. We discuss results of protein engineering studies of the roles of turn residues in folding mechanisms. Factors that correlate with the importance of turns in folding indeed include their intrinsic stability, as well as their topological context and their participation in hydrophobic networks within the protein's structure.
Protein domain definition should allow for conditional disorder

PubMed Central

Yegambaram, Kavestri; Bulloch, Esther MM; Kingston, Richard L

2013-01-01

Abstract: Proteins are often classified in a binary fashion as either structured or disordered. However this approach has several deficits. Firstly, protein folding is always conditional on the physiochemical environment. A protein which is structured in some circumstances will be disordered in others. Secondly, it hides a fundamental asymmetry in behavior. While all structured proteins can be unfolded through a change in environment, not all disordered proteins have the capacity for folding. Failure to accommodate these complexities confuses the definition of both protein structural domains and intrinsically disordered regions. We illustrate these points with an experimental study of a family of small binding domains, drawn from the RNA polymerase of mumps virus and its closest relatives. Assessed at face value the domains fall on a structural continuum, with folded, partially folded, and near unstructured members. Yet the disorder present in the family is conditional, and these closely related polypeptides can access the same folded state under appropriate conditions. Any heuristic definition of the protein domain emphasizing conformational stability divides this domain family in two, in a way that makes no biological sense. Structural domains would be better defined by their ability to adopt a specific tertiary structure: a structure that may or may not be realized, dependent on the circumstances. This explicitly allows for the conditional nature of protein folding, and more clearly demarcates structural domains from intrinsically disordered regions that may function without folding. PMID:23963781
Benzyl isothiocyanate alters the gene expression with cell cycle regulation and cell death in human brain glioblastoma GBM 8401 cells.

PubMed

Tang, Nou-Ying; Chueh, Fu-Shin; Yu, Chien-Chih; Liao, Ching-Lung; Lin, Jen-Jyh; Hsia, Te-Chun; Wu, King-Chuen; Liu, Hsin-Chung; Lu, Kung-Wen; Chung, Jing-Gung

2016-04-01

Glioblastoma multiforme (GBM) is a highly malignant devastating brain tumor in adults. Benzyl isothiocyanate (BITC) is one of the isothiocyanates that have been shown to induce human cancer cell apoptosis and cell cycle arrest. Herein, the effect of BITC on cell viability and apoptotic cell death and the genetic levels of human brain glioblastoma GBM 8401 cells in vitro were investigated. We found that BITC induced cell morphological changes, decreased cell viability and the induction of cell apoptosis in GBM 8401 cells was time-dependent. cDNA microarray was used to examine the effects of BITC on GBM 8401 cells and we found that numerous genes associated with cell death and cell cycle regulation in GBM 8401 cells were altered after BITC treatment. The results show that expression of 317 genes was upregulated, and two genes were associated with DNA damage, the DNA-damage-inducible transcript 3 (DDIT3) was increased 3.66-fold and the growth arrest and DNA-damage-inducible α (GADD45A) was increased 2.34-fold. We also found that expression of 182 genes was downregulated and two genes were associated with receptor for cell responses to stimuli, the EGF containing fibulin-like extracellular matrix protein 1 (EFEMP1) was inhibited 2.01-fold and the TNF receptor-associated protein 1 (TRAP1) was inhibited 2.08-fold. BITC inhibited seven mitochondria ribosomal genes, the mitochondrial ribosomal protein; tumor protein D52 (MRPS28) was inhibited 2.06-fold, the mitochondria ribosomal protein S2 (MRPS2) decreased 2.07-fold, the mitochondria ribosomal protein L23 (MRPL23) decreased 2.08-fold, the mitochondria ribosomal protein S2 (MRPS2) decreased 2.07-fold, the mitochondria ribosomal protein S12 (MRPS12) decreased 2.08-fold, the mitochondria ribosomal protein L12 (MRPL12) decreased 2.25-fold and the mitochondria ribosomal protein S34 (MRPS34) was decreased 2.30-fold in GBM 8401 cells. These changes of gene expression can provide the effects of BITC on the genetic level and are potential biomarkers for glioblastoma therapy.
Folding free energy surfaces of three small proteins under crowding: validation of the postprocessing method by direct simulation

NASA Astrophysics Data System (ADS)

Qin, Sanbo; Mittal, Jeetain; Zhou, Huan-Xiang

2013-08-01

We have developed a ‘postprocessing’ method for modeling biochemical processes such as protein folding under crowded conditions (Qin and Zhou 2009 Biophys. J. 97 12-19). In contrast to the direct simulation approach, in which the protein undergoing folding is simulated along with crowders, the postprocessing method requires only the folding simulation without crowders. The influence of the crowders is then obtained by taking conformations from the crowder-free simulation and calculating the free energies of transferring to the crowders. This postprocessing yields the folding free energy surface of the protein under crowding. Here the postprocessing results for the folding of three small proteins under ‘repulsive’ crowding are validated by those obtained previously by the direct simulation approach (Mittal and Best 2010 Biophys. J. 98 315-20). This validation confirms the accuracy of the postprocessing approach and highlights its distinct advantages in modeling biochemical processes under cell-like crowded conditions, such as enabling an atomistic representation of the test proteins.
Statistical Mechanical Foundation for the Two-State Transition in Protein Folding of Small Globular Proteins

NASA Astrophysics Data System (ADS)

Iguchi, Kazumoto

We discuss the statistical mechanical foundation for the two-state transition in the protein folding of small globular proteins. In the standard arguments of protein folding, the statistical search for the ground state is carried out from astronomically many conformations in the configuration space. This leads us to the famous Levinthal's paradox. To resolve the paradox, Gō first postulated that the two-state transition - all-or-none type transition - is very crucial for the protein folding of small globular proteins and used the Gō's lattice model to show the two-state transition nature. Recently, there have been accumulated many experimental results that support the two-state transition for small globular proteins. Stimulated by such recent experiments, Zwanzig has introduced a minimal statistical mechanical model that exhibits the two-state transition. Also, Finkelstein and coworkers have discussed the solution of the paradox by considering the sequential folding of a small globular protein. On the other hand, recently Iguchi have introduced a toy model of protein folding using the Rubik's magic snake model, in which all folded structures are exactly known and mathematically represented in terms of the four types of conformations: cis-, trans-, left and right gauche-configurations between the unit polyhedrons. In this paper, we study the relationship between the Gō's two-state transition, the Zwanzig's statistical mechanics model and the Finkelsteinapos;s sequential folding model by applying them to the Rubik's magic snake models. We show that the foundation of the Gō's two-state transition model relies on the search within the equienergy surface that is labeled by the contact order of the hydrophobic condensation. This idea reproduces the Zwanzig's statistical model as a special case, realizes the Finkelstein's sequential folding model and fits together to understand the nature of the two-state transition of a small globular protein by calculating the physical quantities such as the free energy, the contact order and the specific heat. We point out the similarity between the liquid-gas transition in statistical mechanics and the two-state transition of protein folding. We also study morphology of the Rubik's magic snake models to give a prototype model for understanding the differences between α-helices proteins and β-sheets proteins.
Global analysis of protein folding using massively parallel design, synthesis and testing

PubMed Central

Rocklin, Gabriel J.; Chidyausiku, Tamuka M.; Goreshnik, Inna; Ford, Alex; Houliston, Scott; Lemak, Alexander; Carter, Lauren; Ravichandran, Rashmi; Mulligan, Vikram K.; Chevalier, Aaron; Arrowsmith, Cheryl H.; Baker, David

2017-01-01

Proteins fold into unique native structures stabilized by thousands of weak interactions that collectively overcome the entropic cost of folding. Though these forces are “encoded” in the thousands of known protein structures, “decoding” them is challenging due to the complexity of natural proteins that have evolved for function, not stability. Here we combine computational protein design, next-generation gene synthesis, and a high-throughput protease susceptibility assay to measure folding and stability for over 15,000 de novo designed miniproteins, 1,000 natural proteins, 10,000 point-mutants, and 30,000 negative control sequences, identifying over 2,500 new stable designed proteins in four basic folds. This scale—three orders of magnitude greater than that of previous studies of design or folding—enabled us to systematically examine how sequence determines folding and stability in uncharted protein space. Iteration between design and experiment increased the design success rate from 6% to 47%, produced stable proteins unlike those found in nature for topologies where design was initially unsuccessful, and revealed subtle contributions to stability as designs became increasingly optimized. Our approach achieves the long-standing goal of a tight feedback cycle between computation and experiment, and promises to transform computational protein design into a data-driven science. PMID:28706065
Engineered fluorescent proteins illuminate the bacterial periplasm

PubMed Central

Dammeyer, Thorben; Tinnefeld, Philip

2012-01-01

The bacterial periplasm is of special interest whenever cell factories are designed and engineered. Recombinantely produced proteins are targeted to the periplasmic space of Gram negative bacteria to take advantage of the authentic N-termini, disulfide bridge formation and easy accessibility for purification with less contaminating cellular proteins. The oxidizing environment of the periplasm promotes disulfide bridge formation - a prerequisite for proper folding of many proteins into their active conformation. In contrast, the most popular reporter protein in all of cell biology, Green Fluorescent Protein (GFP), remains inactive if translocated to the periplasmic space prior to folding. Here, the self-catalyzed chromophore maturation is blocked by formation of covalent oligomers via interchain disulfide bonds in the oxidizing environment. However, different protein engineering approaches addressing folding and stability of GFP resulted in improved proteins with enhanced folding properties. Recent studies describe GFP variants that are not only active if translocated in their folded form via the twin-arginine translocation (Tat) pathway, but actively fold in the periplasm following general secretory pathway (Sec) and signal recognition particle (SRP) mediated secretion. This mini-review highlights the progress that enables new insights into bacterial export and periplasmic protein organization, as well as new biotechnological applications combining the advantages of the periplasmic production and the Aequorea-based fluorescent reporter proteins. PMID:24688673
A combination of feature extraction methods with an ensemble of different classifiers for protein structural class prediction problem.

PubMed

Dehzangi, Abdollah; Paliwal, Kuldip; Sharma, Alok; Dehzangi, Omid; Sattar, Abdul

2013-01-01

Better understanding of structural class of a given protein reveals important information about its overall folding type and its domain. It can also be directly used to provide critical information on general tertiary structure of a protein which has a profound impact on protein function determination and drug design. Despite tremendous enhancements made by pattern recognition-based approaches to solve this problem, it still remains as an unsolved issue for bioinformatics that demands more attention and exploration. In this study, we propose a novel feature extraction model that incorporates physicochemical and evolutionary-based information simultaneously. We also propose overlapped segmented distribution and autocorrelation-based feature extraction methods to provide more local and global discriminatory information. The proposed feature extraction methods are explored for 15 most promising attributes that are selected from a wide range of physicochemical-based attributes. Finally, by applying an ensemble of different classifiers namely, Adaboost.M1, LogitBoost, naive Bayes, multilayer perceptron (MLP), and support vector machine (SVM) we show enhancement of the protein structural class prediction accuracy for four popular benchmarks.
Structural perturbations on huntingtin N17 domain during its folding on 2D-nanomaterials

NASA Astrophysics Data System (ADS)

Zhang, Leili; Feng, Mei; Zhou, Ruhong; Luan, Binquan

2017-09-01

A globular protein’s folded structure in its physiological environment is largely determined by its amino acid sequence. Recently, newly discovered transformer proteins as well as intrinsically disordered proteins may adopt the folding-upon-binding mechanism where their secondary structures are highly dependent on their binding partners. Due to the various applications of nanomaterials in biological sensors and potential wearable devices, it is important to discover possible conformational changes of proteins on nanomaterials. Here, through molecular dynamics simulations, we show that the first 17 residues of the huntingtin protein (HTT-N17) exhibit appreciable differences during its folding on 2D-nanomaterials, such as graphene and MoS2 nanosheets. Namely, the protein is disordered on the graphene surface but is helical on the MoS2 surface. Despite that the amphiphilic environment at the nanosheet-water interface promotes the folding of the amphipathic proteins (such as HTT-N17), competitions between protein-nanosheet and intra-protein interactions yield very different protein conformations. Therefore, as engineered binding partners, nanomaterials might significantly affect the structures of adsorbed proteins.

DOE Office of Scientific and Technical Information (OSTI.GOV)

FLANAGAN,J.M.; BEWLEY,M.C.

It is generally accepted that the information necessary to specify the native, functional, three-dimensional structure of a protein is encoded entirely within its amino acid sequence; however, efficient reversible folding and unfolding is observed only with a subset of small single-domain proteins. Refolding experiments often lead to the formation of kinetically-trapped, misfolded species that aggregate, even in dilute solution. In the cellular environment, the barriers to efficient protein folding and maintenance of native structure are even larger due to the nature of this process. First, nascent polypeptides must fold in an extremely crowded environment where the concentration of macromolecules approachesmore » 300-400 mg/mL and on average, each ribosome is within its own diameter of another ribosome (1-3). These conditions of severe molecular crowding, coupled with high concentrations of nascent polypeptide chains, favor nonspecific aggregation over productive folding (3). Second, folding of newly-translated polypeptides occurs in the context of their vehtorial synthesis process. Amino acids are added to a growing nascent chain at the rate of -5 residues per set, which means that for a 300 residue protein its N-terminus will be exposed to the cytosol {approx}1 min before its C-terminus and be free to begin the folding process. However, because protein folding is highly cooperative, the nascent polypeptide cannot reach its native state until a complete folding domain (50-250 residues) has emerged from the ribosome. Thus, for a single-domain protein, the final steps in folding are only completed post-translationally since {approx}40 residues of a nascent chain are sequestered within the exit channel of the ribosome and are not available for folding (4). A direct consequence of this limitation in cellular folding is that during translation incomplete domains will exist in partially-folded states that tend to expose hydrophobic residues that are prone to aggregation and/or misfolding. Thus it is not surprising that, in cells, the protein folding process is error prone and organisms have evolved ''editing'' or quality control (QC) systems to assist in the folding, maintenance and, when necessary, selective removal of damaged proteins. In fact, there is growing evidence that failure of these QC-systems contributes to a number of disease states (5-8). This chapter describes our current understanding of the nature and mechanisms of the protein quality control systems in the cytosol of bacteria. Parallel systems are exploited in the cytosol and mitochondria of eukaryotes to prevent the accumulation of misfolded proteins.« less
Deciphering Cryptic Binding Sites on Proteins by Mixed-Solvent Molecular Dynamics.

PubMed

Kimura, S Roy; Hu, Hai Peng; Ruvinsky, Anatoly M; Sherman, Woody; Favia, Angelo D

2017-06-26

In recent years, molecular dynamics simulations of proteins in explicit mixed solvents have been applied to various problems in protein biophysics and drug discovery, including protein folding, protein surface characterization, fragment screening, allostery, and druggability assessment. In this study, we perform a systematic study on how mixtures of organic solvent probes in water can reveal cryptic ligand binding pockets that are not evident in crystal structures of apo proteins. We examine a diverse set of eight PDB proteins that show pocket opening induced by ligand binding and investigate whether solvent MD simulations on the apo structures can induce the binding site observed in the holo structures. The cosolvent simulations were found to induce conformational changes on the protein surface, which were characterized and compared with the holo structures. Analyses of the biological systems, choice of probes and concentrations, druggability of the resulting induced pockets, and application to drug discovery are discussed here.
BiP clustering facilitates protein folding in the endoplasmic reticulum.

PubMed

Griesemer, Marc; Young, Carissa; Robinson, Anne S; Petzold, Linda

2014-07-01

The chaperone BiP participates in several regulatory processes within the endoplasmic reticulum (ER): translocation, protein folding, and ER-associated degradation. To facilitate protein folding, a cooperative mechanism known as entropic pulling has been proposed to demonstrate the molecular-level understanding of how multiple BiP molecules bind to nascent and unfolded proteins. Recently, experimental evidence revealed the spatial heterogeneity of BiP within the nuclear and peripheral ER of S. cerevisiae (commonly referred to as 'clusters'). Here, we developed a model to evaluate the potential advantages of accounting for multiple BiP molecules binding to peptides, while proposing that BiP's spatial heterogeneity may enhance protein folding and maturation. Scenarios were simulated to gauge the effectiveness of binding multiple chaperone molecules to peptides. Using two metrics: folding efficiency and chaperone cost, we determined that the single binding site model achieves a higher efficiency than models characterized by multiple binding sites, in the absence of cooperativity. Due to entropic pulling, however, multiple chaperones perform in concert to facilitate the resolubilization and ultimate yield of folded proteins. As a result of cooperativity, multiple binding site models used fewer BiP molecules and maintained a higher folding efficiency than the single binding site model. These insilico investigations reveal that clusters of BiP molecules bound to unfolded proteins may enhance folding efficiency through cooperative action via entropic pulling.
A protein block based fold recognition method for the annotation of twilight zone sequences.

PubMed

Suresh, V; Ganesan, K; Parthasarathy, S

2013-03-01

The description of protein backbone was recently improved with a group of structural fragments called Structural Alphabets instead of the regular three states (Helix, Sheet and Coil) secondary structure description. Protein Blocks is one of the Structural Alphabets used to describe each and every region of protein backbone including the coil. According to de Brevern (2000) the Protein Blocks has 16 structural fragments and each one has 5 residues in length. Protein Blocks fragments are highly informative among the available Structural Alphabets and it has been used for many applications. Here, we present a protein fold recognition method based on Protein Blocks for the annotation of twilight zone sequences. In our method, we align the predicted Protein Blocks of a query amino acid sequence with a library of assigned Protein Blocks of 953 known folds using the local pair-wise alignment. The alignment results with z-value ≥ 2.5 and P-value ≤ 0.08 are predicted as possible folds. Our method is able to recognize the possible folds for nearly 35.5% of the twilight zone sequences with their predicted Protein Block sequence obtained by pb_prediction, which is available at Protein Block Export server.
Solvent Effects on Protein Folding/Unfolding

NASA Astrophysics Data System (ADS)

García, A. E.; Hillson, N.; Onuchic, J. N.

Pressure effects on the hydrophobic potential of mean force led Hummer et al. to postulate a model for pressure denaturation of proteins in which denaturation occurs by means of water penetration into the protein interior, rather than by exposing the protein hydrophobic core to the solvent --- commonly used to describe temperature denaturation. We study the effects of pressure in protein folding/unfolding kinetics in an off-lattice minimalist model of a protein in which pressure effects have been incorporated by means of the pair-wise potential of mean force of hydrophobic groups in water. We show that pressure slows down the kinetics of folding by decreasing the reconfigurational diffusion coefficient and moves the location of the folding transition state.
Oxidative Folding and N-terminal Cyclization of Onconase+

PubMed Central

Welker, Ervin; Hathaway, Laura; Xu, Guoqiang; Narayan, Mahesh; Pradeep, Lovy; Shin, Hang-Cheol; Scheraga, Harold A.

2008-01-01

Cyclization of the N-terminal glutamine residue to pyroglutamic acid in onconase, an anti-cancer chemotherapeutic agent, increases the activity and stability of the protein. Here, we examine the correlated effects of the folding/unfolding process and the formation of this N-terminal pyroglutamic acid. The results in this study indicate that cyclization of the N-terminal glutamine has no significant effect on the rate of either reductive unfolding or oxidative folding of the protein. Both the cyclized and uncyclized proteins seem to follow the same oxidative folding pathways; however, cyclization altered the relative flux of the protein in these two pathways by increasing the rate of formation of a kinetically trapped intermediate. Glutaminyl cyclase (QC) catalyzed the cyclization of the unfolded, reduced protein, but had no effect on the disulfide-intact, uncyclized, folded protein. The structured intermediates of uncyclized onconase were also resistant to QC-catalysis, consistent with their having a native-like fold. These observations suggest that, in vivo, cyclization takes place during the initial stages of oxidative folding, specifically, before the formation of structured intermediates. The competition between oxidative folding and QC-mediated cyclization suggests that QC-catalyzed cyclization of the N-terminal glutamine in onconase occurs in the endoplasmic reticulum, probably co-translationally. PMID:17439243
Hierarchy of folding and unfolding events of protein G, CI2, and ACBP from explicit-solvent simulations

NASA Astrophysics Data System (ADS)

Camilloni, Carlo; Broglia, Ricardo A.; Tiana, Guido

2011-01-01

The study of the mechanism which is at the basis of the phenomenon of protein folding requires the knowledge of multiple folding trajectories under biological conditions. Using a biasing molecular-dynamics algorithm based on the physics of the ratchet-and-pawl system, we carry out all-atom, explicit solvent simulations of the sequence of folding events which proteins G, CI2, and ACBP undergo in evolving from the denatured to the folded state. Starting from highly disordered conformations, the algorithm allows the proteins to reach, at the price of a modest computational effort, nativelike conformations, within a root mean square deviation (RMSD) of approximately 1 Å. A scheme is developed to extract, from the myriad of events, information concerning the sequence of native contact formation and of their eventual correlation. Such an analysis indicates that all the studied proteins fold hierarchically, through pathways which, although not deterministic, are well-defined with respect to the order of contact formation. The algorithm also allows one to study unfolding, a process which looks, to a large extent, like the reverse of the major folding pathway. This is also true in situations in which many pathways contribute to the folding process, like in the case of protein G.
Robustness of atomistic Gō models in predicting native-like folding intermediates

NASA Astrophysics Data System (ADS)

Estácio, S. G.; Fernandes, C. S.; Krobath, H.; Faísca, P. F. N.; Shakhnovich, E. I.

2012-08-01

Gō models are exceedingly popular tools in computer simulations of protein folding. These models are native-centric, i.e., they are directly constructed from the protein's native structure. Therefore, it is important to understand up to which extent the atomistic details of the native structure dictate the folding behavior exhibited by Gō models. Here we address this challenge by performing exhaustive discrete molecular dynamics simulations of a Gō potential combined with a full atomistic protein representation. In particular, we investigate the robustness of this particular type of Gō models in predicting the existence of intermediate states in protein folding. We focus on the N47G mutational form of the Spc-SH3 folding domain (x-ray structure) and compare its folding pathway with that of alternative native structures produced in silico. Our methodological strategy comprises equilibrium folding simulations, structural clustering, and principal component analysis.
Ubiquitin-dependent Protein Degradation at the Yeast Endoplasmic Reticulum and Nuclear Envelope

PubMed Central

Zattas, Dimitrios; Hochstrasser, Mark

2014-01-01

The endoplasmic reticulum (ER) is the primary organelle in eukaryotic cells where membrane and secreted proteins are inserted into or across cell membranes. Its membrane bilayer and luminal compartments provide a favorable environment for the folding and assembly of thousands of newly synthesized proteins. However, protein folding is intrinsically error-prone, and various stress conditions can further increase levels of protein misfolding and damage, particularly in the ER, which can lead to cellular dysfunction and disease. The ubiquitin-proteasome system (UPS) is responsible for the selective destruction of a vast array of protein substrates, either for protein quality control or to allow rapid changes in the levels of specific regulatory proteins. In this review, we will focus on the components and mechanisms of ER-associated protein degradation (ERAD), an important branch of the UPS. ER membranes extend from subcortical regions of the cell to the nuclear envelope, with its continuous outer and inner membranes; the nuclear envelope is a specialized subdomain of the ER. ERAD presents additional challenges to the UPS beyond those faced with soluble substrates of the cytoplasm and nucleus. These include recognition of sugar modifications that occur in the ER, retrotranslocation of proteins across the membrane bilayer, and transfer of substrates from the ER extraction machinery to the proteasome. Here we review characteristics of ERAD substrate degradation signals (degrons), mechanisms underlying substrate recognition and processing by the ERAD machinery, and ideas on the still unresolved problem of how substrate proteins are moved across and extracted from the ER membrane. PMID:25231236
Effective Potentials for Folding Proteins

NASA Astrophysics Data System (ADS)

Chen, Nan-Yow; Su, Zheng-Yao; Mou, Chung-Yu

2006-02-01

A coarse-grained off-lattice model that is not biased in any way to the native state is proposed to fold proteins. To predict the native structure in a reasonable time, the model has included the essential effects of water in an effective potential. Two new ingredients, the dipole-dipole interaction and the local hydrophobic interaction, are introduced and are shown to be as crucial as the hydrogen bonding. The model allows successful folding of the wild-type sequence of protein G and may have provided important hints to the study of protein folding.
From laws of inference to protein folding dynamics.

PubMed

Tseng, Chih-Yuan; Yu, Chun-Ping; Lee, H C

2010-08-01

Protein folding dynamics is one of major issues constantly investigated in the study of protein functions. The molecular dynamic (MD) simulation with the replica exchange method (REM) is a common theoretical approach considered. Yet a trade-off in applying the REM is that the dynamics toward the native configuration in the simulations seems lost. In this work, we show that given REM-MD simulation results, protein folding dynamics can be directly derived from laws of inference. The applicability of the resulting approach, the entropic folding dynamics, is illustrated by investigating a well-studied Trp-cage peptide. Our results are qualitatively comparable with those from other studies. The current studies suggest that the incorporation of laws of inference and physics brings in a comprehensive perspective on exploring the protein folding dynamics.
Universality and diversity of folding mechanics for three-helix bundle proteins.

PubMed

Yang, Jae Shick; Wallin, Stefan; Shakhnovich, Eugene I

2008-01-22

In this study we evaluate, at full atomic detail, the folding processes of two small helical proteins, the B domain of protein A and the Villin headpiece. Folding kinetics are studied by performing a large number of ab initio Monte Carlo folding simulations using a single transferable all-atom potential. Using these trajectories, we examine the relaxation behavior, secondary structure formation, and transition-state ensembles (TSEs) of the two proteins and compare our results with experimental data and previous computational studies. To obtain a detailed structural information on the folding dynamics viewed as an ensemble process, we perform a clustering analysis procedure based on graph theory. Moreover, rigorous p(fold) analysis is used to obtain representative samples of the TSEs and a good quantitative agreement between experimental and simulated Phi values is obtained for protein A. Phi values for Villin also are obtained and left as predictions to be tested by future experiments. Our analysis shows that the two-helix hairpin is a common partially stable structural motif that gets formed before entering the TSE in the studied proteins. These results together with our earlier study of Engrailed Homeodomain and recent experimental studies provide a comprehensive, atomic-level picture of folding mechanics of three-helix bundle proteins.
Analysis of the Free-Energy Surface of Proteins from Reversible Folding Simulations

PubMed Central

Allen, Lucy R.; Krivov, Sergei V.; Paci, Emanuele

2009-01-01

Computer generated trajectories can, in principle, reveal the folding pathways of a protein at atomic resolution and possibly suggest general and simple rules for predicting the folded structure of a given sequence. While such reversible folding trajectories can only be determined ab initio using all-atom transferable force-fields for a few small proteins, they can be determined for a large number of proteins using coarse-grained and structure-based force-fields, in which a known folded structure is by construction the absolute energy and free-energy minimum. Here we use a model of the fast folding helical λ-repressor protein to generate trajectories in which native and non-native states are in equilibrium and transitions are accurately sampled. Yet, representation of the free-energy surface, which underlies the thermodynamic and dynamic properties of the protein model, from such a trajectory remains a challenge. Projections over one or a small number of arbitrarily chosen progress variables often hide the most important features of such surfaces. The results unequivocally show that an unprojected representation of the free-energy surface provides important and unbiased information and allows a simple and meaningful description of many-dimensional, heterogeneous trajectories, providing new insight into the possible mechanisms of fast-folding proteins. PMID:19593364
Analysis of the free-energy surface of proteins from reversible folding simulations.

PubMed

Allen, Lucy R; Krivov, Sergei V; Paci, Emanuele

2009-07-01

Computer generated trajectories can, in principle, reveal the folding pathways of a protein at atomic resolution and possibly suggest general and simple rules for predicting the folded structure of a given sequence. While such reversible folding trajectories can only be determined ab initio using all-atom transferable force-fields for a few small proteins, they can be determined for a large number of proteins using coarse-grained and structure-based force-fields, in which a known folded structure is by construction the absolute energy and free-energy minimum. Here we use a model of the fast folding helical lambda-repressor protein to generate trajectories in which native and non-native states are in equilibrium and transitions are accurately sampled. Yet, representation of the free-energy surface, which underlies the thermodynamic and dynamic properties of the protein model, from such a trajectory remains a challenge. Projections over one or a small number of arbitrarily chosen progress variables often hide the most important features of such surfaces. The results unequivocally show that an unprojected representation of the free-energy surface provides important and unbiased information and allows a simple and meaningful description of many-dimensional, heterogeneous trajectories, providing new insight into the possible mechanisms of fast-folding proteins.
Roles of β-Turns in Protein Folding: From Peptide Models to Protein Engineering

PubMed Central

Marcelino, Anna Marie C.; Gierasch, Lila M.

2010-01-01

Reverse turns are a major class of protein secondary structure; they represent sites of chain reversal and thus sites where the globular character of a protein is created. It has been speculated for many years that turns may nucleate the formation of structure in protein folding, as their propensity to occur will favor the approximation of their flanking regions and their general tendency to be hydrophilic will favor their disposition at the solvent-accessible surface. Reverse turns are local features, and it is therefore not surprising that their structural properties have been extensively studied using peptide models. In this article, we review research on peptide models of turns to test the hypothesis that the propensities of turns to form in short peptides will relate to the roles of corresponding sequences in protein folding. Turns with significant stability as isolated entities should actively promote the folding of a protein, and by contrast, turn sequences that merely allow the chain to adopt conformations required for chain reversal are predicted to be passive in the folding mechanism. We discuss results of protein engineering studies of the roles of turn residues in folding mechanisms. Factors that correlate with the importance of turns in folding indeed include their intrinsic stability, as well as their topological context and their participation in hydrophobic networks within the protein’s structure. PMID:18275088
How cooperative are protein folding and unfolding transitions?

PubMed Central

Malhotra, Pooja

2016-01-01

Abstract A thermodynamically and kinetically simple picture of protein folding envisages only two states, native (N) and unfolded (U), separated by a single activation free energy barrier, and interconverting by cooperative two‐state transitions. The folding/unfolding transitions of many proteins occur, however, in multiple discrete steps associated with the formation of intermediates, which is indicative of reduced cooperativity. Furthermore, much advancement in experimental and computational approaches has demonstrated entirely non‐cooperative (gradual) transitions via a continuum of states and a multitude of small energetic barriers between the N and U states of some proteins. These findings have been instrumental towards providing a structural rationale for cooperative versus noncooperative transitions, based on the coupling between interaction networks in proteins. The cooperativity inherent in a folding/unfolding reaction appears to be context dependent, and can be tuned via experimental conditions which change the stabilities of N and U. The evolution of cooperativity in protein folding transitions is linked closely to the evolution of function as well as the aggregation propensity of the protein. A large activation energy barrier in a fully cooperative transition can provide the kinetic control required to prevent the accumulation of partially unfolded forms, which may promote aggregation. Nevertheless, increasing evidence for barrier‐less “downhill” folding, as well as for continuous “uphill” unfolding transitions, indicate that gradual non‐cooperative processes may be ubiquitous features on the free energy landscape of protein folding. PMID:27522064
Studies of protein-protein and protein-water interactions by small angle x-ray scattering, terahertz spectroscopy, ASMOS, and computer simulation

NASA Astrophysics Data System (ADS)

Kim, Seung Joong

The protein folding problem has been one of the most challenging subjects in biological physics due to its complexity. Energy landscape theory based on statistical mechanics provides a thermodynamic interpretation of the protein folding process. We have been working to answer fundamental questions about protein-protein and protein-water interactions, which are very important for describing the energy landscape surface of proteins correctly. At first, we present a new method for computing protein-protein interaction potentials of solvated proteins directly from SAXS data. An ensemble of proteins was modeled by Metropolis Monte Carlo and Molecular Dynamics simulations, and the global X-ray scattering of the whole model ensemble was computed at each snapshot of the simulation. The interaction potential model was optimized and iterated by a Levenberg-Marquardt algorithm. Secondly, we report that terahertz spectroscopy directly probes hydration dynamics around proteins and determines the size of the dynamical hydration shell. We also present the sequence and pH-dependence of the hydration shell and the effect of the hydrophobicity. On the other hand, kinetic terahertz absorption (KITA) spectroscopy is introduced to study the refolding kinetics of ubiquitin and its mutants. KITA results are compared to small angle X-ray scattering, tryptophan fluorescence, and circular dichroism results. We propose that KITA monitors the rearrangement of hydrogen bonding during secondary structure formation. Finally, we present development of the automated single molecule operating system (ASMOS) for a high throughput single molecule detector, which levitates a single protein molecule in a 10 microm diameter droplet by the laser guidance. I also have performed supporting calculations and simulations with my own program codes.
Achieving Rigorous Accelerated Conformational Sampling in Explicit Solvent.

PubMed

Doshi, Urmi; Hamelberg, Donald

2014-04-03

Molecular dynamics simulations can provide valuable atomistic insights into biomolecular function. However, the accuracy of molecular simulations on general-purpose computers depends on the time scale of the events of interest. Advanced simulation methods, such as accelerated molecular dynamics, have shown tremendous promise in sampling the conformational dynamics of biomolecules, where standard molecular dynamics simulations are nonergodic. Here we present a sampling method based on accelerated molecular dynamics in which rotatable dihedral angles and nonbonded interactions are boosted separately. This method (RaMD-db) is a different implementation of the dual-boost accelerated molecular dynamics, introduced earlier. The advantage is that this method speeds up sampling of the conformational space of biomolecules in explicit solvent, as the degrees of freedom most relevant for conformational transitions are accelerated. We tested RaMD-db on one of the most difficult sampling problems - protein folding. Starting from fully extended polypeptide chains, two fast folding α-helical proteins (Trpcage and the double mutant of C-terminal fragment of Villin headpiece) and a designed β-hairpin (Chignolin) were completely folded to their native structures in very short simulation time. Multiple folding/unfolding transitions could be observed in a single trajectory. Our results show that RaMD-db is a promisingly fast and efficient sampling method for conformational transitions in explicit solvent. RaMD-db thus opens new avenues for understanding biomolecular self-assembly and functional dynamics occurring on long time and length scales.
Role of naturally occurring osmolytes in protein folding and stability.

PubMed

Kumar, Raj

2009-11-01

Osmolytes are typically accumulated in the intracellular environment at relatively high concentrations when cells/tissues are subjected to stress conditions. Osmolytes are common in a variety of organisms, including microorganisms, plants, and animals. They enhance thermodynamic stability of proteins by providing natively folded conformations without perturbing other cellular processes. By burying the backbone into the core of folded proteins, osmolytes can provide significant stability to proteins. Two properties of osmolytes are particularly important: (i) their ability to impart increased thermodynamic stability to folded proteins; and (ii) their compatibility in the intracellular environment at high concentrations. Under physiological conditions, the cellular compositions of osmolytes may vary significantly. This may lead to different protein folding pathways utilized in cells depending upon the intracellular environment. Proper understanding of the role of osmolytes in cell regulation should allow predicting the action of osmolytes on macromolecular interactions in stressed and crowded environments typical of cellular conditions.
The "Transport Specificity Ratio": a structure-function tool to search the protein fold for loci that control transition state stability in membrane transport catalysis

PubMed Central

King, Steven C

2004-01-01

Background In establishing structure-function relationships for membrane transport proteins, the interpretation of phenotypic changes can be problematic, owing to uncertainties in protein expression levels, sub-cellular localization, and protein-folding fidelity. A dual-label competitive transport assay called "Transport Specificity Ratio" (TSR) analysis has been developed that is simple to perform, and circumvents the "expression problem," providing a reliable TSR phenotype (a constant) for comparison to other transporters. Results Using the Escherichia coli GABA (4-aminobutyrate) permease (GabP) as a model carrier, it is demonstrated that the TSR phenotype is largely independent of assay conditions, exhibiting: (i) indifference to the particular substrate concentrations used, (ii) indifference to extreme changes (40-fold) in transporter expression level, and within broad limits (iii) indifference to assay duration. The theoretical underpinnings of TSR analysis predict all of the above observations, supporting that TSR has (i) applicability in the analysis of membrane transport, and (ii) particular utility in the face of incomplete information on protein expression levels and initial reaction rate intervals (e.g., in high-throughput screening situations). The TSR was used to identify gab permease (GabP) variants that exhibit relative changes in catalytic specificity (kcat/Km) for [14C]GABA (4-aminobutyrate) versus [3H]NA (nipecotic acid). Conclusions The TSR phenotype is an easily measured constant that reflects innate molecular properties of the transition state, and provides a reliable index of the difference in catalytic specificity that a carrier exhibits toward a particular pair of substrates. A change in the TSR phenotype, called a Δ(TSR), represents a specificity shift attributable to underlying changes in the intrinsic substrate binding energy (ΔGb) that translocation catalysts rely upon to decrease activation energy (). TSR analysis is therefore a structure-function tool that enables parsimonious scanning for positions in the protein fold that couple to the transition state, creating stability and thereby serving as functional determinants of catalytic power (efficiency, or specificity). PMID:15548327

Molecular chaperones and protein folding as therapeutic targets in Parkinson's disease and other synucleinopathies.

PubMed

Ebrahimi-Fakhari, Darius; Saidi, Laiq-Jan; Wahlster, Lara

2013-12-05

Changes in protein metabolism are key to disease onset and progression in many neurodegenerative diseases. As a prime example, in Parkinson's disease, folding, post-translational modification and recycling of the synaptic protein α-synuclein are clearly altered, leading to a progressive accumulation of pathogenic protein species and the formation of intracellular inclusion bodies. Altered protein folding is one of the first steps of an increasingly understood cascade in which α-synuclein forms complex oligomers and finally distinct protein aggregates, termed Lewy bodies and Lewy neurites. In neurons, an elaborated network of chaperone and co-chaperone proteins is instrumental in mediating protein folding and re-folding. In addition to their direct influence on client proteins, chaperones interact with protein degradation pathways such as the ubiquitin-proteasome-system or autophagy in order to ensure the effective removal of irreversibly misfolded and potentially pathogenic proteins. Because of the vital role of proper protein folding for protein homeostasis, a growing number of studies have evaluated the contribution of chaperone proteins to neurodegeneration. We herein review our current understanding of the involvement of chaperones, co-chaperones and chaperone-mediated autophagy in synucleinopathies with a focus on the Hsp90 and Hsp70 chaperone system. We discuss genetic and pathological studies in Parkinson's disease as well as experimental studies in models of synucleinopathies that explore molecular chaperones and protein degradation pathways as a novel therapeutic target. To this end, we examine the capacity of chaperones to prevent or modulate neurodegeneration and summarize the current progress in models of Parkinson's disease and related neurodegenerative disorders.
Picosecond to nanosecond dynamics provide a source of conformational entropy for protein folding.

PubMed

Stadler, Andreas M; Demmel, Franz; Ollivier, Jacques; Seydel, Tilo

2016-08-03

Myoglobin can be trapped in fully folded structures, partially folded molten globules, and unfolded states under stable equilibrium conditions. Here, we report an experimental study on the conformational dynamics of different folded conformational states of apo- and holomyoglobin in solution. Global protein diffusion and internal molecular motions were probed by neutron time-of-flight and neutron backscattering spectroscopy on the picosecond and nanosecond time scales. Global protein diffusion was found to depend on the α-helical content of the protein suggesting that charges on the macromolecule increase the short-time diffusion of protein. With regard to the molten globules, a gel-like phase due to protein entanglement and interactions with neighbouring macromolecules was visible due to a reduction of the global diffusion coefficients on the nanosecond time scale. Diffusion coefficients, residence and relaxation times of internal protein dynamics and root mean square displacements of localised internal motions were determined for the investigated structural states. The difference in conformational entropy ΔSconf of the protein between the unfolded and the partially or fully folded conformations was extracted from the measured root mean square displacements. Using thermodynamic parameters from the literature and the experimentally determined ΔSconf values we could identify the entropic contribution of the hydration shell ΔShydr of the different folded states. Our results point out the relevance of conformational entropy of the protein and the hydration shell for stability and folding of myoglobin.
Broadly Neutralizing Immune Responses against Hepatitis C Virus Induced by Vectored Measles Viruses and a Recombinant Envelope Protein Booster

PubMed Central

Reyes-del Valle, Jorge; de la Fuente, Cynthia; Turner, Mallory A.; Springfeld, Christoph; Apte-Sengupta, Swapna; Frenzke, Marie E.; Forest, Amelie; Whidby, Jillian; Marcotrigiano, Joseph; Rice, Charles M.

2012-01-01

Hepatitis C virus (HCV) infection remains a serious public health problem worldwide. Treatments are limited, and no preventive vaccine is available. Toward developing an HCV vaccine, we engineered two recombinant measles viruses (MVs) expressing structural proteins from the prototypic HCV subtype 1a strain H77. One virus directs the synthesis of the HCV capsid (C) protein and envelope glycoproteins (E1 and E2), which fold properly and form a heterodimer. The other virus expresses the E1 and E2 glycoproteins separately, with each one fused to the cytoplasmic tail of the MV fusion protein. Although these hybrid glycoproteins were transported to the plasma membrane, they were not incorporated into MV particles. Immunization of MV-susceptible, genetically modified mice with either vector induced neutralizing antibodies to MV and HCV. A boost with soluble E2 protein enhanced titers of neutralizing antibody against the homologous HCV envelope. In animals primed with MV expressing properly folded HCV C-E1-E2, boosting also induced cross-neutralizating antibodies against two heterologous HCV strains. These results show that recombinant MVs retain the ability to induce MV-specific humoral immunity while also eliciting HCV neutralizing antibodies, and that anti-HCV immunity can be boosted with a single dose of purified E2 protein. The use of MV vectors could have advantages for pediatric HCV vaccination. PMID:22896607
Comparison of successive transition states for folding reveals alternative early folding pathways of two homologous proteins

PubMed Central

Calosci, Nicoletta; Chi, Celestine N.; Richter, Barbara; Camilloni, Carlo; Engström, Åke; Eklund, Lars; Travaglini-Allocatelli, Carlo; Gianni, Stefano; Vendruscolo, Michele; Jemth, Per

2008-01-01

The energy landscape theory provides a general framework for describing protein folding reactions. Because a large number of studies, however, have focused on two-state proteins with single well-defined folding pathways and without detectable intermediates, the extent to which free energy landscapes are shaped up by the native topology at the early stages of the folding process has not been fully characterized experimentally. To this end, we have investigated the folding mechanisms of two homologous three-state proteins, PTP-BL PDZ2 and PSD-95 PDZ3, and compared the early and late transition states on their folding pathways. Through a combination of Φ value analysis and molecular dynamics simulations we obtained atomic-level structures of the transition states of these homologous three-state proteins and found that the late transition states are much more structurally similar than the early ones. Our findings thus reveal that, while the native state topology defines essentially in a unique way the late stages of folding, it leaves significant freedom to the early events, a result that reflects the funneling of the free energy landscape toward the native state. PMID:19033470
Inversion of the Balance between Hydrophobic and Hydrogen Bonding Interactions in Protein Folding and Aggregation

PubMed Central

Fitzpatrick, Anthony W.; Knowles, Tuomas P. J.; Waudby, Christopher A.; Vendruscolo, Michele; Dobson, Christopher M.

2011-01-01

Identifying the forces that drive proteins to misfold and aggregate, rather than to fold into their functional states, is fundamental to our understanding of living systems and to our ability to combat protein deposition disorders such as Alzheimer's disease and the spongiform encephalopathies. We report here the finding that the balance between hydrophobic and hydrogen bonding interactions is different for proteins in the processes of folding to their native states and misfolding to the alternative amyloid structures. We find that the minima of the protein free energy landscape for folding and misfolding tend to be respectively dominated by hydrophobic and by hydrogen bonding interactions. These results characterise the nature of the interactions that determine the competition between folding and misfolding of proteins by revealing that the stability of native proteins is primarily determined by hydrophobic interactions between side-chains, while the stability of amyloid fibrils depends more on backbone intermolecular hydrogen bonding interactions. PMID:22022239
Acceleration through passive destabilization: protein folding in a weak hydrophobic environment

NASA Astrophysics Data System (ADS)

Jewett, Andrew; Baumketner, Andrij; Shea, Joan-Emma

2004-03-01

The GroEL chaperonin is a biomolecule which assists the folding of an extremely diverse range of proteins in Eubacteria. Some proteins undergo many rounds of ATP-regulated binding and dissociation from GroEL/ES before folding. It has been proposed that transient stress from ATP-regulated binding and release from GroEL/ES frees frustrated proteins from misfolded conformations. However recent evidence suggests that chaperonin-accelerated protein folding can take place entirely within a mutated GroEL+ES cavity that is unable to open and release the protein. Using molecular dynamics, we demonstrate that static confinement within a weakly hydrophobic (attractive) cavity (similar to the interior of the cavity formed by the GroEL+ES complex) is sufficient to significantly accelerate the folding of a highly frustrated protein-like heteropolymer. Our frustrated molecule benifits kinetically from a static hydrophobic environment that destabilizes misfolded conformations. This may shed light on the mechanisms used by other chaperones which do not depend on ATP.
Folding energy landscape and network dynamics of small globular proteins

PubMed Central

Hori, Naoto; Chikenji, George; Berry, R. Stephen; Takada, Shoji

2009-01-01

The folding energy landscape of proteins has been suggested to be funnel-like with some degree of ruggedness on the slope. How complex the landscape, however, is still rather unclear. Many experiments for globular proteins suggested relative simplicity, whereas molecular simulations of shorter peptides implied more complexity. Here, by using complete conformational sampling of 2 globular proteins, protein G and src SH3 domain and 2 related random peptides, we investigated their energy landscapes, topological properties of folding networks, and folding dynamics. The projected energy surfaces of globular proteins were funneled in the vicinity of the native but also have other quite deep, accessible minima, whereas the randomized peptides have many local basins, including some leading to seriously misfolded forms. Dynamics in the denatured part of the network exhibited basin-hopping itinerancy among many conformations, whereas the protein reached relatively well-defined final stages that led to their native states. We also found that the folding network has the hierarchic nature characterized by the scale-free and the small-world properties. PMID:19114654
Folding energy landscape and network dynamics of small globular proteins.

PubMed

Hori, Naoto; Chikenji, George; Berry, R Stephen; Takada, Shoji

2009-01-06

The folding energy landscape of proteins has been suggested to be funnel-like with some degree of ruggedness on the slope. How complex the landscape, however, is still rather unclear. Many experiments for globular proteins suggested relative simplicity, whereas molecular simulations of shorter peptides implied more complexity. Here, by using complete conformational sampling of 2 globular proteins, protein G and src SH3 domain and 2 related random peptides, we investigated their energy landscapes, topological properties of folding networks, and folding dynamics. The projected energy surfaces of globular proteins were funneled in the vicinity of the native but also have other quite deep, accessible minima, whereas the randomized peptides have many local basins, including some leading to seriously misfolded forms. Dynamics in the denatured part of the network exhibited basin-hopping itinerancy among many conformations, whereas the protein reached relatively well-defined final stages that led to their native states. We also found that the folding network has the hierarchic nature characterized by the scale-free and the small-world properties.
Swellix: a computational tool to explore RNA conformational space.

PubMed

Sloat, Nathan; Liu, Jui-Wen; Schroeder, Susan J

2017-11-21

The sequence of nucleotides in an RNA determines the possible base pairs for an RNA fold and thus also determines the overall shape and function of an RNA. The Swellix program presented here combines a helix abstraction with a combinatorial approach to the RNA folding problem in order to compute all possible non-pseudoknotted RNA structures for RNA sequences. The Swellix program builds on the Crumple program and can include experimental constraints on global RNA structures such as the minimum number and lengths of helices from crystallography, cryoelectron microscopy, or in vivo crosslinking and chemical probing methods. The conceptual advance in Swellix is to count helices and generate all possible combinations of helices rather than counting and combining base pairs. Swellix bundles similar helices and includes improvements in memory use and efficient parallelization. Biological applications of Swellix are demonstrated by computing the reduction in conformational space and entropy due to naturally modified nucleotides in tRNA sequences and by motif searches in Human Endogenous Retroviral (HERV) RNA sequences. The Swellix motif search reveals occurrences of protein and drug binding motifs in the HERV RNA ensemble that do not occur in minimum free energy or centroid predicted structures. Swellix presents significant improvements over Crumple in terms of efficiency and memory use. The efficient parallelization of Swellix enables the computation of sequences as long as 418 nucleotides with sufficient experimental constraints. Thus, Swellix provides a practical alternative to free energy minimization tools when multiple structures, kinetically determined structures, or complex RNA-RNA and RNA-protein interactions are present in an RNA folding problem.
Hidden complexity of free energy surfaces for peptide (protein) folding.

PubMed

Krivov, Sergei V; Karplus, Martin

2004-10-12

An understanding of the thermodynamics and kinetics of protein folding requires a knowledge of the free energy surface governing the motion of the polypeptide chain. Because of the many degrees of freedom involved, surfaces projected on only one or two progress variables are generally used in descriptions of the folding reaction. Such projections result in relatively smooth surfaces, but they could mask the complexity of the unprojected surface. Here we introduce an approach to determine the actual (unprojected) free energy surface and apply it to the second beta-hairpin of protein G, which has been used as a model system for protein folding. The surface is represented by a disconnectivity graph calculated from a long equilibrium folding-unfolding trajectory. The denatured state is found to have multiple low free energy basins. Nevertheless, the peptide shows exponential kinetics in folding to the native basin. Projected surfaces obtained from the present analysis have a simple form in agreement with other studies of the beta-hairpin. The hidden complexity found for the beta-hairpin surface suggests that the standard funnel picture of protein folding should be revisited.
Evolutionary trend toward kinetic stability in the folding trajectory of RNases H

PubMed Central

Lim, Shion A.; Hart, Kathryn M.; Marqusee, Susan

2016-01-01

Proper folding of proteins is critical to producing the biological machinery essential for cellular function. The rates and energetics of a protein’s folding process, which is described by its energy landscape, are encoded in the amino acid sequence. Over the course of evolution, this landscape must be maintained such that the protein folds and remains folded over a biologically relevant time scale. How exactly a protein’s energy landscape is maintained or altered throughout evolution is unclear. To study how a protein’s energy landscape changed over time, we characterized the folding trajectories of ancestral proteins of the ribonuclease H (RNase H) family using ancestral sequence reconstruction to access the evolutionary history between RNases H from mesophilic and thermophilic bacteria. We found that despite large sequence divergence, the overall folding pathway is conserved over billions of years of evolution. There are robust trends in the rates of protein folding and unfolding; both modern RNases H evolved to be more kinetically stable than their most recent common ancestor. Finally, our study demonstrates how a partially folded intermediate provides a readily adaptable folding landscape by allowing the independent tuning of kinetics and thermodynamics. PMID:27799545
Detecting Selection on Protein Stability through Statistical Mechanical Models of Folding and Evolution

PubMed Central

Bastolla, Ugo

2014-01-01

The properties of biomolecules depend both on physics and on the evolutionary process that formed them. These two points of view produce a powerful synergism. Physics sets the stage and the constraints that molecular evolution has to obey, and evolutionary theory helps in rationalizing the physical properties of biomolecules, including protein folding thermodynamics. To complete the parallelism, protein thermodynamics is founded on the statistical mechanics in the space of protein structures, and molecular evolution can be viewed as statistical mechanics in the space of protein sequences. In this review, we will integrate both points of view, applying them to detecting selection on the stability of the folded state of proteins. We will start discussing positive design, which strengthens the stability of the folded against the unfolded state of proteins. Positive design justifies why statistical potentials for protein folding can be obtained from the frequencies of structural motifs. Stability against unfolding is easier to achieve for longer proteins. On the contrary, negative design, which consists in destabilizing frequently formed misfolded conformations, is more difficult to achieve for longer proteins. The folding rate can be enhanced by strengthening short-range native interactions, but this requirement contrasts with negative design, and evolution has to trade-off between them. Finally, selection can accelerate functional movements by favoring low frequency normal modes of the dynamics of the native state that strongly correlate with the functional conformation change. PMID:24970217
Coordinating Subdomains of Ferritin Protein Cages with Catalysis and Biomineralization viewed from the C4 Cage Axes

PubMed Central

Theil, Elizabeth C.; Turano, Paola; Ghini, Veronica; Allegrozzi, Marco; Bernacchioni, Caterina

2014-01-01

Integrated ferritin protein cage function is the reversible synthesis of protein-caged, solid Fe2O3•H2O minerals from Fe2+, for metabolic iron concentrates and oxidant protection; biomineral order varies in different ferritin proteins. The conserved 4, 3, 2 geometric symmetry of ferritin protein cages, parallels subunit dimer, trimer and tetramer interfaces, and coincides with function at several cage axes. Multiple subdomains distributed in the self- assembling ferritin nanocages have functional relationships to cage symmetry such as Fe2+ transport though ion channels (3-fold symmetry), biomineral nucleation/order (4-fold symmetry) and mineral dissolution (3-fold symmetry) studied in ferritin variants. Cage subunit dimers (2-fold symmetry) influence iron oxidation and mineral dissolution, based on effects of natural or synthetic subunit dimer crosslinks. 2Fe2+/O2 catalysis in ferritin occurs in single subunits, but with cooperativity (n=3) that is possibly related to the structure/function of the ion channels, which are constructed from segments of 3 subunits. Here, we study 2Fe2+ + O2 protein catalysis (diferric peroxo formation) and dissolution of ferritin Fe2O3•H2O biominerals in variants with altered subunit interfaces for trimers (ion channels), E130I, and external dimer surfaces (E88A) as controls, and altered tetramer subunit interfaces (L165I and H169F). The results extend observations on the functional importance of structure at ferritin protein 2-fold and 3-fold cage axes to show function at ferritin 4-fold cage axes. Here, conserved amino acids facilitate dissolution of ferritin protein-caged iron biominerals. Biological and nanotechnological uses of ferritin protein cage 4-fold symmetry and solid state mineral properties remain largely unexplored. PMID:24504941
Calculating Free Energies Using Scaled-Force Molecular Dynamics Algorithm

NASA Technical Reports Server (NTRS)

Darve, Eric; Wilson, Micahel A.; Pohorille, Andrew

2000-01-01

One common objective of molecular simulations in chemistry and biology is to calculate the free energy difference between different states of the system of interest. Examples of problems that have such an objective are calculations of receptor-ligand or protein-drug interactions, associations of molecules in response to hydrophobic, and electrostatic interactions or partition of molecules between immiscible liquids. Another common objective is to describe evolution of the system towards a low energy (possibly the global minimum energy), 'native' state. Perhaps the best example of such a problem is folding of proteins or short RNA molecules. Both types of problems share the same difficulty. Often, different states of the system are separated by high energy barriers, which implies that transitions between these states are rare events. This, in turn, can greatly impede exploration of phase space. In some instances this can lead to 'quasi non-ergodicity', whereby a part of phase space is inaccessible on timescales of the simulation. A host of strategies has been developed to improve efficiency of sampling the phase space. For example, some Monte Carlo techniques involve large steps which move the system between low-energy regions in phase space without the need for sampling the configurations corresponding to energy barriers (J-walking). Most strategies, however, rely on modifying probabilities of sampling low and high-energy regions in phase space such that transitions between states of interest are encouraged. Perhaps the simplest implementation of this strategy is to increase the temperature of the system. This approach was successfully used to identify denaturation pathways in several proteins, but it is clearly not applicable to protein folding. It is also not a successful method for determining free energy differences. Finally, the approach is likely to fail for systems with co-existing phases, such as water-membrane systems, because it may lead to spontaneous mixing. A similar difficulty may be encountered in any method relying on global modifications of phase space.
Computational intelligence techniques for biological data mining: An overview

NASA Astrophysics Data System (ADS)

Faye, Ibrahima; Iqbal, Muhammad Javed; Said, Abas Md; Samir, Brahim Belhaouari

2014-10-01

Computational techniques have been successfully utilized for a highly accurate analysis and modeling of multifaceted and raw biological data gathered from various genome sequencing projects. These techniques are proving much more effective to overcome the limitations of the traditional in-vitro experiments on the constantly increasing sequence data. However, most critical problems that caught the attention of the researchers may include, but not limited to these: accurate structure and function prediction of unknown proteins, protein subcellular localization prediction, finding protein-protein interactions, protein fold recognition, analysis of microarray gene expression data, etc. To solve these problems, various classification and clustering techniques using machine learning have been extensively used in the published literature. These techniques include neural network algorithms, genetic algorithms, fuzzy ARTMAP, K-Means, K-NN, SVM, Rough set classifiers, decision tree and HMM based algorithms. Major difficulties in applying the above algorithms include the limitations found in the previous feature encoding and selection methods while extracting the best features, increasing classification accuracy and decreasing the running time overheads of the learning algorithms. The application of this research would be potentially useful in the drug design and in the diagnosis of some diseases. This paper presents a concise overview of the well-known protein classification techniques.
Site-directed protein recombination as a shortest-path problem.

PubMed

Endelman, Jeffrey B; Silberg, Jonathan J; Wang, Zhen-Gang; Arnold, Frances H

2004-07-01

Protein function can be tuned using laboratory evolution, in which one rapidly searches through a library of proteins for the properties of interest. In site-directed recombination, n crossovers are chosen in an alignment of p parents to define a set of p(n + 1) peptide fragments. These fragments are then assembled combinatorially to create a library of p(n+1) proteins. We have developed a computational algorithm to enrich these libraries in folded proteins while maintaining an appropriate level of diversity for evolution. For a given set of parents, our algorithm selects crossovers that minimize the average energy of the library, subject to constraints on the length of each fragment. This problem is equivalent to finding the shortest path between nodes in a network, for which the global minimum can be found efficiently. Our algorithm has a running time of O(N(3)p(2) + N(2)n) for a protein of length N. Adjusting the constraints on fragment length generates a set of optimized libraries with varying degrees of diversity. By comparing these optima for different sets of parents, we rapidly determine which parents yield the lowest energy libraries.
The IRE1/bZIP60 pathway are activated by potexvirus and potyvirus small membrane binding proteins

USDA-ARS?s Scientific Manuscript database

The endoplasmic reticulum provides an environment for protein synthesis, folding and distribution to all corners of the cell. With respect to protein synthesis and folding, quality production is central to maintaining homeostasis. When conditions occur that disrupt the folding capacity of the ER cau...
Principles of protein folding--a perspective from simple exact models.

PubMed Central

Dill, K. A.; Bromberg, S.; Yue, K.; Fiebig, K. M.; Yee, D. P.; Thomas, P. D.; Chan, H. S.

1995-01-01

General principles of protein structure, stability, and folding kinetics have recently been explored in computer simulations of simple exact lattice models. These models represent protein chains at a rudimentary level, but they involve few parameters, approximations, or implicit biases, and they allow complete explorations of conformational and sequence spaces. Such simulations have resulted in testable predictions that are sometimes unanticipated: The folding code is mainly binary and delocalized throughout the amino acid sequence. The secondary and tertiary structures of a protein are specified mainly by the sequence of polar and nonpolar monomers. More specific interactions may refine the structure, rather than dominate the folding code. Simple exact models can account for the properties that characterize protein folding: two-state cooperativity, secondary and tertiary structures, and multistage folding kinetics--fast hydrophobic collapse followed by slower annealing. These studies suggest the possibility of creating "foldable" chain molecules other than proteins. The encoding of a unique compact chain conformation may not require amino acids; it may require only the ability to synthesize specific monomer sequences in which at least one monomer type is solvent-averse. PMID:7613459
Unfolding the chaperone story.

PubMed

Hartl, F Ulrich

2017-11-01

Protein folding in the cell was originally assumed to be a spontaneous process, based on Anfinsen's discovery that purified proteins can fold on their own after removal from denaturant. Consequently cell biologists showed little interest in the protein folding process. This changed only in the mid and late 1980s, when the chaperone story began to unfold. As a result, we now know that in vivo, protein folding requires assistance by a complex machinery of molecular chaperones. To ensure efficient folding, members of different chaperone classes receive the nascent protein chain emerging from the ribosome and guide it along an ordered pathway toward the native state. I was fortunate to contribute to these developments early on. In this short essay, I will describe some of the critical steps leading to the current concept of protein folding as a highly organized cellular process. © 2017 Hartl. This article is distributed by The American Society for Cell Biology under license from the author(s). Two months after publication it is available to the public under an Attribution–Noncommercial–Share Alike 3.0 Unported Creative Commons License (http://creativecommons.org/licenses/by-nc-sa/3.0).
Transiently disordered tails accelerate folding of globular proteins.

PubMed

Mallik, Saurav; Ray, Tanaya; Kundu, Sudip

2017-07-01

Numerous biological proteins exhibit intrinsic disorder at their termini, which are associated with multifarious functional roles. Here, we show the surprising result that an increased percentage of terminal short transiently disordered regions with enhanced flexibility (TstDREF) is associated with accelerated folding rates of globular proteins. Evolutionary conservation of predicted disorder at TstDREFs and drastic alteration of folding rates upon point-mutations suggest critical regulatory role(s) of TstDREFs in shaping the folding kinetics. TstDREFs are associated with long-range intramolecular interactions and the percentage of native secondary structural elements physically contacted by TstDREFs exhibit another surprising positive correlation with folding kinetics. These results allow us to infer probable molecular mechanisms behind the TstDREF-mediated regulation of folding kinetics that challenge protein biochemists to assess by direct experimental testing. © 2017 Federation of European Biochemical Societies.

Evolution of the arginase fold and functional diversity

PubMed Central

Dowling, Daniel P.; Costanzo, Luigi Di; Gennadios, Heather A.; Christianson, David W.

2009-01-01

The large number of protein structures deposited in the Protein Data Bank allows for the identification of novel structural superfamilies based on conservation of fold in addition to conservation of amino acid sequence. Since sequence diverges more rapidly than fold in protein evolution, proteins with little or no significant sequence identity are occasionally observed to adopt similar folds, thereby reflecting unanticipated evolutionary relationships. Here, we review the unique α/β fold first observed in the manganese metalloenzyme rat liver arginase, consisting of a parallel 8 stranded β-sheet surrounded by several helices, and its evolutionary relationship with the zinc-requiring and/or iron-requiring histone deacetylases and acetylpolyamine amidohydrolases. Structural comparisons reveal key features of the core α/β fold that contribute to the divergent metal ion specificity and stoichiometry required for the chemical and biological functions of these enzymes. PMID:18360740
Transient intermediates are populated in the folding pathways of single-domain two-state folding protein L

NASA Astrophysics Data System (ADS)

Maity, Hiranmay; Reddy, Govardhan

2018-04-01

Small single-domain globular proteins, which are believed to be dominantly two-state folders, played an important role in elucidating various aspects of the protein folding mechanism. However, recent single molecule fluorescence resonance energy transfer experiments [H. Y. Aviram et al. J. Chem. Phys. 148, 123303 (2018)] on a single-domain two-state folding protein L showed evidence for the population of an intermediate state and it was suggested that in this state, a β-hairpin present near the C-terminal of the native protein state is unfolded. We performed molecular dynamics simulations using a coarse-grained self-organized-polymer model with side chains to study the folding pathways of protein L. In agreement with the experiments, an intermediate is populated in the simulation folding pathways where the C-terminal β-hairpin detaches from the rest of the protein structure. The lifetime of this intermediate structure increased with the decrease in temperature. In low temperature conditions, we also observed a second intermediate state, which is globular with a significant fraction of the native-like tertiary contacts satisfying the features of a dry molten globule.
Engineering of protein folding and secretion-strategies to overcome bottlenecks for efficient production of recombinant proteins.

PubMed

Delic, Marizela; Göngrich, Rebecca; Mattanovich, Diethard; Gasser, Brigitte

2014-07-20

Recombinant protein production has developed into a huge market with enormous positive implications for human health and for the future direction of a biobased economy. Limitations in the economic and technical feasibility of production processes are often related to bottlenecks of in vivo protein folding. Based on cell biological knowledge, some major bottlenecks have been overcome by the overexpression of molecular chaperones and other folding related proteins, or by the deletion of deleterious pathways that may lead to misfolding, mistargeting, or degradation. While important success could be achieved by this strategy, the list of reported unsuccessful cases is disappointingly long and obviously dependent on the recombinant protein to be produced. Singular engineering of protein folding steps may not lead to desired results if the pathway suffers from several limitations. In particular, the connection between folding quality control and proteolytic degradation needs further attention. Based on recent understanding that multiple steps in the folding and secretion pathways limit productivity, synergistic combinations of the cell engineering approaches mentioned earlier need to be explored. In addition, systems biology-based whole cell analysis that also takes energy and redox metabolism into consideration will broaden the knowledge base for future rational engineering strategies.
Characterization of Folding Mechanisms of Trp-cage and WW-domain by Network Analysis of Simulations with a Hybrid-resolution Model

PubMed Central

Han, Wei; Schulten, Klaus

2013-01-01

In this study, we apply a hybrid-resolution model, namely PACE, to characterize the free energy surfaces (FESs) of trp-cage and a WW domain variant along with the respective folding mechanisms. Unbiased, independent simulations with PACE are found to achieve together multiple folding and unfolding events for both proteins, allowing us to perform network analysis of the FESs to identify folding pathways. PACE reproduces for both proteins expected complexity hidden in the folding FESs, in particular, meta-stable non-native intermediates. Pathway analysis shows that some of these intermediates are, actually, on-pathway folding intermediates and that intermediates kinetically closest to the native states can be either critical on-pathway or off-pathway intermediates, depending on the protein. Apart from general insights into folding, specific folding mechanisms of the proteins are resolved. We find that trp-cage folds via a dominant pathway in which hydrophobic collapse occurs before the N-terminal helix forms; full incorporation of Trp6 into the hydrophobic core takes place as the last step of folding, which, however, may not be the rate-limiting step. For the WW domain variant studied we observe two main folding pathways with opposite orders of formation of the two hairpins involved in the structure; for either pathway, formation of hairpin 1 is more likely to be the rate-limiting step. Altogether, our results suggest that PACE combined with network analysis is a computationally efficient and valuable tool for the study of protein folding. PMID:23915394
A hybrid MD-kMC algorithm for folding proteins in explicit solvent.

PubMed

Peter, Emanuel Karl; Shea, Joan-Emma

2014-04-14

We present a novel hybrid MD-kMC algorithm that is capable of efficiently folding proteins in explicit solvent. We apply this algorithm to the folding of a small protein, Trp-Cage. Different kMC move sets that capture different possible rate limiting steps are implemented. The first uses secondary structure formation as a relevant rate event (a combination of dihedral rotations and hydrogen-bonding formation and breakage). The second uses tertiary structure formation events through formation of contacts via translational moves. Both methods fold the protein, but via different mechanisms and with different folding kinetics. The first method leads to folding via a structured helical state, with kinetics fit by a single exponential. The second method leads to folding via a collapsed loop, with kinetics poorly fit by single or double exponentials. In both cases, folding times are faster than experimentally reported values, The secondary and tertiary move sets are integrated in a third MD-kMC implementation, which now leads to folding of the protein via both pathways, with single and double-exponential fits to the rates, and to folding rates in good agreement with experimental values. The competition between secondary and tertiary structure leads to a longer search for the helix-rich intermediate in the case of the first pathway, and to the emergence of a kinetically trapped long-lived molten-globule collapsed state in the case of the second pathway. The algorithm presented not only captures experimentally observed folding intermediates and kinetics, but yields insights into the relative roles of local and global interactions in determining folding mechanisms and rates.
Prediction of protein mutant stability using classification and regression tool.

PubMed

Huang, Liang-Tsung; Saraboji, K; Ho, Shinn-Ying; Hwang, Shiow-Fen; Ponnuswamy, M N; Gromiha, M Michael

2007-02-01

Prediction of protein stability upon amino acid substitutions is an important problem in molecular biology and the solving of which would help for designing stable mutants. In this work, we have analyzed the stability of protein mutants using two different datasets of 1396 and 2204 mutants obtained from ProTherm database, respectively for free energy change due to thermal (DeltaDeltaG) and denaturant denaturations (DeltaDeltaG(H(2)O)). We have used a set of 48 physical, chemical energetic and conformational properties of amino acid residues and computed the difference of amino acid properties for each mutant in both sets of data. These differences in amino acid properties have been related to protein stability (DeltaDeltaG and DeltaDeltaG(H(2)O)) and are used to train with classification and regression tool for predicting the stability of protein mutants. Further, we have tested the method with 4 fold, 5 fold and 10 fold cross validation procedures. We found that the physical properties, shape and flexibility are important determinants of protein stability. The classification of mutants based on secondary structure (helix, strand, turn and coil) and solvent accessibility (buried, partially buried, partially exposed and exposed) distinguished the stabilizing/destabilizing mutants at an average accuracy of 81% and 80%, respectively for DeltaDeltaG and DeltaDeltaG(H(2)O). The correlation between the experimental and predicted stability change is 0.61 for DeltaDeltaG and 0.44 for DeltaDeltaG(H(2)O). Further, the free energy change due to the replacement of amino acid residue has been predicted within an average error of 1.08 kcal/mol and 1.37 kcal/mol for thermal and chemical denaturation, respectively. The relative importance of secondary structure and solvent accessibility, and the influence of the dataset on prediction of protein mutant stability have been discussed.
Homochiral stereochemistry: the missing link of structure to energetics in protein folding.

PubMed

Kumar, Anil; Ramakrishnan, Vibin; Ranbhor, Ranjit; Patel, Kirti; Durani, Susheel

2009-12-24

The notion is tested that homochiral stereochemistry being ubiquitous to protein structure could be critical to protein folding as well, causing it to become frustrated energetically providing the basis for its solvent- and sequence-mediated control. The proof in support of the notion is found in a consensus of experiment and computation according to which suitable oligopeptides are in their folding-unfolding equilibria, at both macrostate and microstate levels, susceptible to dielectric because of the conflict of peptide-chain electrostatics with interpeptide hydrogen bonds when the structure is poly-L but not when it is alternating-L,D. The argument is thus made that homochiral stereochemistry may in protein folding provide the unifying basis for its solvent- and sequence-mediated control based on screening of peptide-chain electrostatics under conflict with folding of the chain due to homochiral stereochemistry. Dielectric is brought into spotlight as the effect comparatively obscure but presumably critical to the folding in protein structure for its control.
Balancing energy and entropy: A minimalist model for the characterization of protein folding landscapes

PubMed Central

Das, Payel; Matysiak, Silvina; Clementi, Cecilia

2005-01-01

Coarse-grained models have been extremely valuable in promoting our understanding of protein folding. However, the quantitative accuracy of existing simplified models is strongly hindered either from the complete removal of frustration (as in the widely used Gō-like models) or from the compromise with the minimal frustration principle and/or realistic protein geometry (as in the simple on-lattice models). We present a coarse-grained model that “naturally” incorporates sequence details and energetic frustration into an overall minimally frustrated folding landscape. The model is coupled with an optimization procedure to design the parameters of the protein Hamiltonian to fold into a desired native structure. The application to the study of src-Src homology 3 domain shows that this coarse-grained model contains the main physical-chemical ingredients that are responsible for shaping the folding landscape of this protein. The results illustrate the importance of nonnative interactions and energetic heterogeneity for a quantitative characterization of folding mechanisms. PMID:16006532
Prokaryotic Ubiquitin-Like Protein Modification

PubMed Central

Maupin-Furlow, Julie A.

2016-01-01

Prokaryotes form ubiquitin (Ub)-like isopeptide bonds on the lysine residues of proteins by at least two distinct pathways that are reversible and regulated. In mycobacteria, the C-terminal Gln of Pup (prokaryotic ubiquitin-like protein) is deamidated and isopeptide linked to proteins by a mechanism distinct from ubiquitylation in enzymology yet analogous to ubiquitylation in targeting proteins for destruction by proteasomes. Ub-fold proteins of archaea (SAMPs, small archaeal modifier proteins) and Thermus (TtuB, tRNA-two-thiouridine B) that differ from Ub in amino acid sequence, yet share a common β-grasp fold, also form isopeptide bonds by a mechanism that appears streamlined compared with ubiquitylation. SAMPs and TtuB are found to be members of a small group of Ub-fold proteins that function not only in protein modification but also in sulfur-transfer pathways associated with tRNA thiolation and molybdopterin biosynthesis. These multifunctional Ub-fold proteins are thought to be some of the most ancient of Ub-like protein modifiers. PMID:24995873
Local energetic frustration affects the dependence of green fluorescent protein folding on the chaperonin GroEL.

PubMed

Bandyopadhyay, Boudhayan; Goldenzweig, Adi; Unger, Tamar; Adato, Orit; Fleishman, Sarel J; Unger, Ron; Horovitz, Amnon

2017-12-15

The GroE chaperonin system in Escherichia coli comprises GroEL and GroES and facilitates ATP-dependent protein folding in vivo and in vitro Proteins with very similar sequences and structures can differ in their dependence on GroEL for efficient folding. One potential but unverified source for GroEL dependence is frustration, wherein not all interactions in the native state are optimized energetically, thereby potentiating slow folding and misfolding. Here, we chose enhanced green fluorescent protein as a model system and subjected it to random mutagenesis, followed by screening for variants whose in vivo folding displays increased or decreased GroEL dependence. We confirmed the altered GroEL dependence of these variants with in vitro folding assays. Strikingly, mutations at positions predicted to be highly frustrated were found to correlate with decreased GroEL dependence. Conversely, mutations at positions with low frustration were found to correlate with increased GroEL dependence. Further support for this finding was obtained by showing that folding of an enhanced green fluorescent protein variant designed computationally to have reduced frustration is indeed less GroEL-dependent. Our results indicate that changes in local frustration also affect partitioning in vivo between spontaneous and chaperonin-mediated folding. Hence, the design of minimally frustrated sequences can reduce chaperonin dependence and improve protein expression levels. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Comparison of the Folding Mechanism of Highly Homologous Proteins in the Lipid-binding Protein Family

EPA Science Inventory

The folding mechanism of two closely related proteins in the intracellular lipid binding protein family, human bile acid binding protein (hBABP) and rat bile acid binding protein (rBABP) were examined. These proteins are 77% identical (93% similar) in sequence Both of these singl...
Resolution of the unfolded state.

NASA Astrophysics Data System (ADS)

Beaucage, Gregory

2008-03-01

The unfolded states in proteins and nucleic acids remain weakly understood despite their importance to protein folding; misfolding diseases (Parkinson's & Alzheimer's); natively unfolded proteins (˜ 30% of eukaryotic proteins); and to understanding ribozymes. Research has been hindered by the inability to quantify the residual (native) structure present in an unfolded protein or nucleic acid. Here, a scaling model is proposed to quantify the degree of folding and the unfolded state (Beaucage, 2004, 2007). The model takes a global view of protein structure and can be applied to a number of analytic methods and to simulations. Three examples are given of application to small-angle scattering from pressure induced unfolding of SNase (Panick, 1998), from acid unfolded Cyt c (Kataoka, 1993) and from folding of Azoarcus ribozyme (Perez-Salas, 2004). These examples quantitatively show 3 characteristic unfolded states for proteins, the statistical nature of a folding pathway and the relationship between extent of folding and chain size during folding for charge driven folding in RNA. Beaucage, G., Biophys. J., in press (2007). Beaucage, G., Phys. Rev. E. 70, 031401 (2004). Kataoka, M., Y. Hagihara, K. Mihara, Y. Goto J. Mol. Biol. 229, 591 (1993). Panick, G., R. Malessa, R. Winter, G. Rapp, K. J. Frye, C. A. Royer J. Mol. Biol. 275, 389 (1998). Perez-Salas U. A., P. Rangan, S. Krueger, R. M. Briber, D. Thirumalai, S. A. Woodson, Biochemistry 43 1746 (2004).
Search for Functional Flexible Regions in the G-protein Family: New Reading of the FoldUnfold Program.

PubMed

Galzitskaya, Oxana; Deryusheva, Eugenia; Machulin, Andrey; Nemashkalova, Ekaterina; Glyakina, Anna

2018-06-21

High prediction accuracy of flexible loops in different protein families is a challenge because of the crucial functions associated with these regions. Results of the currently available programs for prediction of loops vary from protein to protein. For prediction of flexible regions in the G-domain for 23 representatives of G-proteins with the known 3D structure we have used eight programs. The results of predictions demonstrate that the FoldUnfold program predicts better loop positions than the PONDR, RОNN, DisEMBL, IUPred, GlobPlot 2, FoldIndex, and MobiDB programs. When classifying the predicted loops (rigid/flexible) according to the Debye-Waller fluctuation factors, our data reveal the existing weak correlation between the B-factors and the average number of closed residues according to the FoldUnfold program; the percentage of overlapping characteristics (residue fold/unfold status) of the protein residues from the two methods is about 60-70%. According to the FoldUnfold program, for G-proteins with the posttranslational modifications, the surrounding binding site residues by disordered-promoting glycine and alanine residues conduces to a more flexible position of the binding sites for fatty acid, while methionine, cysteine and isoleucine residues provide more rigid binding sites. Thus, our research demonstrates additional possibilities of the FoldUnfold program for prediction of flexible regions and characteristics of individual residues in a different protein family. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Solvent friction changes the folding pathway of the tryptophan zipper TZ2.

PubMed

Narayanan, Ranjani; Pelakh, Leslie; Hagen, Stephen J

2009-07-17

Because the rate of a diffusional process such as protein folding is controlled by friction encountered along the reaction pathway, the speed of folding is readily tunable through adjustment of solvent viscosity. The precise relationship between solvent viscosity and the rate of diffusion is complex and even conformation-dependent, however, because both solvent friction and protein internal friction contribute to the total reaction friction. The heterogeneity of the reaction friction along the folding pathway may have subtle consequences. For proteins that fold on a multidimensional free-energy surface, an increase in solvent friction may drive a qualitative change in folding trajectory. Our time-resolved experiments on the rapidly and heterogeneously folding beta-hairpin TZ2 show a shift in the folding pathway as viscosity increases, even though the energetics of folding is unaltered. We also observe a nonlinear or saturating behavior of the folding relaxation time with rising solvent viscosity, potentially an experimental signature of the shifting pathway for unfolding. Our results show that manipulations of solvent viscosity in folding experiments and simulations may have subtle and unexpected consequences on the folding dynamics being studied.
Protein kinesis: The dynamics of protein trafficking and stability

DOE Office of Scientific and Technical Information (OSTI.GOV)

NONE

The purpose of this conference is to provide a multidisciplinary forum for exchange of state-of-the-art information on protein kinesis. This volume contains abstracts of papers in the following areas: protein folding and modification in the endoplasmic reticulum; protein trafficking; protein translocation and folding; protein degradation; polarity; nuclear trafficking; membrane dynamics; and protein import into organelles.
Exploring the protein folding free energy landscape: coupling replica exchange method with P3ME/RESPA algorithm.

PubMed

Zhou, Ruhong

2004-05-01

A highly parallel replica exchange method (REM) that couples with a newly developed molecular dynamics algorithm particle-particle particle-mesh Ewald (P3ME)/RESPA has been proposed for efficient sampling of protein folding free energy landscape. The algorithm is then applied to two separate protein systems, beta-hairpin and a designed protein Trp-cage. The all-atom OPLSAA force field with an explicit solvent model is used for both protein folding simulations. Up to 64 replicas of solvated protein systems are simulated in parallel over a wide range of temperatures. The combined trajectories in temperature and configurational space allow a replica to overcome free energy barriers present at low temperatures. These large scale simulations reveal detailed results on folding mechanisms, intermediate state structures, thermodynamic properties and the temperature dependences for both protein systems.
How the Sequence of a Gene Specifies Structural Symmetry in Proteins

PubMed Central

Shen, Xiaojuan; Huang, Tongcheng; Wang, Guanyu; Li, Guanglin

2015-01-01

Internal symmetry is commonly observed in the majority of fundamental protein folds. Meanwhile, sufficient evidence suggests that nascent polypeptide chains of proteins have the potential to start the co-translational folding process and this process allows mRNA to contain additional information on protein structure. In this paper, we study the relationship between gene sequences and protein structures from the viewpoint of symmetry to explore how gene sequences code for structural symmetry in proteins. We found that, for a set of two-fold symmetric proteins from left-handed beta-helix fold, intragenic symmetry always exists in their corresponding gene sequences. Meanwhile, codon usage bias and local mRNA structure might be involved in modulating translation speed for the formation of structural symmetry: a major decrease of local codon usage bias in the middle of the codon sequence can be identified as a common feature; and major or consecutive decreases in local mRNA folding energy near the boundaries of the symmetric substructures can also be observed. The results suggest that gene duplication and fusion may be an evolutionarily conserved process for this protein fold. In addition, the usage of rare codons and the formation of higher order of secondary structure near the boundaries of symmetric substructures might have coevolved as conserved mechanisms to slow down translation elongation and to facilitate effective folding of symmetric substructures. These findings provide valuable insights into our understanding of the mechanisms of translation and its evolution, as well as the design of proteins via symmetric modules. PMID:26641668
Localizing internal friction along the reaction coordinate of protein folding by combining ensemble and single-molecule fluorescence spectroscopy

PubMed Central

Borgia, Alessandro; Wensley, Beth G.; Soranno, Andrea; Nettels, Daniel; Borgia, Madeleine B.; Hoffmann, Armin; Pfeil, Shawn H.; Lipman, Everett A.; Clarke, Jane; Schuler, Benjamin

2012-01-01

Theory, simulations and experimental results have suggested an important role of internal friction in the kinetics of protein folding. Recent experiments on spectrin domains provided the first evidence for a pronounced contribution of internal friction in proteins that fold on the millisecond timescale. However, it has remained unclear how this contribution is distributed along the reaction and what influence it has on the folding dynamics. Here we use a combination of single-molecule Förster resonance energy transfer, nanosecond fluorescence correlation spectroscopy, microfluidic mixing and denaturant- and viscosity-dependent protein-folding kinetics to probe internal friction in the unfolded state and at the early and late transition states of slow- and fast-folding spectrin domains. We find that the internal friction affecting the folding rates of spectrin domains is highly localized to the early transition state, suggesting an important role of rather specific interactions in the rate-limiting conformational changes. PMID:23149740
Localizing internal friction along the reaction coordinate of protein folding by combining ensemble and single-molecule fluorescence spectroscopy.

PubMed

Borgia, Alessandro; Wensley, Beth G; Soranno, Andrea; Nettels, Daniel; Borgia, Madeleine B; Hoffmann, Armin; Pfeil, Shawn H; Lipman, Everett A; Clarke, Jane; Schuler, Benjamin

2012-01-01

Theory, simulations and experimental results have suggested an important role of internal friction in the kinetics of protein folding. Recent experiments on spectrin domains provided the first evidence for a pronounced contribution of internal friction in proteins that fold on the millisecond timescale. However, it has remained unclear how this contribution is distributed along the reaction and what influence it has on the folding dynamics. Here we use a combination of single-molecule Förster resonance energy transfer, nanosecond fluorescence correlation spectroscopy, microfluidic mixing and denaturant- and viscosity-dependent protein-folding kinetics to probe internal friction in the unfolded state and at the early and late transition states of slow- and fast-folding spectrin domains. We find that the internal friction affecting the folding rates of spectrin domains is highly localized to the early transition state, suggesting an important role of rather specific interactions in the rate-limiting conformational changes.
ortho- and meta-substituted aromatic thiols are efficient redox buffers that increase the folding rate of a disulfide-containing protein.

PubMed

Gough, Jonathan D; Barrett, Elvis J; Silva, Yenia; Lees, Watson J

2006-08-20

Thiol based redox buffers are used to enhance the folding rates of disulfide-containing proteins in vitro. Traditionally, small molecule aliphatic thiols such as glutathione are employed. Recently, we have demonstrated that aromatic thiols can further enhance protein-folding rates. In the presence of para-substituted aromatic thiols the folding rate of a disulfide-containing protein was increased by 4-23 times over that measured for glutathione. However, several important practical issues remain to be addressed. Aromatic thiols have never been tested in the presence of denaturants such as guanidine hydrochloride. Only two of the para-substituted aromatic thiols previously examined are commercially available. To expand the number of aromatic thiols for protein folding, several commercially available meta- and ortho-substituted aromatic thiols were studied. Furthermore, an ortho-substituted aromatic thiol, easily obtained from inexpensive starting materials, was investigated. Folding rates of scrambled ribonuclease A at pH 6.0, 7.0 and 7.7, with ortho- and meta-substituted aromatic thiols, were up to 10 times greater than those with glutathione. In the presence of the common denaturant guanidine hydrochloride (0.5M) aromatic thiols provided 100% yield of active protein while maintaining equivalent folding rates.

Protein Folding and the Challenges of Maintaining Endoplasmic Reticulum Proteostasis in Idiopathic Pulmonary Fibrosis.

PubMed

Romero, Freddy; Summer, Ross

2017-11-01

Alveolar epithelial type II (AEII) cells are "professional" secretory cells that synthesize and secrete massive quantities of proteins to produce pulmonary surfactant and maintain airway immune defenses. To facilitate this high level of protein synthesis, AEII cells are equipped with an elaborate endoplasmic reticulum (ER) structure and possess an abundance of the machinery needed to fold, assemble, and secrete proteins. However, conditions that suddenly increase the quantity of new proteins entering the ER or that impede the capacity of the ER to fold proteins can cause misfolded or unfolded proteins to accumulate in the ER lumen, also called ER stress. To minimize this stress, AEII cells adapt by (1) reducing the quantity of proteins entering the ER, (2) increasing the amount of protein-folding machinery, and (3) removing misfolded proteins when they accumulate. Although these adaptive responses, aptly named the unfolded protein response, are usually effective in reducing ER stress, chronic aggregation of misfolded proteins is recognized as a hallmark feature of AEII cells in patients with idiopathic pulmonary fibrosis (IPF). Although mutations in surfactant proteins are linked to the development of ER stress in some rare IPF cases, the mechanisms causing protein misfolding in most cases are unknown. In this article, we review the mechanisms regulating ER proteostasis and highlight specific aspects of protein folding and the unfolded protein response that are most vulnerable to failure. Then, we postulate mechanisms other than genetic mutations that might contribute to protein aggregation in the alveolar epithelium of IPF lung.
Metamorphic Proteins: Emergence of Dual Protein Folds from One Primary Sequence.

PubMed

Lella, Muralikrishna; Mahalakshmi, Radhakrishnan

2017-06-20

Every amino acid exhibits a different propensity for distinct structural conformations. Hence, decoding how the primary amino acid sequence undergoes the transition to a defined secondary structure and its final three-dimensional fold is presently considered predictable with reasonable certainty. However, protein sequences that defy the first principles of secondary structure prediction (they attain two different folds) have recently been discovered. Such proteins, aptly named metamorphic proteins, decrease the conformational constraint by increasing flexibility in the secondary structure and thereby result in efficient functionality. In this review, we discuss the major factors driving the conformational switch related both to protein sequence and to structure using illustrative examples. We discuss the concept of an evolutionary transition in sequence and structure, the functional impact of the tertiary fold, and the pressure of intrinsic and external factors that give rise to metamorphic proteins. We mainly focus on the major components of protein architecture, namely, the α-helix and β-sheet segments, which are involved in conformational switching within the same or highly similar sequences. These chameleonic sequences are widespread in both cytosolic and membrane proteins, and these folds are equally important for protein structure and function. We discuss the implications of metamorphic proteins and chameleonic peptide sequences in de novo peptide design.
Effect of manuka honey on the expression of universal stress protein A in meticillin-resistant Staphylococcus aureus.

PubMed

Jenkins, Rowena; Burton, Neil; Cooper, Rose

2011-04-01

Staphylococcus aureus is an important pathogen that can cause many problems, from impetigo to endocarditis. With its continued resistance to multiple antibiotics, S. aureus remains a serious health threat. Honey has been used to eradicate meticillin-resistant S. aureus (MRSA) strains from wounds, but its mode of action is not yet understood. Proteomics provides a potent group of techniques that can be used to analyse differences in protein expression between untreated bacterial cells and those treated with inhibitory concentrations of manuka honey. In this study, two-dimensional (2D) electrophoresis was combined with matrix-assisted laser desorption/ionisation time-of-flight mass spectrometry (MALDI-TOF MS) to determine the identities of proteins whose levels of expression were changed at least two-fold following treatment with manuka honey. Protein extracts were obtained from cells grown in tryptone soy broth (with or without manuka honey) by mechanical disruption and were separated on 2D polyacrylamide gels. A protein was isolated in gels prepared from untreated cell extract that was absent from gels made using honey-treated cell extract. Using MALDI-TOF MS, the protein was identified as universal stress protein A (UspA). Downregulation of this protein was confirmed by real-time polymerase chain reaction (PCR), which showed a 16-fold downregulation in honey-treated cells compared with untreated samples. This protein is involved in the stress stamina response and its downregulation could help to explain the inhibition of MRSA by manuka honey. Copyright © 2011 Elsevier B.V. and the International Society of Chemotherapy. All rights reserved.
Amyloidogenesis of Natively Unfolded Proteins

PubMed Central

Uversky, Vladimir N.

2009-01-01

Aggregation and subsequent development of protein deposition diseases originate from conformational changes in corresponding amyloidogenic proteins. The accumulated data support the model where protein fibrillogenesis proceeds via the formation of a relatively unfolded amyloidogenic conformation, which shares many structural properties with the pre-molten globule state, a partially folded intermediate first found during the equilibrium and kinetic (un)folding studies of several globular proteins and later described as one of the structural forms of natively unfolded proteins. The flexibility of this structural form is essential for the conformational rearrangements driving the formation of the core cross-beta structure of the amyloid fibril. Obviously, molecular mechanisms describing amyloidogenesis of ordered and natively unfolded proteins are different. For ordered protein to fibrillate, its unique and rigid structure has to be destabilized and partially unfolded. On the other hand, fibrillogenesis of a natively unfolded protein involves the formation of partially folded conformation; i.e., partial folding rather than unfolding. In this review recent findings are surveyed to illustrate some unique features of the natively unfolded proteins amyloidogenesis. PMID:18537543
Proteins improving recombinant antibody production in mammalian cells.

PubMed

Nishimiya, Daisuke

2014-02-01

Mammalian cells have been successfully used for the industrial manufacture of antibodies due to their ability to synthesize antibodies correctly. Nascent polypeptides must be subjected to protein folding and assembly in the ER and the Golgi to be secreted as mature proteins. If these reactions do not proceed appropriately, unfolded or misfolded proteins are degraded by the ER-associated degradation (ERAD) pathway. The accumulation of unfolded proteins or intracellular antibody crystals accompanied by this failure triggers the unfolded protein response (UPR), which can considerably attenuate the levels of translation, folding, assembly, and secretion, resulting in reduction of antibody productivity. Accumulating studies by omics-based analysis of recombinant mammalian cells suggest that not only protein secretion processes including protein folding and assembly but also translation are likely to be the rate-limiting factors for increasing antibody production. Here, this review describes the mechanism of antibody folding and assembly and recent advantages which could improve recombinant antibody production in mammalian cells by utilizing proteins such as ER chaperones or UPR-related proteins.
Towards NV-based magnetic sensing in the time domain

NASA Astrophysics Data System (ADS)

Urbach, Elana; Sumarac, Tamara; Lovchinsky, Igor; Landig, Renate; Sanchez-Yamagishi, Javier; Andersen, Trond; Park, Hongkun; Lukin, Mikhail

2017-04-01

The study of protein folding dynamics is an outstanding problem in the biological sciences. We show that nitrogen-vacancy (NV) centers in diamond can be used to dynamically sense the conformational states of individual proteins under ambient conditions. We present preliminary data on time-domain detection of electronic spin labels which were chemically attached to the proteins, as well as label-free detection of native hydrogen nuclear spins within the protein. In addition, we discuss work towards polarizing boron-11 spins in atomically-thin hexagonal boron nitride using Hartmann-Hahn double resonance, with the ultimate goal of studying many-body spin dynamics and performing quantum simulation. This material is based upon work supported by the National Science Foundation Graduate Research Fellowship Program under Grant No. DGE1144152.
Cold denaturation as a tool to measure protein stability

PubMed Central

Sanfelice, Domenico; Temussi, Piero Andrea

2016-01-01

Protein stability is an important issue for the interpretation of a wide variety of biological problems but its assessment is at times difficult. The most common parameter employed to describe protein stability is the temperature of melting, at which the populations of folded and unfolded species are identical. This parameter may yield ambiguous results. It would always be preferable to measure the whole stability curve. The calculation of this curve is greatly facilitated whenever it is possible to observe cold denaturation. Using Yfh1, one of the few proteins whose cold denaturation occurs at neutral pH and low ionic strength, we could measure the variation of its full stability curve under several environmental conditions. Here we show the advantages of gauging stability as a function of external variables using stability curves. PMID:26026885
A galaxy of folds.

PubMed

Alva, Vikram; Remmert, Michael; Biegert, Andreas; Lupas, Andrei N; Söding, Johannes

2010-01-01

Many protein classification systems capture homologous relationships by grouping domains into families and superfamilies on the basis of sequence similarity. Superfamilies with similar 3D structures are further grouped into folds. In the absence of discernable sequence similarity, these structural similarities were long thought to have originated independently, by convergent evolution. However, the growth of databases and advances in sequence comparison methods have led to the discovery of many distant evolutionary relationships that transcend the boundaries of superfamilies and folds. To investigate the contributions of convergent versus divergent evolution in the origin of protein folds, we clustered representative domains of known structure by their sequence similarity, treating them as point masses in a virtual 2D space which attract or repel each other depending on their pairwise sequence similarities. As expected, families in the same superfamily form tight clusters. But often, superfamilies of the same fold are linked with each other, suggesting that the entire fold evolved from an ancient prototype. Strikingly, some links connect superfamilies with different folds. They arise from modular peptide fragments of between 20 and 40 residues that co-occur in the connected folds in disparate structural contexts. These may be descendants of an ancestral pool of peptide modules that evolved as cofactors in the RNA world and from which the first folded proteins arose by amplification and recombination. Our galaxy of folds summarizes, in a single image, most known and many yet undescribed homologous relationships between protein superfamilies, providing new insights into the evolution of protein domains.
Folding pathway of a multidomain protein depends on its topology of domain connectivity

PubMed Central

Inanami, Takashi; Terada, Tomoki P.; Sasai, Masaki

2014-01-01

How do the folding mechanisms of multidomain proteins depend on protein topology? We addressed this question by developing an Ising-like structure-based model and applying it for the analysis of free-energy landscapes and folding kinetics of an example protein, Escherichia coli dihydrofolate reductase (DHFR). DHFR has two domains, one comprising discontinuous N- and C-terminal parts and the other comprising a continuous middle part of the chain. The simulated folding pathway of DHFR is a sequential process during which the continuous domain folds first, followed by the discontinuous domain, thereby avoiding the rapid decrease in conformation entropy caused by the association of the N- and C-terminal parts during the early phase of folding. Our simulated results consistently explain the observed experimental data on folding kinetics and predict an off-pathway structural fluctuation at equilibrium. For a circular permutant for which the topological complexity of wild-type DHFR is resolved, the balance between energy and entropy is modulated, resulting in the coexistence of the two folding pathways. This coexistence of pathways should account for the experimentally observed complex folding behavior of the circular permutant. PMID:25267632
The Energy Landscape, Folding Pathways and the Kinetics of a Knotted Protein

PubMed Central

Prentiss, Michael C.; Wales, David J.; Wolynes, Peter G.

2010-01-01

The folding pathway and rate coefficients of the folding of a knotted protein are calculated for a potential energy function with minimal energetic frustration. A kinetic transition network is constructed using the discrete path sampling approach, and the resulting potential energy surface is visualized by constructing disconnectivity graphs. Owing to topological constraints, the low-lying portion of the landscape consists of three distinct regions, corresponding to the native knotted state and to configurations where either the N or C terminus is not yet folded into the knot. The fastest folding pathways from denatured states exhibit early formation of the N terminus portion of the knot and a rate-determining step where the C terminus is incorporated. The low-lying minima with the N terminus knotted and the C terminus free therefore constitute an off-pathway intermediate for this model. The insertion of both the N and C termini into the knot occurs late in the folding process, creating large energy barriers that are the rate limiting steps in the folding process. When compared to other protein folding proteins of a similar length, this system folds over six orders of magnitude more slowly. PMID:20617197
Microsecond protein dynamics observed at the single-molecule level

NASA Astrophysics Data System (ADS)

Otosu, Takuhiro; Ishii, Kunihiko; Tahara, Tahei

2015-07-01

How polypeptide chains acquire specific conformations to realize unique biological functions is a central problem of protein science. Single-molecule spectroscopy, combined with fluorescence resonance energy transfer, is utilized to study the conformational heterogeneity and the state-to-state transition dynamics of proteins on the submillisecond to second timescales. However, observation of the dynamics on the microsecond timescale is still very challenging. This timescale is important because the elementary processes of protein dynamics take place and direct comparison between experiment and simulation is possible. Here we report a new single-molecule technique to reveal the microsecond structural dynamics of proteins through correlation of the fluorescence lifetime. This method, two-dimensional fluorescence lifetime correlation spectroscopy, is applied to clarify the conformational dynamics of cytochrome c. Three conformational ensembles and the microsecond transitions in each ensemble are indicated from the correlation signal, demonstrating the importance of quantifying microsecond dynamics of proteins on the folding free energy landscape.
Microsecond protein dynamics observed at the single-molecule level

PubMed Central

Otosu, Takuhiro; Ishii, Kunihiko; Tahara, Tahei

2015-01-01

How polypeptide chains acquire specific conformations to realize unique biological functions is a central problem of protein science. Single-molecule spectroscopy, combined with fluorescence resonance energy transfer, is utilized to study the conformational heterogeneity and the state-to-state transition dynamics of proteins on the submillisecond to second timescales. However, observation of the dynamics on the microsecond timescale is still very challenging. This timescale is important because the elementary processes of protein dynamics take place and direct comparison between experiment and simulation is possible. Here we report a new single-molecule technique to reveal the microsecond structural dynamics of proteins through correlation of the fluorescence lifetime. This method, two-dimensional fluorescence lifetime correlation spectroscopy, is applied to clarify the conformational dynamics of cytochrome c. Three conformational ensembles and the microsecond transitions in each ensemble are indicated from the correlation signal, demonstrating the importance of quantifying microsecond dynamics of proteins on the folding free energy landscape. PMID:26151767
Matching multiple rigid domain decompositions of proteins

PubMed Central

Flynn, Emily; Streinu, Ileana

2017-01-01

We describe efficient methods for consistently coloring and visualizing collections of rigid cluster decompositions obtained from variations of a protein structure, and lay the foundation for more complex setups that may involve different computational and experimental methods. The focus here is on three biological applications: the conceptually simpler problems of visualizing results of dilution and mutation analyses, and the more complex task of matching decompositions of multiple NMR models of the same protein. Implemented into the KINARI web server application, the improved visualization techniques give useful information about protein folding cores, help examining the effect of mutations on protein flexibility and function, and provide insights into the structural motions of PDB proteins solved with solution NMR. These tools have been developed with the goal of improving and validating rigidity analysis as a credible coarse-grained model capturing essential information about a protein’s slow motions near the native state. PMID:28141528
Forces Driving Chaperone Action

PubMed Central

Koldewey, Philipp; Stull, Frederick; Horowitz, Scott; Martin, Raoul; Bardwell, James C. A.

2016-01-01

SUMMARY It is still unclear what molecular forces drive chaperone-mediated protein folding. Here, we obtain a detailed mechanistic understanding of the forces that dictate the four key steps of chaperone-client interaction: initial binding, complex stabilization, folding, and release. Contrary to the common belief that chaperones recognize unfolding intermediates by their hydrophobic nature, we discover that the model chaperone Spy uses long-range electrostatic interactions to rapidly bind to its unfolded client protein Im7. Short-range hydrophobic interactions follow, which serve to stabilize the complex. Hydrophobic collapse of the client protein then drives its folding. By burying hydrophobic residues in its core, the client’s affinity to Spy decreases, which causes client release. By allowing the client to fold itself, Spy circumvents the need for client-specific folding instructions. This mechanism might help explain how chaperones can facilitate the folding of various unrelated proteins. PMID:27293188
Non-detergent sulphobetaines: a new class of molecules that facilitate in vitro protein renaturation.

PubMed

Goldberg, M E; Expert-Bezançon, N; Vuillard, L; Rabilloud, T

1996-01-01

Attempts to renature proteins often yield aggregates rather than native protein. To minimize aggregation, low protein concentrations and/or solubilizing agents are used. Here, we test new solubilizing molecules, non-detergent sulphobetaines, to improve the renaturation of two very different enzymes, hen egg white lysozyme and bacterial beta-D-galactosidase. The renaturation was conducted in the presence of five different sulphobetaines and the yield of active enzyme was measured. The five sulphobetaines improved the yield of native lysozyme up to 12-fold. Some sulphobetaines improved the yield of galactosidase up to 80-fold, but one reduced it 100-fold. Non-detergent sulphobetaines strongly affect the balance between aggregation and folding. Their effect depends on their structure and on their interactions with folding intermediates. These results should serve as a basis for designing more efficient sulphobetaines; for designing improved renaturation protocols using existing sulphobetaines; and for characterizing folding intermediates that interact with sulphobetaines.
Folding and stability of helical bundle proteins from coarse-grained models.

PubMed

Kapoor, Abhijeet; Travesset, Alex

2013-07-01

We develop a coarse-grained model where solvent is considered implicitly, electrostatics are included as short-range interactions, and side-chains are coarse-grained to a single bead. The model depends on three main parameters: hydrophobic, electrostatic, and side-chain hydrogen bond strength. The parameters are determined by considering three level of approximations and characterizing the folding for three selected proteins (training set). Nine additional proteins (containing up to 126 residues) as well as mutated versions (test set) are folded with the given parameters. In all folding simulations, the initial state is a random coil configuration. Besides the native state, some proteins fold into an additional state differing in the topology (structure of the helical bundle). We discuss the stability of the native states, and compare the dynamics of our model to all atom molecular dynamics simulations as well as some general properties on the interactions governing folding dynamics. Copyright © 2013 Wiley Periodicals, Inc.
Precursory signatures of protein folding/unfolding: From time series correlation analysis to atomistic mechanisms

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hsu, P. J.; Lai, S. K., E-mail: sklai@coll.phy.ncu.edu.tw; Molecular Science and Technology Program, Taiwan International Graduate Program, Academia Sinica, Taipei 115, Taiwan

Folded conformations of proteins in thermodynamically stable states have long lifetimes. Before it folds into a stable conformation, or after unfolding from a stable conformation, the protein will generally stray from one random conformation to another leading thus to rapid fluctuations. Brief structural changes therefore occur before folding and unfolding events. These short-lived movements are easily overlooked in studies of folding/unfolding for they represent momentary excursions of the protein to explore conformations in the neighborhood of the stable conformation. The present study looks for precursory signatures of protein folding/unfolding within these rapid fluctuations through a combination of three techniques: (1)more » ultrafast shape recognition, (2) time series segmentation, and (3) time series correlation analysis. The first procedure measures the differences between statistical distance distributions of atoms in different conformations by calculating shape similarity indices from molecular dynamics simulation trajectories. The second procedure is used to discover the times at which the protein makes transitions from one conformation to another. Finally, we employ the third technique to exploit spatial fingerprints of the stable conformations; this procedure is to map out the sequences of changes preceding the actual folding and unfolding events, since strongly correlated atoms in different conformations are different due to bond and steric constraints. The aforementioned high-frequency fluctuations are therefore characterized by distinct correlational and structural changes that are associated with rate-limiting precursors that translate into brief segments. Guided by these technical procedures, we choose a model system, a fragment of the protein transthyretin, for identifying in this system not only the precursory signatures of transitions associated with α helix and β hairpin, but also the important role played by weaker correlations in such protein folding dynamics.« less
Precursory signatures of protein folding/unfolding: From time series correlation analysis to atomistic mechanisms

NASA Astrophysics Data System (ADS)

Hsu, P. J.; Cheong, S. A.; Lai, S. K.

2014-05-01

Folded conformations of proteins in thermodynamically stable states have long lifetimes. Before it folds into a stable conformation, or after unfolding from a stable conformation, the protein will generally stray from one random conformation to another leading thus to rapid fluctuations. Brief structural changes therefore occur before folding and unfolding events. These short-lived movements are easily overlooked in studies of folding/unfolding for they represent momentary excursions of the protein to explore conformations in the neighborhood of the stable conformation. The present study looks for precursory signatures of protein folding/unfolding within these rapid fluctuations through a combination of three techniques: (1) ultrafast shape recognition, (2) time series segmentation, and (3) time series correlation analysis. The first procedure measures the differences between statistical distance distributions of atoms in different conformations by calculating shape similarity indices from molecular dynamics simulation trajectories. The second procedure is used to discover the times at which the protein makes transitions from one conformation to another. Finally, we employ the third technique to exploit spatial fingerprints of the stable conformations; this procedure is to map out the sequences of changes preceding the actual folding and unfolding events, since strongly correlated atoms in different conformations are different due to bond and steric constraints. The aforementioned high-frequency fluctuations are therefore characterized by distinct correlational and structural changes that are associated with rate-limiting precursors that translate into brief segments. Guided by these technical procedures, we choose a model system, a fragment of the protein transthyretin, for identifying in this system not only the precursory signatures of transitions associated with α helix and β hairpin, but also the important role played by weaker correlations in such protein folding dynamics.
Polymer Uncrossing and Knotting in Protein Folding, and Their Role in Minimal Folding Pathways

PubMed Central

Mohazab, Ali R.; Plotkin, Steven S.

2013-01-01

We introduce a method for calculating the extent to which chain non-crossing is important in the most efficient, optimal trajectories or pathways for a protein to fold. This involves recording all unphysical crossing events of a ghost chain, and calculating the minimal uncrossing cost that would have been required to avoid such events. A depth-first tree search algorithm is applied to find minimal transformations to fold , , , and knotted proteins. In all cases, the extra uncrossing/non-crossing distance is a small fraction of the total distance travelled by a ghost chain. Different structural classes may be distinguished by the amount of extra uncrossing distance, and the effectiveness of such discrimination is compared with other order parameters. It was seen that non-crossing distance over chain length provided the best discrimination between structural and kinetic classes. The scaling of non-crossing distance with chain length implies an inevitable crossover to entanglement-dominated folding mechanisms for sufficiently long chains. We further quantify the minimal folding pathways by collecting the sequence of uncrossing moves, which generally involve leg, loop, and elbow-like uncrossing moves, and rendering the collection of these moves over the unfolded ensemble as a multiple-transformation “alignment”. The consensus minimal pathway is constructed and shown schematically for representative cases of an , , and knotted protein. An overlap parameter is defined between pathways; we find that proteins have minimal overlap indicating diverse folding pathways, knotted proteins are highly constrained to follow a dominant pathway, and proteins are somewhere in between. Thus we have shown how topological chain constraints can induce dominant pathway mechanisms in protein folding. PMID:23365638
A strategy for detecting the conservation of folding-nucleus residues in protein superfamilies.

PubMed

Michnick, S W; Shakhnovich, E

1998-01-01

Nucleation-growth theory predicts that fast-folding peptide sequences fold to their native structure via structures in a transition-state ensemble that share a small number of native contacts (the folding nucleus). Experimental and theoretical studies of proteins suggest that residues participating in folding nuclei are conserved among homologs. We attempted to determine if this is true in proteins with highly diverged sequences but identical folds (superfamilies). We describe a strategy based on comparisons of residue conservation in natural superfamily sequences with simulated sequences (generated with a Monte-Carlo sequence design strategy) for the same proteins. The basic assumptions of the strategy were that natural sequences will conserve residues needed for folding and stability plus function, the simulated sequences contain no functional conservation, and nucleus residues make native contacts with each other. Based on these assumptions, we identified seven potential nucleus residues in ubiquitin superfamily members. Non-nucleus conserved residues were also identified; these are proposed to be involved in stabilizing native interactions. We found that all superfamily members conserved the same potential nucleus residue positions, except those for which the structural topology is significantly different. Our results suggest that the conservation of the nucleus of a specific fold can be predicted by comparing designed simulated sequences with natural highly diverged sequences that fold to the same structure. We suggest that such a strategy could be used to help plan protein folding and design experiments, to identify new superfamily members, and to subdivide superfamilies further into classes having a similar folding mechanism.

A thermodynamic definition of protein domains.

PubMed

Porter, Lauren L; Rose, George D

2012-06-12

Protein domains are conspicuous structural units in globular proteins, and their identification has been a topic of intense biochemical interest dating back to the earliest crystal structures. Numerous disparate domain identification algorithms have been proposed, all involving some combination of visual intuition and/or structure-based decomposition. Instead, we present a rigorous, thermodynamically-based approach that redefines domains as cooperative chain segments. In greater detail, most small proteins fold with high cooperativity, meaning that the equilibrium population is dominated by completely folded and completely unfolded molecules, with a negligible subpopulation of partially folded intermediates. Here, we redefine structural domains in thermodynamic terms as cooperative folding units, based on m-values, which measure the cooperativity of a protein or its substructures. In our analysis, a domain is equated to a contiguous segment of the folded protein whose m-value is largely unaffected when that segment is excised from its parent structure. Defined in this way, a domain is a self-contained cooperative unit; i.e., its cooperativity depends primarily upon intrasegment interactions, not intersegment interactions. Implementing this concept computationally, the domains in a large representative set of proteins were identified; all exhibit consistency with experimental findings. Specifically, our domain divisions correspond to the experimentally determined equilibrium folding intermediates in a set of nine proteins. The approach was also proofed against a representative set of 71 additional proteins, again with confirmatory results. Our reframed interpretation of a protein domain transforms an indeterminate structural phenomenon into a quantifiable molecular property grounded in solution thermodynamics.
Complete Reversible Refolding of a G-Protein Coupled Receptor on a Solid Support

PubMed Central

Di Bartolo, Natalie; Compton, Emma L. R.; Warne, Tony; Edwards, Patricia C.; Tate, Christopher G.; Schertler, Gebhard F. X.; Booth, Paula J.

2016-01-01

The factors defining the correct folding and stability of integral membrane proteins are poorly understood. Folding of only a few select membrane proteins has been scrutinised, leaving considerable deficiencies in knowledge for large protein families, such as G protein coupled receptors (GPCRs). Complete reversible folding, which is problematic for any membrane protein, has eluded this dominant receptor family. Moreover, attempts to recover receptors from denatured states are inefficient, yielding at best 40–70% functional protein. We present a method for the reversible unfolding of an archetypal family member, the β1-adrenergic receptor, and attain 100% recovery of the folded, functional state, in terms of ligand binding, compared to receptor which has not been subject to any unfolding and retains its original, folded structure. We exploit refolding on a solid support, which could avoid unwanted interactions and aggregation that occur in bulk solution. We determine the changes in structure and function upon unfolding and refolding. Additionally, we employ a method that is relatively new to membrane protein folding; pulse proteolysis. Complete refolding of β1-adrenergic receptor occurs in n-decyl-β-D-maltoside (DM) micelles from a urea-denatured state, as shown by regain of its original helical structure, ligand binding and protein fluorescence. The successful refolding strategy on a solid support offers a defined method for the controlled refolding and recovery of functional GPCRs and other membrane proteins that suffer from instability and irreversible denaturation once isolated from their native membranes. PMID:26982879
Frustration in Condensed Matter and Protein Folding

NASA Astrophysics Data System (ADS)

Lorelli, S.; Cabot, A.; Sundarprasad, N.; Boekema, C.

Using computer modeling we study frustration in condensed matter and protein folding. Frustration is due to random and/or competing interactions. One definition of frustration is the sum of squares of the differences between actual and expected distances between characters. If this sum is non-zero, then the system is said to have frustration. A simulation tracks the movement of characters to lower their frustration. Our research is conducted on frustration as a function of temperature using a logarithmic scale. At absolute zero, the relaxation for frustration is a power function for randomly assigned patterns or an exponential function for regular patterns like Thomson figures. These findings have implications for protein folding; we attempt to apply our frustration modeling to protein folding and dynamics. We use coding in Python to simulate different ways a protein can fold. An algorithm is being developed to find the lowest frustration (and thus energy) states possible. Research supported by SJSU & AFC.
Enhanced Wang Landau sampling of adsorbed protein conformations.

PubMed

Radhakrishna, Mithun; Sharma, Sumit; Kumar, Sanat K

2012-03-21

Using computer simulations to model the folding of proteins into their native states is computationally expensive due to the extraordinarily low degeneracy of the ground state. In this paper, we develop an efficient way to sample these folded conformations using Wang Landau sampling coupled with the configurational bias method (which uses an unphysical "temperature" that lies between the collapse and folding transition temperatures of the protein). This method speeds up the folding process by roughly an order of magnitude over existing algorithms for the sequences studied. We apply this method to study the adsorption of intrinsically disordered hydrophobic polar protein fragments on a hydrophobic surface. We find that these fragments, which are unstructured in the bulk, acquire secondary structure upon adsorption onto a strong hydrophobic surface. Apparently, the presence of a hydrophobic surface allows these random coil fragments to fold by providing hydrophobic contacts that were lost in protein fragmentation. © 2012 American Institute of Physics
Molecular Origins of Internal Friction Effects on Protein Folding Rates

PubMed Central

Sirur, Anshul

2014-01-01

Recent experiments on protein folding dynamics have revealed strong evidence for internal friction effects. That is, observed relaxation times are not simply proportional to the solvent viscosity as might be expected if the solvent were the only source of friction. However, a molecular interpretation of this remarkable phenomenon is currently lacking. Here, we use all-atom simulations of peptide and protein folding in explicit solvent, to probe the origin of the unusual viscosity dependence. We find that an important contribution to this effect, explaining the viscosity dependence of helix formation and the folding of a helix-containing protein, is the insensitivity of torsion angle isomerization to solvent friction. The influence of this landscape roughness can, in turn, be quantitatively explained by a rate theory including memory friction. This insensitivity of local barrier crossing to solvent friction is expected to contribute to the viscosity dependence of folding rates in larger proteins. PMID:24986114
Predicting protein folding rate change upon point mutation using residue-level coevolutionary information.

PubMed

Mallik, Saurav; Das, Smita; Kundu, Sudip

2016-01-01

Change in folding kinetics of globular proteins upon point mutation is crucial to a wide spectrum of biological research, such as protein misfolding, toxicity, and aggregations. Here we seek to address whether residue-level coevolutionary information of globular proteins can be informative to folding rate changes upon point mutations. Generating residue-level coevolutionary networks of globular proteins, we analyze three parameters: relative coevolution order (rCEO), network density (ND), and characteristic path length (CPL). A point mutation is considered to be equivalent to a node deletion of this network and respective percentage changes in rCEO, ND, CPL are found linearly correlated (0.84, 0.73, and -0.61, respectively) with experimental folding rate changes. The three parameters predict the folding rate change upon a point mutation with 0.031, 0.045, and 0.059 standard errors, respectively. © 2015 Wiley Periodicals, Inc.
Molecular origins of internal friction effects on protein-folding rates.

PubMed

de Sancho, David; Sirur, Anshul; Best, Robert B

2014-07-02

Recent experiments on protein-folding dynamics have revealed strong evidence for internal friction effects. That is, observed relaxation times are not simply proportional to the solvent viscosity as might be expected if the solvent were the only source of friction. However, a molecular interpretation of this remarkable phenomenon is currently lacking. Here, we use all-atom simulations of peptide and protein folding in explicit solvent, to probe the origin of the unusual viscosity dependence. We find that an important contribution to this effect, explaining the viscosity dependence of helix formation and the folding of a helix-containing protein, is the insensitivity of torsion angle isomerization to solvent friction. The influence of this landscape roughness can, in turn, be quantitatively explained by a rate theory including memory friction. This insensitivity of local barrier crossing to solvent friction is expected to contribute to the viscosity dependence of folding rates in larger proteins.
Structure of a Trypanosoma Brucei Alpha/Beta--Hydrolase Fold Protein With Unknown Function

DOE Office of Scientific and Technical Information (OSTI.GOV)

Merritt, E.A.; Holmes, M.; Buckner, F.S.

2009-05-26

The structure of a structural genomics target protein, Tbru020260AAA from Trypanosoma brucei, has been determined to a resolution of 2.2 {angstrom} using multiple-wavelength anomalous diffraction at the Se K edge. This protein belongs to Pfam sequence family PF08538 and is only distantly related to previously studied members of the {alpha}/{beta}-hydrolase fold family. Structural superposition onto representative {alpha}/{beta}-hydrolase fold proteins of known function indicates that a possible catalytic nucleophile, Ser116 in the T. brucei protein, lies at the expected location. However, the present structure and by extension the other trypanosomatid members of this sequence family have neither sequence nor structural similaritymore » at the location of other active-site residues typical for proteins with this fold. Together with the presence of an additional domain between strands {beta}6 and {beta}7 that is conserved in trypanosomatid genomes, this suggests that the function of these homologs has diverged from other members of the fold family.« less
STN1 OB Fold Mutation Alters DNA Binding and Affects Selective Aspects of CST Function

PubMed Central

Bhattacharjee, Anukana; Stewart, Jason; Chaiken, Mary; Price, Carolyn M.

2016-01-01

Mammalian CST (CTC1-STN1-TEN1) participates in multiple aspects of telomere replication and genome-wide recovery from replication stress. CST resembles Replication Protein A (RPA) in that it binds ssDNA and STN1 and TEN1 are structurally similar to RPA2 and RPA3. Conservation between CTC1 and RPA1 is less apparent. Currently the mechanism underlying CST action is largely unknown. Here we address CST mechanism by using a DNA-binding mutant, (STN1 OB-fold mutant, STN1-OBM) to examine the relationship between DNA binding and CST function. In vivo, STN1-OBM affects resolution of endogenous replication stress and telomere duplex replication but telomeric C-strand fill-in and new origin firing after exogenous replication stress are unaffected. These selective effects indicate mechanistic differences in CST action during resolution of different replication problems. In vitro binding studies show that STN1 directly engages both short and long ssDNA oligonucleotides, however STN1-OBM preferentially destabilizes binding to short substrates. The finding that STN1-OBM affects binding to only certain substrates starts to explain the in vivo separation of function observed in STN1-OBM expressing cells. CST is expected to engage DNA substrates of varied length and structure as it acts to resolve different replication problems. Since STN1-OBM will alter CST binding to only some of these substrates, the mutant should affect resolution of only a subset of replication problems, as was observed in the STN1-OBM cells. The in vitro studies also provide insight into CST binding mechanism. Like RPA, CST likely contacts DNA via multiple OB folds. However, the importance of STN1 for binding short substrates indicates differences in the architecture of CST and RPA DNA-protein complexes. Based on our results, we propose a dynamic DNA binding model that provides a general mechanism for CST action at diverse forms of replication stress. PMID:27690379
Crystallization of isoelectrically homogeneous cholera toxin

DOE Office of Scientific and Technical Information (OSTI.GOV)

Spangler, B.D.; Westbrook, E.M.

1989-02-07

Past difficulty in growing good crystals of cholera toxin has prevented the study of the crystal structure of this important protein. The authors have determined that failure of cholera toxin to crystallize well has been due to its heterogeneity. They have now succeeded in overcoming the problem by isolating a single isoelectric variant of this oligomeric protein (one A subunit and five B subunits). Cholera toxin purified by their procedure readily forms large single crystals. The crystal form has been described previously. They have recorded data from native crystals of cholera toxin to 3.0-{angstrom} resolution with our electronic area detectors.more » With these data, they have found the orientation of a 5-fold symmetry axis within these crystals, perpendicular to the screw dyad of the crystal. They are now determining the crystal structure of cholera toxin by a combination of multiple heavy-atom isomorphous replacement and density modification techniques, making use of rotational 5-fold averaging of the B subunits.« less
Robustness of multidimensional Brownian ratchets as directed transport mechanisms.

PubMed

González-Candela, Ernesto; Romero-Rochín, Víctor; Del Río, Fernando

2011-08-07

Brownian ratchets have recently been considered as models to describe the ability of certain systems to locate very specific states in multidimensional configuration spaces. This directional process has particularly been proposed as an alternative explanation for the protein folding problem, in which the polypeptide is driven toward the native state by a multidimensional Brownian ratchet. Recognizing the relevance of robustness in biological systems, in this work we analyze such a property of Brownian ratchets by pushing to the limits all the properties considered essential to produce directed transport. Based on the results presented here, we can state that Brownian ratchets are able to deliver current and locate funnel structures under a wide range of conditions. As a result, they represent a simple model that solves the Levinthal's paradox with great robustness and flexibility and without requiring any ad hoc biased transition probability. The behavior of Brownian ratchets shown in this article considerably enhances the plausibility of the model for at least part of the structural mechanism behind protein folding process.
Chevron Behavior and Isostable Enthalpic Barriers in Protein Folding: Successes and Limitations of Simple Gō-like Modeling

PubMed Central

Kaya, Hüseyin; Liu, Zhirong; Chan, Hue Sun

2005-01-01

It has been demonstrated that a “near-Levinthal” cooperative mechanism, whereby the common Gō interaction scheme is augmented by an extra favorability for the native state as a whole, can lead to apparent two-state folding/unfolding kinetics over a broad range of native stabilities in lattice models of proteins. Here such a mechanism is shown to be generalizable to a simplified continuum (off-lattice) Langevin dynamics model with a Cα protein chain representation, with the resulting chevron plots exhibiting an extended quasilinear regime reminiscent of that of apparent two-state real proteins. Similarly high degrees of cooperativity are possible in Gō-like continuum models with rudimentary pairwise desolvation barriers as well. In these models, cooperativity increases with increasing desolvation barrier height, suggesting strongly that two-state-like folding/unfolding kinetics would be achievable when the pairwise desolvation barrier becomes sufficiently high. Besides cooperativity, another generic folding property of interest that has emerged from published experiments on several apparent two-state proteins is that their folding relaxation under constant native stability (isostability) conditions is essentially Arrhenius, entailing high intrinsic enthalpic folding barriers of ∼17–30 kcal/mol. Based on a new analysis of published data on barnase, here we propose that a similar property should also apply to a certain class of non-two-state proteins that fold with chevron rollovers. However, several continuum Gō-like constructs considered here fail to predict any significant intrinsic enthalpic folding barrier under isostability conditions; thus the physical origin of such barriers in real proteins remains to be elucidated. PMID:15863486
Equilibrium and kinetic folding of rabbit muscle triosephosphate isomerase by hydrogen exchange mass spectrometry.

PubMed

Pan, Hai; Raza, Ashraf S; Smith, David L

2004-03-05

Unfolding and refolding of rabbit muscle triosephosphate isomerase (TIM), a model for (betaalpha)8-barrel proteins, has been studied by amide hydrogen exchange/mass spectrometry. Unfolding was studied by destabilizing the protein in guanidine hydrochloride (GdHCl) or urea, pulse-labeling with 2H2O and analyzing the intact protein by HPLC electrospray ionization mass spectrometry. Bimodal isotope patterns were found in the mass spectra of the labeled protein, indicating two-state unfolding behavior. Refolding experiments were performed by diluting solutions of TIM unfolded in GdHCl or urea and pulse-labeling with 2H2O at different times. Mass spectra of the intact protein labeled after one to two minutes had three envelopes of isotope peaks, indicating population of an intermediate. Kinetic modeling indicates that the stability of the folding intermediate in water is only 1.5 kcal/mol. Failure to detect the intermediate in the unfolding experiments was attributed to its low stability and the high concentrations of denaturant required for unfolding experiments. The folding status of each segment of the polypeptide backbone was determined from the deuterium levels found in peptic fragments of the labeled protein. Analysis of these spectra showed that the C-terminal half folds to form the intermediate, which then forms native TIM with folding of the N-terminal half. These results show that TIM folding fits the (4+4) model for folding of (betaalpha)8-barrel proteins. Results of a double-jump experiment indicate that proline isomerization does not contribute to the rate-limiting step in the folding of TIM.
On the origins of the weak folding cooperativity of a designed ββα ultrafast protein FSD-1.

PubMed

Wu, Chun; Shea, Joan-Emma

2010-11-18

FSD-1, a designed small ultrafast folder with a ββα fold, has been actively studied in the last few years as a model system for studying protein folding mechanisms and for testing of the accuracy of computational models. The suitability of this protein to describe the folding of naturally occurring α/β proteins has recently been challenged based on the observation that the melting transition is very broad, with ill-resolved baselines. Using molecular dynamics simulations with the AMBER protein force field (ff96) coupled with the implicit solvent model (IGB = 5), we shed new light into the nature of this transition and resolve the experimental controversies. We show that the melting transition corresponds to the melting of the protein as a whole, and not solely to the helix-coil transition. The breadth of the folding transition arises from the spread in the melting temperatures (from ∼325 K to ∼302 K) of the individual transitions: formation of the hydrophobic core, β-hairpin and tertiary fold, with the helix formed earlier. Our simulations initiated from an extended chain accurately predict the native structure, provide a reasonable estimate of the transition barrier height, and explicitly demonstrate the existence of multiple pathways and multiple transition states for folding. Our exhaustive sampling enables us to assess the quality of the Amber ff96/igb5 combination and reveals that while this force field can predict the correct native fold, it nonetheless overstabilizes the α-helix portion of the protein (Tm = ∼387K) as well as the denatured structures.
Protein structure-structure alignment with discrete Fréchet distance.

PubMed

Jiang, Minghui; Xu, Ying; Zhu, Binhai

2008-02-01

Matching two geometric objects in two-dimensional (2D) and three-dimensional (3D) spaces is a central problem in computer vision, pattern recognition, and protein structure prediction. In particular, the problem of aligning two polygonal chains under translation and rotation to minimize their distance has been studied using various distance measures. It is well known that the Hausdorff distance is useful for matching two point sets, and that the Fréchet distance is a superior measure for matching two polygonal chains. The discrete Fréchet distance closely approximates the (continuous) Fréchet distance, and is a natural measure for the geometric similarity of the folded 3D structures of biomolecules such as proteins. In this paper, we present new algorithms for matching two polygonal chains in two dimensions to minimize their discrete Fréchet distance under translation and rotation, and an effective heuristic for matching two polygonal chains in three dimensions. We also describe our empirical results on the application of the discrete Fréchet distance to protein structure-structure alignment.
Folding anomalies of neuroligin3 caused by a mutation in the alpha/beta-hydrolase fold domain.

PubMed

De Jaco, Antonella; Dubi, Noga; Comoletti, Davide; Taylor, Palmer

2010-09-06

Proteins of the alpha/beta-hydrolase fold family share a common structural fold, but perform a diverse set of functions. We have been studying natural mutations occurring in association with congenital disorders in the alpha/beta-hydrolase fold domain of neuroligin (NLGN), butyrylcholinesterase (BChE), acetylcholinesterase (AChE). Starting from the autism-related R451C mutation in the alpha/beta-hydrolase fold domain of NLGN3, we had previously shown that the Arg to Cys substitution is responsible for endoplasmic reticulum (ER) retention of the mutant protein and that a similar trafficking defect is observed when the mutation is inserted at the homologous positions in AChE and BChE. Herein we show further characterization of the R451C mutation in NLGN3 when expressed in HEK-293, and by protease digestion sensitivity, we reveal that the phenotype results from protein misfolding. However, the presence of an extra Cys does not interfere with the formation of disulfide bonds as shown by reaction with PEG-maleimide and estimation of the molecular mass changes. These findings highlight the role of proper protein folding in protein processing and localization. Copyright (c) 2010 Elsevier Ireland Ltd. All rights reserved.
FOLDING ANOMALIES OF NEUROLIGIN3 CAUSED BY A MUTATION IN THE α/β-HYDROLASE FOLD DOMAIN

PubMed Central

De Jaco, Antonella; Dubi, Noga; Comoletti, Davide; Taylor, Palmer

2017-01-01

Proteins of the α/β-hydrolase fold family share a common structural fold, but perform a diverse set of functions. We have been studying natural mutations occurring in association with congenital disorders in the α/β-hydrolase fold domain of neuroligin (NLGN), butyrylcholinesterase (BChE), acetylcholinesterase (AChE). Starting from the autism-related R451C mutation in the α/β-hydrolase fold domain of NLGN3, we had previously shown that the Arg to Cys substitution is responsible for endoplasmic reticulum (ER) retention of the mutant protein and that a similar trafficking defect is observed when the mutation is inserted at the homologous positions in AChE and BChE. Herein we show further characterization of the R451C mutation in NLGN3 when expressed in HEK-293, and by protease digestion sensitivity, we reveal that the phenotype results from protein misfolding. However, the presence of an extra Cys doesn’t interfere with the formation of disulfide bonds as shown by reaction with PEG-maleimide and estimation of the molecular mass changes. These findings highlight the role of proper protein folding in protein processing and localization. PMID:20227402
Topography of funneled landscapes determines the thermodynamics and kinetics of protein folding

PubMed Central

Wang, Jin; Oliveira, Ronaldo J.; Chu, Xiakun; Whitford, Paul C.; Chahine, Jorge; Han, Wei; Wang, Erkang; Onuchic, José N.; Leite, Vitor B.P.

2012-01-01

The energy landscape approach has played a fundamental role in advancing our understanding of protein folding. Here, we quantify protein folding energy landscapes by exploring the underlying density of states. We identify three quantities essential for characterizing landscape topography: the stabilizing energy gap between the native and nonnative ensembles δE, the energetic roughness ΔE, and the scale of landscape measured by the entropy S. We show that the dimensionless ratio between the gap, roughness, and entropy of the system accurately predicts the thermodynamics, as well as the kinetics of folding. Large Λ implies that the energy gap (or landscape slope towards the native state) is dominant, leading to more funneled landscapes. We investigate the role of topological and energetic roughness for proteins of different sizes and for proteins of the same size, but with different structural topologies. The landscape topography ratio Λ is shown to be monotonically correlated with the thermodynamic stability against trapping, as characterized by the ratio of folding temperature versus trapping temperature. Furthermore, Λ also monotonically correlates with the folding kinetic rates. These results provide the quantitative bridge between the landscape topography and experimental folding measurements. PMID:23019359
Disulfide bonds in ER protein folding and homeostasis

PubMed Central

Feige, Matthias J.; Hendershot, Linda M.

2010-01-01

Proteins that are expressed outside the cell must be synthesized, folded and assembled in a way that ensures they can function in their designate location. Accordingly these proteins are primarily synthesized in the endoplasmic reticulum (ER), which has developed a chemical environment more similar to that outside the cell. This organelle is equipped with a variety of molecular chaperones and folding enzymes that both assist the folding process, while at the same time exerting tight quality control measures that are largely absent outside the cell. A major post-translational modification of ER-synthesized proteins is disulfide bridge formation, which is catalyzed by the family of protein disulfide isomerases. As this covalent modification provides unique structural advantages to extracellular proteins, multiple pathways to their formation have evolved. However, the advantages that disulfide bonds impart to these proteins come at a high cost to the cell. Very recent reports have shed light on how the cell can deal with or even exploit the side reactions of disulfide bond formation to maintain homeostasis of the ER and its folding machinery. PMID:21144725
Rational design of p53, an intrinsically unstructured protein, for the fabrication of novel molecular sensors.

PubMed

Geddie, Melissa L; O'Loughlin, Taryn L; Woods, Kristen K; Matsumura, Ichiro

2005-10-21

The dominant paradigm of protein engineering is structure-based site-directed mutagenesis. This rational approach is generally more effective for the engineering of local properties, such as substrate specificity, than global ones such as allostery. Previous workers have modified normally unregulated reporter enzymes, including beta-galactosidase, alkaline phosphatase, and beta-lactamase, so that the engineered versions are activated (up to 4-fold) by monoclonal antibodies. A reporter that could easily be "reprogrammed" for the facile detection of novel effectors (binding or modifying activities) would be useful in high throughput screens for directed evolution or drug discovery. Here we describe a straightforward and general solution to this potentially difficult design problem. The transcription factor p53 is normally regulated by a variety of post-translational modifications. The insertion of peptides into intrinsically unstructured domains of p53 generated variants that were activated up to 100-fold by novel effectors (proteases or antibodies). An engineered p53 was incorporated into an existing high throughput screen for the detection of human immunodeficiency virus protease, an arbitrarily chosen novel effector. These results suggest that the molecular recognition properties of intrinsically unstructured proteins are relatively easy to engineer and that the absence of crystal structures should not deter the rational engineering of this class of proteins.

Multiscale Simulations of Protein Landscapes: Using Coarse Grained Models as Reference Potentials to Full Explicit Models

PubMed Central

Messer, Benjamin M.; Roca, Maite; Chu, Zhen T.; Vicatos, Spyridon; Kilshtain, Alexandra Vardi; Warshel, Arieh

2009-01-01

Evaluating the free energy landscape of proteins and the corresponding functional aspects presents a major challenge for computer simulation approaches. This challenge is due to the complexity of the landscape and the enormous computer time needed for converging simulations. The use of simplified coarse grained (CG) folding models offers an effective way of sampling the landscape but such a treatment, however, may not give the correct description of the effect of the actual protein residues. A general way around this problem that has been put forward in our early work (Fan et al, Theor Chem Acc (1999) 103:77-80) uses the CG model as a reference potential for free energy calculations of different properties of the explicit model. This method is refined and extended here, focusing on improving the electrostatic treatment and on demonstrating key applications. This application includes: evaluation of changes of folding energy upon mutations, calculations of transition states binding free energies (which are crucial for rational enzyme design), evaluation of catalytic landscape and simulation of the time dependent responses to pH changes. Furthermore, the general potential of our approach in overcoming major challenges in studies of structure function correlation in proteins is discussed. PMID:20052756
Frnakenstein: multiple target inverse RNA folding.

PubMed

Lyngsø, Rune B; Anderson, James W J; Sizikova, Elena; Badugu, Amarendra; Hyland, Tomas; Hein, Jotun

2012-10-09

RNA secondary structure prediction, or folding, is a classic problem in bioinformatics: given a sequence of nucleotides, the aim is to predict the base pairs formed in its three dimensional conformation. The inverse problem of designing a sequence folding into a particular target structure has only more recently received notable interest. With a growing appreciation and understanding of the functional and structural properties of RNA motifs, and a growing interest in utilising biomolecules in nano-scale designs, the interest in the inverse RNA folding problem is bound to increase. However, whereas the RNA folding problem from an algorithmic viewpoint has an elegant and efficient solution, the inverse RNA folding problem appears to be hard. In this paper we present a genetic algorithm approach to solve the inverse folding problem. The main aims of the development was to address the hitherto mostly ignored extension of solving the inverse folding problem, the multi-target inverse folding problem, while simultaneously designing a method with superior performance when measured on the quality of designed sequences. The genetic algorithm has been implemented as a Python program called Frnakenstein. It was benchmarked against four existing methods and several data sets totalling 769 real and predicted single structure targets, and on 292 two structure targets. It performed as well as or better at finding sequences which folded in silico into the target structure than all existing methods, without the heavy bias towards CG base pairs that was observed for all other top performing methods. On the two structure targets it also performed well, generating a perfect design for about 80% of the targets. Our method illustrates that successful designs for the inverse RNA folding problem does not necessarily have to rely on heavy biases in base pair and unpaired base distributions. The design problem seems to become more difficult on larger structures when the target structures are real structures, while no deterioration was observed for predicted structures. Design for two structure targets is considerably more difficult, but far from impossible, demonstrating the feasibility of automated design of artificial riboswitches. The Python implementation is available at http://www.stats.ox.ac.uk/research/genome/software/frnakenstein.
Frnakenstein: multiple target inverse RNA folding

PubMed Central

2012-01-01

Background RNA secondary structure prediction, or folding, is a classic problem in bioinformatics: given a sequence of nucleotides, the aim is to predict the base pairs formed in its three dimensional conformation. The inverse problem of designing a sequence folding into a particular target structure has only more recently received notable interest. With a growing appreciation and understanding of the functional and structural properties of RNA motifs, and a growing interest in utilising biomolecules in nano-scale designs, the interest in the inverse RNA folding problem is bound to increase. However, whereas the RNA folding problem from an algorithmic viewpoint has an elegant and efficient solution, the inverse RNA folding problem appears to be hard. Results In this paper we present a genetic algorithm approach to solve the inverse folding problem. The main aims of the development was to address the hitherto mostly ignored extension of solving the inverse folding problem, the multi-target inverse folding problem, while simultaneously designing a method with superior performance when measured on the quality of designed sequences. The genetic algorithm has been implemented as a Python program called Frnakenstein. It was benchmarked against four existing methods and several data sets totalling 769 real and predicted single structure targets, and on 292 two structure targets. It performed as well as or better at finding sequences which folded in silico into the target structure than all existing methods, without the heavy bias towards CG base pairs that was observed for all other top performing methods. On the two structure targets it also performed well, generating a perfect design for about 80% of the targets. Conclusions Our method illustrates that successful designs for the inverse RNA folding problem does not necessarily have to rely on heavy biases in base pair and unpaired base distributions. The design problem seems to become more difficult on larger structures when the target structures are real structures, while no deterioration was observed for predicted structures. Design for two structure targets is considerably more difficult, but far from impossible, demonstrating the feasibility of automated design of artificial riboswitches. The Python implementation is available at http://www.stats.ox.ac.uk/research/genome/software/frnakenstein. PMID:23043260
Dynamics of partially folded and unfolded proteins investigated with quasielastic neutron spectroscopy

NASA Astrophysics Data System (ADS)

Stadler, Andreas M.

2018-05-01

Molecular dynamics in proteins animate and play a vital role for biologically relevant processes of these biomacromolecules. Quasielastic incoherent neutron scattering (QENS) is a well-suited experimental method to study protein dynamics from the picosecond to several nanoseconds and in the Ångström length-scale. In QENS experiments of protein solutions hydrogens act as reporters for the motions of methyl groups or amino acids to which they are bound. Neutron Spin-Echo spectroscopy (NSE) offers the highest energy resolution in the field of neutron spectroscopy and allows the study of slow collective motions in proteins up to several hundred nanoseconds and in the nanometer length-scale. In the following manuscript I will review recent studies that stress the relevance of molecular dynamics for protein folding and for conformational transitions of intrinsically disordered proteins (IDPs). During the folding collapse the protein is exploring its accessible conformational space via molecular motions. A large flexibility of partially folded and unfolded proteins, therefore, is mandatory for rapid protein folding. IDPs are a special case as they are largely unstructured under physiological conditions. A large flexibility is a characteristic property of IDPs as it allows, for example, the interaction with various binding partners or the rapid response to different conditions.
Isolation, folding and structural investigations of the amino acid transporter OEP16

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ni, Da Qun; Zook, James; Klewer, Douglas A.

2011-12-01

Membrane proteins compose more than 30% of all proteins in the living cell. However, many membrane proteins have low abundance in the cell and cannot be isolated from natural sources in concentrations suitable for structure analysis. The overexpression, reconstitution, and stabilization of membrane proteins are complex and remain a formidable challenge in membrane protein characterization. Here we describe a novel, in vitro folding procedure for a cation-selective channel protein, the outer envelope membrane protein 16 (OEP16) of pea chloroplast, overexpressed in Escherichia coli in the form of inclusion bodies. The protein is purified and then folded with detergent on amore » Ni-NTA affinity column. Final concentrations of reconstituted OEP16 of up to 24 mg/ml have been achieved, which provides samples that are sufficient for structural studies by NMR and crystallography. Reconstitution of OEP16 in detergent micelles was monitored by circular dichroism, fluorescence, and NMR spectroscopy. Tryptophan fluorescence spectra of heterologous expressed OEP16 in micelles are similar to spectra of functionally active OEP16 in liposomes, which indicates folding of the membrane protein in detergent micelles. CD spectroscopy studies demonstrate a folded protein consisting primarily of a-helices. 15N-HSQC NMR spectra also provide evidence for a folded protein. We present here a convenient, effective and quantitative method to screen large numbers of conditions for optimal protein stability by using microdialysis chambers in combination with fluorescence spectroscopy. Recent collection of multidimensional NMR data at 500, 600 and 800 MHz demonstrated that the protein is suitable for structure determination by NMR and stable for weeks during data collection.« less
Isolation, folding and structural investigations of the amino acid transporter OEP16.

PubMed

Ni, Da Qun; Zook, James; Klewer, Douglas A; Nieman, Ronald A; Soll, J; Fromme, Petra

2011-12-01

Membrane proteins compose more than 30% of all proteins in the living cell. However, many membrane proteins have low abundance in the cell and cannot be isolated from natural sources in concentrations suitable for structure analysis. The overexpression, reconstitution, and stabilization of membrane proteins are complex and remain a formidable challenge in membrane protein characterization. Here we describe a novel, in vitro folding procedure for a cation-selective channel protein, the outer envelope membrane protein 16 (OEP16) of pea chloroplast, overexpressed in Escherichia coli in the form of inclusion bodies. The protein is purified and then folded with detergent on a Ni-NTA affinity column. Final concentrations of reconstituted OEP16 of up to 24 mg/ml have been achieved, which provides samples that are sufficient for structural studies by NMR and crystallography. Reconstitution of OEP16 in detergent micelles was monitored by circular dichroism, fluorescence, and NMR spectroscopy. Tryptophan fluorescence spectra of heterologous expressed OEP16 in micelles are similar to spectra of functionally active OEP16 in liposomes, which indicates folding of the membrane protein in detergent micelles. CD spectroscopy studies demonstrate a folded protein consisting primarily of α-helices. ¹⁵N-HSQC NMR spectra also provide evidence for a folded protein. We present here a convenient, effective and quantitative method to screen large numbers of conditions for optimal protein stability by using microdialysis chambers in combination with fluorescence spectroscopy. Recent collection of multidimensional NMR data at 500, 600 and 800 MHz demonstrated that the protein is suitable for structure determination by NMR and stable for weeks during data collection. Copyright © 2011. Published by Elsevier Inc.
M1 RNA is important for the in-cell solubility of its cognate C5 protein: Implications for RNA-mediated protein folding

PubMed Central

Son, Ahyun; Choi, Seong Il; Han, Gyoonhee; Seong, Baik L

2015-01-01

It is one of the fundamental questions in biology how proteins efficiently fold into their native conformations despite off-pathway events such as misfolding and aggregation in living cells. Although molecular chaperones have been known to assist the de novo folding of certain types of proteins, the role of a binding partner (or a ligand) in the folding and in-cell solubility of its interacting protein still remains poorly defined. RNase P is responsible for the maturation of tRNAs as adaptor molecules of amino acids in ribosomal protein synthesis. The RNase P from Escherichia coli, composed of M1 RNA and C5 protein, is a prototypical ribozyme in which the RNA subunit contains the catalytic activity. Using E. coli RNase P, we demonstrate that M1 RNA plays a pivotal role in the in-cell solubility of C5 protein both in vitro and in vivo. Mutations in either the C5 protein or M1 RNA that affect their interactions significantly abolished the folding of C5 protein. Moreover, we find that M1 RNA provides quality insurance of interacting C5 protein, either by promoting the degradation of C5 mutants in the presence of functional proteolytic machinery, or by abolishing their solubility if the machinery is non-functional. Our results describe a crucial role of M1 RNA in the folding, in-cell solubility, and, consequently, the proteostasis of the client C5 protein, giving new insight into the biological role of RNAs as chaperones and mediators that ensure the quality of interacting proteins. PMID:26517763
Practical Approaches to Protein Folding and Assembly

PubMed Central

Walters, Jad; Milam, Sara L.; Clark, A. Clay

2009-01-01

We describe here the use of several spectroscopies, such as fluorescence emission, circular dichroism, and differential quenching by acrylamide, in examining the equilibrium and kinetic folding of proteins. The first section regarding equilibrium techniques provides practical information for determining the conformational stability of a protein. In addition, several equilibrium-folding models are discussed, from two-state monomer to four-state homodimer, providing a comprehensive protocol for interpretation of folding curves. The second section focuses on the experimental design and interpretation of kinetic data, such as burst-phase analysis and exponential fits, used in elucidating kinetic folding pathways. In addition, simulation programs are used routinely to support folding models generated by kinetic experiments, and the fundamentals of simulations are covered. PMID:19289201
Probabilistic analysis for identifying the driving force of protein folding

NASA Astrophysics Data System (ADS)

Tokunaga, Yoshihiko; Yamamori, Yu; Matubayasi, Nobuyuki

2018-03-01

Toward identifying the driving force of protein folding, energetics was analyzed in water for Trp-cage (20 residues), protein G (56 residues), and ubiquitin (76 residues) at their native (folded) and heat-denatured (unfolded) states. All-atom molecular dynamics simulation was conducted, and the hydration effect was quantified by the solvation free energy. The free-energy calculation was done by employing the solution theory in the energy representation, and it was seen that the sum of the protein intramolecular (structural) energy and the solvation free energy is more favorable for a folded structure than for an unfolded one generated by heat. Probabilistic arguments were then developed to determine which of the electrostatic, van der Waals, and excluded-volume components of the interactions in the protein-water system governs the relative stabilities between the folded and unfolded structures. It was found that the electrostatic interaction does not correspond to the preference order of the two structures. The van der Waals and excluded-volume components were shown, on the other hand, to provide the right order of preference at probabilities of almost unity, and it is argued that a useful modeling of protein folding is possible on the basis of the excluded-volume effect.
Transition Pathway and Its Free-Energy Profile: A Protocol for Protein Folding Simulations

PubMed Central

Lee, In-Ho; Kim, Seung-Yeon; Lee, Jooyoung

2013-01-01

We propose a protocol that provides a systematic definition of reaction coordinate and related free-energy profile as the function of temperature for the protein-folding simulation. First, using action-derived molecular dynamics (ADMD), we investigate the dynamic folding pathway model of a protein between a fixed extended conformation and a compact conformation. We choose the pathway model to be the reaction coordinate, and the folding and unfolding processes are characterized by the ADMD step index, in contrast to the common a priori reaction coordinate as used in conventional studies. Second, we calculate free-energy profile as the function of temperature, by employing the replica-exchange molecular dynamics (REMD) method. The current method provides efficient exploration of conformational space and proper characterization of protein folding/unfolding dynamics from/to an arbitrary extended conformation. We demonstrate that combination of the two simulation methods, ADMD and REMD, provides understanding on molecular conformational changes in proteins. The protocol is tested on a small protein, penta-peptide of met-enkephalin. For the neuropeptide met-enkephalin system, folded, extended, and intermediate sates are well-defined through the free-energy profile over the reaction coordinate. Results are consistent with those in the literature. PMID:23917881
Effect of temperature on the conformation of natively unfolded protein 4E-BP1 in aqueous and mixed solutions containing trifluoroethanol and hexafluoroisopropanol.

PubMed

Hackl, Ellen V

2015-02-01

Natively unfolded (intrinsically disordered) proteins have attracted growing attention due to their high abundance in nature, involvement in various signalling and regulatory pathways and direct association with many diseases. In the present work the combined effect of temperature and alcohols, trifluoroethanol (TFE) and hexafluoroisopropanol (HFIP), on the natively unfolded 4E-BP1 protein was studied to elucidate the balance between temperature-induced folding and unfolding in intrinsically disordered proteins. It was shown that elevated temperatures induce reversible partial folding of 4E-BP1 both in buffer and in the mixed solutions containing denaturants. In the mixed solutions containing TFE (HFIP) 4E-BP1 adopts a partially folded helical conformation. As the temperature increases, the initial temperature-induced protein folding is replaced by irreversible unfolding/melting only after a certain level of the protein helicity has been reached. Onset unfolding temperature decreases with TFE (HFIP) concentration in solution. It was shown that an increase in the temperature induces two divergent processes in a natively unfolded protein--hydrophobicity-driven folding and unfolding. Balance between these two processes determines thermal behaviour of a protein. The correlation between heat-induced protein unfolding and the amount of helical content in a protein is revealed. Heat-induced secondary structure formation can be a valuable test to characterise minor changes in the conformations of natively unfolded proteins as a result of site-directed mutagenesis. Mutants with an increased propensity to fold into a structured form reveal different temperature behaviour.
Experimental support for the foldability-function tradeoff hypothesis: segregation of the folding nucleus and functional regions in fibroblast growth factor-1.

PubMed

Longo, Liam; Lee, Jihun; Blaber, Michael

2012-12-01

The acquisition of function is often associated with destabilizing mutations, giving rise to the stability-function tradeoff hypothesis. To test whether function is also accommodated at the expense of foldability, fibroblast growth factor-1 (FGF-1) was subjected to a comprehensive φ-value analysis at each of the 11 turn regions. FGF-1, a β-trefoil fold, represents an excellent model system with which to evaluate the influence of function on foldability: because of its threefold symmetric structure, analysis of FGF-1 allows for direct comparisons between symmetry-related regions of the protein that are associated with function to those that are not; thus, a structural basis for regions of foldability can potentially be identified. The resulting φ-value distribution of FGF-1 is highly polarized, with the majority of positions described as either folded-like or denatured-like in the folding transition state. Regions important for folding are shown to be asymmetrically distributed within the protein architecture; furthermore, regions associated with function (i.e., heparin-binding affinity and receptor-binding affinity) are localized to regions of the protein that fold after barrier crossing (late in the folding pathway). These results provide experimental support for the foldability-function tradeoff hypothesis in the evolution of FGF-1. Notably, the results identify the potential for folding redundancy in symmetric protein architecture with important implications for protein evolution and design. Copyright © 2012 The Protein Society.
Water promotes the sealing of nanoscale packing defects in folding proteins.

PubMed

Fernández, Ariel

2014-05-21

A net dipole moment is shown to arise from a non-Debye component of water polarization created by nanoscale packing defects on the protein surface. Accordingly, the protein electrostatic field exerts a torque on the induced dipole, locally impeding the nucleation of ice at the protein-water interface. We evaluate the solvent orientation steering (SOS) as the reversible work needed to align the induced dipoles with the Debye electrostatic field and computed the SOS for the variable interface of a folding protein. The minimization of the SOS is shown to drive protein folding as evidenced by the entrainment of the total free energy by the SOS energy along trajectories that approach a Debye limit state where no torque arises. This result suggests that the minimization of anomalous water polarization at the interface promotes the sealing of packing defects, thereby maintaining structural integrity and committing the protein chain to fold.
Enhanced conformational sampling via novel variable transformations and very large time-step molecular dynamics

NASA Astrophysics Data System (ADS)

Tuckerman, Mark

2006-03-01

One of the computational grand challenge problems is to develop methodology capable of sampling conformational equilibria in systems with rough energy landscapes. If met, many important problems, most notably protein folding, could be significantly impacted. In this talk, two new approaches for addressing this problem will be presented. First, it will be shown how molecular dynamics can be combined with a novel variable transformation designed to warp configuration space in such a way that barriers are reduced and attractive basins stretched. This method rigorously preserves equilibrium properties while leading to very large enhancements in sampling efficiency. Extensions of this approach to the calculation/exploration of free energy surfaces will be discussed. Next, a new very large time-step molecular dynamics method will be introduced that overcomes the resonances which plague many molecular dynamics algorithms. The performance of the methods is demonstrated on a variety of systems including liquid water, long polymer chains simple protein models, and oligopeptides.
Poison Domains Block Transit of Translocated Substrates via the Legionella pneumophila Icm/Dot System

PubMed Central

Amyot, Whitney M.; deJesus, Dennise

2013-01-01

Legionella pneumophila uses the Icm/Dot type 4B secretion system (T4BSS) to deliver translocated protein substrates to the host cell, promoting replication vacuole formation. The conformational state of the translocated substrates within the bacterial cell is unknown, so we sought to determine if folded substrates could be translocated via this system. Fusions of L. pneumophila Icm/Dot-translocated substrates (IDTS) to dihydrofolate reductase (DHFR) or ubiquitin (Ub), small proteins known to fold rapidly, resulted in proteins with low translocation efficiencies. The folded moieties did not cause increased aggregation of the IDTS and did not impede interaction with the adaptor protein complex IcmS/IcmW, which is thought to form a soluble complex that promotes translocation. The translocation defect was alleviated with a Ub moiety harboring mutations known to destabilize its structure, indicating that unfolded proteins are preferred substrates. Real-time analysis of translocation, following movement during the first 30 min after bacterial contact with host cells, revealed that the folded moiety caused a kinetic defect in IDTS translocation. Expression of an IDTS fused to a folded moiety interfered with the translocation of other IDTS, consistent with it causing a blockage of the translocation channel. Furthermore, the folded protein fusions also interfered with intracellular growth, consistent with inefficient or impaired translocation of proteins critical for L. pneumophila intracellular growth. These studies indicate that substrates of the Icm/Dot T4SS are translocated to the host cytosol in an unfolded conformation and that folded proteins are stalled within the translocation channel, impairing the function of the secretion system. PMID:23798536
[L-arginine metabolism enzyme activities in rat liver subcellular fractions under condition of protein deprivation].

PubMed

Kopyl'chuk, G P; Buchkovskaia, I M

2014-01-01

The features of arginase and NO-synthase pathways of arginine's metabolism have been studied in rat liver subcellular fractions under condition of protein deprivation. During the experimental period (28 days) albino male rats were kept on semi synthetic casein diet AIN-93. The protein deprivation conditions were designed as total absence of protein in the diet and consumption of the diet partially deprived with 1/2 of the casein amount compared to in the regular diet. Daily diet consumption was regulated according to the pair feeding approach. It has been shown that the changes of enzyme activities, involved in L-arginine metabolism, were characterized by 1.4-1.7 fold decrease in arginase activity, accompanied with unchanged NO-synthase activity in cytosol. In mitochondrial fraction the unchanged arginase activity was accompanied by 3-5 fold increase of NO-synthase activity. At the terminal stages of the experiment the monodirectional dynamics in the studied activities have been observed in the mitochondrial and cytosolfractions in both experimental groups. In the studied subcellular fractions arginase activity decreased (2.4-2.7 fold with no protein in the diet and 1.5 fold with partly supplied protein) and was accompanied by NO-synthase activity increase by 3.8 fold in cytosole fraction, by 7.2 fold in mitochondrial fraction in the group with no protein in the diet and by 2.2 and 3.5 fold in the group partialy supplied with protein respectively. The observed tendency is presumably caused by the switch of L-arginine metabolism from arginase into oxidizing NO-synthase parthway.
Modulation of the multistate folding of designed TPR proteins through intrinsic and extrinsic factors

PubMed Central

Phillips, J J; Javadi, Y; Millership, C; Main, E R G

2012-01-01

Tetratricopeptide repeats (TPRs) are a class of all alpha-helical repeat proteins that are comprised of 34-aa helix-turn-helix motifs. These stack together to form nonglobular structures that are stabilized by short-range interactions from residues close in primary sequence. Unlike globular proteins, they have few, if any, long-range nonlocal stabilizing interactions. Several studies on designed TPR proteins have shown that this modular structure is reflected in their folding, that is, modular multistate folding is observed as opposed to two-state folding. Here we show that TPR multistate folding can be suppressed to approximate two-state folding through modulation of intrinsic stability or extrinsic environmental variables. This modulation was investigated by comparing the thermodynamic unfolding under differing buffer regimes of two distinct series of consensus-designed TPR proteins, which possess different intrinsic stabilities. A total of nine proteins of differing sizes and differing consensus TPR motifs were each thermally and chemically denatured and their unfolding monitored using differential scanning calorimetry (DSC) and CD/fluorescence, respectively. Analyses of both the DSC and chemical denaturation data show that reducing the total stability of each protein and repeat units leads to observable two-state unfolding. These data highlight the intimate link between global and intrinsic repeat stability that governs whether folding proceeds by an observably two-state mechanism, or whether partial unfolding yields stable intermediate structures which retain sufficient stability to be populated at equilibrium. PMID:22170589
Direct folding simulation of helical proteins using an effective polarizable bond force field.

PubMed

Duan, Lili; Zhu, Tong; Ji, Changge; Zhang, Qinggang; Zhang, John Z H

2017-06-14

We report a direct folding study of seven helical proteins (, Trpcage, , C34, N36, , ) ranging from 17 to 53 amino acids through standard molecular dynamics simulations using a recently developed polarizable force field-Effective Polarizable Bond (EPB) method. The backbone RMSDs, radius of gyrations, native contacts and native helix content are in good agreement with the experimental results. Cluster analysis has also verified that these folded structures with the highest population are in good agreement with their corresponding native structures for these proteins. In addition, the free energy landscape of seven proteins in the two dimensional space comprised of RMSD and radius of gyration proved that these folded structures are indeed of the lowest energy conformations. However, when the corresponding simulations were performed using the standard (nonpolarizable) AMBER force fields, no stable folded structures were observed for these proteins. Comparison of the simulation results based on a polarizable EPB force field and a nonpolarizable AMBER force field clearly demonstrates the importance of polarization in the folding of stable helical structures.
Contribution of Charged Groups to the Enthalpic Stabilization of the Folded States of Globular Proteins

PubMed Central

Dadarlat, Voichita M.; Post, Carol Beth

2016-01-01

In this paper we use the results from all atom MD simulations of proteins and peptides to assess individual contribution of charged atomic groups to the enthalpic stability of the native state of globular proteins and investigate how the distribution of charged atomic groups in terms of solvent accessibility relates to protein enthalpic stability. The contributions of charged groups is calculated using a comparison of nonbonded interaction energy terms from equilibrium simulations of charged amino acid dipeptides in water (the “unfolded state”) and charged amino acids in globular proteins (the “folded state”). Contrary to expectation, the analysis shows that many buried, charged atomic groups contribute favorably to protein enthalpic stability. The strongest enthalpic contributions favoring the folded state come from the carboxylate (COO−) groups of either Glu or Asp. The contributions from Arg guanidinium groups are generally somewhat stabilizing, while NH3+ groups from Lys contribute little toward stabilizing the folded state. The average enthalpic gain due to the transfer of a methyl group in an apolar amino acid from solution to the protein interior is described for comparison. Notably, charged groups that are less exposed to solvent contribute more favorably to protein native-state enthalpic stability than charged groups that are solvent exposed. While solvent reorganization/release has favorable contributions to folding for all charged atomic groups, the variation in folded state stability among proteins comes mainly from the change in the nonbonded interaction energy of charged groups between the unfolded and folded states. A key outcome is that the calculated enthalpic stabilization is found to be inversely proportional to the excess charge density on the surface, in support of an hypothesis proposed previously. PMID:18303881
Problem Solving through Paper Folding

ERIC Educational Resources Information Center

Wares, Arsalan

2014-01-01

The purpose of this article is to describe a couple of challenging mathematical problems that involve paper folding. These problem-solving tasks can be used to foster geometric and algebraic thinking among students. The context of paper folding makes some of the abstract mathematical ideas involved relatively concrete. When implemented…

Chaperonin-based biolayer interferometry to assess the kinetic stability of metastable, aggregation-prone proteins

PubMed Central

Lea, Wendy A.; Naik, Subhashchandra; Chaudhri, Tapan; Machen, Alexandra J.; O’Neil, Pierce T.; McGinn-Straub, Wesley; Tischer, Alexander; Auton, Matthew T.; Burns, Joshua R.; Baldwin, Michael R.; Khar, Karen R.; Karanicolas, John; Fisher, Mark T.

2017-01-01

Stabilizing the folded state of metastable and/or aggregation-prone proteins through exogenous ligand binding is an appealing strategy to decrease disease pathologies brought on by protein folding defects or deleterious kinetic transitions. Current methods of examining ligand binding to these marginally stable native states are limited, because protein aggregation typically interferes with analysis. Here, we describe a rapid method for assessing the kinetic stability of folded proteins and monitoring the effects of ligand stabilization for both intrinsically stable proteins (monomers, oligomers, multi-domain) and metastable proteins (e.g. low Tm) that uses a new GroEL chaperonin-based biolayer interferometry (BLI) denaturant-pulse platform. A kinetically controlled denaturation isotherm is generated by exposing a target protein immobilized on a BLI biosensor to increasing denaturant concentrations (urea or GnHCl) in a pulsatile manner to induce partial or complete unfolding of the attached protein population. Following the rapid removal of the denaturant, the extent of hydrophobic unfolded/partially folded species that remain is detected by increased GroEL binding. Since this kinetic denaturant pulse is brief, the amplitude of the GroEL binding to the immobilized protein depends on the duration of exposure to denaturant, the concentration of denaturant, wash times, and the underlying protein unfolding/refolding kinetics; fixing all other parameters and plotting GroEL binding amplitude versus denaturant pulse concentration results in a kinetically controlled denaturation isotherm. When folding osmolytes or stabilizing ligands are added to the immobilized target proteins before and during the denaturant pulse, the diminished population of unfolded/partially folded protein is manifested by a decreased GroEL binding and/or a marked shift in these kinetically controlled denaturation profiles to higher denaturant concentrations. This particular platform approach can be used to identify small molecules/solution conditions that can stabilize or destabilize thermally stable proteins, multi-domain proteins, oligomeric proteins, and most importantly, aggregation prone metastable proteins. PMID:27505032
Lattice model simulation of interchain protein interactions and the folding dynamics and dimerization of the GCN4 Leucine zipper

NASA Astrophysics Data System (ADS)

Liu, Yanxin; Chapagain, Prem P.; Parra, Jose L.; Gerstman, Bernard S.

2008-01-01

The highest level in the hierarchy of protein structure and folding is the formation of protein complexes through protein-protein interactions. We have made modifications to a well established computer lattice model to expand its applicability to two-protein dimerization and aggregation. Based on Brownian dynamics, we implement translation and rotation moves of two peptide chains relative to each other, in addition to the intrachain motions already present in the model. We use this two-chain model to study the folding dynamics of the yeast transcription factor GCN4 leucine zipper. The calculated heat capacity curves agree well with experimental measurements. Free energy landscapes and median first passage times for the folding process are calculated and elucidate experimentally measured characteristics such as the multistate nature of the dimerization process.
Minimal model for the secondary structures and conformational conversions in proteins

NASA Astrophysics Data System (ADS)

Imamura, Hideo

Better understanding of protein folding process can provide physical insights on the function of proteins and makes it possible to benefit from genetic information accumulated so far. Protein folding process normally takes place in less than seconds but even seconds are beyond reach of current computational power for simulations on a system of all-atom detail. Hence, to model and explore protein folding process it is crucial to construct a proper model that can adequately describe the physical process and mechanism for the relevant time scale. We discuss the reduced off-lattice model that can express _-helix and ?-hairpin conformations defined solely by a given sequence in order to investigate a protein folding mechanism of conformations such as a ?-hairpin and also to investigate conformational conversions in proteins. The first two chapters introduce and review essential concepts in protein folding modelling physical interaction in proteins, various simple models, and also review computational methods, in particular, the Metropolis Monte Carlo method, its dynamic interpretation and thermodynamic Monte Carlo algorithms. Chapter 3 describes the minimalist model that represents both _-helix and ?-sheet conformations using simple potentials. The native conformation can be specified by the sequence without particular conformational biases to a reference state. In Chapter 4, the model is used to investigate the folding mechanism of ?-hairpins exhaustively using the dynamic Monte Carlo and a thermodynamic Monte Carlo method an effcient combination of the multicanonical Monte Carlo and the weighted histogram analysis method. We show that the major folding pathways and folding rate depend on the location of a hydrophobic. The conformational conversions between _-helix and ?-sheet conformations are examined in Chapter 5 and 6. First, the conformational conversion due to mutation in a non-hydrophobic system and then the conformational conversion due to mutation with a hydrophobic pair at a different position at various temperatures are examined.
Photocrosslinking approaches to interactome mapping

PubMed Central

Pham, Nam D.; Parker, Randy B.; Kohler, Jennifer J.

2012-01-01

Photocrosslinking approaches can be used to map interactome networks within the context of living cells. Photocrosslinking methods rely on use of metabolic engineering or genetic code expansion to incorporate photocrosslinking analogs of amino acids or sugars into cellular biomolecules. Immunological and mass spectrometry techniques are used to analyze crosslinked complexes, thereby defining specific interactomes. Because photocrosslinking can be conducted in native, cellular settings, it can be used to define context-dependent interactions. Photocrosslinking methods are also ideally suited for determining interactome dynamics, mapping interaction interfaces, and identifying transient interactions in which intrinsically disordered proteins and glycoproteins engage. Here we discuss the application of cell-based photocrosslinking to the study of specific problems in immune cell signaling, transcription, membrane protein dynamics, nucleocytoplasmic transport, and chaperone-assisted protein folding. PMID:23149092
DNA vaccine encoding myristoylated membrane protein (MMP) of rock bream iridovirus (RBIV) induces protective immunity in rock bream (Oplegnathus fasciatus).

PubMed

Jung, Myung-Hwa; Nikapitiya, Chamilani; Jung, Sung-Ju

2018-02-01

Rock bream iridovirus (RBIV) causes severe mass mortalities in rock bream (Oplegnathus fasciatus) in Korea. In this study, we investigated the potential of viral membrane protein to induce antiviral status protecting rock bream against RBIV infection. We found that fish administered with ORF008L (myristoylated membrane protein, MMP) vaccine exhibited significantly higher levels of survival compared to ORF007L (major capsid protein, MCP). Moreover, ORF008L-based DNA vaccinated fish showed significant protection at 4 and 8 weeks post vaccination (wpv) than non-vaccinated fish after infected with RBIV (6.7 × 10 5 ) at 23 °C, with relative percent survival (RPS) of 73.36% and 46.72%, respectively. All of the survivors from the first RBIV infection were strongly protected (100% RPS) from re-infected with RBIV (1.1 × 10 7 ) at 100 dpi. In addition, the MMP (ORF008L)-based DNA vaccine significantly induced the gene expression of TLR3 (14.2-fold), MyD88 (11.6-fold), Mx (84.7-fold), ISG15 (8.7-fold), PKR (25.6-fold), MHC class I (13.3-fold), Fas (6.7-fold), Fas ligand (6.7-fold), caspase9 (17.0-fold) and caspase3 (15.3-fold) at 7 days post vaccination in the muscle (vaccine injection site). Our results showed the induction of immune responses and suggest the possibility of developing preventive measures against RBIV using myristoylated membrane protein-based DNA vaccine. Copyright © 2018 Elsevier Ltd. All rights reserved.
Discrete Haar transform and protein structure.

PubMed

Morosetti, S

1997-12-01

The discrete Haar transform of the sequence of the backbone dihedral angles (phi and psi) was performed over a set of X-ray protein structures of high resolution from the Brookhaven Protein Data Bank. Afterwards, the new dihedral angles were calculated by the inverse transform, using a growing number of Haar functions, from the lower to the higher degree. New structures were obtained using these dihedral angles, with standard values for bond lengths and angles, and with omega = 0 degree. The reconstructed structures were compared with the experimental ones, and analyzed by visual inspection and statistical analysis. When half of the Haar coefficients were used, all the reconstructed structures were not yet collapsed to a tertiary folding, but they showed yet realized most of the secondary motifs. These results indicate a substantial separation of structural information in the space of Haar transform, with the secondary structural information mainly present in the Haar coefficients of lower degrees, and the tertiary one present in the higher degree coefficients. Because of this separation, the representation of the folded structures in the space of Haar transform seems a promising candidate to encompass the problem of premature convergence in genetic algorithms.
Augmenting the Efficacy of Immunotoxins and Other Targeted Protein Toxins by Endosomal Escape Enhancers.

PubMed

Fuchs, Hendrik; Weng, Alexander; Gilabert-Oriol, Roger

2016-07-01

The toxic moiety of almost all protein-based targeted toxins must enter the cytosol of the target cell to mediate its fatal effect. Although more than 500 targeted toxins have been investigated in the past decades, no antibody-targeted protein toxin has been approved for tumor therapeutic applications by the authorities to date. Missing efficacy can be attributed in many cases to insufficient endosomal escape and therefore subsequent lysosomal degradation of the endocytosed toxins. To overcome this drawback, many strategies have been described to weaken the membrane integrity of endosomes. This comprises the use of lysosomotropic amines, carboxylic ionophores, calcium channel antagonists, various cell-penetrating peptides of viral, bacterial, plant, animal, human and synthetic origin, other organic molecules and light-induced techniques. Although the efficacy of the targeted toxins was typically augmented in cell culture hundred or thousand fold, in exceptional cases more than million fold, the combination of several substances harbors new problems including additional side effects, loss of target specificity, difficulties to determine the therapeutic window and cell type-dependent variations. This review critically scrutinizes the chances and challenges of endosomal escape enhancers and their potential role in future developments.
Augmenting the Efficacy of Immunotoxins and Other Targeted Protein Toxins by Endosomal Escape Enhancers

PubMed Central

Fuchs, Hendrik; Weng, Alexander; Gilabert-Oriol, Roger

2016-01-01

The toxic moiety of almost all protein-based targeted toxins must enter the cytosol of the target cell to mediate its fatal effect. Although more than 500 targeted toxins have been investigated in the past decades, no antibody-targeted protein toxin has been approved for tumor therapeutic applications by the authorities to date. Missing efficacy can be attributed in many cases to insufficient endosomal escape and therefore subsequent lysosomal degradation of the endocytosed toxins. To overcome this drawback, many strategies have been described to weaken the membrane integrity of endosomes. This comprises the use of lysosomotropic amines, carboxylic ionophores, calcium channel antagonists, various cell-penetrating peptides of viral, bacterial, plant, animal, human and synthetic origin, other organic molecules and light-induced techniques. Although the efficacy of the targeted toxins was typically augmented in cell culture hundred or thousand fold, in exceptional cases more than million fold, the combination of several substances harbors new problems including additional side effects, loss of target specificity, difficulties to determine the therapeutic window and cell type-dependent variations. This review critically scrutinizes the chances and challenges of endosomal escape enhancers and their potential role in future developments. PMID:27376327
A new method to improve network topological similarity search: applied to fold recognition

PubMed Central

Lhota, John; Hauptman, Ruth; Hart, Thomas; Ng, Clara; Xie, Lei

2015-01-01

Motivation: Similarity search is the foundation of bioinformatics. It plays a key role in establishing structural, functional and evolutionary relationships between biological sequences. Although the power of the similarity search has increased steadily in recent years, a high percentage of sequences remain uncharacterized in the protein universe. Thus, new similarity search strategies are needed to efficiently and reliably infer the structure and function of new sequences. The existing paradigm for studying protein sequence, structure, function and evolution has been established based on the assumption that the protein universe is discrete and hierarchical. Cumulative evidence suggests that the protein universe is continuous. As a result, conventional sequence homology search methods may be not able to detect novel structural, functional and evolutionary relationships between proteins from weak and noisy sequence signals. To overcome the limitations in existing similarity search methods, we propose a new algorithmic framework—Enrichment of Network Topological Similarity (ENTS)—to improve the performance of large scale similarity searches in bioinformatics. Results: We apply ENTS to a challenging unsolved problem: protein fold recognition. Our rigorous benchmark studies demonstrate that ENTS considerably outperforms state-of-the-art methods. As the concept of ENTS can be applied to any similarity metric, it may provide a general framework for similarity search on any set of biological entities, given their representation as a network. Availability and implementation: Source code freely available upon request Contact: lxie@iscb.org PMID:25717198
An Evolution-Based Approach to De Novo Protein Design and Case Study on Mycobacterium tuberculosis

PubMed Central

Brender, Jeffrey R.; Czajka, Jeff; Marsh, David; Gray, Felicia; Cierpicki, Tomasz; Zhang, Yang

2013-01-01

Computational protein design is a reverse procedure of protein folding and structure prediction, where constructing structures from evolutionarily related proteins has been demonstrated to be the most reliable method for protein 3-dimensional structure prediction. Following this spirit, we developed a novel method to design new protein sequences based on evolutionarily related protein families. For a given target structure, a set of proteins having similar fold are identified from the PDB library by structural alignments. A structural profile is then constructed from the protein templates and used to guide the conformational search of amino acid sequence space, where physicochemical packing is accommodated by single-sequence based solvation, torsion angle, and secondary structure predictions. The method was tested on a computational folding experiment based on a large set of 87 protein structures covering different fold classes, which showed that the evolution-based design significantly enhances the foldability and biological functionality of the designed sequences compared to the traditional physics-based force field methods. Without using homologous proteins, the designed sequences can be folded with an average root-mean-square-deviation of 2.1 Å to the target. As a case study, the method is extended to redesign all 243 structurally resolved proteins in the pathogenic bacteria Mycobacterium tuberculosis, which is the second leading cause of death from infectious disease. On a smaller scale, five sequences were randomly selected from the design pool and subjected to experimental validation. The results showed that all the designed proteins are soluble with distinct secondary structure and three have well ordered tertiary structure, as demonstrated by circular dichroism and NMR spectroscopy. Together, these results demonstrate a new avenue in computational protein design that uses knowledge of evolutionary conservation from protein structural families to engineer new protein molecules of improved fold stability and biological functionality. PMID:24204234
Strategies for achieving high-level expression of genes in Escherichia coli.

PubMed Central

Makrides, S C

1996-01-01

Progress in our understanding of several biological processes promises to broaden the usefulness of Escherichia coli as a tool for gene expression. There is an expanding choice of tightly regulated prokaryotic promoters suitable for achieving high-level gene expression. New host strains facilitate the formation of disulfide bonds in the reducing environment of the cytoplasm and offer higher protein yields by minimizing proteolytic degradation. Insights into the process of protein translocation across the bacterial membranes may eventually make it possible to achieve robust secretion of specific proteins into the culture medium. Studies involving molecular chaperones have shown that in specific cases, chaperones can be very effective for improved protein folding, solubility, and membrane transport. Negative results derived from such studies are also instructive in formulating different strategies. The remarkable increase in the availability of fusion partners offers a wide range of tools for improved protein folding, solubility, protection from proteases, yield, and secretion into the culture medium, as well as for detection and purification of recombinant proteins. Codon usage is known to present a potential impediment to high-level gene expression in E. coli. Although we still do not understand all the rules governing this phenomenon, it is apparent that "rare" codons, depending on their frequency and context, can have an adverse effect on protein levels. Usually, this problem can be alleviated by modification of the relevant codons or by coexpression of the cognate tRNA genes. Finally, the elucidation of specific determinants of protein degradation, a plethora of protease-deficient host strains, and methods to stabilize proteins afford new strategies to minimize proteolytic susceptibility of recombinant proteins in E. coli. PMID:8840785
When a domain isn’t a domain, and why it’s important to properly filter proteins in databases

PubMed Central

Towse, Clare-Louise; Daggett, Valerie

2013-01-01

Summary Membership in a protein domain database does not a domain make; a feature we realized when generating a consensus view of protein fold space with our Consensus Domain Dictionary (CDD). This dictionary was used to select representative structures for characterization of the protein dynameome: the Dynameomics initiative. Through this endeavor we rejected a surprising 40% of the 1695 folds in the CDD as being non-autonomous folding units. Although some of this was due to the challenges of grouping similar fold topologies, the dissonance between the cataloguing and structural qualification of protein domains remains surprising. Another potential factor is previously overlooked intrinsic disorder; predicted estimates suggest 40% of proteins to have either local or global disorder. One thing is clear, filtering a structural database and ensuring a consistent definition for protein domains is crucial, and caution is prescribed when generalizations of globular domains are drawn from unfiltered protein domain datasets. PMID:23108912
PyFolding: Open-Source Graphing, Simulation, and Analysis of the Biophysical Properties of Proteins.

PubMed

Lowe, Alan R; Perez-Riba, Albert; Itzhaki, Laura S; Main, Ewan R G

2018-02-06

For many years, curve-fitting software has been heavily utilized to fit simple models to various types of biophysical data. Although such software packages are easy to use for simple functions, they are often expensive and present substantial impediments to applying more complex models or for the analysis of large data sets. One field that is reliant on such data analysis is the thermodynamics and kinetics of protein folding. Over the past decade, increasingly sophisticated analytical models have been generated, but without simple tools to enable routine analysis. Consequently, users have needed to generate their own tools or otherwise find willing collaborators. Here we present PyFolding, a free, open-source, and extensible Python framework for graphing, analysis, and simulation of the biophysical properties of proteins. To demonstrate the utility of PyFolding, we have used it to analyze and model experimental protein folding and thermodynamic data. Examples include: 1) multiphase kinetic folding fitted to linked equations, 2) global fitting of multiple data sets, and 3) analysis of repeat protein thermodynamics with Ising model variants. Moreover, we demonstrate how PyFolding is easily extensible to novel functionality beyond applications in protein folding via the addition of new models. Example scripts to perform these and other operations are supplied with the software, and we encourage users to contribute notebooks and models to create a community resource. Finally, we show that PyFolding can be used in conjunction with Jupyter notebooks as an easy way to share methods and analysis for publication and among research teams. Copyright © 2017 Biophysical Society. Published by Elsevier Inc. All rights reserved.
Physics of protein folding

NASA Astrophysics Data System (ADS)

Finkelstein, A. V.; Galzitskaya, O. V.

2004-04-01

Protein physics is grounded on three fundamental experimental facts: protein, this long heteropolymer, has a well defined compact three-dimensional structure; this structure can spontaneously arise from the unfolded protein chain in appropriate environment; and this structure is separated from the unfolded state of the chain by the “all-or-none” phase transition, which ensures robustness of protein structure and therefore of its action. The aim of this review is to consider modern understanding of physical principles of self-organization of protein structures and to overview such important features of this process, as finding out the unique protein structure among zillions alternatives, nucleation of the folding process and metastable folding intermediates. Towards this end we will consider the main experimental facts and simple, mostly phenomenological theoretical models. We will concentrate on relatively small (single-domain) water-soluble globular proteins (whose structure and especially folding are much better studied and understood than those of large or membrane and fibrous proteins) and consider kinetic and structural aspects of transition of initially unfolded protein chains into their final solid (“native”) 3D structures.
Proteome-level interplay between folding and aggregation propensities of proteins.

PubMed

Tartaglia, Gian Gaetano; Vendruscolo, Michele

2010-10-08

With the advent of proteomics, there is an increasing need of tools for predicting the properties of large numbers of proteins by using the information provided by their amino acid sequences, even in the absence of the knowledge of their structures. One of the most important types of predictions concerns whether proteins will fold or aggregate. Here, we study the competition between these two processes by analyzing the relationship between the folding and aggregation propensity profiles for the human and Escherichia coli proteomes. These profiles are calculated, respectively, using the CamFold method, which we introduce in this work, and the Zyggregator method. Our results indicate that the kinetic behavior of proteins is, to a large extent, determined by the interplay between regions of low folding and high aggregation propensities. Copyright © 2010. Published by Elsevier Ltd.
Protein vivisection reveals elusive intermediates in folding

PubMed Central

Zheng, Zhongzhou; Sosnick, Tobin R.

2010-01-01

Although most folding intermediates escape detection, their characterization is crucial to the elucidation of folding mechanisms. Here we outline a powerful strategy to populate partially unfolded intermediates: A buried aliphatic residue is substituted with a charged residue (e.g., Leu→Glu−) to destabilize and unfold a specific region of the protein. We apply this strategy to Ubiquitin, reversibly trapping a folding intermediate in which the β5 strand is unfolded. The intermediate refolds to a native-like structure upon charge neutralization under mildly acidic conditions. Characterization of the trapped intermediate using NMR and hydrogen exchange methods identifies a second folding intermediate and reveals the order and free energies of the two major folding events on the native side of the rate-limiting step. This general strategy may be combined with other methods and have broad applications in the study of protein folding and other reactions that require trapping of high energy states. PMID:20144618
Characterization of the amino acid contribution to the folding degree of proteins.

PubMed

Estrada, Ernesto

2004-03-01

The folding degree index (Estrada, Bioinformatics 2002;18:697-704) is extended to account for the contribution of amino acids to folding. First, the mathematical formalism for extending the folding degree index is presented. Then, the amino acid contributions to folding degree of several proteins are used to analyze its relation to secondary structure. The possibilities of using these contributions in helping or checking the assignation of secondary structure to amino acids are also introduced. The influence of external factors to the amino acids contribution to folding degree is studied through the temperature effect on ribonuclease A. Finally, the analysis of 3D protein similarity through the use of amino acid contributions to folding degree is studied by selecting a series of lysozymes. These results are compared to that obtained by sequence alignment (2D similarity) and 3D superposition of the structures, showing the uniqueness of the current approach. Copyright 2004 Wiley-Liss, Inc.
Protein folding: complex potential for the driving force in a two-dimensional space of collective variables.

PubMed

Chekmarev, Sergei F

2013-10-14

Using the Helmholtz decomposition of the vector field of folding fluxes in a two-dimensional space of collective variables, a potential of the driving force for protein folding is introduced. The potential has two components. One component is responsible for the source and sink of the folding flows, which represent respectively, the unfolded states and the native state of the protein, and the other, which accounts for the flow vorticity inherently generated at the periphery of the flow field, is responsible for the canalization of the flow between the source and sink. The theoretical consideration is illustrated by calculations for a model β-hairpin protein.
Using NMR chemical shifts to calculate the propensity for structural order and disorder in proteins.

PubMed

Tamiola, Kamil; Mulder, Frans A A

2012-10-01

NMR spectroscopy offers the unique possibility to relate the structural propensities of disordered proteins and loop segments of folded peptides to biological function and aggregation behaviour. Backbone chemical shifts are ideally suited for this task, provided that appropriate reference data are available and idiosyncratic sensitivity of backbone chemical shifts to structural information is treated in a sensible manner. In the present paper, we describe methods to detect structural protein changes from chemical shifts, and present an online tool [ncSPC (neighbour-corrected Structural Propensity Calculator)], which unites aspects of several current approaches. Examples of structural propensity calculations are given for two well-characterized systems, namely the binding of α-synuclein to micelles and light activation of photoactive yellow protein. These examples spotlight the great power of NMR chemical shift analysis for the quantitative assessment of protein disorder at the atomic level, and further our understanding of biologically important problems.
On the role of conformational geometry in protein folding

NASA Astrophysics Data System (ADS)

Du, Rose; Pande, Vijay S.; Grosberg, Alexander Yu.; Tanaka, Toyoichi; Shakhnovich, Eugene

1999-12-01

Using a lattice model of protein folding, we find that once certain native contacts have been formed, folding to the native state is inevitable, even if the only energetic bias in the system is nonspecific, homopolymeric attraction to a collapsed state. These conformations can be quite geometrically unrelated to the native state (with as low as only 53% of the native contacts formed). We demonstrate these results by examining the Monte Carlo kinetics of both heteropolymers under Go interactions and homopolymers, with the folding of both types of polymers to the native state of the heteropolymer. Although we only consider a 48-mer lattice model, our findings shed light on the effects of geometrical restrictions, including those of chain connectivity and steric excluded volume, on protein folding. These effects play a complementary role to that of the rugged energy landscape. In addition, the results of this work can aid in the interpretation of experiments and computer simulations of protein folding performed at elevated temperatures.

Integration of QUARK and I-TASSER for Ab Initio Protein Structure Prediction in CASP11.

PubMed

Zhang, Wenxuan; Yang, Jianyi; He, Baoji; Walker, Sara Elizabeth; Zhang, Hongjiu; Govindarajoo, Brandon; Virtanen, Jouko; Xue, Zhidong; Shen, Hong-Bin; Zhang, Yang

2016-09-01

We tested two pipelines developed for template-free protein structure prediction in the CASP11 experiment. First, the QUARK pipeline constructs structure models by reassembling fragments of continuously distributed lengths excised from unrelated proteins. Five free-modeling (FM) targets have the model successfully constructed by QUARK with a TM-score above 0.4, including the first model of T0837-D1, which has a TM-score = 0.736 and RMSD = 2.9 Å to the native. Detailed analysis showed that the success is partly attributed to the high-resolution contact map prediction derived from fragment-based distance-profiles, which are mainly located between regular secondary structure elements and loops/turns and help guide the orientation of secondary structure assembly. In the Zhang-Server pipeline, weakly scoring threading templates are re-ordered by the structural similarity to the ab initio folding models, which are then reassembled by I-TASSER based structure assembly simulations; 60% more domains with length up to 204 residues, compared to the QUARK pipeline, were successfully modeled by the I-TASSER pipeline with a TM-score above 0.4. The robustness of the I-TASSER pipeline can stem from the composite fragment-assembly simulations that combine structures from both ab initio folding and threading template refinements. Despite the promising cases, challenges still exist in long-range beta-strand folding, domain parsing, and the uncertainty of secondary structure prediction; the latter of which was found to affect nearly all aspects of FM structure predictions, from fragment identification, target classification, structure assembly, to final model selection. Significant efforts are needed to solve these problems before real progress on FM could be made. Proteins 2016; 84(Suppl 1):76-86. © 2015 Wiley Periodicals, Inc. © 2015 Wiley Periodicals, Inc.
Mapping the energy landscape for second-stage folding of a single membrane protein

PubMed Central

Min, Duyoung; Jefferson, Robert E; Bowie, James U; Yoon, Tae-Young

2016-01-01

Membrane proteins are designed to fold and function in a lipid membrane, yet folding experiments within a native membrane environment are challenging to design. Here we show that single-molecule forced unfolding experiments can be adapted to study helical membrane protein folding under native-like bicelle conditions. Applying force using magnetic tweezers, we find that a transmembrane helix protein, Escherichia coli rhomboid protease GlpG, unfolds in a highly cooperative manner, largely unraveling as one physical unit in response to mechanical tension above 25 pN. Considerable hysteresis is observed, with refolding occurring only at forces below 5 pN. Characterizing the energy landscape reveals only modest thermodynamic stability (ΔG = 6.5 kBT) but a large unfolding barrier (21.3 kBT) that can maintain the protein in a folded state for long periods of time (t1/2 ~3.5 h). The observed energy landscape may have evolved to limit the existence of troublesome partially unfolded states and impart rigidity to the structure. PMID:26479439
Development and Application of a High Throughput Protein Unfolding Kinetic Assay

PubMed Central

Wang, Qiang; Waterhouse, Nicklas; Feyijinmi, Olusegun; Dominguez, Matthew J.; Martinez, Lisa M.; Sharp, Zoey; Service, Rachel; Bothe, Jameson R.; Stollar, Elliott J.

2016-01-01

The kinetics of folding and unfolding underlie protein stability and quantification of these rates provides important insights into the folding process. Here, we present a simple high throughput protein unfolding kinetic assay using a plate reader that is applicable to the studies of the majority of 2-state folding proteins. We validate the assay by measuring kinetic unfolding data for the SH3 (Src Homology 3) domain from Actin Binding Protein 1 (AbpSH3) and its stabilized mutants. The results of our approach are in excellent agreement with published values. We further combine our kinetic assay with a plate reader equilibrium assay, to obtain indirect estimates of folding rates and use these approaches to characterize an AbpSH3-peptide hybrid. Our high throughput protein unfolding kinetic assays allow accurate screening of libraries of mutants by providing both kinetic and equilibrium measurements and provide a means for in-depth ϕ-value analyses. PMID:26745729
Experimental investigation of protein folding and misfolding.

PubMed

Dobson, Christopher M

2004-09-01

Newly synthesised proteins need to fold, often to intricate and close-packed structures, in order to function. The underlying mechanism by which this complex process takes place both in vitro and in vivo is now becoming understood, at least in general terms, as a result of the application of a wide range of biophysical and computational methods used in combination with the techniques of biochemistry and protein engineering. It is increasingly apparent, however, that folding is not only crucial for generating biological activity, but that it is also coupled to a wide range of processes within the cell, ranging from the trafficking of proteins to specific organelles to the regulation of cell growth and differentiation. Not surprisingly, therefore, the failure of proteins to fold appropriately, or to remain correctly folded, is associated with a large number of cellular malfunctions that give rise to disease. Misfolding, and its consequences such as aggregation, can be investigated by extending the types of techniques used to study the normal folding process. Application of these techniques is enabling the development of a unified description of the interconversion and regulation of the different conformational states available to proteins in living systems. Such a description proves a generic basis for understanding the fundamental links between protein misfolding and its associated clinical disorders, such as Alzheimer's disease and Type II diabetes, and for exploring novel therapeutic strategies directed at their prevention and treatment on a rational basis.
Refolding of urea-denatured α-chymotrypsin by protein-folding liquid chromatography.

PubMed

Congyu, Ke; Wujuan, Sun; Qunzheng, Zhang; Xindu, Geng

2013-04-01

An approach for re-folding denatured proteins during proteome research by protein folding liquid chromatography (PFLC) is presented. Standard protein, α-chymotrypsin (α-Chy), was selected as a model protein and hydrophobic interaction chromatography was performed as a typical PFLC; the three different α-Chy states - urea-denatured (U state), its folded intermediates (M state) and nature state (N state) - were studied during protein folding. Based on the test by matrix-assisted laser desorption/ionization time of flight mass spectrometry and bioactivity, only one stable M state of the α-Chy was identified and then it was prepared for further investigation. The specific bioactivity of the refolded α-Chy was found to be higher than that of commercial α-Chy as the urea concentration in the sample solution ranged from 1.0 to 3.0 m; the highest specific bioactivity at urea concentration was 1.0 m, indicating the possibility for re-folding some proteins that have partially or completely lost their bioactivity, as a dilute urea solution was employed for dissolving the sample. The experiment showed that the peak height of its M state increased with increasing urea concentration, and correspondingly decreased in the amount of the refolded α-Chy. When the urea concentration reached 6.0 m, the unfolded α-Chy could not be refolded at all. Copyright © 2012 John Wiley & Sons, Ltd.
The PYRIN domain: A member of the death domain-fold superfamily

PubMed Central

Fairbrother, Wayne J.; Gordon, Nathaniel C.; Humke, Eric W.; O'Rourke, Karen M.; Starovasnik, Melissa A.; Yin, Jian-Ping; Dixit, Vishva M.

2001-01-01

PYRIN domains were identified recently as putative protein–protein interaction domains at the N-termini of several proteins thought to function in apoptotic and inflammatory signaling pathways. The ∼95 residue PYRIN domains have no statistically significant sequence homology to proteins with known three-dimensional structure. Using secondary structure prediction and potential-based fold recognition methods, however, the PYRIN domain is predicted to be a member of the six-helix bundle death domain-fold superfamily that includes death domains (DDs), death effector domains (DEDs), and caspase recruitment domains (CARDs). Members of the death domain-fold superfamily are well established mediators of protein–protein interactions found in many proteins involved in apoptosis and inflammation, indicating further that the PYRIN domains serve a similar function. An homology model of the PYRIN domain of CARD7/DEFCAP/NAC/NALP1, a member of the Apaf-1/Ced-4 family of proteins, was constructed using the three-dimensional structures of the FADD and p75 neurotrophin receptor DDs, and of the Apaf-1 and caspase-9 CARDs, as templates. Validation of the model using a variety of computational techniques indicates that the fold prediction is consistent with the sequence. Comparison of a circular dichroism spectrum of the PYRIN domain of CARD7/DEFCAP/NAC/NALP1 with spectra of several proteins known to adopt the death domain-fold provides experimental support for the structure prediction. PMID:11514682
Games that Enlist Collective Intelligence to Solve Complex Scientific Problems.

PubMed

Burnett, Stephen; Furlong, Michelle; Melvin, Paul Guy; Singiser, Richard

2016-03-01

There is great value in employing the collective problem-solving power of large groups of people. Technological advances have allowed computer games to be utilized by a diverse population to solve problems. Science games are becoming more popular and cover various areas such as sequence alignments, DNA base-pairing, and protein and RNA folding. While these tools have been developed for the general population, they can also be used effectively in the classroom to teach students about various topics. Many games also employ a social component that entices students to continue playing and thereby to continue learning. The basic functions of game play and the potential of game play as a tool in the classroom are discussed in this article.
Games that Enlist Collective Intelligence to Solve Complex Scientific Problems

PubMed Central

Burnett, Stephen; Furlong, Michelle; Melvin, Paul Guy; Singiser, Richard

2016-01-01

There is great value in employing the collective problem-solving power of large groups of people. Technological advances have allowed computer games to be utilized by a diverse population to solve problems. Science games are becoming more popular and cover various areas such as sequence alignments, DNA base-pairing, and protein and RNA folding. While these tools have been developed for the general population, they can also be used effectively in the classroom to teach students about various topics. Many games also employ a social component that entices students to continue playing and thereby to continue learning. The basic functions of game play and the potential of game play as a tool in the classroom are discussed in this article. PMID:27047610
Modulation of Folding Internal Friction by Local and Global Barrier Heights.

PubMed

Zheng, Wenwei; de Sancho, David; Best, Robert B

2016-03-17

Recent experiments have revealed an unexpected deviation from a first power dependence of protein relaxation times on solvent viscosity, an effect that has been attributed to "internal friction". One clear source of internal friction in protein dynamics is the isomerization of dihedral angles. A key outstanding question is whether the global folding barrier height influences the measured internal friction, based on the observation that the folding rates of fast-folding proteins, with smaller folding free energy barriers, tend to exhibit larger internal friction. Here, by studying two alanine-based peptides, we find that systematic variation of global folding barrier heights has little effect on the internal friction for folding rates. On the other hand, increasing local torsion angle barriers leads to increased internal friction, which is consistent with solvent memory effects being the origin of the viscosity dependence. Thus, it appears that local torsion transitions determine the viscosity dependence of the diffusion coefficient on the global coordinate and, in turn, internal friction effects on the folding rate.
How Does Chronic Cigarette Smoke Exposure Affect Human Skin? A Global Proteomics Study in Primary Human Keratinocytes.

PubMed

Rajagopalan, Pavithra; Nanjappa, Vishalakshi; Raja, Remya; Jain, Ankit P; Mangalaparthi, Kiran K; Sathe, Gajanan J; Babu, Niraj; Patel, Krishna; Cavusoglu, Nükhet; Soeur, Jeremie; Pandey, Akhilesh; Roy, Nita; Breton, Lionel; Chatterjee, Aditi; Misra, Namita; Gowda, Harsha

2016-11-01

Cigarette smoking has been associated with multiple negative effects on human skin. Long-term physiological effects of cigarette smoke are through chronic and not acute exposure. Molecular alterations due to chronic exposure to cigarette smoke remain unclear. Primary human skin keratinocytes chronically exposed to cigarette smoke condensate (CSC) showed a decreased wound-healing capacity with an increased expression of NRF2 and MMP9. Using quantitative proteomics, we identified 4728 proteins, of which 105 proteins were overexpressed (≥2-fold) and 41 proteins were downregulated (≤2-fold) in primary skin keratinocytes chronically exposed to CSC. We observed an alteration in the expression of several proteins involved in maintenance of epithelial barrier integrity, including keratin 80 (5.3 fold, p value 2.5 × 10 -7 ), cystatin A (3.6-fold, p value 3.2 × 10 -3 ), and periplakin (2.4-fold, p value 1.2 × 10 -8 ). Increased expression of proteins associated with skin hydration, including caspase 14 (2.2-fold, p value 4.7 × 10 -2 ) and filaggrin (3.6-fold, p value 5.4 × 10 -7 ), was also observed. In addition, we report differential expression of several proteins, including adipogenesis regulatory factor (2.5-fold, p value 1.3 × 10 -3 ) and histone H1.0 (2.5-fold, p value 6.3 × 10 -3 ) that have not been reported earlier. Bioinformatics analyses demonstrated that proteins differentially expressed in response to CSC are largely related to oxidative stress, maintenance of skin integrity, and anti-inflammatory responses. Importantly, treatment with vitamin E, a widely used antioxidant, could partially rescue adverse effects of CSC exposure in primary skin keratinocytes. The utility of antioxidant-based new dermatological formulations in delaying or preventing skin aging and oxidative damages caused by chronic cigarette smoke exposure warrants further clinical investigations and multi-omics research.
Prolonged Fasting Identifies Heat Shock Protein 10 as a Sirtuin 3 Substrate

PubMed Central

Lu, Zhongping; Chen, Yong; Aponte, Angel M.; Battaglia, Valentina; Gucek, Marjan; Sack, Michael N.

2015-01-01

Although Sirtuin 3 (SIRT3), a mitochondrially enriched deacetylase and activator of fat oxidation, is down-regulated in response to high fat feeding, the rate of fatty acid oxidation and mitochondrial protein acetylation are invariably enhanced in this dietary milieu. These paradoxical data implicate that additional acetylation modification-dependent levels of regulation may be operational under nutrient excess conditions. Because the heat shock protein (Hsp) Hsp10-Hsp60 chaperone complex mediates folding of the fatty acid oxidation enzyme medium-chain acyl-CoA dehydrogenase, we tested whether acetylation-dependent mitochondrial protein folding contributes to this regulatory discrepancy. We demonstrate that Hsp10 is a functional SIRT3 substrate and that, in response to prolonged fasting, SIRT3 levels modulate mitochondrial protein folding. Acetyl mutagenesis of Hsp10 lysine 56 alters Hsp10-Hsp60 binding, conformation, and protein folding. Consistent with Hsp10-Hsp60 regulation of fatty acid oxidation enzyme integrity, medium-chain acyl-CoA dehydrogenase activity and fat oxidation are elevated by Hsp10 acetylation. These data identify acetyl modification of Hsp10 as a nutrient-sensing regulatory node controlling mitochondrial protein folding and metabolic function. PMID:25505263
Multiple functional roles of the accessory I-domain of bacteriophage P22 coat protein revealed by NMR structure and cryoEM modeling

PubMed Central

Rizzo, Alessandro A.; Suhanovsky, Margaret M.; Baker, Matthew L.; Fraser, LaTasha C.R.; Jones, Lisa M.; Rempel, Don L.; Gross, Michael L.; Chiu, Wah; Alexandrescu, Andrei T.; Teschke, Carolyn M.

2014-01-01

SUMMARY Some capsid proteins built on the ubiquitous HK97-fold have accessory domains that impart specific functions. Bacteriophage P22 coat protein has a unique inserted I-domain. Two prior I-domain models from sub-nanometer cryoEM reconstructions differed substantially. Therefore, the NMR structure of the I-domain was determined, which also was used to improve cryoEM models of coat protein. The I-domain has an anti-parallel 6-stranded β-barrel fold, previously not observed in HK97-fold accessory domains. The D-loop, which is dynamic both in the isolated I-domain and intact monomeric coat protein, forms stabilizing salt bridges between adjacent capsomers in procapsids. A newly described S-loop is important for capsid size determination, likely through intra-subunit interactions. Ten of eighteen coat protein temperature-sensitive-folding substitutions are in the I-domain, indicating its importance in folding and stability. Several are found on a positively charged face of the β-barrel that anchors the I-domain to a negatively charged surface of the coat protein HK97-core. PMID:24836025
Multiple functional roles of the accessory I-domain of bacteriophage P22 coat protein revealed by NMR structure and CryoEM modeling.

PubMed

Rizzo, Alessandro A; Suhanovsky, Margaret M; Baker, Matthew L; Fraser, LaTasha C R; Jones, Lisa M; Rempel, Don L; Gross, Michael L; Chiu, Wah; Alexandrescu, Andrei T; Teschke, Carolyn M

2014-06-10

Some capsid proteins built on the ubiquitous HK97-fold have accessory domains imparting specific functions. Bacteriophage P22 coat protein has a unique insertion domain (I-domain). Two prior I-domain models from subnanometer cryoelectron microscopy (cryoEM) reconstructions differed substantially. Therefore, the I-domain's nuclear magnetic resonance structure was determined and also used to improve cryoEM models of coat protein. The I-domain has an antiparallel six-stranded β-barrel fold, not previously observed in HK97-fold accessory domains. The D-loop, which is dynamic in the isolated I-domain and intact monomeric coat protein, forms stabilizing salt bridges between adjacent capsomers in procapsids. The S-loop is important for capsid size determination, likely through intrasubunit interactions. Ten of 18 coat protein temperature-sensitive-folding substitutions are in the I-domain, indicating its importance in folding and stability. Several are found on a positively charged face of the β-barrel that anchors the I-domain to a negatively charged surface of the coat protein HK97-core. Copyright © 2014 Elsevier Ltd. All rights reserved.
Even with nonnative interactions, the updated folding transition states of the homologs Proteins G & L are extensive and similar

PubMed Central

Baxa, Michael C.; Yu, Wookyung; Adhikari, Aashish N.; Ge, Liang; Xia, Zhen; Zhou, Ruhong; Freed, Karl F.; Sosnick, Tobin R.

2015-01-01

Experimental and computational folding studies of Proteins L & G and NuG2 typically find that sequence differences determine which of the two hairpins is formed in the transition state ensemble (TSE). However, our recent work on Protein L finds that its TSE contains both hairpins, compelling a reassessment of the influence of sequence on the folding behavior of the other two homologs. We characterize the TSEs for Protein G and NuG2b, a triple mutant of NuG2, using ψ analysis, a method for identifying contacts in the TSE. All three homologs are found to share a common and near-native TSE topology with interactions between all four strands. However, the helical content varies in the TSE, being largely absent in Proteins G & L but partially present in NuG2b. The variability likely arises from competing propensities for the formation of nonnative β turns in the naturally occurring proteins, as observed in our TerItFix folding algorithm. All-atom folding simulations of NuG2b recapitulate the observed TSEs with four strands for 5 of 27 transition paths [Lindorff-Larsen K, Piana S, Dror RO, Shaw DE (2011) Science 334(6055):517–520]. Our data support the view that homologous proteins have similar folding mechanisms, even when nonnative interactions are present in the transition state. These findings emphasize the ongoing challenge of accurately characterizing and predicting TSEs, even for relatively simple proteins. PMID:26100906
Protein Folding Simulations Combining Self-Guided Langevin Dynamics and Temperature-Based Replica Exchange

DTIC Science & Technology

2010-01-01

formulations of molecular dynamics (MD) and Langevin dynamics (LD) simulations for the prediction of thermodynamic folding observables of the Trp-cage...ad hoc force term in the SGLD model. Introduction Molecular dynamics (MD) simulations of small proteins provide insight into the mechanisms and... molecular dynamics (MD) and Langevin dynamics (LD) simulations for the prediction of thermodynamic folding observables of the Trp-cage mini-protein. All
Exploring the Universe of Protein Structures beyond the Protein Data Bank

PubMed Central

Cossio, Pilar; Trovato, Antonio; Pietrucci, Fabio; Seno, Flavio; Maritan, Amos; Laio, Alessandro

2010-01-01

It is currently believed that the atlas of existing protein structures is faithfully represented in the Protein Data Bank. However, whether this atlas covers the full universe of all possible protein structures is still a highly debated issue. By using a sophisticated numerical approach, we performed an exhaustive exploration of the conformational space of a 60 amino acid polypeptide chain described with an accurate all-atom interaction potential. We generated a database of around 30,000 compact folds with at least of secondary structure corresponding to local minima of the potential energy. This ensemble plausibly represents the universe of protein folds of similar length; indeed, all the known folds are represented in the set with good accuracy. However, we discover that the known folds form a rather small subset, which cannot be reproduced by choosing random structures in the database. Rather, natural and possible folds differ by the contact order, on average significantly smaller in the former. This suggests the presence of an evolutionary bias, possibly related to kinetic accessibility, towards structures with shorter loops between contacting residues. Beside their conceptual relevance, the new structures open a range of practical applications such as the development of accurate structure prediction strategies, the optimization of force fields, and the identification and design of novel folds. PMID:21079678
Specificity in substrate binding by protein folding catalysts: tyrosine and tryptophan residues are the recognition motifs for the binding of peptides to the pancreas-specific protein disulfide isomerase PDIp.

PubMed Central

Ruddock, L. W.; Freedman, R. B.; Klappa, P.

2000-01-01

Using a cross-linking approach, we recently demonstrated that radiolabeled peptides or misfolded proteins specifically interact in vitro with two luminal proteins in crude extracts from pancreas microsomes. The proteins were the folding catalysts protein disulfide isomerase (PDI) and PDIp, a glycosylated, PDI-related protein, expressed exclusively in the pancreas. In this study, we explore the specificity of these proteins in binding peptides and related ligands and show that tyrosine and tryptophan residues in peptides are the recognition motifs for their binding by PDIp. This peptide-binding specificity may reflect the selectivity of PDIp in binding regions of unfolded polypeptide during catalysis of protein folding. PMID:10794419
Protein collapse is encoded in the folded state architecture.

PubMed

Samanta, Himadri S; Zhuravlev, Pavel I; Hinczewski, Michael; Hori, Naoto; Chakrabarti, Shaon; Thirumalai, D

2017-05-21

Folded states of single domain globular proteins are compact with high packing density. The radius of gyration, R g , of both the folded and unfolded states increase as N ν where N is the number of amino acids in the protein. The values of the Flory exponent ν are, respectively, ≈⅓ and ≈0.6 in the folded and unfolded states, coinciding with those for homopolymers. However, the extent of compaction of the unfolded state of a protein under low denaturant concentration (collapsibility), conditions favoring the formation of the folded state, is unknown. We develop a theory that uses the contact map of proteins as input to quantitatively assess collapsibility of proteins. Although collapsibility is universal, the propensity to be compact depends on the protein architecture. Application of the theory to over two thousand proteins shows that collapsibility depends not only on N but also on the contact map reflecting the native structure. A major prediction of the theory is that β-sheet proteins are far more collapsible than structures dominated by α-helices. The theory and the accompanying simulations, validating the theoretical predictions, provide insights into the differing conclusions reached using different experimental probes assessing the extent of compaction of proteins. By calculating the criterion for collapsibility as a function of protein length we provide quantitative insights into the reasons why single domain proteins are small and the physical reasons for the origin of multi-domain proteins. Collapsibility of non-coding RNA molecules is similar β-sheet proteins structures adding support to "Compactness Selection Hypothesis".
Neural Influences on Sonic Hedgehog and Apoptosis in the Rat Penis1

PubMed Central

Bond, Christopher; Tang, Yi; Podlasek, Carol A.

2010-01-01

The role of sonic hedgehog (SHH) in maintaining corpora cavernosal morphology in the adult penis has been established; however, the mechanism of how SHH itself is regulated remains unclear. Since decreased SHH protein is a cause of smooth muscle apoptosis and erectile dysfunction (ED) in the penis, and SHH treatment can suppress cavernous nerve (CN) injury-induced apoptosis, the question of how SHH signaling is regulated is significant. It is likely that neural input is involved in this process since two models of neuropathy-induced ED exhibit decreased SHH protein and increased apoptosis in the penis. We propose the hypothesis that SHH abundance in the corpora cavernosa is regulated by SHH signaling in the pelvic ganglia, neural activity, or neural transport of a trophic factor from the pelvic ganglia to the corpora. We have examined each of these potential mechanisms. SHH inhibition in the penis shows a 12-fold increase in smooth muscle apoptosis. SHH inhibition in the pelvic ganglia causes significantly increased apoptosis (1.3-fold) and decreased SHH protein (1.1-fold) in the corpora cavernosa. SHH protein is not transported by the CN. Colchicine treatment of the CN resulted in significantly increased smooth muscle apoptosis (1.2-fold) and decreased SHH protein (1.3-fold) in the penis. Lidocaine treatment of the CN caused a similar increase in apoptosis (1.6-fold) and decrease in SHH protein (1.3-fold) in the penis. These results show that neural activity and a trophic factor from the pelvic ganglia/CN are necessary to regulate SHH protein and smooth muscle abundance in the penis. PMID:18256331
Predicting protein-binding RNA nucleotides with consideration of binding partners.

PubMed

Tuvshinjargal, Narankhuu; Lee, Wook; Park, Byungkyu; Han, Kyungsook

2015-06-01

In recent years several computational methods have been developed to predict RNA-binding sites in protein. Most of these methods do not consider interacting partners of a protein, so they predict the same RNA-binding sites for a given protein sequence even if the protein binds to different RNAs. Unlike the problem of predicting RNA-binding sites in protein, the problem of predicting protein-binding sites in RNA has received little attention mainly because it is much more difficult and shows a lower accuracy on average. In our previous study, we developed a method that predicts protein-binding nucleotides from an RNA sequence. In an effort to improve the prediction accuracy and usefulness of the previous method, we developed a new method that uses both RNA and protein sequence data. In this study, we identified effective features of RNA and protein molecules and developed a new support vector machine (SVM) model to predict protein-binding nucleotides from RNA and protein sequence data. The new model that used both protein and RNA sequence data achieved a sensitivity of 86.5%, a specificity of 86.2%, a positive predictive value (PPV) of 72.6%, a negative predictive value (NPV) of 93.8% and Matthews correlation coefficient (MCC) of 0.69 in a 10-fold cross validation; it achieved a sensitivity of 58.8%, a specificity of 87.4%, a PPV of 65.1%, a NPV of 84.2% and MCC of 0.48 in independent testing. For comparative purpose, we built another prediction model that used RNA sequence data alone and ran it on the same dataset. In a 10 fold-cross validation it achieved a sensitivity of 85.7%, a specificity of 80.5%, a PPV of 67.7%, a NPV of 92.2% and MCC of 0.63; in independent testing it achieved a sensitivity of 67.7%, a specificity of 78.8%, a PPV of 57.6%, a NPV of 85.2% and MCC of 0.45. In both cross-validations and independent testing, the new model that used both RNA and protein sequences showed a better performance than the model that used RNA sequence data alone in most performance measures. To the best of our knowledge, this is the first sequence-based prediction of protein-binding nucleotides in RNA which considers the binding partner of RNA. The new model will provide valuable information for designing biochemical experiments to find putative protein-binding sites in RNA with unknown structure. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

Dynamical Coupling of Intrinsically Disordered Proteins and Their Hydration Water: Comparison with Folded Soluble and Membrane Proteins

PubMed Central

Gallat, F.-X.; Laganowsky, A.; Wood, K.; Gabel, F.; van Eijck, L.; Wuttke, J.; Moulin, M.; Härtlein, M.; Eisenberg, D.; Colletier, J.-P.; Zaccai, G.; Weik, M.

2012-01-01

Hydration water is vital for various macromolecular biological activities, such as specific ligand recognition, enzyme activity, response to receptor binding, and energy transduction. Without hydration water, proteins would not fold correctly and would lack the conformational flexibility that animates their three-dimensional structures. Motions in globular, soluble proteins are thought to be governed to a certain extent by hydration-water dynamics, yet it is not known whether this relationship holds true for other protein classes in general and whether, in turn, the structural nature of a protein also influences water motions. Here, we provide insight into the coupling between hydration-water dynamics and atomic motions in intrinsically disordered proteins (IDP), a largely unexplored class of proteins that, in contrast to folded proteins, lack a well-defined three-dimensional structure. We investigated the human IDP tau, which is involved in the pathogenic processes accompanying Alzheimer disease. Combining neutron scattering and protein perdeuteration, we found similar atomic mean-square displacements over a large temperature range for the tau protein and its hydration water, indicating intimate coupling between them. This is in contrast to the behavior of folded proteins of similar molecular weight, such as the globular, soluble maltose-binding protein and the membrane protein bacteriorhodopsin, which display moderate to weak coupling, respectively. The extracted mean square displacements also reveal a greater motional flexibility of IDP compared with globular, folded proteins and more restricted water motions on the IDP surface. The results provide evidence that protein and hydration-water motions mutually affect and shape each other, and that there is a gradient of coupling across different protein classes that may play a functional role in macromolecular activity in a cellular context. PMID:22828339
Interferences of Silica Nanoparticles in Green Fluorescent Protein Folding Processes.

PubMed

Klein, Géraldine; Devineau, Stéphanie; Aude, Jean Christophe; Boulard, Yves; Pasquier, Hélène; Labarre, Jean; Pin, Serge; Renault, Jean Philippe

2016-01-12

We investigated the relationship between unfolded proteins, silica nanoparticles and chaperonin to determine whether unfolded proteins could stick to silica surfaces and how this process could impair heat shock protein activity. The HSP60 catalyzed green fluorescent protein (GFP) folding was used as a model system. The adsorption isotherms and adsorption kinetics of denatured GFP were measured, showing that denaturation increases GFP affinity for silica surfaces. This affinity is maintained even if the surfaces are covered by a protein corona and allows silica NPs to interfere directly with GFP folding by trapping it in its unstructured state. We determined also the adsorption isotherms of HSP60 and its chaperonin activity once adsorbed, showing that SiO2 NP can interfere also indirectly with protein folding through chaperonin trapping and inhibition. This inhibition is specifically efficient when NPs are covered first with a layer of unfolded proteins. These results highlight for the first time the antichaperonin activity of silica NPs and ask new questions about the toxicity of such misfolded proteins/nanoparticles assembly toward cells.
Context-dependent effects of asparagine glycosylation on Pin WW folding kinetics and thermodynamics.

PubMed

Price, Joshua L; Shental-Bechor, Dalit; Dhar, Apratim; Turner, Maurice J; Powers, Evan T; Gruebele, Martin; Levy, Yaakov; Kelly, Jeffery W

2010-11-03

Asparagine glycosylation is one of the most common and important post-translational modifications of proteins in eukaryotic cells. N-glycosylation occurs when a triantennary glycan precursor is transferred en bloc to a nascent polypeptide (harboring the N-X-T/S sequon) as the peptide is cotranslationally translocated into the endoplasmic reticulum (ER). In addition to facilitating binding interactions with components of the ER proteostasis network, N-glycans can also have intrinsic effects on protein folding by directly altering the folding energy landscape. Previous work from our laboratories (Hanson et al. Proc. Natl. Acad. Sci. U.S.A. 2009, 109, 3131-3136; Shental-Bechor, D.; Levy, Y. Proc. Natl. Acad. Sci. U.S.A. 2008, 105, 8256-8261) suggested that the three sugar residues closest to the protein are sufficient for accelerating protein folding and stabilizing the resulting structure in vitro; even a monosaccharide can have a dramatic effect. The highly conserved nature of these three proximal sugars in N-glycans led us to speculate that introducing an N-glycosylation site into a protein that is not normally glycosylated would stabilize the protein and increase its folding rate in a manner that does not depend on the presence of specific stabilizing protein-saccharide interactions. Here, we test this hypothesis experimentally and computationally by incorporating an N-linked GlcNAc residue at various positions within the Pin WW domain, a small β-sheet-rich protein. The results show that an increased folding rate and enhanced thermodynamic stability are not general, context-independent consequences of N-glycosylation. Comparison between computational predictions and experimental observations suggests that generic glycan-based excluded volume effects are responsible for the destabilizing effect of glycosylation at highly structured positions. However, this reasoning does not adequately explain the observed destabilizing effect of glycosylation within flexible loops. Our data are consistent with the hypothesis that specific, evolved protein-glycan contacts must also play an important role in mediating the beneficial energetic effects on protein folding that glycosylation can confer.
Approximate Solutions for a Self-Folding Problem of Carbon Nanotubes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Y Mikata

2006-08-22

This paper treats approximate solutions for a self-folding problem of carbon nanotubes. It has been observed in the molecular dynamics calculations [1] that a carbon nanotube with a large aspect ratio can self-fold due to van der Waals force between the parts of the same carbon nanotube. The main issue in the self-folding problem is to determine the minimum threshold length of the carbon nanotube at which it becomes possible for the carbon nanotube to self-fold due to the van der Waals force. An approximate mathematical model based on the force method is constructed for the self-folding problem of carbonmore » nanotubes, and it is solved exactly as an elastica problem using elliptic functions. Additionally, three other mathematical models are constructed based on the energy method. As a particular example, the lower and upper estimates for the critical threshold (minimum) length are determined based on both methods for the (5,5) armchair carbon nanotube.« less
Multiple scales and phases in discrete chains with application to folded proteins

NASA Astrophysics Data System (ADS)

Sinelnikova, A.; Niemi, A. J.; Nilsson, Johan; Ulybyshev, M.

2018-05-01

Chiral heteropolymers such as large globular proteins can simultaneously support multiple length scales. The interplay between the different scales brings about conformational diversity, determines the phase properties of the polymer chain, and governs the structure of the energy landscape. Most importantly, multiple scales produce complex dynamics that enable proteins to sustain live matter. However, at the moment there is incomplete understanding of how to identify and distinguish the various scales that determine the structure and dynamics of a complex protein. Here we address this impending problem. We develop a methodology with the potential to systematically identify different length scales, in the general case of a linear polymer chain. For this we introduce and analyze the properties of an order parameter that can both reveal the presence of different length scales and can also probe the phase structure. We first develop our concepts in the case of chiral homopolymers. We introduce a variant of Kadanoff's block-spin transformation to coarse grain piecewise linear chains, such as the C α backbone of a protein. We derive analytically, and then verify numerically, a number of properties that the order parameter can display, in the case of a chiral polymer chain. In particular, we propose that in the case of a chiral heteropolymer the order parameter can reveal traits of several different phases, contingent on the length scale at which it is scrutinized. We confirm that this is the case with crystallographic protein structures in the Protein Data Bank. Thus our results suggest relations between the scales, the phases, and the complexity of folding pathways.
Topological switching between an alpha-beta parallel protein and a remarkably helical molten globule.

PubMed

Nabuurs, Sanne M; Westphal, Adrie H; aan den Toorn, Marije; Lindhoud, Simon; van Mierlo, Carlo P M

2009-06-17

Partially folded protein species transiently exist during folding of most proteins. Often these species are molten globules, which may be on- or off-pathway to native protein. Molten globules have a substantial amount of secondary structure but lack virtually all the tertiary side-chain packing characteristic of natively folded proteins. These ensembles of interconverting conformers are prone to aggregation and potentially play a role in numerous devastating pathologies, and thus attract considerable attention. The molten globule that is observed during folding of apoflavodoxin from Azotobacter vinelandii is off-pathway, as it has to unfold before native protein can be formed. Here we report that this species can be trapped under nativelike conditions by substituting amino acid residue F44 by Y44, allowing spectroscopic characterization of its conformation. Whereas native apoflavodoxin contains a parallel beta-sheet surrounded by alpha-helices (i.e., the flavodoxin-like or alpha-beta parallel topology), it is shown that the molten globule has a totally different topology: it is helical and contains no beta-sheet. The presence of this remarkably nonnative species shows that single polypeptide sequences can code for distinct folds that swap upon changing conditions. Topological switching between unrelated protein structures is likely a general phenomenon in the protein structure universe.
Free Energy Landscape - Settlements of Key Residues.

NASA Astrophysics Data System (ADS)

Aroutiounian, Svetlana

2007-03-01

FEL perspective in studies of protein folding transitions reflects notion that since there are ˜10^N conformations to scan in search of lowest free energy state, random search is beyond biological timescale. Protein folding must follow certain fel pathways and folding kinetics of evolutionary selected proteins dominates kinetic traps. Good model for functional robustness of natural proteins - coarse-grained model protein is not very accurate but affords bringing simulations closer to biological realm; Go-like potential secures the fel funnel shape; biochemical contacts signify the funnel bottleneck. Boltzmann-weighted ensemble of protein conformations and histogram method are used to obtain from MC sampling of protein conformational space the approximate probability distribution. The fel is F(rmsd) = -1/βLn[Hist(rmsd)], β=kBT and rmsd is root-mean-square-deviation from native conformation. The sperm whale myoglobin has rich dynamic behavior, is small and large - on computational scale, has a symmetry in architecture and unusual sextet of residue pairs. Main idea: there is a mathematical relation between protein fel and a key residues set providing stability to folding transition. Is the set evolutionary conserved also for functional reasons? Hypothesis: primary sequence determines the key residues positions conserved as stabilizers and the fel is the battlefield for the folding stability. Preliminary results: primary sequence - not the architecture, is the rule settler, indeed.
Nasal mucus proteomic changes reflect altered immune responses and epithelial permeability in patients with allergic rhinitis.

PubMed

Tomazic, Peter Valentin; Birner-Gruenberger, Ruth; Leitner, Anita; Obrist, Britta; Spoerk, Stefan; Lang-Loidolt, Doris

2014-03-01

Nasal mucus is the first-line defense barrier against (aero-) allergens. However, its proteome and function have not been clearly investigated. The role of nasal mucus in the pathophysiology of allergic rhinitis was investigated by analyzing its proteome in patients with allergic rhinitis (n = 29) and healthy control subjects (n = 29). Nasal mucus was collected with a suction device, tryptically digested, and analyzed by using liquid chromatography-tandem mass spectrometry. Proteins were identified by searching the SwissProt database and annotated by collecting gene ontology data from databases and existing literature. Gene enrichment analysis was performed by using Cytoscape/BINGO software tools. Proteins were quantified with spectral counting, and selected proteins were confirmed by means of Western blotting. In total, 267 proteins were identified, with 20 (7.5%) found exclusively in patients with allergic rhinitis and 25 (9.5%) found exclusively in healthy control subjects. Five proteins were found to be significantly upregulated in patients with allergic rhinitis (apolipoprotein A-2 [APOA2], 9.7-fold; α2-macroglobulin [A2M], 4.5-fold; apolipoprotein A-1 [APOA1], 3.2-fold; α1-antitrypsin [SERPINA1], 2.5-fold; and complement C3 [C3], 2.3-fold) and 5 were found to be downregulated (antileukoproteinase [SLPI], 0.6-fold; WAP 4-disulfide core domain protein [WFDC2], 0.5-fold; haptoglobin [HP], 0.7-fold; IgJ chain [IGJ], 0.7-fold; and Ig hc V-III region BRO, 0.8-fold) compared with levels seen in healthy control subjects. The allergic rhinitis mucus proteome shows an enhanced immune response in which apolipoproteins might play an important role. Furthermore, an imbalance between cysteine proteases and antiproteases could be seen, which negatively affects epithelial integrity on exposure to pollen protease activity. This reflects the important role of mucus as the first-line defense barrier against allergens. Copyright © 2013 American Academy of Allergy, Asthma & Immunology. Published by Mosby, Inc. All rights reserved.
Unconstrained Structure Formation in Coarse-Grained Protein Simulations

NASA Astrophysics Data System (ADS)

Bereau, Tristan

The ability of proteins to fold into well-defined structures forms the basis of a wide variety of biochemical functions in and out of the cell membrane. Many of these processes, however, operate at time- and length-scales that are currently unattainable by all-atom computer simulations. To cope with this difficulty, increasingly more accurate and sophisticated coarse-grained models are currently being developed. In the present thesis, we introduce a solvent-free coarse-grained model for proteins. Proteins are modeled by four beads per amino acid, providing enough backbone resolution to allow for accurate sampling of local conformations. It relies on simple interactions that emphasize structure, such as hydrogen bonds and hydrophobicity. Realistic alpha/beta content is achieved by including an effective nearest-neighbor dipolar interaction. Parameters are tuned to reproduce both local conformations and tertiary structures. By studying both helical and extended conformations we make sure the force field is not biased towards any particular secondary structure. Without any further adjustments or bias a realistic oligopeptide aggregation scenario is observed. The model is subsequently applied to various biophysical problems: (i) kinetics of folding of two model peptides, (ii) large-scale amyloid-beta oligomerization, and (iii) protein folding cooperativity. The last topic---defined by the nature of the finite-size thermodynamic transition exhibited upon folding---was investigated from a microcanonical perspective: the accurate evaluation of the density of states can unambiguously characterize the nature of the transition, unlike its corresponding canonical analysis. Extending the results of lattice simulations and theoretical models, we find that it is the interplay between secondary structure and the loss of non-native tertiary contacts which determines the nature of the transition. Finally, we combine the peptide model with a high-resolution, solvent-free, lipid model. The lipid force field was systematically tuned to reproduce the structural and mechanical properties of phosphatidylcholine bilayers. The two models were cross-parametrized against atomistic potential of mean force curves for the insertion of single amino acid side chains into a bilayer. Coarse-grained transmembrane protein simulations were then compared with experiments and atomistic simulations to validate the force field. The transferability of the two models across amino acid sequences and lipid species permits the investigation of a wide variety of scenarios, while the absence of explicit solvent allows for studies of large-scale phenomena.
Comparative Protein Structure Modeling Using MODELLER.

PubMed

Webb, Benjamin; Sali, Andrej

2014-09-08

Functional characterization of a protein sequence is one of the most frequent problems in biology. This task is usually facilitated by accurate three-dimensional (3-D) structure of the studied protein. In the absence of an experimentally determined structure, comparative or homology modeling can sometimes provide a useful 3-D model for a protein that is related to at least one known protein structure. Comparative modeling predicts the 3-D structure of a given protein sequence (target) based primarily on its alignment to one or more proteins of known structure (templates). The prediction process consists of fold assignment, target-template alignment, model building, and model evaluation. This unit describes how to calculate comparative models using the program MODELLER and discusses all four steps of comparative modeling, frequently observed errors, and some applications. Modeling lactate dehydrogenase from Trichomonas vaginalis (TvLDH) is described as an example. The download and installation of the MODELLER software is also described. Copyright © 2014 John Wiley & Sons, Inc.
Stereochemistry and solvent role in protein folding: nuclear magnetic resonance and molecular dynamics studies of poly-L and alternating-L,D homopolypeptides in dimethyl sulfoxide.

PubMed

Srivastava, Kinshuk Raj; Kumar, Anil; Goyal, Bhupesh; Durani, Susheel

2011-05-26

The competing interactions folding and unfolding protein structure remain obscure. Using homopolypeptides, we ask if poly-L structure may have a role. We mutate the structure to alternating-L,D stereochemistry and substitute water as the fold-promoting solvent with methanol and dimethyl sulfoxide (DMSO) as the fold-denaturing solvents. Circular dichroism and molecular dynamics established previously that, while both isomers were folded in water, the poly-L isomer was unfolded and alternating-L,D isomer folded in methanol. Nuclear magnetic resonance and molecular dynamics establish now that both isomers are unfolded in DMSO. We calculated energetics of folding-unfolding equilibrium with water and methanol as solvents. We have now calculated interactions of unfolded polypeptide structures with DMSO as solvent. Methanol was found to unfold and water fold poly-L structure as a dielectric. DMSO has now been found to unfold both poly-L and alternating-L,D structures by strong solvation of peptides to disrupt their hydrogen bonds. Accordingly, we propose that while linked peptides fold protein structure with hydrogen bonds they unfold the structure electrostatically due to the stereochemical effect of the poly-L structure. Protein folding to ordering of peptide hydrogen bonds with water as canonical solvent may thus involve two specific and independent solvent effects-one, strong screening of electrostatics of poly-L linked peptides, and two, weak dipolar solvation of peptides. Correspondingly, protein denaturation may involve two independent solvent effects-one, weak dielectric to unfold poly-L structure electrostatically, and two, strong polarity to disrupt peptide hydrogen bonds by solvation of peptides.
Only Five of 10 Strictly Conserved Disulfide Bonds Are Essential for Folding and Eight for Function of the HIV-1 Envelope Glycoprotein

PubMed Central

van Anken, Eelco; Sanders, Rogier W.; Liscaljet, I. Marije; Land, Aafke; Bontjer, Ilja; Tillemans, Sonja; Nabatov, Alexey A.; Paxton, William A.; Berkhout, Ben

2008-01-01

Protein folding in the endoplasmic reticulum goes hand in hand with disulfide bond formation, and disulfide bonds are considered key structural elements for a protein's folding and function. We used the HIV-1 Envelope glycoprotein to examine in detail the importance of its 10 completely conserved disulfide bonds. We systematically mutated the cysteines in its ectodomain, assayed the mutants for oxidative folding, transport, and incorporation into the virus, and tested fitness of mutant viruses. We found that the protein was remarkably tolerant toward manipulation of its disulfide-bonded structure. Five of 10 disulfide bonds were dispensable for folding. Two of these were even expendable for viral replication in cell culture, indicating that the relevance of these disulfide bonds becomes manifest only during natural infection. Our findings refine old paradigms on the importance of disulfide bonds for proteins. PMID:18653472
How Adequate are One- and Two-Dimensional Free Energy Landscapes for Protein Folding Dynamics?

NASA Astrophysics Data System (ADS)

Maisuradze, Gia G.; Liwo, Adam; Scheraga, Harold A.

2009-06-01

The molecular dynamics trajectories of protein folding or unfolding, generated with the coarse-grained united-residue force field for the B domain of staphylococcal protein A, were analyzed by principal component analysis (PCA). The folding or unfolding process was examined by using free-energy landscapes (FELs) in PC space. By introducing a novel multidimensional FEL, it was shown that the low-dimensional FELs are not always sufficient for the description of folding or unfolding processes. Similarities between the topographies of FELs along low- and high-indexed principal components were observed.
Comparative analysis of the folding dynamics and kinetics of an engineered knotted protein and its variants derived from HP0242 of Helicobacter pylori

NASA Astrophysics Data System (ADS)

Wang, Liang-Wei; Liu, Yu-Nan; Lyu, Ping-Chiang; Jackson, Sophie E.; Hsu, Shang-Te Danny

2015-09-01

Understanding the mechanism by which a polypeptide chain thread itself spontaneously to attain a knotted conformation has been a major challenge in the field of protein folding. HP0242 is a homodimeric protein from Helicobacter pylori with intertwined helices to form a unique pseudo-knotted folding topology. A tandem HP0242 repeat has been constructed to become the first engineered trefoil-knotted protein. Its small size renders it a model system for computational analyses to examine its folding and knotting pathways. Here we report a multi-parametric study on the folding stability and kinetics of a library of HP0242 variants, including the trefoil-knotted tandem HP0242 repeat, using far-UV circular dichroism and fluorescence spectroscopy. Equilibrium chemical denaturation of HP0242 variants shows the presence of highly populated dimeric and structurally heterogeneous folding intermediates. Such equilibrium folding intermediates retain significant amount of helical structures except those at the N- and C-terminal regions in the native structure. Stopped-flow fluorescence measurements of HP0242 variants show that spontaneous refolding into knotted structures can be achieved within seconds, which is several orders of magnitude faster than previously observed for other knotted proteins. Nevertheless, the complex chevron plots indicate that HP0242 variants are prone to misfold into kinetic traps, leading to severely rolled-over refolding arms. The experimental observations are in general agreement with the previously reported molecular dynamics simulations. Based on our results, kinetic folding pathways are proposed to qualitatively describe the complex folding processes of HP0242 variants.
Competing Pathways and Multiple Folding Nuclei in a Large Multidomain Protein, Luciferase.

PubMed

Scholl, Zackary N; Yang, Weitao; Marszalek, Piotr E

2017-05-09

Proteins obtain their final functional configuration through incremental folding with many intermediate steps in the folding pathway. If known, these intermediate steps could be valuable new targets for designing therapeutics and the sequence of events could elucidate the mechanism of refolding. However, determining these intermediate steps is hardly an easy feat, and has been elusive for most proteins, especially large, multidomain proteins. Here, we effectively map part of the folding pathway for the model large multidomain protein, Luciferase, by combining single-molecule force-spectroscopy experiments and coarse-grained simulation. Single-molecule refolding experiments reveal the initial nucleation of folding while simulations corroborate these stable core structures of Luciferase, and indicate the relative propensities for each to propagate to the final folded native state. Both experimental refolding and Monte Carlo simulations of Markov state models generated from simulation reveal that Luciferase most often folds along a pathway originating from the nucleation of the N-terminal domain, and that this pathway is the least likely to form nonnative structures. We then engineer truncated variants of Luciferase whose sequences corresponded to the putative structure from simulation and we use atomic force spectroscopy to determine their unfolding and stability. These experimental results corroborate the structures predicted from the folding simulation and strongly suggest that they are intermediates along the folding pathway. Taken together, our results suggest that initial Luciferase refolding occurs along a vectorial pathway and also suggest a mechanism that chaperones may exploit to prevent misfolding. Copyright © 2017 Biophysical Society. Published by Elsevier Inc. All rights reserved.
PconsFold: improved contact predictions improve protein models.

PubMed

Michel, Mirco; Hayat, Sikander; Skwark, Marcin J; Sander, Chris; Marks, Debora S; Elofsson, Arne

2014-09-01

Recently it has been shown that the quality of protein contact prediction from evolutionary information can be improved significantly if direct and indirect information is separated. Given sufficiently large protein families, the contact predictions contain sufficient information to predict the structure of many protein families. However, since the first studies contact prediction methods have improved. Here, we ask how much the final models are improved if improved contact predictions are used. In a small benchmark of 15 proteins, we show that the TM-scores of top-ranked models are improved by on average 33% using PconsFold compared with the original version of EVfold. In a larger benchmark, we find that the quality is improved with 15-30% when using PconsC in comparison with earlier contact prediction methods. Further, using Rosetta instead of CNS does not significantly improve global model accuracy, but the chemistry of models generated with Rosetta is improved. PconsFold is a fully automated pipeline for ab initio protein structure prediction based on evolutionary information. PconsFold is based on PconsC contact prediction and uses the Rosetta folding protocol. Due to its modularity, the contact prediction tool can be easily exchanged. The source code of PconsFold is available on GitHub at https://www.github.com/ElofssonLab/pcons-fold under the MIT license. PconsC is available from http://c.pcons.net/. Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press.
Circuit topology of proteins and nucleic acids.

PubMed

Mashaghi, Alireza; van Wijk, Roeland J; Tans, Sander J

2014-09-02

Folded biomolecules display a bewildering structural complexity and diversity. They have therefore been analyzed in terms of generic topological features. For instance, folded proteins may be knotted, have beta-strands arranged into a Greek-key motif, or display high contact order. In this perspective, we present a method to formally describe the topology of all folded linear chains and hence provide a general classification and analysis framework for a range of biomolecules. Moreover, by identifying the fundamental rules that intrachain contacts must obey, the method establishes the topological constraints of folded linear chains. We also briefly illustrate how this circuit topology notion can be applied to study the equivalence of folded chains, the engineering of artificial RNA structures and DNA origami, the topological structure of genomes, and the role of topology in protein folding. Copyright © 2014 Elsevier Ltd. All rights reserved.
[CCT chaperonins and their cochaperons].

PubMed

Bregier, Cezary; Kupikowska, Barbara; Fabczak, Hanna; Fabczak, Stanisław

2008-01-01

Chaperonins are large oligomers consisting of two superimposed rings, each enclosing a cavity used for the folding of other proteins. They have been divided into two groups. Chaperonins of type I were identified in mitochondria and chloroplasts (Hsp60) or bacterial cytosol (GroEL) as well. Chaperonins type II were found in Archea and the eukaryotic cell cytosol (CCT). Protein folding occurs in the chaperonin after its conformational changes induced upon ATP binding. Mechanism of the protein folding, although still poorly defined, clearly differs from the one established for GroEL. Although CCT with prefoldin seems to be mainly involved in the folding of actin and tubulin, other substrates engaged in various cellular processes are beginning to be characterized, including proteins possessing WD40-repeats. Moreover, several lines of evidence suggest that beside prefoldin, CCT may work in concert with phosducin-like proteins (PhLPs).
Proteomic analysis of mouse thymoma EL4 cells treated with bis(tri-n-butyltin)oxide (TBTO).

PubMed

Osman, Ahmed M; van Kol, Sandra; Peijnenburg, Ad; Blokland, Marco; Pennings, Jeroen L A; Kleinjans, Jos C S; van Loveren, Henk

2009-09-01

Here, we report the results of proteomic analysis of the mouse thymoma EL4 cell line exposed to bis(tri-n-butylin)oxide (TBTO), an immunotoxic organotin compound. The objective of the work was to examine whether TBTO affects the expression of proteins in this cell line and to compare the differentially expressed proteins with the corresponding mRNA expression data. The identified proteins were quantified using a label-free quantitative method based on counting the observed peptides as an index of protein abundance. The calculation of the ratio of peptides obtained from exposed and control samples allowed us to evaluate the effect of TBTO on protein expression and to compare these results to those obtained in gene expression profiling studies. Correlation of some of the differentially expressed proteins and their corresponding mRNAs was observed. The analysis of the protein ratios revealed that 12 proteins were significantly affected. These proteins included cytoskeleton proteins myosin-9, spectrin beta 2 and plectin 8. The first two proteins were down-regulated 3-fold, whereas the third was up-regulated 2-fold. Ras-related Rab1, a GTP binding protein and T-complex protein-1 subunit alpha, a chaperonin, were decreased 2- and 3.6-fold, respectively. The ribosomal S10 and eukaryotic translation factor (eIf4G1), which are involved in protein synthesis, were down-regulated 2.6- and 3.7-fold, respectively. Also, proteins involved in splicing of pre-mRNA and in transcription, splicing factor arginine/serine-rich 2 and chromodomain-helicase-DNA binding protein 4 (Chd4), were decreased 2.6- and 4.5 times, respectively. Nuclear RNA helicase II was reduced 2.8-fold. Finally, prothymosin-alpha (ProTalpha), an essential protein for cell proliferation, and a protein similar to ProTalpha, (with a molecular weight and a pI (3.54) comparable to that of ProTalpha) were also down-regulated 6-and 8-fold, respectively. We propose that the observed down-regulation of the expression level of ProTalpha in the TBTO-exposed cells could account for the previously reported anti-proliferative effect of TBTO.
Evaluating Protein Structure and Dynamics Using Co-Solvents, Photochemical Triggers, and Site-Specific Spectroscopic Probes

NASA Astrophysics Data System (ADS)

Abaskharon, Rachel M.

As ubiquitous and diverse biopolymers, proteins are dynamic molecules that are constantly engaging in inter- and intramolecular interactions responsible for their structure, fold, and function. Because of this, gaining a comprehensive understanding of the factors that control protein conformation and dynamics remains elusive as current experimental techniques often lack the ability to initiate and probe a specific interaction or conformational transition. For this reason, this thesis aims to develop methods to control and monitor protein conformations, conformational transitions, and dynamics in a site-specific manner, as well as to understand how specific and non-specific interactions affect the protein folding energy landscape. First, by using the co-solvent, trifluoroethanol (TFE), we show that the rate at which a peptide folds can be greatly impacted and thus controlled by the excluded volume effect. Secondly, we demonstrate the utility of several light-responsive molecules and reactions as methods to manipulate and investigate protein-folding processes. Using an azobenzene linker as a photo-initiator, we are able to increase the folding rate of a protein system by an order of magnitude by channeling a sub-population through a parallel, faster folding pathway. Additionally, we utilize a tryptophan-mediated electron transfer process to a nearby disulfide bond to strategically unfold a protein molecule with ultraviolet light. We also demonstrate the potential of two ruthenium polypyridyl complexes as ultrafast phototriggers of protein reactions. Finally, we develop several site-specific spectroscopic probes of protein structure and environment. Specifically, we demonstrate that a 13C-labeled aspartic acid residue constitutes a useful site-specific infrared probe for investigating salt-bridges and hydration dynamics of proteins, particularly in proteins containing several acidic amino acids. We also show that a proline-derivative, 4-oxoproline, possesses novel infrared properties that can be exploited to monitor the cis-trans isomerization process of individual proline residues in proteins.

Physical-chemical features of non-detergent sulfobetaines active as protein-folding helpers.

PubMed

Expert-Bezançon, Nicole; Rabilloud, Thierry; Vuillard, Laurent; Goldberg, Michel E

2003-01-01

Some non-detergent sulfobetaines had been shown to prevent aggregation and improve the yield of active proteins when added to the buffer during in vitro protein renaturation. With the aim of designing more efficient folding helpers, a series of non-detergent sulfobetaines have been synthesized and their efficiency in improving the renaturation of a variety of proteins (E. coli tryptophan synthase and beta-D-galactosidase, hen lysozyme, bovine serum albumin, a monoclonal antibody) have been investigated. Attempts to correlate the structure of each sulfobetaines with its effect on folding revealed some molecular features that appear important in helping renaturation. This enabled us to design and synthesize new non-detergent sulfobetaines that act as potent folding helpers.
Folding of multidomain proteins: biophysical consequences of tethering even in apparently independent folding.

PubMed

Arviv, Oshrit; Levy, Yaakov

2012-12-01

Most eukaryotic and a substantial fraction of prokaryotic proteins are composed of more than one domain. The tethering of these evolutionary, structural, and functional units raises, among others, questions regarding the folding process of conjugated domains. Studying the folding of multidomain proteins in silico enables one to identify and isolate the tethering-induced biophysical determinants that govern crosstalks generated between neighboring domains. For this purpose, we carried out coarse-grained and atomistic molecular dynamics simulations of two two-domain constructs from the immunoglobulin-like β-sandwich fold. Each of these was experimentally shown to behave as the "sum of its parts," that is, the thermodynamic and kinetic folding behavior of the constituent domains of these constructs seems to occur independently, with the folding of each domain uncoupled from the folding of its partner in the two-domain construct. We show that the properties of the individual domains can be significantly affected by conjugation to another domain. The tethering may be accompanied by stabilizing as well as destabilizing factors whose magnitude depends on the size of the interface, the length, and the flexibility of the linker, and the relative stability of the domains. Accordingly, the folding of a multidomain protein should not be viewed as the sum of the folding patterns of each of its parts, but rather, it involves abrogating several effects that lead to this outcome. An imbalance between these effects may result in either stabilization or destabilization owing to the tethering. Copyright © 2012 Wiley Periodicals, Inc.
Investigation of protein folding by coarse-grained molecular dynamics with the UNRES force field.

PubMed

Maisuradze, Gia G; Senet, Patrick; Czaplewski, Cezary; Liwo, Adam; Scheraga, Harold A

2010-04-08

Coarse-grained molecular dynamics simulations offer a dramatic extension of the time-scale of simulations compared to all-atom approaches. In this article, we describe the use of the physics-based united-residue (UNRES) force field, developed in our laboratory, in protein-structure simulations. We demonstrate that this force field offers about a 4000-times extension of the simulation time scale; this feature arises both from averaging out the fast-moving degrees of freedom and reduction of the cost of energy and force calculations compared to all-atom approaches with explicit solvent. With massively parallel computers, microsecond folding simulation times of proteins containing about 1000 residues can be obtained in days. A straightforward application of canonical UNRES/MD simulations, demonstrated with the example of the N-terminal part of the B-domain of staphylococcal protein A (PDB code: 1BDD, a three-alpha-helix bundle), discerns the folding mechanism and determines kinetic parameters by parallel simulations of several hundred or more trajectories. Use of generalized-ensemble techniques, of which the multiplexed replica exchange method proved to be the most effective, enables us to compute thermodynamics of folding and carry out fully physics-based prediction of protein structure, in which the predicted structure is determined as a mean over the most populated ensemble below the folding-transition temperature. By using principal component analysis of the UNRES folding trajectories of the formin-binding protein WW domain (PDB code: 1E0L; a three-stranded antiparallel beta-sheet) and 1BDD, we identified representative structures along the folding pathways and demonstrated that only a few (low-indexed) principal components can capture the main structural features of a protein-folding trajectory; the potentials of mean force calculated along these essential modes exhibit multiple minima, as opposed to those along the remaining modes that are unimodal. In addition, a comparison between the structures that are representative of the minima in the free-energy profile along the essential collective coordinates of protein folding (computed by principal component analysis) and the free-energy profile projected along the virtual-bond dihedral angles gamma of the backbone revealed the key residues involved in the transitions between the different basins of the folding free-energy profile, in agreement with existing experimental data for 1E0L .
Structural Determinants of Sleeping Beauty Transposase Activity

PubMed Central

Abrusán, György; Yant, Stephen R; Szilágyi, András; Marsh, Joseph A; Mátés, Lajos; Izsvák, Zsuzsanna; Barabás, Orsolya; Ivics, Zoltán

2016-01-01

Transposases are important tools in genome engineering, and there is considerable interest in engineering more efficient ones. Here, we seek to understand the factors determining their activity using the Sleeping Beauty transposase. Recent work suggests that protein coevolutionary information can be used to classify groups of physically connected, coevolving residues into elements called “sectors”, which have proven useful for understanding the folding, allosteric interactions, and enzymatic activity of proteins. Using extensive mutagenesis data, protein modeling and analysis of folding energies, we show that (i) The Sleeping Beauty transposase contains two sectors, which span across conserved domains, and are enriched in DNA-binding residues, indicating that the DNA binding and endonuclease functions of the transposase coevolve; (ii) Sector residues are highly sensitive to mutations, and most mutations of these residues strongly reduce transposition rate; (iii) Mutations with a strong effect on free energy of folding in the DDE domain of the transposase significantly reduce transposition rate. (iv) Mutations that influence DNA and protein-protein interactions generally reduce transposition rate, although most hyperactive mutants are also located on the protein surface, including residues with protein-protein interactions. This suggests that hyperactivity results from the modification of protein interactions, rather than the stabilization of protein fold. PMID:27401040
How Well Does a Funneled Energy Landscape Capture the Folding Mechanism of Spectrin Domains?

PubMed Central

2013-01-01

Three structurally similar domains from α-spectrin have been shown to fold very differently. Firstly, there is a contrast in the folding mechanism, as probed by Φ-value analysis, between the R15 domain and the R16 and R17 domains. Secondly, there are very different contributions from internal friction to folding: the folding rate of the R15 domain was found to be inversely proportional to solvent viscosity, showing no apparent frictional contribution from the protein, but in the other two domains a large internal friction component was evident. Non-native misdocking of helices has been suggested to be responsible for this phenomenon. Here, I study the folding of these three proteins with minimalist coarse-grained models based on a funneled energy landscape. Remarkably, I find that, despite the absence of non-native interactions, the differences in folding mechanism of the domains are well captured by the model, and the agreement of the Φ-values with experiment is fairly good. On the other hand, within the context of this model, there are no significant differences in diffusion coefficient along the chosen folding coordinate, and the model cannot explain the large differences in folding rates between the proteins found experimentally. These results are nonetheless consistent with the expectations from the energy landscape perspective of protein folding: namely, that the folding mechanism is primarily determined by the native-like interactions present in the Gō-like model, with missing non-native interactions being required to explain the differences in “internal friction” seen in experiment. PMID:23947368
Mechanical Modeling and Computer Simulation of Protein Folding

ERIC Educational Resources Information Center

Prigozhin, Maxim B.; Scott, Gregory E.; Denos, Sharlene

2014-01-01

In this activity, science education and modern technology are bridged to teach students at the high school and undergraduate levels about protein folding and to strengthen their model building skills. Students are guided from a textbook picture of a protein as a rigid crystal structure to a more realistic view: proteins are highly dynamic…
The double life of the ribosome: When its protein folding activity supports prion propagation.

PubMed

Voisset, Cécile; Blondel, Marc; Jones, Gary W; Friocourt, Gaëlle; Stahl, Guillaume; Chédin, Stéphane; Béringue, Vincent; Gillet, Reynald

2017-03-04

It is no longer necessary to demonstrate that ribosome is the central machinery of protein synthesis. But it is less known that it is also key player of the protein folding process through another conserved function: the protein folding activity of the ribosome (PFAR). This ribozyme activity, discovered more than 2 decades ago, depends upon the domain V of the large rRNA within the large subunit of the ribosome. Surprisingly, we discovered that anti-prion compounds are also potent PFAR inhibitors, highlighting an unexpected link between PFAR and prion propagation. In this review, we discuss the ancestral origin of PFAR in the light of the ancient RNA world hypothesis. We also consider how this ribosomal activity fits into the landscape of cellular protein chaperones involved in the appearance and propagation of prions and other amyloids in mammals. Finally, we examine how drugs targeting the protein folding activity of the ribosome could be active against mammalian prion and other protein aggregation-based diseases, making PFAR a promising therapeutic target for various human protein misfolding diseases.
The Role of High-Dimensional Diffusive Search, Stabilization, and Frustration in Protein Folding

PubMed Central

Rimratchada, Supreecha; McLeish, Tom C.B.; Radford, Sheena E.; Paci, Emanuele

2014-01-01

Proteins are polymeric molecules with many degrees of conformational freedom whose internal energetic interactions are typically screened to small distances. Therefore, in the high-dimensional conformation space of a protein, the energy landscape is locally relatively flat, in contrast to low-dimensional representations, where, because of the induced entropic contribution to the full free energy, it appears funnel-like. Proteins explore the conformation space by searching these flat subspaces to find a narrow energetic alley that we call a hypergutter and then explore the next, lower-dimensional, subspace. Such a framework provides an effective representation of the energy landscape and folding kinetics that does justice to the essential characteristic of high-dimensionality of the search-space. It also illuminates the important role of nonnative interactions in defining folding pathways. This principle is here illustrated using a coarse-grained model of a family of three-helix bundle proteins whose conformations, once secondary structure has formed, can be defined by six rotational degrees of freedom. Two folding mechanisms are possible, one of which involves an intermediate. The stabilization of intermediate subspaces (or states in low-dimensional projection) in protein folding can either speed up or slow down the folding rate depending on the amount of native and nonnative contacts made in those subspaces. The folding rate increases due to reduced-dimension pathways arising from the mere presence of intermediate states, but decreases if the contacts in the intermediate are very stable and introduce sizeable topological or energetic frustration that needs to be overcome. Remarkably, the hypergutter framework, although depending on just a few physically meaningful parameters, can reproduce all the types of experimentally observed curvature in chevron plots for realizations of this fold. PMID:24739172
An all-atom structure-based potential for proteins: bridging minimal models with all-atom empirical forcefields.

PubMed

Whitford, Paul C; Noel, Jeffrey K; Gosavi, Shachi; Schug, Alexander; Sanbonmatsu, Kevin Y; Onuchic, José N

2009-05-01

Protein dynamics take place on many time and length scales. Coarse-grained structure-based (Go) models utilize the funneled energy landscape theory of protein folding to provide an understanding of both long time and long length scale dynamics. All-atom empirical forcefields with explicit solvent can elucidate our understanding of short time dynamics with high energetic and structural resolution. Thus, structure-based models with atomic details included can be used to bridge our understanding between these two approaches. We report on the robustness of folding mechanisms in one such all-atom model. Results for the B domain of Protein A, the SH3 domain of C-Src Kinase, and Chymotrypsin Inhibitor 2 are reported. The interplay between side chain packing and backbone folding is explored. We also compare this model to a C(alpha) structure-based model and an all-atom empirical forcefield. Key findings include: (1) backbone collapse is accompanied by partial side chain packing in a cooperative transition and residual side chain packing occurs gradually with decreasing temperature, (2) folding mechanisms are robust to variations of the energetic parameters, (3) protein folding free-energy barriers can be manipulated through parametric modifications, (4) the global folding mechanisms in a C(alpha) model and the all-atom model agree, although differences can be attributed to energetic heterogeneity in the all-atom model, and (5) proline residues have significant effects on folding mechanisms, independent of isomerization effects. Because this structure-based model has atomic resolution, this work lays the foundation for future studies to probe the contributions of specific energetic factors on protein folding and function.
An All-atom Structure-Based Potential for Proteins: Bridging Minimal Models with All-atom Empirical Forcefields

PubMed Central

Whitford, Paul C.; Noel, Jeffrey K.; Gosavi, Shachi; Schug, Alexander; Sanbonmatsu, Kevin Y.; Onuchic, José N.

2012-01-01

Protein dynamics take place on many time and length scales. Coarse-grained structure-based (Gō) models utilize the funneled energy landscape theory of protein folding to provide an understanding of both long time and long length scale dynamics. All-atom empirical forcefields with explicit solvent can elucidate our understanding of short time dynamics with high energetic and structural resolution. Thus, structure-based models with atomic details included can be used to bridge our understanding between these two approaches. We report on the robustness of folding mechanisms in one such all-atom model. Results for the B domain of Protein A, the SH3 domain of C-Src Kinase and Chymotrypsin Inhibitor 2 are reported. The interplay between side chain packing and backbone folding is explored. We also compare this model to a Cα structure-based model and an all-atom empirical forcefield. Key findings include 1) backbone collapse is accompanied by partial side chain packing in a cooperative transition and residual side chain packing occurs gradually with decreasing temperature 2) folding mechanisms are robust to variations of the energetic parameters 3) protein folding free energy barriers can be manipulated through parametric modifications 4) the global folding mechanisms in a Cα model and the all-atom model agree, although differences can be attributed to energetic heterogeneity in the all-atom model 5) proline residues have significant effects on folding mechanisms, independent of isomerization effects. Since this structure-based model has atomic resolution, this work lays the foundation for future studies to probe the contributions of specific energetic factors on protein folding and function. PMID:18837035
Regulation of protein turnover by heat shock proteins.

PubMed

Bozaykut, Perinur; Ozer, Nesrin Kartal; Karademir, Betul

2014-12-01

Protein turnover reflects the balance between synthesis and degradation of proteins, and it is a crucial process for the maintenance of the cellular protein pool. The folding of proteins, refolding of misfolded proteins, and also degradation of misfolded and damaged proteins are involved in the protein quality control (PQC) system. Correct protein folding and degradation are controlled by many different factors, one of the most important of which is the heat shock protein family. Heat shock proteins (HSPs) are in the class of molecular chaperones, which may prevent the inappropriate interaction of proteins and induce correct folding. On the other hand, these proteins play significant roles in the degradation pathways, including endoplasmic reticulum-associated degradation (ERAD), the ubiquitin-proteasome system, and autophagy. This review focuses on the emerging role of HSPs in the regulation of protein turnover; the effects of HSPs on the degradation machineries ERAD, autophagy, and proteasome; as well as the role of posttranslational modifications in the PQC system. Copyright © 2014 Elsevier Inc. All rights reserved.
Mechanistic Insight into the Reactivation of BCAII Enzyme from Denatured and Molten Globule States by Eukaryotic Ribosomes and Domain V rRNAs

PubMed Central

Chakraborty, Biprashekhar; Bhakta, Sayan; Sengupta, Jayati

2016-01-01

In all life forms, decoding of messenger-RNA into polypeptide chain is accomplished by the ribosome. Several protein chaperones are known to bind at the exit of ribosomal tunnel to ensure proper folding of the nascent chain by inhibiting their premature folding in the densely crowded environment of the cell. However, accumulating evidence suggests that ribosome may play a chaperone role in protein folding events in vitro. Ribosome-mediated folding of denatured proteins by prokaryotic ribosomes has been studied extensively. The RNA-assisted chaperone activity of the prokaryotic ribosome has been attributed to the domain V, a span of 23S rRNA at the intersubunit side of the large subunit encompassing the Peptidyl Transferase Centre. Evidently, this functional property of ribosome is unrelated to the nascent chain protein folding at the exit of the ribosomal tunnel. Here, we seek to scrutinize whether this unique function is conserved in a primitive kinetoplastid group of eukaryotic species Leishmania donovani where the ribosome structure possesses distinct additional features and appears markedly different compared to other higher eukaryotic ribosomes. Bovine Carbonic Anhydrase II (BCAII) enzyme was considered as the model protein. Our results manifest that domain V of the large subunit rRNA of Leishmania ribosomes preserves chaperone activity suggesting that ribosome-mediated protein folding is, indeed, a conserved phenomenon. Further, we aimed to investigate the mechanism underpinning the ribosome-assisted protein reactivation process. Interestingly, the surface plasmon resonance binding analyses exhibit that rRNA guides productive folding by directly interacting with molten globule-like states of the protein. In contrast, native protein shows no notable affinity to the rRNA. Thus, our study not only confirms conserved, RNA-mediated chaperoning role of ribosome but also provides crucial insight into the mechanism of the process. PMID:27099964
How Fast is Collapse of Proteins During Folding?

NASA Astrophysics Data System (ADS)

Chahine, J.; Onuchic, J. N.; Socci, N. D.

1998-03-01

Recent experiments in fast folding proteins are now starting to address the question of how fast is collapse relative to the total folding time. Using minimalist models, we are able to investigate the way in which different scenarios of folding can arise depending on the interplay between the collapse order parameter and the order parameter sensitive to specific tertiary contacts. Most of our earlier studies have focused on the limit that collapse is very fast compared to the total folding time. In this work we focus on the opposite limit, i.e., at the folding temperature, collapse and folding occurs simultaneously. The folding mechanism becomes very different in this limit. Particularly, the non-specific collapse transition, that occurs at temperatures higher than the folding temperature for the fast collapse limit, now occurs between the folding and the glass temperature. We show how this transition can be identified and its consequences for the folding kinetics.
Structural classification of small, disulfide-rich protein domains.

PubMed

Cheek, Sara; Krishna, S Sri; Grishin, Nick V

2006-05-26

Disulfide-rich domains are small protein domains whose global folds are stabilized primarily by the formation of disulfide bonds and, to a much lesser extent, by secondary structure and hydrophobic interactions. Disulfide-rich domains perform a wide variety of roles functioning as growth factors, toxins, enzyme inhibitors, hormones, pheromones, allergens, etc. These domains are commonly found both as independent (single-domain) proteins and as domains within larger polypeptides. Here, we present a comprehensive structural classification of approximately 3000 small, disulfide-rich protein domains. We find that these domains can be arranged into 41 fold groups on the basis of structural similarity. Our fold groups, which describe broader structural relationships than existing groupings of these domains, bring together representatives with previously unacknowledged similarities; 18 of the 41 fold groups include domains from several SCOP folds. Within the fold groups, the domains are assembled into families of homologs. We define 98 families of disulfide-rich domains, some of which include newly detected homologs, particularly among knottin-like domains. On the basis of this classification, we have examined cases of convergent and divergent evolution of functions performed by disulfide-rich proteins. Disulfide bonding patterns in these domains are also evaluated. Reducible disulfide bonding patterns are much less frequent, while symmetric disulfide bonding patterns are more common than expected from random considerations. Examples of variations in disulfide bonding patterns found within families and fold groups are discussed.
Folding behavior of ribosomal protein S6 studied by modified Go¯ -like model

NASA Astrophysics Data System (ADS)

Wu, L.; Zhang, J.; Wang, J.; Li, W. F.; Wang, W.

2007-03-01

Recent experimental and theoretical studies suggest that, although topology is the determinant factor in protein folding, especially for small single-domain proteins, energetic factors also play an important role in the folding process. The ribosomal protein S6 has been subjected to intensive studies. A radical change of the transition state in its circular permutants has been observed, which is believed to be caused by a biased distribution of contact energies. Since the simplistic topology-only Gō -like model is not able to reproduce such an observation, we modify the model by introducing variable contact energies between residues based on their physicochemical properties. The modified Gō -like model can successfully reproduce the Φ -value distributions, folding nucleus, and folding pathways of both the wild-type and circular permutants of S6. Furthermore, by comparing the results of the modified and the simplistic models, we find that the hydrophobic effect constructs the major force that balances the loop entropies. This may indicate that nature maintains the folding cooperativity of this protein by carefully arranging the location of hydrophobic residues in the sequence. Our study reveals a strategy or mechanism used by nature to get out of the dilemma when the native structure, possibly required by biological function, conflicts with folding cooperativity. Finally, the possible relationship between such a design of nature and amyloidosis is also discussed.
Theory of the Protein Equilibrium Population Snapshot by H/D Exchange Electrospray Ionization Mass Spectrometry (PEPS-HDX-ESI-MS) Method used to obtain Protein Folding Energies/Rates and Selected Supporting Experimental Evidence.

PubMed

Liyanage, Rohana; Devarapalli, Nagarjuna; Pyland, Derek B; Puckett, Latisha M; Phan, N H; Starch, Joel A; Okimoto, Mark R; Gidden, Jennifer; Stites, Wesley E; Lay, Jackson O

2012-12-15

Protein equilibrium snapshot by hydrogen/deuterium exchange electrospray ionization mass spectrometry (PEPS-HDX-ESI-MS or PEPS) is a method recently introduced for estimating protein folding energies and rates. Herein we describe the basis for this method using both theory and new experiments. Benchmark experiments were conducted using ubiquitin because of the availability of reference data for folding and unfolding rates from NMR studies. A second set of experiments was also conducted to illustrate the surprising resilience of the PEPS to changes in HDX time, using staphylococcal nuclease and time frames ranging from a few seconds to several minutes. Theory suggests that PEPS experiments should be conducted at relatively high denaturant concentrations, where the protein folding/unfolding rates are slow with respect to HDX and the life times of both the closed and open states are long enough to be sampled experimentally. Upon deliberate denaturation, changes in folding/unfolding are correlated with associated changes in the ESI-MS signal upon fast HDX. When experiments are done quickly, typically within a few seconds, ESI-MS signals, corresponding to the equilibrium population of the native (closed) and denatured (open) states can both be detected. The interior of folded proteins remains largely un-exchanged. Amongst MS methods, the simultaneous detection of both states in the spectrum is unique to PEPS and provides a "snapshot" of these populations. The associated ion intensities are used to estimate the protein folding equilibrium constant (or the free energy change, ΔG). Linear extrapolation method (LEM) plots of derived ΔG values for each denaturant concentration can then be used to calculate ΔG in the absence of denaturant, ΔG(H(2)O). In accordance with the requirement for detection of signals for both the folded and unfolded states, this theoretical framework predicts that PEPS experiments work best at the middle of the denaturation curve where natured and denatured protein molecules are equilibrated at easily detectable ratios, namely 1:1. It also requires that closed and open states have lifetimes measurable in the time frame of the HDX experiment. Because both conditions are met by PEPS, these measurements can provide an accurate assessment of closed/open state populations and thus protein folding energies/rates.
Neuroligin Trafficking Deficiencies Arising from Mutations in the α/β-Hydrolase Fold Protein Family*

PubMed Central

De Jaco, Antonella; Lin, Michael Z.; Dubi, Noga; Comoletti, Davide; Miller, Meghan T.; Camp, Shelley; Ellisman, Mark; Butko, Margaret T.; Tsien, Roger Y.; Taylor, Palmer

2010-01-01

Despite great functional diversity, characterization of the α/β-hydrolase fold proteins that encompass a superfamily of hydrolases, heterophilic adhesion proteins, and chaperone domains reveals a common structural motif. By incorporating the R451C mutation found in neuroligin (NLGN) and associated with autism and the thyroglobulin G2320R (G221R in NLGN) mutation responsible for congenital hypothyroidism into NLGN3, we show that mutations in the α/β-hydrolase fold domain influence folding and biosynthetic processing of neuroligin3 as determined by in vitro susceptibility to proteases, glycosylation processing, turnover, and processing rates. We also show altered interactions of the mutant proteins with chaperones in the endoplasmic reticulum and arrest of transport along the secretory pathway with diversion to the proteasome. Time-controlled expression of a fluorescently tagged neuroligin in hippocampal neurons shows that these mutations compromise neuronal trafficking of the protein, with the R451C mutation reducing and the G221R mutation virtually abolishing the export of NLGN3 from the soma to the dendritic spines. Although the R451C mutation causes a local folding defect, the G221R mutation appears responsible for more global misfolding of the protein, reflecting their sequence positions in the structure of the protein. Our results suggest that disease-related mutations in the α/β-hydrolase fold domain share common trafficking deficiencies yet lead to discrete congenital disorders of differing severity in the endocrine and nervous systems. PMID:20615874
Neuroligin trafficking deficiencies arising from mutations in the alpha/beta-hydrolase fold protein family.

PubMed

De Jaco, Antonella; Lin, Michael Z; Dubi, Noga; Comoletti, Davide; Miller, Meghan T; Camp, Shelley; Ellisman, Mark; Butko, Margaret T; Tsien, Roger Y; Taylor, Palmer

2010-09-10

Despite great functional diversity, characterization of the alpha/beta-hydrolase fold proteins that encompass a superfamily of hydrolases, heterophilic adhesion proteins, and chaperone domains reveals a common structural motif. By incorporating the R451C mutation found in neuroligin (NLGN) and associated with autism and the thyroglobulin G2320R (G221R in NLGN) mutation responsible for congenital hypothyroidism into NLGN3, we show that mutations in the alpha/beta-hydrolase fold domain influence folding and biosynthetic processing of neuroligin3 as determined by in vitro susceptibility to proteases, glycosylation processing, turnover, and processing rates. We also show altered interactions of the mutant proteins with chaperones in the endoplasmic reticulum and arrest of transport along the secretory pathway with diversion to the proteasome. Time-controlled expression of a fluorescently tagged neuroligin in hippocampal neurons shows that these mutations compromise neuronal trafficking of the protein, with the R451C mutation reducing and the G221R mutation virtually abolishing the export of NLGN3 from the soma to the dendritic spines. Although the R451C mutation causes a local folding defect, the G221R mutation appears responsible for more global misfolding of the protein, reflecting their sequence positions in the structure of the protein. Our results suggest that disease-related mutations in the alpha/beta-hydrolase fold domain share common trafficking deficiencies yet lead to discrete congenital disorders of differing severity in the endocrine and nervous systems.
RF-Phos: A Novel General Phosphorylation Site Prediction Tool Based on Random Forest.

PubMed

Ismail, Hamid D; Jones, Ahoi; Kim, Jung H; Newman, Robert H; Kc, Dukka B

2016-01-01

Protein phosphorylation is one of the most widespread regulatory mechanisms in eukaryotes. Over the past decade, phosphorylation site prediction has emerged as an important problem in the field of bioinformatics. Here, we report a new method, termed Random Forest-based Phosphosite predictor 2.0 (RF-Phos 2.0), to predict phosphorylation sites given only the primary amino acid sequence of a protein as input. RF-Phos 2.0, which uses random forest with sequence and structural features, is able to identify putative sites of phosphorylation across many protein families. In side-by-side comparisons based on 10-fold cross validation and an independent dataset, RF-Phos 2.0 compares favorably to other popular mammalian phosphosite prediction methods, such as PhosphoSVM, GPS2.1, and Musite.
Equilibrium folding of pro-HlyA from Escherichia coli reveals a stable calcium ion dependent folding intermediate.

PubMed

Thomas, Sabrina; Bakkes, Patrick J; Smits, Sander H J; Schmitt, Lutz

2014-09-01

HlyA from Escherichia coli is a member of the repeats in toxin (RTX) protein family, produced by a wide range of Gram-negative bacteria and secreted by a dedicated Type 1 Secretion System (T1SS). RTX proteins are thought to be secreted in an unfolded conformation and to fold upon secretion by Ca(2+) binding. However, the exact mechanism of secretion, ion binding and folding to the correct native state remains largely unknown. In this study we provide an easy protocol for high-level pro-HlyA purification from E. coli. Equilibrium folding studies, using intrinsic tryptophan fluorescence, revealed the well-known fact that Ca(2+) is essential for stability as well as correct folding of the whole protein. In the absence of Ca(2+), pro-HlyA adopts a non-native conformation. Such molecules could however be rescued by Ca(2+) addition, indicating that these are not dead-end species and that Ca(2+) drives pro-HlyA folding. More importantly, pro-HlyA unfolded via a two-state mechanism, whereas folding was a three-state process. The latter is indicative of the presence of a stable folding intermediate. Analysis of deletion and Trp mutants revealed that the first folding transition, at 6-7M urea, relates to Ca(2+) dependent structural changes at the extreme C-terminus of pro-HlyA, sensed exclusively by Trp914. Since all Trp residues of HlyA are located outside the RTX domain, our results demonstrate that Ca(2+) induced folding is not restricted to the RTX domain. Taken together, Ca(2+) binding to the pro-HlyA RTX domain is required to drive the folding of the entire protein to its native conformation. Copyright © 2014 Elsevier B.V. All rights reserved.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.