ERIC Educational Resources Information Center
Robic, Srebrenka
2010-01-01
To fully understand the roles proteins play in cellular processes, students need to grasp complex ideas about protein structure, folding, and stability. Our current understanding of these topics is based on mathematical models and experimental data. However, protein structure, folding, and stability are often introduced as descriptive, qualitative…
Sun, Yunxiang; Ming, Dengming
2014-01-01
Energetic frustration is becoming an important topic for understanding the mechanisms of protein folding, which is a long-standing big biological problem usually investigated by the free energy landscape theory. Despite the significant advances in probing the effects of folding frustrations on the overall features of protein folding pathways and folding intermediates, detailed characterizations of folding frustrations at an atomic or residue level are still lacking. In addition, how and to what extent folding frustrations interact with protein topology in determining folding mechanisms remains unclear. In this paper, we tried to understand energetic frustrations in the context of protein topology structures or native-contact networks by comparing the energetic frustrations of five homologous Im9 alpha-helix proteins that share very similar topology structures but have a single hydrophilic-to-hydrophobic mutual mutation. The folding simulations were performed using a coarse-grained Gō-like model, while non-native hydrophobic interactions were introduced as energetic frustrations using a Lennard-Jones potential function. Energetic frustrations were then examined at residue level based on φ-value analyses of the transition state ensemble structures and mapped back to native-contact networks. Our calculations show that energetic frustrations have highly heterogeneous influences on the folding of the four helices of the examined structures depending on the local environment of the frustration centers. Also, the closer the introduced frustration is to the center of the native-contact network, the larger the changes in the protein folding. Our findings add a new dimension to the understanding of protein folding the topology determination in that energetic frustrations works closely with native-contact networks to affect the protein folding.
Robic, Srebrenka
2010-01-01
To fully understand the roles proteins play in cellular processes, students need to grasp complex ideas about protein structure, folding, and stability. Our current understanding of these topics is based on mathematical models and experimental data. However, protein structure, folding, and stability are often introduced as descriptive, qualitative phenomena in undergraduate classes. In the process of learning about these topics, students often form incorrect ideas. For example, by learning about protein folding in the context of protein synthesis, students may come to an incorrect conclusion that once synthesized on the ribosome, a protein spends its entire cellular life time in its fully folded native confirmation. This is clearly not true; proteins are dynamic structures that undergo both local fluctuations and global unfolding events. To prevent and address such misconceptions, basic concepts of protein science can be introduced in the context of simple mathematical models and hands-on explorations of publicly available data sets. Ten common misconceptions about proteins are presented, along with suggestions for using equations, models, sequence, structure, and thermodynamic data to help students gain a deeper understanding of basic concepts relating to protein structure, folding, and stability.
Revealing the global map of protein folding space by large-scale simulations
NASA Astrophysics Data System (ADS)
Sinner, Claude; Lutz, Benjamin; Verma, Abhinav; Schug, Alexander
2015-12-01
The full characterization of protein folding is a remarkable long-standing challenge both for experiment and simulation. Working towards a complete understanding of this process, one needs to cover the full diversity of existing folds and identify the general principles driving the process. Here, we want to understand and quantify the diversity in folding routes for a large and representative set of protein topologies covering the full range from all alpha helical topologies towards beta barrels guided by the key question: Does the majority of the observed routes contribute to the folding process or only a particular route? We identified a set of two-state folders among non-homologous proteins with a sequence length of 40-120 residues. For each of these proteins, we ran native-structure based simulations both with homogeneous and heterogeneous contact potentials. For each protein, we simulated dozens of folding transitions in continuous uninterrupted simulations and constructed a large database of kinetic parameters. We investigate folding routes by tracking the formation of tertiary structure interfaces and discuss whether a single specific route exists for a topology or if all routes are equiprobable. These results permit us to characterize the complete folding space for small proteins in terms of folding barrier ΔG‡, number of routes, and the route specificity RT.
Lazim, Raudah; Mei, Ye; Zhang, Dawei
2012-03-01
Replica exchange molecular dynamics (REMD) simulation provides an efficient conformational sampling tool for the study of protein folding. In this study, we explore the mechanism directing the structure variation from α/4β-fold protein to 3α-fold protein after mutation by conducting REMD simulation on 42 replicas with temperatures ranging from 270 K to 710 K. The simulation began from a protein possessing the primary structure of GA88 but the tertiary structure of GB88, two G proteins with "high sequence identity." Albeit the large Cα-root mean square deviation (RMSD) of the folded protein (4.34 Å at 270 K and 4.75 Å at 304 K), a variation in tertiary structure was observed. Together with the analysis of secondary structure assignment, cluster analysis and principal component, it provides insights to the folding and unfolding pathway of 3α-fold protein and α/4β-fold protein respectively paving the way toward the understanding of the ongoings during conformational variation.
Protein Folding and Self-Organized Criticality
NASA Astrophysics Data System (ADS)
Bajracharya, Arun; Murray, Joelle
Proteins are known to fold into tertiary structures that determine their functionality in living organisms. However, the complex dynamics of protein folding and the way they consistently fold into the same structures is not fully understood. Self-organized criticality (SOC) has provided a framework for understanding complex systems in various systems (earthquakes, forest fires, financial markets, and epidemics) through scale invariance and the associated power law behavior. In this research, we use a simple hydrophobic-polar lattice-bound computational model to investigate self-organized criticality as a possible mechanism for generating complexity in protein folding.
On the Origin of Protein Superfamilies and Superfolds
NASA Astrophysics Data System (ADS)
Magner, Abram; Szpankowski, Wojciech; Kihara, Daisuke
2015-02-01
Distributions of protein families and folds in genomes are highly skewed, having a small number of prevalent superfamiles/superfolds and a large number of families/folds of a small size. Why are the distributions of protein families and folds skewed? Why are there only a limited number of protein families? Here, we employ an information theoretic approach to investigate the protein sequence-structure relationship that leads to the skewed distributions. We consider that protein sequences and folds constitute an information theoretic channel and computed the most efficient distribution of sequences that code all protein folds. The identified distributions of sequences and folds are found to follow a power law, consistent with those observed for proteins in nature. Importantly, the skewed distributions of sequences and folds are suggested to have different origins: the skewed distribution of sequences is due to evolutionary pressure to achieve efficient coding of necessary folds, whereas that of folds is based on the thermodynamic stability of folds. The current study provides a new information theoretic framework for proteins that could be widely applied for understanding protein sequences, structures, functions, and interactions.
Muthu Krishnan, S
2018-05-14
The receptor-associated protein (RAP) is an inhibitor of endocytic receptors that belong to the lipoprotein receptor gene family. In this study, a computational approach was tried to find the evolutionarily related fold of the RAP proteins. Through the structural and sequence-based analysis, found various protein folds that are very close to the RAP folds. Remote homolog datasets were used potentially to develop a different support vector machine (SVM) methods to recognize the homologous RAP fold. This study helps in understanding the relationship of RAP homologs folds based on the structure, function and evolutionary history. Copyright © 2018 Elsevier Ltd. All rights reserved.
Understanding disordered and unfolded proteins using single-molecule FRET and polymer theory.
Hofmann, Hagen
2016-11-17
Understanding protein folding and the functional properties of intrinsically disordered proteins (IDPs) requires detailed knowledge of the forces that act in polypeptide chains. These forces determine the dimensions and dynamics of unfolded and disordered proteins and have been suggested to impact processes such as the coupled binding and folding of IDPs, or the rate of protein folding reactions. Much of the progress in understanding the physical and chemical properties of unfolded and intrinsically disordered polypeptide chains has been made possible by the recent developments in single-molecule fluorescence techniques. However, the interpretation of the experimental results requires concepts from polymer physics in order to be understood. Here, I review some of the theories used to describe the dimensions of unfolded polypeptide chains under varying solvent conditions together with their more recent application to experimental data.
SeqRate: sequence-based protein folding type classification and rates prediction
2010-01-01
Background Protein folding rate is an important property of a protein. Predicting protein folding rate is useful for understanding protein folding process and guiding protein design. Most previous methods of predicting protein folding rate require the tertiary structure of a protein as an input. And most methods do not distinguish the different kinetic nature (two-state folding or multi-state folding) of the proteins. Here we developed a method, SeqRate, to predict both protein folding kinetic type (two-state versus multi-state) and real-value folding rate using sequence length, amino acid composition, contact order, contact number, and secondary structure information predicted from only protein sequence with support vector machines. Results We systematically studied the contributions of individual features to folding rate prediction. On a standard benchmark dataset, the accuracy of folding kinetic type classification is 80%. The Pearson correlation coefficient and the mean absolute difference between predicted and experimental folding rates (sec-1) in the base-10 logarithmic scale are 0.81 and 0.79 for two-state protein folders, and 0.80 and 0.68 for three-state protein folders. SeqRate is the first sequence-based method for protein folding type classification and its accuracy of fold rate prediction is improved over previous sequence-based methods. Its performance can be further enhanced with additional information, such as structure-based geometric contacts, as inputs. Conclusions Both the web server and software of predicting folding rate are publicly available at http://casp.rnet.missouri.edu/fold_rate/index.html. PMID:20438647
Whitford, Paul C; Noel, Jeffrey K; Gosavi, Shachi; Schug, Alexander; Sanbonmatsu, Kevin Y; Onuchic, José N
2009-05-01
Protein dynamics take place on many time and length scales. Coarse-grained structure-based (Go) models utilize the funneled energy landscape theory of protein folding to provide an understanding of both long time and long length scale dynamics. All-atom empirical forcefields with explicit solvent can elucidate our understanding of short time dynamics with high energetic and structural resolution. Thus, structure-based models with atomic details included can be used to bridge our understanding between these two approaches. We report on the robustness of folding mechanisms in one such all-atom model. Results for the B domain of Protein A, the SH3 domain of C-Src Kinase, and Chymotrypsin Inhibitor 2 are reported. The interplay between side chain packing and backbone folding is explored. We also compare this model to a C(alpha) structure-based model and an all-atom empirical forcefield. Key findings include: (1) backbone collapse is accompanied by partial side chain packing in a cooperative transition and residual side chain packing occurs gradually with decreasing temperature, (2) folding mechanisms are robust to variations of the energetic parameters, (3) protein folding free-energy barriers can be manipulated through parametric modifications, (4) the global folding mechanisms in a C(alpha) model and the all-atom model agree, although differences can be attributed to energetic heterogeneity in the all-atom model, and (5) proline residues have significant effects on folding mechanisms, independent of isomerization effects. Because this structure-based model has atomic resolution, this work lays the foundation for future studies to probe the contributions of specific energetic factors on protein folding and function.
Whitford, Paul C.; Noel, Jeffrey K.; Gosavi, Shachi; Schug, Alexander; Sanbonmatsu, Kevin Y.; Onuchic, José N.
2012-01-01
Protein dynamics take place on many time and length scales. Coarse-grained structure-based (Gō) models utilize the funneled energy landscape theory of protein folding to provide an understanding of both long time and long length scale dynamics. All-atom empirical forcefields with explicit solvent can elucidate our understanding of short time dynamics with high energetic and structural resolution. Thus, structure-based models with atomic details included can be used to bridge our understanding between these two approaches. We report on the robustness of folding mechanisms in one such all-atom model. Results for the B domain of Protein A, the SH3 domain of C-Src Kinase and Chymotrypsin Inhibitor 2 are reported. The interplay between side chain packing and backbone folding is explored. We also compare this model to a Cα structure-based model and an all-atom empirical forcefield. Key findings include 1) backbone collapse is accompanied by partial side chain packing in a cooperative transition and residual side chain packing occurs gradually with decreasing temperature 2) folding mechanisms are robust to variations of the energetic parameters 3) protein folding free energy barriers can be manipulated through parametric modifications 4) the global folding mechanisms in a Cα model and the all-atom model agree, although differences can be attributed to energetic heterogeneity in the all-atom model 5) proline residues have significant effects on folding mechanisms, independent of isomerization effects. Since this structure-based model has atomic resolution, this work lays the foundation for future studies to probe the contributions of specific energetic factors on protein folding and function. PMID:18837035
Dependence of Internal Friction on Folding Mechanism
2016-01-01
An outstanding challenge in protein folding is understanding the origin of “internal friction” in folding dynamics, experimentally identified from the dependence of folding rates on solvent viscosity. A possible origin suggested by simulation is the crossing of local torsion barriers. However, it was unclear why internal friction varied from protein to protein or for different folding barriers of the same protein. Using all-atom simulations with variable solvent viscosity, in conjunction with transition-path sampling to obtain reaction rates and analysis via Markov state models, we are able to determine the internal friction in the folding of several peptides and miniproteins. In agreement with experiment, we find that the folding events with greatest internal friction are those that mainly involve helix formation, while hairpin formation exhibits little or no evidence of friction. Via a careful analysis of folding transition paths, we show that internal friction arises when torsion angle changes are an important part of the folding mechanism near the folding free energy barrier. These results suggest an explanation for the variation of internal friction effects from protein to protein and across the energy landscape of the same protein. PMID:25721133
Dependence of internal friction on folding mechanism.
Zheng, Wenwei; De Sancho, David; Hoppe, Travis; Best, Robert B
2015-03-11
An outstanding challenge in protein folding is understanding the origin of "internal friction" in folding dynamics, experimentally identified from the dependence of folding rates on solvent viscosity. A possible origin suggested by simulation is the crossing of local torsion barriers. However, it was unclear why internal friction varied from protein to protein or for different folding barriers of the same protein. Using all-atom simulations with variable solvent viscosity, in conjunction with transition-path sampling to obtain reaction rates and analysis via Markov state models, we are able to determine the internal friction in the folding of several peptides and miniproteins. In agreement with experiment, we find that the folding events with greatest internal friction are those that mainly involve helix formation, while hairpin formation exhibits little or no evidence of friction. Via a careful analysis of folding transition paths, we show that internal friction arises when torsion angle changes are an important part of the folding mechanism near the folding free energy barrier. These results suggest an explanation for the variation of internal friction effects from protein to protein and across the energy landscape of the same protein.
Gelman, Hannah; Gruebele, Martin
2014-01-01
Fast folding proteins have been a major focus of computational and experimental study because they are accessible to both techniques: they are small and fast enough to be reasonably simulated with current computational power, but have dynamics slow enough to be observed with specially developed experimental techniques. This coupled study of fast folding proteins has provided insight into the mechanisms which allow some proteins to find their native conformation well less than 1 ms and has uncovered examples of theoretically predicted phenomena such as downhill folding. The study of fast folders also informs our understanding of even “slow” folding processes: fast folders are small, relatively simple protein domains and the principles that govern their folding also govern the folding of more complex systems. This review summarizes the major theoretical and experimental techniques used to study fast folding proteins and provides an overview of the major findings of fast folding research. Finally, we examine the themes that have emerged from studying fast folders and briefly summarize their application to protein folding in general as well as some work that is left to do. PMID:24641816
Chikenji, George; Fujitsuka, Yoshimi; Takada, Shoji
2006-02-28
Predicting protein tertiary structure by folding-like simulations is one of the most stringent tests of how much we understand the principle of protein folding. Currently, the most successful method for folding-based structure prediction is the fragment assembly (FA) method. Here, we address why the FA method is so successful and its lesson for the folding problem. To do so, using the FA method, we designed a structure prediction test of "chimera proteins." In the chimera proteins, local structural preference is specific to the target sequences, whereas nonlocal interactions are only sequence-independent compaction forces. We find that these chimera proteins can find the native folds of the intact sequences with high probability indicating dominant roles of the local interactions. We further explore roles of local structural preference by exact calculation of the HP lattice model of proteins. From these results, we suggest principles of protein folding: For small proteins, compact structures that are fully compatible with local structural preference are few, one of which is the native fold. These local biases shape up the funnel-like energy landscape.
Shaping up the protein folding funnel by local interaction: Lesson from a structure prediction study
Chikenji, George; Fujitsuka, Yoshimi; Takada, Shoji
2006-01-01
Predicting protein tertiary structure by folding-like simulations is one of the most stringent tests of how much we understand the principle of protein folding. Currently, the most successful method for folding-based structure prediction is the fragment assembly (FA) method. Here, we address why the FA method is so successful and its lesson for the folding problem. To do so, using the FA method, we designed a structure prediction test of “chimera proteins.” In the chimera proteins, local structural preference is specific to the target sequences, whereas nonlocal interactions are only sequence-independent compaction forces. We find that these chimera proteins can find the native folds of the intact sequences with high probability indicating dominant roles of the local interactions. We further explore roles of local structural preference by exact calculation of the HP lattice model of proteins. From these results, we suggest principles of protein folding: For small proteins, compact structures that are fully compatible with local structural preference are few, one of which is the native fold. These local biases shape up the funnel-like energy landscape. PMID:16488978
NASA Technical Reports Server (NTRS)
Shannon, G.; Wei, C.; Pohorille, A.
2017-01-01
Considering the range of functions proteins perform, it is surprising they fold into a relatively small set of structures or "folds" that facilitate such function. One explanation is that only a minority were fit enough to emerge from Darwinian selection during the early evolution of life. Alternatively, perhaps only a fraction of all possible folds were trialed. Understanding proto-catalyst selection will aid understanding of the origins and early evolution of life. To investigate which explanation is correct, we study a protein evolved in vitro to bind ATP by Jack Szostak (Fig. 1). This protein adopts a fold which is absent from nature. We are testing whether this fold would have possessed the capability to evolve that would have been essential to survive natural selection on early Earth. Folds that couldn't improve their fitness and evolve to perform new functions would have been replaced by rivals that could. To determine whether the fold is evolvable, we are attempting to change the function of the protein by rationally redesigning to bind GTP. Two design strategies in the region of the nucleobase have been implemented to provide hydrogen bonding partners for the ligand i) an insertion ii) a MET to ASN mutation. Redesigns are being studied computationally at Ames Research Center including free energy of binding calculations. Binding affinities of promising redesigns are to be validated by experimental collaborators at ForteBio using Super Streptavidin Biosensors. If the fold is found to be non-evolvable, this may suggest that many structures were trialed, but the majority were pruned on the basis of their evolvability. Alternatively, if the fold is demonstrated to be evolvable, it would be difficult to explain its absence from nature without considering the possibility that the fold simply wasn't sampled on early Earth. This would not only further our understanding of the origins of life on Earth but also suggest a common phe-nomenon of proto-catalyst evolution.
Structural Determinants of Sleeping Beauty Transposase Activity
Abrusán, György; Yant, Stephen R; Szilágyi, András; Marsh, Joseph A; Mátés, Lajos; Izsvák, Zsuzsanna; Barabás, Orsolya; Ivics, Zoltán
2016-01-01
Transposases are important tools in genome engineering, and there is considerable interest in engineering more efficient ones. Here, we seek to understand the factors determining their activity using the Sleeping Beauty transposase. Recent work suggests that protein coevolutionary information can be used to classify groups of physically connected, coevolving residues into elements called “sectors”, which have proven useful for understanding the folding, allosteric interactions, and enzymatic activity of proteins. Using extensive mutagenesis data, protein modeling and analysis of folding energies, we show that (i) The Sleeping Beauty transposase contains two sectors, which span across conserved domains, and are enriched in DNA-binding residues, indicating that the DNA binding and endonuclease functions of the transposase coevolve; (ii) Sector residues are highly sensitive to mutations, and most mutations of these residues strongly reduce transposition rate; (iii) Mutations with a strong effect on free energy of folding in the DDE domain of the transposase significantly reduce transposition rate. (iv) Mutations that influence DNA and protein-protein interactions generally reduce transposition rate, although most hyperactive mutants are also located on the protein surface, including residues with protein-protein interactions. This suggests that hyperactivity results from the modification of protein interactions, rather than the stabilization of protein fold. PMID:27401040
Solitons and protein folding: An In Silico experiment
NASA Astrophysics Data System (ADS)
Ilieva, N.; Dai, J.; Sieradzan, A.; Niemi, A.
2015-10-01
Protein folding [1] is the process of formation of a functional 3D structure from a random coil — the shape in which amino-acid chains leave the ribosome. Anfinsen's dogma states that the native 3D shape of a protein is completely determined by protein's amino acid sequence. Despite the progress in understanding the process rate and the success in folding prediction for some small proteins, with presently available physics-based methods it is not yet possible to reliably deduce the shape of a biologically active protein from its amino acid sequence. The protein-folding problem endures as one of the most important unresolved problems in science; it addresses the origin of life itself. Furthermore, a wrong fold is a common cause for a protein to lose its function or even endanger the living organism. Soliton solutions of a generalized discrete non-linear Schrödinger equation (GDNLSE) obtained from the energy function in terms of bond and torsion angles κ and τ provide a constructive theoretical framework for describing protein folds and folding patterns [2]. Here we study the dynamics of this process by means of molecular-dynamics simulations. The soliton manifestation is the pattern helix-loop-helix in the secondary structure of the protein, which explains the importance of understanding loop formation in helical proteins. We performed in silico experiments for unfolding one subunit of the core structure of gp41 from the HIV envelope glycoprotein (PDB ID: 1AIK [3]) by molecular-dynamics simulations with the MD package GROMACS. We analyzed 80 ns trajectories, obtained with one united-atom and two different all-atom force fields, to justify the side-chain orientation quantification scheme adopted in the studies and to eliminate force-field based artifacts. Our results are compatible with the soliton model of protein folding and provide first insight into soliton-formation dynamics.
PROTERAN: animated terrain evolution for visual analysis of patterns in protein folding trajectory.
Zhou, Ruhong; Parida, Laxmi; Kapila, Kush; Mudur, Sudhir
2007-01-01
The mechanism of protein folding remains largely a mystery in molecular biology, despite the enormous effort from many groups in the past decades. Currently, the protein folding mechanism is often characterized by calculating the free energy landscape versus various reaction coordinates such as the fraction of native contacts, the radius of gyration and so on. In this paper, we present an integrated approach towards understanding the folding process via visual analysis of patterns of these reaction coordinates. The three disparate processes (1) protein folding simulation, (2) pattern elicitation and (3) visualization of patterns, work in tandem. Thus as the protein folds, the changing landscape in the pattern space can be viewed via the visualization tool, PROTERAN, a program we developed for this purpose. We first present an incremental (on-line) trie-based pattern discovery algorithm to elicit the patterns and then describe the terrain metaphor based visualization tool. Using two example small proteins, a beta-hairpin and a designed protein Trp-cage, we next demonstrate that this combined pattern discovery and visualization approach extracts crucial information about protein folding intermediates and mechanism.
Effects of tethering a multistate folding protein to a surface
NASA Astrophysics Data System (ADS)
Wei, Shuai; Knotts, Thomas A.
2011-05-01
Protein/surface interactions are important in a variety of fields and devices, yet fundamental understanding of the relevant phenomena remains fragmented due to resolution limitations of experimental techniques. Molecular simulation has provided useful answers, but such studies have focused on proteins that fold through a two-state process. This study uses simulation to show how surfaces can affect proteins which fold through a multistate process by investigating the folding mechanism of lysozyme (PDB ID: 7LZM). The results demonstrate that in the bulk 7LZM folds through a process with four stable states: the folded state, the unfolded state, and two stable intermediates. The folding mechanism remains the same when the protein is tethered to a surface at most residues; however, in one case the folding mechanism changes in such a way as to eliminate one of the intermediates. An analysis of the molecular configurations shows that tethering at this site is advantageous for protein arrays because the active site is both presented to the bulk phase and stabilized. Taken as a whole, the results offer hope that rational design of protein arrays is possible once the behavior of the protein on the surface is ascertained.
A semi-analytical description of protein folding that incorporates detailed geometrical information
Suzuki, Yoko; Noel, Jeffrey K.; Onuchic, José N.
2011-01-01
Much has been done to study the interplay between geometric and energetic effects on the protein folding energy landscape. Numerical techniques such as molecular dynamics simulations are able to maintain a precise geometrical representation of the protein. Analytical approaches, however, often focus on the energetic aspects of folding, including geometrical information only in an average way. Here, we investigate a semi-analytical expression of folding that explicitly includes geometrical effects. We consider a Hamiltonian corresponding to a Gaussian filament with structure-based interactions. The model captures local features of protein folding often averaged over by mean-field theories, for example, loop contact formation and excluded volume. We explore the thermodynamics and folding mechanisms of beta-hairpin and alpha-helical structures as functions of temperature and Q, the fraction of native contacts formed. Excluded volume is shown to be an important component of a protein Hamiltonian, since it both dominates the cooperativity of the folding transition and alters folding mechanisms. Understanding geometrical effects in analytical formulae will help illuminate the consequences of the approximations required for the study of larger proteins. PMID:21721664
NASA Astrophysics Data System (ADS)
Iguchi, Kazumoto
We discuss the statistical mechanical foundation for the two-state transition in the protein folding of small globular proteins. In the standard arguments of protein folding, the statistical search for the ground state is carried out from astronomically many conformations in the configuration space. This leads us to the famous Levinthal's paradox. To resolve the paradox, Gō first postulated that the two-state transition - all-or-none type transition - is very crucial for the protein folding of small globular proteins and used the Gō's lattice model to show the two-state transition nature. Recently, there have been accumulated many experimental results that support the two-state transition for small globular proteins. Stimulated by such recent experiments, Zwanzig has introduced a minimal statistical mechanical model that exhibits the two-state transition. Also, Finkelstein and coworkers have discussed the solution of the paradox by considering the sequential folding of a small globular protein. On the other hand, recently Iguchi have introduced a toy model of protein folding using the Rubik's magic snake model, in which all folded structures are exactly known and mathematically represented in terms of the four types of conformations: cis-, trans-, left and right gauche-configurations between the unit polyhedrons. In this paper, we study the relationship between the Gō's two-state transition, the Zwanzig's statistical mechanics model and the Finkelsteinapos;s sequential folding model by applying them to the Rubik's magic snake models. We show that the foundation of the Gō's two-state transition model relies on the search within the equienergy surface that is labeled by the contact order of the hydrophobic condensation. This idea reproduces the Zwanzig's statistical model as a special case, realizes the Finkelstein's sequential folding model and fits together to understand the nature of the two-state transition of a small globular protein by calculating the physical quantities such as the free energy, the contact order and the specific heat. We point out the similarity between the liquid-gas transition in statistical mechanics and the two-state transition of protein folding. We also study morphology of the Rubik's magic snake models to give a prototype model for understanding the differences between α-helices proteins and β-sheets proteins.
Robustness of atomistic Gō models in predicting native-like folding intermediates
NASA Astrophysics Data System (ADS)
Estácio, S. G.; Fernandes, C. S.; Krobath, H.; Faísca, P. F. N.; Shakhnovich, E. I.
2012-08-01
Gō models are exceedingly popular tools in computer simulations of protein folding. These models are native-centric, i.e., they are directly constructed from the protein's native structure. Therefore, it is important to understand up to which extent the atomistic details of the native structure dictate the folding behavior exhibited by Gō models. Here we address this challenge by performing exhaustive discrete molecular dynamics simulations of a Gō potential combined with a full atomistic protein representation. In particular, we investigate the robustness of this particular type of Gō models in predicting the existence of intermediate states in protein folding. We focus on the N47G mutational form of the Spc-SH3 folding domain (x-ray structure) and compare its folding pathway with that of alternative native structures produced in silico. Our methodological strategy comprises equilibrium folding simulations, structural clustering, and principal component analysis.
Role of naturally occurring osmolytes in protein folding and stability.
Kumar, Raj
2009-11-01
Osmolytes are typically accumulated in the intracellular environment at relatively high concentrations when cells/tissues are subjected to stress conditions. Osmolytes are common in a variety of organisms, including microorganisms, plants, and animals. They enhance thermodynamic stability of proteins by providing natively folded conformations without perturbing other cellular processes. By burying the backbone into the core of folded proteins, osmolytes can provide significant stability to proteins. Two properties of osmolytes are particularly important: (i) their ability to impart increased thermodynamic stability to folded proteins; and (ii) their compatibility in the intracellular environment at high concentrations. Under physiological conditions, the cellular compositions of osmolytes may vary significantly. This may lead to different protein folding pathways utilized in cells depending upon the intracellular environment. Proper understanding of the role of osmolytes in cell regulation should allow predicting the action of osmolytes on macromolecular interactions in stressed and crowded environments typical of cellular conditions.
Visualizing chaperone-assisted protein folding
Horowitz, Scott; Salmon, Loïc; Koldewey, Philipp; ...
2016-05-30
We present that challenges in determining the structures of heterogeneous and dynamic protein complexes have greatly hampered past efforts to obtain a mechanistic understanding of many important biological processes. One such process is chaperone-assisted protein folding. Obtaining structural ensembles of chaperone–substrate complexes would ultimately reveal how chaperones help proteins fold into their native state. To address this problem, we devised a new structural biology approach based on X-ray crystallography, termed residual electron and anomalous density (READ). READ enabled us to visualize even sparsely populated conformations of the substrate protein immunity protein 7 (Im7) in complex with the Escherichia coli chaperonemore » Spy, and to capture a series of snapshots depicting the various folding states of Im7 bound to Spy. The ensemble shows that Spy-associated Im7 samples conformations ranging from unfolded to partially folded to native-like states and reveals how a substrate can explore its folding landscape while being bound to a chaperone.« less
Makabe, Koki; Nakamura, Takashi; Dhar, Debanjan; Ikura, Teikichi; Koide, Shohei; Kuwajima, Kunihiro
2018-04-27
Although many naturally occurring proteins consist of multiple domains, most studies on protein folding to date deal with single-domain proteins or isolated domains of multi-domain proteins. Studies of multi-domain protein folding are required for further advancing our understanding of protein folding mechanisms. Borrelia outer surface protein A (OspA) is a β-rich two-domain protein, in which two globular domains are connected by a rigid and stable single-layer β-sheet. Thus, OspA is particularly suited as a model system for studying the interplays of domains in protein folding. Here, we studied the equilibria and kinetics of the urea-induced folding-unfolding reactions of OspA probed with tryptophan fluorescence and ultraviolet circular dichroism. Global analysis of the experimental data revealed compelling lines of evidence for accumulation of an on-pathway intermediate during kinetic refolding and for the identity between the kinetic intermediate and a previously described equilibrium unfolding intermediate. The results suggest that the intermediate has the fully native structure in the N-terminal domain and the single layer β-sheet, with the C-terminal domain still unfolded. The observation of the productive on-pathway folding intermediate clearly indicates substantial interactions between the two domains mediated by the single-layer β-sheet. We propose that a rigid and stable intervening region between two domains creates an overlap between two folding units and can energetically couple their folding reactions. Copyright © 2018. Published by Elsevier Ltd.
Insights into the fold organization of TIM barrel from interaction energy based structure networks.
Vijayabaskar, M S; Vishveshwara, Saraswathi
2012-01-01
There are many well-known examples of proteins with low sequence similarity, adopting the same structural fold. This aspect of sequence-structure relationship has been extensively studied both experimentally and theoretically, however with limited success. Most of the studies consider remote homology or "sequence conservation" as the basis for their understanding. Recently "interaction energy" based network formalism (Protein Energy Networks (PENs)) was developed to understand the determinants of protein structures. In this paper we have used these PENs to investigate the common non-covalent interactions and their collective features which stabilize the TIM barrel fold. We have also developed a method of aligning PENs in order to understand the spatial conservation of interactions in the fold. We have identified key common interactions responsible for the conservation of the TIM fold, despite high sequence dissimilarity. For instance, the central beta barrel of the TIM fold is stabilized by long-range high energy electrostatic interactions and low-energy contiguous vdW interactions in certain families. The other interfaces like the helix-sheet or the helix-helix seem to be devoid of any high energy conserved interactions. Conserved interactions in the loop regions around the catalytic site of the TIM fold have also been identified, pointing out their significance in both structural and functional evolution. Based on these investigations, we have developed a novel network based phylogenetic analysis for remote homologues, which can perform better than sequence based phylogeny. Such an analysis is more meaningful from both structural and functional evolutionary perspective. We believe that the information obtained through the "interaction conservation" viewpoint and the subsequently developed method of structure network alignment, can shed new light in the fields of fold organization and de novo computational protein design.
Fitzpatrick, Anthony W.; Knowles, Tuomas P. J.; Waudby, Christopher A.; Vendruscolo, Michele; Dobson, Christopher M.
2011-01-01
Identifying the forces that drive proteins to misfold and aggregate, rather than to fold into their functional states, is fundamental to our understanding of living systems and to our ability to combat protein deposition disorders such as Alzheimer's disease and the spongiform encephalopathies. We report here the finding that the balance between hydrophobic and hydrogen bonding interactions is different for proteins in the processes of folding to their native states and misfolding to the alternative amyloid structures. We find that the minima of the protein free energy landscape for folding and misfolding tend to be respectively dominated by hydrophobic and by hydrogen bonding interactions. These results characterise the nature of the interactions that determine the competition between folding and misfolding of proteins by revealing that the stability of native proteins is primarily determined by hydrophobic interactions between side-chains, while the stability of amyloid fibrils depends more on backbone intermolecular hydrogen bonding interactions. PMID:22022239
Thermosensitivity of growth is determined by chaperone-mediated proteome reallocation
Chen, Ke; Gao, Ye; Mih, Nathan; O’Brien, Edward J.; Yang, Laurence; Palsson, Bernhard O.
2017-01-01
Maintenance of a properly folded proteome is critical for bacterial survival at notably different growth temperatures. Understanding the molecular basis of thermoadaptation has progressed in two main directions, the sequence and structural basis of protein thermostability and the mechanistic principles of protein quality control assisted by chaperones. Yet we do not fully understand how structural integrity of the entire proteome is maintained under stress and how it affects cellular fitness. To address this challenge, we reconstruct a genome-scale protein-folding network for Escherichia coli and formulate a computational model, FoldME, that provides statistical descriptions of multiscale cellular response consistent with many datasets. FoldME simulations show (i) that the chaperones act as a system when they respond to unfolding stress rather than achieving efficient folding of any single component of the proteome, (ii) how the proteome is globally balanced between chaperones for folding and the complex machinery synthesizing the proteins in response to perturbation, (iii) how this balancing determines growth rate dependence on temperature and is achieved through nonspecific regulation, and (iv) how thermal instability of the individual protein affects the overall functional state of the proteome. Overall, these results expand our view of cellular regulation, from targeted specific control mechanisms to global regulation through a web of nonspecific competing interactions that modulate the optimal reallocation of cellular resources. The methodology developed in this study enables genome-scale integration of environment-dependent protein properties and a proteome-wide study of cellular stress responses. PMID:29073085
Light-activated control of protein channel assembly mediated by membrane mechanics
NASA Astrophysics Data System (ADS)
Miller, David M.; Findlay, Heather E.; Ces, Oscar; Templer, Richard H.; Booth, Paula J.
2016-12-01
Photochemical processes provide versatile triggers of chemical reactions. Here, we use a photoactivated lipid switch to modulate the folding and assembly of a protein channel within a model biological membrane. In contrast to the information rich field of water-soluble protein folding, there is only a limited understanding of the assembly of proteins that are integral to biological membranes. It is however possible to exploit the foreboding hydrophobic lipid environment and control membrane protein folding via lipid bilayer mechanics. Mechanical properties such as lipid chain lateral pressure influence the insertion and folding of proteins in membranes, with different stages of folding having contrasting sensitivities to the bilayer properties. Studies to date have relied on altering bilayer properties through lipid compositional changes made at equilibrium, and thus can only be made before or after folding. We show that light-activation of photoisomerisable di-(5-[[4-(4-butylphenyl)azo]phenoxy]pentyl)phosphate (4-Azo-5P) lipids influences the folding and assembly of the pentameric bacterial mechanosensitive channel MscL. The use of a photochemical reaction enables the bilayer properties to be altered during folding, which is unprecedented. This mechanical manipulation during folding, allows for optimisation of different stages of the component insertion, folding and assembly steps within the same lipid system. The photochemical approach offers the potential to control channel assembly when generating synthetic devices that exploit the mechanosensitive protein as a nanovalve.
Forces Driving Chaperone Action
Koldewey, Philipp; Stull, Frederick; Horowitz, Scott; Martin, Raoul; Bardwell, James C. A.
2016-01-01
SUMMARY It is still unclear what molecular forces drive chaperone-mediated protein folding. Here, we obtain a detailed mechanistic understanding of the forces that dictate the four key steps of chaperone-client interaction: initial binding, complex stabilization, folding, and release. Contrary to the common belief that chaperones recognize unfolding intermediates by their hydrophobic nature, we discover that the model chaperone Spy uses long-range electrostatic interactions to rapidly bind to its unfolded client protein Im7. Short-range hydrophobic interactions follow, which serve to stabilize the complex. Hydrophobic collapse of the client protein then drives its folding. By burying hydrophobic residues in its core, the client’s affinity to Spy decreases, which causes client release. By allowing the client to fold itself, Spy circumvents the need for client-specific folding instructions. This mechanism might help explain how chaperones can facilitate the folding of various unrelated proteins. PMID:27293188
Minami, Shintaro; Sawada, Kengo; Chikenji, George
2014-01-01
It has been known that topologically different proteins of the same class sometimes share the same spatial arrangement of secondary structure elements (SSEs). However, the frequency by which topologically different structures share the same spatial arrangement of SSEs is unclear. It is important to estimate this frequency because it provides both a deeper understanding of the geometry of protein folds and a valuable suggestion for predicting protein structures with novel folds. Here we clarified the frequency with which protein folds share the same SSE packing arrangement with other folds, the types of spatial arrangement of SSEs that are frequently observed across different folds, and the diversity of protein folds that share the same spatial arrangement of SSEs with a given fold, using a protein structure alignment program MICAN, which we have been developing. By performing comprehensive structural comparison of SCOP fold representatives, we found that approximately 80% of protein folds share the same spatial arrangement of SSEs with other folds. We also observed that many protein pairs that share the same spatial arrangement of SSEs belong to the different classes, often with an opposing N- to C-terminal direction of the polypeptide chain. The most frequently observed spatial arrangement of SSEs was the 2-layer α/β packing arrangement and it was dispersed among as many as 27% of SCOP fold representatives. These results suggest that the same spatial arrangements of SSEs are adopted by a wide variety of different folds and that the spatial arrangement of SSEs is highly robust against the N- to C-terminal direction of the polypeptide chain. PMID:25243952
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ilieva, N., E-mail: nevena.ilieva@parallel.bas.bg; Dai, J., E-mail: daijing491@gmail.com; Sieradzan, A., E-mail: adams86@wp.pl
Protein folding [1] is the process of formation of a functional 3D structure from a random coil — the shape in which amino-acid chains leave the ribosome. Anfinsen’s dogma states that the native 3D shape of a protein is completely determined by protein’s amino acid sequence. Despite the progress in understanding the process rate and the success in folding prediction for some small proteins, with presently available physics-based methods it is not yet possible to reliably deduce the shape of a biologically active protein from its amino acid sequence. The protein-folding problem endures as one of the most important unresolvedmore » problems in science; it addresses the origin of life itself. Furthermore, a wrong fold is a common cause for a protein to lose its function or even endanger the living organism. Soliton solutions of a generalized discrete non-linear Schrödinger equation (GDNLSE) obtained from the energy function in terms of bond and torsion angles κ and τ provide a constructive theoretical framework for describing protein folds and folding patterns [2]. Here we study the dynamics of this process by means of molecular-dynamics simulations. The soliton manifestation is the pattern helix–loop–helix in the secondary structure of the protein, which explains the importance of understanding loop formation in helical proteins. We performed in silico experiments for unfolding one subunit of the core structure of gp41 from the HIV envelope glycoprotein (PDB ID: 1AIK [3]) by molecular-dynamics simulations with the MD package GROMACS. We analyzed 80 ns trajectories, obtained with one united-atom and two different all-atom force fields, to justify the side-chain orientation quantification scheme adopted in the studies and to eliminate force-field based artifacts. Our results are compatible with the soliton model of protein folding and provide first insight into soliton-formation dynamics.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Horowitz, Scott; Salmon, Loïc; Koldewey, Philipp
We present that challenges in determining the structures of heterogeneous and dynamic protein complexes have greatly hampered past efforts to obtain a mechanistic understanding of many important biological processes. One such process is chaperone-assisted protein folding. Obtaining structural ensembles of chaperone–substrate complexes would ultimately reveal how chaperones help proteins fold into their native state. To address this problem, we devised a new structural biology approach based on X-ray crystallography, termed residual electron and anomalous density (READ). READ enabled us to visualize even sparsely populated conformations of the substrate protein immunity protein 7 (Im7) in complex with the Escherichia coli chaperonemore » Spy, and to capture a series of snapshots depicting the various folding states of Im7 bound to Spy. The ensemble shows that Spy-associated Im7 samples conformations ranging from unfolded to partially folded to native-like states and reveals how a substrate can explore its folding landscape while being bound to a chaperone.« less
Gold, Nicola D; Jackson, Richard M
2006-02-03
The rapid growth in protein structural data and the emergence of structural genomics projects have increased the need for automatic structure analysis and tools for function prediction. Small molecule recognition is critical to the function of many proteins; therefore, determination of ligand binding site similarity is important for understanding ligand interactions and may allow their functional classification. Here, we present a binding sites database (SitesBase) that given a known protein-ligand binding site allows rapid retrieval of other binding sites with similar structure independent of overall sequence or fold similarity. However, each match is also annotated with sequence similarity and fold information to aid interpretation of structure and functional similarity. Similarity in ligand binding sites can indicate common binding modes and recognition of similar molecules, allowing potential inference of function for an uncharacterised protein or providing additional evidence of common function where sequence or fold similarity is already known. Alternatively, the resource can provide valuable information for detailed studies of molecular recognition including structure-based ligand design and in understanding ligand cross-reactivity. Here, we show examples of atomic similarity between superfamily or more distant fold relatives as well as between seemingly unrelated proteins. Assignment of unclassified proteins to structural superfamiles is also undertaken and in most cases substantiates assignments made using sequence similarity. Correct assignment is also possible where sequence similarity fails to find significant matches, illustrating the potential use of binding site comparisons for newly determined proteins.
Das, Payel; Matysiak, Silvina; Clementi, Cecilia
2005-01-01
Coarse-grained models have been extremely valuable in promoting our understanding of protein folding. However, the quantitative accuracy of existing simplified models is strongly hindered either from the complete removal of frustration (as in the widely used Gō-like models) or from the compromise with the minimal frustration principle and/or realistic protein geometry (as in the simple on-lattice models). We present a coarse-grained model that “naturally” incorporates sequence details and energetic frustration into an overall minimally frustrated folding landscape. The model is coupled with an optimization procedure to design the parameters of the protein Hamiltonian to fold into a desired native structure. The application to the study of src-Src homology 3 domain shows that this coarse-grained model contains the main physical-chemical ingredients that are responsible for shaping the folding landscape of this protein. The results illustrate the importance of nonnative interactions and energetic heterogeneity for a quantitative characterization of folding mechanisms. PMID:16006532
Kister, Alexander
2015-01-01
We present an alternative approach to protein 3D folding prediction based on determination of rules that specify distribution of “favorable” residues, that are mainly responsible for a given fold formation, and “unfavorable” residues, that are incompatible with that fold, in polypeptide sequences. The process of determining favorable and unfavorable residues is iterative. The starting assumptions are based on the general principles of protein structure formation as well as structural features peculiar to a protein fold under investigation. The initial assumptions are tested one-by-one for a set of all known proteins with a given structure. The assumption is accepted as a “rule of amino acid distribution” for the protein fold if it holds true for all, or near all, structures. If the assumption is not accepted as a rule, it can be modified to better fit the data and then tested again in the next step of the iterative search algorithm, or rejected. We determined the set of amino acid distribution rules for a large group of beta sandwich-like proteins characterized by a specific arrangement of strands in two beta sheets. It was shown that this set of rules is highly sensitive (~90%) and very specific (~99%) for identifying sequences of proteins with specified beta sandwich fold structure. The advantage of the proposed approach is that it does not require that query proteins have a high degree of homology to proteins with known structure. So long as the query protein satisfies residue distribution rules, it can be confidently assigned to its respective protein fold. Another advantage of our approach is that it allows for a better understanding of which residues play an essential role in protein fold formation. It may, therefore, facilitate rational protein engineering design. PMID:25625198
NASA Astrophysics Data System (ADS)
Abaskharon, Rachel M.
As ubiquitous and diverse biopolymers, proteins are dynamic molecules that are constantly engaging in inter- and intramolecular interactions responsible for their structure, fold, and function. Because of this, gaining a comprehensive understanding of the factors that control protein conformation and dynamics remains elusive as current experimental techniques often lack the ability to initiate and probe a specific interaction or conformational transition. For this reason, this thesis aims to develop methods to control and monitor protein conformations, conformational transitions, and dynamics in a site-specific manner, as well as to understand how specific and non-specific interactions affect the protein folding energy landscape. First, by using the co-solvent, trifluoroethanol (TFE), we show that the rate at which a peptide folds can be greatly impacted and thus controlled by the excluded volume effect. Secondly, we demonstrate the utility of several light-responsive molecules and reactions as methods to manipulate and investigate protein-folding processes. Using an azobenzene linker as a photo-initiator, we are able to increase the folding rate of a protein system by an order of magnitude by channeling a sub-population through a parallel, faster folding pathway. Additionally, we utilize a tryptophan-mediated electron transfer process to a nearby disulfide bond to strategically unfold a protein molecule with ultraviolet light. We also demonstrate the potential of two ruthenium polypyridyl complexes as ultrafast phototriggers of protein reactions. Finally, we develop several site-specific spectroscopic probes of protein structure and environment. Specifically, we demonstrate that a 13C-labeled aspartic acid residue constitutes a useful site-specific infrared probe for investigating salt-bridges and hydration dynamics of proteins, particularly in proteins containing several acidic amino acids. We also show that a proline-derivative, 4-oxoproline, possesses novel infrared properties that can be exploited to monitor the cis-trans isomerization process of individual proline residues in proteins.
Helix formation and stability in membranes.
McKay, Matthew J; Afrose, Fahmida; Koeppe, Roger E; Greathouse, Denise V
2018-02-13
In this article we review current understanding of basic principles for the folding of membrane proteins, focusing on the more abundant alpha-helical class. Membrane proteins, vital to many biological functions and implicated in numerous diseases, fold into their active conformations in the complex environment of the cell bilayer membrane. While many membrane proteins rely on the translocon and chaperone proteins to fold correctly, others can achieve their functional form in the absence of any translation apparatus or other aides. Nevertheless, the spontaneous folding process is not well understood at the molecular level. Recent findings suggest that helix fraying and loop formation may be important for overall structure, dynamics and regulation of function. Several types of membrane helices with ionizable amino acids change their topology with pH. Additionally we note that some peptides, including many that are rich in arginine, and a particular analogue of gramicidin, are able passively to translocate across cell membranes. The findings indicate that a final protein structure in a lipid-bilayer membrane is sequence-based, with lipids contributing to stability and regulation. While much progress has been made toward understanding the folding process for alpha-helical membrane proteins, it remains a work in progress. This article is part of a Special Issue entitled: Emergence of Complex Behavior in Biomembranes edited by Marjorie Longo. Copyright © 2018 Elsevier B.V. All rights reserved.
High Pressure ZZ-Exchange NMR Reveals Key Features of Protein Folding Transition States.
Zhang, Yi; Kitazawa, Soichiro; Peran, Ivan; Stenzoski, Natalie; McCallum, Scott A; Raleigh, Daniel P; Royer, Catherine A
2016-11-23
Understanding protein folding mechanisms and their sequence dependence requires the determination of residue-specific apparent kinetic rate constants for the folding and unfolding reactions. Conventional two-dimensional NMR, such as HSQC experiments, can provide residue-specific information for proteins. However, folding is generally too fast for such experiments. ZZ-exchange NMR spectroscopy allows determination of folding and unfolding rates on much faster time scales, yet even this regime is not fast enough for many protein folding reactions. The application of high hydrostatic pressure slows folding by orders of magnitude due to positive activation volumes for the folding reaction. We combined high pressure perturbation with ZZ-exchange spectroscopy on two autonomously folding protein domains derived from the ribosomal protein, L9. We obtained residue-specific apparent rates at 2500 bar for the N-terminal domain of L9 (NTL9), and rates at atmospheric pressure for a mutant of the C-terminal domain (CTL9) from pressure dependent ZZ-exchange measurements. Our results revealed that NTL9 folding is almost perfectly two-state, while small deviations from two-state behavior were observed for CTL9. Both domains exhibited large positive activation volumes for folding. The volumetric properties of these domains reveal that their transition states contain most of the internal solvent excluded voids that are found in the hydrophobic cores of the respective native states. These results demonstrate that by coupling it with high pressure, ZZ-exchange can be extended to investigate a large number of protein conformational transitions.
Hidden complexity of free energy surfaces for peptide (protein) folding.
Krivov, Sergei V; Karplus, Martin
2004-10-12
An understanding of the thermodynamics and kinetics of protein folding requires a knowledge of the free energy surface governing the motion of the polypeptide chain. Because of the many degrees of freedom involved, surfaces projected on only one or two progress variables are generally used in descriptions of the folding reaction. Such projections result in relatively smooth surfaces, but they could mask the complexity of the unprojected surface. Here we introduce an approach to determine the actual (unprojected) free energy surface and apply it to the second beta-hairpin of protein G, which has been used as a model system for protein folding. The surface is represented by a disconnectivity graph calculated from a long equilibrium folding-unfolding trajectory. The denatured state is found to have multiple low free energy basins. Nevertheless, the peptide shows exponential kinetics in folding to the native basin. Projected surfaces obtained from the present analysis have a simple form in agreement with other studies of the beta-hairpin. The hidden complexity found for the beta-hairpin surface suggests that the standard funnel picture of protein folding should be revisited.
Three key residues form a critical contact network in a protein folding transition state
NASA Astrophysics Data System (ADS)
Vendruscolo, Michele; Paci, Emanuele; Dobson, Christopher M.; Karplus, Martin
2001-02-01
Determining how a protein folds is a central problem in structural biology. The rate of folding of many proteins is determined by the transition state, so that a knowledge of its structure is essential for understanding the protein folding reaction. Here we use mutation measurements-which determine the role of individual residues in stabilizing the transition state-as restraints in a Monte Carlo sampling procedure to determine the ensemble of structures that make up the transition state. We apply this approach to the experimental data for the 98-residue protein acylphosphatase, and obtain a transition-state ensemble with the native-state topology and an average root-mean-square deviation of 6Å from the native structure. Although about 20 residues with small positional fluctuations form the structural core of this transition state, the native-like contact network of only three of these residues is sufficient to determine the overall fold of the protein. This result reveals how a nucleation mechanism involving a small number of key residues can lead to folding of a polypeptide chain to its unique native-state structure.
Analyzing the effect of homogeneous frustration in protein folding.
Contessoto, Vinícius G; Lima, Debora T; Oliveira, Ronaldo J; Bruni, Aline T; Chahine, Jorge; Leite, Vitor B P
2013-10-01
The energy landscape theory has been an invaluable theoretical framework in the understanding of biological processes such as protein folding, oligomerization, and functional transitions. According to the theory, the energy landscape of protein folding is funneled toward the native state, a conformational state that is consistent with the principle of minimal frustration. It has been accepted that real proteins are selected through natural evolution, satisfying the minimum frustration criterion. However, there is evidence that a low degree of frustration accelerates folding. We examined the interplay between topological and energetic protein frustration. We employed a Cα structure-based model for simulations with a controlled nonspecific energetic frustration added to the potential energy function. Thermodynamics and kinetics of a group of 19 proteins are completely characterized as a function of increasing level of energetic frustration. We observed two well-separated groups of proteins: one group where a little frustration enhances folding rates to an optimal value and another where any energetic frustration slows down folding. Protein energetic frustration regimes and their mechanisms are explained by the role of non-native contact interactions in different folding scenarios. These findings strongly correlate with the protein free-energy folding barrier and the absolute contact order parameters. These computational results are corroborated by principal component analysis and partial least square techniques. One simple theoretical model is proposed as a useful tool for experimentalists to predict the limits of improvements in real proteins. Copyright © 2013 Wiley Periodicals, Inc.
What amyloidoses may tell us about normal protein folding: The Alzheimer's disease story
NASA Astrophysics Data System (ADS)
Teplow, David B.
2002-03-01
Alzheimer's disease (AD) is a progressive, neurodegenerative disorder characterized by severe neuronal injury and death. A prominent histopathologic feature of AD is disseminated parenchymal and vascular amyloid deposition. The fibrils in these deposits are composed of the amyloid β-protein (Aβ), a peptide of 4 kDa mass. In vitro and in vivo studies of Aβ fibril formation have shown that both oligomeric and polymeric Aβ assemblies have neurotoxic activity. Understanding how these assemblies form thus could be of direct therapeutic relevance. However, the aggregation and fibril-forming propensities of Aβ have complicated structure determination. Nevertheless, careful morphologic, spectroscopic, protein chemical, and physiologic analyses of the time-dependent changes in Aβ conformation, assembly state, and biological activity which occur during fibrillogenesis have significantly advanced our understanding of this clinically important process. Here, I will discuss recent findings about the pathway(s) of Aβ folding and assembly and about key structural features of Aβ which control the associated kinetics. Interestingly, the amyloidogenic folding pathway of Aβ is in some respects the mirror image of that through which natively folded amyloidogenic proteins proceed.
Fernandez-Reche, Andres; Cobos, Eva S; Luque, Irene; Ruiz-Sanz, Javier; Martinez, Jose C
2018-01-04
In 1972 Christian B. Anfinsen received the Nobel Prize in Chemistry for "…his work on ribonuclease, especially concerning the connection between the amino acid sequence and the biologically active conformation." The understanding of this principle is crucial for physical biochemistry students, since protein folding studies, bio-computing sciences and protein design approaches are founded on such a well-demonstrated connection. Herein, we describe a detailed and easy-to-follow experiment to reproduce the most relevant assays carried out at Anfinsen's laboratory in the 60s. This experiment provides students with a platform to interpret by themselves the structural and kinetic experiments conceived to understand the protein folding problem. In addition, this three-day experiment brings students a nice opportunity for protein manipulation as well as for the setting up of spectroscopic and chromatographic techniques. © 2018 by The International Union of Biochemistry and Molecular Biology, 2018. © 2018 The International Union of Biochemistry and Molecular Biology.
Electrodynamic pressure modulation of protein stability in cosolvents.
Damodaran, Srinivasan
2013-11-19
Cosolvents affect structural stability of proteins in aqueous solutions. A clear understanding of the mechanism by which cosolvents impact protein stability is critical to understanding protein folding in a biological milieu. In this study, we investigated the Lifshitz-van der Waals dispersion interaction of seven different solutes with nine globular proteins and report that in an aqueous medium the structure-stabilizing solutes exert a positive electrodynamic pressure, whereas the structure-destabilizing solutes exert a negative electrodynamic pressure on the proteins. The net increase in the thermal denaturation temperature (ΔTd) of a protein in 1 M solution of various solutes was linearly related to the electrodynamic pressure (PvdW) between the solutes and the protein. The slope of the PvdW versus ΔTd plots was protein-dependent. However, we find a positive linear relationship (r(2) = 0.79) between the slope (i.e., d(ΔTd)/dPvdW) and the adiabatic compressibility (βs) of the proteins. Together, these results clearly indicate that the Lifshitz's dispersion forces are inextricably involved in solute-induced stabilization/destabilization of globular proteins. The positive and/or negative electrodynamic pressure generated by the solute-protein interaction across the water medium seems to be the fundamental mechanism by which solutes affect protein stability. This is at variance with the existing preferential hydration concept. The implication of these results is significant in the sense that, in addition to the hydrophobic effect that drives protein folding, the electrodynamic forces between the proteins and solutes in the biological milieu also might play a role in the folding process as well as in the stability of the folded state.
Buchner, Ginka S; Murphy, Ronan D; Buchete, Nicolae-Viorel; Kubelka, Jan
2011-08-01
The problem of spontaneous folding of amino acid chains into highly organized, biologically functional three-dimensional protein structures continues to challenge the modern science. Understanding how proteins fold requires characterization of the underlying energy landscapes as well as the dynamics of the polypeptide chains in all stages of the folding process. In recent years, important advances toward these goals have been achieved owing to the rapidly growing interdisciplinary interest and significant progress in both experimental techniques and theoretical methods. Improvements in the experimental time resolution led to determination of the timescales of the important elementary events in folding, such as formation of secondary structure and tertiary contacts. Sensitive single molecule methods made possible probing the distributions of the unfolded and folded states and following the folding reaction of individual protein molecules. Discovery of proteins that fold in microseconds opened the possibility of atomic-level theoretical simulations of folding and their direct comparisons with experimental data, as well as of direct experimental observation of the barrier-less folding transition. The ultra-fast folding also brought new questions, concerning the intrinsic limits of the folding rates and experimental signatures of barrier-less "downhill" folding. These problems will require novel approaches for even more detailed experimental investigations of the folding dynamics as well as for the analysis of the folding kinetic data. For theoretical simulations of folding, a main challenge is how to extract the relevant information from overwhelmingly detailed atomistic trajectories. New theoretical methods have been devised to allow a systematic approach towards a quantitative analysis of the kinetic network of folding-unfolding transitions between various configuration states of a protein, revealing the transition states and the associated folding pathways at multiple levels, from atomistic to coarse-grained representations. This article is part of a Special Issue entitled: Protein Dynamics: Experimental and Computational Approaches. Copyright © 2010 Elsevier B.V. All rights reserved.
Delic, Marizela; Göngrich, Rebecca; Mattanovich, Diethard; Gasser, Brigitte
2014-07-20
Recombinant protein production has developed into a huge market with enormous positive implications for human health and for the future direction of a biobased economy. Limitations in the economic and technical feasibility of production processes are often related to bottlenecks of in vivo protein folding. Based on cell biological knowledge, some major bottlenecks have been overcome by the overexpression of molecular chaperones and other folding related proteins, or by the deletion of deleterious pathways that may lead to misfolding, mistargeting, or degradation. While important success could be achieved by this strategy, the list of reported unsuccessful cases is disappointingly long and obviously dependent on the recombinant protein to be produced. Singular engineering of protein folding steps may not lead to desired results if the pathway suffers from several limitations. In particular, the connection between folding quality control and proteolytic degradation needs further attention. Based on recent understanding that multiple steps in the folding and secretion pathways limit productivity, synergistic combinations of the cell engineering approaches mentioned earlier need to be explored. In addition, systems biology-based whole cell analysis that also takes energy and redox metabolism into consideration will broaden the knowledge base for future rational engineering strategies.
Recent developments in the theory of protein folding: searching for the global energy minimum.
Scheraga, H A
1996-04-16
Statistical mechanical theories and computer simulation are being used to gain an understanding of the fundamental features of protein folding. A major obstacle in the computation of protein structures is the multiple-minima problem arising from the existence of many local minima in the multidimensional energy landscape of the protein. This problem has been surmounted for small open-chain and cyclic peptides, and for regular-repeating sequences of models of fibrous proteins. Progress is being made in resolving this problem for globular proteins.
Cooperativity and modularity in protein folding
Sasai, Masaki; Chikenji, George; Terada, Tomoki P.
2016-01-01
A simple statistical mechanical model proposed by Wako and Saitô has explained the aspects of protein folding surprisingly well. This model was systematically applied to multiple proteins by Muñoz and Eaton and has since been referred to as the Wako-Saitô-Muñoz-Eaton (WSME) model. The success of the WSME model in explaining the folding of many proteins has verified the hypothesis that the folding is dominated by native interactions, which makes the energy landscape globally biased toward native conformation. Using the WSME and other related models, Saitô emphasized the importance of the hierarchical pathway in protein folding; folding starts with the creation of contiguous segments having a native-like configuration and proceeds as growth and coalescence of these segments. The Φ-values calculated for barnase with the WSME model suggested that segments contributing to the folding nucleus are similar to the structural modules defined by the pattern of native atomic contacts. The WSME model was extended to explain folding of multi-domain proteins having a complex topology, which opened the way to comprehensively understanding the folding process of multi-domain proteins. The WSME model was also extended to describe allosteric transitions, indicating that the allosteric structural movement does not occur as a deterministic sequential change between two conformations but as a stochastic diffusive motion over the dynamically changing energy landscape. Statistical mechanical viewpoint on folding, as highlighted by the WSME model, has been renovated in the context of modern methods and ideas, and will continue to provide insights on equilibrium and dynamical features of proteins. PMID:28409080
BiP clustering facilitates protein folding in the endoplasmic reticulum.
Griesemer, Marc; Young, Carissa; Robinson, Anne S; Petzold, Linda
2014-07-01
The chaperone BiP participates in several regulatory processes within the endoplasmic reticulum (ER): translocation, protein folding, and ER-associated degradation. To facilitate protein folding, a cooperative mechanism known as entropic pulling has been proposed to demonstrate the molecular-level understanding of how multiple BiP molecules bind to nascent and unfolded proteins. Recently, experimental evidence revealed the spatial heterogeneity of BiP within the nuclear and peripheral ER of S. cerevisiae (commonly referred to as 'clusters'). Here, we developed a model to evaluate the potential advantages of accounting for multiple BiP molecules binding to peptides, while proposing that BiP's spatial heterogeneity may enhance protein folding and maturation. Scenarios were simulated to gauge the effectiveness of binding multiple chaperone molecules to peptides. Using two metrics: folding efficiency and chaperone cost, we determined that the single binding site model achieves a higher efficiency than models characterized by multiple binding sites, in the absence of cooperativity. Due to entropic pulling, however, multiple chaperones perform in concert to facilitate the resolubilization and ultimate yield of folded proteins. As a result of cooperativity, multiple binding site models used fewer BiP molecules and maintained a higher folding efficiency than the single binding site model. These insilico investigations reveal that clusters of BiP molecules bound to unfolded proteins may enhance folding efficiency through cooperative action via entropic pulling.
Topography of funneled landscapes determines the thermodynamics and kinetics of protein folding
Wang, Jin; Oliveira, Ronaldo J.; Chu, Xiakun; Whitford, Paul C.; Chahine, Jorge; Han, Wei; Wang, Erkang; Onuchic, José N.; Leite, Vitor B.P.
2012-01-01
The energy landscape approach has played a fundamental role in advancing our understanding of protein folding. Here, we quantify protein folding energy landscapes by exploring the underlying density of states. We identify three quantities essential for characterizing landscape topography: the stabilizing energy gap between the native and nonnative ensembles δE, the energetic roughness ΔE, and the scale of landscape measured by the entropy S. We show that the dimensionless ratio between the gap, roughness, and entropy of the system accurately predicts the thermodynamics, as well as the kinetics of folding. Large Λ implies that the energy gap (or landscape slope towards the native state) is dominant, leading to more funneled landscapes. We investigate the role of topological and energetic roughness for proteins of different sizes and for proteins of the same size, but with different structural topologies. The landscape topography ratio Λ is shown to be monotonically correlated with the thermodynamic stability against trapping, as characterized by the ratio of folding temperature versus trapping temperature. Furthermore, Λ also monotonically correlates with the folding kinetic rates. These results provide the quantitative bridge between the landscape topography and experimental folding measurements. PMID:23019359
Proteomic Alterations in Aqueous Humor From Patients With Primary Open Angle Glaucoma.
Sharma, Shruti; Bollinger, Kathryn E; Kodeboyina, Sai Karthik; Zhi, Wenbo; Patton, Jordan; Bai, Shan; Edwards, Blake; Ulrich, Lane; Bogorad, David; Sharma, Ashok
2018-05-01
Primary open angle glaucoma (POAG) is the most prevalent form of glaucoma, accounting for approximately 90% of all cases. The aqueous humor (AH), a biological fluid in the anterior and posterior chambers of the eye, is involved in a multitude of functions including the maintenance of IOP and ocular homeostasis. This fluid is very close to the pathologic site and is also known to have a significant role in glaucoma pathogenesis. The purpose of this study was to identify proteomic alterations in AH from patients with POAG. AH samples were extracted from 47 patients undergoing cataract surgery (controls: n = 32; POAG: n = 15). Proteomic analysis of the digested samples was accomplished by liquid-chromatography-mass spectrometry. The identified proteins were evaluated using a variety of statistical and bioinformatics methods. A total of 33 proteins were significantly altered in POAG subjects compared with the controls. The most abundant proteins in POAG subjects are IGKC (13.56-fold), ITIH4 (4.1-fold), APOC3 (3.36-fold), IDH3A (3.11-fold), LOC105369216 (2.98-fold). SERPINF2 (2.94-fold), NPC2 (2.88-fold), SUCLG2 (2.70-fold), KIAA0100 (2.29-fold), CNOT4 (2.23-fold), AQP4 (2.11-fold), COL18A1 (2.08-fold), NWD1 (2.07-fold), and TMEM120B (2.06-fold). A significant increasing trend in the odds ratios of having POAG was observed with increased levels of these proteins. Proteins identified in this study are implicated in signaling, glycosylation, immune response, molecular transport, and lipid metabolism. The identified candidate proteins may be potential biomarkers associated with POAG development and may lead to more insight in understanding the mechanisms underlying the pathogenesis of this disease.
NASA Astrophysics Data System (ADS)
Finkelstein, A. V.; Galzitskaya, O. V.
2004-04-01
Protein physics is grounded on three fundamental experimental facts: protein, this long heteropolymer, has a well defined compact three-dimensional structure; this structure can spontaneously arise from the unfolded protein chain in appropriate environment; and this structure is separated from the unfolded state of the chain by the “all-or-none” phase transition, which ensures robustness of protein structure and therefore of its action. The aim of this review is to consider modern understanding of physical principles of self-organization of protein structures and to overview such important features of this process, as finding out the unique protein structure among zillions alternatives, nucleation of the folding process and metastable folding intermediates. Towards this end we will consider the main experimental facts and simple, mostly phenomenological theoretical models. We will concentrate on relatively small (single-domain) water-soluble globular proteins (whose structure and especially folding are much better studied and understood than those of large or membrane and fibrous proteins) and consider kinetic and structural aspects of transition of initially unfolded protein chains into their final solid (“native”) 3D structures.
Transition Pathway and Its Free-Energy Profile: A Protocol for Protein Folding Simulations
Lee, In-Ho; Kim, Seung-Yeon; Lee, Jooyoung
2013-01-01
We propose a protocol that provides a systematic definition of reaction coordinate and related free-energy profile as the function of temperature for the protein-folding simulation. First, using action-derived molecular dynamics (ADMD), we investigate the dynamic folding pathway model of a protein between a fixed extended conformation and a compact conformation. We choose the pathway model to be the reaction coordinate, and the folding and unfolding processes are characterized by the ADMD step index, in contrast to the common a priori reaction coordinate as used in conventional studies. Second, we calculate free-energy profile as the function of temperature, by employing the replica-exchange molecular dynamics (REMD) method. The current method provides efficient exploration of conformational space and proper characterization of protein folding/unfolding dynamics from/to an arbitrary extended conformation. We demonstrate that combination of the two simulation methods, ADMD and REMD, provides understanding on molecular conformational changes in proteins. The protocol is tested on a small protein, penta-peptide of met-enkephalin. For the neuropeptide met-enkephalin system, folded, extended, and intermediate sates are well-defined through the free-energy profile over the reaction coordinate. Results are consistent with those in the literature. PMID:23917881
ERIC Educational Resources Information Center
Fernandez-Reche, Andres; Cobos, Eva S.; Luque, Irene; Ruiz-Sanz, Javier; Martinez, Jose C.
2018-01-01
In 1972 Christian B. Anfinsen received the Nobel Prize in Chemistry for "…his work on ribonuclease, especially concerning the connection between the amino acid sequence and the biologically active conformation." The understanding of this principle is crucial for physical biochemistry students, since protein folding studies, bio-computing…
Resolution of the unfolded state.
NASA Astrophysics Data System (ADS)
Beaucage, Gregory
2008-03-01
The unfolded states in proteins and nucleic acids remain weakly understood despite their importance to protein folding; misfolding diseases (Parkinson's & Alzheimer's); natively unfolded proteins (˜ 30% of eukaryotic proteins); and to understanding ribozymes. Research has been hindered by the inability to quantify the residual (native) structure present in an unfolded protein or nucleic acid. Here, a scaling model is proposed to quantify the degree of folding and the unfolded state (Beaucage, 2004, 2007). The model takes a global view of protein structure and can be applied to a number of analytic methods and to simulations. Three examples are given of application to small-angle scattering from pressure induced unfolding of SNase (Panick, 1998), from acid unfolded Cyt c (Kataoka, 1993) and from folding of Azoarcus ribozyme (Perez-Salas, 2004). These examples quantitatively show 3 characteristic unfolded states for proteins, the statistical nature of a folding pathway and the relationship between extent of folding and chain size during folding for charge driven folding in RNA. Beaucage, G., Biophys. J., in press (2007). Beaucage, G., Phys. Rev. E. 70, 031401 (2004). Kataoka, M., Y. Hagihara, K. Mihara, Y. Goto J. Mol. Biol. 229, 591 (1993). Panick, G., R. Malessa, R. Winter, G. Rapp, K. J. Frye, C. A. Royer J. Mol. Biol. 275, 389 (1998). Perez-Salas U. A., P. Rangan, S. Krueger, R. M. Briber, D. Thirumalai, S. A. Woodson, Biochemistry 43 1746 (2004).
Functional Tat transport of unstructured, small, hydrophilic proteins.
Richter, Silke; Lindenstrauss, Ute; Lücke, Christian; Bayliss, Richard; Brüser, Thomas
2007-11-16
The twin-arginine translocation (Tat) system is a protein translocation system that is adapted to the translocation of folded proteins across biological membranes. An understanding of the folding requirements for Tat substrates is of fundamental importance for the elucidation of the transport mechanism. We now demonstrate for the first time Tat transport for fully unstructured proteins, using signal sequence fusions to naturally unfolded FG repeats from the yeast Nsp1p nuclear pore protein. The transport of unfolded proteins becomes less efficient with increasing size, consistent with only a single interaction between the system and the substrate. Strikingly, the introduction of six residues from the hydrophobic core of a globular protein completely blocked translocation. Physiological data suggest that hydrophobic surface patches abort transport at a late stage, most likely by membrane interactions during transport. This study thus explains the observed restriction of the Tat system to folded globular proteins on a molecular level.
Ebrahimi-Fakhari, Darius; Saidi, Laiq-Jan; Wahlster, Lara
2013-12-05
Changes in protein metabolism are key to disease onset and progression in many neurodegenerative diseases. As a prime example, in Parkinson's disease, folding, post-translational modification and recycling of the synaptic protein α-synuclein are clearly altered, leading to a progressive accumulation of pathogenic protein species and the formation of intracellular inclusion bodies. Altered protein folding is one of the first steps of an increasingly understood cascade in which α-synuclein forms complex oligomers and finally distinct protein aggregates, termed Lewy bodies and Lewy neurites. In neurons, an elaborated network of chaperone and co-chaperone proteins is instrumental in mediating protein folding and re-folding. In addition to their direct influence on client proteins, chaperones interact with protein degradation pathways such as the ubiquitin-proteasome-system or autophagy in order to ensure the effective removal of irreversibly misfolded and potentially pathogenic proteins. Because of the vital role of proper protein folding for protein homeostasis, a growing number of studies have evaluated the contribution of chaperone proteins to neurodegeneration. We herein review our current understanding of the involvement of chaperones, co-chaperones and chaperone-mediated autophagy in synucleinopathies with a focus on the Hsp90 and Hsp70 chaperone system. We discuss genetic and pathological studies in Parkinson's disease as well as experimental studies in models of synucleinopathies that explore molecular chaperones and protein degradation pathways as a novel therapeutic target. To this end, we examine the capacity of chaperones to prevent or modulate neurodegeneration and summarize the current progress in models of Parkinson's disease and related neurodegenerative disorders.
Protein design to understand peptide ligand recognition by tetratricopeptide repeat proteins.
Cortajarena, Aitziber L; Kajander, Tommi; Pan, Weilan; Cocco, Melanie J; Regan, Lynne
2004-04-01
Protein design aims to understand the fundamentals of protein structure by creating novel proteins with pre-specified folds. An equally important goal is to understand protein function by creating novel proteins with pre-specified activities. Here we describe the design and characterization of a tetratricopeptide (TPR) protein, which binds to the C-terminal peptide of the eukaryotic chaperone Hsp90. The design emphasizes the importance of both direct, short-range protein-peptide interactions and of long-range electrostatic optimization. We demonstrate that the designed protein binds specifically to the desired peptide and discriminates between it and the similar C-terminal peptide of Hsp70.
NASA Astrophysics Data System (ADS)
Pappu, Rohit V.; Nussinov, Ruth
2009-03-01
In appropriate physiological milieux proteins spontaneously fold into their functional three-dimensional structures. The amino acid sequences of functional proteins contain all the information necessary to specify the folds. This remarkable observation has spawned research aimed at answering two major questions. (1) Of all the conceivable structures that a protein can adopt, why is the ensemble of native-like structures the most favorable? (2) What are the paths by which proteins manage to robustly and reproducibly fold into their native structures? Anfinsen's thermodynamic hypothesis has guided the pursuit of answers to the first question whereas Levinthal's paradox has influenced the development of models for protein folding dynamics. Decades of work have led to significant advances in the folding problem. Mean-field models have been developed to capture our current, coarse grain understanding of the driving forces for protein folding. These models are being used to predict three-dimensional protein structures from sequence and stability profiles as a function of thermodynamic and chemical perturbations. Impressive strides have also been made in the field of protein design, also known as the inverse folding problem, thereby testing our understanding of the determinants of the fold specificities of different sequences. Early work on protein folding pathways focused on the specific sequence of events that could lead to a simplification of the search process. However, unifying principles proved to be elusive. Proteins that show reversible two-state folding-unfolding transitions turned out to be a gift of natural selection. Focusing on these simple systems helped researchers to uncover general principles regarding the origins of cooperativity in protein folding thermodynamics and kinetics. On the theoretical front, concepts borrowed from polymer physics and the physics of spin glasses led to the development of a framework based on energy landscape theories. These theories predict that evolved sequences (functional proteins as opposed to random sequences) find their native folds by minimizing geometric (topological) frustration (i.e. avoiding entropic bottlenecks/kinetic traps). In some cases, following a dominant pathway is the optimal way to minimize frustration, whereas in extreme cases, proteins may fold without encountering bottlenecks. Experimental studies of two-state proteins led in turn to the development of quantitative descriptors that have allowed specific testing of theoretical predictions. These include methods such as phi value analysis to characterize transition state ensembles and descriptors that measure the effects of geometry/topology on folding rates. Interestingly, there exists a striking inverse correlation between the relative contact order (the distance in sequence space between spatially proximal contacts made in the native state) and the folding rates of several two-state proteins. The relative contact order provides a rough estimate of the net entropic cost associated with realizing the folded state, and theories have been developed to explain the observed correlation between the contact order and folding rates. Despite its maturity as a field, there are several areas that come under the rubric of protein folding that are just beginning to receive attention. For example, how do complications in vivo such as macromolecular crowding, confinement, the presence of cosolutes, membrane anchoring, and tethering to surfaces influence protein stabilities and folding dynamics? While we are accustomed to studying proteins at concentrations that are amenable to investigation via probes whose signal intensities grow with protein concentration, this does not make these readouts relevant to the in vivo setting. In cells, protein concentrations are tightly regulated and are likely to be orders of magnitude lower than what we are accustomed to using within in vitro experimental setups. Protein folding in vivo is a complex multi-scale dynamical problem when one considers the synergies between protein expression, spontaneous folding, chaperonin-assisted folding, protein targeting, the kinetics of post-translational modifications, protein degradation, and of course the drive to avoid aggregation. Further, there is growing recognition that cells not only tolerate but select for proteins that are intrinsically disordered. These proteins are essential for many crucial activities, and yet their inability to fold in isolation makes them prone to proteolytic processing and aggregation. In the series of papers that make up this special focus on protein folding in physical biology, leading researchers provide insights into diverse cross-sections of problems in protein folding. Barrick provides a concise review of what we have learned from the study of two-state folders and draws attention to how several unanswered questions are being approached using studies on large repeat proteins. Dissecting the contribution of hydration-mediated interactions to driving forces for protein folding and assembly has been extremely challenging. There is renewed interest in using hydrostatic pressure as a tool to access folding intermediates and decipher the role of partially hydrated states in folding, misfolding, and aggregation. Silva and Foguel review many of the nuances that have been uncovered by perturbing hydrostatic pressure as a thermodynamic parameter. As noted above, protein folding in vivo is expected to be considerably more complex than the folding of two-state proteins in dilute solutions. Lucent et al review the state-of-the-art in the development of quantitative theories to explain chaperonin-assisted folding in vivo. Additionally, they highlight unanswered questions pertaining to the processing of unfolded/misfolded proteins by the chaperone machinery. Zhuang et al present results that focus on the effects of surface tethering on transition state ensembles and folding mechanisms of a model two-state protein. Their results are important because several proteins in vivo fold while being anchored to membranes. Finally, several neurodegenerative and systemic diseases are associated with the aggregation of intrinsically disordered polypeptides. The search for cures in these debilitating and fatal diseases has focused attention on shared attributes in aggregation mechanisms of different proteins and the possibility of identifying druggable targets from mechanistic studies. Abedini and Raleigh review common features gleaned from mechanistic studies of the aggregation of several intrinsically disordered proteins. They propose that the population of helical intermediates and their stabilization via interactions with membranes might be an important route by which the process of aggregation leads to toxicity. The five papers that form this protein folding focus cover specific sub-topics within the larger field of protein folding. They address current questions and emphasize the importance of the growing and productive interface between the physical sciences and biology. We hope that these papers will stimulate much discussion and more importantly advances in the areas highlighted by the contributors.
Balancing Force Field Protein–Lipid Interactions To Capture Transmembrane Helix–Helix Association
2018-01-01
Atomistic simulations have recently been shown to be sufficiently accurate to reversibly fold globular proteins and have provided insights into folding mechanisms. Gaining similar understanding from simulations of membrane protein folding and association would be of great medical interest. All-atom simulations of the folding and assembly of transmembrane protein domains are much more challenging, not least due to very slow diffusion within the lipid bilayer membrane. Here, we focus on a simple and well-characterized prototype of membrane protein folding and assembly, namely the dimerization of glycophorin A, a homodimer of single transmembrane helices. We have determined the free energy landscape for association of the dimer using the CHARMM36 force field. We find that the native structure is a metastable state, but not stable as expected from experimental estimates of the dissociation constant and numerous experimental structures obtained under a variety of conditions. We explore two straightforward approaches to address this problem and demonstrate that they result in stable dimers with dissociation constants consistent with experimental data. PMID:29424543
ERIC Educational Resources Information Center
Farley, Peter C.
2013-01-01
This article describes a novel approach to teaching novice Biochemistry students visual literacy skills and understanding of some aspects of protein structure using the internet resource FoldIt and a worksheet based on selected Introductory Puzzles from this computer game. In responding to a questionnaire, students indicated that they (94%)…
NASA Astrophysics Data System (ADS)
Beedle, Amy E. M.; Lezamiz, Ainhoa; Stirnemann, Guillaume; Garcia-Manyes, Sergi
2015-08-01
Understanding the directionality and sequence of protein unfolding is crucial to elucidate the underlying folding free energy landscape. An extra layer of complexity is added in metalloproteins, where a metal cofactor participates in the correct, functional fold of the protein. However, the precise mechanisms by which organometallic interactions are dynamically broken and reformed on (un)folding are largely unknown. Here we use single molecule force spectroscopy AFM combined with protein engineering and MD simulations to study the individual unfolding pathways of the blue-copper proteins azurin and plastocyanin. Using the nanomechanical properties of the native copper centre as a structurally embedded molecular reporter, we demonstrate that both proteins unfold via two independent, competing pathways. Our results provide experimental evidence of a novel kinetic partitioning scenario whereby the protein can stochastically unfold through two distinct main transition states placed at the N and C termini that dictate the direction in which unfolding occurs.
Schuler, Benjamin; Soranno, Andrea; Hofmann, Hagen; Nettels, Daniel
2016-07-05
The properties of unfolded proteins have long been of interest because of their importance to the protein folding process. Recently, the surprising prevalence of unstructured regions or entirely disordered proteins under physiological conditions has led to the realization that such intrinsically disordered proteins can be functional even in the absence of a folded structure. However, owing to their broad conformational distributions, many of the properties of unstructured proteins are difficult to describe with the established concepts of structural biology. We have thus seen a reemergence of polymer physics as a versatile framework for understanding their structure and dynamics. An important driving force for these developments has been single-molecule spectroscopy, as it allows structural heterogeneity, intramolecular distance distributions, and dynamics to be quantified over a wide range of timescales and solution conditions. Polymer concepts provide an important basis for relating the physical properties of unstructured proteins to folding and function.
Frustration Sculpts the Early Stages of Protein Folding.
Di Silvio, Eva; Brunori, Maurizio; Gianni, Stefano
2015-09-07
The funneled energy landscape theory implies that protein structures are minimally frustrated. Yet, because of the divergent demands between folding and function, regions of frustrated patterns are present at the active site of proteins. To understand the effects of such local frustration in dictating the energy landscape of proteins, here we compare the folding mechanisms of the two alternative spliced forms of a PDZ domain (PDZ2 and PDZ2as) that share a nearly identical sequence and structure, while displaying different frustration patterns. The analysis, based on the kinetic characterization of a large number of site-directed mutants, reveals that although the late stages for folding are very robust and biased by native topology, the early stages are more malleable and dominated by local frustration. The results are briefly discussed in the context of the energy-landscape theory. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Peppytides: Interactive Models of Polypeptide Chains
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zuckermann, Ron; Chakraborty, Promita; Derisi, Joe
2014-01-21
Peppytides are scaled, 3D-printed models of polypeptide chains that can be folded into accurate protein structures. Designed and created by Berkeley Lab Researcher, Promita Chakraborty, and Berkeley Lab Senior Scientist, Dr. Ron Zuckermann, Peppytides are accurate physical models of polypeptide chains that anyone can interact with and fold intro various protein structures - proving to be a great educational tool, resulting in a deeper understanding of these fascinating structures and how they function. Build your own Peppytide model and learn about how nature's machines fold into their intricate architectures!
Peppytides: Interactive Models of Polypeptide Chains
Zuckermann, Ron; Chakraborty, Promita; Derisi, Joe
2018-06-08
Peppytides are scaled, 3D-printed models of polypeptide chains that can be folded into accurate protein structures. Designed and created by Berkeley Lab Researcher, Promita Chakraborty, and Berkeley Lab Senior Scientist, Dr. Ron Zuckermann, Peppytides are accurate physical models of polypeptide chains that anyone can interact with and fold intro various protein structures - proving to be a great educational tool, resulting in a deeper understanding of these fascinating structures and how they function. Build your own Peppytide model and learn about how nature's machines fold into their intricate architectures!
How the Sequence of a Gene Specifies Structural Symmetry in Proteins
Shen, Xiaojuan; Huang, Tongcheng; Wang, Guanyu; Li, Guanglin
2015-01-01
Internal symmetry is commonly observed in the majority of fundamental protein folds. Meanwhile, sufficient evidence suggests that nascent polypeptide chains of proteins have the potential to start the co-translational folding process and this process allows mRNA to contain additional information on protein structure. In this paper, we study the relationship between gene sequences and protein structures from the viewpoint of symmetry to explore how gene sequences code for structural symmetry in proteins. We found that, for a set of two-fold symmetric proteins from left-handed beta-helix fold, intragenic symmetry always exists in their corresponding gene sequences. Meanwhile, codon usage bias and local mRNA structure might be involved in modulating translation speed for the formation of structural symmetry: a major decrease of local codon usage bias in the middle of the codon sequence can be identified as a common feature; and major or consecutive decreases in local mRNA folding energy near the boundaries of the symmetric substructures can also be observed. The results suggest that gene duplication and fusion may be an evolutionarily conserved process for this protein fold. In addition, the usage of rare codons and the formation of higher order of secondary structure near the boundaries of symmetric substructures might have coevolved as conserved mechanisms to slow down translation elongation and to facilitate effective folding of symmetric substructures. These findings provide valuable insights into our understanding of the mechanisms of translation and its evolution, as well as the design of proteins via symmetric modules. PMID:26641668
Probing the Folding-Unfolding Transition of a Thermophilic Protein, MTH1880
Jung, Youngjin; Han, Jeongmin; Yun, Ji-Hye; Chang, Iksoo; Lee, Weontae
2016-01-01
The folding mechanism of typical proteins has been studied widely, while our understanding of the origin of the high stability of thermophilic proteins is still elusive. Of particular interest is how an atypical thermophilic protein with a novel fold maintains its structure and stability under extreme conditions. Folding-unfolding transitions of MTH1880, a thermophilic protein from Methanobacterium thermoautotrophicum, induced by heat, urea, and GdnHCl, were investigated using spectroscopic techniques including circular dichorism, fluorescence, NMR combined with molecular dynamics (MD) simulations. Our results suggest that MTH1880 undergoes a two-state N to D transition and it is extremely stable against temperature and denaturants. The reversibility of refolding was confirmed by spectroscopic methods and size exclusion chromatography. We found that the hyper-stability of the thermophilic MTH1880 protein originates from an extensive network of both electrostatic and hydrophobic interactions coordinated by the central β-sheet. Spectroscopic measurements, in combination with computational simulations, have helped to clarify the thermodynamic and structural basis for hyper-stability of the novel thermophilic protein MTH1880. PMID:26766214
Folding propensity of intrinsically disordered proteins by osmotic stress
Mansouri, Amanda L.; Grese, Laura N.; Rowe, Erica L.; ...
2016-10-11
Proteins imparted with intrinsic disorder conduct a range of essential cellular functions. To better understand the folding and hydration properties of intrinsically disordered proteins (IDPs), we used osmotic stress to induce conformational changes in nuclear co-activator binding domain (NCBD) and activator for thyroid hormone and retinoid receptor (ACTR). Osmotic stress was applied by the addition of small and polymeric osmolytes, where we discovered that water contributions to NCBD folding always exceeded those for ACTR. Both NCBD and ACTR were found to gain a-helical structure with increasing osmotic stress, consistent with their folding upon NCBD/ACTR complex formation. Using small-angle neutron scatteringmore » (SANS), we further characterized NCBD structural changes with the osmolyte ethylene glycol. Here a large reduction in overall size initially occurred before substantial secondary structural change. In conclusion, by focusing on folding propensity, and linked hydration changes, we uncover new insights that may be important for how IDP folding contributes to binding.« less
Folding propensity of intrinsically disordered proteins by osmotic stress
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mansouri, Amanda L.; Grese, Laura N.; Rowe, Erica L.
Proteins imparted with intrinsic disorder conduct a range of essential cellular functions. To better understand the folding and hydration properties of intrinsically disordered proteins (IDPs), we used osmotic stress to induce conformational changes in nuclear co-activator binding domain (NCBD) and activator for thyroid hormone and retinoid receptor (ACTR). Osmotic stress was applied by the addition of small and polymeric osmolytes, where we discovered that water contributions to NCBD folding always exceeded those for ACTR. Both NCBD and ACTR were found to gain a-helical structure with increasing osmotic stress, consistent with their folding upon NCBD/ACTR complex formation. Using small-angle neutron scatteringmore » (SANS), we further characterized NCBD structural changes with the osmolyte ethylene glycol. Here a large reduction in overall size initially occurred before substantial secondary structural change. In conclusion, by focusing on folding propensity, and linked hydration changes, we uncover new insights that may be important for how IDP folding contributes to binding.« less
Experimental investigation of protein folding and misfolding.
Dobson, Christopher M
2004-09-01
Newly synthesised proteins need to fold, often to intricate and close-packed structures, in order to function. The underlying mechanism by which this complex process takes place both in vitro and in vivo is now becoming understood, at least in general terms, as a result of the application of a wide range of biophysical and computational methods used in combination with the techniques of biochemistry and protein engineering. It is increasingly apparent, however, that folding is not only crucial for generating biological activity, but that it is also coupled to a wide range of processes within the cell, ranging from the trafficking of proteins to specific organelles to the regulation of cell growth and differentiation. Not surprisingly, therefore, the failure of proteins to fold appropriately, or to remain correctly folded, is associated with a large number of cellular malfunctions that give rise to disease. Misfolding, and its consequences such as aggregation, can be investigated by extending the types of techniques used to study the normal folding process. Application of these techniques is enabling the development of a unified description of the interconversion and regulation of the different conformational states available to proteins in living systems. Such a description proves a generic basis for understanding the fundamental links between protein misfolding and its associated clinical disorders, such as Alzheimer's disease and Type II diabetes, and for exploring novel therapeutic strategies directed at their prevention and treatment on a rational basis.
Melnik, T N; Nagibina, G S; Surin, A K; Glukhova, K A; Melnik, B S
2018-01-01
Studying the effect of cysteine bridges on different energy levels of multistage folding proteins will enable a better understanding of the process of folding and functioning of globular proteins. In particular, it will create prospects for directed change in the stability and rate of protein folding. In this work, using the method of differential scanning microcalorimetry, we have studied the effect of three cysteine bridges introduced in different structural elements of the green fluorescent protein on the denaturation enthalpies, activation energies, and heat-capacity increments when this protein passes from native to intermediate and transition states. The studies have allowed us to confirm that, with this protein denaturation, the process hardly damages the structure initially, but then changes occur in the protein structure in the region of 4-6 beta sheets. The cysteine bridge introduced in this region decreases the hydration of the second transition state and increases the hydration of the second intermediate state during the thermal denaturation of the green fluorescent protein.
DOE Office of Scientific and Technical Information (OSTI.GOV)
frydman, judith
Archaea offer a potentially cost effective and renewable source of energy. The methanogen M. maripaludis, a fast growing archaea that obtains energy by sequestering H2 and reducing CO2 to methane by the methanogenic pathway, is an attractive source for biofuel production. More recently, it has also been suggested that the methanogenesis pathway could be run in reverse, to produce H2 growing the organism in formate. A multi-level understanding of archaeal protein homeostasis, should be instrumental for improving the functionality and design of the enzyme pathways and complexes involved in energy production and storage. One additional importance consequence of a bettermore » understanding of archaeal protein homeostasis will be to increase their stress resistance, since their utilization for the efficient large-scale production of methane (and eventually also of H2) requires that the organisms are resistance to a range of growth conditions. This proposal was focused on understanding how archaea achieve protein folding and assembly and maintain protein homeostasis, which are essential for function and viability. We hypothesize that the homo-oligomeric ring shaped chaperonin from M. maripaludis, Mm-Cpn, is central to achaeal protein homeostasis and assists folding of a wide spectrum of metabolic, structural and regulatory archaeal proteins. Through a combination of biochemistry, systems biology, computational and structural biology, we have been testing this hypothesis through two complementary efforts: (i) identify the archaeal substrate repertoire of Mm-Cpn, and (ii) define mechanistic and structural principles of Mm-Cpn mediated protein folding.« less
Kozic, Mara; Fox, Stephen J; Thomas, Jens M; Verma, Chandra S; Rigden, Daniel J
2018-05-01
Antimicrobial resistance within a wide range of infectious agents is a severe and growing public health threat. Antimicrobial peptides (AMPs) are among the leading alternatives to current antibiotics, exhibiting broad spectrum activity. Their activity is determined by numerous properties such as cationic charge, amphipathicity, size, and amino acid composition. Currently, only around 10% of known AMP sequences have experimentally solved structures. To improve our understanding of the AMP structural universe we have carried out large scale ab initio 3D modeling of structurally uncharacterized AMPs that revealed similarities between predicted folds of the modeled sequences and structures of characterized AMPs. Two of the peptides whose models matched known folds are Lebocin Peptide 1A (LP1A) and Odorranain M, predicted to form β-hairpins but, interestingly, to lack the intramolecular disulfide bonds, cation-π or aromatic interactions that generally stabilize such AMP structures. Other examples include Ponericin Q42, Latarcin 4a, Kassinatuerin 1, Ceratotoxin D, and CPF-B1 peptide, which have α-helical folds, as well as mixed αβ folds of human Histatin 2 peptide and Garvicin A which are, to the best of our knowledge, the first linear αββ fold AMPs lacking intramolecular disulfide bonds. In addition to fold matches to experimentally derived structures, unique folds were also obtained, namely for Microcin M and Ipomicin. These results help in understanding the range of protein scaffolds that naturally bear antimicrobial activity and may facilitate protein design efforts towards better AMPs. © 2018 The Authors Proteins: Structure, Function, and Bioinformatics Published by Wiley Periodicals, Inc.
Structural anatomy of telomere OB proteins.
Horvath, Martin P
2011-10-01
Telomere DNA-binding proteins protect the ends of chromosomes in eukaryotes. A subset of these proteins are constructed with one or more OB folds and bind with G+T-rich single-stranded DNA found at the extreme termini. The resulting DNA-OB protein complex interacts with other telomere components to coordinate critical telomere functions of DNA protection and DNA synthesis. While the first crystal and NMR structures readily explained protection of telomere ends, the picture of how single-stranded DNA becomes available to serve as primer and template for synthesis of new telomere DNA is only recently coming into focus. New structures of telomere OB fold proteins alongside insights from genetic and biochemical experiments have made significant contributions towards understanding how protein-binding OB proteins collaborate with DNA-binding OB proteins to recruit telomerase and DNA polymerase for telomere homeostasis. This review surveys telomere OB protein structures alongside highly comparable structures derived from replication protein A (RPA) components, with the goal of providing a molecular context for understanding telomere OB protein evolution and mechanism of action in protection and synthesis of telomere DNA.
Structural anatomy of telomere OB proteins
Horvath, Martin P.
2015-01-01
Telomere DNA-binding proteins protect the ends of chromosomes in eukaryotes. A subset of these proteins are constructed with one or more OB folds and bind with G+T-rich single-stranded DNA found at the extreme termini. The resulting DNA-OB protein complex interacts with other telomere components to coordinate critical telomere functions of DNA protection and DNA synthesis. While the first crystal and NMR structures readily explained protection of telomere ends, the picture of how single-stranded DNA becomes available to serve as primer and template for synthesis of new telomere DNA is only recently coming into focus. New structures of telomere OB fold proteins alongside insights from genetic and biochemical experiments have made significant contributions towards understanding how protein-binding OB proteins collaborate with DNA-binding OB proteins to recruit telomerase and DNA polymerase for telomere homeostasis. This review surveys telomere OB protein structures alongside highly comparable structures derived from replication protein A (RPA) components, with the goal of providing a molecular context for understanding telomere OB protein evolution and mechanism of action in protection and synthesis of telomere DNA. PMID:21950380
My 65 years in protein chemistry.
Scheraga, Harold A
2015-05-01
This is a tour of a physical chemist through 65 years of protein chemistry from the time when emphasis was placed on the determination of the size and shape of the protein molecule as a colloidal particle, with an early breakthrough by James Sumner, followed by Linus Pauling and Fred Sanger, that a protein was a real molecule, albeit a macromolecule. It deals with the recognition of the nature and importance of hydrogen bonds and hydrophobic interactions in determining the structure, properties, and biological function of proteins until the present acquisition of an understanding of the structure, thermodynamics, and folding pathways from a linear array of amino acids to a biological entity. Along the way, with a combination of experiment and theoretical interpretation, a mechanism was elucidated for the thrombin-induced conversion of fibrinogen to a fibrin blood clot and for the oxidative-folding pathways of ribonuclease A. Before the atomic structure of a protein molecule was determined by x-ray diffraction or nuclear magnetic resonance spectroscopy, experimental studies of the fundamental interactions underlying protein structure led to several distance constraints which motivated the theoretical approach to determine protein structure, and culminated in the Empirical Conformational Energy Program for Peptides (ECEPP), an all-atom force field, with which the structures of fibrous collagen-like proteins and the 46-residue globular staphylococcal protein A were determined. To undertake the study of larger globular proteins, a physics-based coarse-grained UNited-RESidue (UNRES) force field was developed, and applied to the protein-folding problem in terms of structure, thermodynamics, dynamics, and folding pathways. Initially, single-chain and, ultimately, multiple-chain proteins were examined, and the methodology was extended to protein-protein interactions and to nucleic acids and to protein-nucleic acid interactions. The ultimate results led to an understanding of a variety of biological processes underlying natural and disease phenomena.
Connecting Protein Structure to Intermolecular Interactions: A Computer Modeling Laboratory
ERIC Educational Resources Information Center
Abualia, Mohammed; Schroeder, Lianne; Garcia, Megan; Daubenmire, Patrick L.; Wink, Donald J.; Clark, Ginevra A.
2016-01-01
An understanding of protein folding relies on a solid foundation of a number of critical chemical concepts, such as molecular structure, intra-/intermolecular interactions, and relating structure to function. Recent reports show that students struggle on all levels to achieve these understandings and use them in meaningful ways. Further, several…
A rate distortion approach to protein symmetry.
Wallace, Rodrick
2010-08-01
A spontaneous symmetry breaking argument is applied to the problem of protein folding, via a rate distortion analysis of the relation between genome coding and the final condensation of the protein molten globule that is, in spirit, analogous to Tlusty's (2007) exploration of the evolution of the genetic code. In the 'energy' picture, the average distortion between codon message and final protein structure, under constraints driven by evolutionary selection, serves as a temperature analog, so that low values limit the possible distribution of protein forms, producing the canonical folding funnel. A dual 'developmental' perspective sees the rate distortion function itself as the temperature analog, and permits incorporation of chaperons or toxic exposures as catalysts, driving the system to different possible outcomes or affecting the rate of convergence. The rate distortion function appears constrained by the availability of metabolic free energy, with implications for prebiotic evolution, and a nonequilibrium empirical Onsager treatment provides an adaptable statistical model that can be fitted to data, in the same manner as a regression equation. In sum, mechanistic models of protein folding fail to account for the observed spectrum of protein folding and aggregation disorders, suggesting that a biologically based cognitive paradigm describing folding will be needed for understanding the etiology, prevention, and treatment of these diseases. The developmental formalism introduced here may contribute substantially to such a paradigm.
Hughes, Alexandria; Oxford, Alexandra E; Tawara, Ken; Jorcyk, Cheryl L; Oxford, Julia Thom
2017-03-20
Chondrocytes of the growth plate undergo apoptosis during the process of endochondral ossification, as well as during the progression of osteoarthritis. Although the regulation of this process is not completely understood, alterations in the precisely orchestrated programmed cell death during development can have catastrophic results, as exemplified by several chondrodystrophies which are frequently accompanied by early onset osteoarthritis. Understanding the mechanisms that underlie chondrocyte apoptosis during endochondral ossification in the growth plate has the potential to impact the development of therapeutic applications for chondrodystrophies and associated early onset osteoarthritis. In recent years, several chondrodysplasias and collagenopathies have been recognized as protein-folding diseases that lead to endoplasmic reticulum stress, endoplasmic reticulum associated degradation, and the unfolded protein response. Under conditions of prolonged endoplasmic reticulum stress in which the protein folding load outweighs the folding capacity of the endoplasmic reticulum, cellular dysfunction and death often occur. However, unfolded protein response (UPR) signaling is also required for the normal maturation of chondrocytes and osteoblasts. Understanding how UPR signaling may contribute to cartilage pathophysiology is an essential step toward therapeutic modulation of skeletal disorders that lead to osteoarthritis.
Selvaraj, S.; Gromiha, M. Michael
2003-01-01
Analysis on the three dimensional structures of (α/β)8 barrel proteins provides ample light to understand the factors that are responsible for directing and maintaining their common fold. In this work, the hydrophobically enriched clusters are identified in 92% of the considered (α/β)8 barrel proteins. The residue segments with hydrophobic clusters have high thermal stability. Further, these clusters are formed and stabilized through long-range interactions. Specifically, a network of long-range contacts connects adjacent β-strands of the (α/β)8 barrel domain and the hydrophobic clusters. The implications of hydrophobic clusters and long-range networks in providing a feasible common mechanism for the folding of (α/β)8 barrel proteins are proposed. PMID:12609894
The Safety Dance: Biophysics of Membrane Protein Folding and Misfolding in a Cellular Context
Schlebach, Jonathan P.; Sanders, Charles R.
2015-01-01
Most biological processes require the production and degradation of proteins, a task that weighs heavily on the cell. Mutations that compromise the conformational stability of proteins place both specific and general burdens on cellular protein homeostasis (proteostasis) in ways that contribute to numerous diseases. Efforts to elucidate the chain of molecular events responsible for diseases of protein folding address one of the foremost challenges in biomedical science. However, relatively little is known about the processes by which mutations prompt the misfolding of α-helical membrane proteins, which rely on an intricate network of cellular machinery to acquire and maintain their functional structures within cellular membranes. In this review, we summarize the current understanding of the physical principles that guide membrane protein biogenesis and folding in the context of mammalian cells. Additionally, we explore how pathogenic mutations that influence biogenesis may differ from those that disrupt folding and assembly, as well as how this may relate to disease mechanisms and therapeutic intervention. These perspectives indicate an imperative for the use of information from structural, cellular, and biochemical studies of membrane proteins in the design of novel therapeutics and in personalized medicine. PMID:25420508
NASA Astrophysics Data System (ADS)
Demaine, Erik
2012-02-01
Our understanding of the mathematics and algorithms behind paper folding, and geometric folding in general, has increased dramatically over the past several years. These developments have found a surprisingly broad range of applications. In the art of origami, it has helped spur the technical origami revolution. In engineering and science, it has helped solve problems in areas such as manufacturing, robotics, graphics, and protein folding. On the recreational side, it has led to new kinds of folding puzzles and magic. I will give an overview of the mathematics and algorithms of folding, with a focus on new mathematics and sculpture.
My 65 years in protein chemistry
Scheraga, Harold A.
2015-01-01
This is a tour of a physical chemist through 65 years of protein chemistry from the time when emphasis was placed on the determination of the size and shape of the protein molecule as a colloidal particle, with an early breakthrough by James Sumner, followed by Linus Pauling and Fred Sanger, that a protein was a real molecule, albeit a macromolecule. It deals with the recognition of the nature and importance of hydrogen bonds and hydrophobic interactions in determining the structure, properties, and biological function of proteins until the present acquisition of an understanding of the structure, thermodynamics, and folding pathways from a linear array of amino acids to a biological entity. Along the way, with a combination of experiment and theoretical interpretation, a mechanism was elucidated for the thrombin-induced conversion of fibrinogen to a fibrin blood clot and for the oxidative-folding pathways of ribonuclease A. Before the atomic structure of a protein molecule was determined by x-ray diffraction or nuclear magnetic resonance spectroscopy, experimental studies of the fundamental interactions underlying protein structure led to several distance constraints which motivated the theoretical approach to determine protein structure, and culminated in the Empirical Conformational Energy Program for Peptides (ECEPP), an all-atom force field, with which the structures of fibrous collagen-like proteins and the 46-residue globular staphylococcal protein A were determined. To undertake the study of larger globular proteins, a physics-based coarse-grained UNited-RESidue (UNRES) force field was developed, and applied to the protein-folding problem in terms of structure, thermodynamics, dynamics, and folding pathways. Initially, single-chain and, ultimately, multiple-chain proteins were examined, and the methodology was extended to protein–protein interactions and to nucleic acids and to protein–nucleic acid interactions. The ultimate results led to an understanding of a variety of biological processes underlying natural and disease phenomena. PMID:25850343
Mathematical methods for protein science
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hart, W.; Istrail, S.; Atkins, J.
1997-12-31
Understanding the structure and function of proteins is a fundamental endeavor in molecular biology. Currently, over 100,000 protein sequences have been determined by experimental methods. The three dimensional structure of the protein determines its function, but there are currently less than 4,000 structures known to atomic resolution. Accordingly, techniques to predict protein structure from sequence have an important role in aiding the understanding of the Genome and the effects of mutations in genetic disease. The authors describe current efforts at Sandia to better understand the structure of proteins through rigorous mathematical analyses of simple lattice models. The efforts have focusedmore » on two aspects of protein science: mathematical structure prediction, and inverse protein folding.« less
Llanes, Antonio; Muñoz, Andrés; Bueno-Crespo, Andrés; García-Valverde, Teresa; Sánchez, Antonia; Arcas-Túnez, Francisco; Pérez-Sánchez, Horacio; Cecilia, José M
2016-01-01
The protein-folding problem has been extensively studied during the last fifty years. The understanding of the dynamics of global shape of a protein and the influence on its biological function can help us to discover new and more effective drugs to deal with diseases of pharmacological relevance. Different computational approaches have been developed by different researchers in order to foresee the threedimensional arrangement of atoms of proteins from their sequences. However, the computational complexity of this problem makes mandatory the search for new models, novel algorithmic strategies and hardware platforms that provide solutions in a reasonable time frame. We present in this revision work the past and last tendencies regarding protein folding simulations from both perspectives; hardware and software. Of particular interest to us are both the use of inexact solutions to this computationally hard problem as well as which hardware platforms have been used for running this kind of Soft Computing techniques.
NASA Astrophysics Data System (ADS)
Wang, Liang-Wei; Liu, Yu-Nan; Lyu, Ping-Chiang; Jackson, Sophie E.; Hsu, Shang-Te Danny
2015-09-01
Understanding the mechanism by which a polypeptide chain thread itself spontaneously to attain a knotted conformation has been a major challenge in the field of protein folding. HP0242 is a homodimeric protein from Helicobacter pylori with intertwined helices to form a unique pseudo-knotted folding topology. A tandem HP0242 repeat has been constructed to become the first engineered trefoil-knotted protein. Its small size renders it a model system for computational analyses to examine its folding and knotting pathways. Here we report a multi-parametric study on the folding stability and kinetics of a library of HP0242 variants, including the trefoil-knotted tandem HP0242 repeat, using far-UV circular dichroism and fluorescence spectroscopy. Equilibrium chemical denaturation of HP0242 variants shows the presence of highly populated dimeric and structurally heterogeneous folding intermediates. Such equilibrium folding intermediates retain significant amount of helical structures except those at the N- and C-terminal regions in the native structure. Stopped-flow fluorescence measurements of HP0242 variants show that spontaneous refolding into knotted structures can be achieved within seconds, which is several orders of magnitude faster than previously observed for other knotted proteins. Nevertheless, the complex chevron plots indicate that HP0242 variants are prone to misfold into kinetic traps, leading to severely rolled-over refolding arms. The experimental observations are in general agreement with the previously reported molecular dynamics simulations. Based on our results, kinetic folding pathways are proposed to qualitatively describe the complex folding processes of HP0242 variants.
Zhou, Tao; Dang, Ying; Zheng, Yong-Hui
2014-03-01
The HIV-1 Env glycoprotein is folded in the endoplasmic reticulum (ER), which is necessary for viral entry and replication. Currently, it is still unclear how this process is regulated. The glycoprotein folding in the ER is controlled by the ER-associated protein degradation (ERAD) pathway, which specifically targets misfolded proteins for degradation. Previously, we reported that HIV-1 replication is restricted in the human CD4(+) T cell line CEM.NKR (NKR). To understand this mechanism, we first analyzed cellular protein expression in NKR cells and discovered that levels of the mitochondrial translocator protein TSPO were upregulated by ∼64-fold. Notably, when NKR cells were treated with TSPO antagonist PK-11195, Ro5-4864, or diazepam, HIV restriction was completely disrupted, and TSPO knockdown by short hairpin RNAs (shRNAs) achieved a similar effect. We next analyzed viral protein expression, and, interestingly, we discovered that Env expression was specifically inhibited. Both TSPO knockdown and treatment with TSPO antagonist could restore Env expression in NKR cells. We further discovered that Env proteins were rapidly degraded and that kifunensine, an ERAD pathway inhibitor, could restore Env expression and viral replication, indicating that Env proteins were misfolded and degraded through the ERAD pathway in NKR cells. We also knocked out the TSPO gene in 293T cells using CRISPR/Cas9 (clustered, regularly interspaced, short palindromic repeat [CRISPR]/CRISPR-associated-9) technology and found that TSPO could similarly inhibit Env expression in these cells. Taken together, these results demonstrate that TSPO inhibits Env protein expression through the ERAD pathway and suggest that mitochondria play an important role in regulating the Env folding process. The HIV-1 Env glycoprotein is absolutely required for viral infection, and an understanding of its expression pathway in infected cells will identify new targets for antiretroviral therapies. Env proteins are folded in the ER and secreted through the classical secretory pathway. The Env folding process involves extensive cross-linking of 10 Cys residues by disulfide bond formation and heavy N-glycosylation on ∼30 Asn residues. Currently, it is still unclear how this process is regulated. Here, we studied this mechanism in the HIV nonpermissive human CD4(+) T cell line CEM.NKR. We found that Env proteins were rapidly degraded through a cellular pathway that specifically targets misfolded proteins, resulting in inhibition of Env expression. Importantly, we have identified a mitochondrial translocator protein, TSPO, which could trigger this degradation by interfering with the Env folding process. Further characterization of TSPO antiviral activity will reveal a novel antiretroviral mechanism that targets the Env protein.
DOE Office of Scientific and Technical Information (OSTI.GOV)
FLANAGAN,J.M.; BEWLEY,M.C.
It is generally accepted that the information necessary to specify the native, functional, three-dimensional structure of a protein is encoded entirely within its amino acid sequence; however, efficient reversible folding and unfolding is observed only with a subset of small single-domain proteins. Refolding experiments often lead to the formation of kinetically-trapped, misfolded species that aggregate, even in dilute solution. In the cellular environment, the barriers to efficient protein folding and maintenance of native structure are even larger due to the nature of this process. First, nascent polypeptides must fold in an extremely crowded environment where the concentration of macromolecules approachesmore » 300-400 mg/mL and on average, each ribosome is within its own diameter of another ribosome (1-3). These conditions of severe molecular crowding, coupled with high concentrations of nascent polypeptide chains, favor nonspecific aggregation over productive folding (3). Second, folding of newly-translated polypeptides occurs in the context of their vehtorial synthesis process. Amino acids are added to a growing nascent chain at the rate of -5 residues per set, which means that for a 300 residue protein its N-terminus will be exposed to the cytosol {approx}1 min before its C-terminus and be free to begin the folding process. However, because protein folding is highly cooperative, the nascent polypeptide cannot reach its native state until a complete folding domain (50-250 residues) has emerged from the ribosome. Thus, for a single-domain protein, the final steps in folding are only completed post-translationally since {approx}40 residues of a nascent chain are sequestered within the exit channel of the ribosome and are not available for folding (4). A direct consequence of this limitation in cellular folding is that during translation incomplete domains will exist in partially-folded states that tend to expose hydrophobic residues that are prone to aggregation and/or misfolding. Thus it is not surprising that, in cells, the protein folding process is error prone and organisms have evolved ''editing'' or quality control (QC) systems to assist in the folding, maintenance and, when necessary, selective removal of damaged proteins. In fact, there is growing evidence that failure of these QC-systems contributes to a number of disease states (5-8). This chapter describes our current understanding of the nature and mechanisms of the protein quality control systems in the cytosol of bacteria. Parallel systems are exploited in the cytosol and mitochondria of eukaryotes to prevent the accumulation of misfolded proteins.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kim, Sang Beom; Dsilva, Carmeline J.; Debenedetti, Pablo G., E-mail: pdebene@princeton.edu
Understanding the mechanisms by which proteins fold from disordered amino-acid chains to spatially ordered structures remains an area of active inquiry. Molecular simulations can provide atomistic details of the folding dynamics which complement experimental findings. Conventional order parameters, such as root-mean-square deviation and radius of gyration, provide structural information but fail to capture the underlying dynamics of the protein folding process. It is therefore advantageous to adopt a method that can systematically analyze simulation data to extract relevant structural as well as dynamical information. The nonlinear dimensionality reduction technique known as diffusion maps automatically embeds the high-dimensional folding trajectories inmore » a lower-dimensional space from which one can more easily visualize folding pathways, assuming the data lie approximately on a lower-dimensional manifold. The eigenvectors that parametrize the low-dimensional space, furthermore, are determined systematically, rather than chosen heuristically, as is done with phenomenological order parameters. We demonstrate that diffusion maps can effectively characterize the folding process of a Trp-cage miniprotein. By embedding molecular dynamics simulation trajectories of Trp-cage folding in diffusion maps space, we identify two folding pathways and intermediate structures that are consistent with the previous studies, demonstrating that this technique can be employed as an effective way of analyzing and constructing protein folding pathways from molecular simulations.« less
Minimal model for the secondary structures and conformational conversions in proteins
NASA Astrophysics Data System (ADS)
Imamura, Hideo
Better understanding of protein folding process can provide physical insights on the function of proteins and makes it possible to benefit from genetic information accumulated so far. Protein folding process normally takes place in less than seconds but even seconds are beyond reach of current computational power for simulations on a system of all-atom detail. Hence, to model and explore protein folding process it is crucial to construct a proper model that can adequately describe the physical process and mechanism for the relevant time scale. We discuss the reduced off-lattice model that can express _-helix and ?-hairpin conformations defined solely by a given sequence in order to investigate a protein folding mechanism of conformations such as a ?-hairpin and also to investigate conformational conversions in proteins. The first two chapters introduce and review essential concepts in protein folding modelling physical interaction in proteins, various simple models, and also review computational methods, in particular, the Metropolis Monte Carlo method, its dynamic interpretation and thermodynamic Monte Carlo algorithms. Chapter 3 describes the minimalist model that represents both _-helix and ?-sheet conformations using simple potentials. The native conformation can be specified by the sequence without particular conformational biases to a reference state. In Chapter 4, the model is used to investigate the folding mechanism of ?-hairpins exhaustively using the dynamic Monte Carlo and a thermodynamic Monte Carlo method an effcient combination of the multicanonical Monte Carlo and the weighted histogram analysis method. We show that the major folding pathways and folding rate depend on the location of a hydrophobic. The conformational conversions between _-helix and ?-sheet conformations are examined in Chapter 5 and 6. First, the conformational conversion due to mutation in a non-hydrophobic system and then the conformational conversion due to mutation with a hydrophobic pair at a different position at various temperatures are examined.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cornell, Thomas A.; Srivastava, Yogesh; Jauch, Ralf
Cage proteins assemble into nanoscale structures with large central cavities. They play roles, including those as virus capsids and chaperones, and have been applied to drug delivery and nanomaterials. Furthermore, protein cages have been used as model systems to understand and design protein quaternary structure. Ferritins are ubiquitous protein cages that manage iron homeostasis and oxidative damage. Two ferritin subfamilies have strongly similar tertiary structure yet distinct quaternary structure: maxi-ferritins normally assemble into 24-meric, octahedral cages with C-terminal E-helices centered around 4-fold symmetry axes, and mini-ferritins are 12-meric, tetrahedral cages with 3-fold axes defined by C-termini lacking E-domains. To understandmore » the role E-domains play in ferritin quaternary structure, we previously designed a chimera of a maxi-ferritin E-domain fused to the C-terminus of a mini-ferritin. The chimera is a 12-mer cage midway in size between those of the maxi- and mini-ferritin. The research described herein sets out to understand (a) whether the increase in size over a typical mini-ferritin is due to a frozen state where the E-domain is flipped out of the cage and (b) whether the symmetrical preference of the E-domain in the maxi-ferritin (4-fold axis) overrules the C-terminal preference in the mini-ferritin (3-fold axis). With a 1.99 Å resolution crystal structure, we determined that the chimera assembles into a tetrahedral cage that can be nearly superimposed with the parent mini-ferritin, and that the E-domains are flipped external to the cage at the 3-fold symmetry axes.« less
How long does it take to equilibrate the unfolded state of a protein?
Levy, Ronald M; Dai, Wei; Deng, Nan-Jie; Makarov, Dmitrii E
2013-01-01
How long does it take to equilibrate the unfolded state of a protein? The answer to this question has important implications for our understanding of why many small proteins fold with two state kinetics. When the equilibration within the unfolded state U is much faster than the folding, the folding kinetics will be two state even if there are many folding pathways with different barriers. Yet the mean first passage times (MFPTs) between different regions of the unfolded state can be much longer than the folding time. This seems to imply that the equilibration within U is much slower than the folding. In this communication we resolve this paradox. We present a formula for estimating the time to equilibrate the unfolded state of a protein. We also present a formula for the MFPT to any state within U, which is proportional to the average lifetime of that state divided by the state population. This relation is valid when the equilibration within U is very fast as compared with folding as it often is for small proteins. To illustrate the concepts, we apply the formulas to estimate the time to equilibrate the unfolded state of Trp-cage and MFPTs within the unfolded state based on a Markov State Model using an ultra-long 208 microsecond trajectory of the miniprotein to parameterize the model. The time to equilibrate the unfolded state of Trp-cage is ∼100 ns while the typical MFPTs within U are tens of microseconds or longer. PMID:23963761
CASP10-BCL::Fold efficiently samples topologies of large proteins.
Heinze, Sten; Putnam, Daniel K; Fischer, Axel W; Kohlmann, Tim; Weiner, Brian E; Meiler, Jens
2015-03-01
During CASP10 in summer 2012, we tested BCL::Fold for prediction of free modeling (FM) and template-based modeling (TBM) targets. BCL::Fold assembles the tertiary structure of a protein from predicted secondary structure elements (SSEs) omitting more flexible loop regions early on. This approach enables the sampling of conformational space for larger proteins with more complex topologies. In preparation of CASP11, we analyzed the quality of CASP10 models throughout the prediction pipeline to understand BCL::Fold's ability to sample the native topology, identify native-like models by scoring and/or clustering approaches, and our ability to add loop regions and side chains to initial SSE-only models. The standout observation is that BCL::Fold sampled topologies with a GDT_TS score > 33% for 12 of 18 and with a topology score > 0.8 for 11 of 18 test cases de novo. Despite the sampling success of BCL::Fold, significant challenges still exist in clustering and loop generation stages of the pipeline. The clustering approach employed for model selection often failed to identify the most native-like assembly of SSEs for further refinement and submission. It was also observed that for some β-strand proteins model refinement failed as β-strands were not properly aligned to form hydrogen bonds removing otherwise accurate models from the pool. Further, BCL::Fold samples frequently non-natural topologies that require loop regions to pass through the center of the protein. © 2015 Wiley Periodicals, Inc.
Structure Prediction and Analysis of Neuraminidase Sequence Variants
ERIC Educational Resources Information Center
Thayer, Kelly M.
2016-01-01
Analyzing protein structure has become an integral aspect of understanding systems of biochemical import. The laboratory experiment endeavors to introduce protein folding to ascertain structures of proteins for which the structure is unavailable, as well as to critically evaluate the quality of the prediction obtained. The model system used is the…
How Many Protein Sequences Fold to a Given Structure? A Coevolutionary Analysis.
Tian, Pengfei; Best, Robert B
2017-10-17
Quantifying the relationship between protein sequence and structure is key to understanding the protein universe. A fundamental measure of this relationship is the total number of amino acid sequences that can fold to a target protein structure, known as the "sequence capacity," which has been suggested as a proxy for how designable a given protein fold is. Although sequence capacity has been extensively studied using lattice models and theory, numerical estimates for real protein structures are currently lacking. In this work, we have quantitatively estimated the sequence capacity of 10 proteins with a variety of different structures using a statistical model based on residue-residue co-evolution to capture the variation of sequences from the same protein family. Remarkably, we find that even for the smallest protein folds, such as the WW domain, the number of foldable sequences is extremely large, exceeding the Avogadro constant. In agreement with earlier theoretical work, the calculated sequence capacity is positively correlated with the size of the protein, or better, the density of contacts. This allows the absolute sequence capacity of a given protein to be approximately predicted from its structure. On the other hand, the relative sequence capacity, i.e., normalized by the total number of possible sequences, is an extremely tiny number and is strongly anti-correlated with the protein length. Thus, although there may be more foldable sequences for larger proteins, it will be much harder to find them. Lastly, we have correlated the evolutionary age of proteins in the CATH database with their sequence capacity as predicted by our model. The results suggest a trade-off between the opposing requirements of high designability and the likelihood of a novel fold emerging by chance. Published by Elsevier Inc.
Small Scaffolds, Big Potential: Developing Miniature Proteins as Therapeutic Agents.
Holub, Justin M
2017-09-01
Preclinical Research Miniature proteins are a class of oligopeptide characterized by their short sequence lengths and ability to adopt well-folded, three-dimensional structures. Because of their biomimetic nature and synthetic tractability, miniature proteins have been used to study a range of biochemical processes including fast protein folding, signal transduction, catalysis and molecular transport. Recently, miniature proteins have been gaining traction as potential therapeutic agents because their small size and ability to fold into defined tertiary structures facilitates their development as protein-based drugs. This research overview discusses emerging developments involving the use of miniature proteins as scaffolds to design novel therapeutics for the treatment and study of human disease. Specifically, this review will explore strategies to: (i) stabilize miniature protein tertiary structure; (ii) optimize biomolecular recognition by grafting functional epitopes onto miniature protein scaffolds; and (iii) enhance cytosolic delivery of miniature proteins through the use of cationic motifs that facilitate endosomal escape. These objectives are discussed not only to address challenges in developing effective miniature protein-based drugs, but also to highlight the tremendous potential miniature proteins hold for combating and understanding human disease. Drug Dev Res 78 : 268-282, 2017. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Guerriero, Christopher J.; Brodsky, Jeffrey L.
2014-01-01
Protein folding is a complex, error-prone process that often results in an irreparable protein by-product. These by-products can be recognized by cellular quality control machineries and targeted for proteasome-dependent degradation. The folding of proteins in the secretory pathway adds another layer to the protein folding “problem,” as the endoplasmic reticulum maintains a unique chemical environment within the cell. In fact, a growing number of diseases are attributed to defects in secretory protein folding, and many of these by-products are targeted for a process known as endoplasmic reticulum-associated degradation (ERAD). Since its discovery, research on the mechanisms underlying the ERAD pathway has provided new insights into how ERAD contributes to human health during both normal and diseases states. Links between ERAD and disease are evidenced from the loss of protein function as a result of degradation, chronic cellular stress when ERAD fails to keep up with misfolded protein production, and the ability of some pathogens to coopt the ERAD pathway. The growing number of ERAD substrates has also illuminated the differences in the machineries used to recognize and degrade a vast array of potential clients for this pathway. Despite all that is known about ERAD, many questions remain, and new paradigms will likely emerge. Clearly, the key to successful disease treatment lies within defining the molecular details of the ERAD pathway and in understanding how this conserved pathway selects and degrades an innumerable cast of substrates. PMID:22535891
Shao, Qiang; Shi, Jiye; Zhu, Weiliang
2012-09-28
The ability of molecular dynamics simulation to capturing the transient states within the folding pathway of protein is important to the understanding of protein folding mechanism. In the present study, the integrated-tempering-sampling molecular dynamics (ITS-MD) simulation was performed to investigate the transient states including intermediate and unfolded ones in the folding pathway of a miniprotein, Trp-cage. Three force fields (FF03, FF99SB, and FF96) were tested, and both intermediate and unfolded states with their characteristics in good agreement with experiments were observed during the simulations, which supports the hypothesis that observable intermediates might present in the folding pathway of small polypeptides. In addition, it was demonstrated that FF03 force field as combined with ITS-MD is in overall a more proper force field than the others in reproducing experimentally recorded properties in UVRS, ECD, and NMR, Photo-CIDNP NMR, and IR T-jump experiments, and the folding∕unfolding thermodynamics parameters, such as ΔG(U), ΔC(p), and ΔH(U) (T(m)). In summary, the present study showed that using suitable force field and energy sampling method, molecular dynamics simulation could capture the transient states within the folding pathway of protein which are consistent with the experimental measurements, and thus provide information of protein folding mechanism and thermodynamics.
Effects of Polymer Hydrophobicity on Protein Structure and Aggregation Kinetics in Crowded Milieu.
Breydo, Leonid; Sales, Amanda E; Frege, Telma; Howell, Mark C; Zaslavsky, Boris Y; Uversky, Vladimir N
2015-05-19
We examined the effects of water-soluble polymers of various degrees of hydrophobicity on the folding and aggregation of proteins. The polymers we chose were polyethylene glycol (PEG) and UCON (1:1 copolymer of ethylene glycol and propylene glycol). The presence of additional methyl groups in UCON makes it more hydrophobic than PEG. Our earlier analysis revealed that similarly sized PEG and UCON produced different changes in the solvent properties of water in their solutions and induced morphologically different α-synuclein aggregates [Ferreira, L. A., et al. (2015) Role of solvent properties of aqueous media in macromolecular crowding effects. J. Biomol. Struct. Dyn., in press]. To improve our understanding of molecular mechanisms defining behavior of proteins in a crowded environment, we tested the effects of these polymers on secondary and tertiary structure and aromatic residue solvent accessibility of 10 proteins [five folded proteins, two hybrid proteins; i.e., protein containing ordered and disordered domains, and three intrinsically disordered proteins (IDPs)] and on the aggregation kinetics of insulin and α-synuclein. We found that effects of both polymers on secondary and tertiary structures of folded and hybrid proteins were rather limited with slight unfolding observed in some cases. Solvent accessibility of aromatic residues was significantly increased for the majority of the studied proteins in the presence of UCON but not PEG. PEG also accelerated the aggregation of protein into amyloid fibrils, whereas UCON promoted aggregation to amyloid oligomers instead. These results indicate that even a relatively small change in polymer structure leads to a significant change in the effect of this polymer on protein folding and aggregation. This is an indication that protein folding and especially aggregation are highly sensitive to the presence of other macromolecules, and an excluded volume effect is insufficient to describe their effect.
Exploring the sequence-structure protein landscape in the glycosyltransferase family
Zhang, Ziding; Kochhar, Sunil; Grigorov, Martin
2003-01-01
To understand the molecular basis of glycosyltransferases’ (GTFs) catalytic mechanism, extensive structural information is required. Here, fold recognition methods were employed to assign 3D protein shapes (folds) to the currently known GTF sequences, available in public databases such as GenBank and Swissprot. First, GTF sequences were retrieved and classified into clusters, based on sequence similarity only. Intracluster sequence similarity was chosen sufficiently high to ensure that the same fold is found within a given cluster. Then, a representative sequence from each cluster was selected to compose a subset of GTF sequences. The members of this reduced set were processed by three different fold recognition methods: 3D-PSSM, FUGUE, and GeneFold. Finally, the results from different fold recognition methods were analyzed and compared to sequence-similarity search methods (i.e., BLAST and PSI-BLAST). It was established that the folds of about 70% of all currently known GTF sequences can be confidently assigned by fold recognition methods, a value which is higher than the fold identification rate based on sequence comparison alone (48% for BLAST and 64% for PSI-BLAST). The identified folds were submitted to 3D clustering, and we found that most of the GTF sequences adopt the typical GTF A or GTF B folds. Our results indicate a lack of evidence that new GTF folds (i.e., folds other than GTF A and B) exist. Based on cases where fold identification was not possible, we suggest several sequences as the most promising targets for a structural genomics initiative focused on the GTF protein family. PMID:14500887
Effects of mutation, truncation, and temperature on the folding kinetics of a WW domain.
Maisuradze, Gia G; Zhou, Rui; Liwo, Adam; Xiao, Yi; Scheraga, Harold A
2012-07-20
The purpose of this work is to show how mutation, truncation, and change of temperature can influence the folding kinetics of a protein. This is accomplished by principal component analysis of molecular-dynamics-generated folding trajectories of the triple β-strand WW domain from formin binding protein 28 (FBP28) (Protein Data Bank ID: 1E0L) and its full-size, and singly- and doubly-truncated mutants at temperatures below and very close to the melting point. The reasons for biphasic folding kinetics [i.e., coexistence of slow (three-state) and fast (two-state) phases], including the involvement of a solvent-exposed hydrophobic cluster and another delocalized hydrophobic core in the folding kinetics, are discussed. New folding pathways are identified in free-energy landscapes determined in terms of principal components for full-size mutants. Three-state folding is found to be a main mechanism for folding the FBP28 WW domain and most of the full-size and truncated mutants. The results from the theoretical analysis are compared to those from experiment. Agreements and discrepancies between the theoretical and experimental results are discussed. Because of its importance in understanding protein kinetics and function, the diffusive mechanism by which the FBP28 WW domain and its full-size and truncated mutants explore their conformational space is examined in terms of the mean-square displacement and principal component analysis eigenvalue spectrum analyses. Subdiffusive behavior is observed for all studied systems. Copyright © 2012. Published by Elsevier Ltd.
Celebioglu, Hasan Ufuk; Ejby, Morten; Majumder, Avishek; Købler, Carsten; Goh, Yong Jun; Thorsen, Kristian; Schmidt, Bjarne; O'Flaherty, Sarah; Abou Hachem, Maher; Lahtinen, Sampo J; Jacobsen, Susanne; Klaenhammer, Todd R; Brix, Susanne; Mølhave, Kristian; Svensson, Birte
2016-05-01
Whole cell and surface proteomes were analyzed together with adhesive properties of the probiotic bacterium Lactobacillus acidophilus NCFM (NCFM) grown on the emerging prebiotic raffinose, exemplifying a synbiotic. Adhesion of NCFM to mucin and intestinal HT-29 cells increased three-fold after culture with raffinose versus glucose, as also visualized by scanning electron microscopy. Comparative proteomics using 2D-DIGE showed 43 unique proteins to change in relative abundance in whole cell lysates from NCFM grown on raffinose compared to glucose. Furthermore, 14 unique proteins in 18 spots of the surface subproteome underwent changes identified by differential 2DE, including elongation factor G, thermostable pullulanase, and phosphate starvation inducible stress-related protein increasing in a range of +2.1 - +4.7 fold. By contrast five known moonlighting proteins decreased in relative abundance by up to -2.4 fold. Enzymes involved in raffinose catabolism were elevated in the whole cell proteome; α-galactosidase (+13.9 fold); sucrose phosphorylase (+5.4 fold) together with metabolic enzymes from the Leloir pathway for galactose utilization and the glycolysis; β-galactosidase (+5.7 fold); galactose (+2.9/+3.1 fold) and fructose (+2.8 fold) kinases. The insights at the molecular and cellular levels contributed to the understanding of the interplay of a synbiotic composed of NCFM and raffinose with the host. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Hsiao, Hao-Ching; Gonzalez, Kim L.; Catanese, Daniel J.; Jordy, Kristopher E.; Matthews, Kathleen S.; Bondos, Sarah E.
2014-01-01
Interactions between structured proteins require a complementary topology and surface chemistry to form sufficient contacts for stable binding. However, approximately one third of protein interactions are estimated to involve intrinsically disordered regions of proteins. The dynamic nature of disordered regions before and, in some cases, after binding calls into question the role of partner topology in forming protein interactions. To understand how intrinsically disordered proteins identify the correct interacting partner proteins, we evaluated interactions formed by the Drosophila melanogaster Hox transcription factor Ultrabithorax (Ubx), which contains both structured and disordered regions. Ubx binding proteins are enriched in specific folds: 23 of its 39 partners include one of 7 folds, out of the 1195 folds recognized by SCOP. For the proteins harboring the two most populated folds, DNA-RNA binding 3-helical bundles and α-α superhelices, the regions of the partner proteins that exhibit these preferred folds are sufficient for Ubx binding. Three disorder-containing regions in Ubx are required to bind these partners. These regions are either alternatively spliced or multiply phosphorylated, providing a mechanism for cellular processes to regulate Ubx-partner interactions. Indeed, partner topology correlates with the ability of individual partner proteins to bind Ubx spliceoforms. Partners bind different disordered regions within Ubx to varying extents, creating the potential for competition between partners and cooperative binding by partners. The ability of partners to bind regions of Ubx that activate transcription and regulate DNA binding provides a mechanism for partners to modulate transcription regulation by Ubx, and suggests that one role of disorder in Ubx is to coordinate multiple molecular functions in response to tissue-specific cues. PMID:25286318
BPI-fold (BPIF) containing/plunc protein expression in human fetal major and minor salivary glands.
Alves, Daniel Berretta Moreira; Bingle, Lynne; Bingle, Colin David; Lourenço, Silvia Vanessa; Silva, Andréia Aparecida; Pereira, Débora Lima; Vargas, Pablo Agustin
2017-01-16
The aim of this study was to determine expression, not previously described, of PLUNC (palate, lung, and nasal epithelium clone) (BPI-fold containing) proteins in major and minor salivary glands from very early fetal tissue to the end of the second trimester and thus gain further insight into the function of these proteins. Early fetal heads, and major and minor salivary glands were collected retrospectively and glands were classified according to morphodifferentiation stage. Expression of BPI-fold containing proteins was localized through immunohistochemistry. BPIFA2, the major BPI-fold containing protein in adult salivary glands, was detected only in the laryngeal pharynx; the lack of staining in salivary glands suggested salivary expression is either very late in development or is only in adult tissues. Early expression of BPIFA1 was seen in the trachea and nasal cavity with salivary gland expression only seen in late morphodifferentiation stages. BPIFB1 was seen in early neural tissue and at later stages in submandibular and sublingual glands. BPIFA1 is significantly expressed in early fetal oral tissue but BPIFB1 has extremely limited expression and the major salivary BPIF protein (BPIFA2) is not produced in fetal development. Further studies, with more sensitive techniques, will confirm the expression pattern and enable a better understanding of embryonic BPIF protein function.
Mechanism of the eukaryotic chaperonin: protein folding in the chamber of secrets
Spiess, Christoph; Meyer, Anne S.; Reissmann, Stefanie; Frydman, Judith
2010-01-01
Chaperonins are key components of the cellular chaperone machinery. These large, cylindrical complexes contain a central cavity that binds to unfolded polypeptides and sequesters them from the cellular environment. Substrate folding then occurs in this central cavity in an ATP-dependent manner. The eukaryotic chaperonin TCP-1 ring complex (TRiC, also called CCT) is indispensable for cell survival because the folding of an essential subset of cytosolic proteins requires TRiC, and this function cannot be substituted by other chaperones. This specificity indicates that TRiC has evolved structural and mechanistic features that distinguish it from other chaperones. Although knowledge of this unique complex is in its infancy, we review recent advances that open the way to understanding the secrets of its folding chamber. PMID:15519848
Mitochondrial unfolded protein response controls matrix pre-RNA processing and translation.
Münch, Christian; Harper, J Wade
2016-06-30
The mitochondrial matrix is unique in that it must integrate the folding and assembly of proteins derived from the nuclear and mitochondrial genomes. In Caenorhabditis elegans, the mitochondrial unfolded protein response (UPRmt) senses matrix protein misfolding and induces a program of nuclear gene expression, including mitochondrial chaperonins, to promote mitochondrial proteostasis. While misfolded mitochondrial-matrix-localized ornithine transcarbamylase induces chaperonin expression, our understanding of mammalian UPRmt is rudimentary, reflecting a lack of acute triggers for UPRmt activation. This limitation has prevented analysis of the cellular responses to matrix protein misfolding and the effects of UPRmt on mitochondrial translation to control protein folding loads. Here we combine pharmacological inhibitors of matrix-localized HSP90/TRAP1 (ref. 8) or LON protease, which promote chaperonin expression, with global transcriptional and proteomic analysis to reveal an extensive and acute response of human cells to UPRmt. This response encompasses widespread induction of nuclear genes, including matrix-localized proteins involved in folding, pre-RNA processing and translation. Functional studies revealed rapid but reversible translation inhibition in mitochondria occurring concurrently with defects in pre-RNA processing caused by transcriptional repression and LON-dependent turnover of the mitochondrial pre-RNA processing nuclease MRPP3 (ref. 10). This study reveals that acute mitochondrial protein folding stress activates both increased chaperone availability within the matrix and reduced matrix-localized protein synthesis through translational inhibition, and provides a framework for further dissection of mammalian UPRmt.
Endoplasmic Reticulum Stress and Oxidative Stress in Cell Fate Decision and Human Disease
Cao, Stewart Siyan
2014-01-01
Abstract Significance: The endoplasmic reticulum (ER) is a specialized organelle for the folding and trafficking of proteins, which is highly sensitive to changes in intracellular homeostasis and extracellular stimuli. Alterations in the protein-folding environment cause accumulation of misfolded proteins in the ER that profoundly affect a variety of cellular signaling processes, including reduction–oxidation (redox) homeostasis, energy production, inflammation, differentiation, and apoptosis. The unfolded protein response (UPR) is a collection of adaptive signaling pathways that evolved to resolve protein misfolding and restore an efficient protein-folding environment. Recent Advances: Production of reactive oxygen species (ROS) has been linked to ER stress and the UPR. ROS play a critical role in many cellular processes and can be produced in the cytosol and several organelles, including the ER and mitochondria. Studies suggest that altered redox homeostasis in the ER is sufficient to cause ER stress, which could, in turn, induce the production of ROS in the ER and mitochondria. Critical Issues: Although ER stress and oxidative stress coexist in many pathologic states, whether and how these stresses interact is unknown. It is also unclear how changes in the protein-folding environment in the ER cause oxidative stress. In addition, how ROS production and protein misfolding commit the cell to an apoptotic death and contribute to various degenerative diseases is unknown. Future Directions: A greater fundamental understanding of the mechanisms that preserve protein folding homeostasis and redox status will provide new information toward the development of novel therapeutics for many human diseases. Antioxid. Redox Signal. 21, 396–413. PMID:24702237
Characterizing Conformational Dynamics of Proteins Using Evolutionary Couplings.
Feng, Jiangyan; Shukla, Diwakar
2018-01-25
Understanding of protein conformational dynamics is essential for elucidating molecular origins of protein structure-function relationship. Traditionally, reaction coordinates, i.e., some functions of protein atom positions and velocities have been used to interpret the complex dynamics of proteins obtained from experimental and computational approaches such as molecular dynamics simulations. However, it is nontrivial to identify the reaction coordinates a priori even for small proteins. Here, we evaluate the power of evolutionary couplings (ECs) to capture protein dynamics by exploring their use as reaction coordinates, which can efficiently guide the sampling of a conformational free energy landscape. We have analyzed 10 diverse proteins and shown that a few ECs are sufficient to characterize complex conformational dynamics of proteins involved in folding and conformational change processes. With the rapid strides in sequencing technology, we expect that ECs could help identify reaction coordinates a priori and enhance the sampling of the slow dynamical process associated with protein folding and conformational change.
Classification of proteins: available structural space for molecular modeling.
Andreeva, Antonina
2012-01-01
The wealth of available protein structural data provides unprecedented opportunity to study and better understand the underlying principles of protein folding and protein structure evolution. A key to achieving this lies in the ability to analyse these data and to organize them in a coherent classification scheme. Over the past years several protein classifications have been developed that aim to group proteins based on their structural relationships. Some of these classification schemes explore the concept of structural neighbourhood (structural continuum), whereas other utilize the notion of protein evolution and thus provide a discrete rather than continuum view of protein structure space. This chapter presents a strategy for classification of proteins with known three-dimensional structure. Steps in the classification process along with basic definitions are introduced. Examples illustrating some fundamental concepts of protein folding and evolution with a special focus on the exceptions to them are presented.
Rosen, Laura E.; Connell, Katelyn B.; Marqusee, Susan
2014-01-01
The molten globule, a conformational ensemble with significant secondary structure but only loosely packed tertiary structure, has been suggested to be a ubiquitous intermediate in protein folding. However, it is difficult to assess the tertiary packing of transiently populated species to evaluate this hypothesis. Escherichia coli RNase H is known to populate an intermediate before the rate-limiting barrier to folding that has long been thought to be a molten globule. We investigated this hypothesis by making mimics of the intermediate that are the ground-state conformation at equilibrium, using two approaches: a truncation to generate a fragment mimic of the intermediate, and selective destabilization of the native state using point mutations. Spectroscopic characterization and the response of the mimics to further mutation are consistent with studies on the transient kinetic intermediate, indicating that they model the early intermediate. Both mimics fold cooperatively and exhibit NMR spectra indicative of a closely packed conformation, in contrast to the hypothesis of molten tertiary packing. This result is important for understanding the nature of the subsequent rate-limiting barrier to folding and has implications for the assumption that many other proteins populate molten globule folding intermediates. PMID:25258414
Rosen, Laura E; Connell, Katelyn B; Marqusee, Susan
2014-10-14
The molten globule, a conformational ensemble with significant secondary structure but only loosely packed tertiary structure, has been suggested to be a ubiquitous intermediate in protein folding. However, it is difficult to assess the tertiary packing of transiently populated species to evaluate this hypothesis. Escherichia coli RNase H is known to populate an intermediate before the rate-limiting barrier to folding that has long been thought to be a molten globule. We investigated this hypothesis by making mimics of the intermediate that are the ground-state conformation at equilibrium, using two approaches: a truncation to generate a fragment mimic of the intermediate, and selective destabilization of the native state using point mutations. Spectroscopic characterization and the response of the mimics to further mutation are consistent with studies on the transient kinetic intermediate, indicating that they model the early intermediate. Both mimics fold cooperatively and exhibit NMR spectra indicative of a closely packed conformation, in contrast to the hypothesis of molten tertiary packing. This result is important for understanding the nature of the subsequent rate-limiting barrier to folding and has implications for the assumption that many other proteins populate molten globule folding intermediates.
Simulating protein folding initiation sites using an alpha-carbon-only knowledge-based force field
Buck, Patrick M.; Bystroff, Christopher
2015-01-01
Protein folding is a hierarchical process where structure forms locally first, then globally. Some short sequence segments initiate folding through strong structural preferences that are independent of their three-dimensional context in proteins. We have constructed a knowledge-based force field in which the energy functions are conditional on local sequence patterns, as expressed in the hidden Markov model for local structure (HMMSTR). Carbon-alpha force field (CALF) builds sequence specific statistical potentials based on database frequencies for α-carbon virtual bond opening and dihedral angles, pairwise contacts and hydrogen bond donor-acceptor pairs, and simulates folding via Brownian dynamics. We introduce hydrogen bond donor and acceptor potentials as α-carbon probability fields that are conditional on the predicted local sequence. Constant temperature simulations were carried out using 27 peptides selected as putative folding initiation sites, each 12 residues in length, representing several different local structure motifs. Each 0.6 μs trajectory was clustered based on structure. Simulation convergence or representativeness was assessed by subdividing trajectories and comparing clusters. For 21 of the 27 sequences, the largest cluster made up more than half of the total trajectory. Of these 21 sequences, 14 had cluster centers that were at most 2.6 Å root mean square deviation (RMSD) from their native structure in the corresponding full-length protein. To assess the adequacy of the energy function on nonlocal interactions, 11 full length native structures were relaxed using Brownian dynamics simulations. Equilibrated structures deviated from their native states but retained their overall topology and compactness. A simple potential that folds proteins locally and stabilizes proteins globally may enable a more realistic understanding of hierarchical folding pathways. PMID:19137613
Learning about protein solubility from bacterial inclusion bodies
Martínez-Alonso, Mónica; González-Montalbán, Nuria; García-Fruitós, Elena; Villaverde, Antonio
2009-01-01
The progressive solving of the conformation of aggregated proteins and the conceptual understanding of the biology of inclusion bodies in recombinant bacteria is providing exciting insights on protein folding and quality. Interestingly, newest data also show an unexpected functional and structural complexity of soluble recombinant protein species and picture the whole bacterial cell factory scenario as more intricate than formerly believed. PMID:19133126
Structure of human POFUT2: insights into thrombospondin type 1 repeat fold and O-fucosylation
Chen, Chun-I; Keusch, Jeremy J; Klein, Dominique; Hess, Daniel; Hofsteenge, Jan; Gut, Heinz
2012-01-01
Protein O-fucosylation is a post-translational modification found on serine/threonine residues of thrombospondin type 1 repeats (TSR). The fucose transfer is catalysed by the enzyme protein O-fucosyltransferase 2 (POFUT2) and >40 human proteins contain the TSR consensus sequence for POFUT2-dependent fucosylation. To better understand O-fucosylation on TSR, we carried out a structural and functional analysis of human POFUT2 and its TSR substrate. Crystal structures of POFUT2 reveal a variation of the classical GT-B fold and identify sugar donor and TSR acceptor binding sites. Structural findings are correlated with steady-state kinetic measurements of wild-type and mutant POFUT2 and TSR and give insight into the catalytic mechanism and substrate specificity. By using an artificial mini-TSR substrate, we show that specificity is not primarily encoded in the TSR protein sequence but rather in the unusual 3D structure of a small part of the TSR. Our findings uncover that recognition of distinct conserved 3D fold motifs can be used as a mechanism to achieve substrate specificity by enzymes modifying completely folded proteins of very wide sequence diversity and biological function. PMID:22588082
Thermodynamics of coupled protein adsorption and stability using hybrid Monte Carlo simulations.
Zhong, Ellen D; Shirts, Michael R
2014-05-06
A better understanding of changes in protein stability upon adsorption can improve the design of protein separation processes. In this study, we examine the coupling of the folding and the adsorption of a model protein, the B1 domain of streptococcal protein G, as a function of surface attraction using a hybrid Monte Carlo (HMC) approach with temperature replica exchange and umbrella sampling. In our HMC implementation, we are able to use a molecular dynamics (MD) time step that is an order of magnitude larger than in a traditional MD simulation protocol and observe a factor of 2 enhancement in the folding and unfolding rate. To demonstrate the convergence of our systems, we measure the travel of our order parameter the fraction of native contacts between folded and unfolded states throughout the length of our simulations. Thermodynamic quantities are extracted with minimum statistical variance using multistate reweighting between simulations at different temperatures and harmonic distance restraints from the surface. The resultant free energies, enthalpies, and entropies of the coupled unfolding and absorption processes are in qualitative agreement with previous experimental and computational observations, including entropic stabilization of the adsorbed, folded state relative to the bulk on surfaces with low attraction.
Progress in bioinformatics and the importance of being earnest.
Attwood, T K; Miller, C J
2002-01-01
In silico biology has gathered momentum as, worldwide, scientists have united in a common quest to sequence, store and analyse complete genomes. This year, a pivotal achievement of this cooperative endeavour was realised in the release of a public draft of the human genome, and with it the promises to improve our understanding of diverse aspects of biology and to yield a healthier future with safe personalized medicines. Key to these goals will be the need to elucidate and characterise the genes and gene products encoded not just in the human genome, but in many genomes. These tasks are underpinned by the concepts and processes of genome and gene/protein evolution, regulation of gene expression, mechanisms of protein folding, the manifestation of protein function, and so on, all of which must be understood in the context of complex, dynamic biological systems. Our use of computers to model such concepts and systems must be placed in the context of the current limits of our understanding of them:- it is important to recognise, for example, that we don't have a common understanding either of what constitutes a gene or a protein function; we can't invariably say that a particular sequence or fold has arisen via divergent or convergent evolution; and we don't fully understand the rules of protein folding. Accepting what we can't do in silico is essential in appreciating what we can do. Without this understanding, it is easy to be misled, as notions of what particular computational approaches can achieve are sometimes rather optimistic. There are valuable lessons to be learned here from the field of Artificial Intelligence, principal among which is the realisation that capturing and representing complex knowledge is time consuming, expensive and hard. Thus, we argue here that if bioinformatics is to tackle biological complexity in earnest, it would be wise to absorb the experience distilled from decades of artificial intelligence research, and to approach the road ahead with caution, rigour and pragmatism.
Albert, Armando; Yunta, Cristina; Arranz, Rocío; Peña, Álvaro; Salido, Eduardo; Valpuesta, José María; Martín-Benito, Jaime
2010-01-01
Primary hyperoxaluria type 1 is a rare autosomal recessive disease caused by mutations in the alanine glyoxylate aminotransferase gene (AGXT). We have previously shown that P11L and I340M polymorphisms together with I244T mutation (AGXT-LTM) represent a conformational disease that could be amenable to pharmacological intervention. Thus, the study of the folding mechanism of AGXT is crucial to understand the molecular basis of the disease. Here, we provide biochemical and structural data showing that AGXT-LTM is able to form non-native folding intermediates. The three-dimensional structure of a complex between the bacterial chaperonin GroEL and a folding intermediate of AGXT-LTM mutant has been solved by cryoelectron microscopy. The electron density map shows the protein substrate in a non-native extended conformation that crosses the GroEL central cavity. Addition of ATP to the complex induces conformational changes on the chaperonin and the internalization of the protein substrate into the folding cavity. The structure provides a three-dimensional picture of an in vivo early ATP-dependent step of the folding reaction cycle of the chaperonin and supports a GroEL functional model in which the chaperonin promotes folding of the AGXT-LTM mutant protein through forced unfolding mechanism. PMID:20056599
Albert, Armando; Yunta, Cristina; Arranz, Rocío; Peña, Alvaro; Salido, Eduardo; Valpuesta, José María; Martín-Benito, Jaime
2010-02-26
Primary hyperoxaluria type 1 is a rare autosomal recessive disease caused by mutations in the alanine glyoxylate aminotransferase gene (AGXT). We have previously shown that P11L and I340M polymorphisms together with I244T mutation (AGXT-LTM) represent a conformational disease that could be amenable to pharmacological intervention. Thus, the study of the folding mechanism of AGXT is crucial to understand the molecular basis of the disease. Here, we provide biochemical and structural data showing that AGXT-LTM is able to form non-native folding intermediates. The three-dimensional structure of a complex between the bacterial chaperonin GroEL and a folding intermediate of AGXT-LTM mutant has been solved by cryoelectron microscopy. The electron density map shows the protein substrate in a non-native extended conformation that crosses the GroEL central cavity. Addition of ATP to the complex induces conformational changes on the chaperonin and the internalization of the protein substrate into the folding cavity. The structure provides a three-dimensional picture of an in vivo early ATP-dependent step of the folding reaction cycle of the chaperonin and supports a GroEL functional model in which the chaperonin promotes folding of the AGXT-LTM mutant protein through forced unfolding mechanism.
Outer Membrane Protein Folding and Topology from a Computational Transfer Free Energy Scale.
Lin, Meishan; Gessmann, Dennis; Naveed, Hammad; Liang, Jie
2016-03-02
Knowledge of the transfer free energy of amino acids from aqueous solution to a lipid bilayer is essential for understanding membrane protein folding and for predicting membrane protein structure. Here we report a computational approach that can calculate the folding free energy of the transmembrane region of outer membrane β-barrel proteins (OMPs) by combining an empirical energy function with a reduced discrete state space model. We quantitatively analyzed the transfer free energies of 20 amino acid residues at the center of the lipid bilayer of OmpLA. Our results are in excellent agreement with the experimentally derived hydrophobicity scales. We further exhaustively calculated the transfer free energies of 20 amino acids at all positions in the TM region of OmpLA. We found that the asymmetry of the Gram-negative bacterial outer membrane as well as the TM residues of an OMP determine its functional fold in vivo. Our results suggest that the folding process of an OMP is driven by the lipid-facing residues in its hydrophobic core, and its NC-IN topology is determined by the differential stabilities of OMPs in the asymmetrical outer membrane. The folding free energy is further reduced by lipid A and assisted by general depth-dependent cooperativities that exist between polar and ionizable residues. Moreover, context-dependency of transfer free energies at specific positions in OmpLA predict regions important for protein function as well as structural anomalies. Our computational approach is fast, efficient and applicable to any OMP.
Proteomic data from human cell cultures refine mechanisms of chaperone-mediated protein homeostasis.
Finka, Andrija; Goloubinoff, Pierre
2013-09-01
In the crowded environment of human cells, folding of nascent polypeptides and refolding of stress-unfolded proteins is error prone. Accumulation of cytotoxic misfolded and aggregated species may cause cell death, tissue loss, degenerative conformational diseases, and aging. Nevertheless, young cells effectively express a network of molecular chaperones and folding enzymes, termed here "the chaperome," which can prevent formation of potentially harmful misfolded protein conformers and use the energy of adenosine triphosphate (ATP) to rehabilitate already formed toxic aggregates into native functional proteins. In an attempt to extend knowledge of chaperome mechanisms in cellular proteostasis, we performed a meta-analysis of human chaperome using high-throughput proteomic data from 11 immortalized human cell lines. Chaperome polypeptides were about 10% of total protein mass of human cells, half of which were Hsp90s and Hsp70s. Knowledge of cellular concentrations and ratios among chaperome polypeptides provided a novel basis to understand mechanisms by which the Hsp60, Hsp70, Hsp90, and small heat shock proteins (HSPs), in collaboration with cochaperones and folding enzymes, assist de novo protein folding, import polypeptides into organelles, unfold stress-destabilized toxic conformers, and control the conformal activity of native proteins in the crowded environment of the cell. Proteomic data also provided means to distinguish between stable components of chaperone core machineries and dynamic regulatory cochaperones.
Carvalho, Filomena A; Martins, Ivo C; Santos, Nuno C
2013-03-01
Atomic force microscopy (AFM) applied to biological systems can, besides generating high-quality and well-resolved images, be employed to study protein folding via AFM-based force spectroscopy. This approach allowed remarkable advances in the measurement of inter- and intramolecular interaction forces with piconewton resolution. The detection of specific interaction forces between molecules based on the AFM sensitivity and the manipulation of individual molecules greatly advanced the understanding of intra-protein and protein-ligand interactions. Apart from the academic interest in the resolution of basic scientific questions, this technique has also key importance on the clarification of several biological questions of immediate biomedical relevance. Force spectroscopy is an especially appropriate technique for "mechanical proteins" that can provide crucial information on single protein molecules and/or domains. Importantly, it also has the potential of combining in a single experiment spatial and kinetic measurements. Here, the main principles of this methodology are described, after which the ability to measure interactions at the single-molecule level is discussed, in the context of relevant protein-folding examples. We intend to demonstrate the potential of AFM-based force spectroscopy in the study of protein folding, especially since this technique is able to circumvent some of the difficulties typically encountered in classical thermal/chemical denaturation studies. Copyright © 2012 Elsevier Inc. All rights reserved.
Computational approaches for rational design of proteins with novel functionalities
Tiwari, Manish Kumar; Singh, Ranjitha; Singh, Raushan Kumar; Kim, In-Won; Lee, Jung-Kul
2012-01-01
Proteins are the most multifaceted macromolecules in living systems and have various important functions, including structural, catalytic, sensory, and regulatory functions. Rational design of enzymes is a great challenge to our understanding of protein structure and physical chemistry and has numerous potential applications. Protein design algorithms have been applied to design or engineer proteins that fold, fold faster, catalyze, catalyze faster, signal, and adopt preferred conformational states. The field of de novo protein design, although only a few decades old, is beginning to produce exciting results. Developments in this field are already having a significant impact on biotechnology and chemical biology. The application of powerful computational methods for functional protein designing has recently succeeded at engineering target activities. Here, we review recently reported de novo functional proteins that were developed using various protein design approaches, including rational design, computational optimization, and selection from combinatorial libraries, highlighting recent advances and successes. PMID:24688643
Hudson, Devin A; Gannon, Shawn A; Thorpe, Colin
2015-03-01
This review examines oxidative protein folding within the mammalian endoplasmic reticulum (ER) from an enzymological perspective. In protein disulfide isomerase-first (PDI-first) pathways of oxidative protein folding, PDI is the immediate oxidant of reduced client proteins and then addresses disulfide mispairings in a second isomerization phase. In PDI-second pathways the initial oxidation is PDI-independent. Evidence for the rapid reduction of PDI by reduced glutathione is presented in the context of PDI-first pathways. Strategies and challenges are discussed for determination of the concentrations of reduced and oxidized glutathione and of the ratios of PDI(red):PDI(ox). The preponderance of evidence suggests that the mammalian ER is more reducing than first envisaged. The average redox state of major PDI-family members is largely to almost totally reduced. These observations are consistent with model studies showing that oxidative protein folding proceeds most efficiently at a reducing redox poise consistent with a stoichiometric insertion of disulfides into client proteins. After a discussion of the use of natively encoded fluorescent probes to report the glutathione redox poise of the ER, this review concludes with an elaboration of a complementary strategy to discontinuously survey the redox state of as many redox-active disulfides as can be identified by ratiometric LC-MS-MS methods. Consortia of oxidoreductases that are in redox equilibrium can then be identified and compared to the glutathione redox poise of the ER to gain a more detailed understanding of the factors that influence oxidative protein folding within the secretory compartment. Copyright © 2014 Elsevier Inc. All rights reserved.
Slocum, Joshua D; Webb, Lauren J
2017-07-06
A photoactivatable variant of superfolder green fluorescent protein (GFP) was created by replacing the threonine at position 203 with aspartic acid. Photoactivation by exposure of this mutant to UV light resulted in conversion of the fluorophore from the neutral to the negatively charged form, accompanied by a ∼95-fold increase in fluorescence under 488 nm excitation. Mass spectrometry before and after exposure to UV light revealed a change in mass of 88 Da, attributed to the double decarboxylation of Glu 222 and Asp 203. Kinetics studies and nonlinear power-dependence of the initial rate of photoconversion indicated that the double decarboxylation occurred via a multiphoton absorption process at 254 nm. In addition to providing a photoactivatable GFP with robust folding properties, a detailed mechanistic understanding of this double decarboxylation in GFP will lead to a better understanding of charge transfer in fluorescent proteins.
Detecting protein folding by thermal fluctuations of microcantilevers
Aguilar-Sandoval, Felipe; Bellon, Ludovic; Melo, Francisco
2017-01-01
The accurate characterization of proteins in both their native and denatured states is essential to effectively understand protein function, folding and stability. As a proof of concept, a micro rheological method is applied, based on the characterization of thermal fluctuations of a micro cantilever immersed in a bovine serum albumin solution, to assess changes in the viscosity associated with modifications in the protein’s structure under the denaturant effect of urea. Through modeling the power spectrum density of the cantilever’s fluctuations over a broad frequency band, it is possible to implement a fitting procedure to accurately determine the viscosity of the fluid, even at low volumes. Increases in viscosity during the denaturant process are identified using the assumption that the protein is a hard sphere, with a hydrodynamic radius that increases during unfolding. This is modeled accordingly through the Einstein-Batchelor formula. The Einstein-Batchelor formula estimates are verified through dynamic light scattering, which measures the hydrodynamic radius of proteins. Thus, this methodology is proven to be suitable for the study of protein folding in samples of small size at vanishing shear stresses. PMID:29267316
Allostery in the Hsp70 chaperone proteins
Zuiderweg, Erik R.P.; Bertelsen, Eric B.; Rousaki, Aikaterini; Mayer, Matthias P.; Gestwicki, Jason E.; Ahmad, Atta
2013-01-01
Heat shock 70 kDa (Hsp70) chaperones are essential to in-vivo protein folding, protein transport and protein re-folding. They carry out these activities using repeated cycles of binding and release of client proteins. This process is under allosteric control of nucleotide binding and hydrolysis. X-ray crystallography, NMR spectroscopy and other biophysical techniques have contributed much to the understanding of the allosteric mechanism linking these activities and the effect of co-chaperones on this mechanism. In this chapter, these findings are critically reviewed. Studies on the allosteric mechanisms of Hsp70 have gained enhanced urgency, as recent studies have implicated this chaperone as a potential drug target in diseases such as Alzheimer's and cancer. Recent approaches to combat these diseases through interference with the Hsp70 allosteric mechanism are discussed. PMID:22576356
Dubaquié, Y; Looser, R; Fünfschilling, U; Jenö, P; Rospert, S
1998-01-01
The mechanism of chaperonin-assisted protein folding has been mostly analyzed in vitro using non-homologous substrate proteins. In order to understand the relative importance of hsp60 and hsp10 in the living cell, homologous substrate proteins need to be identified and analyzed. We have devised a novel screen to test the folding of a large variety of homologous substrates in the mitochondrial matrix in the absence or presence of functional hsp60 or hsp10. The identified substrates have an Mr of 15-90 kDa and fall into three groups: (i) proteins that require both hsp60 and hsp10 for correct folding; (ii) proteins that completely fail to fold after inactivation of hsp60 but are unaffected by the inactivation of hsp10; and (iii) newly imported hsp60 itself, which is more severely affected by inactivation of hsp10 than by inactivation of pre-existing hsp60. The majority of the identified substrates are group I proteins. For these, the lack of hsp60 function has a more pronounced effect than inactivation of hsp10. We suggest that homologous substrate proteins have differential chaperonin requirements, indicating that hsp60 and hsp10 do not always act as a single functional unit in vivo. PMID:9774331
Towards quantitative classification of folded proteins in terms of elementary functions.
Hu, Shuangwei; Krokhotin, Andrei; Niemi, Antti J; Peng, Xubiao
2011-04-01
A comparative classification scheme provides a good basis for several approaches to understand proteins, including prediction of relations between their structure and biological function. But it remains a challenge to combine a classification scheme that describes a protein starting from its well-organized secondary structures and often involves direct human involvement, with an atomary-level physics-based approach where a protein is fundamentally nothing more than an ensemble of mutually interacting carbon, hydrogen, oxygen, and nitrogen atoms. In order to bridge these two complementary approaches to proteins, conceptually novel tools need to be introduced. Here we explain how an approach toward geometric characterization of entire folded proteins can be based on a single explicit elementary function that is familiar from nonlinear physical systems where it is known as the kink soliton. Our approach enables the conversion of hierarchical structural information into a quantitative form that allows for a folded protein to be characterized in terms of a small number of global parameters that are in principle computable from atomary-level considerations. As an example we describe in detail how the native fold of the myoglobin 1M6C emerges from a combination of kink solitons with a very high atomary-level accuracy. We also verify that our approach describes longer loops and loops connecting α helices with β strands, with the same overall accuracy. ©2011 American Physical Society
Makarov, Alexey A; Schafer, Wes A; Helmy, Roy
2015-02-17
The market of protein therapeutics is exploding, and characterization methods for proteins are being further developed to understand and explore conformational structures with regards to function and activity. There are several spectroscopic techniques that allow for analyzing protein secondary structure in solution. However, a majority of these techniques need to use purified protein, concentrated enough in the solution to produce a relevant spectrum. In this study, we describe a novel approach which uses ultrahigh pressure liquid chromatography (UHPLC) coupled with mass-spectrometry (MS) to explore compressibility of the secondary structure of proteins under increasing pressure detected by hydrogen-deuterium exchange (HDX). Several model proteins were used for these studies. The studies were conducted with UHPLC in isocratic mode at constant flow rate and temperature. The pressure was modified by a backpressure regulator up to about 1200 bar. It was found that the increase of retention factors upon pressure increase, at constant flow rate and temperature, was based on reduction of the proteins' molecular molar volume. The change in the proteins' molecular molar volume was caused by changes in protein folding, as was revealed by differential deuterium exchange. The degree of protein folding under certain UHPLC conditions can be controlled by pressure, at constant temperature and flow rate. By modifying pressure during UHPLC separation, it was possible to achieve changes in protein folding, which were manifested as changes in the number of labile protons exchanged to deuterons, or vice versa. Moreover, it was demonstrated with bovine insulin that a small difference in the number of protons exchanged to deuterons (based on protein folding under pressure) could be observed between batches obtained from different sources. The use of HDX during UHPLC separation allowed one to examine protein folding by pressure at constant flow rate and temperature in a mixture of sample solution with minimal amounts of sample used for analysis.
Weinkam, Patrick; Romesberg, Floyd E.; Wolynes, Peter G.
2010-01-01
A grand canonical formalism is developed to combine discrete simulations for chemically distinct species in equilibrium. Each simulation is based on a perturbed funneled landscape. The formalism is illustrated using the alkaline-induced transitions of cytochrome c as observed by FTIR spectroscopy and with various other experimental approaches. The grand canonical simulation method accounts for the acid/base chemistry of deprotonation, the inorganic chemistry of heme ligation and misligation, and the minimally frustrated folding energy landscape, thus elucidating the physics of protein folding involved with an acid/base titration of a protein. The formalism combines simulations for each of the relevant chemical species, varying by protonation and ligation states. In contrast to models based on perfectly funneled energy landscapes that contain only contacts found in the native structure, the current study introduces “chemical frustration” from deprotonation and misligation that gives rise to many intermediates at alkaline pH. While the nature of these intermediates cannot be easily inferred from available experimental data, the current study provides specific structural details of these intermediates thus extending our understanding of how cytochrome c changes with increasing pH. The results demonstrate the importance of chemical frustration for understanding biomolecular energy landscapes. PMID:19199810
What makes proteins work: exploring life in P-T-X
NASA Astrophysics Data System (ADS)
Ichiye, Toshiko
2016-12-01
Although considerable progress has been made in the molecular biophysics of proteins, it is still not possible to reliably design an enzyme for a given function. The current understanding of enzyme function is that both structure and flexibility are important. Much attention has been focused recently on protein folding and thus structure, spurred on by insights from the folding funnel concept. For experimental studies of protein folding, variations in temperature (T) and chemical composition (X) of the solution have been traditionally exploited, although more recent studies using variations in pressure (P) made possible through new instrumentation have led to a deeper understanding of the energy landscape of protein folding. Other work has shown that flexibility is also essential for enzymes, although it is still not clear what type is important. Another avenue has been to take advantage of ‘Nature’s laboratory’ by exploring homologous proteins from organisms that live in extreme conditions, or ‘extremophiles’. While the most studied extremophiles live at extremes of T and X, recent exploration of deep-sea environments has led to the discovery of organisms living under high P, or ‘piezophiles’. An exploration of targeted enzymes from organisms with various P-T-X growth conditions coupled with advances in biophysical instrumentation and computer simulations that allow studies of these enzymes at different P-T-X conditions may lead to a better understanding of ‘flexibility’ and to general design criteria for active enzymes. Preface. Kamal Shukla’s great contribution to science has been his vision that physical sciences could bring new insights to biological sciences, and that the marriage of methodologies, particularly theoretical/computational with experimental, was needed to tackle the complexities of biology. Furthermore, his openness to new methods and different ideas outside the current fad has helped make his vision a reality. In my remarks below, I have not tried to limit myself to projects that I know Kamal had sponsored, nor have I tried to highlight all that he has sponsored. Instead, everything I mention has been influenced directly or indirectly by his efforts. Perhaps the indirect influences are most telling, because they would not have happened without Kamal.
Nonlinear laser scanning microscopy of human vocal folds.
Miri, Amir K; Tripathy, Umakanta; Mongeau, Luc; Wiseman, Paul W
2012-02-01
The purpose of this work was to apply nonlinear laser scanning microscopy (NLSM) for visualizing the morphology of extracellular matrix proteins within human vocal folds. This technique may potentially assist clinicians in making rapid diagnoses of vocal fold tissue disease or damage. Microstructural characterization based on NLSM provides valuable information for better understanding molecular mechanisms and tissue structure. Experimental, ex vivo human vocal fold. A custom-built multimodal nonlinear laser scanning microscope was used to scan fibrillar proteins in three 4% formaldehyde-fixed cadaveric samples. Collagen and elastin, key extracellular matrix proteins in the vocal fold lamina propria, were imaged by two nonlinear microscopy modalities: second harmonic generation (SHG) and two-photon fluorescence (TPF), respectively. An experimental protocol was introduced to characterize the geometrical properties of the imaged fibrous proteins. NLSM revealed the biomorphology of the human vocal fold fibrous proteins. No photobleaching was observed for the incident laser power of ∼60 mW before the excitation objective. Types I and III fibrillar collagen were imaged without label in the tissue by intrinsic SHG. Imaging while rotating the incident laser light-polarization direction confirmed a helical shape for the collagen fibers. The amplitude, periodicity, and overall orientation were then computed for the helically distributed collagen network. The elastin network was simultaneously imaged via TPF and found to have a basket-like structure. In some regions, particularly close to the epithelium, colocalization of both extracellular matrix components were observed. A benchmark study is presented for quantitative real-time, ex vivo, NLSM imaging of the extracellular macromolecules in human vocal fold lamina propria. The results are promising for clinical applications. Copyright © 2011 The American Laryngological, Rhinological, and Otological Society, Inc.
A molecular ensemble in the rER for procollagen maturation.
Ishikawa, Yoshihiro; Bächinger, Hans Peter
2013-11-01
Extracellular matrix (ECM) proteins create structural frameworks in tissues such as bone, skin, tendon and cartilage etc. These connective tissues play important roles in the development and homeostasis of organs. Collagen is the most abundant ECM protein and represents one third of all proteins in humans. The biosynthesis of ECM proteins occurs in the rough endoplasmic reticulum (rER). This review describes the current understanding of the biosynthesis and folding of procollagens, which are the precursor molecules of collagens, in the rER. Multiple folding enzymes and molecular chaperones are required for procollagen to establish specific posttranslational modifications, and facilitate folding and transport to the cell surface. Thus, this molecular ensemble in the rER contributes to ECM maturation and to the development and homeostasis of tissues. Mutations in this ensemble are likely candidates for connective tissue disorders. This article is part of a Special Issue entitled: Functional and structural diversity of endoplasmic reticulum. Copyright © 2013 Elsevier B.V. All rights reserved.
Protein chainmail variants in dsDNA viruses
Zhou, Z. Hong; Chiou, Joshua
2017-01-01
First discovered in bacteriophage HK97, biological chainmail is a highly stable system formed by concatenated protein rings. Each subunit of the ring contains the HK97-like fold, which is characterized by its submarine-like shape with a 5-stranded β sheet in the axial (A) domain, spine helix in the peripheral (P) domain, and an extended (E) loop. HK97 capsid consists of covalently-linked copies of just one HK97-like fold protein and represents the most effective strategy to form highly stable chainmail needed for dsDNA genome encapsidation. Recently, near-atomic resolution structures enabled by cryo electron microscopy (cryoEM) have revealed a range of other, more complex variants of this strategy for constructing dsDNA viruses. The first strategy, exemplified by P22-like phages, is the attachment of an insertional (I) domain to the core 5-stranded β sheet of the HK97-like fold. The atomic models of the Bordetella phage BPP-1 showcases an alternative topology of the classic HK97 topology of the HK97-like fold, as well as the second strategy for constructing stable capsids, where an auxiliary jellyroll protein dimer serves to cement the non-covalent chainmail formed by capsid protein subunits. The third strategy, found in lambda-like phages, uses auxiliary protein trimers to stabilize the underlying non-covalent chainmail near the 3-fold axis. Herpesviruses represent highly complex viruses that use a combination of these strategies, resulting in four-level hierarchical organization including a non-covalent chainmail formed by the HK97-like fold domain found in the floor region. A thorough understanding of these structures should help unlock the enigma of the emergence and evolution of dsDNA viruses and inform bioengineering efforts based on these viruses. PMID:29177192
Mudgal, Richa; Srinivasan, Narayanaswamy; Chandra, Nagasuma
2017-07-01
Functional annotation is seldom straightforward with complexities arising due to functional divergence in protein families or functional convergence between non-homologous protein families, leading to mis-annotations. An enzyme may contain multiple domains and not all domains may be involved in a given function, adding to the complexity in function annotation. To address this, we use binding site information from bound cognate ligands and catalytic residues, since it can help in resolving fold-function relationships at a finer level and with higher confidence. A comprehensive database of 2,020 fold-function-binding site relationships has been systematically generated. A network-based approach is employed to capture the complexity in these relationships, from which different types of associations are deciphered, that identify versatile protein folds performing diverse functions, same function associated with multiple folds and one-to-one relationships. Binding site similarity networks integrated with fold, function, and ligand similarity information are generated to understand the depth of these relationships. Apart from the observed continuity in the functional site space, network properties of these revealed versatile families with topologically different or dissimilar binding sites and structural families that perform very similar functions. As a case study, subtle changes in the active site of a set of evolutionarily related superfamilies are studied using these networks. Tracing of such similarities in evolutionarily related proteins provide clues into the transition and evolution of protein functions. Insights from this study will be helpful in accurate and reliable functional annotations of uncharacterized proteins, poly-pharmacology, and designing enzymes with new functional capabilities. Proteins 2017; 85:1319-1335. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Structural biology of intrinsically disordered proteins: Revisiting unsolved mysteries.
Sigalov, Alexander B
2016-06-01
The emergence of intrinsically disordered proteins (IDPs) has challenged the classical protein structure-function paradigm by introducing a new paradigm of "coupled binding and folding". This paradigm suggests that IDPs fold upon binding to their partners. Further studies, however, revealed a novel and previously unrecognized phenomenon of "uncoupled binding and folding" suggesting that IDPs do not necessarily fold upon interaction with their lipid and protein partners. The complex and often unusual biophysics of IDPs makes structural characterization of these proteins and their complexes not only challenging but often resulting in opposite conclusions. For this reason, some crucial questions in this field remain unsolved for well over a decade. Considering an important role of IDPs in cellular regulation, signaling and control in health and disease, more efforts are needed to solve these mysteries. Here, I focus on two long-standing contradictions in the literature concerning dimerization and membrane-binding activities of IDPs. Molecular explanation of these discrepancies is provided. I also demonstrate how resolution of these critical issues in the field of IDPs results in our expanded understanding of cell function and has multiple applications in biology and medicine. Copyright © 2016 Elsevier B.V. and Société Française de Biochimie et Biologie Moléculaire (SFBBM). All rights reserved.
Effects of mutation, truncation and temperature on the folding kinetics of a WW domain
Maisuradze, Gia G.; Zhou, Rui; Liwo, Adam; Xiao, Yi; Scheraga, Harold A.
2013-01-01
The purpose of this work is to show how mutation, truncation and change of temperature can influence the folding kinetics of a protein. This is accomplished by principal component analysis (PCA) of molecular dynamics (MD)-generated folding trajectories of the triple β-strand WW domain from the Formin binding protein 28 (FBP) [PDB: 1E0L] and its full-size, and singly- and doubly-truncated mutants at temperatures below and very close to the melting point. The reasons for biphasic folding kinetics [i.e., coexistence of slow (three-state) and fast (two-state) phases], including the involvement of a solvent-exposed hydrophobic cluster and another delocalized hydrophobic core in the folding kinetics, are discussed. New folding pathways are identified in free-energy landscapes determined in terms of principal components for full-size mutants. Three-state folding is found to be a main mechanism for folding FBP28 WW domain and most of the full-size and truncated mutants. The results from the theoretical analysis are compared to those from experiment. Agreements and discrepancies between the theoretical and experimental results are discussed. Because of its importance in understanding protein kinetics and function, the diffusive mechanism by which FBP28 WW domain and its full-size and truncated mutants explore their conformational space is examined in terms of the mean-square displacement, (MSD), and PCA eigenvalue spectrum analyses. Subdiffusive behavior is observed for all studied systems. PMID:22560992
Combinatorial pattern discovery approach for the folding trajectory analysis of a beta-hairpin.
Parida, Laxmi; Zhou, Ruhong
2005-06-01
The study of protein folding mechanisms continues to be one of the most challenging problems in computational biology. Currently, the protein folding mechanism is often characterized by calculating the free energy landscape versus various reaction coordinates, such as the fraction of native contacts, the radius of gyration, RMSD from the native structure, and so on. In this paper, we present a combinatorial pattern discovery approach toward understanding the global state changes during the folding process. This is a first step toward an unsupervised (and perhaps eventually automated) approach toward identification of global states. The approach is based on computing biclusters (or patterned clusters)-each cluster is a combination of various reaction coordinates, and its signature pattern facilitates the computation of the Z-score for the cluster. For this discovery process, we present an algorithm of time complexity c in RO((N + nm) log n), where N is the size of the output patterns and (n x m) is the size of the input with n time frames and m reaction coordinates. To date, this is the best time complexity for this problem. We next apply this to a beta-hairpin folding trajectory and demonstrate that this approach extracts crucial information about protein folding intermediate states and mechanism. We make three observations about the approach: (1) The method recovers states previously obtained by visually analyzing free energy surfaces. (2) It also succeeds in extracting meaningful patterns and structures that had been overlooked in previous works, which provides a better understanding of the folding mechanism of the beta-hairpin. These new patterns also interconnect various states in existing free energy surfaces versus different reaction coordinates. (3) The approach does not require calculating the free energy values, yet it offers an analysis comparable to, and sometimes better than, the methods that use free energy landscapes, thus validating the choice of reaction coordinates. (An abstract version of this work was presented at the 2005 Asia Pacific Bioinformatics Conference [1].).
Pokkuluri, P Raj; Dwulit-Smith, Jeff; Duke, Norma E; Wilton, Rosemarie; Mack, Jamey C; Bearden, Jessica; Rakowski, Ella; Babnigg, Gyorgy; Szurmant, Hendrik; Joachimiak, Andrzej; Schiffer, Marianne
2013-01-01
Anaeromyxobacter dehalogenans is a δ-proteobacterium found in diverse soils and sediments. It is of interest in bioremediation efforts due to its dechlorination and metal-reducing capabilities. To gain an understanding on A. dehalogenans' abilities to adapt to diverse environments we analyzed its signal transduction proteins. The A. dehalogenans genome codes for a large number of sensor histidine kinases (HK) and methyl-accepting chemotaxis proteins (MCP); among these 23 HK and 11 MCP proteins have a sensor domain in the periplasm. These proteins most likely contribute to adaptation to the organism's surroundings. We predicted their three-dimensional folds and determined the structures of two of the periplasmic sensor domains by X-ray diffraction. Most of the domains are predicted to have either PAS-like or helical bundle structures, with two predicted to have solute-binding protein fold, and another predicted to have a 6-phosphogluconolactonase like fold. Atomic structures of two sensor domains confirmed the respective fold predictions. The Adeh_2942 sensor (HK) was found to have a helical bundle structure, and the Adeh_3718 sensor (MCP) has a PAS-like structure. Interestingly, the Adeh_3718 sensor has an acetate moiety bound in a binding site typical for PAS-like domains. Future work is needed to determine whether Adeh_3718 is involved in acetate sensing by A. dehalogenans. PMID:23897711
Stabilities and Dynamics of Protein Folding Nuclei by Molecular Dynamics Simulation
NASA Astrophysics Data System (ADS)
Song, Yong-Shun; Zhou, Xin; Zheng, Wei-Mou; Wang, Yan-Ting
2017-07-01
To understand how the stabilities of key nuclei fragments affect protein folding dynamics, we simulate by molecular dynamics (MD) simulation in aqueous solution four fragments cut out of a protein G, including one α-helix (seqB: KVFKQYAN), two β-turns (seqA: LNGKTLKG and seqC: YDDATKTF), and one β-strand (seqD: DGEWTYDD). The Markov State Model clustering method combined with the coarse-grained conformation letters method are employed to analyze the data sampled from 2-μs equilibrium MD simulation trajectories. We find that seqA and seqB have more stable structures than their native structures which become metastable when cut out of the protein structure. As expected, seqD alone is flexible and does not have a stable structure. Throughout our simulations, the native structure of seqC is stable but cannot be reached if starting from a structure other than the native one, implying a funnel-shape free energy landscape of seqC in aqueous solution. All the above results suggest that different nuclei have different formation dynamics during protein folding, which may have a major contribution to the hierarchy of protein folding dynamics. Supported by the National Basic Research Program of China under Grant No. 2013CB932804, the National Natural Science Foundation of China under Grant No. 11421063, and the CAS Biophysics Interdisciplinary Innovation Team Project
Ayuso-Tejedor, Sara; Angarica, Vladimir Espinosa; Bueno, Marta; Campos, Luis A; Abián, Olga; Bernadó, Pau; Sancho, Javier; Jiménez, M Angeles
2010-07-23
Partly unfolded protein conformations close to the native state may play important roles in protein function and in protein misfolding. Structural analyses of such conformations which are essential for their fully physicochemical understanding are complicated by their characteristic low populations at equilibrium. We stabilize here with a single mutation the equilibrium intermediate of apoflavodoxin thermal unfolding and determine its solution structure by NMR. It consists of a large native region identical with that observed in the X-ray structure of the wild-type protein plus an unfolded region. Small-angle X-ray scattering analysis indicates that the calculated ensemble of structures is consistent with the actual degree of expansion of the intermediate. The unfolded region encompasses discontinuous sequence segments that cluster in the 3D structure of the native protein forming the FMN cofactor binding loops and the binding site of a variety of partner proteins. Analysis of the apoflavodoxin inner interfaces reveals that those becoming destabilized in the intermediate are more polar than other inner interfaces of the protein. Natively folded proteins contain hydrophobic cores formed by the packing of hydrophobic surfaces, while natively unfolded proteins are rich in polar residues. The structure of the apoflavodoxin thermal intermediate suggests that the regions of natively folded proteins that are easily responsive to thermal activation may contain cores of intermediate hydrophobicity. Copyright (c) 2010 Elsevier Ltd. All rights reserved.
Re-visiting protein-centric two-tier classification of existing DNA-protein complexes
2012-01-01
Background Precise DNA-protein interactions play most important and vital role in maintaining the normal physiological functioning of the cell, as it controls many high fidelity cellular processes. Detailed study of the nature of these interactions has paved the way for understanding the mechanisms behind the biological processes in which they are involved. Earlier in 2000, a systematic classification of DNA-protein complexes based on the structural analysis of the proteins was proposed at two tiers, namely groups and families. With the advancement in the number and resolution of structures of DNA-protein complexes deposited in the Protein Data Bank, it is important to revisit the existing classification. Results On the basis of the sequence analysis of DNA binding proteins, we have built upon the protein centric, two-tier classification of DNA-protein complexes by adding new members to existing families and making new families and groups. While classifying the new complexes, we also realised the emergence of new groups and families. The new group observed was where β-propeller was seen to interact with DNA. There were 34 SCOP folds which were observed to be present in the complexes of both old and new classifications, whereas 28 folds are present exclusively in the new complexes. Some new families noticed were NarL transcription factor, Z-α DNA binding proteins, Forkhead transcription factor, AP2 protein, Methyl CpG binding protein etc. Conclusions Our results suggest that with the increasing number of availability of DNA-protein complexes in Protein Data Bank, the number of families in the classification increased by approximately three fold. The folds present exclusively in newly classified complexes is suggestive of inclusion of proteins with new function in new classification, the most populated of which are the folds responsible for DNA damage repair. The proposed re-visited classification can be used to perform genome-wide surveys in the genomes of interest for the presence of DNA-binding proteins. Further analysis of these complexes can aid in developing algorithms for identifying DNA-binding proteins and their family members from mere sequence information. PMID:22800292
Re-visiting protein-centric two-tier classification of existing DNA-protein complexes.
Malhotra, Sony; Sowdhamini, Ramanathan
2012-07-16
Precise DNA-protein interactions play most important and vital role in maintaining the normal physiological functioning of the cell, as it controls many high fidelity cellular processes. Detailed study of the nature of these interactions has paved the way for understanding the mechanisms behind the biological processes in which they are involved. Earlier in 2000, a systematic classification of DNA-protein complexes based on the structural analysis of the proteins was proposed at two tiers, namely groups and families. With the advancement in the number and resolution of structures of DNA-protein complexes deposited in the Protein Data Bank, it is important to revisit the existing classification. On the basis of the sequence analysis of DNA binding proteins, we have built upon the protein centric, two-tier classification of DNA-protein complexes by adding new members to existing families and making new families and groups. While classifying the new complexes, we also realised the emergence of new groups and families. The new group observed was where β-propeller was seen to interact with DNA. There were 34 SCOP folds which were observed to be present in the complexes of both old and new classifications, whereas 28 folds are present exclusively in the new complexes. Some new families noticed were NarL transcription factor, Z-α DNA binding proteins, Forkhead transcription factor, AP2 protein, Methyl CpG binding protein etc. Our results suggest that with the increasing number of availability of DNA-protein complexes in Protein Data Bank, the number of families in the classification increased by approximately three fold. The folds present exclusively in newly classified complexes is suggestive of inclusion of proteins with new function in new classification, the most populated of which are the folds responsible for DNA damage repair. The proposed re-visited classification can be used to perform genome-wide surveys in the genomes of interest for the presence of DNA-binding proteins. Further analysis of these complexes can aid in developing algorithms for identifying DNA-binding proteins and their family members from mere sequence information.
Sen, Siddhartha; Voorheis, H Paul
2014-12-21
The mechanism of protein folding during early stages of the process has three determinants. First, moving water molecules obey the rules of low Reynolds number physics without an inertial component. Molecular movement is instantaneous and size insensitive. Proteins emerging from the ribosome move and rotate without an external force if they change shape, forming and propagating helical structures that increases translocational efficiency. Forward motion ceases when the shape change or propelling force ceases. Second, application of quantum field theory to water structure predicts the spontaneous formation of low density coherent units of fixed size that expel dissolved atmospheric gases. Structured water layers with both coherent and non-coherent domains, form a sheath around the new protein. The surface of exposed hydrophobic amino acids is protected from water contact by small nanobubbles of dissolved atmospheric gases, 5 or 6 molecules on average, that vibrate, attracting even widely separated resonating nanobubbles. This force results from quantum effects, appearing only when the system is within and interacts with an oscillating electromagnetic field. The newly recognized quantum force sharply bends the peptide and is part of a dynamic field determining the pathway of protein folding. Third, the force initiating the tertiary folding of proteins arises from twists at the position of each hydrophobic amino acid, that minimizes surface exposure of the hydrophobic amino acids and propagates along the protein. When the total bend reaches 360°, the leading segment of water sheath intersects the trailing segment. This steric self-intersection expels water from overlapping segments of the sheath and by Newton׳s second law moves the polypeptide chain in an opposite direction. Consequently, with very few exceptions that we enumerate and discuss, tertiary structures are absent from proteins without hydrophobic amino acids, which control the early stages of protein folding and the overall shape of protein. Consequently, proteins only adopt a limited number of forms. The formation of quaternary structures is not necessarily prevented by the absence of hydrophobic amino acids. Copyright © 2014 Elsevier Ltd. All rights reserved.
Protein Design Using Unnatural Amino Acids
NASA Astrophysics Data System (ADS)
Bilgiçer, Basar; Kumar, Krishna
2003-11-01
With the increasing availability of whole organism genome sequences, understanding protein structure and function is of capital importance. Recent developments in the methodology of incorporation of unnatural amino acids into proteins allow the exploration of proteins at a very detailed level. Furthermore, de novo design of novel protein structures and function is feasible with unprecedented sophistication. Using examples from the literature, this article describes the available methods for unnatural amino acid incorporation and highlights some recent applications including the design of hyperstable protein folds.
In-cell thermodynamics and a new role for protein surfaces.
Smith, Austin E; Zhou, Larry Z; Gorensek, Annelise H; Senske, Michael; Pielak, Gary J
2016-02-16
There is abundant, physiologically relevant knowledge about protein cores; they are hydrophobic, exquisitely well packed, and nearly all hydrogen bonds are satisfied. An equivalent understanding of protein surfaces has remained elusive because proteins are almost exclusively studied in vitro in simple aqueous solutions. Here, we establish the essential physiological roles played by protein surfaces by measuring the equilibrium thermodynamics and kinetics of protein folding in the complex environment of living Escherichia coli cells, and under physiologically relevant in vitro conditions. Fluorine NMR data on the 7-kDa globular N-terminal SH3 domain of Drosophila signal transduction protein drk (SH3) show that charge-charge interactions are fundamental to protein stability and folding kinetics in cells. Our results contradict predictions from accepted theories of macromolecular crowding and show that cosolutes commonly used to mimic the cellular interior do not yield physiologically relevant information. As such, we provide the foundation for a complete picture of protein chemistry in cells.
LANL Transfers Glowing Bio Technology to Sandia Biotech
Nakhla, Tony; Pino, Tony; Hadley, David
2018-03-02
Partnering with Los Alamos National Laboratory, an Albuquerque-based company is seeking to transform the way protein and peptide analysis is conducted around the world. Sandia Biotech is using a biological technology licensed from Los Alamos called split green fluorescent protein (sGFP), as a detecting and tracking tool for the protein and peptide industry, valuable in the fields of Alzheimer's research, drug development and other biotechnology fields using protein folding to understand protein expression and mechanisms of action.
LANL Transfers Glowing Bio Technology to Sandia Biotech
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nakhla, Tony; Pino, Tony; Hadley, David
2012-05-21
Partnering with Los Alamos National Laboratory, an Albuquerque-based company is seeking to transform the way protein and peptide analysis is conducted around the world. Sandia Biotech is using a biological technology licensed from Los Alamos called split green fluorescent protein (sGFP), as a detecting and tracking tool for the protein and peptide industry, valuable in the fields of Alzheimer's research, drug development and other biotechnology fields using protein folding to understand protein expression and mechanisms of action.
A Proteomic Analysis of the Upper and Lower Flanks of the Base of Rice Shoot in the Gravitropism
NASA Astrophysics Data System (ADS)
Hu, Liwei; Chen, Haiying; Dou, Xianying; Jin, Jing; Sun, Weining; Cai, Weiming
2015-11-01
Due to gravitational stimulation, the lower part of a shoot base grows faster than the upper part, leading the shoot to curve upward. Though much research has been done on the mechanism of plant gravitropism, it still requires extensive elucidation. Recently, functional genomic strategies have been applied to study this mechanism in plants. The present study carried out a proteomic analysis to gain a better understanding of gravity stimulation in rice. Three-week-old rice seedlings were gravitropically stimulated and samples were harvested at 4 different time points: 0.5, 3, 6, and 9 h. Then, the total crude proteins were extracted from the lower and upper parts of the shoot base, separated by 2-DE, and silver stained. At each time point, proteins in the lower and upper parts were compared, and the differently expressed proteins were identified using MALDI TOF or ESI-MS/MS. After gravity stimulation, proteins involved in nine different functional categories were either up-regulated or down-regulated. Sugar metabolism, glycolysis, the tricarboxylic acid (TCA/citric) cycle, pyruvate metabolism, and transcription regulation-related proteins were regulated. Although the initiation of defense reactions mainly occurred in roots, some different defense mechanisms were also evoked in the aerial tissues. Interestingly, the abundance of some proteins changed drastically at only 0.5 h after reorientation: inosine monophosphate dehydrogenase (up to 6.49-fold higher in lower flanks at 0.5 h), ATP synthase D (4.25-fold), and ribulose-1,5 -bisphosphate carboxylase oxygenase (3.62-fold). These findings may aid in understanding the mechanism of the gravitropism.
Malfolded Protein Structure and Proteostasis in Lung Diseases
Balch, William E.; Sznajder, Jacob I.; Budinger, Scott; Finley, Daniel; Laposky, Aaron D.; Cuervo, Ana Maria; Benjamin, Ivor J.; Barreiro, Esther; Morimoto, Richard I.; Postow, Lisa; Weissman, Allan M.; Gail, Dorothy; Banks-Schlegel, Susan; Croxton, Thomas
2014-01-01
Recent discoveries indicate that disorders of protein folding and degradation play a particularly important role in the development of lung diseases and their associated complications. The overarching purpose of the National Heart, Lung, and Blood Institute workshop on “Malformed Protein Structure and Proteostasis in Lung Diseases” was to identify mechanistic and clinical research opportunities indicated by these recent discoveries in proteostasis science that will advance our molecular understanding of lung pathobiology and facilitate the development of new diagnostic and therapeutic strategies for the prevention and treatment of lung disease. The workshop's discussion focused on identifying gaps in scientific knowledge with respect to proteostasis and lung disease, discussing new research advances and opportunities in protein folding science, and highlighting novel technologies with potential therapeutic applications for diagnosis and treatment. PMID:24033344
Malfolded protein structure and proteostasis in lung diseases.
Balch, William E; Sznajder, Jacob I; Budinger, Scott; Finley, Daniel; Laposky, Aaron D; Cuervo, Ana Maria; Benjamin, Ivor J; Barreiro, Esther; Morimoto, Richard I; Postow, Lisa; Weissman, Allan M; Gail, Dorothy; Banks-Schlegel, Susan; Croxton, Thomas; Gan, Weiniu
2014-01-01
Recent discoveries indicate that disorders of protein folding and degradation play a particularly important role in the development of lung diseases and their associated complications. The overarching purpose of the National Heart, Lung, and Blood Institute workshop on "Malformed Protein Structure and Proteostasis in Lung Diseases" was to identify mechanistic and clinical research opportunities indicated by these recent discoveries in proteostasis science that will advance our molecular understanding of lung pathobiology and facilitate the development of new diagnostic and therapeutic strategies for the prevention and treatment of lung disease. The workshop's discussion focused on identifying gaps in scientific knowledge with respect to proteostasis and lung disease, discussing new research advances and opportunities in protein folding science, and highlighting novel technologies with potential therapeutic applications for diagnosis and treatment.
Current Understanding and Future Directions for Vocal Fold Mechanobiology
Li, Nicole Y.K.; Heris, Hossein K.; Mongeau, Luc
2013-01-01
The vocal folds, which are located in the larynx, are the main organ of voice production for human communication. The vocal folds are under continuous biomechanical stress similar to other mechanically active organs, such as the heart, lungs, tendons and muscles. During speech and singing, the vocal folds oscillate at frequencies ranging from 20 Hz to 3 kHz with amplitudes of a few millimeters. The biomechanical stress associated with accumulated phonation is believed to alter vocal fold cell activity and tissue structure in many ways. Excessive phonatory stress can damage tissue structure and induce a cell-mediated inflammatory response, resulting in a pathological vocal fold lesion. On the other hand, phonatory stress is one major factor in the maturation of the vocal folds into a specialized tri-layer structure. One specific form of vocal fold oscillation, which involves low impact and large amplitude excursion, is prescribed therapeutically for patients with mild vocal fold injuries. Although biomechanical forces affect vocal fold physiology and pathology, there is little understanding of how mechanical forces regulate these processes at the cellular and molecular level. Research into vocal fold mechanobiology has burgeoned over the past several years. Vocal fold bioreactors are being developed in several laboratories to provide a biomimic environment that allows the systematic manipulation of physical and biological factors on the cells of interest in vitro. Computer models have been used to simulate the integrated response of cells and proteins as a function of phonation stress. The purpose of this paper is to review current research on the mechanobiology of the vocal folds as it relates to growth, pathogenesis and treatment as well as to propose specific research directions that will advance our understanding of this subject. PMID:24812638
Bellapadrona, Giuliano; Stefanini, Simonetta; Zamparelli, Carlotta; Theil, Elizabeth C; Chiancone, Emilia
2009-07-10
Elucidating pore function at the 3-fold channels of 12-subunit, microbial Dps proteins is important in understanding their role in the management of iron/hydrogen peroxide. The Dps pores are called "ferritin-like" because of the structural resemblance to the 3-fold channels of 24-subunit ferritins used for iron entry and exit to and from the protein cage. In ferritins, negatively charged residues lining the pores generate a negative electrostatic gradient that guides iron ions toward the ferroxidase centers for catalysis with oxidant and destined for the mineralization cavity. To establish whether the set of three aspartate residues that line the pores in Listeria innocua Dps act in a similar fashion, D121N, D126N, D130N, and D121N/D126N/D130N proteins were produced; kinetics of iron uptake/release and the size distribution of the iron mineral in the protein cavity were compared. The results, discussed in the framework of crystal growth in a confined space, indicate that iron uses the hydrophilic 3-fold pores to traverse the protein shell. For the first time, the strength of the electrostatic potential is observed to modulate kinetic cooperativity in the iron uptake/release processes and accordingly the size distribution of the microcrystalline iron minerals in the Dps protein population.
Pey, Angel L.; Albert, Armando; Salido, Eduardo
2013-01-01
Alanine-glyoxylate aminotransferase catalyzes the transamination between L-alanine and glyoxylate to produce pyruvate and glycine using pyridoxal 5′-phosphate (PLP) as cofactor. Human alanine-glyoxylate aminotransferase is a peroxisomal enzyme expressed in the hepatocytes, the main site of glyoxylate detoxification. Its deficit causes primary hyperoxaluria type I, a rare but severe inborn error of metabolism. Single amino acid changes are the main type of mutation causing this disease, and considerable effort has been dedicated to the understanding of the molecular consequences of such missense mutations. In this review, we summarize the role of protein homeostasis in the basic mechanisms of primary hyperoxaluria. Intrinsic physicochemical properties of polypeptide chains such as thermodynamic stability, folding, unfolding, and misfolding rates as well as the interaction of different folding states with protein homeostasis networks are essential to understand this disease. The view presented has important implications for the development of new therapeutic strategies based on targeting specific elements of alanine-glyoxylate aminotransferase homeostasis. PMID:23956997
Miyazaki, Ryoji; Myougo, Naomi; Mori, Hiroyuki; Akiyama, Yoshinori
2018-01-12
Many proteins form multimeric complexes that play crucial roles in various cellular processes. Studying how proteins are correctly folded and assembled into such complexes in a living cell is important for understanding the physiological roles and the qualitative and quantitative regulation of the complex. However, few methods are suitable for analyzing these rapidly occurring processes. Site-directed in vivo photo-cross-linking is an elegant technique that enables analysis of protein-protein interactions in living cells with high spatial resolution. However, the conventional site-directed in vivo photo-cross-linking method is unsuitable for analyzing dynamic processes. Here, by combining an improved site-directed in vivo photo-cross-linking technique with a pulse-chase approach, we developed a new method that can analyze the folding and assembly of a newly synthesized protein with high spatiotemporal resolution. We demonstrate that this method, named the pulse-chase and in vivo photo-cross-linking experiment (PiXie), enables the kinetic analysis of the formation of an Escherichia coli periplasmic (soluble) protein complex (PhoA). We also used our new technique to investigate assembly/folding processes of two membrane complexes (SecD-SecF in the inner membrane and LptD-LptE in the outer membrane), which provided new insights into the biogenesis of these complexes. Our PiXie method permits analysis of the dynamic behavior of various proteins and enables examination of protein-protein interactions at the level of individual amino acid residues. We anticipate that our new technique will have valuable utility for studies of protein dynamics in many organisms. © 2018 by The American Society for Biochemistry and Molecular Biology, Inc.
Similar folds with different stabilization mechanisms: the cases of prion and doppel proteins
Colacino, Stefano; Tiana, Guido; Colombo, Giorgio
2006-01-01
Background Protein misfolding is the main cause of a group of fatal neurodegenerative diseases in humans and animals. In particular, in Prion-related diseases the normal cellular form of the Prion Protein PrP (PrPC) is converted into the infectious PrPSc through a conformational process during which it acquires a high β-sheet content. Doppel is a protein that shares a similar native fold, but lacks the scrapie isoform. Understanding the molecular determinants of these different behaviours is important both for biomedical and biophysical research. Results In this paper, the dynamical and energetic properties of the two proteins in solution is comparatively analyzed by means of long time scale explicit solvent, all-atom molecular dynamics in different temperature conditions. The trajectories are analyzed by means of a recently introduced energy decomposition approach (Tiana et al, Prot. Sci. 2004) aimed at identifying the key residues for the stabilization and folding of the protein. Our analysis shows that Prion and Doppel have two different cores stabilizing the native state and that the relative contribution of the nucleus to the global stability of the protein for Doppel is sensitively higher than for PrP. Moreover, under misfolding conditions the Doppel core is conserved, while the energy stabilization network of PrP is disrupted. Conclusion These observations suggest that different sequences can share similar native topology with different stabilizing interactions and that the sequences of the Prion and Doppel proteins may have diverged under different evolutionary constraints resulting in different folding and stabilization mechanisms. PMID:16857062
Principles of Protein Recognition and Properties of Protein-protein Interfaces
NASA Astrophysics Data System (ADS)
Keskin, Ozlem; Gursoy, Attila; Nussinov, Ruth
In this chapter we address two aspects - the static physical interactions which allow the information transfer for the function to be performed; and the dynamic, i.e. how the information is transmitted between the binding sites in the single protein molecule and in the network. We describe the single protein molecules and their complexes; and the analogy between protein folding and protein binding. Eventually, to fully understand the interactome and how it performs the essential cellular functions, we have to put all together - and hierarchically progress through the system.
Electrostatically Accelerated Coupled Binding and Folding of Intrinsically Disordered Proteins
Ganguly, Debabani; Otieno, Steve; Waddell, Brett; Iconaru, Luigi; Kriwacki, Richard W.; Chen, Jianhan
2012-01-01
Intrinsically disordered proteins (IDPs) are now recognized to be prevalent in biology, and many potential functional benefits have been discussed. However, the frequent requirement of peptide folding in specific interactions of IDPs could impose a kinetic bottleneck, which could be overcome only by efficient folding upon encounter. Intriguingly, existing kinetic data suggest that specific binding of IDPs is generally no slower than that of globular proteins. Here, we exploited the cell cycle regulator p27Kip1 (p27) as a model system to understand how IDPs might achieve efficient folding upon encounter for facile recognition. Combining experiments and coarse-grained modeling, we demonstrate that long-range electrostatic interactions between enriched charges on p27 and near its binding site on cyclin A not only enhance the encounter rate (i.e., electrostatic steering), but also promote folding-competent topologies in the encounter complexes, allowing rapid subsequent formation of short-range native interactions en route to the specific complex. In contrast, nonspecific hydrophobic interactions, while hardly affecting the encounter rate, can significantly reduce the efficiency of folding upon encounter and lead to slower binding kinetics. Further analysis of charge distributions in a set of known IDP complexes reveals that, although IDP binding sites tend to be more hydrophobic compared to the rest of the target surface, their vicinities are frequently enriched with charges to complement those on IDPs. This observation suggests that electrostatically accelerated encounter and induced folding might represent a prevalent mechanism for promoting facile IDP recognition. PMID:22721951
Changes in serum proteins after endotoxin administration in healthy and choline-treated calves.
Yilmaz, Z; Eralp Inan, O; Kocaturk, M; Baykal, A T; Hacariz, O; Hatipoglu, I; Tvarijonaviciute, A; Cansev, M; Ceron, J; Ulus, I H
2016-09-20
This study aimed to investigate the possible serum protein changes after endotoxin administration in healthy and choline-treated calves using proteomics. These results are expected to contribute to the understanding of the pathophysiological mechanisms of endotoxemia and the beneficial effect of choline administration in this clinical situation. Healthy-calves (n = 20) were divided into 4 groups: Control, Choline treated (C), Lipopolysaccharide administered (LPS), and LPS + C. Control calves received 0.9 % NaCl injection. Calves in C and LPS + C groups received choline chloride (1 mg/kg/iv). Endotoxin (LPS) was injected (2 μg/kg/iv) to the calves in LPS and LPS + C groups. Serum samples were collected before and after the treatments. Differentially expressed proteins (> 1.5 fold-change relative to controls) were identified by LC-MS/MS. After LPS administration, 14 proteins increased, and 13 proteins decreased within 48 h as compared to controls. In the LPS group, there were significant increases in serum levels of ragulator complex protein (189-fold) and galectin-3-binding protein (10-fold), but transcription factor MafF and corticosteroid binding globulin were down regulated (≥ 5 fold). As compared with the LPS group, in LPS + C group, fibrinogen gamma-B-chain and antithrombin were up-regulated, while hemopexin and histone H4 were down-regulated. Choline treatment attenuated actin alpha cardiac muscle-1 overexpression after LPS. LPS administration produces changes in serum proteins associated with lipid metabolism, immune and inflammatory response, protein binding/transport, cell adhesion, venous thrombosis, cardiac contractility and blood coagulation. The administration of choline is associated with changes in proteins which can be related with its beneficial effect in this clinical situation.
2017-01-01
Recent advances in understanding protein folding have benefitted from coarse-grained representations of protein structures. Empirical energy functions derived from these techniques occasionally succeed in distinguishing native structures from their corresponding ensembles of nonnative folds or decoys which display varying degrees of structural dissimilarity to the native proteins. Here we utilized atomic coordinates of single protein chains, comprising a large diverse training set, to develop and evaluate twelve all-atom four-body statistical potentials obtained by exploring alternative values for a pair of inherent parameters. Delaunay tessellation was performed on the atomic coordinates of each protein to objectively identify all quadruplets of interacting atoms, and atomic potentials were generated via statistical analysis of the data and implementation of the inverted Boltzmann principle. Our potentials were evaluated using benchmarking datasets from Decoys-‘R'-Us, and comparisons were made with twelve other physics- and knowledge-based potentials. Ranking 3rd, our best potential tied CHARMM19 and surpassed AMBER force field potentials. We illustrate how a generalized version of our potential can be used to empirically calculate binding energies for target-ligand complexes, using HIV-1 protease-inhibitor complexes for a practical application. The combined results suggest an accurate and efficient atomic four-body statistical potential for protein structure prediction and assessment. PMID:29119109
Heat Capacity Changes Associated with Nucleic Acid Folding
Mikulecky, Peter J.; Feig, Andrew L.
2008-01-01
Whereas heat capacity changes (ΔCPs) associated with folding transitions are commonplace in the literature of protein folding, they have long been considered a minor energetic contributor in nucleic acid folding. Recent advances in the understanding of nucleic acid folding and improved technology for measuring the energetics of folding transitions have allowed a greater experimental window for measuring these effects. We present in this review a survey of current literature that confronts the issue of ΔCPs associated with nucleic acid folding transitions. This work helps to gather the molecular insights that can be gleaned from analysis of ΔCPs and points toward the challenges that will need to be overcome if the energetic contribution of ΔCP terms are to be put to use in improving free energy calculations for nucleic acid structure prediction. PMID:16429398
Biomolecular Dynamics: Order-Disorder Transitions and Energy Landscapes
Whitford, Paul C.; Sanbonmatsu, Karissa Y.; Onuchic, José N.
2013-01-01
While the energy landscape theory of protein folding is now a widely accepted view for understanding how relatively-weak molecular interactions lead to rapid and cooperative protein folding, such a framework must be extended to describe the large-scale functional motions observed in molecular machines. In this review, we discuss 1) the development of the energy landscape theory of biomolecular folding, 2) recent advances towards establishing a consistent understanding of folding and function, and 3) emerging themes in the functional motions of enzymes, biomolecular motors, and other biomolecular machines. Recent theoretical, computational, and experimental lines of investigation are providing a very dynamic picture of biomolecular motion. In contrast to earlier ideas, where molecular machines were thought to function similarly to macroscopic machines, with rigid components that move along a few degrees of freedom in a deterministic fashion, biomolecular complexes are only marginally stable. Since the stabilizing contribution of each atomic interaction is on the order of the thermal fluctuations in solution, the rigid body description of molecular function must be revisited. An emerging theme is that functional motions encompass order-disorder transitions and structural flexibility provide significant contributions to the free-energy. In this review, we describe the biological importance of order-disorder transitions and discuss the statistical-mechanical foundation of theoretical approaches that can characterize such transitions. PMID:22790780
Understanding curcumin-induced modulation of protein aggregation.
Ahmad, Basir; Borana, Mohanish S; Chaudhary, Ankur P
2017-07-01
Curcumin, a diarylheptanoid compound, found in spice turmeric is known to alter the aggregation of proteins and reduce the toxicity of the aggregates. This review looks at the molecular basis of modulating protein aggregation and toxicity of the aggregates. Foremost, we identify the interaction of curcumin and its derivatives with proteins/peptides and the effect of their interaction on the conformational stability and unfolding/folding pathway(s). The unfolding/folding processes generate partially folded/unfolded intermediate, which serve as aggregation precursor state. Secondly, we discuss the effect of curcumin binding on the kinetics parameters of the aggregation process, which give information about the mechanism of the aggregation inhibition. We describe, in addition, that curcumin can accelerate/promote fibril formation by binding to oligomeric intermediate(s) accumulated in the aggregation pathway. Finally, we discuss the correlation of curcumin-induced monomeric and/or oligomeric precursor states with aggregate structure and toxicity. On the basis of these discussions, we propose a model describing curcumin-induced inhibition/promotion of formation of amyloid-like fibrils. Copyright © 2016 Elsevier B.V. All rights reserved.
Vocal fold ion transport and mucin expression following acrolein exposure
Levendoski, Elizabeth Erickson; Sivasankar, M. Preeti
2014-01-01
The vocal fold epithelium is exposed to inhaled particulates including pollutants during breathing in everyday environments. Yet, our understanding of the effects of pollutants on vocal fold epithelial function is extremely limited. The objective of this study was to investigate the effect of the pollutant acrolein on two vocal fold epithelial mechanisms: ion transport and mucin synthesis. These mechanisms were chosen as each plays a critical role in vocal defense and in maintaining surface hydration which is necessary for optimal voice production. Healthy, native porcine vocal folds (N=85) were excised and exposed to an acrolein or sham challenge. A 60 minute acrolein, but not sham challenge significantly reduced ion transport and inhibited cyclic adenosine monophosphate-dependent increases in ion transport. Decreases in ion transport were associated with reduced sodium absorption. Within the same timeline, no significant acrolein-induced changes in mucin gene or protein expression were observed. These results improve our understanding of the effects of acrolein on key vocal fold epithelial functions and inform the development of future investigations that seek to elucidate the impact of a wide range of pollutant exposures on vocal fold health. PMID:24648011
Folding superfunnel to describe cooperative folding of interacting proteins.
Smeller, László
2016-07-01
This paper proposes a generalization of the well-known folding funnel concept of proteins. In the funnel model the polypeptide chain is treated as an individual object not interacting with other proteins. Since biological systems are considerably crowded, protein-protein interaction is a fundamental feature during the life cycle of proteins. The folding superfunnel proposed here describes the folding process of interacting proteins in various situations. The first example discussed is the folding of the freshly synthesized protein with the aid of chaperones. Another important aspect of protein-protein interactions is the folding of the recently characterized intrinsically disordered proteins, where binding to target proteins plays a crucial role in the completion of the folding process. The third scenario where the folding superfunnel is used is the formation of aggregates from destabilized proteins, which is an important factor in case of several conformational diseases. The folding superfunnel constructed here with the minimal assumption about the interaction potential explains all three cases mentioned above. Proteins 2016; 84:1009-1016. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Sinha, Pallavi; Pazhamala, Lekha T.; Singh, Vikas K.; Saxena, Rachit K.; Krishnamurthy, L.; Azam, Sarwar; Khan, Aamir W.; Varshney, Rajeev K.
2016-01-01
Pigeonpea is a resilient crop, which is relatively more drought tolerant than many other legume crops. To understand the molecular mechanisms of this unique feature of pigeonpea, 51 genes were selected using the Hidden Markov Models (HMM) those codes for proteins having close similarity to universal stress protein domain. Validation of these genes was conducted on three pigeonpea genotypes (ICPL 151, ICPL 8755, and ICPL 227) having different levels of drought tolerance. Gene expression analysis using qRT-PCR revealed 6, 8, and 18 genes to be ≥2-fold differentially expressed in ICPL 151, ICPL 8755, and ICPL 227, respectively. A total of 10 differentially expressed genes showed ≥2-fold up-regulation in the more drought tolerant genotype, which encoded four different classes of proteins. These include plant U-box protein (four genes), universal stress protein A-like protein (four genes), cation/H(+) antiporter protein (one gene) and an uncharacterized protein (one gene). Genes C.cajan_29830 and C.cajan_33874 belonging to uspA, were found significantly expressed in all the three genotypes with ≥2-fold expression variations. Expression profiling of these two genes on the four other legume crops revealed their specific role in pigeonpea. Therefore, these genes seem to be promising candidates for conferring drought tolerance specifically to pigeonpea. PMID:26779199
Impact of hydrodynamic interactions on protein folding rates depends on temperature
NASA Astrophysics Data System (ADS)
Zegarra, Fabio C.; Homouz, Dirar; Eliaz, Yossi; Gasic, Andrei G.; Cheung, Margaret S.
2018-03-01
We investigated the impact of hydrodynamic interactions (HI) on protein folding using a coarse-grained model. The extent of the impact of hydrodynamic interactions, whether it accelerates, retards, or has no effect on protein folding, has been controversial. Together with a theoretical framework of the energy landscape theory (ELT) for protein folding that describes the dynamics of the collective motion with a single reaction coordinate across a folding barrier, we compared the kinetic effects of HI on the folding rates of two protein models that use a chain of single beads with distinctive topologies: a 64-residue α /β chymotrypsin inhibitor 2 (CI2) protein, and a 57-residue β -barrel α -spectrin Src-homology 3 domain (SH3) protein. When comparing the protein folding kinetics simulated with Brownian dynamics in the presence of HI to that in the absence of HI, we find that the effect of HI on protein folding appears to have a "crossover" behavior about the folding temperature. This means that at a temperature greater than the folding temperature, the enhanced friction from the hydrodynamic solvents between the beads in an unfolded configuration results in lowered folding rate; conversely, at a temperature lower than the folding temperature, HI accelerates folding by the backflow of solvent toward the folded configuration of a protein. Additionally, the extent of acceleration depends on the topology of a protein: for a protein like CI2, where its folding nucleus is rather diffuse in a transition state, HI channels the formation of contacts by favoring a major folding pathway in a complex free energy landscape, thus accelerating folding. For a protein like SH3, where its folding nucleus is already specific and less diffuse, HI matters less at a temperature lower than the folding temperature. Our findings provide further theoretical insight to protein folding kinetic experiments and simulations.
On the Role of Entropy in the Protein Folding Process
NASA Astrophysics Data System (ADS)
Hoppe, Travis
2011-12-01
A protein's ultimate function and activity is determined by the unique three-dimensional structure taken by the folding process. Protein malfunction due to misfolding is the culprit of many clinical disorders, such as abnormal protein aggregations. This leads to neurodegenerative disorders like Huntington's and Alzheimer's disease. We focus on a subset of the folding problem, exploring the role and effects of entropy on the process of protein folding. Four major concepts and models are developed and each pertains to a specific aspect of the folding process: entropic forces, conformational states under crowding, aggregation, and macrostate kinetics from microstate trajectories. The exclusive focus on entropy is well-suited for crowding studies, as many interactions are nonspecific. We show how a stabilizing entropic force can arise purely from the motion of crowders in solution. In addition we are able to make a a quantitative prediction of the crowding effect with an implicit crowding approximation using an aspherical scaled-particle theory. In order to investigate the effects of aggregation, we derive a new operator expansion method to solve the Ising/Potts model with external fields over an arbitrary graph. Here the external fields are representative of the entropic forces. We show that this method reduces the problem of calculating the partition function to the solution of recursion relations. Many of the methods employed are coarse-grained approximations. As such, it is useful to have a viable method for extracting macrostate information from time series data. We develop a method to cluster the microstates into physically meaningful macrostates by grouping similar relaxation times from a transition matrix. Overall, the studied topics allow us to understand deeper the complicated process involving proteins.
Solis, Armando D
2015-12-01
To reduce complexity, understand generalized rules of protein folding, and facilitate de novo protein design, the 20-letter amino acid alphabet is commonly reduced to a smaller alphabet by clustering amino acids based on some measure of similarity. In this work, we seek the optimal alphabet that preserves as much of the structural information found in long-range (contact) interactions among amino acids in natively-folded proteins. We employ the Information Maximization Device, based on information theory, to partition the amino acids into well-defined clusters. Numbering from 2 to 19 groups, these optimal clusters of amino acids, while generated automatically, embody well-known properties of amino acids such as hydrophobicity/polarity, charge, size, and aromaticity, and are demonstrated to maintain the discriminative power of long-range interactions with minimal loss of mutual information. Our measurements suggest that reduced alphabets (of less than 10) are able to capture virtually all of the information residing in native contacts and may be sufficient for fold recognition, as demonstrated by extensive threading tests. In an expansive survey of the literature, we observe that alphabets derived from various approaches-including those derived from physicochemical intuition, local structure considerations, and sequence alignments of remote homologs-fare consistently well in preserving contact interaction information, highlighting a convergence in the various factors thought to be relevant to the folding code. Moreover, we find that alphabets commonly used in experimental protein design are nearly optimal and are largely coherent with observations that have arisen in this work. © 2015 Wiley Periodicals, Inc.
Wu, Cheng-Wei; Biggar, Kyle K; Zhang, Jing; Tessier, Shannon N; Pifferi, Fabien; Perret, Martine; Storey, Kenneth B
2015-04-01
A natural tolerance of various environmental stresses is typically supported by various cytoprotective mechanisms that protect macromolecules and promote extended viability. Among these are antioxidant defenses that help to limit damage from reactive oxygen species and chaperones that help to minimize protein misfolding or unfolding under stress conditions. To understand the molecular mechanisms that act to protect cells during primate torpor, the present study characterizes antioxidant and heat shock protein (HSP) responses in various organs of control (aroused) and torpid gray mouse lemurs, Microcebus murinus. Protein expression of HSP70 and HSP90α was elevated to 1.26 and 1.49 fold, respectively, in brown adipose tissue during torpor as compared with control animals, whereas HSP60 in liver of torpid animals was 1.15 fold of that in control (P<0.05). Among antioxidant enzymes, protein levels of thioredoxin 1 were elevated to 2.19 fold in white adipose tissue during torpor, whereas Cu-Zn superoxide dismutase 1 levels rose to 1.1 fold in skeletal muscle (P<0.05). Additionally, total antioxidant capacity was increased to 1.6 fold in liver during torpor (P<0.05), while remaining unchanged in the five other tissues. Overall, our data suggest that antioxidant and HSP responses are modified in a tissue-specific manner during daily torpor in gray mouse lemurs. Furthermore, our data also show that cytoprotective strategies employed during primate torpor are distinct from the strategies in rodent hibernation as reported in previous studies. Copyright © 2015. Production and hosting by Elsevier Ltd.
Group II chaperonins: new TRiC(k)s and turns of a protein folding machine.
Gutsche, I; Essen, L O; Baumeister, W
1999-10-22
In the past decade, the eubacterial group I chaperonin GroEL became the paradigm of a protein folding machine. More recently, electron microscopy and X-ray crystallography offered insights into the structure of the thermosome, the archetype of the group II chaperonins which also comprise the chaperonin from the eukaryotic cytosol TRiC. Some structural differences from GroEL were revealed, namely the existence of a built-in lid provided by the helical protrusions of the apical domains instead of a GroES-like co-chaperonin. These structural studies provide a framework for understanding the differences in the mode of action between the group II and the group I chaperonins. In vitro analyses of the folding of non-native substrates coupled to ATP binding and hydrolysis are progressing towards establishing a functional cycle for group II chaperonins. A protein complex called GimC/prefoldin has recently been found to cooperate with TRiC in vivo, and its characterization is under way. Copyright 1999 Academic Press.
Structure of the human TRiC/CCT Subunit 5 associated with hereditary sensory neuropathy
Pereira, Jose H.; McAndrew, Ryan P.; Sergeeva, Oksana A.; ...
2017-06-16
The human chaperonin TRiC consists of eight non-identical subunits, and its protein-folding activity is critical for cellular health. Misfolded proteins are associated with many human diseases, such as amyloid diseases, cancer, and neuropathies, making TRiC a potential therapeutic target. A detailed structural understanding of its ATP-dependent folding mechanism and substrate recognition is therefore of great importance. Of particular health-related interest is the mutation Histidine 147 to Arginine (H147R) in human TRiC subunit 5 (CCT5), which has been associated with hereditary sensory neuropathy. In this paper, we describe the crystal structures of CCT5 and the CCT5-H147R mutant, which provide important structuralmore » information for this vital protein-folding machine in humans. This first X-ray crystallographic study of a single human CCT subunit in the context of a hexadecameric complex can be expanded in the future to the other 7 subunits that form the TRiC complex.« less
Structure of the human TRiC/CCT Subunit 5 associated with hereditary sensory neuropathy.
Pereira, Jose H; McAndrew, Ryan P; Sergeeva, Oksana A; Ralston, Corie Y; King, Jonathan A; Adams, Paul D
2017-06-16
The human chaperonin TRiC consists of eight non-identical subunits, and its protein-folding activity is critical for cellular health. Misfolded proteins are associated with many human diseases, such as amyloid diseases, cancer, and neuropathies, making TRiC a potential therapeutic target. A detailed structural understanding of its ATP-dependent folding mechanism and substrate recognition is therefore of great importance. Of particular health-related interest is the mutation Histidine 147 to Arginine (H147R) in human TRiC subunit 5 (CCT5), which has been associated with hereditary sensory neuropathy. In this paper, we describe the crystal structures of CCT5 and the CCT5-H147R mutant, which provide important structural information for this vital protein-folding machine in humans. This first X-ray crystallographic study of a single human CCT subunit in the context of a hexadecameric complex can be expanded in the future to the other 7 subunits that form the TRiC complex.
Structure of the human TRiC/CCT Subunit 5 associated with hereditary sensory neuropathy
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pereira, Jose H.; McAndrew, Ryan P.; Sergeeva, Oksana A.
The human chaperonin TRiC consists of eight non-identical subunits, and its protein-folding activity is critical for cellular health. Misfolded proteins are associated with many human diseases, such as amyloid diseases, cancer, and neuropathies, making TRiC a potential therapeutic target. A detailed structural understanding of its ATP-dependent folding mechanism and substrate recognition is therefore of great importance. Of particular health-related interest is the mutation Histidine 147 to Arginine (H147R) in human TRiC subunit 5 (CCT5), which has been associated with hereditary sensory neuropathy. In this paper, we describe the crystal structures of CCT5 and the CCT5-H147R mutant, which provide important structuralmore » information for this vital protein-folding machine in humans. This first X-ray crystallographic study of a single human CCT subunit in the context of a hexadecameric complex can be expanded in the future to the other 7 subunits that form the TRiC complex.« less
Protein structure recognition: From eigenvector analysis to structural threading method
NASA Astrophysics Data System (ADS)
Cao, Haibo
In this work, we try to understand the protein folding problem using pair-wise hydrophobic interaction as the dominant interaction for the protein folding process. We found a strong correlation between amino acid sequence and the corresponding native structure of the protein. Some applications of this correlation were discussed in this dissertation include the domain partition and a new structural threading method as well as the performance of this method in the CASP5 competition. In the first part, we give a brief introduction to the protein folding problem. Some essential knowledge and progress from other research groups was discussed. This part include discussions of interactions among amino acids residues, lattice HP model, and the designablity principle. In the second part, we try to establish the correlation between amino acid sequence and the corresponding native structure of the protein. This correlation was observed in our eigenvector study of protein contact matrix. We believe the correlation is universal, thus it can be used in automatic partition of protein structures into folding domains. In the third part, we discuss a threading method based on the correlation between amino acid sequence and ominant eigenvector of the structure contact-matrix. A mathematically straightforward iteration scheme provides a self-consistent optimum global sequence-structure alignment. The computational efficiency of this method makes it possible to search whole protein structure databases for structural homology without relying on sequence similarity. The sensitivity and specificity of this method is discussed, along with a case of blind test prediction. In the appendix, we list the overall performance of this threading method in CASP5 blind test in comparison with other existing approaches.
NASA Astrophysics Data System (ADS)
2009-07-01
So what is the site about? Like the popular SETI@home program, which uses the downtime of home computers to sift radio-telescope data for evidence of alien life, Foldit draws on the idle hours of several thousand data-crunchers for help in solving scientific puzzles. But there is a twist. For a start, Foldit is all about biophysics. The project's goal is to understand how proteins - the chains of amino acids that drive processes inside living cells - fold themselves into a myriad of different shapes. But the most striking difference is that Foldit's protein-folding operators are actual human beings, and the datasets they are sifting are disguised as an amazingly addictive computer game.
Mittal, A; Jayaram, B; Shenoy, Sandhya; Bawa, Tejdeep Singh
2010-10-01
Protein folding is at least a six decade old problem, since the times of Pauling and Anfinsen. However, rules of protein folding remain elusive till date. In this work, rigorous analyses of several thousand crystal structures of folded proteins reveal a surprisingly simple unifying principle of backbone organization in protein folding. We find that protein folding is a direct consequence of a narrow band of stoichiometric occurrences of amino-acids in primary sequences, regardless of the size and the fold of a protein. We observe that "preferential interactions" between amino-acids do not drive protein folding, contrary to all prevalent views. We dedicate our discovery to the seminal contribution of Chargaff which was one of the major keys to elucidation of the stoichiometry-driven spatially organized double helical structure of DNA.
pH dependent unfolding characteristics of DLC8 dimer: Residue level details from NMR.
Mohan, P M Krishna; Hosur, Ramakrishna V
2008-11-01
Environment dependence of folding and unfolding of a protein is central to its function. In the same vein, knowledge of pH dependence of stability and folding/unfolding is crucial for many biophysical equilibrium and kinetic studies designed to understand protein folding mechanisms. In the present study we investigated the guanidine induced unfolding transition of dynein light chain protein (DLC8), a cargo adaptor of the dynein complex in the pH range 7-10. It is observed that while the protein remains a dimer in the entire pH range, its stability is somewhat reduced at alkaline pH. Global unfolding features monitored using fluorescence spectroscopy revealed that the unfolding transition of DLC8 at pH 7 is best described by a three-state model, whereas, that at pH 10 is best described by a two-state model. Chemical shift perturbations due to pH change provided insights into the corresponding residue level structural perturbations in the DLC8 dimer. Likewise, backbone (15)N relaxation measurements threw light on the corresponding motional changes in the dimeric protein. These observations have been rationalized on the basis of expected changes with increasing pH in the protonation states of the titratable residues on the structure of the protein. These, in turn provide an explanation for the change from three-state to two-state guanidine induced unfolding transition as the pH is increased from 7 to 10. All these results exemplify and highlight the role of environment vis-à-vis the sequence and structure of a given protein in dictating its folding/unfolding characteristics.
Use of designed sequences in protein structure recognition.
Kumar, Gayatri; Mudgal, Richa; Srinivasan, Narayanaswamy; Sandhya, Sankaran
2018-05-09
Knowledge of the protein structure is a pre-requisite for improved understanding of molecular function. The gap in the sequence-structure space has increased in the post-genomic era. Grouping related protein sequences into families can aid in narrowing the gap. In the Pfam database, structure description is provided for part or full-length proteins of 7726 families. For the remaining 52% of the families, information on 3-D structure is not yet available. We use the computationally designed sequences that are intermediately related to two protein domain families, which are already known to share the same fold. These strategically designed sequences enable detection of distant relationships and here, we have employed them for the purpose of structure recognition of protein families of yet unknown structure. We first measured the success rate of our approach using a dataset of protein families of known fold and achieved a success rate of 88%. Next, for 1392 families of yet unknown structure, we made structural assignments for part/full length of the proteins. Fold association for 423 domains of unknown function (DUFs) are provided as a step towards functional annotation. The results indicate that knowledge-based filling of gaps in protein sequence space is a lucrative approach for structure recognition. Such sequences assist in traversal through protein sequence space and effectively function as 'linkers', where natural linkers between distant proteins are unavailable. This article was reviewed by Oliviero Carugo, Christine Orengo and Srikrishna Subramanian.
Characterization of protein folding by a Φ-value calculation with a statistical-mechanical model.
Wako, Hiroshi; Abe, Haruo
2016-01-01
The Φ-value analysis approach provides information about transition-state structures along the folding pathway of a protein by measuring the effects of an amino acid mutation on folding kinetics. Here we compared the theoretically calculated Φ values of 27 proteins with their experimentally observed Φ values; the theoretical values were calculated using a simple statistical-mechanical model of protein folding. The theoretically calculated Φ values reflected the corresponding experimentally observed Φ values with reasonable accuracy for many of the proteins, but not for all. The correlation between the theoretically calculated and experimentally observed Φ values strongly depends on whether the protein-folding mechanism assumed in the model holds true in real proteins. In other words, the correlation coefficient can be expected to illuminate the folding mechanisms of proteins, providing the answer to the question of which model more accurately describes protein folding: the framework model or the nucleation-condensation model. In addition, we tried to characterize protein folding with respect to various properties of each protein apart from the size and fold class, such as the free-energy profile, contact-order profile, and sensitivity to the parameters used in the Φ-value calculation. The results showed that any one of these properties alone was not enough to explain protein folding, although each one played a significant role in it. We have confirmed the importance of characterizing protein folding from various perspectives. Our findings have also highlighted that protein folding is highly variable and unique across different proteins, and this should be considered while pursuing a unified theory of protein folding.
Characterization of protein folding by a Φ-value calculation with a statistical-mechanical model
Wako, Hiroshi; Abe, Haruo
2016-01-01
The Φ-value analysis approach provides information about transition-state structures along the folding pathway of a protein by measuring the effects of an amino acid mutation on folding kinetics. Here we compared the theoretically calculated Φ values of 27 proteins with their experimentally observed Φ values; the theoretical values were calculated using a simple statistical-mechanical model of protein folding. The theoretically calculated Φ values reflected the corresponding experimentally observed Φ values with reasonable accuracy for many of the proteins, but not for all. The correlation between the theoretically calculated and experimentally observed Φ values strongly depends on whether the protein-folding mechanism assumed in the model holds true in real proteins. In other words, the correlation coefficient can be expected to illuminate the folding mechanisms of proteins, providing the answer to the question of which model more accurately describes protein folding: the framework model or the nucleation-condensation model. In addition, we tried to characterize protein folding with respect to various properties of each protein apart from the size and fold class, such as the free-energy profile, contact-order profile, and sensitivity to the parameters used in the Φ-value calculation. The results showed that any one of these properties alone was not enough to explain protein folding, although each one played a significant role in it. We have confirmed the importance of characterizing protein folding from various perspectives. Our findings have also highlighted that protein folding is highly variable and unique across different proteins, and this should be considered while pursuing a unified theory of protein folding. PMID:28409079
When fast is better: protein folding fundamentals and mechanisms from ultrafast approaches
Muñoz, Victor; Cerminara, Michele
2016-01-01
Protein folding research stalled for decades because conventional experiments indicated that proteins fold slowly and in single strokes, whereas theory predicted a complex interplay between dynamics and energetics resulting in myriad microscopic pathways. Ultrafast kinetic methods turned the field upside down by providing the means to probe fundamental aspects of folding, test theoretical predictions and benchmark simulations. Accordingly, experimentalists could measure the timescales for all relevant folding motions, determine the folding speed limit and confirm that folding barriers are entropic bottlenecks. Moreover, a catalogue of proteins that fold extremely fast (microseconds) could be identified. Such fast-folding proteins cross shallow free energy barriers or fold downhill, and thus unfold with minimal co-operativity (gradually). A new generation of thermodynamic methods has exploited this property to map folding landscapes, interaction networks and mechanisms at nearly atomic resolution. In parallel, modern molecular dynamics simulations have finally reached the timescales required to watch fast-folding proteins fold and unfold in silico. All of these findings have buttressed the fundamentals of protein folding predicted by theory, and are now offering the first glimpses at the underlying mechanisms. Fast folding appears to also have functional implications as recent results connect downhill folding with intrinsically disordered proteins, their complex binding modes and ability to moonlight. These connections suggest that the coupling between downhill (un)folding and binding enables such protein domains to operate analogically as conformational rheostats. PMID:27574021
Thermostability in endoglucanases is fold-specific
2011-01-01
Background Endoglucanases are usually considered to be synergistically involved in the initial stages of cellulose breakdown-an essential step in the bioprocessing of lignocellulosic plant materials into bioethanol. Despite their economic importance, we currently lack a basic understanding of how some endoglucanases can sustain their ability to function at elevated temperatures required for bioprocessing, while others cannot. In this study, we present a detailed comparative analysis of both thermophilic and mesophilic endoglucanases in order to gain insights into origins of thermostability. We analyzed the sequences and structures for sets of endoglucanase proteins drawn from the Carbohydrate-Active enZymes (CAZy) database. Results Our results demonstrate that thermophilic endoglucanases and their mesophilic counterparts differ significantly in their amino acid compositions. Strikingly, these compositional differences are specific to protein folds and enzyme families, and lead to differences in intramolecular interactions in a fold-dependent fashion. Conclusions Here, we provide fold-specific guidelines to control thermostability in endoglucanases that will aid in making production of biofuels from plant biomass more efficient. PMID:21291533
Deng, Nan-jie; Dai, Wei
2013-01-01
Understanding how kinetics in the unfolded state affects protein folding is a fundamentally important yet less well-understood issue. Here we employ three different models to analyze the unfolded landscape and folding kinetics of the miniprotein Trp-cage. The first is a 208 μs explicit solvent molecular dynamics (MD) simulation from D. E. Shaw Research containing tens of folding events. The second is a Markov state model (MSM-MD) constructed from the same ultra-long MD simulation; MSM-MD can be used to generate thousands of folding events. The third is a Markov state model built from temperature replica exchange MD simulations in implicit solvent (MSM-REMD). All the models exhibit multiple folding pathways, and there is a good correspondence between the folding pathways from direct MD and those computed from the MSMs. The unfolded populations interconvert rapidly between extended and collapsed conformations on time scales ≤ 40 ns, compared with the folding time of ≈ 5 μs. The folding rates are independent of where the folding is initiated from within the unfolded ensemble. About 90 % of the unfolded states are sampled within the first 40 μs of the ultra-long MD trajectory, which on average explores ~27 % of the unfolded state ensemble between consecutive folding events. We clustered the folding pathways according to structural similarity into “tubes”, and kinetically partitioned the unfolded state into populations that fold along different tubes. From our analysis of the simulations and a simple kinetic model, we find that when the mixing within the unfolded state is comparable to or faster than folding, the folding waiting times for all the folding tubes are similar and the folding kinetics is essentially single exponential despite the presence of heterogeneous folding paths with non-uniform barriers. When the mixing is much slower than folding, different unfolded populations fold independently leading to non-exponential kinetics. A kinetic partition of the Trp-cage unfolded state is constructed which reveals that different unfolded populations have almost the same probability to fold along any of the multiple folding paths. We are investigating whether the results for the kinetics in the unfolded state of the twenty-residue Trp-cage is representative of larger single domain proteins. PMID:23705683
NASA Astrophysics Data System (ADS)
Angell, C. A.
1997-02-01
We show how energy landscape concepts can rationalize the observations on glassforming liquids over the whole range of behavior, strong to fragile. In particular, we show how the existence of landscapes with both strong and fragile megabasins can provide a basis for understanding the nature of quasi-first-order transitions between amorphous states such as those observed to occur in the glassy states of “strong” glassformers. We show how this propensity originates in the liquid state and then emphasize the analogy provided, at the mesoscopic level, by the folding transition in proteins. Recognition that the folding transition is an equilibrium first-order transition between polyamorphic forms of a complex system implies recognition of the need for a nucleation step in the process. When nucleated phase transitions are kinetically retarded, their probability can be influenced by time-temperature history and by the presence of nucleating agents. Nucleation events are statistically rare in mesoscopic systems, hence the ability to fold rapidly should require special features in the folding molecular structure or the presence of nucleating agents. We propose that the unwanted folding events leading to pathogenic forms of certain proteins (prions) can be stimulated by nucleating agents, which thus may be the unidentified infectious agents in “mad cow” disease and related maladies.
Membrane Topology and Insertion of Membrane Proteins: Search for Topogenic Signals
van Geest, Marleen; Lolkema, Juke S.
2000-01-01
Integral membrane proteins are found in all cellular membranes and carry out many of the functions that are essential to life. The membrane-embedded domains of integral membrane proteins are structurally quite simple, allowing the use of various prediction methods and biochemical methods to obtain structural information about membrane proteins. A critical step in the biosynthetic pathway leading to the folded protein in the membrane is its insertion into the lipid bilayer. Understanding of the fundamentals of the insertion and folding processes will significantly improve the methods used to predict the three-dimensional membrane protein structure from the amino acid sequence. In the first part of this review, biochemical approaches to elucidate membrane protein topology are reviewed and evaluated, and in the second part, the use of similar techniques to study membrane protein insertion is discussed. The latter studies search for signals in the polypeptide chain that direct the insertion process. Knowledge of the topogenic signals in the nascent chain of a membrane protein is essential for the evaluation of membrane topology studies. PMID:10704472
In Touch with Molecules: Improving Student Learning with Innovative Molecular Models
ERIC Educational Resources Information Center
Davenport, Jodi; Silberglitt, Matt; Olson, Arthur
2013-01-01
How do viruses self-assemble? Why do DNA bases pair the way they do? What factors determine whether strands of proteins fold into sheets or helices? Why does handedness matter? A deep understanding of core issues in biology requires students to understand both complex spatial structures of molecules and the interactions involved in dynamic…
Unfolded Protein Response of the Endoplasmic Reticulum in Tumor Progression and Immunogenicity
Yoo, Yoon Seon; Han, Hye Gyeong
2017-01-01
The endoplasmic reticulum (ER) is a pivotal regulator of folding, quality control, trafficking, and targeting of secreted and transmembrane proteins, and accordingly, eukaryotic cells have evolved specialized machinery to ensure that the ER enables these proteins to acquire adequate folding and maturation in the presence of intrinsic and extrinsic insults. This adaptive capacity of the ER to intrinsic and extrinsic perturbations is important for maintaining protein homeostasis, which is termed proteostasis. Failure in adaptation to these perturbations leads to accumulation of misfolded or unassembled proteins in the ER, which is termed ER stress, resulting in the activation of unfolded protein response (UPR) of the ER and the execution of ER-associated degradation (ERAD) to restore homeostasis. Furthermore, both of the two axes play key roles in the control of tumor progression, inflammation, immunity, and aging. Therefore, understanding UPR of the ER and subsequent ERAD will provide new insights into the pathogenesis of many human diseases and contribute to therapeutic intervention in these diseases. PMID:29430279
Energy landscape of knotted protein folding
Sułkowska, Joanna I.; Noel, Jeffrey K.; Onuchic, Jose N.
2012-01-01
Recent experiments have conclusively shown that proteins are able to fold from an unknotted, denatured polypeptide to the knotted, native state without the aid of chaperones. These experiments are consistent with a growing body of theoretical work showing that a funneled, minimally frustrated energy landscape is sufficient to fold small proteins with complex topologies. Here, we present a theoretical investigation of the folding of a knotted protein, 2ouf, engineered in the laboratory by a domain fusion that mimics an evolutionary pathway for knotted proteins. Unlike a previously studied knotted protein of similar length, we see reversible folding/knotting and a surprising lack of deep topological traps with a coarse-grained structure-based model. Our main interest is to investigate how evolution might further select the geometry and stiffness of the threading region of the newly fused protein. We compare the folding of the wild-type protein to several mutants. Similarly to the wild-type protein, all mutants show robust and reversible folding, and knotting coincides with the transition state ensemble. As observed experimentally, our simulations show that the knotted protein folds about ten times slower than an unknotted construct with an identical contact map. Simulated folding kinetics reflect the experimentally observed rollover in the folding limbs of chevron plots. Successful folding of the knotted protein is restricted to a narrow range of temperature as compared to the unknotted protein and fits of the kinetic folding data below folding temperature suggest slow, nondiffusive dynamics for the knotted protein. PMID:22891304
Mesoscale Modeling of Chromatin Folding
NASA Astrophysics Data System (ADS)
Schlick, Tamar
2009-03-01
Eukaryotic chromatin is the fundamental protein/nucleic acid unit that stores the genetic material. Understanding how chromatin fibers fold and unfold in physiological conditions is important for interpreting fundamental biological processes like DNA replication and transcription regulation. Using a mesoscopic model of oligonucleosome chains and tailored sampling protocols, we elucidate the energetics of oligonucleosome folding/unfolding and the role of each histone tail, linker histones, and divalent ions in regulating chromatin structure. The resulting compact topologies reconcile features of the zigzag model with straight linker DNAs with the solenoid model with bent linker DNAs for optimal fiber organization and reveal dynamic and energetic aspects involved.
Improving Protein Fold Recognition by Deep Learning Networks.
Jo, Taeho; Hou, Jie; Eickholt, Jesse; Cheng, Jianlin
2015-12-04
For accurate recognition of protein folds, a deep learning network method (DN-Fold) was developed to predict if a given query-template protein pair belongs to the same structural fold. The input used stemmed from the protein sequence and structural features extracted from the protein pair. We evaluated the performance of DN-Fold along with 18 different methods on Lindahl's benchmark dataset and on a large benchmark set extracted from SCOP 1.75 consisting of about one million protein pairs, at three different levels of fold recognition (i.e., protein family, superfamily, and fold) depending on the evolutionary distance between protein sequences. The correct recognition rate of ensembled DN-Fold for Top 1 predictions is 84.5%, 61.5%, and 33.6% and for Top 5 is 91.2%, 76.5%, and 60.7% at family, superfamily, and fold levels, respectively. We also evaluated the performance of single DN-Fold (DN-FoldS), which showed the comparable results at the level of family and superfamily, compared to ensemble DN-Fold. Finally, we extended the binary classification problem of fold recognition to real-value regression task, which also show a promising performance. DN-Fold is freely available through a web server at http://iris.rnet.missouri.edu/dnfold.
Competition between protein folding and aggregation: A three-dimensional lattice-model simulation
NASA Astrophysics Data System (ADS)
Bratko, D.; Blanch, H. W.
2001-01-01
Aggregation of protein molecules resulting in the loss of biological activity and the formation of insoluble deposits represents a serious problem for the biotechnology and pharmaceutical industries and in medicine. Considerable experimental and theoretical efforts are being made in order to improve our understanding of, and ability to control, the process. In the present work, we describe a Monte Carlo study of a multichain system of coarse-grained model proteins akin to lattice models developed for simulations of protein folding. The model is designed to examine the competition between intramolecular interactions leading to the native protein structure, and intermolecular association, resulting in the formation of aggregates of misfolded chains. Interactions between the segments are described by a variation of the Go potential [N. Go and H. Abe, Biopolymers 20, 1013 (1981)] that extends the recognition between attracting types of segments to pairs on distinct chains. For the particular model we adopt, the global free energy minimum of a pair of protein molecules corresponds to a dimer of native proteins. When three or more molecules interact, clusters of misfolded chains can be more stable than aggregates of native folds. A considerable fraction of native structure, however, is preserved in these cases. Rates of conformational changes rapidly decrease with the size of the protein cluster. Within the timescale accessible to computer simulations, the folding-aggregation balance is strongly affected by kinetic considerations. Both the native form and aggregates can persist in metastable states, even if conditions such as temperature or concentration favor a transition to an alternative form. Refolding yield can be affected by the presence of an additional polymer species mimicking the function of a molecular chaperone.
Atomic-level description of ubiquitin folding
Piana, Stefano; Lindorff-Larsen, Kresten; Shaw, David E.
2013-01-01
Equilibrium molecular dynamics simulations, in which proteins spontaneously and repeatedly fold and unfold, have recently been used to help elucidate the mechanistic principles that underlie the folding of fast-folding proteins. The extent to which the conclusions drawn from the analysis of such proteins, which fold on the microsecond timescale, apply to the millisecond or slower folding of naturally occurring proteins is, however, unclear. As a first attempt to address this outstanding issue, we examine here the folding of ubiquitin, a 76-residue-long protein found in all eukaryotes that is known experimentally to fold on a millisecond timescale. Ubiquitin folding has been the subject of many experimental studies, but its slow folding rate has made it difficult to observe and characterize the folding process through all-atom molecular dynamics simulations. Here we determine the mechanism, thermodynamics, and kinetics of ubiquitin folding through equilibrium atomistic simulations. The picture emerging from the simulations is in agreement with a view of ubiquitin folding suggested from previous experiments. Our findings related to the folding of ubiquitin are also consistent, for the most part, with the folding principles derived from the simulation of fast-folding proteins, suggesting that these principles may be applicable to a wider range of proteins. PMID:23503848
When fast is better: protein folding fundamentals and mechanisms from ultrafast approaches.
Muñoz, Victor; Cerminara, Michele
2016-09-01
Protein folding research stalled for decades because conventional experiments indicated that proteins fold slowly and in single strokes, whereas theory predicted a complex interplay between dynamics and energetics resulting in myriad microscopic pathways. Ultrafast kinetic methods turned the field upside down by providing the means to probe fundamental aspects of folding, test theoretical predictions and benchmark simulations. Accordingly, experimentalists could measure the timescales for all relevant folding motions, determine the folding speed limit and confirm that folding barriers are entropic bottlenecks. Moreover, a catalogue of proteins that fold extremely fast (microseconds) could be identified. Such fast-folding proteins cross shallow free energy barriers or fold downhill, and thus unfold with minimal co-operativity (gradually). A new generation of thermodynamic methods has exploited this property to map folding landscapes, interaction networks and mechanisms at nearly atomic resolution. In parallel, modern molecular dynamics simulations have finally reached the timescales required to watch fast-folding proteins fold and unfold in silico All of these findings have buttressed the fundamentals of protein folding predicted by theory, and are now offering the first glimpses at the underlying mechanisms. Fast folding appears to also have functional implications as recent results connect downhill folding with intrinsically disordered proteins, their complex binding modes and ability to moonlight. These connections suggest that the coupling between downhill (un)folding and binding enables such protein domains to operate analogically as conformational rheostats. © 2016 The Author(s).
Wang, Minglei; Jiang, Ying-Ying; Kim, Kyung Mo; Qu, Ge; Ji, Hong-Fang; Mittenthal, Jay E; Zhang, Hong-Yu; Caetano-Anollés, Gustavo
2011-01-01
The standard molecular clock describes a constant rate of molecular evolution and provides a powerful framework for evolutionary timescales. Here, we describe the existence and implications of a molecular clock of folds, a universal recurrence in the discovery of new structures in the world of proteins. Using a phylogenomic structural census in hundreds of proteomes, we build phylogenies and time lines of domains at fold and fold superfamily levels of structural complexity. These time lines correlate approximately linearly with geological timescales and were here used to date two crucial events in life history, planet oxygenation and organism diversification. We first dissected the structures and functions of enzymes in simulated metabolic networks. The placement of anaerobic and aerobic enzymes in the time line revealed that aerobic metabolism emerged about 2.9 billion years (giga-annum; Ga) ago and expanded during a period of about 400 My, reaching what is known as the Great Oxidation Event. During this period, enzymes recruited old and new folds for oxygen-mediated enzymatic activities. Remarkably, the first fold lost by a superkingdom disappeared in Archaea 2.6 Ga ago, within the span of oxygen rise, suggesting that oxygen also triggered diversification of life. The implications of a molecular clock of folds are many and important for the neutral theory of molecular evolution and for understanding the growth and diversity of the protein world. The clock also extends the standard concept that was specific to molecules and their timescales and turns it into a universal timescale-generating tool.
From Sequence and Forces to Structure, Function and Evolution of Intrinsically Disordered Proteins
Forman-Kay, Julie D.; Mittag, Tanja
2015-01-01
Intrinsically disordered proteins (IDPs), which lack persistent structure, are a challenge to structural biology due to the inapplicability of standard methods for characterization of folded proteins as well as their deviation from the dominant structure/function paradigm. Their widespread presence and involvement in biological function, however, has spurred the growing acceptance of the importance of IDPs and the development of new tools for studying their structure, dynamics and function. The interplay of folded and disordered domains or regions for function and the existence of a continuum of protein states with respect to conformational energetics, motional timescales and compactness is shaping a unified understanding of structure-dynamics-disorder/function relationships. On the 20th anniversary of this journal, Structure, we provide a historical perspective on the investigation of IDPs and summarize the sequence features and physical forces that underlie their unique structural, functional and evolutionary properties. PMID:24010708
From sequence and forces to structure, function, and evolution of intrinsically disordered proteins.
Forman-Kay, Julie D; Mittag, Tanja
2013-09-03
Intrinsically disordered proteins (IDPs), which lack persistent structure, are a challenge to structural biology due to the inapplicability of standard methods for characterization of folded proteins as well as their deviation from the dominant structure/function paradigm. Their widespread presence and involvement in biological function, however, has spurred the growing acceptance of the importance of IDPs and the development of new tools for studying their structure, dynamics, and function. The interplay of folded and disordered domains or regions for function and the existence of a continuum of protein states with respect to conformational energetics, motional timescales, and compactness are shaping a unified understanding of structure-dynamics-disorder/function relationships. In the 20(th) anniversary of Structure, we provide a historical perspective on the investigation of IDPs and summarize the sequence features and physical forces that underlie their unique structural, functional, and evolutionary properties. Copyright © 2013 Elsevier Ltd. All rights reserved.
Okura, Hiromichi; Takahashi, Tsuyoshi; Mihara, Hisakazu
2012-06-01
Successful approaches of de novo protein design suggest a great potential to create novel structural folds and to understand natural rules of protein folding. For these purposes, smaller and simpler de novo proteins have been developed. Here, we constructed smaller proteins by removing the terminal sequences from stable de novo vTAJ proteins and compared stabilities between mutant and original proteins. vTAJ proteins were screened from an α3β3 binary-patterned library which was designed with polar/ nonpolar periodicities of α-helix and β-sheet. vTAJ proteins have the additional terminal sequences due to the method of constructing the genetically repeated library sequences. By removing the parts of the sequences, we successfully obtained the stable smaller de novo protein mutants with fewer amino acid alphabets than the originals. However, these mutants showed the differences on ANS binding properties and stabilities against denaturant and pH change. The terminal sequences, which were designed just as flexible linkers not as secondary structure units, sufficiently affected these physicochemical details. This study showed implications for adjusting protein stabilities by designing N- and C-terminal sequences.
Fisher, Kaitlin M.; Haglund, Ellinor; Noel, Jeffrey K.; Hailey, Kendra L.; Onuchic, José N.; Jennings, Patricia A.
2015-01-01
Interleukin-33 (IL-33) is currently the focus of multiple investigations into targeting pernicious inflammatory disorders. This mediator of inflammation plays a prevalent role in chronic disorders such as asthma, rheumatoid arthritis, and progressive heart disease. In order to better understand the possible link between the folding free energy landscape and functional regions in IL-33, a combined experimental and theoretical approach was applied. IL-33 is a pseudo- symmetrical protein composed of three distinct structural elements that complicate the folding mechanism due to competition for nucleation on the dominant folding route. Trefoil 1 constitutes the majority of the binding interface with the receptor whereas Trefoils 2 and 3 provide the stable scaffold to anchor Trefoil 1. We identified that IL-33 folds with a three-state mechanism, leading to a rollover in the refolding arm of its chevron plots in strongly native conditions. In addition, there is a second slower refolding phase that exhibits the same rollover suggesting similar limitations in folding along parallel routes. Characterization of the intermediate state and the rate limiting steps required for folding suggests that the rollover is attributable to a moving transition state, shifting from a post- to pre-intermediate transition state as you move from strongly native conditions to the midpoint of the transition. On a structural level, we found that initially, all independent Trefoil units fold equally well until a QCA of 0.35 when Trefoil 1 will backtrack in order to allow Trefoils 2 and 3 to fold in the intermediate state, creating a stable scaffold for Trefoil 1 to fold onto during the final folding transition. The formation of this intermediate state and subsequent moving transition state is a result of balancing the difficulty in folding the functionally important Trefoil 1 onto the remainder of the protein. Taken together our results indicate that the functional element of the protein is geometrically frustrated, requiring the more stable elements to fold first, acting as a scaffold for docking of the functional element to allow productive folding to the native state. PMID:26630011
Bi, Rui; Pan, Yiou; Shang, Qingli; Peng, Tianfei; Yang, Shuang; Wang, Shang; Xin, Xuecheng; Liu, Yan; Xi, Jinghui
2016-09-01
Lambda-cyhalothrin is now widely used in China to control the soybean aphid Aphis glycines. To dissect the resistance mechanism, a laboratory-selected resistant soybean aphid strain (CRR) was established with a 43.42-fold resistance ratio to λ-cyhalothrin than the susceptible strain (CSS) in adult aphids. In this study, a comparative proteomic analysis between the CRR and CSS strains revealed important differences between the susceptible and resistant strains of soybean aphids for λ-cyhalothrin. Approximately 493 protein spots were detected in two-dimensional polyacrylamide gel electrophoresis (2-DE). Thirty-six protein spots displayed differential expression of >2-fold in the CRR strain compared to the CSS strain. Out of these 36 protein spots, 21 had elevated and 15 had decreased expression. Twenty-four differentially expressed proteins were identified by MALDI TOF MS/MS and categorized into the functional groups cytoskeleton-related protein, carbohydrate and energy metabolism, protein folding, antioxidant system, and nucleotide and amino acid metabolism. Function analysis showed that cytoskeleton-related proteins and energy metabolism proteins have been associated with the λ-cyhalothrin resistance of A. glycines. The differential expression of λ-cyhalothrin responsive proteins reflected the overall change in cellular structure and metabolism after insecticide treatment in aphids. In summary, our studies improve understanding of the molecular mechanism resistance of soybean aphid to lambda-cyhalothrin, which will facilitate the development of rational approaches to improve the management of this pest and to improve the yield of soybean. Copyright © 2016. Published by Elsevier Inc.
NASA Astrophysics Data System (ADS)
Tavenor, Nathan Albert
Protein-based supramolecular polymers (SMPs) are a class of biomaterials which draw inspiration from and expand upon the many examples of complex protein quaternary structures observed in nature: collagen, microtubules, viral capsids, etc. Designing synthetic supramolecular protein scaffolds both increases our understanding of natural superstructures and allows for the creation of novel materials. Similar to small-molecule SMPs, protein-based SMPs form due to self-assembly driven by intermolecular interactions between monomers, and monomer structure determines the properties of the overall material. Using protein-based monomers takes advantage of the self-assembly and highly specific molecular recognition properties encodable in polypeptide sequences to rationally design SMP architectures. The central hypothesis underlying our work is that alpha-helical coiled coils, a well-studied protein quaternary folding motif, are well-suited to SMP design through the addition of synthetic linkers at solvent-exposed sites. Through small changes in the structures of the cross-links and/or peptide sequence, we have been able to control both the nanoscale organization and the macroscopic properties of the SMPs. Changes to the linker and hydrophobic core of the peptide can be used to control polymer rigidity, stability, and dimensionality. The gaps in knowledge that this thesis sought to fill on this project were 1) the relationship between the molecular structure of the cross-linked polypeptides and the macroscopic properties of the SMPs and 2) a means of creating materials exhibiting multi-dimensional net or framework topologies. Separate from the above efforts on supramolecular architectures was work on improving backbone modification strategies for an alpha-helix in the context of a complex protein tertiary fold. Earlier work in our lab had successfully incorporated unnatural building blocks into every major secondary structure (beta-sheet, alpha-helix, loops and beta-turns) of a small protein with a tertiary fold. Although the tertiary fold of the native sequence was mimicked by the resulting artificial protein, the thermodynamic stability was greatly compromised. Most of this energetic penalty derived from the modifications present in the alpha-helix. The contribution within this thesis was direct comparison of several alpha-helical design strategies and establishment of the thermodynamic consequences of each.
Codon Optimization to Enhance Expression Yields Insights into Chloroplast Translation1[OPEN
Chan, Hui-Ting; Williams-Carrier, Rosalind; Barkan, Alice
2016-01-01
Codon optimization based on psbA genes from 133 plant species eliminated 105 (human clotting factor VIII heavy chain [FVIII HC]) and 59 (polio VIRAL CAPSID PROTEIN1 [VP1]) rare codons; replacement with only the most highly preferred codons decreased transgene expression (77- to 111-fold) when compared with the codon usage hierarchy of the psbA genes. Targeted proteomic quantification by parallel reaction monitoring analysis showed 4.9- to 7.1-fold or 22.5- to 28.1-fold increase in FVIII or VP1 codon-optimized genes when normalized with stable isotope-labeled standard peptides (or housekeeping protein peptides), but quantitation using western blots showed 6.3- to 8-fold or 91- to 125-fold increase of transgene expression from the same batch of materials, due to limitations in quantitative protein transfer, denaturation, solubility, or stability. Parallel reaction monitoring, to our knowledge validated here for the first time for in planta quantitation of biopharmaceuticals, is especially useful for insoluble or multimeric proteins required for oral drug delivery. Northern blots confirmed that the increase of codon-optimized protein synthesis is at the translational level rather than any impact on transcript abundance. Ribosome footprints did not increase proportionately with VP1 translation or even decreased after FVIII codon optimization but is useful in diagnosing additional rate-limiting steps. A major ribosome pause at CTC leucine codons in the native gene of FVIII HC was eliminated upon codon optimization. Ribosome stalls observed at clusters of serine codons in the codon-optimized VP1 gene provide an opportunity for further optimization. In addition to increasing our understanding of chloroplast translation, these new tools should help to advance this concept toward human clinical studies. PMID:27465114
Zhuravlev, Pavel I; Papoian, Garegin A
2010-08-01
Energy landscape theories have provided a common ground for understanding the protein folding problem, which once seemed to be overwhelmingly complicated. At the same time, the native state was found to be an ensemble of interconverting states with frustration playing a more important role compared to the folding problem. The landscape of the folded protein - the native landscape - is glassier than the folding landscape; hence, a general description analogous to the folding theories is difficult to achieve. On the other hand, the native basin phase volume is much smaller, allowing a protein to fully sample its native energy landscape on the biological timescales. Current computational resources may also be used to perform this sampling for smaller proteins, to build a 'topographical map' of the native landscape that can be used for subsequent analysis. Several major approaches to representing this topographical map are highlighted in this review, including the construction of kinetic networks, hierarchical trees and free energy surfaces with subsequent structural and kinetic analyses. In this review, we extensively discuss the important question of choosing proper collective coordinates characterizing functional motions. In many cases, the substates on the native energy landscape, which represent different functional states, can be used to obtain variables that are well suited for building free energy surfaces and analyzing the protein's functional dynamics. Normal mode analysis can provide such variables in cases where functional motions are dictated by the molecule's architecture. Principal component analysis is a more expensive way of inferring the essential variables from the protein's motions, one that requires a long molecular dynamics simulation. Finally, the two popular models for the allosteric switching mechanism, 'preexisting equilibrium' and 'induced fit', are interpreted within the energy landscape paradigm as extreme points of a continuum of transition mechanisms. Some experimental evidence illustrating each of these two models, as well as intermediate mechanisms, is presented and discussed.
Folding of a single domain protein entering the endoplasmic reticulum precedes disulfide formation.
Robinson, Philip J; Pringle, Marie Anne; Woolhead, Cheryl A; Bulleid, Neil J
2017-04-28
The relationship between protein synthesis, folding, and disulfide formation within the endoplasmic reticulum (ER) is poorly understood. Previous studies have suggested that pre-existing disulfide links are absolutely required to allow protein folding and, conversely, that protein folding occurs prior to disulfide formation. To address the question of what happens first within the ER, that is, protein folding or disulfide formation, we studied folding events at the early stages of polypeptide chain translocation into the mammalian ER using stalled translation intermediates. Our results demonstrate that polypeptide folding can occur without complete domain translocation. Protein disulfide isomerase (PDI) interacts with these early intermediates, but disulfide formation does not occur unless the entire sequence of the protein domain is translocated. This is the first evidence that folding of the polypeptide chain precedes disulfide formation within a cellular context and highlights key differences between protein folding in the ER and refolding of purified proteins. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Improving Protein Fold Recognition by Deep Learning Networks
NASA Astrophysics Data System (ADS)
Jo, Taeho; Hou, Jie; Eickholt, Jesse; Cheng, Jianlin
2015-12-01
For accurate recognition of protein folds, a deep learning network method (DN-Fold) was developed to predict if a given query-template protein pair belongs to the same structural fold. The input used stemmed from the protein sequence and structural features extracted from the protein pair. We evaluated the performance of DN-Fold along with 18 different methods on Lindahl’s benchmark dataset and on a large benchmark set extracted from SCOP 1.75 consisting of about one million protein pairs, at three different levels of fold recognition (i.e., protein family, superfamily, and fold) depending on the evolutionary distance between protein sequences. The correct recognition rate of ensembled DN-Fold for Top 1 predictions is 84.5%, 61.5%, and 33.6% and for Top 5 is 91.2%, 76.5%, and 60.7% at family, superfamily, and fold levels, respectively. We also evaluated the performance of single DN-Fold (DN-FoldS), which showed the comparable results at the level of family and superfamily, compared to ensemble DN-Fold. Finally, we extended the binary classification problem of fold recognition to real-value regression task, which also show a promising performance. DN-Fold is freely available through a web server at http://iris.rnet.missouri.edu/dnfold.
Unique Features of Halophilic Proteins.
Arakawa, Tsutomu; Yamaguchi, Rui; Tokunaga, Hiroko; Tokunaga, Masao
2017-01-01
Proteins from moderate and extreme halophiles have unique characteristics. They are highly acidic and hydrophilic, similar to intrinsically disordered proteins. These characteristics make the halophilic proteins soluble in water and fold reversibly. In addition to reversible folding, the rate of refolding of halophilic proteins from denatured structure is generally slow, often taking several days, for example, for extremely halophilic proteins. This slow folding rate makes the halophilic proteins a novel model system for folding mechanism analysis. High solubility and reversible folding also make the halophilic proteins excellent fusion partners for soluble expression of recombinant proteins.
PASS2: an automated database of protein alignments organised as structural superfamilies.
Bhaduri, Anirban; Pugalenthi, Ganesan; Sowdhamini, Ramanathan
2004-04-02
The functional selection and three-dimensional structural constraints of proteins in nature often relates to the retention of significant sequence similarity between proteins of similar fold and function despite poor sequence identity. Organization of structure-based sequence alignments for distantly related proteins, provides a map of the conserved and critical regions of the protein universe that is useful for the analysis of folding principles, for the evolutionary unification of protein families and for maximizing the information return from experimental structure determination. The Protein Alignment organised as Structural Superfamily (PASS2) database represents continuously updated, structural alignments for evolutionary related, sequentially distant proteins. An automated and updated version of PASS2 is, in direct correspondence with SCOP 1.63, consisting of sequences having identity below 40% among themselves. Protein domains have been grouped into 628 multi-member superfamilies and 566 single member superfamilies. Structure-based sequence alignments for the superfamilies have been obtained using COMPARER, while initial equivalencies have been derived from a preliminary superposition using LSQMAN or STAMP 4.0. The final sequence alignments have been annotated for structural features using JOY4.0. The database is supplemented with sequence relatives belonging to different genomes, conserved spatially interacting and structural motifs, probabilistic hidden markov models of superfamilies based on the alignments and useful links to other databases. Probabilistic models and sensitive position specific profiles obtained from reliable superfamily alignments aid annotation of remote homologues and are useful tools in structural and functional genomics. PASS2 presents the phylogeny of its members both based on sequence and structural dissimilarities. Clustering of members allows us to understand diversification of the family members. The search engine has been improved for simpler browsing of the database. The database resolves alignments among the structural domains consisting of evolutionarily diverged set of sequences. Availability of reliable sequence alignments of distantly related proteins despite poor sequence identity and single-member superfamilies permit better sampling of structures in libraries for fold recognition of new sequences and for the understanding of protein structure-function relationships of individual superfamilies. PASS2 is accessible at http://www.ncbs.res.in/~faculty/mini/campass/pass2.html
NASA Astrophysics Data System (ADS)
Bergasa-Caceres, Fernando; Rabitz, Herschel A.
2013-06-01
A model of protein folding kinetics is applied to study the effects of macromolecular crowding on protein folding rate and stability. Macromolecular crowding is found to promote a decrease of the entropic cost of folding of proteins that produces an increase of both the stability and the folding rate. The acceleration of the folding rate due to macromolecular crowding is shown to be a topology-dependent effect. The model is applied to the folding dynamics of the murine prion protein (121-231). The differential effect of macromolecular crowding as a function of protein topology suffices to make non-native configurations relatively more accessible.
Protein degradation and protection against misfolded or damaged proteins
NASA Astrophysics Data System (ADS)
Goldberg, Alfred L.
2003-12-01
The ultimate mechanism that cells use to ensure the quality of intracellular proteins is the selective destruction of misfolded or damaged polypeptides. In eukaryotic cells, the large ATP-dependent proteolytic machine, the 26S proteasome, prevents the accumulation of non-functional, potentially toxic proteins. This process is of particular importance in protecting cells against harsh conditions (for example, heat shock or oxidative stress) and in a variety of diseases (for example, cystic fibrosis and the major neurodegenerative diseases). A full understanding of the pathogenesis of the protein-folding diseases will require greater knowledge of how misfolded proteins are recognized and selectively degraded.
Evidence from mixed hydrate nucleation for a funnel model of crystallization.
Hall, Kyle Wm; Carpendale, Sheelagh; Kusalik, Peter G
2016-10-25
The molecular-level details of crystallization remain unclear for many systems. Previous work has speculated on the phenomenological similarities between molecular crystallization and protein folding. Here we demonstrate that molecular crystallization can involve funnel-shaped potential energy landscapes through a detailed analysis of mixed gas hydrate nucleation, a prototypical multicomponent crystallization process. Through this, we contribute both: (i) a powerful conceptual framework for exploring and rationalizing molecular crystallization, and (ii) an explanation of phenomenological similarities between protein folding and crystallization. Such funnel-shaped potential energy landscapes may be typical of broad classes of molecular ordering processes, and can provide a new perspective for both studying and understanding these processes.
Evidence from mixed hydrate nucleation for a funnel model of crystallization
Hall, Kyle Wm.; Carpendale, Sheelagh; Kusalik, Peter G.
2016-01-01
The molecular-level details of crystallization remain unclear for many systems. Previous work has speculated on the phenomenological similarities between molecular crystallization and protein folding. Here we demonstrate that molecular crystallization can involve funnel-shaped potential energy landscapes through a detailed analysis of mixed gas hydrate nucleation, a prototypical multicomponent crystallization process. Through this, we contribute both: (i) a powerful conceptual framework for exploring and rationalizing molecular crystallization, and (ii) an explanation of phenomenological similarities between protein folding and crystallization. Such funnel-shaped potential energy landscapes may be typical of broad classes of molecular ordering processes, and can provide a new perspective for both studying and understanding these processes. PMID:27790987
Interstitial protein alterations in rabbit vocal fold with scar.
Thibeault, Susan L; Bless, Diane M; Gray, Steven D
2003-09-01
Fibrous and interstitial proteins compose the extracellular matrix of the vocal fold lamina propria and account for its biomechanic properties. Vocal fold scarring is characterized by altered biomechanical properties, which create dysphonia. Although alterations of the fibrous proteins have been confirmed in the rabbit vocal fold scar, interstitial proteins, which are known to be important in wound repair, have not been investigated to date. Using a rabbit model, interstitial proteins decorin, fibromodulin, and fibronectin were examined immunohistologically, two months postinduction of vocal fold scar by means of forcep biopsy. Significantly decreased decorin and fibromodulin with significantly increased fibronectin characterized scarred vocal fold tissue. The implications of altered interstitial proteins levels and their affect on the fibrous proteins will be discussed in relation to increased vocal fold stiffness and viscosity, which characterizes vocal fold scar.
Zhuravleva, Anastasia; Korzhnev, Dmitry M
2017-05-01
Protein folding is a highly complex process proceeding through a number of disordered and partially folded nonnative states with various degrees of structural organization. These transiently and sparsely populated species on the protein folding energy landscape play crucial roles in driving folding toward the native conformation, yet some of these nonnative states may also serve as precursors for protein misfolding and aggregation associated with a range of devastating diseases, including neuro-degeneration, diabetes and cancer. Therefore, in vivo protein folding is often reshaped co- and post-translationally through interactions with the ribosome, molecular chaperones and/or other cellular components. Owing to developments in instrumentation and methodology, solution NMR spectroscopy has emerged as the central experimental approach for the detailed characterization of the complex protein folding processes in vitro and in vivo. NMR relaxation dispersion and saturation transfer methods provide the means for a detailed characterization of protein folding kinetics and thermodynamics under native-like conditions, as well as modeling high-resolution structures of weakly populated short-lived conformational states on the protein folding energy landscape. Continuing development of isotope labeling strategies and NMR methods to probe high molecular weight protein assemblies, along with advances of in-cell NMR, have recently allowed protein folding to be studied in the context of ribosome-nascent chain complexes and molecular chaperones, and even inside living cells. Here we review solution NMR approaches to investigate the protein folding energy landscape, and discuss selected applications of NMR methodology to studying protein folding in vitro and in vivo. Together, these examples highlight a vast potential of solution NMR in providing atomistic insights into molecular mechanisms of protein folding and homeostasis in health and disease. Copyright © 2016 Elsevier B.V. All rights reserved.
Barrick, Doug
2011-01-01
Mapping the stability distributions of proteins in their native folded states provides a critical link between structure, thermodynamics, and function. Linear repeat proteins have proven more amenable to this kind of mapping than globular proteins. C-terminal deletion studies of YopM, a large, linear leucine-rich repeat (LRR) protein, show that stability is distributed quite heterogeneously, yet a high level of cooperativity is maintained [1]. Key components of this distribution are three interfaces that strongly stabilize adjacent sequences, thereby maintaining structural integrity and promoting cooperativity. To better understand the distribution of interaction energy around these critical interfaces, we studied internal (rather than terminal) deletions of three LRRs in this region, including one of these stabilizing interfaces. Contrary to our expectation that deletion of structured repeats should be destabilizing, we find that internal deletion of folded repeats can actually stabilize the native state, suggesting that these repeats are destabilizing, although paradoxically, they are folded in the native state. We identified two residues within this destabilizing segment that deviate from the consensus sequence at a position that normally forms a stacked leucine ladder in the hydrophobic core. Replacement of these nonconsensus residues with leucine is stabilizing. This stability enhancement can be reproduced in the context of nonnative interfaces, but it requires an extended hydrophobic core. Our results demonstrate that different LRRs vary widely in their contribution to stability, and that this variation is context-dependent. These two factors are likely to determine the types of rearrangements that lead to folded, functional proteins, and in turn, are likely to restrict the pathways available for the evolution of linear repeat proteins. PMID:21764506
How the folding rates of two- and multistate proteins depend on the amino acid properties.
Huang, Jitao T; Huang, Wei; Huang, Shanran R; Li, Xin
2014-10-01
Proteins fold by either two-state or multistate kinetic mechanism. We observe that amino acids play different roles in different mechanism. Many residues that are easy to form regular secondary structures (α helices, β sheets and turns) can promote the two-state folding reactions of small proteins. Most of hydrophilic residues can speed up the multistate folding reactions of large proteins. Folding rates of large proteins are equally responsive to the flexibility of partial amino acids. Other properties of amino acids (including volume, polarity, accessible surface, exposure degree, isoelectric point, and phase transfer energy) have contributed little to folding kinetics of the proteins. Cysteine is a special residue, it triggers two-state folding reaction and but inhibits multistate folding reaction. These findings not only provide a new insight into protein structure prediction, but also could be used to direct the point mutations that can change folding rate. © 2014 Wiley Periodicals, Inc.
Course 12: Proteins: Structural, Thermodynamic and Kinetic Aspects
NASA Astrophysics Data System (ADS)
Finkelstein, A. V.
1 Introduction 2 Overview of protein architectures and discussion of physical background of their natural selection 2.1 Protein structures 2.2 Physical selection of protein structures 3 Thermodynamic aspects of protein folding 3.1 Reversible denaturation of protein structures 3.2 What do denatured proteins look like? 3.3 Why denaturation of a globular protein is the first-order phase transition 3.4 "Gap" in energy spectrum: The main characteristic that distinguishes protein chains from random polymers 4 Kinetic aspects of protein folding 4.1 Protein folding in vivo 4.2 Protein folding in vitro (in the test-tube) 4.3 Theory of protein folding rates and solution of the Levinthal paradox
Mapping hydration dynamics and coupled water-protein fluctuations around a protein surface
NASA Astrophysics Data System (ADS)
Zhang, Luyuan; Wang, Lijuan; Kao, Ya-Ting; Qiu, Weihong; Yang, Yi; Okobiah, Oghaghare; Zhong, Dongping
2009-03-01
Elucidation of the molecular mechanism of water-protein interactions is critical to understanding many fundamental aspects of protein science, such as protein folding and misfolding and enzyme catalysis. We recently carried out a global mapping of protein-surface hydration dynamics around a globular α-helical protein apomyoglobin. The intrinsic optical probe tryptophan was employed to scan the protein surface one at a time by site-specific mutagenesis. With femtosecond resolution, we mapped out the dynamics of water-protein interactions with more than 20 mutants and for two states, native and molten globular. A robust bimodal distribution of time scales was observed, representing two types of water motions: local relaxation and protein-coupled fluctuations. The time scales show a strong correlation with the local protein structural rigidity and chemical identity. We also resolved two distinct contributions to the overall Stokes-shifts from the two time scales. These results are significant to understanding the role of hydration water on protein structural stability, dynamics and function.
Folding and unfolding single RNA molecules under tension
Woodside, Michael T; García-García, Cuauhtémoc; Block, Steven M
2010-01-01
Single-molecule force spectroscopy constitutes a powerful method for probing RNA folding: it allows the kinetic, energetic, and structural properties of intermediate and transition states to be determined quantitatively, yielding new insights into folding pathways and energy landscapes. Recent advances in experimental and theoretical methods, including fluctuation theorems, kinetic theories, novel force clamps, and ultrastable instruments, have opened new avenues for study. These tools have been used to probe folding in simple model systems, for example, RNA and DNA hairpins. Knowledge gained from such systems is helping to build our understanding of more complex RNA structures composed of multiple elements, as well as how nucleic acids interact with proteins involved in key cellular activities, such as transcription and translation. PMID:18786653
De Jaco, Antonella; Dubi, Noga; Camp, Shelley; Taylor, Palmer
2017-01-01
The α/β-hydrolase fold superfamily of proteins is composed of structurally related members that, despite great diversity in their catalytic, recognition, adhesion and chaperone functions, share a common fold governed by homologous residues and conserved disulfide bridges. Non-synonymous single nucleotide polymorphisms within the α/β-hydrolase fold domain in various family members have been found for congenital endocrine, metabolic and nervous system disorders. By examining the amino acid sequence from the various proteins, mutations were found to be prevalent in conserved residues within the α/β-hydrolase fold of the homologous proteins. This is the case for the thyroglobulin mutations linked to congenital hypothyroidism. To address whether correct folding of the common domain is required for protein export, we inserted the thyroglobulin mutations at homologous positions in two correlated but simpler α/β-hydrolase fold proteins known to be exported to the cell surface: neuroligin3 and acetylcholinesterase. Here we show that these mutations in the cholinesterase homologous region alter the folding properties of the α/β-hydrolase fold domain, which are reflected in defects in protein trafficking, folding and function, and ultimately result in retention of the partially processed proteins in the endoplasmic reticulum. Accordingly, mutations at conserved residues may be transferred amongst homologous proteins to produce common processing defects despite disparate functions, protein complexity and tissue-specific expression of the homologous proteins. More importantly, a similar assembly of the α/β-hydrolase fold domain tertiary structure among homologous members of the superfamily is required for correct trafficking of the proteins to their final destination. PMID:23035660
Extant fold-switching proteins are widespread.
Porter, Lauren L; Looger, Loren L
2018-06-05
A central tenet of biology is that globular proteins have a unique 3D structure under physiological conditions. Recent work has challenged this notion by demonstrating that some proteins switch folds, a process that involves remodeling of secondary structure in response to a few mutations (evolved fold switchers) or cellular stimuli (extant fold switchers). To date, extant fold switchers have been viewed as rare byproducts of evolution, but their frequency has been neither quantified nor estimated. By systematically and exhaustively searching the Protein Data Bank (PDB), we found ∼100 extant fold-switching proteins. Furthermore, we gathered multiple lines of evidence suggesting that these proteins are widespread in nature. Based on these lines of evidence, we hypothesized that the frequency of extant fold-switching proteins may be underrepresented by the structures in the PDB. Thus, we sought to identify other putative extant fold switchers with only one solved conformation. To do this, we identified two characteristic features of our ∼100 extant fold-switching proteins, incorrect secondary structure predictions and likely independent folding cooperativity, and searched the PDB for other proteins with similar features. Reassuringly, this method identified dozens of other proteins in the literature with indication of a structural change but only one solved conformation in the PDB. Thus, we used it to estimate that 0.5-4% of PDB proteins switch folds. These results demonstrate that extant fold-switching proteins are likely more common than the PDB reflects, which has implications for cell biology, genomics, and human health. Copyright © 2018 the Author(s). Published by PNAS.
Combinatorial Enzyme Design Probes Allostery and Cooperativity in the Trypsin Fold
DOE Office of Scientific and Technical Information (OSTI.GOV)
Page, Michael J.; Di Cera, Enrico; St. Louis-MED)
2010-06-14
Converting one enzyme into another is challenging due to the uneven distribution of important amino acids for function in both protein sequence and structure. We report a strategy for protein engineering allowing an organized mixing and matching of genetic material that leverages lower throughput with increased quality of screens. Our approach successfully tested the contribution of each surface-exposed loop in the trypsin fold alone and the cooperativity of their combinations towards building the substrate selectivity and Na{sup +}-dependent allosteric activation of the protease domain of human coagulation factor Xa into a bacterial trypsin. As the created proteases lack additional proteinmore » domains and protein co-factor activation mechanism requisite for the complexity of blood coagulation, they are stepping-stones towards further understanding and engineering of artificial clotting factors.« less
NASA Astrophysics Data System (ADS)
Bergasa-Caceres, Fernando; Rabitz, Herschel A.
2014-01-01
A model of protein folding kinetics is applied to study the combined effects of protein flexibility and macromolecular crowding on protein folding rate and stability. It is found that the increase in stability and folding rate promoted by macromolecular crowding is damped for proteins with highly flexible native structures. The model is applied to the folding dynamics of the murine prion protein (121-231). It is found that the high flexibility of the native isoform of the murine prion protein (121-231) reduces the effects of macromolecular crowding on its folding dynamics. The relevance of these findings for the pathogenic mechanism are discussed.
Cytosolic thioredoxin reductase 1 is required for correct disulfide formation in the ER.
Poet, Greg J; Oka, Ojore Bv; van Lith, Marcel; Cao, Zhenbo; Robinson, Philip J; Pringle, Marie Anne; Arnér, Elias Sj; Bulleid, Neil J
2017-03-01
Folding of proteins entering the secretory pathway in mammalian cells frequently requires the insertion of disulfide bonds. Disulfide insertion can result in covalent linkages found in the native structure as well as those that are not, so-called non-native disulfides. The pathways for disulfide formation are well characterized, but our understanding of how non-native disulfides are reduced so that the correct or native disulfides can form is poor. Here, we use a novel assay to demonstrate that the reduction in non-native disulfides requires NADPH as the ultimate electron donor, and a robust cytosolic thioredoxin system, driven by thioredoxin reductase 1 (TrxR1 or TXNRD1). Inhibition of this reductive pathway prevents the correct folding and secretion of proteins that are known to form non-native disulfides during their folding. Hence, we have shown for the first time that mammalian cells have a pathway for transferring reducing equivalents from the cytosol to the ER, which is required to ensure correct disulfide formation in proteins entering the secretory pathway. © 2017 The Authors. Published under the terms of the CC BY 4.0 license.
Protein Folding Using a Vortex Fluidic Device.
Britton, Joshua; Smith, Joshua N; Raston, Colin L; Weiss, Gregory A
2017-01-01
Essentially all biochemistry and most molecular biology experiments require recombinant proteins. However, large, hydrophobic proteins typically aggregate into insoluble and misfolded species, and are directed into inclusion bodies. Current techniques to fold proteins recovered from inclusion bodies rely on denaturation followed by dialysis or rapid dilution. Such approaches can be time consuming, wasteful, and inefficient. Here, we describe rapid protein folding using a vortex fluidic device (VFD). This process uses mechanical energy introduced into thin films to rapidly and efficiently fold proteins. With the VFD in continuous flow mode, large volumes of protein solution can be processed per day with 100-fold reductions in both folding times and buffer volumes.
NASA Astrophysics Data System (ADS)
Kochańczyk, Tomasz; Nowakowski, Michał; Wojewska, Dominika; Kocyła, Anna; Ejchart, Andrzej; Koźmiński, Wiktor; Krężel, Artur
2016-11-01
The binding of metal ions at the interface of protein complexes presents a unique and poorly understood mechanism of molecular assembly. A remarkable example is the Rad50 zinc hook domain, which is highly conserved and facilitates the Zn2+-mediated homodimerization of Rad50 proteins. Here, we present a detailed analysis of the structural and thermodynamic effects governing the formation and stability (logK12 = 20.74) of this evolutionarily conserved protein assembly. We have dissected the determinants of the stability contributed by the small β-hairpin of the domain surrounding the zinc binding motif and the coiled-coiled regions using peptides of various lengths from 4 to 45 amino acid residues, alanine substitutions and peptide bond-to-ester perturbations. In the studied series of peptides, an >650 000-fold increase of the formation constant of the dimeric complex arises from favorable enthalpy because of the increased acidity of the cysteine thiols in metal-free form and the structural properties of the dimer. The dependence of the enthalpy on the domain fragment length is partially compensated by the entropic penalty of domain folding, indicating enthalpy-entropy compensation. This study facilitates understanding of the metal-mediated protein-protein interactions in which the metal ion is critical for the tight association of protein subunits.
Kochańczyk, Tomasz; Nowakowski, Michał; Wojewska, Dominika; Kocyła, Anna; Ejchart, Andrzej; Koźmiński, Wiktor; Krężel, Artur
2016-01-01
The binding of metal ions at the interface of protein complexes presents a unique and poorly understood mechanism of molecular assembly. A remarkable example is the Rad50 zinc hook domain, which is highly conserved and facilitates the Zn2+-mediated homodimerization of Rad50 proteins. Here, we present a detailed analysis of the structural and thermodynamic effects governing the formation and stability (logK12 = 20.74) of this evolutionarily conserved protein assembly. We have dissected the determinants of the stability contributed by the small β-hairpin of the domain surrounding the zinc binding motif and the coiled-coiled regions using peptides of various lengths from 4 to 45 amino acid residues, alanine substitutions and peptide bond-to-ester perturbations. In the studied series of peptides, an >650 000-fold increase of the formation constant of the dimeric complex arises from favorable enthalpy because of the increased acidity of the cysteine thiols in metal-free form and the structural properties of the dimer. The dependence of the enthalpy on the domain fragment length is partially compensated by the entropic penalty of domain folding, indicating enthalpy-entropy compensation. This study facilitates understanding of the metal-mediated protein-protein interactions in which the metal ion is critical for the tight association of protein subunits. PMID:27808280
Secreted Proteins Defy the Expression Level–Evolutionary Rate Anticorrelation
Feyertag, Felix; Berninsone, Patricia M.; Alvarez-Ponce, David
2017-01-01
The rates of evolution of the proteins of any organism vary across orders of magnitude. A primary factor influencing rates of protein evolution is expression. A strong negative correlation between expression levels and evolutionary rates (the so-called E–R anticorrelation) has been observed in virtually all studied organisms. This effect is currently attributed to the abundance-dependent fitness costs of misfolding and unspecific protein–protein interactions, among other factors. Secreted proteins are folded in the endoplasmic reticulum, a compartment where chaperones, folding catalysts, and stringent quality control mechanisms promote their correct folding and may reduce the fitness costs of misfolding. In addition, confinement of secreted proteins to the extracellular space may reduce misinteractions and their deleterious effects. We hypothesize that each of these factors (the secretory pathway quality control and extracellular location) may reduce the strength of the E–R anticorrelation. Indeed, here we show that among human proteins that are secreted to the extracellular space, rates of evolution do not correlate with protein abundances. This trend is robust to controlling for several potentially confounding factors and is also observed when analyzing protein abundance data for 6 human tissues. In addition, analysis of mRNA abundance data for 32 human tissues shows that the E–R correlation is always less negative, and sometimes nonsignificant, in secreted proteins. Similar observations were made in Caenorhabditis elegans and in Escherichia coli, and to a lesser extent in Drosophila melanogaster, Saccharomyces cerevisiae and Arabidopsis thaliana. Our observations contribute to understand the causes of the E–R anticorrelation. PMID:28007979
Exhaustive comparison and classification of ligand-binding surfaces in proteins
Murakami, Yoichi; Kinoshita, Kengo; Kinjo, Akira R; Nakamura, Haruki
2013-01-01
Many proteins function by interacting with other small molecules (ligands). Identification of ligand-binding sites (LBS) in proteins can therefore help to infer their molecular functions. A comprehensive comparison among local structures of LBSs was previously performed, in order to understand their relationships and to classify their structural motifs. However, similar exhaustive comparison among local surfaces of LBSs (patches) has never been performed, due to computational complexity. To enhance our understanding of LBSs, it is worth performing such comparisons among patches and classifying them based on similarities of their surface configurations and electrostatic potentials. In this study, we first developed a rapid method to compare two patches. We then clustered patches corresponding to the same PDB chemical component identifier for a ligand, and selected a representative patch from each cluster. We subsequently exhaustively as compared the representative patches and clustered them using similarity score, PatSim. Finally, the resultant PatSim scores were compared with similarities of atomic structures of the LBSs and those of the ligand-binding protein sequences and functions. Consequently, we classified the patches into ∼2000 well-characterized clusters. We found that about 63% of these clusters are used in identical protein folds, although about 25% of the clusters are conserved in distantly related proteins and even in proteins with cross-fold similarity. Furthermore, we showed that patches with higher PatSim score have potential to be involved in similar biological processes. PMID:23934772
NASA Technical Reports Server (NTRS)
Weaver, D. L.
1982-01-01
Theoretical methods and solutions of the dynamics of protein folding, protein aggregation, protein structure, and the origin of life are discussed. The elements of a dynamic model representing the initial stages of protein folding are presented. The calculation and experimental determination of the model parameters are discussed. The use of computer simulation for modeling protein folding is considered.
Modeling repetitive, non‐globular proteins
Basu, Koli; Campbell, Robert L.; Guo, Shuaiqi; Sun, Tianjun
2016-01-01
Abstract While ab initio modeling of protein structures is not routine, certain types of proteins are more straightforward to model than others. Proteins with short repetitive sequences typically exhibit repetitive structures. These repetitive sequences can be more amenable to modeling if some information is known about the predominant secondary structure or other key features of the protein sequence. We have successfully built models of a number of repetitive structures with novel folds using knowledge of the consensus sequence within the sequence repeat and an understanding of the likely secondary structures that these may adopt. Our methods for achieving this success are reviewed here. PMID:26914323
LANL Transfers Glowing Bio Technology to Sandia Biotech
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rorick, Kevin; Nakhla, Tony; Pino, Tony
2012-05-21
Partnering with Los Alamos National Laboratory, an Albuquerque-based company is seeking to transform the way protein and peptide analysis is conducted around the world. Sandia Biotech is using a biological technology licensed from Los Alamos called split green fluorescent protein (sGFP), as a detecting and tracking tool for the protein and peptide industry, valuable in the fields of Alzheimer's research, drug development and other biotechnology fields using protein folding to understand protein expression and mechanisms of action. http://www.lanl.gov/news/stories/glowing-future-for-los-alamos-and-sandia-b iotech-partnership.html
LANL Transfers Glowing Bio Technology to Sandia Biotech
Rorick, Kevin; Nakhla, Tony; Pino, Tony; Hadley, David
2018-03-02
Partnering with Los Alamos National Laboratory, an Albuquerque-based company is seeking to transform the way protein and peptide analysis is conducted around the world. Sandia Biotech is using a biological technology licensed from Los Alamos called split green fluorescent protein (sGFP), as a detecting and tracking tool for the protein and peptide industry, valuable in the fields of Alzheimer's research, drug development and other biotechnology fields using protein folding to understand protein expression and mechanisms of action. http://www.lanl.gov/news/stories/glowing-future-for-los-alamos-and-sandia-b iotech-partnership.html
DOE Office of Scientific and Technical Information (OSTI.GOV)
Guu, Tom S.Y.; Liu, Zheng; Ye, Qiaozhen
Hepatitis E virus (HEV), a small, non-enveloped RNA virus in the family Hepeviridae, is associated with endemic and epidemic acute viral hepatitis in developing countries. Our 3.5-{angstrom} structure of a HEV-like particle (VLP) shows that each capsid protein contains 3 linear domains that form distinct structural elements: S, the continuous capsid; P1, 3-fold protrusions; and P2, 2-fold spikes. The S domain adopts a jelly-roll fold commonly observed in small RNA viruses. The P1 and P2 domains both adopt {beta}-barrel folds. Each domain possesses a potential polysaccharide-binding site that may function in cell-receptor binding. Sugar binding to P1 at the capsidmore » protein interface may lead to capsid disassembly and cell entry. Structural modeling indicates that native T = 3 capsid contains flat dimers, with less curvature than those of T = 1 VLP. Our findings significantly advance the understanding of HEV molecular biology and have application to the development of vaccines and antiviral medications.« less
Progress towards mapping the universe of protein folds
Grant, Alastair; Lee, David; Orengo, Christine
2004-01-01
Although the precise aims differ between the various international structural genomics initiatives currently aiming to illuminate the universe of protein folds, many selectively target protein families for which the fold is unknown. How well can the current set of known protein families and folds be used to estimate the total number of folds in nature, and will structural genomics initiatives yield representatives for all the major protein families within a reasonable time scale? PMID:15128436
Xia, Jiaqi; Peng, Zhenling; Qi, Dawei; Mu, Hongbo; Yang, Jianyi
2017-03-15
Protein fold classification is a critical step in protein structure prediction. There are two possible ways to classify protein folds. One is through template-based fold assignment and the other is ab-initio prediction using machine learning algorithms. Combination of both solutions to improve the prediction accuracy was never explored before. We developed two algorithms, HH-fold and SVM-fold for protein fold classification. HH-fold is a template-based fold assignment algorithm using the HHsearch program. SVM-fold is a support vector machine-based ab-initio classification algorithm, in which a comprehensive set of features are extracted from three complementary sequence profiles. These two algorithms are then combined, resulting to the ensemble approach TA-fold. We performed a comprehensive assessment for the proposed methods by comparing with ab-initio methods and template-based threading methods on six benchmark datasets. An accuracy of 0.799 was achieved by TA-fold on the DD dataset that consists of proteins from 27 folds. This represents improvement of 5.4-11.7% over ab-initio methods. After updating this dataset to include more proteins in the same folds, the accuracy increased to 0.971. In addition, TA-fold achieved >0.9 accuracy on a large dataset consisting of 6451 proteins from 184 folds. Experiments on the LE dataset show that TA-fold consistently outperforms other threading methods at the family, superfamily and fold levels. The success of TA-fold is attributed to the combination of template-based fold assignment and ab-initio classification using features from complementary sequence profiles that contain rich evolution information. http://yanglab.nankai.edu.cn/TA-fold/. yangjy@nankai.edu.cn or mhb-506@163.com. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Waldo, Geoffrey S.
2007-09-18
The current invention provides methods of improving folding of polypeptides using a poorly folding domain as a component of a fusion protein comprising the poorly folding domain and a polypeptide of interest to be improved. The invention also provides novel green fluorescent proteins (GFPs) and red fluorescent proteins that have enhanced folding properties.
Internal Structure and Preferential Protein Binding of Colloidal Aggregates.
Duan, Da; Torosyan, Hayarpi; Elnatan, Daniel; McLaughlin, Christopher K; Logie, Jennifer; Shoichet, Molly S; Agard, David A; Shoichet, Brian K
2017-01-20
Colloidal aggregates of small molecules are the most common artifact in early drug discovery, sequestering and inhibiting target proteins without specificity. Understanding their structure and mechanism has been crucial to developing tools to control for, and occasionally even exploit, these particles. Unfortunately, their polydispersity and transient stability have prevented exploration of certain elementary properties, such as how they pack. Dye-stabilized colloidal aggregates exhibit enhanced homogeneity and stability when compared to conventional colloidal aggregates, enabling investigation of some of these properties. By small-angle X-ray scattering and multiangle light scattering, pair distance distribution functions suggest that the dye-stabilized colloids are filled, not hollow, spheres. Stability of the coformulated colloids enabled investigation of their preference for binding DNA, peptides, or folded proteins, and their ability to purify one from the other. The coformulated colloids showed little ability to bind DNA. Correspondingly, the colloids preferentially sequestered protein from even a 1600-fold excess of peptides that are themselves the result of a digest of the same protein. This may reflect the avidity advantage that a protein has in a surface-to-surface interaction with the colloids. For the first time, colloids could be shown to have preferences of up to 90-fold for particular proteins over others. Loaded onto the colloids, bound enzyme could be spun down, resuspended, and released back into buffer, regaining most of its activity. Implications of these observations for colloid mechanisms and utility will be considered.
From Split to Sibenik: The Tortuous Pathway in the Cholinesterase Field
Taylor, Palmer
2010-01-01
The interim between the first and tenth International Cholinesterase meetings has seen remarkable advances associated with the applications of structural biology and recombinant DNA methodology to our field. The cloning of the cholinesterase genes led to the identification of a new super family of proteins, termed the α,β–hydrolase fold; members of this family possess a four helix bundle capable of linking structural subunits to the functioning globular protein. Sequence comparisons and three dimensional structural studies revealed unexpected cousins possessing this fold that, in turn, revealed three distinct functions for the α,β-hydrolase proteins. These encompass: (1) a capacity for hydrolytic cleavage of a great variety of substrates, (2) a heterophilic adhesion function that results in trans-synaptic associations in linked neurons, (3) a chaperone function leading to stabilization of nascent protein and its trafficking to an extracellular or secretory storage location. The analysis and modification of structure may go beyond understanding mechanism, since it may be possible to convert the cholinesterases to efficient detoxifying agents of organophosphatases assisted by added oximes. Also, the study of the relationship between the α,β–hydrolase fold proteins and their biosynthesis may yield means by which aberrant trafficking may be corrected, enhancing expression of mutant proteins. Those engaged in cholinesterase research should take great pride in our accomplishments punctuated by the series of ten meetings. The momentum established and initial studies with related proteins all hold great promise for the future. PMID:20493179
Nissley, Daniel A.; Sharma, Ajeet K.; Ahmed, Nabeel; Friedrich, Ulrike A.; Kramer, Günter; Bukau, Bernd; O'Brien, Edward P.
2016-01-01
The rates at which domains fold and codons are translated are important factors in determining whether a nascent protein will co-translationally fold and function or misfold and malfunction. Here we develop a chemical kinetic model that calculates a protein domain's co-translational folding curve during synthesis using only the domain's bulk folding and unfolding rates and codon translation rates. We show that this model accurately predicts the course of co-translational folding measured in vivo for four different protein molecules. We then make predictions for a number of different proteins in yeast and find that synonymous codon substitutions, which change translation-elongation rates, can switch some protein domains from folding post-translationally to folding co-translationally—a result consistent with previous experimental studies. Our approach explains essential features of co-translational folding curves and predicts how varying the translation rate at different codon positions along a transcript's coding sequence affects this self-assembly process. PMID:26887592
Glyakina, Anna V; Pereyaslavets, Leonid B; Galzitskaya, Oxana V
2013-09-01
Despite the large number of publications on three-helix protein folding, there is no study devoted to the influence of handedness on the rate of three-helix protein folding. From the experimental studies, we make a conclusion that the left-handed three-helix proteins fold faster than the right-handed ones. What may explain this difference? An important question arising in this paper is whether the modeling of protein folding can catch the difference between the protein folding rates of proteins with similar structures but with different folding mechanisms. To answer this question, the folding of eight three-helix proteins (four right-handed and four left-handed), which are similar in size, was modeled using the Monte Carlo and dynamic programming methods. The studies allowed us to determine the orders of folding of the secondary-structure elements in these domains and amino acid residues which are important for the folding. The obtained data are in good correlation with each other and with the experimental data. Structural analysis of these proteins demonstrated that the left-handed domains have a lesser number of contacts per residue and a smaller radius of cross section than the right-handed domains. This may be one of the explanations of the observed fact. The same tendency is observed for the large dataset consisting of 332 three-helix proteins (238 right- and 94 left-handed). From our analysis, we found that the left-handed three-helix proteins have some less-dense packing that should result in faster folding for some proteins as compared to the case of right-handed proteins. Copyright © 2013 Wiley Periodicals, Inc.
Accelerated molecular dynamics simulations of protein folding.
Miao, Yinglong; Feixas, Ferran; Eun, Changsun; McCammon, J Andrew
2015-07-30
Folding of four fast-folding proteins, including chignolin, Trp-cage, villin headpiece and WW domain, was simulated via accelerated molecular dynamics (aMD). In comparison with hundred-of-microsecond timescale conventional molecular dynamics (cMD) simulations performed on the Anton supercomputer, aMD captured complete folding of the four proteins in significantly shorter simulation time. The folded protein conformations were found within 0.2-2.1 Å of the native NMR or X-ray crystal structures. Free energy profiles calculated through improved reweighting of the aMD simulations using cumulant expansion to the second-order are in good agreement with those obtained from cMD simulations. This allows us to identify distinct conformational states (e.g., unfolded and intermediate) other than the native structure and the protein folding energy barriers. Detailed analysis of protein secondary structures and local key residue interactions provided important insights into the protein folding pathways. Furthermore, the selections of force fields and aMD simulation parameters are discussed in detail. Our work shows usefulness and accuracy of aMD in studying protein folding, providing basic references in using aMD in future protein-folding studies. © 2015 Wiley Periodicals, Inc.
Synergistic Inhibition of Protein Fibrillation by Proline and Sorbitol: Biophysical Investigations
Choudhary, Sinjan; Save, Shreyada N.; Kishore, Nand; Hosur, Ramakrishna V.
2016-01-01
We report here interesting synergistic effects of proline and sorbitol, two well-known chemical chaperones, in the inhibition of fibrillation of two proteins, insulin and lysozyme. A combination of many biophysical techniques has been used to understand the structural morphology and modes of interaction of the chaperones with the proteins during fibrillation. Both the chaperones establish stronger polar interactions in the elongation and saturation stages of fibrillation compared to that in the native stage. However, when presented as a mixture, we also see contribution of hydrophobic interactions. Thus, a co-operative adjustment of polar and hydrophobic interactions between the chaperones and the protein surface seems to drive the synergistic effects in the fibrillation process. In insulin, this synergy is quantitatively similar in all the stages of the fibrillation process. These observations would have significant implications for understanding protein folding concepts, in general, and for designing combination therapies against protein fibrillation, in particular. PMID:27870861
Synergistic Inhibition of Protein Fibrillation by Proline and Sorbitol: Biophysical Investigations.
Choudhary, Sinjan; Save, Shreyada N; Kishore, Nand; Hosur, Ramakrishna V
2016-01-01
We report here interesting synergistic effects of proline and sorbitol, two well-known chemical chaperones, in the inhibition of fibrillation of two proteins, insulin and lysozyme. A combination of many biophysical techniques has been used to understand the structural morphology and modes of interaction of the chaperones with the proteins during fibrillation. Both the chaperones establish stronger polar interactions in the elongation and saturation stages of fibrillation compared to that in the native stage. However, when presented as a mixture, we also see contribution of hydrophobic interactions. Thus, a co-operative adjustment of polar and hydrophobic interactions between the chaperones and the protein surface seems to drive the synergistic effects in the fibrillation process. In insulin, this synergy is quantitatively similar in all the stages of the fibrillation process. These observations would have significant implications for understanding protein folding concepts, in general, and for designing combination therapies against protein fibrillation, in particular.
NASA Astrophysics Data System (ADS)
Zorgani, Mohamed Amine; Patron, Kevin; Desvaux, Mickaël
2014-07-01
Proteins from halophilic archaea, which live in extreme saline conditions, have evolved to remain folded, active and stable at very high ionic strengths. Understanding the mechanism of haloadaptation is the first step toward engineering of halostable biomolecules. Amylases are one of the main enzymes used in industry. Yet, no three-dimensional structure has been experimentally resolved for α-amylases from halophilic archaea. In this study, homology structure modeling of α-amylases from the halophilic archaea Haloarcula marismortui, Haloarcula hispanica, and Halalkalicoccus jeotgali were performed. The resulting models were subjected to energy minimization, evaluation, and structural analysis. Calculations of the amino acid composition, salt bridges and hydrophobic interactions were also performed and compared to a set of non-halophilic counterparts. It clearly appeared that haloarchaeal α-amylases exhibited lower propensities for helix formation and higher propensities for coil-forming regions. Furthermore, they could maintain a folded and stable conformation in high salt concentration through highly negative charged surface with over representation of acidic residues, especially Asp, and low hydrophobicity with increase of salt bridges and decrease in hydrophobic interactions on the protein surface. This study sheds some light on the stability of α-amylases from halophilic archaea and provides strong basis not only to understand haloadaptation mechanisms of proteins in microorganisms from hypersalines environments but also for biotechnological applications.
Zorgani, Mohamed Amine; Patron, Kevin; Desvaux, Mickaël
2014-07-01
Proteins from halophilic archaea, which live in extreme saline conditions, have evolved to remain folded, active and stable at very high ionic strengths. Understanding the mechanism of haloadaptation is the first step toward engineering of halostable biomolecules. Amylases are one of the main enzymes used in industry. Yet, no three-dimensional structure has been experimentally resolved for α-amylases from halophilic archaea. In this study, homology structure modeling of α-amylases from the halophilic archaea Haloarcula marismortui, Haloarcula hispanica, and Halalkalicoccus jeotgali were performed. The resulting models were subjected to energy minimization, evaluation, and structural analysis. Calculations of the amino acid composition, salt bridges and hydrophobic interactions were also performed and compared to a set of non-halophilic counterparts. It clearly appeared that haloarchaeal α-amylases exhibited lower propensities for helix formation and higher propensities for coil-forming regions. Furthermore, they could maintain a folded and stable conformation in high salt concentration through highly negative charged surface with over representation of acidic residues, especially Asp, and low hydrophobicity with increase of salt bridges and decrease in hydrophobic interactions on the protein surface. This study sheds some light on the stability of α-amylases from halophilic archaea and provides strong basis not only to understand haloadaptation mechanisms of proteins in microorganisms from hypersalines environments but also for biotechnological applications.
Shao, Qiang
2014-06-05
A comparative study on the folding of multiple three-α-helix bundle proteins including α3D, α3W, and the B domain of protein A (BdpA) is presented. The use of integrated-tempering-sampling molecular dynamics simulations achieves reversible folding and unfolding events in individual short trajectories, which thus provides an efficient approach to sufficiently sample the configuration space of protein and delineate the folding pathway of α-helix bundle. The detailed free energy landscape analyses indicate that the folding mechanism of α-helix bundle is not uniform but sequence dependent. A simple model is then proposed to predict folding mechanism of α-helix bundle on the basis of amino acid composition: α-helical proteins containing higher percentage of hydrophobic residues than charged ones fold via nucleation-condensation mechanism (e.g., α3D and BdpA) whereas proteins having opposite tendency in amino acid composition more likely fold via the framework mechanism (e.g., α3W). The model is tested on various α-helix bundle proteins, and the predicted mechanism is similar to the most approved one for each protein. In addition, the common features in the folding pathway of α-helix bundle protein are also deduced. In summary, the present study provides comprehensive, atomic-level picture of the folding of α-helix bundle proteins.
Serohijos, Adrian W.R.; Shakhnovich, Eugene I.
2014-01-01
The variation among sequences and structures in nature is both determined by physical laws and by evolutionary history. However, these two factors are traditionally investigated by disciplines with different emphasis and philosophy—molecular biophysics on one hand and evolutionary population genetics in another. Here, we review recent theoretical and computational approaches that address the critical need to integrate these two disciplines. We first articulate the elements of these integrated approaches. Then, we survey their contribution to our mechanistic understanding of molecular evolution, the polymorphisms in coding region, the distribution of fitness effects (DFE) of mutations, the observed folding stability of proteins in nature, and the distribution of protein folds in genomes. PMID:24952216
Serohijos, Adrian W R; Shakhnovich, Eugene I
2014-06-01
The variation among sequences and structures in nature is both determined by physical laws and by evolutionary history. However, these two factors are traditionally investigated by disciplines with different emphasis and philosophy-molecular biophysics on one hand and evolutionary population genetics in another. Here, we review recent theoretical and computational approaches that address the crucial need to integrate these two disciplines. We first articulate the elements of these approaches. Then, we survey their contribution to our mechanistic understanding of molecular evolution, the polymorphisms in coding region, the distribution of fitness effects (DFE) of mutations, the observed folding stability of proteins in nature, and the distribution of protein folds in genomes. Copyright © 2014 Elsevier Ltd. All rights reserved.
General mechanism of two-state protein folding kinetics.
Rollins, Geoffrey C; Dill, Ken A
2014-08-13
We describe here a general model of the kinetic mechanism of protein folding. In the Foldon Funnel Model, proteins fold in units of secondary structures, which form sequentially along the folding pathway, stabilized by tertiary interactions. The model predicts that the free energy landscape has a volcano shape, rather than a simple funnel, that folding is two-state (single-exponential) when secondary structures are intrinsically unstable, and that each structure along the folding path is a transition state for the previous structure. It shows how sequential pathways are consistent with multiple stochastic routes on funnel landscapes, and it gives good agreement with the 9 order of magnitude dependence of folding rates on protein size for a set of 93 proteins, at the same time it is consistent with the near independence of folding equilibrium constant on size. This model gives estimates of folding rates of proteomes, leading to a median folding time in Escherichia coli of about 5 s.
NASA Astrophysics Data System (ADS)
Lei, Hongxing; Wu, Chun; Wang, Zhi-Xiang; Zhou, Yaoqi; Duan, Yong
2008-06-01
Reaching the native states of small proteins, a necessary step towards a comprehensive understanding of the folding mechanisms, has remained a tremendous challenge to ab initio protein folding simulations despite the extensive effort. In this work, the folding process of the B domain of protein A (BdpA) has been simulated by both conventional and replica exchange molecular dynamics using AMBER FF03 all-atom force field. Started from an extended chain, a total of 40 conventional (each to 1.0 μs) and two sets of replica exchange (each to 200.0 ns per replica) molecular dynamics simulations were performed with different generalized-Born solvation models and temperature control schemes. The improvements in both the force field and solvent model allowed successful simulations of the folding process to the native state as demonstrated by the 0.80 A˚ Cα root mean square deviation (RMSD) of the best folded structure. The most populated conformation was the native folded structure with a high population. This was a significant improvement over the 2.8 A˚ Cα RMSD of the best nativelike structures from previous ab initio folding studies on BdpA. To the best of our knowledge, our results demonstrate, for the first time, that ab initio simulations can reach the native state of BdpA. Consistent with experimental observations, including Φ-value analyses, formation of helix II/III hairpin was a crucial step that provides a template upon which helix I could form and the folding process could complete. Early formation of helix III was observed which is consistent with the experimental results of higher residual helical content of isolated helix III among the three helices. The calculated temperature-dependent profile and the melting temperature were in close agreement with the experimental results. The simulations further revealed that phenylalanine 31 may play critical to achieve the correct packing of the three helices which is consistent with the experimental observation. In addition to the mechanistic studies, an ab initio structure prediction was also conducted based on both the physical energy and a statistical potential. Based on the lowest physical energy, the predicted structure was 2.0 A˚ Cα RMSD away from the experimentally determined structure.
Secreted Proteins Defy the Expression Level-Evolutionary Rate Anticorrelation.
Feyertag, Felix; Berninsone, Patricia M; Alvarez-Ponce, David
2017-03-01
The rates of evolution of the proteins of any organism vary across orders of magnitude. A primary factor influencing rates of protein evolution is expression. A strong negative correlation between expression levels and evolutionary rates (the so-called E-R anticorrelation) has been observed in virtually all studied organisms. This effect is currently attributed to the abundance-dependent fitness costs of misfolding and unspecific protein-protein interactions, among other factors. Secreted proteins are folded in the endoplasmic reticulum, a compartment where chaperones, folding catalysts, and stringent quality control mechanisms promote their correct folding and may reduce the fitness costs of misfolding. In addition, confinement of secreted proteins to the extracellular space may reduce misinteractions and their deleterious effects. We hypothesize that each of these factors (the secretory pathway quality control and extracellular location) may reduce the strength of the E-R anticorrelation. Indeed, here we show that among human proteins that are secreted to the extracellular space, rates of evolution do not correlate with protein abundances. This trend is robust to controlling for several potentially confounding factors and is also observed when analyzing protein abundance data for 6 human tissues. In addition, analysis of mRNA abundance data for 32 human tissues shows that the E-R correlation is always less negative, and sometimes nonsignificant, in secreted proteins. Similar observations were made in Caenorhabditis elegans and in Escherichia coli, and to a lesser extent in Drosophila melanogaster, Saccharomyces cerevisiae and Arabidopsis thaliana. Our observations contribute to understand the causes of the E-R anticorrelation. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Naimuddin, Mohammed; Kubo, Tai
2011-12-01
We report an efficient system to produce and display properly folded disulfide-rich proteins facilitated by coupled complementary DNA (cDNA) display and protein disulfide isomerase-assisted folding. The results show that a neurotoxin protein containing four disulfide linkages can be displayed in the folded state. Furthermore, it can be refolded on a solid support that binds efficiently to its natural acetylcholine receptor. Probing the efficiency of the display proteins prepared by these methods provided up to 8-fold higher enrichment by the selective enrichment method compared with cDNA display alone, more than 10-fold higher binding to its receptor by the binding assays, and more than 10-fold higher affinities by affinity measurements. Cotranslational folding was found to have better efficiency than posttranslational refolding between the two investigated methods. We discuss the utilities of efficient display of such proteins in the preparation of superior quality proteins and protein libraries for directed evolution leading to ligand discovery. Copyright © 2011 Elsevier Inc. All rights reserved.
Small protein domains fold inside the ribosome exit tunnel.
Marino, Jacopo; von Heijne, Gunnar; Beckmann, Roland
2016-03-01
Cotranslational folding of small protein domains within the ribosome exit tunnel may be an important cellular strategy to avoid protein misfolding. However, the pathway of cotranslational folding has so far been described only for a few proteins, and therefore, it is unclear whether folding in the ribosome exit tunnel is a common feature for small protein domains. Here, we have analyzed nine small protein domains and determined at which point during translation their folding generates sufficient force on the nascent chain to release translational arrest by the SecM arrest peptide, both in vitro and in live E. coli cells. We find that all nine protein domains initiate folding while still located well within the ribosome exit tunnel. © 2016 Federation of European Biochemical Societies.
Strategies for achieving high-level expression of genes in Escherichia coli.
Makrides, S C
1996-01-01
Progress in our understanding of several biological processes promises to broaden the usefulness of Escherichia coli as a tool for gene expression. There is an expanding choice of tightly regulated prokaryotic promoters suitable for achieving high-level gene expression. New host strains facilitate the formation of disulfide bonds in the reducing environment of the cytoplasm and offer higher protein yields by minimizing proteolytic degradation. Insights into the process of protein translocation across the bacterial membranes may eventually make it possible to achieve robust secretion of specific proteins into the culture medium. Studies involving molecular chaperones have shown that in specific cases, chaperones can be very effective for improved protein folding, solubility, and membrane transport. Negative results derived from such studies are also instructive in formulating different strategies. The remarkable increase in the availability of fusion partners offers a wide range of tools for improved protein folding, solubility, protection from proteases, yield, and secretion into the culture medium, as well as for detection and purification of recombinant proteins. Codon usage is known to present a potential impediment to high-level gene expression in E. coli. Although we still do not understand all the rules governing this phenomenon, it is apparent that "rare" codons, depending on their frequency and context, can have an adverse effect on protein levels. Usually, this problem can be alleviated by modification of the relevant codons or by coexpression of the cognate tRNA genes. Finally, the elucidation of specific determinants of protein degradation, a plethora of protease-deficient host strains, and methods to stabilize proteins afford new strategies to minimize proteolytic susceptibility of recombinant proteins in E. coli. PMID:8840785
Improvement on a simplified model for protein folding simulation.
Zhang, Ming; Chen, Changjun; He, Yi; Xiao, Yi
2005-11-01
Improvements were made on a simplified protein model--the Ramachandran model-to achieve better computer simulation of protein folding. To check the validity of such improvements, we chose the ultrafast folding protein Engrailed Homeodomain as an example and explored several aspects of its folding. The engrailed homeodomain is a mainly alpha-helical protein of 61 residues from Drosophila melanogaster. We found that the simplified model of Engrailed Homeodomain can fold into a global minimum state with a tertiary structure in good agreement with its native structure.
Theoretical and computational studies in protein folding, design, and function
NASA Astrophysics Data System (ADS)
Morrissey, Michael Patrick
2000-10-01
In this work, simplified statistical models are used to understand an array of processes related to protein folding and design. In Part I, lattice models are utilized to test several theories about the statistical properties of protein-like systems. In Part II, sequence analysis and all-atom simulations are used to advance a novel theory for the behavior of a particular protein. Part I is divided into five chapters. In Chapter 2, a method of sequence design for model proteins, based on statistical mechanical first-principles, is developed. The cumulant design method uses a mean-field approximation to expand the free energy of a sequence in temperature. The method successfully designs sequences which fold to a target lattice structure at a specific temperature, a feat which was not possible using previous design methods. The next three chapters are computational studies of the double mutant cycle, which has been used experimentally to predict intra-protein interactions. Complete structure prediction is demonstrated for a model system using exhaustive, and also sub-exhaustive, double mutants. Nonadditivity of enthalpy, rather than of free energy, is proposed and demonstrated to be a superior marker for inter-residue contact. Next, a new double mutant protocol, called exchange mutation, is introduced. Although simple statistical arguments predict exchange mutation to be a more accurate contact predictor than standard mutant cycles, this hypothesis was not upheld in lattice simulations. Reasons for this inconsistency will be discussed. Finally, a multi-chain folding algorithm is introduced. Known as LINKS, this algorithm was developed to test a method of structure prediction which utilizes chain-break mutants. While structure prediction was not successful, LINKS should nevertheless be a useful tool for the study of protein-protein and protein-ligand interactions. The last chapter of Part I utilizes the lattice to explore the differences between standard folding, from the fully denatured state, and cotranslational folding, whereby one end of a protein is synthesized and released before the other. Cotranslational folding is shown to accelerate folding kinetics, particularly when the target backbone contains many local contacts. Additionally, cotranslation is shown capable of "guiding" a model protein into a metastable, local contact-rich state, despite the existence of a true native state of much lower energy. In Part II, a model is developed for the behavior of PrP, a unique mammalian protein which has been shown to possess two native states. The pathogenic "scrapie" state PrPSc, which has not been structurally characterized, is known to trigger conversion of the characterized endogenous conformation PrPC into additional PrPSc, Residues 144--153 are shown to form the most hydrophilic naturally occurring alpha-helix, out of a broad database with more than 10,000 candidates. The novel beta-nucleation model proposes that PrPSc, is not a distinct mono-molecular state, but is rather a beta-sheet-like aggregate centered around helix-1 components of multiple PrP molecules. The remainder of Part II uses molecular dynamics simulations to support the beta-nucleation hypothesis, and to propose a system of peptide ligands which may arrest the process of prion propagation.
Krivoruchko, Anastasia; Storey, Kenneth B
2014-10-01
ChREBP (carbohydrate response element binding protein) is a glucose-responsive transcription factor that is known to be an important regulator of glycolytic and lipogenic genes in response to glucose. We hypothesized that activation of ChREBP could be relevant to anoxia survival by the anoxia-tolerant turtle, Trachemys scripta elegans. Expression of ChREBP in response to 5 and 20h of anoxia was examined using RT-PCR and Western immunoblotting. In addition, subcellular localization and DNA-binding activity of ChREBP protein were assessed and transcript levels of liver pyruvate kinase (LPK), a downstream gene under ChREBP control were quantified using RT-PCR. ChREBP was anoxia-responsive in kidney and liver, with transcript levels increasing by 1.2-1.8 fold in response to anoxia and protein levels increasing by 1.8-1.9 fold. Enhanced nuclear presence under anoxia was also observed in both tissues by 2.2-2.8 fold. A 4.2 fold increase in DNA binding activity of ChREBP was also observed in liver in response to 5h of anoxia. In addition, transcript levels of LPK increased by 2.1 fold in response to 5h of anoxia in the liver. The results suggest that activation of ChREBP in response to anoxia might be a crucial factor for anoxia survival in turtle liver by contributing to elevated glycolytic flux in the initial phases of oxygen limitation. This study provides the first demonstration of activation of ChREBP in response to anoxia in a natural model of anoxia tolerance, further improving our understanding of the molecular nature of anoxia tolerance. Copyright © 2014 Elsevier B.V. All rights reserved.
Amyloid Polymorphism in the Protein Folding and Aggregation Energy Landscape.
Adamcik, Jozef; Mezzenga, Raffaele
2018-02-15
Protein folding involves a large number of steps and conformations in which the folding protein samples different thermodynamic states characterized by local minima. Kinetically trapped on- or off-pathway intermediates are metastable folding intermediates towards the lowest absolute energy minima, which have been postulated to be the natively folded state where intramolecular interactions dominate, and the amyloid state where intermolecular interactions dominate. However, this view largely neglects the rich polymorphism found within amyloid species. We review the protein folding energy landscape in view of recent findings identifying specific transition routes among different amyloid polymorphs. Observed transitions such as twisted ribbon→crystal or helical ribbon→nanotube, and forbidden transitions such helical ribbon↛crystal, are discussed and positioned within the protein folding and aggregation energy landscape. Finally, amyloid crystals are identified as the ground state of the protein folding and aggregation energy landscape. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Exploring the Sequence-based Prediction of Folding Initiation Sites in Proteins.
Raimondi, Daniele; Orlando, Gabriele; Pancsa, Rita; Khan, Taushif; Vranken, Wim F
2017-08-18
Protein folding is a complex process that can lead to disease when it fails. Especially poorly understood are the very early stages of protein folding, which are likely defined by intrinsic local interactions between amino acids close to each other in the protein sequence. We here present EFoldMine, a method that predicts, from the primary amino acid sequence of a protein, which amino acids are likely involved in early folding events. The method is based on early folding data from hydrogen deuterium exchange (HDX) data from NMR pulsed labelling experiments, and uses backbone and sidechain dynamics as well as secondary structure propensities as features. The EFoldMine predictions give insights into the folding process, as illustrated by a qualitative comparison with independent experimental observations. Furthermore, on a quantitative proteome scale, the predicted early folding residues tend to become the residues that interact the most in the folded structure, and they are often residues that display evolutionary covariation. The connection of the EFoldMine predictions with both folding pathway data and the folded protein structure suggests that the initial statistical behavior of the protein chain with respect to local structure formation has a lasting effect on its subsequent states.
Insights Into the Role of Collagen in Vocal Fold Health and Disease.
Tang, Sharon S; Mohad, Vidisha; Gowda, Madhu; Thibeault, Susan L
2017-09-01
As one of the key fibrous proteins in the extracellular matrix, collagen plays a significant role in the structural and biomechanical characteristics of the vocal fold. Anchored fibrils of collagen create secure structural regions within the vocal folds and are strong enough to sustain vibratory impact and stretch during phonation. This contributes tensile strength, density, and organization to the vocal folds and influences health and pathogenesis. This review offers a comprehensive summary for a current understanding of collagen within normal vocal fold tissues throughout the life span as well as vocal pathology and wound repair. Further, collagen's molecular structure and biosynthesis are discussed. Finally, collagen alterations in tissue injury and repair and the incorporation of collagen-based biomaterials as a method of treating voice disorders are reviewed. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Balakrishnan, Swati; Sarma, Siddhartha P
2017-08-22
Aromatic interactions are an important force in protein folding as they combine the stability of a hydrophobic interaction with the selectivity of a hydrogen bond. Much of our understanding of aromatic interactions comes from "bioinformatics" based analyses of protein structures and from the contribution of these interactions to stabilizing secondary structure motifs in model peptides. In this study, the structural consequences of aromatic interactions on protein folding have been explored in engineered mutants of the molten globule protein apo-cytochrome b 5 . Structural changes from disorder to order due to aromatic interactions in two variants of the protein, viz., WF-cytb5 and FF-cytb5, result in significant long-range secondary and tertiary structure. The results show that 54 and 52% of the residues in WF-cytb5 and FF-cytb5, respectively, occupy ordered regions versus 26% in apo-cytochrome b 5 . The interactions between the aromatic groups are offset-stacked and edge-to-face for the Trp-Phe and Phe-Phe mutants, respectively. Urea denaturation studies indicate that both mutants have a C m higher than that of apo-cytochrome b 5 and are more stable to chaotropic agents than apo-cytochrome b 5 . The introduction of these aromatic residues also results in "trimer" interactions with existing aromatic groups, reaffirming the selectivity of the aromatic interactions. These studies provide insights into the aromatic interactions that drive disorder-to-order transitions in intrinsically disordered regions of proteins and will aid in de novo protein design beyond small peptide scaffolds.
Goyal, Megha; Chaudhuri, Tapan K
2015-07-01
Folding of aggregation prone recombinant proteins through co-expression of chaperonin GroEL and GroES has been a popular practice in the effort to optimize preparation of functional protein in Escherichia coli. Considering the demand for functional recombinant protein products, it is desirable to apply the chaperone assisted protein folding strategy for enhancing the yield of properly folded protein. Toward the same direction, it is also worth attempting folding of multiple recombinant proteins simultaneously over-expressed in E. coli through the assistance of co-expressed GroEL-ES. The genesis of this thinking was originated from the fact that cellular GroEL and GroES assist in the folding of several endogenous proteins expressed in the bacterial cell. Here we present the experimental findings from our study on co-expressed GroEL-GroES assisted folding of simultaneously over-expressed proteins maltodextrin glucosidase (MalZ) and yeast mitochondrial aconitase (mAco). Both proteins mentioned here are relatively larger and aggregation prone, mostly form inclusion bodies, and undergo GroEL-ES assisted folding in E. coli cells during over-expression. It has been reported that the relative yield of properly folded functional forms of MalZ and mAco with the exogenous GroEL-ES assistance were comparable with the results when these proteins were overexpressed alone. This observation is quite promising and highlights the fact that GroEL and GroES can assist in the folding of multiple substrate proteins simultaneously when over-expressed in E. coli. This method might be a potential tool for enhanced production of multiple functional recombinant proteins simultaneously in E. coli. Copyright © 2015 Elsevier Ltd. All rights reserved.
Method of generating ploynucleotides encoding enhanced folding variants
Bradbury, Andrew M.; Kiss, Csaba; Waldo, Geoffrey S.
2017-05-02
The invention provides directed evolution methods for improving the folding, solubility and stability (including thermostability) characteristics of polypeptides. In one aspect, the invention provides a method for generating folding and stability-enhanced variants of proteins, including but not limited to fluorescent proteins, chromophoric proteins and enzymes. In another aspect, the invention provides methods for generating thermostable variants of a target protein or polypeptide via an internal destabilization baiting strategy. Internally destabilization a protein of interest is achieved by inserting a heterologous, folding-destabilizing sequence (folding interference domain) within DNA encoding the protein of interest, evolving the protein sequences adjacent to the heterologous insertion to overcome the destabilization (using any number of mutagenesis methods), thereby creating a library of variants. The variants in the library are expressed, and those with enhanced folding characteristics selected.
Sekhar, Ashok; Vallurupalli, Pramodh; Kay, Lewis E
2012-11-20
Friction plays a critical role in protein folding. Frictional forces originating from random solvent and protein fluctuations both retard motion along the folding pathway and activate protein molecules to cross free energy barriers. Studies of friction thus may provide insights into the driving forces underlying protein conformational dynamics. However, the molecular origin of friction in protein folding remains poorly understood because, with the exception of the native conformer, there generally is little detailed structural information on the other states participating in the folding process. Here, we study the folding of the four-helix bundle FF domain that proceeds via a transiently formed, sparsely populated compact on-pathway folding intermediate whose structure was elucidated previously. Because the intermediate is stabilized by both native and nonnative interactions, friction in the folding transition between intermediate and folded states is expected to arise from intrachain reorganization in the protein. However, the viscosity dependencies of rates of folding from or unfolding to the intermediate, as established by relaxation dispersion NMR spectroscopy, clearly indicate that contributions from internal friction are small relative to those from solvent, so solvent frictional forces drive the folding process. Our results emphasize the importance of solvent dynamics in mediating the interconversion between protein configurations, even those that are highly compact, and in equilibrium folding/unfolding fluctuations in general.
Coexistence of Native and Denatured Phases in a Single Proteinlike Molecule
NASA Astrophysics Data System (ADS)
Du, Rose; Grosberg, Alexander Yu.; Tanaka, Toyoichi
1999-11-01
In order to understand the nuclei which develop during the course of protein folding and unfolding, we examine equilibrium coexistence of phases within a single heteropolymer chain. We computationally generate the phase segregation by applying a ``folding pressure,'' or adding an energetic bonus for native monomer-monomer contacts. The computer models reveal that in a polymer system some nuclei hinder folding via topological constraints. Using this insight, we show that the critical nucleus size is of the order of the entire chain and that unfolding time scales as exp\\(cN2/3\\), in the large N limit, N and c being the chain length and a constant, respectively.
van der Vaart, Arjan
2015-05-01
Protein-DNA binding often involves dramatic conformational changes such as protein folding and DNA bending. While thermodynamic aspects of this behavior are understood, and its biological function is often known, the mechanism by which the conformational changes occur is generally unclear. By providing detailed structural and energetic data, molecular dynamics simulations have been helpful in elucidating and rationalizing protein-DNA binding. This review will summarize recent atomistic molecular dynamics simulations of the conformational dynamics of DNA and protein-DNA binding. A brief overview of recent developments in DNA force fields is given as well. Simulations have been crucial in rationalizing the intrinsic flexibility of DNA, and have been instrumental in identifying the sequence of binding events, the triggers for the conformational motion, and the mechanism of binding for a number of important DNA-binding proteins. Molecular dynamics simulations are an important tool for understanding the complex binding behavior of DNA-binding proteins. With recent advances in force fields and rapid increases in simulation time scales, simulations will become even more important for future studies. This article is part of a Special Issue entitled Recent developments of molecular dynamics. Copyright © 2014. Published by Elsevier B.V.
De Novo Proteins with Life-Sustaining Functions Are Structurally Dynamic.
Murphy, Grant S; Greisman, Jack B; Hecht, Michael H
2016-01-29
Designing and producing novel proteins that fold into stable structures and provide essential biological functions are key goals in synthetic biology. In initial steps toward achieving these goals, we constructed a combinatorial library of de novo proteins designed to fold into 4-helix bundles. As described previously, screening this library for sequences that function in vivo to rescue conditionally lethal mutants of Escherichia coli (auxotrophs) yielded several de novo sequences, termed SynRescue proteins, which rescued four different E. coli auxotrophs. In an effort to understand the structural requirements necessary for auxotroph rescue, we investigated the biophysical properties of the SynRescue proteins, using both computational and experimental approaches. Results from circular dichroism, size-exclusion chromatography, and NMR demonstrate that the SynRescue proteins are α-helical and relatively stable. Surprisingly, however, they do not form well-ordered structures. Instead, they form dynamic structures that fluctuate between monomeric and dimeric states. These findings show that a well-ordered structure is not a prerequisite for life-sustaining functions, and suggests that dynamic structures may have been important in the early evolution of protein function. Copyright © 2015 Elsevier Ltd. All rights reserved.
Concerted dihedral rotations give rise to internal friction in unfolded proteins.
Echeverria, Ignacia; Makarov, Dmitrii E; Papoian, Garegin A
2014-06-18
Protein chains undergo conformational diffusion during folding and dynamics, experiencing both thermal kicks and viscous drag. Recent experiments have shown that the corresponding friction can be separated into wet friction, which is determined by the solvent viscosity, and dry friction, where frictional effects arise due to the interactions within the protein chain. Despite important advances, the molecular origins underlying dry friction in proteins have remained unclear. To address this problem, we studied the dynamics of the unfolded cold-shock protein at different solvent viscosities and denaturant concentrations. Using extensive all-atom molecular dynamics simulations we estimated the internal friction time scales and found them to agree well with the corresponding experimental measurements (Soranno et al. Proc. Natl. Acad. Sci. U.S.A. 2012, 109, 17800-17806). Analysis of the reconfiguration dynamics of the unfolded chain further revealed that hops in the dihedral space provide the dominant mechanism of internal friction. Furthermore, the increased number of concerted dihedral moves at physiological conditions suggest that, in such conditions, the concerted motions result in higher frictional forces. These findings have important implications for understanding the folding kinetics of proteins as well as the dynamics of intrinsically disordered proteins.
Characterization of the Protein Unfolding Processes Induced by Urea and Temperature
Rocco, Alessandro Guerini; Mollica, Luca; Ricchiuto, Piero; Baptista, António M.; Gianazza, Elisabetta; Eberini, Ivano
2008-01-01
Correct folding is critical for the biological activities of proteins. As a contribution to a better understanding of the protein (un)folding problem, we studied the effect of temperature and of urea on peptostreptococcal Protein L destructuration. We performed standard molecular dynamics simulations at 300 K, 350 K, 400 K, and 480 K, both in 10 M urea and in water. Protein L followed at least two alternative unfolding pathways. Urea caused the loss of secondary structure acting preferentially on the β-sheets, while leaving the α-helices almost intact; on the contrary, high temperature preserved the β-sheets and led to a complete loss of the α-helices. These data suggest that urea and high temperature act through different unfolding mechanisms, and protein secondary motives reveal a differential sensitivity to various denaturant treatments. As further validation of our results, replica-exchange molecular dynamics simulations of the temperature-induced unfolding process in the presence of urea were performed. This set of simulations allowed us to compute the thermodynamical parameters of the process and confirmed that, in the configurational space of Protein L unfolding, both of the above pathways are accessible, although to a different relative extent. PMID:18065481
A consensus view of fold space: Combining SCOP, CATH, and the Dali Domain Dictionary
Day, Ryan; Beck, David A.C.; Armen, Roger S.; Daggett, Valerie
2003-01-01
We have determined consensus protein-fold classifications on the basis of three classification methods, SCOP, CATH, and Dali. These classifications make use of different methods of defining and categorizing protein folds that lead to different views of protein-fold space. Pairwise comparisons of domains on the basis of their fold classifications show that much of the disagreement between the classification systems is due to differing domain definitions rather than assigning the same domain to different folds. However, there are significant differences in the fold assignments between the three systems. These remaining differences can be explained primarily in terms of the breadth of the fold classifications. Many structures may be defined as having one fold in one system, whereas far fewer are defined as having the analogous fold in another system. By comparing these folds for a nonredundant set of proteins, the consensus method breaks up broad fold classifications and combines restrictive fold classifications into metafolds, creating, in effect, an averaged view of fold space. This averaged view requires that the structural similarities between proteins having the same metafold be recognized by multiple classification systems. Thus, the consensus map is useful for researchers looking for fold similarities that are relatively independent of the method used to compare proteins. The 30 most populated metafolds, representing the folds of about half of a nonredundant subset of the PDB, are presented here. The full list of metafolds is presented on the Web. PMID:14500873
A consensus view of fold space: combining SCOP, CATH, and the Dali Domain Dictionary.
Day, Ryan; Beck, David A C; Armen, Roger S; Daggett, Valerie
2003-10-01
We have determined consensus protein-fold classifications on the basis of three classification methods, SCOP, CATH, and Dali. These classifications make use of different methods of defining and categorizing protein folds that lead to different views of protein-fold space. Pairwise comparisons of domains on the basis of their fold classifications show that much of the disagreement between the classification systems is due to differing domain definitions rather than assigning the same domain to different folds. However, there are significant differences in the fold assignments between the three systems. These remaining differences can be explained primarily in terms of the breadth of the fold classifications. Many structures may be defined as having one fold in one system, whereas far fewer are defined as having the analogous fold in another system. By comparing these folds for a nonredundant set of proteins, the consensus method breaks up broad fold classifications and combines restrictive fold classifications into metafolds, creating, in effect, an averaged view of fold space. This averaged view requires that the structural similarities between proteins having the same metafold be recognized by multiple classification systems. Thus, the consensus map is useful for researchers looking for fold similarities that are relatively independent of the method used to compare proteins. The 30 most populated metafolds, representing the folds of about half of a nonredundant subset of the PDB, are presented here. The full list of metafolds is presented on the Web.
Conformational Transitions in Molecular Systems
NASA Astrophysics Data System (ADS)
Bachmann, M.; Janke, W.
2008-11-01
Proteins are the "work horses" in biological systems. In almost all functions specific proteins are involved. They control molecular transport processes, stabilize the cell structure, enzymatically catalyze chemical reactions; others act as molecular motors in the complex machinery of molecular synthetization processes. Due to their significance, misfolds and malfunctions of proteins typically entail disastrous diseases, such as Alzheimer's disease and bovine spongiform encephalopathy (BSE). Therefore, the understanding of the trinity of amino acid composition, geometric structure, and biological function is one of the most essential challenges for the natural sciences. Here, we glance at conformational transitions accompanying the structure formation in protein folding processes.
Question 3: The Worlds of the Prebiotic and Never Born Proteins
NASA Astrophysics Data System (ADS)
Chiarabelli, Cristiano; de Lucrezia, Davide
2007-10-01
Starting from the statement that no reliable methods are known to produce high molecular weight polypeptides under prebiotic conditions, a possible approach, at least to understand the differences between extant proteins and the possible large number of never born proteins, could be biological. Using the phage display method a large library of totally random amino acidic sequences was obtained. Consequently, different experiments to directly consider the frequency of stable folds were performed, and the interesting results obtained from such new approach are discussed in terms of contingency, contributing to the discussion on the selection mechanism of extant proteins.
Shin, Kwang-Soo; Park, Hee-Soo; Kim, Young; Heo, In-Beom; Kim, Young Hwan; Yu, Jae-Hyuk
2016-10-04
Aspergillus fumigatus reproduces and infects host by forming a high number of small asexual spores (conidia). The velvet proteins are global transcriptional regulators governing the complex process of conidiogenesis in this fungus. Here, to further understand the velvet-mediated regulation, we carried out comparative proteomic analyses of conidia of wild type (WT) and three velvet mutants (ΔveA, ΔvelB and ΔvosA). Cluster analysis of 184 protein spots showing at least 1.5-fold differential accumulation between WT and mutants reveal the clustering of WT- ΔveA and ΔvelB-ΔvosA. Among 43 proteins identified by Nano-LC-ESI-MS/MS, 23 including several heat shock proteins showed more than two-fold reduction in both the ∆velB and ∆vosA conidia. On the contrary, three proteins exhibited more than five-fold increase in ∆veA only, including the putative RNA polymerase II degradation factor DefA. The deletion of defA resulted in a reduced number of conidia and restricted colony growth. In addition, the defA deletion mutant conidia showed hypersensitivity against the DNA damaging agents NQO and MMS, while the ΔveA mutant conidia were more resistant against to NQO. Taken together, we propose that VeA controls protein level of DefA in conidia, which are dormant and equipped with multiple layers of protection against environmental cues. Copyright © 2016 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Krokhotin, Andrey; Dokholyan, Nikolay V.
2017-07-01
Most proteins fold into unique three-dimensional (3D) structures that determine their biological functions, such as catalytic activity or macromolecular binding. Misfolded proteins can pose a threat through aberrant interactions with other proteins leading to a number of diseases including Alzheimer's disease, Parkinson's disease, and amyotrophic lateral sclerosis [1,2]. What does determine 3D structure of proteins? The first clue to this question came more than fifty years ago when Anfinsen demonstrated that unfolded proteins can spontaneously fold to their native 3D structures [3,4]. Anfinsen's experiments lead to the conclusion that proteins fold to unique native structure corresponding to the stable and kinetically accessible free energy minimum, and protein native structure is solely determined by its amino acid sequence. The question of how exactly proteins find their free energy minimum proved to be a difficult problem. One of the puzzles, initially pointed out by Levinthal, was an inconsistency between observed protein folding times and theoretical estimates. A self-avoiding polymer model of a globular protein of 100-residues length on a cubic lattice can sample at least 1047 states. Based on the assumption that conformational sampling occurs at the highest vibrational mode of proteins (∼picoseconds), predicted folding time by searching among all the possible conformations leads to ∼1027 years (much larger than the age of the universe) [5]. In contrast, observed protein folding time range from microseconds to minutes. Due to tremendous theoretical progress in protein folding field that has been achieved in past decades, the source of this inconsistency is currently understood that is thoroughly described in the review by Finkelstein et al. [6].
Effects of lengthscales and attractions on the collapse of hydrophobic polymers in water
Athawale, Manoj V.; Goel, Gaurav; Ghosh, Tuhin; Truskett, Thomas M.; Garde, Shekhar
2007-01-01
We present results from extensive molecular dynamics simulations of collapse transitions of hydrophobic polymers in explicit water focused on understanding effects of lengthscale of the hydrophobic surface and of attractive interactions on folding. Hydrophobic polymers display parabolic, protein-like, temperature-dependent free energy of unfolding. Folded states of small attractive polymers are marginally stable at 300 K and can be unfolded by heating or cooling. Increasing the lengthscale or decreasing the polymer–water attractions stabilizes folded states significantly, the former dominated by the hydration contribution. That hydration contribution can be described by the surface tension model, ΔG = γ(T)ΔA, where the surface tension, γ, is lengthscale-dependent and decreases monotonically with temperature. The resulting variation of the hydration entropy with polymer lengthscale is consistent with theoretical predictions of Huang and Chandler [Huang DM, Chandler D (2000) Proc Natl Acad Sci USA 97:8324–8327] that explain the blurring of entropy convergence observed in protein folding thermodynamics. Analysis of water structure shows that the polymer–water hydrophobic interface is soft and weakly dewetted, and is characterized by enhanced interfacial density fluctuations. Formation of this interface, which induces polymer folding, is strongly opposed by enthalpy and favored by entropy, similar to the vapor–liquid interface. PMID:17215352
Backbone hydration determines the folding signature of amino acid residues.
Bignucolo, Olivier; Leung, Hoi Tik Alvin; Grzesiek, Stephan; Bernèche, Simon
2015-04-08
The relation between the sequence of a protein and its three-dimensional structure remains largely unknown. A lasting dream is to elucidate the side-chain-dependent driving forces that govern the folding process. Different structural data suggest that aromatic amino acids play a particular role in the stabilization of protein structures. To better understand the underlying mechanism, we studied peptides of the sequence EGAAXAASS (X = Gly, Ile, Tyr, Trp) through comparison of molecular dynamics (MD) trajectories and NMR residual dipolar coupling (RDC) measurements. The RDC data for aromatic substitutions provide evidence for a kink in the peptide backbone. Analysis of the MD simulations shows that the formation of internal hydrogen bonds underlying a helical turn is key to reproduce the experimental RDC values. The simulations further reveal that the driving force leading to such helical-turn conformations arises from the lack of hydration of the peptide chain on either side of the bulky aromatic side chain, which can potentially act as a nucleation point initiating the folding process.
Developing a molecular dynamics force field for both folded and disordered protein states.
Robustelli, Paul; Piana, Stefano; Shaw, David E
2018-05-07
Molecular dynamics (MD) simulation is a valuable tool for characterizing the structural dynamics of folded proteins and should be similarly applicable to disordered proteins and proteins with both folded and disordered regions. It has been unclear, however, whether any physical model (force field) used in MD simulations accurately describes both folded and disordered proteins. Here, we select a benchmark set of 21 systems, including folded and disordered proteins, simulate these systems with six state-of-the-art force fields, and compare the results to over 9,000 available experimental data points. We find that none of the tested force fields simultaneously provided accurate descriptions of folded proteins, of the dimensions of disordered proteins, and of the secondary structure propensities of disordered proteins. Guided by simulation results on a subset of our benchmark, however, we modified parameters of one force field, achieving excellent agreement with experiment for disordered proteins, while maintaining state-of-the-art accuracy for folded proteins. The resulting force field, a99SB- disp , should thus greatly expand the range of biological systems amenable to MD simulation. A similar approach could be taken to improve other force fields. Copyright © 2018 the Author(s). Published by PNAS.
Qin, Zhao; Fabre, Andrea; Buehler, Markus J
2013-05-01
The stability of alpha helices is important in protein folding, bioinspired materials design, and controls many biological properties under physiological and disease conditions. Here we show that a naturally favored alpha helix length of 9 to 17 amino acids exists at which the propensity towards the formation of this secondary structure is maximized. We use a combination of thermodynamical analysis, well-tempered metadynamics molecular simulation and statistical analyses of experimental alpha helix length distributions and find that the favored alpha helix length is caused by a competition between alpha helix folding, unfolding into a random coil and formation of higher-order tertiary structures. The theoretical result is suggested to be used to explain the statistical distribution of the length of alpha helices observed in natural protein structures. Our study provides mechanistic insight into fundamental controlling parameters in alpha helix structure formation and potentially other biopolymers or synthetic materials. The result advances our fundamental understanding of size effects in the stability of protein structures and may enable the design of de novo alpha-helical protein materials.
A Simple and Effective Protein Folding Activity Suitable for Large Lectures
ERIC Educational Resources Information Center
White, Brian
2006-01-01
This article describes a simple and inexpensive hands-on simulation of protein folding suitable for use in large lecture classes. This activity uses a minimum of parts, tools, and skill to simulate some of the fundamental principles of protein folding. The major concepts targeted are that proteins begin as linear polypeptides and fold to…
General Mechanism of Two-State Protein Folding Kinetics
Rollins, Geoffrey C.; Dill, Ken A.
2016-01-01
We describe here a general model of the kinetic mechanism of protein folding. In the Foldon Funnel Model, proteins fold in units of secondary structures, which form sequentially along the folding pathway, stabilized by tertiary interactions. The model predicts that the free energy landscape has a volcano shape, rather than a simple funnel, that folding is two-state (single-exponential) when secondary structures are intrinsically unstable, and that each structure along the folding path is a transition state for the previous structure. It shows how sequential pathways are consistent with multiple stochastic routes on funnel landscapes, and it gives good agreement with the 9 order of magnitude dependence of folding rates on protein size for a set of 93 proteins, at the same time it is consistent with the near independence of folding equilibrium constant on size. This model gives estimates of folding rates of proteomes, leading to a median folding time in Escherichia coli of about 5 s. PMID:25056406
Investigation of Natural Bombyx mori Silk Fibroin Proteins Using INS
NASA Astrophysics Data System (ADS)
Crain, Christopher; Strange, Nicholas; Larese, J. Z.
The mechanical properties of many protein comprised biomaterials are a direct reflection of non-covalent (i.e. weak) interacting ions such as F-actin in muscles, tubulin in the cytoskeleton of cells, viral capsids, and silk. Porter and Vollrath underscored the two main factors that are critical for understanding the high mechanical strength of silks: the nanoscale semi-crystalline folding structure, which gives it exceptional toughness and strength, and the degree of hydration of the disordered fraction, which acts to modify these properties. Understanding and controlling these two principal factors are the key to the functionality of protein elastomers, and render silk an ideal model protein for (bio)material design. We will describe our investigation of electrospun silk of the Bombyx mori (silk worm), using Inelastic Neutron Scattering (INS). These techniques were used to investigate the microscopic dynamics of the dry and hydrated protein.
There and back again: Two views on the protein folding puzzle.
Finkelstein, Alexei V; Badretdin, Azat J; Galzitskaya, Oxana V; Ivankov, Dmitry N; Bogatyreva, Natalya S; Garbuzynskiy, Sergiy O
2017-07-01
The ability of protein chains to spontaneously form their spatial structures is a long-standing puzzle in molecular biology. Experimentally measured folding times of single-domain globular proteins range from microseconds to hours: the difference (10-11 orders of magnitude) is the same as that between the life span of a mosquito and the age of the universe. This review describes physical theories of rates of overcoming the free-energy barrier separating the natively folded (N) and unfolded (U) states of protein chains in both directions: "U-to-N" and "N-to-U". In the theory of protein folding rates a special role is played by the point of thermodynamic (and kinetic) equilibrium between the native and unfolded state of the chain; here, the theory obtains the simplest form. Paradoxically, a theoretical estimate of the folding time is easier to get from consideration of protein unfolding (the "N-to-U" transition) rather than folding, because it is easier to outline a good unfolding pathway of any structure than a good folding pathway that leads to the stable fold, which is yet unknown to the folding protein chain. And since the rates of direct and reverse reactions are equal at the equilibrium point (as follows from the physical "detailed balance" principle), the estimated folding time can be derived from the estimated unfolding time. Theoretical analysis of the "N-to-U" transition outlines the range of protein folding rates in a good agreement with experiment. Theoretical analysis of folding (the "U-to-N" transition), performed at the level of formation and assembly of protein secondary structures, outlines the upper limit of protein folding times (i.e., of the time of search for the most stable fold). Both theories come to essentially the same results; this is not a surprise, because they describe overcoming one and the same free-energy barrier, although the way to the top of this barrier from the side of the unfolded state is very different from the way from the side of the native state; and both theories agree with experiment. In addition, they predict the maximal size of protein domains that fold under solely thermodynamic (rather than kinetic) control and explain the observed maximal size of the "foldable" protein domains. Copyright © 2017 Elsevier B.V. All rights reserved.
There and back again: Two views on the protein folding puzzle
NASA Astrophysics Data System (ADS)
Finkelstein, Alexei V.; Badretdin, Azat J.; Galzitskaya, Oxana V.; Ivankov, Dmitry N.; Bogatyreva, Natalya S.; Garbuzynskiy, Sergiy O.
2017-07-01
The ability of protein chains to spontaneously form their spatial structures is a long-standing puzzle in molecular biology. Experimentally measured folding times of single-domain globular proteins range from microseconds to hours: the difference (10-11 orders of magnitude) is the same as that between the life span of a mosquito and the age of the universe. This review describes physical theories of rates of overcoming the free-energy barrier separating the natively folded (N) and unfolded (U) states of protein chains in both directions: ;U-to-N; and ;N-to-U;. In the theory of protein folding rates a special role is played by the point of thermodynamic (and kinetic) equilibrium between the native and unfolded state of the chain; here, the theory obtains the simplest form. Paradoxically, a theoretical estimate of the folding time is easier to get from consideration of protein unfolding (the ;N-to-U; transition) rather than folding, because it is easier to outline a good unfolding pathway of any structure than a good folding pathway that leads to the stable fold, which is yet unknown to the folding protein chain. And since the rates of direct and reverse reactions are equal at the equilibrium point (as follows from the physical ;detailed balance; principle), the estimated folding time can be derived from the estimated unfolding time. Theoretical analysis of the ;N-to-U; transition outlines the range of protein folding rates in a good agreement with experiment. Theoretical analysis of folding (the ;U-to-N; transition), performed at the level of formation and assembly of protein secondary structures, outlines the upper limit of protein folding times (i.e., of the time of search for the most stable fold). Both theories come to essentially the same results; this is not a surprise, because they describe overcoming one and the same free-energy barrier, although the way to the top of this barrier from the side of the unfolded state is very different from the way from the side of the native state; and both theories agree with experiment. In addition, they predict the maximal size of protein domains that fold under solely thermodynamic (rather than kinetic) control and explain the observed maximal size of the ;foldable; protein domains.
Protein folding simulations: from coarse-grained model to all-atom model.
Zhang, Jian; Li, Wenfei; Wang, Jun; Qin, Meng; Wu, Lei; Yan, Zhiqiang; Xu, Weixin; Zuo, Guanghong; Wang, Wei
2009-06-01
Protein folding is an important and challenging problem in molecular biology. During the last two decades, molecular dynamics (MD) simulation has proved to be a paramount tool and was widely used to study protein structures, folding kinetics and thermodynamics, and structure-stability-function relationship. It was also used to help engineering and designing new proteins, and to answer even more general questions such as the minimal number of amino acid or the evolution principle of protein families. Nowadays, the MD simulation is still undergoing rapid developments. The first trend is to toward developing new coarse-grained models and studying larger and more complex molecular systems such as protein-protein complex and their assembling process, amyloid related aggregations, and structure and motion of chaperons, motors, channels and virus capsides; the second trend is toward building high resolution models and explore more detailed and accurate pictures of protein folding and the associated processes, such as the coordination bond or disulfide bond involved folding, the polarization, charge transfer and protonate/deprotonate process involved in metal coupled folding, and the ion permeation and its coupling with the kinetics of channels. On these new territories, MD simulations have given many promising results and will continue to offer exciting views. Here, we review several new subjects investigated by using MD simulations as well as the corresponding developments of appropriate protein models. These include but are not limited to the attempt to go beyond the topology based Gō-like model and characterize the energetic factors in protein structures and dynamics, the study of the thermodynamics and kinetics of disulfide bond involved protein folding, the modeling of the interactions between chaperonin and the encapsulated protein and the protein folding under this circumstance, the effort to clarify the important yet still elusive folding mechanism of protein BBL, the development of discrete MD and its application in studying the alpha-beta conformational conversion and oligomer assembling process, and the modeling of metal ion involved protein folding. (c) 2009 IUBMB.
Complete fold annotation of the human proteome using a novel structural feature space.
Middleton, Sarah A; Illuminati, Joseph; Kim, Junhyong
2017-04-13
Recognition of protein structural fold is the starting point for many structure prediction tools and protein function inference. Fold prediction is computationally demanding and recognizing novel folds is difficult such that the majority of proteins have not been annotated for fold classification. Here we describe a new machine learning approach using a novel feature space that can be used for accurate recognition of all 1,221 currently known folds and inference of unknown novel folds. We show that our method achieves better than 94% accuracy even when many folds have only one training example. We demonstrate the utility of this method by predicting the folds of 34,330 human protein domains and showing that these predictions can yield useful insights into potential biological function, such as prediction of RNA-binding ability. Our method can be applied to de novo fold prediction of entire proteomes and identify candidate novel fold families.
Complete fold annotation of the human proteome using a novel structural feature space
Middleton, Sarah A.; Illuminati, Joseph; Kim, Junhyong
2017-01-01
Recognition of protein structural fold is the starting point for many structure prediction tools and protein function inference. Fold prediction is computationally demanding and recognizing novel folds is difficult such that the majority of proteins have not been annotated for fold classification. Here we describe a new machine learning approach using a novel feature space that can be used for accurate recognition of all 1,221 currently known folds and inference of unknown novel folds. We show that our method achieves better than 94% accuracy even when many folds have only one training example. We demonstrate the utility of this method by predicting the folds of 34,330 human protein domains and showing that these predictions can yield useful insights into potential biological function, such as prediction of RNA-binding ability. Our method can be applied to de novo fold prediction of entire proteomes and identify candidate novel fold families. PMID:28406174
Identification of the protein folding transition state from molecular dynamics trajectories
NASA Astrophysics Data System (ADS)
Muff, S.; Caflisch, A.
2009-03-01
The rate of protein folding is governed by the transition state so that a detailed characterization of its structure is essential for understanding the folding process. In vitro experiments have provided a coarse-grained description of the folding transition state ensemble (TSE) of small proteins. Atomistic details could be obtained by molecular dynamics (MD) simulations but it is not straightforward to extract the TSE directly from the MD trajectories, even for small peptides. Here, the structures in the TSE are isolated by the cut-based free-energy profile (cFEP) using the network whose nodes and links are configurations sampled by MD and direct transitions among them, respectively. The cFEP is a barrier-preserving projection that does not require arbitrarily chosen progress variables. First, a simple two-dimensional free-energy surface is used to illustrate the successful determination of the TSE by the cFEP approach and to explain the difficulty in defining boundary conditions of the Markov state model for an entropically stabilized free-energy minimum. The cFEP is then used to extract the TSE of a β-sheet peptide with a complex free-energy surface containing multiple basins and an entropic region. In contrast, Markov state models with boundary conditions defined by projected variables and conventional histogram-based free-energy profiles are not able to identify the TSE of the β-sheet peptide.
Direct Observation of Parallel Folding Pathways Revealed Using a Symmetric Repeat Protein System
Aksel, Tural; Barrick, Doug
2014-01-01
Although progress has been made to determine the native fold of a polypeptide from its primary structure, the diversity of pathways that connect the unfolded and folded states has not been adequately explored. Theoretical and computational studies predict that proteins fold through parallel pathways on funneled energy landscapes, although experimental detection of pathway diversity has been challenging. Here, we exploit the high translational symmetry and the direct length variation afforded by linear repeat proteins to directly detect folding through parallel pathways. By comparing folding rates of consensus ankyrin repeat proteins (CARPs), we find a clear increase in folding rates with increasing size and repeat number, although the size of the transition states (estimated from denaturant sensitivity) remains unchanged. The increase in folding rate with chain length, as opposed to a decrease expected from typical models for globular proteins, is a clear demonstration of parallel pathways. This conclusion is not dependent on extensive curve-fitting or structural perturbation of protein structure. By globally fitting a simple parallel-Ising pathway model, we have directly measured nucleation and propagation rates in protein folding, and have quantified the fluxes along each path, providing a detailed energy landscape for folding. This finding of parallel pathways differs from results from kinetic studies of repeat-proteins composed of sequence-variable repeats, where modest repeat-to-repeat energy variation coalesces folding into a single, dominant channel. Thus, for globular proteins, which have much higher variation in local structure and topology, parallel pathways are expected to be the exception rather than the rule. PMID:24988356
Pastor, Ashutosh; Singh, Amit K.; Fisher, Mark T.; Chaudhuri, Tapan K.
2016-01-01
Protein folding has been extensively studied for past four decades by employing solution based experiments such as solubility, enzymatic activity, secondary structure analysis, and analytical methods like FRET, NMR and HD exchange. However, for rapid analysis of the folding process, solution based approaches are often plagued with aggregation side reactions resulting in poor yields. In this work we demonstrate that a Bio-Layer Interferometry (BLI) chaperonin detection system can be potentially applied to identify superior refolding conditions for denatured proteins. The degree of immobilized protein folding as a function of time can be detected by monitoring the binding of the high-affinity nucleotide-free form of the chaperonin GroEL. GroEL preferentially interacts with proteins that have hydrophobic surfaces exposed in their unfolded or partially folded form so a decrease in GroEL binding can be correlated with burial of hydrophobic surfaces as folding progresses. The magnitude of GroEL binding to the protein immobilized on Bio-layer interferometry biosensor inversely reflects the extent of protein folding and hydrophobic residue burial. We demonstrate conditions where accelerated folding can be observed for the aggregation prone protein Maltodextrin glucosidase (MalZ). Superior immobilized folding conditions identified on the Bio-layer interferometry biosensor surface were reproduced on Ni-NTA sepharose bead surfaces and resulted in significant improvement in folding yields of released MalZ (measured by enzymatic activity) compared to bulk refolding conditions in solution. PMID:27367928
Bueno, Marta; Camacho, Carlos J; Sancho, Javier
2007-09-01
The bioinformatics revolution of the last decade has been instrumental in the development of empirical potentials to quantitatively estimate protein interactions for modeling and design. Although computationally efficient, these potentials hide most of the relevant thermodynamics in 5-to-40 parameters that are fitted against a large experimental database. Here, we revisit this longstanding problem and show that a careful consideration of the change in hydrophobicity, electrostatics, and configurational entropy between the folded and unfolded state of aliphatic point mutations predicts 20-30% less false positives and yields more accurate predictions than any published empirical energy function. This significant improvement is achieved with essentially no free parameters, validating past theoretical and experimental efforts to understand the thermodynamics of protein folding. Our first principle analysis strongly suggests that both the solute-solute van der Waals interactions in the folded state and the electrostatics free energy change of exposed aliphatic mutations are almost completely compensated by similar interactions operating in the unfolded ensemble. Not surprisingly, the problem of properly accounting for the solvent contribution to the free energy of polar and charged group mutations, as well as of mutations that disrupt the protein backbone remains open. 2007 Wiley-Liss, Inc.
The affinity of a major Ca2+ binding site on GRP78 is differentially enhanced by ADP and ATP.
Lamb, Heather K; Mee, Christopher; Xu, Weiming; Liu, Lizhi; Blond, Sylvie; Cooper, Alan; Charles, Ian G; Hawkins, Alastair R
2006-03-31
GRP78 is a major protein regulated by the mammalian endoplasmic reticulum stress response, and up-regulation has been shown to be important in protecting cells from challenge with cytotoxic agents. GRP78 has ATPase activity, acts as a chaperone, and interacts specifically with other proteins, such as caspases, as part of a mechanism regulating apoptosis. GRP78 is also reported to have a possible role as a Ca2+ storage protein. In order to understand the potential biological effects of Ca2+ and ATP/ADP binding on the biology of GRP78, we have determined its ligand binding properties. We show here for the first time that GRP78 can bind Ca2+, ATP, and ADP, each with a 1:1 stoichiometry, and that the binding of cation and nucleotide is cooperative. These observations do not support the hypothesis that GRP78 is a dynamic Ca2+ storage protein. Furthermore, we demonstrate that whereas Mg2+ enhances GRP78 binding to ADP and ATP to the same extent, Ca2+ shows a differential enhancement. In the presence of Ca2+, the KD for ATP is lowered approximately 11-fold, and the KD for ADP is lowered around 930-fold. The KD for Ca2+ is lowered approximately 40-fold in the presence of ATP and around 880-fold with ADP. These findings may explain the biological requirement for a nucleotide exchange factor to remove ADP from GRP78. Taken together, our data suggest that the Ca2+-binding property of GRP78 may be part of a signal transduction pathway that modulates complex interactions between GRP78, ATP/ADP, secretory proteins, and caspases, and this ultimately has important consequences for cell viability.
Mondal, Abhisek; Datta, Saumen
2017-06-01
Hydrogen bond plays a unique role in governing macromolecular interactions with exquisite specificity. These interactions govern the fundamental biological processes like protein folding, enzymatic catalysis, molecular recognition. Despite extensive research work, till date there is no proper report available about the hydrogen bond's energy surface with respect to its geometric parameters, directly derived from proteins. Herein, we have deciphered the potential energy landscape of hydrogen bond directly from the macromolecular coordinates obtained from Protein Data Bank using quantum mechanical electronic structure calculations. The findings unravel the hydrogen bonding energies of proteins in parametric space. These data can be used to understand the energies of such directional interactions involved in biological molecules. Quantitative characterization has also been performed using Shannon entropic calculations for atoms participating in hydrogen bond. Collectively, our results constitute an improved way of understanding hydrogen bond energies in case of proteins and complement the knowledge-based potential. Proteins 2017; 85:1046-1055. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Solvent viscosity and friction in protein folding dynamics.
Hagen, Stephen J
2010-08-01
The famous Kramers rate theory for diffusion-controlled reactions has been extended in numerous ways and successfully applied to many types of reactions. Its application to protein folding reactions has been of particular interest in recent years, as many researchers have performed experiments and simulations to test whether folding reactions are diffusion-controlled, whether the solvent is the source of the reaction friction, and whether the friction-dependence of folding rates generally can provide insight into folding dynamics. These experiments involve many practical difficulties, however. They have also produced some unexpected results. Here we briefly review the Kramers theory for reactions in the presence of strong friction and summarize some of the subtle problems that arise in the application of the theory to protein folding. We discuss how the results of these experiments ultimately point to a significant role for internal friction in protein folding dynamics. Studies of friction in protein folding, far from revealing any weakness in Kramers theory, may actually lead to new approaches for probing diffusional dynamics and energy landscapes in protein folding.
Predicting the Dynamics of Protein Abundance
Mehdi, Ahmed M.; Patrick, Ralph; Bailey, Timothy L.; Bodén, Mikael
2014-01-01
Protein synthesis is finely regulated across all organisms, from bacteria to humans, and its integrity underpins many important processes. Emerging evidence suggests that the dynamic range of protein abundance is greater than that observed at the transcript level. Technological breakthroughs now mean that sequencing-based measurement of mRNA levels is routine, but protocols for measuring protein abundance remain both complex and expensive. This paper introduces a Bayesian network that integrates transcriptomic and proteomic data to predict protein abundance and to model the effects of its determinants. We aim to use this model to follow a molecular response over time, from condition-specific data, in order to understand adaptation during processes such as the cell cycle. With microarray data now available for many conditions, the general utility of a protein abundance predictor is broad. Whereas most quantitative proteomics studies have focused on higher organisms, we developed a predictive model of protein abundance for both Saccharomyces cerevisiae and Schizosaccharomyces pombe to explore the latitude at the protein level. Our predictor primarily relies on mRNA level, mRNA–protein interaction, mRNA folding energy and half-life, and tRNA adaptation. The combination of key features, allowing for the low certainty and uneven coverage of experimental observations, gives comparatively minor but robust prediction accuracy. The model substantially improved the analysis of protein regulation during the cell cycle: predicted protein abundance identified twice as many cell-cycle-associated proteins as experimental mRNA levels. Predicted protein abundance was more dynamic than observed mRNA expression, agreeing with experimental protein abundance from a human cell line. We illustrate how the same model can be used to predict the folding energy of mRNA when protein abundance is available, lending credence to the emerging view that mRNA folding affects translation efficiency. The software and data used in this research are available at http://bioinf.scmb.uq.edu.au/proteinabundance/. PMID:24532840
Predicting the dynamics of protein abundance.
Mehdi, Ahmed M; Patrick, Ralph; Bailey, Timothy L; Bodén, Mikael
2014-05-01
Protein synthesis is finely regulated across all organisms, from bacteria to humans, and its integrity underpins many important processes. Emerging evidence suggests that the dynamic range of protein abundance is greater than that observed at the transcript level. Technological breakthroughs now mean that sequencing-based measurement of mRNA levels is routine, but protocols for measuring protein abundance remain both complex and expensive. This paper introduces a Bayesian network that integrates transcriptomic and proteomic data to predict protein abundance and to model the effects of its determinants. We aim to use this model to follow a molecular response over time, from condition-specific data, in order to understand adaptation during processes such as the cell cycle. With microarray data now available for many conditions, the general utility of a protein abundance predictor is broad. Whereas most quantitative proteomics studies have focused on higher organisms, we developed a predictive model of protein abundance for both Saccharomyces cerevisiae and Schizosaccharomyces pombe to explore the latitude at the protein level. Our predictor primarily relies on mRNA level, mRNA-protein interaction, mRNA folding energy and half-life, and tRNA adaptation. The combination of key features, allowing for the low certainty and uneven coverage of experimental observations, gives comparatively minor but robust prediction accuracy. The model substantially improved the analysis of protein regulation during the cell cycle: predicted protein abundance identified twice as many cell-cycle-associated proteins as experimental mRNA levels. Predicted protein abundance was more dynamic than observed mRNA expression, agreeing with experimental protein abundance from a human cell line. We illustrate how the same model can be used to predict the folding energy of mRNA when protein abundance is available, lending credence to the emerging view that mRNA folding affects translation efficiency. The software and data used in this research are available at http://bioinf.scmb.uq.edu.au/proteinabundance/.
Kang, Wen-Bin; He, Chuan; Liu, Zhen-Xing; Wang, Jun; Wang, Wei
2018-05-16
Previous studies based on bioinformatics showed that there is a sharp distinction of structural features and residue composition between the intrinsically disordered proteins and the folded proteins. What induces such a composition-related structural transition? How do various kinds of interactions work in such processes? In this work, we investigate these problems based on a survey on peptides randomly composed of charged residues (including glutamic acids and lysines) and the residues with different hydrophobicity, such as alanines, glycines, or phenylalanines. Based on simulations using all-atom model and replica-exchange Monte Carlo method, a coil-globule transition is observed for each peptide. The corresponding transition temperature is found to be dependent on the contents of the hydrophobic and charged residues. For several cases, when the mean hydrophobicity is larger than a certain threshold, the transition temperature is higher than the room temperature, and vise versa. These thresholds of hydrophobicity and net charge are quantitatively consistent with the border line observed from the study of bioinformatics. These results outline the basic physical reasons for the compositional distinction between the intrinsically disordered proteins and the folded proteins. Furthermore, the contributions of various interactions to the structural variation of peptides are analyzed based on the contact statistics and the charge-pattern dependence of the gyration radii of the peptides. Our observations imply that the hydrophobicity contributes essentially to such composition-related transitions. Thus, we achieve a better understanding on composition-structure relation of the natural proteins and the underlying physics.
Zhu, Jianwei; Zhang, Haicang; Li, Shuai Cheng; Wang, Chao; Kong, Lupeng; Sun, Shiwei; Zheng, Wei-Mou; Bu, Dongbo
2017-12-01
Accurate recognition of protein fold types is a key step for template-based prediction of protein structures. The existing approaches to fold recognition mainly exploit the features derived from alignments of query protein against templates. These approaches have been shown to be successful for fold recognition at family level, but usually failed at superfamily/fold levels. To overcome this limitation, one of the key points is to explore more structurally informative features of proteins. Although residue-residue contacts carry abundant structural information, how to thoroughly exploit these information for fold recognition still remains a challenge. In this study, we present an approach (called DeepFR) to improve fold recognition at superfamily/fold levels. The basic idea of our approach is to extract fold-specific features from predicted residue-residue contacts of proteins using deep convolutional neural network (DCNN) technique. Based on these fold-specific features, we calculated similarity between query protein and templates, and then assigned query protein with fold type of the most similar template. DCNN has showed excellent performance in image feature extraction and image recognition; the rational underlying the application of DCNN for fold recognition is that contact likelihood maps are essentially analogy to images, as they both display compositional hierarchy. Experimental results on the LINDAHL dataset suggest that even using the extracted fold-specific features alone, our approach achieved success rate comparable to the state-of-the-art approaches. When further combining these features with traditional alignment-related features, the success rate of our approach increased to 92.3%, 82.5% and 78.8% at family, superfamily and fold levels, respectively, which is about 18% higher than the state-of-the-art approach at fold level, 6% higher at superfamily level and 1% higher at family level. An independent assessment on SCOP_TEST dataset showed consistent performance improvement, indicating robustness of our approach. Furthermore, bi-clustering results of the extracted features are compatible with fold hierarchy of proteins, implying that these features are fold-specific. Together, these results suggest that the features extracted from predicted contacts are orthogonal to alignment-related features, and the combination of them could greatly facilitate fold recognition at superfamily/fold levels and template-based prediction of protein structures. Source code of DeepFR is freely available through https://github.com/zhujianwei31415/deepfr, and a web server is available through http://protein.ict.ac.cn/deepfr. zheng@itp.ac.cn or dbu@ict.ac.cn. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
McClure, Michelle L.; Barnes, Stephen; Brodsky, Jeffrey L.
2016-01-01
Posttranslational modifications add diversity to protein function. Throughout its life cycle, the cystic fibrosis transmembrane conductance regulator (CFTR) undergoes numerous covalent posttranslational modifications (PTMs), including glycosylation, ubiquitination, sumoylation, phosphorylation, and palmitoylation. These modifications regulate key steps during protein biogenesis, such as protein folding, trafficking, stability, function, and association with protein partners and therefore may serve as targets for therapeutic manipulation. More generally, an improved understanding of molecular mechanisms that underlie CFTR PTMs may suggest novel treatment strategies for CF and perhaps other protein conformational diseases. This review provides a comprehensive summary of co- and posttranslational CFTR modifications and their significance with regard to protein biogenesis. PMID:27474090
What determines the spectrum of protein native state structures?
Lezon, Timothy R; Banavar, Jayanth R; Lesk, Arthur M; Maritan, Amos
2006-05-01
We present a brief summary of the key factors underlying protein structure, as developed in the investigations of Pauling, Ramachandran, and Rose. We then outline a simplified physical model of proteins that focuses on geometry and symmetry. Although this model superficially appears unrelated to the detailed chemical descriptions commonly applied to proteins, we show that it captures the essential elements of the chemistry and provides a unified framework for understanding the common characteristics of folded proteins. We suggest that the spectrum of protein native state structures is determined by geometry and symmetry and the role of the sequence is to choose its native state structure from this predetermined menu. 2006 Wiley-Liss, Inc.
Theory and simulation of explicit solvent effects on protein folding in vitro and in vivo
NASA Astrophysics Data System (ADS)
England, Jeremy L.
The aim of this work is to develop theoretical tools for understanding what happens to water that is confined in amphipathic cavities, and for testing the consequences of this understanding for protein folding in vitro and in vivo. We begin in the first chapter with a brief review of the theoretical and simulation literature on the hydrophobic effect and the aqueous solvation of charged species that also puts forward a simple theoretical framework within which various solvation phenomena reported in past studies may be unified. Subsequently, in the second chapter we also review past computational and theoretical work on the specific question of how chaperonin complexes assist the folding of their substrates. With the context set, we turn in Chapter 3 to the case of an open system with water trapped between hydrophobic plates that experiences a uniform electric field normal to and between the plates. Classic bulk theory of electrostriction in polarizable fluids tells us that the electric field should cause an increase in local water density as it rises, yet some simulations have suggested the opposite. We present a mean-field Potts model we have developed to explain this discrepancy, and show how such a simple, coarse-grained lattice description can capture the fundamental consequences of the fact that external electric fields can frustrate the hydrogen bond network in confined water. Chapter 4 continues to pursue the issue of solvent evacuation between hydrophobic plates, but focuses on the impact of chemical denaturants on hydrophobic effects using molecular dynamics simulations of hydrophobic dewetting. We find that while urea and guanidinium have similar qualitative effects at the bulk level, they seem to differ in the microscopic mechanism by which they denature proteins, although both inhibit the onset of dewetting. Lastly, Chapters 5 and 6 examine the potential importance of solvent-mediated forces to protein folding in vivo. Chapter 5 develops a Landau-Ginzburg-type model for solvent free energy and lays out a theoretical argument for a mechanism by which chaperonins may promote the folding of their substrates through a local enhancement of the hydrophobic effect. With this argument in hand, we show results in Chapter 6 from molecular dynamics simulations we performed of different mutants of the bacterial chaperonin GroEL, which demonstrate that the hydrophilicity of the chaperonin cavity correlates with the experimentally measured ability of the cavity to facilitate folding.
Kwa, Lee Gyan; Wensley, Beth G.; Alexander, Crispin G.; Browning, Stuart J.; Lichman, Benjamin R.; Clarke, Jane
2014-01-01
Three homologous spectrin domains have remarkably different folding characteristics. We have previously shown that the slow-folding R16 and R17 spectrin domains can be altered to resemble the fast folding R15, in terms of speed of folding (and unfolding), landscape roughness and folding mechanism, simply by substituting five residues in the core. Here we show that, by contrast, R15 cannot be engineered to resemble R16 and R17. It is possible to engineer a slow-folding version of R15, but our analysis shows that this protein neither has a rougher energy landscape nor does change its folding mechanism. Quite remarkably, R15 appears to be a rare example of a protein with a folding nucleus that does not change in position or in size when its folding nucleus is disrupted. Thus, while two members of this protein family are remarkably plastic, the third has apparently a restricted folding landscape. PMID:24373753
Hydrogen-Bond Driven Loop-Closure Kinetics in Unfolded Polypeptide Chains
Daidone, Isabella; Neuweiler, Hannes; Doose, Sören; Sauer, Markus; Smith, Jeremy C.
2010-01-01
Characterization of the length dependence of end-to-end loop-closure kinetics in unfolded polypeptide chains provides an understanding of early steps in protein folding. Here, loop-closure in poly-glycine-serine peptides is investigated by combining single-molecule fluorescence spectroscopy with molecular dynamics simulation. For chains containing more than 10 peptide bonds loop-closing rate constants on the 20–100 nanosecond time range exhibit a power-law length dependence. However, this scaling breaks down for shorter peptides, which exhibit slower kinetics arising from a perturbation induced by the dye reporter system used in the experimental setup. The loop-closure kinetics in the longer peptides is found to be determined by the formation of intra-peptide hydrogen bonds and transient β-sheet structure, that accelerate the search for contacts among residues distant in sequence relative to the case of a polypeptide chain in which hydrogen bonds cannot form. Hydrogen-bond-driven polypeptide-chain collapse in unfolded peptides under physiological conditions found here is not only consistent with hierarchical models of protein folding, that highlights the importance of secondary structure formation early in the folding process, but is also shown to speed up the search for productive folding events. PMID:20098498
DOE Office of Scientific and Technical Information (OSTI.GOV)
Middleton, Sarah A.; Illuminati, Joseph; Kim, Junhyong
Recognition of protein structural fold is the starting point for many structure prediction tools and protein function inference. Fold prediction is computationally demanding and recognizing novel folds is difficult such that the majority of proteins have not been annotated for fold classification. Here we describe a new machine learning approach using a novel feature space that can be used for accurate recognition of all 1,221 currently known folds and inference of unknown novel folds. We show that our method achieves better than 94% accuracy even when many folds have only one training example. We demonstrate the utility of this methodmore » by predicting the folds of 34,330 human protein domains and showing that these predictions can yield useful insights into potential biological function, such as prediction of RNA-binding ability. Finally, our method can be applied to de novo fold prediction of entire proteomes and identify candidate novel fold families.« less
Banci, Lucia; Bertini, Ivano; Cefaro, Chiara; Cenacchi, Lucia; Ciofi-Baffoni, Simone; Felli, Isabella Caterina; Gallo, Angelo; Gonnelli, Leonardo; Luchinat, Enrico; Sideris, Dionisia; Tokatlidis, Kostas
2010-01-01
Several proteins of the mitochondrial intermembrane space are targeted by internal targeting signals. A class of such proteins with α-helical hairpin structure bridged by two intramolecular disulfides is trapped by a Mia40-dependent oxidative process. Here, we describe the oxidative folding mechanism underpinning this process by an exhaustive structural characterization of the protein in all stages and as a complex with Mia40. Two consecutive induced folding steps are at the basis of the protein-trapping process. In the first one, Mia40 functions as a molecular chaperone assisting α-helical folding of the internal targeting signal of the substrate. Subsequently, in a Mia40-independent manner, folding of the second substrate helix is induced by the folded targeting signal functioning as a folding scaffold. The Mia40-induced folding pathway provides a proof of principle for the general concept that internal targeting signals may operate as a folding nucleus upon compartment-specific activation. PMID:21059946
A Particle Swarm Optimization-Based Approach with Local Search for Predicting Protein Folding.
Yang, Cheng-Hong; Lin, Yu-Shiun; Chuang, Li-Yeh; Chang, Hsueh-Wei
2017-10-01
The hydrophobic-polar (HP) model is commonly used for predicting protein folding structures and hydrophobic interactions. This study developed a particle swarm optimization (PSO)-based algorithm combined with local search algorithms; specifically, the high exploration PSO (HEPSO) algorithm (which can execute global search processes) was combined with three local search algorithms (hill-climbing algorithm, greedy algorithm, and Tabu table), yielding the proposed HE-L-PSO algorithm. By using 20 known protein structures, we evaluated the performance of the HE-L-PSO algorithm in predicting protein folding in the HP model. The proposed HE-L-PSO algorithm exhibited favorable performance in predicting both short and long amino acid sequences with high reproducibility and stability, compared with seven reported algorithms. The HE-L-PSO algorithm yielded optimal solutions for all predicted protein folding structures. All HE-L-PSO-predicted protein folding structures possessed a hydrophobic core that is similar to normal protein folding.
Cotranslational structure acquisition of nascent polypeptides monitored by NMR spectroscopy.
Eichmann, Cédric; Preissler, Steffen; Riek, Roland; Deuerling, Elke
2010-05-18
The folding of proteins in living cells may start during their synthesis when the polypeptides emerge gradually at the ribosomal exit tunnel. However, our current understanding of cotranslational folding processes at the atomic level is limited. We employed NMR spectroscopy to monitor the conformation of the SH3 domain from alpha-spectrin at sequential stages of elongation via in vivo ribosome-arrested (15)N,(13)C-labeled nascent polypeptides. These nascent chains exposed either the entire SH3 domain or C-terminally truncated segments thereof, thus providing snapshots of the translation process. We show that nascent SH3 polypeptides remain unstructured during elongation but fold into a compact, native-like beta-sheet assembly when the entire sequence information is available. Moreover, the ribosome neither imposes major conformational constraints nor significantly interacts with exposed unfolded nascent SH3 domain moieties. Our data provide evidence for a domainwise folding of the SH3 domain on ribosomes without significant population of folding intermediates. The domain follows a thermodynamically favorable pathway in which sequential folding units are stabilized, thus avoiding kinetic traps during the process of cotranslational folding.
Mining sequential patterns for protein fold recognition.
Exarchos, Themis P; Papaloukas, Costas; Lampros, Christos; Fotiadis, Dimitrios I
2008-02-01
Protein data contain discriminative patterns that can be used in many beneficial applications if they are defined correctly. In this work sequential pattern mining (SPM) is utilized for sequence-based fold recognition. Protein classification in terms of fold recognition plays an important role in computational protein analysis, since it can contribute to the determination of the function of a protein whose structure is unknown. Specifically, one of the most efficient SPM algorithms, cSPADE, is employed for the analysis of protein sequence. A classifier uses the extracted sequential patterns to classify proteins in the appropriate fold category. For training and evaluating the proposed method we used the protein sequences from the Protein Data Bank and the annotation of the SCOP database. The method exhibited an overall accuracy of 25% in a classification problem with 36 candidate categories. The classification performance reaches up to 56% when the five most probable protein folds are considered.
Confinement in nanopores can destabilize α-helix folding proteins and stabilize the β structures
NASA Astrophysics Data System (ADS)
Javidpour, Leili; Sahimi, Muhammad
2011-09-01
Protein folding in confined media has attracted wide attention over the past decade due to its importance in both in vivo and in vitro applications. Currently, it is generally believed that protein stability increases by decreasing the size of the confining medium, if its interaction with the confining walls is repulsive, and that the maximum folding temperature in confinement occurs for a pore size only slightly larger than the smallest dimension of the folded state of a protein. Protein stability in pore sizes, very close to the size of the folded state, has not however received the attention that it deserves. Using detailed, 0.3-ms-long molecular dynamics simulations, we show that proteins with an α-helix native state can have an optimal folding temperature in pore sizes that do not affect the folded-state structure. In contradiction to the current theoretical explanations, we find that the maximum folding temperature occurs in larger pores for smaller α-helices. In highly confined pores the free energy surface becomes rough, and a new barrier for protein folding may appear close to the unfolded state. In addition, in small nanopores the protein states that contain the β structures are entropically stabilized, in contrast to the bulk. As a consequence, folding rates decrease notably and the free energy surface becomes rougher. The results shed light on many recent experimental observations that cannot be explained by the current theories, and demonstrate the importance of entropic effects on proteins' misfolded states in highly confined environments. They also support the concept of passive effect of chaperonin GroEL on protein folding by preventing it from aggregation in crowded environment of biological cells, and provide deeper clues to the α → β conformational transition, believed to contribute to Alzheimer's and Parkinson's diseases. The strategy of protein and enzyme stabilization in confined media may also have to be revisited in the case of tight confinement. For in silico studies of protein folding in confined media, use of non-Go potentials may be more appropriate.
Shaheen, Alaa
2018-05-05
The unfolded protein response (UPR) is an adaptive cellular response that aims to relieve endoplasmic reticulum (ER) stress via several mechanisms, including inhibition of protein synthesis and enhancement of protein folding and degradation. There is a controversy over the effect of the UPR on ER protein export. While some investigators suggested that ER export is inhibited during ER stress, others suggested the opposite. In this article, their conflicting studies are analyzed and compared in attempt to solve this controversy. The UPR appears indeed to enhance ER export, possibly via multiple mechanisms. However, another factor, which is the integrity of the folding machinery/environment inside ER, determines whether ER export will appear increased or decreased during experimentation. Also, different methods of stress induction appear to have different effects on ER export. Thus, improvement of ER export may represent a new mechanism by which the UPR alleviates ER stress. This may help researchers to understand how the UPR works inside cells and how to manipulate it to alter cell fate during stress, either to promote cell survival or death. This may open up new approaches for the treatment of ER stress-related diseases.
Castillo, Virginia; Graña-Montes, Ricardo; Sabate, Raimon; Ventura, Salvador
2011-06-01
In the cell, protein folding into stable globular conformations is in competition with aggregation into non-functional and usually toxic structures, since the biophysical properties that promote folding also tend to favor intermolecular contacts, leading to the formation of β-sheet-enriched insoluble assemblies. The formation of protein deposits is linked to at least 20 different human disorders, ranging from dementia to diabetes. Furthermore, protein deposition inside cells represents a major obstacle for the biotechnological production of polypeptides. Importantly, the aggregation behavior of polypeptides appears to be strongly influenced by the intrinsic properties encoded in their sequences and specifically by the presence of selective short regions with high aggregation propensity. This allows computational methods to be used to analyze the aggregation properties of proteins without the previous requirement for structural information. Applications range from the identification of individual amyloidogenic regions in disease-linked polypeptides to the analysis of the aggregation properties of complete proteomes. Herein, we review these theoretical approaches and illustrate how they have become important and useful tools in understanding the molecular mechanisms underlying protein aggregation. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
How Does Your Protein Fold? Elucidating the Apomyoglobin Folding Pathway
Dyson, H. Jane; Wright, Peter E.
2017-01-01
Conspectus Although each type of protein fold and in some cases individual proteins within a fold classification can have very different mechanisms of folding, the underlying biophysical and biochemical principles that operate to cause a linear polypeptide chain to fold into a globular structure must be the same. In an aqueous solution, the protein takes up the thermodynamically most stable structure, but the pathway along which the polypeptide proceeds in order to reach that structure is a function of the amino acid sequence, which must be the final determining factor, not only in shaping the final folded structure, but in dictating the folding pathway. A number of groups have focused on a single protein or group of proteins, to determine in detail the factors that influence the rate and mechanism of folding in a defined system, with the hope that hypothesis-driven experiments can elucidate the underlying principles governing the folding process. Our research group has focused on the folding of the globin family of proteins, and in particular on the monomeric protein apomyoglobin. Apomyoglobin (apoMb) folds relatively slowly (~2 seconds) via an ensemble of obligatory intermediates that form rapidly after the initiation of folding. The folding pathway can be dissected using rapid-mixing techniques, which can probe processes in the millisecond time range. Stopped-flow measurements detected by circular dichroism (CD) or fluorescence spectroscopy give information on the rates of folding events. Quench-flow experiments utilize the differential rates of hydrogen-deuterium exchange of amide protons protected in parts of the structure that are folded early; protection of amides can be detected by mass spectrometry or proton nuclear magnetic resonance spectroscopy (NMR). In addition, apoMb forms an intermediate at equilibrium at pH ~ 4, which is sufficiently stable for it to be structurally characterized by solution methods such as CD, fluorescence and NMR spectroscopies, and the conformational ensembles formed in the presence of denaturing agents and low pH can be characterized as models for the unfolded states of the protein. Newer NMR techniques such as measurement of residual dipolar couplings in the various partly folded states, and relaxation dispersion measurements to probe invisible states present at low concentrations, have contributed to providing a detailed picture of the apomyoglobin folding pathway. The research summarized in this review was aimed at characterizing and comparing the equilibrium and kinetic intermediates both structurally and dynamically, as well as delineating the complete folding pathway at a residue-specific level, in order to answer the question “What is it about the amino acid sequence that causes each molecule in the unfolded protein ensemble to start folding, and, once started, to proceed towards the formation of the correctly folded three-dimensional structure?” PMID:28032989
Ab initio folding of proteins using all-atom discrete molecular dynamics
Ding, Feng; Tsao, Douglas; Nie, Huifen; Dokholyan, Nikolay V.
2008-01-01
Summary Discrete molecular dynamics (DMD) is a rapid sampling method used in protein folding and aggregation studies. Until now, DMD was used to perform simulations of simplified protein models in conjunction with structure-based force fields. Here, we develop an all-atom protein model and a transferable force field featuring packing, solvation, and environment-dependent hydrogen bond interactions. Using the replica exchange method, we perform folding simulations of six small proteins (20–60 residues) with distinct native structures. In all cases, native or near-native states are reached in simulations. For three small proteins, multiple folding transitions are observed and the computationally-characterized thermodynamics are in quantitative agreement with experiments. The predictive power of all-atom DMD highlights the importance of environment-dependent hydrogen bond interactions in modeling protein folding. The developed approach can be used for accurate and rapid sampling of conformational spaces of proteins and protein-protein complexes, and applied to protein engineering and design of protein-protein interactions. PMID:18611374
Ganguly, Debabani; Zhang, Weihong; Chen, Jianhan
2013-01-01
Achieving facile specific recognition is essential for intrinsically disordered proteins (IDPs) that are involved in cellular signaling and regulation. Consideration of the physical time scales of protein folding and diffusion-limited protein-protein encounter has suggested that the frequent requirement of protein folding for specific IDP recognition could lead to kinetic bottlenecks. How IDPs overcome such potential kinetic bottlenecks to viably function in signaling and regulation in general is poorly understood. Our recent computational and experimental study of cell-cycle regulator p27 (Ganguly et al., J. Mol. Biol. (2012)) demonstrated that long-range electrostatic forces exerted on enriched charges of IDPs could accelerate protein-protein encounter via “electrostatic steering” and at the same time promote “folding-competent” encounter topologies to enhance the efficiency of IDP folding upon encounter. Here, we further investigated the coupled binding and folding mechanisms and the roles of electrostatic forces in the formation of three IDP complexes with more complex folded topologies. The surface electrostatic potentials of these complexes lack prominent features like those observed for the p27/Cdk2/cyclin A complex to directly suggest the ability of electrostatic forces to facilitate folding upon encounter. Nonetheless, similar electrostatically accelerated encounter and folding mechanisms were consistently predicted for all three complexes using topology-based coarse-grained simulations. Together with our previous analysis of charge distributions in known IDP complexes, our results support a prevalent role of electrostatic interactions in promoting efficient coupled binding and folding for facile specific recognition. These results also suggest that there is likely a co-evolution of IDP folded topology, charge characteristics, and coupled binding and folding mechanisms, driven at least partially by the need to achieve fast association kinetics for cellular signaling and regulation. PMID:24278008
Hydrogen bonds are a primary driving force for de novo protein folding
Lee, Schuyler; Wang, Chao; Liu, Haolin; ...
2017-11-10
The protein-folding mechanism remains a major puzzle in life science. Purified soluble activation-induced cytidine deaminase (AID) is one of the most difficult proteins to obtain. Starting from inclusion bodies containing a C-terminally truncated version of AID (residues 1–153; AID 153 ), an optimized in vitro folding procedure was derived to obtain large amounts of AID 153 , which led to crystals with good quality and to final structural determination. Interestingly, it was found that the final refolding yield of the protein is proline residue-dependent. The difference in the distribution of cis and trans configurations of proline residues in the proteinmore » after complete denaturation is a major determining factor of the final yield. A point mutation of one of four proline residues to an asparagine led to a near-doubling of the yield of refolded protein after complete denaturation. It was concluded that the driving force behind protein folding could not overcome the cis -to- trans proline isomerization, or vice versa , during the protein-folding process. Furthermore, it was found that successful refolding of proteins optimally occurs at high pH values, which may mimic protein folding in vivo . It was found that high pH values could induce the polarization of peptide bonds, which may trigger the formation of protein secondary structures through hydrogen bonds. It is proposed that a hydrophobic environment coupled with negative charges is essential for protein folding. Combined with our earlier discoveries on protein-unfolding mechanisms, it is proposed that hydrogen bonds are a primary driving force for de novo protein folding.« less
Hydrogen bonds are a primary driving force for de novo protein folding
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lee, Schuyler; Wang, Chao; Liu, Haolin
The protein-folding mechanism remains a major puzzle in life science. Purified soluble activation-induced cytidine deaminase (AID) is one of the most difficult proteins to obtain. Starting from inclusion bodies containing a C-terminally truncated version of AID (residues 1–153; AID 153 ), an optimized in vitro folding procedure was derived to obtain large amounts of AID 153 , which led to crystals with good quality and to final structural determination. Interestingly, it was found that the final refolding yield of the protein is proline residue-dependent. The difference in the distribution of cis and trans configurations of proline residues in the proteinmore » after complete denaturation is a major determining factor of the final yield. A point mutation of one of four proline residues to an asparagine led to a near-doubling of the yield of refolded protein after complete denaturation. It was concluded that the driving force behind protein folding could not overcome the cis -to- trans proline isomerization, or vice versa , during the protein-folding process. Furthermore, it was found that successful refolding of proteins optimally occurs at high pH values, which may mimic protein folding in vivo . It was found that high pH values could induce the polarization of peptide bonds, which may trigger the formation of protein secondary structures through hydrogen bonds. It is proposed that a hydrophobic environment coupled with negative charges is essential for protein folding. Combined with our earlier discoveries on protein-unfolding mechanisms, it is proposed that hydrogen bonds are a primary driving force for de novo protein folding.« less
The Folding of de Novo Designed Protein DS119 via Molecular Dynamics Simulations.
Wang, Moye; Hu, Jie; Zhang, Zhuqing
2016-04-26
As they are not subjected to natural selection process, de novo designed proteins usually fold in a manner different from natural proteins. Recently, a de novo designed mini-protein DS119, with a βαβ motif and 36 amino acids, has folded unusually slowly in experiments, and transient dimers have been detected in the folding process. Here, by means of all-atom replica exchange molecular dynamics (REMD) simulations, several comparably stable intermediate states were observed on the folding free-energy landscape of DS119. Conventional molecular dynamics (CMD) simulations showed that when two unfolded DS119 proteins bound together, most binding sites of dimeric aggregates were located at the N-terminal segment, especially residues 5-10, which were supposed to form β-sheet with its own C-terminal segment. Furthermore, a large percentage of individual proteins in the dimeric aggregates adopted conformations similar to those in the intermediate states observed in REMD simulations. These results indicate that, during the folding process, DS119 can easily become trapped in intermediate states. Then, with diffusion, a transient dimer would be formed and stabilized with the binding interface located at N-terminals. This means that it could not quickly fold to the native structure. The complicated folding manner of DS119 implies the important influence of natural selection on protein-folding kinetics, and more improvement should be achieved in rational protein design.
The Folding of de Novo Designed Protein DS119 via Molecular Dynamics Simulations
Wang, Moye; Hu, Jie; Zhang, Zhuqing
2016-01-01
As they are not subjected to natural selection process, de novo designed proteins usually fold in a manner different from natural proteins. Recently, a de novo designed mini-protein DS119, with a βαβ motif and 36 amino acids, has folded unusually slowly in experiments, and transient dimers have been detected in the folding process. Here, by means of all-atom replica exchange molecular dynamics (REMD) simulations, several comparably stable intermediate states were observed on the folding free-energy landscape of DS119. Conventional molecular dynamics (CMD) simulations showed that when two unfolded DS119 proteins bound together, most binding sites of dimeric aggregates were located at the N-terminal segment, especially residues 5–10, which were supposed to form β-sheet with its own C-terminal segment. Furthermore, a large percentage of individual proteins in the dimeric aggregates adopted conformations similar to those in the intermediate states observed in REMD simulations. These results indicate that, during the folding process, DS119 can easily become trapped in intermediate states. Then, with diffusion, a transient dimer would be formed and stabilized with the binding interface located at N-terminals. This means that it could not quickly fold to the native structure. The complicated folding manner of DS119 implies the important influence of natural selection on protein-folding kinetics, and more improvement should be achieved in rational protein design. PMID:27128902
Hartl, F. Ulrich
2017-01-01
Protein folding in the cell was originally assumed to be a spontaneous process, based on Anfinsen’s discovery that purified proteins can fold on their own after removal from denaturant. Consequently cell biologists showed little interest in the protein folding process. This changed only in the mid and late 1980s, when the chaperone story began to unfold. As a result, we now know that in vivo, protein folding requires assistance by a complex machinery of molecular chaperones. To ensure efficient folding, members of different chaperone classes receive the nascent protein chain emerging from the ribosome and guide it along an ordered pathway toward the native state. I was fortunate to contribute to these developments early on. In this short essay, I will describe some of the critical steps leading to the current concept of protein folding as a highly organized cellular process. PMID:29084909
Conformational dynamics of a protein in the folded and the unfolded state
NASA Astrophysics Data System (ADS)
Fitter, Jörg
2003-08-01
In a quasielastic neutron scattering experiment, the picosecond dynamics of α-amylase was investigated for the folded and the unfolded state of the protein. In order to ensure a reasonable interpretation of the internal protein dynamics, the protein was measured in D 2O-buffer solution. The much higher structural flexibility of the pH induced unfolded state as compared to the native folded state was quantified using a simple analytical model, describing a local diffusion inside a sphere. In terms of this model the conformational volume, which is explored mainly by confined protein side-chain movements, is parameterized by the radius of a sphere (folded state, r=1.2 Å; unfolded state, 1.8 Å). Differences in conformational dynamics between the folded and the unfolded state of a protein are of fundamental interest in the field of protein science, because they are assumed to play an important role for the thermodynamics of folding/unfolding transition and for protein stability.
Charlier, Cyril; Alderson, T. Reid; Courtney, Joseph M.; Ying, Jinfa; Anfinrud, Philip
2018-01-01
In general, small proteins rapidly fold on the timescale of milliseconds or less. For proteins with a substantial volume difference between the folded and unfolded states, their thermodynamic equilibrium can be altered by varying the hydrostatic pressure. Using a pressure-sensitized mutant of ubiquitin, we demonstrate that rapidly switching the pressure within an NMR sample cell enables study of the unfolded protein under native conditions and, vice versa, study of the native protein under denaturing conditions. This approach makes it possible to record 2D and 3D NMR spectra of the unfolded protein at atmospheric pressure, providing residue-specific information on the folding process. 15N and 13C chemical shifts measured immediately after dropping the pressure from 2.5 kbar (favoring unfolding) to 1 bar (native) are close to the random-coil chemical shifts observed for a large, disordered peptide fragment of the protein. However, 15N relaxation data show evidence for rapid exchange, on a ∼100-μs timescale, between the unfolded state and unstable, structured states that can be considered as failed folding events. The NMR data also provide direct evidence for parallel folding pathways, with approximately one-half of the protein molecules efficiently folding through an on-pathway kinetic intermediate, whereas the other half fold in a single step. At protein concentrations above ∼300 μM, oligomeric off-pathway intermediates compete with folding of the native state. PMID:29666248
GroEL actively stimulates folding of the endogenous substrate protein PepQ.
Weaver, Jeremy; Jiang, Mengqiu; Roth, Andrew; Puchalla, Jason; Zhang, Junjie; Rye, Hays S
2017-06-30
Many essential proteins cannot fold without help from chaperonins, like the GroELS system of Escherichia coli. How chaperonins accelerate protein folding remains controversial. Here we test key predictions of both passive and active models of GroELS-stimulated folding, using the endogenous E. coli metalloprotease PepQ. While GroELS increases the folding rate of PepQ by over 15-fold, we demonstrate that slow spontaneous folding of PepQ is not caused by aggregation. Fluorescence measurements suggest that, when folding inside the GroEL-GroES cavity, PepQ populates conformations not observed during spontaneous folding in free solution. Using cryo-electron microscopy, we show that the GroEL C-termini make physical contact with the PepQ folding intermediate and help retain it deep within the GroEL cavity, resulting in reduced compactness of the PepQ monomer. Our findings strongly support an active model of chaperonin-mediated protein folding, where partial unfolding of misfolded intermediates plays a key role.
Complete fold annotation of the human proteome using a novel structural feature space
Middleton, Sarah A.; Illuminati, Joseph; Kim, Junhyong
2017-04-13
Recognition of protein structural fold is the starting point for many structure prediction tools and protein function inference. Fold prediction is computationally demanding and recognizing novel folds is difficult such that the majority of proteins have not been annotated for fold classification. Here we describe a new machine learning approach using a novel feature space that can be used for accurate recognition of all 1,221 currently known folds and inference of unknown novel folds. We show that our method achieves better than 94% accuracy even when many folds have only one training example. We demonstrate the utility of this methodmore » by predicting the folds of 34,330 human protein domains and showing that these predictions can yield useful insights into potential biological function, such as prediction of RNA-binding ability. Finally, our method can be applied to de novo fold prediction of entire proteomes and identify candidate novel fold families.« less
Li, Bai; Lin, Mu; Liu, Qiao; Li, Ya; Zhou, Changjun
2015-10-01
Protein folding is a fundamental topic in molecular biology. Conventional experimental techniques for protein structure identification or protein folding recognition require strict laboratory requirements and heavy operating burdens, which have largely limited their applications. Alternatively, computer-aided techniques have been developed to optimize protein structures or to predict the protein folding process. In this paper, we utilize a 3D off-lattice model to describe the original protein folding scheme as a simplified energy-optimal numerical problem, where all types of amino acid residues are binarized into hydrophobic and hydrophilic ones. We apply a balance-evolution artificial bee colony (BE-ABC) algorithm as the minimization solver, which is featured by the adaptive adjustment of search intensity to cater for the varying needs during the entire optimization process. In this work, we establish a benchmark case set with 13 real protein sequences from the Protein Data Bank database and evaluate the convergence performance of BE-ABC algorithm through strict comparisons with several state-of-the-art ABC variants in short-term numerical experiments. Besides that, our obtained best-so-far protein structures are compared to the ones in comprehensive previous literature. This study also provides preliminary insights into how artificial intelligence techniques can be applied to reveal the dynamics of protein folding. Graphical Abstract Protein folding optimization using 3D off-lattice model and advanced optimization techniques.
Molecular dynamics studies of protein folding and aggregation
NASA Astrophysics Data System (ADS)
Ding, Feng
This thesis applies molecular dynamics simulations and statistical mechanics to study: (i) protein folding; and (ii) protein aggregation. Most small proteins fold into their native states via a first-order-like phase transition with a major free energy barrier between the folded and unfolded states. A set of protein conformations corresponding to the free energy barrier, Delta G >> kBT, are the folding transition state ensemble (TSE). Due to their evasive nature, TSE conformations are hard to capture (probability ∝ exp(-DeltaG/k BT)) and characterize. A coarse-grained discrete molecular dynamics model with realistic steric constraints is constructed to reproduce the experimentally observed two-state folding thermodynamics. A kinetic approach is proposed to identify the folding TSE. A specific set of contacts, common to the TSE conformations, is identified as the folding nuclei which are necessary to be formed in order for the protein to fold. Interestingly, the amino acids at the site of the identified folding nuclei are highly conserved for homologous proteins sharing the same structures. Such conservation suggests that amino acids that are important for folding kinetics are under selective pressure to be preserved during the course of molecular evolution. In addition, studies of the conformations close to the transition states uncover the importance of topology in the construction of order parameter for protein folding transition. Misfolded proteins often form insoluble aggregates, amyloid fibrils, that deposit in the extracellular space and lead to a type of disease known as amyloidosis. Due to its insoluble and non-crystalline nature, the aggregation structure and, thus the aggregation mechanism, has yet to be uncovered. Discrete molecular dynamics studies reveal an aggregate structure with the same structural signatures as in experimental observations and show a nucleation aggregation scenario. The simulations also suggest a generic aggregation mechanism that globular proteins under a denaturing environment partially unfold and aggregate by forming stabilizing hydrogen bonds between the backbones of the partial folded substructures. Proteins or peptides rich in alpha-helices also aggregate into beta-rich amyloid fibrils. Upon aggregation, the protein or peptide undergoes a conformational transition from alpha-helices to beta-sheets. The transition of alpha-helix to beta-hairpin (two-stranded beta-sheet) is studied in an all-heavy-atom discrete molecular dynamics model of a polyalanine chain. An entropical driving scenario for the alpha-helix to beta-hairpin transition is discovered.
Generation of a consensus protein domain dictionary
Schaeffer, R. Dustin; Jonsson, Amanda L.; Simms, Andrew M.; Daggett, Valerie
2011-01-01
Motivation: The discovery of new protein folds is a relatively rare occurrence even as the rate of protein structure determination increases. This rarity reinforces the concept of folds as reusable units of structure and function shared by diverse proteins. If the folding mechanism of proteins is largely determined by their topology, then the folding pathways of members of existing folds could encompass the full set used by globular protein domains. Results: We have used recent versions of three common protein domain dictionaries (SCOP, CATH and Dali) to generate a consensus domain dictionary (CDD). Surprisingly, 40% of the metafolds in the CDD are not composed of autonomous structural domains, i.e. they are not plausible independent folding units. This finding has serious ramifications for bioinformatics studies mining these domain dictionaries for globular protein properties. However, our main purpose in deriving this CDD was to generate an updated CDD to choose targets for MD simulation as part of our dynameomics effort, which aims to simulate the native and unfolding pathways of representatives of all globular protein consensus folds (metafolds). Consequently, we also compiled a list of representative protein targets of each metafold in the CDD. Availability and implementation: This domain dictionary is available at www.dynameomics.org. Contact: daggett@u.washington.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:21068000
Warepam, Marina; Sharma, Gurumayum Suraj; Dar, Tanveer Ali; Khan, Md. Khurshid Alam; Singh, Laishram Rajendrakumar
2014-01-01
Osmolytes are low molecular weight organic molecules accumulated by organisms to assist proper protein folding, and to provide protection to the structural integrity of proteins under denaturing stress conditions. It is known that osmolyte-induced protein folding is brought by unfavorable interaction of osmolytes with the denatured/unfolded states. The interaction of osmolyte with the native state does not significantly contribute to the osmolyte-induced protein folding. We have therefore investigated if different denatured states of a protein (generated by different denaturing agents) interact differently with the osmolytes to induce protein folding. We observed that osmolyte-assisted refolding of protein obtained from heat-induced denatured state produces native molecules with higher enzyme activity than those initiated from GdmCl- or urea-induced denatured state indicating that the structural property of the initial denatured state during refolding by osmolytes determines the catalytic efficiency of the folded protein molecule. These conclusions have been reached from the systematic measurements of enzymatic kinetic parameters (K m and k cat), thermodynamic stability (T m and ΔH m) and secondary and tertiary structures of the folded native proteins obtained from refolding of various denatured states (due to heat-, urea- and GdmCl-induced denaturation) of RNase-A in the presence of various osmolytes. PMID:25313668
Selection of stably folded proteins by phage-display with proteolysis.
Bai, Yawen; Feng, Hanqiao
2004-05-01
To facilitate the process of protein design and learn the basic rules that control the structure and stability of proteins, combinatorial methods have been developed to select or screen proteins with desired properties from libraries of mutants. One such method uses phage-display and proteolysis to select stably folded proteins. This method does not rely on specific properties of proteins for selection. Therefore, in principle it can be applied to any protein. Since its first demonstration in 1998, the method has been used to create hyperthermophilic proteins, to evolve novel folded domains from a library generated by combinatorial shuffling of polypeptide segments and to convert a partially unfolded structure to a fully folded protein.
Gromiha, M Michael; Anoosha, P; Huang, Liang-Tsung
2016-01-01
Protein stability is the free energy difference between unfolded and folded states of a protein, which lies in the range of 5-25 kcal/mol. Experimentally, protein stability is measured with circular dichroism, differential scanning calorimetry, and fluorescence spectroscopy using thermal and denaturant denaturation methods. These experimental data have been accumulated in the form of a database, ProTherm, thermodynamic database for proteins and mutants. It also contains sequence and structure information of a protein, experimental methods and conditions, and literature information. Different features such as search, display, and sorting options and visualization tools have been incorporated in the database. ProTherm is a valuable resource for understanding/predicting the stability of proteins and it can be accessed at http://www.abren.net/protherm/ . ProTherm has been effectively used to examine the relationship among thermodynamics, structure, and function of proteins. We describe the recent progress on the development of methods for understanding/predicting protein stability, such as (1) general trends on mutational effects on stability, (2) relationship between the stability of protein mutants and amino acid properties, (3) applications of protein three-dimensional structures for predicting their stability upon point mutations, (4) prediction of protein stability upon single mutations from amino acid sequence, and (5) prediction methods for addressing double mutants. A list of online resources for predicting has also been provided.
Atomic interaction networks in the core of protein domains and their native folds.
Soundararajan, Venkataramanan; Raman, Rahul; Raguram, S; Sasisekharan, V; Sasisekharan, Ram
2010-02-23
Vastly divergent sequences populate a majority of protein folds. In the quest to identify features that are conserved within protein domains belonging to the same fold, we set out to examine the entire protein universe on a fold-by-fold basis. We report that the atomic interaction network in the solvent-unexposed core of protein domains are fold-conserved, extraordinary sequence divergence notwithstanding. Further, we find that this feature, termed protein core atomic interaction network (or PCAIN) is significantly distinguishable across different folds, thus appearing to be "signature" of a domain's native fold. As part of this study, we computed the PCAINs for 8698 representative protein domains from families across the 1018 known protein folds to construct our seed database and an automated framework was developed for PCAIN-based characterization of the protein fold universe. A test set of randomly selected domains that are not in the seed database was classified with over 97% accuracy, independent of sequence divergence. As an application of this novel fold signature, a PCAIN-based scoring scheme was developed for comparative (homology-based) structure prediction, with 1-2 angstroms (mean 1.61A) C(alpha) RMSD generally observed between computed structures and reference crystal structures. Our results are consistent across the full spectrum of test domains including those from recent CASP experiments and most notably in the 'twilight' and 'midnight' zones wherein <30% and <10% target-template sequence identity prevails (mean twilight RMSD of 1.69A). We further demonstrate the utility of the PCAIN protocol to derive biological insight into protein structure-function relationships, by modeling the structure of the YopM effector novel E3 ligase (NEL) domain from plague-causative bacterium Yersinia Pestis and discussing its implications for host adaptive and innate immune modulation by the pathogen. Considering the several high-throughput, sequence-identity-independent applications demonstrated in this work, we suggest that the PCAIN is a fundamental fold feature that could be a valuable addition to the arsenal of protein modeling and analysis tools.
Atomic Interaction Networks in the Core of Protein Domains and Their Native Folds
Soundararajan, Venkataramanan; Raman, Rahul; Raguram, S.; Sasisekharan, V.; Sasisekharan, Ram
2010-01-01
Vastly divergent sequences populate a majority of protein folds. In the quest to identify features that are conserved within protein domains belonging to the same fold, we set out to examine the entire protein universe on a fold-by-fold basis. We report that the atomic interaction network in the solvent-unexposed core of protein domains are fold-conserved, extraordinary sequence divergence notwithstanding. Further, we find that this feature, termed protein core atomic interaction network (or PCAIN) is significantly distinguishable across different folds, thus appearing to be “signature” of a domain's native fold. As part of this study, we computed the PCAINs for 8698 representative protein domains from families across the 1018 known protein folds to construct our seed database and an automated framework was developed for PCAIN-based characterization of the protein fold universe. A test set of randomly selected domains that are not in the seed database was classified with over 97% accuracy, independent of sequence divergence. As an application of this novel fold signature, a PCAIN-based scoring scheme was developed for comparative (homology-based) structure prediction, with 1–2 angstroms (mean 1.61A) Cα RMSD generally observed between computed structures and reference crystal structures. Our results are consistent across the full spectrum of test domains including those from recent CASP experiments and most notably in the ‘twilight’ and ‘midnight’ zones wherein <30% and <10% target-template sequence identity prevails (mean twilight RMSD of 1.69A). We further demonstrate the utility of the PCAIN protocol to derive biological insight into protein structure-function relationships, by modeling the structure of the YopM effector novel E3 ligase (NEL) domain from plague-causative bacterium Yersinia Pestis and discussing its implications for host adaptive and innate immune modulation by the pathogen. Considering the several high-throughput, sequence-identity-independent applications demonstrated in this work, we suggest that the PCAIN is a fundamental fold feature that could be a valuable addition to the arsenal of protein modeling and analysis tools. PMID:20186337
Reversible Aggregation Plays a Crucial Role on the Folding Landscape of p53 Core Domain
Ishimaru, Daniella; Lima, Luis M. T. R.; Maia, Lenize F.; Lopez, Priscila M.; Ano Bom, Ana P.; Valente, Ana P.; Silva, Jerson L.
2004-01-01
The role of tumor suppressor protein p53 in cell cycle control depends on its flexible and partially unstructured conformation, which makes it crucial to understand its folding landscape. Here we report an intermediate structure of the core domain of the tumor suppressor protein p53 (p53C) during equilibrium and kinetic folding/unfolding transitions induced by guanidinium chloride. This partially folded structure was undetectable when investigated by intrinsic fluorescence. Indeed, the fluorescence data showed a simple two-state transition. On the other hand, analysis of far ultraviolet circular dichroism in 1.0 M guanidinium chloride demonstrated a high content of secondary structure, and the use of an extrinsic fluorescent probe, 4,4′-dianilino-1,1′ binaphthyl-5,5′-disulfonic acid, indicated an increase in exposure of the hydrophobic core at 1 M guanidinium chloride. This partially folded conformation of p53C was plagued by aggregation, as suggested by one-dimensional NMR and demonstrated by light-scattering and gel-filtration chromatography. Dissociation by high pressure of these aggregates reveals the reversibility of the process and that the aggregates have water-excluded cavities. Kinetic measurements show that the intermediate formed in a parallel reaction between unfolded and folded structures and that it is under fine energetic control. They are not only crucial to the folding pathway of p53C but may explain as well the vulnerability of p53C to undergo departure of the native to an inactive state, which makes the cell susceptible to malignant transformation. PMID:15298872
Folding and Stabilization of Native-Sequence-Reversed Proteins
Zhang, Yuanzhao; Weber, Jeffrey K; Zhou, Ruhong
2016-01-01
Though the problem of sequence-reversed protein folding is largely unexplored, one might speculate that reversed native protein sequences should be significantly more foldable than purely random heteropolymer sequences. In this article, we investigate how the reverse-sequences of native proteins might fold by examining a series of small proteins of increasing structural complexity (α-helix, β-hairpin, α-helix bundle, and α/β-protein). Employing a tandem protein structure prediction algorithmic and molecular dynamics simulation approach, we find that the ability of reverse sequences to adopt native-like folds is strongly influenced by protein size and the flexibility of the native hydrophobic core. For β-hairpins with reverse-sequences that fail to fold, we employ a simple mutational strategy for guiding stable hairpin formation that involves the insertion of amino acids into the β-turn region. This systematic look at reverse sequence duality sheds new light on the problem of protein sequence-structure mapping and may serve to inspire new protein design and protein structure prediction protocols. PMID:27113844
Folding and Stabilization of Native-Sequence-Reversed Proteins
NASA Astrophysics Data System (ADS)
Zhang, Yuanzhao; Weber, Jeffrey K.; Zhou, Ruhong
2016-04-01
Though the problem of sequence-reversed protein folding is largely unexplored, one might speculate that reversed native protein sequences should be significantly more foldable than purely random heteropolymer sequences. In this article, we investigate how the reverse-sequences of native proteins might fold by examining a series of small proteins of increasing structural complexity (α-helix, β-hairpin, α-helix bundle, and α/β-protein). Employing a tandem protein structure prediction algorithmic and molecular dynamics simulation approach, we find that the ability of reverse sequences to adopt native-like folds is strongly influenced by protein size and the flexibility of the native hydrophobic core. For β-hairpins with reverse-sequences that fail to fold, we employ a simple mutational strategy for guiding stable hairpin formation that involves the insertion of amino acids into the β-turn region. This systematic look at reverse sequence duality sheds new light on the problem of protein sequence-structure mapping and may serve to inspire new protein design and protein structure prediction protocols.
Statistical mechanics of protein structural transitions: Insights from the island model
Kobayashi, Yukio
2016-01-01
The so-called island model of protein structural transition holds that hydrophobic interactions are the key to both the folding and function of proteins. Herein, the genesis and statistical mechanical basis of the island model of transitions are reviewed, by presenting the results of simulations of such transitions. Elucidating the physicochemical mechanism of protein structural formation is the foundation for understanding the hierarchical structure of life at the microscopic level. Based on the results obtained to date using the island model, remaining problems and future work in the field of protein structures are discussed, referencing Professor Saitô’s views on the hierarchic structure of science. PMID:28409078
Approaches for Defining the Hsp90-dependent Proteome
Hartson, Steven D.; Matts, Robert L.
2011-01-01
Hsp90 is the target of ongoing drug discovery studies seeking new compounds to treat cancer, neurodegenerative diseases, and protein folding disorders. To better understand Hsp90’s roles in cellular pathologies and in normal cells, numerous studies have utilized proteomics assays and related high-throughput tools to characterize its physical and functional protein partnerships. This review surveys these studies, and summarizes the strengths and limitations of the individual attacks. We also include downloadable spreadsheets compiling all of the Hsp90-interacting proteins identified in more than 23 studies. These tools include cross-references among gene aliases, human homologues of yeast Hsp90-interacting proteins, hyperlinks to database entries, summaries of canonical pathways that are enriched in the Hsp90 interactome, and additional bioinformatic annotations. In addition to summarizing Hsp90 proteomics studies performed to date and the insights they have provided, we identify gaps in our current understanding of Hsp90-mediated proteostasis. PMID:21906632
Protein classification using sequential pattern mining.
Exarchos, Themis P; Papaloukas, Costas; Lampros, Christos; Fotiadis, Dimitrios I
2006-01-01
Protein classification in terms of fold recognition can be employed to determine the structural and functional properties of a newly discovered protein. In this work sequential pattern mining (SPM) is utilized for sequence-based fold recognition. One of the most efficient SPM algorithms, cSPADE, is employed for protein primary structure analysis. Then a classifier uses the extracted sequential patterns for classifying proteins of unknown structure in the appropriate fold category. The proposed methodology exhibited an overall accuracy of 36% in a multi-class problem of 17 candidate categories. The classification performance reaches up to 65% when the three most probable protein folds are considered.
REVIEW ARTICLE: How do biomolecular systems speed up and regulate rates?
NASA Astrophysics Data System (ADS)
Zhou, Huan-Xiang
2005-09-01
The viability of a biological system depends upon careful regulation of the rates of various processes. These rates have limits imposed by intrinsic chemical or physical steps (e.g., diffusion). These limits can be expanded by interactions and dynamics of the biomolecules. For example, (a) a chemical reaction is catalyzed when its transition state is preferentially bound to an enzyme; (b) the folding of a protein molecule is speeded up by specific interactions within the transition-state ensemble and may be assisted by molecular chaperones; (c) the rate of specific binding of a protein molecule to a cellular target can be enhanced by mechanisms such as long-range electrostatic interactions, nonspecific binding and folding upon binding; (d) directional movement of motor proteins is generated by capturing favorable Brownian motion through intermolecular binding energy; and (e) conduction and selectivity of ions through membrane channels are controlled by interactions and the dynamics of channel proteins. Simple physical models are presented here to illustrate these processes and provide a unifying framework for understanding speed attainment and regulation in biomolecular systems.
Yoo, Tae Yeon; Adhikari, Aashish; Xia, Zhen; Huynh, Tien; Freed, Karl F.; Zhou, Ruhong; Sosnick, Tobin R.
2012-01-01
Progress in understanding protein folding relies heavily upon an interplay between experiment and theory. In particular, readily interpretable experimental data are required that can be meaningfully compared to simulations. According to standard mutational φ analysis, the transition state for Protein L contains only a single hairpin. However, we demonstrate here using ψ analysis with engineered metal ion binding sites that the transition state is extensive, containing the entire four-stranded β sheet. Underreporting of the structural content of the transition state by φ analysis also occurs for acyl phosphatase1, ubiquitin2 and BdpA3. The carboxy terminal hairpin in the transition state of Protein L is found to be non-native, a significant result that agrees with our PDB-based backbone sampling and all-atom simulations. The non-native character partially explains the failure of accepted experimental and native-centric computational approaches to adequately describe the transition state. Hence, caution is required even when an apparent agreement exists between experiment and theory, thus highlighting the importance of having alternative methods for characterizing transition states. PMID:22522126
Role of α-crystallin B in regulation of stress induced cardiomyocyte apoptosis.
Ganguly, Subhalakshmi; Mitra, Arkadeep; Sarkar, Sagartirtha
2014-01-01
Cardiovascular disease is the leading cause of death worldwide. Recently emerging evidence suggests that cardiomyocyte apoptosis is one of the major pathogenic factors in heart diseases leading to heart failure. Cardiomyocytes undergo apoptosis in response to a wide variety of cellular stresses including protein folding stress at Endoplasmic reticulum (ER). Stressed myocytes elicit an adaptive response referred as Unfolded Protein Response (UPR) by inducing accumulation of heat shock proteins (HSPs) to mitigate the ER stress. HSPs act as molecular chaperons by assisting correct folding of the aggregated misfolded proteins in ER lumen. α-Crystallin B (CRYAB) is an abundant small HSP that confers protection to cardiomyocytes against various stress stimuli. Recent evidence indicates that CRYAB directly interacts with several components of ER stress and also mitochondrial apoptotic pathway. Based on currently available literature this mini review will focus on how CRYAB confers protection to stressed myocardium thereby emphasizing its function as antiapoptotic molecule. Understanding the interplay between CRYAB and the key components in the apoptotic signaling cascade mediated by ER and mitochondria will help in development of novel therapies for cardiac diseases.
Lundberg, Kathleen C.; Fritz, Yi; Johnston, Andrew; Foster, Alexander M.; Baliwag, Jaymie; Gudjonsson, Johann E.; Schlatzer, Daniela; Gokulrangan, Giridharan; McCormick, Thomas S.; Chance, Mark R.; Ward, Nicole L.
2015-01-01
Herein, we demonstrate the efficacy of an unbiased proteomics screening approach for studying protein expression changes in the KC-Tie2 psoriasis mouse model, identifying multiple protein expression changes in the mouse and validating these changes in human psoriasis. KC-Tie2 mouse skin samples (n = 3) were compared with littermate controls (n = 3) using gel-based fractionation followed by label-free protein expression analysis. 5482 peptides mapping to 1281 proteins were identified and quantitated: 105 proteins exhibited fold-changes ≥2.0 including: stefin A1 (average fold change of 342.4 and an average p = 0.0082; cystatin A, human ortholog); slc25a5 (average fold change of 46.2 and an average p = 0.0318); serpinb3b (average fold change of 35.6 and an average p = 0.0345; serpinB1, human ortholog); and kallikrein related peptidase 6 (average fold change of 4.7 and an average p = 0.2474; KLK6). We independently confirmed mouse gene expression-based increases of selected genes including serpinb3b (17.4-fold, p < 0.0001), KLK6 (9-fold, p = 0.002), stefin A1 (7.3-fold; p < 0.001), and slc25A5 (1.5-fold; p = 0.05) using qRT-PCR on a second cohort of animals (n = 8). Parallel LC/MS/MS analyses on these same samples verified protein-level increases of 1.3-fold (slc25a5; p < 0.05), 29,000-fold (stefinA1; p < 0.01), 322-fold (KLK6; p < 0.0001) between KC-Tie2 and control mice. To underscore the utility and translatability of our combined approach, we analyzed gene and protein expression levels in psoriasis patient skin and primary keratinocytes versus healthy controls. Increases in gene expression for slc25a5 (1.8-fold), cystatin A (3-fold), KLK6 (5.8-fold), and serpinB1 (76-fold; all p < 0.05) were observed between healthy controls and involved lesional psoriasis skin and primary psoriasis keratinocytes. Moreover, slc25a5, cystatin A, KLK6, and serpinB1 protein were all increased in lesional psoriasis skin compared with normal skin. These results highlight the usefulness of preclinical disease models using readily-available mouse skin and demonstrate the utility of proteomic approaches for identifying novel peptides/proteins that are differentially regulated in psoriasis that could serve as sources of auto-antigens or provide novel therapeutic targets for the development of new anti-psoriatic treatments. PMID:25351201
Proteomic Profiling of Cigarette Smoke Induced Changes in Retinal Pigment Epithelium Cells.
Merl-Pham, Juliane; Gruhn, Fabian; Hauck, Stefanie M
2016-01-01
Age-related macular degeneration (AMD) is a medical condition usually affecting older adults and resulting in a loss of vision in the macula, the center of the visual field. The dry form of this disease presents with atrophy of the retinal pigment epithelium, resulting in the detachment of the retina and loss of photoreceptors. Cigarette smoke is one main risk factor for dry AMD and increases the risk of developing the disease by three times. In order to understand the influence of cigarette smoke on retinal pigment epithelial cells, cultured human ARPE-19 cells were treated with cigarette smoke extract for 24 h. Using quantitative mass spectrometry more than 3000 proteins were identified and their respective abundances were compared between cigarette smoke-treated and untreated cells. Altogether 1932 proteins were quantified with at least two unique peptides, with 686 proteins found to be significantly differentially abundant with p > 0.05. Of these proteins the abundance of 64 proteins was at least 2-fold down-regulated after cigarette smoke treatment while 120 proteins were 2-fold up-regulated. The analysis of associated biological processes revealed an alteration of proteins involved in RNA processing and transport as well as extracellular matrix remodelling in response to cigarette smoke treatment.
Frustration in Condensed Matter and Protein Folding
NASA Astrophysics Data System (ADS)
Li, Z.; Tanner, S.; Conroy, B.; Owens, F.; Tran, M. M.; Boekema, C.
2014-03-01
By means of computer modeling, we are studying frustration in condensed matter and protein folding, including the influence of temperature and Thomson-figure formation. Frustration is due to competing interactions in a disordered state. The key issue is how the particles interact to reach the lowest frustration. The relaxation for frustration is mostly a power function (randomly assigned pattern) or an exponential function (regular patterns like Thomson figures). For the atomic Thomson model, frustration is predicted to decrease with the formation of Thomson figures at zero kelvin. We attempt to apply our frustration modeling to protein folding and dynamics. We investigate the homogeneous protein frustration that would cause the speed of the protein folding to increase. Increase of protein frustration (where frustration and hydrophobicity interplay with protein folding) may lead to a protein mutation. Research is supported by WiSE@SJSU and AFC San Jose.
NASA Astrophysics Data System (ADS)
Guinn, Emily J.; Jagannathan, Bharat; Marqusee, Susan
2015-04-01
A fundamental question in protein folding is whether proteins fold through one or multiple trajectories. While most experiments indicate a single pathway, simulations suggest proteins can fold through many parallel pathways. Here, we use a combination of chemical denaturant, mechanical force and site-directed mutations to demonstrate the presence of multiple unfolding pathways in a simple, two-state folding protein. We show that these multiple pathways have structurally different transition states, and that seemingly small changes in protein sequence and environment can strongly modulate the flux between the pathways. These results suggest that in vivo, the crowded cellular environment could strongly influence the mechanisms of protein folding and unfolding. Our study resolves the apparent dichotomy between experimental and theoretical studies, and highlights the advantage of using a multipronged approach to reveal the complexities of a protein's free-energy landscape.
Probing biological redox chemistry with large amplitude Fourier transformed ac voltammetry
Adamson, Hope
2017-01-01
Biological electron-exchange reactions are fundamental to life on earth. Redox reactions underpin respiration, photosynthesis, molecular biosynthesis, cell signalling and protein folding. Chemical, biomedical and future energy technology developments are also inspired by these natural electron transfer processes. Further developments in techniques and data analysis are required to gain a deeper understanding of the redox biochemistry processes that power Nature. This review outlines the new insights gained from developing Fourier transformed ac voltammetry as a tool for protein film electrochemistry. PMID:28804798
Kannan, Srinivasaraghavan; Zacharias, Martin
2014-01-01
The 20 residue Trp-cage mini-protein is one of smallest proteins that adopt a stable folded structure containing also well-defined secondary structure elements. The hydrophobic core is arranged around a single central Trp residue. Despite several experimental and simulation studies the detailed folding mechanism of the Trp-cage protein is still not completely understood. Starting from fully extended as well as from partially folded Trp-cage structures a series of molecular dynamics simulations in explicit solvent and using four different force fields was performed. All simulations resulted in rapid collapse of the protein to on average relatively compact states. The simulations indicate a significant dependence of the speed of folding to near-native states on the side chain rotamer state of the central Trp residue. Whereas the majority of intermediate start structures with the central Trp side chain in a near-native rotameric state folded successfully within less than 100 ns only a fraction of start structures reached near-native folded states with an initially non-native Trp side chain rotamer state. Weak restraining of the Trp side chain dihedral angles to the state in the folded protein resulted in significant acceleration of the folding both starting from fully extended or intermediate conformations. The results indicate that the side chain conformation of the central Trp residue can create a significant barrier for controlling transitions to a near native folded structure. Similar mechanisms might be of importance for the folding of other protein structures. PMID:24563686
High-Resolution Mapping of a Repeat Protein Folding Free Energy Landscape.
Fossat, Martin J; Dao, Thuy P; Jenkins, Kelly; Dellarole, Mariano; Yang, Yinshan; McCallum, Scott A; Garcia, Angel E; Barrick, Doug; Roumestand, Christian; Royer, Catherine A
2016-12-06
A complete description of the pathways and mechanisms of protein folding requires a detailed structural and energetic characterization of the conformational ensemble along the entire folding reaction coordinate. Simulations can provide this level of insight for small proteins. In contrast, with the exception of hydrogen exchange, which does not monitor folding directly, experimental studies of protein folding have not yielded such structural and energetic detail. NMR can provide residue specific atomic level structural information, but its implementation in protein folding studies using chemical or temperature perturbation is problematic. Here we present a highly detailed structural and energetic map of the entire folding landscape of the leucine-rich repeat protein, pp32 (Anp32), obtained by combining pressure-dependent site-specific 1 H- 15 N HSQC data with coarse-grained molecular dynamics simulations. The results obtained using this equilibrium approach demonstrate that the main barrier to folding of pp32 is quite broad and lies near the unfolded state, with structure apparent only in the C-terminal region. Significant deviation from two-state unfolding under pressure reveals an intermediate on the folded side of the main barrier in which the N-terminal region is disordered. A nonlinear temperature dependence of the population of this intermediate suggests a large heat capacity change associated with its formation. The combination of pressure, which favors the population of folding intermediates relative to chemical denaturants; NMR, which allows their observation; and constrained structure-based simulations yield unparalleled insight into protein folding mechanisms. Copyright © 2016 Biophysical Society. Published by Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Shea, Joan-Emma; Brooks, Charles L., III
2001-10-01
Beginning with simplified lattice and continuum "minimalist" models and progressing to detailed atomic models, simulation studies have augmented and directed development of the modern landscape perspective of protein folding. In this review we discuss aspects of detailed atomic simulation methods applied to studies of protein folding free energy surfaces, using biased-sampling free energy methods and temperature-induced protein unfolding. We review studies from each on systems of particular experimental interest and assess the strengths and weaknesses of each approach in the context of "exact" results for both free energies and kinetics of a minimalist model for a beta-barrel protein. We illustrate in detail how each approach is implemented and discuss analysis methods that have been developed as components of these studies. We describe key insights into the relationship between protein topology and the folding mechanism emerging from folding free energy surface calculations. We further describe the determination of detailed "pathways" and models of folding transition states that have resulted from unfolding studies. Our assessment of the two methods suggests that both can provide, often complementary, details of folding mechanism and thermodynamics, but this success relies on (a) adequate sampling of diverse conformational regions for the biased-sampling free energy approach and (b) many trajectories at multiple temperatures for unfolding studies. Furthermore, we find that temperature-induced unfolding provides representatives of folding trajectories only when the topology and sequence (energy) provide a relatively funneled landscape and "off-pathway" intermediates do not exist.
Protein expression profiling in head fragments during planarian regeneration after amputation.
Chen, Xiaoguang; Xu, Cunshuan
2015-04-01
Following amputation, a planarian tail fragment can regrow into a complete organism including a well-organized brain within about 2-3 weeks, thus restoring the structure and function to presurgical levels. Despite the enormous potential of these animals for regenerative medicine, our understanding of the exact mechanism of planarian regeneration is incomplete. To better understand the molecular nature of planarian head regeneration, we applied two-dimensional electrophoresis (2-DE)/matrix-assisted laser desorption ionization time-of-flight (MALDI-TOF)/time-of-flight mass spectrometry (TOF MS) technique to analyze the dynamic proteomic expression profiles over the course of 6 to 168 h post-decapitation. This approach identified a total of 141 differentially expressed proteins, 47 of which exhibited exceptionally high fold changes (≥3-fold change). Of these, Rx protein, an important regulator of head and brain development, was considered to be closely related to planarian head regeneration because of its exceptional high expression almost throughout the time course of regeneration process. Functional annotation analysis classified the 141 proteins into eight categories: (1) signaling, (2) Ca(2+) binding and translocation, (3) transcription and translation, (4) cytoskeleton, (5) metabolism, (6) cell protection, (7) tissue differentiation, and (8) cell cycle. Signaling pathway analysis indicated that Wnt1/Ca(2+) signaling pathway was activated during head regeneration. Integrating the analyses of proteome expression profiling, functional annotation, and signaling pathway, amputation-induced head reformation requires some mechanisms to promote cell proliferation and differentiation, including differential regulation of proapoptotic and antiapoptotic proteins, and the regulation of proliferation and differentiation-related proteins. Importantly, Wnt1/Ca(2+) signaling pathway upregulates Rx expression, finally facilitating the differentiation of neoblasts into various cell types. Taken together, our study demonstrated that proteomic analysis approach used by us is a powerful tool in understanding molecular process related to head regeneration of planarian.
Duraisamy, Yasotha; Lambert, Daniel; O'Neill, Catherine A; Padfield, Philip J
2007-09-07
We investigated the influence of docosahexaenoic acid (DHA) on the fatty acid and protein compositions of two populations of membrane rafts present in Caco-2 cells. DHA (100 microM) had no significant influence on the fatty acid or protein compositions of tight junction-associated, Lubrol insoluble, membrane rafts. However, DHA did significantly alter the fatty acid and protein compositions of "archetypal" Triton X-100 insoluble membrane rafts. The DHA content of the raft lipids increased 25-fold and was accompanied by a redistribution of src and fyn out of the rafts. DHA also increased Caco-2 cell monolayer permeability producing a 95% drop in transepithelial electrical resistance and a 8.56-fold increase in the flux of dextran. In conclusion, the data demonstrate that DHA does not increase permeability through modifying the TJ-associated rafts. The data do, however, show that DHA is differentially incorporated into different classes of membrane rafts, which has significant implications to our understanding of how omega-3 PUFAs modulate plasma membrane organization and cell function.
Atomic-level characterization of the structural dynamics of proteins.
Shaw, David E; Maragakis, Paul; Lindorff-Larsen, Kresten; Piana, Stefano; Dror, Ron O; Eastwood, Michael P; Bank, Joseph A; Jumper, John M; Salmon, John K; Shan, Yibing; Wriggers, Willy
2010-10-15
Molecular dynamics (MD) simulations are widely used to study protein motions at an atomic level of detail, but they have been limited to time scales shorter than those of many biologically critical conformational changes. We examined two fundamental processes in protein dynamics--protein folding and conformational change within the folded state--by means of extremely long all-atom MD simulations conducted on a special-purpose machine. Equilibrium simulations of a WW protein domain captured multiple folding and unfolding events that consistently follow a well-defined folding pathway; separate simulations of the protein's constituent substructures shed light on possible determinants of this pathway. A 1-millisecond simulation of the folded protein BPTI reveals a small number of structurally distinct conformational states whose reversible interconversion is slower than local relaxations within those states by a factor of more than 1000.
Protein folding and misfolding: mechanism and principles
Englander, S. Walter; Mayne, Leland; Krishna, Mallela M. G.
2012-01-01
Two fundamentally different views of how proteins fold are now being debated. Do proteins fold through multiple unpredictable routes directed only by the energetically downhill nature of the folding landscape or do they fold through specific intermediates in a defined pathway that systematically puts predetermined pieces of the target native protein into place? It has now become possible to determine the structure of protein folding intermediates, evaluate their equilibrium and kinetic parameters, and establish their pathway relationships. Results obtained for many proteins have serendipitously revealed a new dimension of protein structure. Cooperative structural units of the native protein, called foldons, unfold and refold repeatedly even under native conditions. Much evidence obtained by hydrogen exchange and other methods now indicates that cooperative foldon units and not individual amino acids account for the unit steps in protein folding pathways. The formation of foldons and their ordered pathway assembly systematically puts native-like foldon building blocks into place, guided by a sequential stabilization mechanism in which prior native-like structure templates the formation of incoming foldons with complementary structure. Thus the same propensities and interactions that specify the final native state, encoded in the amino-acid sequence of every protein, determine the pathway for getting there. Experimental observations that have been interpreted differently, in terms of multiple independent pathways, appear to be due to chance misfolding errors that cause different population fractions to block at different pathway points, populate different pathway intermediates, and fold at different rates. This paper summarizes the experimental basis for these three determining principles and their consequences. Cooperative native-like foldon units and the sequential stabilization process together generate predetermined stepwise pathways. Optional misfolding errors are responsible for 3-state and heterogeneous kinetic folding. PMID:18405419
Studying the unfolding process of protein G and protein L under physical property space
Zhao, Liling; Wang, Jihua; Dou, Xianghua; Cao, Zanxia
2009-01-01
Background The studies on protein folding/unfolding indicate that the native state topology is an important determinant of protein folding mechanism. The folding/unfolding behaviors of proteins which have similar topologies have been studied under Cartesian space and the results indicate that some proteins share the similar folding/unfolding characters. Results We construct physical property space with twelve different physical properties. By studying the unfolding process of the protein G and protein L under the property space, we find that the two proteins have the similar unfolding pathways that can be divided into three types and the one which with the umbrella-shape represents the preferred pathway. Moreover, the unfolding simulation time of the two proteins is different and protein L unfolding faster than protein G. Additionally, the distributing area of unfolded state ensemble of protein L is larger than that of protein G. Conclusion Under the physical property space, the protein G and protein L have the similar folding/unfolding behaviors, which agree with the previous results obtained from the studies under Cartesian coordinate space. At the same time, some different unfolding properties can be detected easily, which can not be analyzed under Cartesian coordinate space. PMID:19208146
Unusual biophysics of intrinsically disordered proteins.
Uversky, Vladimir N
2013-05-01
Research of a past decade and a half leaves no doubt that complete understanding of protein functionality requires close consideration of the fact that many functional proteins do not have well-folded structures. These intrinsically disordered proteins (IDPs) and proteins with intrinsically disordered protein regions (IDPRs) are highly abundant in nature and play a number of crucial roles in a living cell. Their functions, which are typically associated with a wide range of intermolecular interactions where IDPs possess remarkable binding promiscuity, complement functional repertoire of ordered proteins. All this requires a close attention to the peculiarities of biophysics of these proteins. In this review, some key biophysical features of IDPs are covered. In addition to the peculiar sequence characteristics of IDPs these biophysical features include sequential, structural, and spatiotemporal heterogeneity of IDPs; their rough and relatively flat energy landscapes; their ability to undergo both induced folding and induced unfolding; the ability to interact specifically with structurally unrelated partners; the ability to gain different structures at binding to different partners; and the ability to keep essential amount of disorder even in the bound form. IDPs are also characterized by the "turned-out" response to the changes in their environment, where they gain some structure under conditions resulting in denaturation or even unfolding of ordered proteins. It is proposed that the heterogeneous spatiotemporal structure of IDPs/IDPRs can be described as a set of foldons, inducible foldons, semi-foldons, non-foldons, and unfoldons. They may lose their function when folded, and activation of some IDPs is associated with the awaking of the dormant disorder. It is possible that IDPs represent the "edge of chaos" systems which operate in a region between order and complete randomness or chaos, where the complexity is maximal. This article is part of a Special Issue entitled: The emerging dynamic view of proteins: Protein plasticity in allostery, evolution and self-assembly. Copyright © 2012 Elsevier B.V. All rights reserved.
Heterochiral Knottin Protein: Folding and Solution Structure.
Mong, Surin K; Cochran, Frank V; Yu, Hongtao; Graziano, Zachary; Lin, Yu-Shan; Cochran, Jennifer R; Pentelute, Bradley L
2017-10-31
Homochirality is a general feature of biological macromolecules, and Nature includes few examples of heterochiral proteins. Herein, we report on the design, chemical synthesis, and structural characterization of heterochiral proteins possessing loops of amino acids of chirality opposite to that of the rest of a protein scaffold. Using the protein Ecballium elaterium trypsin inhibitor II, we discover that selective β-alanine substitution favors the efficient folding of our heterochiral constructs. Solution nuclear magnetic resonance spectroscopy of one such heterochiral protein reveals a homogeneous global fold. Additionally, steered molecular dynamics simulation indicate β-alanine reduces the free energy required to fold the protein. We also find these heterochiral proteins to be more resistant to proteolysis than homochiral l-proteins. This work informs the design of heterochiral protein architectures containing stretches of both d- and l-amino acids.
De Jaco, Antonella; Comoletti, Davide; Dubi, Noga; Camp, Shelley; Taylor, Palmer
2016-01-01
The α/β hydrolase fold family is perhaps the largest group of proteins presenting significant structural homology with divergent functions, ranging from catalytic hydrolysis to heterophilic cell adhesive interactions to chaperones in hormone production. All the proteins of the family share a common three-dimensional core structure containing the α/β-hydrolase fold domain that is crucial for proper protein function. Several mutations associated with congenital diseases or disorders have been reported in conserved residues within the α/β-hydrolase fold domain of cholinesterase-like proteins, neuroligins, butyrylcholinesterase and thyroglobulin. These mutations are known to disrupt the architecture of the common structural domain either globally or locally. Characterization of the natural mutations affecting the α/β-hydrolase fold domain in these proteins has shown that they mainly impair processing and trafficking along the secretory pathway causing retention of the mutant protein in the endoplasmic reticulum. Studying the processing of α/β-hydrolase fold mutant proteins should uncover new functions for this domain, that in some cases require structural integrity for both export of the protein from the ER and for facilitating subunit dimerization. A comparative study of homologous mutations in proteins that are closely related family members, along with the definition of new three-dimensional crystal structures, will identify critical residues for the assembly of the α/β-hydrolase fold. PMID:21933121
Effect of interactions with the chaperonin cavity on protein folding and misfolding†
Sirur, Anshul; Knott, Michael; Best, Robert B.
2015-01-01
Recent experimental and computational results have suggested that attractive interactions between a chaperonin and an enclosed substrate can have an important effect on the protein folding rate: it appears that folding may even be slower inside the cavity than under unconfined conditions, in contrast to what we would expect from excluded volume effects on the unfolded state. Here we examine systematically the dependence of the protein stability and folding rate on the strength of such attractive interactions between the chaperonin and substrate, by using molecular simulations of model protein systems in an idealised attractive cavity. Interestingly, we find a maximum in stability, and a rate which indeed slows down at high attraction strengths. We have developed a simple phenomenological model which can explain the variations in folding rate and stability due to differing effects on the free energies of the unfolded state, folded state, and transition state; changes in the diffusion coefficient along the folding coordinate are relatively small, at least for our simplified model. In order to investigate a possible role for these attractive interactions in folding, we have studied a recently developed model for misfolding in multidomain proteins. We find that, while encapsulation in repulsive cavities greatly increases the fraction of misfolded protein, sufficiently strong attractive protein-cavity interactions can strongly reduce the fraction of proteins reaching misfolded traps. PMID:24077053
The porous borders of the protein world.
Cordes, Matthew H J; Stewart, Katie L
2012-02-08
Fold switching may play a role in the evolution of new protein folds and functions. He et al., in this issue of Structure, use protein design to illustrate that the same drastic change in a protein fold can occur via multiple different mutational pathways. Copyright © 2012 Elsevier Ltd. All rights reserved.
Li, Ying; Gupta, Ruchi; Cho, Jae-Hyun; Raleigh, Daniel P
2007-01-30
The C-terminal domain of ribosomal protein L9 (CTL9) is a 92-residue alpha-beta protein which contains an unusual three-stranded mixed parallel and antiparallel beta-sheet. The protein folds in a two-state fashion, and the folding rate is slow. It is thought that the slow folding may be caused by the necessity of forming this unusual beta-sheet architecture in the transition state for folding. This hypothesis makes CTL9 an interesting target for folding studies. The transition state for the folding of CTL9 was characterized by phi-value analysis. The folding of a set of hydrophobic core mutants was analyzed together with a set of truncation mutants. The results revealed a few positions with high phi-values (> or = 0.5), notably, V131, L133, H134, V137, and L141. All of these residues were found in the beta-hairpin region, indicating that the formation of this structure is likely to be the rate-limiting step in the folding of CTL9. One face of the beta-hairpin docks against the N-terminal helix. Analysis of truncation mutants of this helix confirmed its importance in folding. Mutations at other sites in the protein gave small phi-values, despite the fact that some of them had major effects on stability. The analysis indicates that formation of the antiparallel hairpin is critical and its interactions with the first helix are also important. Thus, the slow folding is not a consequence of the need to fully form the unusual three-stranded beta-sheet in the transition state. Analysis of the urea dependence of the folding rates indicates that mutations modulate the unfolded state. The folding of CTL9 is broadly consistent with the nucleation-condensation model of protein folding.
Intermediates and the folding of proteins L and G
DOE Office of Scientific and Technical Information (OSTI.GOV)
Brown, Scott; Head-Gordon, Teresa
We use a minimalist protein model, in combination with a sequence design strategy, to determine differences in primary structure for proteins L and G that are responsible for the two proteins folding through distinctly different folding mechanisms. We find that the folding of proteins L and G are consistent with a nucleation-condensation mechanism, each of which is described as helix-assisted {beta}-1 and {beta}-2 hairpin formation, respectively. We determine that the model for protein G exhibits an early intermediate that precedes the rate-limiting barrier of folding and which draws together misaligned secondary structure elements that are stabilized by hydrophobic core contactsmore » involving the third {beta}-strand, and presages the later transition state in which the correct strand alignment of these same secondary structure elements is restored. Finally the validity of the targeted intermediate ensemble for protein G was analyzed by fitting the kinetic data to a two-step first order reversible reaction, proving that protein G folding involves an on-pathway early intermediate, and should be populated and therefore observable by experiment.« less
Intermediates and the folding of proteins L and G
Brown, Scott; Head-Gordon, Teresa
2004-01-01
We use a minimalist protein model, in combination with a sequence design strategy, to determine differences in primary structure for proteins L and G, which are responsible for the two proteins folding through distinctly different folding mechanisms. We find that the folding of proteins L and G are consistent with a nucleation-condensation mechanism, each of which is described as helix-assisted β-1 and β-2 hairpin formation, respectively. We determine that the model for protein G exhibits an early intermediate that precedes the rate-limiting barrier of folding, and which draws together misaligned secondary structure elements that are stabilized by hydrophobic core contacts involving the third β-strand, and presages the later transition state in which the correct strand alignment of these same secondary structure elements is restored. Finally, the validity of the targeted intermediate ensemble for protein G was analyzed by fitting the kinetic data to a two-step first-order reversible reaction, proving that protein G folding involves an on-pathway early intermediate, and should be populated and therefore observable by experiment. PMID:15044729
Evolution, Energy Landscapes and the Paradoxes of Protein Folding
Wolynes, Peter G.
2014-01-01
Protein folding has been viewed as a difficult problem of molecular self-organization. The search problem involved in folding however has been simplified through the evolution of folding energy landscapes that are funneled. The funnel hypothesis can be quantified using energy landscape theory based on the minimal frustration principle. Strong quantitative predictions that follow from energy landscape theory have been widely confirmed both through laboratory folding experiments and from detailed simulations. Energy landscape ideas also have allowed successful protein structure prediction algorithms to be developed. The selection constraint of having funneled folding landscapes has left its imprint on the sequences of existing protein structural families. Quantitative analysis of co-evolution patterns allows us to infer the statistical characteristics of the folding landscape. These turn out to be consistent with what has been obtained from laboratory physicochemical folding experiments signalling a beautiful confluence of genomics and chemical physics. PMID:25530262
Designing pH induced fold switch in proteins
NASA Astrophysics Data System (ADS)
Baruah, Anupaul; Biswas, Parbati
2015-05-01
This work investigates the computational design of a pH induced protein fold switch based on a self-consistent mean-field approach by identifying the ensemble averaged characteristics of sequences that encode a fold switch. The primary challenge to balance the alternative sets of interactions present in both target structures is overcome by simultaneously optimizing two foldability criteria corresponding to two target structures. The change in pH is modeled by altering the residual charge on the amino acids. The energy landscape of the fold switch protein is found to be double funneled. The fold switch sequences stabilize the interactions of the sites with similar relative surface accessibility in both target structures. Fold switch sequences have low sequence complexity and hence lower sequence entropy. The pH induced fold switch is mediated by attractive electrostatic interactions rather than hydrophobic-hydrophobic contacts. This study may provide valuable insights to the design of fold switch proteins.
Wood, Christopher W; Bruning, Marc; Ibarra, Amaurys Á; Bartlett, Gail J; Thomson, Andrew R; Sessions, Richard B; Brady, R Leo; Woolfson, Derek N
2014-11-01
The ability to accurately model protein structures at the atomistic level underpins efforts to understand protein folding, to engineer natural proteins predictably and to design proteins de novo. Homology-based methods are well established and produce impressive results. However, these are limited to structures presented by and resolved for natural proteins. Addressing this problem more widely and deriving truly ab initio models requires mathematical descriptions for protein folds; the means to decorate these with natural, engineered or de novo sequences; and methods to score the resulting models. We present CCBuilder, a web-based application that tackles the problem for a defined but large class of protein structure, the α-helical coiled coils. CCBuilder generates coiled-coil backbones, builds side chains onto these frameworks and provides a range of metrics to measure the quality of the models. Its straightforward graphical user interface provides broad functionality that allows users to build and assess models, in which helix geometry, coiled-coil architecture and topology and protein sequence can be varied rapidly. We demonstrate the utility of CCBuilder by assembling models for 653 coiled-coil structures from the PDB, which cover >96% of the known coiled-coil types, and by generating models for rarer and de novo coiled-coil structures. CCBuilder is freely available, without registration, at http://coiledcoils.chm.bris.ac.uk/app/cc_builder/. © The Author 2014. Published by Oxford University Press.
Computational Modeling of Proteins based on Cellular Automata: A Method of HP Folding Approximation.
Madain, Alia; Abu Dalhoum, Abdel Latif; Sleit, Azzam
2018-06-01
The design of a protein folding approximation algorithm is not straightforward even when a simplified model is used. The folding problem is a combinatorial problem, where approximation and heuristic algorithms are usually used to find near optimal folds of proteins primary structures. Approximation algorithms provide guarantees on the distance to the optimal solution. The folding approximation approach proposed here depends on two-dimensional cellular automata to fold proteins presented in a well-studied simplified model called the hydrophobic-hydrophilic model. Cellular automata are discrete computational models that rely on local rules to produce some overall global behavior. One-third and one-fourth approximation algorithms choose a subset of the hydrophobic amino acids to form H-H contacts. Those algorithms start with finding a point to fold the protein sequence into two sides where one side ignores H's at even positions and the other side ignores H's at odd positions. In addition, blocks or groups of amino acids fold the same way according to a predefined normal form. We intend to improve approximation algorithms by considering all hydrophobic amino acids and folding based on the local neighborhood instead of using normal forms. The CA does not assume a fixed folding point. The proposed approach guarantees one half approximation minus the H-H endpoints. This lower bound guaranteed applies to short sequences only. This is proved as the core and the folds of the protein will have two identical sides for all short sequences.
Mitochondrial metabolic regulation by GRP78
Prasad, Manoj; Pawlak, Kevin J.; Burak, William E.; Perry, Elizabeth E.; Marshall, Brendan; Whittal, Randy M.; Bose, Himangshu S.
2017-01-01
Steroids, essential for mammalian survival, are initiated by cholesterol transport by steroidogenic acute regulatory protein (StAR). Appropriate protein folding is an essential requirement of activity. Endoplasmic reticulum (ER) chaperones assist in folding of cytoplasmic proteins, whereas mitochondrial chaperones fold only mitochondrial proteins. We show that glucose regulatory protein 78 (GRP78), a master ER chaperone, is also present at the mitochondria-associated ER membrane (MAM), where it folds StAR for delivery to the outer mitochondrial membrane. StAR expression and activity are drastically reduced following GRP78 knockdown. StAR folding starts at the MAM region; thus, its cholesterol fostering capacity is regulated by GRP78 long before StAR reaches the mitochondria. In summary, GRP78 is an acute regulator of steroidogenesis at the MAM, regulating the intermediate folding of StAR that is crucial for its activity. PMID:28275724
GroEL stimulates protein folding through forced unfolding
Lin, Zong; Madan, Damian; Rye, Hays S
2013-01-01
Many proteins cannot fold without the assistance of chaperonin machines like GroEL and GroES. The nature of this assistance, however, remains poorly understood. Here we demonstrate that unfolding of a substrate protein by GroEL enhances protein folding. We first show that capture of a protein on the open ring of a GroEL–ADP–GroES complex, GroEL’s physiological acceptor state for non-native proteins in vivo, leaves the substrate protein in an unexpectedly compact state. Subsequent binding of ATP to the same GroEL ring causes rapid, forced unfolding of the substrate protein. Notably, the fraction of the substrate protein that commits to the native state following GroES binding and protein release into the GroEL–GroES cavity is proportional to the extent of substrate-protein unfolding. Forced protein unfolding is thus a central component of the multilayered stimulatory mechanism used by GroEL to drive protein folding. PMID:18311152
Characterization of protein-folding pathways by reduced-space modeling.
Kmiecik, Sebastian; Kolinski, Andrzej
2007-07-24
Ab initio simulations of the folding pathways are currently limited to very small proteins. For larger proteins, some approximations or simplifications in protein models need to be introduced. Protein folding and unfolding are among the basic processes in the cell and are very difficult to characterize in detail by experiment or simulation. Chymotrypsin inhibitor 2 (CI2) and barnase are probably the best characterized experimentally in this respect. For these model systems, initial folding stages were simulated by using CA-CB-side chain (CABS), a reduced-space protein-modeling tool. CABS employs knowledge-based potentials that proved to be very successful in protein structure prediction. With the use of isothermal Monte Carlo (MC) dynamics, initiation sites with a residual structure and weak tertiary interactions were identified. Such structures are essential for the initiation of the folding process through a sequential reduction of the protein conformational space, overcoming the Levinthal paradox in this manner. Furthermore, nucleation sites that initiate a tertiary interactions network were located. The MC simulations correspond perfectly to the results of experimental and theoretical research and bring insights into CI2 folding mechanism: unambiguous sequence of folding events was reported as well as cooperative substructures compatible with those obtained in recent molecular dynamics unfolding studies. The correspondence between the simulation and experiment shows that knowledge-based potentials are not only useful in protein structure predictions but are also capable of reproducing the folding pathways. Thus, the results of this work significantly extend the applicability range of reduced models in the theoretical study of proteins.
In vitro folding of inclusion body proteins.
Rudolph, R; Lilie, H
1996-01-01
Insoluble, inactive inclusion bodies are frequently formed upon recombinant protein production in transformed microorganisms. These inclusion bodies, which contain the recombinant protein in an highly enriched form, can be isolated by solid/liquid separation. After solubilization, native proteins can be generated from the inactive material by using in vitro folding techniques. New folding procedures have been developed for efficient in vitro reconstitution of complex hydrophobic, multidomain, oligomeric, or highly disulfide-bonded proteins. These protocols take into account process parameters such as protein concentration, catalysis of disulfide bond formation, temperature, pH, and ionic strength, as well as specific solvent ingredients that reduce unproductive side reactions. Modification of the protein sequence has been exploited to improve in vitro folding.
Chong, Wai Chin; Shastri, Madhur D.; Eri, Rajaraman
2017-01-01
The endoplasmic reticulum (ER) is a complex protein folding and trafficking organelle. Alteration and discrepancy in the endoplasmic reticulum environment can affect the protein folding process and hence, can result in the production of misfolded proteins. The accumulation of misfolded proteins causes cellular damage and elicits endoplasmic reticulum stress. Under such stress conditions, cells exhibit reduced functional synthesis, and will undergo apoptosis if the stress is prolonged. To resolve the ER stress, cells trigger an intrinsic mechanism called an unfolded protein response (UPR). UPR is an adaptive signaling process that triggers multiple pathways through the endoplasmic reticulum transmembrane transducers, to reduce and remove misfolded proteins and improve the protein folding mechanism, in order to improve and maintain endoplasmic reticulum homeostasis. An increasing number of studies support the view that oxidative stress has a strong connection with ER stress. During the protein folding process, reactive oxygen species are produced as by-products, leading to impaired reduction-oxidation (redox) balance conferring oxidative stress. As the protein folding process is dependent on redox homeostasis, the oxidative stress can disrupt the protein folding mechanism and enhance the production of misfolded proteins, causing further ER stress. It is proposed that endoplasmic reticulum stress and oxidative stress together play significant roles in the pathophysiology of bowel diseases. PMID:28379196
Chong, Wai Chin; Shastri, Madhur D; Eri, Rajaraman
2017-04-05
The endoplasmic reticulum (ER) is a complex protein folding and trafficking organelle. Alteration and discrepancy in the endoplasmic reticulum environment can affect the protein folding process and hence, can result in the production of misfolded proteins. The accumulation of misfolded proteins causes cellular damage and elicits endoplasmic reticulum stress. Under such stress conditions, cells exhibit reduced functional synthesis, and will undergo apoptosis if the stress is prolonged. To resolve the ER stress, cells trigger an intrinsic mechanism called an unfolded protein response (UPR). UPR is an adaptive signaling process that triggers multiple pathways through the endoplasmic reticulum transmembrane transducers, to reduce and remove misfolded proteins and improve the protein folding mechanism, in order to improve and maintain endoplasmic reticulum homeostasis. An increasing number of studies support the view that oxidative stress has a strong connection with ER stress. During the protein folding process, reactive oxygen species are produced as by-products, leading to impaired reduction-oxidation (redox) balance conferring oxidative stress. As the protein folding process is dependent on redox homeostasis, the oxidative stress can disrupt the protein folding mechanism and enhance the production of misfolded proteins, causing further ER stress. It is proposed that endoplasmic reticulum stress and oxidative stress together play significant roles in the pathophysiology of bowel diseases.
Huang, Wenxi; Liu, Wanting; Jin, Jingjie; Xiao, Qilan; Lu, Ruibin; Chen, Wei; Xiong, Sheng; Zhang, Gong
2018-03-25
Translational pausing coordinates protein synthesis and co-translational folding. It is a common factor that facilitates the correct folding of large, multi-domain proteins. For small proteins, pausing sites rarely occurs in the gene body, and the 3'-end pausing sites are only essential for the folding of a fraction of proteins. The determinant of the necessity of the pausings remains obscure. In this study, we demonstrated that the steady-state structural fluctuation is a predictor of the necessity of pausing-mediated co-translational folding for small proteins. Validated by experiments with 5 model proteins, we found that the rigid protein structures do not, while the flexible structures do need 3'-end pausings to fold correctly. Therefore, rational optimization of translational pausing can improve soluble expression of small proteins with flexible structures, but not the rigid ones. The rigidity of the structure can be quantitatively estimated in silico using molecular dynamic simulation. Nevertheless, we also found that the translational pausing optimization increases the fitness of the expression host, and thus benefits the recombinant protein production, independent from the soluble expression. These results shed light on the structural basis of the translational pausing and provided a practical tool for industrial protein fermentation. Copyright © 2017. Published by Elsevier Inc.
Deiana, Antonio; Giansanti, Andrea
2010-04-21
Natively unfolded proteins lack a well defined three dimensional structure but have important biological functions, suggesting a re-assignment of the structure-function paradigm. To assess that a given protein is natively unfolded requires laborious experimental investigations, then reliable sequence-only methods for predicting whether a sequence corresponds to a folded or to an unfolded protein are of interest in fundamental and applicative studies. Many proteins have amino acidic compositions compatible both with the folded and unfolded status, and belong to a twilight zone between order and disorder. This makes difficult a dichotomic classification of protein sequences into folded and natively unfolded ones. In this work we propose an operational method to identify proteins belonging to the twilight zone by combining into a consensus score good performing single predictors of folding. In this methodological paper dichotomic folding indexes are considered: hydrophobicity-charge, mean packing, mean pairwise energy, Poodle-W and a new global index, that is called here gVSL2, based on the local disorder predictor VSL2. The performance of these indexes is evaluated on different datasets, in particular on a new dataset composed by 2369 folded and 81 natively unfolded proteins. Poodle-W, gVSL2 and mean pairwise energy have good performance and stability in all the datasets considered and are combined into a strictly unanimous combination score SSU, that leaves proteins unclassified when the consensus of all combined indexes is not reached. The unclassified proteins: i) belong to an overlap region in the vector space of amino acidic compositions occupied by both folded and unfolded proteins; ii) are composed by approximately the same number of order-promoting and disorder-promoting amino acids; iii) have a mean flexibility intermediate between that of folded and that of unfolded proteins. Our results show that proteins unclassified by SSU belong to a twilight zone. Proteins left unclassified by the consensus score SSU have physical properties intermediate between those of folded and those of natively unfolded proteins and their structural properties and evolutionary history are worth to be investigated.
2010-01-01
Background Natively unfolded proteins lack a well defined three dimensional structure but have important biological functions, suggesting a re-assignment of the structure-function paradigm. To assess that a given protein is natively unfolded requires laborious experimental investigations, then reliable sequence-only methods for predicting whether a sequence corresponds to a folded or to an unfolded protein are of interest in fundamental and applicative studies. Many proteins have amino acidic compositions compatible both with the folded and unfolded status, and belong to a twilight zone between order and disorder. This makes difficult a dichotomic classification of protein sequences into folded and natively unfolded ones. In this work we propose an operational method to identify proteins belonging to the twilight zone by combining into a consensus score good performing single predictors of folding. Results In this methodological paper dichotomic folding indexes are considered: hydrophobicity-charge, mean packing, mean pairwise energy, Poodle-W and a new global index, that is called here gVSL2, based on the local disorder predictor VSL2. The performance of these indexes is evaluated on different datasets, in particular on a new dataset composed by 2369 folded and 81 natively unfolded proteins. Poodle-W, gVSL2 and mean pairwise energy have good performance and stability in all the datasets considered and are combined into a strictly unanimous combination score SSU, that leaves proteins unclassified when the consensus of all combined indexes is not reached. The unclassified proteins: i) belong to an overlap region in the vector space of amino acidic compositions occupied by both folded and unfolded proteins; ii) are composed by approximately the same number of order-promoting and disorder-promoting amino acids; iii) have a mean flexibility intermediate between that of folded and that of unfolded proteins. Conclusions Our results show that proteins unclassified by SSU belong to a twilight zone. Proteins left unclassified by the consensus score SSU have physical properties intermediate between those of folded and those of natively unfolded proteins and their structural properties and evolutionary history are worth to be investigated. PMID:20409339
Hydrolytic catalysis and structural stabilization in a designed metalloprotein
Zastrow, Melissa L.; Peacock, Anna F. A.; Stuckey, Jeanne A.; Pecoraro, Vincent L.
2011-01-01
Metal ions are an important part of many natural proteins, providing structural, catalytic and electron transfer functions. Reproducing these functions in a designed protein is the ultimate challenge to our understanding of them. Here, we present an artificial metallohydrolase, which has been shown by X-ray crystallography to contain two different metal ions – a Zn(II) ion which is important for catalytic activity and a Hg(II) ion which provides structural stability. This metallohydrolase displays catalytic activity that compares well with several characteristic reactions of natural enzymes. It catalyses p-nitrophenyl acetate hydrolysis (pNPA) to within ~100-fold of the efficiency of human carbonic anhydrase (CA)II and is at least 550-fold better than comparable synthetic complexes. Similarly, CO2 hydration occurs with an efficiency within ~500-fold of CAII. While histidine residues in the absence of Zn(II) exhibit pNPA hydrolysis, miniscule apopeptide activity is observed for CO2 hydration. The kinetic and structural analysis of this first de novo designed hydrolytic metalloenzyme uncovers necessary design features for future metalloenzymes containing one or more metals. PMID:22270627
Action of the Hsp70 chaperone system observed with single proteins
NASA Astrophysics Data System (ADS)
Nunes, João M.; Mayer-Hartl, Manajit; Hartl, F. Ulrich; Müller, Daniel J.
2015-02-01
In Escherichia coli, the binding of non-native protein substrates to the Hsp70 chaperone DnaK is mediated by the co-chaperone DnaJ. DnaJ accelerates ATP hydrolysis on DnaK, by closing the peptide-binding cleft of DnaK. GrpE catalysed nucleotide exchange and ATP re-binding then lead to substrate release from DnaK, allowing folding. Here we refold immunoglobulin 27 (I27) to better understand how DnaJ-DnaK-GrpE chaperones cooperate. When DnaJ is present, I27 is less likely to misfold and more likely to fold, whereas the unfolded state remains unaffected. Thus, the ‘holdase’ DnaJ shows foldase behaviour. Misfolding of I27 is fully abrogated when DnaJ cooperates with DnaK, which stabilizes the unfolded state and increases the probability of folding. Addition of GrpE shifts the unfolded fraction of I27 to pre-chaperone levels. These insights reveal synergistic mechanisms within the evolutionary highly conserved Hsp70 system that prevent substrates from misfolding and promote their productive transition to the native state.
Hook, Vivian; Toneff, Thomas; Baylon, Sheley; Sei, Catherine
2009-01-01
Neuropeptides function as peptide neurotransmitters and hormones to mediate cell-cell communication. The goal of this study was to understand how different neuropeptides may be similarly or differentially regulated by protein kinase A (PKA) and protein kinase C (PKC) intracellular signaling mechanisms. Therefore, this study compared the differential effects of treating neuroendocrine chromaffin cells with stimulators of PKA and PKC on the production of the neuropeptides (Met)enkephalin, galanin, somatostatin, NPY, and VIP. Significantly, selective increases in production of these neuropeptides was observed by forskolin or PMA (phorbol myristate acetate) which stimulate PKA and PKC mechanisms, respectively. (Met)enkephalin production was stimulated by up to 2-fold by forskolin treatment, but not by PMA. In contrast, PMA treatment (but not forskolin) resulted in a 2-fold increase in production of galanin and somatostatin, and a 3-fold increase in NPY production. Notably, VIP production was highly stimulated by forskolin and PMA, with increases of 3-fold and 10–15-fold, respectively. Differences in elevated neuropeptides occurred in cell extracts compared to secretion media, which consisted of (i) increased NPY primarily in cell extracts, (ii) increased (Met)enkephalin and somatostatin in secretion media (not cell extracts), and (iii) increased galanin and VIP in both cell extracts and secretion media. Involvement of PKA or PKC for forskolin or PMA regulation of neuropeptide biosynthesis, respectively, was confirmed with direct inhibitors of PKA and PKC. The selective activation of neuropeptide production by forskolin and PMA demonstrates that PKA and PKC pathways are involved in the differential regulation of neuropeptide production. PMID:18619673
Randomized Subspace Learning for Proline Cis-Trans Isomerization Prediction.
Al-Jarrah, Omar Y; Yoo, Paul D; Taha, Kamal; Muhaidat, Sami; Shami, Abdallah; Zaki, Nazar
2015-01-01
Proline residues are common source of kinetic complications during folding. The X-Pro peptide bond is the only peptide bond for which the stability of the cis and trans conformations is comparable. The cis-trans isomerization (CTI) of X-Pro peptide bonds is a widely recognized rate-limiting factor, which can not only induces additional slow phases in protein folding but also modifies the millisecond and sub-millisecond dynamics of the protein. An accurate computational prediction of proline CTI is of great importance for the understanding of protein folding, splicing, cell signaling, and transmembrane active transport in both the human body and animals. In our earlier work, we successfully developed a biophysically motivated proline CTI predictor utilizing a novel tree-based consensus model with a powerful metalearning technique and achieved 86.58 percent Q2 accuracy and 0.74 Mcc, which is a better result than the results (70-73 percent Q2 accuracies) reported in the literature on the well-referenced benchmark dataset. In this paper, we describe experiments with novel randomized subspace learning and bootstrap seeding techniques as an extension to our earlier work, the consensus models as well as entropy-based learning methods, to obtain better accuracy through a precise and robust learning scheme for proline CTI prediction.
Principal component analysis for protein folding dynamics.
Maisuradze, Gia G; Liwo, Adam; Scheraga, Harold A
2009-01-09
Protein folding is considered here by studying the dynamics of the folding of the triple beta-strand WW domain from the Formin-binding protein 28. Starting from the unfolded state and ending either in the native or nonnative conformational states, trajectories are generated with the coarse-grained united residue (UNRES) force field. The effectiveness of principal components analysis (PCA), an already established mathematical technique for finding global, correlated motions in atomic simulations of proteins, is evaluated here for coarse-grained trajectories. The problems related to PCA and their solutions are discussed. The folding and nonfolding of proteins are examined with free-energy landscapes. Detailed analyses of many folding and nonfolding trajectories at different temperatures show that PCA is very efficient for characterizing the general folding and nonfolding features of proteins. It is shown that the first principal component captures and describes in detail the dynamics of a system. Anomalous diffusion in the folding/nonfolding dynamics is examined by the mean-square displacement (MSD) and the fractional diffusion and fractional kinetic equations. The collisionless (or ballistic) behavior of a polypeptide undergoing Brownian motion along the first few principal components is accounted for.
Retarded protein folding of deficient human α1-antitrypsin D256V and L41P variants
Jung, Chan-Hun; Na, Yu-Ran; Im, Hana
2004-01-01
α1-Antitrypsin is the most abundant protease inhibitor in plasma and is the archetype of the serine protease inhibitor superfamily. Genetic variants of human α1-antitrypsin are associated with early-onset emphysema and liver cirrhosis. However, the detailed molecular mechanism for the pathogenicity of most variant α1-antitrypsin molecules is not known. Here we examined the structural basis of a dozen deficient α1-antitrypsin variants. Unlike most α1-antitrypsin variants, which were unstable, D256V and L41P variants exhibited extremely retarded protein folding as compared with the wild-type molecule. Once folded, however, the stability and inhibitory activity of these variant proteins were comparable to those of the wild-type molecule. Retarded protein folding may promote protein aggregation by allowing the accumulation of aggregation-prone folding intermediates. Repeated observations of retarded protein folding indicate that it is an important mechanism causing α1-antitrypsin deficiency by variant molecules, which have to fold into the metastable native form to be functional. PMID:14767073
The Dominant Folding Route Minimizes Backbone Distortion in SH3
Lammert, Heiko; Noel, Jeffrey K.; Onuchic, José N.
2012-01-01
Energetic frustration in protein folding is minimized by evolution to create a smooth and robust energy landscape. As a result the geometry of the native structure provides key constraints that shape protein folding mechanisms. Chain connectivity in particular has been identified as an essential component for realistic behavior of protein folding models. We study the quantitative balance of energetic and geometrical influences on the folding of SH3 in a structure-based model with minimal energetic frustration. A decomposition of the two-dimensional free energy landscape for the folding reaction into relevant energy and entropy contributions reveals that the entropy of the chain is not responsible for the folding mechanism. Instead the preferred folding route through the transition state arises from a cooperative energetic effect. Off-pathway structures are penalized by excess distortion in local backbone configurations and contact pair distances. This energy cost is a new ingredient in the malleable balance of interactions that controls the choice of routes during protein folding. PMID:23166485
Evidence for the principle of minimal frustration in the evolution of protein folding landscapes.
Tzul, Franco O; Vasilchuk, Daniel; Makhatadze, George I
2017-02-28
Theoretical and experimental studies have firmly established that protein folding can be described by a funneled energy landscape. This funneled energy landscape is the result of foldable protein sequences evolving following the principle of minimal frustration, which allows proteins to rapidly fold to their native biologically functional conformations. For a protein family with a given functional fold, the principle of minimal frustration suggests that, independent of sequence, all proteins within this family should fold with similar rates. However, depending on the optimal living temperature of the organism, proteins also need to modulate their thermodynamic stability. Consequently, the difference in thermodynamic stability should be primarily caused by differences in the unfolding rates. To test this hypothesis experimentally, we performed comprehensive thermodynamic and kinetic analyses of 15 different proteins from the thioredoxin family. Eight of these thioredoxins were extant proteins from psychrophilic, mesophilic, or thermophilic organisms. The other seven protein sequences were obtained using ancestral sequence reconstruction and can be dated back over 4 billion years. We found that all studied proteins fold with very similar rates but unfold with rates that differ up to three orders of magnitude. The unfolding rates correlate well with the thermodynamic stability of the proteins. Moreover, proteins that unfold slower are more resistant to proteolysis. These results provide direct experimental support to the principle of minimal frustration hypothesis.
Guarracino, Danielle A; Gentile, Kayla; Grossman, Alec; Li, Evan; Refai, Nader; Mohnot, Joy; King, Daniel
2018-02-01
Determining the minimal sequence necessary to induce protein folding is beneficial in understanding the role of protein-protein interactions in biological systems, as their three-dimensional structures often dictate their activity. Proteins are generally comprised of discrete secondary structures, from α-helices to β-turns and larger β-sheets, each of which is influenced by its primary structure. Manipulating the sequence of short, moderately helical peptides can help elucidate the influences on folding. We created two new scaffolds based on a modestly helical eight-residue peptide, PT3, we previously published. Using circular dichroism (CD) spectroscopy and changing the possible salt-bridging residues to new combinations of Lys, Arg, Glu, and Asp, we found that our most helical improvements came from the Arg-Glu combination, whereas the Lys-Asp was not significantly different from the Lys-Glu of the parent scaffold, PT3. The marked 3 10 -helical contributions in PT3 were lessened in the Arg-Glu-containing peptide with the beginning of cooperative unfolding seen through a thermal denaturation. However, a unique and unexpected signature was seen for the denaturation of the Lys-Asp peptide which could help elucidate the stages of folding between the 3 10 and α-helix. In addition, we developed a short six-residue peptide with β-turn/sheet CD signature, again to help study minimal sequences needed for folding. Overall, the results indicate that improvements made to short peptide scaffolds by fine-tuning the salt-bridging residues can enhance scaffold structure. Likewise, with the results from the new, short β-turn motif, these can help impact future peptidomimetic designs in creating biologically useful, short, structured β-sheet-forming peptides.
Niu, Xiaofan; Ah-Fong, Audrey M V; Lopez, Lilianna A; Judelson, Howard S
2018-01-01
Sexual reproduction remains an understudied feature of oomycete biology. To expand our knowledge of this process, we used RNA-seq and quantitative proteomics to examine matings in Phytophthora infestans. Exhibiting significant changes in mRNA abundance in three matings between different A1 and A2 strains compared to nonmating controls were 1170 genes, most being mating-induced. Rising by >10-fold in at least one cross were 455 genes, and 182 in all three crosses. Most genes had elevated expression in a self-fertile strain. Many mating-induced genes were associated with cell wall biosynthesis, which may relate to forming the thick-walled sexual spore (oospore). Several gene families were induced during mating including one encoding histidine, serine, and tyrosine-rich putative wall proteins, and another encoding prolyl hydroxylases which may strengthen the extracellular matrix. The sizes of these families vary >10-fold between Phytophthora species and one exhibits concerted evolution, highlighting two features of genome dynamics within the genus. Proteomic analyses of mature oospores and nonmating hyphae using isobaric tags for quantification identified 835 shared proteins, with 5% showing >2-fold changes in abundance between the tissues. Enriched in oospores were β-glucanases potentially involved in digesting the oospore wall during germination. Despite being dormant, oospores contained a mostly normal complement of proteins required for core cellular functions. The RNA-seq data generated here and in prior studies were used to identify new housekeeping controls for gene expression studies that are more stable than existing normalization standards. We also observed >2-fold variation in the fraction of polyA+ RNA between life stages, which should be considered when quantifying transcripts and may also be relevant to understanding translational control during development.
The Endoplasmic Reticulum and the Unfolded Protein Response
Malhotra, Jyoti D.; Kaufman, Randal J.
2009-01-01
The endoplasmic reticulum (ER) is the site where proteins enter the secretory pathway. Proteins are translocated into the ER lumen in an unfolded state and require protein chaperones and catalysts of protein folding to attain their final appropriate conformation. A sensitive surveillance mechanism exists to prevent misfolded proteins from transiting the secretory pathway and ensures that persistently misfolded proteins are directed towards a degradative pathway. In addition, those processes that prevent accumulation of unfolded proteins in the ER lumen are highly regulated by an intracellular signaling pathway known as the unfolded protein response (UPR). The UPR provides a mechanism by which cells can rapidly adapt to alterations in client protein-folding load in the ER lumen by expanding the capacity for protein folding. In addition, a variety of insults that disrupt protein folding in the ER lumen also activate the UPR. These include changes in intralumenal calcium, altered glycosylation, nutrient deprivation, pathogen infection, expression of folding-defective proteins, and changes in redox status. Persistent protein misfolding initiates apoptotic cascades that are now known to play fundamental roles in the pathogenesis of multiple human diseases including diabetes, atherosclerosis and neurodegenerative diseases. PMID:18023214
Ligand-promoted protein folding by biased kinetic partitioning.
Hingorani, Karan S; Metcalf, Matthew C; Deming, Derrick T; Garman, Scott C; Powers, Evan T; Gierasch, Lila M
2017-04-01
Protein folding in cells occurs in the presence of high concentrations of endogenous binding partners, and exogenous binding partners have been exploited as pharmacological chaperones. A combined mathematical modeling and experimental approach shows that a ligand improves the folding of a destabilized protein by biasing the kinetic partitioning between folding and alternative fates (aggregation or degradation). Computationally predicted inhibition of test protein aggregation and degradation as a function of ligand concentration are validated by experiments in two disparate cellular systems.
Ligand-Promoted Protein Folding by Biased Kinetic Partitioning
Hingorani, Karan S.; Metcalf, Matthew C.; Deming, Derrick T.; Garman, Scott C.; Powers, Evan T.; Gierasch, Lila M.
2017-01-01
Protein folding in cells occurs in the presence of high concentrations of endogenous binding partners, and exogenous binding partners have been exploited as pharmacological chaperones. A combined mathematical modeling and experimental approach shows that a ligand improves the folding of a destabilized protein by biasing the kinetic partitioning between folding and alternative fates (aggregation or degradation). Computationally predicted inhibition of test protein aggregation and degradation as a function of ligand concentration are validated by experiments in two disparate cellular systems. PMID:28218913
Global Survey of Protein Expression during Gonadal Sex Determination in Mice*
Ewen, Katherine; Baker, Mark; Wilhelm, Dagmar; Aitken, R. John; Koopman, Peter
2009-01-01
The development of an embryo as male or female depends on differentiation of the gonads as either testes or ovaries. A number of genes are known to be important for gonadal differentiation, but our understanding of the regulatory networks underpinning sex determination remains fragmentary. To advance our understanding of sexual development beyond the transcriptome level, we performed the first global survey of the mouse gonad proteome at the time of sex determination by using two-dimensional nanoflow LC-MS/MS. The resulting data set contains a total of 1037 gene products (154 non-redundant and 883 redundant proteins) identified from 620 peptides. Functional classification and biological network construction suggested that the identified proteins primarily serve in RNA post-transcriptional modification and trafficking, protein synthesis and folding, and post-translational modification. The data set contains potential novel regulators of gonad development and sex determination not revealed previously by transcriptomics and proteomics studies and more than 60 proteins with potential links to human disorders of sexual development. PMID:19617587
Lu, Diannan; Liu, Zheng; Wu, Jianzhong
2006-01-01
Proteins fold in a confined space not only in vivo, i.e., folding assisted by molecular chaperons and chaperonins in a crowded cellular medium, but also in vitro as in production of recombinant proteins. Despite extensive work on protein folding in bulk, little is known about how and to what extent the thermodynamics and kinetics of protein folding are altered by confinement. In this work, we use a Gō-like off-lattice model to investigate the folding and stability of an all β-sheet protein in spherical cages of different sizes and surface hydrophobicity. We find whereas extreme confinement inhibits correct folding, a hydrophilic cage stabilizes the protein due to restriction of the unfolded configurations. In a hydrophobic cage, however, strong attraction from the cage surface destabilizes the confined protein because of competition between self-aggregation and adsorption of hydrophobic residues. We show that the kinetics of protein collapse and folding is strongly correlated with both the cage size and the surface hydrophobicity. It is demonstrated that a cage of moderate size and hydrophobicity optimizes both the folding yield and kinetics of structural transitions. To support the simulation results, we have also investigated the refolding of hen-egg lysozyme in the presence of cetyltrimethylammoniumbromide (CTAB) surfactants that provide an effective confinement of the proteins by micellization. The influence of the surfactant hydrophobicity on the structural and biological activity of the protein is determined with circular dichroism spectrum, fluorescence emission spectrum, and biological activity assay. It is shown that, as predicted by coarse-grained simulations, CTAB micelles facilitate the collapse of denatured lysozyme, whereas the addition of β-cyclodextrin-grafted-PNIPAAm, a weakly hydrophobic stripper, dissociates CTAB micelles and promotes the conformational rearrangement and thereby gives an improved recovery of lysozyme activity. PMID:16461405
Absolute comparison of simulated and experimental protein-folding dynamics
NASA Astrophysics Data System (ADS)
Snow, Christopher D.; Nguyen, Houbi; Pande, Vijay S.; Gruebele, Martin
2002-11-01
Protein folding is difficult to simulate with classical molecular dynamics. Secondary structure motifs such as α-helices and β-hairpins can form in 0.1-10µs (ref. 1), whereas small proteins have been shown to fold completely in tens of microseconds. The longest folding simulation to date is a single 1-µs simulation of the villin headpiece; however, such single runs may miss many features of the folding process as it is a heterogeneous reaction involving an ensemble of transition states. Here, we have used a distributed computing implementation to produce tens of thousands of 5-20-ns trajectories (700µs) to simulate mutants of the designed mini-protein BBA5. The fast relaxation dynamics these predict were compared with the results of laser temperature-jump experiments. Our computational predictions are in excellent agreement with the experimentally determined mean folding times and equilibrium constants. The rapid folding of BBA5 is due to the swift formation of secondary structure. The convergence of experimentally and computationally accessible timescales will allow the comparison of absolute quantities characterizing in vitro and in silico (computed) protein folding.
NASA Astrophysics Data System (ADS)
Xu, Zhijun; Lazim, Raudah; Sun, Tiedong; Mei, Ye; Zhang, Dawei
2012-04-01
Solvent effect on protein conformation and folding mechanism of E6-associated protein (E6ap) peptide are investigated using a recently developed charge update scheme termed as adaptive hydrogen bond-specific charge (AHBC). On the basis of the close agreement between the calculated helix contents from AHBC simulations and experimental results, we observed based on the presented simulations that the two ends of the peptide may simultaneously take part in the formation of the helical structure at the early stage of folding and finally merge to form a helix with lowest backbone RMSD of about 0.9 Å in 40% 2,2,2-trifluoroethanol solution. However, in pure water, the folding may start at the center of the peptide sequence instead of at the two opposite ends. The analysis of the free energy landscape indicates that the solvent may determine the folding clusters of E6ap, which subsequently leads to the different final folded structure. The current study demonstrates new insight to the role of solvent in the determination of protein structure and folding dynamics.
The transcriptional response of Escherichia coli to recombinant protein insolubility.
Smith, Harold E
2007-03-01
Bacterial production of recombinant proteins offers several advantages over alternative expression methods and remains the system of choice for many structural genomics projects. However, a large percentage of targets accumulate as insoluble inclusion bodies rather than soluble protein, creating a significant bottleneck in the protein production pipeline. Numerous strategies have been reported that can improve in vivo protein solubility, but most do not scale easily for high-throughput expression screening. To understand better the host cell response to the accumulation of insoluble protein, we determined genome-wide changes in bacterial gene expression upon induction of either soluble or insoluble target proteins. By comparing transcriptional profiles for multiple examples from the soluble or insoluble class, we identified a pattern of gene expression that correlates strongly with protein solubility. Direct targets of the sigma32 heat shock sigma factor, which includes genes involved in protein folding and degradation, were highly expressed in response to induction of insoluble protein. This same group of genes was also upregulated by insoluble protein accumulation under a different growth regime, indicating that sigma32-mediated gene expression is a general response to protein insolubility. This knowledge provides a starting point for the rational design of growth parameters and host strains with improved protein solubility characteristics. Summary Problems with protein solubility are frequently encountered when recombinant proteins are expressed in E. coli. The bacterial host responds to this problem by increasing expression of the protein folding machinery via the heat shock sigma factor sigma32. Manipulation of the sigma32 regulon might provide a general mechanism for improving recombinant protein solubility.
Identifying Affordances of 3D Printed Tangible Models for Understanding Core Biological Concepts
ERIC Educational Resources Information Center
Davenport, Jodi L.; Silberglitt, Matt; Boxerman, Jonathan; Olson, Arthur
2014-01-01
3D models derived from actual molecular structures have the potential to transform student learning in biology. We share findings related to our research questions: 1) what types of interactions with a protein folding kit promote specific learning objectives?, and 2) what features of the instructional environment (e.g., peer interactions, teacher…
Dodging the crisis of folding proteins with knots
NASA Astrophysics Data System (ADS)
Sulkowska, Joanna
2009-03-01
Proteins with nontrivial topology, containing knots and slipknots, have the ability to fold to their native states without any additional external forces invoked. A mechanism is suggested for folding of these proteins, such as YibK and YbeA, which involves an intermediate configuration with a slipknot. It elucidates the role of topological barriers and backtracking during the folding event. It also illustrates that native contacts are sufficient to guarantee folding in around 1-2% of the simulations, and how slipknot intermediates are needed to reduce the topological bottlenecks. As expected, simulations of proteins with similar structure but with knot removed fold much more efficiently, clearly demonstrating the origin of these topological barriers. Although these studies are based on a simple coarse-grained model, they are already able to extract some of the underlying principles governing folding in such complex topologies.
Shamim, Mohammad Tabrez Anwar; Anwaruddin, Mohammad; Nagarajaram, H A
2007-12-15
Fold recognition is a key step in the protein structure discovery process, especially when traditional sequence comparison methods fail to yield convincing structural homologies. Although many methods have been developed for protein fold recognition, their accuracies remain low. This can be attributed to insufficient exploitation of fold discriminatory features. We have developed a new method for protein fold recognition using structural information of amino acid residues and amino acid residue pairs. Since protein fold recognition can be treated as a protein fold classification problem, we have developed a Support Vector Machine (SVM) based classifier approach that uses secondary structural state and solvent accessibility state frequencies of amino acids and amino acid pairs as feature vectors. Among the individual properties examined secondary structural state frequencies of amino acids gave an overall accuracy of 65.2% for fold discrimination, which is better than the accuracy by any method reported so far in the literature. Combination of secondary structural state frequencies with solvent accessibility state frequencies of amino acids and amino acid pairs further improved the fold discrimination accuracy to more than 70%, which is approximately 8% higher than the best available method. In this study we have also tested, for the first time, an all-together multi-class method known as Crammer and Singer method for protein fold classification. Our studies reveal that the three multi-class classification methods, namely one versus all, one versus one and Crammer and Singer method, yield similar predictions. Dataset and stand-alone program are available upon request.
Chen, Minyong; Samuelson, James C
2016-06-14
The requirements for target protein folding in M13 phage display are largely underappreciated. Here we chose Fbs1, a carbohydrate binding protein, as a model to address this issue. Importantly, folding of Fbs1 is impaired in an oxidative environment. Fbs1 can be displayed on M13 phage using the SRP or Sec pathway. However, the displayed Fbs1 protein is properly folded only when Fbs1 is translocated via the SRP pathway and displayed using Escherichia coli cells with a DsbA-negative periplasm. This study indicates M13 phage display may be improved using a system specifically designed according to the folding requirements of each target protein.
Roles of beta-turns in protein folding: from peptide models to protein engineering.
Marcelino, Anna Marie C; Gierasch, Lila M
2008-05-01
Reverse turns are a major class of protein secondary structure; they represent sites of chain reversal and thus sites where the globular character of a protein is created. It has been speculated for many years that turns may nucleate the formation of structure in protein folding, as their propensity to occur will favor the approximation of their flanking regions and their general tendency to be hydrophilic will favor their disposition at the solvent-accessible surface. Reverse turns are local features, and it is therefore not surprising that their structural properties have been extensively studied using peptide models. In this article, we review research on peptide models of turns to test the hypothesis that the propensities of turns to form in short peptides will relate to the roles of corresponding sequences in protein folding. Turns with significant stability as isolated entities should actively promote the folding of a protein, and by contrast, turn sequences that merely allow the chain to adopt conformations required for chain reversal are predicted to be passive in the folding mechanism. We discuss results of protein engineering studies of the roles of turn residues in folding mechanisms. Factors that correlate with the importance of turns in folding indeed include their intrinsic stability, as well as their topological context and their participation in hydrophobic networks within the protein's structure.
Protein domain definition should allow for conditional disorder
Yegambaram, Kavestri; Bulloch, Esther MM; Kingston, Richard L
2013-01-01
Abstract: Proteins are often classified in a binary fashion as either structured or disordered. However this approach has several deficits. Firstly, protein folding is always conditional on the physiochemical environment. A protein which is structured in some circumstances will be disordered in others. Secondly, it hides a fundamental asymmetry in behavior. While all structured proteins can be unfolded through a change in environment, not all disordered proteins have the capacity for folding. Failure to accommodate these complexities confuses the definition of both protein structural domains and intrinsically disordered regions. We illustrate these points with an experimental study of a family of small binding domains, drawn from the RNA polymerase of mumps virus and its closest relatives. Assessed at face value the domains fall on a structural continuum, with folded, partially folded, and near unstructured members. Yet the disorder present in the family is conditional, and these closely related polypeptides can access the same folded state under appropriate conditions. Any heuristic definition of the protein domain emphasizing conformational stability divides this domain family in two, in a way that makes no biological sense. Structural domains would be better defined by their ability to adopt a specific tertiary structure: a structure that may or may not be realized, dependent on the circumstances. This explicitly allows for the conditional nature of protein folding, and more clearly demarcates structural domains from intrinsically disordered regions that may function without folding. PMID:23963781
Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.
Wang, Sheng; Sun, Siqi; Li, Zhen; Zhang, Renyu; Xu, Jinbo
2017-01-01
Protein contacts contain key information for the understanding of protein structure and function and thus, contact prediction from sequence is an important problem. Recently exciting progress has been made on this problem, but the predicted contacts for proteins without many sequence homologs is still of low quality and not very useful for de novo structure prediction. This paper presents a new deep learning method that predicts contacts by integrating both evolutionary coupling (EC) and sequence conservation information through an ultra-deep neural network formed by two deep residual neural networks. The first residual network conducts a series of 1-dimensional convolutional transformation of sequential features; the second residual network conducts a series of 2-dimensional convolutional transformation of pairwise information including output of the first residual network, EC information and pairwise potential. By using very deep residual networks, we can accurately model contact occurrence patterns and complex sequence-structure relationship and thus, obtain higher-quality contact prediction regardless of how many sequence homologs are available for proteins in question. Our method greatly outperforms existing methods and leads to much more accurate contact-assisted folding. Tested on 105 CASP11 targets, 76 past CAMEO hard targets, and 398 membrane proteins, the average top L long-range prediction accuracy obtained by our method, one representative EC method CCMpred and the CASP11 winner MetaPSICOV is 0.47, 0.21 and 0.30, respectively; the average top L/10 long-range accuracy of our method, CCMpred and MetaPSICOV is 0.77, 0.47 and 0.59, respectively. Ab initio folding using our predicted contacts as restraints but without any force fields can yield correct folds (i.e., TMscore>0.6) for 203 of the 579 test proteins, while that using MetaPSICOV- and CCMpred-predicted contacts can do so for only 79 and 62 of them, respectively. Our contact-assisted models also have much better quality than template-based models especially for membrane proteins. The 3D models built from our contact prediction have TMscore>0.5 for 208 of the 398 membrane proteins, while those from homology modeling have TMscore>0.5 for only 10 of them. Further, even if trained mostly by soluble proteins, our deep learning method works very well on membrane proteins. In the recent blind CAMEO benchmark, our fully-automated web server implementing this method successfully folded 6 targets with a new fold and only 0.3L-2.3L effective sequence homologs, including one β protein of 182 residues, one α+β protein of 125 residues, one α protein of 140 residues, one α protein of 217 residues, one α/β of 260 residues and one α protein of 462 residues. Our method also achieved the highest F1 score on free-modeling targets in the latest CASP (Critical Assessment of Structure Prediction), although it was not fully implemented back then. http://raptorx.uchicago.edu/ContactMap/.
Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model
Li, Zhen; Zhang, Renyu
2017-01-01
Motivation Protein contacts contain key information for the understanding of protein structure and function and thus, contact prediction from sequence is an important problem. Recently exciting progress has been made on this problem, but the predicted contacts for proteins without many sequence homologs is still of low quality and not very useful for de novo structure prediction. Method This paper presents a new deep learning method that predicts contacts by integrating both evolutionary coupling (EC) and sequence conservation information through an ultra-deep neural network formed by two deep residual neural networks. The first residual network conducts a series of 1-dimensional convolutional transformation of sequential features; the second residual network conducts a series of 2-dimensional convolutional transformation of pairwise information including output of the first residual network, EC information and pairwise potential. By using very deep residual networks, we can accurately model contact occurrence patterns and complex sequence-structure relationship and thus, obtain higher-quality contact prediction regardless of how many sequence homologs are available for proteins in question. Results Our method greatly outperforms existing methods and leads to much more accurate contact-assisted folding. Tested on 105 CASP11 targets, 76 past CAMEO hard targets, and 398 membrane proteins, the average top L long-range prediction accuracy obtained by our method, one representative EC method CCMpred and the CASP11 winner MetaPSICOV is 0.47, 0.21 and 0.30, respectively; the average top L/10 long-range accuracy of our method, CCMpred and MetaPSICOV is 0.77, 0.47 and 0.59, respectively. Ab initio folding using our predicted contacts as restraints but without any force fields can yield correct folds (i.e., TMscore>0.6) for 203 of the 579 test proteins, while that using MetaPSICOV- and CCMpred-predicted contacts can do so for only 79 and 62 of them, respectively. Our contact-assisted models also have much better quality than template-based models especially for membrane proteins. The 3D models built from our contact prediction have TMscore>0.5 for 208 of the 398 membrane proteins, while those from homology modeling have TMscore>0.5 for only 10 of them. Further, even if trained mostly by soluble proteins, our deep learning method works very well on membrane proteins. In the recent blind CAMEO benchmark, our fully-automated web server implementing this method successfully folded 6 targets with a new fold and only 0.3L-2.3L effective sequence homologs, including one β protein of 182 residues, one α+β protein of 125 residues, one α protein of 140 residues, one α protein of 217 residues, one α/β of 260 residues and one α protein of 462 residues. Our method also achieved the highest F1 score on free-modeling targets in the latest CASP (Critical Assessment of Structure Prediction), although it was not fully implemented back then. Availability http://raptorx.uchicago.edu/ContactMap/ PMID:28056090
Tang, Nou-Ying; Chueh, Fu-Shin; Yu, Chien-Chih; Liao, Ching-Lung; Lin, Jen-Jyh; Hsia, Te-Chun; Wu, King-Chuen; Liu, Hsin-Chung; Lu, Kung-Wen; Chung, Jing-Gung
2016-04-01
Glioblastoma multiforme (GBM) is a highly malignant devastating brain tumor in adults. Benzyl isothiocyanate (BITC) is one of the isothiocyanates that have been shown to induce human cancer cell apoptosis and cell cycle arrest. Herein, the effect of BITC on cell viability and apoptotic cell death and the genetic levels of human brain glioblastoma GBM 8401 cells in vitro were investigated. We found that BITC induced cell morphological changes, decreased cell viability and the induction of cell apoptosis in GBM 8401 cells was time-dependent. cDNA microarray was used to examine the effects of BITC on GBM 8401 cells and we found that numerous genes associated with cell death and cell cycle regulation in GBM 8401 cells were altered after BITC treatment. The results show that expression of 317 genes was upregulated, and two genes were associated with DNA damage, the DNA-damage-inducible transcript 3 (DDIT3) was increased 3.66-fold and the growth arrest and DNA-damage-inducible α (GADD45A) was increased 2.34-fold. We also found that expression of 182 genes was downregulated and two genes were associated with receptor for cell responses to stimuli, the EGF containing fibulin-like extracellular matrix protein 1 (EFEMP1) was inhibited 2.01-fold and the TNF receptor-associated protein 1 (TRAP1) was inhibited 2.08-fold. BITC inhibited seven mitochondria ribosomal genes, the mitochondrial ribosomal protein; tumor protein D52 (MRPS28) was inhibited 2.06-fold, the mitochondria ribosomal protein S2 (MRPS2) decreased 2.07-fold, the mitochondria ribosomal protein L23 (MRPL23) decreased 2.08-fold, the mitochondria ribosomal protein S2 (MRPS2) decreased 2.07-fold, the mitochondria ribosomal protein S12 (MRPS12) decreased 2.08-fold, the mitochondria ribosomal protein L12 (MRPL12) decreased 2.25-fold and the mitochondria ribosomal protein S34 (MRPS34) was decreased 2.30-fold in GBM 8401 cells. These changes of gene expression can provide the effects of BITC on the genetic level and are potential biomarkers for glioblastoma therapy.
NASA Astrophysics Data System (ADS)
Qin, Sanbo; Mittal, Jeetain; Zhou, Huan-Xiang
2013-08-01
We have developed a ‘postprocessing’ method for modeling biochemical processes such as protein folding under crowded conditions (Qin and Zhou 2009 Biophys. J. 97 12-19). In contrast to the direct simulation approach, in which the protein undergoing folding is simulated along with crowders, the postprocessing method requires only the folding simulation without crowders. The influence of the crowders is then obtained by taking conformations from the crowder-free simulation and calculating the free energies of transferring to the crowders. This postprocessing yields the folding free energy surface of the protein under crowding. Here the postprocessing results for the folding of three small proteins under ‘repulsive’ crowding are validated by those obtained previously by the direct simulation approach (Mittal and Best 2010 Biophys. J. 98 315-20). This validation confirms the accuracy of the postprocessing approach and highlights its distinct advantages in modeling biochemical processes under cell-like crowded conditions, such as enabling an atomistic representation of the test proteins.
Ross, Jennifer L
2016-09-06
The inside of the cell is full of important, yet invisible species of molecules and proteins that interact weakly but couple together to have huge and important effects in many biological processes. Such "dark matter" inside cells remains mostly hidden, because our tools were developed to investigate strongly interacting species and folded proteins. Example dark-matter species include intrinsically disordered proteins, posttranslational states, ion species, and rare, transient, and weak interactions undetectable by biochemical assays. The dark matter of biology is likely to have multiple, vital roles to regulate signaling, rates of reactions, water structure and viscosity, crowding, and other cellular activities. We need to create new tools to image, detect, and understand these dark-matter species if we are to truly understand fundamental physical principles of biology. Copyright © 2016 Biophysical Society. Published by Elsevier Inc. All rights reserved.
Global analysis of protein folding using massively parallel design, synthesis and testing
Rocklin, Gabriel J.; Chidyausiku, Tamuka M.; Goreshnik, Inna; Ford, Alex; Houliston, Scott; Lemak, Alexander; Carter, Lauren; Ravichandran, Rashmi; Mulligan, Vikram K.; Chevalier, Aaron; Arrowsmith, Cheryl H.; Baker, David
2017-01-01
Proteins fold into unique native structures stabilized by thousands of weak interactions that collectively overcome the entropic cost of folding. Though these forces are “encoded” in the thousands of known protein structures, “decoding” them is challenging due to the complexity of natural proteins that have evolved for function, not stability. Here we combine computational protein design, next-generation gene synthesis, and a high-throughput protease susceptibility assay to measure folding and stability for over 15,000 de novo designed miniproteins, 1,000 natural proteins, 10,000 point-mutants, and 30,000 negative control sequences, identifying over 2,500 new stable designed proteins in four basic folds. This scale—three orders of magnitude greater than that of previous studies of design or folding—enabled us to systematically examine how sequence determines folding and stability in uncharted protein space. Iteration between design and experiment increased the design success rate from 6% to 47%, produced stable proteins unlike those found in nature for topologies where design was initially unsuccessful, and revealed subtle contributions to stability as designs became increasingly optimized. Our approach achieves the long-standing goal of a tight feedback cycle between computation and experiment, and promises to transform computational protein design into a data-driven science. PMID:28706065
Engineered fluorescent proteins illuminate the bacterial periplasm
Dammeyer, Thorben; Tinnefeld, Philip
2012-01-01
The bacterial periplasm is of special interest whenever cell factories are designed and engineered. Recombinantely produced proteins are targeted to the periplasmic space of Gram negative bacteria to take advantage of the authentic N-termini, disulfide bridge formation and easy accessibility for purification with less contaminating cellular proteins. The oxidizing environment of the periplasm promotes disulfide bridge formation - a prerequisite for proper folding of many proteins into their active conformation. In contrast, the most popular reporter protein in all of cell biology, Green Fluorescent Protein (GFP), remains inactive if translocated to the periplasmic space prior to folding. Here, the self-catalyzed chromophore maturation is blocked by formation of covalent oligomers via interchain disulfide bonds in the oxidizing environment. However, different protein engineering approaches addressing folding and stability of GFP resulted in improved proteins with enhanced folding properties. Recent studies describe GFP variants that are not only active if translocated in their folded form via the twin-arginine translocation (Tat) pathway, but actively fold in the periplasm following general secretory pathway (Sec) and signal recognition particle (SRP) mediated secretion. This mini-review highlights the progress that enables new insights into bacterial export and periplasmic protein organization, as well as new biotechnological applications combining the advantages of the periplasmic production and the Aequorea-based fluorescent reporter proteins. PMID:24688673
Zaksas, Nataliya; Gluhcheva, Yordanka; Sedykh, Sergey; Madzharova, Maria; Atanassova, Nina; Nevinsky, Georgy
2013-01-01
Cobalt (Co) is a transition metal and an essential trace element, required for vitamin B(12) biosynthesis, enzyme activation and other biological processes, but toxic in high concentrations. There is lack of data for the effect of long-term Co(II) treatment on the concentrations of other trace elements. We estimate the influence of cobalt chloride (CoCl(2)) on the relative content of different metals in mouse plasma using two-jet arc plasmatron atomic emission and on the total protein content. On average, the content of different elements in the plasma of 2-month-old balb/c mice (control group) decreased in the order: Ca>Mg>Si>Fe>Zn>Cu≥Al≥B. The treatment of mice for 60 days with CoCl(2) (daily dose 125 mg/kg) did not appreciably change the relative content of Ca, Cu, and Zn, while a 2.4-fold statistically significant decrease in the content of B and significant increase in the content of Mg (1.4-fold), Al and Fe (2.0-fold) and Si (3.2-fold) was found. A detectable amount of Mo was observed only for two control mice, while the plasma of 9 out of 16 mice of the treated group contained this metal. The administration of Co made its concentration detectable in the plasma of all mice of the treated group, but the relative content varied significantly. The treatment led to a 2.2-fold decrease in the concentration of the total plasma protein. Chronic exposure to CoCl(2) affects homeostasis as well as the concentrations and metabolism of other essential elements, probably due to competition of Co ions for similar binding sites within cells, altered signal transduction and protein biosynthesis. Long-term treatment also leads to significant weight changes and reduces the total protein concentration. The data may be useful for an understanding of Co toxicity, its effect on the concentration of other metal ions and different physiological processes. Copyright © 2012 Elsevier GmbH. All rights reserved.
Da Costa, Gustavo Góes; Gomig, Talita Helen Bombardelli; Kaviski, Rodrigo; Santos Sousa, Karla; Kukolj, Caroline; De Lima, Rubens Silveira; De Andrade Urban, Cicero; Cavalli, Iglenir J; Ribeiro, Enilze M S F
2015-01-01
Breast cancer is the most common type of cancer among women worldwide, and about 57,000 new cases are expected for the Brazilian population in 2015. Elucidation of protein expression and modification is essential for the biological understanding, early diagnosis and therapeutics of breast cancer. The main objectives of the study are comparison between the proteome of tumor and paired non-tumor breast cancer tissues, describing all identified proteins, highlighting the ones most differentially expressed and comparing the data with existing literature. The five paired samples from patients with invasive ductal carcinoma were analyzed by 2-DE and MS. We collected 161 identified spots corresponding to 110 distinct proteins. Forty-three differentially-expressed spots were common to at least two samples, and the ten proteins with the highest-fold changes were CASPE, ENOG, TPM1, CAPG, VIME, TPM3, TRFE, PDIA6, WDR61 and PDIA3. Metabolic enzymes and proteins with binding functions were the most representative functional classes of proteins with increased and decreased expression in tumor tissue respectively. Taking the fold change as a parameter, we point to future targets to be studied by functional methods in a search for biomarkers for initiation and progress of breast cancer. Copyright© 2015, International Institute of Anticancer Research (Dr. John G. Delinasios), All rights reserved.
Labudde, Dirk
2015-01-01
The importance of short membrane sequence motifs has been shown in many works and emphasizes the related sequence motif analysis. Together with specific transmembrane helix-helix interactions, the analysis of interacting sequence parts is helpful for understanding the process during membrane protein folding and in retaining the three-dimensional fold. Here we present a simple high-throughput analysis method for deriving mutational information of interacting sequence parts. Applied on aquaporin water channel proteins, our approach supports the analysis of mutational variants within different interacting subsequences and finally the investigation of natural variants which cause diseases like, for example, nephrogenic diabetes insipidus. In this work we demonstrate a simple method for massive membrane protein data analysis. As shown, the presented in silico analyses provide information about interacting sequence parts which are constrained by protein evolution. We present a simple graphical visualization medium for the representation of evolutionary influenced interaction pattern pairs (EIPPs) adapted to mutagen investigations of aquaporin-2, a protein whose mutants are involved in the rare endocrine disorder known as nephrogenic diabetes insipidus, and membrane proteins in general. Furthermore, we present a new method to derive new evolutionary variations within EIPPs which can be used for further mutagen laboratory investigations. PMID:26180540
Grunert, Steffen; Labudde, Dirk
2015-01-01
The importance of short membrane sequence motifs has been shown in many works and emphasizes the related sequence motif analysis. Together with specific transmembrane helix-helix interactions, the analysis of interacting sequence parts is helpful for understanding the process during membrane protein folding and in retaining the three-dimensional fold. Here we present a simple high-throughput analysis method for deriving mutational information of interacting sequence parts. Applied on aquaporin water channel proteins, our approach supports the analysis of mutational variants within different interacting subsequences and finally the investigation of natural variants which cause diseases like, for example, nephrogenic diabetes insipidus. In this work we demonstrate a simple method for massive membrane protein data analysis. As shown, the presented in silico analyses provide information about interacting sequence parts which are constrained by protein evolution. We present a simple graphical visualization medium for the representation of evolutionary influenced interaction pattern pairs (EIPPs) adapted to mutagen investigations of aquaporin-2, a protein whose mutants are involved in the rare endocrine disorder known as nephrogenic diabetes insipidus, and membrane proteins in general. Furthermore, we present a new method to derive new evolutionary variations within EIPPs which can be used for further mutagen laboratory investigations.
Structural perturbations on huntingtin N17 domain during its folding on 2D-nanomaterials
NASA Astrophysics Data System (ADS)
Zhang, Leili; Feng, Mei; Zhou, Ruhong; Luan, Binquan
2017-09-01
A globular protein’s folded structure in its physiological environment is largely determined by its amino acid sequence. Recently, newly discovered transformer proteins as well as intrinsically disordered proteins may adopt the folding-upon-binding mechanism where their secondary structures are highly dependent on their binding partners. Due to the various applications of nanomaterials in biological sensors and potential wearable devices, it is important to discover possible conformational changes of proteins on nanomaterials. Here, through molecular dynamics simulations, we show that the first 17 residues of the huntingtin protein (HTT-N17) exhibit appreciable differences during its folding on 2D-nanomaterials, such as graphene and MoS2 nanosheets. Namely, the protein is disordered on the graphene surface but is helical on the MoS2 surface. Despite that the amphiphilic environment at the nanosheet-water interface promotes the folding of the amphipathic proteins (such as HTT-N17), competitions between protein-nanosheet and intra-protein interactions yield very different protein conformations. Therefore, as engineered binding partners, nanomaterials might significantly affect the structures of adsorbed proteins.
Modelling the metabolism of protein secretion through the Tat route in Streptomyces lividans.
Valverde, José R; Gullón, Sonia; Mellado, Rafael P
2018-06-14
Streptomyces lividans has demonstrated its value as an efficient host for protein production due to its ability to secrete functional proteins directly to the media. Secretory proteins that use the major Sec route need to be properly folded outside the cell, whereas secretory proteins using the Tat route appear outside the cell correctly folded. This feature makes the Tat system very attractive for the production of natural or engineered Tat secretory proteins. S. lividans cells are known to respond differently to overproduction and secretion of Tat versus Sec proteins. Increased understanding of the impact of protein secretion through the Tat route can be obtained by a deeper analysis of the metabolic impact associated with protein production, and its dependence on protein origin, composition, secretion mechanisms, growth phases and nutrients. Flux Balance Analysis of Genome-Scale Metabolic Network models provides a theoretical framework to investigate cell metabolism under different constraints. We have built new models for various S. lividans strains to better understand the mechanisms associated with overproduction of proteins secreted through the Tat route. We compare models of an S. lividans Tat-dependent agarase overproducing strain with those of the S. lividans wild-type, an S. lividans strain carrying the multi-copy plasmid vector and an α-amylase Sec-dependent overproducing strain. Using updated genomic, transcriptomic and experimental data we could extend existing S. lividans models and produce a new model which produces improved results largely extending the coverage of S. lividans strains, the number of genes and reactions being considered, the predictive behaviour and the dependence on specification of exchange constraints. Comparison of the optimized solutions obtained highlights numerous changes between Tat- and Sec-dependent protein secreting strains affecting the metabolism of carbon, amino acids, nucleotides, lipids and cofactors, and variability analysis predicts a large potential for protein overproduction. This work provides a detailed look to metabolic changes associated to Tat-dependent protein secretion reproducing experimental observations and identifying changes that are specific to each secretory route, presenting a novel, improved, more accurate and strain-independent model of S. lividans, thus opening the way for enhanced metabolic engineering of protein overproduction in S. lividans.
Proteomic analysis of blastema formation in regenerating axolotl limbs
2009-01-01
Background Following amputation, urodele salamander limbs reprogram somatic cells to form a blastema that self-organizes into the missing limb parts to restore the structure and function of the limb. To help understand the molecular basis of blastema formation, we used quantitative label-free liquid chromatography-mass spectrometry/mass spectrometry (LC-MS/MS)-based methods to analyze changes in the proteome that occurred 1, 4 and 7 days post amputation (dpa) through the mid-tibia/fibula of axolotl hind limbs. Results We identified 309 unique proteins with significant fold change relative to controls (0 dpa), representing 10 biological process categories: (1) signaling, (2) Ca2+ binding and translocation, (3) transcription, (4) translation, (5) cytoskeleton, (6) extracellular matrix (ECM), (7) metabolism, (8) cell protection, (9) degradation, and (10) cell cycle. In all, 43 proteins exhibited exceptionally high fold changes. Of these, the ecotropic viral integrative factor 5 (EVI5), a cell cycle-related oncoprotein that prevents cells from entering the mitotic phase of the cell cycle prematurely, was of special interest because its fold change was exceptionally high throughout blastema formation. Conclusion Our data were consistent with previous studies indicating the importance of inositol triphosphate and Ca2+ signaling in initiating the ECM and cytoskeletal remodeling characteristic of histolysis and cell dedifferentiation. In addition, the data suggested that blastema formation requires several mechanisms to avoid apoptosis, including reduced metabolism, differential regulation of proapoptotic and antiapoptotic proteins, and initiation of an unfolded protein response (UPR). Since there is virtually no mitosis during blastema formation, we propose that high levels of EVI5 function to arrest dedifferentiated cells somewhere in the G1/S/G2 phases of the cell cycle until they have accumulated under the wound epidermis and enter mitosis in response to neural and epidermal factors. Our findings indicate the general value of quantitative proteomic analysis in understanding the regeneration of complex structures. PMID:19948009
Folding and self-assembly of polypeptides: Dynamics and thermodynamics from molecular simulation
NASA Astrophysics Data System (ADS)
Fluitt, Aaron Michael
Empowered by their exquisite three-dimensional structures, or "folds," proteins carry out biological tasks with high specificity, efficiency, and fidelity. The fold that optimizes biological function represents a stable configuration of the constituent polypeptide molecule(s) under physiological conditions. Proteins and polypeptides are not static, however: battered by thermal motion, they explore a distribution of folds that is determined by the sequence of amino acids, the presence and identity of other molecules, and the thermodynamic conditions. In this dissertation, we apply molecular simulation techniques to the study of two polypeptides that have unusually diffuse distributions of folds under physiological conditions: polyglutamine (polyQ) and islet amyloid polypeptide (IAPP). Neither polyQ nor IAPP adopts a predominant fold in dilute aqueous solution, but at sufficient concentrations, both are prone to self-assemble into stable, periodic, and highly regular aggregate structures known as amyloid. The appearance of amyloid deposits of polyQ in the brain, and of IAPP in the pancreas, are associated with Huntington's disease and type 2 diabetes, respectively. A molecular view of the mechanism(s) by which polyQ and IAPP fold and self-assemble will enhance our understanding of disease pathogenesis, and it has the potential to accelerate the development of therapeutics that target early-stage aggregates. Using molecular simulations with spatial and temporal resolution on the atomic scale, we present analyses of the structural distributions of polyQ and IAPP under various conditions, both in and out of equilibrium. In particular, we examine amyloid fibers of polyQ, the IAPP dimer in solution, and single IAPP fragments at a lipid bilayer. We also benchmark the molecular models, or "force fields," available for such studies, and we introduce a novel simulation algorithm.
A protein block based fold recognition method for the annotation of twilight zone sequences.
Suresh, V; Ganesan, K; Parthasarathy, S
2013-03-01
The description of protein backbone was recently improved with a group of structural fragments called Structural Alphabets instead of the regular three states (Helix, Sheet and Coil) secondary structure description. Protein Blocks is one of the Structural Alphabets used to describe each and every region of protein backbone including the coil. According to de Brevern (2000) the Protein Blocks has 16 structural fragments and each one has 5 residues in length. Protein Blocks fragments are highly informative among the available Structural Alphabets and it has been used for many applications. Here, we present a protein fold recognition method based on Protein Blocks for the annotation of twilight zone sequences. In our method, we align the predicted Protein Blocks of a query amino acid sequence with a library of assigned Protein Blocks of 953 known folds using the local pair-wise alignment. The alignment results with z-value ≥ 2.5 and P-value ≤ 0.08 are predicted as possible folds. Our method is able to recognize the possible folds for nearly 35.5% of the twilight zone sequences with their predicted Protein Block sequence obtained by pb_prediction, which is available at Protein Block Export server.
Ogawa, Seiji; Watanabe, Toshihide; Moriyuki, Kazumi; Goto, Yoshikazu; Yamane, Shinsaku; Watanabe, Akio; Tsuboi, Kazuma; Kinoshita, Atsushi; Okada, Takuya; Takeda, Hiroyuki; Tani, Kousuke; Maruyama, Toru
2016-05-15
The modification of the novel G protein-biased EP2 agonist 1 has been investigated to improve its G protein activity and develop a better understanding of its structure-functional selectivity relationship (SFSR). The optimization of the substituents on the phenyl ring of 1, followed by the inversion of the hydroxyl group on the cyclopentane moiety led to compound 9, which showed a 100-fold increase in its G protein activity compared with 1 without any increase in β-arrestin recruitment. Furthermore, SFSR studies revealed that the combination of meta and para substituents on the phenyl moiety was crucial to the functional selectivity. Copyright © 2016 Elsevier Ltd. All rights reserved.
Zhang, Lina; Boeren, Sjef; Hageman, Jos A; van Hooijdonk, Toon; Vervoort, Jacques; Hettinga, Kasper
2015-01-01
In order to better understand the milk proteome and its changes from colostrum to mature milk, samples taken at seven time points in the first 9 days from 4 individual cows were analyzed using proteomic techniques. Both the similarity in changes from day 0 to day 9 in the quantitative milk proteome, and the differences in specific protein abundance, were observed among four cows. One third of the quantified proteins showed a significant decrease in concentration over the first 9 days after calving, especially in the immune proteins (as much as 40 fold). Three relative high abundant enzymes (XDH, LPL, and RNASE1) and cell division and proliferation protein (CREG1) may be involved in the maturation of the gastro-intestinal tract. In addition, high correlations between proteins involved in complement and blood coagulation cascades illustrates the complex nature of biological interrelationships between milk proteins. The linear decrease of protease inhibitors and proteins involved in innate and adaptive immune system implies a protective role for protease inhibitor against degradation. In conclusion, the results found in this study not only improve our understanding of the role of colostrum in both host defense and development of the newborn calf but also provides guidance for the improvement of infant formula through better understanding of the complex interactions between milk proteins.
Solvent Effects on Protein Folding/Unfolding
NASA Astrophysics Data System (ADS)
García, A. E.; Hillson, N.; Onuchic, J. N.
Pressure effects on the hydrophobic potential of mean force led Hummer et al. to postulate a model for pressure denaturation of proteins in which denaturation occurs by means of water penetration into the protein interior, rather than by exposing the protein hydrophobic core to the solvent --- commonly used to describe temperature denaturation. We study the effects of pressure in protein folding/unfolding kinetics in an off-lattice minimalist model of a protein in which pressure effects have been incorporated by means of the pair-wise potential of mean force of hydrophobic groups in water. We show that pressure slows down the kinetics of folding by decreasing the reconfigurational diffusion coefficient and moves the location of the folding transition state.
Oxidative Folding and N-terminal Cyclization of Onconase+
Welker, Ervin; Hathaway, Laura; Xu, Guoqiang; Narayan, Mahesh; Pradeep, Lovy; Shin, Hang-Cheol; Scheraga, Harold A.
2008-01-01
Cyclization of the N-terminal glutamine residue to pyroglutamic acid in onconase, an anti-cancer chemotherapeutic agent, increases the activity and stability of the protein. Here, we examine the correlated effects of the folding/unfolding process and the formation of this N-terminal pyroglutamic acid. The results in this study indicate that cyclization of the N-terminal glutamine has no significant effect on the rate of either reductive unfolding or oxidative folding of the protein. Both the cyclized and uncyclized proteins seem to follow the same oxidative folding pathways; however, cyclization altered the relative flux of the protein in these two pathways by increasing the rate of formation of a kinetically trapped intermediate. Glutaminyl cyclase (QC) catalyzed the cyclization of the unfolded, reduced protein, but had no effect on the disulfide-intact, uncyclized, folded protein. The structured intermediates of uncyclized onconase were also resistant to QC-catalysis, consistent with their having a native-like fold. These observations suggest that, in vivo, cyclization takes place during the initial stages of oxidative folding, specifically, before the formation of structured intermediates. The competition between oxidative folding and QC-mediated cyclization suggests that QC-catalyzed cyclization of the N-terminal glutamine in onconase occurs in the endoplasmic reticulum, probably co-translationally. PMID:17439243
NASA Astrophysics Data System (ADS)
Camilloni, Carlo; Broglia, Ricardo A.; Tiana, Guido
2011-01-01
The study of the mechanism which is at the basis of the phenomenon of protein folding requires the knowledge of multiple folding trajectories under biological conditions. Using a biasing molecular-dynamics algorithm based on the physics of the ratchet-and-pawl system, we carry out all-atom, explicit solvent simulations of the sequence of folding events which proteins G, CI2, and ACBP undergo in evolving from the denatured to the folded state. Starting from highly disordered conformations, the algorithm allows the proteins to reach, at the price of a modest computational effort, nativelike conformations, within a root mean square deviation (RMSD) of approximately 1 Å. A scheme is developed to extract, from the myriad of events, information concerning the sequence of native contact formation and of their eventual correlation. Such an analysis indicates that all the studied proteins fold hierarchically, through pathways which, although not deterministic, are well-defined with respect to the order of contact formation. The algorithm also allows one to study unfolding, a process which looks, to a large extent, like the reverse of the major folding pathway. This is also true in situations in which many pathways contribute to the folding process, like in the case of protein G.
Pandurangan, Sudhakar; Diapari, Marwan; Yin, Fuqiang; Munholland, Seth; Perry, Gregory E.; Chapman, B. Patrick; Huang, Shangzhi; Sparvoli, Francesca; Bollini, Roberto; Crosby, William L.; Pauls, Karl P.; Marsolais, Frédéric
2016-01-01
A series of genetically related lines of common bean (Phaseolus vulgaris L.) integrate a progressive deficiency in major storage proteins, the 7S globulin phaseolin and lectins. SARC1 integrates a lectin-like protein, arcelin-1 from a wild common bean accession. SMARC1N-PN1 is deficient in major lectins, including erythroagglutinating phytohemagglutinin (PHA-E) but not α-amylase inhibitor, and incorporates also a deficiency in phaseolin. SMARC1-PN1 is intermediate and shares the phaseolin deficiency. Sanilac is the parental background. To understand the genomic basis for variations in protein profiles previously determined by proteomics, the genotypes were submitted to short-fragment genome sequencing using an Illumina HiSeq 2000/2500 platform. Reads were aligned to reference sequences and subjected to de novo assembly. The results of the analyses identified polymorphisms responsible for the lack of specific storage proteins, as well as those associated with large differences in storage protein expression. SMARC1N-PN1 lacks the lectin genes pha-E and lec4-B17, and has the pseudogene pdlec1 in place of the functional pha-L gene. While the α-phaseolin gene appears absent, an approximately 20-fold decrease in β-phaseolin accumulation is associated with a single nucleotide polymorphism converting a G-box to an ACGT motif in the proximal promoter. Among residual lectins compensating for storage protein deficiency, mannose lectin FRIL and α-amylase inhibitor 1 genes are uniquely present in SMARC1N-PN1. An approximately 50-fold increase in α-amylase inhibitor like protein accumulation is associated with multiple polymorphisms introducing up to eight potential positive cis-regulatory elements in the proximal promoter specific to SMARC1N-PN1. An approximately 7-fold increase in accumulation of 11S globulin legumin is not associated with variation in proximal promoter sequence, suggesting that the identity of individual proteins involved in proteome rebalancing might also be determined at the translational level. PMID:27066039
Coarse-grained sequences for protein folding and design.
Brown, Scott; Fawzi, Nicolas J; Head-Gordon, Teresa
2003-09-16
We present the results of sequence design on our off-lattice minimalist model in which no specification of native-state tertiary contacts is needed. We start with a sequence that adopts a target topology and build on it through sequence mutation to produce new sequences that comprise distinct members within a target fold class. In this work, we use the alpha/beta ubiquitin fold class and design two new sequences that, when characterized through folding simulations, reproduce the differences in folding mechanism seen experimentally for proteins L and G. The primary implication of this work is that patterning of hydrophobic and hydrophilic residues is the physical origin for the success of relative contact-order descriptions of folding, and that these physics-based potentials provide a predictive connection between free energy landscapes and amino acid sequence (the original protein folding problem). We present results of the sequence mapping from a 20- to the three-letter code for determining a sequence that folds into the WW domain topology to illustrate future extensions to protein design.
Coarse-grained sequences for protein folding and design
Brown, Scott; Fawzi, Nicolas J.; Head-Gordon, Teresa
2003-01-01
We present the results of sequence design on our off-lattice minimalist model in which no specification of native-state tertiary contacts is needed. We start with a sequence that adopts a target topology and build on it through sequence mutation to produce new sequences that comprise distinct members within a target fold class. In this work, we use the α/β ubiquitin fold class and design two new sequences that, when characterized through folding simulations, reproduce the differences in folding mechanism seen experimentally for proteins L and G. The primary implication of this work is that patterning of hydrophobic and hydrophilic residues is the physical origin for the success of relative contact-order descriptions of folding, and that these physics-based potentials provide a predictive connection between free energy landscapes and amino acid sequence (the original protein folding problem). We present results of the sequence mapping from a 20- to the three-letter code for determining a sequence that folds into the WW domain topology to illustrate future extensions to protein design. PMID:12963815
Effective Potentials for Folding Proteins
NASA Astrophysics Data System (ADS)
Chen, Nan-Yow; Su, Zheng-Yao; Mou, Chung-Yu
2006-02-01
A coarse-grained off-lattice model that is not biased in any way to the native state is proposed to fold proteins. To predict the native structure in a reasonable time, the model has included the essential effects of water in an effective potential. Two new ingredients, the dipole-dipole interaction and the local hydrophobic interaction, are introduced and are shown to be as crucial as the hydrogen bonding. The model allows successful folding of the wild-type sequence of protein G and may have provided important hints to the study of protein folding.
From laws of inference to protein folding dynamics.
Tseng, Chih-Yuan; Yu, Chun-Ping; Lee, H C
2010-08-01
Protein folding dynamics is one of major issues constantly investigated in the study of protein functions. The molecular dynamic (MD) simulation with the replica exchange method (REM) is a common theoretical approach considered. Yet a trade-off in applying the REM is that the dynamics toward the native configuration in the simulations seems lost. In this work, we show that given REM-MD simulation results, protein folding dynamics can be directly derived from laws of inference. The applicability of the resulting approach, the entropic folding dynamics, is illustrated by investigating a well-studied Trp-cage peptide. Our results are qualitatively comparable with those from other studies. The current studies suggest that the incorporation of laws of inference and physics brings in a comprehensive perspective on exploring the protein folding dynamics.
Universality and diversity of folding mechanics for three-helix bundle proteins.
Yang, Jae Shick; Wallin, Stefan; Shakhnovich, Eugene I
2008-01-22
In this study we evaluate, at full atomic detail, the folding processes of two small helical proteins, the B domain of protein A and the Villin headpiece. Folding kinetics are studied by performing a large number of ab initio Monte Carlo folding simulations using a single transferable all-atom potential. Using these trajectories, we examine the relaxation behavior, secondary structure formation, and transition-state ensembles (TSEs) of the two proteins and compare our results with experimental data and previous computational studies. To obtain a detailed structural information on the folding dynamics viewed as an ensemble process, we perform a clustering analysis procedure based on graph theory. Moreover, rigorous p(fold) analysis is used to obtain representative samples of the TSEs and a good quantitative agreement between experimental and simulated Phi values is obtained for protein A. Phi values for Villin also are obtained and left as predictions to be tested by future experiments. Our analysis shows that the two-helix hairpin is a common partially stable structural motif that gets formed before entering the TSE in the studied proteins. These results together with our earlier study of Engrailed Homeodomain and recent experimental studies provide a comprehensive, atomic-level picture of folding mechanics of three-helix bundle proteins.
Analysis of the Free-Energy Surface of Proteins from Reversible Folding Simulations
Allen, Lucy R.; Krivov, Sergei V.; Paci, Emanuele
2009-01-01
Computer generated trajectories can, in principle, reveal the folding pathways of a protein at atomic resolution and possibly suggest general and simple rules for predicting the folded structure of a given sequence. While such reversible folding trajectories can only be determined ab initio using all-atom transferable force-fields for a few small proteins, they can be determined for a large number of proteins using coarse-grained and structure-based force-fields, in which a known folded structure is by construction the absolute energy and free-energy minimum. Here we use a model of the fast folding helical λ-repressor protein to generate trajectories in which native and non-native states are in equilibrium and transitions are accurately sampled. Yet, representation of the free-energy surface, which underlies the thermodynamic and dynamic properties of the protein model, from such a trajectory remains a challenge. Projections over one or a small number of arbitrarily chosen progress variables often hide the most important features of such surfaces. The results unequivocally show that an unprojected representation of the free-energy surface provides important and unbiased information and allows a simple and meaningful description of many-dimensional, heterogeneous trajectories, providing new insight into the possible mechanisms of fast-folding proteins. PMID:19593364
Analysis of the free-energy surface of proteins from reversible folding simulations.
Allen, Lucy R; Krivov, Sergei V; Paci, Emanuele
2009-07-01
Computer generated trajectories can, in principle, reveal the folding pathways of a protein at atomic resolution and possibly suggest general and simple rules for predicting the folded structure of a given sequence. While such reversible folding trajectories can only be determined ab initio using all-atom transferable force-fields for a few small proteins, they can be determined for a large number of proteins using coarse-grained and structure-based force-fields, in which a known folded structure is by construction the absolute energy and free-energy minimum. Here we use a model of the fast folding helical lambda-repressor protein to generate trajectories in which native and non-native states are in equilibrium and transitions are accurately sampled. Yet, representation of the free-energy surface, which underlies the thermodynamic and dynamic properties of the protein model, from such a trajectory remains a challenge. Projections over one or a small number of arbitrarily chosen progress variables often hide the most important features of such surfaces. The results unequivocally show that an unprojected representation of the free-energy surface provides important and unbiased information and allows a simple and meaningful description of many-dimensional, heterogeneous trajectories, providing new insight into the possible mechanisms of fast-folding proteins.
Roles of β-Turns in Protein Folding: From Peptide Models to Protein Engineering
Marcelino, Anna Marie C.; Gierasch, Lila M.
2010-01-01
Reverse turns are a major class of protein secondary structure; they represent sites of chain reversal and thus sites where the globular character of a protein is created. It has been speculated for many years that turns may nucleate the formation of structure in protein folding, as their propensity to occur will favor the approximation of their flanking regions and their general tendency to be hydrophilic will favor their disposition at the solvent-accessible surface. Reverse turns are local features, and it is therefore not surprising that their structural properties have been extensively studied using peptide models. In this article, we review research on peptide models of turns to test the hypothesis that the propensities of turns to form in short peptides will relate to the roles of corresponding sequences in protein folding. Turns with significant stability as isolated entities should actively promote the folding of a protein, and by contrast, turn sequences that merely allow the chain to adopt conformations required for chain reversal are predicted to be passive in the folding mechanism. We discuss results of protein engineering studies of the roles of turn residues in folding mechanisms. Factors that correlate with the importance of turns in folding indeed include their intrinsic stability, as well as their topological context and their participation in hydrophobic networks within the protein’s structure. PMID:18275088
How cooperative are protein folding and unfolding transitions?
Malhotra, Pooja
2016-01-01
Abstract A thermodynamically and kinetically simple picture of protein folding envisages only two states, native (N) and unfolded (U), separated by a single activation free energy barrier, and interconverting by cooperative two‐state transitions. The folding/unfolding transitions of many proteins occur, however, in multiple discrete steps associated with the formation of intermediates, which is indicative of reduced cooperativity. Furthermore, much advancement in experimental and computational approaches has demonstrated entirely non‐cooperative (gradual) transitions via a continuum of states and a multitude of small energetic barriers between the N and U states of some proteins. These findings have been instrumental towards providing a structural rationale for cooperative versus noncooperative transitions, based on the coupling between interaction networks in proteins. The cooperativity inherent in a folding/unfolding reaction appears to be context dependent, and can be tuned via experimental conditions which change the stabilities of N and U. The evolution of cooperativity in protein folding transitions is linked closely to the evolution of function as well as the aggregation propensity of the protein. A large activation energy barrier in a fully cooperative transition can provide the kinetic control required to prevent the accumulation of partially unfolded forms, which may promote aggregation. Nevertheless, increasing evidence for barrier‐less “downhill” folding, as well as for continuous “uphill” unfolding transitions, indicate that gradual non‐cooperative processes may be ubiquitous features on the free energy landscape of protein folding. PMID:27522064
Detergent-associated solution conformations of helical and beta-barrel membrane proteins.
Mo, Yiming; Lee, Byung-Kwon; Ankner, John F; Becker, Jeffrey M; Heller, William T
2008-10-23
Membrane proteins present major challenges for structural biology. In particular, the production of suitable crystals for high-resolution structural determination continues to be a significant roadblock for developing an atomic-level understanding of these vital cellular systems. The use of detergents for extracting membrane proteins from the native membrane for either crystallization or reconstitution into model lipid membranes for further study is assumed to leave the protein with the proper fold with a belt of detergent encompassing the membrane-spanning segments of the structure. Small-angle X-ray scattering was used to probe the detergent-associated solution conformations of three membrane proteins, namely bacteriorhodopsin (BR), the Ste2p G-protein coupled receptor from Saccharomyces cerevisiae, and the Escherichia coli porin OmpF. The results demonstrate that, contrary to the traditional model of a detergent-associated membrane protein, the helical proteins BR and Ste2p are not in the expected, compact conformation and associated with detergent micelles, while the beta-barrel OmpF is indeed embedded in a disk-like micelle in a properly folded state. The comparison provided by the BR and Ste2p, both members of the 7TM family of helical membrane proteins, further suggests that the interhelical interactions between the transmembrane helices of the two proteins differ, such that BR, like other rhodopsins, can properly refold to crystallize, while Ste2p continues to prove resistant to crystallization from an initially detergent-associated state.
Detergent-associated Solution Conformations of Helical and Beta-barrel Membrane Proteins
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mo, Yiming; Lee, Byung-Kwon; Ankner, John Francis
2008-01-01
Membrane proteins present major challenges for structural biology. In particular, the production of suitable crystals for high-resolution structural determination continues to be a significant roadblock for developing an atomic-level understanding of these vital cellular systems. The use of detergents for extracting membrane proteins from the native membrane for either crystallization or reconstitution into model lipid membranes for further study is assumed to leave the protein with the proper fold with a belt of detergent encompassing the membrane-spanning segments of the structure. Small-angle X-ray scattering was used to probe the detergent-associated solution conformations of three membrane proteins, namely bacteriorhodopsin (BR), themore » Ste2p G-protein coupled receptor from Saccharomyces cerevisiae, and the Escherichia coli porin OmpF. The results demonstrate that, contrary to the traditional model of a detergent-associated membrane protein, the helical proteins BR and Ste2p are not in the expected, compact conformation and associated with detergent micelles, while the ?-barrel OmpF is indeed embedded in a disk-like micelle in a properly folded state. The comparison provided by the BR and Ste2p, both members of the 7TM family of helical membrane proteins, further suggests that the interhelical interactions between the transmembrane helices of the two proteins differ, such that BR, like other rhodopsins, can properly refold to crystallize, while Ste2p continues to prove resistant to crystallization from an initially detergent-associated state.« less
von Holst, Hans; Li, Xiaogai
2013-07-01
Although the consequences of traumatic brain injury (TBI) and its treatment have been improved, there is still a substantial lack of understanding the mechanisms. Numerical simulation of the impact can throw further lights on site and mechanism of action. A finite element model of the human head and brain tissue was used to simulate TBI. The consequences of gradually increased kinetic energy transfer was analyzed by evaluating the impact intracranial pressure (ICP), strain level, and their potential influences on binding forces in folded protein structures. The gradually increased kinetic energy was found to have the potential to break apart bonds of Van der Waals in all impacts and hydrogen bonds at simulated impacts from 6 m/s and higher, thereby superseding the energy in folded protein structures. Further, impacts below 6 m/s showed none or very slight increase in impact ICP and strain levels, whereas impacts of 6 m/s or higher showed a gradual increase of the impact ICP and strain levels reaching over 1000 KPa and over 30%, respectively. The present simulation study shows that the free kinetic energy transfer, impact ICP, and strain levels all have the potential to initiate cytotoxic brain tissue edema by unfolding protein structures. The definition of mild, moderate, and severe TBI should thus be looked upon as the same condition and separated only by a gradual severity of impact.
Communication: Role of explicit water models in the helix folding/unfolding processes
NASA Astrophysics Data System (ADS)
Palazzesi, Ferruccio; Salvalaglio, Matteo; Barducci, Alessandro; Parrinello, Michele
2016-09-01
In the last years, it has become evident that computer simulations can assume a relevant role in modelling protein dynamical motions for their ability to provide a full atomistic image of the processes under investigation. The ability of the current protein force-fields in reproducing the correct thermodynamics and kinetics systems behaviour is thus an essential ingredient to improve our understanding of many relevant biological functionalities. In this work, employing the last developments of the metadynamics framework, we compare the ability of state-of-the-art all-atom empirical functions and water models to consistently reproduce the folding and unfolding of a helix turn motif in a model peptide. This theoretical study puts in evidence that the choice of the water models can influence the thermodynamic and the kinetics of the system under investigation, and for this reason cannot be considered trivial.
Fragility of Liquids, Polyamorphism, Nucleation, and Folding Directions, in the Landscape Paradigm
NASA Astrophysics Data System (ADS)
Angell, C. A.
1998-03-01
Combination of the observations that cold glasses are rigid over eons, warm glasses (near their glass transitions) can densify, and most cooling liquids would, by extrapolation of observed entropy-temperature functions, achieve negative entropies well above OK if not salvaged by kinetics leads to the notion that substances which cannot find deep energy minima by 3 dimensional ordering of their particles must ``live" on a landscape of innumerable closely spaced and interconnected energy ``basins of probability" , the configuration space of amorphous substances. This energy hypersurface, which current studies suggest has a comparable multiplicity per heavy-atom particle regardless of how the particles are connected, provides the source of both excess liquid entropy and low temperature glass rigidity. If the energy distribution is narrow, the total entropy can be excited over a narrow temperature interval above T_g. The liquid then exhibits a high excess heat capacity and, directionally bonded cases aside, so called ``fragile" kinetic characteristics. A dynamic crossover to a liquid with different behavior occurs when the landscape entropy is almost fully excited. In extreme (highly cooperative) cases the crossover can occur via a first order transition in which much of the landscape entropy is excited isothermally. While this is rare in liquids, and in any case is difficult to observe because of promoted crystallization, it is quite common in mesoscopic systems which lack ordered packing possibilities but do have low energy amorphous configurations available. This is the case with proteins which often have ``all or nothing" transitions between folded and unfolded states, and behave like ``glassy" systems in their native (folded) states. Since phase transitions in many particle systems require nucleation, the identification of the nucleation step and its kinetic distinction from overall folding kinetics should be an important part of the understanding of the protein folding problem. The possibility exists that in certain cases an aberrant step in the nucleation event, facilitated by mutant nucleotide sequences or by third agents (heterogeneous nucleating agents), will trigger folding down an alternative and pathogenic route to a second stable state. This possibility should be evaluated, using nucleation kinetics analysis techniques, as an approach to understanding the initiation of ``mad cow" disease cerebral pathology.
Picosecond to nanosecond dynamics provide a source of conformational entropy for protein folding.
Stadler, Andreas M; Demmel, Franz; Ollivier, Jacques; Seydel, Tilo
2016-08-03
Myoglobin can be trapped in fully folded structures, partially folded molten globules, and unfolded states under stable equilibrium conditions. Here, we report an experimental study on the conformational dynamics of different folded conformational states of apo- and holomyoglobin in solution. Global protein diffusion and internal molecular motions were probed by neutron time-of-flight and neutron backscattering spectroscopy on the picosecond and nanosecond time scales. Global protein diffusion was found to depend on the α-helical content of the protein suggesting that charges on the macromolecule increase the short-time diffusion of protein. With regard to the molten globules, a gel-like phase due to protein entanglement and interactions with neighbouring macromolecules was visible due to a reduction of the global diffusion coefficients on the nanosecond time scale. Diffusion coefficients, residence and relaxation times of internal protein dynamics and root mean square displacements of localised internal motions were determined for the investigated structural states. The difference in conformational entropy ΔSconf of the protein between the unfolded and the partially or fully folded conformations was extracted from the measured root mean square displacements. Using thermodynamic parameters from the literature and the experimentally determined ΔSconf values we could identify the entropic contribution of the hydration shell ΔShydr of the different folded states. Our results point out the relevance of conformational entropy of the protein and the hydration shell for stability and folding of myoglobin.
Calosci, Nicoletta; Chi, Celestine N.; Richter, Barbara; Camilloni, Carlo; Engström, Åke; Eklund, Lars; Travaglini-Allocatelli, Carlo; Gianni, Stefano; Vendruscolo, Michele; Jemth, Per
2008-01-01
The energy landscape theory provides a general framework for describing protein folding reactions. Because a large number of studies, however, have focused on two-state proteins with single well-defined folding pathways and without detectable intermediates, the extent to which free energy landscapes are shaped up by the native topology at the early stages of the folding process has not been fully characterized experimentally. To this end, we have investigated the folding mechanisms of two homologous three-state proteins, PTP-BL PDZ2 and PSD-95 PDZ3, and compared the early and late transition states on their folding pathways. Through a combination of Φ value analysis and molecular dynamics simulations we obtained atomic-level structures of the transition states of these homologous three-state proteins and found that the late transition states are much more structurally similar than the early ones. Our findings thus reveal that, while the native state topology defines essentially in a unique way the late stages of folding, it leaves significant freedom to the early events, a result that reflects the funneling of the free energy landscape toward the native state. PMID:19033470
Shojaei Saadi, Habib A; van Riemsdijk, Evine; Dance, Alysha L; Rajamanickam, Gayathri D; Kastelic, John P; Thundathil, Jacob C
2013-04-26
The objective was to investigate expression patterns of proteins in pyriform sperm, a common morphological abnormality in bull sperm. Ejaculates were collected from sexually mature Holstein bulls (n=3) twice weekly for 10 weeks (pre-thermal insult samples). Testicular temperature was elevated in all bulls by scrotal insulation for 72 consecutive hours during week 2. Total sperm proteins were extracted from pre- and post-thermal insult sperm samples and subjected to two-dimensional gel electrophoresis. Among the protein spots detected, 131 spots were significantly expressed (False Detection Rate <0.01) with ≥ 2 fold changes between normal and pyriform sperm. Among them, 25 spots with ≥ 4 fold difference in expression patterns were identified using liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS). Expression of several proteins involved in sperm capacitation, sperm-egg interaction and sperm cytoskeletal structure was decreased in pyriform sperm, whereas proteins regulating antioxidant activity, apoptosis and metabolic activity were increased. Contents of reactive oxygen species and ubiquitinated proteins were higher in pyriform sperm. In addition to understanding the molecular basis of functional deficiencies in sperm with specific morphological abnormalities, comparing normal versus morphologically abnormal sperm appeared to be a suitable experimental model for identifying important sperm functional proteins. To our knowledge, this study is the first report on differential expression of proteins in pyriform bovine sperm versus morphologically normal sperm. We report that expression of several proteins involved in sperm capacitation, sperm-egg interaction and sperm cytoskeletal structure was decreased in pyriform sperm, whereas proteins which regulate antioxidant activity, apoptosis and metabolic activity were increased. Contents of reactive oxygen species and ubiquitinated proteins were higher in pyriform sperm. In addition to understanding the molecular basis of functional deficiencies in sperm with specific morphological abnormalities, our results suggest that comparing normal versus morphologically abnormal sperm appeared to be a suitable experimental model for identifying important sperm functional proteins. Copyright © 2013 Elsevier B.V. All rights reserved.
Acceleration through passive destabilization: protein folding in a weak hydrophobic environment
NASA Astrophysics Data System (ADS)
Jewett, Andrew; Baumketner, Andrij; Shea, Joan-Emma
2004-03-01
The GroEL chaperonin is a biomolecule which assists the folding of an extremely diverse range of proteins in Eubacteria. Some proteins undergo many rounds of ATP-regulated binding and dissociation from GroEL/ES before folding. It has been proposed that transient stress from ATP-regulated binding and release from GroEL/ES frees frustrated proteins from misfolded conformations. However recent evidence suggests that chaperonin-accelerated protein folding can take place entirely within a mutated GroEL+ES cavity that is unable to open and release the protein. Using molecular dynamics, we demonstrate that static confinement within a weakly hydrophobic (attractive) cavity (similar to the interior of the cavity formed by the GroEL+ES complex) is sufficient to significantly accelerate the folding of a highly frustrated protein-like heteropolymer. Our frustrated molecule benifits kinetically from a static hydrophobic environment that destabilizes misfolded conformations. This may shed light on the mechanisms used by other chaperones which do not depend on ATP.
Folding energy landscape and network dynamics of small globular proteins
Hori, Naoto; Chikenji, George; Berry, R. Stephen; Takada, Shoji
2009-01-01
The folding energy landscape of proteins has been suggested to be funnel-like with some degree of ruggedness on the slope. How complex the landscape, however, is still rather unclear. Many experiments for globular proteins suggested relative simplicity, whereas molecular simulations of shorter peptides implied more complexity. Here, by using complete conformational sampling of 2 globular proteins, protein G and src SH3 domain and 2 related random peptides, we investigated their energy landscapes, topological properties of folding networks, and folding dynamics. The projected energy surfaces of globular proteins were funneled in the vicinity of the native but also have other quite deep, accessible minima, whereas the randomized peptides have many local basins, including some leading to seriously misfolded forms. Dynamics in the denatured part of the network exhibited basin-hopping itinerancy among many conformations, whereas the protein reached relatively well-defined final stages that led to their native states. We also found that the folding network has the hierarchic nature characterized by the scale-free and the small-world properties. PMID:19114654
Folding energy landscape and network dynamics of small globular proteins.
Hori, Naoto; Chikenji, George; Berry, R Stephen; Takada, Shoji
2009-01-06
The folding energy landscape of proteins has been suggested to be funnel-like with some degree of ruggedness on the slope. How complex the landscape, however, is still rather unclear. Many experiments for globular proteins suggested relative simplicity, whereas molecular simulations of shorter peptides implied more complexity. Here, by using complete conformational sampling of 2 globular proteins, protein G and src SH3 domain and 2 related random peptides, we investigated their energy landscapes, topological properties of folding networks, and folding dynamics. The projected energy surfaces of globular proteins were funneled in the vicinity of the native but also have other quite deep, accessible minima, whereas the randomized peptides have many local basins, including some leading to seriously misfolded forms. Dynamics in the denatured part of the network exhibited basin-hopping itinerancy among many conformations, whereas the protein reached relatively well-defined final stages that led to their native states. We also found that the folding network has the hierarchic nature characterized by the scale-free and the small-world properties.
High-throughput SRCD using multi-well plates and its applications
NASA Astrophysics Data System (ADS)
Hussain, Rohanah; Jávorfi, Tamás; Rudd, Timothy R.; Siligardi, Giuliano
2016-12-01
The sample compartment for high-throughput synchrotron radiation circular dichroism (HT-SRCD) has been developed to satisfy an increased demand of protein characterisation in terms of folding and binding interaction properties not only in the traditional field of structural biology but also in the growing research area of material science with the potential to save time by 80%. As the understanding of protein behaviour in different solvent environments has increased dramatically the development of novel functions such as recombinant proteins modified to have different functions from harvesting solar energy to metabolonics for cleaning heavy and metal and organic molecule pollutions, there is a need to characterise speedily these system.
Evolutionary trend toward kinetic stability in the folding trajectory of RNases H
Lim, Shion A.; Hart, Kathryn M.; Marqusee, Susan
2016-01-01
Proper folding of proteins is critical to producing the biological machinery essential for cellular function. The rates and energetics of a protein’s folding process, which is described by its energy landscape, are encoded in the amino acid sequence. Over the course of evolution, this landscape must be maintained such that the protein folds and remains folded over a biologically relevant time scale. How exactly a protein’s energy landscape is maintained or altered throughout evolution is unclear. To study how a protein’s energy landscape changed over time, we characterized the folding trajectories of ancestral proteins of the ribonuclease H (RNase H) family using ancestral sequence reconstruction to access the evolutionary history between RNases H from mesophilic and thermophilic bacteria. We found that despite large sequence divergence, the overall folding pathway is conserved over billions of years of evolution. There are robust trends in the rates of protein folding and unfolding; both modern RNases H evolved to be more kinetically stable than their most recent common ancestor. Finally, our study demonstrates how a partially folded intermediate provides a readily adaptable folding landscape by allowing the independent tuning of kinetics and thermodynamics. PMID:27799545
Bastolla, Ugo
2014-01-01
The properties of biomolecules depend both on physics and on the evolutionary process that formed them. These two points of view produce a powerful synergism. Physics sets the stage and the constraints that molecular evolution has to obey, and evolutionary theory helps in rationalizing the physical properties of biomolecules, including protein folding thermodynamics. To complete the parallelism, protein thermodynamics is founded on the statistical mechanics in the space of protein structures, and molecular evolution can be viewed as statistical mechanics in the space of protein sequences. In this review, we will integrate both points of view, applying them to detecting selection on the stability of the folded state of proteins. We will start discussing positive design, which strengthens the stability of the folded against the unfolded state of proteins. Positive design justifies why statistical potentials for protein folding can be obtained from the frequencies of structural motifs. Stability against unfolding is easier to achieve for longer proteins. On the contrary, negative design, which consists in destabilizing frequently formed misfolded conformations, is more difficult to achieve for longer proteins. The folding rate can be enhanced by strengthening short-range native interactions, but this requirement contrasts with negative design, and evolution has to trade-off between them. Finally, selection can accelerate functional movements by favoring low frequency normal modes of the dynamics of the native state that strongly correlate with the functional conformation change. PMID:24970217
Theil, Elizabeth C.; Turano, Paola; Ghini, Veronica; Allegrozzi, Marco; Bernacchioni, Caterina
2014-01-01
Integrated ferritin protein cage function is the reversible synthesis of protein-caged, solid Fe2O3•H2O minerals from Fe2+, for metabolic iron concentrates and oxidant protection; biomineral order varies in different ferritin proteins. The conserved 4, 3, 2 geometric symmetry of ferritin protein cages, parallels subunit dimer, trimer and tetramer interfaces, and coincides with function at several cage axes. Multiple subdomains distributed in the self- assembling ferritin nanocages have functional relationships to cage symmetry such as Fe2+ transport though ion channels (3-fold symmetry), biomineral nucleation/order (4-fold symmetry) and mineral dissolution (3-fold symmetry) studied in ferritin variants. Cage subunit dimers (2-fold symmetry) influence iron oxidation and mineral dissolution, based on effects of natural or synthetic subunit dimer crosslinks. 2Fe2+/O2 catalysis in ferritin occurs in single subunits, but with cooperativity (n=3) that is possibly related to the structure/function of the ion channels, which are constructed from segments of 3 subunits. Here, we study 2Fe2+ + O2 protein catalysis (diferric peroxo formation) and dissolution of ferritin Fe2O3•H2O biominerals in variants with altered subunit interfaces for trimers (ion channels), E130I, and external dimer surfaces (E88A) as controls, and altered tetramer subunit interfaces (L165I and H169F). The results extend observations on the functional importance of structure at ferritin protein 2-fold and 3-fold cage axes to show function at ferritin 4-fold cage axes. Here, conserved amino acids facilitate dissolution of ferritin protein-caged iron biominerals. Biological and nanotechnological uses of ferritin protein cage 4-fold symmetry and solid state mineral properties remain largely unexplored. PMID:24504941
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lewis, Hal A.; Zhao, Xun; Wang, Chi
Cystic fibrosis is caused by defects in the cystic fibrosis transmembrane conductance regulator (CFTR), commonly the deletion of residue Phe-508 (DeltaF508) in the first nucleotide-binding domain (NBD1), which results in a severe reduction in the population of functional channels at the epithelial cell surface. Previous studies employing incomplete NBD1 domains have attributed this to aberrant folding of DeltaF508 NBD1. We report structural and biophysical studies on complete human NBD1 domains, which fail to demonstrate significant changes of in vitro stability or folding kinetics in the presence or absence of the DeltaF508 mutation. Crystal structures show minimal changes in protein conformationmore » but substantial changes in local surface topography at the site of the mutation, which is located in the region of NBD1 believed to interact with the first membrane spanning domain of CFTR. These results raise the possibility that the primary effect of DeltaF508 is a disruption of proper interdomain interactions at this site in CFTR rather than interference with the folding of NBD1. Interestingly, increases in the stability of NBD1 constructs are observed upon introduction of second-site mutations that suppress the trafficking defect caused by the DeltaF508 mutation, suggesting that these suppressors might function indirectly by improving the folding efficiency of NBD1 in the context of the full-length protein. The human NBD1 structures also solidify the understanding of CFTR regulation by showing that its two protein segments that can be phosphorylated both adopt multiple conformations that modulate access to the ATPase active site and functional interdomain interfaces.« less
Structural and kinetic mapping of side-chain exposure onto the protein energy landscape.
Bernstein, Rachel; Schmidt, Kierstin L; Harbury, Pehr B; Marqusee, Susan
2011-06-28
Identification and characterization of structural fluctuations that occur under native conditions is crucial for understanding protein folding and function, but such fluctuations are often rare and transient, making them difficult to study. Native-state hydrogen exchange (NSHX) has been a powerful tool for identifying such rarely populated conformations, but it generally reveals no information about the placement of these species along the folding reaction coordinate or the barriers separating them from the folded state and provides little insight into side-chain packing. To complement such studies, we have performed native-state alkyl-proton exchange, a method analogous to NSHX that monitors cysteine modification rather than backbone amide exchange, to examine the folding landscape of Escherichia coli ribonuclease H, a protein well characterized by hydrogen exchange. We have chosen experimental conditions such that the rate-limiting barrier acts as a kinetic partition: residues that become exposed only upon crossing the unfolding barrier are modified in the EX1 regime (alkylation rates report on the rate of unfolding), while those exposed on the native side of the barrier are modified predominantly in the EX2 regime (alkylation rates report on equilibrium populations). This kinetic partitioning allows for identification and placement of partially unfolded forms along the reaction coordinate. Using this approach we detect previously unidentified, rarely populated conformations residing on the native side of the barrier and identify side chains that are modified only upon crossing the unfolding barrier. Thus, in a single experiment under native conditions, both sides of the rate-limiting barrier are investigated.
Structural and kinetic mapping of side-chain exposure onto the protein energy landscape
Bernstein, Rachel; Schmidt, Kierstin L.; Harbury, Pehr B.; Marqusee, Susan
2011-01-01
Identification and characterization of structural fluctuations that occur under native conditions is crucial for understanding protein folding and function, but such fluctuations are often rare and transient, making them difficult to study. Native-state hydrogen exchange (NSHX) has been a powerful tool for identifying such rarely populated conformations, but it generally reveals no information about the placement of these species along the folding reaction coordinate or the barriers separating them from the folded state and provides little insight into side-chain packing. To complement such studies, we have performed native-state alkyl-proton exchange, a method analogous to NSHX that monitors cysteine modification rather than backbone amide exchange, to examine the folding landscape of Escherichia coli ribonuclease H, a protein well characterized by hydrogen exchange. We have chosen experimental conditions such that the rate-limiting barrier acts as a kinetic partition: residues that become exposed only upon crossing the unfolding barrier are modified in the EX1 regime (alkylation rates report on the rate of unfolding), while those exposed on the native side of the barrier are modified predominantly in the EX2 regime (alkylation rates report on equilibrium populations). This kinetic partitioning allows for identification and placement of partially unfolded forms along the reaction coordinate. Using this approach we detect previously unidentified, rarely populated conformations residing on the native side of the barrier and identify side chains that are modified only upon crossing the unfolding barrier. Thus, in a single experiment under native conditions, both sides of the rate-limiting barrier are investigated. PMID:21670244
The IRE1/bZIP60 pathway are activated by potexvirus and potyvirus small membrane binding proteins
USDA-ARS?s Scientific Manuscript database
The endoplasmic reticulum provides an environment for protein synthesis, folding and distribution to all corners of the cell. With respect to protein synthesis and folding, quality production is central to maintaining homeostasis. When conditions occur that disrupt the folding capacity of the ER cau...
Principles of protein folding--a perspective from simple exact models.
Dill, K. A.; Bromberg, S.; Yue, K.; Fiebig, K. M.; Yee, D. P.; Thomas, P. D.; Chan, H. S.
1995-01-01
General principles of protein structure, stability, and folding kinetics have recently been explored in computer simulations of simple exact lattice models. These models represent protein chains at a rudimentary level, but they involve few parameters, approximations, or implicit biases, and they allow complete explorations of conformational and sequence spaces. Such simulations have resulted in testable predictions that are sometimes unanticipated: The folding code is mainly binary and delocalized throughout the amino acid sequence. The secondary and tertiary structures of a protein are specified mainly by the sequence of polar and nonpolar monomers. More specific interactions may refine the structure, rather than dominate the folding code. Simple exact models can account for the properties that characterize protein folding: two-state cooperativity, secondary and tertiary structures, and multistage folding kinetics--fast hydrophobic collapse followed by slower annealing. These studies suggest the possibility of creating "foldable" chain molecules other than proteins. The encoding of a unique compact chain conformation may not require amino acids; it may require only the ability to synthesize specific monomer sequences in which at least one monomer type is solvent-averse. PMID:7613459
Unfolding the chaperone story.
Hartl, F Ulrich
2017-11-01
Protein folding in the cell was originally assumed to be a spontaneous process, based on Anfinsen's discovery that purified proteins can fold on their own after removal from denaturant. Consequently cell biologists showed little interest in the protein folding process. This changed only in the mid and late 1980s, when the chaperone story began to unfold. As a result, we now know that in vivo, protein folding requires assistance by a complex machinery of molecular chaperones. To ensure efficient folding, members of different chaperone classes receive the nascent protein chain emerging from the ribosome and guide it along an ordered pathway toward the native state. I was fortunate to contribute to these developments early on. In this short essay, I will describe some of the critical steps leading to the current concept of protein folding as a highly organized cellular process. © 2017 Hartl. This article is distributed by The American Society for Cell Biology under license from the author(s). Two months after publication it is available to the public under an Attribution–Noncommercial–Share Alike 3.0 Unported Creative Commons License (http://creativecommons.org/licenses/by-nc-sa/3.0).
Transiently disordered tails accelerate folding of globular proteins.
Mallik, Saurav; Ray, Tanaya; Kundu, Sudip
2017-07-01
Numerous biological proteins exhibit intrinsic disorder at their termini, which are associated with multifarious functional roles. Here, we show the surprising result that an increased percentage of terminal short transiently disordered regions with enhanced flexibility (TstDREF) is associated with accelerated folding rates of globular proteins. Evolutionary conservation of predicted disorder at TstDREFs and drastic alteration of folding rates upon point-mutations suggest critical regulatory role(s) of TstDREFs in shaping the folding kinetics. TstDREFs are associated with long-range intramolecular interactions and the percentage of native secondary structural elements physically contacted by TstDREFs exhibit another surprising positive correlation with folding kinetics. These results allow us to infer probable molecular mechanisms behind the TstDREF-mediated regulation of folding kinetics that challenge protein biochemists to assess by direct experimental testing. © 2017 Federation of European Biochemical Societies.
Evolution of the arginase fold and functional diversity
Dowling, Daniel P.; Costanzo, Luigi Di; Gennadios, Heather A.; Christianson, David W.
2009-01-01
The large number of protein structures deposited in the Protein Data Bank allows for the identification of novel structural superfamilies based on conservation of fold in addition to conservation of amino acid sequence. Since sequence diverges more rapidly than fold in protein evolution, proteins with little or no significant sequence identity are occasionally observed to adopt similar folds, thereby reflecting unanticipated evolutionary relationships. Here, we review the unique α/β fold first observed in the manganese metalloenzyme rat liver arginase, consisting of a parallel 8 stranded β-sheet surrounded by several helices, and its evolutionary relationship with the zinc-requiring and/or iron-requiring histone deacetylases and acetylpolyamine amidohydrolases. Structural comparisons reveal key features of the core α/β fold that contribute to the divergent metal ion specificity and stoichiometry required for the chemical and biological functions of these enzymes. PMID:18360740
NASA Astrophysics Data System (ADS)
Maity, Hiranmay; Reddy, Govardhan
2018-04-01
Small single-domain globular proteins, which are believed to be dominantly two-state folders, played an important role in elucidating various aspects of the protein folding mechanism. However, recent single molecule fluorescence resonance energy transfer experiments [H. Y. Aviram et al. J. Chem. Phys. 148, 123303 (2018)] on a single-domain two-state folding protein L showed evidence for the population of an intermediate state and it was suggested that in this state, a β-hairpin present near the C-terminal of the native protein state is unfolded. We performed molecular dynamics simulations using a coarse-grained self-organized-polymer model with side chains to study the folding pathways of protein L. In agreement with the experiments, an intermediate is populated in the simulation folding pathways where the C-terminal β-hairpin detaches from the rest of the protein structure. The lifetime of this intermediate structure increased with the decrease in temperature. In low temperature conditions, we also observed a second intermediate state, which is globular with a significant fraction of the native-like tertiary contacts satisfying the features of a dry molten globule.
Han, Wei; Schulten, Klaus
2013-01-01
In this study, we apply a hybrid-resolution model, namely PACE, to characterize the free energy surfaces (FESs) of trp-cage and a WW domain variant along with the respective folding mechanisms. Unbiased, independent simulations with PACE are found to achieve together multiple folding and unfolding events for both proteins, allowing us to perform network analysis of the FESs to identify folding pathways. PACE reproduces for both proteins expected complexity hidden in the folding FESs, in particular, meta-stable non-native intermediates. Pathway analysis shows that some of these intermediates are, actually, on-pathway folding intermediates and that intermediates kinetically closest to the native states can be either critical on-pathway or off-pathway intermediates, depending on the protein. Apart from general insights into folding, specific folding mechanisms of the proteins are resolved. We find that trp-cage folds via a dominant pathway in which hydrophobic collapse occurs before the N-terminal helix forms; full incorporation of Trp6 into the hydrophobic core takes place as the last step of folding, which, however, may not be the rate-limiting step. For the WW domain variant studied we observe two main folding pathways with opposite orders of formation of the two hairpins involved in the structure; for either pathway, formation of hairpin 1 is more likely to be the rate-limiting step. Altogether, our results suggest that PACE combined with network analysis is a computationally efficient and valuable tool for the study of protein folding. PMID:23915394
A hybrid MD-kMC algorithm for folding proteins in explicit solvent.
Peter, Emanuel Karl; Shea, Joan-Emma
2014-04-14
We present a novel hybrid MD-kMC algorithm that is capable of efficiently folding proteins in explicit solvent. We apply this algorithm to the folding of a small protein, Trp-Cage. Different kMC move sets that capture different possible rate limiting steps are implemented. The first uses secondary structure formation as a relevant rate event (a combination of dihedral rotations and hydrogen-bonding formation and breakage). The second uses tertiary structure formation events through formation of contacts via translational moves. Both methods fold the protein, but via different mechanisms and with different folding kinetics. The first method leads to folding via a structured helical state, with kinetics fit by a single exponential. The second method leads to folding via a collapsed loop, with kinetics poorly fit by single or double exponentials. In both cases, folding times are faster than experimentally reported values, The secondary and tertiary move sets are integrated in a third MD-kMC implementation, which now leads to folding of the protein via both pathways, with single and double-exponential fits to the rates, and to folding rates in good agreement with experimental values. The competition between secondary and tertiary structure leads to a longer search for the helix-rich intermediate in the case of the first pathway, and to the emergence of a kinetically trapped long-lived molten-globule collapsed state in the case of the second pathway. The algorithm presented not only captures experimentally observed folding intermediates and kinetics, but yields insights into the relative roles of local and global interactions in determining folding mechanisms and rates.
Engineering Globular Protein Vesicles through Tunable Self-Assembly of Recombinant Fusion Proteins
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jang, Yeongseon; Choi, Won Tae; Heller, William T.
Vesicles assembled from folded, globular proteins have potential for functions different from traditional lipid or polymeric vesicles. However, they also present challenges in understanding the assembly process and controlling vesicle properties. From detailed investigation of the assembly behavior of recombinant fusion proteins, this work reports a simple strategy to engineer protein vesicles containing functional, globular domains. This is achieved through tunable self-assembly of recombinant globular fusion proteins containing leucine zippers and elastin-like polypeptides. The fusion proteins form complexes in solution via high affinity binding of the zippers, and transition through dynamic coacervates to stable hollow vesicles upon warming. The thermalmore » driving force, which can be tuned by protein concentration or temperature, controls both vesicle size and whether vesicles are single or bi-layered. Lastly, these results provide critical information to engineer globular protein vesicles via self-assembly with desired size and membrane structure.« less
Engineering Globular Protein Vesicles through Tunable Self-Assembly of Recombinant Fusion Proteins
Jang, Yeongseon; Choi, Won Tae; Heller, William T.; ...
2017-07-27
Vesicles assembled from folded, globular proteins have potential for functions different from traditional lipid or polymeric vesicles. However, they also present challenges in understanding the assembly process and controlling vesicle properties. From detailed investigation of the assembly behavior of recombinant fusion proteins, this work reports a simple strategy to engineer protein vesicles containing functional, globular domains. This is achieved through tunable self-assembly of recombinant globular fusion proteins containing leucine zippers and elastin-like polypeptides. The fusion proteins form complexes in solution via high affinity binding of the zippers, and transition through dynamic coacervates to stable hollow vesicles upon warming. The thermalmore » driving force, which can be tuned by protein concentration or temperature, controls both vesicle size and whether vesicles are single or bi-layered. Lastly, these results provide critical information to engineer globular protein vesicles via self-assembly with desired size and membrane structure.« less
Homochiral stereochemistry: the missing link of structure to energetics in protein folding.
Kumar, Anil; Ramakrishnan, Vibin; Ranbhor, Ranjit; Patel, Kirti; Durani, Susheel
2009-12-24
The notion is tested that homochiral stereochemistry being ubiquitous to protein structure could be critical to protein folding as well, causing it to become frustrated energetically providing the basis for its solvent- and sequence-mediated control. The proof in support of the notion is found in a consensus of experiment and computation according to which suitable oligopeptides are in their folding-unfolding equilibria, at both macrostate and microstate levels, susceptible to dielectric because of the conflict of peptide-chain electrostatics with interpeptide hydrogen bonds when the structure is poly-L but not when it is alternating-L,D. The argument is thus made that homochiral stereochemistry may in protein folding provide the unifying basis for its solvent- and sequence-mediated control based on screening of peptide-chain electrostatics under conflict with folding of the chain due to homochiral stereochemistry. Dielectric is brought into spotlight as the effect comparatively obscure but presumably critical to the folding in protein structure for its control.
Prokaryotic Ubiquitin-Like Protein Modification
Maupin-Furlow, Julie A.
2016-01-01
Prokaryotes form ubiquitin (Ub)-like isopeptide bonds on the lysine residues of proteins by at least two distinct pathways that are reversible and regulated. In mycobacteria, the C-terminal Gln of Pup (prokaryotic ubiquitin-like protein) is deamidated and isopeptide linked to proteins by a mechanism distinct from ubiquitylation in enzymology yet analogous to ubiquitylation in targeting proteins for destruction by proteasomes. Ub-fold proteins of archaea (SAMPs, small archaeal modifier proteins) and Thermus (TtuB, tRNA-two-thiouridine B) that differ from Ub in amino acid sequence, yet share a common β-grasp fold, also form isopeptide bonds by a mechanism that appears streamlined compared with ubiquitylation. SAMPs and TtuB are found to be members of a small group of Ub-fold proteins that function not only in protein modification but also in sulfur-transfer pathways associated with tRNA thiolation and molybdopterin biosynthesis. These multifunctional Ub-fold proteins are thought to be some of the most ancient of Ub-like protein modifiers. PMID:24995873
Bandyopadhyay, Boudhayan; Goldenzweig, Adi; Unger, Tamar; Adato, Orit; Fleishman, Sarel J; Unger, Ron; Horovitz, Amnon
2017-12-15
The GroE chaperonin system in Escherichia coli comprises GroEL and GroES and facilitates ATP-dependent protein folding in vivo and in vitro Proteins with very similar sequences and structures can differ in their dependence on GroEL for efficient folding. One potential but unverified source for GroEL dependence is frustration, wherein not all interactions in the native state are optimized energetically, thereby potentiating slow folding and misfolding. Here, we chose enhanced green fluorescent protein as a model system and subjected it to random mutagenesis, followed by screening for variants whose in vivo folding displays increased or decreased GroEL dependence. We confirmed the altered GroEL dependence of these variants with in vitro folding assays. Strikingly, mutations at positions predicted to be highly frustrated were found to correlate with decreased GroEL dependence. Conversely, mutations at positions with low frustration were found to correlate with increased GroEL dependence. Further support for this finding was obtained by showing that folding of an enhanced green fluorescent protein variant designed computationally to have reduced frustration is indeed less GroEL-dependent. Our results indicate that changes in local frustration also affect partitioning in vivo between spontaneous and chaperonin-mediated folding. Hence, the design of minimally frustrated sequences can reduce chaperonin dependence and improve protein expression levels. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Molecular recognition features (MoRFs) in three domains of life.
Yan, Jing; Dunker, A Keith; Uversky, Vladimir N; Kurgan, Lukasz
2016-03-01
Intrinsically disordered proteins and protein regions offer numerous advantages in the context of protein-protein interactions when compared to the structured proteins and domains. These advantages include ability to interact with multiple partners, to fold into different conformations when bound to different partners, and to undergo disorder-to-order transitions concomitant with their functional activity. Molecular recognition features (MoRFs) are widespread elements located in disordered regions that undergo disorder-to-order transition upon binding to their protein partners. We characterize abundance, composition, and functions of MoRFs and their association with the disordered regions across 868 species spread across Eukaryota, Bacteria and Archaea. We found that although disorder is substantially elevated in Eukaryota, MoRFs have similar abundance and amino acid composition across the three domains of life. The abundance of MoRFs is highly correlated with the amount of intrinsic disorder in Bacteria and Archaea but only modestly correlated in Eukaryota. Proteins with MoRFs have significantly more disorder and MoRFs are present in many disordered regions, with Eukaryota having more MoRF-free disordered regions. MoRF-containing proteins are enriched in the ribosome, nucleus, nucleolus and microtubule and are involved in translation, protein transport, protein folding, and interactions with DNAs. Our insights into the nature and function of MoRFs enhance our understanding of the mechanisms underlying the disorder-to-order transition and protein-protein recognition and interactions. The fMoRFpred method that we used to annotate MoRFs is available at http://biomine.ece.ualberta.ca/fMoRFpred/.
Bose, Sayantan; Heath, Carissa M.; Shah, Priya A.; Alayyoubi, Maher; Jardetzky, Theodore S.
2013-01-01
Paramyxovirus membrane glycoproteins F (fusion protein) and HN, H, or G (attachment protein) are critical for virus entry, which occurs through fusion of viral and cellular envelopes. The F protein folds into a homotrimeric, metastable prefusion form that can be triggered by the attachment protein to undergo a series of structural rearrangements, ultimately folding into a stable postfusion form. In paramyxovirus-infected cells, the F protein is activated in the Golgi apparatus by cleavage adjacent to a hydrophobic fusion peptide that inserts into the target membrane, eventually bringing the membranes together by F refolding. However, it is not clear how the attachment protein, known as HN in parainfluenza virus 5 (PIV5), interacts with F and triggers F to initiate fusion. To understand the roles of various F protein domains in fusion triggering and metastability, single point mutations were introduced into the PIV5 F protein. By extensive study of F protein cleavage activation, surface expression, and energetics of fusion triggering, we found a role for an immunoglobulin-like (Ig-like) domain, where multiple hydrophobic residues on the PIV5 F protein may mediate F-HN interactions. Additionally, destabilizing mutations of PIV5 F that resulted in HN trigger-independent mutant F proteins were identified in a region along the border of F trimer subunits. The positions of the potential HN-interacting region and the region important for F stability in the lower part of the PIV5 F prefusion structure provide clues to the receptor-binding initiated, HN-mediated F trigger. PMID:24089572
Construction of a Chassis for a Tripartite Protein-Based Molecular Motor.
Small, Lara S R; Bruning, Marc; Thomson, Andrew R; Boyle, Aimee L; Davies, Roberta B; Curmi, Paul M G; Forde, Nancy R; Linke, Heiner; Woolfson, Derek N; Bromley, Elizabeth H C
2017-06-16
Improving our understanding of biological motors, both to fully comprehend their activities in vital processes, and to exploit their impressive abilities for use in bionanotechnology, is highly desirable. One means of understanding these systems is through the production of synthetic molecular motors. We demonstrate the use of orthogonal coiled-coil dimers (including both parallel and antiparallel coiled coils) as a hub for linking other components of a previously described synthetic molecular motor, the Tumbleweed. We use circular dichroism, analytical ultracentrifugation, dynamic light scattering, and disulfide rearrangement studies to demonstrate the ability of this six-peptide set to form the structure designed for the Tumbleweed motor. The successful formation of a suitable hub structure is both a test of the transferability of design rules for protein folding as well as an important step in the production of a synthetic protein-based molecular motor.
The folding mechanism of two closely related proteins in the intracellular lipid binding protein family, human bile acid binding protein (hBABP) and rat bile acid binding protein (rBABP) were examined. These proteins are 77% identical (93% similar) in sequence Both of these singl...
Galzitskaya, Oxana; Deryusheva, Eugenia; Machulin, Andrey; Nemashkalova, Ekaterina; Glyakina, Anna
2018-06-21
High prediction accuracy of flexible loops in different protein families is a challenge because of the crucial functions associated with these regions. Results of the currently available programs for prediction of loops vary from protein to protein. For prediction of flexible regions in the G-domain for 23 representatives of G-proteins with the known 3D structure we have used eight programs. The results of predictions demonstrate that the FoldUnfold program predicts better loop positions than the PONDR, RОNN, DisEMBL, IUPred, GlobPlot 2, FoldIndex, and MobiDB programs. When classifying the predicted loops (rigid/flexible) according to the Debye-Waller fluctuation factors, our data reveal the existing weak correlation between the B-factors and the average number of closed residues according to the FoldUnfold program; the percentage of overlapping characteristics (residue fold/unfold status) of the protein residues from the two methods is about 60-70%. According to the FoldUnfold program, for G-proteins with the posttranslational modifications, the surrounding binding site residues by disordered-promoting glycine and alanine residues conduces to a more flexible position of the binding sites for fatty acid, while methionine, cysteine and isoleucine residues provide more rigid binding sites. Thus, our research demonstrates additional possibilities of the FoldUnfold program for prediction of flexible regions and characteristics of individual residues in a different protein family. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Solvent friction changes the folding pathway of the tryptophan zipper TZ2.
Narayanan, Ranjani; Pelakh, Leslie; Hagen, Stephen J
2009-07-17
Because the rate of a diffusional process such as protein folding is controlled by friction encountered along the reaction pathway, the speed of folding is readily tunable through adjustment of solvent viscosity. The precise relationship between solvent viscosity and the rate of diffusion is complex and even conformation-dependent, however, because both solvent friction and protein internal friction contribute to the total reaction friction. The heterogeneity of the reaction friction along the folding pathway may have subtle consequences. For proteins that fold on a multidimensional free-energy surface, an increase in solvent friction may drive a qualitative change in folding trajectory. Our time-resolved experiments on the rapidly and heterogeneously folding beta-hairpin TZ2 show a shift in the folding pathway as viscosity increases, even though the energetics of folding is unaltered. We also observe a nonlinear or saturating behavior of the folding relaxation time with rising solvent viscosity, potentially an experimental signature of the shifting pathway for unfolding. Our results show that manipulations of solvent viscosity in folding experiments and simulations may have subtle and unexpected consequences on the folding dynamics being studied.
Protein kinesis: The dynamics of protein trafficking and stability
DOE Office of Scientific and Technical Information (OSTI.GOV)
NONE
The purpose of this conference is to provide a multidisciplinary forum for exchange of state-of-the-art information on protein kinesis. This volume contains abstracts of papers in the following areas: protein folding and modification in the endoplasmic reticulum; protein trafficking; protein translocation and folding; protein degradation; polarity; nuclear trafficking; membrane dynamics; and protein import into organelles.
Comparing Residue Clusters from Thermophilic and Mesophilic Enzymes Reveals Adaptive Mechanisms.
Sammond, Deanne W; Kastelowitz, Noah; Himmel, Michael E; Yin, Hang; Crowley, Michael F; Bomble, Yannick J
2016-01-01
Understanding how proteins adapt to function at high temperatures is important for deciphering the energetics that dictate protein stability and folding. While multiple principles important for thermostability have been identified, we lack a unified understanding of how internal protein structural and chemical environment determine qualitative or quantitative impact of evolutionary mutations. In this work we compare equivalent clusters of spatially neighboring residues between paired thermophilic and mesophilic homologues to evaluate adaptations under the selective pressure of high temperature. We find the residue clusters in thermophilic enzymes generally display improved atomic packing compared to mesophilic enzymes, in agreement with previous research. Unlike residue clusters from mesophilic enzymes, however, thermophilic residue clusters do not have significant cavities. In addition, anchor residues found in many clusters are highly conserved with respect to atomic packing between both thermophilic and mesophilic enzymes. Thus the improvements in atomic packing observed in thermophilic homologues are not derived from these anchor residues but from neighboring positions, which may serve to expand optimized protein core regions.
Comparing Residue Clusters from Thermophilic and Mesophilic Enzymes Reveals Adaptive Mechanisms
Sammond, Deanne W.; Kastelowitz, Noah; Himmel, Michael E.; Yin, Hang; Crowley, Michael F.; Bomble, Yannick J.
2016-01-01
Understanding how proteins adapt to function at high temperatures is important for deciphering the energetics that dictate protein stability and folding. While multiple principles important for thermostability have been identified, we lack a unified understanding of how internal protein structural and chemical environment determine qualitative or quantitative impact of evolutionary mutations. In this work we compare equivalent clusters of spatially neighboring residues between paired thermophilic and mesophilic homologues to evaluate adaptations under the selective pressure of high temperature. We find the residue clusters in thermophilic enzymes generally display improved atomic packing compared to mesophilic enzymes, in agreement with previous research. Unlike residue clusters from mesophilic enzymes, however, thermophilic residue clusters do not have significant cavities. In addition, anchor residues found in many clusters are highly conserved with respect to atomic packing between both thermophilic and mesophilic enzymes. Thus the improvements in atomic packing observed in thermophilic homologues are not derived from these anchor residues but from neighboring positions, which may serve to expand optimized protein core regions. PMID:26741367
Naranjo, Yandi; Pons, Miquel; Konrat, Robert
2012-01-01
The number of existing protein sequences spans a very small fraction of sequence space. Natural proteins have overcome a strong negative selective pressure to avoid the formation of insoluble aggregates. Stably folded globular proteins and intrinsically disordered proteins (IDPs) use alternative solutions to the aggregation problem. While in globular proteins folding minimizes the access to aggregation prone regions, IDPs on average display large exposed contact areas. Here, we introduce the concept of average meta-structure correlation maps to analyze sequence space. Using this novel conceptual view we show that representative ensembles of folded and ID proteins show distinct characteristics and respond differently to sequence randomization. By studying the way evolutionary constraints act on IDPs to disable a negative function (aggregation) we might gain insight into the mechanisms by which function-enabling information is encoded in IDPs.
Introduction to the Minireview Series on Modern Technologies for In-cell Biochemistry.
Lutsenko, Svetlana
2016-02-19
The last decade has seen enormous progress in the exploration and understanding of the behavior of molecules in their natural cellular environments at increasingly high spatial and temporal resolution. Advances in microscopy and the development of new fluorescent reagents as well as genetic editing techniques have enabled quantitative analysis of protein interactions, intracellular trafficking, metabolic changes, and signaling. Modern biochemistry now faces new and exciting challenges. Can traditionally "in vitro" experiments, e.g. analysis of protein folding and conformational transitions, be done in cells? Can the structure and behavior of endogenous and/or non-tagged recombinant proteins be analyzed and altered within the cell or in cellular compartments? How can molecules and their actions be studied mechanistically in tissues and organs? Is personalized cellular biochemistry a reality? This thematic series summarizes recent studies that illustrate some first steps toward successfully answering these modern biochemical questions. The first minireview focuses on utilization of three-dimensional primary enteroids and organoids for mechanistic studies of intestinal biology with molecular resolution. The second minireview describes application of single chain antibodies (nanobodies) for monitoring and regulating protein dynamics in vitro and in cells. The third minireview highlights advances in using NMR spectroscopy for analysis of protein folding and assembly in cells. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Populations of the Minor α-Conformation in AcGXGNH2 and the α-Helical Nucleation Propensities
NASA Astrophysics Data System (ADS)
Zhou, Yanjun; He, Liu; Zhang, Wenwen; Hu, Jingjing; Shi, Zhengshuang
2016-06-01
Intrinsic backbone conformational preferences of different amino acids are important for understanding the local structure of unfolded protein chains. Recent evidence suggests α-structure is relatively minor among three major backbone conformations for unfolded proteins. The α-helices are the dominant structures in many proteins. For these proteins, how could the α-structures occur from the least in unfolded to the most in folded states? Populations of the minor α-conformation in model peptides provide vital information. Reliable determination of populations of the α-conformers in these peptides that exist in multiple equilibriums of different conformations remains a challenge. Combined analyses on data from AcGXPNH2 and AcGXGNH2 peptides allow us to derive the populations of PII, β and α in AcGXGNH2. Our results show that on average residue X in AcGXGNH2 adopt PII, β, and α 44.7%, 44.5% and 10.8% of time, respectively. The contents of α-conformations for different amino acids define an α-helix nucleation propensity scale. With derived PII, β and α-contents, we can construct a free energy-conformation diagram on each AcGXGNH2 in aqueous solution for the three major backbone conformations. Our results would have broad implications on early-stage events of protein folding.
Habisov, Sabrina; Huber, Jessica; Ichimura, Yoshinobu; Akutsu, Masato; Rogova, Natalia; Loehr, Frank; McEwan, David G.; Johansen, Terje; Dikic, Ivan; Doetsch, Volker; Komatsu, Masaaki; Rogov, Vladimir V.; Kirkin, Vladimir
2016-01-01
The covalent conjugation of ubiquitin-fold modifier 1 (UFM1) to proteins generates a signal that regulates transcription, response to cell stress, and differentiation. Ufmylation is initiated by ubiquitin-like modifier activating enzyme 5 (UBA5), which activates and transfers UFM1 to ubiquitin-fold modifier-conjugating enzyme 1 (UFC1). The details of the interaction between UFM1 and UBA5 required for UFM1 activation and its downstream transfer are however unclear. In this study, we described and characterized a combined linear LC3-interacting region/UFM1-interacting motif (LIR/UFIM) within the C terminus of UBA5. This single motif ensures that UBA5 binds both UFM1 and light chain 3/γ-aminobutyric acid receptor-associated proteins (LC3/GABARAP), two ubiquitin (Ub)-like proteins. We demonstrated that LIR/UFIM is required for the full biological activity of UBA5 and for the effective transfer of UFM1 onto UFC1 and a downstream protein substrate both in vitro and in cells. Taken together, our study provides important structural and functional insights into the interaction between UBA5 and Ub-like modifiers, improving the understanding of the biology of the ufmylation pathway. PMID:26929408
Hepatic transcriptional changes in critical genes for gluconeogenesis following castration of bulls
Fassah, Dilla Mareistia; Jeong, Jin Young
2018-01-01
Objective This study was performed to understand transcriptional changes in the genes involved in gluconeogenesis and glycolysis pathways following castration of bulls. Methods Twenty Korean bulls were weaned at average 3 months of age, and castrated at 6 months. Liver tissues were collected from bulls (n = 10) and steers (n = 10) of Korean cattle, and hepatic gene expression levels were measured using quantitative real-time polymerase chain reaction. We examined hepatic transcription levels of genes encoding enzymes for irreversible reactions in both gluconeogenesis and glycolysis as well as genes encoding enzymes for the utilization of several glucogenic substrates. Correlations between hepatic gene expression and carcass characteristics were performed to understand their associations. Results Castration increased the mRNA (3.6 fold; p<0.01) and protein levels (1.4 fold; p< 0.05) of pyruvate carboxylase and mitochondrial phosphoenolpyruvate carboxykinase genes (1.7 fold; p<0.05). Hepatic mRNA levels of genes encoding the glycolysis enzymes were not changed by castration. Castration increased mRNA levels of both lactate dehydrogenase A (1.5 fold; p<0.05) and lactate dehydrogenase B (2.2 fold; p<0.01) genes for lactate utilization. Castration increased mRNA levels of glycerol kinase (2.7 fold; p<0.05) and glycerol-3-phosphate dehydrogenase 1 (1.5 fold; p<0.05) genes for glycerol utilization. Castration also increased mRNA levels of propionyl-CoA carboxylase beta (mitochondrial) (3.5 fold; p<0.01) and acyl-CoA synthetase short chain family member 3 (1.3 fold; p = 0.06) genes for propionate incorporation. Conclusion Castration increases transcription levels of critical genes coding for enzymes involved in irreversible gluconeogenesis reactions from pyruvate to glucose and enzymes responsible for incorporation of glucogenic substrates including lactate, glycerol, and propionate. Hepatic gluconeogenic gene expression levels were associated with intramuscular fat deposition. PMID:29502393
Hepatic transcriptional changes in critical genes for gluconeogenesis following castration of bulls.
Fassah, Dilla Mareistia; Jeong, Jin Young; Baik, Myunggi
2018-04-01
This study was performed to understand transcriptional changes in the genes involved in gluconeogenesis and glycolysis pathways following castration of bulls. Twenty Korean bulls were weaned at average 3 months of age, and castrated at 6 months. Liver tissues were collected from bulls (n = 10) and steers (n = 10) of Korean cattle, and hepatic gene expression levels were measured using quantitative real-time polymerase chain reaction. We examined hepatic transcription levels of genes encoding enzymes for irreversible reactions in both gluconeogenesis and glycolysis as well as genes encoding enzymes for the utilization of several glucogenic substrates. Correlations between hepatic gene expression and carcass characteristics were performed to understand their associations. Castration increased the mRNA (3.6 fold; p<0.01) and protein levels (1.4 fold; p< 0.05) of pyruvate carboxylase and mitochondrial phosphoenolpyruvate carboxykinase genes (1.7 fold; p<0.05). Hepatic mRNA levels of genes encoding the glycolysis enzymes were not changed by castration. Castration increased mRNA levels of both lactate dehydrogenase A (1.5 fold; p<0.05) and lactate dehydrogenase B (2.2 fold; p<0.01) genes for lactate utilization. Castration increased mRNA levels of glycerol kinase (2.7 fold; p<0.05) and glycerol-3-phosphate dehydrogenase 1 (1.5 fold; p<0.05) genes for glycerol utilization. Castration also increased mRNA levels of propionyl-CoA carboxylase beta (mitochondrial) (3.5 fold; p<0.01) and acyl-CoA synthetase short chain family member 3 (1.3 fold; p = 0.06) genes for propionate incorporation. Castration increases transcription levels of critical genes coding for enzymes involved in irreversible gluconeogenesis reactions from pyruvate to glucose and enzymes responsible for incorporation of glucogenic substrates including lactate, glycerol, and propionate. Hepatic gluconeogenic gene expression levels were associated with intramuscular fat deposition.
Mechanisms of protein-folding diseases at a glance.
Valastyan, Julie S; Lindquist, Susan
2014-01-01
For a protein to function appropriately, it must first achieve its proper conformation and location within the crowded environment inside the cell. Multiple chaperone systems are required to fold proteins correctly. In addition, degradation pathways participate by destroying improperly folded proteins. The intricacy of this multisystem process provides many opportunities for error. Furthermore, mutations cause misfolded, nonfunctional forms of proteins to accumulate. As a result, many pathological conditions are fundamentally rooted in the protein-folding problem that all cells must solve to maintain their function and integrity. Here, to illustrate the breadth of this phenomenon, we describe five examples of protein-misfolding events that can lead to disease: improper degradation, mislocalization, dominant-negative mutations, structural alterations that establish novel toxic functions, and amyloid accumulation. In each case, we will highlight current therapeutic options for battling such diseases.
Zhou, Ruhong
2004-05-01
A highly parallel replica exchange method (REM) that couples with a newly developed molecular dynamics algorithm particle-particle particle-mesh Ewald (P3ME)/RESPA has been proposed for efficient sampling of protein folding free energy landscape. The algorithm is then applied to two separate protein systems, beta-hairpin and a designed protein Trp-cage. The all-atom OPLSAA force field with an explicit solvent model is used for both protein folding simulations. Up to 64 replicas of solvated protein systems are simulated in parallel over a wide range of temperatures. The combined trajectories in temperature and configurational space allow a replica to overcome free energy barriers present at low temperatures. These large scale simulations reveal detailed results on folding mechanisms, intermediate state structures, thermodynamic properties and the temperature dependences for both protein systems.
Borgia, Alessandro; Wensley, Beth G.; Soranno, Andrea; Nettels, Daniel; Borgia, Madeleine B.; Hoffmann, Armin; Pfeil, Shawn H.; Lipman, Everett A.; Clarke, Jane; Schuler, Benjamin
2012-01-01
Theory, simulations and experimental results have suggested an important role of internal friction in the kinetics of protein folding. Recent experiments on spectrin domains provided the first evidence for a pronounced contribution of internal friction in proteins that fold on the millisecond timescale. However, it has remained unclear how this contribution is distributed along the reaction and what influence it has on the folding dynamics. Here we use a combination of single-molecule Förster resonance energy transfer, nanosecond fluorescence correlation spectroscopy, microfluidic mixing and denaturant- and viscosity-dependent protein-folding kinetics to probe internal friction in the unfolded state and at the early and late transition states of slow- and fast-folding spectrin domains. We find that the internal friction affecting the folding rates of spectrin domains is highly localized to the early transition state, suggesting an important role of rather specific interactions in the rate-limiting conformational changes. PMID:23149740
Borgia, Alessandro; Wensley, Beth G; Soranno, Andrea; Nettels, Daniel; Borgia, Madeleine B; Hoffmann, Armin; Pfeil, Shawn H; Lipman, Everett A; Clarke, Jane; Schuler, Benjamin
2012-01-01
Theory, simulations and experimental results have suggested an important role of internal friction in the kinetics of protein folding. Recent experiments on spectrin domains provided the first evidence for a pronounced contribution of internal friction in proteins that fold on the millisecond timescale. However, it has remained unclear how this contribution is distributed along the reaction and what influence it has on the folding dynamics. Here we use a combination of single-molecule Förster resonance energy transfer, nanosecond fluorescence correlation spectroscopy, microfluidic mixing and denaturant- and viscosity-dependent protein-folding kinetics to probe internal friction in the unfolded state and at the early and late transition states of slow- and fast-folding spectrin domains. We find that the internal friction affecting the folding rates of spectrin domains is highly localized to the early transition state, suggesting an important role of rather specific interactions in the rate-limiting conformational changes.
Gough, Jonathan D; Barrett, Elvis J; Silva, Yenia; Lees, Watson J
2006-08-20
Thiol based redox buffers are used to enhance the folding rates of disulfide-containing proteins in vitro. Traditionally, small molecule aliphatic thiols such as glutathione are employed. Recently, we have demonstrated that aromatic thiols can further enhance protein-folding rates. In the presence of para-substituted aromatic thiols the folding rate of a disulfide-containing protein was increased by 4-23 times over that measured for glutathione. However, several important practical issues remain to be addressed. Aromatic thiols have never been tested in the presence of denaturants such as guanidine hydrochloride. Only two of the para-substituted aromatic thiols previously examined are commercially available. To expand the number of aromatic thiols for protein folding, several commercially available meta- and ortho-substituted aromatic thiols were studied. Furthermore, an ortho-substituted aromatic thiol, easily obtained from inexpensive starting materials, was investigated. Folding rates of scrambled ribonuclease A at pH 6.0, 7.0 and 7.7, with ortho- and meta-substituted aromatic thiols, were up to 10 times greater than those with glutathione. In the presence of the common denaturant guanidine hydrochloride (0.5M) aromatic thiols provided 100% yield of active protein while maintaining equivalent folding rates.
Romero, Freddy; Summer, Ross
2017-11-01
Alveolar epithelial type II (AEII) cells are "professional" secretory cells that synthesize and secrete massive quantities of proteins to produce pulmonary surfactant and maintain airway immune defenses. To facilitate this high level of protein synthesis, AEII cells are equipped with an elaborate endoplasmic reticulum (ER) structure and possess an abundance of the machinery needed to fold, assemble, and secrete proteins. However, conditions that suddenly increase the quantity of new proteins entering the ER or that impede the capacity of the ER to fold proteins can cause misfolded or unfolded proteins to accumulate in the ER lumen, also called ER stress. To minimize this stress, AEII cells adapt by (1) reducing the quantity of proteins entering the ER, (2) increasing the amount of protein-folding machinery, and (3) removing misfolded proteins when they accumulate. Although these adaptive responses, aptly named the unfolded protein response, are usually effective in reducing ER stress, chronic aggregation of misfolded proteins is recognized as a hallmark feature of AEII cells in patients with idiopathic pulmonary fibrosis (IPF). Although mutations in surfactant proteins are linked to the development of ER stress in some rare IPF cases, the mechanisms causing protein misfolding in most cases are unknown. In this article, we review the mechanisms regulating ER proteostasis and highlight specific aspects of protein folding and the unfolded protein response that are most vulnerable to failure. Then, we postulate mechanisms other than genetic mutations that might contribute to protein aggregation in the alveolar epithelium of IPF lung.
Metamorphic Proteins: Emergence of Dual Protein Folds from One Primary Sequence.
Lella, Muralikrishna; Mahalakshmi, Radhakrishnan
2017-06-20
Every amino acid exhibits a different propensity for distinct structural conformations. Hence, decoding how the primary amino acid sequence undergoes the transition to a defined secondary structure and its final three-dimensional fold is presently considered predictable with reasonable certainty. However, protein sequences that defy the first principles of secondary structure prediction (they attain two different folds) have recently been discovered. Such proteins, aptly named metamorphic proteins, decrease the conformational constraint by increasing flexibility in the secondary structure and thereby result in efficient functionality. In this review, we discuss the major factors driving the conformational switch related both to protein sequence and to structure using illustrative examples. We discuss the concept of an evolutionary transition in sequence and structure, the functional impact of the tertiary fold, and the pressure of intrinsic and external factors that give rise to metamorphic proteins. We mainly focus on the major components of protein architecture, namely, the α-helix and β-sheet segments, which are involved in conformational switching within the same or highly similar sequences. These chameleonic sequences are widespread in both cytosolic and membrane proteins, and these folds are equally important for protein structure and function. We discuss the implications of metamorphic proteins and chameleonic peptide sequences in de novo peptide design.
Tiwari, Sandhya P.; Reuter, Nathalie
2016-01-01
The conservation of the intrinsic dynamics of proteins emerges as we attempt to understand the relationship between sequence, structure and functional conservation. We characterise the conservation of such dynamics in a case where the structure is conserved but function differs greatly. The triosephosphate isomerase barrel fold (TBF), renowned for its 8 β-strand-α-helix repeats that close to form a barrel, is one of the most diverse and abundant folds found in known protein structures. Proteins with this fold have diverse enzymatic functions spanning five of six Enzyme Commission classes, and we have picked five different superfamily candidates for our analysis using elastic network models. We find that the overall shape is a large determinant in the similarity of the intrinsic dynamics, regardless of function. In particular, the β-barrel core is highly rigid, while the α-helices that flank the β-strands have greater relative mobility, allowing for the many possibilities for placement of catalytic residues. We find that these elements correlate with each other via the loops that link them, as opposed to being directly correlated. We are also able to analyse the types of motions encoded by the normal mode vectors of the α-helices. We suggest that the global conservation of the intrinsic dynamics in the TBF contributes greatly to its success as an enzymatic scaffold both through evolution and enzyme design. PMID:27015412
Amyloidogenesis of Natively Unfolded Proteins
Uversky, Vladimir N.
2009-01-01
Aggregation and subsequent development of protein deposition diseases originate from conformational changes in corresponding amyloidogenic proteins. The accumulated data support the model where protein fibrillogenesis proceeds via the formation of a relatively unfolded amyloidogenic conformation, which shares many structural properties with the pre-molten globule state, a partially folded intermediate first found during the equilibrium and kinetic (un)folding studies of several globular proteins and later described as one of the structural forms of natively unfolded proteins. The flexibility of this structural form is essential for the conformational rearrangements driving the formation of the core cross-beta structure of the amyloid fibril. Obviously, molecular mechanisms describing amyloidogenesis of ordered and natively unfolded proteins are different. For ordered protein to fibrillate, its unique and rigid structure has to be destabilized and partially unfolded. On the other hand, fibrillogenesis of a natively unfolded protein involves the formation of partially folded conformation; i.e., partial folding rather than unfolding. In this review recent findings are surveyed to illustrate some unique features of the natively unfolded proteins amyloidogenesis. PMID:18537543
Proteins improving recombinant antibody production in mammalian cells.
Nishimiya, Daisuke
2014-02-01
Mammalian cells have been successfully used for the industrial manufacture of antibodies due to their ability to synthesize antibodies correctly. Nascent polypeptides must be subjected to protein folding and assembly in the ER and the Golgi to be secreted as mature proteins. If these reactions do not proceed appropriately, unfolded or misfolded proteins are degraded by the ER-associated degradation (ERAD) pathway. The accumulation of unfolded proteins or intracellular antibody crystals accompanied by this failure triggers the unfolded protein response (UPR), which can considerably attenuate the levels of translation, folding, assembly, and secretion, resulting in reduction of antibody productivity. Accumulating studies by omics-based analysis of recombinant mammalian cells suggest that not only protein secretion processes including protein folding and assembly but also translation are likely to be the rate-limiting factors for increasing antibody production. Here, this review describes the mechanism of antibody folding and assembly and recent advantages which could improve recombinant antibody production in mammalian cells by utilizing proteins such as ER chaperones or UPR-related proteins.
Jinek, Martin; Eulalio, Ana; Lingel, Andreas; Helms, Sigrun; Conti, Elena; Izaurralde, Elisa
2008-10-01
The removal of the 5' cap structure by the DCP1-DCP2 decapping complex irreversibly commits eukaryotic mRNAs to degradation. In human cells, the interaction between DCP1 and DCP2 is bridged by the Ge-1 protein. Ge-1 contains an N-terminal WD40-repeat domain connected by a low-complexity region to a conserved C-terminal domain. It was reported that the C-terminal domain interacts with DCP2 and mediates Ge-1 oligomerization and P-body localization. To understand the molecular basis for these functions, we determined the three-dimensional crystal structure of the most conserved region of the Drosophila melanogaster Ge-1 C-terminal domain. The region adopts an all alpha-helical fold related to ARM- and HEAT-repeat proteins. Using structure-based mutants we identified an invariant surface residue affecting P-body localization. The conservation of critical surface and structural residues suggests that the C-terminal region adopts a similar fold with conserved functions in all members of the Ge-1 protein family.
Varamini, Behzad; Sikalidis, Angelos K; Bradford, Kathryn L
2014-02-01
Alzheimer's disease (AD) is characterized by intraneuronal β-amyloid plaques and hyperphosphorylated tau, leading to neuronal cell death and progressive memory losses. This exploratory work investigates if dietary resveratrol, previously shown to have broad anti-aging effects and improve AD pathology in vivo, leads to neuroprotective changes in specific protein targets in the mouse brain. Both wild-type and APP/PS1 mice, a transgenic AD mouse model, received control AIN-93G diet or AIN-93G supplemented with resveratrol. Pathology parameters and AD risk were assessed via measurements on plaque burden, levels of phosphorylated glycogen synthase kinase 3-β (GSK3-β), tau, transthyretin and drebrin. Dietary resveratrol treatment did not decrease plaque burden in APP/PS1 mice. However, resveratrol-fed mice demonstrated increases in GSK3-β phosphorylation, a 3.8-fold increase in protein levels of transthyretin, and a 2.2-fold increase in drebrin. This study broadens our understanding of specific mechanisms and targets whereby resveratrol provides neuroprotection.
Inflating bacterial cells by increased protein synthesis
Basan, Markus; Zhu, Manlu; Dai, Xiongfeng; Warren, Mya; Sévin, Daniel; Wang, Yi-Ping; Hwa, Terence
2015-01-01
Understanding how the homeostasis of cellular size and composition is accomplished by different organisms is an outstanding challenge in biology. For exponentially growing Escherichia coli cells, it is long known that the size of cells exhibits a strong positive relation with their growth rates in different nutrient conditions. Here, we characterized cell sizes in a set of orthogonal growth limitations. We report that cell size and mass exhibit positive or negative dependences with growth rate depending on the growth limitation applied. In particular, synthesizing large amounts of “useless” proteins led to an inversion of the canonical, positive relation, with slow growing cells enlarged 7- to 8-fold compared to cells growing at similar rates under nutrient limitation. Strikingly, this increase in cell size was accompanied by a 3- to 4-fold increase in cellular DNA content at slow growth, reaching up to an amount equivalent to ∼8 chromosomes per cell. Despite drastic changes in cell mass and macromolecular composition, cellular dry mass density remained constant. Our findings reveal an important role of protein synthesis in cell division control. PMID:26519362
Gulten, Gulcin; Sacchettini, James C
2013-10-08
CarD from Mycobacterium tuberculosis (Mtb) is an essential protein shown to be involved in stringent response through downregulation of rRNA and ribosomal protein genes. CarD interacts with the β-subunit of RNAP and this interaction is vital for Mtb's survival during the persistent infection state. We have determined the crystal structure of CarD in complex with the RNAP β-subunit β1 and β2 domains at 2.1 Å resolution. The structure reveals the molecular basis of CarD/RNAP interaction, providing a basis to further our understanding of RNAP regulation by CarD. The structural fold of the CarD N-terminal domain is conserved in RNAP interacting proteins such as TRCF-RID and CdnL, and displays similar interactions to the predicted homology model based on the TRCF/RNAP β1 structure. Interestingly, the structure of the C-terminal domain, which is required for complete CarD function in vivo, represents a distinct DNA-binding fold. Copyright © 2013 Elsevier Ltd. All rights reserved.
Sgourakis, Nikolaos G; Natarajan, Kannan; Ying, Jinfa; Vogeli, Beat; Boyd, Lisa F; Margulies, David H; Bax, Ad
2014-09-02
Immunoevasins are key proteins used by viruses to subvert host immune responses. Determining their high-resolution structures is key to understanding virus-host interactions toward the design of vaccines and other antiviral therapies. Mouse cytomegalovirus encodes a unique set of immunoevasins, the m02-m06 family, that modulates major histocompatibility complex class I (MHC-I) antigen presentation to CD8+ T cells and natural killer cells. Notwithstanding the large number of genetic and functional studies, the structural biology of immunoevasins remains incompletely understood, largely because of crystallization bottlenecks. Here we implement a technology using sparse nuclear magnetic resonance data and integrative Rosetta modeling to determine the structure of the m04/gp34 immunoevasin extracellular domain. The structure reveals a β fold that is representative of the m02-m06 family of viral proteins, several of which are known to bind MHC-I molecules and interfere with antigen presentation, suggesting its role as a diversified immune regulation module. Copyright © 2014 Elsevier Ltd. All rights reserved.
Thermodynamics of protein folding using a modified Wako-Saitô-Muñoz-Eaton model.
Tsai, Min-Yeh; Yuan, Jian-Min; Teranishi, Yoshiaki; Lin, Sheng Hsien
2012-09-01
Herein, we propose a modified version of the Wako-Saitô-Muñoz-Eaton (WSME) model. The proposed model introduces an empirical temperature parameter for the hypothetical structural units (i.e., foldons) in proteins to include site-dependent thermodynamic behavior. The thermodynamics for both our proposed model and the original WSME model were investigated. For a system with beta-hairpin topology, a mathematical treatment (contact-pair treatment) to facilitate the calculation of its partition function was developed. The results show that the proposed model provides better insight into the site-dependent thermodynamic behavior of the system, compared with the original WSME model. From this site-dependent point of view, the relationship between probe-dependent experimental results and model's thermodynamic predictions can be explained. The model allows for suggesting a general principle to identify foldon behavior. We also find that the backbone hydrogen bonds may play a role of structural constraints in modulating the cooperative system. Thus, our study may contribute to the understanding of the fundamental principles for the thermodynamics of protein folding.
Handl, Julia; Lovell, Simon C.
2016-01-01
ABSTRACT Energy functions, fragment libraries, and search methods constitute three key components of fragment‐assembly methods for protein structure prediction, which are all crucial for their ability to generate high‐accuracy predictions. All of these components are tightly coupled; efficient searching becomes more important as the quality of fragment libraries decreases. Given these relationships, there is currently a poor understanding of the strengths and weaknesses of the sampling approaches currently used in fragment‐assembly techniques. Here, we determine how the performance of search techniques can be assessed in a meaningful manner, given the above problems. We describe a set of techniques that aim to reduce the impact of the energy function, and assess exploration in view of the search space defined by a given fragment library. We illustrate our approach using Rosetta and EdaFold, and show how certain features of these methods encourage or limit conformational exploration. We demonstrate that individual trajectories of Rosetta are susceptible to local minima in the energy landscape, and that this can be linked to non‐uniform sampling across the protein chain. We show that EdaFold's novel approach can help balance broad exploration with locating good low‐energy conformations. This occurs through two mechanisms which cannot be readily differentiated using standard performance measures: exclusion of false minima, followed by an increasingly focused search in low‐energy regions of conformational space. Measures such as ours can be helpful in characterizing new fragment‐based methods in terms of the quality of conformational exploration realized. Proteins 2016; 84:411–426. © 2016 The Authors Proteins: Structure, Function, and Bioinformatics Published by Wiley Periodicals, Inc. PMID:26799916
Kolhe, Parag; Badkar, Advait
2011-01-01
Active pharmaceutical ingredient for biotechnology-based drugs, commonly known as drug substance (DS), is often stored frozen for longer shelf-life. Freezing DS enhances stability by slowing down reaction rates that lead to protein instability, minimizes the risk of microbial growth, and eliminates the risk of transport-related stress. High density polyethylene bottles are commonly used for storing monoclonal antibody DS due to good mechanical stress/strain resistant properties even at low temperatures. Despite the aforementioned advantages for frozen storage of DS, this is not devoid of risks. Proteins are known to undergo ice-water surface denaturation, cryoconcentration, and cold denaturation during freezing. A systematic investigation was performed to better understand the protein and solute distribution along with potential of aggregate formation during freeze and thaw process. A significant solute and protein concentration gradient was observed for both frozen and thawed DS bottles. In case of thawed DS, cryoconcentration was localized in the bottom layer and a linear increase in concentration as a function of liquid depth was observed. On the other hand, for frozen DS, a "bell shaped" cryoconcentration distribution was observed between the bottom layers and centre position. A cryoconcentration of almost three-fold was observed for frozen DS in the most concentrated part when freezing was conducted at -20 and -40 °C and 2.5-fold cryoconcentration was observed in the thawed DS before mixing. The information obtained in this study is critical to design freeze thaw experiments, storage condition determination, and process improvement in manufacturing environment. Copyright © 2011 American Institute of Chemical Engineers (AIChE).
Molecular Determinants of Mutant Phenotypes, Inferred from Saturation Mutagenesis Data.
Tripathi, Arti; Gupta, Kritika; Khare, Shruti; Jain, Pankaj C; Patel, Siddharth; Kumar, Prasanth; Pulianmackal, Ajai J; Aghera, Nilesh; Varadarajan, Raghavan
2016-11-01
Understanding how mutations affect protein activity and organismal fitness is a major challenge. We used saturation mutagenesis combined with deep sequencing to determine mutational sensitivity scores for 1,664 single-site mutants of the 101 residue Escherichia coli cytotoxin, CcdB at seven different expression levels. Active-site residues could be distinguished from buried ones, based on their differential tolerance to aliphatic and charged amino acid substitutions. At nonactive-site positions, the average mutational tolerance correlated better with depth from the protein surface than with accessibility. Remarkably, similar results were observed for two other small proteins, PDZ domain (PSD95 pdz3 ) and IgG-binding domain of protein G (GB1). Mutational sensitivity data obtained with CcdB were used to derive a procedure for predicting functional effects of mutations. Results compared favorably with those of two widely used computational predictors. In vitro characterization of 80 single, nonactive-site mutants of CcdB showed that activity in vivo correlates moderately with thermal stability and solubility. The inability to refold reversibly, as well as a decreased folding rate in vitro, is associated with decreased activity in vivo. Upon probing the effect of modulating expression of various proteases and chaperones on mutant phenotypes, most deleterious mutants showed an increased in vivo activity and solubility only upon over-expression of either Trigger factor or SecB ATP-independent chaperones. Collectively, these data suggest that folding kinetics rather than protein stability is the primary determinant of activity in vivo This study enhances our understanding of how mutations affect phenotype, as well as the ability to predict fitness effects of point mutations. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
The Mechanism and Function of Group II Chaperonins
Lopez, Tom; Dalton, Kevin; Frydman, Judith
2015-04-30
We report protein folding in the cell requires the assistance of enzymes collectively called chaperones. Among these, the chaperonins are 1-MDa ring-shaped oligomeric complexes that bind unfolded polypeptides and promote their folding within an isolated chamber in an ATP-dependent manner. Group II chaperonins, found in archaea and eukaryotes, contain a built-in lid that opens and closes over the central chamber. In eukaryotes, the chaperonin TRiC/CCT is hetero-oligomeric, consisting of two stacked rings of eight paralogous subunits each. TRiC facilitates folding of approximately 10% of the eukaryotic proteome, including many cytoskeletal components and cell cycle regulators. Folding of many cellular substratesmore » of TRiC cannot be assisted by any other chaperone. A complete structural and mechanistic understanding of this highly conserved and essential chaperonin remains elusive. However, recent work is beginning to shed light on key aspects of chaperonin function and how their unique properties underlie their contribution to maintaining cellular proteostasis.« less