protein structures show: Topics by Science.gov

Sample records for protein structures show

Heat-induced Protein Structure and Subfractions in Relation to Protein Degradation Kinetics and Intestinal Availability in Dairy Cattle

DOE Office of Scientific and Technical Information (OSTI.GOV)

Doiron, K.; Yu, P; McKinnon, J

2009-01-01

The objectives of this study were to reveal protein structures of feed tissues affected by heat processing at a cellular level, using the synchrotron-based Fourier transform infrared microspectroscopy as a novel approach, and quantify protein structure in relation to protein digestive kinetics and nutritive value in the rumen and intestine in dairy cattle. The parameters assessed included (1) protein structure a-helix to e-sheet ratio; (2) protein subfractions profiles; (3) protein degradation kinetics and effective degradability; (4) predicted nutrient supply using the intestinally absorbed protein supply (DVE)/degraded protein balance (OEB) system for dairy cattle. In this study, Vimy flaxseed protein wasmore » used as a model feed protein and was autoclave-heated at 120C for 20, 40, and 60 min in treatments T1, T2, and T3, respectively. The results showed that using the synchrotron-based Fourier transform infrared microspectroscopy revealed and identified the heat-induced protein structure changes. Heating at 120C for 40 and 60 min increased the protein structure a-helix to e-sheet ratio. There were linear effects of heating time on the ratio. The heating also changed chemical profiles, which showed soluble CP decreased upon heating with concomitant increases in nonprotein nitrogen, neutral, and acid detergent insoluble nitrogen. The protein subfractions with the greatest changes were PB1, which showed a dramatic reduction, and PB2, which showed a dramatic increase, demonstrating a decrease in overall protein degradability. In situ results showed a reduction in rumen-degradable protein and in rumen-degradable dry matter without differences between the treatments. Intestinal digestibility, determined using a 3-step in vitro procedure, showed no changes to rumen undegradable protein. Modeling results showed that heating increased total intestinally absorbable protein (feed DVE value) and decreased degraded protein balance (feed OEB value), but there were no differences between the treatments. There was a linear effect of heating time on the DVE and a cubic effect on the OEB value. Our results showed that heating changed chemical profiles, protein structure a-helix to e-sheet ratio, and protein subfractions; decreased rumen-degradable protein and rumen-degradable dry matter; and increased potential nutrient supply to dairy cattle. The protein structure a-helix to e-sheet ratio had a significant positive correlation with total intestinally absorbed protein supply and negative correlation with degraded protein balance.« less
Investigating Molecular Structures of Bio-Fuel and Bio-Oil Seeds as Predictors To Estimate Protein Bioavailability for Ruminants by Advanced Nondestructive Vibrational Molecular Spectroscopy.

PubMed

Ban, Yajing; L Prates, Luciana; Yu, Peiqiang

2017-10-18

This study was conducted to (1) determine protein and carbohydrate molecular structure profiles and (2) quantify the relationship between structural features and protein bioavailability of newly developed carinata and canola seeds for dairy cows by using Fourier transform infrared molecular spectroscopy. Results showed similarity in protein structural makeup within the entire protein structural region between carinata and canola seeds. The highest area ratios related to structural CHO, total CHO, and cellulosic compounds were obtained for carinata seeds. Carinata and canola seeds showed similar carbohydrate and protein molecular structures by multivariate analyses. Carbohydrate molecular structure profiles were highly correlated to protein rumen degradation and intestinal digestion characteristics. In conclusion, the molecular spectroscopy can detect inherent structural characteristics in carinata and canola seeds in which carbohydrate-relative structural features are related to protein metabolism and utilization. Protein and carbohydrate spectral profiles could be used as predictors of rumen protein bioavailability in cows.
Streptococcus pneumonia YlxR at 1.35 A shows a putative new fold.

PubMed

Osipiuk, J; Górnicki, P; Maj, L; Dementieva, I; Laskowski, R; Joachimiak, A

2001-11-01

The structure of the YlxR protein of unknown function from Streptococcus pneumonia was determined to 1.35 A. YlxR is expressed from the nusA/infB operon in bacteria and belongs to a small protein family (COG2740) that shares a conserved sequence motif GRGA(Y/W). The family shows no significant amino-acid sequence similarity with other proteins. Three-wavelength diffraction MAD data were collected to 1.7 A from orthorhombic crystals using synchrotron radiation and the structure was determined using a semi-automated approach. The YlxR structure resembles a two-layer alpha/beta sandwich with the overall shape of a cylinder and shows no structural homology to proteins of known structure. Structural analysis revealed that the YlxR structure represents a new protein fold that belongs to the alpha-beta plait superfamily. The distribution of the electrostatic surface potential shows a large positively charged patch on one side of the protein, a feature often found in nucleic acid-binding proteins. Three sulfate ions bind to this positively charged surface. Analysis of potential binding sites uncovered several substantial clefts, with the largest spanning 3/4 of the protein. A similar distribution of binding sites and a large sharply bent cleft are observed in RNA-binding proteins that are unrelated in sequence and structure. It is proposed that YlxR is an RNA-binding protein.
Composition, structure and functional properties of protein concentrates and isolates produced from walnut (Juglans regia L.).

PubMed

Mao, Xiaoying; Hua, Yufei

2012-01-01

In this study, composition, structure and the functional properties of protein concentrate (WPC) and protein isolate (WPI) produced from defatted walnut flour (DFWF) were investigated. The results showed that the composition and structure of walnut protein concentrate (WPC) and walnut protein isolate (WPI) were significantly different. The molecular weight distribution of WPI was uniform and the protein composition of DFWF and WPC was complex with the protein aggregation. H(0) of WPC was significantly higher (p < 0.05) than those of DFWF and WPI, whilst WPI had a higher H(0) compared to DFWF. The secondary structure of WPI was similar to WPC. WPI showed big flaky plate like structures; whereas WPC appeared as a small flaky and more compact structure. The most functional properties of WPI were better than WPC. In comparing most functional properties of WPI and WPC with soybean protein concentrate and isolate, WPI and WPC showed higher fat absorption capacity (FAC). Emulsifying properties and foam properties of WPC and WPI in alkaline pH were comparable with that of soybean protein concentrate and isolate. Walnut protein concentrates and isolates can be considered as potential functional food ingredients.
Predicting nucleic acid binding interfaces from structural models of proteins

PubMed Central

Dror, Iris; Shazman, Shula; Mukherjee, Srayanta; Zhang, Yang; Glaser, Fabian; Mandel-Gutfreund, Yael

2011-01-01

The function of DNA- and RNA-binding proteins can be inferred from the characterization and accurate prediction of their binding interfaces. However the main pitfall of various structure-based methods for predicting nucleic acid binding function is that they are all limited to a relatively small number of proteins for which high-resolution three dimensional structures are available. In this study, we developed a pipeline for extracting functional electrostatic patches from surfaces of protein structural models, obtained using the I-TASSER protein structure predictor. The largest positive patches are extracted from the protein surface using the patchfinder algorithm. We show that functional electrostatic patches extracted from an ensemble of structural models highly overlap the patches extracted from high-resolution structures. Furthermore, by testing our pipeline on a set of 55 known nucleic acid binding proteins for which I-TASSER produces high-quality models, we show that the method accurately identifies the nucleic acids binding interface on structural models of proteins. Employing a combined patch approach we show that patches extracted from an ensemble of models better predicts the real nucleic acid binding interfaces compared to patches extracted from independent models. Overall, these results suggest that combining information from a collection of low-resolution structural models could be a valuable approach for functional annotation. We suggest that our method will be further applicable for predicting other functional surfaces of proteins with unknown structure. PMID:22086767
Taking advantage of local structure descriptors to analyze interresidue contacts in protein structures and protein complexes.

PubMed

Martin, Juliette; Regad, Leslie; Etchebest, Catherine; Camproux, Anne-Claude

2008-11-15

Interresidue protein contacts in proteins structures and at protein-protein interface are classically described by the amino acid types of interacting residues and the local structural context of the contact, if any, is described using secondary structures. In this study, we present an alternate analysis of interresidue contact using local structures defined by the structural alphabet introduced by Camproux et al. This structural alphabet allows to describe a 3D structure as a sequence of prototype fragments called structural letters, of 27 different types. Each residue can then be assigned to a particular local structure, even in loop regions. The analysis of interresidue contacts within protein structures defined using Voronoï tessellations reveals that pairwise contact specificity is greater in terms of structural letters than amino acids. Using a simple heuristic based on specificity score comparison, we find that 74% of the long-range contacts within protein structures are better described using structural letters than amino acid types. The investigation is extended to a set of protein-protein complexes, showing that the similar global rules apply as for intraprotein contacts, with 64% of the interprotein contacts best described by local structures. We then present an evaluation of pairing functions integrating structural letters to decoy scoring and show that some complexes could benefit from the use of structural letter-based pairing functions.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Osipiuk, J.; Gornicki, P.; Maj, L.

The structure of the YlxR protein of unknown function from Streptococcus pneumonia was determined to 1.35 Angstroms. YlxR is expressed from the nusA/infB operon in bacteria and belongs to a small protein family (COG2740) that shares a conserved sequence motif GRGA(Y/W). The family shows no significant amino-acid sequence similarity with other proteins. Three-wavelength diffraction MAD data were collected to 1.7 Angstroms from orthorhombic crystals using synchrotron radiation and the structure was determined using a semi-automated approach. The YlxR structure resembles a two-layer {alpha}/{beta} sandwich with the overall shape of a cylinder and shows no structural homology to proteins of knownmore » structure. Structural analysis revealed that the YlxR structure represents a new protein fold that belongs to the {alpha}-{beta} plait superfamily. The distribution of the electrostatic surface potential shows a large positively charged patch on one side of the protein, a feature often found in nucleic acid-binding proteins. Three sulfate ions bind to this positively charged surface. Analysis of potential binding sites uncovered several substantial clefts, with the largest spanning 3/4 of the protein. A similar distribution of binding sites and a large sharply bent cleft are observed in RNA-binding proteins that are unrelated in sequence and structure. It is proposed that YlxR is an RNA-binding protein.« less
A Sand Fly Salivary Protein Vaccine Shows Efficacy Against Vector-Transmitted Cutaneous Leishmaniasis in Nonhuman Primates

DTIC Science & Technology

2015-06-03

demonstrating its immunogenicity in humans. PdSP15 sequence and structure show no homol- ogy to mammalian proteins, further demonstrating its potential...sequence or structure homology to known human proteins The protective salivary antigen PdSP15 shares sequence homology only to the small odorant binding...salivary proteins PpSP15 and PsSP15, respectively (Fig. 4B). To exclude any structural similarities to human pro teins, the crystal structure of PdPS15
Isolation and in silico analysis of a novel H+-pyrophosphatase gene orthologue from the halophytic grass Leptochloa fusca

NASA Astrophysics Data System (ADS)

Rauf, Muhammad; Saeed, Nasir A.; Habib, Imran; Ahmed, Moddassir; Shahzad, Khurram; Mansoor, Shahid; Ali, Rashid

2017-02-01

Structure prediction can provide information about function and active sites of protein which helps to design new functional proteins. H+-pyrophosphatase is transmembrane protein involved in establishing proton motive force for active transport of Na+ across membrane by Na+/H+ antiporters. A full length novel H+-pyrophosphatase gene was isolated from halophytic grass Leptochloa fusca using RT-PCR and RACE method. Full length LfVP1 gene sequence of 2292 nucleotides encodes protein of 764 amino acids. DNA and protein sequences were used for characterization using bioinformatics tools. Various important potential sites were predicted by PROSITE webserver. Primary structural analysis showed LfVP1 as stable protein and Grand average hydropathy (GRAVY) indicated that LfVP1 protein has good hydrosolubility. Secondary structure analysis showed that LfVP1 protein sequence contains significant proportion of alpha helix and random coil. Protein membrane topology suggested the presence of 14 transmembrane domains and presence of catalytic domain in TM3. Three dimensional structure from LfVP1 protein sequence also indicated the presence of 14 transmembrane domains and hydrophobicity surface model showed amino acid hydrophobicity. Ramachandran plot showed that 98% amino acid residues were predicted in the favored region.
Predicting nucleic acid binding interfaces from structural models of proteins.

PubMed

Dror, Iris; Shazman, Shula; Mukherjee, Srayanta; Zhang, Yang; Glaser, Fabian; Mandel-Gutfreund, Yael

2012-02-01

The function of DNA- and RNA-binding proteins can be inferred from the characterization and accurate prediction of their binding interfaces. However, the main pitfall of various structure-based methods for predicting nucleic acid binding function is that they are all limited to a relatively small number of proteins for which high-resolution three-dimensional structures are available. In this study, we developed a pipeline for extracting functional electrostatic patches from surfaces of protein structural models, obtained using the I-TASSER protein structure predictor. The largest positive patches are extracted from the protein surface using the patchfinder algorithm. We show that functional electrostatic patches extracted from an ensemble of structural models highly overlap the patches extracted from high-resolution structures. Furthermore, by testing our pipeline on a set of 55 known nucleic acid binding proteins for which I-TASSER produces high-quality models, we show that the method accurately identifies the nucleic acids binding interface on structural models of proteins. Employing a combined patch approach we show that patches extracted from an ensemble of models better predicts the real nucleic acid binding interfaces compared with patches extracted from independent models. Overall, these results suggest that combining information from a collection of low-resolution structural models could be a valuable approach for functional annotation. We suggest that our method will be further applicable for predicting other functional surfaces of proteins with unknown structure. Copyright © 2011 Wiley Periodicals, Inc.
Binding free energy analysis of protein-protein docking model structures by evERdock.

PubMed

Takemura, Kazuhiro; Matubayasi, Nobuyuki; Kitao, Akio

2018-03-14

To aid the evaluation of protein-protein complex model structures generated by protein docking prediction (decoys), we previously developed a method to calculate the binding free energies for complexes. The method combines a short (2 ns) all-atom molecular dynamics simulation with explicit solvent and solution theory in the energy representation (ER). We showed that this method successfully selected structures similar to the native complex structure (near-native decoys) as the lowest binding free energy structures. In our current work, we applied this method (evERdock) to 100 or 300 model structures of four protein-protein complexes. The crystal structures and the near-native decoys showed the lowest binding free energy of all the examined structures, indicating that evERdock can successfully evaluate decoys. Several decoys that show low interface root-mean-square distance but relatively high binding free energy were also identified. Analysis of the fraction of native contacts, hydrogen bonds, and salt bridges at the protein-protein interface indicated that these decoys were insufficiently optimized at the interface. After optimizing the interactions around the interface by including interfacial water molecules, the binding free energies of these decoys were improved. We also investigated the effect of solute entropy on binding free energy and found that consideration of the entropy term does not necessarily improve the evaluations of decoys using the normal model analysis for entropy calculation.
Binding free energy analysis of protein-protein docking model structures by evERdock

NASA Astrophysics Data System (ADS)

Takemura, Kazuhiro; Matubayasi, Nobuyuki; Kitao, Akio

2018-03-01

To aid the evaluation of protein-protein complex model structures generated by protein docking prediction (decoys), we previously developed a method to calculate the binding free energies for complexes. The method combines a short (2 ns) all-atom molecular dynamics simulation with explicit solvent and solution theory in the energy representation (ER). We showed that this method successfully selected structures similar to the native complex structure (near-native decoys) as the lowest binding free energy structures. In our current work, we applied this method (evERdock) to 100 or 300 model structures of four protein-protein complexes. The crystal structures and the near-native decoys showed the lowest binding free energy of all the examined structures, indicating that evERdock can successfully evaluate decoys. Several decoys that show low interface root-mean-square distance but relatively high binding free energy were also identified. Analysis of the fraction of native contacts, hydrogen bonds, and salt bridges at the protein-protein interface indicated that these decoys were insufficiently optimized at the interface. After optimizing the interactions around the interface by including interfacial water molecules, the binding free energies of these decoys were improved. We also investigated the effect of solute entropy on binding free energy and found that consideration of the entropy term does not necessarily improve the evaluations of decoys using the normal model analysis for entropy calculation.
Structural hot spots for the solubility of globular proteins

PubMed Central

Ganesan, Ashok; Siekierska, Aleksandra; Beerten, Jacinte; Brams, Marijke; Van Durme, Joost; De Baets, Greet; Van der Kant, Rob; Gallardo, Rodrigo; Ramakers, Meine; Langenberg, Tobias; Wilkinson, Hannah; De Smet, Frederik; Ulens, Chris; Rousseau, Frederic; Schymkowitz, Joost

2016-01-01

Natural selection shapes protein solubility to physiological requirements and recombinant applications that require higher protein concentrations are often problematic. This raises the question whether the solubility of natural protein sequences can be improved. We here show an anti-correlation between the number of aggregation prone regions (APRs) in a protein sequence and its solubility, suggesting that mutational suppression of APRs provides a simple strategy to increase protein solubility. We show that mutations at specific positions within a protein structure can act as APR suppressors without affecting protein stability. These hot spots for protein solubility are both structure and sequence dependent but can be computationally predicted. We demonstrate this by reducing the aggregation of human α-galactosidase and protective antigen of Bacillus anthracis through mutation. Our results indicate that many proteins possess hot spots allowing to adapt protein solubility independently of structure and function. PMID:26905391
MD simulations of papillomavirus DNA-E2 protein complexes hints at a protein structural code for DNA deformation.

PubMed

Falconi, M; Oteri, F; Eliseo, T; Cicero, D O; Desideri, A

2008-08-01

The structural dynamics of the DNA binding domains of the human papillomavirus strain 16 and the bovine papillomavirus strain 1, complexed with their DNA targets, has been investigated by modeling, molecular dynamics simulations, and nuclear magnetic resonance analysis. The simulations underline different dynamical features of the protein scaffolds and a different mechanical interaction of the two proteins with DNA. The two protein structures, although very similar, show differences in the relative mobility of secondary structure elements. Protein structural analyses, principal component analysis, and geometrical and energetic DNA analyses indicate that the two transcription factors utilize a different strategy in DNA recognition and deformation. Results show that the protein indirect DNA readout is not only addressable to the DNA molecule flexibility but it is finely tuned by the mechanical and dynamical properties of the protein scaffold involved in the interaction.
Effect of protein solution components in the adsorption of Herbaspirillum seropedicae GlnB protein on mica.

PubMed

Ferreira, Cecília F G; Benelli, Elaine M; Klein, Jorge J; Schreiner, Wido; Camargo, Paulo C

2009-10-15

The adsorption of proteins and its buffer solution on mica surfaces was investigated by atomic force microscopy (AFM). Different salt concentration of the Herbaspirillum seropedicae GlnB protein (GlnB-Hs) solution deposited on mica was investigated. This protein is a globular, soluble homotrimer (36kDa), member of PII-like proteins family involved in signal transducing in prokaryote. Supramolecular structures were formed when this protein was deposited onto bare mica surface. The topographic AFM images of the GlnB-Hs films showed that at high salt concentration the supramolecular structures are spherical-like, instead of the typical doughnut-like shape for low salt concentration. AFM images of NaCl and Tris from the buffer solution showed structures with the same pattern as those observed for high salt protein solution, misleading the image interpretation. XPS experiments showed that GlnB protein film covers the mica surface without chemical reaction.
Restricted N-glycan conformational space in the PDB and its implication in glycan structure modeling.

PubMed

Jo, Sunhwan; Lee, Hui Sun; Skolnick, Jeffrey; Im, Wonpil

2013-01-01

Understanding glycan structure and dynamics is central to understanding protein-carbohydrate recognition and its role in protein-protein interactions. Given the difficulties in obtaining the glycan's crystal structure in glycoconjugates due to its flexibility and heterogeneity, computational modeling could play an important role in providing glycosylated protein structure models. To address if glycan structures available in the PDB can be used as templates or fragments for glycan modeling, we present a survey of the N-glycan structures of 35 different sequences in the PDB. Our statistical analysis shows that the N-glycan structures found on homologous glycoproteins are significantly conserved compared to the random background, suggesting that N-glycan chains can be confidently modeled with template glycan structures whose parent glycoproteins share sequence similarity. On the other hand, N-glycan structures found on non-homologous glycoproteins do not show significant global structural similarity. Nonetheless, the internal substructures of these N-glycans, particularly, the substructures that are closer to the protein, show significantly similar structures, suggesting that such substructures can be used as fragments in glycan modeling. Increased interactions with protein might be responsible for the restricted conformational space of N-glycan chains. Our results suggest that structure prediction/modeling of N-glycans of glycoconjugates using structure database could be effective and different modeling approaches would be needed depending on the availability of template structures.
Restricted N-glycan Conformational Space in the PDB and Its Implication in Glycan Structure Modeling

PubMed Central

Jo, Sunhwan; Lee, Hui Sun; Skolnick, Jeffrey; Im, Wonpil

2013-01-01

Understanding glycan structure and dynamics is central to understanding protein-carbohydrate recognition and its role in protein-protein interactions. Given the difficulties in obtaining the glycan's crystal structure in glycoconjugates due to its flexibility and heterogeneity, computational modeling could play an important role in providing glycosylated protein structure models. To address if glycan structures available in the PDB can be used as templates or fragments for glycan modeling, we present a survey of the N-glycan structures of 35 different sequences in the PDB. Our statistical analysis shows that the N-glycan structures found on homologous glycoproteins are significantly conserved compared to the random background, suggesting that N-glycan chains can be confidently modeled with template glycan structures whose parent glycoproteins share sequence similarity. On the other hand, N-glycan structures found on non-homologous glycoproteins do not show significant global structural similarity. Nonetheless, the internal substructures of these N-glycans, particularly, the substructures that are closer to the protein, show significantly similar structures, suggesting that such substructures can be used as fragments in glycan modeling. Increased interactions with protein might be responsible for the restricted conformational space of N-glycan chains. Our results suggest that structure prediction/modeling of N-glycans of glycoconjugates using structure database could be effective and different modeling approaches would be needed depending on the availability of template structures. PMID:23516343
Structural deformation upon protein-protein interaction: A structural alphabet approach

PubMed Central

Martin, Juliette; Regad, Leslie; Lecornet, Hélène; Camproux, Anne-Claude

2008-01-01

Background In a number of protein-protein complexes, the 3D structures of bound and unbound partners significantly differ, supporting the induced fit hypothesis for protein-protein binding. Results In this study, we explore the induced fit modifications on a set of 124 proteins available in both bound and unbound forms, in terms of local structure. The local structure is described thanks to a structural alphabet of 27 structural letters that allows a detailed description of the backbone. Using a control set to distinguish induced fit from experimental error and natural protein flexibility, we show that the fraction of structural letters modified upon binding is significantly greater than in the control set (36% versus 28%). This proportion is even greater in the interface regions (41%). Interface regions preferentially involve coils. Our analysis further reveals that some structural letters in coil are not favored in the interface. We show that certain structural letters in coil are particularly subject to modifications at the interface, and that the severity of structural change also varies. These information are used to derive a structural letter substitution matrix that summarizes the local structural changes observed in our data set. We also illustrate the usefulness of our approach to identify common binding motifs in unrelated proteins. Conclusion Our study provides qualitative information about induced fit. These results could be of help for flexible docking. PMID:18307769
Structural deformation upon protein-protein interaction: a structural alphabet approach.

PubMed

Martin, Juliette; Regad, Leslie; Lecornet, Hélène; Camproux, Anne-Claude

2008-02-28

In a number of protein-protein complexes, the 3D structures of bound and unbound partners significantly differ, supporting the induced fit hypothesis for protein-protein binding. In this study, we explore the induced fit modifications on a set of 124 proteins available in both bound and unbound forms, in terms of local structure. The local structure is described thanks to a structural alphabet of 27 structural letters that allows a detailed description of the backbone. Using a control set to distinguish induced fit from experimental error and natural protein flexibility, we show that the fraction of structural letters modified upon binding is significantly greater than in the control set (36% versus 28%). This proportion is even greater in the interface regions (41%). Interface regions preferentially involve coils. Our analysis further reveals that some structural letters in coil are not favored in the interface. We show that certain structural letters in coil are particularly subject to modifications at the interface, and that the severity of structural change also varies. These information are used to derive a structural letter substitution matrix that summarizes the local structural changes observed in our data set. We also illustrate the usefulness of our approach to identify common binding motifs in unrelated proteins. Our study provides qualitative information about induced fit. These results could be of help for flexible docking.
Same but not alike: Structure, flexibility and energetics of domains in multi-domain proteins are influenced by the presence of other domains

PubMed Central

Vishwanath, Sneha

2018-01-01

The majority of the proteins encoded in the genomes of eukaryotes contain more than one domain. Reasons for high prevalence of multi-domain proteins in various organisms have been attributed to higher stability and functional and folding advantages over single-domain proteins. Despite these advantages, many proteins are composed of only one domain while their homologous domains are part of multi-domain proteins. In the study presented here, differences in the properties of protein domains in single-domain and multi-domain systems and their influence on functions are discussed. We studied 20 pairs of identical protein domains, which were crystallized in two forms (a) tethered to other proteins domains and (b) tethered to fewer protein domains than (a) or not tethered to any protein domain. Results suggest that tethering of domains in multi-domain proteins influences the structural, dynamic and energetic properties of the constituent protein domains. 50% of the protein domain pairs show significant structural deviations while 90% of the protein domain pairs show differences in dynamics and 12% of the residues show differences in the energetics. To gain further insights on the influence of tethering on the function of the domains, 4 pairs of homologous protein domains, where one of them is a full-length single-domain protein and the other protein domain is a part of a multi-domain protein, were studied. Analyses showed that identical and structurally equivalent functional residues show differential dynamics in homologous protein domains; though comparable dynamics between in-silico generated chimera protein and multi-domain proteins were observed. From these observations, the differences observed in the functions of homologous proteins could be attributed to the presence of tethered domain. Overall, we conclude that tethered domains in multi-domain proteins not only provide stability or folding advantages but also influence pathways resulting in differences in function or regulatory properties. PMID:29432415

Same but not alike: Structure, flexibility and energetics of domains in multi-domain proteins are influenced by the presence of other domains.

PubMed

Vishwanath, Sneha; de Brevern, Alexandre G; Srinivasan, Narayanaswamy

2018-02-01

The majority of the proteins encoded in the genomes of eukaryotes contain more than one domain. Reasons for high prevalence of multi-domain proteins in various organisms have been attributed to higher stability and functional and folding advantages over single-domain proteins. Despite these advantages, many proteins are composed of only one domain while their homologous domains are part of multi-domain proteins. In the study presented here, differences in the properties of protein domains in single-domain and multi-domain systems and their influence on functions are discussed. We studied 20 pairs of identical protein domains, which were crystallized in two forms (a) tethered to other proteins domains and (b) tethered to fewer protein domains than (a) or not tethered to any protein domain. Results suggest that tethering of domains in multi-domain proteins influences the structural, dynamic and energetic properties of the constituent protein domains. 50% of the protein domain pairs show significant structural deviations while 90% of the protein domain pairs show differences in dynamics and 12% of the residues show differences in the energetics. To gain further insights on the influence of tethering on the function of the domains, 4 pairs of homologous protein domains, where one of them is a full-length single-domain protein and the other protein domain is a part of a multi-domain protein, were studied. Analyses showed that identical and structurally equivalent functional residues show differential dynamics in homologous protein domains; though comparable dynamics between in-silico generated chimera protein and multi-domain proteins were observed. From these observations, the differences observed in the functions of homologous proteins could be attributed to the presence of tethered domain. Overall, we conclude that tethered domains in multi-domain proteins not only provide stability or folding advantages but also influence pathways resulting in differences in function or regulatory properties.
Exploring representations of protein structure for automated remote homology detection and mapping of protein structure space

PubMed Central

2014-01-01

Background Due to rapid sequencing of genomes, there are now millions of deposited protein sequences with no known function. Fast sequence-based comparisons allow detecting close homologs for a protein of interest to transfer functional information from the homologs to the given protein. Sequence-based comparison cannot detect remote homologs, in which evolution has adjusted the sequence while largely preserving structure. Structure-based comparisons can detect remote homologs but most methods for doing so are too expensive to apply at a large scale over structural databases of proteins. Recently, fragment-based structural representations have been proposed that allow fast detection of remote homologs with reasonable accuracy. These representations have also been used to obtain linearly-reducible maps of protein structure space. It has been shown, as additionally supported from analysis in this paper that such maps preserve functional co-localization of the protein structure space. Methods Inspired by a recent application of the Latent Dirichlet Allocation (LDA) model for conducting structural comparisons of proteins, we propose higher-order LDA-obtained topic-based representations of protein structures to provide an alternative route for remote homology detection and organization of the protein structure space in few dimensions. Various techniques based on natural language processing are proposed and employed to aid the analysis of topics in the protein structure domain. Results We show that a topic-based representation is just as effective as a fragment-based one at automated detection of remote homologs and organization of protein structure space. We conduct a detailed analysis of the information content in the topic-based representation, showing that topics have semantic meaning. The fragment-based and topic-based representations are also shown to allow prediction of superfamily membership. Conclusions This work opens exciting venues in designing novel representations to extract information about protein structures, as well as organizing and mining protein structure space with mature text mining tools. PMID:25080993
Key Structures and Interactions for Binding of Mycobacterium tuberculosis Protein Kinase B Inhibitors from Molecular Dynamics Simulation.

PubMed

Punkvang, Auradee; Kamsri, Pharit; Saparpakorn, Patchreenart; Hannongbua, Supa; Wolschann, Peter; Irle, Stephan; Pungpo, Pornpan

2015-07-01

Substituted aminopyrimidine inhibitors have recently been introduced as antituberculosis agents. These inhibitors show impressive activity against protein kinase B, a Ser/Thr protein kinase that is essential for cell growth of M. tuberculosis. However, up to now, X-ray structures of the protein kinase B enzyme complexes with the substituted aminopyrimidine inhibitors are currently unavailable. Consequently, structural details of their binding modes are questionable, prohibiting the structural-based design of more potent protein kinase B inhibitors in the future. Here, molecular dynamics simulations, in conjunction with molecular mechanics/Poisson-Boltzmann surface area binding free-energy analysis, were employed to gain insight into the complex structures of the protein kinase B inhibitors and their binding energetics. The complex structures obtained by the molecular dynamics simulations show binding free energies in good agreement with experiment. The detailed analysis of molecular dynamics results shows that Glu93, Val95, and Leu17 are key residues responsible to the binding of the protein kinase B inhibitors. The aminopyrazole group and the pyrimidine core are the crucial moieties of substituted aminopyrimidine inhibitors for interaction with the key residues. Our results provide a structural concept that can be used as a guide for the future design of protein kinase B inhibitors with highly increased antagonistic activity. © 2014 John Wiley & Sons A/S.
Protein Structure Determination using Metagenome sequence data

PubMed Central

Ovchinnikov, Sergey; Park, Hahnbeom; Varghese, Neha; Huang, Po-Ssu; Pavlopoulos, Georgios A.; Kim, David E.; Kamisetty, Hetunandan; Kyrpides, Nikos C.; Baker, David

2017-01-01

Despite decades of work by structural biologists, there are still ~5200 protein families with unknown structure outside the range of comparative modeling. We show that Rosetta structure prediction guided by residue-residue contacts inferred from evolutionary information can accurately model proteins that belong to large families, and that metagenome sequence data more than triples the number of protein families with sufficient sequences for accurate modeling. We then integrate metagenome data, contact based structure matching and Rosetta structure calculations to generate models for 614 protein families with currently unknown structures; 206 are membrane proteins and 137 have folds not represented in the PDB. This approach provides the representative models for large protein families originally envisioned as the goal of the protein structure initiative at a fraction of the cost. PMID:28104891
An Evolution-Based Approach to De Novo Protein Design and Case Study on Mycobacterium tuberculosis

PubMed Central

Brender, Jeffrey R.; Czajka, Jeff; Marsh, David; Gray, Felicia; Cierpicki, Tomasz; Zhang, Yang

2013-01-01

Computational protein design is a reverse procedure of protein folding and structure prediction, where constructing structures from evolutionarily related proteins has been demonstrated to be the most reliable method for protein 3-dimensional structure prediction. Following this spirit, we developed a novel method to design new protein sequences based on evolutionarily related protein families. For a given target structure, a set of proteins having similar fold are identified from the PDB library by structural alignments. A structural profile is then constructed from the protein templates and used to guide the conformational search of amino acid sequence space, where physicochemical packing is accommodated by single-sequence based solvation, torsion angle, and secondary structure predictions. The method was tested on a computational folding experiment based on a large set of 87 protein structures covering different fold classes, which showed that the evolution-based design significantly enhances the foldability and biological functionality of the designed sequences compared to the traditional physics-based force field methods. Without using homologous proteins, the designed sequences can be folded with an average root-mean-square-deviation of 2.1 Å to the target. As a case study, the method is extended to redesign all 243 structurally resolved proteins in the pathogenic bacteria Mycobacterium tuberculosis, which is the second leading cause of death from infectious disease. On a smaller scale, five sequences were randomly selected from the design pool and subjected to experimental validation. The results showed that all the designed proteins are soluble with distinct secondary structure and three have well ordered tertiary structure, as demonstrated by circular dichroism and NMR spectroscopy. Together, these results demonstrate a new avenue in computational protein design that uses knowledge of evolutionary conservation from protein structural families to engineer new protein molecules of improved fold stability and biological functionality. PMID:24204234
Comparative analyses of quaternary arrangements in homo-oligomeric proteins in superfamilies: Functional implications.

PubMed

Sudha, Govindarajan; Srinivasan, Narayanaswamy

2016-09-01

A comprehensive analysis of the quaternary features of distantly related homo-oligomeric proteins is the focus of the current study. This study has been performed at the levels of quaternary state, symmetry, and quaternary structure. Quaternary state and quaternary structure refers to the number of subunits and spatial arrangements of subunits, respectively. Using a large dataset of available 3D structures of biologically relevant assemblies, we show that only 53% of the distantly related homo-oligomeric proteins have the same quaternary state. Considering these homologous homo-oligomers with the same quaternary state, conservation of quaternary structures is observed only in 38% of the pairs. In 36% of the pairs of distantly related homo-oligomers with different quaternary states the larger assembly in a pair shows high structural similarity with the entire quaternary structure of the related protein with lower quaternary state and it is referred as "Russian doll effect." The differences in quaternary state and structure have been suggested to contribute to the functional diversity. Detailed investigations show that even though the gross functions of many distantly related homo-oligomers are the same, finer level differences in molecular functions are manifested by differences in quaternary states and structures. Comparison of structures of biological assemblies in distantly and closely related homo-oligomeric proteins throughout the study differentiates the effects of sequence divergence on the quaternary structures and function. Knowledge inferred from this study can provide insights for improved protein structure classification and function prediction of homo-oligomers. Proteins 2016; 84:1190-1202. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Projections for fast protein structure retrieval

PubMed Central

Bhattacharya, Sourangshu; Bhattacharyya, Chiranjib; Chandra, Nagasuma R

2006-01-01

Background In recent times, there has been an exponential rise in the number of protein structures in databases e.g. PDB. So, design of fast algorithms capable of querying such databases is becoming an increasingly important research issue. This paper reports an algorithm, motivated from spectral graph matching techniques, for retrieving protein structures similar to a query structure from a large protein structure database. Each protein structure is specified by the 3D coordinates of residues of the protein. The algorithm is based on a novel characterization of the residues, called projections, leading to a similarity measure between the residues of the two proteins. This measure is exploited to efficiently compute the optimal equivalences. Results Experimental results show that, the current algorithm outperforms the state of the art on benchmark datasets in terms of speed without losing accuracy. Search results on SCOP 95% nonredundant database, for fold similarity with 5 proteins from different SCOP classes show that the current method performs competitively with the standard algorithm CE. The algorithm is also capable of detecting non-topological similarities between two proteins which is not possible with most of the state of the art tools like Dali. PMID:17254310
A Group 6 Late Embryogenesis Abundant Protein from Common Bean Is a Disordered Protein with Extended Helical Structure and Oligomer-forming Properties*

PubMed Central

Rivera-Najera, Lucero Y.; Saab-Rincón, Gloria; Battaglia, Marina; Amero, Carlos; Pulido, Nancy O.; García-Hernández, Enrique; Solórzano, Rosa M.; Reyes, José L.; Covarrubias, Alejandra A.

2014-01-01

Late embryogenesis-abundant proteins accumulate to high levels in dry seeds. Some of them also accumulate in response to water deficit in vegetative tissues, which leads to a remarkable association between their presence and low water availability conditions. A major sub-group of these proteins, also known as typical LEA proteins, shows high hydrophilicity and a high percentage of glycine and other small amino acid residues, distinctive physicochemical properties that predict a high content of structural disorder. Although all typical LEA proteins share these characteristics, seven groups can be distinguished by sequence similarity, indicating structural and functional diversity among them. Some of these groups have been extensively studied; however, others require a more detailed analysis to advance in their functional understanding. In this work, we report the structural characterization of a group 6 LEA protein from a common bean (Phaseolus vulgaris L.) (PvLEA6) by circular dichroism and nuclear magnetic resonance showing that it is a disordered protein in aqueous solution. Using the same techniques, we show that despite its unstructured nature, the addition of trifluoroethanol exhibited an intrinsic potential in this protein to gain helicity. This property was also promoted by high osmotic potentials or molecular crowding. Furthermore, we demonstrate that PvLEA6 protein is able to form soluble homo-oligomeric complexes that also show high levels of structural disorder. The association between PvLEA6 monomers to form dimers was shown to occur in plant cells by bimolecular fluorescence complementation, pointing to the in vivo functional relevance of this association. PMID:25271167
Fourier-based classification of protein secondary structures.

PubMed

Shu, Jian-Jun; Yong, Kian Yan

2017-04-15

The correct prediction of protein secondary structures is one of the key issues in predicting the correct protein folded shape, which is used for determining gene function. Existing methods make use of amino acids properties as indices to classify protein secondary structures, but are faced with a significant number of misclassifications. The paper presents a technique for the classification of protein secondary structures based on protein "signal-plotting" and the use of the Fourier technique for digital signal processing. New indices are proposed to classify protein secondary structures by analyzing hydrophobicity profiles. The approach is simple and straightforward. Results show that the more types of protein secondary structures can be classified by means of these newly-proposed indices. Copyright © 2017 Elsevier Inc. All rights reserved.
In Search of Functional Advantages of Knots in Proteins.

PubMed

Dabrowski-Tumanski, Pawel; Stasiak, Andrzej; Sulkowska, Joanna I

2016-01-01

We analysed the structure of deeply knotted proteins representing three unrelated families of knotted proteins. We looked at the correlation between positions of knotted cores in these proteins and such local structural characteristics as the number of intra-chain contacts, structural stability and solvent accessibility. We observed that the knotted cores and especially their borders showed strong enrichment in the number of contacts. These regions showed also increased thermal stability, whereas their solvent accessibility was decreased. Interestingly, the active sites within these knotted proteins preferentially located in the regions with increased number of contacts that also have increased thermal stability and decreased solvent accessibility. Our results suggest that knotting of polypeptide chains provides a favourable environment for the active sites observed in knotted proteins. Some knotted proteins have homologues without a knot. Interestingly, these unknotted homologues form local entanglements that retain structural characteristics of the knotted cores.
Process and Formulation Effects on Protein Structure in Lyophilized Solids using Mass Spectrometric Methods

PubMed Central

Iyer, Lavanya K.; Sacha, Gregory A.; Moorthy, Balakrishnan S.; Nail, Steven L.; Topp, Elizabeth M.

2016-01-01

Myoglobin (Mb) was lyophilized in the absence (Mb-A) and presence (Mb-B) of sucrose in a pilot-scale lyophilizer with or without controlled ice nucleation. Cake morphology was characterized using scanning electron microscopy (SEM) and changes in protein structure were monitored using solid-state Fourier-transform infrared spectroscopy (ssFTIR), solid-state hydrogen-deuterium exchange-mass spectrometry (ssHDX-MS) and solid-state photolytic labeling-mass spectrometry (ssPL-MS). The results showed greater variability in nucleation temperature and irregular cake structure for formulations lyophilized without controlled nucleation. Controlled nucleation resulted in nucleation at ~ −5 °C and uniform cake structure. Formulations containing sucrose showed better retention of protein structure by all measures than formulations without sucrose. Samples lyophilized with and without controlled nucleation were similar by most measures of protein structure. However, ssPL-MS showed the greatest pLeu incorporation and more labeled regions for Mb-B lyophilized with controlled nucleation. The data support the use of ssHDX-MS and ssPL-MS to study formulation and process-induced conformational changes in lyophilized proteins. PMID:27044943
The crystal structure of Erwinia amylovora AmyR, a member of the YbjN protein family, shows similarity to type III secretion chaperones but suggests different cellular functions

PubMed Central

Bartho, Joseph D.; Bellini, Dom; Wuerges, Jochen; Demitri, Nicola; Toccafondi, Mirco; Schmitt, Armin O.; Zhao, Youfu; Walsh, Martin A.

2017-01-01

AmyR is a stress and virulence associated protein from the plant pathogenic Enterobacteriaceae species Erwinia amylovora, and is a functionally conserved ortholog of YbjN from Escherichia coli. The crystal structure of E. amylovora AmyR reveals a class I type III secretion chaperone-like fold, despite the lack of sequence similarity between these two classes of protein and lacking any evidence of a secretion-associated role. The results indicate that AmyR, and YbjN proteins in general, function through protein-protein interactions without any enzymatic action. The YbjN proteins of Enterobacteriaceae show remarkably low sequence similarity with other members of the YbjN protein family in Eubacteria, yet a high level of structural conservation is observed. Across the YbjN protein family sequence conservation is limited to residues stabilising the protein core and dimerization interface, while interacting regions are only conserved between closely related species. This study presents the first structure of a YbjN protein from Enterobacteriaceae, the most highly divergent and well-studied subgroup of YbjN proteins, and an in-depth sequence and structural analysis of this important but poorly understood protein family. PMID:28426806
The crystal structure of Erwinia amylovora AmyR, a member of the YbjN protein family, shows similarity to type III secretion chaperones but suggests different cellular functions.

PubMed

Bartho, Joseph D; Bellini, Dom; Wuerges, Jochen; Demitri, Nicola; Toccafondi, Mirco; Schmitt, Armin O; Zhao, Youfu; Walsh, Martin A; Benini, Stefano

2017-01-01

AmyR is a stress and virulence associated protein from the plant pathogenic Enterobacteriaceae species Erwinia amylovora, and is a functionally conserved ortholog of YbjN from Escherichia coli. The crystal structure of E. amylovora AmyR reveals a class I type III secretion chaperone-like fold, despite the lack of sequence similarity between these two classes of protein and lacking any evidence of a secretion-associated role. The results indicate that AmyR, and YbjN proteins in general, function through protein-protein interactions without any enzymatic action. The YbjN proteins of Enterobacteriaceae show remarkably low sequence similarity with other members of the YbjN protein family in Eubacteria, yet a high level of structural conservation is observed. Across the YbjN protein family sequence conservation is limited to residues stabilising the protein core and dimerization interface, while interacting regions are only conserved between closely related species. This study presents the first structure of a YbjN protein from Enterobacteriaceae, the most highly divergent and well-studied subgroup of YbjN proteins, and an in-depth sequence and structural analysis of this important but poorly understood protein family.
Structural changes in gluten protein structure after addition of emulsifier. A Raman spectroscopy study

NASA Astrophysics Data System (ADS)

Ferrer, Evelina G.; Gómez, Analía V.; Añón, María C.; Puppo, María C.

2011-06-01

Food protein product, gluten protein, was chemically modified by varying levels of sodium stearoyl lactylate (SSL); and the extent of modifications (secondary and tertiary structures) of this protein was analyzed by using Raman spectroscopy. Analysis of the Amide I band showed an increase in its intensity mainly after the addition of the 0.25% of SSL to wheat flour to produced modified gluten protein, pointing the formation of a more ordered structure. Side chain vibrations also confirmed the observed changes.
Application potential of ATR-FT/IR molecular spectroscopy in animal nutrition: revelation of protein molecular structures of canola meal and presscake, as affected by heat-processing methods, in relationship with their protein digestive behavior and utilization for dairy cattle.

PubMed

Theodoridou, Katerina; Yu, Peiqiang

2013-06-12

Protein quality relies not only on total protein but also on protein inherent structures. The most commonly occurring protein secondary structures (α-helix and β-sheet) may influence protein quality, nutrient utilization, and digestive behavior. The objectives of this study were to reveal the protein molecular structures of canola meal (yellow and brown) and presscake as affected by the heat-processing methods and to investigate the relationship between structure changes and protein rumen degradations kinetics, estimated protein intestinal digestibility, degraded protein balance, and metabolizable protein. Heat-processing conditions resulted in a higher value for α-helix and β-sheet for brown canola presscake compared to brown canola meal. The multivariate molecular spectral analyses (PCA, CLA) showed that there were significant molecular structural differences in the protein amide I and II fingerprint region (ca. 1700-1480 cm(-1)) between the brown canola meal and presscake. The in situ degradation parameters, amide I and II, and α-helix to β-sheet ratio (R_a_β) were positively correlated with the degradable fraction and the degradation rate. Modeling results showed that α-helix was positively correlated with the truly absorbed rumen synthesized microbial protein in the small intestine when using both the Dutch DVE/OEB system and the NRC-2001 model. Concerning the protein profiles, R_a_β was a better predictor for crude protein (79%) and for neutral detergent insoluble crude protein (68%). In conclusion, ATR-FT/IR molecular spectroscopy may be used to rapidly characterize feed structures at the molecular level and also as a potential predictor of feed functionality, digestive behavior, and nutrient utilization of canola feed.
Effect of thermal processing on estimated metabolizable protein supply to dairy cattle from camelina seeds: relationship with protein molecular structural changes.

PubMed

Peng, Quanhui; Khan, Nazir A; Wang, Zhisheng; Zhang, Xuewei; Yu, Peiqiang

2014-08-20

This study evaluated the effect of thermal processing on the estimated metabolizable protein (MP) supply to dairy cattle from camelina seeds (Camelina sativa L. Crantz) and determined the relationship between heat-induced changes in protein molecular structural characteristics and the MP supply. Seeds from two camelina varieties were sampled in two consecutive years and were either kept raw or were heated in an autoclave (moist heating) or in an air-draft oven (dry heating) at 120 °C for 1 h. The MP supply to dairy cattle was modeled by three commonly used protein evaluation systems. The protein molecular structures were analyzed by Fourier transform/infrared-attenuated total reflectance molecular spectroscopy. The results showed that both the dry and moist heating increased the contents of truly absorbable rumen-undegraded protein (ARUP) and total MP and decreased the degraded protein balance (DPB). However, the moist-heated camelina seeds had a significantly higher (P < 0.05) content of ARUP and total MP and a significantly lower (P < 0.05) content of DPB than did the dry-heated camelina seeds. The regression equations showed that intensities of the protein molecular structural bands can be used to estimate the contents of ARUP, MP, and DPB with high accuracy (R(2) > 0.70). These results show that protein molecular structural characteristics can be used to rapidly assess the MP supply to dairy cattle from raw and heat-treated camelina seeds.
Buried and accessible surface area control intrinsic protein flexibility.

PubMed

Marsh, Joseph A

2013-09-09

Proteins experience a wide variety of conformational dynamics that can be crucial for facilitating their diverse functions. How is the intrinsic flexibility required for these motions encoded in their three-dimensional structures? Here, the overall flexibility of a protein is demonstrated to be tightly coupled to the total amount of surface area buried within its fold. A simple proxy for this, the relative solvent-accessible surface area (Arel), therefore shows excellent agreement with independent measures of global protein flexibility derived from various experimental and computational methods. Application of Arel on a large scale demonstrates its utility by revealing unique sequence and structural properties associated with intrinsic flexibility. In particular, flexibility as measured by Arel shows little correspondence with intrinsic disorder, but instead tends to be associated with multiple domains and increased α-helical structure. Furthermore, the apparent flexibility of monomeric proteins is found to be useful for identifying quaternary-structure errors in published crystal structures. There is also a strong tendency for the crystal structures of more flexible proteins to be solved to lower resolutions. Finally, local solvent accessibility is shown to be a primary determinant of local residue flexibility. Overall, this work provides both fundamental mechanistic insight into the origin of protein flexibility and a simple, practical method for predicting flexibility from protein structures. © 2013 Elsevier Ltd. All rights reserved.
Assessing the Potential of Folded Globular Polyproteins As Hydrogel Building Blocks

PubMed Central

2016-01-01

The native states of proteins generally have stable well-defined folded structures endowing these biomolecules with specific functionality and molecular recognition abilities. Here we explore the potential of using folded globular polyproteins as building blocks for hydrogels. Photochemically cross-linked hydrogels were produced from polyproteins containing either five domains of I27 ((I27)5), protein L ((pL)5), or a 1:1 blend of these proteins. SAXS analysis showed that (I27)5 exists as a single rod-like structure, while (pL)5 shows signatures of self-aggregation in solution. SANS measurements showed that both polyprotein hydrogels have a similar nanoscopic structure, with protein L hydrogels being formed from smaller and more compact clusters. The polyprotein hydrogels showed small energy dissipation in a load/unload cycle, which significantly increased when the hydrogels were formed in the unfolded state. This study demonstrates the use of folded proteins as building blocks in hydrogels, and highlights the potential versatility that can be offered in tuning the mechanical, structural, and functional properties of polyproteins. PMID:28006103
Crystal structure and confirmation of the alanine:glyoxylate aminotransferase activity of the YFL030w yeast protein.

PubMed

Meyer, Philippe; Liger, Dominique; Leulliot, Nicolas; Quevillon-Cheruel, Sophie; Zhou, Cong-Zhao; Borel, Franck; Ferrer, Jean-Luc; Poupon, Anne; Janin, Joël; van Tilbeurgh, Herman

2005-12-01

We have determined the three-dimensional crystal structure of the protein encoded by the open reading frame YFL030w from Saccharomyces cerevisiae to a resolution of 2.6 A using single wavelength anomalous diffraction. YFL030w is a 385 amino-acid protein with sequence similarity to the aminotransferase family. The structure of the protein reveals a homodimer adopting the fold-type I of pyridoxal 5'-phosphate (PLP)-dependent aminotransferases. The PLP co-factor is covalently bound to the active site in the crystal structure. The protein shows close structural resemblance with the human alanine:glyoxylate aminotransferase (EC 2.6.1.44), an enzyme involved in the hereditary kidney stone disease primary hyperoxaluria type 1. In this paper we show that YFL030w codes for an alanine:glyoxylate aminotransferase, highly specific for its amino donor and acceptor substrates.
Utilization of protein intrinsic disorder knowledge in structural proteomics

PubMed Central

Oldfield, Christopher J.; Xue, Bin; Van, Ya-Yue; Ulrich, Eldon L.; Markley, John L.; Dunker, A. Keith; Uversky, Vladimir N.

2014-01-01

Intrinsically disordered proteins (IDPs) and proteins with long disordered regions are highly abundant in various proteomes. Despite their lack of well-defined ordered structure, these proteins and regions are frequently involved in crucial biological processes. Although in recent years these proteins have attracted the attention of many researchers, IDPs represent a significant challenge for structural characterization since these proteins can impact many of the processes in the structure determination pipeline. Here we investigate the effects of IDPs on the structure determination process and the utility of disorder prediction in selecting and improving proteins for structural characterization. Examination of the extent of intrinsic disorder in existing crystal structures found that relatively few protein crystal structures contain extensive regions of intrinsic disorder. Although intrinsic disorder is not the only cause of crystallization failures and many structured proteins cannot be crystallized, filtering out highly disordered proteins from structure-determination target lists is still likely to be cost effective. Therefore it is desirable to avoid highly disordered proteins from structure-determination target lists and we show that disorder prediction can be applied effectively to enrich structure determination pipelines with proteins more likely to yield crystal structures. For structural investigation of specific proteins, disorder prediction can be used to improve targets for structure determination. Finally, a framework for considering intrinsic disorder in the structure determination pipeline is proposed. PMID:23232152

General overview on structure prediction of twilight-zone proteins.

PubMed

Khor, Bee Yin; Tye, Gee Jun; Lim, Theam Soon; Choong, Yee Siew

2015-09-04

Protein structure prediction from amino acid sequence has been one of the most challenging aspects in computational structural biology despite significant progress in recent years showed by critical assessment of protein structure prediction (CASP) experiments. When experimentally determined structures are unavailable, the predictive structures may serve as starting points to study a protein. If the target protein consists of homologous region, high-resolution (typically <1.5 Å) model can be built via comparative modelling. However, when confronted with low sequence similarity of the target protein (also known as twilight-zone protein, sequence identity with available templates is less than 30%), the protein structure prediction has to be initiated from scratch. Traditionally, twilight-zone proteins can be predicted via threading or ab initio method. Based on the current trend, combination of different methods brings an improved success in the prediction of twilight-zone proteins. In this mini review, the methods, progresses and challenges for the prediction of twilight-zone proteins were discussed.
Protein single-model quality assessment by feature-based probability density functions.

PubMed

Cao, Renzhi; Cheng, Jianlin

2016-04-04

Protein quality assessment (QA) has played an important role in protein structure prediction. We developed a novel single-model quality assessment method-Qprob. Qprob calculates the absolute error for each protein feature value against the true quality scores (i.e. GDT-TS scores) of protein structural models, and uses them to estimate its probability density distribution for quality assessment. Qprob has been blindly tested on the 11th Critical Assessment of Techniques for Protein Structure Prediction (CASP11) as MULTICOM-NOVEL server. The official CASP result shows that Qprob ranks as one of the top single-model QA methods. In addition, Qprob makes contributions to our protein tertiary structure predictor MULTICOM, which is officially ranked 3rd out of 143 predictors. The good performance shows that Qprob is good at assessing the quality of models of hard targets. These results demonstrate that this new probability density distribution based method is effective for protein single-model quality assessment and is useful for protein structure prediction. The webserver of Qprob is available at: http://calla.rnet.missouri.edu/qprob/. The software is now freely available in the web server of Qprob.
Crystal Structure Analysis and the Identification of Distinctive Functional Regions of the Protein Elicitor Mohrip2.

PubMed

Liu, Mengjie; Duan, Liangwei; Wang, Meifang; Zeng, Hongmei; Liu, Xinqi; Qiu, Dewen

2016-01-01

The protein elicitor MoHrip2, which was extracted from Magnaporthe oryzae as an exocrine protein, triggers the tobacco immune system and enhances blast resistance in rice. However, the detailed mechanisms by which MoHrip2 acts as an elicitor remain unclear. Here, we investigated the structure of MoHrip2 to elucidate its functions based on molecular structure. The three-dimensional structure of MoHrip2 was obtained. Overall, the crystal structure formed a β-barrel structure and showed high similarity to the pathogenesis-related (PR) thaumatin superfamily protein thaumatin-like xylanase inhibitor (TL-XI). To investigate the functional regions responsible for MoHrip2 elicitor activities, the full length and eight truncated proteins were expressed in Escherichia coli and were evaluated for elicitor activity in tobacco. Biological function analysis showed that MoHrip2 triggered the defense system against Botrytis cinerea in tobacco. Moreover, only MoHrip2M14 and other fragments containing the 14 amino acids residues in the middle region of the protein showed the elicitor activity of inducing a hypersensitive response and resistance related pathways, which were similar to that of full-length MoHrip2. These results revealed that the central 14 amino acid residues were essential for anti-pathogenic activity.
A method for partitioning the information contained in a protein sequence between its structure and function.

PubMed

Possenti, Andrea; Vendruscolo, Michele; Camilloni, Carlo; Tiana, Guido

2018-05-23

Proteins employ the information stored in the genetic code and translated into their sequences to carry out well-defined functions in the cellular environment. The possibility to encode for such functions is controlled by the balance between the amount of information supplied by the sequence and that left after that the protein has folded into its structure. We study the amount of information necessary to specify the protein structure, providing an estimate that keeps into account the thermodynamic properties of protein folding. We thus show that the information remaining in the protein sequence after encoding for its structure (the 'information gap') is very close to what needed to encode for its function and interactions. Then, by predicting the information gap directly from the protein sequence, we show that it may be possible to use these insights from information theory to discriminate between ordered and disordered proteins, to identify unknown functions, and to optimize artificially-designed protein sequences. This article is protected by copyright. All rights reserved. © 2018 Wiley Periodicals, Inc.
Structural basis for the fast maturation of Arthropoda green fluorescent protein

PubMed Central

Evdokimov, Artem G; Pokross, Matthew E; Egorov, Nikolay S; Zaraisky, Andrey G; Yampolsky, Ilya V; Merzlyak, Ekaterina M; Shkoporov, Andrey N; Sander, Ian; Lukyanov, Konstantin A; Chudakov, Dmitriy M

2006-01-01

Since the cloning of Aequorea victoria green fluorescent protein (GFP) in 1992, a family of known GFP-like proteins has been growing rapidly. Today, it includes more than a hundred proteins with different spectral characteristics cloned from Cnidaria species. For some of these proteins, crystal structures have been solved, showing diversity in chromophore modifications and conformational states. However, we are still far from a complete understanding of the origin, functions and evolution of the GFP family. Novel proteins of the family were recently cloned from evolutionarily distant marine Copepoda species, phylum Arthropoda, demonstrating an extremely rapid generation of fluorescent signal. Here, we have generated a non-aggregating mutant of Copepoda fluorescent protein and solved its high-resolution crystal structure. It was found that the protein β-barrel contains a pore, leading to the chromophore. Using site-directed mutagenesis, we showed that this feature is critical for the fast maturation of the chromophore. PMID:16936637
Alanine and proline content modulate global sensitivity to discrete perturbations in disordered proteins

PubMed Central

Perez, Romel B.; Tischer, Alexander; Auton, Matthew; Whitten, Steven T.

2014-01-01

Molecular transduction of biological signals is understood primarily in terms of the cooperative structural transitions of protein macromolecules, providing a mechanism through which discrete local structure perturbations affect global macromolecular properties. The recognition that proteins lacking tertiary stability, commonly referred to as intrinsically disordered proteins, mediate key signaling pathways suggests that protein structures without cooperative intramolecular interactions may also have the ability to couple local and global structure changes. Presented here are results from experiments that measured and tested the ability of disordered proteins to couple local changes in structure to global changes in structure. Using the intrinsically disordered N-terminal region of the p53 protein as an experimental model, a set of proline and alanine to glycine substitution variants were designed to modulate backbone conformational propensities without introducing non-native intramolecular interactions. The hydrodynamic radius (Rh) was used to monitor changes in global structure. Circular dichroism spectroscopy showed that the glycine substitutions decreased polyproline II (PPII) propensities relative to the wild type, as expected, and fluorescence methods indicated that substitution-induced changes in Rh were not associated with folding. The experiments showed that changes in local PPII structure cause changes in Rh that are variable and that depend on the intrinsic chain propensities of proline and alanine residues, demonstrating a mechanism for coupling local and global structure changes. Molecular simulations that model our results were used to extend the analysis to other proteins and illustrate the generality of the observed proline and alanine effects on the structures of intrinsically disordered proteins. PMID:25244701
The compositional transition of vertebrate genomes: an analysis of the secondary structure of the proteins encoded by human genes.

PubMed

D'Onofrio, Giuseppe; Ghosh, Tapash Chandra

2005-01-17

Fluctuations and increments of both C(3) and G(3) levels along the human coding sequences were investigated comparing two sets of Xenopus/human orthologous genes. The first set of genes shows minor differences of the GC(3) levels, the second shows considerable increments of the GC(3) levels in the human genes. In both data sets, the fluctuations of C(3) and G(3) levels along the coding sequences correlated with the secondary structures of the encoded proteins. The human genes that underwent the compositional transition showed a different increment of the C(3) and G(3) levels within and among the structural units of the proteins. The relative synonymous codon usage (RSCU) of several amino acids were also affected during the compositional transition, showing that there exists a correlation between RSCU and protein secondary structures in human genes. The importance of natural selection for the formation of isochore organization of the human genome has been discussed on the basis of these results.
G-LoSA for Prediction of Protein-Ligand Binding Sites and Structures.

PubMed

Lee, Hui Sun; Im, Wonpil

2017-01-01

Recent advances in high-throughput structure determination and computational protein structure prediction have significantly enriched the universe of protein structure. However, there is still a large gap between the number of available protein structures and that of proteins with annotated function in high accuracy. Computational structure-based protein function prediction has emerged to reduce this knowledge gap. The identification of a ligand binding site and its structure is critical to the determination of a protein's molecular function. We present a computational methodology for predicting small molecule ligand binding site and ligand structure using G-LoSA, our protein local structure alignment and similarity measurement tool. All the computational procedures described here can be easily implemented using G-LoSA Toolkit, a package of standalone software programs and preprocessed PDB structure libraries. G-LoSA and G-LoSA Toolkit are freely available to academic users at http://compbio.lehigh.edu/GLoSA . We also illustrate a case study to show the potential of our template-based approach harnessing G-LoSA for protein function prediction.
Binding of DNA-bending non-histone proteins destabilizes regular 30-nm chromatin structure

PubMed Central

Bajpai, Gaurav; Jain, Ishutesh; Inamdar, Mandar M.; Das, Dibyendu; Padinhateeri, Ranjith

2017-01-01

Why most of the in vivo experiments do not find the 30-nm chromatin fiber, well studied in vitro, is a puzzle. Two basic physical inputs that are crucial for understanding the structure of the 30-nm fiber are the stiffness of the linker DNA and the relative orientations of the DNA entering/exiting nucleosomes. Based on these inputs we simulate chromatin structure and show that the presence of non-histone proteins, which bind and locally bend linker DNA, destroys any regular higher order structures (e.g., zig-zag). Accounting for the bending geometry of proteins like nhp6 and HMG-B, our theory predicts phase-diagram for the chromatin structure as a function of DNA-bending non-histone protein density and mean linker DNA length. For a wide range of linker lengths, we show that as we vary one parameter, that is, the fraction of bent linker region due to non-histone proteins, the steady-state structure will show a transition from zig-zag to an irregular structure—a structure that is reminiscent of what is observed in experiments recently. Our theory can explain the recent in vivo observation of irregular chromatin having co-existence of finite fraction of the next-neighbor (i + 2) and neighbor (i + 1) nucleosome interactions. PMID:28135276
Three-dimensional (3D) structure prediction of the American and African oil-palms β-ketoacyl-[ACP] synthase-II protein by comparative modelling

PubMed Central

Wang, Edina; Chinni, Suresh; Bhore, Subhash Janardhan

2014-01-01

Background: The fatty-acid profile of the vegetable oils determines its properties and nutritional value. Palm-oil obtained from the African oil-palm [Elaeis guineensis Jacq. (Tenera)] contains 44% palmitic acid (C16:0), but, palm-oil obtained from the American oilpalm [Elaeis oleifera] contains only 25% C16:0. In part, the b-ketoacyl-[ACP] synthase II (KASII) [EC: 2.3.1.179] protein is responsible for the high level of C16:0 in palm-oil derived from the African oil-palm. To understand more about E. guineensis KASII (EgKASII) and E. oleifera KASII (EoKASII) proteins, it is essential to know its structures. Hence, this study was undertaken. Objective: The objective of this study was to predict three-dimensional (3D) structure of EgKASII and EoKASII proteins using molecular modelling tools. Materials and Methods: The amino-acid sequences for KASII proteins were retrieved from the protein database of National Center for Biotechnology Information (NCBI), USA. The 3D structures were predicted for both proteins using homology modelling and ab-initio technique approach of protein structure prediction. The molecular dynamics (MD) simulation was performed to refine the predicted structures. The predicted structure models were evaluated and root mean square deviation (RMSD) and root mean square fluctuation (RMSF) values were calculated. Results: The homology modelling showed that EgKASII and EoKASII proteins are 78% and 74% similar with Streptococcus pneumonia KASII and Brucella melitensis KASII, respectively. The EgKASII and EoKASII structures predicted by using ab-initio technique approach shows 6% and 9% deviation to its structures predicted by homology modelling, respectively. The structure refinement and validation confirmed that the predicted structures are accurate. Conclusion: The 3D structures for EgKASII and EoKASII proteins were predicted. However, further research is essential to understand the interaction of EgKASII and EoKASII proteins with its substrates. PMID:24748752
Three-dimensional (3D) structure prediction of the American and African oil-palms β-ketoacyl-[ACP] synthase-II protein by comparative modelling.

PubMed

Wang, Edina; Chinni, Suresh; Bhore, Subhash Janardhan

2014-01-01

The fatty-acid profile of the vegetable oils determines its properties and nutritional value. Palm-oil obtained from the African oil-palm [Elaeis guineensis Jacq. (Tenera)] contains 44% palmitic acid (C16:0), but, palm-oil obtained from the American oilpalm [Elaeis oleifera] contains only 25% C16:0. In part, the b-ketoacyl-[ACP] synthase II (KASII) [EC: 2.3.1.179] protein is responsible for the high level of C16:0 in palm-oil derived from the African oil-palm. To understand more about E. guineensis KASII (EgKASII) and E. oleifera KASII (EoKASII) proteins, it is essential to know its structures. Hence, this study was undertaken. The objective of this study was to predict three-dimensional (3D) structure of EgKASII and EoKASII proteins using molecular modelling tools. The amino-acid sequences for KASII proteins were retrieved from the protein database of National Center for Biotechnology Information (NCBI), USA. The 3D structures were predicted for both proteins using homology modelling and ab-initio technique approach of protein structure prediction. The molecular dynamics (MD) simulation was performed to refine the predicted structures. The predicted structure models were evaluated and root mean square deviation (RMSD) and root mean square fluctuation (RMSF) values were calculated. The homology modelling showed that EgKASII and EoKASII proteins are 78% and 74% similar with Streptococcus pneumonia KASII and Brucella melitensis KASII, respectively. The EgKASII and EoKASII structures predicted by using ab-initio technique approach shows 6% and 9% deviation to its structures predicted by homology modelling, respectively. The structure refinement and validation confirmed that the predicted structures are accurate. The 3D structures for EgKASII and EoKASII proteins were predicted. However, further research is essential to understand the interaction of EgKASII and EoKASII proteins with its substrates.
Dissecting protein loops with a statistical scalpel suggests a functional implication of some structural motifs.

PubMed

Regad, Leslie; Martin, Juliette; Camproux, Anne-Claude

2011-06-20

One of the strategies for protein function annotation is to search particular structural motifs that are known to be shared by proteins with a given function. Here, we present a systematic extraction of structural motifs of seven residues from protein loops and we explore their correspondence with functional sites. Our approach is based on the structural alphabet HMM-SA (Hidden Markov Model - Structural Alphabet), which allows simplification of protein structures into uni-dimensional sequences, and advanced pattern statistics adapted to short sequences. Structural motifs of interest are selected by looking for structural motifs significantly over-represented in SCOP superfamilies in protein loops. We discovered two types of structural motifs significantly over-represented in SCOP superfamilies: (i) ubiquitous motifs, shared by several superfamilies and (ii) superfamily-specific motifs, over-represented in few superfamilies. A comparison of ubiquitous words with known small structural motifs shows that they contain well-described motifs as turn, niche or nest motifs. A comparison between superfamily-specific motifs and biological annotations of Swiss-Prot reveals that some of them actually correspond to functional sites involved in the binding sites of small ligands, such as ATP/GTP, NAD(P) and SAH/SAM. Our findings show that statistical over-representation in SCOP superfamilies is linked to functional features. The detection of over-represented motifs within structures simplified by HMM-SA is therefore a promising approach for prediction of functional sites and annotation of uncharacterized proteins.
Dissecting protein loops with a statistical scalpel suggests a functional implication of some structural motifs

PubMed Central

2011-01-01

Background One of the strategies for protein function annotation is to search particular structural motifs that are known to be shared by proteins with a given function. Results Here, we present a systematic extraction of structural motifs of seven residues from protein loops and we explore their correspondence with functional sites. Our approach is based on the structural alphabet HMM-SA (Hidden Markov Model - Structural Alphabet), which allows simplification of protein structures into uni-dimensional sequences, and advanced pattern statistics adapted to short sequences. Structural motifs of interest are selected by looking for structural motifs significantly over-represented in SCOP superfamilies in protein loops. We discovered two types of structural motifs significantly over-represented in SCOP superfamilies: (i) ubiquitous motifs, shared by several superfamilies and (ii) superfamily-specific motifs, over-represented in few superfamilies. A comparison of ubiquitous words with known small structural motifs shows that they contain well-described motifs as turn, niche or nest motifs. A comparison between superfamily-specific motifs and biological annotations of Swiss-Prot reveals that some of them actually correspond to functional sites involved in the binding sites of small ligands, such as ATP/GTP, NAD(P) and SAH/SAM. Conclusions Our findings show that statistical over-representation in SCOP superfamilies is linked to functional features. The detection of over-represented motifs within structures simplified by HMM-SA is therefore a promising approach for prediction of functional sites and annotation of uncharacterized proteins. PMID:21689388
Finding Correlation between Protein Protein Interaction Modules Using Semantic Web Techniques

NASA Astrophysics Data System (ADS)

Kargar, Mehdi; Moaven, Shahrouz; Abolhassani, Hassan

Many complex networks such as social networks and computer show modular structures, where edges between nodes are much denser within modules than between modules. It is strongly believed that cellular networks are also modular, reflecting the relative independence and coherence of different functional units in a cell. In this paper we used a human curated dataset. In this paper we consider each module in the PPI network as ontology. Using techniques in ontology alignment, we compare each pair of modules in the network. We want to see that is there a correlation between the structure of each module or they have totally different structures. Our results show that there is no correlation between proteins in a protein protein interaction network.
From protein sequence to dynamics and disorder with DynaMine.

PubMed

Cilia, Elisa; Pancsa, Rita; Tompa, Peter; Lenaerts, Tom; Vranken, Wim F

2013-01-01

Protein function and dynamics are closely related; however, accurate dynamics information is difficult to obtain. Here based on a carefully assembled data set derived from experimental data for proteins in solution, we quantify backbone dynamics properties on the amino-acid level and develop DynaMine--a fast, high-quality predictor of protein backbone dynamics. DynaMine uses only protein sequence information as input and shows great potential in distinguishing regions of different structural organization, such as folded domains, disordered linkers, molten globules and pre-structured binding motifs of different sizes. It also identifies disordered regions within proteins with an accuracy comparable to the most sophisticated existing predictors, without depending on prior disorder knowledge or three-dimensional structural information. DynaMine provides molecular biologists with an important new method that grasps the dynamical characteristics of any protein of interest, as we show here for human p53 and E1A from human adenovirus 5.
Modeling Protein Excited-state Structures from "Over-length" Chemical Cross-links.

PubMed

Ding, Yue-He; Gong, Zhou; Dong, Xu; Liu, Kan; Liu, Zhu; Liu, Chao; He, Si-Min; Dong, Meng-Qiu; Tang, Chun

2017-01-27

Chemical cross-linking coupled with mass spectroscopy (CXMS) provides proximity information for the cross-linked residues and is used increasingly for modeling protein structures. However, experimentally identified cross-links are sometimes incompatible with the known structure of a protein, as the distance calculated between the cross-linked residues far exceeds the maximum length of the cross-linker. The discrepancies may persist even after eliminating potentially false cross-links and excluding intermolecular ones. Thus the "over-length" cross-links may arise from alternative excited-state conformation of the protein. Here we present a method and associated software DynaXL for visualizing the ensemble structures of multidomain proteins based on intramolecular cross-links identified by mass spectrometry with high confidence. Representing the cross-linkers and cross-linking reactions explicitly, we show that the protein excited-state structure can be modeled with as few as two over-length cross-links. We demonstrate the generality of our method with three systems: calmodulin, enzyme I, and glutamine-binding protein, and we show that these proteins alternate between different conformations for interacting with other proteins and ligands. Taken together, the over-length chemical cross-links contain valuable information about protein dynamics, and our findings here illustrate the relationship between dynamic domain movement and protein function. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
An Evolutionarily Structured Universe of Protein Architecture

PubMed Central

Caetano-Anollés, Gustavo; Caetano-Anollés, Derek

2003-01-01

Protein structural diversity encompasses a finite set of architectural designs. Embedded in these topologies are evolutionary histories that we here uncover using cladistic principles and measurements of protein-fold usage and sharing. The reconstructed phylogenies are inherently rooted and depict histories of protein and proteome diversification. Proteome phylogenies showed two monophyletic sister-groups delimiting Bacteria and Archaea, and a topology rooted in Eucarya. This suggests three dramatic evolutionary events and a common ancestor with a eukaryotic-like, gene-rich, and relatively modern organization. Conversely, a general phylogeny of protein architectures showed that structural classes of globular proteins appeared early in evolution and in defined order, the α/β class being the first. Although most ancestral folds shared a common architecture of barrels or interleaved β-sheets and α-helices, many were clearly derived, such as polyhedral folds in the all-α class and β-sandwiches, β-propellers, and β-prisms in all-β proteins. We also describe transformation pathways of architectures that are prevalently used in nature. For example, β-barrels with increased curl and stagger were favored evolutionary outcomes in the all-β class. Interestingly, we found cases where structural change followed the α-to-β tendency uncovered in the tree of architectures. Lastly, we traced the total number of enzymatic functions associated with folds in the trees and show that there is a general link between structure and enzymatic function. PMID:12840035
Alanine and proline content modulate global sensitivity to discrete perturbations in disordered proteins.

PubMed

Perez, Romel B; Tischer, Alexander; Auton, Matthew; Whitten, Steven T

2014-12-01

Molecular transduction of biological signals is understood primarily in terms of the cooperative structural transitions of protein macromolecules, providing a mechanism through which discrete local structure perturbations affect global macromolecular properties. The recognition that proteins lacking tertiary stability, commonly referred to as intrinsically disordered proteins (IDPs), mediate key signaling pathways suggests that protein structures without cooperative intramolecular interactions may also have the ability to couple local and global structure changes. Presented here are results from experiments that measured and tested the ability of disordered proteins to couple local changes in structure to global changes in structure. Using the intrinsically disordered N-terminal region of the p53 protein as an experimental model, a set of proline (PRO) and alanine (ALA) to glycine (GLY) substitution variants were designed to modulate backbone conformational propensities without introducing non-native intramolecular interactions. The hydrodynamic radius (R(h)) was used to monitor changes in global structure. Circular dichroism spectroscopy showed that the GLY substitutions decreased polyproline II (PP(II)) propensities relative to the wild type, as expected, and fluorescence methods indicated that substitution-induced changes in R(h) were not associated with folding. The experiments showed that changes in local PP(II) structure cause changes in R(h) that are variable and that depend on the intrinsic chain propensities of PRO and ALA residues, demonstrating a mechanism for coupling local and global structure changes. Molecular simulations that model our results were used to extend the analysis to other proteins and illustrate the generality of the observed PRO and alanine effects on the structures of IDPs. © 2014 Wiley Periodicals, Inc.
Study of the interactions between a proline-rich protein and a flavan-3-ol by NMR: residual structures in the natively unfolded protein provides anchorage points for the ligands.

PubMed

Pascal, Christine; Paté, Franck; Cheynier, Véronique; Delsuc, Marc-André

2009-09-01

Astringency is one of the major organoleptic properties of food and beverages that are made from plants, such as tea, chocolate, beer, or red wine. This sensation is thought to be due to interactions between tannins and salivary proline-rich proteins, which are natively unfolded proteins. A human salivary proline-rich protein, namely IB-5, was produced by the recombinant method. Its interactions with a model tannin, epigallocatechin gallate (EGCG), the major flavan-3-ol in green tea, were studied here. Circular dichroism experiments showed that IB-5 presents residual structures (PPII helices) when the ionic strength is close to that in saliva. In the presence of these residual structures, IB-5 undergoes an increase in structural content upon binding to EGCG. NMR data corroborated the presence of preformed structural elements within the protein prior to binding and a partial assignment was proposed, showing partial structuration. TOCSY experiments showed that amino acids that are involved in PPII helices are more likely to interact with EGCG than those in random coil regions, as if they were anchorage points for the ligand. The signal from IB-5 in the DOSY NMR spectrum revealed an increase in polydispersity upon addition of EGCG while the mean hydrodynamic radius remained unchanged. This strongly suggests the formation of IB-5/EGCG aggregates.
Characterization of protein and carbohydrate mid-IR spectral features in crop residues

NASA Astrophysics Data System (ADS)

Xin, Hangshu; Zhang, Yonggen; Wang, Mingjun; Li, Zhongyu; Wang, Zhibo; Yu, Peiqiang

2014-08-01

To the best of our knowledge, a few studies have been conducted on inherent structure spectral traits related to biopolymers of crop residues. The objective of this study was to characterize protein and carbohydrate structure spectral features of three field crop residues (rice straw, wheat straw and millet straw) in comparison with two crop vines (peanut vine and pea vine) by using Fourier transform infrared spectroscopy (FTIR) technique with attenuated total reflectance (ATR). Also, multivariate analyses were performed on spectral data sets within the regions mainly related to protein and carbohydrate in this study. The results showed that spectral differences existed in mid-IR peak intensities that are mainly related to protein and carbohydrate among these crop residue samples. With regard to protein spectral profile, peanut vine showed the greatest mid-IR band intensities that are related to protein amide and protein secondary structures, followed by pea vine and the rest three field crop straws. The crop vines had 48-134% higher spectral band intensity than the grain straws in spectral features associated with protein. Similar trends were also found in the bands that are mainly related to structural carbohydrates (such as cellulosic compounds). However, the field crop residues had higher peak intensity in total carbohydrates region than the crop vines. Furthermore, spectral ratios varied among the residue samples, indicating that these five crop residues had different internal structural conformation. However, multivariate spectral analyses showed that structural similarities still exhibited among crop residues in the regions associated with protein biopolymers and carbohydrate. Further study is needed to find out whether there is any relationship between spectroscopic information and nutrition supply in various kinds of crop residue when fed to animals.

Characterization of protein and carbohydrate mid-IR spectral features in crop residues.

PubMed

Xin, Hangshu; Zhang, Yonggen; Wang, Mingjun; Li, Zhongyu; Wang, Zhibo; Yu, Peiqiang

2014-08-14

To the best of our knowledge, a few studies have been conducted on inherent structure spectral traits related to biopolymers of crop residues. The objective of this study was to characterize protein and carbohydrate structure spectral features of three field crop residues (rice straw, wheat straw and millet straw) in comparison with two crop vines (peanut vine and pea vine) by using Fourier transform infrared spectroscopy (FTIR) technique with attenuated total reflectance (ATR). Also, multivariate analyses were performed on spectral data sets within the regions mainly related to protein and carbohydrate in this study. The results showed that spectral differences existed in mid-IR peak intensities that are mainly related to protein and carbohydrate among these crop residue samples. With regard to protein spectral profile, peanut vine showed the greatest mid-IR band intensities that are related to protein amide and protein secondary structures, followed by pea vine and the rest three field crop straws. The crop vines had 48-134% higher spectral band intensity than the grain straws in spectral features associated with protein. Similar trends were also found in the bands that are mainly related to structural carbohydrates (such as cellulosic compounds). However, the field crop residues had higher peak intensity in total carbohydrates region than the crop vines. Furthermore, spectral ratios varied among the residue samples, indicating that these five crop residues had different internal structural conformation. However, multivariate spectral analyses showed that structural similarities still exhibited among crop residues in the regions associated with protein biopolymers and carbohydrate. Further study is needed to find out whether there is any relationship between spectroscopic information and nutrition supply in various kinds of crop residue when fed to animals. Copyright © 2014 Elsevier B.V. All rights reserved.
Detect the sensitivity and response of protein molecular structure of whole canola seed (yellow and brown) to different heat processing methods and relation to protein utilization and availability using ATR-FT/IR molecular spectroscopy with chemometrics.

PubMed

Samadi; Theodoridou, Katerina; Yu, Peiqiang

2013-03-15

The objectives of this experiment were to detect the sensitivity and response of protein molecular structure of whole canola seed to different heat processing [moisture (autoclaving) vs. dry (roasting) heating] and quantify heat-induced protein molecular structure changes in relation to protein utilization and availability. In this study, whole canola seeds were autoclaved (moisture heating) and dry (roasting) heated at 120 °C for 1h, respectively. The parameters assessed included changes in (1) chemical composition profile, (2) CNCPS protein subfractions (PA, PB1, PB2, PB3, PC), (3) intestinal absorbed true protein supply, (4) energy values, and (5) protein molecular structures (amide I, amide II, ratio of amide I to II, α-helix, β-sheet, ratio of α-helix to β-sheet). The results showed that autoclave heating significantly decreased (P<0.05) but dry heating increased (P<0.05) the ratio of protein α-helix to β-sheet (with the ratios of 1.07, 0.95, 1.10 for the control (raw), autoclave heating and dry heating, respectively). The multivariate molecular spectral analyses (PCA, CLA) showed that there were significantly molecular structural differences in the protein amide I and II fingerprint region (ca. 1714-1480 cm(-1)) among the control, autoclave and dry heating. These differences were indicated by the form of separate class (PCA) and group of separate ellipse (CLA) between the treatments. The correlation analysis with spearman method showed that there were significantly and highly positive correlation (P<0.05) between heat-induced protein molecular structure changes in terms of α-helix to β-sheet ratios and in situ protein degradation and significantly negative correlation between the protein α-helix to β-sheet ratios and intestinal digestibility of undegraded protein. The results indicated that heat-induced changes of protein molecular structure revealed by vibration molecular spectroscopy could be used as a potential predictor to protein degradation and intestinal protein digestion of whole canola seed. Future study is needed to study response and impact of heat processing to each inherent layer of canola seed from outside to inside tissues and between yellow canola and brown canola. Copyright © 2012 Elsevier B.V. All rights reserved.
Packing in protein cores

NASA Astrophysics Data System (ADS)

Gaines, J. C.; Clark, A. H.; Regan, L.; O'Hern, C. S.

2017-07-01

Proteins are biological polymers that underlie all cellular functions. The first high-resolution protein structures were determined by x-ray crystallography in the 1960s. Since then, there has been continued interest in understanding and predicting protein structure and stability. It is well-established that a large contribution to protein stability originates from the sequestration from solvent of hydrophobic residues in the protein core. How are such hydrophobic residues arranged in the core; how can one best model the packing of these residues, and are residues loosely packed with multiple allowed side chain conformations or densely packed with a single allowed side chain conformation? Here we show that to properly model the packing of residues in protein cores it is essential that amino acids are represented by appropriately calibrated atom sizes, and that hydrogen atoms are explicitly included. We show that protein cores possess a packing fraction of φ ≈ 0.56 , which is significantly less than the typically quoted value of 0.74 obtained using the extended atom representation. We also compare the results for the packing of amino acids in protein cores to results obtained for jammed packings from discrete element simulations of spheres, elongated particles, and composite particles with bumpy surfaces. We show that amino acids in protein cores pack as densely as disordered jammed packings of particles with similar values for the aspect ratio and bumpiness as found for amino acids. Knowing the structural properties of protein cores is of both fundamental and practical importance. Practically, it enables the assessment of changes in the structure and stability of proteins arising from amino acid mutations (such as those identified as a result of the massive human genome sequencing efforts) and the design of new folded, stable proteins and protein-protein interactions with tunable specificity and affinity.
Dancing Protein Clouds: The Strange Biology and Chaotic Physics of Intrinsically Disordered Proteins*

PubMed Central

2016-01-01

Biologically active but floppy proteins represent a new reality of modern protein science. These intrinsically disordered proteins (IDPs) and hybrid proteins containing ordered and intrinsically disordered protein regions (IDPRs) constitute a noticeable part of any given proteome. Functionally, they complement ordered proteins, and their conformational flexibility and structural plasticity allow them to perform impossible tricks and be engaged in biological activities that are inaccessible to well folded proteins with their unique structures. The major goals of this minireview are to show that, despite their simplified amino acid sequences, IDPs/IDPRs are complex entities often resembling chaotic systems, are structurally and functionally heterogeneous, and can be considered an important part of the structure-function continuum. Furthermore, IDPs/IDPRs are everywhere, and are ubiquitously engaged in various interactions characterized by a wide spectrum of binding scenarios and an even wider spectrum of structural and functional outputs. PMID:26851286
The phylogenomic roots of modern biochemistry: origins of proteins, cofactors and protein biosynthesis.

PubMed

Caetano-Anollés, Gustavo; Kim, Kyung Mo; Caetano-Anollés, Derek

2012-02-01

The complexity of modern biochemistry developed gradually on early Earth as new molecules and structures populated the emerging cellular systems. Here, we generate a historical account of the gradual discovery of primordial proteins, cofactors, and molecular functions using phylogenomic information in the sequence of 420 genomes. We focus on structural and functional annotations of the 54 most ancient protein domains. We show how primordial functions are linked to folded structures and how their interaction with cofactors expanded the functional repertoire. We also reveal protocell membranes played a crucial role in early protein evolution and show translation started with RNA and thioester cofactor-mediated aminoacylation. Our findings allow elaboration of an evolutionary model of early biochemistry that is firmly grounded in phylogenomic information and biochemical, biophysical, and structural knowledge. The model describes how primordial α-helical bundles stabilized membranes, how these were decorated by layered arrangements of β-sheets and α-helices, and how these arrangements became globular. Ancient forms of aminoacyl-tRNA synthetase (aaRS) catalytic domains and ancient non-ribosomal protein synthetase (NRPS) modules gave rise to primordial protein synthesis and the ability to generate a code for specificity in their active sites. These structures diversified producing cofactor-binding molecular switches and barrel structures. Accretion of domains and molecules gave rise to modern aaRSs, NRPS, and ribosomal ensembles, first organized around novel emerging cofactors (tRNA and carrier proteins) and then more complex cofactor structures (rRNA). The model explains how the generation of protein structures acted as scaffold for nucleic acids and resulted in crystallization of modern translation.
Automatic Classification of Protein Structure Using the Maximum Contact Map Overlap Metric

DOE Office of Scientific and Technical Information (OSTI.GOV)

Andonov, Rumen; Djidjev, Hristo Nikolov; Klau, Gunnar W.

In this paper, we propose a new distance measure for comparing two protein structures based on their contact map representations. We show that our novel measure, which we refer to as the maximum contact map overlap (max-CMO) metric, satisfies all properties of a metric on the space of protein representations. Having a metric in that space allows one to avoid pairwise comparisons on the entire database and, thus, to significantly accelerate exploring the protein space compared to no-metric spaces. We show on a gold standard superfamily classification benchmark set of 6759 proteins that our exact k-nearest neighbor (k-NN) scheme classifiesmore » up to 224 out of 236 queries correctly and on a larger, extended version of the benchmark with 60; 850 additional structures, up to 1361 out of 1369 queries. Finally, our k-NN classification thus provides a promising approach for the automatic classification of protein structures based on flexible contact map overlap alignments.« less
Automatic Classification of Protein Structure Using the Maximum Contact Map Overlap Metric

DOE PAGES

Andonov, Rumen; Djidjev, Hristo Nikolov; Klau, Gunnar W.; ...

2015-10-09

In this paper, we propose a new distance measure for comparing two protein structures based on their contact map representations. We show that our novel measure, which we refer to as the maximum contact map overlap (max-CMO) metric, satisfies all properties of a metric on the space of protein representations. Having a metric in that space allows one to avoid pairwise comparisons on the entire database and, thus, to significantly accelerate exploring the protein space compared to no-metric spaces. We show on a gold standard superfamily classification benchmark set of 6759 proteins that our exact k-nearest neighbor (k-NN) scheme classifiesmore » up to 224 out of 236 queries correctly and on a larger, extended version of the benchmark with 60; 850 additional structures, up to 1361 out of 1369 queries. Finally, our k-NN classification thus provides a promising approach for the automatic classification of protein structures based on flexible contact map overlap alignments.« less
Rescore protein-protein docked ensembles with an interface contact statistics.

PubMed

Mezei, Mihaly

2017-02-01

The recently developed statistical measure for the type of residue-residue contact at protein complex interfaces, based on a parameter-free definition of contact, has been used to define a contact score that is correlated with the likelihood of correctness of a proposed complex structure. Comparing the proposed contact scores on the native structure and on a set of model structures the proposed measure was shown to generally favor the native structure but in itself was not able to reliably score the native structure to be the best. Adjusting the scores of redocking experiments with the contact score showed that the adjusted score was able to move up the ranking of the native-like structure among the proposed complexes when the native-like was not ranked the best by the respective program. Tests on docking of unbound proteins compared the contact scores of the complexes with the contact score of the crystal structure again showing the tendency of the contact score to favor native-like conformations. The possibility of using the contact score to improve the determination of biological dimers in a crystal structure was also explored. Proteins 2017; 85:235-241. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Investigating the Structural Compaction of Biomolecules Upon Transition to the Gas-Phase Using ESI-TWIMS-MS.

PubMed

Devine, Paul W A; Fisher, Henry C; Calabrese, Antonio N; Whelan, Fiona; Higazi, Daniel R; Potts, Jennifer R; Lowe, David C; Radford, Sheena E; Ashcroft, Alison E

2017-09-01

Collision cross-section (CCS) measurements obtained from ion mobility spectrometry-mass spectrometry (IMS-MS) analyses often provide useful information concerning a protein's size and shape and can be complemented by modeling procedures. However, there have been some concerns about the extent to which certain proteins maintain a native-like conformation during the gas-phase analysis, especially proteins with dynamic or extended regions. Here we have measured the CCSs of a range of biomolecules including non-globular proteins and RNAs of different sequence, size, and stability. Using traveling wave IMS-MS, we show that for the proteins studied, the measured CCS deviates significantly from predicted CCS values based upon currently available structures. The results presented indicate that these proteins collapse to different extents varying on their elongated structures upon transition into the gas-phase. Comparing two RNAs of similar mass but different solution structures, we show that these biomolecules may also be susceptible to gas-phase compaction. Together, the results suggest that caution is needed when predicting structural models based on CCS data for RNAs as well as proteins with non-globular folds. Graphical Abstract ᅟ.
Relation between native ensembles and experimental structures of proteins

PubMed Central

Best, Robert B.; Lindorff-Larsen, Kresten; DePristo, Mark A.; Vendruscolo, Michele

2006-01-01

Different experimental structures of the same protein or of proteins with high sequence similarity contain many small variations. Here we construct ensembles of “high-sequence similarity Protein Data Bank” (HSP) structures and consider the extent to which such ensembles represent the structural heterogeneity of the native state in solution. We find that different NMR measurements probing structure and dynamics of given proteins in solution, including order parameters, scalar couplings, and residual dipolar couplings, are remarkably well reproduced by their respective high-sequence similarity Protein Data Bank ensembles; moreover, we show that the effects of uncertainties in structure determination are insufficient to explain the results. These results highlight the importance of accounting for native-state protein dynamics in making comparisons with ensemble-averaged experimental data and suggest that even a modest number of structures of a protein determined under different conditions, or with small variations in sequence, capture a representative subset of the true native-state ensemble. PMID:16829580
Low-resolution structure of Drosophila translin

PubMed Central

Kumar, Vinay; Gupta, Gagan D.

2012-01-01

Crystals of native Drosophila melanogaster translin diffracted to 7 Å resolution. Reductive methylation of the protein improved crystal quality. The native and methylated proteins showed similar profiles in size-exclusion chromatography analyses but the methylated protein displayed reduced DNA-binding activity. Crystals of the methylated protein diffracted to 4.2 Å resolution at BM14 of the ESRF synchrotron. Crystals with 49% solvent content belonged to monoclinic space group P21 with eight protomers in the asymmetric unit. Only 2% of low-resolution structures with similar low percentage solvent content were found in the PDB. The crystal structure, solved by molecular replacement method, refined to Rwork (Rfree) of 0.24 (0.29) with excellent stereochemistry. The crystal structure clearly shows that drosophila protein exists as an octamer, and not as a decamer as expected from gel-filtration elution profiles. The similar octameric quaternary fold in translin orthologs and in translin–TRAX complexes suggests an up-down dimer as the basic structural subunit of translin-like proteins. The drosophila oligomer displays asymmetric assembly and increased radius of gyration that accounts for the observed differences between the elution profiles of human and drosophila proteins on gel-filtration columns. This study demonstrates clearly that low-resolution X-ray structure can be useful in understanding complex biological oligomers. PMID:23650579
Protein domain assignment from the recurrence of locally similar structures

PubMed Central

Tai, Chin-Hsien; Sam, Vichetra; Gibrat, Jean-Francois; Garnier, Jean; Munson, Peter J.

2010-01-01

Domains are basic units of protein structure and essential for exploring protein fold space and structure evolution. With the structural genomics initiative, the number of protein structures in the Protein Databank (PDB) is increasing dramatically and domain assignments need to be done automatically. Most existing structural domain assignment programs define domains using the compactness of the domains and/or the number and strength of intra-domain versus inter-domain contacts. Here we present a different approach based on the recurrence of locally similar structural pieces (LSSPs) found by one-against-all structure comparisons with a dataset of 6,373 protein chains from the PDB. Residues of the query protein are clustered using LSSPs via three different procedures to define domains. This approach gives results that are comparable to several existing programs that use geometrical and other structural information explicitly. Remarkably, most of the proteins that contribute the LSSPs defining a domain do not themselves contain the domain of interest. This study shows that domains can be defined by a collection of relatively small locally similar structural pieces containing, on average, four secondary structure elements. In addition, it indicates that domains are indeed made of recurrent small structural pieces that are used to build protein structures of many different folds as suggested by recent studies. PMID:21287617
Structural study of surfactant-dependent interaction with protein

DOE Office of Scientific and Technical Information (OSTI.GOV)

Mehan, Sumit; Aswal, Vinod K., E-mail: vkaswal@barc.gov.in; Kohlbrecher, Joachim

2015-06-24

Small-angle neutron scattering (SANS) has been used to study the complex structure of anionic BSA protein with three different (cationic DTAB, anionic SDS and non-ionic C12E10) surfactants. These systems form very different surfactant-dependent complexes. We show that the structure of protein-surfactant complex is initiated by the site-specific electrostatic interaction between the components, followed by the hydrophobic interaction at high surfactant concentrations. It is also found that hydrophobic interaction is preferred over the electrostatic interaction in deciding the resultant structure of protein-surfactant complexes.
Structural study of surfactant-dependent interaction with protein

NASA Astrophysics Data System (ADS)

Mehan, Sumit; Aswal, Vinod K.; Kohlbrecher, Joachim

2015-06-01

Small-angle neutron scattering (SANS) has been used to study the complex structure of anionic BSA protein with three different (cationic DTAB, anionic SDS and non-ionic C12E10) surfactants. These systems form very different surfactant-dependent complexes. We show that the structure of protein-surfactant complex is initiated by the site-specific electrostatic interaction between the components, followed by the hydrophobic interaction at high surfactant concentrations. It is also found that hydrophobic interaction is preferred over the electrostatic interaction in deciding the resultant structure of protein-surfactant complexes.
Secondary-structure matching (SSM), a new tool for fast protein structure alignment in three dimensions.

PubMed

Krissinel, E; Henrick, K

2004-12-01

The present paper describes the SSM algorithm of protein structure comparison in three dimensions, which includes an original procedure of matching graphs built on the protein's secondary-structure elements, followed by an iterative three-dimensional alignment of protein backbone Calpha atoms. The SSM results are compared with those obtained from other protein comparison servers, and the advantages and disadvantages of different scores that are used for structure recognition are discussed. A new score, balancing the r.m.s.d. and alignment length Nalign, is proposed. It is found that different servers agree reasonably well on the new score, while showing considerable differences in r.m.s.d. and Nalign.
What determines the spectrum of protein native state structures?

PubMed

Lezon, Timothy R; Banavar, Jayanth R; Lesk, Arthur M; Maritan, Amos

2006-05-01

We present a brief summary of the key factors underlying protein structure, as developed in the investigations of Pauling, Ramachandran, and Rose. We then outline a simplified physical model of proteins that focuses on geometry and symmetry. Although this model superficially appears unrelated to the detailed chemical descriptions commonly applied to proteins, we show that it captures the essential elements of the chemistry and provides a unified framework for understanding the common characteristics of folded proteins. We suggest that the spectrum of protein native state structures is determined by geometry and symmetry and the role of the sequence is to choose its native state structure from this predetermined menu. 2006 Wiley-Liss, Inc.
A study of the thermal denaturation of the S-layer protein from Lactobacillus salivarius

NASA Astrophysics Data System (ADS)

Lighezan, Liliana; Georgieva, Ralitsa; Neagu, Adrian

2012-09-01

Surface layer (S-layer) proteins display an intrinsic self-assembly property, forming monomolecular crystalline arrays, identified in outermost structures of the cell envelope in many organisms, such as bacteria and archaea. Isolated S-layer proteins also possess the ability to recrystallize into regular lattices, being used in biotechnological applications, such as controlling the architecture of biomimetic surfaces. To this end, the stability of the S-layer proteins under high-temperature conditions is very important. In this study, the S-layer protein has been isolated from Lactobacillus salivarius 16 strain of human origin, and purified by cation-exchange chromatography. Using circular dichroism (CD) spectroscopy, we have investigated the thermal denaturation of the S-layer protein. The far- and near-UV CD spectra have been collected, and the temperature dependence of the CD signal in these spectral domains has been analyzed. The variable temperature results show that the secondary and tertiary structures of the S-layer protein change irreversibly due to the heating of the sample. After the cooling of the heated protein, the secondary and tertiary structures are partially recovered. The denaturation curves show that the protein unfolding depends on the sample concentration and on the heating rate. The secondary and tertiary structures of the protein suffer changes in the same temperature range. We have also detected an intermediate state in the protein denaturation pathway. Our results on the thermal behavior of the S-layer protein may be important for the use of S-layer proteins in biotechnological applications, as well as for a better understanding of the structure and function of S-layer proteins.
Prediction of Spontaneous Protein Deamidation from Sequence-Derived Secondary Structure and Intrinsic Disorder.

PubMed

Lorenzo, J Ramiro; Alonso, Leonardo G; Sánchez, Ignacio E

2015-01-01

Asparagine residues in proteins undergo spontaneous deamidation, a post-translational modification that may act as a molecular clock for the regulation of protein function and turnover. Asparagine deamidation is modulated by protein local sequence, secondary structure and hydrogen bonding. We present NGOME, an algorithm able to predict non-enzymatic deamidation of internal asparagine residues in proteins in the absence of structural data, using sequence-based predictions of secondary structure and intrinsic disorder. Compared to previous algorithms, NGOME does not require three-dimensional structures yet yields better predictions than available sequence-only methods. Four case studies of specific proteins show how NGOME may help the user identify deamidation-prone asparagine residues, often related to protein gain of function, protein degradation or protein misfolding in pathological processes. A fifth case study applies NGOME at a proteomic scale and unveils a correlation between asparagine deamidation and protein degradation in yeast. NGOME is freely available as a webserver at the National EMBnet node Argentina, URL: http://www.embnet.qb.fcen.uba.ar/ in the subpage "Protein and nucleic acid structure and sequence analysis".
Evaluation of variability in high-resolution protein structures by global distance scoring.

PubMed

Anzai, Risa; Asami, Yoshiki; Inoue, Waka; Ueno, Hina; Yamada, Koya; Okada, Tetsuji

2018-01-01

Systematic analysis of the statistical and dynamical properties of proteins is critical to understanding cellular events. Extraction of biologically relevant information from a set of high-resolution structures is important because it can provide mechanistic details behind the functional properties of protein families, enabling rational comparison between families. Most of the current structural comparisons are pairwise-based, which hampers the global analysis of increasing contents in the Protein Data Bank. Additionally, pairing of protein structures introduces uncertainty with respect to reproducibility because it frequently accompanies other settings for superimposition. This study introduces intramolecular distance scoring for the global analysis of proteins, for each of which at least several high-resolution structures are available. As a pilot study, we have tested 300 human proteins and showed that the method is comprehensively used to overview advances in each protein and protein family at the atomic level. This method, together with the interpretation of the model calculations, provide new criteria for understanding specific structural variation in a protein, enabling global comparison of the variability in proteins from different species.
Structure of a new crystal form of human Hsp70 ATPase domain.

PubMed

Osipiuk, J; Walsh, M A; Freeman, B C; Morimoto, R I; Joachimiak, A

1999-05-01

Hsp70 proteins are highly conserved proteins induced by heat shock and other stress conditions. An ATP-binding domain of human Hsp70 protein has been crystallized in two major morphological forms at pH 7.0 in the presence of PEG 8000 and CaCl2. Both crystal forms belong to the orthorhombic space group P212121, but show no resemblance in unit-cell parameters. Analysis of the crystal structures for both forms shows a 1-2 A shift of one of the subdomains of the protein. This conformational change could reflect a 'natural' flexibility of the protein which might be relevant to ATP binding and may facilitate the interaction of other proteins with Hsp70 protein.

Evaluating the efficacy of a structure-derived amino acid substitution matrix in detecting protein homologs by BLAST and PSI-BLAST.

PubMed

Goonesekere, Nalin Cw

2009-01-01

The large numbers of protein sequences generated by whole genome sequencing projects require rapid and accurate methods of annotation. The detection of homology through computational sequence analysis is a powerful tool in determining the complex evolutionary and functional relationships that exist between proteins. Homology search algorithms employ amino acid substitution matrices to detect similarity between proteins sequences. The substitution matrices in common use today are constructed using sequences aligned without reference to protein structure. Here we present amino acid substitution matrices constructed from the alignment of a large number of protein domain structures from the structural classification of proteins (SCOP) database. We show that when incorporated into the homology search algorithms BLAST and PSI-blast, the structure-based substitution matrices enhance the efficacy of detecting remote homologs.
MyPMFs: a simple tool for creating statistical potentials to assess protein structural models.

PubMed

Postic, Guillaume; Hamelryck, Thomas; Chomilier, Jacques; Stratmann, Dirk

2018-05-29

Evaluating the model quality of protein structures that evolve in environments with particular physicochemical properties requires scoring functions that are adapted to their specific residue compositions and/or structural characteristics. Thus, computational methods developed for structures from the cytosol cannot work properly on membrane or secreted proteins. Here, we present MyPMFs, an easy-to-use tool that allows users to train statistical potentials of mean force (PMFs) on the protein structures of their choice, with all parameters being adjustable. We demonstrate its use by creating an accurate statistical potential for transmembrane protein domains. We also show its usefulness to study the influence of the physical environment on residue interactions within protein structures. Our open-source software is freely available for download at https://github.com/bibip-impmc/mypmfs. Copyright © 2018. Published by Elsevier B.V.
Structural Disorder within Henipavirus Nucleoprotein and Phosphoprotein: From Predictions to Experimental Assessment

PubMed Central

Darbon, Hervé; Longhi, Sonia

2010-01-01

Henipaviruses are newly emerged viruses within the Paramyxoviridae family. Their negative-strand RNA genome is packaged by the nucleoprotein (N) within α-helical nucleocapsid that recruits the polymerase complex made of the L protein and the phosphoprotein (P). To date structural data on Henipaviruses are scarce, and their N and P proteins have never been characterized so far. Using both computational and experimental approaches we herein show that Henipaviruses N and P proteins possess large intrinsically disordered regions. By combining several disorder prediction methods, we show that the N-terminal domain of P (PNT) and the C-terminal domain of N (NTAIL) are both mostly disordered, although they contain short order-prone segments. We then report the cloning, the bacterial expression, purification and characterization of Henipavirus PNT and NTAIL domains. By combining gel filtration, dynamic light scattering, circular dichroism and nuclear magnetic resonance, we show that both NTAIL and PNT belong to the premolten globule sub-family within the class of intrinsically disordered proteins. This study is the first reported experimental characterization of Henipavirus P and N proteins. The evidence that their respective N-terminal and C-terminal domains are highly disordered under native conditions is expected to be invaluable for future structural studies by helping to delineate N and P protein domains amenable to crystallization. In addition, following previous hints establishing a relationship between structural disorder and protein interactivity, the present results suggest that Henipavirus PNT and NTAIL domains could be involved in manifold protein-protein interactions. PMID:20657787
Fourier transform infrared microspectroscopic analysis of the effects of cereal type and variety within a type of grain on structural makeup in relation to rumen degradation kinetics.

PubMed

Walker, Amanda M; Yu, Peiqiang; Christensen, Colleen R; Christensen, David A; McKinnon, John J

2009-08-12

The objectives of this study were to use Fourier transform infrared microspectroscopy (FTIRM) to determine structural makeup (features) of cereal grain endosperm tissue and to reveal and identify differences in protein and carbohydrate structural makeup between different cereal types (corn vs barley) and between different varieties within a grain (barley CDC Bold, CDC Dolly, Harrington, and Valier). Another objective was to investigate how these structural features relate to rumen degradation kinetics. The items assessed included (1) structural differences in protein amide I to nonstructural carbohydrate (NSC, starch) intensity and ratio within cellular dimensions; (2) molecular structural differences in the secondary structure profile of protein, alpha-helix, beta-sheet, and their ratio; (3) structural differences in NSC to amide I ratio profile. From the results, it was observed that (1) comparison between grain types [corn (cv. Pioneer 39P78) vs barley (cv. Harrington)] showed significant differences in structural makeup in terms of NSC, amide I to NSC ratio, and rumen degradation kinetics (degradation ratio, effective degradability of dry matter, protein and NSC) (P < 0.05); (2) comparison between varieties within a grain (barley varieties) also showed significant differences in structural makeup in terms of amide I, NSC, amide I to NSC ratio, alpha-helix and beta-sheet protein structures, and rumen degradation kinetics (effective degradability of dry matter, protein, and NSC) (P < 0.05); (3) correlation analysis showed that the amide I to NSC ratio was strongly correlated with rumen degradation kinetics in terms of the degradation rate (R = 0.91, P = 0.086) and effective degradability of dry matter (R = 0.93, P = 0.071). The results suggest that with the FTIRM technique, the structural makeup differences between cereal types and between different varieties within a type of grain could be revealed. These structural makeup differences were related to the rate and extent of rumen degradation.
Inferences from structural comparison: flexibility, secondary structure wobble and sequence alignment optimization.

PubMed

Zhang, Gaihua; Su, Zhen

2012-01-01

Work on protein structure prediction is very useful in biological research. To evaluate their accuracy, experimental protein structures or their derived data are used as the 'gold standard'. However, as proteins are dynamic molecular machines with structural flexibility such a standard may be unreliable. To investigate the influence of the structure flexibility, we analysed 3,652 protein structures of 137 unique sequences from 24 protein families. The results showed that (1) the three-dimensional (3D) protein structures were not rigid: the root-mean-square deviation (RMSD) of the backbone Cα of structures with identical sequences was relatively large, with the average of the maximum RMSD from each of the 137 sequences being 1.06 Å; (2) the derived data of the 3D structure was not constant, e.g. the highest ratio of the secondary structure wobble site was 60.69%, with the sequence alignments from structural comparisons of two proteins in the same family sometimes being completely different. Proteins may have several stable conformations and the data derived from resolved structures as a 'gold standard' should be optimized before being utilized as criteria to evaluate the prediction methods, e.g. sequence alignment from structural comparison. Helix/β-sheet transition exists in normal free proteins. The coil ratio of the 3D structure could affect its resolution as determined by X-ray crystallography.
Application of long-range order to predict unfolding rates of two-state proteins.

PubMed

Harihar, B; Selvaraj, S

2011-03-01

Predicting the experimental unfolding rates of two-state proteins and models describing the unfolding rates of these proteins is quite limited because of the complexity present in the unfolding mechanism and the lack of experimental unfolding data compared with folding data. In this work, 25 two-state proteins characterized by Maxwell et al. (Protein Sci 2005;14:602–616) using a consensus set of experimental conditions were taken, and the parameter long-range order (LRO) derived from their three-dimensional structures were related with their experimental unfolding rates ln(k(u)). From the total data set of 30 proteins used by Maxwell et al. (Protein Sci 2005;14:602–616), five slow-unfolding proteins with very low unfolding rates were considered to be outliers and were not included in our data set. Except all beta structural class, LRO of both the all-alpha and mixed-class proteins showed a strong inverse correlation of r = -0.99 and -0.88, respectively, with experimental ln(k(u)). LRO shows a correlation of -0.62 with experimental ln(k(u)) for all-beta proteins. For predicting the unfolding rates, a simple statistical method has been used and linear regression equations were developed for individual structural classes of proteins using LRO, and the results obtained showed a better agreement with experimental results. Copyright © 2010 Wiley-Liss, Inc.
Requirements on paramagnetic relaxation enhancement data for membrane protein structure determination by NMR.

PubMed

Gottstein, Daniel; Reckel, Sina; Dötsch, Volker; Güntert, Peter

2012-06-06

Nuclear magnetic resonance (NMR) structure calculations of the α-helical integral membrane proteins DsbB, GlpG, and halorhodopsin show that distance restraints from paramagnetic relaxation enhancement (PRE) can provide sufficient structural information to determine their structure with an accuracy of about 1.5 Å in the absence of other long-range conformational restraints. Our systematic study with simulated NMR data shows that about one spin label per transmembrane helix is necessary for obtaining enough PRE distance restraints to exclude wrong topologies, such as pseudo mirror images, if only limited other NMR restraints are available. Consequently, an experimentally realistic amount of PRE data enables α-helical membrane protein structure determinations that would not be feasible with the very limited amount of conventional NOESY data normally available for these systems. These findings are in line with our recent first de novo NMR structure determination of a heptahelical integral membrane protein, proteorhodopsin, that relied extensively on PRE data. Copyright © 2012 Elsevier Ltd. All rights reserved.
Interaction of sucralose with whey protein: Experimental and molecular modeling studies

NASA Astrophysics Data System (ADS)

Zhang, Hongmei; Sun, Shixin; Wang, Yanqing; Cao, Jian

2017-12-01

The objective of this research was to study the interactions of sucralose with whey protein isolate (WPI) by using the three-dimensional fluorescence spectroscopy, circular dichroism spectroscopy and molecular modeling. The results showed that the peptide strands structure of WPI had been changed by sucralose. Sucralose binding induced the secondary structural changes and increased content of aperiodic structure of WPI. Sucralose decreased the thermal stability of WPI and acted as a structure destabilizer during the thermal unfolding process of protein. In addition, the existence of sucralose decreased the reversibility of the unfolding of WPI. Nonetheless, sucralose-WPI complex was less stable than protein alone. The molecular modeling result showed that van der Waals and hydrogen bonding interactions contribute to the complexation free binding energy. There are more than one possible binding sites of WPI with sucralose by surface binding mode.
Structural study of the Fox-1 RRM protein hydration reveals a role for key water molecules in RRM-RNA recognition

PubMed Central

Blatter, Markus; Cléry, Antoine; Damberger, Fred F.

2017-01-01

Abstract The Fox-1 RNA recognition motif (RRM) domain is an important member of the RRM protein family. We report a 1.8 Å X-ray structure of the free Fox-1 containing six distinct monomers. We use this and the nuclear magnetic resonance (NMR) structure of the Fox-1 protein/RNA complex for molecular dynamics (MD) analyses of the structured hydration. The individual monomers of the X-ray structure show diverse hydration patterns, however, MD excellently reproduces the most occupied hydration sites. Simulations of the protein/RNA complex show hydration consistent with the isolated protein complemented by hydration sites specific to the protein/RNA interface. MD predicts intricate hydration sites with water-binding times extending up to hundreds of nanoseconds. We characterize two of them using NMR spectroscopy, RNA binding with switchSENSE and free-energy calculations of mutant proteins. Both hydration sites are experimentally confirmed and their abolishment reduces the binding free-energy. A quantitative agreement between theory and experiment is achieved for the S155A substitution but not for the S122A mutant. The S155 hydration site is evolutionarily conserved within the RRM domains. In conclusion, MD is an effective tool for predicting and interpreting the hydration patterns of protein/RNA complexes. Hydration is not easily detectable in NMR experiments but can affect stability of protein/RNA complexes. PMID:28505313
Application of far-infrared spectroscopy to the structural identification of protein materials.

PubMed

Han, Yanchen; Ling, Shengjie; Qi, Zeming; Shao, Zhengzhong; Chen, Xin

2018-05-03

Although far-infrared (IR) spectroscopy has been shown to be a powerful tool to determine peptide structure and to detect structural transitions in peptides, it has been overlooked in the characterization of proteins. Herein, we used far-IR spectroscopy to monitor the structure of four abundant non-bioactive proteins, namely, soybean protein isolate (SPI), pea protein isolate (PPI) and two types of silk fibroins (SFs), domestic Bombyx mori and wild Antheraea pernyi. The two globular proteins SPI and PPI result in broad and weak far-IR bands (between 50 and 700 cm-1), in agreement with those of some other bioactive globular proteins previously studied (lysozyme, myoglobin, hemoglobin, etc.) that generally only have random amino acid sequences. Interestingly, the two SFs, which are characterized by a structure composed of highly repetitive motifs, show several sharp far-IR characteristic absorption peaks. Moreover, some of these characteristic peaks (such as the peaks at 260 and 428 cm-1 in B. mori, and the peaks at 245 and 448 cm-1 in A. pernyi) are sensitive to conformational changes; hence, they can be directly used to monitor conformational transitions in SFs. Furthermore, since SF absorption bands clearly differ from those of globular proteins and different SFs even show distinct adsorption bands, far-IR spectroscopy can be applied to distinguish and determine the specific SF component within protein blends.
Modularity in protein structures: study on all-alpha proteins.

PubMed

Khan, Taushif; Ghosh, Indira

2015-01-01

Modularity is known as one of the most important features of protein's robust and efficient design. The architecture and topology of proteins play a vital role by providing necessary robust scaffolds to support organism's growth and survival in constant evolutionary pressure. These complex biomolecules can be represented by several layers of modular architecture, but it is pivotal to understand and explore the smallest biologically relevant structural component. In the present study, we have developed a component-based method, using protein's secondary structures and their arrangements (i.e. patterns) in order to investigate its structural space. Our result on all-alpha protein shows that the known structural space is highly populated with limited set of structural patterns. We have also noticed that these frequently observed structural patterns are present as modules or "building blocks" in large proteins (i.e. higher secondary structure content). From structural descriptor analysis, observed patterns are found to be within similar deviation; however, frequent patterns are found to be distinctly occurring in diverse functions e.g. in enzymatic classes and reactions. In this study, we are introducing a simple approach to explore protein structural space using combinatorial- and graph-based geometry methods, which can be used to describe modularity in protein structures. Moreover, analysis indicates that protein function seems to be the driving force that shapes the known structure space.
Understanding the Structural Ensembles of a Highly Extended Disordered Protein†

PubMed Central

Daughdrill, Gary W.; Kashtanov, Stepan; Stancik, Amber; Hill, Shannon E.; Helms, Gregory; Muschol, Martin

2013-01-01

Developing a comprehensive description of the equilibrium structural ensembles for intrinsically disordered proteins (IDPs) is essential to understanding their function. The p53 transactivation domain (p53TAD) is an IDP that interacts with multiple protein partners and contains numerous phosphorylation sites. Multiple techniques were used to investigate the equilibrium structural ensemble of p53TAD in its native and chemically unfolded states. The results from these experiments show that the native state of p53TAD has dimensions similar to a classical random coil while the chemically unfolded state is more extended. To investigate the molecular properties responsible for this behavior, a novel algorithm that generates diverse and unbiased structural ensembles of IDPs was developed. This algorithm was used to generate a large pool of plausible p53TAD structures that were reweighted to identify a subset of structures with the best fit to small angle X-ray scattering data. High weight structures in the native state ensemble show features that are localized to protein binding sites and regions with high proline content. The features localized to the protein binding sites are mostly eliminated in the chemically unfolded ensemble; while, the regions with high proline content remain relatively unaffected. Data from NMR experiments support these results, showing that residues from the protein binding sites experience larger environmental changes upon unfolding by urea than regions with high proline content. This behavior is consistent with the urea-induced exposure of nonpolar and aromatic side-chains in the protein binding sites that are partially excluded from solvent in the native state ensemble. PMID:21979461
Modeling complexes of modeled proteins.

PubMed

Anishchenko, Ivan; Kundrotas, Petras J; Vakser, Ilya A

2017-03-01

Structural characterization of proteins is essential for understanding life processes at the molecular level. However, only a fraction of known proteins have experimentally determined structures. This fraction is even smaller for protein-protein complexes. Thus, structural modeling of protein-protein interactions (docking) primarily has to rely on modeled structures of the individual proteins, which typically are less accurate than the experimentally determined ones. Such "double" modeling is the Grand Challenge of structural reconstruction of the interactome. Yet it remains so far largely untested in a systematic way. We present a comprehensive validation of template-based and free docking on a set of 165 complexes, where each protein model has six levels of structural accuracy, from 1 to 6 Å C α RMSD. Many template-based docking predictions fall into acceptable quality category, according to the CAPRI criteria, even for highly inaccurate proteins (5-6 Å RMSD), although the number of such models (and, consequently, the docking success rate) drops significantly for models with RMSD > 4 Å. The results show that the existing docking methodologies can be successfully applied to protein models with a broad range of structural accuracy, and the template-based docking is much less sensitive to inaccuracies of protein models than the free docking. Proteins 2017; 85:470-478. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Solenopsis invicta virus 3: mapping of structural proteins, ribosomal frameshifting, and similarities to Acyrthosiphon pisum virus and Kelp fly virus.

PubMed

Valles, Steven M; Bell, Susanne; Firth, Andrew E

2014-01-01

Solenopsis invicta virus 3 (SINV-3) is a positive-sense single-stranded RNA virus that infects the red imported fire ant, Solenopsis invicta. We show that the second open reading frame (ORF) of the dicistronic genome is expressed via a frameshifting mechanism and that the sequences encoding the structural proteins map to both ORF2 and the 3' end of ORF1, downstream of the sequence that encodes the RNA-dependent RNA polymerase. The genome organization and structural protein expression strategy resemble those of Acyrthosiphon pisum virus (APV), an aphid virus. The capsid protein that is encoded by the 3' end of ORF1 in SINV-3 and APV is predicted to have a jelly-roll fold similar to the capsid proteins of picornaviruses and caliciviruses. The capsid-extension protein that is produced by frameshifting, includes the jelly-roll fold domain encoded by ORF1 as its N-terminus, while the C-terminus encoded by the 5' half of ORF2 has no clear homology with other viral structural proteins. A third protein, encoded by the 3' half of ORF2, is associated with purified virions at sub-stoichiometric ratios. Although the structural proteins can be translated from the genomic RNA, we show that SINV-3 also produces a subgenomic RNA encoding the structural proteins. Circumstantial evidence suggests that APV may also produce such a subgenomic RNA. Both SINV-3 and APV are unclassified picorna-like viruses distantly related to members of the order Picornavirales and the family Caliciviridae. Within this grouping, features of the genome organization and capsid domain structure of SINV-3 and APV appear more similar to caliciviruses, perhaps suggesting the basis for a "Calicivirales" order.
Ana3 is a conserved protein required for the structural integrity of centrioles and basal bodies.

PubMed

Stevens, Naomi R; Dobbelaere, Jeroen; Wainman, Alan; Gergely, Fanni; Raff, Jordan W

2009-11-02

Recent studies have identified a conserved "core" of proteins that are required for centriole duplication. A small number of additional proteins have recently been identified as potential duplication factors, but it is unclear whether any of these proteins are components of the core duplication machinery. In this study, we investigate the function of one of these proteins, Drosophila melanogaster Ana3. We show that Ana3 is present in centrioles and basal bodies, but its behavior is distinct from that of the core duplication proteins. Most importantly, we find that Ana3 is required for the structural integrity of both centrioles and basal bodies and for centriole cohesion, but it is not essential for centriole duplication. We show that Ana3 has a mammalian homologue, Rotatin, that also localizes to centrioles and basal bodies and appears to be essential for cilia function. Thus, Ana3 defines a conserved family of centriolar proteins and plays an important part in ensuring the structural integrity of centrioles and basal bodies.
A Systematic Analysis of the Structures of Heterologously Expressed Proteins and Those from Their Native Hosts in the RCSB PDB Archive.

PubMed

Zhou, Ren-Bin; Lu, Hui-Meng; Liu, Jie; Shi, Jian-Yu; Zhu, Jing; Lu, Qin-Qin; Yin, Da-Chuan

2016-01-01

Recombinant expression of proteins has become an indispensable tool in modern day research. The large yields of recombinantly expressed proteins accelerate the structural and functional characterization of proteins. Nevertheless, there are literature reported that the recombinant proteins show some differences in structure and function as compared with the native ones. Now there have been more than 100,000 structures (from both recombinant and native sources) publicly available in the Protein Data Bank (PDB) archive, which makes it possible to investigate if there exist any proteins in the RCSB PDB archive that have identical sequence but have some difference in structures. In this paper, we present the results of a systematic comparative study of the 3D structures of identical naturally purified versus recombinantly expressed proteins. The structural data and sequence information of the proteins were mined from the RCSB PDB archive. The combinatorial extension (CE), FATCAT-flexible and TM-Align methods were employed to align the protein structures. The root-mean-square distance (RMSD), TM-score, P-value, Z-score, secondary structural elements and hydrogen bonds were used to assess the structure similarity. A thorough analysis of the PDB archive generated five-hundred-seventeen pairs of native and recombinant proteins that have identical sequence. There were no pairs of proteins that had the same sequence and significantly different structural fold, which support the hypothesis that expression in a heterologous host usually could fold correctly into their native forms.
A Systematic Analysis of the Structures of Heterologously Expressed Proteins and Those from Their Native Hosts in the RCSB PDB Archive

PubMed Central

Zhou, Ren-Bin; Lu, Hui-Meng; Liu, Jie; Shi, Jian-Yu; Zhu, Jing; Lu, Qin-Qin; Yin, Da-Chuan

2016-01-01

Recombinant expression of proteins has become an indispensable tool in modern day research. The large yields of recombinantly expressed proteins accelerate the structural and functional characterization of proteins. Nevertheless, there are literature reported that the recombinant proteins show some differences in structure and function as compared with the native ones. Now there have been more than 100,000 structures (from both recombinant and native sources) publicly available in the Protein Data Bank (PDB) archive, which makes it possible to investigate if there exist any proteins in the RCSB PDB archive that have identical sequence but have some difference in structures. In this paper, we present the results of a systematic comparative study of the 3D structures of identical naturally purified versus recombinantly expressed proteins. The structural data and sequence information of the proteins were mined from the RCSB PDB archive. The combinatorial extension (CE), FATCAT-flexible and TM-Align methods were employed to align the protein structures. The root-mean-square distance (RMSD), TM-score, P-value, Z-score, secondary structural elements and hydrogen bonds were used to assess the structure similarity. A thorough analysis of the PDB archive generated five-hundred-seventeen pairs of native and recombinant proteins that have identical sequence. There were no pairs of proteins that had the same sequence and significantly different structural fold, which support the hypothesis that expression in a heterologous host usually could fold correctly into their native forms. PMID:27517583
Structural modeling of the N-terminal signal–receiving domain of IκBα

PubMed Central

Yazdi, Samira; Durdagi, Serdar; Naumann, Michael; Stein, Matthias

2015-01-01

The transcription factor nuclear factor-κB (NF-κB) exerts essential roles in many biological processes including cell growth, apoptosis and innate and adaptive immunity. The NF-κB inhibitor (IκBα) retains NF-κB in the cytoplasm and thus inhibits nuclear localization of NF-κB and its association with DNA. Recent protein crystal structures of the C-terminal part of IκBα in complex with NF-κB provided insights into the protein-protein interactions but could not reveal structural details about the N-terminal signal receiving domain (SRD). The SRD of IκBα contains a degron, formed following phosphorylation by IκB kinases (IKK). In current protein X-ray structures, however, the SRD is not resolved and assumed to be disordered. Here, we combined secondary structure annotation and domain threading followed by long molecular dynamics (MD) simulations and showed that the SRD possesses well-defined secondary structure elements. We show that the SRD contains 3 additional stable α-helices supplementing the six ARDs present in crystallized IκBα. The IκBα/NF-κB protein-protein complex remained intact and stable during the entire simulations. Also in solution, free IκBα retains its structural integrity. Differences in structural topology and dynamics were observed by comparing the structures of NF-κB free and NF-κB bound IκBα-complex. This study paves the way for investigating the signaling properties of the SRD in the IκBα degron. A detailed atomic scale understanding of molecular mechanism of NF-κB activation, regulation and the protein-protein interactions may assist to design and develop novel chronic inflammation modulators. PMID:26157801
Crystal structure of AFV1-102, a protein from the acidianus filamentous virus 1

PubMed Central

Keller, Jenny; Leulliot, Nicolas; Collinet, Bruno; Campanacci, Valerie; Cambillau, Christian; Pranghisvilli, David; van Tilbeurgh, Herman

2009-01-01

Viruses infecting hyperthermophilic archaea have intriguing morphologies and genomic properties. The vast majority of their genes do not have homologs other than in other hyperthermophilic viruses, and the biology of these viruses is poorly understood. As part of a structural genomics project on the proteins of these viruses, we present here the structure of a 102 amino acid protein from acidianus filamentous virus 1 (AFV1-102). The structure shows that it is made of two identical motifs that have poor sequence similarity. Although no function can be proposed from structural analysis, tight binding of the gateway tag peptide in a groove between the two motifs suggests AFV1-102 is involved in protein protein interactions. PMID:19319936
Dancing Protein Clouds: The Strange Biology and Chaotic Physics of Intrinsically Disordered Proteins.

PubMed

Uversky, Vladimir N

2016-03-25

Biologically active but floppy proteins represent a new reality of modern protein science. These intrinsically disordered proteins (IDPs) and hybrid proteins containing ordered and intrinsically disordered protein regions (IDPRs) constitute a noticeable part of any given proteome. Functionally, they complement ordered proteins, and their conformational flexibility and structural plasticity allow them to perform impossible tricks and be engaged in biological activities that are inaccessible to well folded proteins with their unique structures. The major goals of this minireview are to show that, despite their simplified amino acid sequences, IDPs/IDPRs are complex entities often resembling chaotic systems, are structurally and functionally heterogeneous, and can be considered an important part of the structure-function continuum. Furthermore, IDPs/IDPRs are everywhere, and are ubiquitously engaged in various interactions characterized by a wide spectrum of binding scenarios and an even wider spectrum of structural and functional outputs. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.

Compact structure and proteins of pasta retard in vitro digestive evolution of branched starch molecular structure.

PubMed

Zou, Wei; Sissons, Mike; Warren, Frederick J; Gidley, Michael J; Gilbert, Robert G

2016-11-05

The roles that the compact structure and proteins in pasta play in retarding evolution of starch molecular structure during in vitro digestion are explored, using four types of cooked samples: whole pasta, pasta powder, semolina (with proteins) and extracted starch without proteins. These were subjected to in vitro digestion with porcine α-amylase, collecting samples at different times and characterizing the weight distribution of branched starch molecules using size-exclusion chromatography. Measurement of α-amylase activity showed that a protein (or proteins) from semolina or pasta powder interacted with α-amylase, causing reduced enzymatic activity and retarding digestion of branched starch molecules with hydrodynamic radius (Rh)<100nm; this protein(s) was susceptible to proteolysis. Thus the compact structure of pasta protects the starch and proteins in the interior of the whole pasta, reducing the enzymatic degradation of starch molecules, especially for molecules with Rh>100nm. Copyright © 2016 Elsevier Ltd. All rights reserved.
Sequence co-evolution gives 3D contacts and structures of protein complexes

PubMed Central

Hopf, Thomas A; Schärfe, Charlotta P I; Rodrigues, João P G L M; Green, Anna G; Kohlbacher, Oliver; Sander, Chris; Bonvin, Alexandre M J J; Marks, Debora S

2014-01-01

Protein–protein interactions are fundamental to many biological processes. Experimental screens have identified tens of thousands of interactions, and structural biology has provided detailed functional insight for select 3D protein complexes. An alternative rich source of information about protein interactions is the evolutionary sequence record. Building on earlier work, we show that analysis of correlated evolutionary sequence changes across proteins identifies residues that are close in space with sufficient accuracy to determine the three-dimensional structure of the protein complexes. We evaluate prediction performance in blinded tests on 76 complexes of known 3D structure, predict protein–protein contacts in 32 complexes of unknown structure, and demonstrate how evolutionary couplings can be used to distinguish between interacting and non-interacting protein pairs in a large complex. With the current growth of sequences, we expect that the method can be generalized to genome-wide elucidation of protein–protein interaction networks and used for interaction predictions at residue resolution. DOI: http://dx.doi.org/10.7554/eLife.03430.001 PMID:25255213
A new definition and properties of the similarity value between two protein structures.

PubMed

Saberi Fathi, S M

2016-10-01

Knowledge regarding the 3D structure of a protein provides useful information about the protein's functional properties. Particularly, structural similarity between proteins can be used as a good predictor of functional similarity. One method that uses the 3D geometrical structure of proteins in order to compare them is the similarity value (SV). In this paper, we introduce a new definition of the SV measure for comparing two proteins. To this end, we consider the mass of the protein's atoms and concentrate on the number of protein's atoms to be compared. This defines a new measure, called the weighted similarity value (WSV), adding physical properties to geometrical properties. We also show that our results are in good agreement with the results obtained by TM-SCORE and DALILITE. WSV can be of use in protein classification and in drug discovery.
Detection of functionally important regions in "hypothetical proteins" of known structure.

PubMed

Nimrod, Guy; Schushan, Maya; Steinberg, David M; Ben-Tal, Nir

2008-12-10

Structural genomics initiatives provide ample structures of "hypothetical proteins" (i.e., proteins of unknown function) at an ever increasing rate. However, without function annotation, this structural goldmine is of little use to biologists who are interested in particular molecular systems. To this end, we used (an improved version of) the PatchFinder algorithm for the detection of functional regions on the protein surface, which could mediate its interactions with, e.g., substrates, ligands, and other proteins. Examination, using a data set of annotated proteins, showed that PatchFinder outperforms similar methods. We collected 757 structures of hypothetical proteins and their predicted functional regions in the N-Func database. Inspection of several of these regions demonstrated that they are useful for function prediction. For example, we suggested an interprotein interface and a putative nucleotide-binding site. A web-server implementation of PatchFinder and the N-Func database are available at http://patchfinder.tau.ac.il/.
Segmented molecular design of self-healing proteinaceous materials

NASA Astrophysics Data System (ADS)

Sariola, Veikko; Pena-Francesch, Abdon; Jung, Huihun; Çetinkaya, Murat; Pacheco, Carlos; Sitti, Metin; Demirel, Melik C.

2015-09-01

Hierarchical assembly of self-healing adhesive proteins creates strong and robust structural and interfacial materials, but understanding of the molecular design and structure-property relationships of structural proteins remains unclear. Elucidating this relationship would allow rational design of next generation genetically engineered self-healing structural proteins. Here we report a general self-healing and -assembly strategy based on a multiphase recombinant protein based material. Segmented structure of the protein shows soft glycine- and tyrosine-rich segments with self-healing capability and hard beta-sheet segments. The soft segments are strongly plasticized by water, lowering the self-healing temperature close to body temperature. The hard segments self-assemble into nanoconfined domains to reinforce the material. The healing strength scales sublinearly with contact time, which associates with diffusion and wetting of autohesion. The finding suggests that recombinant structural proteins from heterologous expression have potential as strong and repairable engineering materials.
Segmented molecular design of self-healing proteinaceous materials.

PubMed

Sariola, Veikko; Pena-Francesch, Abdon; Jung, Huihun; Çetinkaya, Murat; Pacheco, Carlos; Sitti, Metin; Demirel, Melik C

2015-09-01

Hierarchical assembly of self-healing adhesive proteins creates strong and robust structural and interfacial materials, but understanding of the molecular design and structure-property relationships of structural proteins remains unclear. Elucidating this relationship would allow rational design of next generation genetically engineered self-healing structural proteins. Here we report a general self-healing and -assembly strategy based on a multiphase recombinant protein based material. Segmented structure of the protein shows soft glycine- and tyrosine-rich segments with self-healing capability and hard beta-sheet segments. The soft segments are strongly plasticized by water, lowering the self-healing temperature close to body temperature. The hard segments self-assemble into nanoconfined domains to reinforce the material. The healing strength scales sublinearly with contact time, which associates with diffusion and wetting of autohesion. The finding suggests that recombinant structural proteins from heterologous expression have potential as strong and repairable engineering materials.
Molecular modeling of the human sperm associated antigen 11 B (SPAG11B) proteins.

PubMed

Narmadha, Ganapathy; Yenugu, Suresh

2015-04-01

Antimicrobial proteins and peptides are ubiquitous in nature with diverse structural and biological properties. Among them, the human beta-defensins are known to contribute to the innate immune response. Besides the defensins, a number of defensin-like proteins and peptides are expressed in many organ systems including the male reproductive system. Some of the protein isoforms encoded by the sperm associated antigen 11B (SPAG11) gene in humans are beta-defensin-like and exhibit structure dependent and salt tolerant antimicrobial activity, besides contributing to sperm maturation. Though some of the functional roles of these proteins are reported, the structural and molecular features that contribute to their antimicrobial activity is not yet reported. In this study, using in silico tools, we report the three dimensional structure of the human SPAG11B proteins and their C-terminal peptides. web-based hydropathy, amphipathicity, and topology (WHAT) analyses and grand average of hydropathy (GRAVY) indices show that these proteins and peptides are amphipathic and highly hydrophilic. Self-optimized prediction method with alignment (SOPMA) analyses and circular dichroism data suggest that the secondary structure of these proteins and peptides primarily contain beta-sheet and random coil structure and alpha-helix to a lesser extent. Ramachandran plots show that majority of the amino acids in these proteins and peptides fall in the permissible regions, thus indicating stable structures. The secondary structure of SPAG11B isoforms and their peptides were not perturbed with increasing NaCl concentration (0-300 mM) and at different pH (3, 7, and 10), thus reinforcing our previously reported observation that their antimicrobial activity is salt tolerant. To the best of our knowledge, for the first time, results of our study provide vital information on the structural features of SPAG11B protein isoforms and their contribution to antimicrobial activity.
Genome Pool Strategy for Structural Coverage of Protein Families

PubMed Central

Jaroszewski, Lukasz; Slabinski, Lukasz; Wooley, John; Deacon, Ashley M.; Lesley, Scott A.; Wilson, Ian. A.; Godzik, Adam

2010-01-01

As noticed by generations of structural biologists, closely homologous proteins may have substantially different crystallization properties and propensities. These observations can be used to systematically introduce additional dimensionality into crystallization trials by targeting homologous proteins from multiple genomes in a “genome pool” strategy. Through extensive use of our recently introduced “crystallization feasibility score” (Slabinski et al., 2007a), we can explain that the genome pool strategy works well because the crystallization feasibility scores are surprisingly broad within families of homologous proteins, with most families containing a range of optimal to very difficult targets. We also show that some families can be regarded as relatively “easy”, where a significant number of proteins are predicted to have optimal crystallization features, and others are “very difficult”, where almost none are predicted to result in a crystal structure. Thus, the outcome of such variable distributions of such crystallizability' preferences leads to uneven structural coverage of known families, with “easier” or “optimal” families having several times more solved structures than “very difficult” ones. Nevertheless, this latter category can be successfully targeted by increasing the number of genomes that are used to select targets from a given family. On average, adding 10 new genomes to the “genome pool” provides more promising targets for 7 “very difficult” families. In contrast, our crystallization feasibility score does not indicate that any specific microbial genomes can be readily classified as “easier” or “very difficult” with respect to providing suitable candidates for crystallization and structure determination. Finally, our analyses show that specific physicochemical properties of the protein sequence favor successful outcomes for structure determination and, hence, the group of proteins with known 3D structures is systematically different from the general pool of known proteins. We, therefore, assess the structural consequences of these differences in protein sequence and protein biophysical properties. PMID:19000818
Modified Aequorin Shows Increased Bioluminescence Activity

DTIC Science & Technology

1993-08-18

Primary structure of the Aequorea victoria green - fluorescent protein . Gene 111 (2):229-233. PATENTS U.S...and Initial Characterization of Crystals of the Photoprotein Aequorin from Aequorea victoria . Proteins , Structure , & Genetics 15: 103-107. RELATED...Hexapeptide Chromophore of the Aequorea Green - Fluorescent Protein . Biochemistry 32: 1212-1218. 1992 Dennis J. O’Kane, and Douglas C.
Understand protein functions by comparing the similarity of local structural environments.

PubMed

Chen, Jiawen; Xie, Zhong-Ru; Wu, Yinghao

2017-02-01

The three-dimensional structures of proteins play an essential role in regulating binding between proteins and their partners, offering a direct relationship between structures and functions of proteins. It is widely accepted that the function of a protein can be determined if its structure is similar to other proteins whose functions are known. However, it is also observed that proteins with similar global structures do not necessarily correspond to the same function, while proteins with very different folds can share similar functions. This indicates that function similarity is originated from the local structural information of proteins instead of their global shapes. We assume that proteins with similar local environments prefer binding to similar types of molecular targets. In order to testify this assumption, we designed a new structural indicator to define the similarity of local environment between residues in different proteins. This indicator was further used to calculate the probability that a given residue binds to a specific type of structural neighbors, including DNA, RNA, small molecules and proteins. After applying the method to a large-scale non-redundant database of proteins, we show that the positive signal of binding probability calculated from the local structural indicator is statistically meaningful. In summary, our studies suggested that the local environment of residues in a protein is a good indicator to recognize specific binding partners of the protein. The new method could be a potential addition to a suite of existing template-based approaches for protein function prediction. Copyright © 2016 Elsevier B.V. All rights reserved.
Hydrophobic core malleability of a de novo designed three-helix bundle protein.

PubMed

Walsh, S T; Sukharev, V I; Betz, S F; Vekshin, N L; DeGrado, W F

2001-01-12

De novo protein design provides a tool for testing the principles that stabilize the structures of proteins. Recently, we described the design and structure determination of alpha(3)D, a three-helix bundle protein with a well-packed hydrophobic core. Here, we test the malleability and adaptability of this protein's structure by mutating a small, Ala residue (A60) in its core to larger, hydrophobic side-chains, Leu and Ile. Such changes introduce strain into the structures of natural proteins, and therefore generally destabilize the native state. By contrast, these mutations were slightly stabilizing ( approximately 1.5 kcal mol(-1)) to the tertiary structure of alpha(3)D. The value of DeltaC(p) for unfolding of these mutants was not greatly affected relative to wild-type, indicating that the change in solvent accessibility for unfolding was similar. However, two-dimensional heteronuclear single quantum coherence spectra indicate that the protein adjusts to the introduction of steric bulk in different ways. A60L-alpha(3)D showed serious erosion in the dispersion of both the amide backbone as well as the side-chain methyl chemical shifts. By contrast, A60I-alpha(3)D showed excellent dispersion of the backbone resonances, and selective changes in dispersion of the aliphatic side-chains proximal to the site of mutation. Together, these data suggest that alpha(3)D, although folded into a unique three-dimensional structure, is nevertheless more malleable and flexible than most natural, native proteins. Copyright 2001 Academic Press.
Design, production and molecular structure of a new family of artificial alpha-helicoidal repeat proteins (αRep) based on thermostable HEAT-like repeats.

PubMed

Urvoas, Agathe; Guellouz, Asma; Valerio-Lepiniec, Marie; Graille, Marc; Durand, Dominique; Desravines, Danielle C; van Tilbeurgh, Herman; Desmadril, Michel; Minard, Philippe

2010-11-26

Repeat proteins have a modular organization and a regular architecture that make them attractive models for design and directed evolution experiments. HEAT repeat proteins, although very common, have not been used as a scaffold for artificial proteins, probably because they are made of long and irregular repeats. Here, we present and validate a consensus sequence for artificial HEAT repeat proteins. The sequence was defined from the structure-based sequence analysis of a thermostable HEAT-like repeat protein. Appropriate sequences were identified for the N- and C-caps. A library of genes coding for artificial proteins based on this sequence design, named αRep, was assembled using new and versatile methodology based on circular amplification. Proteins picked randomly from this library are expressed as soluble proteins. The biophysical properties of proteins with different numbers of repeats and different combinations of side chains in hypervariable positions were characterized. Circular dichroism and differential scanning calorimetry experiments showed that all these proteins are folded cooperatively and are very stable (T(m) >70 °C). Stability of these proteins increases with the number of repeats. Detailed gel filtration and small-angle X-ray scattering studies showed that the purified proteins form either monomers or dimers. The X-ray structure of a stable dimeric variant structure was solved. The protein is folded with a highly regular topology and the repeat structure is organized, as expected, as pairs of alpha helices. In this protein variant, the dimerization interface results directly from the variable surface enriched in aromatic residues located in the randomized positions of the repeats. The dimer was crystallized both in an apo and in a PEG-bound form, revealing a very well defined binding crevice and some structure flexibility at the interface. This fortuitous binding site could later prove to be a useful binding site for other low molecular mass partners. Copyright © 2010 Elsevier Ltd. All rights reserved.
Molecular basis of protein structure in proanthocyanidin and anthocyanin-enhanced Lc-transgenic alfalfa in relation to nutritive value using synchrotron-radiation FTIR microspectroscopy: A novel approach

NASA Astrophysics Data System (ADS)

Yu, Peiqiang; Jonker, Arjan; Gruber, Margaret

2009-09-01

To date there has been very little application of synchrotron radiation-based Fourier transform infrared microspectroscopy (SRFTIRM) to the study of molecular structures in plant forage in relation to livestock digestive behavior and nutrient availability. Protein inherent structure, among other factors such as protein matrix, affects nutritive quality, fermentation and degradation behavior in both humans and animals. The relative percentage of protein secondary structure influences protein value. A high percentage of β-sheets usually reduce the access of gastrointestinal digestive enzymes to the protein. Reduced accessibility results in poor digestibility and as a result, low protein value. The objective of this study was to use SRFTIRM to compare protein molecular structure of alfalfa plant tissues transformed with the maize Lc regulatory gene with non-transgenic alfalfa protein within cellular and subcellular dimensions and to quantify protein inherent structure profiles using Gaussian and Lorentzian methods of multi-component peak modeling. Protein molecular structure revealed by this method included α-helices, β-sheets and other structures such as β-turns and random coils. Hierarchical cluster analysis and principal component analysis of the synchrotron data, as well as accurate spectral analysis based on curve fitting, showed that transgenic alfalfa contained a relatively lower ( P < 0.05) percentage of the model-fitted α-helices (29 vs. 34) and model-fitted β-sheets (22 vs. 27) and a higher ( P < 0.05) percentage of other model-fitted structures (49 vs. 39). Transgenic alfalfa protein displayed no difference ( P > 0.05) in the ratio of α-helices to β-sheets (average: 1.4) and higher ( P < 0.05) ratios of α-helices to others (0.7 vs. 0.9) and β-sheets to others (0.5 vs. 0.8) than the non-transgenic alfalfa protein. The transgenic protein structures also exhibited no difference ( P > 0.05) in the vibrational intensity of protein amide I (average of 24) and amide II areas (average of 10) and their ratio (average of 2.4) compared with non-transgenic alfalfa. Cluster analysis and principal component analysis showed no significant differences between the two genotypes in the broad molecular fingerprint region, amides I and II regions, and the carbohydrate molecular region, indicating they are highly related to each other. The results suggest that transgenic Lc-alfalfa leaves contain similar proteins to non-transgenic alfalfa (because amide I and II intensities were identical), but a subtle difference in protein molecular structure after freeze drying. Further study is needed to understand the relationship between these structural profiles and biological features such as protein nutrient availability, protein bypass and digestive behavior of livestock fed with this type of forage.
Molecular Basis of Protein Structure in Proanthocyanidin and Anthocyanin-Enhanced Lc-transgenic Alfalfa in Relation to Nutritive Value Using Synchrotron-Radiation FTIR Microspectroscopy: A Novel Approach

DOE Office of Scientific and Technical Information (OSTI.GOV)

Yu, P.; Jonker, A; Gruber, M

2009-01-01

To date there has been very little application of synchrotron radiation-based Fourier transform infrared microspectroscopy (SRFTIRM) to the study of molecular structures in plant forage in relation to livestock digestive behavior and nutrient availability. Protein inherent structure, among other factors such as protein matrix, affects nutritive quality, fermentation and degradation behavior in both humans and animals. The relative percentage of protein secondary structure influences protein value. A high percentage of e-sheets usually reduce the access of gastrointestinal digestive enzymes to the protein. Reduced accessibility results in poor digestibility and as a result, low protein value. The objective of this studymore » was to use SRFTIRM to compare protein molecular structure of alfalfa plant tissues transformed with the maize Lc regulatory gene with non-transgenic alfalfa protein within cellular and subcellular dimensions and to quantify protein inherent structure profiles using Gaussian and Lorentzian methods of multi-component peak modeling. Protein molecular structure revealed by this method included a-helices, e-sheets and other structures such as e-turns and random coils. Hierarchical cluster analysis and principal component analysis of the synchrotron data, as well as accurate spectral analysis based on curve fitting, showed that transgenic alfalfa contained a relatively lower (P < 0.05) percentage of the model-fitted a-helices (29 vs. 34) and model-fitted e-sheets (22 vs. 27) and a higher (P < 0.05) percentage of other model-fitted structures (49 vs. 39). Transgenic alfalfa protein displayed no difference (P > 0.05) in the ratio of a-helices to e-sheets (average: 1.4) and higher (P < 0.05) ratios of a-helices to others (0.7 vs. 0.9) and e-sheets to others (0.5 vs. 0.8) than the non-transgenic alfalfa protein. The transgenic protein structures also exhibited no difference (P > 0.05) in the vibrational intensity of protein amide I (average of 24) and amide II areas (average of 10) and their ratio (average of 2.4) compared with non-transgenic alfalfa. Cluster analysis and principal component analysis showed no significant differences between the two genotypes in the broad molecular fingerprint region, amides I and II regions, and the carbohydrate molecular region, indicating they are highly related to each other. The results suggest that transgenic Lc-alfalfa leaves contain similar proteins to non-transgenic alfalfa (because amide I and II intensities were identical), but a subtle difference in protein molecular structure after freeze drying. Further study is needed to understand the relationship between these structural profiles and biological features such as protein nutrient availability, protein bypass and digestive behavior of livestock fed with this type of forage.« less
MEGADOCK: An All-to-All Protein-Protein Interaction Prediction System Using Tertiary Structure Data

PubMed Central

Ohue, Masahito; Matsuzaki, Yuri; Uchikoga, Nobuyuki; Ishida, Takashi; Akiyama, Yutaka

2014-01-01

The elucidation of protein-protein interaction (PPI) networks is important for understanding cellular structure and function and structure-based drug design. However, the development of an effective method to conduct exhaustive PPI screening represents a computational challenge. We have been investigating a protein docking approach based on shape complementarity and physicochemical properties. We describe here the development of the protein-protein docking software package “MEGADOCK” that samples an extremely large number of protein dockings at high speed. MEGADOCK reduces the calculation time required for docking by using several techniques such as a novel scoring function called the real Pairwise Shape Complementarity (rPSC) score. We showed that MEGADOCK is capable of exhaustive PPI screening by completing docking calculations 7.5 times faster than the conventional docking software, ZDOCK, while maintaining an acceptable level of accuracy. When MEGADOCK was applied to a subset of a general benchmark dataset to predict 120 relevant interacting pairs from 120 x 120 = 14,400 combinations of proteins, an F-measure value of 0.231 was obtained. Further, we showed that MEGADOCK can be applied to a large-scale protein-protein interaction-screening problem with accuracy better than random. When our approach is combined with parallel high-performance computing systems, it is now feasible to search and analyze protein-protein interactions while taking into account three-dimensional structures at the interactome scale. MEGADOCK is freely available at http://www.bi.cs.titech.ac.jp/megadock. PMID:23855673
Protein Secondary Structure Prediction Using AutoEncoder Network and Bayes Classifier

NASA Astrophysics Data System (ADS)

Wang, Leilei; Cheng, Jinyong

2018-03-01

Protein secondary structure prediction is belong to bioinformatics,and it's important in research area. In this paper, we propose a new prediction way of protein using bayes classifier and autoEncoder network. Our experiments show some algorithms including the construction of the model, the classification of parameters and so on. The data set is a typical CB513 data set for protein. In terms of accuracy, the method is the cross validation based on the 3-fold. Then we can get the Q3 accuracy. Paper results illustrate that the autoencoder network improved the prediction accuracy of protein secondary structure.
On the relationship between residue structural environment and sequence conservation in proteins.

PubMed

Liu, Jen-Wei; Lin, Jau-Ji; Cheng, Chih-Wen; Lin, Yu-Feng; Hwang, Jenn-Kang; Huang, Tsun-Tsao

2017-09-01

Residues that are crucial to protein function or structure are usually evolutionarily conserved. To identify the important residues in protein, sequence conservation is estimated, and current methods rely upon the unbiased collection of homologous sequences. Surprisingly, our previous studies have shown that the sequence conservation is closely correlated with the weighted contact number (WCN), a measure of packing density for residue's structural environment, calculated only based on the C α positions of a protein structure. Moreover, studies have shown that sequence conservation is correlated with environment-related structural properties calculated based on different protein substructures, such as a protein's all atoms, backbone atoms, side-chain atoms, or side-chain centroid. To know whether the C α atomic positions are adequate to show the relationship between residue environment and sequence conservation or not, here we compared C α atoms with other substructures in their contributions to the sequence conservation. Our results show that C α positions are substantially equivalent to the other substructures in calculations of various measures of residue environment. As a result, the overlapping contributions between C α atoms and the other substructures are high, yielding similar structure-conservation relationship. Take the WCN as an example, the average overlapping contribution to sequence conservation is 87% between C α and all-atom substructures. These results indicate that only C α atoms of a protein structure could reflect sequence conservation at the residue level. © 2017 Wiley Periodicals, Inc.
Protein docking by the interface structure similarity: how much structure is needed?

PubMed

Sinha, Rohita; Kundrotas, Petras J; Vakser, Ilya A

2012-01-01

The increasing availability of co-crystallized protein-protein complexes provides an opportunity to use template-based modeling for protein-protein docking. Structure alignment techniques are useful in detection of remote target-template similarities. The size of the structure involved in the alignment is important for the success in modeling. This paper describes a systematic large-scale study to find the optimal definition/size of the interfaces for the structure alignment-based docking applications. The results showed that structural areas corresponding to the cutoff values <12 Å across the interface inadequately represent structural details of the interfaces. With the increase of the cutoff beyond 12 Å, the success rate for the benchmark set of 99 protein complexes, did not increase significantly for higher accuracy models, and decreased for lower-accuracy models. The 12 Å cutoff was optimal in our interface alignment-based docking, and a likely best choice for the large-scale (e.g., on the scale of the entire genome) applications to protein interaction networks. The results provide guidelines for the docking approaches, including high-throughput applications to modeled structures.
Rebelling for a Reason: Protein Structural “Outliers”

PubMed Central

Arumugam, Gandhimathi; Nair, Anu G.; Hariharaputran, Sridhar; Ramanathan, Sowdhamini

2013-01-01

Analysis of structural variation in domain superfamilies can reveal constraints in protein evolution which aids protein structure prediction and classification. Structure-based sequence alignment of distantly related proteins, organized in PASS2 database, provides clues about structurally conserved regions among different functional families. Some superfamily members show large structural differences which are functionally relevant. This paper analyses the impact of structural divergence on function for multi-member superfamilies, selected from the PASS2 superfamily alignment database. Functional annotations within superfamilies, with structural outliers or ‘rebels’, are discussed in the context of structural variations. Overall, these data reinforce the idea that functional similarities cannot be extrapolated from mere structural conservation. The implication for fold-function prediction is that the functional annotations can only be inherited with very careful consideration, especially at low sequence identities. PMID:24073209
Alpha-Helical Protein Networks Are Self-Protective and Flaw-Tolerant

PubMed Central

Ackbarow, Theodor; Sen, Dipanjan; Thaulow, Christian; Buehler, Markus J.

2009-01-01

Alpha-helix based protein networks as they appear in intermediate filaments in the cell’s cytoskeleton and the nuclear membrane robustly withstand large deformation of up to several hundred percent strain, despite the presence of structural imperfections or flaws. This performance is not achieved by most synthetic materials, which typically fail at much smaller deformation and show a great sensitivity to the existence of structural flaws. Here we report a series of molecular dynamics simulations with a simple coarse-grained multi-scale model of alpha-helical protein domains, explaining the structural and mechanistic basis for this observed behavior. We find that the characteristic properties of alpha-helix based protein networks are due to the particular nanomechanical properties of their protein constituents, enabling the formation of large dissipative yield regions around structural flaws, effectively protecting the protein network against catastrophic failure. We show that the key for these self protecting properties is a geometric transformation of the crack shape that significantly reduces the stress concentration at corners. Specifically, our analysis demonstrates that the failure strain of alpha-helix based protein networks is insensitive to the presence of structural flaws in the protein network, only marginally affecting their overall strength. Our findings may help to explain the ability of cells to undergo large deformation without catastrophic failure while providing significant mechanical resistance. PMID:19547709

PDB-UF: database of predicted enzymatic functions for unannotated protein structures from structural genomics.

PubMed

von Grotthuss, Marcin; Plewczynski, Dariusz; Ginalski, Krzysztof; Rychlewski, Leszek; Shakhnovich, Eugene I

2006-02-06

The number of protein structures from structural genomics centers dramatically increases in the Protein Data Bank (PDB). Many of these structures are functionally unannotated because they have no sequence similarity to proteins of known function. However, it is possible to successfully infer function using only structural similarity. Here we present the PDB-UF database, a web-accessible collection of predictions of enzymatic properties using structure-function relationship. The assignments were conducted for three-dimensional protein structures of unknown function that come from structural genomics initiatives. We show that 4 hypothetical proteins (with PDB accession codes: 1VH0, 1NS5, 1O6D, and 1TO0), for which standard BLAST tools such as PSI-BLAST or RPS-BLAST failed to assign any function, are probably methyltransferase enzymes. We suggest that the structure-based prediction of an EC number should be conducted having the different similarity score cutoff for different protein folds. Moreover, performing the annotation using two different algorithms can reduce the rate of false positive assignments. We believe, that the presented web-based repository will help to decrease the number of protein structures that have functions marked as "unknown" in the PDB file. http://paradox.harvard.edu/PDB-UF and http://bioinfo.pl/PDB-UF.
Functional structural motifs for protein-ligand, protein-protein, and protein-nucleic acid interactions and their connection to supersecondary structures.

PubMed

Kinjo, Akira R; Nakamura, Haruki

2013-01-01

Protein functions are mediated by interactions between proteins and other molecules. One useful approach to analyze protein functions is to compare and classify the structures of interaction interfaces of proteins. Here, we describe the procedures for compiling a database of interface structures and efficiently comparing the interface structures. To do so requires a good understanding of the data structures of the Protein Data Bank (PDB). Therefore, we also provide a detailed account of the PDB exchange dictionary necessary for extracting data that are relevant for analyzing interaction interfaces and secondary structures. We identify recurring structural motifs by classifying similar interface structures, and we define a coarse-grained representation of supersecondary structures (SSS) which represents a sequence of two or three secondary structure elements including their relative orientations as a string of four to seven letters. By examining the correspondence between structural motifs and SSS strings, we show that no SSS string has particularly high propensity to be found interaction interfaces in general, indicating any SSS can be used as a binding interface. When individual structural motifs are examined, there are some SSS strings that have high propensity for particular groups of structural motifs. In addition, it is shown that while the SSS strings found in particular structural motifs for nonpolymer and protein interfaces are as abundant as in other structural motifs that belong to the same subunit, structural motifs for nucleic acid interfaces exhibit somewhat stronger preference for SSS strings. In regard to protein folds, many motif-specific SSS strings were found across many folds, suggesting that SSS may be a useful description to investigate the universality of ligand binding modes.
Detection of amide I signals of interfacial proteins in situ using SFG.

PubMed

Wang, Jie; Even, Mark A; Chen, Xiaoyun; Schmaier, Alvin H; Waite, J Herbert; Chen, Zhan

2003-08-20

In this Communication, we demonstrate the novel observation that it is feasible to collect amide signals from polymer/protein solution interfaces in situ using sum frequency generation (SFG) vibrational spectroscopy. Such SFG amide signals allow for acquisition of more detailed molecular level information of entire interfacial protein structures. Proteins investigated include bovine serum albumin, mussel protein mefp-2, factor XIIa, and ubiquitin. Our studies indicate that different proteins generate different SFG amide signals at the polystyrene/protein solution interface, showing that they have different interfacial coverage, secondary structure, or orientation.
Nucleic and Amino Acid Sequences Support Structure-Based Viral Classification.

PubMed

Sinclair, Robert M; Ravantti, Janne J; Bamford, Dennis H

2017-04-15

Viral capsids ensure viral genome integrity by protecting the enclosed nucleic acids. Interactions between the genome and capsid and between individual capsid proteins (i.e., capsid architecture) are intimate and are expected to be characterized by strong evolutionary conservation. For this reason, a capsid structure-based viral classification has been proposed as a way to bring order to the viral universe. The seeming lack of sufficient sequence similarity to reproduce this classification has made it difficult to reject structural convergence as the basis for the classification. We reinvestigate whether the structure-based classification for viral coat proteins making icosahedral virus capsids is in fact supported by previously undetected sequence similarity. Since codon choices can influence nascent protein folding cotranslationally, we searched for both amino acid and nucleotide sequence similarity. To demonstrate the sensitivity of the approach, we identify a candidate gene for the pandoravirus capsid protein. We show that the structure-based classification is strongly supported by amino acid and also nucleotide sequence similarities, suggesting that the similarities are due to common descent. The correspondence between structure-based and sequence-based analyses of the same proteins shown here allow them to be used in future analyses of the relationship between linear sequence information and macromolecular function, as well as between linear sequence and protein folds. IMPORTANCE Viral capsids protect nucleic acid genomes, which in turn encode capsid proteins. This tight coupling of protein shell and nucleic acids, together with strong functional constraints on capsid protein folding and architecture, leads to the hypothesis that capsid protein-coding nucleotide sequences may retain signatures of ancient viral evolution. We have been able to show that this is indeed the case, using the major capsid proteins of viruses forming icosahedral capsids. Importantly, we detected similarity at the nucleotide level between capsid protein-coding regions from viruses infecting cells belonging to all three domains of life, reproducing a previously established structure-based classification of icosahedral viral capsids. Copyright © 2017 Sinclair et al.
Nucleic and Amino Acid Sequences Support Structure-Based Viral Classification

PubMed Central

Sinclair, Robert M.; Ravantti, Janne J.

2017-01-01

ABSTRACT Viral capsids ensure viral genome integrity by protecting the enclosed nucleic acids. Interactions between the genome and capsid and between individual capsid proteins (i.e., capsid architecture) are intimate and are expected to be characterized by strong evolutionary conservation. For this reason, a capsid structure-based viral classification has been proposed as a way to bring order to the viral universe. The seeming lack of sufficient sequence similarity to reproduce this classification has made it difficult to reject structural convergence as the basis for the classification. We reinvestigate whether the structure-based classification for viral coat proteins making icosahedral virus capsids is in fact supported by previously undetected sequence similarity. Since codon choices can influence nascent protein folding cotranslationally, we searched for both amino acid and nucleotide sequence similarity. To demonstrate the sensitivity of the approach, we identify a candidate gene for the pandoravirus capsid protein. We show that the structure-based classification is strongly supported by amino acid and also nucleotide sequence similarities, suggesting that the similarities are due to common descent. The correspondence between structure-based and sequence-based analyses of the same proteins shown here allow them to be used in future analyses of the relationship between linear sequence information and macromolecular function, as well as between linear sequence and protein folds. IMPORTANCE Viral capsids protect nucleic acid genomes, which in turn encode capsid proteins. This tight coupling of protein shell and nucleic acids, together with strong functional constraints on capsid protein folding and architecture, leads to the hypothesis that capsid protein-coding nucleotide sequences may retain signatures of ancient viral evolution. We have been able to show that this is indeed the case, using the major capsid proteins of viruses forming icosahedral capsids. Importantly, we detected similarity at the nucleotide level between capsid protein-coding regions from viruses infecting cells belonging to all three domains of life, reproducing a previously established structure-based classification of icosahedral viral capsids. PMID:28122979
Rapid search for tertiary fragments reveals protein sequence–structure relationships

PubMed Central

Zhou, Jianfu; Grigoryan, Gevorg

2015-01-01

Finding backbone substructures from the Protein Data Bank that match an arbitrary query structural motif, composed of multiple disjoint segments, is a problem of growing relevance in structure prediction and protein design. Although numerous protein structure search approaches have been proposed, methods that address this specific task without additional restrictions and on practical time scales are generally lacking. Here, we propose a solution, dubbed MASTER, that is both rapid, enabling searches over the Protein Data Bank in a matter of seconds, and provably correct, finding all matches below a user-specified root-mean-square deviation cutoff. We show that despite the potentially exponential time complexity of the problem, running times in practice are modest even for queries with many segments. The ability to explore naturally plausible structural and sequence variations around a given motif has the potential to synthesize its design principles in an automated manner; so we go on to illustrate the utility of MASTER to protein structural biology. We demonstrate its capacity to rapidly establish structure–sequence relationships, uncover the native designability landscapes of tertiary structural motifs, identify structural signatures of binding, and automatically rewire protein topologies. Given the broad utility of protein tertiary fragment searches, we hope that providing MASTER in an open-source format will enable novel advances in understanding, predicting, and designing protein structure. PMID:25420575
In-situ and real-time growth observation of high-quality protein crystals under quasi-microgravity on earth.

PubMed

Nakamura, Akira; Ohtsuka, Jun; Kashiwagi, Tatsuki; Numoto, Nobutaka; Hirota, Noriyuki; Ode, Takahiro; Okada, Hidehiko; Nagata, Koji; Kiyohara, Motosuke; Suzuki, Ei-Ichiro; Kita, Akiko; Wada, Hitoshi; Tanokura, Masaru

2016-02-26

Precise protein structure determination provides significant information on life science research, although high-quality crystals are not easily obtained. We developed a system for producing high-quality protein crystals with high throughput. Using this system, gravity-controlled crystallization are made possible by a magnetic microgravity environment. In addition, in-situ and real-time observation and time-lapse imaging of crystal growth are feasible for over 200 solution samples independently. In this paper, we also report results of crystallization experiments for two protein samples. Crystals grown in the system exhibited magnetic orientation and showed higher and more homogeneous quality compared with the control crystals. The structural analysis reveals that making use of the magnetic microgravity during the crystallization process helps us to build a well-refined protein structure model, which has no significant structural differences with a control structure. Therefore, the system contributes to improvement in efficiency of structural analysis for "difficult" proteins, such as membrane proteins and supermolecular complexes.
Structural characterization and physicochemical properties of protein extracted from soybean meal assisted by steam flash-explosion with dilute acid soaking.

PubMed

Zhang, Yanpeng; Yang, Ruijin; Zhang, Weinong; Hu, Zhixiong; Zhao, Wei

2017-03-15

The aim of this work was to analyze the influence of steam flash-explosion (SFE) with dilute acid soaking pretreatment on the structural characteristics and physiochemical properties of protein from soybean meal (SBM). The pretreatment led to depolymerisation of soy protein isolate (SPI) and formation of new protein aggregation through non-disulfide covalent bonds, which resulted in broader MW distribution of SPI. The analysis of CD spectroscopy showed that the SFE treatment induced minor changes in secondary structure, however, the intrinsic tryptophan fluorescence revealed that acid soaking and SFE treatment pronouncedly altered the tertiary structure of SPI. The protein zeta potential was shown to be increased after SFE treatment attributed to the changes in protein structure and the covalent coupling between carbohydrate and protein. These results contribute to clarifying the mechanisms of the effect of pretreatment on SPI structure, thus moving further toward implementing SFE in the processing chain of SPI. Copyright © 2016 Elsevier Ltd. All rights reserved.
Protein Secondary Structures (alpha-helix and beta-sheet) at a Cellular Levle and Protein Fractions in Relation to Rumen Degradation Behaviours of Protein: A New Approach

DOE Office of Scientific and Technical Information (OSTI.GOV)

Yu,P.

2007-01-01

Studying the secondary structure of proteins leads to an understanding of the components that make up a whole protein, and such an understanding of the structure of the whole protein is often vital to understanding its digestive behaviour and nutritive value in animals. The main protein secondary structures are the {alpha}-helix and {beta}-sheet. The percentage of these two structures in protein secondary structures influences protein nutritive value, quality and digestive behaviour. A high percentage of {beta}-sheet structure may partly cause a low access to gastrointestinal digestive enzymes, which results in a low protein value. The objectives of the present studymore » were to use advanced synchrotron-based Fourier transform IR (S-FTIR) microspectroscopy as a new approach to reveal the molecular chemistry of the protein secondary structures of feed tissues affected by heat-processing within intact tissue at a cellular level, and to quantify protein secondary structures using multicomponent peak modelling Gaussian and Lorentzian methods, in relation to protein digestive behaviours and nutritive value in the rumen, which was determined using the Cornell Net Carbohydrate Protein System. The synchrotron-based molecular chemistry research experiment was performed at the National Synchrotron Light Source at Brookhaven National Laboratory, US Department of Energy. The results showed that, with S-FTIR microspectroscopy, the molecular chemistry, ultrastructural chemical make-up and nutritive characteristics could be revealed at a high ultraspatial resolution ({approx}10 {mu}m). S-FTIR microspectroscopy revealed that the secondary structure of protein differed between raw and roasted golden flaxseeds in terms of the percentages and ratio of {alpha}-helixes and {beta}-sheets in the mid-IR range at the cellular level. By using multicomponent peak modelling, the results show that the roasting reduced (P <0.05) the percentage of {alpha}-helixes (from 47.1% to 36.1%: S-FTIR absorption intensity), increased the percentage of {beta}-sheets (from 37.2% to 49.8%: S-FTIR absorption intensity) and reduced the {alpha}-helix to {beta}-sheet ratio (from 0.3 to 0.7) in the golden flaxseeds, which indicated a negative effect of the roasting on protein values, utilisation and bioavailability. These results were proved by the Cornell Net Carbohydrate Protein System in situ animal trial, which also revealed that roasting increased the amount of protein bound to lignin, and well as of the Maillard reaction protein (both of which are poorly used by ruminants), and increased the level of indigestible and undegradable protein in ruminants. The present results demonstrate the potential of highly spatially resolved synchrotron-based infrared microspectroscopy to locate 'pure' protein in feed tissues, and reveal protein secondary structures and digestive behaviour, making a significant step forward in and an important contribution to protein nutritional research. Further study is needed to determine the sensitivities of protein secondary structures to various heat-processing conditions, and to quantify the relationship between protein secondary structures and the nutrient availability and digestive behaviour of various protein sources. Information from the present study arising from the synchrotron-based IR probing of the protein secondary structures of protein sources at the cellular level will be valuable as a guide to maintaining protein quality and predicting digestive behaviours.« less
Identification of a Unique Fe-S Cluster Binding Site in a Glycyl-Radical Type Microcompartment Shell Protein

PubMed Central

Thompson, Michael C.; Wheatley, Nicole M.; Jorda, Julien; Sawaya, Michael R.; Gidaniyan, Soheil D.; Ahmed, Hoda; Yang, Zhongyu; McCarty, Krystal N.; Whitelegge, Julian P.; Yeates, Todd O.

2014-01-01

Recently, progress has been made toward understanding the functional diversity of bacterial microcompartment (MCP) systems, which serve as protein-based metabolic organelles in diverse microbes. New types of MCPs have been identified, including the glycyl-radical propanediol (Grp) MCP. Within these elaborate protein complexes, BMC-domain shell proteins assemble to form a polyhedral barrier that encapsulates the enzymatic contents of the MCP. Interestingly, the Grp MCP contains a number of shell proteins with unusual sequence features. GrpU is one such shell protein, whose amino acid sequence is particularly divergent from other members of the BMC-domain superfamily of proteins that effectively defines all MCPs. Expression, purification, and subsequent characterization of the protein showed, unexpectedly, that it binds an iron-sulfur cluster. We determined X-ray crystal structures of two GrpU orthologs, providing the first structural insight into the homohexameric BMC-domain shell proteins of the Grp system. The X-ray structures of GrpU, both obtained in the apo form, combined with spectroscopic analyses and computational modeling, show that the metal cluster resides in the central pore of the BMC shell protein at a position of broken 6-fold symmetry. The result is a structurally polymorphic iron-sulfur cluster binding site that appears to be unique among metalloproteins studied to date. PMID:25102080
Water entrapment and structure ordering as protection mechanisms for protein structural preservation

NASA Astrophysics Data System (ADS)

Arsiccio, A.; Pisano, R.

2018-02-01

In this paper, molecular dynamics is used to further gain insight into the mechanisms by which typical pharmaceutical excipients preserve the protein structure. More specifically, the water entrapment scenario will be analyzed, which states that excipients form a cage around the protein, entrapping and slowing water molecules. Human growth hormone will be used as a model protein, but the results obtained are generally applicable. We will show that water entrapment, as well as the other mechanisms of protein stabilization in the dried state proposed so far, may be related to the formation of a dense hydrogen bonding network between excipient molecules. We will also present a simple phenomenological model capable of explaining the behavior and stabilizing effect provided by typical cryo- and lyo-protectants. This model uses, as input data, molecular properties which can be easily evaluated. We will finally show that the model predictions compare fairly well with experimental data.
Non-Uniform Sampling and J-UNIO Automation for Efficient Protein NMR Structure Determination.

PubMed

Didenko, Tatiana; Proudfoot, Andrew; Dutta, Samit Kumar; Serrano, Pedro; Wüthrich, Kurt

2015-08-24

High-resolution structure determination of small proteins in solution is one of the big assets of NMR spectroscopy in structural biology. Improvements in the efficiency of NMR structure determination by advances in NMR experiments and automation of data handling therefore attracts continued interest. Here, non-uniform sampling (NUS) of 3D heteronuclear-resolved [(1)H,(1)H]-NOESY data yielded two- to three-fold savings of instrument time for structure determinations of soluble proteins. With the 152-residue protein NP_372339.1 from Staphylococcus aureus and the 71-residue protein NP_346341.1 from Streptococcus pneumonia we show that high-quality structures can be obtained with NUS NMR data, which are equally well amenable to robust automated analysis as the corresponding uniformly sampled data. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Structural perturbation of proteins in low denaturant concentrations.

PubMed

Basak, S; Debnath, D; Haque, E; Ray, S; Chakrabarti, A

2001-01-01

The presence of very low concentrations of the widely used chemical denaturants, guanidinium chloride and urea, induce changes in the tertiary structure of proteins. We have presented results on such changes in four structurally unrelated proteins to show that such structural perturbations are common irrespective of their origin. Data representative of such structural changes are shown for the monomeric globular proteins such as horseradish peroxidase (HRP) from a plant, human serum albumin (HSA) and prothrombin from ovine blood serum, and for the membrane-associated, worm-like elongated protein, spectrin, from ovine erythrocytes. Structural alterations in these proteins were reflected in quenching studies of tryptophan fluorescence using the widely used quencher acrylamide. Stern-Volmer quenching constants measured in presence of the denaturants, even at concentrations below 100 mM, were higher than those measured in absence of the denaturants. Both steady-state and time-resolved fluorescence emission properties of tryptophan and of the extrinsic probe PRODAN were used for monitoring conformational changes in the proteins in presence of different low concentrations of the denaturants. These results are consistent with earlier studies from our laboratory indicating structural perturbations in proteins at the tertiary level, keeping their native-like secondary structure and their biological activity more or less intact.
Structural protein 4.1 is located in mammalian centrosomes

PubMed Central

Krauss, Sharon Wald; Chasis, Joel Anne; Rogers, Catherine; Mohandas, Narla; Krockmalnic, Gabriela; Penman, Sheldon

1997-01-01

Structural protein 4.1 was first characterized as an important 80-kDa protein in the mature red cell membrane skeleton. It is now known to be a member of a family of protein isoforms detected at diverse intracellular sites in many nucleated mammalian cells. We recently reported that protein 4.1 isoforms are present at interphase in nuclear matrix and are rearranged during the cell cycle. Here we report that protein 4.1 epitopes are present in centrosomes of human and murine cells and are detected by using affinity-purified antibodies specific for 80-kDa red cell 4.1 and for 4.1 peptides. Immunofluorescence, by both conventional and confocal microscopy, showed that protein 4.1 epitopes localized in the pericentriolar region. Protein 4.1 epitopes remained in centrosomes after extraction of cells with detergent, salt, and DNase. Higher resolution electron microscopy of detergent-extracted cell whole mounts showed centrosomal protein 4.1 epitopes distributed along centriolar cylinders and on pericentriolar fibers, at least some of which constitute the filamentous network surrounding each centriole. Double-label electron microscopy showed that protein 4.1 epitopes were predominately localized in regions also occupied by epitopes for centrosome-specific autoimmune serum 5051 but were not found on microtubules. Our results suggest that protein 4.1 is an integral component of centrosome structure, in which it may play an important role in centrosome function during cell division and organization of cellular architecture. PMID:9207085
Effects of power ultrasound on oxidation and structure of beef proteins during curing processing.

PubMed

Kang, Da-Cheng; Zou, Yun-He; Cheng, Yu-Ping; Xing, Lu-Juan; Zhou, Guang-Hong; Zhang, Wan-Gang

2016-11-01

The aim of this study was to evaluate the effects of power ultrasound intensity (PUS, 2.39, 6.23, 11.32 and 20.96Wcm(-2)) and treatment time (30, 60, 90 and 120min) on the oxidation and structure of beef proteins during the brining procedure with 6% NaCl concentration. The investigation was conducted with an ultrasonic generator with the frequency of 20kHz and fresh beef at 48h after slaughter. Analysis of TBARS (Thiobarbituric acid reactive substances) contents showed that PUS treatment significantly increased the extent of lipid oxidation compared to static brining (P<0.05). As indicators of protein oxidation, the carbonyl contents were significantly affected by PUS (P<0.05). SDS-PAGE analysis showed that PUS treatment increased protein aggregation through disulfide cross-linking, indicated by the decreasing content of total sulfhydryl groups which would contribute to protein oxidation. In addition, changes in protein structure after PUS treatment are suggested by the increases in free sulfhydryl residues and protein surface hydrophobicity. Fourier transformed infrared spectroscopy (FTIR) provided further information about the changes in protein secondary structures with increases in β-sheet and decreases in α-helix contents after PUS processing. These results indicate that PUS leads to changes in structures and oxidation of beef proteins caused by mechanical effects of cavitation and the resultant generation of free radicals. Copyright © 2016 Elsevier B.V. All rights reserved.
Deciphering Cryptic Binding Sites on Proteins by Mixed-Solvent Molecular Dynamics.

PubMed

Kimura, S Roy; Hu, Hai Peng; Ruvinsky, Anatoly M; Sherman, Woody; Favia, Angelo D

2017-06-26

In recent years, molecular dynamics simulations of proteins in explicit mixed solvents have been applied to various problems in protein biophysics and drug discovery, including protein folding, protein surface characterization, fragment screening, allostery, and druggability assessment. In this study, we perform a systematic study on how mixtures of organic solvent probes in water can reveal cryptic ligand binding pockets that are not evident in crystal structures of apo proteins. We examine a diverse set of eight PDB proteins that show pocket opening induced by ligand binding and investigate whether solvent MD simulations on the apo structures can induce the binding site observed in the holo structures. The cosolvent simulations were found to induce conformational changes on the protein surface, which were characterized and compared with the holo structures. Analyses of the biological systems, choice of probes and concentrations, druggability of the resulting induced pockets, and application to drug discovery are discussed here.
Amyloid formation and inhibition of an all-beta protein: A study on fungal polygalacturonase

NASA Astrophysics Data System (ADS)

Chinisaz, Maryam; Ghasemi, Atiyeh; Larijani, Bagher; Ebrahim-Habibi, Azadeh

2014-02-01

Theoretically, all proteins can adopt the nanofibrillar structures known as amyloid, which contain cross-beta structures. The all-beta folded proteins are particularly interesting in this regard, since they appear to be naturally more predisposed toward this structural arrangement. In this study, methanol has been used to drive the beta-helix protein polygalacturonase (PG), toward amyloid fibril formation. Congo red absorbance, thioflavin T fluorescence, circular dichroism (CD) and transmission electron microscopy have been used to characterize this process. Similar to other all-beta proteins, PG shows a non-cooperative fibrillation mechanism, but the structural changes that are monitored by CD indicate a different pattern. Furthermore, several compounds containing aromatic components were tested as potential inhibitors of amyloid formation. Another protein predominantly composed of alpha-helices (human serum albumin) was also targeted by these ligands, in order to get an insight into their potential anti-aggregation property toward structurally different proteins. Among tested compounds, silibinin and chlorpropamide were able to considerably affect both proteins fibrillation process.
Tuning structure of oppositely charged nanoparticle and protein complexes

NASA Astrophysics Data System (ADS)

Kumar, Sugam; Aswal, V. K.; Callow, P.

2014-04-01

Small-angle neutron scattering (SANS) has been used to probe the structures of anionic silica nanoparticles (LS30) and cationic lyszyme protein (M.W. 14.7kD, I.P. ˜ 11.4) by tuning their interaction through the pH variation. The protein adsorption on nanoparticles is found to be increasing with pH and determined by the electrostatic attraction between two components as well as repulsion between protein molecules. We show the strong electrostatic attraction between nanoparticles and protein molecules leads to protein-mediated aggregation of nanoparticles which are characterized by fractal structures. At pH 5, the protein adsorption gives rise to nanoparticle aggregation having surface fractal morphology with close packing of nanoparticles. The surface fractals transform to open structures of mass fractal morphology at higher pH (7 and 9) on approaching isoelectric point (I.P.).
Evolution of plant cell wall: Arabinogalactan-proteins from three moss genera show structural differences compared to seed plants.

PubMed

Bartels, Desirée; Baumann, Alexander; Maeder, Malte; Geske, Thomas; Heise, Esther Marie; von Schwartzenberg, Klaus; Classen, Birgit

2017-05-01

Arabinogalactan-proteins (AGPs) are important proteoglycans of plant cell walls. They seem to be present in most, if not all seed plants, but their occurrence and structure in bryophytes is widely unknown and actually the focus of AGP research. With regard to evolution of plant cell wall, we isolated AGPs from the three mosses Sphagnum sp., Physcomitrella patens and Polytrichastrum formosum. The moss AGPs show structural characteristics common for AGPs of seed plants, but also unique features, especially 3-O-methyl-rhamnose (trivial name acofriose) as terminal monosaccharide not found in arabinogalactan-proteins of angiosperms and 1,2,3-linked galactose as branching point never found in arabinogalactan-proteins before. Copyright © 2017 Elsevier Ltd. All rights reserved.
Superimposition of protein structures with dynamically weighted RMSD.

PubMed

Wu, Di; Wu, Zhijun

2010-02-01

In protein modeling, one often needs to superimpose a group of structures for a protein. A common way to do this is to translate and rotate the structures so that the square root of the sum of squares of coordinate differences of the atoms in the structures, called the root-mean-square deviation (RMSD) of the structures, is minimized. While it has provided a general way of aligning a group of structures, this approach has not taken into account the fact that different atoms may have different properties and they should be compared differently. For this reason, when superimposed with RMSD, the coordinate differences of different atoms should be evaluated with different weights. The resulting RMSD is called the weighted RMSD (wRMSD). Here we investigate the use of a special wRMSD for superimposing a group of structures with weights assigned to the atoms according to certain thermal motions of the atoms. We call such an RMSD the dynamically weighted RMSD (dRMSD). We show that the thermal motions of the atoms can be obtained from several sources such as the mean-square fluctuations that can be estimated by Gaussian network model analysis. We show that the superimposition of structures with dRMSD can successfully identify protein domains and protein motions, and that it has important implications in practice, e.g., in aligning the ensemble of structures determined by nuclear magnetic resonance.

Analysis of protein circular dichroism spectra for secondary structure using a simple matrix multiplication.

PubMed

Compton, L A; Johnson, W C

1986-05-15

Inverse circular dichroism (CD) spectra are presented for each of the five major secondary structures of proteins: alpha-helix, antiparallel and parallel beta-sheet, beta-turn, and other (random) structures. The fraction of the each secondary structure in a protein is predicted by forming the dot product of the corresponding inverse CD spectrum, expressed as a vector, with the CD spectrum of the protein digitized in the same way. We show how this method is based on the construction of the generalized inverse from the singular value decomposition of a set of CD spectra corresponding to proteins whose secondary structures are known from X-ray crystallography. These inverse spectra compute secondary structure directly from protein CD spectra without resorting to least-squares fitting and standard matrix inversion techniques. In addition, spectra corresponding to the individual secondary structures, analogous to the CD spectra of synthetic polypeptides, are generated from the five most significant CD eigenvectors.
Structural effects of protein aging: Terminal marking by deamidation in human triosephosphate isomerase

DOE PAGES

Torres-Larios, Alfredo; Enríquez-Flores, Sergio; Méndez, Sara -Teresa; ...

2015-04-17

Deamidation, the loss of the ammonium group of asparagine and glutamine to form aspartic and glutamic acid, is one of the most commonly occurring post-translational modifications in proteins. Since deamidation rates are encoded in the protein structure, it has been proposed that they can serve as molecular clocks for the timing of biological processes such as protein turnover, development and aging. Despite the importance of this process, there is a lack of detailed structural information explaining the effects of deamidation on the structure of proteins. Here, we studied the effects of deamidation on human triosephosphate isomerase (HsTIM), an enzyme formore » which deamidation of N15 and N71 has been long recognized as the signal for terminal marking of the protein. Deamidation was mimicked by site directed mutagenesis; thus, three mutants of HsTIM (N15D, N71D and N15D/N71D) were characterized. The results show that the N71D mutant resembles, structurally and functionally, the wild type enzyme. In contrast, the N15D mutant displays all the detrimental effects related to deamidation. The N15D/N71D mutant shows only minor additional effects when compared with the N15D mutation, supporting that deamidation of N71 induces negligible effects. The crystal structures show that, in contrast to the N71D mutant, where minimal alterations are observed, the N15D mutation forms new interactions that perturb the structure of loop 1 and loop 3, both critical components of the catalytic site and the interface of HsTIM. Based on a phylogenetic analysis of TIM sequences, we propose the conservation of this mechanism for mammalian TIMs.« less
Structural effects of protein aging: Terminal marking by deamidation in human triosephosphate isomerase

DOE Office of Scientific and Technical Information (OSTI.GOV)

Torres-Larios, Alfredo; Enríquez-Flores, Sergio; Méndez, Sara -Teresa

Deamidation, the loss of the ammonium group of asparagine and glutamine to form aspartic and glutamic acid, is one of the most commonly occurring post-translational modifications in proteins. Since deamidation rates are encoded in the protein structure, it has been proposed that they can serve as molecular clocks for the timing of biological processes such as protein turnover, development and aging. Despite the importance of this process, there is a lack of detailed structural information explaining the effects of deamidation on the structure of proteins. Here, we studied the effects of deamidation on human triosephosphate isomerase (HsTIM), an enzyme formore » which deamidation of N15 and N71 has been long recognized as the signal for terminal marking of the protein. Deamidation was mimicked by site directed mutagenesis; thus, three mutants of HsTIM (N15D, N71D and N15D/N71D) were characterized. The results show that the N71D mutant resembles, structurally and functionally, the wild type enzyme. In contrast, the N15D mutant displays all the detrimental effects related to deamidation. The N15D/N71D mutant shows only minor additional effects when compared with the N15D mutation, supporting that deamidation of N71 induces negligible effects. The crystal structures show that, in contrast to the N71D mutant, where minimal alterations are observed, the N15D mutation forms new interactions that perturb the structure of loop 1 and loop 3, both critical components of the catalytic site and the interface of HsTIM. Based on a phylogenetic analysis of TIM sequences, we propose the conservation of this mechanism for mammalian TIMs.« less
Protein Structure Prediction Using Gas Phase Molecular Dynamics Simulation: EOTAXIN-3 Cytokine as a Case Study

NASA Astrophysics Data System (ADS)

Khairudin, Nurul Bahiyah Ahmad; Wahab, Habibah A.

In the current work, the structure of the enzyme CC chemokine eotaxin-3 (1G2S) was chosen as a case study to investigate the effects of gas phase on the predicted protein conformation using molecular dynamics simulation. Generally, simulating proteins in the gas phase tend to suffer from various drawbacks, among which excessive numbers of protein-protein hydrogen bonds. However, current results showed that the effects of gas phase simulation on 1G2S did not amplify the protein-protein hydrogen bonds. It was also found that some of the hydrogen bonds which were crucial in maintaining the secondary structural elements were disrupted. The predicted models showed high values of RMSD, 11.5 Å and 13.5 Å for both vacuum and explicit solvent simulations, respectively, indicating that the conformers were very much different from the native conformation. Even though the RMSD value for the in vacuo model was slightly lower, it somehow suffered from lower fraction of native contacts, poor hydrogen bonding networks and fewer occurrences of secondary structural elements compared to the solvated model. This finding supports the notion that water plays a dominant role in guiding the protein to fold along the correct path.
Structural studies of the Sputnik virophage.

PubMed

Sun, Siyang; La Scola, Bernard; Bowman, Valorie D; Ryan, Christopher M; Whitelegge, Julian P; Raoult, Didier; Rossmann, Michael G

2010-01-01

The virophage Sputnik is a satellite virus of the giant mimivirus and is the only satellite virus reported to date whose propagation adversely affects its host virus' production. Genome sequence analysis showed that Sputnik has genes related to viruses infecting all three domains of life. Here, we report structural studies of Sputnik, which show that it is about 740 A in diameter, has a T=27 icosahedral capsid, and has a lipid membrane inside the protein shell. Structural analyses suggest that the major capsid protein of Sputnik is likely to have a double jelly-roll fold, although sequence alignments do not show any detectable similarity with other viral double jelly-roll capsid proteins. Hence, the origin of Sputnik's capsid might have been derived from other viruses prior to its association with mimivirus.
Structural Studies of the Sputnik Virophage▿

PubMed Central

Sun, Siyang; La Scola, Bernard; Bowman, Valorie D.; Ryan, Christopher M.; Whitelegge, Julian P.; Raoult, Didier; Rossmann, Michael G.

2010-01-01

The virophage Sputnik is a satellite virus of the giant mimivirus and is the only satellite virus reported to date whose propagation adversely affects its host virus' production. Genome sequence analysis showed that Sputnik has genes related to viruses infecting all three domains of life. Here, we report structural studies of Sputnik, which show that it is about 740 Å in diameter, has a T=27 icosahedral capsid, and has a lipid membrane inside the protein shell. Structural analyses suggest that the major capsid protein of Sputnik is likely to have a double jelly-roll fold, although sequence alignments do not show any detectable similarity with other viral double jelly-roll capsid proteins. Hence, the origin of Sputnik's capsid might have been derived from other viruses prior to its association with mimivirus. PMID:19889775
The hypothetical protein Atu4866 from Agrobacterium tumefaciens adopts a streptavidin-like fold

PubMed Central

Ai, Xuanjun; Semesi, Anthony; Yee, Adelinda; Arrowsmith, Cheryl H.; Choy, Wing-Yiu; Li, Shawn S.C.

2008-01-01

Atu4866 is a 79-residue conserved hypothetical protein of unknown function from Agrobacterium tumefaciens. Protein sequence alignments show that it shares ≥60% sequence identity with 20 other hypothetical proteins of bacterial origin. However, the structures and functions of these proteins remain unknown so far. To gain insight into the function of this family of proteins, we have determined the structure of Atu4866 as a target of a structural genomics project using solution NMR spectroscopy. Our results reveal that Atu4866 adopts a streptavidin-like fold featuring a β-barrel/sandwich formed by eight antiparallel β-strands. Further structural analysis identified a continuous patch of conserved residues on the surface of Atu4866 that may constitute a potential ligand-binding site. PMID:18042676
A novel Multi-Agent Ada-Boost algorithm for predicting protein structural class with the information of protein secondary structure.

PubMed

Fan, Ming; Zheng, Bin; Li, Lihua

2015-10-01

Knowledge of the structural class of a given protein is important for understanding its folding patterns. Although a lot of efforts have been made, it still remains a challenging problem for prediction of protein structural class solely from protein sequences. The feature extraction and classification of proteins are the main problems in prediction. In this research, we extended our earlier work regarding these two aspects. In protein feature extraction, we proposed a scheme by calculating the word frequency and word position from sequences of amino acid, reduced amino acid, and secondary structure. For an accurate classification of the structural class of protein, we developed a novel Multi-Agent Ada-Boost (MA-Ada) method by integrating the features of Multi-Agent system into Ada-Boost algorithm. Extensive experiments were taken to test and compare the proposed method using four benchmark datasets in low homology. The results showed classification accuracies of 88.5%, 96.0%, 88.4%, and 85.5%, respectively, which are much better compared with the existing methods. The source code and dataset are available on request.
Structure of the Get3 targeting factor in complex with its membrane protein cargo

DOE PAGES

Mateja, Agnieszka; Paduch, Marcin; Chang, Hsin-Yang; ...

2015-03-06

Tail-anchored (TA) proteins are a physiologically important class of membrane proteins targeted to the endoplasmic reticulum by the conserved guided-entry of TA proteins (GET) pathway. During transit, their hydrophobic transmembrane domains (TMDs) are chaperoned by the cytosolic targeting factor Get3, but the molecular nature of the functional Get3-TA protein targeting complex remains unknown. In this paper, we reconstituted the physiologic assembly pathway for a functional targeting complex and showed that it comprises a TA protein bound to a Get3 homodimer. Crystal structures of Get3 bound to different TA proteins showed an α-helical TMD occupying a hydrophobic groove that spans themore » Get3 homodimer. Finally, our data elucidate the mechanism of TA protein recognition and shielding by Get3 and suggest general principles of hydrophobic domain chaperoning by cellular targeting factors.« less
Meta-structure correlation in protein space unveils different selection rules for folded and intrinsically disordered proteins.

PubMed

Naranjo, Yandi; Pons, Miquel; Konrat, Robert

2012-01-01

The number of existing protein sequences spans a very small fraction of sequence space. Natural proteins have overcome a strong negative selective pressure to avoid the formation of insoluble aggregates. Stably folded globular proteins and intrinsically disordered proteins (IDPs) use alternative solutions to the aggregation problem. While in globular proteins folding minimizes the access to aggregation prone regions, IDPs on average display large exposed contact areas. Here, we introduce the concept of average meta-structure correlation maps to analyze sequence space. Using this novel conceptual view we show that representative ensembles of folded and ID proteins show distinct characteristics and respond differently to sequence randomization. By studying the way evolutionary constraints act on IDPs to disable a negative function (aggregation) we might gain insight into the mechanisms by which function-enabling information is encoded in IDPs.
Connecting Protein Structure to Intermolecular Interactions: A Computer Modeling Laboratory

ERIC Educational Resources Information Center

Abualia, Mohammed; Schroeder, Lianne; Garcia, Megan; Daubenmire, Patrick L.; Wink, Donald J.; Clark, Ginevra A.

2016-01-01

An understanding of protein folding relies on a solid foundation of a number of critical chemical concepts, such as molecular structure, intra-/intermolecular interactions, and relating structure to function. Recent reports show that students struggle on all levels to achieve these understandings and use them in meaningful ways. Further, several…
Identification of a new protein in the centrosome-like "atractophore" of Trichomonas vaginalis.

PubMed

Bricheux, Geneviève; Coffe, Gérard; Brugerolle, Guy

2007-06-01

The human parasite Trichomonas vaginalis has specific structural bodies, atractophores, associated at one end to the kinetosomes and at the other to the spindle during division. A monoclonal antibody specific for a component of this structure was obtained. It recognizes a protein with a predicted molecular mass of 477 kDa. Sequence analysis of this protein shows that P477 belongs to the family of large coiled-coil proteins, sharing a highly versatile protein folding motif adaptable to many biological functions. P477-might act as an anchor to localize cellular activities and components to the golgi centrosomal region. It may represent a new class of structural proteins, since similar proteins were found in many protozoans.
Effect of polarization on the stability of a helix dimer

NASA Astrophysics Data System (ADS)

Wang, Xing Y.; Zhang, John Z. H.

2011-01-01

Molecular dynamics (MD) simulations have been carried out to study helix-helix interaction using both standard AMBER and polarized force fields. Comparison of the two simulations shows that electrostatic polarization of intra-protein hydrogen bonds plays a significant role in stabilizing the structure of helix dimer. This stabilizing effect is clearly demonstrated by examining the monomer structure, helix crossing angle and stability of backbone hydrogen bonds under AMBER and PPC. Since reliable prediction of protein-protein structure is a significant challenge, the current study should help shed light on the importance of electrostatic polarization of protein in helix-helix interaction and helix bundle structures.
Segmented molecular design of self-healing proteinaceous materials

PubMed Central

Sariola, Veikko; Pena-Francesch, Abdon; Jung, Huihun; Çetinkaya, Murat; Pacheco, Carlos; Sitti, Metin; Demirel, Melik C.

2015-01-01

Hierarchical assembly of self-healing adhesive proteins creates strong and robust structural and interfacial materials, but understanding of the molecular design and structure–property relationships of structural proteins remains unclear. Elucidating this relationship would allow rational design of next generation genetically engineered self-healing structural proteins. Here we report a general self-healing and -assembly strategy based on a multiphase recombinant protein based material. Segmented structure of the protein shows soft glycine- and tyrosine-rich segments with self-healing capability and hard beta-sheet segments. The soft segments are strongly plasticized by water, lowering the self-healing temperature close to body temperature. The hard segments self-assemble into nanoconfined domains to reinforce the material. The healing strength scales sublinearly with contact time, which associates with diffusion and wetting of autohesion. The finding suggests that recombinant structural proteins from heterologous expression have potential as strong and repairable engineering materials. PMID:26323335
Computational Analysis Reveals the Association of Threonine 118 Methionine Mutation in PMP22 Resulting in CMT-1A

PubMed Central

Swetha, Rayapadi G.

2014-01-01

The T118M mutation in PMP22 gene is associated with Charcot Marie Tooth, type 1A (CMT1A). CMT1A is a form of Charcot-Marie-Tooth disease, the most common inherited disorder of the peripheral nervous system. Mutations in CMT related disorder are seen to increase the stability of the protein resulting in the diseased state. We performed SNP analysis for all the nsSNPs of PMP22 protein and carried out molecular dynamics simulation for T118M mutation to compare the stability difference between the wild type protein structure and the mutant protein structure. The mutation T118M resulted in the overall increase in the stability of the mutant protein. The superimposed structure shows marked structural variation between the wild type and the mutant protein structures. PMID:25400662
Hidden Markov model-derived structural alphabet for proteins: the learning of protein local shapes captures sequence specificity.

PubMed

Camproux, A C; Tufféry, P

2005-08-05

Understanding and predicting protein structures depend on the complexity and the accuracy of the models used to represent them. We have recently set up a Hidden Markov Model to optimally compress protein three-dimensional conformations into a one-dimensional series of letters of a structural alphabet. Such a model learns simultaneously the shape of representative structural letters describing the local conformation and the logic of their connections, i.e. the transition matrix between the letters. Here, we move one step further and report some evidence that such a model of protein local architecture also captures some accurate amino acid features. All the letters have specific and distinct amino acid distributions. Moreover, we show that words of amino acids can have significant propensities for some letters. Perspectives point towards the prediction of the series of letters describing the structure of a protein from its amino acid sequence.
Structure of the parainfluenza virus 5 F protein in its metastable, prefusion conformation.

PubMed

Yin, Hsien-Sheng; Wen, Xiaolin; Paterson, Reay G; Lamb, Robert A; Jardetzky, Theodore S

2006-01-05

Enveloped viruses have evolved complex glycoprotein machinery that drives the fusion of viral and cellular membranes, permitting entry of the viral genome into the cell. For the paramyxoviruses, the fusion (F) protein catalyses this membrane merger and entry step, and it has been postulated that the F protein undergoes complex refolding during this process. Here we report the crystal structure of the parainfluenza virus 5 F protein in its prefusion conformation, stabilized by the addition of a carboxy-terminal trimerization domain. The structure of the F protein shows that there are profound conformational differences between the pre- and postfusion states, involving transformations in secondary and tertiary structure. The positions and structural transitions of key parts of the fusion machinery, including the hydrophobic fusion peptide and two helical heptad repeat regions, clarify the mechanism of membrane fusion mediated by the F protein.
Utilizing knowledge base of amino acids structural neighborhoods to predict protein-protein interaction sites.

PubMed

Jelínek, Jan; Škoda, Petr; Hoksza, David

2017-12-06

Protein-protein interactions (PPI) play a key role in an investigation of various biochemical processes, and their identification is thus of great importance. Although computational prediction of which amino acids take part in a PPI has been an active field of research for some time, the quality of in-silico methods is still far from perfect. We have developed a novel prediction method called INSPiRE which benefits from a knowledge base built from data available in Protein Data Bank. All proteins involved in PPIs were converted into labeled graphs with nodes corresponding to amino acids and edges to pairs of neighboring amino acids. A structural neighborhood of each node was then encoded into a bit string and stored in the knowledge base. When predicting PPIs, INSPiRE labels amino acids of unknown proteins as interface or non-interface based on how often their structural neighborhood appears as interface or non-interface in the knowledge base. We evaluated INSPiRE's behavior with respect to different types and sizes of the structural neighborhood. Furthermore, we examined the suitability of several different features for labeling the nodes. Our evaluations showed that INSPiRE clearly outperforms existing methods with respect to Matthews correlation coefficient. In this paper we introduce a new knowledge-based method for identification of protein-protein interaction sites called INSPiRE. Its knowledge base utilizes structural patterns of known interaction sites in the Protein Data Bank which are then used for PPI prediction. Extensive experiments on several well-established datasets show that INSPiRE significantly surpasses existing PPI approaches.
Oligomerization of a molecular chaperone modulates its activity

PubMed Central

Kawagoe, Soichiro; Ishimori, Koichiro

2018-01-01

Molecular chaperones alter the folding properties of cellular proteins via mechanisms that are not well understood. Here, we show that Trigger Factor (TF), an ATP-independent chaperone, exerts strikingly contrasting effects on the folding of non-native proteins as it transitions between a monomeric and a dimeric state. We used NMR spectroscopy to determine the atomic resolution structure of the 100 kDa dimeric TF. The structural data show that some of the substrate-binding sites are buried in the dimeric interface, explaining the lower affinity for protein substrates of the dimeric compared to the monomeric TF. Surprisingly, the dimeric TF associates faster with proteins and it exhibits stronger anti-aggregation and holdase activity than the monomeric TF. The structural data show that the dimer assembles in a way that substrate-binding sites in the two subunits form a large contiguous surface inside a cavity, thus accounting for the observed accelerated association with unfolded proteins. Our results demonstrate how the activity of a chaperone can be modulated to provide distinct functional outcomes in the cell. PMID:29714686
Chemical synthesis and X-ray structure of a heterochiral {D-protein antagonist plus vascular endothelial growth factor} protein complex by racemic crystallography.

PubMed

Mandal, Kalyaneswar; Uppalapati, Maruti; Ault-Riché, Dana; Kenney, John; Lowitz, Joshua; Sidhu, Sachdev S; Kent, Stephen B H

2012-09-11

Total chemical synthesis was used to prepare the mirror image (D-protein) form of the angiogenic protein vascular endothelial growth factor (VEGF-A). Phage display against D-VEGF-A was used to screen designed libraries based on a unique small protein scaffold in order to identify a high affinity ligand. Chemically synthesized D- and L- forms of the protein ligand showed reciprocal chiral specificity in surface plasmon resonance binding experiments: The L-protein ligand bound only to D-VEGF-A, whereas the D-protein ligand bound only to L-VEGF-A. The D-protein ligand, but not the L-protein ligand, inhibited the binding of natural VEGF(165) to the VEGFR1 receptor. Racemic protein crystallography was used to determine the high resolution X-ray structure of the heterochiral complex consisting of {D-protein antagonist + L-protein form of VEGF-A}. Crystallization of a racemic mixture of these synthetic proteins in appropriate stoichiometry gave a racemic protein complex of more than 73 kDa containing six synthetic protein molecules. The structure of the complex was determined to a resolution of 1.6 Å. Detailed analysis of the interaction between the D-protein antagonist and the VEGF-A protein molecule showed that the binding interface comprised a contact surface area of approximately 800 Å(2) in accord with our design objectives, and that the D-protein antagonist binds to the same region of VEGF-A that interacts with VEGFR1-domain 2.

Effect of drying methods on the structure, thermo and functional properties of fenugreek (Trigonella foenum graecum) protein isolate.

PubMed

Feyzi, Samira; Varidi, Mehdi; Zare, Fatemeh; Varidi, Mohammad Javad

2018-03-01

Different drying methods due to protein denaturation could alter the functional properties of proteins, as well as their structure. So, this study focused on the effect of different drying methods on amino acid content, thermo and functional properties, and protein structure of fenugreek protein isolate. Freeze and spray drying methods resulted in comparable protein solubility, dynamic surface and interfacial tensions, foaming and emulsifying properties except for emulsion stability. Vacuum oven drying promoted emulsion stability, surface hydrophobicity and viscosity of fenugreek protein isolate at the expanse of its protein solubility. Vacuum oven process caused a higher level of Maillard reaction followed by the spray drying process, which was confirmed by the lower amount of lysine content and less lightness, also more browning intensity. ΔH of fenugreek protein isolates was higher than soy protein isolate, which confirmed the presence of more ordered structures. Also, the bands which are attributed to the α-helix structures in the FTIR spectrum were in the shorter wave number region for freeze and spray dried fenugreek protein isolates that show more possibility of such structures. This research suggests that any drying method must be conducted in its gentle state in order to sustain native structure of proteins and promote their functionalities. © 2017 Society of Chemical Industry. © 2017 Society of Chemical Industry.
Surface layer protein characterization by small angle x-ray scattering and a fractal mean force concept: from protein structure to nanodisk assemblies.

PubMed

Horejs, Christine; Pum, Dietmar; Sleytr, Uwe B; Peterlik, Herwig; Jungbauer, Alois; Tscheliessnig, Rupert

2010-11-07

Surface layers (S-layers) are the most commonly observed cell surface structure of prokaryotic organisms. They are made up of proteins that spontaneously self-assemble into functional crystalline lattices in solution, on various solid surfaces, and interfaces. While classical experimental techniques failed to recover a complete structural model of an unmodified S-layer protein, small angle x-ray scattering (SAXS) provides an opportunity to study the structure of S-layer monomers in solution and of self-assembled two-dimensional sheets. For the protein under investigation we recently suggested an atomistic structural model by the use of molecular dynamics simulations. This structural model is now refined on the basis of SAXS data together with a fractal assembly approach. Here we show that a nondiluted critical system of proteins, which crystallize into monomolecular structures, might be analyzed by SAXS if protein-protein interactions are taken into account by relating a fractal local density distribution to a fractal local mean potential, which has to fulfill the Poisson equation. The present work demonstrates an important step into the elucidation of the structure of S-layers and offers a tool to analyze the structure of self-assembling systems in solution by means of SAXS and computer simulations.
Surface layer protein characterization by small angle x-ray scattering and a fractal mean force concept: From protein structure to nanodisk assemblies

NASA Astrophysics Data System (ADS)

Horejs, Christine; Pum, Dietmar; Sleytr, Uwe B.; Peterlik, Herwig; Jungbauer, Alois; Tscheliessnig, Rupert

2010-11-01

Surface layers (S-layers) are the most commonly observed cell surface structure of prokaryotic organisms. They are made up of proteins that spontaneously self-assemble into functional crystalline lattices in solution, on various solid surfaces, and interfaces. While classical experimental techniques failed to recover a complete structural model of an unmodified S-layer protein, small angle x-ray scattering (SAXS) provides an opportunity to study the structure of S-layer monomers in solution and of self-assembled two-dimensional sheets. For the protein under investigation we recently suggested an atomistic structural model by the use of molecular dynamics simulations. This structural model is now refined on the basis of SAXS data together with a fractal assembly approach. Here we show that a nondiluted critical system of proteins, which crystallize into monomolecular structures, might be analyzed by SAXS if protein-protein interactions are taken into account by relating a fractal local density distribution to a fractal local mean potential, which has to fulfill the Poisson equation. The present work demonstrates an important step into the elucidation of the structure of S-layers and offers a tool to analyze the structure of self-assembling systems in solution by means of SAXS and computer simulations.
BAYESIAN PROTEIN STRUCTURE ALIGNMENT.

PubMed

Rodriguez, Abel; Schmidler, Scott C

The analysis of the three-dimensional structure of proteins is an important topic in molecular biochemistry. Structure plays a critical role in defining the function of proteins and is more strongly conserved than amino acid sequence over evolutionary timescales. A key challenge is the identification and evaluation of structural similarity between proteins; such analysis can aid in understanding the role of newly discovered proteins and help elucidate evolutionary relationships between organisms. Computational biologists have developed many clever algorithmic techniques for comparing protein structures, however, all are based on heuristic optimization criteria, making statistical interpretation somewhat difficult. Here we present a fully probabilistic framework for pairwise structural alignment of proteins. Our approach has several advantages, including the ability to capture alignment uncertainty and to estimate key "gap" parameters which critically affect the quality of the alignment. We show that several existing alignment methods arise as maximum a posteriori estimates under specific choices of prior distributions and error models. Our probabilistic framework is also easily extended to incorporate additional information, which we demonstrate by including primary sequence information to generate simultaneous sequence-structure alignments that can resolve ambiguities obtained using structure alone. This combined model also provides a natural approach for the difficult task of estimating evolutionary distance based on structural alignments. The model is illustrated by comparison with well-established methods on several challenging protein alignment examples.
Surface layer protein characterization by small angle x-ray scattering and a fractal mean force concept: From protein structure to nanodisk assemblies

DOE Office of Scientific and Technical Information (OSTI.GOV)

Horejs, Christine; Pum, Dietmar; Sleytr, Uwe B.

2010-11-07

Surface layers (S-layers) are the most commonly observed cell surface structure of prokaryotic organisms. They are made up of proteins that spontaneously self-assemble into functional crystalline lattices in solution, on various solid surfaces, and interfaces. While classical experimental techniques failed to recover a complete structural model of an unmodified S-layer protein, small angle x-ray scattering (SAXS) provides an opportunity to study the structure of S-layer monomers in solution and of self-assembled two-dimensional sheets. For the protein under investigation we recently suggested an atomistic structural model by the use of molecular dynamics simulations. This structural model is now refined on themore » basis of SAXS data together with a fractal assembly approach. Here we show that a nondiluted critical system of proteins, which crystallize into monomolecular structures, might be analyzed by SAXS if protein-protein interactions are taken into account by relating a fractal local density distribution to a fractal local mean potential, which has to fulfill the Poisson equation. The present work demonstrates an important step into the elucidation of the structure of S-layers and offers a tool to analyze the structure of self-assembling systems in solution by means of SAXS and computer simulations.« less
Conformational switching upon phosphorylation: a predictive framework based on energy landscape principles.

PubMed

Lätzer, Joachim; Shen, Tongye; Wolynes, Peter G

2008-02-19

We investigate how post-translational phosphorylation modifies the global conformation of a protein by changing its free energy landscape using two test proteins, cystatin and NtrC. We first examine the changes in a free energy landscape caused by phosphorylation using a model containing information about both structural forms. For cystatin the free energy cost is fairly large indicating a low probability of sampling the phosphorylated conformation in a perfectly funneled landscape. The predicted barrier for NtrC conformational transition is several times larger than the barrier for cystatin, indicating that the switch protein NtrC most probably follows a partial unfolding mechanism to move from one basin to the other. Principal component analysis and linear response theory show how the naturally occurring conformational changes in unmodified proteins are captured and stabilized by the change of interaction potential. We also develop a partially guided structure prediction Hamiltonian which is capable of predicting the global structure of a phosphorylated protein using only knowledge of the structure of the unphosphorylated protein or vice versa. This algorithm makes use of a generic transferable long-range residue contact potential along with details of structure short range in sequence. By comparing the results obtained with this guided transferable potential to those from the native-only, perfectly funneled Hamiltonians, we show that the transferable Hamiltonian correctly captures the nature of the global conformational changes induced by phosphorylation and can sample substantially correct structures for the modified protein with high probability.
InterPred: A pipeline to identify and model protein-protein interactions.

PubMed

Mirabello, Claudio; Wallner, Björn

2017-06-01

Protein-protein interactions (PPI) are crucial for protein function. There exist many techniques to identify PPIs experimentally, but to determine the interactions in molecular detail is still difficult and very time-consuming. The fact that the number of PPIs is vastly larger than the number of individual proteins makes it practically impossible to characterize all interactions experimentally. Computational approaches that can bridge this gap and predict PPIs and model the interactions in molecular detail are greatly needed. Here we present InterPred, a fully automated pipeline that predicts and model PPIs from sequence using structural modeling combined with massive structural comparisons and molecular docking. A key component of the method is the use of a novel random forest classifier that integrate several structural features to distinguish correct from incorrect protein-protein interaction models. We show that InterPred represents a major improvement in protein-protein interaction detection with a performance comparable or better than experimental high-throughput techniques. We also show that our full-atom protein-protein complex modeling pipeline performs better than state of the art protein docking methods on a standard benchmark set. In addition, InterPred was also one of the top predictors in the latest CAPRI37 experiment. InterPred source code can be downloaded from http://wallnerlab.org/InterPred Proteins 2017; 85:1159-1170. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Structure Prediction and Analysis of DNA Transposon and LINE Retrotransposon Proteins*

PubMed Central

Abrusán, György; Zhang, Yang; Szilágyi, András

2013-01-01

Despite the considerable amount of research on transposable elements, no large-scale structural analyses of the TE proteome have been performed so far. We predicted the structures of hundreds of proteins from a representative set of DNA and LINE transposable elements and used the obtained structural data to provide the first general structural characterization of TE proteins and to estimate the frequency of TE domestication and horizontal transfer events. We show that 1) ORF1 and Gag proteins of retrotransposons contain high amounts of structural disorder; thus, despite their very low conservation, the presence of disordered regions and probably their chaperone function is conserved. 2) The distribution of SCOP classes in DNA transposons and LINEs indicates that the proteins of DNA transposons are more ancient, containing folds that already existed when the first cellular organisms appeared. 3) DNA transposon proteins have lower contact order than randomly selected reference proteins, indicating rapid folding, most likely to avoid protein aggregation. 4) Structure-based searches for TE homologs indicate that the overall frequency of TE domestication events is low, whereas we found a relatively high number of cases where horizontal transfer, frequently involving parasites, is the most likely explanation for the observed homology. PMID:23530042
Crystal structure of the Melampsora lini effector AvrP reveals insights into a possible nuclear function and recognition by the flax disease resistance protein P.

PubMed

Zhang, Xiaoxiao; Farah, Nadya; Rolston, Laura; Ericsson, Daniel J; Catanzariti, Ann-Maree; Bernoux, Maud; Ve, Thomas; Bendak, Katerina; Chen, Chunhong; Mackay, Joel P; Lawrence, Gregory J; Hardham, Adrienne; Ellis, Jeffrey G; Williams, Simon J; Dodds, Peter N; Jones, David A; Kobe, Bostjan

2018-05-01

The effector protein AvrP is secreted by the flax rust fungal pathogen (Melampsora lini) and recognized specifically by the flax (Linum usitatissimum) P disease resistance protein, leading to effector-triggered immunity. To investigate the biological function of this effector and the mechanisms of specific recognition by the P resistance protein, we determined the crystal structure of AvrP. The structure reveals an elongated zinc-finger-like structure with a novel interleaved zinc-binding topology. The residues responsible for zinc binding are conserved in AvrP effector variants and mutations of these motifs result in a loss of P-mediated recognition. The first zinc-coordinating region of the structure displays a positively charged surface and shows some limited similarities to nucleic acid-binding and chromatin-associated proteins. We show that the majority of the AvrP protein accumulates in the plant nucleus when transiently expressed in Nicotiana benthamiana cells, suggesting a nuclear pathogenic function. Polymorphic residues in AvrP and its allelic variants map to the protein surface and could be associated with differences in recognition specificity. Several point mutations of residues on the non-conserved surface patch result in a loss of recognition by P, suggesting that these residues are required for recognition. © 2017 BSPP AND JOHN WILEY & SONS LTD.
The calcium binding properties and structure prediction of the Hax-1 protein.

PubMed

Balcerak, Anna; Rowinski, Sebastian; Szafron, Lukasz M; Grzybowska, Ewa A

2017-01-01

Hax-1 is a protein involved in regulation of different cellular processes, but its properties and exact mechanisms of action remain unknown. In this work, using purified, recombinant Hax-1 and by applying an in vitro autoradiography assay we have shown that this protein binds Ca 2+ . Additionally, we performed structure prediction analysis which shows that Hax-1 displays definitive structural features, such as two α-helices, short β-strands and four disordered segments.
Qualitative and quantitative changes in phospholipids and proteins investigated by spectroscopic techniques in olfactory bulbectomy animal depression model.

PubMed

Depciuch, J; Parlinska-Wojtan, M

2018-01-30

Depression becomes nowadays a high mortality civilization disease with one of the potential causes being impaired smell. In this study Raman, Fourier Transform Infra Red (FTIR) and Ultraviolet-Visible (UV-vis) spectroscopies were used to determine the changes in the quantity and structure of phospholipids and proteins in the blood serum of bulbectomized rats (OB_NaCl), which is a common animal depression model. The efficiency of amitriptyline (AMI) treatment was also evaluated. The obtained results show a significant decrease in the phospholipid and protein fractions (as well as changes in their secondary structures) in blood serum of bulbectomized rats. AMI treatment in bulbectomized rats increased protein level and did not affect the level of phospholipids. Structural information from phospholipids and proteins was obtained from UV-vis spectroscopy combined with the second derivative of the FTIR spectra. Indeed, the structure of proteins in blood serum of bulbectomized rats was normalized after amitriptyline therapy, while the damaged structure of phospholipids remained unaffected. These findings strongly suggest that impaired smell could be one of the causes of depression and may induce permanent (irreversible) damages into the phospholipid structure identified as shortened carbon chains. This study shows a possible new application of spectroscopic techniques in the diagnosis and therapy monitoring of depression. Copyright © 2017 Elsevier B.V. All rights reserved.
Relating protein conformational changes to packing efficiency and disorder

PubMed Central

Bhardwaj, Nitin; Gerstein, Mark

2009-01-01

Changes in protein conformation play key roles in facilitating various biochemical processes, ranging from signaling and phosphorylation to transport and catalysis. While various factors that drive these motions such as environmental changes and binding of small molecules are well understood, specific causative effects on the structural features of the protein due to these conformational changes have not been studied on a large scale. Here, we study protein conformational changes in relation to two key structural metrics: packing efficiency and disorder. Packing has been shown to be crucial for protein stability and function by many protein design and engineering studies. We study changes in packing efficiency during conformational changes, thus extending the analysis from a static context to a dynamic perspective and report some interesting observations. First, we study various proteins that adopt alternate conformations and find that tendencies to show motion and change in packing efficiency are correlated: residues that change their packing efficiency show larger motions. Second, our results suggest that residues that show higher changes in packing during motion are located on the changing interfaces which are formed during these conformational changes. These changing interfaces are slightly different from shear or static interfaces that have been analyzed in previous studies. Third, analysis of packing efficiency changes in the context of secondary structure shows that, as expected, residues buried in helices show the least change in packing efficiency, whereas those embedded in bends are most likely to change packing. Finally, by relating protein disorder to motions, we show that marginally disordered residues which are ordered enough to be crystallized but have sequence patterns indicative of disorder show higher dislocation and a higher change in packing than ordered ones and are located mostly on the changing interfaces. Overall, our results demonstrate that between the two conformations, the cores of the proteins remain mostly intact, whereas the interfaces display the most elasticity, both in terms of disorder and change in packing efficiency. By doing a variety of tests, we also show that our observations are robust to the solvation state of the proteins. PMID:19472340
Bayesian comparison of protein structures using partial Procrustes distance.

PubMed

Ejlali, Nasim; Faghihi, Mohammad Reza; Sadeghi, Mehdi

2017-09-26

An important topic in bioinformatics is the protein structure alignment. Some statistical methods have been proposed for this problem, but most of them align two protein structures based on the global geometric information without considering the effect of neighbourhood in the structures. In this paper, we provide a Bayesian model to align protein structures, by considering the effect of both local and global geometric information of protein structures. Local geometric information is incorporated to the model through the partial Procrustes distance of small substructures. These substructures are composed of β-carbon atoms from the side chains. Parameters are estimated using a Markov chain Monte Carlo (MCMC) approach. We evaluate the performance of our model through some simulation studies. Furthermore, we apply our model to a real dataset and assess the accuracy and convergence rate. Results show that our model is much more efficient than previous approaches.
Exploring the repeat protein universe through computational protein design

DOE PAGES

Brunette, TJ; Parmeggiani, Fabio; Huang, Po-Ssu; ...

2015-12-16

A central question in protein evolution is the extent to which naturally occurring proteins sample the space of folded structures accessible to the polypeptide chain. Repeat proteins composed of multiple tandem copies of a modular structure unit are widespread in nature and have critical roles in molecular recognition, signalling, and other essential biological processes. Naturally occurring repeat proteins have been re-engineered for molecular recognition and modular scaffolding applications. In this paper, we use computational protein design to investigate the space of folded structures that can be generated by tandem repeating a simple helix–loop–helix–loop structural motif. Eighty-three designs with sequences unrelatedmore » to known repeat proteins were experimentally characterized. Of these, 53 are monomeric and stable at 95 °C, and 43 have solution X-ray scattering spectra consistent with the design models. Crystal structures of 15 designs spanning a broad range of curvatures are in close agreement with the design models with root mean square deviations ranging from 0.7 to 2.5 Å. Finally, our results show that existing repeat proteins occupy only a small fraction of the possible repeat protein sequence and structure space and that it is possible to design novel repeat proteins with precisely specified geometries, opening up a wide array of new possibilities for biomolecular engineering.« less
Identify High-Quality Protein Structural Models by Enhanced K-Means.

PubMed

Wu, Hongjie; Li, Haiou; Jiang, Min; Chen, Cheng; Lv, Qiang; Wu, Chuang

2017-01-01

Background. One critical issue in protein three-dimensional structure prediction using either ab initio or comparative modeling involves identification of high-quality protein structural models from generated decoys. Currently, clustering algorithms are widely used to identify near-native models; however, their performance is dependent upon different conformational decoys, and, for some algorithms, the accuracy declines when the decoy population increases. Results. Here, we proposed two enhanced K -means clustering algorithms capable of robustly identifying high-quality protein structural models. The first one employs the clustering algorithm SPICKER to determine the initial centroids for basic K -means clustering ( SK -means), whereas the other employs squared distance to optimize the initial centroids ( K -means++). Our results showed that SK -means and K -means++ were more robust as compared with SPICKER alone, detecting 33 (59%) and 42 (75%) of 56 targets, respectively, with template modeling scores better than or equal to those of SPICKER. Conclusions. We observed that the classic K -means algorithm showed a similar performance to that of SPICKER, which is a widely used algorithm for protein-structure identification. Both SK -means and K -means++ demonstrated substantial improvements relative to results from SPICKER and classical K -means.
Identify High-Quality Protein Structural Models by Enhanced K-Means

PubMed Central

Li, Haiou; Chen, Cheng; Lv, Qiang; Wu, Chuang

2017-01-01

Background. One critical issue in protein three-dimensional structure prediction using either ab initio or comparative modeling involves identification of high-quality protein structural models from generated decoys. Currently, clustering algorithms are widely used to identify near-native models; however, their performance is dependent upon different conformational decoys, and, for some algorithms, the accuracy declines when the decoy population increases. Results. Here, we proposed two enhanced K-means clustering algorithms capable of robustly identifying high-quality protein structural models. The first one employs the clustering algorithm SPICKER to determine the initial centroids for basic K-means clustering (SK-means), whereas the other employs squared distance to optimize the initial centroids (K-means++). Our results showed that SK-means and K-means++ were more robust as compared with SPICKER alone, detecting 33 (59%) and 42 (75%) of 56 targets, respectively, with template modeling scores better than or equal to those of SPICKER. Conclusions. We observed that the classic K-means algorithm showed a similar performance to that of SPICKER, which is a widely used algorithm for protein-structure identification. Both SK-means and K-means++ demonstrated substantial improvements relative to results from SPICKER and classical K-means. PMID:28421198
Dynamic protein interaction networks and new structural paradigms in signaling

PubMed Central

Csizmok, Veronika; Follis, Ariele Viacava; Kriwacki, Richard W.; Forman-Kay, Julie D.

2017-01-01

Understanding signaling and other complex biological processes requires elucidating the critical roles of intrinsically disordered proteins and regions (IDPs/IDRs), which represent ~30% of the proteome and enable unique regulatory mechanisms. In this review we describe the structural heterogeneity of disordered proteins that underpins these mechanisms and the latest progress in obtaining structural descriptions of ensembles of disordered proteins that are needed for linking structure and dynamics to function. We describe the diverse interactions of IDPs that can have unusual characteristics such as “ultrasensitivity” and “regulated folding and unfolding”. We also summarize the mounting data showing that large-scale assembly and protein phase separation occurs within a variety of signaling complexes and cellular structures. In addition, we discuss efforts to therapeutically target disordered proteins with small molecules. Overall, we interpret the remodeling of disordered state ensembles due to binding and post-translational modifications within an expanded framework for allostery that provides significant insights into how disordered proteins transmit biological information. PMID:26922996
Membrane remodeling by amyloidogenic and non-amyloidogenic proteins studied by EPR

NASA Astrophysics Data System (ADS)

Varkey, Jobin; Langen, Ralf

2017-07-01

The advancement in site-directed spin labeling of proteins has enabled EPR studies to expand into newer research areas within the umbrella of protein-membrane interactions. Recently, membrane remodeling by amyloidogenic and non-amyloidogenic proteins has gained a substantial interest in relation to driving and controlling vital cellular processes such as endocytosis, exocytosis, shaping of organelles like endoplasmic reticulum, Golgi and mitochondria, intracellular vesicular trafficking, formation of filopedia and multivesicular bodies, mitochondrial fusion and fission, and synaptic vesicle fusion and recycling in neurotransmission. Misregulation in any of these processes due to an aberrant protein (mutation or misfolding) or alteration of lipid metabolism can be detrimental to the cell and cause disease. Dissection of the structural basis of membrane remodeling by proteins is thus quite necessary for an understanding of the underlying mechanisms, but it remains a formidable task due to the difficulties of various common biophysical tools in monitoring the dynamic process of membrane binding and bending by proteins. This is largely since membranes generally complicate protein structure analysis and this problem is amplified for structural analysis in the presence of different types of membrane curvatures. Recent EPR studies on membrane remodeling by proteins show that a significant structural information can be generated to delineate the role of different protein modules, domains and individual amino acids in the generation of membrane curvature. These studies also show how EPR can complement the data obtained by high resolution techniques such as X-ray and NMR. This perspective covers the application of EPR in recent studies for understanding membrane remodeling by amyloidogenic and non-amyloidogenic proteins that is useful for researchers interested in using or complimenting EPR to gain better understanding of membrane remodeling. We also discuss how a single protein can generate different type of membrane curvatures using specific conformations for specific membrane structures and how EPR is a versatile tool well-suited to analyze subtle alterations in structures under such modifying conditions which otherwise would have been difficult using other biophysical tools.
Evolution of a protein folding nucleus.

PubMed

Xia, Xue; Longo, Liam M; Sutherland, Mason A; Blaber, Michael

2016-07-01

The folding nucleus (FN) is a cryptic element within protein primary structure that enables an efficient folding pathway and is the postulated heritable element in the evolution of protein architecture; however, almost nothing is known regarding how the FN structurally changes as complex protein architecture evolves from simpler peptide motifs. We report characterization of the FN of a designed purely symmetric β-trefoil protein by ϕ-value analysis. We compare the structure and folding properties of key foldable intermediates along the evolutionary trajectory of the β-trefoil. The results show structural acquisition of the FN during gene fusion events, incorporating novel turn structure created by gene fusion. Furthermore, the FN is adjusted by circular permutation in response to destabilizing functional mutation. FN plasticity by way of circular permutation is made possible by the intrinsic C3 cyclic symmetry of the β-trefoil architecture, identifying a possible selective advantage that helps explain the prevalence of cyclic structural symmetry in the proteome. © 2015 The Protein Society.
Protein Structure Determination from Pseudocontact Shifts Using ROSETTA

PubMed Central

Schmitz, Christophe; Vernon, Robert; Otting, Gottfried; Baker, David; Huber, Thomas

2013-01-01

Paramagnetic metal ions generate pseudocontact shifts (PCSs) in nuclear magnetic resonance spectra that are manifested as easily measurable changes in chemical shifts. Metals can be incorporated into proteins through metal binding tags, and PCS data constitute powerful long-range restraints on the positions of nuclear spins relative to the coordinate system of the magnetic susceptibility anisotropy tensor (Δχ-tensor) of the metal ion. We show that three-dimensional structures of proteins can reliably be determined using PCS data from a single metal binding site combined with backbone chemical shifts. The program PCS-ROSETTA automatically determines the Δχ-tensor and metal position from the PCS data during the structure calculations, without any prior knowledge of the protein structure. The program can determine structures accurately for proteins of up to 150 residues, offering a powerful new approach to protein structure determination that relies exclusively on readily measurable backbone chemical shifts and easily discriminates between correctly and incorrectly folded conformations. PMID:22285518

Structural Determination of Functional Domains in Early B-cell Factor (EBF) Family of Transcription Factors Reveals Similarities to Rel DNA-binding Proteins and a Novel Dimerization Motif*

PubMed Central

Siponen, Marina I.; Wisniewska, Magdalena; Lehtiö, Lari; Johansson, Ida; Svensson, Linda; Raszewski, Grzegorz; Nilsson, Lennart; Sigvardsson, Mikael; Berglund, Helena

2010-01-01

The early B-cell factor (EBF) transcription factors are central regulators of development in several organs and tissues. This protein family shows low sequence similarity to other protein families, which is why structural information for the functional domains of these proteins is crucial to understand their biochemical features. We have used a modular approach to determine the crystal structures of the structured domains in the EBF family. The DNA binding domain reveals a striking resemblance to the DNA binding domains of the Rel homology superfamily of transcription factors but contains a unique zinc binding structure, termed zinc knuckle. Further the EBF proteins contain an IPT/TIG domain and an atypical helix-loop-helix domain with a novel type of dimerization motif. The data presented here provide insights into unique structural features of the EBF proteins and open possibilities for detailed molecular investigations of this important transcription factor family. PMID:20592035
Crystal structure of the YGR205w protein from Saccharomyces cerevisiae: close structural resemblance to E. coli pantothenate kinase.

PubMed

Li de La Sierra-Gallay, Ines; Collinet, Bruno; Graille, Marc; Quevillon-Cheruel, Sophie; Liger, Dominique; Minard, Philippe; Blondeau, Karine; Henckes, Gilles; Aufrère, Robert; Leulliot, Nicolas; Zhou, Cong-Zhao; Sorel, Isabelle; Ferrer, Jean-Luc; Poupon, Anne; Janin, Joël; van Tilbeurgh, Herman

2004-03-01

The protein product of the YGR205w gene of Saccharomyces cerevisiae was targeted as part of our yeast structural genomics project. YGR205w codes for a small (290 amino acids) protein with unknown structure and function. The only recognizable sequence feature is the presence of a Walker A motif (P loop) indicating a possible nucleotide binding/converting function. We determined the three-dimensional crystal structure of Se-methionine substituted protein using multiple anomalous diffraction. The structure revealed a well known mononucleotide fold and strong resemblance to the structure of small metabolite phosphorylating enzymes such as pantothenate and phosphoribulo kinase. Biochemical experiments show that YGR205w binds specifically ATP and, less tightly, ADP. The structure also revealed the presence of two bound sulphate ions, occupying opposite niches in a canyon that corresponds to the active site of the protein. One sulphate is bound to the P-loop in a position that corresponds to the position of beta-phosphate in mononucleotide protein ATP complex, suggesting the protein is indeed a kinase. The nature of the phosphate accepting substrate remains to be determined. Copyright 2004 Wiley-Liss, Inc.
Improved cryoEM-Guided Iterative Molecular Dynamics–Rosetta Protein Structure Refinement Protocol for High Precision Protein Structure Prediction

PubMed Central

2016-01-01

Many excellent methods exist that incorporate cryo-electron microscopy (cryoEM) data to constrain computational protein structure prediction and refinement. Previously, it was shown that iteration of two such orthogonal sampling and scoring methods – Rosetta and molecular dynamics (MD) simulations – facilitated exploration of conformational space in principle. Here, we go beyond a proof-of-concept study and address significant remaining limitations of the iterative MD–Rosetta protein structure refinement protocol. Specifically, all parts of the iterative refinement protocol are now guided by medium-resolution cryoEM density maps, and previous knowledge about the native structure of the protein is no longer necessary. Models are identified solely based on score or simulation time. All four benchmark proteins showed substantial improvement through three rounds of the iterative refinement protocol. The best-scoring final models of two proteins had sub-Ångstrom RMSD to the native structure over residues in secondary structure elements. Molecular dynamics was most efficient in refining secondary structure elements and was thus highly complementary to the Rosetta refinement which is most powerful in refining side chains and loop regions. PMID:25883538
Proteopedia: A Collaborative, Virtual 3D Web-Resource for Protein and Biomolecule Structure and Function

ERIC Educational Resources Information Center

Hodis, Eran; Prilusky, Jaime, Sussman, Joel L.

2010-01-01

Protein structures are hard to represent on paper. They are large, complex, and three-dimensional (3D)--four-dimensional if conformational changes count! Unlike most of their substrates, which can easily be drawn out in full chemical formula, drawing every atom in a protein would usually be a mess. Simplifications like showing only the surface of…
Structural elucidation of estrus urinary lipocalin protein (EULP) and evaluating binding affinity with pheromones using molecular docking and fluorescence study

PubMed Central

Rajesh, Durairaj; Muthukumar, Subramanian; Saibaba, Ganesan; Siva, Durairaj; Akbarsha, Mohammad Abdulkader; Gulyás, Balázs; Padmanabhan, Parasuraman; Archunan, Govindaraju

2016-01-01

Transportation of pheromones bound with carrier proteins belonging to lipocalin superfamily is known to prolong chemo-signal communication between individuals belonging to the same species. Members of lipocalin family (MLF) proteins have three structurally conserved motifs for delivery of hydrophobic molecules to the specific recognizer. However, computational analyses are critically required to validate and emphasize the sequence and structural annotation of MLF. This study focused to elucidate the evolution, structural documentation, stability and binding efficiency of estrus urinary lipocalin protein (EULP) with endogenous pheromones adopting in-silico and fluorescence study. The results revealed that: (i) EULP perhaps originated from fatty acid binding protein (FABP) revealed in evolutionary analysis; (ii) Dynamic simulation study shows that EULP is highly stable at below 0.45 Å of root mean square deviation (RMSD); (iii) Docking evaluation shows that EULP has higher binding energy with farnesol and 2-iso-butyl-3-methoxypyrazine (IBMP) than 2-naphthol; and (iv) Competitive binding and quenching assay revealed that purified EULP has good binding interaction with farnesol. Both, In-silico and experimental studies showed that EULP is an efficient binding partner to pheromones. The present study provides impetus to create a point mutation for increasing longevity of EULP to develop pheromone trap for rodent pest management. PMID:27782155
Structural elucidation of estrus urinary lipocalin protein (EULP) and evaluating binding affinity with pheromones using molecular docking and fluorescence study.

PubMed

Rajesh, Durairaj; Muthukumar, Subramanian; Saibaba, Ganesan; Siva, Durairaj; Akbarsha, Mohammad Abdulkader; Gulyás, Balázs; Padmanabhan, Parasuraman; Archunan, Govindaraju

2016-10-26

Transportation of pheromones bound with carrier proteins belonging to lipocalin superfamily is known to prolong chemo-signal communication between individuals belonging to the same species. Members of lipocalin family (MLF) proteins have three structurally conserved motifs for delivery of hydrophobic molecules to the specific recognizer. However, computational analyses are critically required to validate and emphasize the sequence and structural annotation of MLF. This study focused to elucidate the evolution, structural documentation, stability and binding efficiency of estrus urinary lipocalin protein (EULP) with endogenous pheromones adopting in-silico and fluorescence study. The results revealed that: (i) EULP perhaps originated from fatty acid binding protein (FABP) revealed in evolutionary analysis; (ii) Dynamic simulation study shows that EULP is highly stable at below 0.45 Å of root mean square deviation (RMSD); (iii) Docking evaluation shows that EULP has higher binding energy with farnesol and 2-iso-butyl-3-methoxypyrazine (IBMP) than 2-naphthol; and (iv) Competitive binding and quenching assay revealed that purified EULP has good binding interaction with farnesol. Both, In-silico and experimental studies showed that EULP is an efficient binding partner to pheromones. The present study provides impetus to create a point mutation for increasing longevity of EULP to develop pheromone trap for rodent pest management.
Three-dimensional structure of the lithostathine protofibril, a protein involved in Alzheimer's disease.

PubMed

Grégoire, C; Marco, S; Thimonier, J; Duplan, L; Laurine, E; Chauvin, J P; Michel, B; Peyrot, V; Verdier, J M

2001-07-02

Neurodegenerative diseases are characterized by the presence of filamentous aggregates of proteins. We previously established that lithostathine is a protein overexpressed in the pre-clinical stages of Alzheimer's disease. Furthermore, it is present in the pathognomonic lesions associated with Alzheimer's disease. After self-proteolysis, the N-terminally truncated form of lithostathine leads to the formation of fibrillar aggregates. Here we observed using atomic force microscopy that these aggregates consisted of a network of protofibrils, each of which had a twisted appearance. Electron microscopy and image analysis showed that this twisted protofibril has a quadruple helical structure. Three-dimensional X-ray structural data and the results of biochemical experiments showed that when forming a protofibril, lithostathine was first assembled via lateral hydrophobic interactions into a tetramer. Each tetramer then linked up with another tetramer as the result of longitudinal electrostatic interactions. All these results were used to build a structural model for the lithostathine protofibril called the quadruple-helical filament (QHF-litho). In conclusion, lithostathine strongly resembles the prion protein in its dramatic proteolysis and amyloid proteins in its ability to form fibrils.
Protein simulation using coarse-grained two-bead multipole force field with polarizable water models.

PubMed

Li, Min; Zhang, John Z H

2017-02-14

A recently developed two-bead multipole force field (TMFF) is employed in coarse-grained (CG) molecular dynamics (MD) simulation of proteins in combination with polarizable CG water models, the Martini polarizable water model, and modified big multipole water model. Significant improvement in simulated structures and dynamics of proteins is observed in terms of both the root-mean-square deviations (RMSDs) of the structures and residue root-mean-square fluctuations (RMSFs) from the native ones in the present simulation compared with the simulation result with Martini's non-polarizable water model. Our result shows that TMFF simulation using CG water models gives much stable secondary structures of proteins without the need for adding extra interaction potentials to constrain the secondary structures. Our result also shows that by increasing the MD time step from 2 fs to 6 fs, the RMSD and RMSF results are still in excellent agreement with those from all-atom simulations. The current study demonstrated clearly that the application of TMFF together with a polarizable CG water model significantly improves the accuracy and efficiency for CG simulation of proteins.
Protein simulation using coarse-grained two-bead multipole force field with polarizable water models

NASA Astrophysics Data System (ADS)

Li, Min; Zhang, John Z. H.

2017-02-01

A recently developed two-bead multipole force field (TMFF) is employed in coarse-grained (CG) molecular dynamics (MD) simulation of proteins in combination with polarizable CG water models, the Martini polarizable water model, and modified big multipole water model. Significant improvement in simulated structures and dynamics of proteins is observed in terms of both the root-mean-square deviations (RMSDs) of the structures and residue root-mean-square fluctuations (RMSFs) from the native ones in the present simulation compared with the simulation result with Martini's non-polarizable water model. Our result shows that TMFF simulation using CG water models gives much stable secondary structures of proteins without the need for adding extra interaction potentials to constrain the secondary structures. Our result also shows that by increasing the MD time step from 2 fs to 6 fs, the RMSD and RMSF results are still in excellent agreement with those from all-atom simulations. The current study demonstrated clearly that the application of TMFF together with a polarizable CG water model significantly improves the accuracy and efficiency for CG simulation of proteins.
Structural changes induced by binding of the high-mobility group I protein to a mouse satellite DNA sequence.

PubMed Central

Slama-Schwok, A; Zakrzewska, K; Léger, G; Leroux, Y; Takahashi, M; Käs, E; Debey, P

2000-01-01

Using spectroscopic methods, we have studied the structural changes induced in both protein and DNA upon binding of the High-Mobility Group I (HMG-I) protein to a 21-bp sequence derived from mouse satellite DNA. We show that these structural changes depend on the stoichiometry of the protein/DNA complexes formed, as determined by Job plots derived from experiments using pyrene-labeled duplexes. Circular dichroism and melting temperature experiments extended in the far ultraviolet range show that while native HMG-I is mainly random coiled in solution, it adopts a beta-turn conformation upon forming a 1:1 complex in which the protein first binds to one of two dA.dT stretches present in the duplex. HMG-I structure in the 1:1 complex is dependent on the sequence of its DNA target. A 3:1 HMG-I/DNA complex can also form and is characterized by a small increase in the DNA natural bend and/or compaction coupled to a change in the protein conformation, as determined from fluorescence resonance energy transfer (FRET) experiments. In addition, a peptide corresponding to an extended DNA-binding domain of HMG-I induces an ordered condensation of DNA duplexes. Based on the constraints derived from pyrene excimer measurements, we present a model of these nucleated structures. Our results illustrate an extreme case of protein structure induced by DNA conformation that may bear on the evolutionary conservation of the DNA-binding motifs of HMG-I. We discuss the functional relevance of the structural flexibility of HMG-I associated with the nature of its DNA targets and the implications of the binding stoichiometry for several aspects of chromatin structure and gene regulation. PMID:10777751
Revealing Abrupt and Spontaneous Ruptures of Protein Native Structure under picoNewton Compressive Force Manipulation.

PubMed

Chowdhury, S Roy; Cao, Jin; He, Yufan; Lu, H Peter

2018-03-27

Manipulating protein conformations for exploring protein structure-function relationship has shown great promise. Although protein conformational changes under pulling force manipulation have been extensively studied, protein conformation changes under a compressive force have not been explored quantitatively. The latter is even more biologically significant and relevant in revealing protein functions in living cells associated with protein crowdedness, distribution fluctuations, and cell osmotic stress. Here we report our experimental observations on abrupt ruptures of protein native structures under compressive force, demonstrated and studied by single-molecule AFM-FRET spectroscopic nanoscopy. Our results show that the protein ruptures are abrupt and spontaneous events occurred when the compressive force reaches a threshold of 12-75 pN, a force amplitude accessible from thermal fluctuations in a living cell. The abrupt ruptures are sensitive to local environment, likely a general and important pathway of protein unfolding in living cells.
Analysis of protein-protein docking decoys using interaction fingerprints: application to the reconstruction of CaM-ligand complexes.

PubMed

Uchikoga, Nobuyuki; Hirokawa, Takatsugu

2010-05-11

Protein-protein docking for proteins with large conformational changes was analyzed by using interaction fingerprints, one of the scales for measuring similarities among complex structures, utilized especially for searching near-native protein-ligand or protein-protein complex structures. Here, we have proposed a combined method for analyzing protein-protein docking by taking large conformational changes into consideration. This combined method consists of ensemble soft docking with multiple protein structures, refinement of complexes, and cluster analysis using interaction fingerprints and energy profiles. To test for the applicability of this combined method, various CaM-ligand complexes were reconstructed from the NMR structures of unbound CaM. For the purpose of reconstruction, we used three known CaM-ligands, namely, the CaM-binding peptides of cyclic nucleotide gateway (CNG), CaM kinase kinase (CaMKK) and the plasma membrane Ca2+ ATPase pump (PMCA), and thirty-one structurally diverse CaM conformations. For each ligand, 62000 CaM-ligand complexes were generated in the docking step and the relationship between their energy profiles and structural similarities to the native complex were analyzed using interaction fingerprint and RMSD. Near-native clusters were obtained in the case of CNG and CaMKK. The interaction fingerprint method discriminated near-native structures better than the RMSD method in cluster analysis. We showed that a combined method that includes the interaction fingerprint is very useful for protein-protein docking analysis of certain cases.
Deciphering RNA-Recognition Patterns of Intrinsically Disordered Proteins.

PubMed

Srivastava, Ambuj; Ahmad, Shandar; Gromiha, M Michael

2018-05-29

Intrinsically disordered regions (IDRs) and protein (IDPs) are highly flexible owing to their lack of well-defined structures. A subset of such proteins interacts with various substrates; including RNA; frequently adopting regular structures in the final complex. In this work; we have analysed a dataset of protein⁻RNA complexes undergoing disorder-to-order transition (DOT) upon binding. We found that DOT regions are generally small in size (less than 3 residues) for RNA binding proteins. Like structured proteins; positively charged residues are found to interact with RNA molecules; indicating the dominance of electrostatic and cation-π interactions. However, a comparison of binding frequency shows that interface hydrophobic and aromatic residues have more interactions in only DOT regions than in a protein. Further; DOT regions have significantly higher exposure to water than their structured counterparts. Interactions of DOT regions with RNA increase the sheet formation with minor changes in helix forming residues. We have computed the interaction energy for amino acids⁻nucleotide pairs; which showed the preference of His⁻G; Asn⁻U and Ser⁻U at for the interface of DOT regions. This study provides insights to understand protein⁻RNA interactions and the results could also be used for developing a tool for identifying DOT regions in RNA binding proteins.
Lessons in molecular recognition. 2. Assessing and improving cross-docking accuracy.

PubMed

Sutherland, Jeffrey J; Nandigam, Ravi K; Erickson, Jon A; Vieth, Michal

2007-01-01

Docking methods are used to predict the manner in which a ligand binds to a protein receptor. Many studies have assessed the success rate of programs in self-docking tests, whereby a ligand is docked into the protein structure from which it was extracted. Cross-docking, or using a protein structure from a complex containing a different ligand, provides a more realistic assessment of a docking program's ability to reproduce X-ray results. In this work, cross-docking was performed with CDocker, Fred, and Rocs using multiple X-ray structures for eight proteins (two kinases, one nuclear hormone receptor, one serine protease, two metalloproteases, and two phosphodiesterases). While average cross-docking accuracy is not encouraging, it is shown that using the protein structure from the complex that contains the bound ligand most similar to the docked ligand increases docking accuracy for all methods ("similarity selection"). Identifying the most successful protein conformer ("best selection") and similarity selection substantially reduce the difference between self-docking and average cross-docking accuracy. We identify universal predictors of docking accuracy (i.e., showing consistent behavior across most protein-method combinations), and show that models for predicting docking accuracy built using these parameters can be used to select the most appropriate docking method.
Crystal Structure of the Catalytic Domain of Drosophila [beta]1,4-Galactosyltransferase-7

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ramakrishnan, Boopathy; Qasba, Pradman K.

2010-11-03

The {beta}1,4-galactosyltransferase-7 ({beta}4Gal-T7) enzyme, one of seven members of the {beta}4Gal-T family, transfers in the presence of manganese Gal from UDP-Gal to an acceptor sugar (xylose) that is attached to a side chain hydroxyl group of Ser/Thr residues of proteoglycan proteins. It exhibits the least protein sequence similarity with the other family members, including the well studied family member {beta}4Gal-T1, which, in the presence of manganese, transfers Gal from UDP-Gal to GlcNAc. We report here the crystal structure of the catalytic domain of {beta}4Gal-T7 from Drosophila in the presence of manganese and UDP at 1.81 {angstrom} resolution. In the crystalmore » structure, a new manganese ion-binding motif (HXH) has been observed. Superposition of the crystal structures of {beta}4Gal-T7 and {beta}4Gal-T1 shows that the catalytic pocket and the substrate-binding sites in these proteins are similar. Compared with GlcNAc, xylose has a hydroxyl group (instead of an N-acetyl group) at C2 and lacks the CH{sub 2}OH group at C5; thus, these protein structures show significant differences in their acceptor-binding site. Modeling of xylose in the acceptor-binding site of the {beta}4Gal-T7 crystal structure shows that the aromatic side chain of Tyr{sup 177} interacts strongly with the C5 atom of xylose, causing steric hindrance to any additional group at C5. Because Drosophila Cd7 has a 73% protein sequence similarity to human Cd7, the present crystal structure offers a structure-based explanation for the mutations in human Cd7 that have been linked to Ehlers-Danlos syndrome.« less
Effects of Bacillus fermentation on the protein microstructure and anti-nutritional factors of soybean meal.

PubMed

Zheng, L; Li, D; Li, Z-L; Kang, L-N; Jiang, Y-Y; Liu, X-Y; Chi, Y-P; Li, Y-Q; Wang, J-H

2017-12-01

This study evaluated the effects of Bacillus fermentation on soybean meal protein (SBMP) microstructure and major anti-nutritional factors (ANFs) in soybean meal (SBM). The Bacillus siamensis isolate JL8 producing high yield of protease at 519·1 U g -1 was selected for the laboratory production of fermented soybean meal (FSBM). After 24 h fermentation, the FSBM showed better properties compared with those of SBM, the ANFs such as glycinin, β-conglycinin and trypsin inhibitor significantly decreased by 86·0, 70·3 and 95·01%, while in vitro digestibility and absorbability increased by 8·7 and 18·9% respectively. Scanning electron microscopy (SEM) image of fermented soybean meal protein showed smaller aggregates and looser network than that of SBMP. Secondary structure examination of proteins revealed fermentation significantly decreased the content of β-sheet structure by 43·2% and increased the random coil structure by 59·9%. It is demonstrated that Bacillus fermentation improved the nutritional quality of SBM through degrading ANFs and changing the microstructure of SBMP. There is limited information about the structural property changes of soybean protein during fermentation. In this study, physicochemical analysis of soybean meal protein showed evidence that the increase in in vitro digestibility and absorbability of fermented soybean meal reflected the decrease in β-conformation and destruction of original structure in soybean meal protein. The results directly gained the understanding of nutritional quality improvement of soybean meal by Bacillus fermentation, and supply the potential use of Bacillus siamensis for fermented soybean meal production. © 2017 The Society for Applied Microbiology.
G-LoSA: An efficient computational tool for local structure-centric biological studies and drug design.

PubMed

Lee, Hui Sun; Im, Wonpil

2016-04-01

Molecular recognition by protein mostly occurs in a local region on the protein surface. Thus, an efficient computational method for accurate characterization of protein local structural conservation is necessary to better understand biology and drug design. We present a novel local structure alignment tool, G-LoSA. G-LoSA aligns protein local structures in a sequence order independent way and provides a GA-score, a chemical feature-based and size-independent structure similarity score. Our benchmark validation shows the robust performance of G-LoSA to the local structures of diverse sizes and characteristics, demonstrating its universal applicability to local structure-centric comparative biology studies. In particular, G-LoSA is highly effective in detecting conserved local regions on the entire surface of a given protein. In addition, the applications of G-LoSA to identifying template ligands and predicting ligand and protein binding sites illustrate its strong potential for computer-aided drug design. We hope that G-LoSA can be a useful computational method for exploring interesting biological problems through large-scale comparison of protein local structures and facilitating drug discovery research and development. G-LoSA is freely available to academic users at http://im.compbio.ku.edu/GLoSA/. © 2016 The Protein Society.
A Viral-Human Interactome Based on Structural Motif-Domain Interactions Captures the Human Infectome

PubMed Central

Guo, Xianwu; Rodríguez-Pérez, Mario A.

2013-01-01

Protein interactions between a pathogen and its host are fundamental in the establishment of the pathogen and underline the infection mechanism. In the present work, we developed a single predictive model for building a host-viral interactome based on the identification of structural descriptors from motif-domain interactions of protein complexes deposited in the Protein Data Bank (PDB). The structural descriptors were used for searching, in a database of protein sequences of human and five clinically important viruses; therefore, viral and human proteins sharing a descriptor were predicted as interacting proteins. The analysis of the host-viral interactome allowed to identify a set of new interactions that further explain molecular mechanism associated with viral infections and showed that it was able to capture human proteins already associated to viral infections (human infectome) and non-infectious diseases (human diseasome). The analysis of human proteins targeted by viral proteins in the context of a human interactome showed that their neighbors are enriched in proteins reported with differential expression under infection and disease conditions. It is expected that the findings of this work will contribute to the development of systems biology for infectious diseases, and help guide the rational identification and prioritization of novel drug targets. PMID:23951184
Recombinant dengue 2 virus NS3 protein conserves structural antigenic and immunological properties relevant for dengue vaccine design.

PubMed

Ramírez, Rosa; Falcón, Rosabel; Izquierdo, Alienys; García, Angélica; Alvarez, Mayling; Pérez, Ana Beatriz; Soto, Yudira; Muné, Mayra; da Silva, Emiliana Mandarano; Ortega, Oney; Mohana-Borges, Ronaldo; Guzmán, María G

2014-10-01

The NS3 protein is a multifunctional non-structural protein of flaviviruses implicated in the polyprotein processing. The predominance of cytotoxic T cell lymphocytes epitopes on the NS3 protein suggests a protective role of this protein in limiting virus replication. In this work, we studied the antigenicity and immunogenicity of a recombinant NS3 protein of the Dengue virus 2. The full-length NS3 gene was cloned and expressed as a His-tagged fusion protein in Escherichia coli. The pNS3 protein was purified by two chromatography steps. The recombinant NS3 protein was recognized by anti-protease NS3 polyclonal antibody and anti-DENV2 HMAF by Western Blot. This purified protein was able to stimulate the secretion of high levels of gamma interferon and low levels of interleukin-10 and tumor necrosis factor-α in mice splenocytes, suggesting a predominantly Th-1-type T cell response. Immunized BALB/c mice with the purified NS3 protein showed a strong induction of anti-NS3 IgG antibodies, essentially IgG2b, as determined by ELISA. Immunized mice sera with recombinant NS3 protein showed specific recognition of native dengue protein by Western blotting and immunofluorescence techniques. The successfully purified recombinant protein was able to preserv the structural and antigenic determinants of the native dengue protein. The antigenicity shown by the recombinant NS3 protein suggests its possible inclusion into future DENV vaccine preparations.
Structure Refinement of Protein Low Resolution Models Using the GNEIMO Constrained Dynamics Method

PubMed Central

Park, In-Hee; Gangupomu, Vamshi; Wagner, Jeffrey; Jain, Abhinandan; Vaidehi, Nagara-jan

2012-01-01

The challenge in protein structure prediction using homology modeling is the lack of reliable methods to refine the low resolution homology models. Unconstrained all-atom molecular dynamics (MD) does not serve well for structure refinement due to its limited conformational search. We have developed and tested the constrained MD method, based on the Generalized Newton-Euler Inverse Mass Operator (GNEIMO) algorithm for protein structure refinement. In this method, the high-frequency degrees of freedom are replaced with hard holonomic constraints and a protein is modeled as a collection of rigid body clusters connected by flexible torsional hinges. This allows larger integration time steps and enhances the conformational search space. In this work, we have demonstrated the use of a constraint free GNEIMO method for protein structure refinement that starts from low-resolution decoy sets derived from homology methods. In the eight proteins with three decoys for each, we observed an improvement of ~2 Å in the RMSD to the known experimental structures of these proteins. The GNEIMO method also showed enrichment in the population density of native-like conformations. In addition, we demonstrated structural refinement using a “Freeze and Thaw” clustering scheme with the GNEIMO framework as a viable tool for enhancing localized conformational search. We have derived a robust protocol based on the GNEIMO replica exchange method for protein structure refinement that can be readily extended to other proteins and possibly applicable for high throughput protein structure refinement. PMID:22260550

Physicochemical characterization of native and modified sodium caseinate- Vitamin A complexes.

PubMed

Gupta, Chitra; Arora, Sumit; Syama, M A; Sharma, Apurva

2018-04-01

Native and modified sodium caseinate- Vitamin A complexes {Sodium caseinate- Vit A complex by stirring (NaCas-VA ST), succinylated sodium caseinate- Vit A complex by stirring (SNaCas-VA ST), reassembled sodium caseinate- Vit A complex (RNaCas-VA) and reassembled succinylated sodium caseinate- Vit A complex (RSNaCas-VA)} were prepared and characterized for their physicochemical characteristics e.g. particle size, zeta potential, turbidity analysis and tryptophan intensities which confirmed structural modification of both native (NaCas-VA ST) and modified (SNaCas-VA ST, RNaCas-VA and RSNaCas- VA) proteins upon complex formation with vitamin A. Binding of vitamin A to milk protein reduced the turbidity caused by vitamin A, however, the particle size and zeta potential of milk protein increased after complexation. Microstructure details of NaCas (spray dried) showed uniform spherical structure, however, other milk proteins and milk protein- Vit A complexes (freeze dried) showed broken glass and flaky structures. Tiny particles were observed on the surface of reassembled protein and reassembled protein- Vit A complexes. Binding of vitamin A to milk protein did not have an influence on the electrophoretic mobility and elution profile (RP-HPLC). Copyright © 2018 Elsevier Ltd. All rights reserved.
Combining protein sequence, structure, and dynamics: A novel approach for functional evolution analysis of PAS domain superfamily.

PubMed

Dong, Zheng; Zhou, Hongyu; Tao, Peng

2018-02-01

PAS domains are widespread in archaea, bacteria, and eukaryota, and play important roles in various functions. In this study, we aim to explore functional evolutionary relationship among proteins in the PAS domain superfamily in view of the sequence-structure-dynamics-function relationship. We collected protein sequences and crystal structure data from RCSB Protein Data Bank of the PAS domain superfamily belonging to three biological functions (nucleotide binding, photoreceptor activity, and transferase activity). Protein sequences were aligned and then used to select sequence-conserved residues and build phylogenetic tree. Three-dimensional structure alignment was also applied to obtain structure-conserved residues. The protein dynamics were analyzed using elastic network model (ENM) and validated by molecular dynamics (MD) simulation. The result showed that the proteins with same function could be grouped by sequence similarity, and proteins in different functional groups displayed statistically significant difference in their vibrational patterns. Interestingly, in all three functional groups, conserved amino acid residues identified by sequence and structure conservation analysis generally have a lower fluctuation than other residues. In addition, the fluctuation of conserved residues in each biological function group was strongly correlated with the corresponding biological function. This research suggested a direct connection in which the protein sequences were related to various functions through structural dynamics. This is a new attempt to delineate functional evolution of proteins using the integrated information of sequence, structure, and dynamics. © 2017 The Protein Society.
Rate Kinetics and Molecular Dynamics of the Structural Transitions in Amyloidogenic Proteins

NASA Astrophysics Data System (ADS)

Steckmann, Timothy M.

Amyloid fibril aggregation is associated with several horrific diseases such as Alzheimer's, Creutzfeld-Jacob, diabetes, Parkinson's and others. The process of amyloid aggregation involves forming myriad different metastable intermediate aggregates. Amyloid fibrils are composed of proteins that originate in an innocuous alpha-helix or random-coil structure. The alpha-helices convert their structure to beta-strands that aggregate into beta-sheets, and then into protofibrils, and ultimately into fully formed amyloid fibrils. On the basis of experimental data, I have developed a mathematical model for the kinetics of the reaction pathways and determined rate parameters for peptide secondary structural conversion and aggregation during the entire fibrillogenesis process from random coil to fibrils, including the molecular species that accelerate the conversions. The specific steps of the model and the rate constants that are determined by fitting to experimental data provide insight on the molecular species involved in the fibril formation process. To better understand the molecular basis of the protein structural transitions and aggregation, I report on molecular dynamics (MD) computational studies on the formation of amyloid protofibrillar structures in the small model protein ccbeta, which undergoes many of the structural transitions of the larger, naturally occurring amyloid forming proteins. Two different structural transition processes involving hydrogen bonds are observed for aggregation into fibrils: the breaking of intrachain hydrogen bonds to allow beta-hairpin proteins to straighten, and the subsequent formation of interchain hydrogen bonds during aggregation into amyloid fibrils. For my MD simulations, I found that the temperature dependence of these two different structural transition processes results in the existence of a temperature window that the ccbeta protein experiences during the process of forming protofibrillar structures. Both the mathematical modeling of the kinetics and the MD simulations show that molecular structural heterogeneity is a major factor in the process. The MD simulations also show that intrachain and interchain hydrogen bonds breaking and forming is strongly correlated to the process of amyloid formation.
Efficient protein structure search using indexing methods

PubMed Central

2013-01-01

Understanding functions of proteins is one of the most important challenges in many studies of biological processes. The function of a protein can be predicted by analyzing the functions of structurally similar proteins, thus finding structurally similar proteins accurately and efficiently from a large set of proteins is crucial. A protein structure can be represented as a vector by 3D-Zernike Descriptor (3DZD) which compactly represents the surface shape of the protein tertiary structure. This simplified representation accelerates the searching process. However, computing the similarity of two protein structures is still computationally expensive, thus it is hard to efficiently process many simultaneous requests of structurally similar protein search. This paper proposes indexing techniques which substantially reduce the search time to find structurally similar proteins. In particular, we first exploit two indexing techniques, i.e., iDistance and iKernel, on the 3DZDs. After that, we extend the techniques to further improve the search speed for protein structures. The extended indexing techniques build and utilize an reduced index constructed from the first few attributes of 3DZDs of protein structures. To retrieve top-k similar structures, top-10 × k similar structures are first found using the reduced index, and top-k structures are selected among them. We also modify the indexing techniques to support θ-based nearest neighbor search, which returns data points less than θ to the query point. The results show that both iDistance and iKernel significantly enhance the searching speed. In top-k nearest neighbor search, the searching time is reduced 69.6%, 77%, 77.4% and 87.9%, respectively using iDistance, iKernel, the extended iDistance, and the extended iKernel. In θ-based nearest neighbor serach, the searching time is reduced 80%, 81%, 95.6% and 95.6% using iDistance, iKernel, the extended iDistance, and the extended iKernel, respectively. PMID:23691543
Efficient protein structure search using indexing methods.

PubMed

Kim, Sungchul; Sael, Lee; Yu, Hwanjo

2013-01-01

Understanding functions of proteins is one of the most important challenges in many studies of biological processes. The function of a protein can be predicted by analyzing the functions of structurally similar proteins, thus finding structurally similar proteins accurately and efficiently from a large set of proteins is crucial. A protein structure can be represented as a vector by 3D-Zernike Descriptor (3DZD) which compactly represents the surface shape of the protein tertiary structure. This simplified representation accelerates the searching process. However, computing the similarity of two protein structures is still computationally expensive, thus it is hard to efficiently process many simultaneous requests of structurally similar protein search. This paper proposes indexing techniques which substantially reduce the search time to find structurally similar proteins. In particular, we first exploit two indexing techniques, i.e., iDistance and iKernel, on the 3DZDs. After that, we extend the techniques to further improve the search speed for protein structures. The extended indexing techniques build and utilize an reduced index constructed from the first few attributes of 3DZDs of protein structures. To retrieve top-k similar structures, top-10 × k similar structures are first found using the reduced index, and top-k structures are selected among them. We also modify the indexing techniques to support θ-based nearest neighbor search, which returns data points less than θ to the query point. The results show that both iDistance and iKernel significantly enhance the searching speed. In top-k nearest neighbor search, the searching time is reduced 69.6%, 77%, 77.4% and 87.9%, respectively using iDistance, iKernel, the extended iDistance, and the extended iKernel. In θ-based nearest neighbor serach, the searching time is reduced 80%, 81%, 95.6% and 95.6% using iDistance, iKernel, the extended iDistance, and the extended iKernel, respectively.
FINDSITE-metal: Integrating evolutionary information and machine learning for structure-based metal binding site prediction at the proteome level

PubMed Central

Brylinski, Michal; Skolnick, Jeffrey

2010-01-01

The rapid accumulation of gene sequences, many of which are hypothetical proteins with unknown function, has stimulated the development of accurate computational tools for protein function prediction with evolution/structure-based approaches showing considerable promise. In this paper, we present FINDSITE-metal, a new threading-based method designed specifically to detect metal binding sites in modeled protein structures. Comprehensive benchmarks using different quality protein structures show that weakly homologous protein models provide sufficient structural information for quite accurate annotation by FINDSITE-metal. Combining structure/evolutionary information with machine learning results in highly accurate metal binding annotations; for protein models constructed by TASSER, whose average Cα RMSD from the native structure is 8.9 Å, 59.5% (71.9%) of the best of top five predicted metal locations are within 4 Å (8 Å) from a bound metal in the crystal structure. For most of the targets, multiple metal binding sites are detected with the best predicted binding site at rank 1 and within the top 2 ranks in 65.6% and 83.1% of the cases, respectively. Furthermore, for iron, copper, zinc, calcium and magnesium ions, the binding metal can be predicted with high, typically 70-90%, accuracy. FINDSITE-metal also provides a set of confidence indexes that help assess the reliability of predictions. Finally, we describe the proteome-wide application of FINDSITE-metal that quantifies the metal binding complement of the human proteome. FINDSITE-metal is freely available to the academic community at http://cssb.biology.gatech.edu/findsite-metal/. PMID:21287609
Thermodynamic effects of proline introduction on protein stability.

PubMed

Prajapati, Ravindra Singh; Das, Mili; Sreeramulu, Sridhar; Sirajuddin, Minhajuddin; Srinivasan, Sankaranarayanan; Krishnamurthy, Vaishnavi; Ranjani, Ranganathan; Ramakrishnan, C; Varadarajan, Raghavan

2007-02-01

The amino acid Pro is more rigid than other naturally occurring amino acids and, in proteins, lacks an amide hydrogen. To understand the structural and thermodynamic effects of Pro substitutions, it was introduced at 13 different positions in four different proteins, leucine-isoleucine-valine binding protein, maltose binding protein, ribose binding protein, and thioredoxin. Three of the maltose binding protein mutants were characterized by X-ray crystallography to confirm that no structural changes had occurred upon mutation. In the remaining cases, fluorescence and CD spectroscopy were used to show the absence of structural change. Stabilities of wild type and mutant proteins were characterized by chemical denaturation at neutral pH and by differential scanning calorimetry as a function of pH. The mutants did not show enhanced stability with respect to chemical denaturation at room temperature. However, 6 of the 13 single mutants showed a small but significant increase in the free energy of thermal unfolding in the range of 0.3-2.4 kcal/mol, 2 mutants showed no change, and 5 were destabilized. In five of the six cases, the stabilization was because of reduced entropy of unfolding. However, the magnitude of the reduction in entropy of unfolding was typically several fold larger than the theoretical estimate of -4 cal K(-1) mol(-1) derived from the relative areas in the Ramachandran map accessible to Pro and Ala residues, respectively. Two double mutants were constructed. In both cases, the effects of the single mutations on the free energy of thermal unfolding were nonadditive. Copyright 2006 Wiley-Liss, Inc.
The Classification of Protein Domains.

PubMed

Dawson, Natalie; Sillitoe, Ian; Marsden, Russell L; Orengo, Christine A

2017-01-01

The significant expansion in protein sequence and structure data that we are now witnessing brings with it a pressing need to bring order to the protein world. Such order enables us to gain insights into the evolution of proteins, their function and the extent to which the functional repertoire can vary across the three kingdoms of life. This has lead to the creation of a wide range of protein family classifications that aim to group proteins based upon their evolutionary relationships.In this chapter we discuss the approaches and methods that are frequently used in the classification of proteins, with a specific emphasis on the classification of protein domains. The construction of both domain sequence and domain structure databases is considered and we show how the use of domain family annotations to assign structural and functional information is enhancing our understanding of genomes.
Chimeric microbial rhodopsins for optical activation of Gs-proteins

PubMed Central

Yoshida, Kazuho; Yamashita, Takahiro; Sasaki, Kengo; Inoue, Keiichi; Shichida, Yoshinori; Kandori, Hideki

2017-01-01

We previously showed that the chimeric proteins of microbial rhodopsins, such as light-driven proton pump bacteriorhodopsin (BR) and Gloeobacter rhodopsin (GR) that contain cytoplasmic loops of bovine rhodopsin, are able to activate Gt protein upon light absorption. These facts suggest similar protein structural changes in both the light-driven proton pump and animal rhodopsin. Here we report two trials to engineer chimeric rhodopsins, one for the inserted loop, and another for the microbial rhodopsin template. For the former, we successfully activated Gs protein by light through the incorporation of the cytoplasmic loop of β2-adrenergic receptor (β2AR). For the latter, we did not observe any G-protein activation for the light-driven sodium pump from Indibacter alkaliphilus (IndiR2) or a light-driven chloride pump halorhodopsin from Natronomonas pharaonis (NpHR), whereas the light-driven proton pump GR showed light-dependent G-protein activation. This fact suggests that a helix opening motion is common to G protein coupled receptor (GPCR) and GR, but not to IndiR2 and NpHR. Light-induced difference FTIR spectroscopy revealed similar structural changes between WT and the third loop chimera for each light-driven pump. A helical structural perturbation, which was largest for GR, was further enhanced in the chimera. We conclude that similar structural dynamics that occur on the cytoplasmic side of GPCR are needed to design chimeric microbial rhodopsins. PMID:29362703
Parameter optimization on the convergence surface of path simulations

NASA Astrophysics Data System (ADS)

Chandrasekaran, Srinivas Niranj

Computational treatments of protein conformational changes tend to focus on the trajectories themselves, despite the fact that it is the transition state structures that contain information about the barriers that impose multi-state behavior. PATH is an algorithm that computes a transition pathway between two protein crystal structures, along with the transition state structure, by minimizing the Onsager-Machlup action functional. It is rapid but depends on several unknown input parameters whose range of different values can potentially generate different transition-state structures. Transition-state structures arising from different input parameters cannot be uniquely compared with those generated by other methods. I outline modifications that I have made to the PATH algorithm that estimates these input parameters in a manner that circumvents these difficulties, and describe two complementary tests that validate the transition-state structures found by the PATH algorithm. First, I show that although the PATH algorithm and two other approaches to computing transition pathways produce different low-energy structures connecting the initial and final ground-states with the transition state, all three methods agree closely on the configurations of their transition states. Second, I show that the PATH transition states are close to the saddle points of free-energy surfaces connecting initial and final states generated by replica-exchange Discrete Molecular Dynamics simulations. I show that aromatic side-chain rearrangements create similar potential energy barriers in the transition-state structures identified by PATH for a signaling protein, a contractile protein, and an enzyme. Finally, I observed, but cannot account for, the fact that trajectories obtained for all-atom and Calpha-only simulations identify transition state structures in which the Calpha atoms are in essentially the same positions. The consistency between transition-state structures derived by different algorithms for unrelated protein systems argues that although functionally important protein conformational change trajectories are to a degree stochastic, they nonetheless pass through a well-defined transition state whose detailed structural properties can rapidly be identified using PATH. In the end, I outline the strategies that could enhance the efficiency and applicability of PATH.
Accurate secondary structure prediction and fold recognition for circular dichroism spectroscopy

PubMed Central

Micsonai, András; Wien, Frank; Kernya, Linda; Lee, Young-Ho; Goto, Yuji; Réfrégiers, Matthieu; Kardos, József

2015-01-01

Circular dichroism (CD) spectroscopy is a widely used technique for the study of protein structure. Numerous algorithms have been developed for the estimation of the secondary structure composition from the CD spectra. These methods often fail to provide acceptable results on α/β-mixed or β-structure–rich proteins. The problem arises from the spectral diversity of β-structures, which has hitherto been considered as an intrinsic limitation of the technique. The predictions are less reliable for proteins of unusual β-structures such as membrane proteins, protein aggregates, and amyloid fibrils. Here, we show that the parallel/antiparallel orientation and the twisting of the β-sheets account for the observed spectral diversity. We have developed a method called β-structure selection (BeStSel) for the secondary structure estimation that takes into account the twist of β-structures. This method can reliably distinguish parallel and antiparallel β-sheets and accurately estimates the secondary structure for a broad range of proteins. Moreover, the secondary structure components applied by the method are characteristic to the protein fold, and thus the fold can be predicted to the level of topology in the CATH classification from a single CD spectrum. By constructing a web server, we offer a general tool for a quick and reliable structure analysis using conventional CD or synchrotron radiation CD (SRCD) spectroscopy for the protein science research community. The method is especially useful when X-ray or NMR techniques fail. Using BeStSel on data collected by SRCD spectroscopy, we investigated the structure of amyloid fibrils of various disease-related proteins and peptides. PMID:26038575
Structural Conservation of the Myoviridae Phage Tail Sheath Protein Fold

DOE Office of Scientific and Technical Information (OSTI.GOV)

Aksyuk, Anastasia A.; Kurochkina, Lidia P.; Fokine, Andrei

2012-02-21

Bacteriophage phiKZ is a giant phage that infects Pseudomonas aeruginosa, a human pathogen. The phiKZ virion consists of a 1450 {angstrom} diameter icosahedral head and a 2000 {angstrom}-long contractile tail. The structure of the whole virus was previously reported, showing that its tail organization in the extended state is similar to the well-studied Myovirus bacteriophage T4 tail. The crystal structure of a tail sheath protein fragment of phiKZ was determined to 2.4 {angstrom} resolution. Furthermore, crystal structures of two prophage tail sheath proteins were determined to 1.9 and 3.3 {angstrom} resolution. Despite low sequence identity between these proteins, all ofmore » these structures have a similar fold. The crystal structure of the phiKZ tail sheath protein has been fitted into cryo-electron-microscopy reconstructions of the extended tail sheath and of a polysheath. The structural rearrangement of the phiKZ tail sheath contraction was found to be similar to that of phage T4.« less
Chemical synthesis and X-ray structure of a heterochiral {D-protein antagonist plus vascular endothelial growth factor} protein complex by racemic crystallography

PubMed Central

Mandal, Kalyaneswar; Uppalapati, Maruti; Ault-Riché, Dana; Kenney, John; Lowitz, Joshua; Sidhu, Sachdev S.; Kent, Stephen B.H.

2012-01-01

Total chemical synthesis was used to prepare the mirror image (D-protein) form of the angiogenic protein vascular endothelial growth factor (VEGF-A). Phage display against D-VEGF-A was used to screen designed libraries based on a unique small protein scaffold in order to identify a high affinity ligand. Chemically synthesized D- and L- forms of the protein ligand showed reciprocal chiral specificity in surface plasmon resonance binding experiments: The L-protein ligand bound only to D-VEGF-A, whereas the D-protein ligand bound only to L-VEGF-A. The D-protein ligand, but not the L-protein ligand, inhibited the binding of natural VEGF165 to the VEGFR1 receptor. Racemic protein crystallography was used to determine the high resolution X-ray structure of the heterochiral complex consisting of {D-protein antagonist + L-protein form ofVEGF-A}. Crystallization of a racemic mixture of these synthetic proteins in appropriate stoichiometry gave a racemic protein complex of more than 73 kDa containing six synthetic protein molecules. The structure of the complex was determined to a resolution of 1.6 Å. Detailed analysis of the interaction between the D-protein antagonist and the VEGF-A protein molecule showed that the binding interface comprised a contact surface area of approximately 800 Å2 in accord with our design objectives, and that the D-protein antagonist binds to the same region of VEGF-A that interacts with VEGFR1-domain 2. PMID:22927390
Infrared light-induced protein crystallization. Structuring of protein interfacial water and periodic self-assembly

NASA Astrophysics Data System (ADS)

Kowacz, Magdalena; Marchel, Mateusz; Juknaité, Lina; Esperança, José M. S. S.; Romão, Maria João; Carvalho, Ana Luísa; Rebelo, Luís Paulo N.

2017-01-01

We show that a physical trigger, a non-ionizing infrared (IR) radiation at wavelengths strongly absorbed by liquid water, can be used to induce and kinetically control protein (periodic) self-assembly in solution. This phenomenon is explained by considering the effect of IR light on the structuring of protein interfacial water. Our results indicate that the IR radiation can promote enhanced mutual correlations of water molecules in the protein hydration shell. We report on the radiation-induced increase in both the strength and cooperativeness of H-bonds. The presence of a structured dipolar hydration layer can lead to attractive interactions between like-charged biomacromolecules in solution (and crystal nucleation events). Furthermore, our study suggests that enveloping the protein within a layer of structured solvent (an effect enhanced by IR light) can prevent the protein non-specific aggregation favoring periodic self-assembly. Recognizing the ability to affect protein-water interactions by means of IR radiation may have important implications for biological and bio-inspired systems.
Increasing Sequence Diversity with Flexible Backbone Protein Design: The Complete Redesign of a Protein Hydrophobic Core

DOE Office of Scientific and Technical Information (OSTI.GOV)

Murphy, Grant S.; Mills, Jeffrey L.; Miley, Michael J.

2015-10-15

Protein design tests our understanding of protein stability and structure. Successful design methods should allow the exploration of sequence space not found in nature. However, when redesigning naturally occurring protein structures, most fixed backbone design algorithms return amino acid sequences that share strong sequence identity with wild-type sequences, especially in the protein core. This behavior places a restriction on functional space that can be explored and is not consistent with observations from nature, where sequences of low identity have similar structures. Here, we allow backbone flexibility during design to mutate every position in the core (38 residues) of a four-helixmore » bundle protein. Only small perturbations to the backbone, 12 {angstrom}, were needed to entirely mutate the core. The redesigned protein, DRNN, is exceptionally stable (melting point >140C). An NMR and X-ray crystal structure show that the side chains and backbone were accurately modeled (all-atom RMSD = 1.3 {angstrom}).« less
Protein Structure Validation and Refinement Using Amide Proton Chemical Shifts Derived from Quantum Mechanics

PubMed Central

Christensen, Anders S.; Linnet, Troels E.; Borg, Mikael; Boomsma, Wouter; Lindorff-Larsen, Kresten; Hamelryck, Thomas; Jensen, Jan H.

2013-01-01

We present the ProCS method for the rapid and accurate prediction of protein backbone amide proton chemical shifts - sensitive probes of the geometry of key hydrogen bonds that determine protein structure. ProCS is parameterized against quantum mechanical (QM) calculations and reproduces high level QM results obtained for a small protein with an RMSD of 0.25 ppm (r = 0.94). ProCS is interfaced with the PHAISTOS protein simulation program and is used to infer statistical protein ensembles that reflect experimentally measured amide proton chemical shift values. Such chemical shift-based structural refinements, starting from high-resolution X-ray structures of Protein G, ubiquitin, and SMN Tudor Domain, result in average chemical shifts, hydrogen bond geometries, and trans-hydrogen bond (h3 JNC') spin-spin coupling constants that are in excellent agreement with experiment. We show that the structural sensitivity of the QM-based amide proton chemical shift predictions is needed to obtain this agreement. The ProCS method thus offers a powerful new tool for refining the structures of hydrogen bonding networks to high accuracy with many potential applications such as protein flexibility in ligand binding. PMID:24391900
Evolution of strigolactone receptors by gradual neo-functionalization of KAI2 paralogues.

PubMed

Bythell-Douglas, Rohan; Rothfels, Carl J; Stevenson, Dennis W D; Graham, Sean W; Wong, Gane Ka-Shu; Nelson, David C; Bennett, Tom

2017-06-29

Strigolactones (SLs) are a class of plant hormones that control many aspects of plant growth. The SL signalling mechanism is homologous to that of karrikins (KARs), smoke-derived compounds that stimulate seed germination. In angiosperms, the SL receptor is an α/β-hydrolase known as DWARF14 (D14); its close homologue, KARRIKIN INSENSITIVE2 (KAI2), functions as a KAR receptor and likely recognizes an uncharacterized, endogenous signal ('KL'). Previous phylogenetic analyses have suggested that the KAI2 lineage is ancestral in land plants, and that canonical D14-type SL receptors only arose in seed plants; this is paradoxical, however, as non-vascular plants synthesize and respond to SLs. We have used a combination of phylogenetic and structural approaches to re-assess the evolution of the D14/KAI2 family in land plants. We analysed 339 members of the D14/KAI2 family from land plants and charophyte algae. Our phylogenetic analyses show that the divergence between the eu-KAI2 lineage and the DDK (D14/DLK2/KAI2) lineage that includes D14 occurred very early in land plant evolution. We show that eu-KAI2 proteins are highly conserved, and have unique features not found in DDK proteins. Conversely, we show that DDK proteins show considerable sequence and structural variation to each other, and lack clearly definable characteristics. We use homology modelling to show that the earliest members of the DDK lineage structurally resemble KAI2 and that SL receptors in non-seed plants likely do not have D14-like structure. We also show that certain groups of DDK proteins lack the otherwise conserved MORE AXILLARY GROWTH2 (MAX2) interface, and may thus function independently of MAX2, which we show is highly conserved throughout land plant evolution. Our results suggest that D14-like structure is not required for SL perception, and that SL perception has relatively relaxed structural requirements compared to KAI2-mediated signalling. We suggest that SL perception gradually evolved by neo-functionalization within the DDK lineage, and that the transition from KAI2-like to D14-like protein may have been driven by interactions with protein partners, rather than being required for SL perception per se.
Structures and Free Energy Landscapes of the A53T Mutant-Type α-Synuclein Protein and Impact of A53T Mutation on the Structures of the Wild-Type α-Synuclein Protein with Dynamics

PubMed Central

2013-01-01

The A53T genetic missense mutation of the wild-type α-synuclein (αS) protein was initially identified in Greek and Italian families with familial Parkinson’s disease. Detailed understanding of the structures and the changes induced in the wild-type αS structure by the A53T mutation, as well as establishing the direct relationships between the rapid conformational changes and free energy landscapes of these intrinsically disordered fibrillogenic proteins, helps to enhance our fundamental knowledge and to gain insights into the pathogenic mechanism of Parkinson’s disease. We employed extensive parallel tempering molecular dynamics simulations along with thermodynamic calculations to determine the secondary and tertiary structural properties as well as the conformational free energy surfaces of the wild-type and A53T mutant-type αS proteins in an aqueous solution medium using both implicit and explicit water models. The confined aqueous volume effect in the simulations of disordered proteins using an explicit model for water is addressed for a model disordered protein. We also assessed the stabilities of the residual secondary structure component interconversions in αS based on free energy calculations at the atomic level with dynamics using our recently developed theoretical strategy. To the best of our knowledge, this study presents the first detailed comparison of the structural properties linked directly to the conformational free energy landscapes of the monomeric wild-type and A53T mutant-type α-synuclein proteins in an aqueous solution environment. Results demonstrate that the β-sheet structure is significantly more altered than the helical structure upon A53T mutation of the monomeric wild-type αS protein in aqueous solution. The β-sheet content close to the mutation site in the N-terminal region is more abundant while the non-amyloid-β component (NAC) and C-terminal regions show a decrease in β-sheet abundance upon A53T mutation. Obtained results utilizing our new theoretical strategy show that the residual secondary structure conversion stabilities resulting in α-helix formation are not significantly affected by the mutation. Interestingly, the residual secondary structure conversion stabilities show that secondary structure conversions resulting in β-sheet formation are influenced by the A53T mutation and the most stable residual transition yielding β-sheet occurs directly from the coil structure. Long-range interactions detected between the NAC region and the N- or C-terminal regions of the wild-type αS disappear upon A53T mutation. The A53T mutant-type αS structures are thermodynamically more stable than those of the wild-type αS protein structures in aqueous solution. Overall, the higher propensity of the A53T mutant-type αS protein to aggregate in comparison to the wild-type αS protein is related to the increased β-sheet formation and lack of strong intramolecular long-range interactions in the N-terminal region in comparison to its wild-type form. The specific residual secondary structure component stabilities reported herein provide information helpful for designing and synthesizing small organic molecules that can block the β-sheet forming residues, which are reactive toward aggregation. PMID:23607785
Structures and free energy landscapes of the A53T mutant-type α-synuclein protein and impact of A53T mutation on the structures of the wild-type α-synuclein protein with dynamics.

PubMed

Coskuner, Orkid; Wise-Scira, Olivia

2013-07-17

The A53T genetic missense mutation of the wild-type α-synuclein (αS) protein was initially identified in Greek and Italian families with familial Parkinson's disease. Detailed understanding of the structures and the changes induced in the wild-type αS structure by the A53T mutation, as well as establishing the direct relationships between the rapid conformational changes and free energy landscapes of these intrinsically disordered fibrillogenic proteins, helps to enhance our fundamental knowledge and to gain insights into the pathogenic mechanism of Parkinson's disease. We employed extensive parallel tempering molecular dynamics simulations along with thermodynamic calculations to determine the secondary and tertiary structural properties as well as the conformational free energy surfaces of the wild-type and A53T mutant-type αS proteins in an aqueous solution medium using both implicit and explicit water models. The confined aqueous volume effect in the simulations of disordered proteins using an explicit model for water is addressed for a model disordered protein. We also assessed the stabilities of the residual secondary structure component interconversions in αS based on free energy calculations at the atomic level with dynamics using our recently developed theoretical strategy. To the best of our knowledge, this study presents the first detailed comparison of the structural properties linked directly to the conformational free energy landscapes of the monomeric wild-type and A53T mutant-type α-synuclein proteins in an aqueous solution environment. Results demonstrate that the β-sheet structure is significantly more altered than the helical structure upon A53T mutation of the monomeric wild-type αS protein in aqueous solution. The β-sheet content close to the mutation site in the N-terminal region is more abundant while the non-amyloid-β component (NAC) and C-terminal regions show a decrease in β-sheet abundance upon A53T mutation. Obtained results utilizing our new theoretical strategy show that the residual secondary structure conversion stabilities resulting in α-helix formation are not significantly affected by the mutation. Interestingly, the residual secondary structure conversion stabilities show that secondary structure conversions resulting in β-sheet formation are influenced by the A53T mutation and the most stable residual transition yielding β-sheet occurs directly from the coil structure. Long-range interactions detected between the NAC region and the N- or C-terminal regions of the wild-type αS disappear upon A53T mutation. The A53T mutant-type αS structures are thermodynamically more stable than those of the wild-type αS protein structures in aqueous solution. Overall, the higher propensity of the A53T mutant-type αS protein to aggregate in comparison to the wild-type αS protein is related to the increased β-sheet formation and lack of strong intramolecular long-range interactions in the N-terminal region in comparison to its wild-type form. The specific residual secondary structure component stabilities reported herein provide information helpful for designing and synthesizing small organic molecules that can block the β-sheet forming residues, which are reactive toward aggregation.
De Novo Proteins with Life-Sustaining Functions Are Structurally Dynamic.

PubMed

Murphy, Grant S; Greisman, Jack B; Hecht, Michael H

2016-01-29

Designing and producing novel proteins that fold into stable structures and provide essential biological functions are key goals in synthetic biology. In initial steps toward achieving these goals, we constructed a combinatorial library of de novo proteins designed to fold into 4-helix bundles. As described previously, screening this library for sequences that function in vivo to rescue conditionally lethal mutants of Escherichia coli (auxotrophs) yielded several de novo sequences, termed SynRescue proteins, which rescued four different E. coli auxotrophs. In an effort to understand the structural requirements necessary for auxotroph rescue, we investigated the biophysical properties of the SynRescue proteins, using both computational and experimental approaches. Results from circular dichroism, size-exclusion chromatography, and NMR demonstrate that the SynRescue proteins are α-helical and relatively stable. Surprisingly, however, they do not form well-ordered structures. Instead, they form dynamic structures that fluctuate between monomeric and dimeric states. These findings show that a well-ordered structure is not a prerequisite for life-sustaining functions, and suggests that dynamic structures may have been important in the early evolution of protein function. Copyright © 2015 Elsevier Ltd. All rights reserved.

Structural features that predict real-value fluctuations of globular proteins.

PubMed

Jamroz, Michal; Kolinski, Andrzej; Kihara, Daisuke

2012-05-01

It is crucial to consider dynamics for understanding the biological function of proteins. We used a large number of molecular dynamics (MD) trajectories of nonhomologous proteins as references and examined static structural features of proteins that are most relevant to fluctuations. We examined correlation of individual structural features with fluctuations and further investigated effective combinations of features for predicting the real value of residue fluctuations using the support vector regression (SVR). It was found that some structural features have higher correlation than crystallographic B-factors with fluctuations observed in MD trajectories. Moreover, SVR that uses combinations of static structural features showed accurate prediction of fluctuations with an average Pearson's correlation coefficient of 0.669 and a root mean square error of 1.04 Å. This correlation coefficient is higher than the one observed in predictions by the Gaussian network model (GNM). An advantage of the developed method over the GNMs is that the former predicts the real value of fluctuation. The results help improve our understanding of relationships between protein structure and fluctuation. Furthermore, the developed method provides a convienient practial way to predict fluctuations of proteins using easily computed static structural features of proteins. Copyright © 2012 Wiley Periodicals, Inc.
Structural features that predict real-value fluctuations of globular proteins

PubMed Central

Jamroz, Michal; Kolinski, Andrzej; Kihara, Daisuke

2012-01-01

It is crucial to consider dynamics for understanding the biological function of proteins. We used a large number of molecular dynamics trajectories of non-homologous proteins as references and examined static structural features of proteins that are most relevant to fluctuations. We examined correlation of individual structural features with fluctuations and further investigated effective combinations of features for predicting the real-value of residue fluctuations using the support vector regression. It was found that some structural features have higher correlation than crystallographic B-factors with fluctuations observed in molecular dynamics trajectories. Moreover, support vector regression that uses combinations of static structural features showed accurate prediction of fluctuations with an average Pearson’s correlation coefficient of 0.669 and a root mean square error of 1.04 Å. This correlation coefficient is higher than the one observed for the prediction by the Gaussian network model. An advantage of the developed method over the Gaussian network models is that the former predicts the real-value of fluctuation. The results help improve our understanding of relationships between protein structure and fluctuation. Furthermore, the developed method provides a convienient practial way to predict fluctuations of proteins using easily computed static structural features of proteins. PMID:22328193
Fast iodide-SAD phasing for high-throughput membrane protein structure determination.

PubMed

Melnikov, Igor; Polovinkin, Vitaly; Kovalev, Kirill; Gushchin, Ivan; Shevtsov, Mikhail; Shevchenko, Vitaly; Mishin, Alexey; Alekseev, Alexey; Rodriguez-Valera, Francisco; Borshchevskiy, Valentin; Cherezov, Vadim; Leonard, Gordon A; Gordeliy, Valentin; Popov, Alexander

2017-05-01

We describe a fast, easy, and potentially universal method for the de novo solution of the crystal structures of membrane proteins via iodide-single-wavelength anomalous diffraction (I-SAD). The potential universality of the method is based on a common feature of membrane proteins-the availability at the hydrophobic-hydrophilic interface of positively charged amino acid residues with which iodide strongly interacts. We demonstrate the solution using I-SAD of four crystal structures representing different classes of membrane proteins, including a human G protein-coupled receptor (GPCR), and we show that I-SAD can be applied using data collection strategies based on either standard or serial x-ray crystallography techniques.
Structure of a designed protein cage that self-assembles into a highly porous cube

DOE PAGES

Lai, Yen-Ting; Reading, Eamonn; Hura, Greg L.; ...

2014-11-10

Natural proteins can be versatile building blocks for multimeric, self-assembling structures. Yet, creating protein-based assemblies with specific geometries and chemical properties remains challenging. Highly porous materials represent particularly interesting targets for designed assembly. Here we utilize a strategy of fusing two natural protein oligomers using a continuous alpha-helical linker to design a novel protein that self assembles into a 750 kDa, 225 Å diameter, cube-shaped cage with large openings into a 130 Å diameter inner cavity. A crystal structure of the cage showed atomic level agreement with the designed model, while electron microscopy, native mass spectrometry, and small angle x-raymore » scattering revealed alternate assembly forms in solution. These studies show that accurate design of large porous assemblies with specific shapes is feasible, while further specificity improvements will likely require limiting flexibility to select against alternative forms. Finally, these results provide a foundation for the design of advanced materials with applications in bionanotechnology, nanomedicine and material sciences.« less
Analysis of Protein Thermostability Enhancing Factors in Industrially Important Thermus Bacteria Species

PubMed Central

Kumwenda, Benjamin; Litthauer, Derek; Bishop, Özlem Tastan; Reva, Oleg

2013-01-01

Elucidation of evolutionary factors that enhance protein thermostability is a critical problem and was the focus of this work on Thermus species. Pairs of orthologous sequences of T. scotoductus SA-01 and T. thermophilus HB27, with the largest negative minimum folding energy (MFE) as predicted by the UNAFold algorithm, were statistically analyzed. Favored substitutions of amino acids residues and their properties were determined. Substitutions were analyzed in modeled protein structures to determine their locations and contribution to energy differences using PyMOL and FoldX programs respectively. Dominant trends in amino acid substitutions consistent with differences in thermostability between orthologous sequences were observed. T. thermophilus thermophilic proteins showed an increase in non-polar, tiny, and charged amino acids. An abundance of alanine substituted by serine and threonine, as well as arginine substituted by glutamine and lysine was observed in T. thermophilus HB27. Structural comparison showed that stabilizing mutations occurred on surfaces and loops in protein structures. PMID:24023508
Structural perturbations on huntingtin N17 domain during its folding on 2D-nanomaterials

NASA Astrophysics Data System (ADS)

Zhang, Leili; Feng, Mei; Zhou, Ruhong; Luan, Binquan

2017-09-01

A globular protein’s folded structure in its physiological environment is largely determined by its amino acid sequence. Recently, newly discovered transformer proteins as well as intrinsically disordered proteins may adopt the folding-upon-binding mechanism where their secondary structures are highly dependent on their binding partners. Due to the various applications of nanomaterials in biological sensors and potential wearable devices, it is important to discover possible conformational changes of proteins on nanomaterials. Here, through molecular dynamics simulations, we show that the first 17 residues of the huntingtin protein (HTT-N17) exhibit appreciable differences during its folding on 2D-nanomaterials, such as graphene and MoS2 nanosheets. Namely, the protein is disordered on the graphene surface but is helical on the MoS2 surface. Despite that the amphiphilic environment at the nanosheet-water interface promotes the folding of the amphipathic proteins (such as HTT-N17), competitions between protein-nanosheet and intra-protein interactions yield very different protein conformations. Therefore, as engineered binding partners, nanomaterials might significantly affect the structures of adsorbed proteins.
Construction of ontology augmented networks for protein complex prediction.

PubMed

Zhang, Yijia; Lin, Hongfei; Yang, Zhihao; Wang, Jian

2013-01-01

Protein complexes are of great importance in understanding the principles of cellular organization and function. The increase in available protein-protein interaction data, gene ontology and other resources make it possible to develop computational methods for protein complex prediction. Most existing methods focus mainly on the topological structure of protein-protein interaction networks, and largely ignore the gene ontology annotation information. In this article, we constructed ontology augmented networks with protein-protein interaction data and gene ontology, which effectively unified the topological structure of protein-protein interaction networks and the similarity of gene ontology annotations into unified distance measures. After constructing ontology augmented networks, a novel method (clustering based on ontology augmented networks) was proposed to predict protein complexes, which was capable of taking into account the topological structure of the protein-protein interaction network, as well as the similarity of gene ontology annotations. Our method was applied to two different yeast protein-protein interaction datasets and predicted many well-known complexes. The experimental results showed that (i) ontology augmented networks and the unified distance measure can effectively combine the structure closeness and gene ontology annotation similarity; (ii) our method is valuable in predicting protein complexes and has higher F1 and accuracy compared to other competing methods.
NMR in structural genomics to increase structural coverage of the protein universe: Delivered by Prof. Kurt Wüthrich on 7 July 2013 at the 38th FEBS Congress in St. Petersburg, Russia.

PubMed

Serrano, Pedro; Dutta, Samit K; Proudfoot, Andrew; Mohanty, Biswaranjan; Susac, Lukas; Martin, Bryan; Geralt, Michael; Jaroszewski, Lukasz; Godzik, Adam; Elsliger, Marc; Wilson, Ian A; Wüthrich, Kurt

2016-11-01

For more than a decade, the Joint Center for Structural Genomics (JCSG; www.jcsg.org) worked toward increased three-dimensional structure coverage of the protein universe. This coordinated quest was one of the main goals of the four high-throughput (HT) structure determination centers of the Protein Structure Initiative (PSI; www.nigms.nih.gov/Research/specificareas/PSI). To achieve the goals of the PSI, the JCSG made use of the complementarity of structure determination by X-ray crystallography and nuclear magnetic resonance (NMR) spectroscopy to increase and diversify the range of targets entering the HT structure determination pipeline. The overall strategy, for both techniques, was to determine atomic resolution structures for representatives of large protein families, as defined by the Pfam database, which had no structural coverage and could make significant contributions to biological and biomedical research. Furthermore, the experimental structures could be leveraged by homology modeling to further expand the structural coverage of the protein universe and increase biological insights. Here, we describe what could be achieved by this structural genomics approach, using as an illustration the contributions from 20 NMR structure determinations out of a total of 98 JCSG NMR structures, which were selected because they are the first three-dimensional structure representations of the respective Pfam protein families. The information from this small sample is representative for the overall results from crystal and NMR structure determination in the JCSG. There are five new folds, which were classified as domains of unknown functions (DUF), three of the proteins could be functionally annotated based on three-dimensional structure similarity with previously characterized proteins, and 12 proteins showed only limited similarity with previous deposits in the Protein Data Bank (PDB) and were classified as DUFs. © 2016 Federation of European Biochemical Societies.
Structure of the SPRY domain of the human RNA helicase DDX1, a putative interaction platform within a DEAD-box protein

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kellner, Julian N.; Meinhart, Anton, E-mail: anton.meinhart@mpimf-heidelberg.mpg.de

The structure of the SPRY domain of the human RNA helicase DDX1 was determined at 2.0 Å resolution. The SPRY domain provides a putative protein–protein interaction platform within DDX1 that differs from other SPRY domains in its structure and conserved regions. The human RNA helicase DDX1 in the DEAD-box family plays an important role in RNA processing and has been associated with HIV-1 replication and tumour progression. Whereas previously described DEAD-box proteins have a structurally conserved core, DDX1 shows a unique structural feature: a large SPRY-domain insertion in its RecA-like consensus fold. SPRY domains are known to function as protein–proteinmore » interaction platforms. Here, the crystal structure of the SPRY domain of human DDX1 (hDSPRY) is reported at 2.0 Å resolution. The structure reveals two layers of concave, antiparallel β-sheets that stack onto each other and a third β-sheet beneath the β-sandwich. A comparison with SPRY-domain structures from other eukaryotic proteins showed that the general β-sandwich fold is conserved; however, differences were detected in the loop regions, which were identified in other SPRY domains to be essential for interaction with cognate partners. In contrast, in hDSPRY these loop regions are not strictly conserved across species. Interestingly, though, a conserved patch of positive surface charge is found that may replace the connecting loops as a protein–protein interaction surface. The data presented here comprise the first structural information on DDX1 and provide insights into the unique domain architecture of this DEAD-box protein. By providing the structure of a putative interaction domain of DDX1, this work will serve as a basis for further studies of the interaction network within the hetero-oligomeric complexes of DDX1 and of its recruitment to the HIV-1 Rev protein as a viral replication factor.« less
Structural and functional studies of a 50 kDa antigenic protein from Salmonella enterica serovar Typhi.

PubMed

Choong, Yee Siew; Lim, Theam Soon; Chew, Ai Lan; Aziah, Ismail; Ismail, Asma

2011-04-01

The high typhoid incidence rate in developing and under-developed countries emphasizes the need for a rapid, affordable and accessible diagnostic test for effective therapy and disease management. TYPHIDOT®, a rapid dot enzyme immunoassay test for typhoid, was developed from the discovery of a ∼50 kDa protein specific for Salmonella enterica serovar Typhi. However, the structure of this antigen remains unknown till today. Studies on the structure of this antigen are important to elucidate its function, which will in turn increase the efficiency of the development and improvement of the typhoid detection test. This paper described the predictive structure and function of the antigenically specific protein. The homology modeling approach was employed to construct the three-dimensional structure of the antigen. The built structure possesses the features of TolC-like outer membrane protein. Molecular docking simulation was also performed to further probe the functionality of the antigen. Docking results showed that hexamminecobalt, Co(NH(3))(6)(3+), as an inhibitor of TolC protein, formed favorable hydrogen bonds with D368 and D371 of the antigen. The single point (D368A, D371A) and double point (D368A and D371A) mutations of the antigen showed a decrease (single point mutation) and loss (double point mutations) of binding affinity towards hexamminecobalt. The architecture features of the built model and the docking simulation reinforced and supported that this antigen is indeed the variant of outer membrane protein, TolC. As channel proteins are important for the virulence and survival of bacteria, therefore this ∼50 kDa channel protein is a good specific target for typhoid detection test. Copyright © 2011 Elsevier Inc. All rights reserved.
LucY: A Versatile New Fluorescent Reporter Protein

PubMed Central

Auldridge, Michele E.; Franz, Laura P.; Bingman, Craig A.; Yennamalli, Ragothaman M.; Phillips, George N.; Mead, David; Steinmetz, Eric J.

2015-01-01

We report on the discovery, isolation, and use of a novel yellow fluorescent protein. Lucigen Yellow (LucY) binds one FAD molecule within its core, thus shielding it from water and maintaining its structure so that fluorescence is 10-fold higher than freely soluble FAD. LucY displays excitation and emission spectra characteristic of FAD, with 3 excitation peaks at 276nm, 377nm, and 460nm and a single emission peak at 530nm. These excitation and emission maxima provide the large Stokes shift beneficial to fluorescence experimentation. LucY belongs to the MurB family of UDP-N-acetylenolpyruvylglucosamine reductases. The high resolution crystal structure shows that in contrast to other structurally resolved MurB enzymes, LucY does not contain a potentially quenching aromatic residue near the FAD isoalloxazine ring, which may explain its increased fluorescence over related proteins. Using E. coli as a system in which to develop LucY as a reporter, we show that it is amenable to circular permutation and use as a reporter of protein-protein interaction. Fragmentation between its distinct domains renders LucY non-fluorescent, but fluorescence can be partially restored by fusion of the fragments to interacting protein domains. Thus, LucY may find application in Protein-fragment Complementation Assays for evaluating protein-protein interactions. PMID:25906065
LucY: A Versatile New Fluorescent Reporter Protein.

PubMed

Auldridge, Michele E; Cao, Hongnan; Sen, Saurabh; Franz, Laura P; Bingman, Craig A; Yennamalli, Ragothaman M; Phillips, George N; Mead, David; Steinmetz, Eric J

2015-01-01

We report on the discovery, isolation, and use of a novel yellow fluorescent protein. Lucigen Yellow (LucY) binds one FAD molecule within its core, thus shielding it from water and maintaining its structure so that fluorescence is 10-fold higher than freely soluble FAD. LucY displays excitation and emission spectra characteristic of FAD, with 3 excitation peaks at 276 nm, 377 nm, and 460 nm and a single emission peak at 530 nm. These excitation and emission maxima provide the large Stokes shift beneficial to fluorescence experimentation. LucY belongs to the MurB family of UDP-N-acetylenolpyruvylglucosamine reductases. The high resolution crystal structure shows that in contrast to other structurally resolved MurB enzymes, LucY does not contain a potentially quenching aromatic residue near the FAD isoalloxazine ring, which may explain its increased fluorescence over related proteins. Using E. coli as a system in which to develop LucY as a reporter, we show that it is amenable to circular permutation and use as a reporter of protein-protein interaction. Fragmentation between its distinct domains renders LucY non-fluorescent, but fluorescence can be partially restored by fusion of the fragments to interacting protein domains. Thus, LucY may find application in Protein-fragment Complementation Assays for evaluating protein-protein interactions.
LucY: A versatile new fluorescent reporter protein

DOE PAGES

Auldridge, Michele E.; Cao, Hongnan; Sen, Saurabh; ...

2015-04-23

We report on the discovery, isolation, and use of a novel yellow fluorescent protein. Lucigen Yellow (LucY) binds one FAD molecule within its core, thus shielding it from water and maintaining its structure so that fluorescence is 10-fold higher than freely soluble FAD. LucY displays excitation and emission spectra characteristic of FAD, with 3 excitation peaks at 276nm, 377nm, and 460nm and a single emission peak at 530nm. These excitation and emission maxima provide the large Stokes shift beneficial to fluorescence experimentation. LucY belongs to the MurB family of UDP-N-acetylenolpyruvylglucosamine reductases. The high resolution crystal structure shows that in contrastmore » to other structurally resolved MurB enzymes, LucY does not contain a potentially quenching aromatic residue near the FAD isoalloxazine ring, which may explain its increased fluorescence over related proteins. Using E. coli as a system in which to develop LucY as a reporter, we show that it is amenable to circular permutation and use as a reporter of protein-protein interaction. Fragmentation between its distinct domains renders LucY non-fluorescent, but fluorescence can be partially restored by fusion of the fragments to interacting protein domains. Thus, LucY may find application in Protein-fragment Complementation Assays for evaluating protein-protein interactions.« less
Salivary proline-rich proteins and gluten: Do structural similarities suggest a role in celiac disease?

PubMed

Tian, Na; Messana, Irene; Leffler, Daniel A; Kelly, Ciaran P; Hansen, Joshua; Cabras, Tiziana; D'Alessandro, Alfredo; Schuppan, Detlef; Castagnola, Massimo; Helmerhorst, Eva J

2015-10-01

Gluten proteins, the culprits in celiac disease (CD), show striking similarities in primary structure with human salivary proline-rich proteins (PRPs). Both are enriched in proline and glutamine residues that often occur consecutively in their sequences. We investigated potential differences in the spectrum of salivary PRPs in health and CD. Stimulated salivary secretions were collected from CD patients, patients with refractory CD, patients with gastrointestinal complaints but no CD, and healthy controls. PRP isoforms/peptides were characterized by anionic and SDS-PAGE, PCR, and LC-ESI-MS. The gene frequencies of the acidic PRP isoforms PIF, Db, Pa, PRP1, and PRP2 did not differ between groups. At the protein level, PRPs peptides showed minor group differences, but these could not differentiate the CD and/or refractory CDs groups from the controls. This extensive study established that salivary PRPs, despite similarity to gluten proteins, show no apparent correlation with CD and thus will not serve as diagnostic markers for the disease. The structural basis for the tolerance to the gluten-like PRP proteins in CD is worthy of further exploration and may lead to the development of gluten-like analogs lacking immunogenicity that could be used therapeutically. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
TOUCHSTONE II: a new approach to ab initio protein structure prediction.

PubMed

Zhang, Yang; Kolinski, Andrzej; Skolnick, Jeffrey

2003-08-01

We have developed a new combined approach for ab initio protein structure prediction. The protein conformation is described as a lattice chain connecting C(alpha) atoms, with attached C(beta) atoms and side-chain centers of mass. The model force field includes various short-range and long-range knowledge-based potentials derived from a statistical analysis of the regularities of protein structures. The combination of these energy terms is optimized through the maximization of correlation for 30 x 60,000 decoys between the root mean square deviation (RMSD) to native and energies, as well as the energy gap between native and the decoy ensemble. To accelerate the conformational search, a newly developed parallel hyperbolic sampling algorithm with a composite movement set is used in the Monte Carlo simulation processes. We exploit this strategy to successfully fold 41/100 small proteins (36 approximately 120 residues) with predicted structures having a RMSD from native below 6.5 A in the top five cluster centroids. To fold larger-size proteins as well as to improve the folding yield of small proteins, we incorporate into the basic force field side-chain contact predictions from our threading program PROSPECTOR where homologous proteins were excluded from the data base. With these threading-based restraints, the program can fold 83/125 test proteins (36 approximately 174 residues) with structures having a RMSD to native below 6.5 A in the top five cluster centroids. This shows the significant improvement of folding by using predicted tertiary restraints, especially when the accuracy of side-chain contact prediction is >20%. For native fold selection, we introduce quantities dependent on the cluster density and the combination of energy and free energy, which show a higher discriminative power to select the native structure than the previously used cluster energy or cluster size, and which can be used in native structure identification in blind simulations. These procedures are readily automated and are being implemented on a genomic scale.
Similarity of a 16.5kDa tegumental protein of the human liver fluke Opisthorchis viverrini to nematode cytoplasmic motility protein.

PubMed

Labbunruang, Nipawan; Phadungsil, Wansika; Tesana, Smarn; Smooker, Peter M; Grams, Rudi

2016-05-01

Opisthorchis viverrini is the causative agent of human opisthorchiasis in Thailand and long lasting infection with the parasite has been correlated with the development of cholangiocarcinoma. In this work we have molecularly characterized the first member of a protein family carrying two DM9 repeats in this parasite (OvDM9-1). InterPro and other protein family databases describe the DM9 repeat as a protein domain of unknown function that has been first noted in Drosophila melanogaster. Two paralogous proteins have been partially characterized in the genus Fasciola, Fasciola hepatica TP16.5, a novel tegumental antigen in human fascioliasis and, recently F. gigantica DM9-1, a parenchymal protein with structural similarity to nematode cytoplasmic motility protein (MFP2). In this study, we show further evidence that this family of trematode proteins is related to MFP2 in sequence and structure. Soluble recombinant OvDM9-1 was used for structural analyses and for production of specific antisera. The native protein was detected in soluble and insoluble crude worm extracts and in seemingly various oligomeric forms in the latter. The potential for oligomerization was supported by cross-linking experiments of recombinant OvDM9-1. Structure prediction suggested a β-rich secondary structure of the protein and this was supported by a circular dichroism analysis. Molecular modeling in Phyre2 identified both MFP2 domains as distant homologs of OvDM9-1. The protein was located in tegumental type tissue and the cecal epithelium in the mature parasite. Recombinant OvDM9-1 was used as target in indirect ELISA but sera from infected hamsters showed only marginal reactivity towards it. It is proposed that OvDM9-1 and other members of this protein family have a role in cellular transport through functions on the cytoskeleton. Copyright © 2016 Elsevier B.V. All rights reserved.
Ligand Binding Site Detection by Local Structure Alignment and Its Performance Complementarity

PubMed Central

Lee, Hui Sun; Im, Wonpil

2013-01-01

Accurate determination of potential ligand binding sites (BS) is a key step for protein function characterization and structure-based drug design. Despite promising results of template-based BS prediction methods using global structure alignment (GSA), there is a room to improve the performance by properly incorporating local structure alignment (LSA) because BS are local structures and often similar for proteins with dissimilar global folds. We present a template-based ligand BS prediction method using G-LoSA, our LSA tool. A large benchmark set validation shows that G-LoSA predicts drug-like ligands’ positions in single-chain protein targets more precisely than TM-align, a GSA-based method, while the overall success rate of TM-align is better. G-LoSA is particularly efficient for accurate detection of local structures conserved across proteins with diverse global topologies. Recognizing the performance complementarity of G-LoSA to TM-align and a non-template geometry-based method, fpocket, a robust consensus scoring method, CMCS-BSP (Complementary Methods and Consensus Scoring for ligand Binding Site Prediction), is developed and shows improvement on prediction accuracy. The G-LoSA source code is freely available at http://im.bioinformatics.ku.edu/GLoSA. PMID:23957286
Complete fold annotation of the human proteome using a novel structural feature space.

PubMed

Middleton, Sarah A; Illuminati, Joseph; Kim, Junhyong

2017-04-13

Recognition of protein structural fold is the starting point for many structure prediction tools and protein function inference. Fold prediction is computationally demanding and recognizing novel folds is difficult such that the majority of proteins have not been annotated for fold classification. Here we describe a new machine learning approach using a novel feature space that can be used for accurate recognition of all 1,221 currently known folds and inference of unknown novel folds. We show that our method achieves better than 94% accuracy even when many folds have only one training example. We demonstrate the utility of this method by predicting the folds of 34,330 human protein domains and showing that these predictions can yield useful insights into potential biological function, such as prediction of RNA-binding ability. Our method can be applied to de novo fold prediction of entire proteomes and identify candidate novel fold families.
Complete fold annotation of the human proteome using a novel structural feature space

PubMed Central

Middleton, Sarah A.; Illuminati, Joseph; Kim, Junhyong

2017-01-01

Recognition of protein structural fold is the starting point for many structure prediction tools and protein function inference. Fold prediction is computationally demanding and recognizing novel folds is difficult such that the majority of proteins have not been annotated for fold classification. Here we describe a new machine learning approach using a novel feature space that can be used for accurate recognition of all 1,221 currently known folds and inference of unknown novel folds. We show that our method achieves better than 94% accuracy even when many folds have only one training example. We demonstrate the utility of this method by predicting the folds of 34,330 human protein domains and showing that these predictions can yield useful insights into potential biological function, such as prediction of RNA-binding ability. Our method can be applied to de novo fold prediction of entire proteomes and identify candidate novel fold families. PMID:28406174
VoroMQA: Assessment of protein structure quality using interatomic contact areas.

PubMed

Olechnovič, Kliment; Venclovas, Česlovas

2017-06-01

In the absence of experimentally determined protein structure many biological questions can be addressed using computational structural models. However, the utility of protein structural models depends on their quality. Therefore, the estimation of the quality of predicted structures is an important problem. One of the approaches to this problem is the use of knowledge-based statistical potentials. Such methods typically rely on the statistics of distances and angles of residue-residue or atom-atom interactions collected from experimentally determined structures. Here, we present VoroMQA (Voronoi tessellation-based Model Quality Assessment), a new method for the estimation of protein structure quality. Our method combines the idea of statistical potentials with the use of interatomic contact areas instead of distances. Contact areas, derived using Voronoi tessellation of protein structure, are used to describe and seamlessly integrate both explicit interactions between protein atoms and implicit interactions of protein atoms with solvent. VoroMQA produces scores at atomic, residue, and global levels, all in the fixed range from 0 to 1. The method was tested on the CASP data and compared to several other single-model quality assessment methods. VoroMQA showed strong performance in the recognition of the native structure and in the structural model selection tests, thus demonstrating the efficacy of interatomic contact areas in estimating protein structure quality. The software implementation of VoroMQA is freely available as a standalone application and as a web server at http://bioinformatics.lt/software/voromqa. Proteins 2017; 85:1131-1145. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.

The structure and dynamics in solution of Cu(I) pseudoazurin from Paracoccus pantotrophus.

PubMed Central

Thompson, G. S.; Leung, Y. C.; Ferguson, S. J.; Radford, S. E.; Redfield, C.

2000-01-01

The solution structure and backbone dynamics of Cu(I) pseudoazurin, a 123 amino acid electron transfer protein from Paracoccus pantotrophus, have been determined using NMR methods. The structure was calculated to high precision, with a backbone RMS deviation for secondary structure elements of 0.35+/-0.06 A, using 1,498 distance and 55 torsion angle constraints. The protein has a double-wound Greek-key fold with two alpha-helices toward its C-terminus, similar to that of its oxidized counterpart determined by X-ray crystallography. Comparison of the Cu(I) solution structure with the X-ray structure of the Cu(II) protein shows only small differences in the positions of some of the secondary structure elements. Order parameters S2, measured for amide nitrogens, indicate that the backbone of the protein is rigid on the picosecond to nanosecond timescale. PMID:10850794
Synchrotron IR microspectroscopy for protein structure analysis: Potential and questions

DOE PAGES

Yu, Peiqiang

2006-01-01

Synchrotron radiation-based Fourier transform infrared microspectroscopy (S-FTIR) has been developed as a rapid, direct, non-destructive, bioanalytical technique. This technique takes advantage of synchrotron light brightness and small effective source size and is capable of exploring the molecular chemical make-up within microstructures of a biological tissue without destruction of inherent structures at ultra-spatial resolutions within cellular dimension. To date there has been very little application of this advanced technique to the study of pure protein inherent structure at a cellular level in biological tissues. In this review, a novel approach was introduced to show the potential of the newly developed, advancedmore » synchrotron-based analytical technology, which can be used to localize relatively “pure“ protein in the plant tissues and relatively reveal protein inherent structure and protein molecular chemical make-up within intact tissue at cellular and subcellular levels. Several complex protein IR spectra data analytical techniques (Gaussian and Lorentzian multi-component peak modeling, univariate and multivariate analysis, principal component analysis (PCA), and hierarchical cluster analysis (CLA) are employed to relatively reveal features of protein inherent structure and distinguish protein inherent structure differences between varieties/species and treatments in plant tissues. By using a multi-peak modeling procedure, RELATIVE estimates (but not EXACT determinations) for protein secondary structure analysis can be made for comparison purpose. The issues of pro- and anti-multi-peaking modeling/fitting procedure for relative estimation of protein structure were discussed. By using the PCA and CLA analyses, the plant molecular structure can be qualitatively separate one group from another, statistically, even though the spectral assignments are not known. The synchrotron-based technology provides a new approach for protein structure research in biological tissues at ultraspatial resolutions.« less
Effects of temperature and SDS on the structure of beta-glycosidase from the thermophilic archaeon Sulfolobus solfataricus.

PubMed Central

D'auria, S; Barone, R; Rossi, M; Nucci, R; Barone, G; Fessas, D; Bertoli, E; Tanfani, F

1997-01-01

The effects of temperature and SDS on the three-dimensional organization and secondary structure of beta-glycosidase from the thermophilic archaeon Sulfolobus solfataricus were investigated by CD, IR spectroscopy and differential scanning calorimetry. CD spectra in the near UV region showed that the detergent caused a remarkable change in the protein tertiary structure, and far-UV CD analysis revealed only a slight effect on secondary structure. Infrared spectroscopy showed that low concentrations of the detergent (up to 0.02%) induced slight changes in the enzyme secondary structure, whereas high concentrations caused the alpha-helix content to increase at high temperatures and prevented protein aggregation. PMID:9169619
Solution structures, dynamics, and ice growth inhibitory activity of peptide fragments derived from an antarctic yeast protein.

PubMed

Shah, Syed Hussinien H; Kar, Rajiv K; Asmawi, Azren A; Rahman, Mohd Basyaruddin A; Murad, Abdul Munir A; Mahadi, Nor M; Basri, Mahiran; Rahman, Raja Noor Zaliha A; Salleh, Abu B; Chatterjee, Subhrangsu; Tejo, Bimo A; Bhunia, Anirban

2012-01-01

Exotic functions of antifreeze proteins (AFP) and antifreeze glycopeptides (AFGP) have recently been attracted with much interest to develop them as commercial products. AFPs and AFGPs inhibit ice crystal growth by lowering the water freezing point without changing the water melting point. Our group isolated the Antarctic yeast Glaciozyma antarctica that expresses antifreeze protein to assist it in its survival mechanism at sub-zero temperatures. The protein is unique and novel, indicated by its low sequence homology compared to those of other AFPs. We explore the structure-function relationship of G. antarctica AFP using various approaches ranging from protein structure prediction, peptide design and antifreeze activity assays, nuclear magnetic resonance (NMR) studies and molecular dynamics simulation. The predicted secondary structure of G. antarctica AFP shows several α-helices, assumed to be responsible for its antifreeze activity. We designed several peptide fragments derived from the amino acid sequences of α-helical regions of the parent AFP and they also showed substantial antifreeze activities, below that of the original AFP. The relationship between peptide structure and activity was explored by NMR spectroscopy and molecular dynamics simulation. NMR results show that the antifreeze activity of the peptides correlates with their helicity and geometrical straightforwardness. Furthermore, molecular dynamics simulation also suggests that the activity of the designed peptides can be explained in terms of the structural rigidity/flexibility, i.e., the most active peptide demonstrates higher structural stability, lower flexibility than that of the other peptides with lower activities, and of lower rigidity. This report represents the first detailed report of downsizing a yeast AFP into its peptide fragments with measurable antifreeze activities.
Solution Structures, Dynamics, and Ice Growth Inhibitory Activity of Peptide Fragments Derived from an Antarctic Yeast Protein

PubMed Central

Asmawi, Azren A.; Rahman, Mohd Basyaruddin A.; Murad, Abdul Munir A.; Mahadi, Nor M.; Basri, Mahiran; Rahman, Raja Noor Zaliha A.; Salleh, Abu B.; Chatterjee, Subhrangsu; Tejo, Bimo A.; Bhunia, Anirban

2012-01-01

Exotic functions of antifreeze proteins (AFP) and antifreeze glycopeptides (AFGP) have recently been attracted with much interest to develop them as commercial products. AFPs and AFGPs inhibit ice crystal growth by lowering the water freezing point without changing the water melting point. Our group isolated the Antarctic yeast Glaciozyma antarctica that expresses antifreeze protein to assist it in its survival mechanism at sub-zero temperatures. The protein is unique and novel, indicated by its low sequence homology compared to those of other AFPs. We explore the structure-function relationship of G. antarctica AFP using various approaches ranging from protein structure prediction, peptide design and antifreeze activity assays, nuclear magnetic resonance (NMR) studies and molecular dynamics simulation. The predicted secondary structure of G. antarctica AFP shows several α-helices, assumed to be responsible for its antifreeze activity. We designed several peptide fragments derived from the amino acid sequences of α-helical regions of the parent AFP and they also showed substantial antifreeze activities, below that of the original AFP. The relationship between peptide structure and activity was explored by NMR spectroscopy and molecular dynamics simulation. NMR results show that the antifreeze activity of the peptides correlates with their helicity and geometrical straightforwardness. Furthermore, molecular dynamics simulation also suggests that the activity of the designed peptides can be explained in terms of the structural rigidity/flexibility, i.e., the most active peptide demonstrates higher structural stability, lower flexibility than that of the other peptides with lower activities, and of lower rigidity. This report represents the first detailed report of downsizing a yeast AFP into its peptide fragments with measurable antifreeze activities. PMID:23209600
The crystal structure of the streptococcal collagen-like protein 2 globular domain from invasive M3-type group A Streptococcus shows significant similarity to immunomodulatory HIV protein gp41.

PubMed

Squeglia, Flavia; Bachert, Beth; De Simone, Alfonso; Lukomski, Slawomir; Berisio, Rita

2014-02-21

The arsenal of virulence factors deployed by streptococci includes streptococcal collagen-like (Scl) proteins. These proteins, which are characterized by a globular domain and a collagen-like domain, play key roles in host adhesion, host immune defense evasion, and biofilm formation. In this work, we demonstrate that the Scl2.3 protein is expressed on the surface of invasive M3-type strain MGAS315 of Streptococcus pyogenes. We report the crystal structure of Scl2.3 globular domain, the first of any Scl. This structure shows a novel fold among collagen trimerization domains of either bacterial or human origin. Despite there being low sequence identity, we observed that Scl2.3 globular domain structurally resembles the gp41 subunit of the envelope glycoprotein from human immunodeficiency virus type 1, an essential subunit for viral fusion to human T cells. We combined crystallographic data with modeling and molecular dynamics techniques to gather information on the entire lollipop-like Scl2.3 structure. Molecular dynamics data evidence a high flexibility of Scl2.3 with remarkable interdomain motions that are likely instrumental to the protein biological function in mediating adhesive or immune-modulatory functions in host-pathogen interactions. Altogether, our results provide molecular tools for the understanding of Scl-mediated streptococcal pathogenesis and important structural insights for the future design of small molecular inhibitors of streptococcal invasion.
Structural and Functional Studies of H. seropedicae RecA Protein - Insights into the Polymerization of RecA Protein as Nucleoprotein Filament.

PubMed

Leite, Wellington C; Galvão, Carolina W; Saab, Sérgio C; Iulek, Jorge; Etto, Rafael M; Steffens, Maria B R; Chitteni-Pattu, Sindhu; Stanage, Tyler; Keck, James L; Cox, Michael M

2016-01-01

The bacterial RecA protein plays a role in the complex system of DNA damage repair. Here, we report the functional and structural characterization of the Herbaspirillum seropedicae RecA protein (HsRecA). HsRecA protein is more efficient at displacing SSB protein from ssDNA than Escherichia coli RecA protein. HsRecA also promotes DNA strand exchange more efficiently. The three dimensional structure of HsRecA-ADP/ATP complex has been solved to 1.7 Å resolution. HsRecA protein contains a small N-terminal domain, a central core ATPase domain and a large C-terminal domain, that are similar to homologous bacterial RecA proteins. Comparative structural analysis showed that the N-terminal polymerization motif of archaeal and eukaryotic RecA family proteins are also present in bacterial RecAs. Reconstruction of electrostatic potential from the hexameric structure of HsRecA-ADP/ATP revealed a high positive charge along the inner side, where ssDNA is bound inside the filament. The properties of this surface may explain the greater capacity of HsRecA protein to bind ssDNA, forming a contiguous nucleoprotein filament, displace SSB and promote DNA exchange relative to EcRecA. Our functional and structural analyses provide insight into the molecular mechanisms of polymerization of bacterial RecA as a helical nucleoprotein filament.
Predictive energy landscapes for folding membrane protein assemblies

NASA Astrophysics Data System (ADS)

Truong, Ha H.; Kim, Bobby L.; Schafer, Nicholas P.; Wolynes, Peter G.

2015-12-01

We study the energy landscapes for membrane protein oligomerization using the Associative memory, Water mediated, Structure and Energy Model with an implicit membrane potential (AWSEM-membrane), a coarse-grained molecular dynamics model previously optimized under the assumption that the energy landscapes for folding α-helical membrane protein monomers are funneled once their native topology within the membrane is established. In this study we show that the AWSEM-membrane force field is able to sample near native binding interfaces of several oligomeric systems. By predicting candidate structures using simulated annealing, we further show that degeneracies in predicting structures of membrane protein monomers are generally resolved in the folding of the higher order assemblies as is the case in the assemblies of both nicotinic acetylcholine receptor and V-type Na+-ATPase dimers. The physics of the phenomenon resembles domain swapping, which is consistent with the landscape following the principle of minimal frustration. We revisit also the classic Khorana study of the reconstitution of bacteriorhodopsin from its fragments, which is the close analogue of the early Anfinsen experiment on globular proteins. Here, we show the retinal cofactor likely plays a major role in selecting the final functional assembly.
Structure of faustovirus, a large dsDNA virus

DOE Office of Scientific and Technical Information (OSTI.GOV)

Klose, Thomas; Reteno, Dorine G.; Benamar, Samia

Many viruses protect their genome with a combination of a protein shell with or without a membrane layer. In this paper, we describe the structure of faustovirus, the first DNA virus (to our knowledge) that has been found to use two protein shells to encapsidate and protect its genome. The crystal structure of the major capsid protein, in combination with cryo-electron microscopy structures of two different maturation stages of the virus, shows that the outer virus shell is composed of a double jelly-roll protein that can be found in many double-stranded DNA viruses. The structure of the repeating hexameric unitmore » of the inner shell is different from all other known capsid proteins. In addition to the unique architecture, the region of the genome that encodes the major capsid protein stretches over 17,000 bp and contains a large number of introns and exons. Finally, this complexity might help the virus to rapidly adapt to new environments or hosts.« less
Structure of faustovirus, a large dsDNA virus

DOE PAGES

Klose, Thomas; Reteno, Dorine G.; Benamar, Samia; ...

2016-05-16

Many viruses protect their genome with a combination of a protein shell with or without a membrane layer. In this paper, we describe the structure of faustovirus, the first DNA virus (to our knowledge) that has been found to use two protein shells to encapsidate and protect its genome. The crystal structure of the major capsid protein, in combination with cryo-electron microscopy structures of two different maturation stages of the virus, shows that the outer virus shell is composed of a double jelly-roll protein that can be found in many double-stranded DNA viruses. The structure of the repeating hexameric unitmore » of the inner shell is different from all other known capsid proteins. In addition to the unique architecture, the region of the genome that encodes the major capsid protein stretches over 17,000 bp and contains a large number of introns and exons. Finally, this complexity might help the virus to rapidly adapt to new environments or hosts.« less
Protein Secondary Structure Prediction Using Deep Convolutional Neural Fields.

PubMed

Wang, Sheng; Peng, Jian; Ma, Jianzhu; Xu, Jinbo

2016-01-11

Protein secondary structure (SS) prediction is important for studying protein structure and function. When only the sequence (profile) information is used as input feature, currently the best predictors can obtain ~80% Q3 accuracy, which has not been improved in the past decade. Here we present DeepCNF (Deep Convolutional Neural Fields) for protein SS prediction. DeepCNF is a Deep Learning extension of Conditional Neural Fields (CNF), which is an integration of Conditional Random Fields (CRF) and shallow neural networks. DeepCNF can model not only complex sequence-structure relationship by a deep hierarchical architecture, but also interdependency between adjacent SS labels, so it is much more powerful than CNF. Experimental results show that DeepCNF can obtain ~84% Q3 accuracy, ~85% SOV score, and ~72% Q8 accuracy, respectively, on the CASP and CAMEO test proteins, greatly outperforming currently popular predictors. As a general framework, DeepCNF can be used to predict other protein structure properties such as contact number, disorder regions, and solvent accessibility.
Protein Secondary Structure Prediction Using Deep Convolutional Neural Fields

NASA Astrophysics Data System (ADS)

Wang, Sheng; Peng, Jian; Ma, Jianzhu; Xu, Jinbo

2016-01-01

Protein secondary structure (SS) prediction is important for studying protein structure and function. When only the sequence (profile) information is used as input feature, currently the best predictors can obtain ~80% Q3 accuracy, which has not been improved in the past decade. Here we present DeepCNF (Deep Convolutional Neural Fields) for protein SS prediction. DeepCNF is a Deep Learning extension of Conditional Neural Fields (CNF), which is an integration of Conditional Random Fields (CRF) and shallow neural networks. DeepCNF can model not only complex sequence-structure relationship by a deep hierarchical architecture, but also interdependency between adjacent SS labels, so it is much more powerful than CNF. Experimental results show that DeepCNF can obtain ~84% Q3 accuracy, ~85% SOV score, and ~72% Q8 accuracy, respectively, on the CASP and CAMEO test proteins, greatly outperforming currently popular predictors. As a general framework, DeepCNF can be used to predict other protein structure properties such as contact number, disorder regions, and solvent accessibility.
Tertiary structural propensities reveal fundamental sequence/structure relationships.

PubMed

Zheng, Fan; Zhang, Jian; Grigoryan, Gevorg

2015-05-05

Extracting useful generalizations from the continually growing Protein Data Bank (PDB) is of central importance. We hypothesize that the PDB contains valuable quantitative information on the level of local tertiary structural motifs (TERMs). We show that by breaking a protein structure into its constituent TERMs, and querying the PDB to characterize the natural ensemble matching each, we can estimate the compatibility of the structure with a given amino acid sequence through a metric we term "structure score." Considering submissions from recent Critical Assessment of Structure Prediction (CASP) experiments, we found a strong correlation (R = 0.69) between structure score and model accuracy, with poorly predicted regions readily identifiable. This performance exceeds that of leading atomistic statistical energy functions. Furthermore, TERM-based analysis of two prototypical multi-state proteins rapidly produced structural insights fully consistent with prior extensive experimental studies. We thus find that TERM-based analysis should have considerable utility for protein structural biology. Copyright © 2015 Elsevier Ltd. All rights reserved.
Encounter complexes and dimensionality reduction in protein-protein association.

PubMed

Kozakov, Dima; Li, Keyong; Hall, David R; Beglov, Dmitri; Zheng, Jiefu; Vakili, Pirooz; Schueler-Furman, Ora; Paschalidis, Ioannis Ch; Clore, G Marius; Vajda, Sandor

2014-04-08

An outstanding challenge has been to understand the mechanism whereby proteins associate. We report here the results of exhaustively sampling the conformational space in protein-protein association using a physics-based energy function. The agreement between experimental intermolecular paramagnetic relaxation enhancement (PRE) data and the PRE profiles calculated from the docked structures shows that the method captures both specific and non-specific encounter complexes. To explore the energy landscape in the vicinity of the native structure, the nonlinear manifold describing the relative orientation of two solid bodies is projected onto a Euclidean space in which the shape of low energy regions is studied by principal component analysis. Results show that the energy surface is canyon-like, with a smooth funnel within a two dimensional subspace capturing over 75% of the total motion. Thus, proteins tend to associate along preferred pathways, similar to sliding of a protein along DNA in the process of protein-DNA recognition. DOI: http://dx.doi.org/10.7554/eLife.01370.001.
Msp1 Is a Membrane Protein Dislocase for Tail-Anchored Proteins.

PubMed

Wohlever, Matthew L; Mateja, Agnieszka; McGilvray, Philip T; Day, Kasey J; Keenan, Robert J

2017-07-20

Mislocalized tail-anchored (TA) proteins of the outer mitochondrial membrane are cleared by a newly identified quality control pathway involving the conserved eukaryotic protein Msp1 (ATAD1 in humans). Msp1 is a transmembrane AAA-ATPase, but its role in TA protein clearance is not known. Here, using purified components reconstituted into proteoliposomes, we show that Msp1 is both necessary and sufficient to drive the ATP-dependent extraction of TA proteins from the membrane. A crystal structure of the Msp1 cytosolic region modeled into a ring hexamer suggests that active Msp1 contains a conserved membrane-facing surface adjacent to a central pore. Structure-guided mutagenesis of the pore residues shows that they are critical for TA protein extraction in vitro and for functional complementation of an msp1 deletion in yeast. Together, these data provide a molecular framework for Msp1-dependent extraction of mislocalized TA proteins from the outer mitochondrial membrane. Copyright © 2017 Elsevier Inc. All rights reserved.
Mycobacterium tuberculosis acyl carrier protein synthase adopts two different pH-dependent structural conformations

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gokulan, Kuppan; Aggarwal, Anup; Shipman, Lance

2011-07-01

Bacterial acyl carrier protein synthase plays an essential role in the synthesis of fatty acids, nonribosomal peptides and polyketides. In Mycobacterium tuberculosis, AcpS or group I phosphopentatheine transferase exhibits two different structural conformations depending upon the pH. The crystal structures of acyl carrier protein synthase (AcpS) from Mycobacterium tuberculosis (Mtb) and Corynebacterium ammoniagenes determined at pH 5.3 and pH 6.5, respectively, are reported. Comparison of the Mtb apo-AcpS structure with the recently reported structure of the Mtb AcpS–ADP complex revealed that AcpS adopts two different conformations: the orthorhombic and trigonal space-group structures show structural differences in the α2 helix andmore » in the conformation of the α3–α4 connecting loop, which is in a closed conformation. The apo-AcpS structure shows electron density for the entire model and was obtained at lower pH values (4.4–6.0). In contrast, at a higher pH value (6.5) AcpS undergoes significant conformational changes, resulting in disordered regions that show no electron density in the AcpS model. The solved structures also reveal that C. ammoniagenes AcpS undergoes structural rearrangement in two regions, similar to the recently reported Mtb AcpS–ADP complex structure. In vitro reconstitution experiments show that AcpS has a higher post-translational modification activity between pH 4.4 and 6.0 than at pH values above 6.5, where the activity drops owing to the change in conformation. The results show that apo-AcpS and AcpS–ADP adopt different conformations depending upon the pH conditions of the crystallization solution.« less
Acyl carrier protein structural classification and normal mode analysis

PubMed Central

Cantu, David C; Forrester, Michael J; Charov, Katherine; Reilly, Peter J

2012-01-01

All acyl carrier protein primary and tertiary structures were gathered into the ThYme database. They are classified into 16 families by amino acid sequence similarity, with members of the different families having sequences with statistically highly significant differences. These classifications are supported by tertiary structure superposition analysis. Tertiary structures from a number of families are very similar, suggesting that these families may come from a single distant ancestor. Normal vibrational mode analysis was conducted on experimentally determined freestanding structures, showing greater fluctuations at chain termini and loops than in most helices. Their modes overlap more so within families than between different families. The tertiary structures of three acyl carrier protein families that lacked any known structures were predicted as well. PMID:22374859
Structure prediction of polyglutamine disease proteins: comparison of methods

PubMed Central

2014-01-01

Background The expansion of polyglutamine (poly-Q) repeats in several unrelated proteins is associated with at least ten neurodegenerative diseases. The length of the poly-Q regions plays an important role in the progression of the diseases. The number of glutamines (Q) is inversely related to the onset age of these polyglutamine diseases, and the expansion of poly-Q repeats has been associated with protein misfolding. However, very little is known about the structural changes induced by the expansion of the repeats. Computational methods can provide an alternative to determine the structure of these poly-Q proteins, but it is important to evaluate their performance before large scale prediction work is done. Results In this paper, two popular protein structure prediction programs, I-TASSER and Rosetta, have been used to predict the structure of the N-terminal fragment of a protein associated with Huntington's disease with 17 glutamines. Results show that both programs have the ability to find the native structures, but I-TASSER performs better for the overall task. Conclusions Both I-TASSER and Rosetta can be used for structure prediction of proteins with poly-Q repeats. Knowledge of poly-Q structure may significantly contribute to development of therapeutic strategies for poly-Q diseases. PMID:25080018
Protein structure determination by exhaustive search of Protein Data Bank derived databases.

PubMed

Stokes-Rees, Ian; Sliz, Piotr

2010-12-14

Parallel sequence and structure alignment tools have become ubiquitous and invaluable at all levels in the study of biological systems. We demonstrate the application and utility of this same parallel search paradigm to the process of protein structure determination, benefitting from the large and growing corpus of known structures. Such searches were previously computationally intractable. Through the method of Wide Search Molecular Replacement, developed here, they can be completed in a few hours with the aide of national-scale federated cyberinfrastructure. By dramatically expanding the range of models considered for structure determination, we show that small (less than 12% structural coverage) and low sequence identity (less than 20% identity) template structures can be identified through multidimensional template scoring metrics and used for structure determination. Many new macromolecular complexes can benefit significantly from such a technique due to the lack of known homologous protein folds or sequences. We demonstrate the effectiveness of the method by determining the structure of a full-length p97 homologue from Trichoplusia ni. Example cases with the MHC/T-cell receptor complex and the EmoB protein provide systematic estimates of minimum sequence identity, structure coverage, and structural similarity required for this method to succeed. We describe how this structure-search approach and other novel computationally intensive workflows are made tractable through integration with the US national computational cyberinfrastructure, allowing, for example, rapid processing of the entire Structural Classification of Proteins protein fragment database.
The modular architecture of protein-protein binding interfaces.

PubMed

Reichmann, D; Rahat, O; Albeck, S; Meged, R; Dym, O; Schreiber, G

2005-01-04

Protein-protein interactions are essential for life. Yet, our understanding of the general principles governing binding is not complete. In the present study, we show that the interface between proteins is built in a modular fashion; each module is comprised of a number of closely interacting residues, with few interactions between the modules. The boundaries between modules are defined by clustering the contact map of the interface. We show that mutations in one module do not affect residues located in a neighboring module. As a result, the structural and energetic consequences of the deletion of entire modules are surprisingly small. To the contrary, within their module, mutations cause complex energetic and structural consequences. Experimentally, this phenomenon is shown on the interaction between TEM1-beta-lactamase and beta-lactamase inhibitor protein (BLIP) by using multiple-mutant analysis and x-ray crystallography. Replacing an entire module of five interface residues with Ala created a large cavity in the interface, with no effect on the detailed structure of the remaining interface. The modular architecture of binding sites, which resembles human engineering design, greatly simplifies the design of new protein interactions and provides a feasible view of how these interactions evolved.

Comparative analysis of changes in gene expression due to RNA melting activities of translation initiation factor IF1 and a cold shock protein of the CspA family.

PubMed

Phadtare, Sangita; Severinov, Konstantin

2009-11-01

In Escherichia coli, temperature downshift elicits cold shock response, which is characterized by induction of cold shock proteins. CspA, the major cold shock protein of E. coli, helps cells to acclimatize to low temperature by melting the secondary structures in nucleic acids and acting as a transcription antiterminator. CspA and its homologues contain the cold shock domain and belong to the oligomer binding protein family, which also includes S1 domain proteins such as IF1. Structural similarity between IF1 and CspA homologues suggested a functional overlap between these proteins. Indeed IF1 can melt secondary structures in RNA and acts as transcription antiterminator in vivo and in vitro. Here, we show that in spite of having these critical activities, IF1 does not complement cold-sensitivity of a csp quadruple deletion strain. DNA microarray analysis shows that overproduction of IF1 and Csp leads to changes in expression of different sets of genes. Importantly, several genes which were previously shown to require Csp proteins for their expression at low temperature did not respond to IF1. Moreover, in vitro, we show that a transcription terminator responsive to Csp does not respond to IF1. Our results suggest that Csp proteins and IF1 have different sets of target genes as they may be suppressing the function of different types of transcription termination elements in specific genes.
Structural insights into a secretory abundant heat-soluble protein from an anhydrobiotic tardigrade, Ramazzottius varieornatus.

PubMed

Fukuda, Yohta; Miura, Yoshimasa; Mizohata, Eiichi; Inoue, Tsuyoshi

2017-08-01

Upon stopping metabolic processes, some tardigrades can undergo anhydrobiosis. Secretory abundant heat-soluble (SAHS) proteins have been reported as candidates for anhydrobiosis-related proteins in tardigrades, which seem to protect extracellular components and/or secretory organelles. We determined structures of a SAHS protein from Ramazzottius varieornatus (RvSAHS1), which is one of the toughest tardigrades. RvSAHS1 shows a β-barrel structure similar to fatty acid-binding proteins (FABPs), in which hydrophilic residues form peculiar hydrogen bond networks, which would provide RvSAHS1 with better tolerance against dehydration. We identified two putative ligand-binding sites: one that superimposes on those of some FABPs and the other, unique to and conserved in SAHS proteins. These results indicate that SAHS proteins constitute a new FABP family. © 2017 Federation of European Biochemical Societies.
Characterization of the motion of membrane proteins using high-speed atomic force microscopy

NASA Astrophysics Data System (ADS)

Casuso, Ignacio; Khao, Jonathan; Chami, Mohamed; Paul-Gilloteaux, Perrine; Husain, Mohamed; Duneau, Jean-Pierre; Stahlberg, Henning; Sturgis, James N.; Scheuring, Simon

2012-08-01

For cells to function properly, membrane proteins must be able to diffuse within biological membranes. The functions of these membrane proteins depend on their position and also on protein-protein and protein-lipid interactions. However, so far, it has not been possible to study simultaneously the structure and dynamics of biological membranes. Here, we show that the motion of unlabelled membrane proteins can be characterized using high-speed atomic force microscopy. We find that the molecules of outer membrane protein F (OmpF) are widely distributed in the membrane as a result of diffusion-limited aggregation, and while the overall protein motion scales roughly with the local density of proteins in the membrane, individual protein molecules can also diffuse freely or become trapped by protein-protein interactions. Using these measurements, and the results of molecular dynamics simulations, we determine an interaction potential map and an interaction pathway for a membrane protein, which should provide new insights into the connection between the structures of individual proteins and the structures and dynamics of supramolecular membranes.
Deciphering the shape and deformation of secondary structures through local conformation analysis

PubMed Central

2011-01-01

Background Protein deformation has been extensively analysed through global methods based on RMSD, torsion angles and Principal Components Analysis calculations. Here we use a local approach, able to distinguish among the different backbone conformations within loops, α-helices and β-strands, to address the question of secondary structures' shape variation within proteins and deformation at interface upon complexation. Results Using a structural alphabet, we translated the 3 D structures of large sets of protein-protein complexes into sequences of structural letters. The shape of the secondary structures can be assessed by the structural letters that modeled them in the structural sequences. The distribution analysis of the structural letters in the three protein compartments (surface, core and interface) reveals that secondary structures tend to adopt preferential conformations that differ among the compartments. The local description of secondary structures highlights that curved conformations are preferred on the surface while straight ones are preferred in the core. Interfaces display a mixture of local conformations either preferred in core or surface. The analysis of the structural letters transition occurring between protein-bound and unbound conformations shows that the deformation of secondary structure is tightly linked to the compartment preference of the local conformations. Conclusion The conformation of secondary structures can be further analysed and detailed thanks to a structural alphabet which allows a better description of protein surface, core and interface in terms of secondary structures' shape and deformation. Induced-fit modification tendencies described here should be valuable information to identify and characterize regions under strong structural constraints for functional reasons. PMID:21284872
Deciphering the shape and deformation of secondary structures through local conformation analysis.

PubMed

Baussand, Julie; Camproux, Anne-Claude

2011-02-01

Protein deformation has been extensively analysed through global methods based on RMSD, torsion angles and Principal Components Analysis calculations. Here we use a local approach, able to distinguish among the different backbone conformations within loops, α-helices and β-strands, to address the question of secondary structures' shape variation within proteins and deformation at interface upon complexation. Using a structural alphabet, we translated the 3 D structures of large sets of protein-protein complexes into sequences of structural letters. The shape of the secondary structures can be assessed by the structural letters that modeled them in the structural sequences. The distribution analysis of the structural letters in the three protein compartments (surface, core and interface) reveals that secondary structures tend to adopt preferential conformations that differ among the compartments. The local description of secondary structures highlights that curved conformations are preferred on the surface while straight ones are preferred in the core. Interfaces display a mixture of local conformations either preferred in core or surface. The analysis of the structural letters transition occurring between protein-bound and unbound conformations shows that the deformation of secondary structure is tightly linked to the compartment preference of the local conformations. The conformation of secondary structures can be further analysed and detailed thanks to a structural alphabet which allows a better description of protein surface, core and interface in terms of secondary structures' shape and deformation. Induced-fit modification tendencies described here should be valuable information to identify and characterize regions under strong structural constraints for functional reasons.
Structure and assembly of a paramyxovirus matrix protein

PubMed Central

Battisti, Anthony J.; Meng, Geng; Winkler, Dennis C.; McGinnes, Lori W.; Plevka, Pavel; Steven, Alasdair C.; Morrison, Trudy G.; Rossmann, Michael G.

2012-01-01

Many pleomorphic, lipid-enveloped viruses encode matrix proteins that direct their assembly and budding, but the mechanism of this process is unclear. We have combined X-ray crystallography and cryoelectron tomography to show that the matrix protein of Newcastle disease virus, a paramyxovirus and relative of measles virus, forms dimers that assemble into pseudotetrameric arrays that generate the membrane curvature necessary for virus budding. We show that the glycoproteins are anchored in the gaps between the matrix proteins and that the helical nucleocapsids are associated in register with the matrix arrays. About 90% of virions lack matrix arrays, suggesting that, in agreement with previous biological observations, the matrix protein needs to dissociate from the viral membrane during maturation, as is required for fusion and release of the nucleocapsid into the host’s cytoplasm. Structure and sequence conservation imply that other paramyxovirus matrix proteins function similarly. PMID:22891297
Structure and assembly of a paramyxovirus matrix protein.

PubMed

Battisti, Anthony J; Meng, Geng; Winkler, Dennis C; McGinnes, Lori W; Plevka, Pavel; Steven, Alasdair C; Morrison, Trudy G; Rossmann, Michael G

2012-08-28

Many pleomorphic, lipid-enveloped viruses encode matrix proteins that direct their assembly and budding, but the mechanism of this process is unclear. We have combined X-ray crystallography and cryoelectron tomography to show that the matrix protein of Newcastle disease virus, a paramyxovirus and relative of measles virus, forms dimers that assemble into pseudotetrameric arrays that generate the membrane curvature necessary for virus budding. We show that the glycoproteins are anchored in the gaps between the matrix proteins and that the helical nucleocapsids are associated in register with the matrix arrays. About 90% of virions lack matrix arrays, suggesting that, in agreement with previous biological observations, the matrix protein needs to dissociate from the viral membrane during maturation, as is required for fusion and release of the nucleocapsid into the host's cytoplasm. Structure and sequence conservation imply that other paramyxovirus matrix proteins function similarly.
Thermodynamic prediction of protein neutrality.

PubMed

Bloom, Jesse D; Silberg, Jonathan J; Wilke, Claus O; Drummond, D Allan; Adami, Christoph; Arnold, Frances H

2005-01-18

We present a simple theory that uses thermodynamic parameters to predict the probability that a protein retains the wild-type structure after one or more random amino acid substitutions. Our theory predicts that for large numbers of substitutions the probability that a protein retains its structure will decline exponentially with the number of substitutions, with the severity of this decline determined by properties of the structure. Our theory also predicts that a protein can gain extra robustness to the first few substitutions by increasing its thermodynamic stability. We validate our theory with simulations on lattice protein models and by showing that it quantitatively predicts previously published experimental measurements on subtilisin and our own measurements on variants of TEM1 beta-lactamase. Our work unifies observations about the clustering of functional proteins in sequence space, and provides a basis for interpreting the response of proteins to substitutions in protein engineering applications.
Thermodynamic prediction of protein neutrality

PubMed Central

Bloom, Jesse D.; Silberg, Jonathan J.; Wilke, Claus O.; Drummond, D. Allan; Adami, Christoph; Arnold, Frances H.

2005-01-01

We present a simple theory that uses thermodynamic parameters to predict the probability that a protein retains the wild-type structure after one or more random amino acid substitutions. Our theory predicts that for large numbers of substitutions the probability that a protein retains its structure will decline exponentially with the number of substitutions, with the severity of this decline determined by properties of the structure. Our theory also predicts that a protein can gain extra robustness to the first few substitutions by increasing its thermodynamic stability. We validate our theory with simulations on lattice protein models and by showing that it quantitatively predicts previously published experimental measurements on subtilisin and our own measurements on variants of TEM1 β-lactamase. Our work unifies observations about the clustering of functional proteins in sequence space, and provides a basis for interpreting the response of proteins to substitutions in protein engineering applications. PMID:15644440
Correlation between protein sequence similarity and x-ray diffraction quality in the protein data bank.

PubMed

Lu, Hui-Meng; Yin, Da-Chuan; Ye, Ya-Jing; Luo, Hui-Min; Geng, Li-Qiang; Li, Hai-Sheng; Guo, Wei-Hong; Shang, Peng

2009-01-01

As the most widely utilized technique to determine the 3-dimensional structure of protein molecules, X-ray crystallography can provide structure of the highest resolution among the developed techniques. The resolution obtained via X-ray crystallography is known to be influenced by many factors, such as the crystal quality, diffraction techniques, and X-ray sources, etc. In this paper, the authors found that the protein sequence could also be one of the factors. We extracted information of the resolution and the sequence of proteins from the Protein Data Bank (PDB), classified the proteins into different clusters according to the sequence similarity, and statistically analyzed the relationship between the sequence similarity and the best resolution obtained. The results showed that there was a pronounced correlation between the sequence similarity and the obtained resolution. These results indicate that protein structure itself is one variable that may affect resolution when X-ray crystallography is used.
Tracing Primordial Protein Evolution through Structurally Guided Stepwise Segment Elongation*

PubMed Central

Watanabe, Hideki; Yamasaki, Kazuhiko; Honda, Shinya

2014-01-01

The understanding of how primordial proteins emerged has been a fundamental and longstanding issue in biology and biochemistry. For a better understanding of primordial protein evolution, we synthesized an artificial protein on the basis of an evolutionary hypothesis, segment-based elongation starting from an autonomously foldable short peptide. A 10-residue protein, chignolin, the smallest foldable polypeptide ever reported, was used as a structural support to facilitate higher structural organization and gain-of-function in the development of an artificial protein. Repetitive cycles of segment elongation and subsequent phage display selection successfully produced a 25-residue protein, termed AF.2A1, with nanomolar affinity against the Fc region of immunoglobulin G. AF.2A1 shows exquisite molecular recognition ability such that it can distinguish conformational differences of the same molecule. The structure determined by NMR measurements demonstrated that AF.2A1 forms a globular protein-like conformation with the chignolin-derived β-hairpin and a tryptophan-mediated hydrophobic core. Using sequence analysis and a mutation study, we discovered that the structural organization and gain-of-function emerged from the vicinity of the chignolin segment, revealing that the structural support served as the core in both structural and functional development. Here, we propose an evolutionary model for primordial proteins in which a foldable segment serves as the evolving core to facilitate structural and functional evolution. This study provides insights into primordial protein evolution and also presents a novel methodology for designing small sized proteins useful for industrial and pharmaceutical applications. PMID:24356963
High-throughput crystallization screening.

PubMed

Skarina, Tatiana; Xu, Xiaohui; Evdokimova, Elena; Savchenko, Alexei

2014-01-01

Protein structure determination by X-ray crystallography is dependent on obtaining a single protein crystal suitable for diffraction data collection. Due to this requirement, protein crystallization represents a key step in protein structure determination. The conditions for protein crystallization have to be determined empirically for each protein, making this step also a bottleneck in the structure determination process. Typical protein crystallization practice involves parallel setup and monitoring of a considerable number of individual protein crystallization experiments (also called crystallization trials). In these trials the aliquots of purified protein are mixed with a range of solutions composed of a precipitating agent, buffer, and sometimes an additive that have been previously successful in prompting protein crystallization. The individual chemical conditions in which a particular protein shows signs of crystallization are used as a starting point for further crystallization experiments. The goal is optimizing the formation of individual protein crystals of sufficient size and quality to make them suitable for diffraction data collection. Thus the composition of the primary crystallization screen is critical for successful crystallization.Systematic analysis of crystallization experiments carried out on several hundred proteins as part of large-scale structural genomics efforts allowed the optimization of the protein crystallization protocol and identification of a minimal set of 96 crystallization solutions (the "TRAP" screen) that, in our experience, led to crystallization of the maximum number of proteins.
Structural Analysis of PTM Hotspots (SAPH-ire) – A Quantitative Informatics Method Enabling the Discovery of Novel Regulatory Elements in Protein Families*

PubMed Central

Dewhurst, Henry M.; Choudhury, Shilpa; Torres, Matthew P.

2015-01-01

Predicting the biological function potential of post-translational modifications (PTMs) is becoming increasingly important in light of the exponential increase in available PTM data from high-throughput proteomics. We developed structural analysis of PTM hotspots (SAPH-ire)—a quantitative PTM ranking method that integrates experimental PTM observations, sequence conservation, protein structure, and interaction data to allow rank order comparisons within or between protein families. Here, we applied SAPH-ire to the study of PTMs in diverse G protein families, a conserved and ubiquitous class of proteins essential for maintenance of intracellular structure (tubulins) and signal transduction (large and small Ras-like G proteins). A total of 1728 experimentally verified PTMs from eight unique G protein families were clustered into 451 unique hotspots, 51 of which have a known and cited biological function or response. Using customized software, the hotspots were analyzed in the context of 598 unique protein structures. By comparing distributions of hotspots with known versus unknown function, we show that SAPH-ire analysis is predictive for PTM biological function. Notably, SAPH-ire revealed high-ranking hotspots for which a functional impact has not yet been determined, including phosphorylation hotspots in the N-terminal tails of G protein gamma subunits—conserved protein structures never before reported as regulators of G protein coupled receptor signaling. To validate this prediction we used the yeast model system for G protein coupled receptor signaling, revealing that gamma subunit–N-terminal tail phosphorylation is activated in response to G protein coupled receptor stimulation and regulates protein stability in vivo. These results demonstrate the utility of integrating protein structural and sequence features into PTM prioritization schemes that can improve the analysis and functional power of modification-specific proteomics data. PMID:26070665
Domain analyses of Usher syndrome causing Clarin-1 and GPR98 protein models.

PubMed

Khan, Sehrish Haider; Javed, Muhammad Rizwan; Qasim, Muhammad; Shahzadi, Samar; Jalil, Asma; Rehman, Shahid Ur

2014-01-01

Usher syndrome is an autosomal recessive disorder that causes hearing loss, Retinitis Pigmentosa (RP) and vestibular dysfunction. It is clinically and genetically heterogeneous disorder which is clinically divided into three types i.e. type I, type II and type III. To date, there are about twelve loci and ten identified genes which are associated with Usher syndrome. A mutation in any of these genes e.g. CDH23, CLRN1, GPR98, MYO7A, PCDH15, USH1C, USH1G, USH2A and DFNB31 can result in Usher syndrome or non-syndromic deafness. These genes provide instructions for making proteins that play important roles in normal hearing, balance and vision. Studies have shown that protein structures of only seven genes have been determined experimentally and there are still three genes whose structures are unavailable. These genes are Clarin-1, GPR98 and Usherin. In the absence of an experimentally determined structure, homology modeling and threading often provide a useful 3D model of a protein. Therefore in the current study Clarin-1 and GPR98 proteins have been analyzed for signal peptide, domains and motifs. Clarin-1 protein was found to be without any signal peptide and consists of prokar lipoprotein domain. Clarin-1 is classified within claudin 2 super family and consists of twelve motifs. Whereas, GPR98 has a 29 amino acids long signal peptide and classified within GPCR family 2 having Concanavalin A-like lectin/glucanase superfamily. It was found to be consists of GPS and G protein receptor F2 domains and twenty nine motifs. Their 3D structures have been predicted using I-TASSER server. The model of Clarin-1 showed only α-helix but no beta sheets while model of GPR98 showed both α-helix and β sheets. The predicted structures were then evaluated and validated by MolProbity and Ramachandran plot. The evaluation of the predicted structures showed 78.9% residues of Clarin-1 and 78.9% residues of GPR98 within favored regions. The findings of present study has resulted in the three dimensional structure prediction and conserved domain analysis which will be quite beneficial in better understanding of molecular components, protein-protein interaction, clinical heterogeneity and pathophysiology of Usher syndrome.
Domain analyses of Usher syndrome causing Clarin-1 and GPR98 protein models

PubMed Central

Khan, Sehrish Haider; Javed, Muhammad Rizwan; Qasim, Muhammad; Shahzadi, Samar; Jalil, Asma; Rehman, Shahid ur

2014-01-01

Usher syndrome is an autosomal recessive disorder that causes hearing loss, Retinitis Pigmentosa (RP) and vestibular dysfunction. It is clinically and genetically heterogeneous disorder which is clinically divided into three types i.e. type I, type II and type III. To date, there are about twelve loci and ten identified genes which are associated with Usher syndrome. A mutation in any of these genes e.g. CDH23, CLRN1, GPR98, MYO7A, PCDH15, USH1C, USH1G, USH2A and DFNB31 can result in Usher syndrome or non-syndromic deafness. These genes provide instructions for making proteins that play important roles in normal hearing, balance and vision. Studies have shown that protein structures of only seven genes have been determined experimentally and there are still three genes whose structures are unavailable. These genes are Clarin-1, GPR98 and Usherin. In the absence of an experimentally determined structure, homology modeling and threading often provide a useful 3D model of a protein. Therefore in the current study Clarin-1 and GPR98 proteins have been analyzed for signal peptide, domains and motifs. Clarin-1 protein was found to be without any signal peptide and consists of prokar lipoprotein domain. Clarin-1 is classified within claudin 2 super family and consists of twelve motifs. Whereas, GPR98 has a 29 amino acids long signal peptide and classified within GPCR family 2 having Concanavalin A-like lectin/glucanase superfamily. It was found to be consists of GPS and G protein receptor F2 domains and twenty nine motifs. Their 3D structures have been predicted using I-TASSER server. The model of Clarin-1 showed only α-helix but no beta sheets while model of GPR98 showed both α-helix and β sheets. The predicted structures were then evaluated and validated by MolProbity and Ramachandran plot. The evaluation of the predicted structures showed 78.9% residues of Clarin-1 and 78.9% residues of GPR98 within favored regions. The findings of present study has resulted in the three dimensional structure prediction and conserved domain analysis which will be quite beneficial in better understanding of molecular components, protein-protein interaction, clinical heterogeneity and pathophysiology of Usher syndrome. PMID:25258483
Distributions of experimental protein structures on coarse-grained free energy landscapes

PubMed Central

Liu, Jie; Jernigan, Robert L.

2015-01-01

Predicting conformational changes of proteins is needed in order to fully comprehend functional mechanisms. With the large number of available structures in sets of related proteins, it is now possible to directly visualize the clusters of conformations and their conformational transitions through the use of principal component analysis. The most striking observation about the distributions of the structures along the principal components is their highly non-uniform distributions. In this work, we use principal component analysis of experimental structures of 50 diverse proteins to extract the most important directions of their motions, sample structures along these directions, and estimate their free energy landscapes by combining knowledge-based potentials and entropy computed from elastic network models. When these resulting motions are visualized upon their coarse-grained free energy landscapes, the basis for conformational pathways becomes readily apparent. Using three well-studied proteins, T4 lysozyme, serum albumin, and sarco-endoplasmic reticular Ca2+ adenosine triphosphatase (SERCA), as examples, we show that such free energy landscapes of conformational changes provide meaningful insights into the functional dynamics and suggest transition pathways between different conformational states. As a further example, we also show that Monte Carlo simulations on the coarse-grained landscape of HIV-1 protease can directly yield pathways for force-driven conformational changes. PMID:26723638
The impact of CRISPR repeat sequence on structures of a Cas6 protein-RNA complex

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wang, Ruiying; Zheng, Han; Preamplume, Gan

The repeat-associated mysterious proteins (RAMPs) comprise the most abundant family of proteins involved in prokaryotic immunity against invading genetic elements conferred by the clustered regularly interspaced short palindromic repeat (CRISPR) system. Cas6 is one of the first characterized RAMP proteins and is a key enzyme required for CRISPR RNA maturation. Despite a strong structural homology with other RAMP proteins that bind hairpin RNA, Cas6 distinctly recognizes single-stranded RNA. Previous structural and biochemical studies show that Cas6 captures the 5' end while cleaving the 3' end of the CRISPR RNA. Here, we describe three structures and complementary biochemical analysis of amore » noncatalytic Cas6 homolog from Pyrococcus horikoshii bound to CRISPR repeat RNA of different sequences. Our study confirms the specificity of the Cas6 protein for single-stranded RNA and further reveals the importance of the bases at Positions 5-7 in Cas6-RNA interactions. Substitutions of these bases result in structural changes in the protein-RNA complex including its oligomerization state.« less
An Unusual Hydrophobic Core Confers Extreme Flexibility to HEAT Repeat Proteins

PubMed Central

Kappel, Christian; Zachariae, Ulrich; Dölker, Nicole; Grubmüller, Helmut

2010-01-01

Alpha-solenoid proteins are suggested to constitute highly flexible macromolecules, whose structural variability and large surface area is instrumental in many important protein-protein binding processes. By equilibrium and nonequilibrium molecular dynamics simulations, we show that importin-β, an archetypical α-solenoid, displays unprecedentedly large and fully reversible elasticity. Our stretching molecular dynamics simulations reveal full elasticity over up to twofold end-to-end extensions compared to its bound state. Despite the absence of any long-range intramolecular contacts, the protein can return to its equilibrium structure to within 3 Å backbone RMSD after the release of mechanical stress. We find that this extreme degree of flexibility is based on an unusually flexible hydrophobic core that differs substantially from that of structurally similar but more rigid globular proteins. In that respect, the core of importin-β resembles molten globules. The elastic behavior is dominated by nonpolar interactions between HEAT repeats, combined with conformational entropic effects. Our results suggest that α-solenoid structures such as importin-β may bridge the molecular gap between completely structured and intrinsically disordered proteins. PMID:20816072
Construction of Matryoshka-type structures from supercharged protein nanocages.

PubMed

Beck, Tobias; Tetter, Stephan; Künzle, Matthias; Hilvert, Donald

2015-01-12

Designing nanoscaled hierarchical structures with increasing levels of complexity is challenging. Here we show that electrostatic interactions between two complementarily supercharged protein nanocages can be effectively utilized to create nested Matryoshka-type structures. Cage-within-cage complexes containing spatially ordered iron oxide nanoparticles spontaneously self-assemble upon mixing positively supercharged ferritin compartments with AaLS-13, a larger shell-forming protein with a negatively supercharged lumen. Exploiting engineered Coulombic interactions and protein dynamics in this way opens up new avenues for creating hierarchically organized supramolecular assemblies for application as delivery vehicles, reaction chambers, and artificial organelles. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Golden rule for buttressing vulnerable soluble proteins.

PubMed

Fernández, Ariel; Berry, R Stephen

2010-05-07

Local weaknesses in the structure of soluble proteins have received little attention. The structure may be inherently weak at sites where hydration of the protein backbone is locally hampered by formation of an intramolecular hydrogen bond which in turn is not fully stabilized through burial within a hydrophobic environment. The result is insufficient compensation for the thermodynamic cost of dehydrating the backbone polar groups. This work shows that these structural deficiencies, the unburied backbone hydrogen bonds, are compensated in natural proteins by disulfide bonds that are needed to maintain the structural integrity. Examination of all PDB-reported soluble structures reveals that, after suitable normalization, the number of disulfide bonds, X, correlates tightly with the number of unburied backbone hydrogen bonds, Y, beyond the baseline level Y = 20, revealing a simple balance relation: Y = 5X + 20. This equation introduces a 1:5 ratio associated with the buttressing of soluble proteins with structural deficiencies. The results are justified on thermodynamic grounds and have implications for biomolecular engineering as they introduce two constants of universal applicability determining the architecture of soluble proteins.

Interactive and Versatile Navigation of Structural Databases.

PubMed

Korb, Oliver; Kuhn, Bernd; Hert, Jérôme; Taylor, Neil; Cole, Jason; Groom, Colin; Stahl, Martin

2016-05-12

We present CSD-CrossMiner, a novel tool for pharmacophore-based searches in crystal structure databases. Intuitive pharmacophore queries describing, among others, protein-ligand interaction patterns, ligand scaffolds, or protein environments can be built and modified interactively. Matching crystal structures are overlaid onto the query and visualized as soon as they are available, enabling the researcher to quickly modify a hypothesis on the fly. We exemplify the utility of the approach by showing applications relevant to real-world drug discovery projects, including the identification of novel fragments for a specific protein environment or scaffold hopping. The ability to concurrently search protein-ligand binding sites extracted from the Protein Data Bank (PDB) and small organic molecules from the Cambridge Structural Database (CSD) using the same pharmacophore query further emphasizes the flexibility of CSD-CrossMiner. We believe that CSD-CrossMiner closes an important gap in mining structural data and will allow users to extract more value from the growing number of available crystal structures.
Prevention of IcaA regulated poly N-acetyl glucosamine formation in Staphylococcus aureus biofilm through new-drug like inhibitors: In silico approach and MD simulation study.

PubMed

Gupta, Ayushi; Mishra, Swechha; Singh, Sangeeta; Mishra, Sonali

2017-09-01

The effectiveness of various ligands against the protein structure of IcaA of the IcaABCD gene locus of Staphylococcus aureus were examined using the approach of structure based drug designing in reference with the protein's efficiency to form biofilms. Four compounds CID42738592, CID90468752, CID24277882, and CID6435208 were secluded from a database of 31,242 inhibitory ligands on the justification of the evaluated values falling under the four - tier structure based virtual screening. Under this principle value of least binding energy, human oral absorption and ADME properties were taken into consideration. Using the Glide module of Schrödinger, the above mentioned ligands showed an effective action against the protein IcaA which showed reduced activity as a glucosaminyl transferase. The complex of protein and ligand with best docking score was chosen for simulation studies. Structure based drug designing for the protein IcaA has given us potential leads as anti - biofilm agents. These screened out ligands might enable the development of new therapeutic strategies aimed at disrupting Staphylococcus aureus biofilms. The complex was showing stability towards the end of time for which it has been put for simulation. Thus molecule could be considered for making of biofilms. Copyright © 2017 Elsevier Ltd. All rights reserved.
Relationships between residue Voronoi volume and sequence conservation in proteins.

PubMed

Liu, Jen-Wei; Cheng, Chih-Wen; Lin, Yu-Feng; Chen, Shao-Yu; Hwang, Jenn-Kang; Yen, Shih-Chung

2018-02-01

Functional and biophysical constraints can cause different levels of sequence conservation in proteins. Previously, structural properties, e.g., relative solvent accessibility (RSA) and packing density of the weighted contact number (WCN), have been found to be related to protein sequence conservation (CS). The Voronoi volume has recently been recognized as a new structural property of the local protein structural environment reflecting CS. However, for surface residues, it is sensitive to water molecules surrounding the protein structure. Herein, we present a simple structural determinant termed the relative space of Voronoi volume (RSV); it uses the Voronoi volume and the van der Waals volume of particular residues to quantify the local structural environment. RSV (range, 0-1) is defined as (Voronoi volume-van der Waals volume)/Voronoi volume of the target residue. The concept of RSV describes the extent of available space for every protein residue. RSV and Voronoi profiles with and without water molecules (RSVw, RSV, VOw, and VO) were compared for 554 non-homologous proteins. RSV (without water) showed better Pearson's correlations with CS than did RSVw, VO, or VOw values. The mean correlation coefficient between RSV and CS was 0.51, which is comparable to the correlation between RSA and CS (0.49) and that between WCN and CS (0.56). RSV is a robust structural descriptor with and without water molecules and can quantitatively reflect evolutionary information in a single protein structure. Therefore, it may represent a practical structural determinant to study protein sequence, structure, and function relationships. Copyright © 2017 Elsevier B.V. All rights reserved.
Identification of Atg3 as an intrinsically disordered polypeptide yields insights into the molecular dynamics of autophagy-related proteins in yeast.

PubMed

Popelka, Hana; Uversky, Vladimir N; Klionsky, Daniel J

2014-06-01

The mechanism of autophagy relies on complex cell signaling and regulatory processes. Each cell contains many proteins that lack a rigid 3-dimensional structure under physiological conditions. These dynamic proteins, called intrinsically disordered proteins (IDPs) and protein regions (IDPRs), are predominantly involved in cell signaling and regulation. Yet, very little is known about their presence among proteins of the core autophagy machinery. In this work, we characterized the autophagy protein Atg3 from yeast and human along with 2 variants to show that Atg3 is an IDPRs-containing protein and that disorder/order predicted for these proteins from their amino acid sequence corresponds to their experimental characteristics. Based on this consensus, we applied the same prediction methods to all known Atg proteins from Saccharomyces cerevisiae. The data presented here provide an insight into the structural dynamics of each Atg protein. They also show that intrinsic disorder at various levels has to be taken into consideration for about half of the Atg proteins. This work should become a useful tool that will facilitate and encourage exploration of protein intrinsic disorder in autophagy.
TALEs from a spring--superelasticity of Tal effector protein structures.

PubMed

Flechsig, Holger

2014-01-01

Transcription activator-like effectors (TALEs) are DNA-related proteins that recognise and bind specific target sequences to manipulate gene expression. Recently determined crystal structures show that their common architecture reveals a superhelical overall structure that may undergo drastic conformational changes. To establish a link between structure and dynamics in TALE proteins we have employed coarse-grained elastic-network modelling of currently available structural data and implemented a force-probe setup that allowed us to investigate their mechanical behaviour in computer experiments. Based on the measured force-extension curves we conclude that TALEs exhibit superelastic dynamical properties allowing for large-scale global conformational changes along their helical axis, which represents the soft direction in such proteins. For moderate external forcing the TALE models behave like linear springs, obeying Hooke's law, and the investigated structures can be characterised and compared by a corresponding spring constant. We show that conformational flexibility underlying the large-scale motions is not homogeneously distributed over the TALE structure, but instead soft spot residues around which strain is accumulated and which turn out to represent key agents in the transmission of conformational motions are identified. They correspond to the RVD loop residues that have been experimentally determined to play an eminent role in the binding process of target DNA.
TALEs from a Spring – Superelasticity of Tal Effector Protein Structures

PubMed Central

Flechsig, Holger

2014-01-01

Transcription activator-like effectors (TALEs) are DNA-related proteins that recognise and bind specific target sequences to manipulate gene expression. Recently determined crystal structures show that their common architecture reveals a superhelical overall structure that may undergo drastic conformational changes. To establish a link between structure and dynamics in TALE proteins we have employed coarse-grained elastic-network modelling of currently available structural data and implemented a force-probe setup that allowed us to investigate their mechanical behaviour in computer experiments. Based on the measured force-extension curves we conclude that TALEs exhibit superelastic dynamical properties allowing for large-scale global conformational changes along their helical axis, which represents the soft direction in such proteins. For moderate external forcing the TALE models behave like linear springs, obeying Hooke's law, and the investigated structures can be characterised and compared by a corresponding spring constant. We show that conformational flexibility underlying the large-scale motions is not homogeneously distributed over the TALE structure, but instead soft spot residues around which strain is accumulated and which turn out to represent key agents in the transmission of conformational motions are identified. They correspond to the RVD loop residues that have been experimentally determined to play an eminent role in the binding process of target DNA. PMID:25313859
Efficient Multicriteria Protein Structure Comparison on Modern Processor Architectures

PubMed Central

Manolakos, Elias S.

2015-01-01

Fast increasing computational demand for all-to-all protein structures comparison (PSC) is a result of three confounding factors: rapidly expanding structural proteomics databases, high computational complexity of pairwise protein comparison algorithms, and the trend in the domain towards using multiple criteria for protein structures comparison (MCPSC) and combining results. We have developed a software framework that exploits many-core and multicore CPUs to implement efficient parallel MCPSC in modern processors based on three popular PSC methods, namely, TMalign, CE, and USM. We evaluate and compare the performance and efficiency of the two parallel MCPSC implementations using Intel's experimental many-core Single-Chip Cloud Computer (SCC) as well as Intel's Core i7 multicore processor. We show that the 48-core SCC is more efficient than the latest generation Core i7, achieving a speedup factor of 42 (efficiency of 0.9), making many-core processors an exciting emerging technology for large-scale structural proteomics. We compare and contrast the performance of the two processors on several datasets and also show that MCPSC outperforms its component methods in grouping related domains, achieving a high F-measure of 0.91 on the benchmark CK34 dataset. The software implementation for protein structure comparison using the three methods and combined MCPSC, along with the developed underlying rckskel algorithmic skeletons library, is available via GitHub. PMID:26605332
Structure prediction, expression, and antigenicity of c-terminal of GRP78.

PubMed

Aghamollaei, Hossein; Mousavi Gargari, Seyed Latif; Ghanei, Mostafa; Rasaee, Mohamad Javad; Amani, Jafar; Bakherad, Hamid; Farnoosh, Gholamreza

2017-01-01

Glucose-regulated protein 78 (GRP78) is a typical endoplasmic reticulum luminal chaperone having a main role in the activation of the unfolded protein response. Because of hypoxia and nutrient deprivation in the tumor microenvironment, expression of GRP78 in these cells becomes higher than the native cells, which makes it a suitable candidate for cancer targeting. Suppression of survival signals by antibody production against C-terminal domain of GR78 (CGRP) can induce apoptosis of cancer cells. The aim of this study was in silico analysis, recombinant production, and characterization of CGRP in Escherichia coli. Structural prediction of CGRP by bioinformatics tools was done and the construct containing optimized sequence was transferred to E. coli T7 shuffle. Expression was induced by isopropyl-β-d-thiogalactoside, and recombinant protein was purified by Ni-NTA agarose resin. The content of secondary structures was obtained by circular dichroism (CD) spectrum. CGRP immunogenicity was evaluated from the immunized mouse sera. SDS-PAGE analysis showed CGRP expression in E. coli. CD spectrum also confirmed prediction of structures by bioinformatics tools. The enzyme-linked immunosorbent assay using sera from immunized mice revealed CGRP as a good immunogen. The results obtained in this study showed that the structure of truncated CGRP is very similar to its structure in the whole protein context. This protein can be used in cancer researches. © 2015 International Union of Biochemistry and Molecular Biology, Inc.
Efficient Multicriteria Protein Structure Comparison on Modern Processor Architectures.

PubMed

Sharma, Anuj; Manolakos, Elias S

2015-01-01

Fast increasing computational demand for all-to-all protein structures comparison (PSC) is a result of three confounding factors: rapidly expanding structural proteomics databases, high computational complexity of pairwise protein comparison algorithms, and the trend in the domain towards using multiple criteria for protein structures comparison (MCPSC) and combining results. We have developed a software framework that exploits many-core and multicore CPUs to implement efficient parallel MCPSC in modern processors based on three popular PSC methods, namely, TMalign, CE, and USM. We evaluate and compare the performance and efficiency of the two parallel MCPSC implementations using Intel's experimental many-core Single-Chip Cloud Computer (SCC) as well as Intel's Core i7 multicore processor. We show that the 48-core SCC is more efficient than the latest generation Core i7, achieving a speedup factor of 42 (efficiency of 0.9), making many-core processors an exciting emerging technology for large-scale structural proteomics. We compare and contrast the performance of the two processors on several datasets and also show that MCPSC outperforms its component methods in grouping related domains, achieving a high F-measure of 0.91 on the benchmark CK34 dataset. The software implementation for protein structure comparison using the three methods and combined MCPSC, along with the developed underlying rckskel algorithmic skeletons library, is available via GitHub.
Algorithm, applications and evaluation for protein comparison by Ramanujan Fourier transform.

PubMed

Zhao, Jian; Wang, Jiasong; Hua, Wei; Ouyang, Pingkai

2015-12-01

The amino acid sequence of a protein determines its chemical properties, chain conformation and biological functions. Protein sequence comparison is of great importance to identify similarities of protein structures and infer their functions. Many properties of a protein correspond to the low-frequency signals within the sequence. Low frequency modes in protein sequences are linked to the secondary structures, membrane protein types, and sub-cellular localizations of the proteins. In this paper, we present Ramanujan Fourier transform (RFT) with a fast algorithm to analyze the low-frequency signals of protein sequences. The RFT method is applied to similarity analysis of protein sequences with the Resonant Recognition Model (RRM). The results show that the proposed fast RFT method on protein comparison is more efficient than commonly used discrete Fourier transform (DFT). RFT can detect common frequencies as significant feature for specific protein families, and the RFT spectrum heat-map of protein sequences demonstrates the information conservation in the sequence comparison. The proposed method offers a new tool for pattern recognition, feature extraction and structural analysis on protein sequences. Copyright © 2015 Elsevier Ltd. All rights reserved.
Quality assessment of protein model-structures based on structural and functional similarities.

PubMed

Konopka, Bogumil M; Nebel, Jean-Christophe; Kotulska, Malgorzata

2012-09-21

Experimental determination of protein 3D structures is expensive, time consuming and sometimes impossible. A gap between number of protein structures deposited in the World Wide Protein Data Bank and the number of sequenced proteins constantly broadens. Computational modeling is deemed to be one of the ways to deal with the problem. Although protein 3D structure prediction is a difficult task, many tools are available. These tools can model it from a sequence or partial structural information, e.g. contact maps. Consequently, biologists have the ability to generate automatically a putative 3D structure model of any protein. However, the main issue becomes evaluation of the model quality, which is one of the most important challenges of structural biology. GOBA--Gene Ontology-Based Assessment is a novel Protein Model Quality Assessment Program. It estimates the compatibility between a model-structure and its expected function. GOBA is based on the assumption that a high quality model is expected to be structurally similar to proteins functionally similar to the prediction target. Whereas DALI is used to measure structure similarity, protein functional similarity is quantified using standardized and hierarchical description of proteins provided by Gene Ontology combined with Wang's algorithm for calculating semantic similarity. Two approaches are proposed to express the quality of protein model-structures. One is a single model quality assessment method, the other is its modification, which provides a relative measure of model quality. Exhaustive evaluation is performed on data sets of model-structures submitted to the CASP8 and CASP9 contests. The validation shows that the method is able to discriminate between good and bad model-structures. The best of tested GOBA scores achieved 0.74 and 0.8 as a mean Pearson correlation to the observed quality of models in our CASP8 and CASP9-based validation sets. GOBA also obtained the best result for two targets of CASP8, and one of CASP9, compared to the contest participants. Consequently, GOBA offers a novel single model quality assessment program that addresses the practical needs of biologists. In conjunction with other Model Quality Assessment Programs (MQAPs), it would prove useful for the evaluation of single protein models.
An estimated 5% of new protein structures solved today represent a new Pfam family

DOE Office of Scientific and Technical Information (OSTI.GOV)

Mistry, Jaina; Kloppmann, Edda; Rost, Burkhard

2013-11-01

This study uses the Pfam database to show that the sequence redundancy of protein structures deposited in the PDB is increasing. The possible reasons behind this trend are discussed. High-resolution structural knowledge is key to understanding how proteins function at the molecular level. The number of entries in the Protein Data Bank (PDB), the repository of all publicly available protein structures, continues to increase, with more than 8000 structures released in 2012 alone. The authors of this article have studied how structural coverage of the protein-sequence space has changed over time by monitoring the number of Pfam families that acquiredmore » their first representative structure each year from 1976 to 2012. Twenty years ago, for every 100 new PDB entries released, an estimated 20 Pfam families acquired their first structure. By 2012, this decreased to only about five families per 100 structures. The reasons behind the slower pace at which previously uncharacterized families are being structurally covered were investigated. It was found that although more than 50% of current Pfam families are still without a structural representative, this set is enriched in families that are small, functionally uncharacterized or rich in problem features such as intrinsically disordered and transmembrane regions. While these are important constraints, the reasons why it may not yet be time to give up the pursuit of a targeted but more comprehensive structural coverage of the protein-sequence space are discussed.« less
An experimental point of view on hydration/solvation in halophilic proteins

PubMed Central

Talon, Romain; Coquelle, Nicolas; Madern, Dominique; Girard, Eric

2014-01-01

Protein-solvent interactions govern the behaviors of proteins isolated from extreme halophiles. In this work, we compared the solvent envelopes of two orthologous tetrameric malate dehydrogenases (MalDHs) from halophilic and non-halophilic bacteria. The crystal structure of the MalDH from the non-halophilic bacterium Chloroflexus aurantiacus (Ca MalDH) solved, de novo, at 1.7 Å resolution exhibits numerous water molecules in its solvation shell. We observed that a large number of these water molecules are arranged in pentagonal polygons in the first hydration shell of Ca MalDH. Some of them are clustered in large networks, which cover non-polar amino acid surface. The crystal structure of MalDH from the extreme halophilic bacterium Salinibacter ruber (Sr) solved at 1.55 Å resolution shows that its surface is strongly enriched in acidic amino acids. The structural comparison of these two models is the first direct observation of the relative impact of acidic surface enrichment on the water structure organization between a halophilic protein and its non-adapted counterpart. The data show that surface acidic amino acids disrupt pentagonal water networks in the hydration shell. These crystallographic observations are discussed with respect to halophilic protein behaviors in solution PMID:24600446
Three-dimensional structure of the lithostathine protofibril, a protein involved in Alzheimer’s disease

PubMed Central

Grégoire, Catherine; Marco, Sergio; Thimonier, Jean; Duplan, Laure; Laurine, Emmanuelle; Chauvin, Jean-Paul; Michel, Bernard; Peyrot, Vincent; Verdier, Jean-Michel

2001-01-01

Neurodegenerative diseases are characterized by the presence of filamentous aggregates of proteins. We previously established that lithostathine is a protein overexpressed in the pre-clinical stages of Alzheimer’s disease. Furthermore, it is present in the pathognomonic lesions associated with Alzheimer’s disease. After self-proteolysis, the N-terminally truncated form of lithostathine leads to the formation of fibrillar aggregates. Here we observed using atomic force microscopy that these aggregates consisted of a network of protofibrils, each of which had a twisted appearance. Electron microscopy and image analysis showed that this twisted protofibril has a quadruple helical structure. Three-dimensional X-ray structural data and the results of biochemical experiments showed that when forming a protofibril, lithostathine was first assembled via lateral hydrophobic interactions into a tetramer. Each tetramer then linked up with another tetramer as the result of longitudinal electrostatic interactions. All these results were used to build a structural model for the lithostathine protofibril called the quadruple-helical filament (QHF-litho). In conclusion, lithostathine strongly resembles the prion protein in its dramatic proteolysis and amyloid proteins in its ability to form fibrils. PMID:11432819
Subfamily-specific adaptations in the structures of two penicillin-binding proteins from Mycobacterium tuberculosis

DOE PAGES

Prigozhin, Daniil M.; Krieger, Inna V.; Huizar, John P.; ...

2014-12-31

Beta-lactam antibiotics target penicillin-binding proteins including several enzyme classes essential for bacterial cell-wall homeostasis. To better understand the functional and inhibitor-binding specificities of penicillin-binding proteins from the pathogen, Mycobacterium tuberculosis, we carried out structural and phylogenetic analysis of two predicted D,D-carboxypeptidases, Rv2911 and Rv3330. Optimization of Rv2911 for crystallization using directed evolution and the GFP folding reporter method yielded a soluble quadruple mutant. Structures of optimized Rv2911 bound to phenylmethylsulfonyl fluoride and Rv3330 bound to meropenem show that, in contrast to the nonspecific inhibitor, meropenem forms an extended interaction with the enzyme along a conserved surface. Phylogenetic analysis shows thatmore » Rv2911 and Rv3330 belong to different clades that emerged in Actinobacteria and are not represented in model organisms such as Escherichia coli and Bacillus subtilis. Clade-specific adaptations allow these enzymes to fulfill distinct physiological roles despite strict conservation of core catalytic residues. The characteristic differences include potential protein-protein interaction surfaces and specificity-determining residues surrounding the catalytic site. Overall, these structural insights lay the groundwork to develop improved beta-lactam therapeutics for tuberculosis.« less
Subfamily-specific adaptations in the structures of two penicillin-binding proteins from Mycobacterium tuberculosis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Prigozhin, Daniil M.; Krieger, Inna V.; Huizar, John P.

Beta-lactam antibiotics target penicillin-binding proteins including several enzyme classes essential for bacterial cell-wall homeostasis. To better understand the functional and inhibitor-binding specificities of penicillin-binding proteins from the pathogen, Mycobacterium tuberculosis, we carried out structural and phylogenetic analysis of two predicted D,D-carboxypeptidases, Rv2911 and Rv3330. Optimization of Rv2911 for crystallization using directed evolution and the GFP folding reporter method yielded a soluble quadruple mutant. Structures of optimized Rv2911 bound to phenylmethylsulfonyl fluoride and Rv3330 bound to meropenem show that, in contrast to the nonspecific inhibitor, meropenem forms an extended interaction with the enzyme along a conserved surface. Phylogenetic analysis shows thatmore » Rv2911 and Rv3330 belong to different clades that emerged in Actinobacteria and are not represented in model organisms such as Escherichia coli and Bacillus subtilis. Clade-specific adaptations allow these enzymes to fulfill distinct physiological roles despite strict conservation of core catalytic residues. The characteristic differences include potential protein-protein interaction surfaces and specificity-determining residues surrounding the catalytic site. Overall, these structural insights lay the groundwork to develop improved beta-lactam therapeutics for tuberculosis.« less
An experimental point of view on hydration/solvation in halophilic proteins.

PubMed

Talon, Romain; Coquelle, Nicolas; Madern, Dominique; Girard, Eric

2014-01-01

Protein-solvent interactions govern the behaviors of proteins isolated from extreme halophiles. In this work, we compared the solvent envelopes of two orthologous tetrameric malate dehydrogenases (MalDHs) from halophilic and non-halophilic bacteria. The crystal structure of the MalDH from the non-halophilic bacterium Chloroflexus aurantiacus (Ca MalDH) solved, de novo, at 1.7 Å resolution exhibits numerous water molecules in its solvation shell. We observed that a large number of these water molecules are arranged in pentagonal polygons in the first hydration shell of Ca MalDH. Some of them are clustered in large networks, which cover non-polar amino acid surface. The crystal structure of MalDH from the extreme halophilic bacterium Salinibacter ruber (Sr) solved at 1.55 Å resolution shows that its surface is strongly enriched in acidic amino acids. The structural comparison of these two models is the first direct observation of the relative impact of acidic surface enrichment on the water structure organization between a halophilic protein and its non-adapted counterpart. The data show that surface acidic amino acids disrupt pentagonal water networks in the hydration shell. These crystallographic observations are discussed with respect to halophilic protein behaviors in solution.
Role of ice structuring proteins on freezing-thawing cycles of pasta sauces.

PubMed

Calderara, Marianna; Deorsola, Fabio A; Bensaid, Samir; Fino, Debora; Russo, Nunzio; Geobaldo, Francesco

2016-12-01

The freezing of the food is one of the most important technological developments for the storage of food in terms of quality and safety. The aim of this work was to study the role of an ice structuring protein (ISP) on freezing-thawing cycles of different solutions and commercial Italian pasta sauces. Ice structuring proteins were related to the modification of the structure of ice. The results showed that the freezing time of an aqueous solution containing the protein was reduced to about 20% with respect to a pure water solution. The same effect was demonstrated in sugar-containing solutions and in lipid-containing sauces. The study proved a specific role of ISP during thawing, inducing a time decrease similar to that of freezing and even more important in the case of tomato-based sauces. This work demonstrated the role of ISP in the freezing-thawing process, showing a significant reduction of processing in the freezing and thawing phase by adding the protein to pure water and different sugar-, salt- and lipid-containing solutions and commercial sauces, with considerable benefits for the food industry in terms of costs and food quality.
Structural Integrity of Proteins under Applied Bias during Solid-State Nanopore Translocation

NASA Astrophysics Data System (ADS)

Hasan, Mohammad R.; Khanzada, Raja Raheel; Mahmood, Mohammed A. I.; Ashfaq, Adnan; Iqbal, Samir M.

2015-03-01

The translocation behavior of proteins through solid-state nanopores can be used as a new way to detect and identify proteins. The ionic current through a nanopore that flows under applied bias gets perturbed when a biomolecule traverses the Nanopore. It is important for a protein detection scheme to know of any changes in the three-dimensional structure of the molecule during the process. Here we report the data on structural integrity of protein during translocation through nanopore under different applied biases. Nanoscale Molecular Dynamic was used to establish a framework to study the changes in protein structures as these travelled across the nanopore. The analysis revealed the contributions of structural changes of protein to its ionic current signature. As a model, thrombin protein crystalline structure was imported and positioned inside a 6 nm diameter pore in a 6 nm thick silicon nitride membrane. The protein was solvated in 1 M KCl at 295 K and the system was equilibrated for 20 ns to attain its minimum energy state. The simulation was performed at different electric fields from 0 to 1 kCal/(mol.Å.e). RMSD, radial distribution function, movement of the center of mass and velocity of the protein were calculated. The results showed linear increments in the velocity and perturbations in ionic current profile with increasing electric potential. Support Acknowledged from NSF through ECCS-1201878.
Protein complex prediction in large ontology attributed protein-protein interaction networks.

PubMed

Zhang, Yijia; Lin, Hongfei; Yang, Zhihao; Wang, Jian; Li, Yanpeng; Xu, Bo

2013-01-01

Protein complexes are important for unraveling the secrets of cellular organization and function. Many computational approaches have been developed to predict protein complexes in protein-protein interaction (PPI) networks. However, most existing approaches focus mainly on the topological structure of PPI networks, and largely ignore the gene ontology (GO) annotation information. In this paper, we constructed ontology attributed PPI networks with PPI data and GO resource. After constructing ontology attributed networks, we proposed a novel approach called CSO (clustering based on network structure and ontology attribute similarity). Structural information and GO attribute information are complementary in ontology attributed networks. CSO can effectively take advantage of the correlation between frequent GO annotation sets and the dense subgraph for protein complex prediction. Our proposed CSO approach was applied to four different yeast PPI data sets and predicted many well-known protein complexes. The experimental results showed that CSO was valuable in predicting protein complexes and achieved state-of-the-art performance.

Reversible and irreversible conformational transitions in myoglobin: role of hydrated amino acid ionic liquid.

PubMed

Sankaranarayanan, Kamatchi; Sathyaraj, Gopal; Nair, B U; Dhathathreyan, A

2012-04-12

Hydrated phenylalanine ionic liquid (Phe-IL) has been used to solubilize myoglobin (Mb). Structural stability of Mb in Phe-IL analyzed using fluorescence and circular dichroism spectroscopy shows that for low levels of hydration of Phe-IL there is a large red shift in the fluorescence emission wavelength and the protein transforms to complete β sheet from its native helical conformation. Rehydration or dilution reverses the β sheet to an α helix which on aging organizes to micrometer-sized fibrils. At concentrations higher than 200 μM, the protein changes from β to a more random coiled structure. Organization of the protein in Phe-IL in a Langmuir film at the air/water interface has been investigated using the surface pressure-molecular area isotherm and shows nearly the same surface tension for both pure Mb and Mb in Phe-IL. Scanning electron microscopy of the films of Mb in Phe-IL transferred using the Langmuir-Blodgett film technique show layered morphology. This study shows that the conformation of Mb is completely reversible going from β → helix → β sheet up to 200 μM of Phe-IL. Similar surface tension values for Mb in water and in Phe-IL suggests that direct ion binding interactions with the protein coupled with the change in local viscosity from the IL seems to not only alter the secondary structure of individual proteins but also drives the self-assembly of the protein molecules leading finally to fibril formation.
Prediction of Carbohydrate Binding Sites on Protein Surfaces with 3-Dimensional Probability Density Distributions of Interacting Atoms

PubMed Central

Tsai, Keng-Chang; Jian, Jhih-Wei; Yang, Ei-Wen; Hsu, Po-Chiang; Peng, Hung-Pin; Chen, Ching-Tai; Chen, Jun-Bo; Chang, Jeng-Yih; Hsu, Wen-Lian; Yang, An-Suei

2012-01-01

Non-covalent protein-carbohydrate interactions mediate molecular targeting in many biological processes. Prediction of non-covalent carbohydrate binding sites on protein surfaces not only provides insights into the functions of the query proteins; information on key carbohydrate-binding residues could suggest site-directed mutagenesis experiments, design therapeutics targeting carbohydrate-binding proteins, and provide guidance in engineering protein-carbohydrate interactions. In this work, we show that non-covalent carbohydrate binding sites on protein surfaces can be predicted with relatively high accuracy when the query protein structures are known. The prediction capabilities were based on a novel encoding scheme of the three-dimensional probability density maps describing the distributions of 36 non-covalent interacting atom types around protein surfaces. One machine learning model was trained for each of the 30 protein atom types. The machine learning algorithms predicted tentative carbohydrate binding sites on query proteins by recognizing the characteristic interacting atom distribution patterns specific for carbohydrate binding sites from known protein structures. The prediction results for all protein atom types were integrated into surface patches as tentative carbohydrate binding sites based on normalized prediction confidence level. The prediction capabilities of the predictors were benchmarked by a 10-fold cross validation on 497 non-redundant proteins with known carbohydrate binding sites. The predictors were further tested on an independent test set with 108 proteins. The residue-based Matthews correlation coefficient (MCC) for the independent test was 0.45, with prediction precision and sensitivity (or recall) of 0.45 and 0.49 respectively. In addition, 111 unbound carbohydrate-binding protein structures for which the structures were determined in the absence of the carbohydrate ligands were predicted with the trained predictors. The overall prediction MCC was 0.49. Independent tests on anti-carbohydrate antibodies showed that the carbohydrate antigen binding sites were predicted with comparable accuracy. These results demonstrate that the predictors are among the best in carbohydrate binding site predictions to date. PMID:22848404
Data on crystal organization in the structure of the Fab fragment from the NIST reference antibody, RM 8671.

PubMed

Gallagher, D T; Karageorgos, I; Hudgens, J W; Galvin, C V

2018-02-01

The reported data describe the crystallization, crystal packing, structure determination and twinning of the unliganded Fab (antigen-binding fragment) from the NISTmAb (standard reference material 8671). The raw atomic coordinates are available as Protein Data Bank structure 5K8A and biological aspects are described in the article, (Karageorgos et al., 2017) [1]. Crystal data show that the packing is unique, and show the basis for the crystal's twinned growth. Twinning is a common and often serious problem in protein structure determination by x-ray crystallography [2]. In the present case the twinning is due to a small deviation (about 0.3 nm) from 4-fold symmetry in the primary intermolecular interface. The deviation produces pseudosymmetry, generating slightly different conformations of the protein, and alternating strong and weak forms of key packing interfaces throughout the lattice.
The structure of the KlcA and ArdB proteins reveals a novel fold and antirestriction activity against Type I DNA restriction systems in vivo but not in vitro

PubMed Central

Serfiotis-Mitsa, Dimitra; Herbert, Andrew P.; Roberts, Gareth A.; Soares, Dinesh C.; White, John H.; Blakely, Garry W.; Uhrín, Dušan; Dryden, David T. F.

2010-01-01

Plasmids, conjugative transposons and phage frequently encode anti-restriction proteins to enhance their chances of entering a new bacterial host that is highly likely to contain a Type I DNA restriction and modification (RM) system. The RM system usually destroys the invading DNA. Some of the anti-restriction proteins are DNA mimics and bind to the RM enzyme to prevent it binding to DNA. In this article, we characterize ArdB anti-restriction proteins and their close homologues, the KlcA proteins from a range of mobile genetic elements; including an ArdB encoded on a pathogenicity island from uropathogenic Escherichia coli and a KlcA from an IncP-1b plasmid, pBP136 isolated from Bordetella pertussis. We show that all the ArdB and KlcA act as anti-restriction proteins and inhibit the four main families of Type I RM systems in vivo, but fail to block the restriction endonuclease activity of the archetypal Type I RM enzyme, EcoKI, in vitro indicating that the action of ArdB is indirect and very different from that of the DNA mimics. We also present the structure determined by NMR spectroscopy of the pBP136 KlcA protein. The structure shows a novel protein fold and it is clearly not a DNA structural mimic. PMID:20007596
Structure of the thermally stable Zika virus.

PubMed

Kostyuchenko, Victor A; Lim, Elisa X Y; Zhang, Shuijun; Fibriansah, Guntur; Ng, Thiam-Seng; Ooi, Justin S G; Shi, Jian; Lok, Shee-Mei

2016-05-19

Zika virus (ZIKV), formerly a neglected pathogen, has recently been associated with microcephaly in fetuses, and with Guillian-Barré syndrome in adults. Here we present the 3.7 Å resolution cryo-electron microscopy structure of ZIKV, and show that the overall architecture of the virus is similar to that of other flaviviruses. Sequence and structural comparisons of the ZIKV envelope (E) protein with other flaviviruses show that parts of the E protein closely resemble the neurovirulent West Nile and Japanese encephalitis viruses, while others are similar to dengue virus (DENV). However, the contribution of the E protein to flavivirus pathobiology is currently not understood. The virus particle was observed to be structurally stable even when incubated at 40 °C, in sharp contrast to the less thermally stable DENV. This is also reflected in the infectivity of ZIKV compared to DENV serotypes 2 and 4 (DENV2 and DENV4) at different temperatures. The cryo-electron microscopy structure shows a virus with a more compact surface. This structural stability of the virus may help it to survive in the harsh conditions of semen, saliva and urine. Antibodies or drugs that destabilize the structure may help to reduce the disease outcome or limit the spread of the virus.
Structural basis of carbohydrate recognition by lectin II from Ulex europaeus, a protein with a promiscuous carbohydrate-binding site.

PubMed

Loris, R; De Greve, H; Dao-Thi, M H; Messens, J; Imberty, A; Wyns, L

2000-08-25

Protein-carbohydrate interactions are the language of choice for inter- cellular communication. The legume lectins form a large family of homologous proteins that exhibit a wide variety of carbohydrate specificities. The legume lectin family is therefore highly suitable as a model system to study the structural principles of protein-carbohydrate recognition. Until now, structural data are only available for two specificity families: Man/Glc and Gal/GalNAc. No structural data are available for any of the fucose or chitobiose specific lectins. The crystal structure of Ulex europaeus (UEA-II) is the first of a legume lectin belonging to the chitobiose specificity group. The complexes with N-acetylglucosamine, galactose and fucosylgalactose show a promiscuous primary binding site capable of accommodating both N-acetylglucos amine or galactose in the primary binding site. The hydrogen bonding network in these complexes can be considered suboptimal, in agreement with the low affinities of these sugars. In the complexes with chitobiose, lactose and fucosyllactose this suboptimal hydrogen bonding network is compensated by extensive hydrophobic interactions in a Glc/GlcNAc binding subsite. UEA-II thus forms the first example of a legume lectin with a promiscuous binding site and illustrates the importance of hydrophobic interactions in protein-carbohydrate complexes. Together with other known legume lectin crystal structures, it shows how different specificities can be grafted upon a conserved structural framework. Copyright 2000 Academic Press.
Thermally induced disintegration of the oligomeric structure of alphaB-crystallin mutant F28S is associated with diminished chaperone activity.

PubMed

Kelley, Patrick B; Abraham, Edathara C

2003-10-01

alphaB-crystallin, a member of the small heat-shock protein (hsp) family of proteins, is able to function as a molecular chaperone by protecting other proteins from stress-induced aggregation by recognizing and binding to partially unfolded species of damaged proteins. The present work has investigated the role of phenylalanine-28 (F28) of the 22RLFDQFF28 region of alphaB-crystallin in maintaining chaperone function and oligomeric structure under physiological condition and under thermal stress. Bovine alphaB-crystallin was cloned for the first time and the cDNA sequence revealed greater than 90% homology to that of human, rat and mouse alphaB-crystallins. F28 was mutated to a serine followed by expression of the mutant F28S and the wild-type alphaB (alphaB-wt) in E. coli and subsequent purification of the protein by size-exclusion chromatography. Secondary and tertiary structure analyses showed some structural changes in the mutant. Chaperone activity and oligomeric size of the mutant was unchanged at 37 degrees C whereas at 58 degrees C the chaperone activity was significantly decreased and the oligomeric size ranged from low molecular weight to high molecular weight showing disintegration of the oligomeric structure. The data support the idea that the participation of large oligomeric structure rather than smaller units is required to have optimal chaperone activity and the hydrophobic F28 residue is needed for maintaining the native oligomeric structure under thermal stress.
Structure and dynamics of Ebola virus matrix protein VP40 by a coarse-grained Monte Carlo simulation

NASA Astrophysics Data System (ADS)

Pandey, Ras; Farmer, Barry

Ebola virus matrix protein VP40 (consisting of 326 residues) plays a critical role in viral assembly and its functions such as regulation of viral transcription, packaging, and budding of mature virions into the plasma membrane of infected cells. How does the protein VP40 go through structural evolution during the viral life cycle remains an open question? Using a coarse-grained Monte Carlo simulation we investigate the structural evolution of VP40 as a function of temperature with the input of a knowledge-based residue-residue interaction. A number local and global physical quantities (e.g. mobility profile, contact map, radius of gyration, structure factor) are analyzed with our large-scale simulations. Our preliminary data show that the structure of the protein evolves through different state with well-defined morphologies which can be identified and quantified via a detailed analysis of structure factor.
Rational design to improve thermostability and specific activity of the truncated Fibrobacter succinogenes 1,3-1,4-β-D-glucanase.

PubMed

Huang, Jian-Wen; Cheng, Ya-Shan; Ko, Tzu-Ping; Lin, Cheng-Yen; Lai, Hui-Lin; Chen, Chun-Chi; Ma, Yanhe; Zheng, Yingying; Huang, Chun-Hsiang; Zou, Peijian; Liu, Je-Ruei; Guo, Rey-Ting

2012-04-01

1,3-1,4-β-D-Glucanase has been widely used as a feed additive to help non-ruminant animals digest plant fibers, with potential in increasing nutrition turnover rate and reducing sanitary problems. Engineering of enzymes for better thermostability is of great importance because it not only can broaden their industrial applications, but also facilitate exploring the mechanism of enzyme stability from structural point of view. To obtain enzyme with higher thermostability and specific activity, structure-based rational design was carried out in this study. Eleven mutants of Fibrobacter succinogenes 1,3-1,4-β-D-glucanase were constructed in attempt to improve the enzyme properties. In particular, the crude proteins expressed in Pichia pastoris were examined firstly to ensure that the protein productions meet the need for industrial fermentation. The crude protein of V18Y mutant showed a 2 °C increment of Tm and W203Y showed ∼30% increment of the specific activity. To further investigate the structure-function relationship, some mutants were expressed and purified from P. pastoris and Escherichia coli. Notably, the specific activity of purified W203Y which was expressed in E. coli was 63% higher than the wild-type protein. The double mutant V18Y/W203Y showed the same increments of Tm and specific activity as the single mutants did. When expressed and purified from E. coli, V18Y/W203Y showed similar pattern of thermostability increment and 75% higher specific activity. Furthermore, the apo-form and substrate complex structures of V18Y/W203Y were solved by X-ray crystallography. Analyzing protein structure of V18Y/W203Y helps elucidate how the mutations could enhance the protein stability and enzyme activity.
Membrane remodeling by amyloidogenic and non-amyloidogenic proteins studied by EPR.

PubMed

Varkey, Jobin; Langen, Ralf

2017-07-01

The advancement in site-directed spin labeling of proteins has enabled EPR studies to expand into newer research areas within the umbrella of protein-membrane interactions. Recently, membrane remodeling by amyloidogenic and non-amyloidogenic proteins has gained a substantial interest in relation to driving and controlling vital cellular processes such as endocytosis, exocytosis, shaping of organelles like endoplasmic reticulum, Golgi and mitochondria, intracellular vesicular trafficking, formation of filopedia and multivesicular bodies, mitochondrial fusion and fission, and synaptic vesicle fusion and recycling in neurotransmission. Misregulation in any of these processes due to an aberrant protein (mutation or misfolding) or alteration of lipid metabolism can be detrimental to the cell and cause disease. Dissection of the structural basis of membrane remodeling by proteins is thus quite necessary for an understanding of the underlying mechanisms, but it remains a formidable task due to the difficulties of various common biophysical tools in monitoring the dynamic process of membrane binding and bending by proteins. This is largely since membranes generally complicate protein structure analysis and this problem is amplified for structural analysis in the presence of different types of membrane curvatures. Recent EPR studies on membrane remodeling by proteins show that a significant structural information can be generated to delineate the role of different protein modules, domains and individual amino acids in the generation of membrane curvature. These studies also show how EPR can complement the data obtained by high resolution techniques such as X-ray and NMR. This perspective covers the application of EPR in recent studies for understanding membrane remodeling by amyloidogenic and non-amyloidogenic proteins that is useful for researchers interested in using or complimenting EPR to gain better understanding of membrane remodeling. We also discuss how a single protein can generate different type of membrane curvatures using specific conformations for specific membrane structures and how EPR is a versatile tool well-suited to analyze subtle alterations in structures under such modifying conditions which otherwise would have been difficult using other biophysical tools. Copyright © 2017 Elsevier Inc. All rights reserved.
Structure and functional dynamics characterization of the ion channel of the human respiratory syncytial virus (hRSV) small hydrophobic protein (SH) transmembrane domain by combining molecular dynamics with excited normal modes.

PubMed

Araujo, Gabriela C; Silva, Ricardo H T; Scott, Luis P B; Araujo, Alexandre S; Souza, Fatima P; de Oliveira, Ronaldo Junio

2016-12-01

The human respiratory syncytial virus (hRSV) is the major cause of lower respiratory tract infection in children and elderly people worldwide. Its genome encodes 11 proteins including SH protein, whose functions are not well known. Studies show that SH protein increases RSV virulence degree and permeability to small compounds, suggesting it is involved in the formation of ion channels. The knowledge of SH structure and function is fundamental for a better understanding of its infection mechanism. The aim of this study was to model, characterize, and analyze the structural behavior of SH protein in the phospholipids bilayer environment. Molecular modeling of SH pentameric structure was performed, followed by traditional molecular dynamics (MD) simulations of the protein immersed in the lipid bilayer. Molecular dynamics with excited normal modes (MDeNM) was applied in the resulting system in order to investigate long time scale pore dynamics. MD simulations support that SH protein is stable in its pentameric form. Simulations also showed the presence of water molecules within the bilayer by density distribution, thus confirming that SH protein is a viroporin. This water transport was also observed in MDeNM studies with histidine residues of five chains (His22 and His51), playing a key role in pore permeability. The combination of traditional MD and MDeNM was a very efficient protocol to investigate functional conformational changes of transmembrane proteins that act as molecular channels. This protocol can support future investigations of drug candidates by acting on SH protein to inhibit viral infection. Graphical Abstract The ion channel of the human respiratory syncytial virus (hRSV) small hydrophobic protein (SH) transmembrane domainᅟ.
Terminal sequence importance of de novo proteins from binary-patterned library: stable artificial proteins with 11- or 12-amino acid alphabet.

PubMed

Okura, Hiromichi; Takahashi, Tsuyoshi; Mihara, Hisakazu

2012-06-01

Successful approaches of de novo protein design suggest a great potential to create novel structural folds and to understand natural rules of protein folding. For these purposes, smaller and simpler de novo proteins have been developed. Here, we constructed smaller proteins by removing the terminal sequences from stable de novo vTAJ proteins and compared stabilities between mutant and original proteins. vTAJ proteins were screened from an α3β3 binary-patterned library which was designed with polar/ nonpolar periodicities of α-helix and β-sheet. vTAJ proteins have the additional terminal sequences due to the method of constructing the genetically repeated library sequences. By removing the parts of the sequences, we successfully obtained the stable smaller de novo protein mutants with fewer amino acid alphabets than the originals. However, these mutants showed the differences on ANS binding properties and stabilities against denaturant and pH change. The terminal sequences, which were designed just as flexible linkers not as secondary structure units, sufficiently affected these physicochemical details. This study showed implications for adjusting protein stabilities by designing N- and C-terminal sequences.
Aggregation of trypsin and trypsin inhibitor by Al cation.

PubMed

Chanphai, P; Kreplak, L; Tajmir-Riahi, H A

2017-04-01

Al cation may trigger protein structural changes such as aggregation and fibrillation, causing neurodegenerative diseases. We report the effect of Al cation on the solution structures of trypsin (try) and trypsin inhibitor (tryi), using thermodynamic analysis, UV-Visible, Fourier transform infrared (FTIR) spectroscopic methods and atomic force microscopy (AFM). Thermodynamic parameters showed Al-protein bindings occur via H-bonding and van der Waals contacts for trypsin and trypsin inhibitor. AFM showed that Al cations are able to force trypsin into larger or more robust aggregates than trypsin inhibitor, with trypsin 5±1 SE (n=52) proteins per aggregate and for trypsin inhibitor 8.3±0.7 SE (n=118). Thioflavin T test showed no major protein fibrillation in the presence of Al cation. Al complexation induced more alterations of trypsin inhibitor conformation than trypsin. Copyright © 2017 Elsevier B.V. All rights reserved.
De novo protein structure prediction by dynamic fragment assembly and conformational space annealing.

PubMed

Lee, Juyong; Lee, Jinhyuk; Sasaki, Takeshi N; Sasai, Masaki; Seok, Chaok; Lee, Jooyoung

2011-08-01

Ab initio protein structure prediction is a challenging problem that requires both an accurate energetic representation of a protein structure and an efficient conformational sampling method for successful protein modeling. In this article, we present an ab initio structure prediction method which combines a recently suggested novel way of fragment assembly, dynamic fragment assembly (DFA) and conformational space annealing (CSA) algorithm. In DFA, model structures are scored by continuous functions constructed based on short- and long-range structural restraint information from a fragment library. Here, DFA is represented by the full-atom model by CHARMM with the addition of the empirical potential of DFIRE. The relative contributions between various energy terms are optimized using linear programming. The conformational sampling was carried out with CSA algorithm, which can find low energy conformations more efficiently than simulated annealing used in the existing DFA study. The newly introduced DFA energy function and CSA sampling algorithm are implemented into CHARMM. Test results on 30 small single-domain proteins and 13 template-free modeling targets of the 8th Critical Assessment of protein Structure Prediction show that the current method provides comparable and complementary prediction results to existing top methods. Copyright © 2011 Wiley-Liss, Inc.
Synthetic beta-solenoid proteins with the fragment-free computational design of a beta-hairpin extension

PubMed Central

MacDonald, James T.; Kabasakal, Burak V.; Godding, David; Kraatz, Sebastian; Henderson, Louie; Barber, James; Freemont, Paul S.; Murray, James W.

2016-01-01

The ability to design and construct structures with atomic level precision is one of the key goals of nanotechnology. Proteins offer an attractive target for atomic design because they can be synthesized chemically or biologically and can self-assemble. However, the generalized protein folding and design problem is unsolved. One approach to simplifying the problem is to use a repetitive protein as a scaffold. Repeat proteins are intrinsically modular, and their folding and structures are better understood than large globular domains. Here, we have developed a class of synthetic repeat proteins based on the pentapeptide repeat family of beta-solenoid proteins. We have constructed length variants of the basic scaffold and computationally designed de novo loops projecting from the scaffold core. The experimentally solved 3.56-Å resolution crystal structure of one designed loop matches closely the designed hairpin structure, showing the computational design of a backbone extension onto a synthetic protein core without the use of backbone fragments from known structures. Two other loop designs were not clearly resolved in the crystal structures, and one loop appeared to be in an incorrect conformation. We have also shown that the repeat unit can accommodate whole-domain insertions by inserting a domain into one of the designed loops. PMID:27573845
Membrane Protein Structure, Function, and Dynamics: a Perspective from Experiments and Theory

DOE PAGES

Cournia, Zoe; Allen, Toby W.; Andricioaei, Ioan; ...

2015-06-11

It is fundamental for the flourishing biological cells that membrane proteins mediate the process. Membrane-embedded transporters move ions and larger solutes across membranes; receptors mediate communication between the cell and its environment and membrane-embedded enzymes catalyze chemical reactions. Understanding these mechanisms of action requires knowledge of how the proteins couple to their fluid, hydrated lipid membrane environment. Here, we present here current studies in computational and experimental membrane protein biophysics, and show how they address outstanding challenges in understanding the complex environmental effects on the structure, function, and dynamics of membrane proteins.
Immunological characterization of recombinant soy protein allergen produced by Escherichia coli expression system.

PubMed

Babiker, E E; Azakami, H; Ogawa, T; Kato, A

2000-02-01

To elucidate the molecular mechanism of the allergenicity of soybean P34 protein recognized as the most allergenic protein in soybean, the protein was expressed in Escherichia coli transformed with a plasmid carrying P34 cDNA. SDS-PAGE pattern showed that the molecular weight of the recombinant P34 was approximately 2 kDa less than that of the native soybean P34. The difference in the molecular mass between these two proteins could be due to the native P34 in soybean being glycosylated at position Asn(170), whereas the recombinant protein generated in E. coli lacks this post-translational modification. Immunoblot analysis showed that both soybean and recombinant P34 proteins cross-reacted not only with polyclonal and monoclonal antibodies produced against P34 and crude soybean protein but also with patients' sera. The results suggest that the recombinant P34 is immunologically reactive, indicating that both proteins have similar epitope structures. Thus, the recombinant P34 produced by the E. coli expression system can be used as a standard allergen for molecular design to reduce the allergenic structure.
Analysis of core-periphery organization in protein contact networks reveals groups of structurally and functionally critical residues.

PubMed

Isaac, Arnold Emerson; Sinha, Sitabhra

2015-10-01

The representation of proteins as networks of interacting amino acids, referred to as protein contact networks (PCN), and their subsequent analyses using graph theoretic tools, can provide novel insights into the key functional roles of specific groups of residues. We have characterized the networks corresponding to the native states of 66 proteins (belonging to different families) in terms of their core-periphery organization. The resulting hierarchical classification of the amino acid constituents of a protein arranges the residues into successive layers - having higher core order - with increasing connection density, ranging from a sparsely linked periphery to a densely intra-connected core (distinct from the earlier concept of protein core defined in terms of the three-dimensional geometry of the native state, which has least solvent accessibility). Our results show that residues in the inner cores are more conserved than those at the periphery. Underlining the functional importance of the network core, we see that the receptor sites for known ligand molecules of most proteins occur in the innermost core. Furthermore, the association of residues with structural pockets and cavities in binding or active sites increases with the core order. From mutation sensitivity analysis, we show that the probability of deleterious or intolerant mutations also increases with the core order. We also show that stabilization centre residues are in the innermost cores, suggesting that the network core is critically important in maintaining the structural stability of the protein. A publicly available Web resource for performing core-periphery analysis of any protein whose native state is known has been made available by us at http://www.imsc.res.in/ ~sitabhra/proteinKcore/index.html.
Using more than 801 296 small-molecule crystal structures to aid in protein structure refinement and analysis

PubMed Central

Cole, Jason C.

2017-01-01

The Cambridge Structural Database (CSD) is the worldwide resource for the dissemination of all published three-dimensional structures of small-molecule organic and metal–organic compounds. This paper briefly describes how this collection of crystal structures can be used en masse in the context of macromolecular crystallography. Examples highlight how the CSD and associated software aid protein–ligand complex validation, and show how the CSD could be further used in the generation of geometrical restraints for protein structure refinement. PMID:28291758
In vitro characterization of six STUB1 variants in spinocerebellar ataxia 16 reveals altered structural properties for the encoded CHIP proteins

PubMed Central

Pakdaman, Yasaman; Sanchez-Guixé, Monica; Kleppe, Rune; Erdal, Sigrid; Bustad, Helene J.; Bjørkhaug, Lise; Haugarvoll, Kristoffer; Tzoulis, Charalampos; Heimdal, Ketil; Knappskog, Per M.; Johansson, Stefan

2017-01-01

Spinocerebellar ataxia, autosomal recessive 16 (SCAR16) is caused by biallelic mutations in the STIP1 homology and U-box containing protein 1 (STUB1) gene encoding the ubiquitin E3 ligase and dimeric co-chaperone C-terminus of Hsc70-interacting protein (CHIP). It has been proposed that the disease mechanism is related to CHIP’s impaired E3 ubiquitin ligase properties and/or interaction with its chaperones. However, there is limited knowledge on how these mutations affect the stability, folding, and protein structure of CHIP itself. To gain further insight, six previously reported pathogenic STUB1 variants (E28K, N65S, K145Q, M211I, S236T, and T246M) were expressed as recombinant proteins and studied using limited proteolysis, size-exclusion chromatography (SEC), and circular dichroism (CD). Our results reveal that N65S shows increased CHIP dimerization, higher levels of α-helical content, and decreased degradation rate compared with wild-type (WT) CHIP. By contrast, T246M demonstrates a strong tendency for aggregation, a more flexible protein structure, decreased levels of α-helical structures, and increased degradation rate compared with WT CHIP. E28K, K145Q, M211I, and S236T also show defects on structural properties compared with WT CHIP, although less profound than what observed for N65S and T246M. In conclusion, our results illustrate that some STUB1 mutations known to cause recessive SCAR16 have a profound impact on the protein structure, stability, and ability of CHIP to dimerize in vitro. These results add to the growing understanding on the mechanisms behind the disorder. PMID:28396517

In vitro characterization of six STUB1 variants in spinocerebellar ataxia 16 reveals altered structural properties for the encoded CHIP proteins.

PubMed

Pakdaman, Yasaman; Sanchez-Guixé, Monica; Kleppe, Rune; Erdal, Sigrid; Bustad, Helene J; Bjørkhaug, Lise; Haugarvoll, Kristoffer; Tzoulis, Charalampos; Heimdal, Ketil; Knappskog, Per M; Johansson, Stefan; Aukrust, Ingvild

2017-04-30

Spinocerebellar ataxia, autosomal recessive 16 (SCAR16) is caused by biallelic mutations in the STIP1 homology and U-box containing protein 1 ( STUB1 ) gene encoding the ubiquitin E3 ligase and dimeric co-chaperone C-terminus of Hsc70-interacting protein (CHIP). It has been proposed that the disease mechanism is related to CHIP's impaired E3 ubiquitin ligase properties and/or interaction with its chaperones. However, there is limited knowledge on how these mutations affect the stability, folding, and protein structure of CHIP itself. To gain further insight, six previously reported pathogenic STUB1 variants (E28K, N65S, K145Q, M211I, S236T, and T246M) were expressed as recombinant proteins and studied using limited proteolysis, size-exclusion chromatography (SEC), and circular dichroism (CD). Our results reveal that N65S shows increased CHIP dimerization, higher levels of α-helical content, and decreased degradation rate compared with wild-type (WT) CHIP. By contrast, T246M demonstrates a strong tendency for aggregation, a more flexible protein structure, decreased levels of α-helical structures, and increased degradation rate compared with WT CHIP. E28K, K145Q, M211I, and S236T also show defects on structural properties compared with WT CHIP, although less profound than what observed for N65S and T246M. In conclusion, our results illustrate that some STUB1 mutations known to cause recessive SCAR16 have a profound impact on the protein structure, stability, and ability of CHIP to dimerize in vitro. These results add to the growing understanding on the mechanisms behind the disorder. © 2017 The Author(s).
The nonchromatin substructures of the nucleus: the ribonucleoprotein (RNP)-containing and RNP-depleted matrices analyzed by sequential fractionation and resinless section electron microscopy

PubMed Central

1986-01-01

The nonchromatin structure or matrix of the nucleus has been studied using an improved fractionation in concert with resinless section electron microscopy. The resinless sections show the nucleus of the intact cell to be filled with a dense network or lattice composed of soluble proteins and chromatin in addition to the structural nuclear constituents. In the first fractionation step, soluble proteins are removed by extraction with Triton X-100, and the dense nuclear lattice largely disappears. Chromatin and nonchromatin nuclear fibers are now sharply imaged. Nuclear constituents are further separated into three well-defined, distinct protein fractions. Chromatin proteins are those that require intact DNA for their association with the nucleus and are released by 0.25 M ammonium sulfate after internucleosomal DNA is cut with DNAase I. The resulting structure retains most heterogeneous nuclear ribonucleoprotein (hnRNP) and is designated the RNP-containing nuclear matrix. The proteins of hnRNP are those associated with the nucleus only if RNA is intact. These are released when nuclear RNA is briefly digested with RNAase A. Ribonuclease digestion releases 97% of the hnRNA and its associated proteins. These proteins correspond to the hnRNP described by Pederson (Pederson, T., 1974, J. Mol. Biol., 83:163- 184) and are distinct from the proteins that remain in the ribonucleoprotein (RNP)-depleted nuclear matrix. The RNP-depleted nuclear matrix is a core structure that retains lamins A and C, the intermediate filaments, and a unique set of nuclear matrix proteins (Fey, E. G., K. M. Wan, and S. Penman, 1984, J. Cell Biol. 98:1973- 1984). This core had been previously designated the nuclear matrix- intermediate filament scaffold and its proteins are a third, distinct, and nonoverlapping subset of the nuclear nonhistone proteins. Visualizing the nuclear matrix using resinless sections shows that nuclear RNA plays an important role in matrix organization. Conventional Epon-embedded electron microscopy sections show comparatively little of the RNP-containing and RNP-depleted nuclear matrix structure. In contrast, resinless sections show matrix interior to be a three-dimensional network of thick filaments bounded by the nuclear lamina. The filaments are covered with 20-30-nm electron dense particles which may contain the hnRNA. The large electron dense bodies, enmeshed in the interior matrix fibers, have the characteristic morphology of nucleoli. Treatment of the nuclear matrix with RNAase results in the aggregation of the interior fibers and the extensive loss of the 20-30-nm particles.(ABSTRACT TRUNCATED AT 400 WORDS) PMID:3700470
Structure of Pseudoknot PK26 Shows 3D Domain Swapping in an RNA

NASA Technical Reports Server (NTRS)

Lietzke, Susan E; Barnes, Cindy L.

1998-01-01

3D domain swapping provides a facile pathway for the evolution of oligomeric proteins and allosteric mechanisms and a means for using monomer-oligomer equilibria to regulate biological activity. The term "3D domain swapping" describes the exchange of identical domains between two protein monomers to create an oligomer. 3D domain swapping has, so far, only been recognized in proteins. In this study, the structure of the pseudoknot PK26 is reported and it is a clear example of 3D domain swapping in RNA. PK26 was chosen for study because RNA pseudoknots are required structures in several biological processes and they arise frequently in in vitro selection experiments directed against protein targets. PK26 specifically inhibits HIV-1 reverse transcriptase with nanomolar affinity. We have now determined the 3.1 A resolution crystal structure of PK26 and find that it forms a 3D domain swapped dimer. PK26 shows extensive base pairing between and within strands. Formation of the dimer requires the linker region between the pseudoknot folds to adopt a unique conformation that allows a base within a helical stem to skip one base in the stacking register. Rearrangement of the linker would permit a monomeric pseudoknot to form. This structure shows how RNA can use 3D domain swapping to build large scale oligomers like the putative hexamer in the packaging RNA of bacteriophage Phi29.
Structure-function analysis of the extracellular domain of the pneumococcal cell division site positioning protein MapZ

NASA Astrophysics Data System (ADS)

Manuse, Sylvie; Jean, Nicolas L.; Guinot, Mégane; Lavergne, Jean-Pierre; Laguri, Cédric; Bougault, Catherine M.; Vannieuwenhze, Michael S.; Grangeasse, Christophe; Simorre, Jean-Pierre

2016-06-01

Accurate placement of the bacterial division site is a prerequisite for the generation of two viable and identical daughter cells. In Streptococcus pneumoniae, the positive regulatory mechanism involving the membrane protein MapZ positions precisely the conserved cell division protein FtsZ at the cell centre. Here we characterize the structure of the extracellular domain of MapZ and show that it displays a bi-modular structure composed of two subdomains separated by a flexible serine-rich linker. We further demonstrate in vivo that the N-terminal subdomain serves as a pedestal for the C-terminal subdomain, which determines the ability of MapZ to mark the division site. The C-terminal subdomain displays a patch of conserved amino acids and we show that this patch defines a structural motif crucial for MapZ function. Altogether, this structure-function analysis of MapZ provides the first molecular characterization of a positive regulatory process of bacterial cell division.
Structure of the Z Ring-associated Protein, ZapD, Bound to the C-terminal Domain of the Tubulin-like Protein, FtsZ, Suggests Mechanism of Z Ring Stabilization through FtsZ Cross-linking.

PubMed

Schumacher, Maria A; Huang, Kuo-Hsiang; Zeng, Wenjie; Janakiraman, Anuradha

2017-03-03

Cell division in most bacteria is mediated by the tubulin-like FtsZ protein, which polymerizes in a GTP-dependent manner to form the cytokinetic Z ring. A diverse repertoire of FtsZ-binding proteins affects FtsZ localization and polymerization to ensure correct Z ring formation. Many of these proteins bind the C-terminal domain (CTD) of FtsZ, which serves as a hub for FtsZ regulation. FtsZ ring-associated proteins, ZapA-D (Zaps), are important FtsZ regulatory proteins that stabilize FtsZ assembly and enhance Z ring formation by increasing lateral assembly of FtsZ protofilaments, which then form the Z ring. There are no structures of a Zap protein bound to FtsZ; therefore, how these proteins affect FtsZ polymerization has been unclear. Recent data showed ZapD binds specifically to the FtsZ CTD. Thus, to obtain insight into the ZapD-CTD interaction and how it may mediate FtsZ protofilament assembly, we determined the Escherichia coli ZapD-FtsZ CTD structure to 2.67 Å resolution. The structure shows that the CTD docks within a hydrophobic cleft in the ZapD helical domain and adopts an unusual structure composed of two turns of helix separated by a proline kink. FtsZ CTD residue Phe-377 inserts into the ZapD pocket, anchoring the CTD in place and permitting hydrophobic contacts between FtsZ residues Ile-374, Pro-375, and Leu-378 with ZapD residues Leu-74, Trp-77, Leu-91, and Leu-174. The structural findings were supported by mutagenesis coupled with biochemical and in vivo studies. The combined data suggest that ZapD acts as a molecular cross-linking reagent between FtsZ protofilaments to enhance FtsZ assembly. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Direct protein-protein conjugation by genetically introducing bioorthogonal functional groups into proteins.

PubMed

Kim, Sanggil; Ko, Wooseok; Sung, Bong Hyun; Kim, Sun Chang; Lee, Hyun Soo

2016-11-15

Proteins often function as complex structures in conjunction with other proteins. Because these complex structures are essential for sophisticated functions, developing protein-protein conjugates has gained research interest. In this study, site-specific protein-protein conjugation was performed by genetically incorporating an azide-containing amino acid into one protein and a bicyclononyne (BCN)-containing amino acid into the other. Three to four sites in each of the proteins were tested for conjugation efficiency, and three combinations showed excellent conjugation efficiency. The genetic incorporation of unnatural amino acids (UAAs) is technically simple and produces the mutant protein in high yield. In addition, the conjugation reaction can be conducted by simple mixing, and does not require additional reagents or linker molecules. Therefore, this method may prove very useful for generating protein-protein conjugates and protein complexes of biochemical significance. Copyright © 2016. Published by Elsevier Ltd.
Mapping of ligand-binding cavities in proteins.

PubMed

Andersson, C David; Chen, Brian Y; Linusson, Anna

2010-05-01

The complex interactions between proteins and small organic molecules (ligands) are intensively studied because they play key roles in biological processes and drug activities. Here, we present a novel approach to characterize and map the ligand-binding cavities of proteins without direct geometric comparison of structures, based on Principal Component Analysis of cavity properties (related mainly to size, polarity, and charge). This approach can provide valuable information on the similarities and dissimilarities, of binding cavities due to mutations, between-species differences and flexibility upon ligand-binding. The presented results show that information on ligand-binding cavity variations can complement information on protein similarity obtained from sequence comparisons. The predictive aspect of the method is exemplified by successful predictions of serine proteases that were not included in the model construction. The presented strategy to compare ligand-binding cavities of related and unrelated proteins has many potential applications within protein and medicinal chemistry, for example in the characterization and mapping of "orphan structures", selection of protein structures for docking studies in structure-based design, and identification of proteins for selectivity screens in drug design programs. 2009 Wiley-Liss, Inc.
Layers: A molecular surface peeling algorithm and its applications to analyze protein structures

PubMed Central

Karampudi, Naga Bhushana Rao; Bahadur, Ranjit Prasad

2015-01-01

We present an algorithm ‘Layers’ to peel the atoms of proteins as layers. Using Layers we show an efficient way to transform protein structures into 2D pattern, named residue transition pattern (RTP), which is independent of molecular orientations. RTP explains the folding patterns of proteins and hence identification of similarity between proteins is simple and reliable using RTP than with the standard sequence or structure based methods. Moreover, Layers generates a fine-tunable coarse model for the molecular surface by using non-random sampling. The coarse model can be used for shape comparison, protein recognition and ligand design. Additionally, Layers can be used to develop biased initial configuration of molecules for protein folding simulations. We have developed a random forest classifier to predict the RTP of a given polypeptide sequence. Layers is a standalone application; however, it can be merged with other applications to reduce the computational load when working with large datasets of protein structures. Layers is available freely at http://www.csb.iitkgp.ernet.in/applications/mol_layers/main. PMID:26553411
Structure-Based Phylogenetic Analysis of the Lipocalin Superfamily.

PubMed

Lakshmi, Balasubramanian; Mishra, Madhulika; Srinivasan, Narayanaswamy; Archunan, Govindaraju

2015-01-01

Lipocalins constitute a superfamily of extracellular proteins that are found in all three kingdoms of life. Although very divergent in their sequences and functions, they show remarkable similarity in 3-D structures. Lipocalins bind and transport small hydrophobic molecules. Earlier sequence-based phylogenetic studies of lipocalins highlighted that they have a long evolutionary history. However the molecular and structural basis of their functional diversity is not completely understood. The main objective of the present study is to understand functional diversity of the lipocalins using a structure-based phylogenetic approach. The present study with 39 protein domains from the lipocalin superfamily suggests that the clusters of lipocalins obtained by structure-based phylogeny correspond well with the functional diversity. The detailed analysis on each of the clusters and sub-clusters reveals that the 39 lipocalin domains cluster based on their mode of ligand binding though the clustering was performed on the basis of gross domain structure. The outliers in the phylogenetic tree are often from single member families. Also structure-based phylogenetic approach has provided pointers to assign putative function for the domains of unknown function in lipocalin family. The approach employed in the present study can be used in the future for the functional identification of new lipocalin proteins and may be extended to other protein families where members show poor sequence similarity but high structural similarity.
As Simple As Possible, but Not Simpler: Exploring the Fidelity of Coarse-Grained Protein Models for Simulated Force Spectroscopy

PubMed Central

Rottler, Jörg; Plotkin, Steven S.

2016-01-01

Mechanical unfolding of a single domain of loop-truncated superoxide dismutase protein has been simulated via force spectroscopy techniques with both all-atom (AA) models and several coarse-grained models having different levels of resolution: A Gō model containing all heavy atoms in the protein (HA-Gō), the associative memory, water mediated, structure and energy model (AWSEM) which has 3 interaction sites per amino acid, and a Gō model containing only one interaction site per amino acid at the Cα position (Cα-Gō). To systematically compare results across models, the scales of time, energy, and force had to be suitably renormalized in each model. Surprisingly, the HA-Gō model gives the softest protein, exhibiting much smaller force peaks than all other models after the above renormalization. Clustering to render a structural taxonomy as the protein unfolds showed that the AA, HA-Gō, and Cα-Gō models exhibit a single pathway for early unfolding, which eventually bifurcates repeatedly to multiple branches only after the protein is about half-unfolded. The AWSEM model shows a single dominant unfolding pathway over the whole range of unfolding, in contrast to all other models. TM alignment, clustering analysis, and native contact maps show that the AWSEM pathway has however the most structural similarity to the AA model at high nativeness, but the least structural similarity to the AA model at low nativeness. In comparison to the AA model, the sequence of native contact breakage is best predicted by the HA-Gō model. All models consistently predict a similar unfolding mechanism for early force-induced unfolding events, but diverge in their predictions for late stage unfolding events when the protein is more significantly disordered. PMID:27898663
As Simple As Possible, but Not Simpler: Exploring the Fidelity of Coarse-Grained Protein Models for Simulated Force Spectroscopy.

PubMed

Habibi, Mona; Rottler, Jörg; Plotkin, Steven S

2016-11-01

Mechanical unfolding of a single domain of loop-truncated superoxide dismutase protein has been simulated via force spectroscopy techniques with both all-atom (AA) models and several coarse-grained models having different levels of resolution: A Gō model containing all heavy atoms in the protein (HA-Gō), the associative memory, water mediated, structure and energy model (AWSEM) which has 3 interaction sites per amino acid, and a Gō model containing only one interaction site per amino acid at the Cα position (Cα-Gō). To systematically compare results across models, the scales of time, energy, and force had to be suitably renormalized in each model. Surprisingly, the HA-Gō model gives the softest protein, exhibiting much smaller force peaks than all other models after the above renormalization. Clustering to render a structural taxonomy as the protein unfolds showed that the AA, HA-Gō, and Cα-Gō models exhibit a single pathway for early unfolding, which eventually bifurcates repeatedly to multiple branches only after the protein is about half-unfolded. The AWSEM model shows a single dominant unfolding pathway over the whole range of unfolding, in contrast to all other models. TM alignment, clustering analysis, and native contact maps show that the AWSEM pathway has however the most structural similarity to the AA model at high nativeness, but the least structural similarity to the AA model at low nativeness. In comparison to the AA model, the sequence of native contact breakage is best predicted by the HA-Gō model. All models consistently predict a similar unfolding mechanism for early force-induced unfolding events, but diverge in their predictions for late stage unfolding events when the protein is more significantly disordered.
Host nuclear proteins expressed in simian virus 40-transformed and -infected cells.

PubMed Central

Melero, J A; Tur, S; Carroll, R B

1980-01-01

Two new families of host proteins (Mr, 48,000 and 55,000), in additional to the viral large (T) and small tumor antigens, are precipitable, with anti-T antiserum, from cells transformed or infected by the DNA tumor virus simian virus 40 (SV40). Rabbit anti-mouse 48,000 protein antiserum reacts specifically with SV40-infected or -transformed mouse cells to give nuclear staining indistinguishable from T-antigen staining but does not react with SV40-transformed human cells which nevertheless have structurally analogous 48,000 proteins, nor does it give nuclear fluorescence with untransformed mouse cells. Comparison of the partial proteolytic digests of the 48,000 proteins from cultured cells of various mammalian species shows that they are structurally related but not related to the 55,000 or large T-antigen proteins. The 55,000 proteins from the various mammalian species were also structurally related. Images PMID:6244576
Site-directed mutagenesis of Azotobacter vinelandii ferredoxin I: [Fe-S] cluster-driven protein rearrangement.

PubMed Central

Martín, A E; Burgess, B K; Stout, C D; Cash, V L; Dean, D R; Jensen, G M; Stephens, P J

1990-01-01

Azotobacter vinelandii ferredoxin I is a small protein that contains one [4Fe-4S] cluster and one [3Fe-4S] cluster. Recently the x-ray crystal structure has been redetermined and the fdxA gene, which encodes the protein, has been cloned and sequenced. Here we report the site-directed mutation of Cys-20, which is a ligand of the [4Fe-4S] cluster in the native protein, to alanine and the characterization of the protein product by x-ray crystallographic and spectroscopic methods. The data show that the mutant protein again contains one [4Fe-4S] cluster and one [3Fe-4S] cluster. The new [4Fe-4S] cluster obtains its fourth ligand from Cys-24, a free cysteine in the native structure. The formation of this [4Fe-4S] cluster drives rearrangement of the protein structure. PMID:2153958
Biomolecular interactions modulate macromolecular structure and dynamics in atomistic model of a bacterial cytoplasm.

PubMed

Yu, Isseki; Mori, Takaharu; Ando, Tadashi; Harada, Ryuhei; Jung, Jaewoon; Sugita, Yuji; Feig, Michael

2016-11-01

Biological macromolecules function in highly crowded cellular environments. The structure and dynamics of proteins and nucleic acids are well characterized in vitro, but in vivo crowding effects remain unclear. Using molecular dynamics simulations of a comprehensive atomistic model cytoplasm we found that protein-protein interactions may destabilize native protein structures, whereas metabolite interactions may induce more compact states due to electrostatic screening. Protein-protein interactions also resulted in significant variations in reduced macromolecular diffusion under crowded conditions, while metabolites exhibited significant two-dimensional surface diffusion and altered protein-ligand binding that may reduce the effective concentration of metabolites and ligands in vivo. Metabolic enzymes showed weak non-specific association in cellular environments attributed to solvation and entropic effects. These effects are expected to have broad implications for the in vivo functioning of biomolecules. This work is a first step towards physically realistic in silico whole-cell models that connect molecular with cellular biology.
Identification of Extracellular Segments by Mass Spectrometry Improves Topology Prediction of Transmembrane Proteins.

PubMed

Langó, Tamás; Róna, Gergely; Hunyadi-Gulyás, Éva; Turiák, Lilla; Varga, Julia; Dobson, László; Várady, György; Drahos, László; Vértessy, Beáta G; Medzihradszky, Katalin F; Szakács, Gergely; Tusnády, Gábor E

2017-02-13

Transmembrane proteins play crucial role in signaling, ion transport, nutrient uptake, as well as in maintaining the dynamic equilibrium between the internal and external environment of cells. Despite their important biological functions and abundance, less than 2% of all determined structures are transmembrane proteins. Given the persisting technical difficulties associated with high resolution structure determination of transmembrane proteins, additional methods, including computational and experimental techniques remain vital in promoting our understanding of their topologies, 3D structures, functions and interactions. Here we report a method for the high-throughput determination of extracellular segments of transmembrane proteins based on the identification of surface labeled and biotin captured peptide fragments by LC/MS/MS. We show that reliable identification of extracellular protein segments increases the accuracy and reliability of existing topology prediction algorithms. Using the experimental topology data as constraints, our improved prediction tool provides accurate and reliable topology models for hundreds of human transmembrane proteins.
Impact of Protein-Metal Ion Interactions on the Crystallization of Silk Fibroin Protein

NASA Astrophysics Data System (ADS)

Hu, Xiao; Lu, Qiang; Kaplan, David; Cebe, Peggy

2009-03-01

Proteins can easily form bonds with a variety of metal ions, which provides many unique biological functions for the protein structures, and therefore controls the overall structural transformation of proteins. We use advanced thermal analysis methods such as temperature modulated differential scanning calorimetry and quasi-isothermal TMDSC, combined with Fourier transform infrared spectroscopy, and scanning electron microscopy, to investigate the protein-metallic ion interactions in Bombyx mori silk fibroin proteins. Silk samples were mixed with different metal ions (Ca^2+, K^+, Ma^2+, Na^+, Cu^2+, Mn^2+) with different mass ratios, and compared with the physical conditions in the silkworm gland. Results show that all metallic ions can directly affect the crystallization behavior and glass transition of silk fibroin. However, different ions tend to have different structural impact, including their role as plasticizer or anti-plasticizer. Detailed studies reveal important information allowing us better to understand the natural silk spinning and crystallization process.
Characterization of a novel domain ‘GATE’ in the ABC protein DrrA and its role in drug efflux by the DrrAB complex

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhang, Han; Rahman, Sadia; Li, Wen

2015-03-27

A novel domain, GATE (Glycine-loop And Transducer Element), is identified in the ABC protein DrrA. This domain shows sequence and structural conservation among close homologs of DrrA as well as distantly-related ABC proteins. Among the highly conserved residues in this domain are three glycines, G215, G221 and G231, of which G215 was found to be critical for stable expression of the DrrAB complex. Other conserved residues, including E201, G221, K227 and G231, were found to be critical for the catalytic and transport functions of the DrrAB transporter. Structural analysis of both the previously published crystal structure of the DrrA homologmore » MalK and the modeled structure of DrrA showed that G215 makes close contacts with residues in and around the Walker A motif, suggesting that these interactions may be critical for maintaining the integrity of the ATP binding pocket as well as the complex. It is also shown that G215A or K227R mutation diminishes some of the atomic interactions essential for ATP catalysis and overall transport function. Therefore, based on both the biochemical and structural analyses, it is proposed that the GATE domain, located outside of the previously identified ATP binding and hydrolysis motifs, is an additional element involved in ATP catalysis. - Highlights: • A novel domain ‘GATE’ is identified in the ABC protein DrrA. • GATE shows high sequence and structural conservation among diverse ABC proteins. • GATE is located outside of the previously studied ATP binding and hydrolysis motifs. • Conserved GATE residues are critical for stability of DrrAB and for ATP catalysis.« less
Template-Based Modeling of Protein-RNA Interactions.

PubMed

Zheng, Jinfang; Kundrotas, Petras J; Vakser, Ilya A; Liu, Shiyong

2016-09-01

Protein-RNA complexes formed by specific recognition between RNA and RNA-binding proteins play an important role in biological processes. More than a thousand of such proteins in human are curated and many novel RNA-binding proteins are to be discovered. Due to limitations of experimental approaches, computational techniques are needed for characterization of protein-RNA interactions. Although much progress has been made, adequate methodologies reliably providing atomic resolution structural details are still lacking. Although protein-RNA free docking approaches proved to be useful, in general, the template-based approaches provide higher quality of predictions. Templates are key to building a high quality model. Sequence/structure relationships were studied based on a representative set of binary protein-RNA complexes from PDB. Several approaches were tested for pairwise target/template alignment. The analysis revealed a transition point between random and correct binding modes. The results showed that structural alignment is better than sequence alignment in identifying good templates, suitable for generating protein-RNA complexes close to the native structure, and outperforms free docking, successfully predicting complexes where the free docking fails, including cases of significant conformational change upon binding. A template-based protein-RNA interaction modeling protocol PRIME was developed and benchmarked on a representative set of complexes.
Biodegradation of the chitin-protein complex in crustacean cuticle

USGS Publications Warehouse

Artur, Stankiewicz B.; Mastalerz, Maria; Hof, C.H.J.; Bierstedt, A.; Flannery, M.B.; Briggs, D.E.G.; Evershed, R.P.

1998-01-01

Arthropod cuticles consist predominantly of chitin cross-linked with proteins. While there is some experimental evidence that this chitin-protein complex may resist decay, the chemical changes that occur during degradation have not been investigated in detail. The stomatopod crustacean Neogonodactylus oerstedii was decayed in the laboratory under anoxic conditions. A combination of pyrolysis-gas chromatography/mass spectrometry and FTIR revealed extensive chemical changes after just 2 weeks that resulted in a cuticle composition dominated by chitin. Quantitative analysis of amino acids (by HPLC) and chitin showed that the major loss of proteins and chitin occurred between weeks 1 and 2. After 8 weeks tyrosine, tryptophan and valine are the most prominent amino acid moieties, showing their resistance to degradation. The presence of cyclic ketones in the pyrolysates indicates that mucopolysaccharides or other bound non-chitinous carbohydrates are also resistant to decay. There is no evidence of structural degradation of chitin prior to 8 weeks when FTIR revealed a reduction in chitin-specific bands. The chemical changes are paralleled by structural changes in the cuticle, which becomes an increasingly open structure consisting of loose chitinous fibres. The rapid rate of decay in the experiments suggests that where chitin and protein are preserved in fossil cuticles degradation must have been inhibited.Arthropod cuticles consist predominantly of chitin cross-linked with proteins. While there is some experimental evidence that this chitin-protein complex may resist decay, the chemical changes that occur during degradation have not been investigated in detail. The stomatopod crustacean Neogonodactylus oerstedii was decayed in the laboratory under anoxic conditions. A combination of pyrolysis-gas chromatography/mass spectrometry and FTIR revealed extensive chemical changes after just 2 weeks that resulted in a cuticle composition dominated by chitin. Quantitative analysis of amino acids (by HPLC) and chitin showed that the major loss of proteins and chitin occurred between weeks 1 and 2. After 8 weeks tyrosine, tryptophan and valine are the most prominent amino acid moieties, showing their resistance to degradation. The presence of cyclic ketones in the pyrolysates indicates that mucopolysaccharides or other bound non-chitinous carbohydrates are also resistant to decay. There is no evidence of structural degradation of chitin prior to 8 weeks when FTIR revealed a reduction in chitin-specific bands. The chemical changes are paralleled by structural changes in the cuticle, which becomes an increasingly open structure consisting of loose chitinous fibres. The rapid rate of decay in the experiments suggests, that where chitin and protein are preserved in fossil cuticles degradation must have been inhibited.
Structural and Functional Studies of H. seropedicae RecA Protein – Insights into the Polymerization of RecA Protein as Nucleoprotein Filament

DOE Office of Scientific and Technical Information (OSTI.GOV)

Leite, Wellington C.; Galvão, Carolina W.; Saab, Sérgio C.

The bacterial RecA protein plays a role in the complex system of DNA damage repair. Here, we report the functional and structural characterization of the Herbaspirillum seropedicae RecA protein (HsRecA). HsRecA protein is more efficient at displacing SSB protein from ssDNA than Escherichia coli RecA protein. HsRecA also promotes DNA strand exchange more efficiently. The three dimensional structure of HsRecA-ADP/ATP complex has been solved to 1.7 Å resolution. HsRecA protein contains a small N-terminal domain, a central core ATPase domain and a large C-terminal domain, that are similar to homologous bacterial RecA proteins. Comparative structural analysis showed that the N-terminalmore » polymerization motif of archaeal and eukaryotic RecA family proteins are also present in bacterial RecAs. Reconstruction of electrostatic potential from the hexameric structure of HsRecA-ADP/ATP revealed a high positive charge along the inner side, where ssDNA is bound inside the filament. The properties of this surface may explain the greater capacity of HsRecA protein to bind ssDNA, forming a contiguous nucleoprotein filament, displace SSB and promote DNA exchange relative to EcRecA. In conclusion, our functional and structural analyses provide insight into the molecular mechanisms of polymerization of bacterial RecA as a helical nucleoprotein filament.« less

Structural and Functional Studies of H. seropedicae RecA Protein – Insights into the Polymerization of RecA Protein as Nucleoprotein Filament

PubMed Central

Galvão, Carolina W.; Saab, Sérgio C.; Iulek, Jorge; Etto, Rafael M.; Steffens, Maria B. R.; Chitteni-Pattu, Sindhu; Stanage, Tyler; Keck, James L.; Cox, Michael M.

2016-01-01

The bacterial RecA protein plays a role in the complex system of DNA damage repair. Here, we report the functional and structural characterization of the Herbaspirillum seropedicae RecA protein (HsRecA). HsRecA protein is more efficient at displacing SSB protein from ssDNA than Escherichia coli RecA protein. HsRecA also promotes DNA strand exchange more efficiently. The three dimensional structure of HsRecA-ADP/ATP complex has been solved to 1.7 Å resolution. HsRecA protein contains a small N-terminal domain, a central core ATPase domain and a large C-terminal domain, that are similar to homologous bacterial RecA proteins. Comparative structural analysis showed that the N-terminal polymerization motif of archaeal and eukaryotic RecA family proteins are also present in bacterial RecAs. Reconstruction of electrostatic potential from the hexameric structure of HsRecA-ADP/ATP revealed a high positive charge along the inner side, where ssDNA is bound inside the filament. The properties of this surface may explain the greater capacity of HsRecA protein to bind ssDNA, forming a contiguous nucleoprotein filament, displace SSB and promote DNA exchange relative to EcRecA. Our functional and structural analyses provide insight into the molecular mechanisms of polymerization of bacterial RecA as a helical nucleoprotein filament. PMID:27447485
Protein secondary structure and stability determined by combining exoproteolysis and matrix-assisted laser desorption/ionization time-of-flight mass spectrometry.

PubMed

Villanueva, Josep; Villegas, Virtudes; Querol, Enrique; Avilés, Francesc X; Serrano, Luis

2002-09-01

In the post-genomic era, several projects focused on the massive experimental resolution of the three-dimensional structures of all the proteins of different organisms have been initiated. Simultaneously, significant progress has been made in the ab initio prediction of protein three-dimensional structure. One of the keys to the success of such a prediction is the use of local information (i.e. secondary structure). Here we describe a new limited proteolysis methodology, based on the use of unspecific exoproteases coupled with matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS), to map quickly secondary structure elements of a protein from both ends, the N- and C-termini. We show that the proteolytic patterns (mass spectra series) obtained can be interpreted in the light of the conformation and local stability of the analyzed proteins, a direct correlation being observed between the predicted and the experimentally derived protein secondary structure. Further, this methodology can be easily applied to check rapidly the folding state of a protein and characterize mutational effects on protein conformation and stability. Moreover, given global stability information, this methodology allows one to locate the protein regions of increased or decreased conformational stability. All of this can be done with a small fraction of the amount of protein required by most of the other methods for conformational analysis. Thus limited exoproteolysis, together with MALDI-TOF MS, can be a useful tool to achieve quickly the elucidation of protein structure and stability. Copyright 2002 John Wiley & Sons, Ltd.
Characterization of protein hydration by solution NMR spectroscopy

NASA Astrophysics Data System (ADS)

Wand, Joshua

A comprehensive understanding of the interactions between protein molecules and hydration water remains elusive. Solution nuclear magnetic resonance (NMR) spectroscopy has been proposed as a means to characterize these interactions but is plagued with artifacts when employed in bulk aqueous solution. Encapsulation of proteins in reverse micelles prepared in short chain alkane solvents can overcome these technical limitations. Application of this approach has revealed that the interaction of water with the surface of protein molecules is quite heterogeneous with some regions of the protein having long-lived interactions while other regions show relatively transient hydration. Results from several proteins will be presented including ubiquitin, staphylococcal nuclease, interleukin 1beta, hen egg white lysozyme (HEWL) and T4 lysozyme. Ubiquitin and interleukin 1beta are signaling proteins and interact with other proteins through formation of dry protein-protein interfaces. Interestingly, the protein surfaces of the free proteins show relatively slowed (restricted) motion at the surface, which is indicative of low residual entropy. Other regions of the protein surface have relatively high mobility water. These results are consistent with the idea that proteins have evolved to maximize the hydrophobic effect in optimization of binding with protein partners. As predicted by simulation and theory, we find that hydration of internal hydrophobic cavities of interleukin 1beta and T4 lysozyme is highly disfavored. In contrast, the hydrophilic polar cavity of HEWL is occupied by water. Initial structural correlations suggest that hydration of alpha helical structure is characterized by relatively mobile water while those of beta strands and loops are more ordered and slowed. These and other results from this set of proteins reveals that the dynamical and structural character of hydration of proteins is heterogeneous and complex. Supported by the National Science Foundation.
Structural and functional characterizations of SsgB, a conserved activator of developmental cell division in morphologically complex actinomycetes.

PubMed

Xu, Qingping; Traag, Bjørn A; Willemse, Joost; McMullan, Daniel; Miller, Mitchell D; Elsliger, Marc-André; Abdubek, Polat; Astakhova, Tamara; Axelrod, Herbert L; Bakolitsa, Constantina; Carlton, Dennis; Chen, Connie; Chiu, Hsiu-Ju; Chruszcz, Maksymilian; Clayton, Thomas; Das, Debanu; Deller, Marc C; Duan, Lian; Ellrott, Kyle; Ernst, Dustin; Farr, Carol L; Feuerhelm, Julie; Grant, Joanna C; Grzechnik, Anna; Grzechnik, Slawomir K; Han, Gye Won; Jaroszewski, Lukasz; Jin, Kevin K; Klock, Heath E; Knuth, Mark W; Kozbial, Piotr; Krishna, S Sri; Kumar, Abhinav; Marciano, David; Minor, Wladek; Mommaas, A Mieke; Morse, Andrew T; Nigoghossian, Edward; Nopakun, Amanda; Okach, Linda; Oommachen, Silvya; Paulsen, Jessica; Puckett, Christina; Reyes, Ron; Rife, Christopher L; Sefcovic, Natasha; Tien, Henry J; Trame, Christine B; van den Bedem, Henry; Wang, Shuren; Weekes, Dana; Hodgson, Keith O; Wooley, John; Deacon, Ashley M; Godzik, Adam; Lesley, Scott A; Wilson, Ian A; van Wezel, Gilles P

2009-09-11

SsgA-like proteins (SALPs) are a family of homologous cell division-related proteins that occur exclusively in morphologically complex actinomycetes. We show that SsgB, a subfamily of SALPs, is the archetypal SALP that is functionally conserved in all sporulating actinomycetes. Sporulation-specific cell division of Streptomyces coelicolor ssgB mutants is restored by introduction of distant ssgB orthologues from other actinomycetes. Interestingly, the number of septa (and spores) of the complemented null mutants is dictated by the specific ssgB orthologue that is expressed. The crystal structure of the SsgB from Thermobifida fusca was determined at 2.6 A resolution and represents the first structure for this family. The structure revealed similarities to a class of eukaryotic "whirly" single-stranded DNA/RNA-binding proteins. However, the electro-negative surface of the SALPs suggests that neither SsgB nor any of the other SALPs are likely to interact with nucleotide substrates. Instead, we show that a conserved hydrophobic surface is likely to be important for SALP function and suggest that proteins are the likely binding partners.
How curvature-generating proteins build scaffolds on membrane nanotubes

PubMed Central

Evergren, Emma; Golushko, Ivan; Prévost, Coline; Renard, Henri-François; Johannes, Ludger; McMahon, Harvey T.; Lorman, Vladimir; Voth, Gregory A.; Bassereau, Patricia

2016-01-01

Bin/Amphiphysin/Rvs (BAR) domain proteins control the curvature of lipid membranes in endocytosis, trafficking, cell motility, the formation of complex subcellular structures, and many other cellular phenomena. They form 3D assemblies that act as molecular scaffolds to reshape the membrane and alter its mechanical properties. It is unknown, however, how a protein scaffold forms and how BAR domains interact in these assemblies at protein densities relevant for a cell. In this work, we use various experimental, theoretical, and simulation approaches to explore how BAR proteins organize to form a scaffold on a membrane nanotube. By combining quantitative microscopy with analytical modeling, we demonstrate that a highly curving BAR protein endophilin nucleates its scaffolds at the ends of a membrane tube, contrary to a weaker curving protein centaurin, which binds evenly along the tube’s length. Our work implies that the nature of local protein–membrane interactions can affect the specific localization of proteins on membrane-remodeling sites. Furthermore, we show that amphipathic helices are dispensable in forming protein scaffolds. Finally, we explore a possible molecular structure of a BAR-domain scaffold using coarse-grained molecular dynamics simulations. Together with fluorescence microscopy, the simulations show that proteins need only to cover 30–40% of a tube’s surface to form a rigid assembly. Our work provides mechanical and structural insights into the way BAR proteins may sculpt the membrane as a high-order cooperative assembly in important biological processes. PMID:27655892
Electronic polarization stabilizes tertiary structure prediction of HP-36.

PubMed

Duan, Li L; Zhu, Tong; Zhang, Qing G; Tang, Bo; Zhang, John Z H

2014-04-01

Molecular dynamic (MD) simulations with both implicit and explicit solvent models have been carried out to study the folding dynamics of HP-36 protein. Starting from the extended conformation, the secondary structure of all three helices in HP-36 was formed in about 50 ns and remained stable in the remaining simulation. However, the formation of the tertiary structure was difficult. Although some intermediates were close to the native structure, the overall conformation was not stable. Further analysis revealed that the large structure fluctuation of loop and hydrophobic core regions was devoted mostly to the instability of the structure during MD simulation. The backbone root-mean-square deviation (RMSD) of the loop and hydrophobic core regions showed strong correlation with the backbone RMSD of the whole protein. The free energy landscape indicated that the distribution of main chain torsions in loop and turn regions was far away from the native state. Starting from an intermediate structure extracted from the initial AMBER simulation, HP-36 was found to generally fold to the native state under the dynamically adjusted polarized protein-specific charge (DPPC) simulation, while the peptide did not fold into the native structure when AMBER force filed was used. The two best folded structures were extracted and taken into further simulations in water employing AMBER03 charge and DPPC for 25 ns. Result showed that introducing polarization effect into interacting potential could stabilize the near-native protein structure.
Critical Features of Fragment Libraries for Protein Structure Prediction

PubMed Central

dos Santos, Karina Baptista

2017-01-01

The use of fragment libraries is a popular approach among protein structure prediction methods and has proven to substantially improve the quality of predicted structures. However, some vital aspects of a fragment library that influence the accuracy of modeling a native structure remain to be determined. This study investigates some of these features. Particularly, we analyze the effect of using secondary structure prediction guiding fragments selection, different fragments sizes and the effect of structural clustering of fragments within libraries. To have a clearer view of how these factors affect protein structure prediction, we isolated the process of model building by fragment assembly from some common limitations associated with prediction methods, e.g., imprecise energy functions and optimization algorithms, by employing an exact structure-based objective function under a greedy algorithm. Our results indicate that shorter fragments reproduce the native structure more accurately than the longer. Libraries composed of multiple fragment lengths generate even better structures, where longer fragments show to be more useful at the beginning of the simulations. The use of many different fragment sizes shows little improvement when compared to predictions carried out with libraries that comprise only three different fragment sizes. Models obtained from libraries built using only sequence similarity are, on average, better than those built with a secondary structure prediction bias. However, we found that the use of secondary structure prediction allows greater reduction of the search space, which is invaluable for prediction methods. The results of this study can be critical guidelines for the use of fragment libraries in protein structure prediction. PMID:28085928
Critical Features of Fragment Libraries for Protein Structure Prediction.

PubMed

Trevizani, Raphael; Custódio, Fábio Lima; Dos Santos, Karina Baptista; Dardenne, Laurent Emmanuel

2017-01-01

The use of fragment libraries is a popular approach among protein structure prediction methods and has proven to substantially improve the quality of predicted structures. However, some vital aspects of a fragment library that influence the accuracy of modeling a native structure remain to be determined. This study investigates some of these features. Particularly, we analyze the effect of using secondary structure prediction guiding fragments selection, different fragments sizes and the effect of structural clustering of fragments within libraries. To have a clearer view of how these factors affect protein structure prediction, we isolated the process of model building by fragment assembly from some common limitations associated with prediction methods, e.g., imprecise energy functions and optimization algorithms, by employing an exact structure-based objective function under a greedy algorithm. Our results indicate that shorter fragments reproduce the native structure more accurately than the longer. Libraries composed of multiple fragment lengths generate even better structures, where longer fragments show to be more useful at the beginning of the simulations. The use of many different fragment sizes shows little improvement when compared to predictions carried out with libraries that comprise only three different fragment sizes. Models obtained from libraries built using only sequence similarity are, on average, better than those built with a secondary structure prediction bias. However, we found that the use of secondary structure prediction allows greater reduction of the search space, which is invaluable for prediction methods. The results of this study can be critical guidelines for the use of fragment libraries in protein structure prediction.
Maltose-neopentyl glycol (MNG) amphiphiles for solubilization, stabilization and crystallization of membrane proteins.

PubMed

Chae, Pil Seok; Rasmussen, Søren G F; Rana, Rohini R; Gotfryd, Kamil; Chandra, Richa; Goren, Michael A; Kruse, Andrew C; Nurva, Shailika; Loland, Claus J; Pierre, Yves; Drew, David; Popot, Jean-Luc; Picot, Daniel; Fox, Brian G; Guan, Lan; Gether, Ulrik; Byrne, Bernadette; Kobilka, Brian; Gellman, Samuel H

2010-12-01

The understanding of integral membrane protein (IMP) structure and function is hampered by the difficulty of handling these proteins. Aqueous solubilization, necessary for many types of biophysical analysis, generally requires a detergent to shield the large lipophilic surfaces of native IMPs. Many proteins remain difficult to study owing to a lack of suitable detergents. We introduce a class of amphiphiles, each built around a central quaternary carbon atom derived from neopentyl glycol, with hydrophilic groups derived from maltose. Representatives of this maltose-neopentyl glycol (MNG) amphiphile family show favorable behavior relative to conventional detergents, as manifested in multiple membrane protein systems, leading to enhanced structural stability and successful crystallization. MNG amphiphiles are promising tools for membrane protein science because of the ease with which they may be prepared and the facility with which their structures may be varied.
Structural insight into GRIP1-PDZ6 in Alzheimer's disease: study from protein expression data to molecular dynamics simulations.

PubMed

Chatterjee, Paulami; Roy, Debjani

2017-08-01

Protein-protein interaction domain, PDZ, plays a critical role in efficient synaptic transmission in brain. Dysfunction of synaptic transmission is thought to be the underlying basis of many neuropsychiatric and neurodegenerative disorders including Alzheimer's disease (AD). In this study, Glutamate Receptor Interacting Protein1 (GRIP1) was identified as one of the most important differentially expressed, topologically significant proteins in the protein-protein interaction network. To date, very few studies have analyzed the detailed structural basis of PDZ-mediated protein interaction of GRIP1. In order to gain better understanding of structural and dynamic basis of these interactions, we employed molecular dynamics (MD) simulations of GRIP1-PDZ6 dimer bound with Liprin-alpha and GRIP1-PDZ6 dimer alone each with 100 ns simulations. The analyses of MD simulations of Liprin-alpha bound GRIP1-PDZ6 dimer show considerable conformational differences than that of peptide-free dimer in terms of SASA, hydrogen bonding patterns, and along principal component 1 (PC1). Our study also furnishes insight into the structural attunement of the PDZ6 domains of Liprin-alpha bound GRIP1 that is attributed by significant shift of the Liprin-alpha recognition helix in the simulated peptide-bound dimer compared to the crystal structure and simulated peptide-free dimer. It is evident that PDZ6 domains of peptide-bound dimer show differential movements along PC1 than that of peptide-free dimers. Thus, Liprin-alpha also serves an important role in conferring conformational changes along the dimeric interface of the peptide-bound dimer. Results reported here provide information that may lead to novel therapeutic approaches in AD.
Accurate protein structure modeling using sparse NMR data and homologous structure information.

PubMed

Thompson, James M; Sgourakis, Nikolaos G; Liu, Gaohua; Rossi, Paolo; Tang, Yuefeng; Mills, Jeffrey L; Szyperski, Thomas; Montelione, Gaetano T; Baker, David

2012-06-19

While information from homologous structures plays a central role in X-ray structure determination by molecular replacement, such information is rarely used in NMR structure determination because it can be incorrect, both locally and globally, when evolutionary relationships are inferred incorrectly or there has been considerable evolutionary structural divergence. Here we describe a method that allows robust modeling of protein structures of up to 225 residues by combining (1)H(N), (13)C, and (15)N backbone and (13)Cβ chemical shift data, distance restraints derived from homologous structures, and a physically realistic all-atom energy function. Accurate models are distinguished from inaccurate models generated using incorrect sequence alignments by requiring that (i) the all-atom energies of models generated using the restraints are lower than models generated in unrestrained calculations and (ii) the low-energy structures converge to within 2.0 Å backbone rmsd over 75% of the protein. Benchmark calculations on known structures and blind targets show that the method can accurately model protein structures, even with very remote homology information, to a backbone rmsd of 1.2-1.9 Å relative to the conventional determined NMR ensembles and of 0.9-1.6 Å relative to X-ray structures for well-defined regions of the protein structures. This approach facilitates the accurate modeling of protein structures using backbone chemical shift data without need for side-chain resonance assignments and extensive analysis of NOESY cross-peak assignments.
Molecular and functional characterization of single-box high-mobility group B (HMGB) chromosomal protein from Aedes aegypti.

PubMed

de Abreu da Silva, Isabel Caetano; Vicentino, Amanda Roberta Revoredo; Dos Santos, Renata Coutinho; da Fonseca, Rodrigo Nunes; de Mendonça Amarante, Anderson; Carneiro, Vitor Coutinho; de Amorim Pinto, Marcia; Aguilera, Estefania Anahi; Mohana-Borges, Ronaldo; Bisch, Paulo Mascarello; da Silva-Neto, Mario Alberto Cardoso; Fantappié, Marcelo Rosado

2018-05-30

High-mobility group B (HMGB) proteins have highly conserved, unique DNA-binding domains, HMG boxes, that can bind non-B-type DNA structures, such as bent, kinked and unwound structures, with high affinity. HMGB proteins also promote DNA bending, looping and unwinding. In this study, we determined the role of the Aedes aegypti single HMG-box domain protein AaHMGB; characterized its structure, spatiotemporal expression levels, subcellular localization, and nucleic acid binding activities; and compared these properties with those of its double-HMG-box counterpart protein, AaHMGB1. Via qRT-PCR, we showed that AaHMGB is expressed at much higher levels than AaHMGB1 throughout mosquito development. In situ hybridization results suggested a role for AaHMGB and AaHMGB1 during embryogenesis. Immunolocalization in the midgut revealed that AaHMGB is exclusively nuclear. Circular dichroism and fluorescence spectroscopy analyses showed that AaHMGB exhibits common features of α-helical structures and is more stably folded than AaHMGB1, likely due to the presence of one or two HMG boxes. Using several DNA substrates or single-stranded RNAs as probes, we observed significant differences between AaHMGB and AaHMGB1 in terms of their binding patterns, activity and/or specificity. Importantly, we showed that the phosphorylation of AaHMGB plays a critical role in its DNA-binding activity. Our study provides additional insight into the roles of single- versus double-HMG-box-containing proteins in nucleic acid interactions for better understanding of mosquito development, physiology and homeostasis. Copyright © 2017. Published by Elsevier B.V.
Recent advances in automated protein design and its future challenges.

PubMed

Setiawan, Dani; Brender, Jeffrey; Zhang, Yang

2018-04-25

Protein function is determined by protein structure which is in turn determined by the corresponding protein sequence. If the rules that cause a protein to adopt a particular structure are understood, it should be possible to refine or even redefine the function of a protein by working backwards from the desired structure to the sequence. Automated protein design attempts to calculate the effects of mutations computationally with the goal of more radical or complex transformations than are accessible by experimental techniques. Areas covered: The authors give a brief overview of the recent methodological advances in computer-aided protein design, showing how methodological choices affect final design and how automated protein design can be used to address problems considered beyond traditional protein engineering, including the creation of novel protein scaffolds for drug development. Also, the authors address specifically the future challenges in the development of automated protein design. Expert opinion: Automated protein design holds potential as a protein engineering technique, particularly in cases where screening by combinatorial mutagenesis is problematic. Considering solubility and immunogenicity issues, automated protein design is initially more likely to make an impact as a research tool for exploring basic biology in drug discovery than in the design of protein biologics.
Intermolecular detergent-membrane protein noes for the characterization of the dynamics of membrane protein-detergent complexes.

PubMed

Eichmann, Cédric; Orts, Julien; Tzitzilonis, Christos; Vögeli, Beat; Smrt, Sean; Lorieau, Justin; Riek, Roland

2014-12-11

The interaction between membrane proteins and lipids or lipid mimetics such as detergents is key for the three-dimensional structure and dynamics of membrane proteins. In NMR-based structural studies of membrane proteins, qualitative analysis of intermolecular nuclear Overhauser enhancements (NOEs) or paramagnetic resonance enhancement are used in general to identify the transmembrane segments of a membrane protein. Here, we employed a quantitative characterization of intermolecular NOEs between (1)H of the detergent and (1)H(N) of (2)H-perdeuterated, (15)N-labeled α-helical membrane protein-detergent complexes following the exact NOE (eNOE) approach. Structural considerations suggest that these intermolecular NOEs should show a helical-wheel-type behavior along a transmembrane helix or a membrane-attached helix within a membrane protein as experimentally demonstrated for the complete influenza hemagglutinin fusion domain HAfp23. The partial absence of such a NOE pattern along the amino acid sequence as shown for a truncated variant of HAfp23 and for the Escherichia coli inner membrane protein YidH indicates the presence of large tertiary structure fluctuations such as an opening between helices or the presence of large rotational dynamics of the helices. Detergent-protein NOEs thus appear to be a straightforward probe for a qualitative characterization of structural and dynamical properties of membrane proteins embedded in detergent micelles.
Structure-Energy Relationships of Halogen Bonds in Proteins.

PubMed

Scholfield, Matthew R; Ford, Melissa Coates; Carlsson, Anna-Carin C; Butta, Hawera; Mehl, Ryan A; Ho, P Shing

2017-06-06

The structures and stabilities of proteins are defined by a series of weak noncovalent electrostatic, van der Waals, and hydrogen bond (HB) interactions. In this study, we have designed and engineered halogen bonds (XBs) site-specifically to study their structure-energy relationship in a model protein, T4 lysozyme. The evidence for XBs is the displacement of the aromatic side chain toward an oxygen acceptor, at distances that are equal to or less than the sums of their respective van der Waals radii, when the hydroxyl substituent of the wild-type tyrosine is replaced by a halogen. In addition, thermal melting studies show that the iodine XB rescues the stabilization energy from an otherwise destabilizing substitution (at an equivalent noninteracting site), indicating that the interaction is also present in solution. Quantum chemical calculations show that the XB complements an HB at this site and that solvent structure must also be considered in trying to design molecular interactions such as XBs into biological systems. A bromine substitution also shows displacement of the side chain, but the distances and geometries do not indicate formation of an XB. Thus, we have dissected the contributions from various noncovalent interactions of halogens introduced into proteins, to drive the application of XBs, particularly in biomolecular design.
A Generic Force Field for Protein Coarse-Grained Molecular Dynamics Simulation

PubMed Central

Gu, Junfeng; Bai, Fang; Li, Honglin; Wang, Xicheng

2012-01-01

Coarse-grained (CG) force fields have become promising tools for studies of protein behavior, but the balance of speed and accuracy is still a challenge in the research of protein coarse graining methodology. In this work, 20 CG beads have been designed based on the structures of amino acid residues, with which an amino acid can be represented by one or two beads, and a CG solvent model with five water molecules was adopted to ensure the consistence with the protein CG beads. The internal interactions in protein were classified according to the types of the interacting CG beads, and adequate potential functions were chosen and systematically parameterized to fit the energy distributions. The proposed CG force field has been tested on eight proteins, and each protein was simulated for 1000 ns. Even without any extra structure knowledge of the simulated proteins, the Cα root mean square deviations (RMSDs) with respect to their experimental structures are close to those of relatively short time all atom molecular dynamics simulations. However, our coarse grained force field will require further refinement to improve agreement with and persistence of native-like structures. In addition, the root mean square fluctuations (RMSFs) relative to the average structures derived from the simulations show that the conformational fluctuations of the proteins can be sampled. PMID:23203075
Native State Volume Fluctuations in Proteins as a Mechanism for Dynamic Allostery.

PubMed

Law, Anthony B; Sapienza, Paul J; Zhang, Jun; Zuo, Xiaobing; Petit, Chad M

2017-03-15

Allostery enables tight regulation of protein function in the cellular environment. Although existing models of allostery are firmly rooted in the current structure-function paradigm, the mechanistic basis for allostery in the absence of structural change remains unclear. In this study, we show that a typical globular protein is able to undergo significant changes in volume under native conditions while exhibiting no additional changes in protein structure. These native state volume fluctuations were found to correlate with changes in internal motions that were previously recognized as a source of allosteric entropy. This finding offers a novel mechanistic basis for allostery in the absence of canonical structural change. The unexpected observation that function can be derived from expanded, low density protein states has broad implications for our understanding of allostery and suggests that the general concept of the native state be expanded to allow for more variable physical dimensions with looser packing.
Structural insights into the multifunctional protein VP3 of birnaviruses.

PubMed

Casañas, Arnau; Navarro, Aitor; Ferrer-Orta, Cristina; González, Dolores; Rodríguez, José F; Verdaguer, Núria

2008-01-01

Infectious bursal disease virus (IBDV), a member of the Birnaviridae family, is the causative agent of one of the most harmful poultry diseases. The IBDV genome encodes five mature proteins; of these, the multifunctional protein VP3 plays an essential role in virus morphogenesis. This protein, which interacts with the structural protein VP2, with the double-stranded RNA genome, and with the virus-encoded, RNA-dependent RNA polymerase, VP1, is involved not only in the formation of the viral capsid, but also in the recruitment of VP1 into the capsid and in the encapsidation of the viral genome. Here, we report the X-ray structure of the central region of VP3, residues 92-220, consisting of two alpha-helical domains connected by a long and flexible hinge that are organized as a dimer. Unexpectedly, the overall fold of the second VP3 domain shows significant structural similarities with different transcription regulation factors.
Modeling helical proteins using residual dipolar couplings, sparse long-range distance constraints and a simple residue-based force field

PubMed Central

Eggimann, Becky L.; Vostrikov, Vitaly V.; Veglia, Gianluigi; Siepmann, J. Ilja

2013-01-01

We present a fast and simple protocol to obtain moderate-resolution backbone structures of helical proteins. This approach utilizes a combination of sparse backbone NMR data (residual dipolar couplings and paramagnetic relaxation enhancements) or EPR data with a residue-based force field and Monte Carlo/simulated annealing protocol to explore the folding energy landscape of helical proteins. By using only backbone NMR data, which are relatively easy to collect and analyze, and strategically placed spin relaxation probes, we show that it is possible to obtain protein structures with correct helical topology and backbone RMS deviations well below 4 Å. This approach offers promising alternatives for the structural determination of proteins in which nuclear Overha-user effect data are difficult or impossible to assign and produces initial models that will speed up the high-resolution structure determination by NMR spectroscopy. PMID:24639619
General mechanism of two-state protein folding kinetics.

PubMed

Rollins, Geoffrey C; Dill, Ken A

2014-08-13

We describe here a general model of the kinetic mechanism of protein folding. In the Foldon Funnel Model, proteins fold in units of secondary structures, which form sequentially along the folding pathway, stabilized by tertiary interactions. The model predicts that the free energy landscape has a volcano shape, rather than a simple funnel, that folding is two-state (single-exponential) when secondary structures are intrinsically unstable, and that each structure along the folding path is a transition state for the previous structure. It shows how sequential pathways are consistent with multiple stochastic routes on funnel landscapes, and it gives good agreement with the 9 order of magnitude dependence of folding rates on protein size for a set of 93 proteins, at the same time it is consistent with the near independence of folding equilibrium constant on size. This model gives estimates of folding rates of proteomes, leading to a median folding time in Escherichia coli of about 5 s.

Structure and assembly of scalable porous protein cages

NASA Astrophysics Data System (ADS)

Sasaki, Eita; Böhringer, Daniel; van de Waterbeemd, Michiel; Leibundgut, Marc; Zschoche, Reinhard; Heck, Albert J. R.; Ban, Nenad; Hilvert, Donald

2017-03-01

Proteins that self-assemble into regular shell-like polyhedra are useful, both in nature and in the laboratory, as molecular containers. Here we describe cryo-electron microscopy (EM) structures of two versatile encapsulation systems that exploit engineered electrostatic interactions for cargo loading. We show that increasing the number of negative charges on the lumenal surface of lumazine synthase, a protein that naturally assembles into a ~1-MDa dodecahedron composed of 12 pentamers, induces stepwise expansion of the native protein shell, giving rise to thermostable ~3-MDa and ~6-MDa assemblies containing 180 and 360 subunits, respectively. Remarkably, these expanded particles assume unprecedented tetrahedrally and icosahedrally symmetric structures constructed entirely from pentameric units. Large keyhole-shaped pores in the shell, not present in the wild-type capsid, enable diffusion-limited encapsulation of complementarily charged guests. The structures of these supercharged assemblies demonstrate how programmed electrostatic effects can be effectively harnessed to tailor the architecture and properties of protein cages.
Sequence-similar, structure-dissimilar protein pairs in the PDB.

PubMed

Kosloff, Mickey; Kolodny, Rachel

2008-05-01

It is often assumed that in the Protein Data Bank (PDB), two proteins with similar sequences will also have similar structures. Accordingly, it has proved useful to develop subsets of the PDB from which "redundant" structures have been removed, based on a sequence-based criterion for similarity. Similarly, when predicting protein structure using homology modeling, if a template structure for modeling a target sequence is selected by sequence alone, this implicitly assumes that all sequence-similar templates are equivalent. Here, we show that this assumption is often not correct and that standard approaches to create subsets of the PDB can lead to the loss of structurally and functionally important information. We have carried out sequence-based structural superpositions and geometry-based structural alignments of a large number of protein pairs to determine the extent to which sequence similarity ensures structural similarity. We find many examples where two proteins that are similar in sequence have structures that differ significantly from one another. The source of the structural differences usually has a functional basis. The number of such proteins pairs that are identified and the magnitude of the dissimilarity depend on the approach that is used to calculate the differences; in particular sequence-based structure superpositioning will identify a larger number of structurally dissimilar pairs than geometry-based structural alignments. When two sequences can be aligned in a statistically meaningful way, sequence-based structural superpositioning provides a meaningful measure of structural differences. This approach and geometry-based structure alignments reveal somewhat different information and one or the other might be preferable in a given application. Our results suggest that in some cases, notably homology modeling, the common use of nonredundant datasets, culled from the PDB based on sequence, may mask important structural and functional information. We have established a data base of sequence-similar, structurally dissimilar protein pairs that will help address this problem (http://luna.bioc.columbia.edu/rachel/seqsimstrdiff.htm).
Insights into structural features determining odorant affinities to honey bee odorant binding protein 14.

PubMed

Schwaighofer, Andreas; Pechlaner, Maria; Oostenbrink, Chris; Kotlowski, Caroline; Araman, Can; Mastrogiacomo, Rosa; Pelosi, Paolo; Knoll, Wolfgang; Nowak, Christoph; Larisika, Melanie

2014-04-18

Molecular interactions between odorants and odorant binding proteins (OBPs) are of major importance for understanding the principles of selectivity of OBPs towards the wide range of semiochemicals. It is largely unknown on a structural basis, how an OBP binds and discriminates between odorant molecules. Here we examine this aspect in greater detail by comparing the C-minus OBP14 of the honey bee (Apis mellifera L.) to a mutant form of the protein that comprises the third disulfide bond lacking in C-minus OBPs. Affinities of structurally analogous odorants featuring an aromatic phenol group with different side chains were assessed based on changes of the thermal stability of the protein upon odorant binding monitored by circular dichroism spectroscopy. Our results indicate a tendency that odorants show higher affinity to the wild-type OBP suggesting that the introduced rigidity in the mutant protein has a negative effect on odorant binding. Furthermore, we show that OBP14 stability is very sensitive to the position and type of functional groups in the odorant. Copyright © 2014 Elsevier Inc. All rights reserved.
Structural and biological mimicry of protein surface recognition by [alpha/beta]-peptide foldamers

DOE Office of Scientific and Technical Information (OSTI.GOV)

Horne, W. Seth; Johnson, Lisa M.; Ketas, Thomas J.

Unnatural oligomers that can mimic protein surfaces offer a potentially useful strategy for blocking biomedically important protein-protein interactions. Here we evaluate an approach based on combining {alpha}- and {beta}-amino acid residues in the context of a polypeptide sequence from the HIV protein gp41, which represents an excellent testbed because of the wealth of available structural and biological information. We show that {alpha}/{beta}-peptides can mimic structural and functional properties of a critical gp41 subunit. Physical studies in solution, crystallographic data, and results from cell-fusion and virus-infectivity assays collectively indicate that the gp41-mimetic {alpha}/{beta}-peptides effectively block HIV-cell fusion via a mechanism comparablemore » to that of gp41-derived {alpha}-peptides. An optimized {alpha}/{beta}-peptide is far less susceptible to proteolytic degradation than is an analogous {alpha}-peptide. Our findings show how a two-stage design approach, in which sequence-based {alpha} {yields} {beta} replacements are followed by site-specific backbone rigidification, can lead to physical and biological mimicry of a natural biorecognition process.« less
Structural elucidation of the interaction between neurodegenerative disease-related tau protein with model lipid membranes

NASA Astrophysics Data System (ADS)

Jones, Emmalee M.

A protein's sequence of amino acids determines how it folds. That folded structure is linked to protein function, and misfolding to dysfunction. Protein misfolding and aggregation into beta-sheet rich fibrillar aggregates is connected with over 20 neurodegenerative diseases, including Alzheimer's disease (AD). AD is characterized in part by misfolding, aggregation and deposition of the microtubule associated tau protein into neurofibrillary tangles (NFTs). However, two questions remain: What is tau's fibrillization mechanism, and what is tau's cytotoxicity mechanism? Tau is prone to heterogeneous interactions, including with lipid membranes. Lipids have been found in NFTs, anionic lipid vesicles induced aggregation of the microtubule binding domain of tau, and other protein aggregates induced ion permeability in cells. This evidence prompted our investigation of tau's interaction with model lipid membranes to elucidate the structural perturbations those interactions induced in tau protein and in the membrane. We show that although tau is highly charged and soluble, it is highly surface active and preferentially interacts with anionic membranes. To resolve molecular-scale structural details of tau and model membranes, we utilized X-ray and neutron scattering techniques. X-ray reflectivity indicated tau aggregated at air/water and anionic lipid membrane interfaces and penetrated into membranes. More significantly, membrane interfaces induced tau protein to partially adopt a more compact conformation with density similar to folded protein and ordered structure characteristic of beta-sheet formation. This suggests possible membrane-based mechanisms of tau aggregation. Membrane morphological changes were seen using fluorescence microscopy, and X-ray scattering techniques showed tau completely disrupts anionic membranes, suggesting an aggregate-based cytotoxicity mechanism. Further investigation of protein constructs and a "hyperphosphorylation" disease mimic helped clarify the role of the microtubule binding domain in anionic lipid affinity and demonstrated even "hyperphosphorylation" did not prevent interaction with anionic membranes. Additional studies investigated more complex membrane models to increase physiological relevance. These insights revealed structural changes in tau protein and lipid membranes after interaction. We observed tau's affinity for interfaces, and aggregation and compaction once tau partitions to interfaces. We observed the beginnings of beta-sheet formation in tau at anionic lipid membranes. We also examined disruption to the membrane on a molecular scale.
Ion-binding properties of Calnuc, Ca2+ versus Mg2+--Calnuc adopts additional and unusual Ca2+-binding sites upon interaction with G-protein.

PubMed

Kanuru, Madhavi; Samuel, Jebakumar J; Balivada, Lavanya M; Aradhyam, Gopala K

2009-05-01

Calnuc is a novel, highly modular, EF-hand containing, Ca(2+)-binding, Golgi resident protein whose functions are not clear. Using amino acid sequences, we demonstrate that Calnuc is a highly conserved protein among various organisms, from Ciona intestinalis to humans. Maximum homology among all sequences is found in the region that binds to G-proteins. In humans, it is known to be expressed in a variety of tissues, and it interacts with several important protein partners. Among other proteins, Calnuc is known to interact with heterotrimeric G-proteins, specifically with the alpha-subunit. Herein, we report the structural implications of Ca(2+) and Mg(2+) binding, and illustrate that Calnuc functions as a downstream effector for G-protein alpha-subunit. Our results show that Ca(2+) binds with an affinity of 7 mum and causes structural changes. Although Mg(2+) binds to Calnuc with very weak affinity, the structural changes that it causes are further enhanced by Ca(2+) binding. Furthermore, isothermal titration calorimetry results show that Calnuc and the G-protein bind with an affinity of 13 nm. We also predict a probable function for Calnuc, that of maintaining Ca(2+) homeostasis in the cell. Using Stains-all and terbium as Ca(2+) mimic probes, we demonstrate that the Ca(2+)-binding ability of Calnuc is governed by the activity-based conformational state of the G-protein. We propose that Calnuc adopts structural sites similar to the ones seen in proteins such as annexins, c2 domains or chromogrannin A, and therefore binds more calcium ions upon binding to Gialpha. With the number of organelle-targeted G-protein-coupled receptors increasing, intracellular communication mediated by G-proteins could become a new paradigm. In this regard, we propose that Calnuc could be involved in the downstream signaling of G-proteins.
Solution structure of the tandem acyl carrier protein domains from a polyunsaturated fatty acid synthase reveals beads-on-a-string configuration.

PubMed

Trujillo, Uldaeliz; Vázquez-Rosa, Edwin; Oyola-Robles, Delise; Stagg, Loren J; Vassallo, David A; Vega, Irving E; Arold, Stefan T; Baerga-Ortiz, Abel

2013-01-01

The polyunsaturated fatty acid (PUFA) synthases from deep-sea bacteria invariably contain multiple acyl carrier protein (ACP) domains in tandem. This conserved tandem arrangement has been implicated in both amplification of fatty acid production (additive effect) and in structural stabilization of the multidomain protein (synergistic effect). While the more accepted model is one in which domains act independently, recent reports suggest that ACP domains may form higher oligomers. Elucidating the three-dimensional structure of tandem arrangements may therefore give important insights into the functional relevance of these structures, and hence guide bioengineering strategies. In an effort to elucidate the three-dimensional structure of tandem repeats from deep-sea anaerobic bacteria, we have expressed and purified a fragment consisting of five tandem ACP domains from the PUFA synthase from Photobacterium profundum. Analysis of the tandem ACP fragment by analytical gel filtration chromatography showed a retention time suggestive of a multimeric protein. However, small angle X-ray scattering (SAXS) revealed that the multi-ACP fragment is an elongated monomer which does not form a globular unit. Stokes radii calculated from atomic monomeric SAXS models were comparable to those measured by analytical gel filtration chromatography, showing that in the gel filtration experiment, the molecular weight was overestimated due to the elongated protein shape. Thermal denaturation monitored by circular dichroism showed that unfolding of the tandem construct was not cooperative, and that the tandem arrangement did not stabilize the protein. Taken together, these data are consistent with an elongated beads-on-a-string arrangement of the tandem ACP domains in PUFA synthases, and speak against synergistic biocatalytic effects promoted by quaternary structuring. Thus, it is possible to envision bioengineering strategies which simply involve the artificial linking of multiple ACP domains for increasing the yield of fatty acids in bacterial cultures.
Solution Structure of the Tandem Acyl Carrier Protein Domains from a Polyunsaturated Fatty Acid Synthase Reveals Beads-on-a-String Configuration

PubMed Central

Trujillo, Uldaeliz; Vázquez-Rosa, Edwin; Oyola-Robles, Delise; Stagg, Loren J.; Vassallo, David A.; Vega, Irving E.; Arold, Stefan T.; Baerga-Ortiz, Abel

2013-01-01

The polyunsaturated fatty acid (PUFA) synthases from deep-sea bacteria invariably contain multiple acyl carrier protein (ACP) domains in tandem. This conserved tandem arrangement has been implicated in both amplification of fatty acid production (additive effect) and in structural stabilization of the multidomain protein (synergistic effect). While the more accepted model is one in which domains act independently, recent reports suggest that ACP domains may form higher oligomers. Elucidating the three-dimensional structure of tandem arrangements may therefore give important insights into the functional relevance of these structures, and hence guide bioengineering strategies. In an effort to elucidate the three-dimensional structure of tandem repeats from deep-sea anaerobic bacteria, we have expressed and purified a fragment consisting of five tandem ACP domains from the PUFA synthase from Photobacterium profundum. Analysis of the tandem ACP fragment by analytical gel filtration chromatography showed a retention time suggestive of a multimeric protein. However, small angle X-ray scattering (SAXS) revealed that the multi-ACP fragment is an elongated monomer which does not form a globular unit. Stokes radii calculated from atomic monomeric SAXS models were comparable to those measured by analytical gel filtration chromatography, showing that in the gel filtration experiment, the molecular weight was overestimated due to the elongated protein shape. Thermal denaturation monitored by circular dichroism showed that unfolding of the tandem construct was not cooperative, and that the tandem arrangement did not stabilize the protein. Taken together, these data are consistent with an elongated beads-on-a-string arrangement of the tandem ACP domains in PUFA synthases, and speak against synergistic biocatalytic effects promoted by quaternary structuring. Thus, it is possible to envision bioengineering strategies which simply involve the artificial linking of multiple ACP domains for increasing the yield of fatty acids in bacterial cultures. PMID:23469090
Quantitative Protein Topography Analysis and High-Resolution Structure Prediction Using Hydroxyl Radical Labeling and Tandem-Ion Mass Spectrometry (MS)*

PubMed Central

Kaur, Parminder; Kiselar, Janna; Yang, Sichun; Chance, Mark R.

2015-01-01

Hydroxyl radical footprinting based MS for protein structure assessment has the goal of understanding ligand induced conformational changes and macromolecular interactions, for example, protein tertiary and quaternary structure, but the structural resolution provided by typical peptide-level quantification is limiting. In this work, we present experimental strategies using tandem-MS fragmentation to increase the spatial resolution of the technique to the single residue level to provide a high precision tool for molecular biophysics research. Overall, in this study we demonstrated an eightfold increase in structural resolution compared with peptide level assessments. In addition, to provide a quantitative analysis of residue based solvent accessibility and protein topography as a basis for high-resolution structure prediction; we illustrate strategies of data transformation using the relative reactivity of side chains as a normalization strategy and predict side-chain surface area from the footprinting data. We tested the methods by examination of Ca+2-calmodulin showing highly significant correlations between surface area and side-chain contact predictions for individual side chains and the crystal structure. Tandem ion based hydroxyl radical footprinting-MS provides quantitative high-resolution protein topology information in solution that can fill existing gaps in structure determination for large proteins and macromolecular complexes. PMID:25687570
StruLocPred: structure-based protein subcellular localisation prediction using multi-class support vector machine.

PubMed

Zhou, Wengang; Dickerson, Julie A

2012-01-01

Knowledge of protein subcellular locations can help decipher a protein's biological function. This work proposes new features: sequence-based: Hybrid Amino Acid Pair (HAAP) and two structure-based: Secondary Structural Element Composition (SSEC) and solvent accessibility state frequency. A multi-class Support Vector Machine is developed to predict the locations. Testing on two established data sets yields better prediction accuracies than the best available systems. Comparisons with existing methods show comparable results to ESLPred2. When StruLocPred is applied to the entire Arabidopsis proteome, over 77% of proteins with known locations match the prediction results. An implementation of this system is at http://wgzhou.ece. iastate.edu/StruLocPred/.
Structural optimization and structure-functional selectivity relationship studies of G protein-biased EP2 receptor agonists.

PubMed

Ogawa, Seiji; Watanabe, Toshihide; Moriyuki, Kazumi; Goto, Yoshikazu; Yamane, Shinsaku; Watanabe, Akio; Tsuboi, Kazuma; Kinoshita, Atsushi; Okada, Takuya; Takeda, Hiroyuki; Tani, Kousuke; Maruyama, Toru

2016-05-15

The modification of the novel G protein-biased EP2 agonist 1 has been investigated to improve its G protein activity and develop a better understanding of its structure-functional selectivity relationship (SFSR). The optimization of the substituents on the phenyl ring of 1, followed by the inversion of the hydroxyl group on the cyclopentane moiety led to compound 9, which showed a 100-fold increase in its G protein activity compared with 1 without any increase in β-arrestin recruitment. Furthermore, SFSR studies revealed that the combination of meta and para substituents on the phenyl moiety was crucial to the functional selectivity. Copyright © 2016 Elsevier Ltd. All rights reserved.
Evolutionarily Conserved Linkage between Enzyme Fold, Flexibility, and Catalysis

PubMed Central

Ramanathan, Arvind; Agarwal, Pratul K.

2011-01-01

Proteins are intrinsically flexible molecules. The role of internal motions in a protein's designated function is widely debated. The role of protein structure in enzyme catalysis is well established, and conservation of structural features provides vital clues to their role in function. Recently, it has been proposed that the protein function may involve multiple conformations: the observed deviations are not random thermodynamic fluctuations; rather, flexibility may be closely linked to protein function, including enzyme catalysis. We hypothesize that the argument of conservation of important structural features can also be extended to identification of protein flexibility in interconnection with enzyme function. Three classes of enzymes (prolyl-peptidyl isomerase, oxidoreductase, and nuclease) that catalyze diverse chemical reactions have been examined using detailed computational modeling. For each class, the identification and characterization of the internal protein motions coupled to the chemical step in enzyme mechanisms in multiple species show identical enzyme conformational fluctuations. In addition to the active-site residues, motions of protein surface loop regions (>10 Å away) are observed to be identical across species, and networks of conserved interactions/residues connect these highly flexible surface regions to the active-site residues that make direct contact with substrates. More interestingly, examination of reaction-coupled motions in non-homologous enzyme systems (with no structural or sequence similarity) that catalyze the same biochemical reaction shows motions that induce remarkably similar changes in the enzyme–substrate interactions during catalysis. The results indicate that the reaction-coupled flexibility is a conserved aspect of the enzyme molecular architecture. Protein motions in distal areas of homologous and non-homologous enzyme systems mediate similar changes in the active-site enzyme–substrate interactions, thereby impacting the mechanism of catalyzed chemistry. These results have implications for understanding the mechanism of allostery, and for protein engineering and drug design. PMID:22087074
Evolutionarily conserved linkage between enzyme fold, flexibility, and catalysis.

PubMed

Ramanathan, Arvind; Agarwal, Pratul K

2011-11-01

Proteins are intrinsically flexible molecules. The role of internal motions in a protein's designated function is widely debated. The role of protein structure in enzyme catalysis is well established, and conservation of structural features provides vital clues to their role in function. Recently, it has been proposed that the protein function may involve multiple conformations: the observed deviations are not random thermodynamic fluctuations; rather, flexibility may be closely linked to protein function, including enzyme catalysis. We hypothesize that the argument of conservation of important structural features can also be extended to identification of protein flexibility in interconnection with enzyme function. Three classes of enzymes (prolyl-peptidyl isomerase, oxidoreductase, and nuclease) that catalyze diverse chemical reactions have been examined using detailed computational modeling. For each class, the identification and characterization of the internal protein motions coupled to the chemical step in enzyme mechanisms in multiple species show identical enzyme conformational fluctuations. In addition to the active-site residues, motions of protein surface loop regions (>10 Å away) are observed to be identical across species, and networks of conserved interactions/residues connect these highly flexible surface regions to the active-site residues that make direct contact with substrates. More interestingly, examination of reaction-coupled motions in non-homologous enzyme systems (with no structural or sequence similarity) that catalyze the same biochemical reaction shows motions that induce remarkably similar changes in the enzyme-substrate interactions during catalysis. The results indicate that the reaction-coupled flexibility is a conserved aspect of the enzyme molecular architecture. Protein motions in distal areas of homologous and non-homologous enzyme systems mediate similar changes in the active-site enzyme-substrate interactions, thereby impacting the mechanism of catalyzed chemistry. These results have implications for understanding the mechanism of allostery, and for protein engineering and drug design.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Ramanathan, Arvind; Agarwal, Pratul K

Proteins are intrinsically flexible molecules. The role of internal motions in a protein's designated function is widely debated. The role of protein structure in enzyme catalysis is well established, and conservation of structural features provides vital clues to their role in function. Recently, it has been proposed that the protein function may involve multiple conformations: the observed deviations are not random thermodynamic fluctuations; rather, flexibility may be closely linked to protein function, including enzyme catalysis. We hypothesize that the argument of conservation of important structural features can also be extended to identification of protein flexibility in interconnection with enzyme function.more » Three classes of enzymes (prolyl-peptidyl isomerase, oxidoreductase, and nuclease) that catalyze diverse chemical reactions have been examined using detailed computational modeling. For each class, the identification and characterization of the internal protein motions coupled to the chemical step in enzyme mechanisms in multiple species show identical enzyme conformational fluctuations. In addition to the active-site residues, motions of protein surface loop regions (>10 away) are observed to be identical across species, and networks of conserved interactions/residues connect these highly flexible surface regions to the active-site residues that make direct contact with substrates. More interestingly, examination of reaction-coupled motions in non-homologous enzyme systems (with no structural or sequence similarity) that catalyze the same biochemical reaction shows motions that induce remarkably similar changes in the enzyme substrate interactions during catalysis. The results indicate that the reaction-coupled flexibility is a conserved aspect of the enzyme molecular architecture. Protein motions in distal areas of homologous and non-homologous enzyme systems mediate similar changes in the active-site enzyme substrate interactions, thereby impacting the mechanism of catalyzed chemistry. These results have implications for understanding the mechanism of allostery, and for protein engineering and drug design.« less
Fold independent structural comparisons of protein-ligand binding sites for exploring functional relationships.

PubMed

Gold, Nicola D; Jackson, Richard M

2006-02-03

The rapid growth in protein structural data and the emergence of structural genomics projects have increased the need for automatic structure analysis and tools for function prediction. Small molecule recognition is critical to the function of many proteins; therefore, determination of ligand binding site similarity is important for understanding ligand interactions and may allow their functional classification. Here, we present a binding sites database (SitesBase) that given a known protein-ligand binding site allows rapid retrieval of other binding sites with similar structure independent of overall sequence or fold similarity. However, each match is also annotated with sequence similarity and fold information to aid interpretation of structure and functional similarity. Similarity in ligand binding sites can indicate common binding modes and recognition of similar molecules, allowing potential inference of function for an uncharacterised protein or providing additional evidence of common function where sequence or fold similarity is already known. Alternatively, the resource can provide valuable information for detailed studies of molecular recognition including structure-based ligand design and in understanding ligand cross-reactivity. Here, we show examples of atomic similarity between superfamily or more distant fold relatives as well as between seemingly unrelated proteins. Assignment of unclassified proteins to structural superfamiles is also undertaken and in most cases substantiates assignments made using sequence similarity. Correct assignment is also possible where sequence similarity fails to find significant matches, illustrating the potential use of binding site comparisons for newly determined proteins.
Protein family clustering for structural genomics.

PubMed

Yan, Yongpan; Moult, John

2005-10-28

A major goal of structural genomics is the provision of a structural template for a large fraction of protein domains. The magnitude of this task depends on the number and nature of protein sequence families. With a large number of bacterial genomes now fully sequenced, it is possible to obtain improved estimates of the number and diversity of families in that kingdom. We have used an automated clustering procedure to group all sequences in a set of genomes into protein families. Bench-marking shows the clustering method is sensitive at detecting remote family members, and has a low level of false positives. This comprehensive protein family set has been used to address the following questions. (1) What is the structure coverage for currently known families? (2) How will the number of known apparent families grow as more genomes are sequenced? (3) What is a practical strategy for maximizing structure coverage in future? Our study indicates that approximately 20% of known families with three or more members currently have a representative structure. The study indicates also that the number of apparent protein families will be considerably larger than previously thought: We estimate that, by the criteria of this work, there will be about 250,000 protein families when 1000 microbial genomes have been sequenced. However, the vast majority of these families will be small, and it will be possible to obtain structural templates for 70-80% of protein domains with an achievable number of representative structures, by systematically sampling the larger families.
Atomic Force Microscopy Analysis of the Role of Major DNA-Binding Proteins in Organization of the Nucleoid in Escherichia coli

PubMed Central

Ohniwa, Ryosuke L.; Muchaku, Hiroki; Saito, Shinji; Wada, Chieko; Morikawa, Kazuya

2013-01-01

Bacterial genomic DNA is packed within the nucleoid of the cell along with various proteins and RNAs. We previously showed that the nucleoid in log phase cells consist of fibrous structures with diameters ranging from 30 to 80 nm, and that these structures, upon RNase A treatment, are converted into homogeneous thinner fibers with diameter of 10 nm. In this study, we investigated the role of major DNA-binding proteins in nucleoid organization by analyzing the nucleoid of mutant Escherichia coli strains lacking HU, IHF, H–NS, StpA, Fis, or Hfq using atomic force microscopy. Deletion of particular DNA-binding protein genes altered the nucleoid structure in different ways, but did not release the naked DNA even after the treatment with RNase A. This suggests that major DNA-binding proteins are involved in the formation of higher order structure once 10-nm fiber structure is built up from naked DNA. PMID:23951337
The Ramachandran Number: An Order Parameter for Protein Geometry

DOE PAGES

Mannige, Ranjan V.; Kundu, Joyjit; Whitelam, Stephen; ...

2016-08-04

Three-dimensional protein structures usually contain regions of local order, called secondary structure, such as α-helices and β-sheets. Secondary structure is characterized by the local rotational state of the protein backbone, quantified by two dihedral angles called Øand Ψ. Particular types of secondary structure can generally be described by a single (diffuse) location on a two-dimensional plot drawn in the space of the angles Ø andΨ, called a Ramachandran plot. By contrast, a recently-discovered nanomaterial made from peptoids, structural isomers of peptides, displays a secondary-structure motif corresponding to two regions on the Ramachandran plot [Mannige et al., Nature 526, 415 (2015)].more » In order to describe such 'higher-order' secondary structure in a compact way we introduce here a means of describing regions on the Ramachandran plot in terms of a single Ramachandran number, R, which is a structurally meaningful combination of Ø andΨ. We show that the potential applications of R are numerous: it can be used to describe the geometric content of protein structures, and can be used to draw diagrams that reveal, at a glance, the frequency of occurrence of regular secondary structures and disordered regions in large protein datasets. We propose that R might be used as an order parameter for protein geometry for a wide range of applications.« less
Text Mining for Protein Docking

PubMed Central

Badal, Varsha D.; Kundrotas, Petras J.; Vakser, Ilya A.

2015-01-01

The rapidly growing amount of publicly available information from biomedical research is readily accessible on the Internet, providing a powerful resource for predictive biomolecular modeling. The accumulated data on experimentally determined structures transformed structure prediction of proteins and protein complexes. Instead of exploring the enormous search space, predictive tools can simply proceed to the solution based on similarity to the existing, previously determined structures. A similar major paradigm shift is emerging due to the rapidly expanding amount of information, other than experimentally determined structures, which still can be used as constraints in biomolecular structure prediction. Automated text mining has been widely used in recreating protein interaction networks, as well as in detecting small ligand binding sites on protein structures. Combining and expanding these two well-developed areas of research, we applied the text mining to structural modeling of protein-protein complexes (protein docking). Protein docking can be significantly improved when constraints on the docking mode are available. We developed a procedure that retrieves published abstracts on a specific protein-protein interaction and extracts information relevant to docking. The procedure was assessed on protein complexes from Dockground (http://dockground.compbio.ku.edu). The results show that correct information on binding residues can be extracted for about half of the complexes. The amount of irrelevant information was reduced by conceptual analysis of a subset of the retrieved abstracts, based on the bag-of-words (features) approach. Support Vector Machine models were trained and validated on the subset. The remaining abstracts were filtered by the best-performing models, which decreased the irrelevant information for ~ 25% complexes in the dataset. The extracted constraints were incorporated in the docking protocol and tested on the Dockground unbound benchmark set, significantly increasing the docking success rate. PMID:26650466
Quality assessment of protein model-structures using evolutionary conservation.

PubMed

Kalman, Matan; Ben-Tal, Nir

2010-05-15

Programs that evaluate the quality of a protein structural model are important both for validating the structure determination procedure and for guiding the model-building process. Such programs are based on properties of native structures that are generally not expected for faulty models. One such property, which is rarely used for automatic structure quality assessment, is the tendency for conserved residues to be located at the structural core and for variable residues to be located at the surface. We present ConQuass, a novel quality assessment program based on the consistency between the model structure and the protein's conservation pattern. We show that it can identify problematic structural models, and that the scores it assigns to the server models in CASP8 correlate with the similarity of the models to the native structure. We also show that when the conservation information is reliable, the method's performance is comparable and complementary to that of the other single-structure quality assessment methods that participated in CASP8 and that do not use additional structural information from homologs. A perl implementation of the method, as well as the various perl and R scripts used for the analysis are available at http://bental.tau.ac.il/ConQuass/. nirb@tauex.tau.ac.il Supplementary data are available at Bioinformatics online.

Structural changes of malt proteins during boiling.

PubMed

Jin, Bei; Li, Lin; Liu, Guo-Qin; Li, Bing; Zhu, Yu-Kui; Liao, Liao-Ning

2009-03-09

Changes in the physicochemical properties and structure of proteins derived from two malt varieties (Baudin and Guangmai) during wort boiling were investigated by differential scanning calorimetry, SDS-PAGE, two-dimensional electrophoresis, gel filtration chromatography and circular dichroism spectroscopy. The results showed that both protein content and amino acid composition changed only slightly during boiling, and that boiling might cause a gradual unfolding of protein structures, as indicated by the decrease in surface hydrophobicity and free sulfhydryl content and enthalpy value, as well as reduced alpha-helix contents and markedly increased random coil contents. It was also found that major component of both worts was a boiling-resistant protein with a molecular mass of 40 kDa, and that according to the two-dimensional electrophoresis and SE-HPLC analyses, a small amount of soluble aggregates might be formed via hydrophobic interactions. It was thus concluded that changes of protein structure caused by boiling that might influence beer quality are largely independent of malt variety.
Universal features of fluctuations in globular proteins.

PubMed

Erman, Burak

2016-06-01

Using data from 2000 non-homologous protein crystal structures, we show that the distribution of residue B factors of proteins collapses onto a single master curve. We show by maximum entropy arguments that this curve is a Gamma function whose order and dispersion are obtained from experimental data. The distribution for any given specific protein can be generated from the master curve by a linear transformation. Any perturbation of the B factor distribution of a protein, imposed at constant energy, causes a decrease in the entropy of the protein relative to that of the reference state. Proteins 2016; 84:721-725. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Visualisation of variable binding pockets on protein surfaces by probabilistic analysis of related structure sets.

PubMed

Ashford, Paul; Moss, David S; Alex, Alexander; Yeap, Siew K; Povia, Alice; Nobeli, Irene; Williams, Mark A

2012-03-14

Protein structures provide a valuable resource for rational drug design. For a protein with no known ligand, computational tools can predict surface pockets that are of suitable size and shape to accommodate a complementary small-molecule drug. However, pocket prediction against single static structures may miss features of pockets that arise from proteins' dynamic behaviour. In particular, ligand-binding conformations can be observed as transiently populated states of the apo protein, so it is possible to gain insight into ligand-bound forms by considering conformational variation in apo proteins. This variation can be explored by considering sets of related structures: computationally generated conformers, solution NMR ensembles, multiple crystal structures, homologues or homology models. It is non-trivial to compare pockets, either from different programs or across sets of structures. For a single structure, difficulties arise in defining particular pocket's boundaries. For a set of conformationally distinct structures the challenge is how to make reasonable comparisons between them given that a perfect structural alignment is not possible. We have developed a computational method, Provar, that provides a consistent representation of predicted binding pockets across sets of related protein structures. The outputs are probabilities that each atom or residue of the protein borders a predicted pocket. These probabilities can be readily visualised on a protein using existing molecular graphics software. We show how Provar simplifies comparison of the outputs of different pocket prediction algorithms, of pockets across multiple simulated conformations and between homologous structures. We demonstrate the benefits of use of multiple structures for protein-ligand and protein-protein interface analysis on a set of complexes and consider three case studies in detail: i) analysis of a kinase superfamily highlights the conserved occurrence of surface pockets at the active and regulatory sites; ii) a simulated ensemble of unliganded Bcl2 structures reveals extensions of a known ligand-binding pocket not apparent in the apo crystal structure; iii) visualisations of interleukin-2 and its homologues highlight conserved pockets at the known receptor interfaces and regions whose conformation is known to change on inhibitor binding. Through post-processing of the output of a variety of pocket prediction software, Provar provides a flexible approach to the analysis and visualization of the persistence or variability of pockets in sets of related protein structures.
Interplay between Peptide Bond Geometrical Parameters in Nonglobular Structural Contexts

PubMed Central

Esposito, Luciana; De Simone, Alfonso; Vitagliano, Luigi

2013-01-01

Several investigations performed in the last two decades have unveiled that geometrical parameters of protein backbone show a remarkable variability. Although these studies have provided interesting insights into one of the basic aspects of protein structure, they have been conducted on globular and water-soluble proteins. We report here a detailed analysis of backbone geometrical parameters in nonglobular proteins/peptides. We considered membrane proteins and two distinct fibrous systems (amyloid-forming and collagen-like peptides). Present data show that in these systems the local conformation plays a major role in dictating the amplitude of the bond angle N-Cα-C and the propensity of the peptide bond to adopt planar/nonplanar states. Since the trends detected here are in line with the concept of the mutual influence of local geometry and conformation previously established for globular and water-soluble proteins, our analysis demonstrates that the interplay of backbone geometrical parameters is an intrinsic and general property of protein/peptide structures that is preserved also in nonglobular contexts. For amyloid-forming peptides significant distortions of the N-Cα-C bond angle, indicative of sterical hidden strain, may occur in correspondence with side chain interdigitation. The correlation between the dihedral angles Δω/ψ in collagen-like models may have interesting implications for triple helix stability. PMID:24455689
Interplay between peptide bond geometrical parameters in nonglobular structural contexts.

PubMed

Esposito, Luciana; Balasco, Nicole; De Simone, Alfonso; Berisio, Rita; Vitagliano, Luigi

2013-01-01

Several investigations performed in the last two decades have unveiled that geometrical parameters of protein backbone show a remarkable variability. Although these studies have provided interesting insights into one of the basic aspects of protein structure, they have been conducted on globular and water-soluble proteins. We report here a detailed analysis of backbone geometrical parameters in nonglobular proteins/peptides. We considered membrane proteins and two distinct fibrous systems (amyloid-forming and collagen-like peptides). Present data show that in these systems the local conformation plays a major role in dictating the amplitude of the bond angle N-C(α)-C and the propensity of the peptide bond to adopt planar/nonplanar states. Since the trends detected here are in line with the concept of the mutual influence of local geometry and conformation previously established for globular and water-soluble proteins, our analysis demonstrates that the interplay of backbone geometrical parameters is an intrinsic and general property of protein/peptide structures that is preserved also in nonglobular contexts. For amyloid-forming peptides significant distortions of the N-C(α)-C bond angle, indicative of sterical hidden strain, may occur in correspondence with side chain interdigitation. The correlation between the dihedral angles Δω/ψ in collagen-like models may have interesting implications for triple helix stability.
Salting-in effect on muscle protein extracted from giant squid (Dosidicus gigas).

PubMed

Zhang, Rui; Zhou, Ru; Pan, Weichun; Lin, Weiwei; Zhang, Xiuzhen; Li, Mengya; Li, Jianrong; Niu, Fuge; Li, Ang

2017-01-15

The salting-in effect on muscle protein is well-known in food science but hard to explain using conventional theories. Myofibrillar protein extracted from the giant squid (Dosidicus gigas) was selected as a model muscle protein to study this mechanism in KCl solutions. Changes in the secondary structures of myofibrillar protein molecules caused by concentrated salts, particularly in the paramyosin molecule conformation, have been reported. Zeta-potential determinations showed that these secondary structures have modified protein molecule surfaces. The zeta-potential of the myofibrillar protein molecules fell from -7.24±0.82 to -9.99±1.65mV with increasing salt concentration from 0.1 to 0.5M. The corresponding second virial coefficient increased from -85.43±3.8×10(-7) to -3.45±1.3×10(-7) molmLg(-2). The extended law of corresponding states suggests that reduced attractive interactions increase the protein solubility. Solubility measurements in alternating KCl concentrations showed that the conformational change was reversible. Copyright © 2016 Elsevier Ltd. All rights reserved.
Computational prediction of hinge axes in proteins

PubMed Central

2014-01-01

Background A protein's function is determined by the wide range of motions exhibited by its 3D structure. However, current experimental techniques are not able to reliably provide the level of detail required for elucidating the exact mechanisms of protein motion essential for effective drug screening and design. Computational tools are instrumental in the study of the underlying structure-function relationship. We focus on a special type of proteins called "hinge proteins" which exhibit a motion that can be interpreted as a rotation of one domain relative to another. Results This work proposes a computational approach that uses the geometric structure of a single conformation to predict the feasible motions of the protein and is founded in recent work from rigidity theory, an area of mathematics that studies flexibility properties of general structures. Given a single conformational state, our analysis predicts a relative axis of motion between two specified domains. We analyze a dataset of 19 structures known to exhibit this hinge-like behavior. For 15, the predicted axis is consistent with a motion to a second, known conformation. We present a detailed case study for three proteins whose dynamics have been well-studied in the literature: calmodulin, the LAO binding protein and the Bence-Jones protein. Conclusions Our results show that incorporating rigidity-theoretic analyses can lead to effective computational methods for understanding hinge motions in macromolecules. This initial investigation is the first step towards a new tool for probing the structure-dynamics relationship in proteins. PMID:25080829
Gi- and Gs-coupled GPCRs show different modes of G-protein binding.

PubMed

Van Eps, Ned; Altenbach, Christian; Caro, Lydia N; Latorraca, Naomi R; Hollingsworth, Scott A; Dror, Ron O; Ernst, Oliver P; Hubbell, Wayne L

2018-03-06

More than two decades ago, the activation mechanism for the membrane-bound photoreceptor and prototypical G protein-coupled receptor (GPCR) rhodopsin was uncovered. Upon light-induced changes in ligand-receptor interaction, movement of specific transmembrane helices within the receptor opens a crevice at the cytoplasmic surface, allowing for coupling of heterotrimeric guanine nucleotide-binding proteins (G proteins). The general features of this activation mechanism are conserved across the GPCR superfamily. Nevertheless, GPCRs have selectivity for distinct G-protein family members, but the mechanism of selectivity remains elusive. Structures of GPCRs in complex with the stimulatory G protein, G s , and an accessory nanobody to stabilize the complex have been reported, providing information on the intermolecular interactions. However, to reveal the structural selectivity filters, it will be necessary to determine GPCR-G protein structures involving other G-protein subtypes. In addition, it is important to obtain structures in the absence of a nanobody that may influence the structure. Here, we present a model for a rhodopsin-G protein complex derived from intermolecular distance constraints between the activated receptor and the inhibitory G protein, G i , using electron paramagnetic resonance spectroscopy and spin-labeling methodologies. Molecular dynamics simulations demonstrated the overall stability of the modeled complex. In the rhodopsin-G i complex, G i engages rhodopsin in a manner distinct from previous GPCR-G s structures, providing insight into specificity determinants. Copyright © 2018 the Author(s). Published by PNAS.
Immunoglobulin subunits of murine B lymphocytes: structure and associations with other membrane proteins.

PubMed Central

Vogel, L; Haustein, D

1989-01-01

The Ig subunit structure of murine B lymphocytes was studied by employing different radiolabelling techniques in combination with chemical cross-linking. The main membrane structure of IgM was a half molecule that was disulphide-linked to proteins with MW 30,000, 45,000 and 55,000, respectively. Small amounts of mu 2L2, microL disulphide-linked to a protein with MW 50,000, and free microL were also detected. The main IgD structures were half molecules disulphide-linked to two proteins with MW 14,000 and two proteins with MW 16,000. Furthermore, IgD half molecules disulphide-linked to a protein with MW 16,000 and free half molecules could be demonstrated. Labelling with hydrophobic reagents showed that all Ig molecules and the protein with MW 50,000, linked to microL, penetrated the lipid bilayer, whereas the other IgM- and IgD-linked proteins probably did not. Additional proteins which were associated exclusively with IgM were detected by chemical cross-linking. These findings offer new possibilities for the investigation of the function(s) of antigen receptors on B cells. Images Figure 1 Figure 2 Figure 4 Figure 5 PMID:2787780
The use of experimental structures to model protein dynamics.

PubMed

Katebi, Ataur R; Sankar, Kannan; Jia, Kejue; Jernigan, Robert L

2015-01-01

The number of solved protein structures submitted in the Protein Data Bank (PDB) has increased dramatically in recent years. For some specific proteins, this number is very high-for example, there are over 550 solved structures for HIV-1 protease, one protein that is essential for the life cycle of human immunodeficiency virus (HIV) which causes acquired immunodeficiency syndrome (AIDS) in humans. The large number of structures for the same protein and its variants include a sample of different conformational states of the protein. A rich set of structures solved experimentally for the same protein has information buried within the dataset that can explain the functional dynamics and structural mechanism of the protein. To extract the dynamics information and functional mechanism from the experimental structures, this chapter focuses on two methods-Principal Component Analysis (PCA) and Elastic Network Models (ENM). PCA is a widely used statistical dimensionality reduction technique to classify and visualize high-dimensional data. On the other hand, ENMs are well-established simple biophysical method for modeling the functionally important global motions of proteins. This chapter covers the basics of these two. Moreover, an improved ENM version that utilizes the variations found within a given set of structures for a protein is described. As a practical example, we have extracted the functional dynamics and mechanism of HIV-1 protease dimeric structure by using a set of 329 PDB structures of this protein. We have described, step by step, how to select a set of protein structures, how to extract the needed information from the PDB files for PCA, how to extract the dynamics information using PCA, how to calculate ENM modes, how to measure the congruency between the dynamics computed from the principal components (PCs) and the ENM modes, and how to compute entropies using the PCs. We provide the computer programs or references to software tools to accomplish each step and show how to use these programs and tools. We also include computer programs to generate movies based on PCs and ENM modes and describe how to visualize them.
The Use of Experimental Structures to Model Protein Dynamics

PubMed Central

Katebi, Ataur R.; Sankar, Kannan; Jia, Kejue; Jernigan, Robert L.

2014-01-01

Summary The number of solved protein structures submitted in the Protein Data Bank (PDB) has increased dramatically in recent years. For some specific proteins, this number is very high – for example, there are over 550 solved structures for HIV-1 protease, one protein that is essential for the life cycle of human immunodeficiency virus (HIV) which causes acquired immunodeficiency syndrome (AIDS) in humans. The large number of structures for the same protein and its variants include a sample of different conformational states of the protein. A rich set of structures solved experimentally for the same protein has information buried within the dataset that can explain the functional dynamics and structural mechanism of the protein. To extract the dynamics information and functional mechanism from the experimental structures, this chapter focuses on two methods – Principal Component Analysis (PCA) and Elastic Network Models (ENM). PCA is a widely used statistical dimensionality reduction technique to classify and visualize high-dimensional data. On the other hand, ENMs are well-established simple biophysical method for modeling the functionally important global motions of proteins. This chapter covers the basics of these two. Moreover, an improved ENM version that utilizes the variations found within a given set of structures for a protein is described. As a practical example, we have extracted the functional dynamics and mechanism of HIV-1 protease dimeric structure by using a set of 329 PDB structures of this protein. We have described, step by step, how to select a set of protein structures, how to extract the needed information from the PDB files for PCA, how to extract the dynamics information using PCA, how to calculate ENM modes, how to measure the congruency between the dynamics computed from the principal components (PCs) and the ENM modes, and how to compute entropies using the PCs. We provide the computer programs or references to software tools to accomplish each step and show how to use these programs and tools. We also include computer programs to generate movies based on PCs and ENM modes and describe how to visualize them. PMID:25330965
Structure and application of antifreeze proteins from Antarctic bacteria.

PubMed

Muñoz, Patricio A; Márquez, Sebastián L; González-Nilo, Fernando D; Márquez-Miranda, Valeria; Blamey, Jenny M

2017-08-07

Antifreeze proteins (AFPs) production is a survival strategy of psychrophiles in ice. These proteins have potential in frozen food industry avoiding the damage in the structure of animal or vegetal foods. Moreover, there is not much information regarding the interaction of Antarctic bacterial AFPs with ice, and new determinations are needed to understand the behaviour of these proteins at the water/ice interface. Different Antarctic places were screened for antifreeze activity and microorganisms were selected for the presence of thermal hysteresis in their crude extracts. Isolates GU1.7.1, GU3.1.1, and AFP5.1 showed higher thermal hysteresis and were characterized using a polyphasic approach. Studies using cucumber and zucchini samples showed cellular protection when samples were treated with partially purified AFPs or a commercial AFP as was determined using toluidine blue O and neutral red staining. Additionally, genome analysis of these isolates revealed the presence of genes that encode for putative AFPs. Deduced amino acids sequences from GU3.1.1 (gu3A and gu3B) and AFP5.1 (afp5A) showed high similarity to reported AFPs which crystal structures are solved, allowing then generating homology models. Modelled proteins showed a triangular prism form similar to β-helix AFPs with a linear distribution of threonine residues at one side of the prism that could correspond to the putative ice binding side. The statistically best models were used to build a protein-water system. Molecular dynamics simulations were then performed to compare the antifreezing behaviour of these AFPs at the ice/water interface. Docking and molecular dynamics simulations revealed that gu3B could have the most efficient antifreezing behavior, but gu3A could have a higher affinity for ice. AFPs from Antarctic microorganisms GU1.7.1, GU3.1.1 and AFP5.1 protect cellular structures of frozen food showing a potential for frozen food industry. Modeled proteins possess a β-helix structure, and molecular docking analysis revealed the AFP gu3B could be the most efficient AFPs in order to avoid the formation of ice crystals, even when gu3A has a higher affinity for ice. By determining the interaction of AFPs at the ice/water interface, it will be possible to understand the process of adaptation of psychrophilic bacteria to Antarctic ice.
Molecular Cloning and Characterization of cDNA Encoding a Putative Stress-Induced Heat-Shock Protein from Camelus dromedarius

PubMed Central

Elrobh, Mohamed S.; Alanazi, Mohammad S.; Khan, Wajahatullah; Abduljaleel, Zainularifeen; Al-Amri, Abdullah; Bazzi, Mohammad D.

2011-01-01

Heat shock proteins are ubiquitous, induced under a number of environmental and metabolic stresses, with highly conserved DNA sequences among mammalian species. Camelus dromedaries (the Arabian camel) domesticated under semi-desert environments, is well adapted to tolerate and survive against severe drought and high temperatures for extended periods. This is the first report of molecular cloning and characterization of full length cDNA of encoding a putative stress-induced heat shock HSPA6 protein (also called HSP70B′) from Arabian camel. A full-length cDNA (2417 bp) was obtained by rapid amplification of cDNA ends (RACE) and cloned in pET-b expression vector. The sequence analysis of HSPA6 gene showed 1932 bp-long open reading frame encoding 643 amino acids. The complete cDNA sequence of the Arabian camel HSPA6 gene was submitted to NCBI GeneBank (accession number HQ214118.1). The BLAST analysis indicated that C. dromedaries HSPA6 gene nucleotides shared high similarity (77–91%) with heat shock gene nucleotide of other mammals. The deduced 643 amino acid sequences (accession number ADO12067.1) showed that the predicted protein has an estimated molecular weight of 70.5 kDa with a predicted isoelectric point (pI) of 6.0. The comparative analyses of camel HSPA6 protein sequences with other mammalian heat shock proteins (HSPs) showed high identity (80–94%). Predicted camel HSPA6 protein structure using Protein 3D structural analysis high similarities with human and mouse HSPs. Taken together, this study indicates that the cDNA sequences of HSPA6 gene and its amino acid and protein structure from the Arabian camel are highly conserved and have similarities with other mammalian species. PMID:21845074
Looping and clustering model for the organization of protein-DNA complexes on the bacterial genome

NASA Astrophysics Data System (ADS)

Walter, Jean-Charles; Walliser, Nils-Ole; David, Gabriel; Dorignac, Jérôme; Geniet, Frédéric; Palmeri, John; Parmeggiani, Andrea; Wingreen, Ned S.; Broedersz, Chase P.

2018-03-01

The bacterial genome is organized by a variety of associated proteins inside a structure called the nucleoid. These proteins can form complexes on DNA that play a central role in various biological processes, including chromosome segregation. A prominent example is the large ParB-DNA complex, which forms an essential component of the segregation machinery in many bacteria. ChIP-Seq experiments show that ParB proteins localize around centromere-like parS sites on the DNA to which ParB binds specifically, and spreads from there over large sections of the chromosome. Recent theoretical and experimental studies suggest that DNA-bound ParB proteins can interact with each other to condense into a coherent 3D complex on the DNA. However, the structural organization of this protein-DNA complex remains unclear, and a predictive quantitative theory for the distribution of ParB proteins on DNA is lacking. Here, we propose the looping and clustering model, which employs a statistical physics approach to describe protein-DNA complexes. The looping and clustering model accounts for the extrusion of DNA loops from a cluster of interacting DNA-bound proteins that is organized around a single high-affinity binding site. Conceptually, the structure of the protein-DNA complex is determined by a competition between attractive protein interactions and loop closure entropy of this protein-DNA cluster on the one hand, and the positional entropy for placing loops within the cluster on the other. Indeed, we show that the protein interaction strength determines the ‘tightness’ of the loopy protein-DNA complex. Thus, our model provides a theoretical framework for quantitatively computing the binding profiles of ParB-like proteins around a cognate (parS) binding site.
Modulation of the Extent of Cooperative Structural Change During Protein Folding by Chemical Denaturant.

PubMed

Jethva, Prashant N; Udgaonkar, Jayant B

2017-09-07

Protein folding and unfolding reactions invariably appear to be highly cooperative reactions, but the structural and sequence determinants of cooperativity are poorly understood. Importantly, it is not known whether cooperative structural change occurs throughout the protein, or whether some parts change cooperatively and other parts change noncooperatively. In the current study, hydrogen exchange mass spectrometry has been used to show that the mechanism of unfolding of the PI3K SH3 domain is similar in the absence and presence of 5 M urea. The data are well described by a four state N ↔ I N ↔ I 2 ↔ U model, in which structural changes occur noncooperatively during the N ↔ I N and I N ↔ I 2 transitions, and occur cooperatively during the I 2 ↔ U transition. The nSrc-loop and RT-loop, as well as β strands 4 and 5 undergo noncooperative unfolding, while β strands 1, 2, and 3 unfold cooperatively in the absence of urea. However, in the presence of 5 M urea, the unfolding of β strand 4 switches to become cooperative, leading to an increase in the extent of cooperative structural change. The current study highlights the relationship between protein stability and cooperativity, by showing how the extent of cooperativity can be varied, using chemical denaturant to alter protein stability.
Fast and Accurate Multivariate Gaussian Modeling of Protein Families: Predicting Residue Contacts and Protein-Interaction Partners

PubMed Central

Feinauer, Christoph; Procaccini, Andrea; Zecchina, Riccardo; Weigt, Martin; Pagnani, Andrea

2014-01-01

In the course of evolution, proteins show a remarkable conservation of their three-dimensional structure and their biological function, leading to strong evolutionary constraints on the sequence variability between homologous proteins. Our method aims at extracting such constraints from rapidly accumulating sequence data, and thereby at inferring protein structure and function from sequence information alone. Recently, global statistical inference methods (e.g. direct-coupling analysis, sparse inverse covariance estimation) have achieved a breakthrough towards this aim, and their predictions have been successfully implemented into tertiary and quaternary protein structure prediction methods. However, due to the discrete nature of the underlying variable (amino-acids), exact inference requires exponential time in the protein length, and efficient approximations are needed for practical applicability. Here we propose a very efficient multivariate Gaussian modeling approach as a variant of direct-coupling analysis: the discrete amino-acid variables are replaced by continuous Gaussian random variables. The resulting statistical inference problem is efficiently and exactly solvable. We show that the quality of inference is comparable or superior to the one achieved by mean-field approximations to inference with discrete variables, as done by direct-coupling analysis. This is true for (i) the prediction of residue-residue contacts in proteins, and (ii) the identification of protein-protein interaction partner in bacterial signal transduction. An implementation of our multivariate Gaussian approach is available at the website http://areeweb.polito.it/ricerca/cmp/code. PMID:24663061
regSNPs-splicing: a tool for prioritizing synonymous single-nucleotide substitution.

PubMed

Zhang, Xinjun; Li, Meng; Lin, Hai; Rao, Xi; Feng, Weixing; Yang, Yuedong; Mort, Matthew; Cooper, David N; Wang, Yue; Wang, Yadong; Wells, Clark; Zhou, Yaoqi; Liu, Yunlong

2017-09-01

While synonymous single-nucleotide variants (sSNVs) have largely been unstudied, since they do not alter protein sequence, mounting evidence suggests that they may affect RNA conformation, splicing, and the stability of nascent-mRNAs to promote various diseases. Accurately prioritizing deleterious sSNVs from a pool of neutral ones can significantly improve our ability of selecting functional genetic variants identified from various genome-sequencing projects, and, therefore, advance our understanding of disease etiology. In this study, we develop a computational algorithm to prioritize sSNVs based on their impact on mRNA splicing and protein function. In addition to genomic features that potentially affect splicing regulation, our proposed algorithm also includes dozens structural features that characterize the functions of alternatively spliced exons on protein function. Our systematical evaluation on thousands of sSNVs suggests that several structural features, including intrinsic disorder protein scores, solvent accessible surface areas, protein secondary structures, and known and predicted protein family domains, show significant differences between disease-causing and neutral sSNVs. Our result suggests that the protein structure features offer an added dimension of information while distinguishing disease-causing and neutral synonymous variants. The inclusion of structural features increases the predictive accuracy for functional sSNV prioritization.
Complete fold annotation of the human proteome using a novel structural feature space

DOE PAGES

Middleton, Sarah A.; Illuminati, Joseph; Kim, Junhyong

2017-04-13

Recognition of protein structural fold is the starting point for many structure prediction tools and protein function inference. Fold prediction is computationally demanding and recognizing novel folds is difficult such that the majority of proteins have not been annotated for fold classification. Here we describe a new machine learning approach using a novel feature space that can be used for accurate recognition of all 1,221 currently known folds and inference of unknown novel folds. We show that our method achieves better than 94% accuracy even when many folds have only one training example. We demonstrate the utility of this methodmore » by predicting the folds of 34,330 human protein domains and showing that these predictions can yield useful insights into potential biological function, such as prediction of RNA-binding ability. Finally, our method can be applied to de novo fold prediction of entire proteomes and identify candidate novel fold families.« less
The Structure of the Poxvirus A33 Protein Reveals a Dimer of Unique C-Type Lectin-Like Domains

DOE Office of Scientific and Technical Information (OSTI.GOV)

Su, Hua-Poo; Singh, Kavita; Gittis, Apostolos G.

2010-11-03

The current vaccine against smallpox is an infectious form of vaccinia virus that has significant side effects. Alternative vaccine approaches using recombinant viral proteins are being developed. A target of subunit vaccine strategies is the poxvirus protein A33, a conserved protein in the Chordopoxvirinae subfamily of Poxviridae that is expressed on the outer viral envelope. Here we have determined the structure of the A33 ectodomain of vaccinia virus. The structure revealed C-type lectin-like domains (CTLDs) that occur as dimers in A33 crystals with five different crystal lattices. Comparison of the A33 dimer models shows that the A33 monomers have amore » degree of flexibility in position within the dimer. Structural comparisons show that the A33 monomer is a close match to the Link module class of CTLDs but that the A33 dimer is most similar to the natural killer (NK)-cell receptor class of CTLDs. Structural data on Link modules and NK-cell receptor-ligand complexes suggest a surface of A33 that could interact with viral or host ligands. The dimer interface is well conserved in all known A33 sequences, indicating an important role for the A33 dimer. The structure indicates how previously described A33 mutations disrupt protein folding and locates the positions of N-linked glycosylations and the epitope of a protective antibody.« less
Varicella-zoster virus induces the formation of dynamic nuclear capsid aggregates

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lebrun, Marielle; Thelen, Nicolas; Thiry, Marc

2014-04-15

The first step of herpesviruses virion assembly occurs in the nucleus. However, the exact site where nucleocapsids are assembled, where the genome and the inner tegument are acquired, remains controversial. We created a recombinant VZV expressing ORF23 (homologous to HSV-1 VP26) fused to the eGFP and dually fluorescent viruses with a tegument protein additionally fused to a red tag (ORF9, ORF21 and ORF22 corresponding to HSV-1 UL49, UL37 and UL36). We identified nuclear dense structures containing the major capsid protein, the scaffold protein and maturing protease, as well as ORF21 and ORF22. Correlative microscopy demonstrated that the structures correspond tomore » capsid aggregates and time-lapse video imaging showed that they appear prior to the accumulation of cytoplasmic capsids, presumably undergoing the secondary egress, and are highly dynamic. Our observations suggest that these structures might represent a nuclear area important for capsid assembly and/or maturation before the budding at the inner nuclear membrane. - Highlights: • We created a recombinant VZV expressing the small capsid protein fused to the eGFP. • We identified nuclear dense structures containing capsid and procapsid proteins. • Correlative microscopy showed that the structures correspond to capsid aggregates. • Procapsids and partial capsids are found within the aggregates of WT and eGFP-23 VZV. • FRAP and FLIP experiments demonstrated that they are dynamic structures.« less

Water at protein surfaces studied with femtosecond nonlinear spectroscopy

NASA Astrophysics Data System (ADS)

Bakker, Huib J.

We report on an investigation of the structure and dynamics of water molecules near protein surfaces with femtosecond nonlinear spectroscopic techniques. We measured the reorientation dynamics of water molecules near the surface of several globular protein surfaces, using polarization-resolved femtosecond infrared spectroscopy. We found that water molecules near the protein surface have a much slower reorientation than water molecules in bulk liquid water. The number of slow water molecules scales scales with the size of the hydrophobic surface of the protein. When we denature the proteins by adding an increasing amount of urea to the protein solution, we observe that the water-exposed surface increases by 50% before the secondary structure of the proteins changes. This finding indicates that protein unfolding starts with the protein structure becoming less tight, thereby allowing water to enter. With surface vibrational sum frequency generation (VSFG) spectroscopy, we studied the structure of water at the surface of antifreeze protein III. The measured VSFG spectra showed the presence of ice-like water layers at the ice-binding site of the protein in aqueous solution, at temperatures well above the freezing point. This ordered ice-like hydration layers at the protein surface likely plays an important role in the specific recognition and binding of anti-freeze protein III to nascent ice crystallites, and thus in its anti-freeze mechanism. This research is supported by the ''Nederlandse organisatie voor Wetenschappelijk Onderzoek (NWO).
Functional Evolution of PLP-dependent Enzymes based on Active-Site Structural Similarities

PubMed Central

Catazaro, Jonathan; Caprez, Adam; Guru, Ashu; Swanson, David; Powers, Robert

2014-01-01

Families of distantly related proteins typically have very low sequence identity, which hinders evolutionary analysis and functional annotation. Slowly evolving features of proteins, such as an active site, are therefore valuable for annotating putative and distantly related proteins. To date, a complete evolutionary analysis of the functional relationship of an entire enzyme family based on active-site structural similarities has not yet been undertaken. Pyridoxal-5’-phosphate (PLP) dependent enzymes are primordial enzymes that diversified in the last universal ancestor. Using the Comparison of Protein Active Site Structures (CPASS) software and database, we show that the active site structures of PLP-dependent enzymes can be used to infer evolutionary relationships based on functional similarity. The enzymes successfully clustered together based on substrate specificity, function, and three-dimensional fold. This study demonstrates the value of using active site structures for functional evolutionary analysis and the effectiveness of CPASS. PMID:24920327
Functional evolution of PLP-dependent enzymes based on active-site structural similarities.

PubMed

Catazaro, Jonathan; Caprez, Adam; Guru, Ashu; Swanson, David; Powers, Robert

2014-10-01

Families of distantly related proteins typically have very low sequence identity, which hinders evolutionary analysis and functional annotation. Slowly evolving features of proteins, such as an active site, are therefore valuable for annotating putative and distantly related proteins. To date, a complete evolutionary analysis of the functional relationship of an entire enzyme family based on active-site structural similarities has not yet been undertaken. Pyridoxal-5'-phosphate (PLP) dependent enzymes are primordial enzymes that diversified in the last universal ancestor. Using the comparison of protein active site structures (CPASS) software and database, we show that the active site structures of PLP-dependent enzymes can be used to infer evolutionary relationships based on functional similarity. The enzymes successfully clustered together based on substrate specificity, function, and three-dimensional-fold. This study demonstrates the value of using active site structures for functional evolutionary analysis and the effectiveness of CPASS. © 2014 Wiley Periodicals, Inc.
Is the isolated ligand binding domain a good model of the domain in the native receptor?

PubMed

Deming, Dustin; Cheng, Qing; Jayaraman, Vasanthi

2003-05-16

Numerous studies have used the atomic level structure of the isolated ligand binding domain of the glutamate receptor to elucidate the agonist-induced activation and desensitization processes in this group of proteins. However, no study has demonstrated the structural equivalence of the isolated ligand binding fragments and the protein in the native receptor. In this report, using visible absorption spectroscopy we show that the electronic environment of the antagonist 6-cyano-7-nitro-2,3-dihydroxyquinoxaline is identical for the isolated protein and the native glutamate receptors expressed in cells. Our results hence establish that the local structure of the ligand binding site is the same in the two proteins and validate the detailed structure-function relationships that have been developed based on a comparison of the structure of the isolated ligand binding domain and electrophysiological consequences in the native receptor.
Protein Assembly and Building Blocks: Beyond the Limits of the LEGO Brick Metaphor.

PubMed

Levy, Yaakov

2017-09-26

Proteins, like other biomolecules, have a modular and hierarchical structure. Various building blocks are used to construct proteins of high structural complexity and diverse functionality. In multidomain proteins, for example, domains are fused to each other in different combinations to achieve different functions. Although the LEGO brick metaphor is justified as a means of simplifying the complexity of three-dimensional protein structures, several fundamental properties (such as allostery or the induced-fit mechanism) make deviation from it necessary to respect the plasticity, softness, and cross-talk that are essential to protein function. In this work, we illustrate recently reported protein behavior in multidomain proteins that deviates from the LEGO brick analogy. While earlier studies showed that a protein domain is often unaffected by being fused to another domain or becomes more stable following the formation of a new interface between the tethered domains, destabilization due to tethering has been reported for several systems. We illustrate that tethering may sometimes result in a multidomain protein behaving as "less than the sum of its parts". We survey these cases for which structure additivity does not guarantee thermodynamic additivity. Protein destabilization due to fusion to other domains may be linked in some cases to biological function and should be taken into account when designing large assemblies.
Discrete Molecular Dynamics Approach to the Study of Disordered and Aggregating Proteins.

PubMed

Emperador, Agustí; Orozco, Modesto

2017-03-14

We present a refinement of the Coarse Grained PACSAB force field for Discrete Molecular Dynamics (DMD) simulations of proteins in aqueous conditions. As the original version, the refined method provides good representation of the structure and dynamics of folded proteins but provides much better representations of a variety of unfolded proteins, including some very large, impossible to analyze by atomistic simulation methods. The PACSAB/DMD method also reproduces accurately aggregation properties, providing good pictures of the structural ensembles of proteins showing a folded core and an intrinsically disordered region. The combination of accuracy and speed makes the method presented here a good alternative for the exploration of unstructured protein systems.
Slowing Translation between Protein Domains by Increasing Affinity between mRNAs and the Ribosomal Anti-Shine-Dalgarno Sequence Improves Solubility.

PubMed

Vasquez, Kevin A; Hatridge, Taylor A; Curtis, Nicholas C; Contreras, Lydia M

2016-02-19

Recent studies have demonstrated that effective protein production requires coordination of multiple cotranslational cellular processes, which are heavily affected by translation timing. Until recently, protein engineering has focused on codon optimization to maximize protein production rates, mostly considering the effect of tRNA abundance. However, as it relates to complex multidomain proteins, it has been hypothesized that strategic translational pauses between domains and between distinct individual structural motifs can prevent interactions between nascent chain fragments that generate kinetically trapped misfolded peptides and thereby enhance protein yields. In this study, we introduce synthetic transient pauses between structural domains in a heterologous model protein based on designed patterns of affinity between the mRNA and the anti-Shine-Dalgarno (aSD) sequence on the ribosome. We demonstrate that optimizing translation attenuation at domain boundaries can predictably affect solubility patterns in bacteria. Exploration of the affinity space showed that modifying less than 1% of the nucleotides (on a small 12 amino acid linker) can vary soluble protein yields up to ∼7-fold without altering the primary sequence of the protein. In the context of longer linkers, where a larger number of distinct structural motifs can fold outside the ribosome, optimal synonymous codon variations resulted in an additional 2.1-fold increase in solubility, relative to that of nonoptimized linkers of the same length. While rational construction of 54 linkers of various affinities showed a significant correlation between protein solubility and predicted affinity, only weaker correlations were observed between tRNA abundance and protein solubility. We also demonstrate that naturally occurring high-affinity clusters are present between structural domains of β-galactosidase, one of Escherichia coli's largest native proteins. Interdomain ribosomal affinity is an important factor that has not previously been explored in the context of protein engineering.
Systems Mechanobiology: Tension-Inhibited Protein Turnover Is Sufficient to Physically Control Gene Circuits

PubMed Central

Dingal, P.C. Dave P.; Discher, Dennis E.

2014-01-01

Mechanotransduction pathways convert forces that stress and strain structures within cells into gene expression levels that impact development, homeostasis, and disease. The levels of some key structural proteins in the nucleus, cytoskeleton, or extracellular matrix have been recently reported to scale with tissue- and cell-level forces or mechanical properties such as stiffness, and so the mathematics of mechanotransduction becomes important to understand. Here, we show that if a given structural protein positively regulates its own gene expression, then stresses need only inhibit degradation of that protein to achieve stable, mechanosensitive gene expression. This basic use-it-or-lose-it module is illustrated by application to meshworks of nuclear lamin A, minifilaments of myosin II, and extracellular matrix collagen fibers—all of which possess filamentous coiled-coil/supercoiled structures. Past experiments not only suggest that tension suppresses protein degradation mediated and/or initiated by various enzymes but also that transcript levels vary with protein levels because key transcription factors are regulated by these structural proteins. Coupling between modules occurs within single cells and between cells in tissue, as illustrated during embryonic heart development where cardiac fibroblasts make collagen that cardiomyocytes contract. With few additional assumptions, the basic module has sufficient physics to control key structural genes in both development and disease. PMID:25468352
G‐LoSA: An efficient computational tool for local structure‐centric biological studies and drug design

PubMed Central

2016-01-01

Abstract Molecular recognition by protein mostly occurs in a local region on the protein surface. Thus, an efficient computational method for accurate characterization of protein local structural conservation is necessary to better understand biology and drug design. We present a novel local structure alignment tool, G‐LoSA. G‐LoSA aligns protein local structures in a sequence order independent way and provides a GA‐score, a chemical feature‐based and size‐independent structure similarity score. Our benchmark validation shows the robust performance of G‐LoSA to the local structures of diverse sizes and characteristics, demonstrating its universal applicability to local structure‐centric comparative biology studies. In particular, G‐LoSA is highly effective in detecting conserved local regions on the entire surface of a given protein. In addition, the applications of G‐LoSA to identifying template ligands and predicting ligand and protein binding sites illustrate its strong potential for computer‐aided drug design. We hope that G‐LoSA can be a useful computational method for exploring interesting biological problems through large‐scale comparison of protein local structures and facilitating drug discovery research and development. G‐LoSA is freely available to academic users at http://im.compbio.ku.edu/GLoSA/. PMID:26813336
Pretreatment of flaxseed protein isolate by high hydrostatic pressure: Impacts on protein structure, enzymatic hydrolysis and final hydrolysate antioxidant capacities.

PubMed

Perreault, Véronique; Hénaux, Loïc; Bazinet, Laurent; Doyen, Alain

2017-04-15

The effect of high hydrostatic pressure (HHP) on flaxseed protein structure and peptide profiles, obtained after protein hydrolysis, was investigated. Isolated flaxseed protein (1%, m/v) was subjected to HHP (600MPa, 5min or 20min at 20°C) prior to hydrolysis with trypsin only and trypsin-pronase. The results demonstrated that HHP treatment induced dissociation of flaxseed proteins and generated higher molecular weight aggregates as a function of processing duration. Fluorescence spectroscopy showed that HHP treatment, as well as processing duration, had an impact on flaxseed protein structure since exposition of hydrophobic amino acid tyrosine was modified. Except for some specific peptides, the concentrations of which were modified, similar peptide profiles were obtained after hydrolysis of pressure-treated proteins using trypsin. Finally, hydrolysates obtained using trypsin-pronase had a greater antioxidant capacity (ORAC) than control samples; these results confirmed that HHP enhanced the generation of antioxidant peptides. Copyright © 2016 Elsevier Ltd. All rights reserved.
Predicting Real-Valued Protein Residue Fluctuation Using FlexPred.

PubMed

Peterson, Lenna; Jamroz, Michal; Kolinski, Andrzej; Kihara, Daisuke

2017-01-01

The conventional view of a protein structure as static provides only a limited picture. There is increasing evidence that protein dynamics are often vital to protein function including interaction with partners such as other proteins, nucleic acids, and small molecules. Considering flexibility is also important in applications such as computational protein docking and protein design. While residue flexibility is partially indicated by experimental measures such as the B-factor from X-ray crystallography and ensemble fluctuation from nuclear magnetic resonance (NMR) spectroscopy as well as computational molecular dynamics (MD) simulation, these techniques are resource-intensive. In this chapter, we describe the web server and stand-alone version of FlexPred, which rapidly predicts absolute per-residue fluctuation from a three-dimensional protein structure. On a set of 592 nonredundant structures, comparing the fluctuations predicted by FlexPred to the observed fluctuations in MD simulations showed an average correlation coefficient of 0.669 and an average root mean square error of 1.07 Å. FlexPred is available at http://kiharalab.org/flexPred/ .
Crystallization of Membrane Proteins by Vapor Diffusion

PubMed Central

Delmar, Jared A.; Bolla, Jani Reddy; Su, Chih-Chia; Yu, Edward W.

2016-01-01

X-ray crystallography remains the most robust method to determine protein structure at the atomic level. However, the bottlenecks of protein expression and purification often discourage further study. In this chapter, we address the most common problems encountered at these stages. Based on our experiences in expressing and purifying antimicrobial efflux proteins, we explain how a pure and homogenous protein sample can be successfully crystallized by the vapor diffusion method. We present our current protocols and methodologies for this technique. Case studies show step-by-step how we have overcome problems related to expression and diffraction, eventually producing high quality membrane protein crystals for structural determinations. It is our hope that a rational approach can be made of the often anecdotal process of membrane protein crystallization. PMID:25950974
A Predictive Model of Intein Insertion Site for Use in the Engineering of Molecular Switches

PubMed Central

Apgar, James; Ross, Mary; Zuo, Xiao; Dohle, Sarah; Sturtevant, Derek; Shen, Binzhang; de la Vega, Humberto; Lessard, Philip; Lazar, Gabor; Raab, R. Michael

2012-01-01

Inteins are intervening protein domains with self-splicing ability that can be used as molecular switches to control activity of their host protein. Successfully engineering an intein into a host protein requires identifying an insertion site that permits intein insertion and splicing while allowing for proper folding of the mature protein post-splicing. By analyzing sequence and structure based properties of native intein insertion sites we have identified four features that showed significant correlation with the location of the intein insertion sites, and therefore may be useful in predicting insertion sites in other proteins that provide native-like intein function. Three of these properties, the distance to the active site and dimer interface site, the SVM score of the splice site cassette, and the sequence conservation of the site showed statistically significant correlation and strong predictive power, with area under the curve (AUC) values of 0.79, 0.76, and 0.73 respectively, while the distance to secondary structure/loop junction showed significance but with less predictive power (AUC of 0.54). In a case study of 20 insertion sites in the XynB xylanase, two features of native insertion sites showed correlation with the splice sites and demonstrated predictive value in selecting non-native splice sites. Structural modeling of intein insertions at two sites highlighted the role that the insertion site location could play on the ability of the intein to modulate activity of the host protein. These findings can be used to enrich the selection of insertion sites capable of supporting intein splicing and hosting an intein switch. PMID:22649521
Molecular Simulation-Based Structural Prediction of Protein Complexes in Mass Spectrometry: The Human Insulin Dimer

PubMed Central

Li, Jinyu; Rossetti, Giulia; Dreyer, Jens; Raugei, Simone; Ippoliti, Emiliano; Lüscher, Bernhard; Carloni, Paolo

2014-01-01

Protein electrospray ionization (ESI) mass spectrometry (MS)-based techniques are widely used to provide insight into structural proteomics under the assumption that non-covalent protein complexes being transferred into the gas phase preserve basically the same intermolecular interactions as in solution. Here we investigate the applicability of this assumption by extending our previous structural prediction protocol for single proteins in ESI-MS to protein complexes. We apply our protocol to the human insulin dimer (hIns2) as a test case. Our calculations reproduce the main charge and the collision cross section (CCS) measured in ESI-MS experiments. Molecular dynamics simulations for 0.075 ms show that the complex maximizes intermolecular non-bonded interactions relative to the structure in water, without affecting the cross section. The overall gas-phase structure of hIns2 does exhibit differences with the one in aqueous solution, not inferable from a comparison with calculated CCS. Hence, care should be exerted when interpreting ESI-MS proteomics data based solely on NMR and/or X-ray structural information. PMID:25210764
Structure and mechanism of maximum stability of isolated alpha-helical protein domains at a critical length scale.

PubMed

Qin, Zhao; Fabre, Andrea; Buehler, Markus J

2013-05-01

The stability of alpha helices is important in protein folding, bioinspired materials design, and controls many biological properties under physiological and disease conditions. Here we show that a naturally favored alpha helix length of 9 to 17 amino acids exists at which the propensity towards the formation of this secondary structure is maximized. We use a combination of thermodynamical analysis, well-tempered metadynamics molecular simulation and statistical analyses of experimental alpha helix length distributions and find that the favored alpha helix length is caused by a competition between alpha helix folding, unfolding into a random coil and formation of higher-order tertiary structures. The theoretical result is suggested to be used to explain the statistical distribution of the length of alpha helices observed in natural protein structures. Our study provides mechanistic insight into fundamental controlling parameters in alpha helix structure formation and potentially other biopolymers or synthetic materials. The result advances our fundamental understanding of size effects in the stability of protein structures and may enable the design of de novo alpha-helical protein materials.
Quality assessment of protein model-structures based on structural and functional similarities

PubMed Central

2012-01-01

Background Experimental determination of protein 3D structures is expensive, time consuming and sometimes impossible. A gap between number of protein structures deposited in the World Wide Protein Data Bank and the number of sequenced proteins constantly broadens. Computational modeling is deemed to be one of the ways to deal with the problem. Although protein 3D structure prediction is a difficult task, many tools are available. These tools can model it from a sequence or partial structural information, e.g. contact maps. Consequently, biologists have the ability to generate automatically a putative 3D structure model of any protein. However, the main issue becomes evaluation of the model quality, which is one of the most important challenges of structural biology. Results GOBA - Gene Ontology-Based Assessment is a novel Protein Model Quality Assessment Program. It estimates the compatibility between a model-structure and its expected function. GOBA is based on the assumption that a high quality model is expected to be structurally similar to proteins functionally similar to the prediction target. Whereas DALI is used to measure structure similarity, protein functional similarity is quantified using standardized and hierarchical description of proteins provided by Gene Ontology combined with Wang's algorithm for calculating semantic similarity. Two approaches are proposed to express the quality of protein model-structures. One is a single model quality assessment method, the other is its modification, which provides a relative measure of model quality. Exhaustive evaluation is performed on data sets of model-structures submitted to the CASP8 and CASP9 contests. Conclusions The validation shows that the method is able to discriminate between good and bad model-structures. The best of tested GOBA scores achieved 0.74 and 0.8 as a mean Pearson correlation to the observed quality of models in our CASP8 and CASP9-based validation sets. GOBA also obtained the best result for two targets of CASP8, and one of CASP9, compared to the contest participants. Consequently, GOBA offers a novel single model quality assessment program that addresses the practical needs of biologists. In conjunction with other Model Quality Assessment Programs (MQAPs), it would prove useful for the evaluation of single protein models. PMID:22998498
LECTINPred: web Server that Uses Complex Networks of Protein Structure for Prediction of Lectins with Potential Use as Cancer Biomarkers or in Parasite Vaccine Design.

PubMed

Munteanu, Cristian R; Pedreira, Nieves; Dorado, Julián; Pazos, Alejandro; Pérez-Montoto, Lázaro G; Ubeira, Florencio M; González-Díaz, Humberto

2014-04-01

Lectins (Ls) play an important role in many diseases such as different types of cancer, parasitic infections and other diseases. Interestingly, the Protein Data Bank (PDB) contains +3000 protein 3D structures with unknown function. Thus, we can in principle, discover new Ls mining non-annotated structures from PDB or other sources. However, there are no general models to predict new biologically relevant Ls based on 3D chemical structures. We used the MARCH-INSIDE software to calculate the Markov-Shannon 3D electrostatic entropy parameters for the complex networks of protein structure of 2200 different protein 3D structures, including 1200 Ls. We have performed a Linear Discriminant Analysis (LDA) using these parameters as inputs in order to seek a new Quantitative Structure-Activity Relationship (QSAR) model, which is able to discriminate 3D structure of Ls from other proteins. We implemented this predictor in the web server named LECTINPred, freely available at http://bio-aims.udc.es/LECTINPred.php. This web server showed the following goodness-of-fit statistics: Sensitivity=96.7 % (for Ls), Specificity=87.6 % (non-active proteins), and Accuracy=92.5 % (for all proteins), considering altogether both the training and external prediction series. In mode 2, users can carry out an automatic retrieval of protein structures from PDB. We illustrated the use of this server, in operation mode 1, performing a data mining of PDB. We predicted Ls scores for +2000 proteins with unknown function and selected the top-scored ones as possible lectins. In operation mode 2, LECTINPred can also upload 3D structural models generated with structure-prediction tools like LOMETS or PHYRE2. The new Ls are expected to be of relevance as cancer biomarkers or useful in parasite vaccine design. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Building proteins from C alpha coordinates using the dihedral probability grid Monte Carlo method.

PubMed Central

Mathiowetz, A. M.; Goddard, W. A.

1995-01-01

Dihedral probability grid Monte Carlo (DPG-MC) is a general-purpose method of conformational sampling that can be applied to many problems in peptide and protein modeling. Here we present the DPG-MC method and apply it to predicting complete protein structures from C alpha coordinates. This is useful in such endeavors as homology modeling, protein structure prediction from lattice simulations, or fitting protein structures to X-ray crystallographic data. It also serves as an example of how DPG-MC can be applied to systems with geometric constraints. The conformational propensities for individual residues are used to guide conformational searches as the protein is built from the amino-terminus to the carboxyl-terminus. Results for a number of proteins show that both the backbone and side chain can be accurately modeled using DPG-MC. Backbone atoms are generally predicted with RMS errors of about 0.5 A (compared to X-ray crystal structure coordinates) and all atoms are predicted to an RMS error of 1.7 A or better. PMID:7549885
The Vip3Ag4 Insecticidal Protoxin from Bacillus thuringiensis Adopts A Tetrameric Configuration That Is Maintained on Proteolysis

PubMed Central

Palma, Leopoldo; Scott, David J.; Harris, Gemma; Din, Salah-Ud; Williams, Thomas L.; Roberts, Oliver J.; Young, Mark T.; Caballero, Primitivo; Berry, Colin

2017-01-01

The Vip3 proteins produced during vegetative growth by strains of the bacterium Bacillus thuringiensis show insecticidal activity against lepidopteran insects with a mechanism of action that may involve pore formation and apoptosis. These proteins are promising supplements to our arsenal of insecticidal proteins, but the molecular details of their activity are not understood. As a first step in the structural characterisation of these proteins, we have analysed their secondary structure and resolved the surface topology of a tetrameric complex of the Vip3Ag4 protein by transmission electron microscopy. Sites sensitive to proteolysis by trypsin are identified and the trypsin-cleaved protein appears to retain a similar structure as an octomeric complex comprising four copies each of the ~65 kDa and ~21 kDa products of proteolysis. This processed form of the toxin may represent the active toxin. The quality and monodispersity of the protein produced in this study make Vip3Ag4 a candidate for more detailed structural analysis using cryo-electron microscopy. PMID:28505109
Structures of minimal catalytic fragments of topoisomerase V reveals conformational changes relevant for DNA binding

PubMed Central

Rajan, Rakhi; Taneja, Bhupesh; Mondragón, Alfonso

2010-01-01

Summary Topoisomerase V is an archaeal type I topoisomerase that is unique among topoisomerases due to presence of both topoisomerase and DNA repair activities in the same protein. It is organized as an N-terminal topoisomerase domain followed by 24 tandem helix hairpin helix (HhH) motifs. Structural studies have shown that the active site is buried by the (HhH) motifs. Here we show that the N-terminal domain can relax DNA in the absence of any HhH motifs and that the HhH motifs are required for stable protein-DNA complex formation. Crystal structures of various topoisomerase V fragments show changes in the relative orientation of the domains mediated by a long bent linker helix, and these movements are essential for the DNA to enter the active site. Phosphate ions bound to the protein near the active site helped model DNA in the topoisomerase domain and shows how topoisomerase V may interact with DNA. PMID:20637419

Investigating the effect of an arterial hypertension drug on the structural properties of plasma protein.

PubMed

Hassan, Natalia; Maldonado-Valderrama, Julia; Gunning, A Patrick; Morris, V J; Ruso, Juan M

2011-10-15

Propanolol is a betablocker drug used in the treatment of arterial hypertension related diseases. In order to achieve an optimal performance of this drug it is important to consider the possible interactions of propanolol with plasma proteins. In this work, we have used several experimental techniques to characterise the effect of addition of the betablocker propanolol on the properties of bovine plasma fibrinogen (FB). Differential scanning calorimeter (DSC), circular dichroism (CD), dynamic light scattering (DLS), surface tension techniques and atomic force microscopy (AFM) measurements have been combined to carry out a detailed physicochemical and surface characterization of the mixed system. As a result, DSC measurements show that propranolol can play two opposite roles, either acting as a structure stabilizer at low molar concentrations or as a structure destabilizer at higher concentrations, in different domains of fibrinogen. CD measurements have revealed that the effect of propanolol on the secondary structure of fibrinogen depends on the temperature and the drug concentration and the DLS analysis showed evidence for protein aggregation. Interestingly, surface tension measurements provided further evidence of the conformational change induced by propanolol on the secondary structure of FB by importantly increasing the surface tension of the system. Finally, AFM imaging of the fibrinogen system provided direct visualization of the protein structure in the presence of propanolol. Combination of these techniques has produced complementary information on the behavior of the mixed system, providing new insights into the structural properties of proteins with potential medical interest. Copyright © 2011 Elsevier B.V. All rights reserved.
Automatic classification of protein structures relying on similarities between alignments

PubMed Central

2012-01-01

Background Identification of protein structural cores requires isolation of sets of proteins all sharing a same subset of structural motifs. In the context of an ever growing number of available 3D protein structures, standard and automatic clustering algorithms require adaptations so as to allow for efficient identification of such sets of proteins. Results When considering a pair of 3D structures, they are stated as similar or not according to the local similarities of their matching substructures in a structural alignment. This binary relation can be represented in a graph of similarities where a node represents a 3D protein structure and an edge states that two 3D protein structures are similar. Therefore, classifying proteins into structural families can be viewed as a graph clustering task. Unfortunately, because such a graph encodes only pairwise similarity information, clustering algorithms may include in the same cluster a subset of 3D structures that do not share a common substructure. In order to overcome this drawback we first define a ternary similarity on a triple of 3D structures as a constraint to be satisfied by the graph of similarities. Such a ternary constraint takes into account similarities between pairwise alignments, so as to ensure that the three involved protein structures do have some common substructure. We propose hereunder a modification algorithm that eliminates edges from the original graph of similarities and gives a reduced graph in which no ternary constraints are violated. Our approach is then first to build a graph of similarities, then to reduce the graph according to the modification algorithm, and finally to apply to the reduced graph a standard graph clustering algorithm. Such method was used for classifying ASTRAL-40 non-redundant protein domains, identifying significant pairwise similarities with Yakusa, a program devised for rapid 3D structure alignments. Conclusions We show that filtering similarities prior to standard graph based clustering process by applying ternary similarity constraints i) improves the separation of proteins of different classes and consequently ii) improves the classification quality of standard graph based clustering algorithms according to the reference classification SCOP. PMID:22974051
Encounter complexes and dimensionality reduction in protein–protein association

PubMed Central

Kozakov, Dima; Li, Keyong; Hall, David R; Beglov, Dmitri; Zheng, Jiefu; Vakili, Pirooz; Schueler-Furman, Ora; Paschalidis, Ioannis Ch; Clore, G Marius; Vajda, Sandor

2014-01-01

An outstanding challenge has been to understand the mechanism whereby proteins associate. We report here the results of exhaustively sampling the conformational space in protein–protein association using a physics-based energy function. The agreement between experimental intermolecular paramagnetic relaxation enhancement (PRE) data and the PRE profiles calculated from the docked structures shows that the method captures both specific and non-specific encounter complexes. To explore the energy landscape in the vicinity of the native structure, the nonlinear manifold describing the relative orientation of two solid bodies is projected onto a Euclidean space in which the shape of low energy regions is studied by principal component analysis. Results show that the energy surface is canyon-like, with a smooth funnel within a two dimensional subspace capturing over 75% of the total motion. Thus, proteins tend to associate along preferred pathways, similar to sliding of a protein along DNA in the process of protein-DNA recognition. DOI: http://dx.doi.org/10.7554/eLife.01370.001 PMID:24714491
Peptide-oligonucleotide conjugates as nanoscale building blocks for assembly of an artificial three-helix protein mimic

NASA Astrophysics Data System (ADS)

Lou, Chenguang; Martos-Maldonado, Manuel C.; Madsen, Charlotte S.; Thomsen, Rasmus P.; Midtgaard, Søren Roi; Christensen, Niels Johan; Kjems, Jørgen; Thulstrup, Peter W.; Wengel, Jesper; Jensen, Knud J.

2016-07-01

Peptide-based structures can be designed to yield artificial proteins with specific folding patterns and functions. Template-based assembly of peptide units is one design option, but the use of two orthogonal self-assembly principles, oligonucleotide triple helix and a coiled coil protein domain formation have never been realized for de novo protein design. Here, we show the applicability of peptide-oligonucleotide conjugates for self-assembly of higher-ordered protein-like structures. The resulting nano-assemblies were characterized by ultraviolet-melting, gel electrophoresis, circular dichroism (CD) spectroscopy, small-angle X-ray scattering and transmission electron microscopy. These studies revealed the formation of the desired triple helix and coiled coil domains at low concentrations, while a dimer of trimers was dominating at high concentration. CD spectroscopy showed an extraordinarily high degree of α-helicity for the peptide moieties in the assemblies. The results validate the use of orthogonal self-assembly principles as a paradigm for de novo protein design.
Three reasons protein disorder analysis makes more sense in the light of collagen

PubMed Central

Oates, Matt E.; Tompa, Peter; Gough, Julian

2016-01-01

Abstract We have identified that the collagen helix has the potential to be disruptive to analyses of intrinsically disordered proteins. The collagen helix is an extended fibrous structure that is both promiscuous and repetitive. Whilst its sequence is predicted to be disordered, this type of protein structure is not typically considered as intrinsic disorder. Here, we show that collagen‐encoding proteins skew the distribution of exon lengths in genes. We find that previous results, demonstrating that exons encoding disordered regions are more likely to be symmetric, are due to the abundance of the collagen helix. Other related results, showing increased levels of alternative splicing in disorder‐encoding exons, still hold after considering collagen‐containing proteins. Aside from analyses of exons, we find that the set of proteins that contain collagen significantly alters the amino acid composition of regions predicted as disordered. We conclude that research in this area should be conducted in the light of the collagen helix. PMID:26941008
Geometry motivated alternative view on local protein backbone structures.

PubMed

Zacharias, Jan; Knapp, Ernst Walter

2013-11-01

We present an alternative to the classical Ramachandran plot (R-plot) to display local protein backbone structure. Instead of the (φ, ψ)-backbone angles relating to the chemical architecture of polypeptides generic helical parameters are used. These are the rotation or twist angle ϑ and the helical rise parameter d. Plots with these parameters provide a different view on the nature of local protein backbone structures. It allows to display the local structures in polar (d, ϑ)-coordinates, which is not possible for an R-plot, where structural regimes connected by periodicity appear disconnected. But there are other advantages, like a clear discrimination of the handedness of a local structure, a larger spread of the different local structure domains--the latter can yield a better separation of different local secondary structure motives--and many more. Compared to the R-plot we are not aware of any major disadvantage to classify local polypeptide structures with the (d, ϑ)-plot, except that it requires some elementary computations. To facilitate usage of the new (d, ϑ)-plot for protein structures we provide a web application (http://agknapp.chemie.fu-berlin.de/secsass), which shows the (d, ϑ)-plot side-by-side with the R-plot. © 2013 The Protein Society.
DNA wrapping and distortion by an oligomeric homeodomain protein.

PubMed

Williams, Hannah; Jayaraman, Padma-Sheela; Gaston, Kevin

2008-10-31

Many transcription factors alter DNA or chromatin structure. Changes in chromatin structure are often brought about by the recruitment of chromatin-binding proteins, chromatin-modifying proteins, or other transcription co-activator or co-repressor proteins. However, some transcription factors form oligomeric assemblies that may themselves induce changes in DNA conformation and chromatin structure. The proline-rich homeodomain (PRH/Hex) protein is a transcription factor that regulates cell differentiation and cell proliferation, and has multiple roles in embryonic development. Earlier, we showed that PRH can repress transcription by multiple mechanisms, including the recruitment of co-repressor proteins belonging to the TLE family of chromatin-binding proteins. Our in vivo crosslinking studies have shown that PRH forms oligomeric complexes in cells and a variety of biophysical techniques suggest that the protein forms octamers. However, as yet we have little knowledge of the role played by PRH oligomerisation in the regulation of promoter activity or of the architecture of promoters that are regulated directly by PRH in cells. Here, we compare the binding of PRH and the isolated PRH homeodomain to DNA fragments with single and multiple PRH sites, using gel retardation assays and DNase I and chemical footprinting. We show that the PRH oligomer binds to multiple sites within the human Goosecoid promoter with high affinity and that the binding of PRH brings about DNA distortion. We suggest that PRH octamers wrap DNA in order to bring about transcriptional repression.
Sequence and structure insights of kazal type thrombin inhibitor protein: Studied with phylogeny, homology modeling and dynamic MM/GBSA studies.

PubMed

Jadhav, Aparna; Dash, RadhaCharan; Hirwani, Raj; Abdin, Malik

2018-03-01

Despite the wide medical importance of serine protease inhibitors, many of kazal type proteins are still to be explored. These thrombin inhibiting proteins are found in the digestive system of hematophagous organisms mainly Arthropods. We studied one of such protein i.e. Kazal type-1 protein from sand-fly Phlebotomus papatasi as its structure and interaction with thrombin is unclear. Initially, Dipetalin a kazal-follistasin domain protein was run through PSI-BLAST to retrieve related sequences. Using this set of sequence a phylogenetic tree was constructed, which identified a distantly related kazal type-1 protein. A three-dimensional structure was predicted for this protein and was aligned with Rhodniin for further evaluation. To have a comparative understanding of it's binding at the thrombin active site, the aligned kazal model-thrombin and rhodniin-thrombin complexes were subjected to molecular dynamics simulations. Dynamics analysis with reference to main chain RMSD, H-chain residue RMSF and total energy showed rhodniin-thrombin complex as a more stable system. Further, the MM/GBSA method was applied that calculated the binding free energy (ΔG binding ) for rhodniin and kazal model as -220.32kcal/Mol and -90.70kcal/Mol, respectively. Thus, it shows that kazal model has weaker bonding with thrombin, unlike rhodniin. Copyright © 2017 Elsevier B.V. All rights reserved.
Glycosylation of the severe acute respiratory syndrome coronavirus triple-spanning membrane proteins 3a and M.

PubMed

Oostra, M; de Haan, C A M; de Groot, R J; Rottier, P J M

2006-03-01

The severe acute respiratory syndrome coronavirus (SARS-CoV) open reading frame 3a protein has recently been shown to be a structural protein. The protein is encoded by one of the so-called group-specific genes and has no sequence homology with any of the known structural or group-specific proteins of coronaviruses. It does, however, have several similarities to the coronavirus M proteins; (i) they are triple membrane spanning with the same topology, (ii) they have similar intracellular localizations (predominantly Golgi), (iii) both are viral structural proteins, and (iv) they appear to interact with the E and S proteins, as well as with each other. The M protein plays a crucial role in coronavirus assembly and is glycosylated in all coronaviruses, either by N-linked or by O-linked oligosaccharides. The conserved glycosylation of the coronavirus M proteins and the resemblance of the 3a protein to them led us to investigate the glycosylation of these two SARS-CoV membrane proteins. The proteins were expressed separately using the vaccinia virus T7 expression system, followed by metabolic labeling. Pulse-chase analysis showed that both proteins were modified, although in different ways. While the M protein acquired cotranslationally oligosaccharides that could be removed by PNGaseF, the 3a protein acquired its modifications posttranslationally, and they were not sensitive to the N-glycosidase enzyme. The SARS-CoV 3a protein, however, was demonstrated to contain sialic acids, indicating the presence of oligosaccharides. O-glycosylation of the 3a protein was indeed confirmed using an in situ O-glycosylation assay of endoplasmic reticulum-retained mutants. In addition, we showed that substitution of serine and threonine residues in the ectodomain of the 3a protein abolished the addition of the O-linked sugars. Thus, the SARS-CoV 3a protein is an O-glycosylated glycoprotein, like the group 2 coronavirus M proteins but unlike the SARS-CoV M protein, which is N glycosylated.
Temperature inducible β-sheet structure in the transactivation domains of retroviral regulatory proteins of the Rev family

NASA Astrophysics Data System (ADS)

Thumb, Werner; Graf, Christine; Parslow, Tristram; Schneider, Rainer; Auer, Manfred

1999-11-01

The interaction of the human immunodeficiency virus type 1 (HIV-1) regulatory protein Rev with cellular cofactors is crucial for the viral life cycle. The HIV-1 Rev transactivation domain is functionally interchangeable with analog regions of Rev proteins of other retroviruses suggesting common folding patterns. In order to obtain experimental evidence for similar structural features mediating protein-protein contacts we investigated activation domain peptides from HIV-1, HIV-2, VISNA virus, feline immunodeficiency virus (FIV) and equine infectious anemia virus (EIAV) by CD spectroscopy, secondary structure prediction and sequence analysis. Although different in polarity and hydrophobicity, all peptides showed a similar behavior with respect to solution conformation, concentration dependence and variations in ionic strength and pH. Temperature studies revealed an unusual induction of β-structure with rising temperatures in all activation domain peptides. The high stability of β-structure in this region was demonstrated in three different peptides of the activation domain of HIV-1 Rev in solutions containing 40% hexafluoropropanol, a reagent usually known to induce α-helix into amino acid sequences. Sequence alignments revealed similarities between the polar effector domains from FIV and EIAV and the leucine rich (hydrophobic) effector domains found in HIV-1, HIV-2 and VISNA. Studies on activation domain peptides of two dominant negative HIV-1 Rev mutants, M10 and M32, pointed towards different reasons for the biological behavior. Whereas the peptide containing the M10 mutation (L 78E 79→D 78L 79) showed wild-type structure, the M32 mutant peptide (L 78L 81L 83→A 78A 81A 83) revealed a different protein fold to be the reason for the disturbed binding to cellular cofactors. From our data, we conclude, that the activation domain of Rev proteins from different viral origins adopt a similar fold and that a β-structural element is involved in binding to a cellular cofactor.
Accurate determination of interfacial protein secondary structure by combining interfacial-sensitive amide I and amide III spectral signals.

PubMed

Ye, Shuji; Li, Hongchun; Yang, Weilai; Luo, Yi

2014-01-29

Accurate determination of protein structures at the interface is essential to understand the nature of interfacial protein interactions, but it can only be done with a few, very limited experimental methods. Here, we demonstrate for the first time that sum frequency generation vibrational spectroscopy can unambiguously differentiate the interfacial protein secondary structures by combining surface-sensitive amide I and amide III spectral signals. This combination offers a powerful tool to directly distinguish random-coil (disordered) and α-helical structures in proteins. From a systematic study on the interactions between several antimicrobial peptides (including LKα14, mastoparan X, cecropin P1, melittin, and pardaxin) and lipid bilayers, it is found that the spectral profiles of the random-coil and α-helical structures are well separated in the amide III spectra, appearing below and above 1260 cm(-1), respectively. For the peptides with a straight backbone chain, the strength ratio for the peaks of the random-coil and α-helical structures shows a distinct linear relationship with the fraction of the disordered structure deduced from independent NMR experiments reported in the literature. It is revealed that increasing the fraction of negatively charged lipids can induce a conformational change of pardaxin from random-coil to α-helical structures. This experimental protocol can be employed for determining the interfacial protein secondary structures and dynamics in situ and in real time without extraneous labels.
Structural and Functional Characterization of an Ancient Bacterial Transglutaminase Sheds Light on the Minimal Requirements for Protein Cross-Linking.

PubMed

Fernandes, Catarina G; Plácido, Diana; Lousa, Diana; Brito, José A; Isidro, Anabela; Soares, Cláudio M; Pohl, Jan; Carrondo, Maria A; Archer, Margarida; Henriques, Adriano O

2015-09-22

Transglutaminases are best known for their ability to catalyze protein cross-linking reactions that impart chemical and physical resilience to cellular structures. Here, we report the crystal structure and characterization of Tgl, a transglutaminase from the bacterium Bacillus subtilis. Tgl is produced during sporulation and cross-links the surface of the highly resilient spore. Tgl-like proteins are found only in spore-forming bacteria of the Bacillus and Clostridia classes, indicating an ancient origin. Tgl is a single-domain protein, produced in active form, and the smallest transglutaminase characterized to date. We show that Tgl is structurally similar to bacterial cell wall endopeptidases and has an NlpC/P60 catalytic core, thought to represent the ancestral unit of the cysteine protease fold. We show that Tgl functions through a unique partially redundant catalytic dyad formed by Cys116 and Glu187 or Glu115. Strikingly, the catalytic Cys is insulated within a hydrophobic tunnel that traverses the molecule from side to side. The lack of similarity of Tgl to other transglutaminases together with its small size suggests that an NlpC/P60 catalytic core and insulation of the active site during catalysis may be essential requirements for protein cross-linking.
Structural Analysis of PTM Hotspots (SAPH-ire)--A Quantitative Informatics Method Enabling the Discovery of Novel Regulatory Elements in Protein Families.

PubMed

Dewhurst, Henry M; Choudhury, Shilpa; Torres, Matthew P

2015-08-01

Predicting the biological function potential of post-translational modifications (PTMs) is becoming increasingly important in light of the exponential increase in available PTM data from high-throughput proteomics. We developed structural analysis of PTM hotspots (SAPH-ire)--a quantitative PTM ranking method that integrates experimental PTM observations, sequence conservation, protein structure, and interaction data to allow rank order comparisons within or between protein families. Here, we applied SAPH-ire to the study of PTMs in diverse G protein families, a conserved and ubiquitous class of proteins essential for maintenance of intracellular structure (tubulins) and signal transduction (large and small Ras-like G proteins). A total of 1728 experimentally verified PTMs from eight unique G protein families were clustered into 451 unique hotspots, 51 of which have a known and cited biological function or response. Using customized software, the hotspots were analyzed in the context of 598 unique protein structures. By comparing distributions of hotspots with known versus unknown function, we show that SAPH-ire analysis is predictive for PTM biological function. Notably, SAPH-ire revealed high-ranking hotspots for which a functional impact has not yet been determined, including phosphorylation hotspots in the N-terminal tails of G protein gamma subunits--conserved protein structures never before reported as regulators of G protein coupled receptor signaling. To validate this prediction we used the yeast model system for G protein coupled receptor signaling, revealing that gamma subunit-N-terminal tail phosphorylation is activated in response to G protein coupled receptor stimulation and regulates protein stability in vivo. These results demonstrate the utility of integrating protein structural and sequence features into PTM prioritization schemes that can improve the analysis and functional power of modification-specific proteomics data. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.
Direct Observation of Insulin Association Dynamics with Time-Resolved X-ray Scattering

DOE Office of Scientific and Technical Information (OSTI.GOV)

Rimmerman, Dolev; Leshchev, Denis; Hsu, Darren J.

Biological functions frequently require protein-protein interactions that involve secondary and tertiary structural perturbation. Here we study protein-protein dissociation and reassociation dynamics in insulin, a model system for protein oligomerization. Insulin dimer dissociation into monomers was induced by a nanosecond temperature-jump (T-jump) of ~8 °C in aqueous solution, and the resulting protein and solvent dynamics were tracked by time-resolved X-ray solution scattering (TRXSS) on time scales of 10 ns to 100 ms. The protein scattering signals revealed the formation of five distinguishable transient species during the association process that deviate from simple two state kinetics. Our results show that the combinationmore » of T-jump pump coupled to TRXSS probe allows for direct tracking of structural dynamics in nonphotoactive proteins.« less
Disulfide Bridges: Bringing Together Frustrated Structure in a Bioactive Peptide.

PubMed

Zhang, Yi; Schulten, Klaus; Gruebele, Martin; Bansal, Paramjit S; Wilson, David; Daly, Norelle L

2016-04-26

Disulfide bridges are commonly found covalent bonds that are usually believed to maintain structural stability of proteins. Here, we investigate the influence of disulfide bridges on protein dynamics through molecular dynamics simulations on the cysteine-rich trypsin inhibitor MCoTI-II with three disulfide bridges. Correlation analysis of the reduced cyclic peptide shows that two of the three disulfide distances (Cys(11)-Cys(23) and Cys(17)-Cys(29)) are anticorrelated within ∼1 μs of bridge formation or dissolution: when the peptide is in nativelike structures and one of the distances shortens to allow bond formation, the other tends to lengthen. Simulations over longer timescales, when the denatured state is less structured, do not show the anticorrelation. We propose that the native state contains structural elements that frustrate one another's folding, and that the two bridges are critical for snapping the frustrated native structure into place. In contrast, the Cys(4)-Cys(21) bridge is predicted to form together with either of the other two bridges. Indeed, experimental chromatography and nuclear magnetic resonance data show that an engineered peptide with the Cys(4)-Cys(21) bridge deleted can still fold into its near-native structure even in its noncyclic form, confirming the lesser role of the Cys(4)-Cys(21) bridge. The results highlight the importance of disulfide bridges in a small bioactive peptide to bring together frustrated structure in addition to maintaining protein structural stability. Copyright © 2016 Biophysical Society. Published by Elsevier Inc. All rights reserved.
The network organization of protein interactions in the spliceosome is reproduced by the simple rules of food-web models

PubMed Central

Pires, Mathias M.; Cantor, Maurício; Guimarães, Paulo R.; de Aguiar, Marcus A. M.; dos Reis, Sérgio F.; Coltri, Patricia P.

2015-01-01

The network structure of biological systems provides information on the underlying processes shaping their organization and dynamics. Here we examined the structure of the network depicting protein interactions within the spliceosome, the macromolecular complex responsible for splicing in eukaryotic cells. We show the interactions of less connected spliceosome proteins are nested subsets of the connections of the highly connected proteins. At the same time, the network has a modular structure with groups of proteins sharing similar interaction patterns. We then investigated the role of affinity and specificity in shaping the spliceosome network by adapting a probabilistic model originally designed to reproduce food webs. This food-web model was as successful in reproducing the structure of protein interactions as it is in reproducing interactions among species. The good performance of the model suggests affinity and specificity, partially determined by protein size and the timing of association to the complex, may be determining network structure. Moreover, because network models allow building ensembles of realistic networks while encompassing uncertainty they can be useful to examine the dynamics and vulnerability of intracelullar processes. Unraveling the mechanisms organizing the spliceosome interactions is important to characterize the role of individual proteins on splicing catalysis and regulation. PMID:26443080
Unexpected features of the dark proteome.

PubMed

Perdigão, Nelson; Heinrich, Julian; Stolte, Christian; Sabir, Kenneth S; Buckley, Michael J; Tabor, Bruce; Signal, Beth; Gloss, Brian S; Hammang, Christopher J; Rost, Burkhard; Schafferhans, Andrea; O'Donoghue, Seán I

2015-12-29

We surveyed the "dark" proteome-that is, regions of proteins never observed by experimental structure determination and inaccessible to homology modeling. For 546,000 Swiss-Prot proteins, we found that 44-54% of the proteome in eukaryotes and viruses was dark, compared with only ∼14% in archaea and bacteria. Surprisingly, most of the dark proteome could not be accounted for by conventional explanations, such as intrinsic disorder or transmembrane regions. Nearly half of the dark proteome comprised dark proteins, in which the entire sequence lacked similarity to any known structure. Dark proteins fulfill a wide variety of functions, but a subset showed distinct and largely unexpected features, such as association with secretion, specific tissues, the endoplasmic reticulum, disulfide bonding, and proteolytic cleavage. Dark proteins also had short sequence length, low evolutionary reuse, and few known interactions with other proteins. These results suggest new research directions in structural and computational biology.
An object programming based environment for protein secondary structure prediction.

PubMed

Giacomini, M; Ruggiero, C; Sacile, R

1996-01-01

The most frequently used methods for protein secondary structure prediction are empirical statistical methods and rule based methods. A consensus system based on object-oriented programming is presented, which integrates the two approaches with the aim of improving the prediction quality. This system uses an object-oriented knowledge representation based on the concepts of conformation, residue and protein, where the conformation class is the basis, the residue class derives from it and the protein class derives from the residue class. The system has been tested with satisfactory results on several proteins of the Brookhaven Protein Data Bank. Its results have been compared with the results of the most widely used prediction methods, and they show a higher prediction capability and greater stability. Moreover, the system itself provides an index of the reliability of its current prediction. This system can also be regarded as a basis structure for programs of this kind.
Protein Tertiary Structure Prediction Based on Main Chain Angle Using a Hybrid Bees Colony Optimization Algorithm

NASA Astrophysics Data System (ADS)

Mahmood, Zakaria N.; Mahmuddin, Massudi; Mahmood, Mohammed Nooraldeen

Encoding proteins of amino acid sequence to predict classified into their respective families and subfamilies is important research area. However for a given protein, knowing the exact action whether hormonal, enzymatic, transmembranal or nuclear receptors does not depend solely on amino acid sequence but on the way the amino acid thread folds as well. This study provides a prototype system that able to predict a protein tertiary structure. Several methods are used to develop and evaluate the system to produce better accuracy in protein 3D structure prediction. The Bees Optimization algorithm which inspired from the honey bees food foraging method, is used in the searching phase. In this study, the experiment is conducted on short sequence proteins that have been used by the previous researches using well-known tools. The proposed approach shows a promising result.
Unexpected features of the dark proteome

PubMed Central

Perdigão, Nelson; Heinrich, Julian; Stolte, Christian; Sabir, Kenneth S.; Buckley, Michael J.; Tabor, Bruce; Signal, Beth; Gloss, Brian S.; Hammang, Christopher J.; Rost, Burkhard; Schafferhans, Andrea

2015-01-01

We surveyed the “dark” proteome–that is, regions of proteins never observed by experimental structure determination and inaccessible to homology modeling. For 546,000 Swiss-Prot proteins, we found that 44–54% of the proteome in eukaryotes and viruses was dark, compared with only ∼14% in archaea and bacteria. Surprisingly, most of the dark proteome could not be accounted for by conventional explanations, such as intrinsic disorder or transmembrane regions. Nearly half of the dark proteome comprised dark proteins, in which the entire sequence lacked similarity to any known structure. Dark proteins fulfill a wide variety of functions, but a subset showed distinct and largely unexpected features, such as association with secretion, specific tissues, the endoplasmic reticulum, disulfide bonding, and proteolytic cleavage. Dark proteins also had short sequence length, low evolutionary reuse, and few known interactions with other proteins. These results suggest new research directions in structural and computational biology. PMID:26578815

The HPr Proteins from the Thermophile Bacillus stearothermophilus Can Form Domain-swapped Dimers

DOE Office of Scientific and Technical Information (OSTI.GOV)

Sridharan, Sudharsan; Razvi, Abbas; Scholtz, J. Martin

2010-07-20

The study of proteins from extremophilic organisms continues to generate interest in the field of protein folding because paradigms explaining the enhanced stability of these proteins still elude us and such studies have the potential to further our knowledge of the forces stabilizing proteins. We have undertaken such a study with our model protein HPr from a mesophile, Bacillus subtilis, and a thermophile, Bacillus stearothermophilus. We report here the high-resolution structures of the wild-type HPr protein from the thermophile and a variant, F29W. The variant proved to crystallize in two forms: a monomeric form with a structure very similar tomore » the wild-type protein as well as a domain-swapped dimer. Interestingly, the structure of the domain-swapped dimer for HPr is very different from that observed for a homologous protein, Crh, from B. subtilis. The existence of a domain-swapped dimer has implications for amyloid formation and is consistent with recent results showing that the HPr proteins can form amyloid fibrils. We also characterized the conformational stability of the thermophilic HPr proteins using thermal and solvent denaturation methods and have used the high-resolution structures in an attempt to explain the differences in stability between the different HPr proteins. Finally, we present a detailed analysis of the solution properties of the HPr proteins using a variety of biochemical and biophysical methods.« less
A probabilistic model for detecting rigid domains in protein structures.

PubMed

Nguyen, Thach; Habeck, Michael

2016-09-01

Large-scale conformational changes in proteins are implicated in many important biological functions. These structural transitions can often be rationalized in terms of relative movements of rigid domains. There is a need for objective and automated methods that identify rigid domains in sets of protein structures showing alternative conformational states. We present a probabilistic model for detecting rigid-body movements in protein structures. Our model aims to approximate alternative conformational states by a few structural parts that are rigidly transformed under the action of a rotation and a translation. By using Bayesian inference and Markov chain Monte Carlo sampling, we estimate all parameters of the model, including a segmentation of the protein into rigid domains, the structures of the domains themselves, and the rigid transformations that generate the observed structures. We find that our Gibbs sampling algorithm can also estimate the optimal number of rigid domains with high efficiency and accuracy. We assess the power of our method on several thousand entries of the DynDom database and discuss applications to various complex biomolecular systems. The Python source code for protein ensemble analysis is available at: https://github.com/thachnguyen/motion_detection : mhabeck@gwdg.de. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
JavaProtein Dossier: a novel web-based data visualization tool for comprehensive analysis of protein structure

PubMed Central

Neshich, Goran; Rocchia, Walter; Mancini, Adauto L.; Yamagishi, Michel E. B.; Kuser, Paula R.; Fileto, Renato; Baudet, Christian; Pinto, Ivan P.; Montagner, Arnaldo J.; Palandrani, Juliana F.; Krauchenco, Joao N.; Torres, Renato C.; Souza, Savio; Togawa, Roberto C.; Higa, Roberto H.

2004-01-01

JavaProtein Dossier (JPD) is a new concept, database and visualization tool providing one of the largest collections of the physicochemical parameters describing proteins' structure, stability, function and interaction with other macromolecules. By collecting as many descriptors/parameters as possible within a single database, we can achieve a better use of the available data and information. Furthermore, data grouping allows us to generate different parameters with the potential to provide new insights into the sequence–structure–function relationship. In JPD, residue selection can be performed according to multiple criteria. JPD can simultaneously display and analyze all the physicochemical parameters of any pair of structures, using precalculated structural alignments, allowing direct parameter comparison at corresponding amino acid positions among homologous structures. In order to focus on the physicochemical (and consequently pharmacological) profile of proteins, visualization tools (showing the structure and structural parameters) also had to be optimized. Our response to this challenge was the use of Java technology with its exceptional level of interactivity. JPD is freely accessible (within the Gold Sting Suite) at http://sms.cbi.cnptia.embrapa.br, http://mirrors.rcsb.org/SMS, http://trantor.bioc.columbia.edu/SMS and http://www.es.embnet.org/SMS/ (Option: JavaProtein Dossier). PMID:15215458
The Methionine-aromatic Motif Plays a Unique Role in Stabilizing Protein Structure*

PubMed Central

Valley, Christopher C.; Cembran, Alessandro; Perlmutter, Jason D.; Lewis, Andrew K.; Labello, Nicholas P.; Gao, Jiali; Sachs, Jonathan N.

2012-01-01

Of the 20 amino acids, the precise function of methionine (Met) remains among the least well understood. To establish a determining characteristic of methionine that fundamentally differentiates it from purely hydrophobic residues, we have used in vitro cellular experiments, molecular simulations, quantum calculations, and a bioinformatics screen of the Protein Data Bank. We show that approximately one-third of all known protein structures contain an energetically stabilizing Met-aromatic motif and, remarkably, that greater than 10,000 structures contain this motif more than 10 times. Critically, we show that as compared with a purely hydrophobic interaction, the Met-aromatic motif yields an additional stabilization of 1–1.5 kcal/mol. To highlight its importance and to dissect the energetic underpinnings of this motif, we have studied two clinically relevant TNF ligand-receptor complexes, namely TRAIL-DR5 and LTα-TNFR1. In both cases, we show that the motif is necessary for high affinity ligand binding as well as function. Additionally, we highlight previously overlooked instances of the motif in several disease-related Met mutations. Our results strongly suggest that the Met-aromatic motif should be exploited in the rational design of therapeutics targeting a range of proteins. PMID:22859300
Geomfinder: a multi-feature identifier of similar three-dimensional protein patterns: a ligand-independent approach.

PubMed

Núñez-Vivanco, Gabriel; Valdés-Jiménez, Alejandro; Besoaín, Felipe; Reyes-Parada, Miguel

2016-01-01

Since the structure of proteins is more conserved than the sequence, the identification of conserved three-dimensional (3D) patterns among a set of proteins, can be important for protein function prediction, protein clustering, drug discovery and the establishment of evolutionary relationships. Thus, several computational applications to identify, describe and compare 3D patterns (or motifs) have been developed. Often, these tools consider a 3D pattern as that described by the residues surrounding co-crystallized/docked ligands available from X-ray crystal structures or homology models. Nevertheless, many of the protein structures stored in public databases do not provide information about the location and characteristics of ligand binding sites and/or other important 3D patterns such as allosteric sites, enzyme-cofactor interaction motifs, etc. This makes necessary the development of new ligand-independent methods to search and compare 3D patterns in all available protein structures. Here we introduce Geomfinder, an intuitive, flexible, alignment-free and ligand-independent web server for detailed estimation of similarities between all pairs of 3D patterns detected in any two given protein structures. We used around 1100 protein structures to form pairs of proteins which were assessed with Geomfinder. In these analyses each protein was considered in only one pair (e.g. in a subset of 100 different proteins, 50 pairs of proteins can be defined). Thus: (a) Geomfinder detected identical pairs of 3D patterns in a series of monoamine oxidase-B structures, which corresponded to the effectively similar ligand binding sites at these proteins; (b) we identified structural similarities among pairs of protein structures which are targets of compounds such as acarbose, benzamidine, adenosine triphosphate and pyridoxal phosphate; these similar 3D patterns are not detected using sequence-based methods; (c) the detailed evaluation of three specific cases showed the versatility of Geomfinder, which was able to discriminate between similar and different 3D patterns related to binding sites of common substrates in a range of diverse proteins. Geomfinder allows detecting similar 3D patterns between any two pair of protein structures, regardless of the divergency among their amino acids sequences. Although the software is not intended for simultaneous multiple comparisons in a large number of proteins, it can be particularly useful in cases such as the structure-based design of multitarget drugs, where a detailed analysis of 3D patterns similarities between a few selected protein targets is essential.
Discrete Molecular Dynamics Can Predict Helical Prestructured Motifs in Disordered Proteins

PubMed Central

Han, Kyou-Hoon; Dokholyan, Nikolay V.; Tompa, Péter; Kalmár, Lajos; Hegedűs, Tamás

2014-01-01

Intrinsically disordered proteins (IDPs) lack a stable tertiary structure, but their short binding regions termed Pre-Structured Motifs (PreSMo) can form transient secondary structure elements in solution. Although disordered proteins are crucial in many biological processes and designing strategies to modulate their function is highly important, both experimental and computational tools to describe their conformational ensembles and the initial steps of folding are sparse. Here we report that discrete molecular dynamics (DMD) simulations combined with replica exchange (RX) method efficiently samples the conformational space and detects regions populating α-helical conformational states in disordered protein regions. While the available computational methods predict secondary structural propensities in IDPs based on the observation of protein-protein interactions, our ab initio method rests on physical principles of protein folding and dynamics. We show that RX-DMD predicts α-PreSMos with high confidence confirmed by comparison to experimental NMR data. Moreover, the method also can dissect α-PreSMos in close vicinity to each other and indicate helix stability. Importantly, simulations with disordered regions forming helices in X-ray structures of complexes indicate that a preformed helix is frequently the binding element itself, while in other cases it may have a role in initiating the binding process. Our results indicate that RX-DMD provides a breakthrough in the structural and dynamical characterization of disordered proteins by generating the structural ensembles of IDPs even when experimental data are not available. PMID:24763499
The HsiB1C1 (TssB-TssC) Complex of the Pseudomonas aeruginosa Type VI Secretion System Forms a Bacteriophage Tail Sheathlike Structure

PubMed Central

Lossi, Nadine S.; Manoli, Eleni; Förster, Andreas; Dajani, Rana; Pape, Tillmann; Freemont, Paul; Filloux, Alain

2013-01-01

Protein secretion systems in Gram-negative bacteria evolved into a variety of molecular nanomachines. They are related to cell envelope complexes, which are involved in assembly of surface appendages or transport of solutes. They are classified as types, the most recent addition being the type VI secretion system (T6SS). The T6SS displays similarities to bacteriophage tail, which drives DNA injection into bacteria. The Hcp protein is related to the T4 bacteriophage tail tube protein gp19, whereas VgrG proteins structurally resemble the gp27/gp5 puncturing device of the phage. The tube and spike of the phage are pushed through the bacterial envelope upon contraction of a tail sheath composed of gp18. In Vibrio cholerae it was proposed that VipA and VipB assemble into a tail sheathlike structure. Here we confirm these previous data by showing that HsiB1 and HsiC1 of the Pseudomonas aeruginosa H1-T6SS assemble into tubules resulting from stacking of cogwheel-like structures showing predominantly 12-fold symmetry. The internal diameter of the cogwheels is ∼100 Å, which is large enough to accommodate an Hcp tube whose external diameter has been reported to be 85 Å. The N-terminal 212 residues of HsiC1 are sufficient to form a stable complex with HsiB1, but the C terminus of HsiC1 is essential for the formation of the tubelike structure. Bioinformatics analysis suggests that HsiC1 displays similarities to gp18-like proteins in its C-terminal region. In conclusion, we provide further structural and mechanistic insights into the T6SS and show that a phage sheathlike structure is likely to be a conserved element across all T6SSs. PMID:23341461
In silico characterization and analysis of RTBP1 and NgTRF1 protein through MD simulation and molecular docking - A comparative study.

PubMed

Mukherjee, Koel; Pandey, Dev Mani; Vidyarthi, Ambarish Saran

2015-02-06

Gaining access to sequence and structure information of telomere binding proteins helps in understanding the essential biological processes involve in conserved sequence specific interaction between DNA and the proteins. Rice telomere binding protein (RTBP1) and Nicotiana glutinosa telomere repeat binding factor (NgTRF1) are helix turn helix motif type of proteins that plays role in telomeric DNA protection and length regulation. Both the proteins share same type of domain but till now there is very less communication on the in silico studies of these complete proteins.Here we intend to do a comparative study between two proteins through modeling of the complete proteins, physiochemical characterization, MD simulation and DNA-protein docking. I-TASSER and CLC protein work bench was performed to find out the protein 3D structure as well as the different parameters to characterize the proteins. MD simulation was completed by GROMOS forcefield of GROMACS for 10 ns of time stretch. The simulated 3D structures were docked with template DNA (3D DNA modeled through 3D-DART) of TTTAGGG conserved sequence motif using HADDOCK web server.Digging up all the facts about the proteins it was reveled that around 120 amino acids in the tail part was showing a good sequence similarity between the proteins. Molecular modeling, sequence characterization and secondary structure prediction also indicates the similarity between the protein's structure and sequence. The result of MD simulation highlights on the RMSD, RMSF, Rg, PCA and Energy plots which also conveys the similar type of motional behavior between them. The best complex formation for both the proteins in docking result also indicates for the first interaction site which is mainly the helix3 region of the DNA binding domain. The overall computational analysis reveals that RTBP1 and NgTRF1 proteins display good amount of similarity in their physicochemical properties, structure, dynamics and binding mode.
In Silico Characterization and Analysis of RTBP1 and NgTRF1 Protein Through MD Simulation and Molecular Docking: A Comparative Study.

PubMed

Mukherjee, Koel; Pandey, Dev Mani; Vidyarthi, Ambarish Saran

2015-09-01

Gaining access to sequence and structure information of telomere-binding proteins helps in understanding the essential biological processes involve in conserved sequence-specific interaction between DNA and the proteins. Rice telomere-binding protein (RTBP1) and Nicotiana glutinosa telomere repeat binding factor (NgTRF1) are helix-turn-helix motif type of proteins that plays role in telomeric DNA protection and length regulation. Both the proteins share same type of domain, but till now there is very less communication on the in silico studies of these complete proteins. Here we intend to do a comparative study between two proteins through modeling of the complete proteins, physiochemical characterization, MD simulation and DNA-protein docking. I-TASSER and CLC protein work bench was performed to find out the protein 3D structure as well as the different parameters to characterize the proteins. MD simulation was completed by GROMOS forcefield of GROMACS for 10 ns of time stretch. The simulated 3D structures were docked with template DNA (3D DNA modeled through 3D-DART) of TTTAGGG conserved sequence motif using HADDOCK Web server. By digging up all the facts about the proteins, it was revealed that around 120 amino acids in the tail part were showing a good sequence similarity between the proteins. Molecular modeling, sequence characterization and secondary structure prediction also indicate the similarity between the protein's structure and sequence. The result of MD simulation highlights on the RMSD, RMSF, Rg, PCA and energy plots which also conveys the similar type of motional behavior between them. The best complex formation for both the proteins in docking result also indicates for the first interaction site which is mainly the helix3 region of the DNA-binding domain. The overall computational analysis reveals that RTBP1 and NgTRF1 proteins display good amount of similarity in their physicochemical properties, structure, dynamics and binding mode.
Tighter Ligand Binding Can Compensate for Impaired Stability of an RNA-Binding Protein.

PubMed

Wallis, Christopher P; Richman, Tara R; Filipovska, Aleksandra; Rackham, Oliver

2018-06-15

It has been widely shown that ligand-binding residues, by virtue of their orientation, charge, and solvent exposure, often have a net destabilizing effect on proteins that is offset by stability conferring residues elsewhere in the protein. This structure-function trade-off can constrain possible adaptive evolutionary changes of function and may hamper protein engineering efforts to design proteins with new functions. Here, we present evidence from a large randomized mutant library screen that, in the case of PUF RNA-binding proteins, this structural relationship may be inverted and that active-site mutations that increase protein activity are also able to compensate for impaired stability. We show that certain mutations in RNA-protein binding residues are not necessarily destabilizing and that increased ligand-binding can rescue an insoluble, unstable PUF protein. We hypothesize that these mutations restabilize the protein via thermodynamic coupling of protein folding and RNA binding.
Arc is a flexible modular protein capable of reversible self-oligomerization

PubMed Central

Myrum, Craig; Baumann, Anne; Bustad, Helene J.; Flydal, Marte Innselset; Mariaule, Vincent; Alvira, Sara; Cuéllar, Jorge; Haavik, Jan; Soulé, Jonathan; Valpuesta, José Maria; Márquez, José Antonio; Martinez, Aurora; Bramham, Clive R.

2015-01-01

The immediate early gene product Arc (activity-regulated cytoskeleton-associated protein) is posited as a master regulator of long-term synaptic plasticity and memory. However, the physicochemical and structural properties of Arc have not been elucidated. In the present study, we expressed and purified recombinant human Arc (hArc) and performed the first biochemical and biophysical analysis of hArc's structure and stability. Limited proteolysis assays and MS analysis indicate that hArc has two major domains on either side of a central more disordered linker region, consistent with in silico structure predictions. hArc's secondary structure was estimated using CD, and stability was analysed by CD-monitored thermal denaturation and differential scanning fluorimetry (DSF). Oligomerization states under different conditions were studied by dynamic light scattering (DLS) and visualized by AFM and EM. Biophysical analyses show that hArc is a modular protein with defined secondary structure and loose tertiary structure. hArc appears to be pyramid-shaped as a monomer and is capable of reversible self-association, forming large soluble oligomers. The N-terminal domain of hArc is highly basic, which may promote interaction with cytoskeletal structures or other polyanionic surfaces, whereas the C-terminal domain is acidic and stabilized by ionic conditions that promote oligomerization. Upon binding of presenilin-1 (PS1) peptide, hArc undergoes a large structural change. A non-synonymous genetic variant of hArc (V231G) showed properties similar to the wild-type (WT) protein. We conclude that hArc is a flexible multi-domain protein that exists in monomeric and oligomeric forms, compatible with a diverse, hub-like role in plasticity-related processes. PMID:25748042
Computational analysis of sequence selection mechanisms.

PubMed

Meyerguz, Leonid; Grasso, Catherine; Kleinberg, Jon; Elber, Ron

2004-04-01

Mechanisms leading to gene variations are responsible for the diversity of species and are important components of the theory of evolution. One constraint on gene evolution is that of protein foldability; the three-dimensional shapes of proteins must be thermodynamically stable. We explore the impact of this constraint and calculate properties of foldable sequences using 3660 structures from the Protein Data Bank. We seek a selection function that receives sequences as input, and outputs survival probability based on sequence fitness to structure. We compute the number of sequences that match a particular protein structure with energy lower than the native sequence, the density of the number of sequences, the entropy, and the "selection" temperature. The mechanism of structure selection for sequences longer than 200 amino acids is approximately universal. For shorter sequences, it is not. We speculate on concrete evolutionary mechanisms that show this behavior.
2BC Non-Structural Protein of Enterovirus A71 Interacts with SNARE Proteins to Trigger Autolysosome Formation.

PubMed

Lai, Jeffrey K F; Sam, I-Ching; Verlhac, Pauline; Baguet, Joël; Eskelinen, Eeva-Liisa; Faure, Mathias; Chan, Yoke Fun

2017-07-04

Viruses have evolved unique strategies to evade or subvert autophagy machinery. Enterovirus A71 (EV-A71) induces autophagy during infection in vitro and in vivo. In this study, we report that EV-A71 triggers autolysosome formation during infection in human rhabdomyosarcoma (RD) cells to facilitate its replication. Blocking autophagosome-lysosome fusion with chloroquine inhibited virus RNA replication, resulting in lower viral titres, viral RNA copies and viral proteins. Overexpression of the non-structural protein 2BC of EV-A71 induced autolysosome formation. Yeast 2-hybrid and co-affinity purification assays showed that 2BC physically and specifically interacted with a N -ethylmaleimide-sensitive factor attachment receptor (SNARE) protein, syntaxin-17 (STX17). Co-immunoprecipitation assay further showed that 2BC binds to SNARE proteins, STX17 and synaptosome associated protein 29 (SNAP29). Transient knockdown of STX17, SNAP29, and microtubule-associated protein 1 light chain 3B (LC3B), crucial proteins in the fusion between autophagosomes and lysosomes) as well as the lysosomal-associated membrane protein 1 (LAMP1) impaired production of infectious EV-A71 in RD cells. Collectively, these results demonstrate that the generation of autolysosomes triggered by the 2BC non-structural protein is important for EV-A71 replication, revealing a potential molecular pathway targeted by the virus to exploit autophagy. This study opens the possibility for the development of novel antivirals that specifically target 2BC to inhibit formation of autolysosomes during EV-A71 infection.
2BC Non-Structural Protein of Enterovirus A71 Interacts with SNARE Proteins to Trigger Autolysosome Formation

PubMed Central

Lai, Jeffrey K. F.; Sam, I-Ching; Verlhac, Pauline; Baguet, Joël; Faure, Mathias

2017-01-01

Viruses have evolved unique strategies to evade or subvert autophagy machinery. Enterovirus A71 (EV-A71) induces autophagy during infection in vitro and in vivo. In this study, we report that EV-A71 triggers autolysosome formation during infection in human rhabdomyosarcoma (RD) cells to facilitate its replication. Blocking autophagosome-lysosome fusion with chloroquine inhibited virus RNA replication, resulting in lower viral titres, viral RNA copies and viral proteins. Overexpression of the non-structural protein 2BC of EV-A71 induced autolysosome formation. Yeast 2-hybrid and co-affinity purification assays showed that 2BC physically and specifically interacted with a N-ethylmaleimide-sensitive factor attachment receptor (SNARE) protein, syntaxin-17 (STX17). Co-immunoprecipitation assay further showed that 2BC binds to SNARE proteins, STX17 and synaptosome associated protein 29 (SNAP29). Transient knockdown of STX17, SNAP29, and microtubule-associated protein 1 light chain 3B (LC3B), crucial proteins in the fusion between autophagosomes and lysosomes) as well as the lysosomal-associated membrane protein 1 (LAMP1) impaired production of infectious EV-A71 in RD cells. Collectively, these results demonstrate that the generation of autolysosomes triggered by the 2BC non-structural protein is important for EV-A71 replication, revealing a potential molecular pathway targeted by the virus to exploit autophagy. This study opens the possibility for the development of novel antivirals that specifically target 2BC to inhibit formation of autolysosomes during EV-A71 infection. PMID:28677644
Polymeric assembly of gluten proteins in an aqueous ethanol solvent.

PubMed

Dahesh, Mohsen; Banc, Amélie; Duri, Agnès; Morel, Marie-Hélène; Ramos, Laurence

2014-09-25

The supramolecular organization of wheat gluten proteins is largely unknown due to the intrinsic complexity of this family of proteins and their insolubility in water. We fractionate gluten in a water/ethanol mixture (50/50 v/v) and obtain a protein extract which is depleted in gliadin, the monomeric part of wheat gluten proteins, and enriched in glutenin, the polymeric part of wheat gluten proteins. We investigate the structure of the proteins in the solvent used for extraction over a wide range of concentration, by combining X-ray scattering and multiangle static and dynamic light scattering. Our data show that, in the ethanol/water mixture, the proteins display features characteristic of flexible polymer chains in a good solvent. In the dilute regime, the proteins form very loose structures of characteristic size 150 nm, with an internal dynamics which is quantitatively similar to that of branched polymer coils. In more concentrated regimes, data highlight a hierarchical structure with one characteristic length scale of the order of a few nm, which displays the scaling with concentration expected for a semidilute polymer in good solvent, and a fractal arrangement at a much larger length scale. This structure is strikingly similar to that of polymeric gels, thus providing some factual knowledge to rationalize the viscoelastic properties of wheat gluten proteins and their assemblies.
Template-Based Modeling of Protein-RNA Interactions

PubMed Central

Zheng, Jinfang; Kundrotas, Petras J.; Vakser, Ilya A.

2016-01-01

Protein-RNA complexes formed by specific recognition between RNA and RNA-binding proteins play an important role in biological processes. More than a thousand of such proteins in human are curated and many novel RNA-binding proteins are to be discovered. Due to limitations of experimental approaches, computational techniques are needed for characterization of protein-RNA interactions. Although much progress has been made, adequate methodologies reliably providing atomic resolution structural details are still lacking. Although protein-RNA free docking approaches proved to be useful, in general, the template-based approaches provide higher quality of predictions. Templates are key to building a high quality model. Sequence/structure relationships were studied based on a representative set of binary protein-RNA complexes from PDB. Several approaches were tested for pairwise target/template alignment. The analysis revealed a transition point between random and correct binding modes. The results showed that structural alignment is better than sequence alignment in identifying good templates, suitable for generating protein-RNA complexes close to the native structure, and outperforms free docking, successfully predicting complexes where the free docking fails, including cases of significant conformational change upon binding. A template-based protein-RNA interaction modeling protocol PRIME was developed and benchmarked on a representative set of complexes. PMID:27662342
An Integrated Framework Advancing Membrane Protein Modeling and Design

PubMed Central

Weitzner, Brian D.; Duran, Amanda M.; Tilley, Drew C.; Elazar, Assaf; Gray, Jeffrey J.

2015-01-01

Membrane proteins are critical functional molecules in the human body, constituting more than 30% of open reading frames in the human genome. Unfortunately, a myriad of difficulties in overexpression and reconstitution into membrane mimetics severely limit our ability to determine their structures. Computational tools are therefore instrumental to membrane protein structure prediction, consequently increasing our understanding of membrane protein function and their role in disease. Here, we describe a general framework facilitating membrane protein modeling and design that combines the scientific principles for membrane protein modeling with the flexible software architecture of Rosetta3. This new framework, called RosettaMP, provides a general membrane representation that interfaces with scoring, conformational sampling, and mutation routines that can be easily combined to create new protocols. To demonstrate the capabilities of this implementation, we developed four proof-of-concept applications for (1) prediction of free energy changes upon mutation; (2) high-resolution structural refinement; (3) protein-protein docking; and (4) assembly of symmetric protein complexes, all in the membrane environment. Preliminary data show that these algorithms can produce meaningful scores and structures. The data also suggest needed improvements to both sampling routines and score functions. Importantly, the applications collectively demonstrate the potential of combining the flexible nature of RosettaMP with the power of Rosetta algorithms to facilitate membrane protein modeling and design. PMID:26325167
Predicting beta-turns in proteins using support vector machines with fractional polynomials

PubMed Central

2013-01-01

Background β-turns are secondary structure type that have essential role in molecular recognition, protein folding, and stability. They are found to be the most common type of non-repetitive structures since 25% of amino acids in protein structures are situated on them. Their prediction is considered to be one of the crucial problems in bioinformatics and molecular biology, which can provide valuable insights and inputs for the fold recognition and drug design. Results We propose an approach that combines support vector machines (SVMs) and logistic regression (LR) in a hybrid prediction method, which we call (H-SVM-LR) to predict β-turns in proteins. Fractional polynomials are used for LR modeling. We utilize position specific scoring matrices (PSSMs) and predicted secondary structure (PSS) as features. Our simulation studies show that H-SVM-LR achieves Qtotal of 82.87%, 82.84%, and 82.32% on the BT426, BT547, and BT823 datasets respectively. These values are the highest among other β-turns prediction methods that are based on PSSMs and secondary structure information. H-SVM-LR also achieves favorable performance in predicting β-turns as measured by the Matthew's correlation coefficient (MCC) on these datasets. Furthermore, H-SVM-LR shows good performance when considering shape strings as additional features. Conclusions In this paper, we present a comprehensive approach for β-turns prediction. Experiments show that our proposed approach achieves better performance compared to other competing prediction methods. PMID:24565438
Predicting beta-turns in proteins using support vector machines with fractional polynomials.

PubMed

Elbashir, Murtada; Wang, Jianxin; Wu, Fang-Xiang; Wang, Lusheng

2013-11-07

β-turns are secondary structure type that have essential role in molecular recognition, protein folding, and stability. They are found to be the most common type of non-repetitive structures since 25% of amino acids in protein structures are situated on them. Their prediction is considered to be one of the crucial problems in bioinformatics and molecular biology, which can provide valuable insights and inputs for the fold recognition and drug design. We propose an approach that combines support vector machines (SVMs) and logistic regression (LR) in a hybrid prediction method, which we call (H-SVM-LR) to predict β-turns in proteins. Fractional polynomials are used for LR modeling. We utilize position specific scoring matrices (PSSMs) and predicted secondary structure (PSS) as features. Our simulation studies show that H-SVM-LR achieves Qtotal of 82.87%, 82.84%, and 82.32% on the BT426, BT547, and BT823 datasets respectively. These values are the highest among other β-turns prediction methods that are based on PSSMs and secondary structure information. H-SVM-LR also achieves favorable performance in predicting β-turns as measured by the Matthew's correlation coefficient (MCC) on these datasets. Furthermore, H-SVM-LR shows good performance when considering shape strings as additional features. In this paper, we present a comprehensive approach for β-turns prediction. Experiments show that our proposed approach achieves better performance compared to other competing prediction methods.
Effect of ethanol concentrations on temperature driven structural changes of chymotrypsin inhibitor 2

NASA Astrophysics Data System (ADS)

Mohanta, Dayanidhi; Jana, Madhurima

2016-04-01

A series of atomistic molecular dynamics (MD) simulations of a small enzymatic protein Chymotrypsin Inhibitor 2 (CI2) in water-ethanol mixed solutions were carried out to explore the underlying mechanism of ethanol driven conformational changes of the protein. Efforts have been made to probe the influence of ethanol concentrations ranging from 0% to 75% (v/v) at ambient condition (300 K (T1)) and at elevated temperatures (375 K (T2) and 450 K (T3)) to investigate the temperature induced conformational changes of the protein further. Our study showed that the effect of varying ethanol concentrations on protein's structure is almost insignificant at T1 and T2 temperatures whereas at T3 temperature, partial unfolding of CI2 in 10% ethanol solution followed by full unfolding of the protein at ethanol concentrations above 25% occurs. However, interestingly, at T3 temperature CI2's native structure was found to be retained in pure water (0% ethanol solution) indicating that the cosolvent ethanol do play an important role in thermal denaturation of CI2. Such observations were quantified in the light of root-mean-square deviations (RMSDs) and radius of gyration. Although higher RMSD values of β-sheet over α-helix indicate complete destruction of the β-structure of CI2 at high ethanol concentrations, the associated time scale showed that the faster melting of α-helix happens over β-sheet. Around 60%-80% of initial native contacts of the protein were found broken with the separation of hydrophobic core consisting eleven residues at ethanol concentrations greater than 25%. This leads protein to expand with the increase in solvent accessible surface area. The interactions between protein and solvent molecules showed that protein's solvation shell preferred to accommodate ethanol molecules as compared to water thereby excluded water molecules from CI2's surface. Further, concentration dependent differential self-aggregation behavior of ethanol is likely to regulate the replacement of relatively fast diffused water by low diffused ethanol molecules from protein's surface during the unfolding process.

Protein Folding and Structure Prediction from the Ground Up: The Atomistic Associative Memory, Water Mediated, Structure and Energy Model.

PubMed

Chen, Mingchen; Lin, Xingcheng; Zheng, Weihua; Onuchic, José N; Wolynes, Peter G

2016-08-25

The associative memory, water mediated, structure and energy model (AWSEM) is a coarse-grained force field with transferable tertiary interactions that incorporates local in sequence energetic biases using bioinformatically derived structural information about peptide fragments with locally similar sequences that we call memories. The memory information from the protein data bank (PDB) database guides proper protein folding. The structural information about available sequences in the database varies in quality and can sometimes lead to frustrated free energy landscapes locally. One way out of this difficulty is to construct the input fragment memory information from all-atom simulations of portions of the complete polypeptide chain. In this paper, we investigate this approach first put forward by Kwac and Wolynes in a more complete way by studying the structure prediction capabilities of this approach for six α-helical proteins. This scheme which we call the atomistic associative memory, water mediated, structure and energy model (AAWSEM) amounts to an ab initio protein structure prediction method that starts from the ground up without using bioinformatic input. The free energy profiles from AAWSEM show that atomistic fragment memories are sufficient to guide the correct folding when tertiary forces are included. AAWSEM combines the efficiency of coarse-grained simulations on the full protein level with the local structural accuracy achievable from all-atom simulations of only parts of a large protein. The results suggest that a hybrid use of atomistic fragment memory and database memory in structural predictions may well be optimal for many practical applications.
The crystal structure of the catalytic domain of the ser/thr kinase PknA from M. tuberculosis shows an Src-like autoinhibited conformation.

PubMed

Wagner, Tristan; Alexandre, Matthieu; Duran, Rosario; Barilone, Nathalie; Wehenkel, Annemarie; Alzari, Pedro M; Bellinzoni, Marco

2015-05-01

Signal transduction mediated by Ser/Thr phosphorylation in Mycobacterium tuberculosis has been intensively studied in the last years, as its genome harbors eleven genes coding for eukaryotic-like Ser/Thr kinases. Here we describe the crystal structure and the autophosphorylation sites of the catalytic domain of PknA, one of two protein kinases essential for pathogen's survival. The structure of the ligand-free kinase domain shows an auto-inhibited conformation similar to that observed in human Tyr kinases of the Src-family. These results reinforce the high conservation of structural hallmarks and regulation mechanisms between prokaryotic and eukaryotic protein kinases. © 2015 Wiley Periodicals, Inc.
MetaGO: Predicting Gene Ontology of Non-homologous Proteins Through Low-Resolution Protein Structure Prediction and Protein-Protein Network Mapping.

PubMed

Zhang, Chengxin; Zheng, Wei; Freddolino, Peter L; Zhang, Yang

2018-03-10

Homology-based transferal remains the major approach to computational protein function annotations, but it becomes increasingly unreliable when the sequence identity between query and template decreases below 30%. We propose a novel pipeline, MetaGO, to deduce Gene Ontology attributes of proteins by combining sequence homology-based annotation with low-resolution structure prediction and comparison, and partner's homology-based protein-protein network mapping. The pipeline was tested on a large-scale set of 1000 non-redundant proteins from the CAFA3 experiment. Under the stringent benchmark conditions where templates with >30% sequence identity to the query are excluded, MetaGO achieves average F-measures of 0.487, 0.408, and 0.598, for Molecular Function, Biological Process, and Cellular Component, respectively, which are significantly higher than those achieved by other state-of-the-art function annotations methods. Detailed data analysis shows that the major advantage of the MetaGO lies in the new functional homolog detections from partner's homology-based network mapping and structure-based local and global structure alignments, the confidence scores of which can be optimally combined through logistic regression. These data demonstrate the power of using a hybrid model incorporating protein structure and interaction networks to deduce new functional insights beyond traditional sequence homology-based referrals, especially for proteins that lack homologous function templates. The MetaGO pipeline is available at http://zhanglab.ccmb.med.umich.edu/MetaGO/. Copyright © 2018. Published by Elsevier Ltd.
Identification of DNA-Binding Proteins Using Structural, Electrostatic and Evolutionary Features

PubMed Central

Nimrod, Guy; Szilágyi, András; Leslie, Christina; Ben-Tal, Nir

2009-01-01

Summary DNA binding proteins (DBPs) often take part in various crucial processes of the cell's life cycle. Therefore, the identification and characterization of these proteins are of great importance. We present here a random forests classifier for identifying DBPs among proteins with known three-dimensional structures. First, clusters of evolutionarily conserved regions (patches) on the protein's surface are detected using the PatchFinder algorithm; previous studies showed that these regions are typically the proteins' functionally important regions. Next, we train a classifier using features like the electrostatic potential, cluster-based amino acid conservation patterns and the secondary structure content of the patches, as well as features of the whole protein including its dipole moment. Using 10-fold cross validation on a dataset of 138 DNA-binding proteins and 110 proteins which do not bind DNA, the classifier achieved a sensitivity and a specificity of 0.90, which is overall better than the performance of previously published methods. Furthermore, when we tested 5 different methods on 11 new DBPs which did not appear in the original dataset, only our method annotated all correctly. The resulting classifier was applied to a collection of 757 proteins of known structure and unknown function. Of these proteins, 218 were predicted to bind DNA, and we anticipate that some of them interact with DNA using new structural motifs. The use of complementary computational tools supports the notion that at least some of them do bind DNA. PMID:19233205
In situ structural analysis of the Yersinia enterocolitica injectisome

PubMed Central

Kudryashev, Mikhail; Stenta, Marco; Schmelz, Stefan; Amstutz, Marlise; Wiesand, Ulrich; Castaño-Díez, Daniel; Degiacomi, Matteo T; Münnich, Stefan; Bleck, Christopher KE; Kowal, Julia; Diepold, Andreas; Heinz, Dirk W; Dal Peraro, Matteo; Cornelis, Guy R; Stahlberg, Henning

2013-01-01

Injectisomes are multi-protein transmembrane machines allowing pathogenic bacteria to inject effector proteins into eukaryotic host cells, a process called type III secretion. Here we present the first three-dimensional structure of Yersinia enterocolitica and Shigella flexneri injectisomes in situ and the first structural analysis of the Yersinia injectisome. Unexpectedly, basal bodies of injectisomes inside the bacterial cells showed length variations of 20%. The in situ structures of the Y. enterocolitica and S. flexneri injectisomes had similar dimensions and were significantly longer than the isolated structures of related injectisomes. The crystal structure of the inner membrane injectisome component YscD appeared elongated compared to a homologous protein, and molecular dynamics simulations documented its elongation elasticity. The ring-shaped secretin YscC at the outer membrane was stretched by 30–40% in situ, compared to its isolated liposome-embedded conformation. We suggest that elasticity is critical for some two-membrane spanning protein complexes to cope with variations in the intermembrane distance. DOI: http://dx.doi.org/10.7554/eLife.00792.001 PMID:23908767
Transmembrane Helices Tilt, Bend, Slide, Torque, and Unwind between Functional States of Rhodopsin

PubMed Central

Ren, Zhong; Ren, Peter X.; Balusu, Rohith; Yang, Xiaojing

2016-01-01

The seven-helical bundle of rhodopsin and other G-protein coupled receptors undergoes structural rearrangements as the transmembrane receptor protein is activated. These structural changes are known to involve tilting and bending of various transmembrane helices. However, the cause and effect relationship among structural events leading to a cytoplasmic crevasse for G-protein binding is less well defined. Here we present a mathematical model of the protein helix and a simple procedure to determine multiple parameters that offer precise depiction of a helical conformation. A comprehensive survey of bovine rhodopsin structures shows that the helical rearrangements during the activation of rhodopsin involve a variety of angular and linear motions such as torsion, unwinding, and sliding in addition to the previously reported tilting and bending. These hitherto undefined motion components unify the results obtained from different experimental approaches, and demonstrate conformational similarity between the active opsin structure and the photoactivated structures in crystallo near the retinal anchor despite their marked differences. PMID:27658480
NeEMO: a method using residue interaction networks to improve prediction of protein stability upon mutation.

PubMed

Giollo, Manuel; Martin, Alberto J M; Walsh, Ian; Ferrari, Carlo; Tosatto, Silvio C E

2014-01-01

The rapid growth of un-annotated missense variants poses challenges requiring novel strategies for their interpretation. From the thermodynamic point of view, amino acid changes can lead to a change in the internal energy of a protein and induce structural rearrangements. This is of great relevance for the study of diseases and protein design, justifying the development of prediction methods for variant-induced stability changes. Here we propose NeEMO, a tool for the evaluation of stability changes using an effective representation of proteins based on residue interaction networks (RINs). RINs are used to extract useful features describing interactions of the mutant amino acid with its structural environment. Benchmarking shows NeEMO to be very effective, allowing reliable predictions in different parts of the protein such as β-strands and buried residues. Validation on a previously published independent dataset shows that NeEMO has a Pearson correlation coefficient of 0.77 and a standard error of 1 Kcal/mol, outperforming nine recent methods. The NeEMO web server can be freely accessed from URL: http://protein.bio.unipd.it/neemo/. NeEMO offers an innovative and reliable tool for the annotation of amino acid changes. A key contribution are RINs, which can be used for modeling proteins and their interactions effectively. Interestingly, the approach is very general, and can motivate the development of a new family of RIN-based protein structure analyzers. NeEMO may suggest innovative strategies for bioinformatics tools beyond protein stability prediction.
The vaccinia virus I3L gene product is localized to a complex endoplasmic reticulum-associated structure that contains the viral parental DNA.

PubMed

Welsch, Sonja; Doglio, Laura; Schleich, Sibylle; Krijnse Locker, Jacomine

2003-05-01

The vaccinia virus (VV) I3L gene product is a single-stranded DNA-binding protein made early in infection that localizes to the cytoplasmic sites of viral DNA replication (S. C. Rochester and P. Traktman, J. Virol. 72:2917-2926, 1998). Surprisingly, when replication was blocked, the protein localized to distinct cytoplasmic spots (A. Domi and G. Beaud, J. Gen. Virol. 81:1231-1235, 2000). Here these I3L-positive spots were characterized in more detail. By using an anti-I3L peptide antibody we confirmed that the protein localized to the cytoplasmic sites of viral DNA replication by both immunofluorescence and electron microscopy (EM). Before replication had started or when replication was inhibited with hydroxyurea or cytosine arabinoside, I3L localized to distinct cytoplasmic punctate structures of homogeneous size. We show that these structures are not incoming cores or cytoplasmic sites of VV early mRNA accumulation. Instead, morphological and quantitative data indicate that they are specialized sites where the parental DNA accumulates after its release from incoming viral cores. By EM, these sites appeared as complex, electron-dense structures that were intimately associated with the cellular endoplasmic reticulum (ER). By double labeling of cryosections we show that they contain DNA and a viral early protein, the gene product of E8R. Since E8R is a membrane protein that is able to bind to DNA, the localization of this protein to the I3L puncta suggests that they are composed of membranes. The results are discussed in relation to our previous data showing that the process of viral DNA replication also occurs in close association with the ER.
Electrostatics, structure prediction, and the energy landscapes for protein folding and binding.

PubMed

Tsai, Min-Yeh; Zheng, Weihua; Balamurugan, D; Schafer, Nicholas P; Kim, Bobby L; Cheung, Margaret S; Wolynes, Peter G

2016-01-01

While being long in range and therefore weakly specific, electrostatic interactions are able to modulate the stability and folding landscapes of some proteins. The relevance of electrostatic forces for steering the docking of proteins to each other is widely acknowledged, however, the role of electrostatics in establishing specifically funneled landscapes and their relevance for protein structure prediction are still not clear. By introducing Debye-Hückel potentials that mimic long-range electrostatic forces into the Associative memory, Water mediated, Structure, and Energy Model (AWSEM), a transferable protein model capable of predicting tertiary structures, we assess the effects of electrostatics on the landscapes of thirteen monomeric proteins and four dimers. For the monomers, we find that adding electrostatic interactions does not improve structure prediction. Simulations of ribosomal protein S6 show, however, that folding stability depends monotonically on electrostatic strength. The trend in predicted melting temperatures of the S6 variants agrees with experimental observations. Electrostatic effects can play a range of roles in binding. The binding of the protein complex KIX-pKID is largely assisted by electrostatic interactions, which provide direct charge-charge stabilization of the native state and contribute to the funneling of the binding landscape. In contrast, for several other proteins, including the DNA-binding protein FIS, electrostatics causes frustration in the DNA-binding region, which favors its binding with DNA but not with its protein partner. This study highlights the importance of long-range electrostatics in functional responses to problems where proteins interact with their charged partners, such as DNA, RNA, as well as membranes. © 2015 The Protein Society.
Tuning of protein-surfactant interaction to modify the resultant structure.

PubMed

Mehan, Sumit; Aswal, Vinod K; Kohlbrecher, Joachim

2015-09-01

Small-angle neutron scattering and dynamic light scattering studies have been carried out to examine the interaction of bovine serum albumin (BSA) protein with different surfactants under varying solution conditions. We show that the interaction of anionic BSA protein (pH7) with surfactant and the resultant structure are strongly modified by the charge head group of the surfactant, ionic strength of the solution, and mixed surfactants. The protein-surfactant interaction is maximum when two components are oppositely charged, followed by components being similarly charged through the site-specific binding, and no interaction in the case of a nonionic surfactant. This interaction of protein with ionic surfactants is characterized by the fractal structure representing a bead-necklace structure of micellelike clusters adsorbed along the unfolded protein chain. The interaction is enhanced with ionic strength only in the case of site-specific binding of an anionic surfactant with an anionic protein, whereas it is almost unchanged for other complexes of cationic and nonionic surfactants with anionic proteins. Interestingly, the interaction of BSA protein with ionic surfactants is significantly suppressed in the presence of nonionic surfactant. These results with mixed surfactants thus can be used to fold back the unfolded protein as well as to prevent surfactant-induced protein unfolding. For different solution conditions, the results are interpreted in terms of a change in fractal dimension, the overall size of the protein-surfactant complex, and the number of micelles attached to the protein. The interplay of electrostatic and hydrophobic interactions is found to govern the resultant structure of complexes.
Tuning of protein-surfactant interaction to modify the resultant structure

NASA Astrophysics Data System (ADS)

Mehan, Sumit; Aswal, Vinod K.; Kohlbrecher, Joachim

2015-09-01

Small-angle neutron scattering and dynamic light scattering studies have been carried out to examine the interaction of bovine serum albumin (BSA) protein with different surfactants under varying solution conditions. We show that the interaction of anionic BSA protein (p H 7 ) with surfactant and the resultant structure are strongly modified by the charge head group of the surfactant, ionic strength of the solution, and mixed surfactants. The protein-surfactant interaction is maximum when two components are oppositely charged, followed by components being similarly charged through the site-specific binding, and no interaction in the case of a nonionic surfactant. This interaction of protein with ionic surfactants is characterized by the fractal structure representing a bead-necklace structure of micellelike clusters adsorbed along the unfolded protein chain. The interaction is enhanced with ionic strength only in the case of site-specific binding of an anionic surfactant with an anionic protein, whereas it is almost unchanged for other complexes of cationic and nonionic surfactants with anionic proteins. Interestingly, the interaction of BSA protein with ionic surfactants is significantly suppressed in the presence of nonionic surfactant. These results with mixed surfactants thus can be used to fold back the unfolded protein as well as to prevent surfactant-induced protein unfolding. For different solution conditions, the results are interpreted in terms of a change in fractal dimension, the overall size of the protein-surfactant complex, and the number of micelles attached to the protein. The interplay of electrostatic and hydrophobic interactions is found to govern the resultant structure of complexes.
Aggregation of gluten proteins in model dough after fibre polysaccharide addition.

PubMed

Nawrocka, Agnieszka; Szymańska-Chargot, Monika; Miś, Antoni; Wilczewska, Agnieszka Z; Markiewicz, Karolina H

2017-09-15

FT-Raman spectroscopy, thermogravimetry and differential scanning calorimetry were used to study changes in structure of gluten proteins and their thermal properties influenced by four dietary fibre polysaccharides (microcrystalline cellulose, inulin, apple pectin and citrus pectin) during development of a model dough. The flour reconstituted from wheat starch and wheat gluten was mixed with the polysaccharides in five concentrations: 3%, 6%, 9%, 12% and 18%. The obtained results showed that all polysaccharides induced similar changes in secondary structure of gluten proteins concerning formation of aggregates (1604cm -1 ), H-bonded parallel- and antiparallel-β-sheets (1690cm -1 ) and H-bonded β-turns (1664cm -1 ). These changes concerned mainly glutenins since β-structures are characteristic for them. The observed structural changes confirmed hypothesis about partial dehydration of gluten network after polysaccharides addition. The gluten aggregation and dehydration processes were also reflected in the DSC results, while the TGA ones showed that gluten network remained thermally stable after polysaccharides addition. Copyright © 2017 Elsevier Ltd. All rights reserved.
In situ X-ray data collection and structure phasing of protein crystals at Structural Biology Center 19-ID

DOE Office of Scientific and Technical Information (OSTI.GOV)

Michalska, Karolina; Tan, Kemin; Chang, Changsoo

A prototype of a 96-well plate scanner forin situdata collection has been developed at the Structural Biology Center (SBC) beamline 19-ID, located at the Advanced Photon Source, USA. The applicability of this instrument for protein crystal diffraction screening and data collection at ambient temperature has been demonstrated. Several different protein crystals, including selenium-labeled, were used for data collection and successful SAD phasing. Without the common procedure of crystal handling and subsequent cryo-cooling for data collection atT= 100 K, crystals in a crystallization buffer show remarkably low mosaicity (<0.1°) until deterioration by radiation damage occurs. Data presented here show that cryo-coolingmore » can cause some unexpected structural changes. Based on the results of this study, the integration of the plate scanner into the 19-ID end-station with automated controls is being prepared. With improvement of hardware and software,in situdata collection will become available for the SBC user program including remote access.« less
Effects of Chain Length and Degree of Unsaturation of Fatty Acids on Structure and in Vitro Digestibility of Starch-Protein-Fatty Acid Complexes.

PubMed

Zheng, Mengge; Chao, Chen; Yu, Jinglin; Copeland, Les; Wang, Shuo; Wang, Shujun

2018-02-28

The effects of chain length and degree of unsaturation of fatty acids (FAs) on structure and in vitro digestibility of starch-protein-FA complexes were investigated in model systems. Studies with the rapid visco analyzer (RVA) showed that the formation of ternary complex resulted in higher viscosities than those of binary complex during the cooling and holding stages. The results of differential scanning calorimetry (DSC), Raman, and X-ray diffraction (XRD) showed that the structural differences for ternary complexes were much less than those for binary complexes. Starch-protein-FA complexes presented lower in vitro enzymatic digestibility compared with starch-FAs complexes. We conclude that shorter chain and lower unsaturation FAs favor the formation of ternary complexes but decrease the thermal stability of these complexes. FAs had a smaller effect on the ordered structures of ternary complexes than on those of binary complexes and little effect on enzymatic digestibility of both binary and ternary complexes.
Characterization of the low-temperature properties of a simplified protein model

NASA Astrophysics Data System (ADS)

Hagmann, Johannes-Geert; Nakagawa, Naoko; Peyrard, Michel

2014-01-01

Prompted by results that showed that a simple protein model, the frustrated Gō model, appears to exhibit a transition reminiscent of the protein dynamical transition, we examine the validity of this model to describe the low-temperature properties of proteins. First, we examine equilibrium fluctuations. We calculate its incoherent neutron-scattering structure factor and show that it can be well described by a theory using the one-phonon approximation. By performing an inherent structure analysis, we assess the transitions among energy states at low temperatures. Then, we examine nonequilibrium fluctuations after a sudden cooling of the protein. We investigate the violation of the fluctuation-dissipation theorem in order to analyze the protein glass transition. We find that the effective temperature of the quenched protein deviates from the temperature of the thermostat, however it relaxes towards the actual temperature with an Arrhenius behavior as the waiting time increases. These results of the equilibrium and nonequilibrium studies converge to the conclusion that the apparent dynamical transition of this coarse-grained model cannot be attributed to a glassy behavior.
Cross-Link Guided Molecular Modeling with ROSETTA

PubMed Central

Leitner, Alexander; Rosenberger, George; Aebersold, Ruedi; Malmström, Lars

2013-01-01

Chemical cross-links identified by mass spectrometry generate distance restraints that reveal low-resolution structural information on proteins and protein complexes. The technology to reliably generate such data has become mature and robust enough to shift the focus to the question of how these distance restraints can be best integrated into molecular modeling calculations. Here, we introduce three workflows for incorporating distance restraints generated by chemical cross-linking and mass spectrometry into ROSETTA protocols for comparative and de novo modeling and protein-protein docking. We demonstrate that the cross-link validation and visualization software Xwalk facilitates successful cross-link data integration. Besides the protocols we introduce XLdb, a database of chemical cross-links from 14 different publications with 506 intra-protein and 62 inter-protein cross-links, where each cross-link can be mapped on an experimental structure from the Protein Data Bank. Finally, we demonstrate on a protein-protein docking reference data set the impact of virtual cross-links on protein docking calculations and show that an inter-protein cross-link can reduce on average the RMSD of a docking prediction by 5.0 Å. The methods and results presented here provide guidelines for the effective integration of chemical cross-link data in molecular modeling calculations and should advance the structural analysis of particularly large and transient protein complexes via hybrid structural biology methods. PMID:24069194
Conservative mutation Met8 --> Leu affects the folding process and structural stability of squash trypsin inhibitor CMTI-I.

PubMed Central

Zhukov, I.; Jaroszewski, L.; Bierzyński, A.

2000-01-01

Protein molecules can accommodate a large number of mutations without noticeable effects on their stability and folding kinetics. On the other hand, some mutations can have quite strong effects on protein conformational properties. Such mutations either destabilize secondary structures, e.g., alpha-helices, are incompatible with close packing of protein hydrophobic cores, or lead to disruption of some specific interactions such as disulfide cross links, salt bridges, hydrogen bonds, or aromatic-aromatic contacts. The Met8 --> Leu mutation in CMTI-I results in significant destabilization of the protein structure. This effect could hardly be expected since the mutation is highly conservative, and the side chain of residue 8 is situated on the protein surface. We show that the protein destabilization is caused by rearrangement of a hydrophobic cluster formed by side chains of residues 8, Ile6, and Leu17 that leads to partial breaking of a hydrogen bond formed by the amide group of Leu17 with water and to a reduction of a hydrophobic surface buried within the cluster. The mutation perturbs also the protein folding. In aerobic conditions the reduced wild-type protein folds effectively into its native structure, whereas more then 75% of the mutant molecules are trapped in various misfolded species. The main conclusion of this work is that conservative mutations of hydrophobic residues can destabilize a protein structure even if these residues are situated on the protein surface and partially accessible to water. Structural rearrangement of small hydrophobic clusters formed by such residues can lead to local changes in protein hydration, and consequently, can affect considerably protein stability and folding process. PMID:10716179
Influence of xanthan gum on the structural characteristics of myofibrillar proteins treated by high pressure.

PubMed

Villamonte, Gina; Jury, Vanessa; Jung, Stéphanie; de Lamballerie, Marie

2015-03-01

The effects of xanthan gum on the structural modifications of myofibrillar proteins (0.3 M NaCl, pH 6) induced by high pressure (200, 400, and 600 MPa, 6 min) were investigated. The changes in the secondary and tertiary structures of myofibrillar proteins were analyzed by circular dichroism. The protein denaturation was also evaluated by differential scanning calorimetry. Likewise, the protein surface hydrophobicity and the solubility of myofibrillar proteins were measured. High pressure (600 MPa) induced the loss of α-helix structures and an increase of β-sheet structures. However, the presence of xanthan gum hindered the former mechanism of protein denaturation by high pressure. In fact, changes in the secondary (600 MPa) and the tertiary structure fingerprint of high-pressure-treated myofibrillar proteins (400 to 600 MPa) were observed in the presence of xanthan gum. These modifications were confirmed by the thermal analysis, the thermal transitions of high-pressure (400 to 600 MPa)-treated myofibrillar proteins were modified in systems containing xanthan gum. As consequence, the high-pressure-treated myofibrillar proteins with xanthan gum showed increased solubility from 400 MPa, in contrast to high-pressure treatment (600 MPa) without xanthan gum. Moreover, the surface hydrophobicity of high-pressure-treated myofibrillar proteins was enhanced in the presence of xanthan gum. These effects could be due to the unfolding of myofibrillar proteins at high-pressure levels, which exposed sites that most likely interacted with the anionic polysaccharide. This study suggests that the role of food additives could be considered for the development of meat products produced by high-pressure processing. © 2015 Institute of Food Technologists®
Imaging and three-dimensional reconstruction of chemical groups inside a protein complex using atomic force microscopy

NASA Astrophysics Data System (ADS)

Kim, Duckhoe; Sahin, Ozgur

2015-03-01

Scanning probe microscopes can be used to image and chemically characterize surfaces down to the atomic scale. However, the localized tip-sample interactions in scanning probe microscopes limit high-resolution images to the topmost atomic layer of surfaces, and characterizing the inner structures of materials and biomolecules is a challenge for such instruments. Here, we show that an atomic force microscope can be used to image and three-dimensionally reconstruct chemical groups inside a protein complex. We use short single-stranded DNAs as imaging labels that are linked to target regions inside a protein complex, and T-shaped atomic force microscope cantilevers functionalized with complementary probe DNAs allow the labels to be located with sequence specificity and subnanometre resolution. After measuring pairwise distances between labels, we reconstruct the three-dimensional structure formed by the target chemical groups within the protein complex using simple geometric calculations. Experiments with the biotin-streptavidin complex show that the predicted three-dimensional loci of the carboxylic acid groups of biotins are within 2 Å of their respective loci in the corresponding crystal structure, suggesting that scanning probe microscopes could complement existing structural biological techniques in solving structures that are difficult to study due to their size and complexity.
The 15-K neutron structure of saccharide-free concanavalin A.

PubMed

Blakeley, M P; Kalb, A J; Helliwell, J R; Myles, D A A

2004-11-23

The positions of the ordered hydrogen isotopes of a protein and its bound solvent can be determined by using neutron crystallography. Furthermore, by collecting neutron data at cryo temperatures, the dynamic disorder within a protein crystal is reduced, which may lead to improved definition of the nuclear density. It has proved possible to cryo-cool very large Con A protein crystals (>1.5 mm3) suitable for high-resolution neutron and x-ray structure analysis. We can thereby report the neutron crystal structure of the saccharide-free form of Con A and its bound water, including 167 intact D2O molecules and 60 oxygen atoms at 15 K to 2.5-A resolution, along with the 1.65-A x-ray structure of an identical crystal at 100 K. Comparison with the 293-K neutron structure shows that the bound water molecules are better ordered and have lower average B factors than those at room temperature. Overall, twice as many bound waters (as D2O) are identified at 15 K than at 293 K. We note that alteration of bound water orientations occurs between 293 and 15 K; such changes, as illustrated here with this example, could be important more generally in protein crystal structure analysis and ligand design. Methodologically, this successful neutron cryo protein structure refinement opens up categories of neutron protein crystallography, including freeze-trapped structures and cryo to room temperature comparisons.

Post processing of protein-compound docking for fragment-based drug discovery (FBDD): in-silico structure-based drug screening and ligand-binding pose prediction.

PubMed

Fukunishi, Yoshifumi

2010-01-01

For fragment-based drug development, both hit (active) compound prediction and docking-pose (protein-ligand complex structure) prediction of the hit compound are important, since chemical modification (fragment linking, fragment evolution) subsequent to the hit discovery must be performed based on the protein-ligand complex structure. However, the naïve protein-compound docking calculation shows poor accuracy in terms of docking-pose prediction. Thus, post-processing of the protein-compound docking is necessary. Recently, several methods for the post-processing of protein-compound docking have been proposed. In FBDD, the compounds are smaller than those for conventional drug screening. This makes it difficult to perform the protein-compound docking calculation. A method to avoid this problem has been reported. Protein-ligand binding free energy estimation is useful to reduce the procedures involved in the chemical modification of the hit fragment. Several prediction methods have been proposed for high-accuracy estimation of protein-ligand binding free energy. This paper summarizes the various computational methods proposed for docking-pose prediction and their usefulness in FBDD.
Visualization of a radical B 12 enzyme with its G-protein chaperone

DOE PAGES

Jost, Marco; Cracan, Valentin; Hubbard, Paul A.; ...

2015-02-09

G-protein metallochaperones ensure fidelity during cofactor assembly for a variety of metalloproteins, including adenosylcobalamin (AdoCbl)-dependent methylmalonyl-CoA mutase and hydrogenase, and thus have both medical and biofuel development applications. In this paper, we present crystal structures of IcmF, a natural fusion protein of AdoCbl-dependent isobutyryl-CoA mutase and its corresponding G-protein chaperone, which reveal the molecular architecture of a G-protein metallochaperone in complex with its target protein. These structures show that conserved G-protein elements become ordered upon target protein association, creating the molecular pathways that both sense and report on the cofactor loading state. Structures determined of both apo- and holo-forms ofmore » IcmF depict both open and closed enzyme states, in which the cofactor-binding domain is alternatively positioned for cofactor loading and for catalysis. Finally and notably, the G protein moves as a unit with the cofactor-binding domain, providing a visualization of how a chaperone assists in the sequestering of a precious cofactor inside an enzyme active site.« less
Reduced native state stability in crowded cellular environment due to protein-protein interactions.

PubMed

Harada, Ryuhei; Tochio, Naoya; Kigawa, Takanori; Sugita, Yuji; Feig, Michael

2013-03-06

The effect of cellular crowding environments on protein structure and stability is a key issue in molecular and cellular biology. The classical view of crowding emphasizes the volume exclusion effect that generally favors compact, native states. Here, results from molecular dynamics simulations and NMR experiments show that protein crowders may destabilize native states via protein-protein interactions. In the model system considered here, mixtures of villin head piece and protein G at high concentrations, villin structures become increasingly destabilized upon increasing crowder concentrations. The denatured states observed in the simulation involve partial unfolding as well as more subtle conformational shifts. The unfolded states remain overall compact and only partially overlap with unfolded ensembles at high temperature and in the presence of urea. NMR measurements on the same systems confirm structural changes upon crowding based on changes of chemical shifts relative to dilute conditions. An analysis of protein-protein interactions and energetic aspects suggests the importance of enthalpic and solvation contributions to the crowding free energies that challenge an entropic-centered view of crowding effects.
Molecular dynamics simulations and statistical coupling analysis reveal functional coevolution network of oncogenic mutations in the CDKN2A-CDK6 complex.

PubMed

Wang, Jingwen; Zhao, Yuqi; Wang, Yanjie; Huang, Jingfei

2013-01-16

Coevolution between proteins is crucial for understanding protein-protein interaction. Simultaneous changes allow a protein complex to maintain its overall structural-functional integrity. In this study, we combined statistical coupling analysis (SCA) and molecular dynamics simulations on the CDK6-CDKN2A protein complex to evaluate coevolution between proteins. We reconstructed an inter-protein residue coevolution network, consisting of 37 residues and 37 interactions. It shows that most of the coevolved residue pairs are spatially proximal. When the mutations happened, the stable local structures were broken up and thus the protein interaction was decreased or inhibited, with a following increased risk of melanoma. The identification of inter-protein coevolved residues in the CDK6-CDKN2A complex can be helpful for designing protein engineering experiments. Copyright © 2012 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
The amino-terminal structure of human fragile X mental retardation protein obtained using precipitant-immobilized imprinted polymers

NASA Astrophysics Data System (ADS)

Hu, Yufeng; Chen, Zhenhang; Fu, Yanjun; He, Qingzhong; Jiang, Lun; Zheng, Jiangge; Gao, Yina; Mei, Pinchao; Chen, Zhongzhou; Ren, Xueqin

2015-03-01

Flexibility is an intrinsic property of proteins and essential for their biological functions. However, because of structural flexibility, obtaining high-quality crystals of proteins with heterogeneous conformations remain challenging. Here, we show a novel approach to immobilize traditional precipitants onto molecularly imprinted polymers (MIPs) to facilitate protein crystallization, especially for flexible proteins. By applying this method, high-quality crystals of the flexible N-terminus of human fragile X mental retardation protein are obtained, whose absence causes the most common inherited mental retardation. A novel KH domain and an intermolecular disulfide bond are discovered, and several types of dimers are found in solution, thus providing insights into the function of this protein. Furthermore, the precipitant-immobilized MIPs (piMIPs) successfully facilitate flexible protein crystal formation for five model proteins with increased diffraction resolution. This highlights the potential of piMIPs for the crystallization of flexible proteins.
Determination of the X-ray structure of the snake venom protein omwaprin by total chemical synthesis and racemic protein crystallography.

PubMed

Banigan, James R; Mandal, Kalyaneswar; Sawaya, Michael R; Thammavongsa, Vilasak; Hendrickx, Antoni P A; Schneewind, Olaf; Yeates, Todd O; Kent, Stephen B H

2010-10-01

The 50-residue snake venom protein L-omwaprin and its enantiomer D-omwaprin were prepared by total chemical synthesis. Radial diffusion assays were performed against Bacillus megaterium and Bacillus anthracis; both L- and D-omwaprin showed antibacterial activity against B. megaterium. The native protein enantiomer, made of L-amino acids, failed to crystallize readily. However, when a racemic mixture containing equal amounts of L- and D-omwaprin was used, diffraction quality crystals were obtained. The racemic protein sample crystallized in the centrosymmetric space group P2(1)/c and its structure was determined at atomic resolution (1.33 A) by a combination of Patterson and direct methods based on the strong scattering from the sulfur atoms in the eight cysteine residues per protein. Racemic crystallography once again proved to be a valuable method for obtaining crystals of recalcitrant proteins and for determining high-resolution X-ray structures by direct methods.
Biomolecular interactions modulate macromolecular structure and dynamics in atomistic model of a bacterial cytoplasm

PubMed Central

Yu, Isseki; Mori, Takaharu; Ando, Tadashi; Harada, Ryuhei; Jung, Jaewoon; Sugita, Yuji; Feig, Michael

2016-01-01

Biological macromolecules function in highly crowded cellular environments. The structure and dynamics of proteins and nucleic acids are well characterized in vitro, but in vivo crowding effects remain unclear. Using molecular dynamics simulations of a comprehensive atomistic model cytoplasm we found that protein-protein interactions may destabilize native protein structures, whereas metabolite interactions may induce more compact states due to electrostatic screening. Protein-protein interactions also resulted in significant variations in reduced macromolecular diffusion under crowded conditions, while metabolites exhibited significant two-dimensional surface diffusion and altered protein-ligand binding that may reduce the effective concentration of metabolites and ligands in vivo. Metabolic enzymes showed weak non-specific association in cellular environments attributed to solvation and entropic effects. These effects are expected to have broad implications for the in vivo functioning of biomolecules. This work is a first step towards physically realistic in silico whole-cell models that connect molecular with cellular biology. DOI: http://dx.doi.org/10.7554/eLife.19274.001 PMID:27801646
Solution structure and interactions of the Escherichia coli cell division activator protein CedA.

PubMed

Chen, Ho An; Simpson, Peter; Huyton, Trevor; Roper, David; Matthews, Stephen

2005-05-10

CedA is a protein that is postulated to be involved in the regulation of cell division in Escherichia coli and related organisms; however, little biological data about its possible mode of action are available. Here we present a three-dimensional structure of this protein as determined by NMR spectroscopy. The protein is made up of four antiparallel beta-strands, an alpha-helix, and a large unstructured stretch of residues at the N-terminus. It shows structural similarity to a family of DNA-binding proteins which interact with dsDNA via a three-stranded beta-sheet, suggesting that CedA may be a DNA-binding protein. The putative binding surface of CedA is predominantly positively charged with a number of basic residues surrounding a groove largely dominated by aromatic residues. NMR chemical shift perturbations and gel-shift experiments performed with CedA confirm that the protein binds dsDNA, and its interaction is mediated primarily via the beta-sheet.
Comprehensive 3D-modeling of allergenic proteins and amino acid composition of potential conformational IgE epitopes

PubMed Central

Oezguen, Numan; Zhou, Bin; Negi, Surendra S.; Ivanciuc, Ovidiu; Schein, Catherine H.; Labesse, Gilles; Braun, Werner

2008-01-01

Similarities in sequences and 3D structures of allergenic proteins provide vital clues to identify clinically relevant IgE cross-reactivities. However, experimental 3D structures are available in the Protein Data Bank for only 5% (45/829) of all allergens catalogued in the Structural Database of Allergenic Proteins (SDAP, http://fermi.utmb.edu/SDAP). Here, an automated procedure was used to prepare 3D-models of all allergens where there was no experimentally determined 3D structure or high identity (95%) to another protein of known 3D structure. After a final selection by quality criteria, 433 reliable 3D models were retained and are available from our SDAP Website. The new 3D models extensively enhance our knowledge of allergen structures. As an example of their use, experimentally derived “continuous IgE epitopes” were mapped on 3 experimentally determined structures and 13 of our 3D-models of allergenic proteins. Large portions of these continuous sequences are not entirely on the surface and therefore cannot interact with IgE or other proteins. Only the surface exposed residues are constituents of “conformational IgE epitopes” which are not in all cases continuous in sequence. The surface exposed parts of the experimental determined continuous IgE epitopes showed a distinct statistical distribution as compared to their presence in typical protein-protein interfaces. The amino acids Ala, Ser, Asn, Gly and particularly Lys have a high propensity to occur in IgE binding sites. The 3D-models will facilitate further analysis of the common properties of IgE binding sites of allergenic proteins. PMID:18621419
Sterilization mechanism of nitrogen gas plasma: induction of secondary structural change in protein.

PubMed

Sakudo, Akikazu; Higa, Masato; Maeda, Kojiro; Shimizu, Naohiro; Imanishi, Yuichiro; Shintani, Hideharu

2013-07-01

The mechanism of action on biomolecules of N₂ gas plasma, a novel sterilization technique, remains unclear. Here, the effect of N₂ gas plasma on protein structure was investigated. BSA, which was used as the model protein, was exposed to N₂ gas plasma generated by short-time high voltage pulses from a static induction thyristor power supply. N₂ gas plasma-treated BSA at 1.5 kilo pulses per second showed evidence of degradation and modification when assessed by Coomassie brilliant blue staining and ultraviolet spectroscopy at 280 nm. Fourier transform infrared spectroscopy analysis was used to determine the protein's secondary structure. When the amide I region was analyzed in the infrared spectra according to curve fitting and Fourier self-deconvolution, N₂ gas plasma-treated BSA showed increased α-helix and decreased β-turn content. Because heating decreased α-helix and increased β-sheet content, the structural changes induced by N₂ gas plasma-treatment of BSA were not caused by high temperatures. Thus, the present results suggest that conformational changes induced by N₂ gas plasma are mediated by mechanisms distinct from heat denaturation. © 2013 The Societies and Wiley Publishing Asia Pty Ltd.
Structural and Functional Characterizations of SsgB, a Conserved Activator of Developmental Cell Division in Morphologically Complex Actinomycetes*

PubMed Central

Xu, Qingping; Traag, Bjørn A.; Willemse, Joost; McMullan, Daniel; Miller, Mitchell D.; Elsliger, Marc-André; Abdubek, Polat; Astakhova, Tamara; Axelrod, Herbert L.; Bakolitsa, Constantina; Carlton, Dennis; Chen, Connie; Chiu, Hsiu-Ju; Chruszcz, Maksymilian; Clayton, Thomas; Das, Debanu; Deller, Marc C.; Duan, Lian; Ellrott, Kyle; Ernst, Dustin; Farr, Carol L.; Feuerhelm, Julie; Grant, Joanna C.; Grzechnik, Anna; Grzechnik, Slawomir K.; Han, Gye Won; Jaroszewski, Lukasz; Jin, Kevin K.; Klock, Heath E.; Knuth, Mark W.; Kozbial, Piotr; Krishna, S. Sri; Kumar, Abhinav; Marciano, David; Minor, Wladek; Mommaas, A. Mieke; Morse, Andrew T.; Nigoghossian, Edward; Nopakun, Amanda; Okach, Linda; Oommachen, Silvya; Paulsen, Jessica; Puckett, Christina; Reyes, Ron; Rife, Christopher L.; Sefcovic, Natasha; Tien, Henry J.; Trame, Christine B.; van den Bedem, Henry; Wang, Shuren; Weekes, Dana; Hodgson, Keith O.; Wooley, John; Deacon, Ashley M.; Godzik, Adam; Lesley, Scott A.; Wilson, Ian A.; van Wezel, Gilles P.

2009-01-01

SsgA-like proteins (SALPs) are a family of homologous cell division-related proteins that occur exclusively in morphologically complex actinomycetes. We show that SsgB, a subfamily of SALPs, is the archetypal SALP that is functionally conserved in all sporulating actinomycetes. Sporulation-specific cell division of Streptomyces coelicolor ssgB mutants is restored by introduction of distant ssgB orthologues from other actinomycetes. Interestingly, the number of septa (and spores) of the complemented null mutants is dictated by the specific ssgB orthologue that is expressed. The crystal structure of the SsgB from Thermobifida fusca was determined at 2.6 Å resolution and represents the first structure for this family. The structure revealed similarities to a class of eukaryotic “whirly” single-stranded DNA/RNA-binding proteins. However, the electro-negative surface of the SALPs suggests that neither SsgB nor any of the other SALPs are likely to interact with nucleotide substrates. Instead, we show that a conserved hydrophobic surface is likely to be important for SALP function and suggest that proteins are the likely binding partners. PMID:19567872
Isolation, characterization, and aggregation of a structured bacterial matrix precursor.

PubMed

Chai, Liraz; Romero, Diego; Kayatekin, Can; Akabayov, Barak; Vlamakis, Hera; Losick, Richard; Kolter, Roberto

2013-06-14

Biofilms are surface-associated groups of microbial cells that are embedded in an extracellular matrix (ECM). The ECM is a network of biopolymers, mainly polysaccharides, proteins, and nucleic acids. ECM proteins serve a variety of structural roles and often form amyloid-like fibers. Despite the extensive study of the formation of amyloid fibers from their constituent subunits in humans, much less is known about the assembly of bacterial functional amyloid-like precursors into fibers. Using dynamic light scattering, atomic force microscopy, circular dichroism, and infrared spectroscopy, we show that our unique purification method of a Bacillus subtilis major matrix protein component results in stable oligomers that retain their native α-helical structure. The stability of these oligomers enabled us to control the external conditions that triggered their aggregation. In particular, we show that stretched fibers are formed on a hydrophobic surface, whereas plaque-like aggregates are formed in solution under acidic pH conditions. TasA is also shown to change conformation upon aggregation and gain some β-sheet structure. Our studies of the aggregation of a bacterial matrix protein from its subunits shed new light on assembly processes of the ECM within bacterial biofilms.
Solution structure of an antifreeze protein CfAFP-501 from Choristoneura fumiferana.

PubMed

Li, Congmin; Guo, Xianrong; Jia, Zongchao; Xia, Bin; Jin, Changwen

2005-07-01

Antifreeze proteins (AFPs) are widely employed by various organisms as part of their overwintering survival strategy. AFPs have the unique ability to suppress the freezing point of aqueous solution and inhibit ice recrystallization through binding to the ice seed crystals and restricting their growth. The solution structure of CfAFP-501 from spruce budworm has been determined by NMR spectroscopy. Our result demonstrates that CfAFP-501 retains its rigid and highly regular structure in solution. Overall, the solution structure is similar to the crystal structure except the N- and C-terminal regions. NMR spin-relaxation experiments further indicate the overall rigidity of the protein and identify a collection of residues with greater flexibilities. Furthermore, Pro91 shows a cis conformation in solution instead of the trans conformation determined in the crystal structure.
Sulfolobus turreted icosahedral virus c92 protein responsible for the formation of pyramid-like cellular lysis structures.

PubMed

Snyder, Jamie C; Brumfield, Susan K; Peng, Nan; She, Qunxin; Young, Mark J

2011-07-01

Host cells infected by Sulfolobus turreted icosahedral virus (STIV) have been shown to produce unusual pyramid-like structures on the cell surface. These structures represent a virus-induced lysis mechanism that is present in Archaea and appears to be distinct from the holin/endolysin system described for DNA bacteriophages. This study investigated the STIV gene products required for pyramid formation in its host Sulfolobus solfataricus. Overexpression of STIV open reading frame (ORF) c92 in S. solfataricus alone is sufficient to produce the pyramid-like lysis structures in cells. Gene disruption of c92 within STIV demonstrates that c92 is an essential protein for virus replication. Immunolocalization of c92 shows that the protein is localized to the cellular membranes forming the pyramid-like structures.
BeStSel: a web server for accurate protein secondary structure prediction and fold recognition from the circular dichroism spectra.

PubMed

Micsonai, András; Wien, Frank; Bulyáki, Éva; Kun, Judit; Moussong, Éva; Lee, Young-Ho; Goto, Yuji; Réfrégiers, Matthieu; Kardos, József

2018-06-11

Circular dichroism (CD) spectroscopy is a widely used method to study the protein secondary structure. However, for decades, the general opinion was that the correct estimation of β-sheet content is challenging because of the large spectral and structural diversity of β-sheets. Recently, we showed that the orientation and twisting of β-sheets account for the observed spectral diversity, and developed a new method to estimate accurately the secondary structure (PNAS, 112, E3095). BeStSel web server provides the Beta Structure Selection method to analyze the CD spectra recorded by conventional or synchrotron radiation CD equipment. Both normalized and measured data can be uploaded to the server either as a single spectrum or series of spectra. The originality of BeStSel is that it carries out a detailed secondary structure analysis providing information on eight secondary structure components including parallel-β structure and antiparallel β-sheets with three different groups of twist. Based on these, it predicts the protein fold down to the topology/homology level of the CATH protein fold classification. The server also provides a module to analyze the structures deposited in the PDB for BeStSel secondary structure contents in relation to Dictionary of Secondary Structure of Proteins data. The BeStSel server is freely accessible at http://bestsel.elte.hu.
Influence of heat and shear induced protein aggregation on the in vitro digestion rate of whey proteins.

PubMed

Singh, Tanoj K; Øiseth, Sofia K; Lundin, Leif; Day, Li

2014-11-01

Protein intake is essential for growth and repair of body cells, the normal functioning of muscles, and health related immune functions. Most food proteins are consumed after undergoing various degrees of processing. Changes in protein structure and assembly as a result of processing impact the digestibility of proteins. Research in understanding to what extent the protein structure impacts the rate of proteolysis under human physiological conditions has gained considerable interest. In this work, four whey protein gels were prepared using heat processing at two different pH values, 6.8 and 4.6, with and without applied shear. The gels showed different protein network microstructures due to heat induced unfolding (at pH 6.8) or lack of unfolding, thus resulting in fine stranded protein networks. When shear was applied during heating, particulate protein networks were formed. The differences in the gel microstructures resulted in considerable differences in their rheological properties. An in vitro gastric and intestinal model was used to investigate the resulting effects of these different gel structures on whey protein digestion. In addition, the rate of digestion was monitored by taking samples at various time points throughout the in vitro digestion process. The peptides in the digesta were profiled using SDS-polyacrylamide gel electrophoresis, reversed-phase-HPLC and LC-MS. Under simulated gastric conditions, whey proteins in structured gels were hydrolysed faster than native proteins in solution. The rate of peptides released during in vitro digestion differed depending on the structure of the gels and extent of protein aggregation. The outcomes of this work highlighted that changes in the network structure of the protein can influence the rate and pattern of its proteolysis under gastrointestinal conditions. Such knowledge could assist the food industry in designing novel food formulations to control the digestion kinetics and the release of biologically active peptides for desired health outcome.
Assessment of the utility of contact-based restraints in accelerating the prediction of protein structure using molecular dynamics simulations.

PubMed

Raval, Alpan; Piana, Stefano; Eastwood, Michael P; Shaw, David E

2016-01-01

Molecular dynamics (MD) simulation is a well-established tool for the computational study of protein structure and dynamics, but its application to the important problem of protein structure prediction remains challenging, in part because extremely long timescales can be required to reach the native structure. Here, we examine the extent to which the use of low-resolution information in the form of residue-residue contacts, which can often be inferred from bioinformatics or experimental studies, can accelerate the determination of protein structure in simulation. We incorporated sets of 62, 31, or 15 contact-based restraints in MD simulations of ubiquitin, a benchmark system known to fold to the native state on the millisecond timescale in unrestrained simulations. One-third of the restrained simulations folded to the native state within a few tens of microseconds-a speedup of over an order of magnitude compared with unrestrained simulations and a demonstration of the potential for limited amounts of structural information to accelerate structure determination. Almost all of the remaining ubiquitin simulations reached near-native conformations within a few tens of microseconds, but remained trapped there, apparently due to the restraints. We discuss potential methodological improvements that would facilitate escape from these near-native traps and allow more simulations to quickly reach the native state. Finally, using a target from the Critical Assessment of protein Structure Prediction (CASP) experiment, we show that distance restraints can improve simulation accuracy: In our simulations, restraints stabilized the native state of the protein, enabling a reasonable structural model to be inferred. © 2015 The Authors Protein Science published by Wiley Periodicals, Inc. on behalf of The Protein Society.
Chromophore Structure of Photochromic Fluorescent Protein Dronpa: Acid-Base Equilibrium of Two Cis Configurations.

PubMed

Higashino, Asuka; Mizuno, Misao; Mizutani, Yasuhisa

2016-04-07

Dronpa is a novel photochromic fluorescent protein that exhibits fast response to light. The present article is the first report of the resonance and preresonance Raman spectra of Dronpa. We used the intensity and frequency of Raman bands to determine the structure of the Dronpa chromophore in two thermally stable photochromic states. The acid-base equilibrium in one photochromic state was observed by spectroscopic pH titration. The Raman spectra revealed that the chromophore in this state shows a protonation/deprotonation transition with a pKa of 5.2 ± 0.3 and maintains the cis configuration. The observed resonance Raman bands showed that the other photochromic state of the chromophore is in a trans configuration. The results demonstrate that Raman bands selectively enhanced for the chromophore yield valuable information on the molecular structure of the chromophore in photochromic fluorescent proteins after careful elimination of the fluorescence background.
Hydration water and bulk water in proteins have distinct properties in radial distributions calculated from 105 atomic resolution crystal structures.

PubMed

Chen, Xianfeng; Weber, Irene; Harrison, Robert W

2008-09-25

Water plays a critical role in the structure and function of proteins, although the experimental properties of water around protein structures are not well understood. The water can be classified by the separation from the protein surface into bulk water and hydration water. Hydration water interacts closely with the protein and contributes to protein folding, stability, and dynamics, as well as interacting with the bulk water. Water potential functions are often parametrized to fit bulk water properties because of the limited experimental data for hydration water. Therefore, the structural and energetic properties of the hydration water were assessed for 105 atomic resolution (
Self-similar assemblies of globular whey proteins at the air-water interface: effect of the structure.

PubMed

Mahmoudi, Najet; Gaillard, Cédric; Boué, François; Axelos, Monique A V; Riaublanc, Alain

2010-05-01

We investigated the structure of heat-induced assemblies of whey globular proteins using small angle neutron scattering (SANS), static and dynamic light scattering (SLS and DLS), and cryogenic transmission electron microscopy (Cryo-TEM). Whey protein molecules self-assemble in fractal aggregates with a structure density depending on the electrostatic interactions. We determined the static and dynamic properties of interfacial layer formed by the protein assemblies, upon adsorption and spreading at the air-water interface using surface film balance and interfacial dilatational rheology. Upon spreading, all whey protein systems show a power-law scaling behavior of the surface pressure versus concentration in the semi-dilute surface concentration regime, with an exponent ranging from 5.5 to 9 depending on the electrostatic interactions and the aggregation state. The dilatational modulus derived from surface pressure isotherms shows a main peak at 6-8 mN/m, generally considered to be the onset of a conformational change in the monolayer, and a second peak or a shoulder at 15 mN/m. Long-time adsorption kinetics give similar results for both the native whey proteins and the corresponding self-similar assemblies, with a systematic effect of the ionic strength. Copyright 2010 Elsevier Inc. All rights reserved.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.