Recent developments in structural proteomics for protein structure determination.
Liu, Hsuan-Liang; Hsu, Jyh-Ping
2005-05-01
The major challenges in structural proteomics include identifying all the proteins on the genome-wide scale, determining their structure-function relationships, and outlining the precise three-dimensional structures of the proteins. Protein structures are typically determined by experimental approaches such as X-ray crystallography or nuclear magnetic resonance (NMR) spectroscopy. However, the knowledge of three-dimensional space by these techniques is still limited. Thus, computational methods such as comparative and de novo approaches and molecular dynamic simulations are intensively used as alternative tools to predict the three-dimensional structures and dynamic behavior of proteins. This review summarizes recent developments in structural proteomics for protein structure determination; including instrumental methods such as X-ray crystallography and NMR spectroscopy, and computational methods such as comparative and de novo structure prediction and molecular dynamics simulations.
Density functional study of molecular interactions in secondary structures of proteins.
Takano, Yu; Kusaka, Ayumi; Nakamura, Haruki
2016-01-01
Proteins play diverse and vital roles in biology, which are dominated by their three-dimensional structures. The three-dimensional structure of a protein determines its functions and chemical properties. Protein secondary structures, including α-helices and β-sheets, are key components of the protein architecture. Molecular interactions, in particular hydrogen bonds, play significant roles in the formation of protein secondary structures. Precise and quantitative estimations of these interactions are required to understand the principles underlying the formation of three-dimensional protein structures. In the present study, we have investigated the molecular interactions in α-helices and β-sheets, using ab initio wave function-based methods, the Hartree-Fock method (HF) and the second-order Møller-Plesset perturbation theory (MP2), density functional theory, and molecular mechanics. The characteristic interactions essential for forming the secondary structures are discussed quantitatively.
Rigidity of transmembrane proteins determines their cluster shape
NASA Astrophysics Data System (ADS)
Jafarinia, Hamidreza; Khoshnood, Atefeh; Jalali, Mir Abbas
2016-01-01
Protein aggregation in cell membrane is vital for the majority of biological functions. Recent experimental results suggest that transmembrane domains of proteins such as α -helices and β -sheets have different structural rigidities. We use molecular dynamics simulation of a coarse-grained model of protein-embedded lipid membranes to investigate the mechanisms of protein clustering. For a variety of protein concentrations, our simulations under thermal equilibrium conditions reveal that the structural rigidity of transmembrane domains dramatically affects interactions and changes the shape of the cluster. We have observed stable large aggregates even in the absence of hydrophobic mismatch, which has been previously proposed as the mechanism of protein aggregation. According to our results, semiflexible proteins aggregate to form two-dimensional clusters, while rigid proteins, by contrast, form one-dimensional string-like structures. By assuming two probable scenarios for the formation of a two-dimensional triangular structure, we calculate the lipid density around protein clusters and find that the difference in lipid distribution around rigid and semiflexible proteins determines the one- or two-dimensional nature of aggregates. It is found that lipids move faster around semiflexible proteins than rigid ones. The aggregation mechanism suggested in this paper can be tested by current state-of-the-art experimental facilities.
Fast computational methods for predicting protein structure from primary amino acid sequence
Agarwal, Pratul Kumar [Knoxville, TN
2011-07-19
The present invention provides a method utilizing primary amino acid sequence of a protein, energy minimization, molecular dynamics and protein vibrational modes to predict three-dimensional structure of a protein. The present invention also determines possible intermediates in the protein folding pathway. The present invention has important applications to the design of novel drugs as well as protein engineering. The present invention predicts the three-dimensional structure of a protein independent of size of the protein, overcoming a significant limitation in the prior art.
A discrete search algorithm for finding the structure of protein backbones and side chains.
Sallaume, Silas; Martins, Simone de Lima; Ochi, Luiz Satoru; Da Silva, Warley Gramacho; Lavor, Carlile; Liberti, Leo
2013-01-01
Some information about protein structure can be obtained by using Nuclear Magnetic Resonance (NMR) techniques, but they provide only a sparse set of distances between atoms in a protein. The Molecular Distance Geometry Problem (MDGP) consists in determining the three-dimensional structure of a molecule using a set of known distances between some atoms. Recently, a Branch and Prune (BP) algorithm was proposed to calculate the backbone of a protein, based on a discrete formulation for the MDGP. We present an extension of the BP algorithm that can calculate not only the protein backbone, but the whole three-dimensional structure of proteins.
Yeates, Todd O.; Padilla, Jennifer; Colovos, Chris
2004-06-29
Novel fusion proteins capable of self-assembling into regular structures, as well as nucleic acids encoding the same, are provided. The subject fusion proteins comprise at least two oligomerization domains rigidly linked together, e.g. through an alpha helical linking group. Also provided are regular structures comprising a plurality of self-assembled fusion proteins of the subject invention, and methods for producing the same. The subject fusion proteins find use in the preparation of a variety of nanostructures, where such structures include: cages, shells, double-layer rings, two-dimensional layers, three-dimensional crystals, filaments, and tubes.
Nannenga, Brent L; Iadanza, Matthew G; Vollmar, Breanna S; Gonen, Tamir
2013-01-01
Electron cryomicroscopy, or cryoEM, is an emerging technique for studying the three-dimensional structures of proteins and large macromolecular machines. Electron crystallography is a branch of cryoEM in which structures of proteins can be studied at resolutions that rival those achieved by X-ray crystallography. Electron crystallography employs two-dimensional crystals of a membrane protein embedded within a lipid bilayer. The key to a successful electron crystallographic experiment is the crystallization, or reconstitution, of the protein of interest. This unit describes ways in which protein can be expressed, purified, and reconstituted into well-ordered two-dimensional crystals. A protocol is also provided for negative stain electron microscopy as a tool for screening crystallization trials. When large and well-ordered crystals are obtained, the structures of both protein and its surrounding membrane can be determined to atomic resolution.
Nagata, Koji
2010-01-01
Peptides and proteins with similar amino acid sequences can have different biological functions. Knowledge of their three-dimensional molecular structures is critically important in identifying their functional determinants. In this review, I describe the results of our and other groups' structure-based functional characterization of insect insulin-like peptides, a crustacean hyperglycemic hormone-family peptide, a mammalian epidermal growth factor-family protein, and an intracellular signaling domain that recognizes proline-rich sequence.
NASA Technical Reports Server (NTRS)
Luo, Ming (Inventor); Sha, Bingdong (Inventor)
2000-01-01
The matrix protein, M1, of influenza virus strain A/PR/8/34 has been purified from virions and crystallized. The crystals consist of a stable fragment (18 Kd) of the M1 protein. X-ray diffraction studies indicated that the crystals have a space group of P3.sub.t 21 or P3.sub.2 21. Vm calculations showed that there are two monomers in an asymmetric unit. A crystallized N-terminal domain of M1, wherein the N-terminal domain of M1 is crystallized such that the three dimensional structure of the crystallized N-terminal domain of M1 can be determined to a resolution of about 2.1 .ANG. or better, and wherein the three dimensional structure of the uncrystallized N-terminal domain of M1 cannot be determined to a resolution of about 2.1 .ANG. or better. A method of purifying M1 and a method of crystallizing M1. A method of using the three-dimensional crystal structure of M1 to screen for antiviral, influenza virus treating or preventing compounds. A method of using the three-dimensional crystal structure of M1 to screen for improved binding to or inhibition of influenza virus M1. The use of the three-dimensional crystal structure of the M1 protein of influenza virus in the manufacture of an inhibitor of influenza virus M1. The use of the three-dimensional crystal structure of the M1 protein of influenza virus in the screening of candidates for inhibition of influenza virus M1.
ERIC Educational Resources Information Center
Hodis, Eran; Prilusky, Jaime, Sussman, Joel L.
2010-01-01
Protein structures are hard to represent on paper. They are large, complex, and three-dimensional (3D)--four-dimensional if conformational changes count! Unlike most of their substrates, which can easily be drawn out in full chemical formula, drawing every atom in a protein would usually be a mess. Simplifications like showing only the surface of…
Protein Engineering Approaches in the Post-Genomic Era.
Singh, Raushan K; Lee, Jung-Kul; Selvaraj, Chandrabose; Singh, Ranjitha; Li, Jinglin; Kim, Sang-Yong; Kalia, Vipin C
2018-01-01
Proteins are one of the most multifaceted macromolecules in living systems. Proteins have evolved to function under physiological conditions and, therefore, are not usually tolerant of harsh experimental and environmental conditions. The growing use of proteins in industrial processes as a greener alternative to chemical catalysts often demands constant innovation to improve their performance. Protein engineering aims to design new proteins or modify the sequence of a protein to create proteins with new or desirable functions. With the emergence of structural and functional genomics, protein engineering has been invigorated in the post-genomic era. The three-dimensional structures of proteins with known functions facilitate protein engineering approaches to design variants with desired properties. There are three major approaches of protein engineering research, namely, directed evolution, rational design, and de novo design. Rational design is an effective method of protein engineering when the threedimensional structure and mechanism of the protein is well known. In contrast, directed evolution does not require extensive information and a three-dimensional structure of the protein of interest. Instead, it involves random mutagenesis and selection to screen enzymes with desired properties. De novo design uses computational protein design algorithms to tailor synthetic proteins by using the three-dimensional structures of natural proteins and their folding rules. The present review highlights and summarizes recent protein engineering approaches, and their challenges and limitations in the post-genomic era. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Xiaodan, Chen; Xiurong, Zhan; Xinyu, Wu; Chunyan, Zhao; Wanghong, Zhao
2015-04-01
The aim of this study is to analyze the three-dimensional crystal structure of SMU.2055 protein, a putative acetyltransferase from the major caries pathogen Streptococcus mutans (S. mutans). The design and selection of the structure-based small molecule inhibitors are also studied. The three-dimensional crystal structure of SMU.2055 protein was obtained by structural genomics research methods of gene cloning and expression, protein purification with Ni²⁺-chelating affinity chromatography, crystal screening, and X-ray diffraction data collection. An inhibitor virtual model matching with its target protein structure was set up using computer-aided drug design methods, virtual screening and fine docking, and Libdock and Autodock procedures. The crystal of SMU.2055 protein was obtained, and its three-dimensional crystal structure was analyzed. This crystal was diffracted to a resolution of 0.23 nm. It belongs to orthorhombic space group C222(1), with unit cell parameters of a = 9.20 nm, b = 9.46 nm, and c = 19.39 nm. The asymmetric unit contained four molecules, with a solvent content of 56.7%. Moreover, five small molecule compounds, whose structure matched with that of the target protein in high degree, were designed and selected. Protein crystallography research of S. mutans SMU.2055 helps to understand the structures and functions of proteins from S. mutans at the atomic level. These five compounds may be considered as effective inhibitors to SMU.2055. The virtual model of small molecule inhibitors we built will lay a foundation to the anticaries research based on the crystal structure of proteins.
Yeh, Chun-Ting; Brunette, T J; Baker, David; McIntosh-Smith, Simon; Parmeggiani, Fabio
2018-02-01
Computational protein design methods have enabled the design of novel protein structures, but they are often still limited to small proteins and symmetric systems. To expand the size of designable proteins while controlling the overall structure, we developed Elfin, a genetic algorithm for the design of novel proteins with custom shapes using structural building blocks derived from experimentally verified repeat proteins. By combining building blocks with compatible interfaces, it is possible to rapidly build non-symmetric large structures (>1000 amino acids) that match three-dimensional geometric descriptions provided by the user. A run time of about 20min on a laptop computer for a 3000 amino acid structure makes Elfin accessible to users with limited computational resources. Protein structures with controlled geometry will allow the systematic study of the effect of spatial arrangement of enzymes and signaling molecules, and provide new scaffolds for functional nanomaterials. Copyright © 2017 Elsevier Inc. All rights reserved.
Feng, Yingang
2017-01-01
The use of NMR methods to determine the three-dimensional structures of carbohydrates and glycoproteins is still challenging, in part because of the lack of standard protocols. In order to increase the convenience of structure determination, the topology and parameter files for carbohydrates in the program Crystallography & NMR System (CNS) were investigated and new files were developed to be compatible with the standard simulated annealing protocols for proteins and nucleic acids. Recalculating the published structures of protein-carbohydrate complexes and glycosylated proteins demonstrates that the results are comparable to the published structures which employed more complex procedures for structure calculation. Integrating the new carbohydrate parameters into the standard structure calculation protocol will facilitate three-dimensional structural study of carbohydrates and glycosylated proteins by NMR spectroscopy.
2017-01-01
The use of NMR methods to determine the three-dimensional structures of carbohydrates and glycoproteins is still challenging, in part because of the lack of standard protocols. In order to increase the convenience of structure determination, the topology and parameter files for carbohydrates in the program Crystallography & NMR System (CNS) were investigated and new files were developed to be compatible with the standard simulated annealing protocols for proteins and nucleic acids. Recalculating the published structures of protein-carbohydrate complexes and glycosylated proteins demonstrates that the results are comparable to the published structures which employed more complex procedures for structure calculation. Integrating the new carbohydrate parameters into the standard structure calculation protocol will facilitate three-dimensional structural study of carbohydrates and glycosylated proteins by NMR spectroscopy. PMID:29232406
Kato, Koichi; Nakayoshi, Tomoki; Fukuyoshi, Shuichi; Kurimoto, Eiji; Oda, Akifumi
2017-10-12
Although various higher-order protein structure prediction methods have been developed, almost all of them were developed based on the three-dimensional (3D) structure information of known proteins. Here we predicted the short protein structures by molecular dynamics (MD) simulations in which only Newton's equations of motion were used and 3D structural information of known proteins was not required. To evaluate the ability of MD simulationto predict protein structures, we calculated seven short test protein (10-46 residues) in the denatured state and compared their predicted and experimental structures. The predicted structure for Trp-cage (20 residues) was close to the experimental structure by 200-ns MD simulation. For proteins shorter or longer than Trp-cage, root-mean square deviation values were larger than those for Trp-cage. However, secondary structures could be reproduced by MD simulations for proteins with 10-34 residues. Simulations by replica exchange MD were performed, but the results were similar to those from normal MD simulations. These results suggest that normal MD simulations can roughly predict short protein structures and 200-ns simulations are frequently sufficient for estimating the secondary structures of protein (approximately 20 residues). Structural prediction method using only fundamental physical laws are useful for investigating non-natural proteins, such as primitive proteins and artificial proteins for peptide-based drug delivery systems.
Shao, W; Fernandez, E; Wilken, J; Thompson, D A; Siani, M A; West, J; Lolis, E; Schweitzer, B I
1998-12-11
The determination of high resolution three-dimensional structures by X-ray crystallography or nuclear magnetic resonance (NMR) is a time-consuming process. Here we describe an approach to circumvent the cloning and expression of a recombinant protein as well as screening for heavy atom derivatives. The selenomethionine-modified chemokine macrophage inflammatory protein-II (MIP-II) from human herpesvirus-8 has been produced by total chemical synthesis, crystallized, and characterized by NMR. The protein has a secondary structure typical of other chemokines and forms a monomer in solution. These results indicate that total chemical synthesis can be used to accelerate the determination of three-dimensional structures of new proteins identified in genome programs.
X-ray scattering data and structural genomics
NASA Astrophysics Data System (ADS)
Doniach, Sebastian
2003-03-01
High throughput structural genomics has the ambitious goal of determining the structure of all, or a very large number of protein folds using the high-resolution techniques of protein crystallography and NMR. However, the program is facing significant bottlenecks in reaching this goal, which include problems of protein expression and crystallization. In this talk, some preliminary results on how the low-resolution technique of small-angle X-ray solution scattering (SAXS) can help ameliorate some of these bottlenecks will be presented. One of the most significant bottlenecks arises from the difficulty of crystallizing integral membrane proteins, where only a handful of structures are available compared to thousands of structures for soluble proteins. By 3-dimensional reconstruction from SAXS data, the size and shape of detergent-solubilized integral membrane proteins can be characterized. This information can then be used to classify membrane proteins which constitute some 25% of all genomes. SAXS may also be used to study the dependence of interparticle interference scattering on solvent conditions so that regions of the protein solution phase diagram which favor crystallization can be elucidated. As a further application, SAXS may be used to provide physical constraints on computational methods for protein structure prediction based on primary sequence information. This in turn can help in identifying structural homologs of a given protein, which can then give clues to its function. D. Walther, F. Cohen and S. Doniach. "Reconstruction of low resolution three-dimensional density maps from one-dimensional small angle x-ray scattering data for biomolecules." J. Appl. Cryst. 33(2):350-363 (2000). Protein structure prediction constrained by solution X-ray scattering data and structural homology identification Zheng WJ, Doniach S JOURNAL OF MOLECULAR BIOLOGY , v. 316(#1) pp. 173-187 FEB 8, 2002
Serrano, Pedro; Dutta, Samit K; Proudfoot, Andrew; Mohanty, Biswaranjan; Susac, Lukas; Martin, Bryan; Geralt, Michael; Jaroszewski, Lukasz; Godzik, Adam; Elsliger, Marc; Wilson, Ian A; Wüthrich, Kurt
2016-11-01
For more than a decade, the Joint Center for Structural Genomics (JCSG; www.jcsg.org) worked toward increased three-dimensional structure coverage of the protein universe. This coordinated quest was one of the main goals of the four high-throughput (HT) structure determination centers of the Protein Structure Initiative (PSI; www.nigms.nih.gov/Research/specificareas/PSI). To achieve the goals of the PSI, the JCSG made use of the complementarity of structure determination by X-ray crystallography and nuclear magnetic resonance (NMR) spectroscopy to increase and diversify the range of targets entering the HT structure determination pipeline. The overall strategy, for both techniques, was to determine atomic resolution structures for representatives of large protein families, as defined by the Pfam database, which had no structural coverage and could make significant contributions to biological and biomedical research. Furthermore, the experimental structures could be leveraged by homology modeling to further expand the structural coverage of the protein universe and increase biological insights. Here, we describe what could be achieved by this structural genomics approach, using as an illustration the contributions from 20 NMR structure determinations out of a total of 98 JCSG NMR structures, which were selected because they are the first three-dimensional structure representations of the respective Pfam protein families. The information from this small sample is representative for the overall results from crystal and NMR structure determination in the JCSG. There are five new folds, which were classified as domains of unknown functions (DUF), three of the proteins could be functionally annotated based on three-dimensional structure similarity with previously characterized proteins, and 12 proteins showed only limited similarity with previous deposits in the Protein Data Bank (PDB) and were classified as DUFs. © 2016 Federation of European Biochemical Societies.
Jian, Jhih-Wei; Elumalai, Pavadai; Pitti, Thejkiran; Wu, Chih Yuan; Tsai, Keng-Chang; Chang, Jeng-Yih; Peng, Hung-Pin; Yang, An-Suei
2016-01-01
Predicting ligand binding sites (LBSs) on protein structures, which are obtained either from experimental or computational methods, is a useful first step in functional annotation or structure-based drug design for the protein structures. In this work, the structure-based machine learning algorithm ISMBLab-LIG was developed to predict LBSs on protein surfaces with input attributes derived from the three-dimensional probability density maps of interacting atoms, which were reconstructed on the query protein surfaces and were relatively insensitive to local conformational variations of the tentative ligand binding sites. The prediction accuracy of the ISMBLab-LIG predictors is comparable to that of the best LBS predictors benchmarked on several well-established testing datasets. More importantly, the ISMBLab-LIG algorithm has substantial tolerance to the prediction uncertainties of computationally derived protein structure models. As such, the method is particularly useful for predicting LBSs not only on experimental protein structures without known LBS templates in the database but also on computationally predicted model protein structures with structural uncertainties in the tentative ligand binding sites. PMID:27513851
Representing and comparing protein structures as paths in three-dimensional space
Zhi, Degui; Krishna, S Sri; Cao, Haibo; Pevzner, Pavel; Godzik, Adam
2006-01-01
Background Most existing formulations of protein structure comparison are based on detailed atomic level descriptions of protein structures and bypass potential insights that arise from a higher-level abstraction. Results We propose a structure comparison approach based on a simplified representation of proteins that describes its three-dimensional path by local curvature along the generalized backbone of the polypeptide. We have implemented a dynamic programming procedure that aligns curvatures of proteins by optimizing a defined sum turning angle deviation measure. Conclusion Although our procedure does not directly optimize global structural similarity as measured by RMSD, our benchmarking results indicate that it can surprisingly well recover the structural similarity defined by structure classification databases and traditional structure alignment programs. In addition, our program can recognize similarities between structures with extensive conformation changes that are beyond the ability of traditional structure alignment programs. We demonstrate the applications of procedure to several contexts of structure comparison. An implementation of our procedure, CURVE, is available as a public webserver. PMID:17052359
Camproux, A C; Tufféry, P
2005-08-05
Understanding and predicting protein structures depend on the complexity and the accuracy of the models used to represent them. We have recently set up a Hidden Markov Model to optimally compress protein three-dimensional conformations into a one-dimensional series of letters of a structural alphabet. Such a model learns simultaneously the shape of representative structural letters describing the local conformation and the logic of their connections, i.e. the transition matrix between the letters. Here, we move one step further and report some evidence that such a model of protein local architecture also captures some accurate amino acid features. All the letters have specific and distinct amino acid distributions. Moreover, we show that words of amino acids can have significant propensities for some letters. Perspectives point towards the prediction of the series of letters describing the structure of a protein from its amino acid sequence.
The structure of a cholesterol-trapping protein
Date February 28, 2003 Date Berkeley Lab Science Beat Berkeley Lab Science Beat The structure of a Institute researchers determined the three-dimensional structure of a protein that controls cholesterol level in the bloodstream. Knowing the structure of the protein, a cellular receptor that ensnares
Computational methods for constructing protein structure models from 3D electron microscopy maps.
Esquivel-Rodríguez, Juan; Kihara, Daisuke
2013-10-01
Protein structure determination by cryo-electron microscopy (EM) has made significant progress in the past decades. Resolutions of EM maps have been improving as evidenced by recently reported structures that are solved at high resolutions close to 3Å. Computational methods play a key role in interpreting EM data. Among many computational procedures applied to an EM map to obtain protein structure information, in this article we focus on reviewing computational methods that model protein three-dimensional (3D) structures from a 3D EM density map that is constructed from two-dimensional (2D) maps. The computational methods we discuss range from de novo methods, which identify structural elements in an EM map, to structure fitting methods, where known high resolution structures are fit into a low-resolution EM map. A list of available computational tools is also provided. Copyright © 2013 Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Kim, Duckhoe; Sahin, Ozgur
2015-03-01
Scanning probe microscopes can be used to image and chemically characterize surfaces down to the atomic scale. However, the localized tip-sample interactions in scanning probe microscopes limit high-resolution images to the topmost atomic layer of surfaces, and characterizing the inner structures of materials and biomolecules is a challenge for such instruments. Here, we show that an atomic force microscope can be used to image and three-dimensionally reconstruct chemical groups inside a protein complex. We use short single-stranded DNAs as imaging labels that are linked to target regions inside a protein complex, and T-shaped atomic force microscope cantilevers functionalized with complementary probe DNAs allow the labels to be located with sequence specificity and subnanometre resolution. After measuring pairwise distances between labels, we reconstruct the three-dimensional structure formed by the target chemical groups within the protein complex using simple geometric calculations. Experiments with the biotin-streptavidin complex show that the predicted three-dimensional loci of the carboxylic acid groups of biotins are within 2 Å of their respective loci in the corresponding crystal structure, suggesting that scanning probe microscopes could complement existing structural biological techniques in solving structures that are difficult to study due to their size and complexity.
Impact of genetic variation on three dimensional structure and function of proteins
Bhattacharya, Roshni; Rose, Peter W.; Burley, Stephen K.
2017-01-01
The Protein Data Bank (PDB; http://wwpdb.org) was established in 1971 as the first open access digital data resource in biology with seven protein structures as its initial holdings. The global PDB archive now contains more than 126,000 experimentally determined atomic level three-dimensional (3D) structures of biological macromolecules (proteins, DNA, RNA), all of which are freely accessible via the Internet. Knowledge of the 3D structure of the gene product can help in understanding its function and role in disease. Of particular interest in the PDB archive are proteins for which 3D structures of genetic variant proteins have been determined, thus revealing atomic-level structural differences caused by the variation at the DNA level. Herein, we present a systematic and qualitative analysis of such cases. We observe a wide range of structural and functional changes caused by single amino acid differences, including changes in enzyme activity, aggregation propensity, structural stability, binding, and dissociation, some in the context of large assemblies. Structural comparison of wild type and mutated proteins, when both are available, provide insights into atomic-level structural differences caused by the genetic variation. PMID:28296894
An approach to large scale identification of non-obvious structural similarities between proteins
Cherkasov, Artem; Jones, Steven JM
2004-01-01
Background A new sequence independent bioinformatics approach allowing genome-wide search for proteins with similar three dimensional structures has been developed. By utilizing the numerical output of the sequence threading it establishes putative non-obvious structural similarities between proteins. When applied to the testing set of proteins with known three dimensional structures the developed approach was able to recognize structurally similar proteins with high accuracy. Results The method has been developed to identify pathogenic proteins with low sequence identity and high structural similarity to host analogues. Such protein structure relationships would be hypothesized to arise through convergent evolution or through ancient horizontal gene transfer events, now undetectable using current sequence alignment techniques. The pathogen proteins, which could mimic or interfere with host activities, would represent candidate virulence factors. The developed approach utilizes the numerical outputs from the sequence-structure threading. It identifies the potential structural similarity between a pair of proteins by correlating the threading scores of the corresponding two primary sequences against the library of the standard folds. This approach allowed up to 64% sensitivity and 99.9% specificity in distinguishing protein pairs with high structural similarity. Conclusion Preliminary results obtained by comparison of the genomes of Homo sapiens and several strains of Chlamydia trachomatis have demonstrated the potential usefulness of the method in the identification of bacterial proteins with known or potential roles in virulence. PMID:15147578
Structure and function of seed storage proteins in faba bean (Vicia faba L.).
Liu, Yujiao; Wu, Xuexia; Hou, Wanwei; Li, Ping; Sha, Weichao; Tian, Yingying
2017-05-01
The protein subunit is the most important basic unit of protein, and its study can unravel the structure and function of seed storage proteins in faba bean. In this study, we identified six specific protein subunits in Faba bean (cv. Qinghai 13) combining liquid chromatography (LC), liquid chromatography-electronic spray ionization mass (LC-ESI-MS/MS) and bio-information technology. The results suggested a diversity of seed storage proteins in faba bean, and a total of 16 proteins (four GroEL molecular chaperones and 12 plant-specific proteins) were identified from 97-, 96-, 64-, 47-, 42-, and 38-kD-specific protein subunits in faba bean based on the peptide sequence. We also analyzed the composition and abundance of the amino acids, the physicochemical characteristics, secondary structure, three-dimensional structure, transmembrane domain, and possible subcellular localization of these identified proteins in faba bean seed, and finally predicted function and structure. The three-dimensional structures were generated based on homologous modeling, and the protein function was analyzed based on the annotation from the non-redundant protein database (NR database, NCBI) and function analysis of optimal modeling. The objective of this study was to identify the seed storage proteins in faba bean and confirm the structure and function of these proteins. Our results can be useful for the study of protein nutrition and achieve breeding goals for optimal protein quality in faba bean.
X-ray diffraction study of Penicillium Vitale catalase in the complex with aminotriazole
DOE Office of Scientific and Technical Information (OSTI.GOV)
Borovik, A. A.; Grebenko, A. I.; Melik-Adamyan, V. R., E-mail: mawr@ns.crys.ras.ru
2011-07-15
The three-dimensional structure of the enzyme catalase from Penicillium vitale in a complex with the inhibitor aminotriazole was solved and refined by protein X-ray crystallography methods. An analysis of the three-dimensional structure of the complex showed that the inhibition of the enzyme occurs as a result of the covalent binding of aminotriazole to the amino-acid residue His64 in the active site of the enzyme. An investigation of the three-dimensional structure of the complex resulted in the amino-acid residues being more precisely identified. The binding sites of saccharide residues and calcium ions in the protein molecule were found.
Johann Deisenhofer, Crystallography, and Proteins
research using X-ray crystallography to elucidate for the first time the three-dimensional structure of a large membrane-bound protein molecule. This structure helped explain the process of photosynthesis, by a protein structure determination that relied on complementary features of two different beam lines
Conformational Sampling in Template-Free Protein Loop Structure Modeling: An Overview
Li, Yaohang
2013-01-01
Accurately modeling protein loops is an important step to predict three-dimensional structures as well as to understand functions of many proteins. Because of their high flexibility, modeling the three-dimensional structures of loops is difficult and is usually treated as a “mini protein folding problem” under geometric constraints. In the past decade, there has been remarkable progress in template-free loop structure modeling due to advances of computational methods as well as stably increasing number of known structures available in PDB. This mini review provides an overview on the recent computational approaches for loop structure modeling. In particular, we focus on the approaches of sampling loop conformation space, which is a critical step to obtain high resolution models in template-free methods. We review the potential energy functions for loop modeling, loop buildup mechanisms to satisfy geometric constraints, and loop conformation sampling algorithms. The recent loop modeling results are also summarized. PMID:24688696
Conformational sampling in template-free protein loop structure modeling: an overview.
Li, Yaohang
2013-01-01
Accurately modeling protein loops is an important step to predict three-dimensional structures as well as to understand functions of many proteins. Because of their high flexibility, modeling the three-dimensional structures of loops is difficult and is usually treated as a "mini protein folding problem" under geometric constraints. In the past decade, there has been remarkable progress in template-free loop structure modeling due to advances of computational methods as well as stably increasing number of known structures available in PDB. This mini review provides an overview on the recent computational approaches for loop structure modeling. In particular, we focus on the approaches of sampling loop conformation space, which is a critical step to obtain high resolution models in template-free methods. We review the potential energy functions for loop modeling, loop buildup mechanisms to satisfy geometric constraints, and loop conformation sampling algorithms. The recent loop modeling results are also summarized.
Investigations of photosynthetic light harvesting by two-dimensional electronic spectroscopy
NASA Astrophysics Data System (ADS)
Read, Elizabeth Louise
Photosynthesis begins with the harvesting of sunlight by antenna pigments, organized in a network of pigment-protein complexes that rapidly funnel energy to photochemical reaction centers. The intricate design of these systems---the widely varying structural motifs of pigment organization within proteins and protein organization within a larger, cooperative network---underlies the remarkable speed and efficiency of light harvesting. Advances in femtosecond laser spectroscopy have enabled researchers to follow light energy on its course through the energetic levels of photosynthetic systems. Now, newly-developed femtosecond two-dimensional electronic spectroscopy reveals deeper insight into the fundamental molecular interactions and dynamics that emerge in these structures. The following chapters present investigations of a number of natural light-harvesting complexes using two-dimensional electronic spectroscopy. These studies demonstrate the various types of information contained in experimental two-dimensional spectra, and they show that the technique makes it possible to probe pigment-protein complexes on the length- and time-scales relevant to their functioning. New methods are described that further extend the capabilities of two-dimensional electronic spectroscopy, for example, by independently controlling the excitation laser pulse polarizations. The experiments, coupled with theoretical simulation, elucidate spatial pathways of energy flow, unravel molecular and electronic structures, and point to potential new quantum mechanical mechanisms of light harvesting.
2008-05-01
4 ). The three-dimensional spatial orientation of the atoms for these resolved solution structures (Protein Data Bank accession codes: 2gt3...Crystal structure of the Escherichia coli peptide methionine sulphoxide reductase at 1.9 Å resolution . Struct. Fold. Des. 8: 1167 – 1178. 2 . Brot...sources (8). There is a 67% sequence identity between the E.coli and human MsrA ( 2 ). N-terminus C-terminus Figure 2 . Three-dimensional structure
Motivated Proteins: A web application for studying small three-dimensional protein motifs
Leader, David P; Milner-White, E James
2009-01-01
Background Small loop-shaped motifs are common constituents of the three-dimensional structure of proteins. Typically they comprise between three and seven amino acid residues, and are defined by a combination of dihedral angles and hydrogen bonding partners. The most abundant of these are αβ-motifs, asx-motifs, asx-turns, β-bulges, β-bulge loops, β-turns, nests, niches, Schellmann loops, ST-motifs, ST-staples and ST-turns. We have constructed a database of such motifs from a range of high-quality protein structures and built a web application as a visual interface to this. Description The web application, Motivated Proteins, provides access to these 12 motifs (with 48 sub-categories) in a database of over 400 representative proteins. Queries can be made for specific categories or sub-categories of motif, motifs in the vicinity of ligands, motifs which include part of an enzyme active site, overlapping motifs, or motifs which include a particular amino acid sequence. Individual proteins can be specified, or, where appropriate, motifs for all proteins listed. The results of queries are presented in textual form as an (X)HTML table, and may be saved as parsable plain text or XML. Motifs can be viewed and manipulated either individually or in the context of the protein in the Jmol applet structural viewer. Cartoons of the motifs imposed on a linear representation of protein secondary structure are also provided. Summary information for the motifs is available, as are histograms of amino acid distribution, and graphs of dihedral angles at individual positions in the motifs. Conclusion Motivated Proteins is a publicly and freely accessible web application that enables protein scientists to study small three-dimensional motifs without requiring knowledge of either Structured Query Language or the underlying database schema. PMID:19210785
Protein crystallization studies
NASA Technical Reports Server (NTRS)
Lyne, James Evans
1996-01-01
The Structural Biology laboratory at NASA Marshall Spaceflight Center uses x-ray crystallographic techniques to conduct research into the three-dimensional structure of a wide variety of proteins. A major effort in the laboratory involves an ongoing study of human serum albumin (the principal protein in human plasma) and its interaction with various endogenous substances and pharmaceutical agents. Another focus is on antigenic and functional proteins from several pathogenic organisms including the human immunodeficiency virus (HIV) and the widespread parasitic genus, Schistosoma. My efforts this summer have been twofold: first, to identify clinically significant drug interactions involving albumin binding displacement and to initiate studies of the three-dimensional structure of albumin complexed with these agents, and secondly, to establish collaborative efforts to extend the lab's work on human pathogens.
Resource for structure related information on transmembrane proteins
NASA Astrophysics Data System (ADS)
Tusnády, Gábor E.; Simon, István
Transmembrane proteins are involved in a wide variety of vital biological processes including transport of water-soluble molecules, flow of information and energy production. Despite significant efforts to determine the structures of these proteins, only a few thousand solved structures are known so far. Here, we review the various resources for structure-related information on these types of proteins ranging from the 3D structure to the topology and from the up-to-date databases to the various Internet sites and servers dealing with structure prediction and structure analysis. Abbreviations: 3D, three dimensional; PDB, Protein Data Bank; TMP, transmembrane protein.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kim, Sang Beom; Dsilva, Carmeline J.; Debenedetti, Pablo G., E-mail: pdebene@princeton.edu
Understanding the mechanisms by which proteins fold from disordered amino-acid chains to spatially ordered structures remains an area of active inquiry. Molecular simulations can provide atomistic details of the folding dynamics which complement experimental findings. Conventional order parameters, such as root-mean-square deviation and radius of gyration, provide structural information but fail to capture the underlying dynamics of the protein folding process. It is therefore advantageous to adopt a method that can systematically analyze simulation data to extract relevant structural as well as dynamical information. The nonlinear dimensionality reduction technique known as diffusion maps automatically embeds the high-dimensional folding trajectories inmore » a lower-dimensional space from which one can more easily visualize folding pathways, assuming the data lie approximately on a lower-dimensional manifold. The eigenvectors that parametrize the low-dimensional space, furthermore, are determined systematically, rather than chosen heuristically, as is done with phenomenological order parameters. We demonstrate that diffusion maps can effectively characterize the folding process of a Trp-cage miniprotein. By embedding molecular dynamics simulation trajectories of Trp-cage folding in diffusion maps space, we identify two folding pathways and intermediate structures that are consistent with the previous studies, demonstrating that this technique can be employed as an effective way of analyzing and constructing protein folding pathways from molecular simulations.« less
Arana-Daniel, Nancy; Gallegos, Alberto A; López-Franco, Carlos; Alanís, Alma Y; Morales, Jacob; López-Franco, Adriana
2016-01-01
With the increasing power of computers, the amount of data that can be processed in small periods of time has grown exponentially, as has the importance of classifying large-scale data efficiently. Support vector machines have shown good results classifying large amounts of high-dimensional data, such as data generated by protein structure prediction, spam recognition, medical diagnosis, optical character recognition and text classification, etc. Most state of the art approaches for large-scale learning use traditional optimization methods, such as quadratic programming or gradient descent, which makes the use of evolutionary algorithms for training support vector machines an area to be explored. The present paper proposes an approach that is simple to implement based on evolutionary algorithms and Kernel-Adatron for solving large-scale classification problems, focusing on protein structure prediction. The functional properties of proteins depend upon their three-dimensional structures. Knowing the structures of proteins is crucial for biology and can lead to improvements in areas such as medicine, agriculture and biofuels.
Structural changes of malt proteins during boiling.
Jin, Bei; Li, Lin; Liu, Guo-Qin; Li, Bing; Zhu, Yu-Kui; Liao, Liao-Ning
2009-03-09
Changes in the physicochemical properties and structure of proteins derived from two malt varieties (Baudin and Guangmai) during wort boiling were investigated by differential scanning calorimetry, SDS-PAGE, two-dimensional electrophoresis, gel filtration chromatography and circular dichroism spectroscopy. The results showed that both protein content and amino acid composition changed only slightly during boiling, and that boiling might cause a gradual unfolding of protein structures, as indicated by the decrease in surface hydrophobicity and free sulfhydryl content and enthalpy value, as well as reduced alpha-helix contents and markedly increased random coil contents. It was also found that major component of both worts was a boiling-resistant protein with a molecular mass of 40 kDa, and that according to the two-dimensional electrophoresis and SE-HPLC analyses, a small amount of soluble aggregates might be formed via hydrophobic interactions. It was thus concluded that changes of protein structure caused by boiling that might influence beer quality are largely independent of malt variety.
SA-Search: a web tool for protein structure mining based on a Structural Alphabet
Guyon, Frédéric; Camproux, Anne-Claude; Hochez, Joëlle; Tufféry, Pierre
2004-01-01
SA-Search is a web tool that can be used to mine for protein structures and extract structural similarities. It is based on a hidden Markov model derived Structural Alphabet (SA) that allows the compression of three-dimensional (3D) protein conformations into a one-dimensional (1D) representation using a limited number of prototype conformations. Using such a representation, classical methods developed for amino acid sequences can be employed. Currently, SA-Search permits the performance of fast 3D similarity searches such as the extraction of exact words using a suffix tree approach, and the search for fuzzy words viewed as a simple 1D sequence alignment problem. SA-Search is available at http://bioserv.rpbs.jussieu.fr/cgi-bin/SA-Search. PMID:15215446
SA-Search: a web tool for protein structure mining based on a Structural Alphabet.
Guyon, Frédéric; Camproux, Anne-Claude; Hochez, Joëlle; Tufféry, Pierre
2004-07-01
SA-Search is a web tool that can be used to mine for protein structures and extract structural similarities. It is based on a hidden Markov model derived Structural Alphabet (SA) that allows the compression of three-dimensional (3D) protein conformations into a one-dimensional (1D) representation using a limited number of prototype conformations. Using such a representation, classical methods developed for amino acid sequences can be employed. Currently, SA-Search permits the performance of fast 3D similarity searches such as the extraction of exact words using a suffix tree approach, and the search for fuzzy words viewed as a simple 1D sequence alignment problem. SA-Search is available at http://bioserv.rpbs.jussieu.fr/cgi-bin/SA-Search.
Neumann, Sindy; Hartmann, Holger; Martin-Galiano, Antonio J; Fuchs, Angelika; Frishman, Dmitrij
2012-03-01
Structural bioinformatics of membrane proteins is still in its infancy, and the picture of their fold space is only beginning to emerge. Because only a handful of three-dimensional structures are available, sequence comparison and structure prediction remain the main tools for investigating sequence-structure relationships in membrane protein families. Here we present a comprehensive analysis of the structural families corresponding to α-helical membrane proteins with at least three transmembrane helices. The new version of our CAMPS database (CAMPS 2.0) covers nearly 1300 eukaryotic, prokaryotic, and viral genomes. Using an advanced classification procedure, which is based on high-order hidden Markov models and considers both sequence similarity as well as the number of transmembrane helices and loop lengths, we identified 1353 structurally homogeneous clusters roughly corresponding to membrane protein folds. Only 53 clusters are associated with experimentally determined three-dimensional structures, and for these clusters CAMPS is in reasonable agreement with structure-based classification approaches such as SCOP and CATH. We therefore estimate that ∼1300 structures would need to be determined to provide a sufficient structural coverage of polytopic membrane proteins. CAMPS 2.0 is available at http://webclu.bio.wzw.tum.de/CAMPS2.0/. Copyright © 2011 Wiley Periodicals, Inc.
Predicting residue-wise contact orders in proteins by support vector regression.
Song, Jiangning; Burrage, Kevin
2006-10-03
The residue-wise contact order (RWCO) describes the sequence separations between the residues of interest and its contacting residues in a protein sequence. It is a new kind of one-dimensional protein structure that represents the extent of long-range contacts and is considered as a generalization of contact order. Together with secondary structure, accessible surface area, the B factor, and contact number, RWCO provides comprehensive and indispensable important information to reconstructing the protein three-dimensional structure from a set of one-dimensional structural properties. Accurately predicting RWCO values could have many important applications in protein three-dimensional structure prediction and protein folding rate prediction, and give deep insights into protein sequence-structure relationships. We developed a novel approach to predict residue-wise contact order values in proteins based on support vector regression (SVR), starting from primary amino acid sequences. We explored seven different sequence encoding schemes to examine their effects on the prediction performance, including local sequence in the form of PSI-BLAST profiles, local sequence plus amino acid composition, local sequence plus molecular weight, local sequence plus secondary structure predicted by PSIPRED, local sequence plus molecular weight and amino acid composition, local sequence plus molecular weight and predicted secondary structure, and local sequence plus molecular weight, amino acid composition and predicted secondary structure. When using local sequences with multiple sequence alignments in the form of PSI-BLAST profiles, we could predict the RWCO distribution with a Pearson correlation coefficient (CC) between the predicted and observed RWCO values of 0.55, and root mean square error (RMSE) of 0.82, based on a well-defined dataset with 680 protein sequences. Moreover, by incorporating global features such as molecular weight and amino acid composition we could further improve the prediction performance with the CC to 0.57 and an RMSE of 0.79. In addition, combining the predicted secondary structure by PSIPRED was found to significantly improve the prediction performance and could yield the best prediction accuracy with a CC of 0.60 and RMSE of 0.78, which provided at least comparable performance compared with the other existing methods. The SVR method shows a prediction performance competitive with or at least comparable to the previously developed linear regression-based methods for predicting RWCO values. In contrast to support vector classification (SVC), SVR is very good at estimating the raw value profiles of the samples. The successful application of the SVR approach in this study reinforces the fact that support vector regression is a powerful tool in extracting the protein sequence-structure relationship and in estimating the protein structural profiles from amino acid sequences.
Life in the fast lane for protein crystallization and X-ray crystallography
NASA Technical Reports Server (NTRS)
Pusey, Marc L.; Liu, Zhi-Jie; Tempel, Wolfram; Praissman, Jeremy; Lin, Dawei; Wang, Bi-Cheng; Gavira, Jose A.; Ng, Joseph D.
2005-01-01
The common goal for structural genomic centers and consortiums is to decipher as quickly as possible the three-dimensional structures for a multitude of recombinant proteins derived from known genomic sequences. Since X-ray crystallography is the foremost method to acquire atomic resolution for macromolecules, the limiting step is obtaining protein crystals that can be useful of structure determination. High-throughput methods have been developed in recent years to clone, express, purify, crystallize and determine the three-dimensional structure of a protein gene product rapidly using automated devices, commercialized kits and consolidated protocols. However, the average number of protein structures obtained for most structural genomic groups has been very low compared to the total number of proteins purified. As more entire genomic sequences are obtained for different organisms from the three kingdoms of life, only the proteins that can be crystallized and whose structures can be obtained easily are studied. Consequently, an astonishing number of genomic proteins remain unexamined. In the era of high-throughput processes, traditional methods in molecular biology, protein chemistry and crystallization are eclipsed by automation and pipeline practices. The necessity for high-rate production of protein crystals and structures has prevented the usage of more intellectual strategies and creative approaches in experimental executions. Fundamental principles and personal experiences in protein chemistry and crystallization are minimally exploited only to obtain "low-hanging fruit" protein structures. We review the practical aspects of today's high-throughput manipulations and discuss the challenges in fast pace protein crystallization and tools for crystallography. Structural genomic pipelines can be improved with information gained from low-throughput tactics that may help us reach the higher-bearing fruits. Examples of recent developments in this area are reported from the efforts of the Southeast Collaboratory for Structural Genomics (SECSG).
Life in the Fast Lane for Protein Crystallization and X-Ray Crystallography
NASA Technical Reports Server (NTRS)
Pusey, Marc L.; Liu, Zhi-Jie; Tempel, Wolfram; Praissman, Jeremy; Lin, Dawei; Wang, Bi-Cheng; Gavira, Jose A.; Ng, Joseph D.
2004-01-01
The common goal for structural genomic centers and consortiums is to decipher as quickly as possible the three-dimensional structures for a multitude of recombinant proteins derived from known genomic sequences. Since X-ray crystallography is the foremost method to acquire atomic resolution for macromolecules, the limiting step is obtaining protein crystals that can be useful of structure determination. High-throughput methods have been developed in recent years to clone, express, purify, crystallize and determine the three-dimensional structure of a protein gene product rapidly using automated devices, commercialized kits and consolidated protocols. However, the average number of protein structures obtained for most structural genomic groups has been very low compared to the total number of proteins purified. As more entire genomic sequences are obtained for different organisms from the three kingdoms of life, only the proteins that can be crystallized and whose structures can be obtained easily are studied. Consequently, an astonishing number of genomic proteins remain unexamined. In the era of high-throughput processes, traditional methods in molecular biology, protein chemistry and crystallization are eclipsed by automation and pipeline practices. The necessity for high rate production of protein crystals and structures has prevented the usage of more intellectual strategies and creative approaches in experimental executions. Fundamental principles and personal experiences in protein chemistry and crystallization are minimally exploited only to obtain "low-hanging fruit" protein structures. We review the practical aspects of today s high-throughput manipulations and discuss the challenges in fast pace protein crystallization and tools for crystallography. Structural genomic pipelines can be improved with information gained from low-throughput tactics that may help us reach the higher-bearing fruits. Examples of recent developments in this area are reported from the efforts of the Southeast Collaboratory for Structural Genomics (SECSG).
Improved method for predicting protein fold patterns with ensemble classifiers.
Chen, W; Liu, X; Huang, Y; Jiang, Y; Zou, Q; Lin, C
2012-01-27
Protein folding is recognized as a critical problem in the field of biophysics in the 21st century. Predicting protein-folding patterns is challenging due to the complex structure of proteins. In an attempt to solve this problem, we employed ensemble classifiers to improve prediction accuracy. In our experiments, 188-dimensional features were extracted based on the composition and physical-chemical property of proteins and 20-dimensional features were selected using a coupled position-specific scoring matrix. Compared with traditional prediction methods, these methods were superior in terms of prediction accuracy. The 188-dimensional feature-based method achieved 71.2% accuracy in five cross-validations. The accuracy rose to 77% when we used a 20-dimensional feature vector. These methods were used on recent data, with 54.2% accuracy. Source codes and dataset, together with web server and software tools for prediction, are available at: http://datamining.xmu.edu.cn/main/~cwc/ProteinPredict.html.
Poppe, Leszek; Jordan, John B; Rogers, Gary; Schnier, Paul D
2015-06-02
An important aspect in the analytical characterization of protein therapeutics is the comprehensive characterization of higher order structure (HOS). Nuclear magnetic resonance (NMR) is arguably the most sensitive method for fingerprinting HOS of a protein in solution. Traditionally, (1)H-(15)N or (1)H-(13)C correlation spectra are used as a "structural fingerprint" of HOS. Here, we demonstrate that protein fingerprint by line shape enhancement (PROFILE), a 1D (1)H NMR spectroscopy fingerprinting approach, is superior to traditional two-dimensional methods using monoclonal antibody samples and a heavily glycosylated protein therapeutic (Epoetin Alfa). PROFILE generates a high resolution structural fingerprint of a therapeutic protein in a fraction of the time required for a 2D NMR experiment. The cross-correlation analysis of PROFILE spectra allows one to distinguish contributions from HOS vs protein heterogeneity, which is difficult to accomplish by 2D NMR. We demonstrate that the major analytical limitation of two-dimensional methods is poor selectivity, which renders these approaches problematic for the purpose of fingerprinting large biological macromolecules.
Mathematical methods for protein science
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hart, W.; Istrail, S.; Atkins, J.
1997-12-31
Understanding the structure and function of proteins is a fundamental endeavor in molecular biology. Currently, over 100,000 protein sequences have been determined by experimental methods. The three dimensional structure of the protein determines its function, but there are currently less than 4,000 structures known to atomic resolution. Accordingly, techniques to predict protein structure from sequence have an important role in aiding the understanding of the Genome and the effects of mutations in genetic disease. The authors describe current efforts at Sandia to better understand the structure of proteins through rigorous mathematical analyses of simple lattice models. The efforts have focusedmore » on two aspects of protein science: mathematical structure prediction, and inverse protein folding.« less
NASA Astrophysics Data System (ADS)
Ratamero, Erick Martins; Bellini, Dom; Dowson, Christopher G.; Römer, Rudolf A.
2018-06-01
The ability to precisely visualize the atomic geometry of the interactions between a drug and its protein target in structural models is critical in predicting the correct modifications in previously identified inhibitors to create more effective next generation drugs. It is currently common practice among medicinal chemists while attempting the above to access the information contained in three-dimensional structures by using two-dimensional projections, which can preclude disclosure of useful features. A more accessible and intuitive visualization of the three-dimensional configuration of the atomic geometry in the models can be achieved through the implementation of immersive virtual reality (VR). While bespoke commercial VR suites are available, in this work, we present a freely available software pipeline for visualising protein structures through VR. New consumer hardware, such as the uc(HTC Vive) and the uc(Oculus Rift) utilized in this study, are available at reasonable prices. As an instructive example, we have combined VR visualization with fast algorithms for simulating intramolecular motions of protein flexibility, in an effort to further improve structure-led drug design by exposing molecular interactions that might be hidden in the less informative static models. This is a paradigmatic test case scenario for many similar applications in computer-aided molecular studies and design.
Ratamero, Erick Martins; Bellini, Dom; Dowson, Christopher G; Römer, Rudolf A
2018-06-07
The ability to precisely visualize the atomic geometry of the interactions between a drug and its protein target in structural models is critical in predicting the correct modifications in previously identified inhibitors to create more effective next generation drugs. It is currently common practice among medicinal chemists while attempting the above to access the information contained in three-dimensional structures by using two-dimensional projections, which can preclude disclosure of useful features. A more accessible and intuitive visualization of the three-dimensional configuration of the atomic geometry in the models can be achieved through the implementation of immersive virtual reality (VR). While bespoke commercial VR suites are available, in this work, we present a freely available software pipeline for visualising protein structures through VR. New consumer hardware, such as the HTC VIVE and the OCULUS RIFT utilized in this study, are available at reasonable prices. As an instructive example, we have combined VR visualization with fast algorithms for simulating intramolecular motions of protein flexibility, in an effort to further improve structure-led drug design by exposing molecular interactions that might be hidden in the less informative static models. This is a paradigmatic test case scenario for many similar applications in computer-aided molecular studies and design.
Hsing, Michael; Cherkasov, Artem
2008-06-25
Insertions and deletions (indels) represent a common type of sequence variations, which are less studied and pose many important biological questions. Recent research has shown that the presence of sizable indels in protein sequences may be indicative of protein essentiality and their role in protein interaction networks. Examples of utilization of indels for structure-based drug design have also been recently demonstrated. Nonetheless many structural and functional characteristics of indels remain less researched or unknown. We have created a web-based resource, Indel PDB, representing a structural database of insertions/deletions identified from the sequence alignments of highly similar proteins found in the Protein Data Bank (PDB). Indel PDB utilized large amounts of available structural information to characterize 1-, 2- and 3-dimensional features of indel sites. Indel PDB contains 117,266 non-redundant indel sites extracted from 11,294 indel-containing proteins. Unlike loop databases, Indel PDB features more indel sequences with secondary structures including alpha-helices and beta-sheets in addition to loops. The insertion fragments have been characterized by their sequences, lengths, locations, secondary structure composition, solvent accessibility, protein domain association and three dimensional structures. By utilizing the data available in Indel PDB, we have studied and presented here several sequence and structural features of indels. We anticipate that Indel PDB will not only enable future functional studies of indels, but will also assist protein modeling efforts and identification of indel-directed drug binding sites.
Martínez-Castilla, León P.; Rodríguez-Sotres, Rogelio
2010-01-01
Background Despite the remarkable progress of bioinformatics, how the primary structure of a protein leads to a three-dimensional fold, and in turn determines its function remains an elusive question. Alignments of sequences with known function can be used to identify proteins with the same or similar function with high success. However, identification of function-related and structure-related amino acid positions is only possible after a detailed study of every protein. Folding pattern diversity seems to be much narrower than sequence diversity, and the amino acid sequences of natural proteins have evolved under a selective pressure comprising structural and functional requirements acting in parallel. Principal Findings The approach described in this work begins by generating a large number of amino acid sequences using ROSETTA [Dantas G et al. (2003) J Mol Biol 332:449–460], a program with notable robustness in the assignment of amino acids to a known three-dimensional structure. The resulting sequence-sets showed no conservation of amino acids at active sites, or protein-protein interfaces. Hidden Markov models built from the resulting sequence sets were used to search sequence databases. Surprisingly, the models retrieved from the database sequences belonged to proteins with the same or a very similar function. Given an appropriate cutoff, the rate of false positives was zero. According to our results, this protocol, here referred to as Rd.HMM, detects fine structural details on the folding patterns, that seem to be tightly linked to the fitness of a structural framework for a specific biological function. Conclusion Because the sequence of the native protein used to create the Rd.HMM model was always amongst the top hits, the procedure is a reliable tool to score, very accurately, the quality and appropriateness of computer-modeled 3D-structures, without the need for spectroscopy data. However, Rd.HMM is very sensitive to the conformational features of the models' backbone. PMID:20830209
Fitting Multimeric Protein Complexes into Electron Microscopy Maps Using 3D Zernike Descriptors
Esquivel-Rodríguez, Juan; Kihara, Daisuke
2012-01-01
A novel computational method for fitting high-resolution structures of multiple proteins into a cryoelectron microscopy map is presented. The method named EMLZerD generates a pool of candidate multiple protein docking conformations of component proteins, which are later compared with a provided electron microscopy (EM) density map to select the ones that fit well into the EM map. The comparison of docking conformations and the EM map is performed using the 3D Zernike descriptor (3DZD), a mathematical series expansion of three-dimensional functions. The 3DZD provides a unified representation of the surface shape of multimeric protein complex models and EM maps, which allows a convenient, fast quantitative comparison of the three dimensional structural data. Out of 19 multimeric complexes tested, near native complex structures with a root mean square deviation of less than 2.5 Å were obtained for 14 cases while medium range resolution structures with correct topology were computed for the additional 5 cases. PMID:22417139
Fitting multimeric protein complexes into electron microscopy maps using 3D Zernike descriptors.
Esquivel-Rodríguez, Juan; Kihara, Daisuke
2012-06-14
A novel computational method for fitting high-resolution structures of multiple proteins into a cryoelectron microscopy map is presented. The method named EMLZerD generates a pool of candidate multiple protein docking conformations of component proteins, which are later compared with a provided electron microscopy (EM) density map to select the ones that fit well into the EM map. The comparison of docking conformations and the EM map is performed using the 3D Zernike descriptor (3DZD), a mathematical series expansion of three-dimensional functions. The 3DZD provides a unified representation of the surface shape of multimeric protein complex models and EM maps, which allows a convenient, fast quantitative comparison of the three-dimensional structural data. Out of 19 multimeric complexes tested, near native complex structures with a root-mean-square deviation of less than 2.5 Å were obtained for 14 cases while medium range resolution structures with correct topology were computed for the additional 5 cases.
A Protein in the Palm of Your Hand through Augmented Reality
ERIC Educational Resources Information Center
Berry, Colin; Board, Jason
2014-01-01
Understanding of proteins and other biological macromolecules must be based on an appreciation of their 3-dimensional shape and the fine details of their structure. Conveying these details in a clear and stimulating fashion can present challenges using conventional approaches and 2-dimensional monitors and projectors. Here we describe a method for…
PSPP: A Protein Structure Prediction Pipeline for Computing Clusters
2009-07-01
Evanseck JD, et al. (1998) All-atom empirical potential for molecular modeling and dynamics studies of proteins. Journal of Physical Chemistry B 102...dimensional (3-D) protein structures are critical for the understanding of molecular mechanisms of living systems. Traditionally, X-ray crystallography...disordered proteins are often responsible for molecular recognition, molecular assembly, protein modifica- tion, and entropic chain activities in organisms [26
NASA Technical Reports Server (NTRS)
2000-01-01
Dr. Marc Pusey (seated) and Dr. Craig Kundrot use computers to analyze x-ray maps and generate three-dimensional models of protein structures. With this information, scientists at Marshall Space Flight Center can learn how proteins are made and how they work. The computer screen depicts a proten structure as a ball-and-stick model. Other models depict the actual volume occupied by the atoms, or the ribbon-like structures that are crucial to a protein's function.
Kemege, Kyle E.; Hickey, John M.; Lovell, Scott; Battaile, Kevin P.; Zhang, Yang; Hefty, P. Scott
2011-01-01
Chlamydia trachomatis is a medically important pathogen that encodes a relatively high percentage of proteins with unknown function. The three-dimensional structure of a protein can be very informative regarding the protein's functional characteristics; however, determining protein structures experimentally can be very challenging. Computational methods that model protein structures with sufficient accuracy to facilitate functional studies have had notable successes. To evaluate the accuracy and potential impact of computational protein structure modeling of hypothetical proteins encoded by Chlamydia, a successful computational method termed I-TASSER was utilized to model the three-dimensional structure of a hypothetical protein encoded by open reading frame (ORF) CT296. CT296 has been reported to exhibit functional properties of a divalent cation transcription repressor (DcrA), with similarity to the Escherichia coli iron-responsive transcriptional repressor, Fur. Unexpectedly, the I-TASSER model of CT296 exhibited no structural similarity to any DNA-interacting proteins or motifs. To validate the I-TASSER-generated model, the structure of CT296 was solved experimentally using X-ray crystallography. Impressively, the ab initio I-TASSER-generated model closely matched (2.72-Å Cα root mean square deviation [RMSD]) the high-resolution (1.8-Å) crystal structure of CT296. Modeled and experimentally determined structures of CT296 share structural characteristics of non-heme Fe(II) 2-oxoglutarate-dependent enzymes, although key enzymatic residues are not conserved, suggesting a unique biochemical process is likely associated with CT296 function. Additionally, functional analyses did not support prior reports that CT296 has properties shared with divalent cation repressors such as Fur. PMID:21965559
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kemege, Kyle E.; Hickey, John M.; Lovell, Scott
2012-02-13
Chlamydia trachomatis is a medically important pathogen that encodes a relatively high percentage of proteins with unknown function. The three-dimensional structure of a protein can be very informative regarding the protein's functional characteristics; however, determining protein structures experimentally can be very challenging. Computational methods that model protein structures with sufficient accuracy to facilitate functional studies have had notable successes. To evaluate the accuracy and potential impact of computational protein structure modeling of hypothetical proteins encoded by Chlamydia, a successful computational method termed I-TASSER was utilized to model the three-dimensional structure of a hypothetical protein encoded by open reading frame (ORF)more » CT296. CT296 has been reported to exhibit functional properties of a divalent cation transcription repressor (DcrA), with similarity to the Escherichia coli iron-responsive transcriptional repressor, Fur. Unexpectedly, the I-TASSER model of CT296 exhibited no structural similarity to any DNA-interacting proteins or motifs. To validate the I-TASSER-generated model, the structure of CT296 was solved experimentally using X-ray crystallography. Impressively, the ab initio I-TASSER-generated model closely matched (2.72-{angstrom} C{alpha} root mean square deviation [RMSD]) the high-resolution (1.8-{angstrom}) crystal structure of CT296. Modeled and experimentally determined structures of CT296 share structural characteristics of non-heme Fe(II) 2-oxoglutarate-dependent enzymes, although key enzymatic residues are not conserved, suggesting a unique biochemical process is likely associated with CT296 function. Additionally, functional analyses did not support prior reports that CT296 has properties shared with divalent cation repressors such as Fur.« less
The Ramachandran Number: An Order Parameter for Protein Geometry
Mannige, Ranjan V.; Kundu, Joyjit; Whitelam, Stephen; ...
2016-08-04
Three-dimensional protein structures usually contain regions of local order, called secondary structure, such as α-helices and β-sheets. Secondary structure is characterized by the local rotational state of the protein backbone, quantified by two dihedral angles called Øand Ψ. Particular types of secondary structure can generally be described by a single (diffuse) location on a two-dimensional plot drawn in the space of the angles Ø andΨ, called a Ramachandran plot. By contrast, a recently-discovered nanomaterial made from peptoids, structural isomers of peptides, displays a secondary-structure motif corresponding to two regions on the Ramachandran plot [Mannige et al., Nature 526, 415 (2015)].more » In order to describe such 'higher-order' secondary structure in a compact way we introduce here a means of describing regions on the Ramachandran plot in terms of a single Ramachandran number, R, which is a structurally meaningful combination of Ø andΨ. We show that the potential applications of R are numerous: it can be used to describe the geometric content of protein structures, and can be used to draw diagrams that reveal, at a glance, the frequency of occurrence of regular secondary structures and disordered regions in large protein datasets. We propose that R might be used as an order parameter for protein geometry for a wide range of applications.« less
The Multiple-Minima Problem in Protein Folding
NASA Astrophysics Data System (ADS)
Scheraga, Harold A.
1991-10-01
The conformational energy surface of a polypeptide or protein has many local minima, and conventional energy minimization procedures reach only a local minimum (near the starting point of the optimization algorithm) instead of the global minimum (the multiple-minima problem). Several procedures have been developed to surmount this problem, the most promising of which are: (a) build up procedure, (b) optimization of electrostatics, (c) Monte Carlo-plus-energy minimization, (d) electrostatically-driven Monte Carlo, (e) inclusion of distance restraints, (f) adaptive importance-sampling Monte Carlo, (g) relaxation of dimensionality, (h) pattern-recognition, and (i) diffusion equation method. These procedures have been applied to a variety of polypeptide structural problems, and the results of such computations are presented. These include the computation of the structures of open-chain and cyclic peptides, fibrous proteins and globular proteins. Present efforts are being devoted to scaling up these procedures from small polypeptides to proteins, to try to compute the three-dimensional structure of a protein from its amino sequence.
Rational Protein Engineering Guided by Deep Mutational Scanning
Shin, HyeonSeok; Cho, Byung-Kwan
2015-01-01
Sequence–function relationship in a protein is commonly determined by the three-dimensional protein structure followed by various biochemical experiments. However, with the explosive increase in the number of genome sequences, facilitated by recent advances in sequencing technology, the gap between protein sequences available and three-dimensional structures is rapidly widening. A recently developed method termed deep mutational scanning explores the functional phenotype of thousands of mutants via massive sequencing. Coupled with a highly efficient screening system, this approach assesses the phenotypic changes made by the substitution of each amino acid sequence that constitutes a protein. Such an informational resource provides the functional role of each amino acid sequence, thereby providing sufficient rationale for selecting target residues for protein engineering. Here, we discuss the current applications of deep mutational scanning and consider experimental design. PMID:26404267
De Jaco, Antonella; Comoletti, Davide; Dubi, Noga; Camp, Shelley; Taylor, Palmer
2016-01-01
The α/β hydrolase fold family is perhaps the largest group of proteins presenting significant structural homology with divergent functions, ranging from catalytic hydrolysis to heterophilic cell adhesive interactions to chaperones in hormone production. All the proteins of the family share a common three-dimensional core structure containing the α/β-hydrolase fold domain that is crucial for proper protein function. Several mutations associated with congenital diseases or disorders have been reported in conserved residues within the α/β-hydrolase fold domain of cholinesterase-like proteins, neuroligins, butyrylcholinesterase and thyroglobulin. These mutations are known to disrupt the architecture of the common structural domain either globally or locally. Characterization of the natural mutations affecting the α/β-hydrolase fold domain in these proteins has shown that they mainly impair processing and trafficking along the secretory pathway causing retention of the mutant protein in the endoplasmic reticulum. Studying the processing of α/β-hydrolase fold mutant proteins should uncover new functions for this domain, that in some cases require structural integrity for both export of the protein from the ER and for facilitating subunit dimerization. A comparative study of homologous mutations in proteins that are closely related family members, along with the definition of new three-dimensional crystal structures, will identify critical residues for the assembly of the α/β-hydrolase fold. PMID:21933121
Three dimensional electron microscopy and in silico tools for macromolecular structure determination
Borkotoky, Subhomoi; Meena, Chetan Kumar; Khan, Mohammad Wahab; Murali, Ayaluru
2013-01-01
Recently, structural biology witnessed a major tool - electron microscopy - in solving the structures of macromolecules in addition to the conventional techniques, X-ray crystallography and nuclear magnetic resonance (NMR). Three dimensional transmission electron microscopy (3DTEM) is one of the most sophisticated techniques for structure determination of molecular machines. Known to give the 3-dimensional structures in its native form with literally no upper limit on size of the macromolecule, this tool does not need the crystallization of the protein. Combining the 3DTEM data with in silico tools, one can have better refined structure of a desired complex. In this review we are discussing about the recent advancements in three dimensional electron microscopy and tools associated with it. PMID:27092033
Villanueva, Josep; Villegas, Virtudes; Querol, Enrique; Avilés, Francesc X; Serrano, Luis
2002-09-01
In the post-genomic era, several projects focused on the massive experimental resolution of the three-dimensional structures of all the proteins of different organisms have been initiated. Simultaneously, significant progress has been made in the ab initio prediction of protein three-dimensional structure. One of the keys to the success of such a prediction is the use of local information (i.e. secondary structure). Here we describe a new limited proteolysis methodology, based on the use of unspecific exoproteases coupled with matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS), to map quickly secondary structure elements of a protein from both ends, the N- and C-termini. We show that the proteolytic patterns (mass spectra series) obtained can be interpreted in the light of the conformation and local stability of the analyzed proteins, a direct correlation being observed between the predicted and the experimentally derived protein secondary structure. Further, this methodology can be easily applied to check rapidly the folding state of a protein and characterize mutational effects on protein conformation and stability. Moreover, given global stability information, this methodology allows one to locate the protein regions of increased or decreased conformational stability. All of this can be done with a small fraction of the amount of protein required by most of the other methods for conformational analysis. Thus limited exoproteolysis, together with MALDI-TOF MS, can be a useful tool to achieve quickly the elucidation of protein structure and stability. Copyright 2002 John Wiley & Sons, Ltd.
Hexadecameric structure of an invertebrate gap junction channel.
Oshima, Atsunori; Matsuzawa, Tomohiro; Murata, Kazuyoshi; Tani, Kazutoshi; Fujiyoshi, Yoshinori
2016-03-27
Innexins are invertebrate-specific gap junction proteins with four transmembrane helices. These proteins oligomerize to constitute intercellular channels that allow for the passage of small signaling molecules associated with neural and muscular electrical activity. In contrast to the large number of structural and functional studies of connexin gap junction channels, few structural studies of recombinant innexin channels are reported. Here we show the three-dimensional structure of two-dimensionally crystallized Caenorhabditis elegans innexin-6 (INX-6) gap junction channels. The N-terminal deleted INX-6 proteins are crystallized in lipid bilayers. The three-dimensional reconstruction determined by cryo-electron crystallography reveals that a single INX-6 gap junction channel comprises 16 subunits, a hexadecamer, in contrast to chordate connexin channels, which comprise 12 subunits. The channel pore diameters at the cytoplasmic entrance and extracellular gap region are larger than those of connexin26. Two bulb densities are observed in each hemichannel, one in the pore and the other at the cytoplasmic side of the hemichannel in the channel pore pathway. These findings imply a structural diversity of gap junction channels among multicellular organisms. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
Protein structure-structure alignment with discrete Fréchet distance.
Jiang, Minghui; Xu, Ying; Zhu, Binhai
2008-02-01
Matching two geometric objects in two-dimensional (2D) and three-dimensional (3D) spaces is a central problem in computer vision, pattern recognition, and protein structure prediction. In particular, the problem of aligning two polygonal chains under translation and rotation to minimize their distance has been studied using various distance measures. It is well known that the Hausdorff distance is useful for matching two point sets, and that the Fréchet distance is a superior measure for matching two polygonal chains. The discrete Fréchet distance closely approximates the (continuous) Fréchet distance, and is a natural measure for the geometric similarity of the folded 3D structures of biomolecules such as proteins. In this paper, we present new algorithms for matching two polygonal chains in two dimensions to minimize their discrete Fréchet distance under translation and rotation, and an effective heuristic for matching two polygonal chains in three dimensions. We also describe our empirical results on the application of the discrete Fréchet distance to protein structure-structure alignment.
Classification of proteins: available structural space for molecular modeling.
Andreeva, Antonina
2012-01-01
The wealth of available protein structural data provides unprecedented opportunity to study and better understand the underlying principles of protein folding and protein structure evolution. A key to achieving this lies in the ability to analyse these data and to organize them in a coherent classification scheme. Over the past years several protein classifications have been developed that aim to group proteins based on their structural relationships. Some of these classification schemes explore the concept of structural neighbourhood (structural continuum), whereas other utilize the notion of protein evolution and thus provide a discrete rather than continuum view of protein structure space. This chapter presents a strategy for classification of proteins with known three-dimensional structure. Steps in the classification process along with basic definitions are introduced. Examples illustrating some fundamental concepts of protein folding and evolution with a special focus on the exceptions to them are presented.
ERIC Educational Resources Information Center
Ray, Gigi B.; Cook, J. Whitney
2005-01-01
A biochemical molecular modeling project on heme proteins suitable for an introductory Biochemistry I class has been designed with a 2-fold objective: i) to reinforce the correlation between protein three-dimensional structure and function through a discovery oriented project, and ii) to introduce students to the fields of bioinorganic and…
2000-04-19
Dr. Marc Pusey (seated) and Dr. Craig Kundrot use computers to analyze x-ray maps and generate three-dimensional models of protein structures. With this information, scientists at Marshall Space Flight Center can learn how proteins are made and how they work. The computer screen depicts a proten structure as a ball-and-stick model. Other models depict the actual volume occupied by the atoms, or the ribbon-like structures that are crucial to a protein's function.
Protein conformational disorder and enzyme catalysis.
Schulenburg, Cindy; Hilvert, Donald
2013-01-01
Though lacking a well-defined three-dimensional structure, intrinsically unstructured proteins are ubiquitous in nature. These molecules play crucial roles in many cellular processes, especially signaling and regulation. Surprisingly, even enzyme catalysis can tolerate substantial disorder. This observation contravenes conventional wisdom but is relevant to an understanding of how protein dynamics modulates enzyme function. This chapter reviews properties and characteristics of disordered proteins, emphasizing examples of enzymes that lack defined structures, and considers implications of structural disorder for catalytic efficiency and evolution.
Advances in structural and functional analysis of membrane proteins by electron crystallography
Wisedchaisri, Goragot; Reichow, Steve L.; Gonen, Tamir
2011-01-01
Summary Electron crystallography is a powerful technique for the study of membrane protein structure and function in the lipid environment. When well-ordered two-dimensional crystals are obtained the structure of both protein and lipid can be determined and lipid-protein interactions analyzed. Protons and ionic charges can be visualized by electron crystallography and the protein of interest can be captured for structural analysis in a variety of physiologically distinct states. This review highlights the strengths of electron crystallography and the momentum that is building up in automation and the development of high throughput tools and methods for structural and functional analysis of membrane proteins by electron crystallography. PMID:22000511
Advances in structural and functional analysis of membrane proteins by electron crystallography.
Wisedchaisri, Goragot; Reichow, Steve L; Gonen, Tamir
2011-10-12
Electron crystallography is a powerful technique for the study of membrane protein structure and function in the lipid environment. When well-ordered two-dimensional crystals are obtained the structure of both protein and lipid can be determined and lipid-protein interactions analyzed. Protons and ionic charges can be visualized by electron crystallography and the protein of interest can be captured for structural analysis in a variety of physiologically distinct states. This review highlights the strengths of electron crystallography and the momentum that is building up in automation and the development of high throughput tools and methods for structural and functional analysis of membrane proteins by electron crystallography. Copyright © 2011 Elsevier Ltd. All rights reserved.
Grégoire, C; Marco, S; Thimonier, J; Duplan, L; Laurine, E; Chauvin, J P; Michel, B; Peyrot, V; Verdier, J M
2001-07-02
Neurodegenerative diseases are characterized by the presence of filamentous aggregates of proteins. We previously established that lithostathine is a protein overexpressed in the pre-clinical stages of Alzheimer's disease. Furthermore, it is present in the pathognomonic lesions associated with Alzheimer's disease. After self-proteolysis, the N-terminally truncated form of lithostathine leads to the formation of fibrillar aggregates. Here we observed using atomic force microscopy that these aggregates consisted of a network of protofibrils, each of which had a twisted appearance. Electron microscopy and image analysis showed that this twisted protofibril has a quadruple helical structure. Three-dimensional X-ray structural data and the results of biochemical experiments showed that when forming a protofibril, lithostathine was first assembled via lateral hydrophobic interactions into a tetramer. Each tetramer then linked up with another tetramer as the result of longitudinal electrostatic interactions. All these results were used to build a structural model for the lithostathine protofibril called the quadruple-helical filament (QHF-litho). In conclusion, lithostathine strongly resembles the prion protein in its dramatic proteolysis and amyloid proteins in its ability to form fibrils.
Fusion proteins as alternate crystallization paths to difficult structure problems
NASA Technical Reports Server (NTRS)
Carter, Daniel C.; Rueker, Florian; Ho, Joseph X.; Lim, Kap; Keeling, Kim; Gilliland, Gary; Ji, Xinhua
1994-01-01
The three-dimensional structure of a peptide fusion product with glutathione transferase from Schistosoma japonicum (SjGST) has been solved by crystallographic methods to 2.5 A resolution. Peptides or proteins can be fused to SjGST and expressed in a plasmid for rapid synthesis in Escherichia coli. Fusion proteins created by this commercial method can be purified rapidly by chromatography on immobilized glutathione. The potential utility of using SjGST fusion proteins as alternate paths to the crystallization and structure determination of proteins is demonstrated.
Wang, Edina; Chinni, Suresh; Bhore, Subhash Janardhan
2014-01-01
Background: The fatty-acid profile of the vegetable oils determines its properties and nutritional value. Palm-oil obtained from the African oil-palm [Elaeis guineensis Jacq. (Tenera)] contains 44% palmitic acid (C16:0), but, palm-oil obtained from the American oilpalm [Elaeis oleifera] contains only 25% C16:0. In part, the b-ketoacyl-[ACP] synthase II (KASII) [EC: 2.3.1.179] protein is responsible for the high level of C16:0 in palm-oil derived from the African oil-palm. To understand more about E. guineensis KASII (EgKASII) and E. oleifera KASII (EoKASII) proteins, it is essential to know its structures. Hence, this study was undertaken. Objective: The objective of this study was to predict three-dimensional (3D) structure of EgKASII and EoKASII proteins using molecular modelling tools. Materials and Methods: The amino-acid sequences for KASII proteins were retrieved from the protein database of National Center for Biotechnology Information (NCBI), USA. The 3D structures were predicted for both proteins using homology modelling and ab-initio technique approach of protein structure prediction. The molecular dynamics (MD) simulation was performed to refine the predicted structures. The predicted structure models were evaluated and root mean square deviation (RMSD) and root mean square fluctuation (RMSF) values were calculated. Results: The homology modelling showed that EgKASII and EoKASII proteins are 78% and 74% similar with Streptococcus pneumonia KASII and Brucella melitensis KASII, respectively. The EgKASII and EoKASII structures predicted by using ab-initio technique approach shows 6% and 9% deviation to its structures predicted by homology modelling, respectively. The structure refinement and validation confirmed that the predicted structures are accurate. Conclusion: The 3D structures for EgKASII and EoKASII proteins were predicted. However, further research is essential to understand the interaction of EgKASII and EoKASII proteins with its substrates. PMID:24748752
Wang, Edina; Chinni, Suresh; Bhore, Subhash Janardhan
2014-01-01
The fatty-acid profile of the vegetable oils determines its properties and nutritional value. Palm-oil obtained from the African oil-palm [Elaeis guineensis Jacq. (Tenera)] contains 44% palmitic acid (C16:0), but, palm-oil obtained from the American oilpalm [Elaeis oleifera] contains only 25% C16:0. In part, the b-ketoacyl-[ACP] synthase II (KASII) [EC: 2.3.1.179] protein is responsible for the high level of C16:0 in palm-oil derived from the African oil-palm. To understand more about E. guineensis KASII (EgKASII) and E. oleifera KASII (EoKASII) proteins, it is essential to know its structures. Hence, this study was undertaken. The objective of this study was to predict three-dimensional (3D) structure of EgKASII and EoKASII proteins using molecular modelling tools. The amino-acid sequences for KASII proteins were retrieved from the protein database of National Center for Biotechnology Information (NCBI), USA. The 3D structures were predicted for both proteins using homology modelling and ab-initio technique approach of protein structure prediction. The molecular dynamics (MD) simulation was performed to refine the predicted structures. The predicted structure models were evaluated and root mean square deviation (RMSD) and root mean square fluctuation (RMSF) values were calculated. The homology modelling showed that EgKASII and EoKASII proteins are 78% and 74% similar with Streptococcus pneumonia KASII and Brucella melitensis KASII, respectively. The EgKASII and EoKASII structures predicted by using ab-initio technique approach shows 6% and 9% deviation to its structures predicted by homology modelling, respectively. The structure refinement and validation confirmed that the predicted structures are accurate. The 3D structures for EgKASII and EoKASII proteins were predicted. However, further research is essential to understand the interaction of EgKASII and EoKASII proteins with its substrates.
A Review on Structures and Functions of Bcl-2 Family Proteins from Homo sapiens.
Sivakumar, Dakshinamurthy; Sivaraman, Thirunavukkarasu
2016-01-01
Cancer cells evade apoptosis, which is regulated by proteins of Bcl-2 family in the intrinsic pathways. Numerous experimental three-dimensional (3D) structures of the apoptotic proteins and the proteins bound with small chemical molecules/peptides/proteins have been reported in the literature. In this review article, the 3D structures of the Bcl-2 family proteins from Homo sapiens and as well complex structures of the anti-apoptotic proteins bound with small molecular inhibitors reported in the literature to date have been comprehensively listed out and described in detail. Moreover, the molecular mechanisms by which the Bcl-2 family proteins modulate the apoptotic processes and strategies for designing antagonists to anti-apoptotic proteins have been concisely discussed.
Lu, Jia-hai; Zhang, Ding-mei; Wang, Guo-ling; Guo, Zhong-min; Zhang, Chuan-hai; Tan, Bing-yan; Ouyang, Li-ping; Lin, Li; Liu, Yi-min; Chen, Wei-qing; Ling, Wen-hua; Yu, Xin-bing; Zhong, Nan-shan
2005-05-05
The rapid transmission and high mortality rate made severe acute respiratory syndrome (SARS) a global threat for which no efficacious therapy is available now. Without sufficient knowledge about the SARS coronavirus (SARS-CoV), it is impossible to define the candidate for the anti-SARS targets. The putative non-structural protein 2 (nsp2) (3CL(pro), following the nomenclature by Gao et al, also known as nsp5 in Snidjer et al) of SARS-CoV plays an important role in viral transcription and replication, and is an attractive target for anti-SARS drug development, so we carried on this study to have an insight into putative polymerase nsp2 of SARS-CoV Guangdong (GD) strain. The SARS-CoV strain was isolated from a SARS patient in Guangdong, China, and cultured in Vero E6 cells. The nsp2 gene was amplified by reverse transcription-polymerase chain reaction (RT-PCR) and cloned into eukaryotic expression vector pCI-neo (pCI-neo/nsp2). Then the recombinant eukaryotic expression vector pCI-neo/nsp2 was transfected into COS-7 cells using lipofectin reagent to express the nsp2 protein. The expressive protein of SARS-CoV nsp2 was analyzed by 7% sodium dodecylsulfate polyacrylamide gel electrophoresis (SDS-PAGE). The nucleotide sequence and protein sequence of GD nsp2 were compared with that of other SARS-CoV strains by nucleotide-nucleotide basic local alignment search tool (BLASTN) and protein-protein basic local alignment search tool (BLASTP) to investigate its variance trend during the transmission. The secondary structure of GD strain and that of other strains were predicted by Garnier-Osguthorpe-Robson (GOR) Secondary Structure Prediction. Three-dimensional-PSSM Protein Fold Recognition (Threading) Server was employed to construct the three-dimensional model of the nsp2 protein. The putative polymerase nsp2 gene of GD strain was amplified by RT-PCR. The eukaryotic expression vector (pCI-neo/nsp2) was constructed and expressed the protein in COS-7 cells successfully. The result of sequencing and sequence comparison with other SARS-CoV strains showed that nsp2 gene was relatively conservative during the transmission and total five base sites mutated in about 100 strains investigated, three of which in the early and middle phases caused synonymous mutation, and another two base sites variation in the late phase resulted in the amino acid substitutions and secondary structure changes. The three-dimensional structure of the nsp2 protein was successfully constructed. The results suggest that polymerase nsp2 is relatively stable during the phase of epidemic. The amino acid and secondary structure change may be important for viral infection. The fact that majority of single nucleotide variations (SNVs) are predicted to cause synonymous, as well as the result of low mutation rate of nsp2 gene in the epidemic variations, indicates that the nsp2 is conservative and could be a target for anti-SARS drugs. The three-dimensional structure result indicates that the nsp2 protein of GD strain is high homologous with 3CL(pro) of SARS-CoV urbani strain, 3CL(pro) of transmissible gastroenteritis virus and 3CL(pro) of human coronavirus 229E strain, which further suggests that nsp2 protein of GD strain possesses the activity of 3CL(pro).
Protein structural similarity search by Ramachandran codes
Lo, Wei-Cheng; Huang, Po-Jung; Chang, Chih-Hung; Lyu, Ping-Chiang
2007-01-01
Background Protein structural data has increased exponentially, such that fast and accurate tools are necessary to access structure similarity search. To improve the search speed, several methods have been designed to reduce three-dimensional protein structures to one-dimensional text strings that are then analyzed by traditional sequence alignment methods; however, the accuracy is usually sacrificed and the speed is still unable to match sequence similarity search tools. Here, we aimed to improve the linear encoding methodology and develop efficient search tools that can rapidly retrieve structural homologs from large protein databases. Results We propose a new linear encoding method, SARST (Structural similarity search Aided by Ramachandran Sequential Transformation). SARST transforms protein structures into text strings through a Ramachandran map organized by nearest-neighbor clustering and uses a regenerative approach to produce substitution matrices. Then, classical sequence similarity search methods can be applied to the structural similarity search. Its accuracy is similar to Combinatorial Extension (CE) and works over 243,000 times faster, searching 34,000 proteins in 0.34 sec with a 3.2-GHz CPU. SARST provides statistically meaningful expectation values to assess the retrieved information. It has been implemented into a web service and a stand-alone Java program that is able to run on many different platforms. Conclusion As a database search method, SARST can rapidly distinguish high from low similarities and efficiently retrieve homologous structures. It demonstrates that the easily accessible linear encoding methodology has the potential to serve as a foundation for efficient protein structural similarity search tools. These search tools are supposed applicable to automated and high-throughput functional annotations or predictions for the ever increasing number of published protein structures in this post-genomic era. PMID:17716377
Electron crystallography of PhoE porin, an outer membrane, channel- forming protein from E. coli
DOE Office of Scientific and Technical Information (OSTI.GOV)
Walian, P.J.
1989-11-01
One approach to studying the structure of membrane proteins is the use of electron crystallography. Dr. Bing Jap has crystallized PhoE pore-forming protein (porin) from the outer membrane of escherichia coli (E. coli) into monolayer crystals. The findings of this research and those of Jap (1988, 1989) have determined these crystals to be highly ordered, yielding structural information to a resolution of better than 2.8 angstroms. The task of this thesis has been to collect and process the electron diffraction patterns necessary to generate a complete three-dimensional set of high resolution structure factor amplitudes of PhoE porin. Fourier processing ofmore » these amplitudes when combined with the corresponding phase data is expected to yield the three-dimensional structure of PhoE porin at better than 3.5 angstroms resolution. 92 refs., 33 figs., 3 tabs. (CBS)« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kalbitzer, H.R.; Neidig, K.P.; Hengstenberg, W.
1991-11-19
Complete sequence-specific assignments of the {sup 1}H NMR spectrum of HPr protein from Staphylococcus aureus were obtained by two-dimensional NMR methods. Important secondary structure elements that can be derived from the observed nuclear Overhauser effects are a large antiparallel {beta}-pleated sheet consisting of four strands, A, B, C, D, a segment S{sub AB} consisting of an extended region around the active-center histidine (His-15) and an {alpha}-helix, a half-turn between strands B and C, a segment S{sub CD} which shows no typical secondary structure, and the {alpha}-helical, C-terminal segment S{sub term}. These general structural features are similar to those found earliermore » in HPr proteins from different microorganisms such as Escherichia coli, Bacillus subtilis, and Streptococcus faecalis.« less
Encounter complexes and dimensionality reduction in protein-protein association.
Kozakov, Dima; Li, Keyong; Hall, David R; Beglov, Dmitri; Zheng, Jiefu; Vakili, Pirooz; Schueler-Furman, Ora; Paschalidis, Ioannis Ch; Clore, G Marius; Vajda, Sandor
2014-04-08
An outstanding challenge has been to understand the mechanism whereby proteins associate. We report here the results of exhaustively sampling the conformational space in protein-protein association using a physics-based energy function. The agreement between experimental intermolecular paramagnetic relaxation enhancement (PRE) data and the PRE profiles calculated from the docked structures shows that the method captures both specific and non-specific encounter complexes. To explore the energy landscape in the vicinity of the native structure, the nonlinear manifold describing the relative orientation of two solid bodies is projected onto a Euclidean space in which the shape of low energy regions is studied by principal component analysis. Results show that the energy surface is canyon-like, with a smooth funnel within a two dimensional subspace capturing over 75% of the total motion. Thus, proteins tend to associate along preferred pathways, similar to sliding of a protein along DNA in the process of protein-DNA recognition. DOI: http://dx.doi.org/10.7554/eLife.01370.001.
Imai, Takashi; Kovalenko, Andriy; Hirata, Fumio
2005-04-14
The three-dimensional reference interaction site model (3D-RISM) theory is applied to the analysis of hydration effects on the partial molar volume of proteins. For the native structure of some proteins, the partial molar volume is decomposed into geometric and hydration contributions using the 3D-RISM theory combined with the geometric volume calculation. The hydration contributions are correlated with the surface properties of the protein. The thermal volume, which is the volume of voids around the protein induced by the thermal fluctuation of water molecules, is directly proportional to the accessible surface area of the protein. The interaction volume, which is the contribution of electrostatic interactions between the protein and water molecules, is apparently governed by the charged atomic groups on the protein surface. The polar atomic groups do not make any contribution to the interaction volume. The volume differences between low- and high-pressure structures of lysozyme are also analyzed by the present method.
Barkay-Olami, Hilla; Zilberman, Meital
2016-08-01
Use of naturally derived materials for biomedical applications is steadily increasing. Soy protein has advantages over various types of natural proteins employed for biomedical applications due to its low price, nonanimal origin, and relatively long storage time and stability. In the current study, blends of soy protein with other polymers (gelatin, alginate, pectin, polyvinyl alcohol, and polyethylene glycol) were developed and studied. The mechanical tensile properties of dense films were studied in order to select the best secondary polymer for porous three-dimensional structures. The porous soy-gelatin and soy-alginate structures were then studied for physical properties, degradation behavior, and microstructure. The results show that these blends can be assembled into porous three-dimensional structures by combining chemical crosslinking with freeze-drying. The soy-alginate blends are advantageous over soy-gelatin blends, demonstrated better stability, and degradation time along with controlled swelling behavior due to more effective crosslinking and higher water uptake than soy-gelatin blends. Water vapor transmission rate experiments showed that all porous blend structures were in the desired range for burn treatment [2000-2500 g/(m(2) d)] and can be controlled by the crosslinking process. We conclude that these novel porous three-dimensional structures have a high potential for use as scaffolds for tissue engineering, especially for skin regeneration applications. © 2015 Wiley Periodicals, Inc. J Biomed Mater Res Part B: Appl Biomater, 104B: 1109-1120, 2016. © 2015 Wiley Periodicals, Inc.
Bhardwaj, Anshul; Casjens, Sherwood R; Cingolani, Gino
2014-02-01
Protein fibers are widespread in nature, but only a limited number of high-resolution structures have been determined experimentally. Unlike globular proteins, fibers are usually recalcitrant to form three-dimensional crystals, preventing single-crystal X-ray diffraction analysis. In the absence of three-dimensional crystals, X-ray fiber diffraction is a powerful tool to determine the internal symmetry of a fiber, but it rarely yields atomic resolution structural information on complex protein fibers. An 85-residue-long minimal coiled-coil repeat unit (MiCRU) was previously identified in the trimeric helical core of tail needle gp26, a fibrous protein emanating from the tail apparatus of the bacteriophage P22 virion. Here, evidence is provided that an MiCRU can be inserted in frame inside the gp26 helical core to generate a rationally extended fiber (gp26-2M) which, like gp26, retains a trimeric quaternary structure in solution. The 2.7 Å resolution crystal structure of this engineered fiber, which measures ∼320 Å in length and is only 20-35 Å wide, was determined. This structure, the longest for a trimeric protein fiber to be determined to such a high resolution, reveals the architecture of 22 consecutive trimerization heptads and provides a framework to decipher the structural determinants for protein fiber assembly, stability and flexibility.
Mining protein loops using a structural alphabet and statistical exceptionality
2010-01-01
Background Protein loops encompass 50% of protein residues in available three-dimensional structures. These regions are often involved in protein functions, e.g. binding site, catalytic pocket... However, the description of protein loops with conventional tools is an uneasy task. Regular secondary structures, helices and strands, have been widely studied whereas loops, because they are highly variable in terms of sequence and structure, are difficult to analyze. Due to data sparsity, long loops have rarely been systematically studied. Results We developed a simple and accurate method that allows the description and analysis of the structures of short and long loops using structural motifs without restriction on loop length. This method is based on the structural alphabet HMM-SA. HMM-SA allows the simplification of a three-dimensional protein structure into a one-dimensional string of states, where each state is a four-residue prototype fragment, called structural letter. The difficult task of the structural grouping of huge data sets is thus easily accomplished by handling structural letter strings as in conventional protein sequence analysis. We systematically extracted all seven-residue fragments in a bank of 93000 protein loops and grouped them according to the structural-letter sequence, named structural word. This approach permits a systematic analysis of loops of all sizes since we consider the structural motifs of seven residues rather than complete loops. We focused the analysis on highly recurrent words of loops (observed more than 30 times). Our study reveals that 73% of loop-lengths are covered by only 3310 highly recurrent structural words out of 28274 observed words). These structural words have low structural variability (mean RMSd of 0.85 Å). As expected, half of these motifs display a flanking-region preference but interestingly, two thirds are shared by short (less than 12 residues) and long loops. Moreover, half of recurrent motifs exhibit a significant level of amino-acid conservation with at least four significant positions and 87% of long loops contain at least one such word. We complement our analysis with the detection of statistically over-represented patterns of structural letters as in conventional DNA sequence analysis. About 30% (930) of structural words are over-represented, and cover about 40% of loop lengths. Interestingly, these words exhibit lower structural variability and higher sequential specificity, suggesting structural or functional constraints. Conclusions We developed a method to systematically decompose and study protein loops using recurrent structural motifs. This method is based on the structural alphabet HMM-SA and not on structural alignment and geometrical parameters. We extracted meaningful structural motifs that are found in both short and long loops. To our knowledge, it is the first time that pattern mining helps to increase the signal-to-noise ratio in protein loops. This finding helps to better describe protein loops and might permit to decrease the complexity of long-loop analysis. Detailed results are available at http://www.mti.univ-paris-diderot.fr/publication/supplementary/2009/ACCLoop/. PMID:20132552
Mining protein loops using a structural alphabet and statistical exceptionality.
Regad, Leslie; Martin, Juliette; Nuel, Gregory; Camproux, Anne-Claude
2010-02-04
Protein loops encompass 50% of protein residues in available three-dimensional structures. These regions are often involved in protein functions, e.g. binding site, catalytic pocket... However, the description of protein loops with conventional tools is an uneasy task. Regular secondary structures, helices and strands, have been widely studied whereas loops, because they are highly variable in terms of sequence and structure, are difficult to analyze. Due to data sparsity, long loops have rarely been systematically studied. We developed a simple and accurate method that allows the description and analysis of the structures of short and long loops using structural motifs without restriction on loop length. This method is based on the structural alphabet HMM-SA. HMM-SA allows the simplification of a three-dimensional protein structure into a one-dimensional string of states, where each state is a four-residue prototype fragment, called structural letter. The difficult task of the structural grouping of huge data sets is thus easily accomplished by handling structural letter strings as in conventional protein sequence analysis. We systematically extracted all seven-residue fragments in a bank of 93000 protein loops and grouped them according to the structural-letter sequence, named structural word. This approach permits a systematic analysis of loops of all sizes since we consider the structural motifs of seven residues rather than complete loops. We focused the analysis on highly recurrent words of loops (observed more than 30 times). Our study reveals that 73% of loop-lengths are covered by only 3310 highly recurrent structural words out of 28274 observed words). These structural words have low structural variability (mean RMSd of 0.85 A). As expected, half of these motifs display a flanking-region preference but interestingly, two thirds are shared by short (less than 12 residues) and long loops. Moreover, half of recurrent motifs exhibit a significant level of amino-acid conservation with at least four significant positions and 87% of long loops contain at least one such word. We complement our analysis with the detection of statistically over-represented patterns of structural letters as in conventional DNA sequence analysis. About 30% (930) of structural words are over-represented, and cover about 40% of loop lengths. Interestingly, these words exhibit lower structural variability and higher sequential specificity, suggesting structural or functional constraints. We developed a method to systematically decompose and study protein loops using recurrent structural motifs. This method is based on the structural alphabet HMM-SA and not on structural alignment and geometrical parameters. We extracted meaningful structural motifs that are found in both short and long loops. To our knowledge, it is the first time that pattern mining helps to increase the signal-to-noise ratio in protein loops. This finding helps to better describe protein loops and might permit to decrease the complexity of long-loop analysis. Detailed results are available at http://www.mti.univ-paris-diderot.fr/publication/supplementary/2009/ACCLoop/.
NIAS-Server: Neighbors Influence of Amino acids and Secondary Structures in Proteins.
Borguesan, Bruno; Inostroza-Ponta, Mario; Dorn, Márcio
2017-03-01
The exponential growth in the number of experimentally determined three-dimensional protein structures provide a new and relevant knowledge about the conformation of amino acids in proteins. Only a few of probability densities of amino acids are publicly available for use in structure validation and prediction methods. NIAS (Neighbors Influence of Amino acids and Secondary structures) is a web-based tool used to extract information about conformational preferences of amino acid residues and secondary structures in experimental-determined protein templates. This information is useful, for example, to characterize folds and local motifs in proteins, molecular folding, and can help the solution of complex problems such as protein structure prediction, protein design, among others. The NIAS-Server and supplementary data are available at http://sbcb.inf.ufrgs.br/nias .
Physical Model of the Genotype-to-Phenotype Map of Proteins
NASA Astrophysics Data System (ADS)
Tlusty, Tsvi; Libchaber, Albert; Eckmann, Jean-Pierre
2017-04-01
How DNA is mapped to functional proteins is a basic question of living matter. We introduce and study a physical model of protein evolution which suggests a mechanical basis for this map. Many proteins rely on large-scale motion to function. We therefore treat protein as learning amorphous matter that evolves towards such a mechanical function: Genes are binary sequences that encode the connectivity of the amino acid network that makes a protein. The gene is evolved until the network forms a shear band across the protein, which allows for long-range, soft modes required for protein function. The evolution reduces the high-dimensional sequence space to a low-dimensional space of mechanical modes, in accord with the observed dimensional reduction between genotype and phenotype of proteins. Spectral analysis of the space of 1 06 solutions shows a strong correspondence between localization around the shear band of both mechanical modes and the sequence structure. Specifically, our model shows how mutations are correlated among amino acids whose interactions determine the functional mode.
Hydrophobic core malleability of a de novo designed three-helix bundle protein.
Walsh, S T; Sukharev, V I; Betz, S F; Vekshin, N L; DeGrado, W F
2001-01-12
De novo protein design provides a tool for testing the principles that stabilize the structures of proteins. Recently, we described the design and structure determination of alpha(3)D, a three-helix bundle protein with a well-packed hydrophobic core. Here, we test the malleability and adaptability of this protein's structure by mutating a small, Ala residue (A60) in its core to larger, hydrophobic side-chains, Leu and Ile. Such changes introduce strain into the structures of natural proteins, and therefore generally destabilize the native state. By contrast, these mutations were slightly stabilizing ( approximately 1.5 kcal mol(-1)) to the tertiary structure of alpha(3)D. The value of DeltaC(p) for unfolding of these mutants was not greatly affected relative to wild-type, indicating that the change in solvent accessibility for unfolding was similar. However, two-dimensional heteronuclear single quantum coherence spectra indicate that the protein adjusts to the introduction of steric bulk in different ways. A60L-alpha(3)D showed serious erosion in the dispersion of both the amide backbone as well as the side-chain methyl chemical shifts. By contrast, A60I-alpha(3)D showed excellent dispersion of the backbone resonances, and selective changes in dispersion of the aliphatic side-chains proximal to the site of mutation. Together, these data suggest that alpha(3)D, although folded into a unique three-dimensional structure, is nevertheless more malleable and flexible than most natural, native proteins. Copyright 2001 Academic Press.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Anna Johnston, SNL 9215
2002-09-01
PDB to AMPL Conversion was written to convert protein data base files to AMPL files. The protein data bases on the internet contain a wealth of information about the structue and makeup of proteins. Each file contains information derived by one or more experiments and contains information on how the experiment waw performed, the amino acid building blocks of each chain, and often the three-dimensional structure of the protein extracted from the experiments. The way a protein folds determines much about its function. Thus, studying the three-dimensional structure of the protein is of great interest. Analysing the contact maps ismore » one way to examine the structure. A contact map is a graph which has a linear back bone of amino acids for nodes (i.e., adjacent amino acids are always connected) and vertices between non-adjacent nodes if they are close enough to be considered in contact. If the graphs are similar then the folds of the protein and their function should also be similar. This software extracts the contact maps from a protein data base file and puts in into AMPL data format. This format is designed for use in AMPL, a programming language for simplifying linear programming formulations.« less
Researchers at the Frederick National Lab (FNL) have collaborated in solving the three-dimensional structure of a key protein in Alzheimer’s disease, providing new insight into the basic mechanisms that give rise to the devastating illness. The pro
Femtosecond X-ray Diffraction From Two-Dimensional Protein Crystals
DOE Office of Scientific and Technical Information (OSTI.GOV)
Frank, Matthias; Carlson, David B.; Hunter, Mark
2014-02-28
Here we present femtosecond x-ray diffraction patterns from two-dimensional (2-D) protein crystals using an x-ray free electron laser (XFEL). To date it has not been possible to acquire x-ray diffraction from individual 2-D protein crystals due to radiation damage. However, the intense and ultrafast pulses generated by an XFEL permits a new method of collecting diffraction data before the sample is destroyed. Utilizing a diffract-before-destroy methodology at the Linac Coherent Light Source, we observed Bragg diffraction to better than 8.5 Å resolution for two different 2-D protein crystal samples that were maintained at room temperature. These proof-of-principle results show promisemore » for structural analysis of both soluble and membrane proteins arranged as 2-D crystals without requiring cryogenic conditions or the formation of three-dimensional crystals.« less
Three-dimensional electron diffraction of plant light-harvesting complex
Wang, Da Neng; Kühlbrandt, Werner
1992-01-01
Electron diffraction patterns of two-dimensional crystals of light-harvesting chlorophyll a/b-protein complex (LHC-II) from photosynthetic membranes of pea chloroplasts, tilted at different angles up to 60°, were collected to 3.2 Å resolution at -125°C. The reflection intensities were merged into a three-dimensional data set. The Friedel R-factor and the merging R-factor were 21.8 and 27.6%, respectively. Specimen flatness and crystal size were critical for recording electron diffraction patterns from crystals at high tilts. The principal sources of experimental error were attributed to limitations of the number of unit cells contributing to an electron diffraction pattern, and to the critical electron dose. The distribution of strong diffraction spots indicated that the three-dimensional structure of LHC-II is less regular than that of other known membrane proteins and is not dominated by a particular feature of secondary structure. ImagesFIGURE 1FIGURE 2 PMID:19431817
Krissinel, E; Henrick, K
2004-12-01
The present paper describes the SSM algorithm of protein structure comparison in three dimensions, which includes an original procedure of matching graphs built on the protein's secondary-structure elements, followed by an iterative three-dimensional alignment of protein backbone Calpha atoms. The SSM results are compared with those obtained from other protein comparison servers, and the advantages and disadvantages of different scores that are used for structure recognition are discussed. A new score, balancing the r.m.s.d. and alignment length Nalign, is proposed. It is found that different servers agree reasonably well on the new score, while showing considerable differences in r.m.s.d. and Nalign.
Fujimoto, Akihiro; Okada, Yukinori; Boroevich, Keith A; Tsunoda, Tatsuhiko; Taniguchi, Hiroaki; Nakagawa, Hidewaki
2016-05-26
Protein tertiary structure determines molecular function, interaction, and stability of the protein, therefore distribution of mutation in the tertiary structure can facilitate the identification of new driver genes in cancer. To analyze mutation distribution in protein tertiary structures, we applied a novel three dimensional permutation test to the mutation positions. We analyzed somatic mutation datasets of 21 types of cancers obtained from exome sequencing conducted by the TCGA project. Of the 3,622 genes that had ≥3 mutations in the regions with tertiary structure data, 106 genes showed significant skew in mutation distribution. Known tumor suppressors and oncogenes were significantly enriched in these identified cancer gene sets. Physical distances between mutations in known oncogenes were significantly smaller than those of tumor suppressors. Twenty-three genes were detected in multiple cancers. Candidate genes with significant skew of the 3D mutation distribution included kinases (MAPK1, EPHA5, ERBB3, and ERBB4), an apoptosis related gene (APP), an RNA splicing factor (SF1), a miRNA processing factor (DICER1), an E3 ubiquitin ligase (CUL1) and transcription factors (KLF5 and EEF1B2). Our study suggests that systematic analysis of mutation distribution in the tertiary protein structure can help identify cancer driver genes.
Fujimoto, Akihiro; Okada, Yukinori; Boroevich, Keith A.; Tsunoda, Tatsuhiko; Taniguchi, Hiroaki; Nakagawa, Hidewaki
2016-01-01
Protein tertiary structure determines molecular function, interaction, and stability of the protein, therefore distribution of mutation in the tertiary structure can facilitate the identification of new driver genes in cancer. To analyze mutation distribution in protein tertiary structures, we applied a novel three dimensional permutation test to the mutation positions. We analyzed somatic mutation datasets of 21 types of cancers obtained from exome sequencing conducted by the TCGA project. Of the 3,622 genes that had ≥3 mutations in the regions with tertiary structure data, 106 genes showed significant skew in mutation distribution. Known tumor suppressors and oncogenes were significantly enriched in these identified cancer gene sets. Physical distances between mutations in known oncogenes were significantly smaller than those of tumor suppressors. Twenty-three genes were detected in multiple cancers. Candidate genes with significant skew of the 3D mutation distribution included kinases (MAPK1, EPHA5, ERBB3, and ERBB4), an apoptosis related gene (APP), an RNA splicing factor (SF1), a miRNA processing factor (DICER1), an E3 ubiquitin ligase (CUL1) and transcription factors (KLF5 and EEF1B2). Our study suggests that systematic analysis of mutation distribution in the tertiary protein structure can help identify cancer driver genes. PMID:27225414
Takeda, Mitsuhiro; Sugimori, Nozomi; Torizawa, Takuya; Terauchi, Tsutomu; Ono, Akira Mei; Yagi, Hirokazu; Yamaguchi, Yoshiki; Kato, Koichi; Ikeya, Teppei; Jee, JunGoo; Güntert, Peter; Aceti, David J.; Markley, John L.; Kainosho, Masatsune
2009-01-01
The product of gene At3g16450.1 from Arabidopsis thaliana is a 32 kDa, 299-residue protein classified as resembling a myrosinase-binding protein (MyroBP). MyroBPs are found in plants as part of a complex with the glucosinolate-degrading enzyme, myrosinase, and are suspected to play a role in myrosinase-dependent defense against pathogens. Many MyroBPs and MyroBP-related proteins are composed of repeated homologous sequences with unknown structure. We report here the three-dimensional structure of the At3g16450.1 protein from Arabidopsis, which consists of two tandem repeats. Because the size of the protein is larger than that amenable to high-throughput analysis by uniformly 13C/15N labeling methods, we used our stereo-array isotope labeling (SAIL) technology to prepare an optimally 2H/13C/15N-labeled sample. NMR data sets collected with the SAIL-protein enabled us to assign 1H, 13C and 15N chemical shifts to 95.5% of all atoms, even at the low concentration (0.2 mM) of the protein product. We collected additional NOESY data and solved the three-dimensional structure with the CYANA software package. The structure, the first for a MyroBP family member, revealed that the At3g16450.1 protein consists of two independent, but similar, lectin-fold domains composed of three β-sheets. PMID:19021763
Takeda, Mitsuhiro; Sugimori, Nozomi; Torizawa, Takuya; Terauchi, Tsutomu; Ono, Akira M; Yagi, Hirokazu; Yamaguchi, Yoshiki; Kato, Koichi; Ikeya, Teppei; Jee, Jungoo; Güntert, Peter; Aceti, David J; Markley, John L; Kainosho, Masatsune
2008-12-01
The product of gene At3g16450.1 from Arabidopsis thaliana is a 32 kDa, 299-residue protein classified as resembling a myrosinase-binding protein (MyroBP). MyroBPs are found in plants as part of a complex with the glucosinolate-degrading enzyme myrosinase, and are suspected to play a role in myrosinase-dependent defense against pathogens. Many MyroBPs and MyroBP-related proteins are composed of repeated homologous sequences with unknown structure. We report here the three-dimensional structure of the At3g16450.1 protein from Arabidopsis, which consists of two tandem repeats. Because the size of the protein is larger than that amenable to high-throughput analysis by uniform (13)C/(15)N labeling methods, we used stereo-array isotope labeling (SAIL) technology to prepare an optimally (2)H/(13)C/(15)N-labeled sample. NMR data sets collected using the SAIL protein enabled us to assign (1)H, (13)C and (15)N chemical shifts to 95.5% of all atoms, even at a low concentration (0.2 mm) of protein product. We collected additional NOESY data and determined the three-dimensional structure using the cyana software package. The structure, the first for a MyroBP family member, revealed that the At3g16450.1 protein consists of two independent but similar lectin-fold domains, each composed of three beta-sheets.
Three-dimensional structure of Erwinia carotovora L-asparaginase
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kislitsyn, Yu. A.; Kravchenko, O. V.; Nikonov, S. V.
2006-10-15
Three-dimensional structure of Erwinia carotovora L-asparaginase, which has antitumor activity and is used for the treatment of acute lymphoblastic leukemia, was solved at 3 A resolution and refined to R{sub cryst} = 20% and R{sub free} = 28%. Crystals of recombinant Erwinia carotovora L-asparaginase were grown by the hanging-drop vapor-diffusion method from protein solutions in a HEPES buffer (pH 6.5) and PEG MME 5000 solutions in a cacodylate buffer (pH 6.5) as the precipitant. Three-dimensional X-ray diffraction data were collected up to 3 A resolution from one crystal at room temperature. The structure was solved by the molecular replacement methodmore » using the coordinates of Erwinia chrysanthemi L-asparaginase as the starting model. The coordinates refined with the use of the CNS program package were deposited in the Protein Data Bank (PDB code 1ZCF)« less
Hao, Xiao-Hu; Zhang, Gui-Jun; Zhou, Xiao-Gen; Yu, Xu-Feng
2016-01-01
To address the searching problem of protein conformational space in ab-initio protein structure prediction, a novel method using abstract convex underestimation (ACUE) based on the framework of evolutionary algorithm was proposed. Computing such conformations, essential to associate structural and functional information with gene sequences, is challenging due to the high-dimensionality and rugged energy surface of the protein conformational space. As a consequence, the dimension of protein conformational space should be reduced to a proper level. In this paper, the high-dimensionality original conformational space was converted into feature space whose dimension is considerably reduced by feature extraction technique. And, the underestimate space could be constructed according to abstract convex theory. Thus, the entropy effect caused by searching in the high-dimensionality conformational space could be avoided through such conversion. The tight lower bound estimate information was obtained to guide the searching direction, and the invalid searching area in which the global optimal solution is not located could be eliminated in advance. Moreover, instead of expensively calculating the energy of conformations in the original conformational space, the estimate value is employed to judge if the conformation is worth exploring to reduce the evaluation time, thereby making computational cost lower and the searching process more efficient. Additionally, fragment assembly and the Monte Carlo method are combined to generate a series of metastable conformations by sampling in the conformational space. The proposed method provides a novel technique to solve the searching problem of protein conformational space. Twenty small-to-medium structurally diverse proteins were tested, and the proposed ACUE method was compared with It Fix, HEA, Rosetta and the developed method LEDE without underestimate information. Test results show that the ACUE method can more rapidly and more efficiently obtain the near-native protein structure.
Deng, Lei; Fan, Chao; Zeng, Zhiwen
2017-12-28
Direct prediction of the three-dimensional (3D) structures of proteins from one-dimensional (1D) sequences is a challenging problem. Significant structural characteristics such as solvent accessibility and contact number are essential for deriving restrains in modeling protein folding and protein 3D structure. Thus, accurately predicting these features is a critical step for 3D protein structure building. In this study, we present DeepSacon, a computational method that can effectively predict protein solvent accessibility and contact number by using a deep neural network, which is built based on stacked autoencoder and a dropout method. The results demonstrate that our proposed DeepSacon achieves a significant improvement in the prediction quality compared with the state-of-the-art methods. We obtain 0.70 three-state accuracy for solvent accessibility, 0.33 15-state accuracy and 0.74 Pearson Correlation Coefficient (PCC) for the contact number on the 5729 monomeric soluble globular protein dataset. We also evaluate the performance on the CASP11 benchmark dataset, DeepSacon achieves 0.68 three-state accuracy and 0.69 PCC for solvent accessibility and contact number, respectively. We have shown that DeepSacon can reliably predict solvent accessibility and contact number with stacked sparse autoencoder and a dropout approach.
3DProIN: Protein-Protein Interaction Networks and Structure Visualization.
Li, Hui; Liu, Chunmei
2014-06-14
3DProIN is a computational tool to visualize protein-protein interaction networks in both two dimensional (2D) and three dimensional (3D) view. It models protein-protein interactions in a graph and explores the biologically relevant features of the tertiary structures of each protein in the network. Properties such as color, shape and name of each node (protein) of the network can be edited in either 2D or 3D views. 3DProIN is implemented using 3D Java and C programming languages. The internet crawl technique is also used to parse dynamically grasped protein interactions from protein data bank (PDB). It is a java applet component that is embedded in the web page and it can be used on different platforms including Linux, Mac and Window using web browsers such as Firefox, Internet Explorer, Chrome and Safari. It also was converted into a mac app and submitted to the App store as a free app. Mac users can also download the app from our website. 3DProIN is available for academic research at http://bicompute.appspot.com.
DOE Office of Scientific and Technical Information (OSTI.GOV)
van der Graaf, M.; van Mierlo, C.P.M.; Hemminga, M.A.
1991-06-11
The first 25 amino acids of the coat protein of cowpea chlorotic mottle virus are essential for binding the encapsidated RNA. Although an {alpha}-helical conformation has been predicted for this highly positively charged N-terminal region. No experimental evidence for this conformation has been presented so far. In this study, two-dimensional proton NMR experiments were performed on a chemically synthesized pentacosapeptide containing the first 25 amino acids of this coat protein. All resonances could be assigned by a combined use of two-dimensional correlated spectroscopy and nuclear Overhauser enhancement spectroscopy carried out at four different temperatures. Various NMR parameters indicate the presencemore » of a conformational ensemble consisting of helical structures rapidly converting into more extended states. Differences in chemical shifts and nuclear Overhauser effects indicate that lowering the temperature induces a shift of the dynamic equilibrium toward more helical structures. At 10{degrees}C, a perceptible fraction of the conformational ensemble consists of structures with an {alpha}-helical conformation between residues 9 and 17, likely starting with a turnlike structure around Thr9 and Arg10. Both the conformation and the position of this helical region agree well with the secondary structure predictions mentioned above.« less
Controllable assembly and disassembly of nanoparticle systems via protein and DNA agents
Lee, Soo-Kwan; Gang, Oleg; van der Lelie, Daniel
2014-05-20
The invention relates to the use of peptides, proteins, and other oligomers to provide a means by which normally quenched nanoparticle fluorescence may be recovered upon detection of a target molecule. Further, the inventive technology provides a structure and method to carry out detection of target molecules without the need to label the target molecules before detection. In another aspect, a method for forming arbitrarily shaped two- and three-dimensional protein-mediated nanoparticle structures and the resulting structures are described. Proteins mediating structure formation may themselves be functionalized with a variety of useful moieties, including catalytic functional groups.
Molecular Analysis of Protein Assembly in Muscle Development.
ERIC Educational Resources Information Center
Epstein, Henry F.; Fischman, Donald A.
1991-01-01
Advances in the genetics and cell biology of muscle development are discussed. In-vitro analysis of the renaturation, polymerization, and three-dimensional structure of the purified proteins involved is described. (CW)
SAIL--stereo-array isotope labeling.
Kainosho, Masatsune; Güntert, Peter
2009-11-01
Optimal stereospecific and regiospecific labeling of proteins with stable isotopes enhances the nuclear magnetic resonance (NMR) method for the determination of the three-dimensional protein structures in solution. Stereo-array isotope labeling (SAIL) offers sharpened lines, spectral simplification without loss of information and the ability to rapidly collect and automatically evaluate the structural restraints required to solve a high-quality solution structure for proteins up to twice as large as before. This review gives an overview of stable isotope labeling methods for NMR spectroscopy with proteins and provides an in-depth treatment of the SAIL technology.
Bio-Organic Nanotechnology: Using Proteins and Synthetic Polymers for Nanoscale Devices
NASA Technical Reports Server (NTRS)
Molnar, Linda K.; Xu, Ting; Trent, Jonathan D.; Russell, Thomas P.
2003-01-01
While the ability of proteins to self-assemble makes them powerful tools in nanotechnology, in biological systems protein-based structures ultimately depend on the context in which they form. We combine the self-assembling properties of synthetic diblock copolymers and proteins to construct intricately ordered, three-dimensional polymer protein structures with the ultimate goal of forming nano-scale devices. This hybrid approach takes advantage of the capabilities of organic polymer chemistry to build ordered structures and the capabilities of genetic engineering to create proteins that are selective for inorganic or organic substrates. Here, microphase-separated block copolymers coupled with genetically engineered heat shock proteins are used to produce nano-scale patterning that maximizes the potential for both increased structural complexity and integrity.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bhardwaj, Anshul; Casjens, Sherwood R.; Cingolani, Gino, E-mail: gino.cingolani@jefferson.edu
2014-02-01
This study presents the crystal structure of a ∼320 Å long protein fiber generated by in-frame extension of its repeated helical coiled-coil core. Protein fibers are widespread in nature, but only a limited number of high-resolution structures have been determined experimentally. Unlike globular proteins, fibers are usually recalcitrant to form three-dimensional crystals, preventing single-crystal X-ray diffraction analysis. In the absence of three-dimensional crystals, X-ray fiber diffraction is a powerful tool to determine the internal symmetry of a fiber, but it rarely yields atomic resolution structural information on complex protein fibers. An 85-residue-long minimal coiled-coil repeat unit (MiCRU) was previously identifiedmore » in the trimeric helical core of tail needle gp26, a fibrous protein emanating from the tail apparatus of the bacteriophage P22 virion. Here, evidence is provided that an MiCRU can be inserted in frame inside the gp26 helical core to generate a rationally extended fiber (gp26-2M) which, like gp26, retains a trimeric quaternary structure in solution. The 2.7 Å resolution crystal structure of this engineered fiber, which measures ∼320 Å in length and is only 20–35 Å wide, was determined. This structure, the longest for a trimeric protein fiber to be determined to such a high resolution, reveals the architecture of 22 consecutive trimerization heptads and provides a framework to decipher the structural determinants for protein fiber assembly, stability and flexibility.« less
SSEP: secondary structural elements of proteins
Shanthi, V.; Selvarani, P.; Kiran Kumar, Ch.; Mohire, C. S.; Sekar, K.
2003-01-01
SSEP is a comprehensive resource for accessing information related to the secondary structural elements present in the 25 and 90% non-redundant protein chains. The database contains 1771 protein chains from 1670 protein structures and 6182 protein chains from 5425 protein structures in 25 and 90% non-redundant protein chains, respectively. The current version provides information about the α-helical segments and β-strand fragments of varying lengths. In addition, it also contains the information about 310-helix, β- and ν-turns and hairpin loops. The free graphics program RASMOL has been interfaced with the search engine to visualize the three-dimensional structures of the user queried secondary structural fragment. The database is updated regularly and is available through Bioinformatics web server at http://cluster.physics.iisc.ernet.in/ssep/ or http://144.16.71.148/ssep/. PMID:12824336
Exploring Human Diseases and Biological Mechanisms by Protein Structure Prediction and Modeling.
Wang, Juexin; Luttrell, Joseph; Zhang, Ning; Khan, Saad; Shi, NianQing; Wang, Michael X; Kang, Jing-Qiong; Wang, Zheng; Xu, Dong
2016-01-01
Protein structure prediction and modeling provide a tool for understanding protein functions by computationally constructing protein structures from amino acid sequences and analyzing them. With help from protein prediction tools and web servers, users can obtain the three-dimensional protein structure models and gain knowledge of functions from the proteins. In this chapter, we will provide several examples of such studies. As an example, structure modeling methods were used to investigate the relation between mutation-caused misfolding of protein and human diseases including epilepsy and leukemia. Protein structure prediction and modeling were also applied in nucleotide-gated channels and their interaction interfaces to investigate their roles in brain and heart cells. In molecular mechanism studies of plants, rice salinity tolerance mechanism was studied via structure modeling on crucial proteins identified by systems biology analysis; trait-associated protein-protein interactions were modeled, which sheds some light on the roles of mutations in soybean oil/protein content. In the age of precision medicine, we believe protein structure prediction and modeling will play more and more important roles in investigating biomedical mechanism of diseases and drug design.
The three-dimensional structure of aquaporin-1
NASA Astrophysics Data System (ADS)
Walz, Thomas; Hirai, Teruhisa; Murata, Kazuyoshi; Heymann, J. Bernard; Mitsuoka, Kaoru; Fujiyoshi, Yoshinori; Smith, Barbara L.; Agre, Peter; Engel, Andreas
1997-06-01
The entry and exit of water from cells is a fundamental process of life. Recognition of the high water permeability of red blood cells led to the proposal that specialized water pores exist in the plasma membrane. Expression in Xenopus oocytes and functional studies of an erythrocyte integral membrane protein of relative molecular mass 28,000, identified it as the mercury-sensitive water channel, aquaporin-1 (AQP1). Many related proteins, all belonging to the major intrinsic protein (MIP) family, are found throughout nature. AQP1 is a homotetramer containing four independent aqueous channels. When reconstituted into lipid bilayers, the protein forms two-dimensional lattices with a unit cell containing two tetramers in opposite orientation. Here we present the three-dimensional structure of AQP1 determined at 6Å resolution by cryo-electron microscopy. Each AQP1 monomer has six tilted, bilayer-spanning α-helices which form a right-handed bundle surrounding a central density. These results, together with functional studies, provide a model that identifies the aqueous pore in the AQP1 molecule and indicates the organization of the tetrameric complex in the membrane.
Protein sectors: evolutionary units of three-dimensional structure
Halabi, Najeeb; Rivoire, Olivier; Leibler, Stanislas; Ranganathan, Rama
2011-01-01
Proteins display a hierarchy of structural features at primary, secondary, tertiary, and higher-order levels, an organization that guides our current understanding of their biological properties and evolutionary origins. Here, we reveal a structural organization distinct from this traditional hierarchy by statistical analysis of correlated evolution between amino acids. Applied to the S1A serine proteases, the analysis indicates a decomposition of the protein into three quasi-independent groups of correlated amino acids that we term “protein sectors”. Each sector is physically connected in the tertiary structure, has a distinct functional role, and constitutes an independent mode of sequence divergence in the protein family. Functionally relevant sectors are evident in other protein families as well, suggesting that they may be general features of proteins. We propose that sectors represent a structural organization of proteins that reflects their evolutionary histories. PMID:19703402
The use of experimental structures to model protein dynamics.
Katebi, Ataur R; Sankar, Kannan; Jia, Kejue; Jernigan, Robert L
2015-01-01
The number of solved protein structures submitted in the Protein Data Bank (PDB) has increased dramatically in recent years. For some specific proteins, this number is very high-for example, there are over 550 solved structures for HIV-1 protease, one protein that is essential for the life cycle of human immunodeficiency virus (HIV) which causes acquired immunodeficiency syndrome (AIDS) in humans. The large number of structures for the same protein and its variants include a sample of different conformational states of the protein. A rich set of structures solved experimentally for the same protein has information buried within the dataset that can explain the functional dynamics and structural mechanism of the protein. To extract the dynamics information and functional mechanism from the experimental structures, this chapter focuses on two methods-Principal Component Analysis (PCA) and Elastic Network Models (ENM). PCA is a widely used statistical dimensionality reduction technique to classify and visualize high-dimensional data. On the other hand, ENMs are well-established simple biophysical method for modeling the functionally important global motions of proteins. This chapter covers the basics of these two. Moreover, an improved ENM version that utilizes the variations found within a given set of structures for a protein is described. As a practical example, we have extracted the functional dynamics and mechanism of HIV-1 protease dimeric structure by using a set of 329 PDB structures of this protein. We have described, step by step, how to select a set of protein structures, how to extract the needed information from the PDB files for PCA, how to extract the dynamics information using PCA, how to calculate ENM modes, how to measure the congruency between the dynamics computed from the principal components (PCs) and the ENM modes, and how to compute entropies using the PCs. We provide the computer programs or references to software tools to accomplish each step and show how to use these programs and tools. We also include computer programs to generate movies based on PCs and ENM modes and describe how to visualize them.
The Use of Experimental Structures to Model Protein Dynamics
Katebi, Ataur R.; Sankar, Kannan; Jia, Kejue; Jernigan, Robert L.
2014-01-01
Summary The number of solved protein structures submitted in the Protein Data Bank (PDB) has increased dramatically in recent years. For some specific proteins, this number is very high – for example, there are over 550 solved structures for HIV-1 protease, one protein that is essential for the life cycle of human immunodeficiency virus (HIV) which causes acquired immunodeficiency syndrome (AIDS) in humans. The large number of structures for the same protein and its variants include a sample of different conformational states of the protein. A rich set of structures solved experimentally for the same protein has information buried within the dataset that can explain the functional dynamics and structural mechanism of the protein. To extract the dynamics information and functional mechanism from the experimental structures, this chapter focuses on two methods – Principal Component Analysis (PCA) and Elastic Network Models (ENM). PCA is a widely used statistical dimensionality reduction technique to classify and visualize high-dimensional data. On the other hand, ENMs are well-established simple biophysical method for modeling the functionally important global motions of proteins. This chapter covers the basics of these two. Moreover, an improved ENM version that utilizes the variations found within a given set of structures for a protein is described. As a practical example, we have extracted the functional dynamics and mechanism of HIV-1 protease dimeric structure by using a set of 329 PDB structures of this protein. We have described, step by step, how to select a set of protein structures, how to extract the needed information from the PDB files for PCA, how to extract the dynamics information using PCA, how to calculate ENM modes, how to measure the congruency between the dynamics computed from the principal components (PCs) and the ENM modes, and how to compute entropies using the PCs. We provide the computer programs or references to software tools to accomplish each step and show how to use these programs and tools. We also include computer programs to generate movies based on PCs and ENM modes and describe how to visualize them. PMID:25330965
Sinz, Andrea
2018-05-28
Structural mass spectrometry (MS) is gaining increasing importance for deriving valuable three-dimensional structural information on proteins and protein complexes, and it complements existing techniques, such as NMR spectroscopy and X-ray crystallography. Structural MS unites different MS-based techniques, such as hydrogen/deuterium exchange, native MS, ion-mobility MS, protein footprinting, and chemical cross-linking/MS, and it allows fundamental questions in structural biology to be addressed. In this Minireview, I will focus on the cross-linking/MS strategy. This method not only delivers tertiary structural information on proteins, but is also increasingly being used to decipher protein interaction networks, both in vitro and in vivo. Cross-linking/MS is currently one of the most promising MS-based approaches to derive structural information on very large and transient protein assemblies and intrinsically disordered proteins. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
ERIC Educational Resources Information Center
Roy, Urmi
2016-01-01
This work presents a three-dimensional (3D) modeling exercise for undergraduate students in chemistry and health sciences disciplines, focusing on a protein-group linked to immune system regulation. Specifically, the exercise involves molecular modeling and structural analysis of tumor necrosis factor (TNF) proteins, both wild type and mutant. The…
Grégoire, Catherine; Marco, Sergio; Thimonier, Jean; Duplan, Laure; Laurine, Emmanuelle; Chauvin, Jean-Paul; Michel, Bernard; Peyrot, Vincent; Verdier, Jean-Michel
2001-01-01
Neurodegenerative diseases are characterized by the presence of filamentous aggregates of proteins. We previously established that lithostathine is a protein overexpressed in the pre-clinical stages of Alzheimer’s disease. Furthermore, it is present in the pathognomonic lesions associated with Alzheimer’s disease. After self-proteolysis, the N-terminally truncated form of lithostathine leads to the formation of fibrillar aggregates. Here we observed using atomic force microscopy that these aggregates consisted of a network of protofibrils, each of which had a twisted appearance. Electron microscopy and image analysis showed that this twisted protofibril has a quadruple helical structure. Three-dimensional X-ray structural data and the results of biochemical experiments showed that when forming a protofibril, lithostathine was first assembled via lateral hydrophobic interactions into a tetramer. Each tetramer then linked up with another tetramer as the result of longitudinal electrostatic interactions. All these results were used to build a structural model for the lithostathine protofibril called the quadruple-helical filament (QHF-litho). In conclusion, lithostathine strongly resembles the prion protein in its dramatic proteolysis and amyloid proteins in its ability to form fibrils. PMID:11432819
Encounter complexes and dimensionality reduction in protein–protein association
Kozakov, Dima; Li, Keyong; Hall, David R; Beglov, Dmitri; Zheng, Jiefu; Vakili, Pirooz; Schueler-Furman, Ora; Paschalidis, Ioannis Ch; Clore, G Marius; Vajda, Sandor
2014-01-01
An outstanding challenge has been to understand the mechanism whereby proteins associate. We report here the results of exhaustively sampling the conformational space in protein–protein association using a physics-based energy function. The agreement between experimental intermolecular paramagnetic relaxation enhancement (PRE) data and the PRE profiles calculated from the docked structures shows that the method captures both specific and non-specific encounter complexes. To explore the energy landscape in the vicinity of the native structure, the nonlinear manifold describing the relative orientation of two solid bodies is projected onto a Euclidean space in which the shape of low energy regions is studied by principal component analysis. Results show that the energy surface is canyon-like, with a smooth funnel within a two dimensional subspace capturing over 75% of the total motion. Thus, proteins tend to associate along preferred pathways, similar to sliding of a protein along DNA in the process of protein-DNA recognition. DOI: http://dx.doi.org/10.7554/eLife.01370.001 PMID:24714491
Ruller, Roberto; Silva-Rocha, Rafael; Silva, Artur; Cruz Schneider, Maria Paula; Ward, Richard John
2011-01-01
Protein engineering is a powerful tool, which correlates protein structure with specific functions, both in applied biotechnology and in basic research. Here, we present a practical teaching course for engineering the green fluorescent protein (GFP) from Aequorea victoria by a random mutagenesis strategy using error-prone polymerase chain reaction. Screening of bacterial colonies transformed with random mutant libraries identified GFP variants with increased fluorescence yields. Mapping the three-dimensional structure of these mutants demonstrated how alterations in structural features such as the environment around the fluorophore and properties of the protein surface can influence functional properties such as the intensity of fluorescence and protein solubility. Copyright © 2011 Wiley Periodicals, Inc.
Tonal Interface to MacroMolecules (TIMMol): A Textual and Tonal Tool for Molecular Visualization
ERIC Educational Resources Information Center
Cordes, Timothy J.; Carlson, C. Britt; Forest, Katrina T.
2008-01-01
We developed the three-dimensional visualization software, Tonal Interface to MacroMolecules or TIMMol, for studying atomic coordinates of protein structures. Key features include audio tones indicating x, y, z location, identification of the cursor location in one-dimensional and three-dimensional space, textual output that can be easily linked…
Protein space: a natural method for realizing the nature of protein universe.
Yu, Chenglong; Deng, Mo; Cheng, Shiu-Yuen; Yau, Shek-Chung; He, Rong L; Yau, Stephen S-T
2013-02-07
Current methods cannot tell us what the nature of the protein universe is concretely. They are based on different models of amino acid substitution and multiple sequence alignment which is an NP-hard problem and requires manual intervention. Protein structural analysis also gives a direction for mapping the protein universe. Unfortunately, now only a minuscule fraction of proteins' 3-dimensional structures are known. Furthermore, the phylogenetic tree representations are not unique for any existing tree construction methods. Here we develop a novel method to realize the nature of protein universe. We show the protein universe can be realized as a protein space in 60-dimensional Euclidean space using a distance based on a normalized distribution of amino acids. Every protein is in one-to-one correspondence with a point in protein space, where proteins with similar properties stay close together. Thus the distance between two points in protein space represents the biological distance of the corresponding two proteins. We also propose a natural graphical representation for inferring phylogenies. The representation is natural and unique based on the biological distances of proteins in protein space. This will solve the fundamental question of how proteins are distributed in the protein universe. Copyright © 2012 Elsevier Ltd. All rights reserved.
Weyand, Simone; Kefala, Georgia; Svergun, Dmitri I; Weiss, Manfred S
2009-09-01
The three-dimensional structure of the enzyme diaminopimelate decarboxylase from Mycobacterium tuberculosis has been determined in a new crystal form and refined to a resolution of 2.33 A. The monoclinic crystals contain one tetramer exhibiting D(2)-symmetry in the asymmetric unit. The tetramer exhibits a donut-like structure with a hollow interior. All four active sites are accessible only from the interior of the tetrameric assembly. Small-angle X-ray scattering indicates that in solution the predominant oligomeric species of the protein is a dimer, but also that higher oligomers exist at higher protein concentrations. The observed scattering data are best explained by assuming a dimer-tetramer equilibrium with about 7% tetramers present in solution. Consequently, at the elevated protein concentrations in the crowded environment inside the cell the observed tetramer may constitute the biologically relevant functional unit of the enzyme.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Coughlan, H. D.; Darmanin, C.; Kirkwood, H. J.
2016-03-14
Three-dimensional imaging of protein crystals during X-ray diffraction experiments opens up a range of possibilities for optimising crystal quality and gaining new insights into the fundamental processes that drive radiation damage. Obtaining this information at the appropriate lengthscales however is extremely challenging. One approach that has been recently demonstrated as a promising avenue for charactering the size and shape of protein crystals at nanometre lengthscales is Bragg Coherent Diffractive Imaging (BCDI). BCDI is a recently developed technique that is able to recover the phase of the continuous diffraction intensity signal around individual Bragg peaks. When data is collected at multiplemore » points on a rocking curve a Reciprocal Space Map (RSM) can be assembled and then inverted using BCDI to obtain a three-dimensional image of the crystal. The first demonstration of two-dimensional BCDI of protein crystals was reported by Boutet at al., recently this work was extended to the study of radiation damage of micron-sized crystals. Here we present the first three-dimensional reconstructions of a Lysozyme protein crystal using BDI. The results are validated against RSM and TEM data and have implications for both radiation damage studies and for developing new approaches to structure retrieval from micron-sized protein crystals.« less
Predicting the helix packing of globular proteins by self-correcting distance geometry.
Mumenthaler, C; Braun, W
1995-05-01
A new self-correcting distance geometry method for predicting the three-dimensional structure of small globular proteins was assessed with a test set of 8 helical proteins. With the knowledge of the amino acid sequence and the helical segments, our completely automated method calculated the correct backbone topology of six proteins. The accuracy of the predicted structures ranged from 2.3 A to 3.1 A for the helical segments compared to the experimentally determined structures. For two proteins, the predicted constraints were not restrictive enough to yield a conclusive prediction. The method can be applied to all small globular proteins, provided the secondary structure is known from NMR analysis or can be predicted with high reliability.
Chemical cross-linking and native mass spectrometry: A fruitful combination for structural biology
Sinz, Andrea; Arlt, Christian; Chorev, Dror; Sharon, Michal
2015-01-01
Mass spectrometry (MS) is becoming increasingly popular in the field of structural biology for analyzing protein three-dimensional-structures and for mapping protein–protein interactions. In this review, the specific contributions of chemical crosslinking and native MS are outlined to reveal the structural features of proteins and protein assemblies. Both strategies are illustrated based on the examples of the tetrameric tumor suppressor protein p53 and multisubunit vinculin-Arp2/3 hybrid complexes. We describe the distinct advantages and limitations of each technique and highlight synergistic effects when both techniques are combined. Integrating both methods is especially useful for characterizing large protein assemblies and for capturing transient interactions. We also point out the future directions we foresee for a combination of in vivo crosslinking and native MS for structural investigation of intact protein assemblies. PMID:25970732
Bhaskara, Ramachandra M; Padhi, Amrita; Srinivasan, Narayanaswamy
2014-07-01
With the preponderance of multidomain proteins in eukaryotic genomes, it is essential to recognize the constituent domains and their functions. Often function involves communications across the domain interfaces, and the knowledge of the interacting sites is essential to our understanding of the structure-function relationship. Using evolutionary information extracted from homologous domains in at least two diverse domain architectures (single and multidomain), we predict the interface residues corresponding to domains from the two-domain proteins. We also use information from the three-dimensional structures of individual domains of two-domain proteins to train naïve Bayes classifier model to predict the interfacial residues. Our predictions are highly accurate (∼85%) and specific (∼95%) to the domain-domain interfaces. This method is specific to multidomain proteins which contain domains in at least more than one protein architectural context. Using predicted residues to constrain domain-domain interaction, rigid-body docking was able to provide us with accurate full-length protein structures with correct orientation of domains. We believe that these results can be of considerable interest toward rational protein and interaction design, apart from providing us with valuable information on the nature of interactions. © 2013 Wiley Periodicals, Inc.
NASA Astrophysics Data System (ADS)
Finkelstein, A. V.; Galzitskaya, O. V.
2004-04-01
Protein physics is grounded on three fundamental experimental facts: protein, this long heteropolymer, has a well defined compact three-dimensional structure; this structure can spontaneously arise from the unfolded protein chain in appropriate environment; and this structure is separated from the unfolded state of the chain by the “all-or-none” phase transition, which ensures robustness of protein structure and therefore of its action. The aim of this review is to consider modern understanding of physical principles of self-organization of protein structures and to overview such important features of this process, as finding out the unique protein structure among zillions alternatives, nucleation of the folding process and metastable folding intermediates. Towards this end we will consider the main experimental facts and simple, mostly phenomenological theoretical models. We will concentrate on relatively small (single-domain) water-soluble globular proteins (whose structure and especially folding are much better studied and understood than those of large or membrane and fibrous proteins) and consider kinetic and structural aspects of transition of initially unfolded protein chains into their final solid (“native”) 3D structures.
Comparative Protein Structure Modeling Using MODELLER
Webb, Benjamin; Sali, Andrej
2016-01-01
Comparative protein structure modeling predicts the three-dimensional structure of a given protein sequence (target) based primarily on its alignment to one or more proteins of known structure (templates). The prediction process consists of fold assignment, target-template alignment, model building, and model evaluation. This unit describes how to calculate comparative models using the program MODELLER and how to use the ModBase database of such models, and discusses all four steps of comparative modeling, frequently observed errors, and some applications. Modeling lactate dehydrogenase from Trichomonas vaginalis (TvLDH) is described as an example. The download and installation of the MODELLER software is also described. PMID:27322406
A Protein in the palm of your hand through augmented reality.
Berry, Colin; Board, Jason
2014-01-01
Understanding of proteins and other biological macromolecules must be based on an appreciation of their 3-dimensional shape and the fine details of their structure. Conveying these details in a clear and stimulating fashion can present challenges using conventional approaches and 2-dimensional monitors and projectors. Here we describe a method for the production of 3-D interactive images of protein structures that can be manipulated in real time through the use of augmented reality software. Users first see a real-time image of themselves using the computer's camera, then, when they hold up a trigger image, a model of a molecule appears automatically in the video. This model rotates and translates in space in response to movements of the trigger card. The system described has been optimized to allow customization for the display of user-selected structures to create engaging, educational visualizations to explore 3-D structures. Copyright © 2014 The International Union of Biochemistry and Molecular Biology.
Prototype Protein-Based Three-Dimensional Memory
2003-01-01
9 Figure 3.2: Hypothetical mutational landscape ...to explore the genetic mutational landscape of a protein without any a priori knowledge of structure- function relationships. As such, it explores...native organism, Halobacterium salinarum, the protein acts as a photosynthetic sunlight to chemical energy transducer. Through several billion years of
Examining a Thermodynamic Order Parameter of Protein Folding.
Chong, Song-Ho; Ham, Sihyun
2018-05-08
Dimensionality reduction with a suitable choice of order parameters or reaction coordinates is commonly used for analyzing high-dimensional time-series data generated by atomistic biomolecular simulations. So far, geometric order parameters, such as the root mean square deviation, fraction of native amino acid contacts, and collective coordinates that best characterize rare or large conformational transitions, have been prevailing in protein folding studies. Here, we show that the solvent-averaged effective energy, which is a thermodynamic quantity but unambiguously defined for individual protein conformations, serves as a good order parameter of protein folding. This is illustrated through the application to the folding-unfolding simulation trajectory of villin headpiece subdomain. We rationalize the suitability of the effective energy as an order parameter by the funneledness of the underlying protein free energy landscape. We also demonstrate that an improved conformational space discretization is achieved by incorporating the effective energy. The most distinctive feature of this thermodynamic order parameter is that it works in pointing to near-native folded structures even when the knowledge of the native structure is lacking, and the use of the effective energy will also find applications in combination with methods of protein structure prediction.
Advancements of two dimensional correlation spectroscopy in protein researches
NASA Astrophysics Data System (ADS)
Tao, Yanchun; Wu, Yuqing; Zhang, Liping
2018-05-01
The developments of two-dimensional correlation spectroscopy (2DCOS) applications in protein studies are discussed, especially for the past two decades. The powerful utilities of 2DCOS combined with various analytical techniques in protein studies are summarized. The emphasis is on the vibration spectroscopic techniques including IR, NIR, Raman and optical activity (ROA), as well as vibration circular dichroism (VCD) and fluorescence spectroscopy. In addition, some new developments, such as hetero-spectral 2DCOS, moving-window correlation, and model based correlation, are also reviewed for their utility in the investigation of the secondary structure, denaturation, folding and unfolding changes of protein. Finally, the new possibility and challenges of 2DCOS in protein research are highlighted as well.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Timofeev, V. I., E-mail: inna@ns.crys.ras.ru, E-mail: tostars@mail.ru, E-mail: ugama@yandex.ru; Abramchik, Yu. A.; Zhukhlistova, N. E.
2016-01-15
Phosphoribosyl pyrophosphate synthetase from Escherichia coli was cloned, purified, and crystallized. Single crystals of the enzyme were grown under microgravity. The X-ray diffraction data set was collected at the Spring-8 synchrotron facility and used to determine the three-dimensional structure of the enzyme by the molecular-replacement method at 2.71 Å resolution. The active and regulatory sites in the molecule of E. coli phosphoribosyl pyrophosphate synthetase were revealed by comparison with the homologous protein from Bacillus subtilis, the structure of which was determined in a complex with functional ligands. The conformations of polypeptide-chain fragments surrounding and composing the active and regulatory sitesmore » were shown to be identical in both proteins.« less
Reddy, Jithender G; Kumar, Dinesh; Hosur, Ramakrishna V
2015-02-01
Protein NMR spectroscopy has expanded dramatically over the last decade into a powerful tool for the study of their structure, dynamics, and interactions. The primary requirement for all such investigations is sequence-specific resonance assignment. The demand now is to obtain this information as rapidly as possible and in all types of protein systems, stable/unstable, soluble/insoluble, small/big, structured/unstructured, and so on. In this context, we introduce here two reduced dimensionality experiments – (3,2)D-hNCOcanH and (3,2)D-hNcoCAnH – which enhance the previously described 2D NMR-based assignment methods quite significantly. Both the experiments can be recorded in just about 2-3 h each and hence would be of immense value for high-throughput structural proteomics and drug discovery research. The applicability of the method has been demonstrated using alpha-helical bovine apo calbindin-D9k P43M mutant (75 aa) protein. Automated assignment of this data using AUTOBA has been presented, which enhances the utility of these experiments. The backbone resonance assignments so derived are utilized to estimate secondary structures and the backbone fold using Web-based algorithms. Taken together, we believe that the method and the protocol proposed here can be used for routine high-throughput structural studies of proteins. Copyright © 2014 John Wiley & Sons, Ltd.
The PYRIN domain: A member of the death domain-fold superfamily
Fairbrother, Wayne J.; Gordon, Nathaniel C.; Humke, Eric W.; O'Rourke, Karen M.; Starovasnik, Melissa A.; Yin, Jian-Ping; Dixit, Vishva M.
2001-01-01
PYRIN domains were identified recently as putative protein–protein interaction domains at the N-termini of several proteins thought to function in apoptotic and inflammatory signaling pathways. The ∼95 residue PYRIN domains have no statistically significant sequence homology to proteins with known three-dimensional structure. Using secondary structure prediction and potential-based fold recognition methods, however, the PYRIN domain is predicted to be a member of the six-helix bundle death domain-fold superfamily that includes death domains (DDs), death effector domains (DEDs), and caspase recruitment domains (CARDs). Members of the death domain-fold superfamily are well established mediators of protein–protein interactions found in many proteins involved in apoptosis and inflammation, indicating further that the PYRIN domains serve a similar function. An homology model of the PYRIN domain of CARD7/DEFCAP/NAC/NALP1, a member of the Apaf-1/Ced-4 family of proteins, was constructed using the three-dimensional structures of the FADD and p75 neurotrophin receptor DDs, and of the Apaf-1 and caspase-9 CARDs, as templates. Validation of the model using a variety of computational techniques indicates that the fold prediction is consistent with the sequence. Comparison of a circular dichroism spectrum of the PYRIN domain of CARD7/DEFCAP/NAC/NALP1 with spectra of several proteins known to adopt the death domain-fold provides experimental support for the structure prediction. PMID:11514682
ERIC Educational Resources Information Center
Bethel, Casey M.; Lieberman, Raquel L.
2014-01-01
Here we present a multidisciplinary educational unit intended for general, advanced placement, or international baccalaureate-level high school science, focused on the three-dimensional structure of proteins and their connection to function and disease. The lessons are designed within the framework of the Next Generation Science Standards to make…
Nune, K C; Kumar, A; Murr, L E; Misra, R D K
2016-02-01
Three-dimensional cellular scaffolds are receiving significant attention in bone tissue engineering to treat segmental bone defects. However, there are indications of lack of significant osteoinductive ability of three-dimensional cellular scaffolds. In this regard, the objective of the study is to elucidate the interplay between bone morphogenetic protein (BMP-2) and osteoblast functions on 3D mesh structures with different porosities and pore size that were fabricated by electron beam melting. Self-assembled dendritic microstructure with interconnected cellular-type morphology of BMP-2 on 3D scaffolds stimulated osteoblast functions including adhesion, proliferation, and mineralization, with prominent effect on 2-mm mesh. Furthermore, immunofluorescence studies demonstrated higher density and viability of osteoblasts on lower porosity mesh structure (2 mm) as compared to 3- and 4-mm mesh structures. Enhanced filopodia cellular extensions with extensive cell spreading was observed on BMP-2 treated mesh structures, a behavior that is attributed to the unique self-assembled structure of BMP-2 that effectively communicates with the cells. The study underscores the potential of BMP-2 in imparting osteoinductive capability to the 3D printed scaffolds. © 2015 Wiley Periodicals, Inc.
NASA Astrophysics Data System (ADS)
De Marco, Luigi; Haky, Andrew; Tokmakoff, Andrei
Two-dimensional infrared (2D IR) spectroscopy has proven itself an indispensable tool for studying molecular dynamics and intermolecular interactions on ultrafast timescales. Using a novel source of broadband mid-IR pulses, we have collected 2D IR spectra of protein films at varying levels of hydration. With 2D IR, we can directly observe coupling between water's motions and the protein's. Protein films provide us with the ability to discriminate hydration waters from bulk water and thus give us access to studying water dynamics along the protein backbone, fluctuations in the protein structure, and the interplay between the molecular dynamics of the two. We present two representative protein films: poly-L-proline (PLP) and hen egg-white lysozyme (HEWL). Having no N-H groups, PLP allows us to look at water dynamics without interference from resonant energy transfer between the protein N-H stretch and the water O-H stretch. We conclude that at low hydration levels water-protein interactions dominate, and the water's dynamics are tied to those of the protein. In HEWL films, we take advantage of the robust secondary structure to partially deuterate the film, allowing us to spectrally distinguish the protein core from the exterior. From this, we show that resonant energy transfer to water provides an effective means of dissipating excess energy within the protein, while maintaining the structure. These methods are general and can easily be extended to studying specific protein-water interactions.
How Molecular Size Impacts RMSD Applications in Molecular Dynamics Simulations.
Sargsyan, Karen; Grauffel, Cédric; Lim, Carmay
2017-04-11
The root-mean-square deviation (RMSD) is a similarity measure widely used in analysis of macromolecular structures and dynamics. As increasingly larger macromolecular systems are being studied, dimensionality effects such as the "curse of dimensionality" (a diminishing ability to discriminate pairwise differences between conformations with increasing system size) may exist and significantly impact RMSD-based analyses. For such large bimolecular systems, whether the RMSD or other alternative similarity measures might suffer from this "curse" and lose the ability to discriminate different macromolecular structures had not been explicitly addressed. Here, we show such dimensionality effects for both weighted and nonweighted RMSD schemes. We also provide a mechanism for the emergence of the "curse of dimensionality" for RMSD from the law of large numbers by showing that the conformational distributions from which RMSDs are calculated become increasingly similar as the system size increases. Our findings suggest the use of weighted RMSD schemes for small proteins (less than 200 residues) and nonweighted RMSD for larger proteins when analyzing molecular dynamics trajectories.
Casadei, Cecilia M.; Tsai, Ching-Ju; Barty, Anton; ...
2018-01-01
Previous proof-of-concept measurements on single-layer two-dimensional membrane-protein crystals performed at X-ray free-electron lasers (FELs) have demonstrated that the collection of meaningful diffraction patterns, which is not possible at synchrotrons because of radiation-damage issues, is feasible. Here, the results obtained from the analysis of a thousand single-shot, room-temperature X-ray FEL diffraction images from two-dimensional crystals of a bacteriorhodopsin mutant are reported in detail. The high redundancy in the measurements boosts the intensity signal-to-noise ratio, so that the values of the diffracted intensities can be reliably determined down to the detector-edge resolution of 4 Å. The results show that two-dimensional serial crystallography atmore » X-ray FELs is a suitable method to study membrane proteins to near-atomic length scales at ambient temperature. The method presented here can be extended to pump–probe studies of optically triggered structural changes on submillisecond timescales in two-dimensional crystals, which allow functionally relevant large-scale motions that may be quenched in three-dimensional crystals.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Casadei, Cecilia M.; Tsai, Ching-Ju; Barty, Anton
Previous proof-of-concept measurements on single-layer two-dimensional membrane-protein crystals performed at X-ray free-electron lasers (FELs) have demonstrated that the collection of meaningful diffraction patterns, which is not possible at synchrotrons because of radiation-damage issues, is feasible. Here, the results obtained from the analysis of a thousand single-shot, room-temperature X-ray FEL diffraction images from two-dimensional crystals of a bacteriorhodopsin mutant are reported in detail. The high redundancy in the measurements boosts the intensity signal-to-noise ratio, so that the values of the diffracted intensities can be reliably determined down to the detector-edge resolution of 4 Å. The results show that two-dimensional serial crystallography atmore » X-ray FELs is a suitable method to study membrane proteins to near-atomic length scales at ambient temperature. The method presented here can be extended to pump–probe studies of optically triggered structural changes on submillisecond timescales in two-dimensional crystals, which allow functionally relevant large-scale motions that may be quenched in three-dimensional crystals.« less
Structural properties of matrix metalloproteinases.
Bode, W; Fernandez-Catalan, C; Tschesche, H; Grams, F; Nagase, H; Maskos, K
1999-04-01
Matrix metalloproteinases (MMPs) are involved in extracellular matrix degradation. Their proteolytic activity must be precisely regulated by their endogenous protein inhibitors, the tissue inhibitors of metalloproteinases (TIMPs). Disruption of this balance results in serious diseases such as arthritis, tumour growth and metastasis. Knowledge of the tertiary structures of the proteins involved is crucial for understanding their functional properties and interference with associated dysfunctions. Within the last few years, several three-dimensional MMP and MMP-TIMP structures became available, showing the domain organization, polypeptide fold and main specificity determinants. Complexes of the catalytic MMP domains with various synthetic inhibitors enabled the structure-based design and improvement of high-affinity ligands, which might be elaborated into drugs. A multitude of reviews surveying work done on all aspects of MMPs have appeared in recent years, but none of them has focused on the three-dimensional structures. This review was written to close the gap.
A global optimization algorithm for protein surface alignment
2010-01-01
Background A relevant problem in drug design is the comparison and recognition of protein binding sites. Binding sites recognition is generally based on geometry often combined with physico-chemical properties of the site since the conformation, size and chemical composition of the protein surface are all relevant for the interaction with a specific ligand. Several matching strategies have been designed for the recognition of protein-ligand binding sites and of protein-protein interfaces but the problem cannot be considered solved. Results In this paper we propose a new method for local structural alignment of protein surfaces based on continuous global optimization techniques. Given the three-dimensional structures of two proteins, the method finds the isometric transformation (rotation plus translation) that best superimposes active regions of two structures. We draw our inspiration from the well-known Iterative Closest Point (ICP) method for three-dimensional (3D) shapes registration. Our main contribution is in the adoption of a controlled random search as a more efficient global optimization approach along with a new dissimilarity measure. The reported computational experience and comparison show viability of the proposed approach. Conclusions Our method performs well to detect similarity in binding sites when this in fact exists. In the future we plan to do a more comprehensive evaluation of the method by considering large datasets of non-redundant proteins and applying a clustering technique to the results of all comparisons to classify binding sites. PMID:20920230
Morris, Garrett M; Lim-Wilby, Marguerita
2008-01-01
Molecular docking is a key tool in structural molecular biology and computer-assisted drug design. The goal of ligand-protein docking is to predict the predominant binding mode(s) of a ligand with a protein of known three-dimensional structure. Successful docking methods search high-dimensional spaces effectively and use a scoring function that correctly ranks candidate dockings. Docking can be used to perform virtual screening on large libraries of compounds, rank the results, and propose structural hypotheses of how the ligands inhibit the target, which is invaluable in lead optimization. The setting up of the input structures for the docking is just as important as the docking itself, and analyzing the results of stochastic search methods can sometimes be unclear. This chapter discusses the background and theory of molecular docking software, and covers the usage of some of the most-cited docking software.
Structure of synaptophysin: a hexameric MARVEL-domain channel protein.
Arthur, Christopher P; Stowell, Michael H B
2007-06-01
Synaptophysin I (SypI) is an archetypal member of the MARVEL-domain family of integral membrane proteins and one of the first synaptic vesicle proteins to be identified and cloned. Most all MARVEL-domain proteins are involved in membrane apposition and vesicle-trafficking events, but their precise role in these processes is unclear. We have purified mammalian SypI and determined its three-dimensional (3D) structure by using electron microscopy and single-particle 3D reconstruction. The hexameric structure resembles an open basket with a large pore and tenuous interactions within the cytosolic domain. The structure suggests a model for Synaptophysin's role in fusion and recycling that is regulated by known interactions with the SNARE machinery. This 3D structure of a MARVEL-domain protein provides a structural foundation for understanding the role of these important proteins in a variety of biological processes.
Making the Bend: DNA Tertiary Structure and Protein-DNA Interactions
Harteis, Sabrina; Schneider, Sabine
2014-01-01
DNA structure functions as an overlapping code to the DNA sequence. Rapid progress in understanding the role of DNA structure in gene regulation, DNA damage recognition and genome stability has been made. The three dimensional structure of both proteins and DNA plays a crucial role for their specific interaction, and proteins can recognise the chemical signature of DNA sequence (“base readout”) as well as the intrinsic DNA structure (“shape recognition”). These recognition mechanisms do not exist in isolation but, depending on the individual interaction partners, are combined to various extents. Driving force for the interaction between protein and DNA remain the unique thermodynamics of each individual DNA-protein pair. In this review we focus on the structures and conformations adopted by DNA, both influenced by and influencing the specific interaction with the corresponding protein binding partner, as well as their underlying thermodynamics. PMID:25026169
Using Symmetry to Design Self-Assembling Protein Cages and Nanomaterials on the Mid-Nanometer Scale
NASA Astrophysics Data System (ADS)
Yeates, Todd
Self-assembling molecular structures having diverse cellular functions are widespread in nature. Some of the largest and most sophisticated types are built from many copies of the same or similar protein molecules arranged following principles of symmetry. A long-standing engineering goal has been to design novel protein molecules to self-assemble into geometrically specific structures similar to the extraordinary structures that have evolved in Nature. Practical routes to this goal have been developed by using ideas in symmetry to articulate the minimum design requirements for achieving various types of symmetric architectures, including cages, extended two-dimensional layers, and three-dimensional crystalline materials. The key requirement is that two distinct self-associating interfaces, each conferring one element of rotational symmetry, have to be engineered into the protein molecule (or molecules), following particular geometric specifications. The main principle is that combining two separate symmetry elements into a single molecular entity produces a molecule that necessarily assembles into an architecture dictated by a symmetry group that is the product of the two simpler contributing symmetries. Recent experiments have demonstrated success using a variety of symmetry-based strategies. Strategic variations are emerging that differ from each other with respect to biophysical features such as flexibility vs rigidity in the assembled structures, and with respect to design aspects such as whether the protein interfaces are inherited from natural oligomeric proteins or are designed de novo by advanced computational methods. The success of these strategies has been proven by determining crystal structures of several giant, self-assembling protein cages and clusters (10-25 nm in diameter), created by design. The ability to create sophisticated supramolecular structures from designed protein subunits opens the way to broad applications in synthetic biology and nanotechnology.
New paradigm in ankyrin repeats: Beyond protein-protein interaction module.
Islam, Zeyaul; Nagampalli, Raghavendra Sashi Krishna; Fatima, Munazza Tamkeen; Ashraf, Ghulam Md
2018-04-01
Classically, ankyrin repeat (ANK) proteins are built from tandems of two or more repeats and form curved solenoid structures that are associated with protein-protein interactions. These are short, widespread structural motif of around 33 amino acids repeats in tandem, having a canonical helix-loop-helix fold, found individually or in combination with other domains. The multiplicity of structural pattern enables it to form assemblies of diverse sizes, required for their abilities to confer multiple binding and structural roles of proteins. Three-dimensional structures of these repeats determined to date reveal a degree of structural variability that translates into the considerable functional versatility of this protein superfamily. Recent work on the ANK has proposed novel structural information, especially protein-lipid, protein-sugar and protein-protein interaction. Self-assembly of these repeats was also shown to prevent the associated protein in forming filaments. In this review, we summarize the latest findings and how the new structural information has increased our understanding of the structural determinants of ANK proteins. We discussed latest findings on how these proteins participate in various interactions to diversify the ANK roles in numerous biological processes, and explored the emerging and evolving field of designer ankyrins and its framework for protein engineering emphasizing on biotechnological applications. Copyright © 2017 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Ploetz, Evelyn; Lerner, Eitan; Husada, Florence; Roelfs, Martin; Chung, Sangyoon; Hohlbein, Johannes; Weiss, Shimon; Cordes, Thorben
2016-09-01
Advanced microscopy methods allow obtaining information on (dynamic) conformational changes in biomolecules via measuring a single molecular distance in the structure. It is, however, extremely challenging to capture the full depth of a three-dimensional biochemical state, binding-related structural changes or conformational cross-talk in multi-protein complexes using one-dimensional assays. In this paper we address this fundamental problem by extending the standard molecular ruler based on Förster resonance energy transfer (FRET) into a two-dimensional assay via its combination with protein-induced fluorescence enhancement (PIFE). We show that donor brightness (via PIFE) and energy transfer efficiency (via FRET) can simultaneously report on e.g., the conformational state of double stranded DNA (dsDNA) following its interaction with unlabelled proteins (BamHI, EcoRV, and T7 DNA polymerase gp5/trx). The PIFE-FRET assay uses established labelling protocols and single molecule fluorescence detection schemes (alternating-laser excitation, ALEX). Besides quantitative studies of PIFE and FRET ruler characteristics, we outline possible applications of ALEX-based PIFE-FRET for single-molecule studies with diffusing and immobilized molecules. Finally, we study transcription initiation and scrunching of E. coli RNA-polymerase with PIFE-FRET and provide direct evidence for the physical presence and vicinity of the polymerase that causes structural changes and scrunching of the transcriptional DNA bubble.
Ploetz, Evelyn; Lerner, Eitan; Husada, Florence; Roelfs, Martin; Chung, SangYoon; Hohlbein, Johannes; Weiss, Shimon; Cordes, Thorben
2016-01-01
Advanced microscopy methods allow obtaining information on (dynamic) conformational changes in biomolecules via measuring a single molecular distance in the structure. It is, however, extremely challenging to capture the full depth of a three-dimensional biochemical state, binding-related structural changes or conformational cross-talk in multi-protein complexes using one-dimensional assays. In this paper we address this fundamental problem by extending the standard molecular ruler based on Förster resonance energy transfer (FRET) into a two-dimensional assay via its combination with protein-induced fluorescence enhancement (PIFE). We show that donor brightness (via PIFE) and energy transfer efficiency (via FRET) can simultaneously report on e.g., the conformational state of double stranded DNA (dsDNA) following its interaction with unlabelled proteins (BamHI, EcoRV, and T7 DNA polymerase gp5/trx). The PIFE-FRET assay uses established labelling protocols and single molecule fluorescence detection schemes (alternating-laser excitation, ALEX). Besides quantitative studies of PIFE and FRET ruler characteristics, we outline possible applications of ALEX-based PIFE-FRET for single-molecule studies with diffusing and immobilized molecules. Finally, we study transcription initiation and scrunching of E. coli RNA-polymerase with PIFE-FRET and provide direct evidence for the physical presence and vicinity of the polymerase that causes structural changes and scrunching of the transcriptional DNA bubble. PMID:27641327
Fang, Jing; Nevin, Philip; Kairys, Visvaldas; Venclovas, Česlovas; Engen, John R; Beuning, Penny J
2014-04-08
The relationship between protein sequence, structure, and dynamics has been elusive. Here, we report a comprehensive analysis using an in-solution experimental approach to study how the conservation of tertiary structure correlates with protein dynamics. Hydrogen exchange measurements of eight processivity clamp proteins from different species revealed that, despite highly similar three-dimensional structures, clamp proteins display a wide range of dynamic behavior. Differences were apparent both for structurally similar domains within proteins and for corresponding domains of different proteins. Several of the clamps contained regions that underwent local unfolding with different half-lives. We also observed a conserved pattern of alternating dynamics of the α helices lining the inner pore of the clamps as well as a correlation between dynamics and the number of salt bridges in these α helices. Our observations reveal that tertiary structure and dynamics are not directly correlated and that primary structure plays an important role in dynamics. Copyright © 2014 Elsevier Ltd. All rights reserved.
Núñez-Vivanco, Gabriel; Valdés-Jiménez, Alejandro; Besoaín, Felipe; Reyes-Parada, Miguel
2016-01-01
Since the structure of proteins is more conserved than the sequence, the identification of conserved three-dimensional (3D) patterns among a set of proteins, can be important for protein function prediction, protein clustering, drug discovery and the establishment of evolutionary relationships. Thus, several computational applications to identify, describe and compare 3D patterns (or motifs) have been developed. Often, these tools consider a 3D pattern as that described by the residues surrounding co-crystallized/docked ligands available from X-ray crystal structures or homology models. Nevertheless, many of the protein structures stored in public databases do not provide information about the location and characteristics of ligand binding sites and/or other important 3D patterns such as allosteric sites, enzyme-cofactor interaction motifs, etc. This makes necessary the development of new ligand-independent methods to search and compare 3D patterns in all available protein structures. Here we introduce Geomfinder, an intuitive, flexible, alignment-free and ligand-independent web server for detailed estimation of similarities between all pairs of 3D patterns detected in any two given protein structures. We used around 1100 protein structures to form pairs of proteins which were assessed with Geomfinder. In these analyses each protein was considered in only one pair (e.g. in a subset of 100 different proteins, 50 pairs of proteins can be defined). Thus: (a) Geomfinder detected identical pairs of 3D patterns in a series of monoamine oxidase-B structures, which corresponded to the effectively similar ligand binding sites at these proteins; (b) we identified structural similarities among pairs of protein structures which are targets of compounds such as acarbose, benzamidine, adenosine triphosphate and pyridoxal phosphate; these similar 3D patterns are not detected using sequence-based methods; (c) the detailed evaluation of three specific cases showed the versatility of Geomfinder, which was able to discriminate between similar and different 3D patterns related to binding sites of common substrates in a range of diverse proteins. Geomfinder allows detecting similar 3D patterns between any two pair of protein structures, regardless of the divergency among their amino acids sequences. Although the software is not intended for simultaneous multiple comparisons in a large number of proteins, it can be particularly useful in cases such as the structure-based design of multitarget drugs, where a detailed analysis of 3D patterns similarities between a few selected protein targets is essential.
Ejnik, John W; Muñoz, Amalia; DeRose, Eugene; Shaw, C Frank; Petering, David H
2003-07-22
The NMR determination of the structure of Cd(7)-metallothionein was done previously using a relatively large protein concentration that favors dimer formation. The reactivity of the protein is also affected under this condition. To examine the influence of protein concentration on metallothionein conformation, the isolated Cd(4)-alpha-domain was prepared from rabbit metallothionein-2 (MT 2), and its three-dimensional structure was determined by heteronuclear, (1)H-(111)Cd, and homonuclear, (1)H-(1)H NMR, correlation experiments. The three-dimensional structure was refined using distance and angle constraints derived from these two-dimensional NMR data sets and a distance geometry/simulated annealing protocol. The backbone superposition of the alpha-domain from rabbit holoprotein Cd(7)-MT 2 and the isolated rabbit Cd(4)-alpha was measured at a RMSD of 2.0 A. Nevertheless, the conformations of the two Cd-thiolate clusters were distinctly different at two of the cadmium centers. In addition, solvent access to the sulfhydryl ligands of the isolated Cd(4)-alpha cluster was 130% larger due to this small change in cluster geometry. To probe whether these differences were an artifact of the structure calculation, the Cd(4)-alpha-domain structure in rabbit Cd(7)-MT 2 was redetermined, using the previously defined set of NOEs and the present calculation protocol. All calculations employed the same ionic radius for Cd(2+) and same cadmium-thiolate bond distance. The newly calculated structure matched the original with an RMSD of 1.24 A. It is hypothesized that differences in the two alpha-domain structures result from a perturbation of the holoprotein structure because of head-to-tail dimerization under the conditions of the NMR experiments.
NASA Astrophysics Data System (ADS)
Oda, Akifumi; Fukuyoshi, Shuichi
2015-06-01
The GADV hypothesis is a form of the protein world hypothesis, which suggests that life originated from proteins (Lacey et al. 1999; Ikehara 2002; Andras 2006). In the GADV hypothesis, life is thought to have originated from primitive proteins constructed of only glycine, alanine, aspartic acid, and valine ([GADV]-proteins). In this study, the three-dimensional (3D) conformations of randomly generated short [GADV]-peptides were computationally investigated using replica-exchange molecular dynamics (REMD) simulations (Sugita and Okamoto 1999). Because the peptides used in this study consisted of only 20 residues each, they could not form certain 3D structures. However, the conformational tendencies of the peptides were elucidated by analyzing the conformational ensembles generated by REMD simulations. The results indicate that secondary structures can be formed in several randomly generated [GADV]-peptides. A long helical structure was found in one of the hydrophobic peptides, supporting the conjecture of the GADV hypothesis that many peptides aggregated to form peptide multimers with enzymatic activity in the primordial soup. In addition, these results indicate that REMD simulations can be used for the structural investigation of short peptides.
Lorenzo, J Ramiro; Alonso, Leonardo G; Sánchez, Ignacio E
2015-01-01
Asparagine residues in proteins undergo spontaneous deamidation, a post-translational modification that may act as a molecular clock for the regulation of protein function and turnover. Asparagine deamidation is modulated by protein local sequence, secondary structure and hydrogen bonding. We present NGOME, an algorithm able to predict non-enzymatic deamidation of internal asparagine residues in proteins in the absence of structural data, using sequence-based predictions of secondary structure and intrinsic disorder. Compared to previous algorithms, NGOME does not require three-dimensional structures yet yields better predictions than available sequence-only methods. Four case studies of specific proteins show how NGOME may help the user identify deamidation-prone asparagine residues, often related to protein gain of function, protein degradation or protein misfolding in pathological processes. A fifth case study applies NGOME at a proteomic scale and unveils a correlation between asparagine deamidation and protein degradation in yeast. NGOME is freely available as a webserver at the National EMBnet node Argentina, URL: http://www.embnet.qb.fcen.uba.ar/ in the subpage "Protein and nucleic acid structure and sequence analysis".
Cole, Jason C.
2017-01-01
The Cambridge Structural Database (CSD) is the worldwide resource for the dissemination of all published three-dimensional structures of small-molecule organic and metal–organic compounds. This paper briefly describes how this collection of crystal structures can be used en masse in the context of macromolecular crystallography. Examples highlight how the CSD and associated software aid protein–ligand complex validation, and show how the CSD could be further used in the generation of geometrical restraints for protein structure refinement. PMID:28291758
Packaging DNA Origami into Viral Protein Cages.
Linko, Veikko; Mikkilä, Joona; Kostiainen, Mauri A
2018-01-01
The DNA origami technique is a widely used method to create customized, complex, spatially well-defined two-dimensional (2D) and three-dimensional (3D) DNA nanostructures. These structures have huge potential to serve as smart drug-delivery vehicles and molecular devices in various nanomedical and biotechnological applications. However, so far only little is known about the behavior of these novel structures in living organisms or in cell culture/tissue models. Moreover, enhancing pharmacokinetic bioavailability and transfection properties of such structures still remains a challenge. One intriguing approach to overcome these issues is to coat DNA origami nanostructures with proteins or lipid membranes. Here, we show how cowpea chlorotic mottle virus (CCMV) capsid proteins (CPs) can be used for coating DNA origami nanostructures. We present a method for disassembling native CCMV particles and isolating the pure CP dimers, which can further bind and encapsulate a rectangular DNA origami shape. Owing to the highly programmable nature of DNA origami, packaging of DNA nanostructures into viral protein cages could find imminent uses in enhanced targeting and cellular delivery of various active nano-objects, such as enzymes and drug molecules.
NASA Astrophysics Data System (ADS)
Illing, Gerd; Saenger, Wolfram; Heinemann, Udo
2000-06-01
The Protein Structure Factory will be established to characterize proteins encoded by human genes or cDNAs, which will be selected by criteria of potential structural novelty or medical or biotechnological usefulness. It represents an integrative approach to structure analysis combining bioinformatics techniques, automated gene expression and purification of gene products, generation of a biophysical fingerprint of the proteins and the determination of their three-dimensional structures either by NMR spectroscopy or by X-ray diffraction. The use of synchrotron radiation will be crucial to the Protein Structure Factory: high brilliance and tunable wavelengths are prerequisites for fast data collection, the use of small crystals and multiwavelength anomalous diffraction (MAD) phasing. With the opening of BESSY II, direct access to a third-generation XUV storage ring source with excellent conditions is available nearby. An insertion device with two MAD beamlines and one constant energy station will be set up until 2001.
Crystal Structure of a Plant Multidrug and Toxic Compound Extrusion Family Protein.
Tanaka, Yoshiki; Iwaki, Shigehiro; Tsukazaki, Tomoya
2017-09-05
The multidrug and toxic compound extrusion (MATE) family of proteins consists of transporters responsible for multidrug resistance in prokaryotes. In plants, a number of MATE proteins were identified by recent genomic and functional studies, which imply that the proteins have substrate-specific transport functions instead of multidrug extrusion. The three-dimensional structure of eukaryotic MATE proteins, including those of plants, has not been reported, preventing a better understanding of the molecular mechanism of these proteins. Here, we describe the crystal structure of a MATE protein from the plant Camelina sativa at 2.9 Å resolution. Two sets of six transmembrane α helices, assembled pseudo-symmetrically, possess a negatively charged internal pocket with an outward-facing shape. The crystal structure provides insight into the diversity of plant MATE proteins and their substrate recognition and transport through the membrane. Copyright © 2017 Elsevier Ltd. All rights reserved.
Sequence co-evolution gives 3D contacts and structures of protein complexes
Hopf, Thomas A; Schärfe, Charlotta P I; Rodrigues, João P G L M; Green, Anna G; Kohlbacher, Oliver; Sander, Chris; Bonvin, Alexandre M J J; Marks, Debora S
2014-01-01
Protein–protein interactions are fundamental to many biological processes. Experimental screens have identified tens of thousands of interactions, and structural biology has provided detailed functional insight for select 3D protein complexes. An alternative rich source of information about protein interactions is the evolutionary sequence record. Building on earlier work, we show that analysis of correlated evolutionary sequence changes across proteins identifies residues that are close in space with sufficient accuracy to determine the three-dimensional structure of the protein complexes. We evaluate prediction performance in blinded tests on 76 complexes of known 3D structure, predict protein–protein contacts in 32 complexes of unknown structure, and demonstrate how evolutionary couplings can be used to distinguish between interacting and non-interacting protein pairs in a large complex. With the current growth of sequences, we expect that the method can be generalized to genome-wide elucidation of protein–protein interaction networks and used for interaction predictions at residue resolution. DOI: http://dx.doi.org/10.7554/eLife.03430.001 PMID:25255213
DOE Office of Scientific and Technical Information (OSTI.GOV)
South, T.L.; Blake, P.R.; Hare, D.R.
Two-dimensional NMR spectroscopic and computational methods were employed for the structure determination of an 18-residue peptide with the amino acid sequence of the C-terminal retriviral-type (r.t.) zinc finger domain from the nucleocapsid protein (NCP) of HIV-1 (Zn(HIV1-F2)). Unlike results obtained for the first retroviral-type zinc finger peptide, Zn (HIV1-F1) broad signals indicative of confomational lability were observed in the {sup 1}H NMR spectrum of An(HIV1-F2) at 25 C. The NMR signals narrowed upon cooling to {minus}2 C, enabling complete {sup 1}H NMR signal assignment via standard two-dimensional (2D) NMR methods. Distance restraints obtained from qualitative analysis of 2D nuclear Overhausermore » effect (NOESY) data were sued to generate 30 distance geometry (DG) structures with penalties in the range 0.02-0.03 {angstrom}{sup 2}. All structures were qualitatively consistent with the experimental NOESY spectrum based on comparisons with 2D NOESY back-calculated spectra. These results indicate that the r.t. zinc finger sequences observed in retroviral NCPs, simple plant virus coat proteins, and in a human single-stranded nucleic acid binding protein share a common structural motif.« less
Real-time ligand binding pocket database search using local surface descriptors.
Chikhi, Rayan; Sael, Lee; Kihara, Daisuke
2010-07-01
Because of the increasing number of structures of unknown function accumulated by ongoing structural genomics projects, there is an urgent need for computational methods for characterizing protein tertiary structures. As functions of many of these proteins are not easily predicted by conventional sequence database searches, a legitimate strategy is to utilize structure information in function characterization. Of particular interest is prediction of ligand binding to a protein, as ligand molecule recognition is a major part of molecular function of proteins. Predicting whether a ligand molecule binds a protein is a complex problem due to the physical nature of protein-ligand interactions and the flexibility of both binding sites and ligand molecules. However, geometric and physicochemical complementarity is observed between the ligand and its binding site in many cases. Therefore, ligand molecules which bind to a local surface site in a protein can be predicted by finding similar local pockets of known binding ligands in the structure database. Here, we present two representations of ligand binding pockets and utilize them for ligand binding prediction by pocket shape comparison. These representations are based on mapping of surface properties of binding pockets, which are compactly described either by the two-dimensional pseudo-Zernike moments or the three-dimensional Zernike descriptors. These compact representations allow a fast real-time pocket searching against a database. Thorough benchmark studies employing two different datasets show that our representations are competitive with the other existing methods. Limitations and potentials of the shape-based methods as well as possible improvements are discussed.
Tolkatchev, Dmitri; Shaykhutdinov, Rustem; Xu, Ping; Plamondon, Josée; Watson, David C; Young, N Martin; Ni, Feng
2006-10-01
A putative low molecular weight protein tyrosine phosphatase (LMW-PTP) was identified in the genome sequence of the bacterial pathogen, Campylobacter jejuni. This novel gene, cj1258, has sequence homology with a distinctive class of phosphatases widely distributed among prokaryotes and eukaryotes. We report here the solution structure of Cj1258 established by high-resolution NMR spectroscopy using NOE-derived distance restraints, hydrogen bond data, and torsion angle restraints. The three-dimensional structure consists of a central four-stranded parallel beta-sheet flanked by five alpha-helices, revealing an overall structural topology similar to those of the eukaryotic LMW-PTPs, such as human HCPTP-A, bovine BPTP, and Saccharomyces cerevisiae LTP1, and to those of the bacterial LMW-PTPs MPtpA from Mycobacterium tuberculosis and YwlE from Bacillus subtilis. The active site of the enzyme is flexible in solution and readily adapts to the binding of ligands, such as the phosphate ion. An NMR-based screen was carried out against a number of potential inhibitors and activators, including phosphonomethylphenylalanine, derivatives of the cinnamic acid, 2-hydroxy-5-nitrobenzaldehyde, cinnamaldehyde, adenine, and hypoxanthine. Despite its bacterial origin, both the three-dimensional structure and ligand-binding properties of Cj1258 suggest that this novel phosphatase may have functional roles close to those of eukaryotic and mammalian tyrosine phosphatases. The three-dimensional structure along with mapping of small-molecule binding will be discussed in the context of developing high-affinity inhibitors of this novel LMW-PTP.
Gorai, Biswajit; Prabhavadhni, Arasu; Sivaraman, Thirunavukkarasu
2015-09-01
Unfolding stabilities of two homologous proteins, cardiotoxin III and short-neurotoxin (SNTX) belonging to three-finger toxin (TFT) superfamily, have been probed by means of molecular dynamics (MD) simulations. Combined analysis of data obtained from steered MD and all-atom MD simulations at various temperatures in near physiological conditions on the proteins suggested that overall structural stabilities of the two proteins were different from each other and the MD results are consistent with experimental data of the proteins reported in the literature. Rationalization for the differential structural stabilities of the structurally similar proteins has been chiefly attributed to the differences in the structural contacts between C- and N-termini regions in their three-dimensional structures, and the findings endorse the 'CN network' hypothesis proposed to qualitatively analyse the thermodynamic stabilities of proteins belonging to TFT superfamily of snake venoms. Moreover, the 'CN network' hypothesis has been revisited and the present study suggested that 'CN network' should be accounted in terms of 'structural contacts' and 'structural strengths' in order to precisely describe order of structural stabilities of TFTs.
2013-01-01
Chemical cross-linking of proteins combined with mass spectrometry provides an attractive and novel method for the analysis of native protein structures and protein complexes. Analysis of the data however is complex. Only a small number of cross-linked peptides are produced during sample preparation and must be identified against a background of more abundant native peptides. To facilitate the search and identification of cross-linked peptides, we have developed a novel software suite, named Hekate. Hekate is a suite of tools that address the challenges involved in analyzing protein cross-linking experiments when combined with mass spectrometry. The software is an integrated pipeline for the automation of the data analysis workflow and provides a novel scoring system based on principles of linear peptide analysis. In addition, it provides a tool for the visualization of identified cross-links using three-dimensional models, which is particularly useful when combining chemical cross-linking with other structural techniques. Hekate was validated by the comparative analysis of cytochrome c (bovine heart) against previously reported data.1 Further validation was carried out on known structural elements of DNA polymerase III, the catalytic α-subunit of the Escherichia coli DNA replisome along with new insight into the previously uncharacterized C-terminal domain of the protein. PMID:24010795
Comparative Protein Structure Modeling Using MODELLER.
Webb, Benjamin; Sali, Andrej
2014-09-08
Functional characterization of a protein sequence is one of the most frequent problems in biology. This task is usually facilitated by accurate three-dimensional (3-D) structure of the studied protein. In the absence of an experimentally determined structure, comparative or homology modeling can sometimes provide a useful 3-D model for a protein that is related to at least one known protein structure. Comparative modeling predicts the 3-D structure of a given protein sequence (target) based primarily on its alignment to one or more proteins of known structure (templates). The prediction process consists of fold assignment, target-template alignment, model building, and model evaluation. This unit describes how to calculate comparative models using the program MODELLER and discusses all four steps of comparative modeling, frequently observed errors, and some applications. Modeling lactate dehydrogenase from Trichomonas vaginalis (TvLDH) is described as an example. The download and installation of the MODELLER software is also described. Copyright © 2014 John Wiley & Sons, Inc.
Komives, Elizabeth A.; Wolynes, Peter G.
2008-01-01
Repeat-proteins are made up of near repetitions of 20– to 40–amino acid stretches. These polypeptides usually fold up into non-globular, elongated architectures that are stabilized by the interactions within each repeat and those between adjacent repeats, but that lack contacts between residues distant in sequence. The inherent symmetries both in primary sequence and three-dimensional structure are reflected in a folding landscape that may be analyzed as a quasi–one-dimensional problem. We present a general description of repeat-protein energy landscapes based on a formal Ising-like treatment of the elementary interaction energetics in and between foldons, whose collective ensemble are treated as spin variables. The overall folding properties of a complete “domain” (the stability and cooperativity of the repeating array) can be derived from this microscopic description. The one-dimensional nature of the model implies there are simple relations for the experimental observables: folding free-energy (ΔGwater) and the cooperativity of denaturation (m-value), which do not ordinarily apply for globular proteins. We show how the parameters for the “coarse-grained” description in terms of foldon spin variables can be extracted from more detailed folding simulations on perfectly funneled landscapes. To illustrate the ideas, we present a case-study of a family of tetratricopeptide (TPR) repeat proteins and quantitatively relate the results to the experimentally observed folding transitions. Based on the dramatic effect that single point mutations exert on the experimentally observed folding behavior, we speculate that natural repeat proteins are “poised” at particular ratios of inter- and intra-element interaction energetics that allow them to readily undergo structural transitions in physiologically relevant conditions, which may be intrinsically related to their biological functions. PMID:18483553
Severcan, Isil; Geary, Cody; Chworos, Arkadiusz; Voss, Neil; Jacovetty, Erica; Jaeger, Luc
2010-09-01
Supramolecular assembly is a powerful strategy used by nature to build nanoscale architectures with predefined sizes and shapes. With synthetic systems, however, numerous challenges remain to be solved before precise control over the synthesis, folding and assembly of rationally designed three-dimensional nano-objects made of RNA can be achieved. Here, using the transfer RNA molecule as a structural building block, we report the design, efficient synthesis and structural characterization of stable, modular three-dimensional particles adopting the polyhedral geometry of a non-uniform square antiprism. The spatial control within the final architecture allows the precise positioning and encapsulation of proteins. This work demonstrates that a remarkable degree of structural control can be achieved with RNA structural motifs for the construction of thermostable three-dimensional nano-architectures that do not rely on helix bundles or tensegrity. RNA three-dimensional particles could potentially be used as carriers or scaffolds in nanomedicine and synthetic biology.
Quantifying the relationship between sequence and three-dimensional structure conservation in RNA
2010-01-01
Background In recent years, the number of available RNA structures has rapidly grown reflecting the increased interest on RNA biology. Similarly to the studies carried out two decades ago for proteins, which gave the fundamental grounds for developing comparative protein structure prediction methods, we are now able to quantify the relationship between sequence and structure conservation in RNA. Results Here we introduce an all-against-all sequence- and three-dimensional (3D) structure-based comparison of a representative set of RNA structures, which have allowed us to quantitatively confirm that: (i) there is a measurable relationship between sequence and structure conservation that weakens for alignments resulting in below 60% sequence identity, (ii) evolution tends to conserve more RNA structure than sequence, and (iii) there is a twilight zone for RNA homology detection. Discussion The computational analysis here presented quantitatively describes the relationship between sequence and structure for RNA molecules and defines a twilight zone region for detecting RNA homology. Our work could represent the theoretical basis and limitations for future developments in comparative RNA 3D structure prediction. PMID:20550657
[Three-dimensional genome organization: a lesson from the Polycomb-Group proteins].
Bantignies, Frédéric
2013-01-01
As more and more genomes are being explored and annotated, important features of three-dimensional (3D) genome organization are just being uncovered. In the light of what we know about Polycomb group (PcG) proteins, we will present the latest findings on this topic. The PcG proteins are well-conserved chromatin factors that repress transcription of numerous target genes. They bind the genome at specific sites, forming chromatin domains of associated histone modifications as well as higher-order chromatin structures. These 3D chromatin structures involve the interactions between PcG-bound regulatory regions at short- and long-range distances, and may significantly contribute to PcG function. Recent high throughput "Chromosome Conformation Capture" (3C) analyses have revealed many other higher order structures along the chromatin fiber, partitioning the genomes into well demarcated topological domains. This revealed an unprecedented link between linear epigenetic domains and chromosome architecture, which might be intimately connected to genome function. © Société de Biologie, 2013.
Three-dimensional organization of three-domain copper oxidases: A review
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhukhlistova, N. E., E-mail: amm@ns.crys.ras.ru; Zhukova, Yu. N.; Lyashenko, A. V.
2008-01-15
'Blue' copper-containing proteins are multidomain proteins that utilize a unique redox property of copper ions. Among other blue multicopper oxidases, three-domain oxidases belong to the group of proteins that exhibit a wide variety of compositions in amino acid sequences, functions, and occurrences in organisms. This paper presents a review of the data obtained from X-ray diffraction investigations of the three-dimensional structures of three-domain multicopper oxidases, such as the ascorbate oxidase catalyzing oxidation of ascorbate to dehydroascorbate and its three derivatives; the multicopper oxidase CueO (the laccase homologue); the laccases isolated from the basidiomycetes Coprinus cinereus, Trametes versicolor, Coriolus zonatus, Cerrenamore » maxima, and Rigidoporus lignosus and the ascomycete Melanocarpus albomyces; and the bacterial laccases CotA from the endospore coats of Bacillus subtilis. A comparison of the molecular structures of the laccases of different origins demonstrates that, structurally, these objects are highly conservative. This obviously indicates that the catalytic activity of the enzymes under consideration is characterized by similar mechanisms.« less
Three-dimensional organization of three-domain copper oxidases: A review
NASA Astrophysics Data System (ADS)
Zhukhlistova, N. E.; Zhukova, Yu. N.; Lyashenko, A. V.; Zaĭtsev, V. N.; Mikhaĭlov, A. M.
2008-01-01
“Blue” copper-containing proteins are multidomain proteins that utilize a unique redox property of copper ions. Among other blue multicopper oxidases, three-domain oxidases belong to the group of proteins that exhibit a wide variety of compositions in amino acid sequences, functions, and occurrences in organisms. This paper presents a review of the data obtained from X-ray diffraction investigations of the three-dimensional structures of three-domain multicopper oxidases, such as the ascorbate oxidase catalyzing oxidation of ascorbate to dehydroascorbate and its three derivatives; the multicopper oxidase CueO (the laccase homologue); the laccases isolated from the basidiomycetes Coprinus cinereus, Trametes versicolor, Coriolus zonatus, Cerrena maxima, and Rigidoporus lignosus and the ascomycete Melanocarpus albomyces; and the bacterial laccases CotA from the endospore coats of Bacillus subtilis. A comparison of the molecular structures of the laccases of different origins demonstrates that, structurally, these objects are highly conservative. This obviously indicates that the catalytic activity of the enzymes under consideration is characterized by similar mechanisms.
ChemPreview: an augmented reality-based molecular interface.
Zheng, Min; Waller, Mark P
2017-05-01
Human computer interfaces make computational science more comprehensible and impactful. Complex 3D structures such as proteins or DNA are magnified by digital representations and displayed on two-dimensional monitors. Augmented reality has recently opened another door to access the virtual three-dimensional world. Herein, we present an augmented reality application called ChemPreview with the potential to manipulate bio-molecular structures at an atomistic level. ChemPreview is available at https://github.com/wallerlab/chem-preview/releases, and is built on top of the Meta 1 platform https://www.metavision.com/. ChemPreview can be used to interact with a protein in an intuitive way using natural hand gestures, thereby making it appealing to computational chemists or structural biologists. The ability to manipulate atoms in real world could eventually provide new and more efficient ways of extracting structural knowledge, or designing new molecules in silico. Copyright © 2017 Elsevier Inc. All rights reserved.
Inorganic pyrophosphatases: structural diversity serving the function
NASA Astrophysics Data System (ADS)
Samygina, V. R.
2016-05-01
The review is devoted to ubiquitous enzymes, inorganic pyrophosphatases, which are essential in all living organisms. Despite the long history of investigations, these enzymes continue to attract interest. The review focuses on the three-dimensional structures of various representatives of this class of proteins. The structural diversity, the relationship between the structure and some properties of pyrophosphatases and various mechanisms of enzyme action related to the structural diversity of these enzymes are discussed. Interactions of pyrophosphatase with other proteins and possible practical applications are considered. The bibliography includes 56 references.
Gallat, F.-X.; Laganowsky, A.; Wood, K.; Gabel, F.; van Eijck, L.; Wuttke, J.; Moulin, M.; Härtlein, M.; Eisenberg, D.; Colletier, J.-P.; Zaccai, G.; Weik, M.
2012-01-01
Hydration water is vital for various macromolecular biological activities, such as specific ligand recognition, enzyme activity, response to receptor binding, and energy transduction. Without hydration water, proteins would not fold correctly and would lack the conformational flexibility that animates their three-dimensional structures. Motions in globular, soluble proteins are thought to be governed to a certain extent by hydration-water dynamics, yet it is not known whether this relationship holds true for other protein classes in general and whether, in turn, the structural nature of a protein also influences water motions. Here, we provide insight into the coupling between hydration-water dynamics and atomic motions in intrinsically disordered proteins (IDP), a largely unexplored class of proteins that, in contrast to folded proteins, lack a well-defined three-dimensional structure. We investigated the human IDP tau, which is involved in the pathogenic processes accompanying Alzheimer disease. Combining neutron scattering and protein perdeuteration, we found similar atomic mean-square displacements over a large temperature range for the tau protein and its hydration water, indicating intimate coupling between them. This is in contrast to the behavior of folded proteins of similar molecular weight, such as the globular, soluble maltose-binding protein and the membrane protein bacteriorhodopsin, which display moderate to weak coupling, respectively. The extracted mean square displacements also reveal a greater motional flexibility of IDP compared with globular, folded proteins and more restricted water motions on the IDP surface. The results provide evidence that protein and hydration-water motions mutually affect and shape each other, and that there is a gradient of coupling across different protein classes that may play a functional role in macromolecular activity in a cellular context. PMID:22828339
Petti, Megan K; Lomont, Justin P; Maj, Michał; Zanni, Martin T
2018-02-15
Two-dimensional spectroscopy is a powerful tool for extracting structural and dynamic information from a wide range of chemical systems. We provide a brief overview of the ways in which two-dimensional visible and infrared spectroscopies are being applied to elucidate fundamental details of important processes in biological and materials science. The topics covered include amyloid proteins, photosynthetic complexes, ion channels, photovoltaics, batteries, as well as a variety of promising new methods in two-dimensional spectroscopy.
A Stochastic Evolutionary Model for Protein Structure Alignment and Phylogeny
Challis, Christopher J.; Schmidler, Scott C.
2012-01-01
We present a stochastic process model for the joint evolution of protein primary and tertiary structure, suitable for use in alignment and estimation of phylogeny. Indels arise from a classic Links model, and mutations follow a standard substitution matrix, whereas backbone atoms diffuse in three-dimensional space according to an Ornstein–Uhlenbeck process. The model allows for simultaneous estimation of evolutionary distances, indel rates, structural drift rates, and alignments, while fully accounting for uncertainty. The inclusion of structural information enables phylogenetic inference on time scales not previously attainable with sequence evolution models. The model also provides a tool for testing evolutionary hypotheses and improving our understanding of protein structural evolution. PMID:22723302
Challenges in NMR-based structural genomics
NASA Astrophysics Data System (ADS)
Sue, Shih-Che; Chang, Chi-Fon; Huang, Yao-Te; Chou, Ching-Yu; Huang, Tai-huang
2005-05-01
Understanding the functions of the vast number of proteins encoded in many genomes that have been completely sequenced recently is the main challenge for biologists in the post-genomics era. Since the function of a protein is determined by its exact three-dimensional structure it is paramount to determine the 3D structures of all proteins. This need has driven structural biologists to undertake the structural genomics project aimed at determining the structures of all known proteins. Several centers for structural genomics studies have been established throughout the world. Nuclear magnetic resonance (NMR) spectroscopy has played a major role in determining protein structures in atomic details and in a physiologically relevant solution state. Since the number of new genes being discovered daily far exceeds the number of structures determined by both NMR and X-ray crystallography, a high-throughput method for speeding up the process of protein structure determination is essential for the success of the structural genomics effort. In this article we will describe NMR methods currently being employed for protein structure determination. We will also describe methods under development which may drastically increase the throughput, as well as point out areas where opportunities exist for biophysicists to make significant contribution in this important field.
Tsai, Keng-Chang; Jian, Jhih-Wei; Yang, Ei-Wen; Hsu, Po-Chiang; Peng, Hung-Pin; Chen, Ching-Tai; Chen, Jun-Bo; Chang, Jeng-Yih; Hsu, Wen-Lian; Yang, An-Suei
2012-01-01
Non-covalent protein-carbohydrate interactions mediate molecular targeting in many biological processes. Prediction of non-covalent carbohydrate binding sites on protein surfaces not only provides insights into the functions of the query proteins; information on key carbohydrate-binding residues could suggest site-directed mutagenesis experiments, design therapeutics targeting carbohydrate-binding proteins, and provide guidance in engineering protein-carbohydrate interactions. In this work, we show that non-covalent carbohydrate binding sites on protein surfaces can be predicted with relatively high accuracy when the query protein structures are known. The prediction capabilities were based on a novel encoding scheme of the three-dimensional probability density maps describing the distributions of 36 non-covalent interacting atom types around protein surfaces. One machine learning model was trained for each of the 30 protein atom types. The machine learning algorithms predicted tentative carbohydrate binding sites on query proteins by recognizing the characteristic interacting atom distribution patterns specific for carbohydrate binding sites from known protein structures. The prediction results for all protein atom types were integrated into surface patches as tentative carbohydrate binding sites based on normalized prediction confidence level. The prediction capabilities of the predictors were benchmarked by a 10-fold cross validation on 497 non-redundant proteins with known carbohydrate binding sites. The predictors were further tested on an independent test set with 108 proteins. The residue-based Matthews correlation coefficient (MCC) for the independent test was 0.45, with prediction precision and sensitivity (or recall) of 0.45 and 0.49 respectively. In addition, 111 unbound carbohydrate-binding protein structures for which the structures were determined in the absence of the carbohydrate ligands were predicted with the trained predictors. The overall prediction MCC was 0.49. Independent tests on anti-carbohydrate antibodies showed that the carbohydrate antigen binding sites were predicted with comparable accuracy. These results demonstrate that the predictors are among the best in carbohydrate binding site predictions to date. PMID:22848404
Confinement and Structural Changes in Vertically Aligned Dust Structures
NASA Astrophysics Data System (ADS)
Hyde, Truell
2013-10-01
In physics, confinement is known to influence collective system behavior. Examples include coulomb crystal variants such as those formed from ions or dust particles (classical), electrons in quantum dots (quantum) and the structural changes observed in vertically aligned dust particle systems formed within a glass box placed on the lower electrode of a Gaseous Electronics Conference (GEC) rf reference cell. Recent experimental studies have expanded the above to include the biological domain by showing that the stability and dynamics of proteins confined through encapsulation and enzyme molecules placed in inorganic cavities such as those found in biosensors are also directly influenced by their confinement. In this paper, the self-assembly and subsequent collective behavior of structures formed from n, charged dust particles interacting with one another and located within a glass box placed on the lower, powered electrode of a GEC rf reference cell is discussed. Self-organized formation of vertically aligned one-dimensional chains, two-dimensional zigzag structures, and three-dimensional helical structures of triangular, quadrangular, pentagonal, hexagonal, and heptagonal symmetries are shown to occur. System evolution is shown to progress from one-dimensional chain structures, through a zigzag transition to a two-dimensional, spindle like structures, and then to various three-dimensional, helical structures exhibiting various symmetries. Stable configurations are shown to be strongly dependent upon system confinement. The critical conditions for structural transitions as well as the basic symmetry exhibited by the one-, two-, and three-dimensional structures that subsequently develop will be shown to be in good agreement with molecular dynamics simulations.
Protein Structure Determination from Pseudocontact Shifts Using ROSETTA
Schmitz, Christophe; Vernon, Robert; Otting, Gottfried; Baker, David; Huber, Thomas
2013-01-01
Paramagnetic metal ions generate pseudocontact shifts (PCSs) in nuclear magnetic resonance spectra that are manifested as easily measurable changes in chemical shifts. Metals can be incorporated into proteins through metal binding tags, and PCS data constitute powerful long-range restraints on the positions of nuclear spins relative to the coordinate system of the magnetic susceptibility anisotropy tensor (Δχ-tensor) of the metal ion. We show that three-dimensional structures of proteins can reliably be determined using PCS data from a single metal binding site combined with backbone chemical shifts. The program PCS-ROSETTA automatically determines the Δχ-tensor and metal position from the PCS data during the structure calculations, without any prior knowledge of the protein structure. The program can determine structures accurately for proteins of up to 150 residues, offering a powerful new approach to protein structure determination that relies exclusively on readily measurable backbone chemical shifts and easily discriminates between correctly and incorrectly folded conformations. PMID:22285518
RaptorX server: a resource for template-based protein structure modeling.
Källberg, Morten; Margaryan, Gohar; Wang, Sheng; Ma, Jianzhu; Xu, Jinbo
2014-01-01
Assigning functional properties to a newly discovered protein is a key challenge in modern biology. To this end, computational modeling of the three-dimensional atomic arrangement of the amino acid chain is often crucial in determining the role of the protein in biological processes. We present a community-wide web-based protocol, RaptorX server ( http://raptorx.uchicago.edu ), for automated protein secondary structure prediction, template-based tertiary structure modeling, and probabilistic alignment sampling.Given a target sequence, RaptorX server is able to detect even remotely related template sequences by means of a novel nonlinear context-specific alignment potential and probabilistic consistency algorithm. Using the protocol presented here it is thus possible to obtain high-quality structural models for many target protein sequences when only distantly related protein domains have experimentally solved structures. At present, RaptorX server can perform secondary and tertiary structure prediction of a 200 amino acid target sequence in approximately 30 min.
Jones, Peter P.; Meng, Xing; Xiao, Bailong; Cai, Shitian; Bolstad, Jeff; Wagenknecht, Terence; Liu, Zheng; Chen, S. R. Wayne
2009-01-01
Protein kinase A (PKA)-dependent phosphorylation of the cardiac Ca2+ release channel/ryanodine receptor (RyR2) is believed to directly dissociate FKBP12.6 from the channel, causing abnormal channel activation and Ca2+ release. To gain insight into the structural basis of the regulation of RyR2 by PKA, we determined the three-dimensional location of the PKA site S2030. Green fluorescent protein (GFP) was inserted into the wild type (wt) RyR2 and RyR2 mutant, A4860G, after T2023. The resultant GFP-RyR2 fusion proteins, RyR2T2023-GFP and RyR2(A4860G)T2023-GFP, were expressed in HEK293 cells and functionally characterized. Ca2+ release assays revealed that both GFP-RyR2 fusion proteins formed caffeine- and ryanodine-sensitive Ca2+ release channels. Further analyses using [3H]ryanodine binding demonstrated that the insertion of GFP into RyR2 wt after T2023 reduced the sensitivity of the channel to activation by Ca2+ or caffeine. RyR2(A4860G)T2023-GFP was found to be structurally more stable than RyR2T2023-GFP and was subsequently used as a basis for three-dimensional reconstruction. Cryo-electron microscopy and single particle image processing of the purified RyR2(A4860G)T2023-GFP protein revealed the location of the inserted GFP, and hence the S2030 PKA site in domain 4, a region that may be involved in signal transduction between the transmembrane and cytoplasmic domains. Like the S2808 PKA site reported previously, the S2030 site is not located close to the FKBP12.6 binding site mapped previously, indicating that neither of these PKA sites is directly involved in FKBP12.6 binding. Based on the three-dimensional localizations of a number of residues or regions, a model for the subunit organization in the structure of RyR2 is proposed. PMID:17967164
Bhagavat, Raghu; Sankar, Santhosh; Srinivasan, Narayanaswamy; Chandra, Nagasuma
2018-03-06
Protein-ligand interactions form the basis of most cellular events. Identifying ligand binding pockets in proteins will greatly facilitate rationalizing and predicting protein function. Ligand binding sites are unknown for many proteins of known three-dimensional (3D) structure, creating a gap in our understanding of protein structure-function relationships. To bridge this gap, we detect pockets in proteins of known 3D structures, using computational techniques. This augmented pocketome (PocketDB) consists of 249,096 pockets, which is about seven times larger than what is currently known. We deduce possible ligand associations for about 46% of the newly identified pockets. The augmented pocketome, when subjected to clustering based on similarities among pockets, yielded 2,161 site types, which are associated with 1,037 ligand types, together providing fold-site-type-ligand-type associations. The PocketDB resource facilitates a structure-based function annotation, delineation of the structural basis of ligand recognition, and provides functional clues for domains of unknown functions, allosteric proteins, and druggable pockets. Copyright © 2018 Elsevier Ltd. All rights reserved.
2017-01-01
Although deep learning approaches have had tremendous success in image, video and audio processing, computer vision, and speech recognition, their applications to three-dimensional (3D) biomolecular structural data sets have been hindered by the geometric and biological complexity. To address this problem we introduce the element-specific persistent homology (ESPH) method. ESPH represents 3D complex geometry by one-dimensional (1D) topological invariants and retains important biological information via a multichannel image-like representation. This representation reveals hidden structure-function relationships in biomolecules. We further integrate ESPH and deep convolutional neural networks to construct a multichannel topological neural network (TopologyNet) for the predictions of protein-ligand binding affinities and protein stability changes upon mutation. To overcome the deep learning limitations from small and noisy training sets, we propose a multi-task multichannel topological convolutional neural network (MM-TCNN). We demonstrate that TopologyNet outperforms the latest methods in the prediction of protein-ligand binding affinities, mutation induced globular protein folding free energy changes, and mutation induced membrane protein folding free energy changes. Availability: weilab.math.msu.edu/TDL/ PMID:28749969
Can natural proteins designed with 'inverted' peptide sequences adopt native-like protein folds?
Sridhar, Settu; Guruprasad, Kunchur
2014-01-01
We have carried out a systematic computational analysis on a representative dataset of proteins of known three-dimensional structure, in order to evaluate whether it would possible to 'swap' certain short peptide sequences in naturally occurring proteins with their corresponding 'inverted' peptides and generate 'artificial' proteins that are predicted to retain native-like protein fold. The analysis of 3,967 representative proteins from the Protein Data Bank revealed 102,677 unique identical inverted peptide sequence pairs that vary in sequence length between 5-12 and 18 amino acid residues. Our analysis illustrates with examples that such 'artificial' proteins may be generated by identifying peptides with 'similar structural environment' and by using comparative protein modeling and validation studies. Our analysis suggests that natural proteins may be tolerant to accommodating such peptides.
Visualization of molecular structures using HoloLens-based augmented reality
Hoffman, MA; Provance, JB
2017-01-01
Biological molecules and biologically active small molecules are complex three dimensional structures. Current flat screen monitors are limited in their ability to convey the full three dimensional characteristics of these molecules. Augmented reality devices, including the Microsoft HoloLens, offer an immersive platform to change how we interact with molecular visualizations. We describe a process to incorporate the three dimensional structures of small molecules and complex proteins into the Microsoft HoloLens using aspirin and the human leukocyte antigen (HLA) as examples. Small molecular structures can be introduced into the HoloStudio application, which provides native support for rotating, resizing and performing other interactions with these molecules. Larger molecules can be imported through the Unity gaming development platform and then Microsoft Visual Developer. The processes described here can be modified to import a wide variety of molecular structures into augmented reality systems and improve our comprehension of complex structural features. PMID:28815109
Analysis of self-assembly of S-layer protein slp-B53 from Lysinibacillus sphaericus.
Liu, Jun; Falke, Sven; Drobot, Bjoern; Oberthuer, Dominik; Kikhney, Alexey; Guenther, Tobias; Fahmy, Karim; Svergun, Dmitri; Betzel, Christian; Raff, Johannes
2017-01-01
The formation of stable and functional surface layers (S-layers) via self-assembly of surface-layer proteins on the cell surface is a dynamic and complex process. S-layers facilitate a number of important biological functions, e.g., providing protection and mediating selective exchange of molecules and thereby functioning as molecular sieves. Furthermore, S-layers selectively bind several metal ions including uranium, palladium, gold, and europium, some of them with high affinity. Most current research on surface layers focuses on investigating crystalline arrays of protein subunits in Archaea and bacteria. In this work, several complementary analytical techniques and methods have been applied to examine structure-function relationships and dynamics for assembly of S-layer protein slp-B53 from Lysinibacillus sphaericus: (1) The secondary structure of the S-layer protein was analyzed by circular dichroism spectroscopy; (2) Small-angle X-ray scattering was applied to gain insights into the three-dimensional structure in solution; (3) The interaction with bivalent cations was followed by differential scanning calorimetry; (4) The dynamics and time-dependent assembly of S-layers were followed by applying dynamic light scattering; (5) The two-dimensional structure of the paracrystalline S-layer lattice was examined by atomic force microscopy. The data obtained provide essential structural insights into the mechanism of S-layer self-assembly, particularly with respect to binding of bivalent cations, i.e., Mg 2+ and Ca 2+ . Furthermore, the results obtained highlight potential applications of S-layers in the fields of micromaterials and nanobiotechnology by providing engineered or individual symmetric thin protein layers, e.g., for protective, antimicrobial, or otherwise functionalized surfaces.
Thermal perturbation correlation of calcium binding Human centrin 3 and its structural changes
NASA Astrophysics Data System (ADS)
Pastrana-Rios, Belinda
2014-07-01
Perturbation-correlation moving-window two-dimensional (PCMW2D) correlation spectroscopy was applied for the determination of the individual transition temperatures of different vibrational modes located within structural components of a calcium binding protein known as Human centrin 3. This crucial information served to understand the contribution individual calcium binding sites made towards the stability of the EF-hand and therefore the protein without the use of probes. We are convinced that the general application of PCMW2D correlation spectroscopy can be applied to the study of proteins in general to ascertain the differences in the stability of structural motifs within proteins and its relationship to the actual transition temperature of unfolding.
Improved in-cell structure determination of proteins at near-physiological concentration
Ikeya, Teppei; Hanashima, Tomomi; Hosoya, Saori; Shimazaki, Manato; Ikeda, Shiro; Mishima, Masaki; Güntert, Peter; Ito, Yutaka
2016-01-01
Investigating three-dimensional (3D) structures of proteins in living cells by in-cell nuclear magnetic resonance (NMR) spectroscopy opens an avenue towards understanding the structural basis of their functions and physical properties under physiological conditions inside cells. In-cell NMR provides data at atomic resolution non-invasively, and has been used to detect protein-protein interactions, thermodynamics of protein stability, the behavior of intrinsically disordered proteins, etc. in cells. However, so far only a single de novo 3D protein structure could be determined based on data derived only from in-cell NMR. Here we introduce methods that enable in-cell NMR protein structure determination for a larger number of proteins at concentrations that approach physiological ones. The new methods comprise (1) advances in the processing of non-uniformly sampled NMR data, which reduces the measurement time for the intrinsically short-lived in-cell NMR samples, (2) automatic chemical shift assignment for obtaining an optimal resonance assignment, and (3) structure refinement with Bayesian inference, which makes it possible to calculate accurate 3D protein structures from sparse data sets of conformational restraints. As an example application we determined the structure of the B1 domain of protein G at about 250 μM concentration in living E. coli cells. PMID:27910948
Meyer, Philippe; Liger, Dominique; Leulliot, Nicolas; Quevillon-Cheruel, Sophie; Zhou, Cong-Zhao; Borel, Franck; Ferrer, Jean-Luc; Poupon, Anne; Janin, Joël; van Tilbeurgh, Herman
2005-12-01
We have determined the three-dimensional crystal structure of the protein encoded by the open reading frame YFL030w from Saccharomyces cerevisiae to a resolution of 2.6 A using single wavelength anomalous diffraction. YFL030w is a 385 amino-acid protein with sequence similarity to the aminotransferase family. The structure of the protein reveals a homodimer adopting the fold-type I of pyridoxal 5'-phosphate (PLP)-dependent aminotransferases. The PLP co-factor is covalently bound to the active site in the crystal structure. The protein shows close structural resemblance with the human alanine:glyoxylate aminotransferase (EC 2.6.1.44), an enzyme involved in the hereditary kidney stone disease primary hyperoxaluria type 1. In this paper we show that YFL030w codes for an alanine:glyoxylate aminotransferase, highly specific for its amino donor and acceptor substrates.
Cytocompatible and water stable ultrafine protein fibers for tissue engineering
NASA Astrophysics Data System (ADS)
Jiang, Qiuran
This dissertation proposal focuses on the development of cytocompatible and water stable protein ultrafine fibers for tissue engineering. The protein-based ultrafine fibers have the potential to be used for biomedicine, due to their biocompatibility, biodegradability, similarity to natural extracellular matrix (ECM) in physical structure and chemical composition, and superior adsorption properties due to their high surface to volume ratio. However, the current technologies to produce the protein-based ultrafine fibers for biomedical applications still have several problems. For instance, the current electrospinning and phase separation technologies generate scaffolds composed of densely compacted ultrafine fibers, and cells can spread just on the surface of the fiber bulk, and hardly penetrate into the inner sections of scaffolds. Thus, these scaffolds can merely emulate the ECM as a two dimensional basement membrane, but are difficult to mimic the three dimensional ECM stroma. Moreover, the protein-based ultrafine fibers do not possess sufficient water stability and strength for biomedical applications, and need modifications such as crosslinking. However, current crosslinking methods are either high in toxicity or low in crosslinking efficiency. To solve the problems mentioned above, zein, collagen, and gelatin were selected as the raw materials to represent plant proteins, animal proteins, and denatured proteins in this dissertation. A benign solvent system was developed specifically for the fabrication of collagen ultrafine fibers. In addition, the gelatin scaffolds with a loose fibrous structure, high cell-accessibility and cell viability were produced by a novel ultralow concentration phase separation method aiming to simulate the structure of three dimensional (3D) ECM stroma. Non-toxic crosslinking methods using citric acid as the crosslinker were also developed for electrospun or phase separated scaffolds from these three proteins, and proved to be efficient to enhance the strength and water stability of scaffolds. The crosslinked protein scaffolds showed higher cytocompatibility than the polylactic acid scaffolds and the fibers crosslinked by glutaraldehyde. The potential of using these protein-based ultrafine fibers crosslinked by citric acid for tissue engineering has been proved in this dissertation.
Zhang, Gaihua; Su, Zhen
2012-01-01
Work on protein structure prediction is very useful in biological research. To evaluate their accuracy, experimental protein structures or their derived data are used as the 'gold standard'. However, as proteins are dynamic molecular machines with structural flexibility such a standard may be unreliable. To investigate the influence of the structure flexibility, we analysed 3,652 protein structures of 137 unique sequences from 24 protein families. The results showed that (1) the three-dimensional (3D) protein structures were not rigid: the root-mean-square deviation (RMSD) of the backbone Cα of structures with identical sequences was relatively large, with the average of the maximum RMSD from each of the 137 sequences being 1.06 Å; (2) the derived data of the 3D structure was not constant, e.g. the highest ratio of the secondary structure wobble site was 60.69%, with the sequence alignments from structural comparisons of two proteins in the same family sometimes being completely different. Proteins may have several stable conformations and the data derived from resolved structures as a 'gold standard' should be optimized before being utilized as criteria to evaluate the prediction methods, e.g. sequence alignment from structural comparison. Helix/β-sheet transition exists in normal free proteins. The coil ratio of the 3D structure could affect its resolution as determined by X-ray crystallography.
Fast protein tertiary structure retrieval based on global surface shape similarity.
Sael, Lee; Li, Bin; La, David; Fang, Yi; Ramani, Karthik; Rustamov, Raif; Kihara, Daisuke
2008-09-01
Characterization and identification of similar tertiary structure of proteins provides rich information for investigating function and evolution. The importance of structure similarity searches is increasing as structure databases continue to expand, partly due to the structural genomics projects. A crucial drawback of conventional protein structure comparison methods, which compare structures by their main-chain orientation or the spatial arrangement of secondary structure, is that a database search is too slow to be done in real-time. Here we introduce a global surface shape representation by three-dimensional (3D) Zernike descriptors, which represent a protein structure compactly as a series expansion of 3D functions. With this simplified representation, the search speed against a few thousand structures takes less than a minute. To investigate the agreement between surface representation defined by 3D Zernike descriptor and conventional main-chain based representation, a benchmark was performed against a protein classification generated by the combinatorial extension algorithm. Despite the different representation, 3D Zernike descriptor retrieved proteins of the same conformation defined by combinatorial extension in 89.6% of the cases within the top five closest structures. The real-time protein structure search by 3D Zernike descriptor will open up new possibility of large-scale global and local protein surface shape comparison. 2008 Wiley-Liss, Inc.
Modeling of protein binary complexes using structural mass spectrometry data
Kamal, J.K. Amisha; Chance, Mark R.
2008-01-01
In this article, we describe a general approach to modeling the structure of binary protein complexes using structural mass spectrometry data combined with molecular docking. In the first step, hydroxyl radical mediated oxidative protein footprinting is used to identify residues that experience conformational reorganization due to binding or participate in the binding interface. In the second step, a three-dimensional atomic structure of the complex is derived by computational modeling. Homology modeling approaches are used to define the structures of the individual proteins if footprinting detects significant conformational reorganization as a function of complex formation. A three-dimensional model of the complex is constructed from these binary partners using the ClusPro program, which is composed of docking, energy filtering, and clustering steps. Footprinting data are used to incorporate constraints—positive and/or negative—in the docking step and are also used to decide the type of energy filter—electrostatics or desolvation—in the successive energy-filtering step. By using this approach, we examine the structure of a number of binary complexes of monomeric actin and compare the results to crystallographic data. Based on docking alone, a number of competing models with widely varying structures are observed, one of which is likely to agree with crystallographic data. When the docking steps are guided by footprinting data, accurate models emerge as top scoring. We demonstrate this method with the actin/gelsolin segment-1 complex. We also provide a structural model for the actin/cofilin complex using this approach which does not have a crystal or NMR structure. PMID:18042684
Crystallization of PTP Domains.
Levy, Colin; Adams, James; Tabernero, Lydia
2016-01-01
Protein crystallography is the most powerful method to obtain atomic resolution information on the three-dimensional structure of proteins. An essential step towards determining the crystallographic structure of a protein is to produce good quality crystals from a concentrated sample of purified protein. These crystals are then used to obtain X-ray diffraction data necessary to determine the 3D structure by direct phasing or molecular replacement if the model of a homologous protein is available. Here, we describe the main approaches and techniques to obtain suitable crystals for X-ray diffraction. We include tools and guidance on how to evaluate and design the protein construct, how to prepare Se-methionine derivatized protein, how to assess the stability and quality of the sample, and how to crystallize and prepare crystals for diffraction experiments. While general strategies for protein crystallization are summarized, specific examples of the application of these strategies to the crystallization of PTP domains are discussed.
Lu, Hui-Meng; Yin, Da-Chuan; Ye, Ya-Jing; Luo, Hui-Min; Geng, Li-Qiang; Li, Hai-Sheng; Guo, Wei-Hong; Shang, Peng
2009-01-01
As the most widely utilized technique to determine the 3-dimensional structure of protein molecules, X-ray crystallography can provide structure of the highest resolution among the developed techniques. The resolution obtained via X-ray crystallography is known to be influenced by many factors, such as the crystal quality, diffraction techniques, and X-ray sources, etc. In this paper, the authors found that the protein sequence could also be one of the factors. We extracted information of the resolution and the sequence of proteins from the Protein Data Bank (PDB), classified the proteins into different clusters according to the sequence similarity, and statistically analyzed the relationship between the sequence similarity and the best resolution obtained. The results showed that there was a pronounced correlation between the sequence similarity and the obtained resolution. These results indicate that protein structure itself is one variable that may affect resolution when X-ray crystallography is used.
A Library of the Nanoscale Self-Assembly of Amino Acids on Metal Surfaces
NASA Astrophysics Data System (ADS)
Iski, Erin; Yitamben, Esmeralda; Guisinger, Nathan
2012-02-01
The investigation of the hierarchical self-assembly of amino acids on surfaces represents a unique test-bed for the origin of enantio-favoritism in biology and the transmission of chirality from single molecules to complete surface layers. These chiral systems, in particular the assembly of isoleucine and alanine on Cu(111), represent a direct link to the understanding of certain biological processes, specifically the preference for some amino acids to form alpha helices vs. beta-pleated sheets in the secondary structure of proteins. Low temperature, ultra-high vacuum, scanning tunneling microscopy (LT UHV-STM) is used to study the hierarchical self-assembly of different amino acids on a Cu(111) single crystal in an effort to build a library of their two-dimensional structure with molecular-scale resolution for enhanced protein and peptide studies. Both enantiopure and racemic structures are studied in order to elucidate how chirality can affect the self-assembly of the amino acids. In some cases, density functional theory (DFT) models can be used to confirm the experimental structure. The advent of such a library with fully resolved, two-dimensional structures at different molecular coverages would address some of the complex questions surrounding the preferential formation of alpha helices vs. beta-pleated sheets in proteins and lead to a better understanding of the key role played by these amino acids in protein sequencing.
Roles of water in protein structure and function studied by molecular liquid theory.
Imai, Takashi
2009-01-01
The roles of water in the structure and function of proteins have not been completely elucidated. Although molecular simulation has been widely used for the investigation of protein structure and function, it is not always useful for elucidating the roles of water because the effect of water ranges from atomic to thermodynamic level. The three-dimensional reference interaction site model (3D-RISM) theory, which is a statistical-mechanical theory of molecular liquids, can yield the solvation structure at the atomic level and calculate the thermodynamic quantities from the intermolecular potentials. In the last few years, the author and coworkers have succeeded in applying the 3D-RISM theory to protein aqueous solution systems and demonstrated that the theory is useful for investigating the roles of water. This article reviews some of the recent applications and findings, which are concerned with molecular recognition by protein, protein folding, and the partial molar volume of protein which is related to the pressure effect on protein.
Cloning, production, and purification of proteins for a medium-scale structural genomics project.
Quevillon-Cheruel, Sophie; Collinet, Bruno; Trésaugues, Lionel; Minard, Philippe; Henckes, Gilles; Aufrère, Robert; Blondeau, Karine; Zhou, Cong-Zhao; Liger, Dominique; Bettache, Nabila; Poupon, Anne; Aboulfath, Ilham; Leulliot, Nicolas; Janin, Joël; van Tilbeurgh, Herman
2007-01-01
The South-Paris Yeast Structural Genomics Pilot Project (http://www.genomics.eu.org) aims at systematically expressing, purifying, and determining the three-dimensional structures of Saccharomyces cerevisiae proteins. We have already cloned 240 yeast open reading frames in the Escherichia coli pET system. Eighty-two percent of the targets can be expressed in E. coli, and 61% yield soluble protein. We have currently purified 58 proteins. Twelve X-ray structures have been solved, six are in progress, and six other proteins gave crystals. In this chapter, we present the general experimental flowchart applied for this project. One of the main difficulties encountered in this pilot project was the low solubility of a great number of target proteins. We have developed parallel strategies to recover these proteins from inclusion bodies, including refolding, coexpression with chaperones, and an in vitro expression system. A limited proteolysis protocol, developed to localize flexible regions in proteins that could hinder crystallization, is also described.
von Grotthuss, Marcin; Plewczynski, Dariusz; Ginalski, Krzysztof; Rychlewski, Leszek; Shakhnovich, Eugene I
2006-02-06
The number of protein structures from structural genomics centers dramatically increases in the Protein Data Bank (PDB). Many of these structures are functionally unannotated because they have no sequence similarity to proteins of known function. However, it is possible to successfully infer function using only structural similarity. Here we present the PDB-UF database, a web-accessible collection of predictions of enzymatic properties using structure-function relationship. The assignments were conducted for three-dimensional protein structures of unknown function that come from structural genomics initiatives. We show that 4 hypothetical proteins (with PDB accession codes: 1VH0, 1NS5, 1O6D, and 1TO0), for which standard BLAST tools such as PSI-BLAST or RPS-BLAST failed to assign any function, are probably methyltransferase enzymes. We suggest that the structure-based prediction of an EC number should be conducted having the different similarity score cutoff for different protein folds. Moreover, performing the annotation using two different algorithms can reduce the rate of false positive assignments. We believe, that the presented web-based repository will help to decrease the number of protein structures that have functions marked as "unknown" in the PDB file. http://paradox.harvard.edu/PDB-UF and http://bioinfo.pl/PDB-UF.
Protein crystal growth in space
NASA Technical Reports Server (NTRS)
Bugg, C. E.; Clifford, D. W.
1987-01-01
The advantages of protein crystallization in space, and the applications of protein crystallography to drug design, protein engineering, and the design of synthetic vaccines are examined. The steps involved in using protein crystallography to determine the three-dimensional structure of a protein are discussed. The growth chamber design and the hand-held apparatus developed for protein crystal growth by vapor diffusion techniques (hanging-drop method) are described; the experimental data from the four Shuttle missions are utilized to develop hardware for protein crystal growth in space and to evaluate the effects of gravity on protein crystal growth.
Modeling the formation of cell-matrix adhesions on a single 3D matrix fiber.
Escribano, J; Sánchez, M T; García-Aznar, J M
2015-11-07
Cell-matrix adhesions are crucial in different biological processes like tissue morphogenesis, cell motility, and extracellular matrix remodeling. These interactions that link cell cytoskeleton and matrix fibers are built through protein clutches, generally known as adhesion complexes. The adhesion formation process has been deeply studied in two-dimensional (2D) cases; however, the knowledge is limited for three-dimensional (3D) cases. In this work, we simulate different local extracellular matrix properties in order to unravel the fundamental mechanisms that regulate the formation of cell-matrix adhesions in 3D. We aim to study the mechanical interaction of these biological structures through a three dimensional discrete approach, reproducing the transmission pattern force between the cytoskeleton and a single extracellular matrix fiber. This numerical model provides a discrete analysis of the proteins involved including spatial distribution, interaction between them, and study of the different phenomena, such as protein clutches unbinding or protein unfolding. Copyright © 2015 Elsevier Ltd. All rights reserved.
7A projection map of the S-layer protein sbpA obtained with trehalose-embedded monolayer crystals.
Norville, Julie E; Kelly, Deborah F; Knight, Thomas F; Belcher, Angela M; Walz, Thomas
2007-12-01
Two-dimensional crystallization on lipid monolayers is a versatile tool to obtain structural information of proteins by electron microscopy. An inherent problem with this approach is to prepare samples in a way that preserves the crystalline order of the protein array and produces specimens that are sufficiently flat for high-resolution data collection at high tilt angles. As a test specimen to optimize the preparation of lipid monolayer crystals for electron microscopy imaging, we used the S-layer protein sbpA, a protein with potential for designing arrays of both biological and inorganic materials with engineered properties for a variety of nanotechnology applications. Sugar embedding is currently considered the best method to prepare two-dimensional crystals of membrane proteins reconstituted into lipid bilayers. We found that using a loop to transfer lipid monolayer crystals to an electron microscopy grid followed by embedding in trehalose and quick-freezing in liquid ethane also yielded the highest resolution images for sbpA lipid monolayer crystals. Using images of specimens prepared in this way we could calculate a projection map of sbpA at 7A resolution, one of the highest resolution projection structures obtained with lipid monolayer crystals to date.
The Role of High-Dimensional Diffusive Search, Stabilization, and Frustration in Protein Folding
Rimratchada, Supreecha; McLeish, Tom C.B.; Radford, Sheena E.; Paci, Emanuele
2014-01-01
Proteins are polymeric molecules with many degrees of conformational freedom whose internal energetic interactions are typically screened to small distances. Therefore, in the high-dimensional conformation space of a protein, the energy landscape is locally relatively flat, in contrast to low-dimensional representations, where, because of the induced entropic contribution to the full free energy, it appears funnel-like. Proteins explore the conformation space by searching these flat subspaces to find a narrow energetic alley that we call a hypergutter and then explore the next, lower-dimensional, subspace. Such a framework provides an effective representation of the energy landscape and folding kinetics that does justice to the essential characteristic of high-dimensionality of the search-space. It also illuminates the important role of nonnative interactions in defining folding pathways. This principle is here illustrated using a coarse-grained model of a family of three-helix bundle proteins whose conformations, once secondary structure has formed, can be defined by six rotational degrees of freedom. Two folding mechanisms are possible, one of which involves an intermediate. The stabilization of intermediate subspaces (or states in low-dimensional projection) in protein folding can either speed up or slow down the folding rate depending on the amount of native and nonnative contacts made in those subspaces. The folding rate increases due to reduced-dimension pathways arising from the mere presence of intermediate states, but decreases if the contacts in the intermediate are very stable and introduce sizeable topological or energetic frustration that needs to be overcome. Remarkably, the hypergutter framework, although depending on just a few physically meaningful parameters, can reproduce all the types of experimentally observed curvature in chevron plots for realizations of this fold. PMID:24739172
Defining an essence of structure determining residue contacts in proteins.
Sathyapriya, R; Duarte, Jose M; Stehr, Henning; Filippis, Ioannis; Lappe, Michael
2009-12-01
The network of native non-covalent residue contacts determines the three-dimensional structure of a protein. However, not all contacts are of equal structural significance, and little knowledge exists about a minimal, yet sufficient, subset required to define the global features of a protein. Characterisation of this "structural essence" has remained elusive so far: no algorithmic strategy has been devised to-date that could outperform a random selection in terms of 3D reconstruction accuracy (measured as the Ca RMSD). It is not only of theoretical interest (i.e., for design of advanced statistical potentials) to identify the number and nature of essential native contacts-such a subset of spatial constraints is very useful in a number of novel experimental methods (like EPR) which rely heavily on constraint-based protein modelling. To derive accurate three-dimensional models from distance constraints, we implemented a reconstruction pipeline using distance geometry. We selected a test-set of 12 protein structures from the four major SCOP fold classes and performed our reconstruction analysis. As a reference set, series of random subsets (ranging from 10% to 90% of native contacts) are generated for each protein, and the reconstruction accuracy is computed for each subset. We have developed a rational strategy, termed "cone-peeling" that combines sequence features and network descriptors to select minimal subsets that outperform the reference sets. We present, for the first time, a rational strategy to derive a structural essence of residue contacts and provide an estimate of the size of this minimal subset. Our algorithm computes sparse subsets capable of determining the tertiary structure at approximately 4.8 A Ca RMSD with as little as 8% of the native contacts (Ca-Ca and Cb-Cb). At the same time, a randomly chosen subset of native contacts needs about twice as many contacts to reach the same level of accuracy. This "structural essence" opens new avenues in the fields of structure prediction, empirical potentials and docking.
Defining an Essence of Structure Determining Residue Contacts in Proteins
Sathyapriya, R.; Duarte, Jose M.; Stehr, Henning; Filippis, Ioannis; Lappe, Michael
2009-01-01
The network of native non-covalent residue contacts determines the three-dimensional structure of a protein. However, not all contacts are of equal structural significance, and little knowledge exists about a minimal, yet sufficient, subset required to define the global features of a protein. Characterisation of this “structural essence” has remained elusive so far: no algorithmic strategy has been devised to-date that could outperform a random selection in terms of 3D reconstruction accuracy (measured as the Ca RMSD). It is not only of theoretical interest (i.e., for design of advanced statistical potentials) to identify the number and nature of essential native contacts—such a subset of spatial constraints is very useful in a number of novel experimental methods (like EPR) which rely heavily on constraint-based protein modelling. To derive accurate three-dimensional models from distance constraints, we implemented a reconstruction pipeline using distance geometry. We selected a test-set of 12 protein structures from the four major SCOP fold classes and performed our reconstruction analysis. As a reference set, series of random subsets (ranging from 10% to 90% of native contacts) are generated for each protein, and the reconstruction accuracy is computed for each subset. We have developed a rational strategy, termed “cone-peeling” that combines sequence features and network descriptors to select minimal subsets that outperform the reference sets. We present, for the first time, a rational strategy to derive a structural essence of residue contacts and provide an estimate of the size of this minimal subset. Our algorithm computes sparse subsets capable of determining the tertiary structure at approximately 4.8 Å Ca RMSD with as little as 8% of the native contacts (Ca-Ca and Cb-Cb). At the same time, a randomly chosen subset of native contacts needs about twice as many contacts to reach the same level of accuracy. This “structural essence” opens new avenues in the fields of structure prediction, empirical potentials and docking. PMID:19997489
Metamorphic Proteins: Emergence of Dual Protein Folds from One Primary Sequence.
Lella, Muralikrishna; Mahalakshmi, Radhakrishnan
2017-06-20
Every amino acid exhibits a different propensity for distinct structural conformations. Hence, decoding how the primary amino acid sequence undergoes the transition to a defined secondary structure and its final three-dimensional fold is presently considered predictable with reasonable certainty. However, protein sequences that defy the first principles of secondary structure prediction (they attain two different folds) have recently been discovered. Such proteins, aptly named metamorphic proteins, decrease the conformational constraint by increasing flexibility in the secondary structure and thereby result in efficient functionality. In this review, we discuss the major factors driving the conformational switch related both to protein sequence and to structure using illustrative examples. We discuss the concept of an evolutionary transition in sequence and structure, the functional impact of the tertiary fold, and the pressure of intrinsic and external factors that give rise to metamorphic proteins. We mainly focus on the major components of protein architecture, namely, the α-helix and β-sheet segments, which are involved in conformational switching within the same or highly similar sequences. These chameleonic sequences are widespread in both cytosolic and membrane proteins, and these folds are equally important for protein structure and function. We discuss the implications of metamorphic proteins and chameleonic peptide sequences in de novo peptide design.
Protein 3D Structure and Electron Microscopy Map Retrieval Using 3D-SURFER2.0 and EM-SURFER.
Han, Xusi; Wei, Qing; Kihara, Daisuke
2017-12-08
With the rapid growth in the number of solved protein structures stored in the Protein Data Bank (PDB) and the Electron Microscopy Data Bank (EMDB), it is essential to develop tools to perform real-time structure similarity searches against the entire structure database. Since conventional structure alignment methods need to sample different orientations of proteins in the three-dimensional space, they are time consuming and unsuitable for rapid, real-time database searches. To this end, we have developed 3D-SURFER and EM-SURFER, which utilize 3D Zernike descriptors (3DZD) to conduct high-throughput protein structure comparison, visualization, and analysis. Taking an atomic structure or an electron microscopy map of a protein or a protein complex as input, the 3DZD of a query protein is computed and compared with the 3DZD of all other proteins in PDB or EMDB. In addition, local geometrical characteristics of a query protein can be analyzed using VisGrid and LIGSITE CSC in 3D-SURFER. This article describes how to use 3D-SURFER and EM-SURFER to carry out protein surface shape similarity searches, local geometric feature analysis, and interpretation of the search results. © 2017 by John Wiley & Sons, Inc. Copyright © 2017 John Wiley & Sons, Inc.
Supra-domains: evolutionary units larger than single protein domains.
Vogel, Christine; Berzuini, Carlo; Bashton, Matthew; Gough, Julian; Teichmann, Sarah A
2004-02-20
Domains are the evolutionary units that comprise proteins, and most proteins are built from more than one domain. Domains can be shuffled by recombination to create proteins with new arrangements of domains. Using structural domain assignments, we examined the combinations of domains in the proteins of 131 completely sequenced organisms. We found two-domain and three-domain combinations that recur in different protein contexts with different partner domains. The domains within these combinations have a particular functional and spatial relationship. These units are larger than individual domains and we term them "supra-domains". Amongst the supra-domains, we identified some 1400 (1203 two-domain and 166 three-domain) combinations that are statistically significantly over-represented relative to the occurrence and versatility of the individual component domains. Over one-third of all structurally assigned multi-domain proteins contain these over-represented supra-domains. This means that investigation of the structural and functional relationships of the domains forming these popular combinations would be particularly useful for an understanding of multi-domain protein function and evolution as well as for genome annotation. These and other supra-domains were analysed for their versatility, duplication, their distribution across the three kingdoms of life and their functional classes. By examining the three-dimensional structures of several examples of supra-domains in different biological processes, we identify two basic types of spatial relationships between the component domains: the combined function of the two domains is such that either the geometry of the two domains is crucial and there is a tight constraint on the interface, or the precise orientation of the domains is less important and they are spatially separate. Frequently, the role of the supra-domain becomes clear only once the three-dimensional structure is known. Since this is the case for only a quarter of the supra-domains, we provide a list of the most important unknown supra-domains as potential targets for structural genomics projects.
3D structure of eukaryotic flagella/cilia by cryo-electron tomography.
Ishikawa, Takashi
2013-01-01
Flagella/cilia are motile organelles with more than 400 proteins. To understand the mechanism of such complex systems, we need methods to describe molecular arrange-ments and conformations three-dimensionally in vivo. Cryo-electron tomography enabled us such a 3D structural analysis. Our group has been working on 3D structure of flagella/cilia using this method and revealed highly ordered and beautifully organized molecular arrangement. 3D structure gave us insights into the mechanism to gener-ate bending motion with well defined waveforms. In this review, I summarize our recent structural studies on fla-gella/cilia by cryo-electron tomography, mainly focusing on dynein microtubule-based ATPase motor proteins and the radial spoke, a regulatory protein complex.
3D structure of eukaryotic flagella/cilia by cryo-electron tomography
Ishikawa, Takashi
2013-01-01
Flagella/cilia are motile organelles with more than 400 proteins. To understand the mechanism of such complex systems, we need methods to describe molecular arrange-ments and conformations three-dimensionally in vivo. Cryo-electron tomography enabled us such a 3D structural analysis. Our group has been working on 3D structure of flagella/cilia using this method and revealed highly ordered and beautifully organized molecular arrangement. 3D structure gave us insights into the mechanism to gener-ate bending motion with well defined waveforms. In this review, I summarize our recent structural studies on fla-gella/cilia by cryo-electron tomography, mainly focusing on dynein microtubule-based ATPase motor proteins and the radial spoke, a regulatory protein complex. PMID:27493552
Zook, James D.; Molugu, Trivikram R.; Jacobsen, Neil E.; Lin, Guangxin; Soll, Jürgen; Cherry, Brian R.; Brown, Michael F.; Fromme, Petra
2013-01-01
Solving high-resolution structures for membrane proteins continues to be a daunting challenge in the structural biology community. In this study we report our high-resolution NMR results for a transmembrane protein, outer envelope protein of molar mass 16 kDa (OEP16), an amino acid transporter from the outer membrane of chloroplasts. Three-dimensional, high-resolution NMR experiments on the 13C, 15N, 2H-triply-labeled protein were used to assign protein backbone resonances and to obtain secondary structure information. The results yield over 95% assignment of N, HN, CO, Cα, and Cβ chemical shifts, which is essential for obtaining a high resolution structure from NMR data. Chemical shift analysis from the assignment data reveals experimental evidence for the first time on the location of the secondary structure elements on a per residue basis. In addition T 1Z and T2 relaxation experiments were performed in order to better understand the protein dynamics. Arginine titration experiments yield an insight into the amino acid residues responsible for protein transporter function. The results provide the necessary basis for high-resolution structural determination of this important plant membrane protein. PMID:24205117
Fantini, Jacques; Garmy, Nicolas; Yahi, Nouara
2006-09-12
Protein-glycolipid interactions mediate the attachment of various pathogens to the host cell surface as well as the association of numerous cellular proteins with lipid rafts. Thus, it is of primary importance to identify the protein domains involved in glycolipid recognition. Using structure similarity searches, we could identify a common glycolipid-binding domain in the three-dimensional structure of several proteins known to interact with lipid rafts. Yet the three-dimensional structure of most raft-targeted proteins is still unknown. In the present study, we have identified a glycolipid-binding domain in the amino acid sequence of a bacterial adhesin (Helicobacter pylori adhesin A, HpaA). The prediction was based on the major properties of the glycolipid-binding domains previously characterized by structural searches. A short (15-mer) synthetic peptide corresponding to this putative glycolipid-binding domain was synthesized, and we studied its interaction with glycolipid monolayers at the air-water interface. The synthetic HpaA peptide recognized LacCer but not Gb3. This glycolipid specificity was in line with that of the whole bacterium. Molecular modeling studies gave some insights into this high selectivity of interaction. It also suggested that Phe147 in HpaA played a key role in LacCer recognition, through sugar-aromatic CH-pi stacking interactions with the hydrophobic side of the galactose ring of LacCer. Correspondingly, the replacement of Phe147 with Ala strongly affected LacCer recognition, whereas substitution with Trp did not. Our method could be used to identify glycolipid-binding domains in microbial and cellular proteins interacting with lipid shells, rafts, and other specialized membrane microdomains.
Directed self-assembly of proteins into discrete radial patterns
Thakur, Garima; Prashanthi, Kovur; Thundat, Thomas
2013-01-01
Unlike physical patterning of materials at nanometer scale, manipulating soft matter such as biomolecules into patterns is still in its infancy. Self-assembled monolayer (SAM) with surface density gradient has the capability to drive biomolecules in specific directions to create hierarchical and discrete structures. Here, we report on a two-step process of self-assembly of the human serum albumin (HSA) protein into discrete ring structures based on density gradient of SAM. The methodology involves first creating a 2-dimensional (2D) polyethylene glycol (PEG) islands with responsive carboxyl functionalities. Incubation of proteins on such pre-patterned surfaces results in direct self-assembly of protein molecules around PEG islands. Immobilization and adsorption of protein on such structures over time evolve into the self-assembled patterns. PMID:23719678
Shao, W; Fernandez, E; Sachpatzidis, A; Wilken, J; Thompson, D A; Schweitzer, B I; Lolis, E
2001-05-01
Human herpesvirus-8 (HHV-8) is the infectious agent responsible for Kaposi's sarcoma and encodes a protein, macrophage inflammatory protein-II (vMIP-II), which shows sequence similarity to the human CC chemokines. vMIP-II has broad receptor specificity that crosses chemokine receptor subfamilies, and inhibits HIV-1 viral entry mediated by numerous chemokine receptors. In this study, the solution structure of chemically synthesized vMIP-II was determined by nuclear magnetic resonance. The protein is a monomer and possesses the chemokine fold consisting of a flexible N-terminus, three antiparallel beta strands, and a C-terminal alpha helix. Except for the N-terminal residues (residues 1-13) and the last two C-terminal residues (residues 73-74), the structure of vMIP-II is well-defined, exhibiting average rmsd of 0.35 and 0.90 A for the backbone heavy atoms and all heavy atoms of residues 14-72, respectively. Taking into account the sequence differences between the various CC chemokines and comparing their three-dimensional structures allows us to implicate residues that influence the quaternary structure and receptor binding and activation of these proteins in solution. The analysis of the sequence and three-dimensional structure of vMIP-II indicates the presence of epitopes involved in binding two receptors CCR2 and CCR5. We propose that vMIP-II was initially specific for CCR5 and acquired receptor-binding properties to CCR2 and other chemokine receptors.
Sequence diagrams and the presentation of structural and evolutionary relationships among proteins.
Thomas, B R
1975-01-01
Protein sequences mapped on two-dimensional diagrams show characteristic patterns that should be of value in visualising sequence information and in distinguishing simpler structures. A convenient map form for comparative purposes is the alpha-helix diagram with aminoacid distribution analogous to the surface of an alpha-helix oriented so that an alpha-helix structure corresponds on the diagram to a vertical band 3.6 residues wide. The sequence diagram for an alpha-keratin, high-sulphur protein suggests a new form of polypeptide helix based on a repeating unit of five which may be an important component of alpha-keratin fibres.
Use of conserved key amino acid positions to morph protein folds.
Reddy, Boojala V B; Li, Wilfred W; Bourne, Philip E
2002-07-15
By using three-dimensional (3D) structure alignments and a previously published method to determine Conserved Key Amino Acid Positions (CKAAPs) we propose a theoretical method to design mutations that can be used to morph the protein folds. The original Paracelsus challenge, met by several groups, called for the engineering of a stable but different structure by modifying less than 50% of the amino acid residues. We have used the sequences from the Protein Data Bank (PDB) identifiers 1ROP, and 2CRO, which were previously used in the Paracelsus challenge by those groups, and suggest mutation to CKAAPs to morph the protein fold. The total number of mutations suggested is less than 40% of the starting sequence theoretically improving the challenge results. From secondary structure prediction experiments of the proposed mutant sequence structures, we observe that each of the suggested mutant protein sequences likely folds to a different, non-native potentially stable target structure. These results are an early indicator that analyses using structure alignments leading to CKAAPs of a given structure are of value in protein engineering experiments. Copyright 2002 Wiley Periodicals, Inc.
Optimal contact definition for reconstruction of contact maps.
Duarte, Jose M; Sathyapriya, Rajagopal; Stehr, Henning; Filippis, Ioannis; Lappe, Michael
2010-05-27
Contact maps have been extensively used as a simplified representation of protein structures. They capture most important features of a protein's fold, being preferred by a number of researchers for the description and study of protein structures. Inspired by the model's simplicity many groups have dedicated a considerable amount of effort towards contact prediction as a proxy for protein structure prediction. However a contact map's biological interest is subject to the availability of reliable methods for the 3-dimensional reconstruction of the structure. We use an implementation of the well-known distance geometry protocol to build realistic protein 3-dimensional models from contact maps, performing an extensive exploration of many of the parameters involved in the reconstruction process. We try to address the questions: a) to what accuracy does a contact map represent its corresponding 3D structure, b) what is the best contact map representation with regard to reconstructability and c) what is the effect of partial or inaccurate contact information on the 3D structure recovery. Our results suggest that contact maps derived from the application of a distance cutoff of 9 to 11A around the Cbeta atoms constitute the most accurate representation of the 3D structure. The reconstruction process does not provide a single solution to the problem but rather an ensemble of conformations that are within 2A RMSD of the crystal structure and with lower values for the pairwise average ensemble RMSD. Interestingly it is still possible to recover a structure with partial contact information, although wrong contacts can lead to dramatic loss in reconstruction fidelity. Thus contact maps represent a valid approximation to the structures with an accuracy comparable to that of experimental methods. The optimal contact definitions constitute key guidelines for methods based on contact maps such as structure prediction through contacts and structural alignments based on maximum contact map overlap.
Optimal contact definition for reconstruction of Contact Maps
2010-01-01
Background Contact maps have been extensively used as a simplified representation of protein structures. They capture most important features of a protein's fold, being preferred by a number of researchers for the description and study of protein structures. Inspired by the model's simplicity many groups have dedicated a considerable amount of effort towards contact prediction as a proxy for protein structure prediction. However a contact map's biological interest is subject to the availability of reliable methods for the 3-dimensional reconstruction of the structure. Results We use an implementation of the well-known distance geometry protocol to build realistic protein 3-dimensional models from contact maps, performing an extensive exploration of many of the parameters involved in the reconstruction process. We try to address the questions: a) to what accuracy does a contact map represent its corresponding 3D structure, b) what is the best contact map representation with regard to reconstructability and c) what is the effect of partial or inaccurate contact information on the 3D structure recovery. Our results suggest that contact maps derived from the application of a distance cutoff of 9 to 11Å around the Cβ atoms constitute the most accurate representation of the 3D structure. The reconstruction process does not provide a single solution to the problem but rather an ensemble of conformations that are within 2Å RMSD of the crystal structure and with lower values for the pairwise average ensemble RMSD. Interestingly it is still possible to recover a structure with partial contact information, although wrong contacts can lead to dramatic loss in reconstruction fidelity. Conclusions Thus contact maps represent a valid approximation to the structures with an accuracy comparable to that of experimental methods. The optimal contact definitions constitute key guidelines for methods based on contact maps such as structure prediction through contacts and structural alignments based on maximum contact map overlap. PMID:20507547
Characterization of structural proteins of hirame rhabdovirus, HRV
Nishizawa, Toyohiko; Yoshimizu, Mamoru; Winton, James; Ahne, Winfried; Kimura, Takahisa
1991-01-01
Structural proteins of hirame rhabdovirus (HRV) were analyzed by SDS-polyacrylarnide gel electrophoresis, western blotting, 2-dimensional gel electrophoresis, and Triton X-100 treatment. Purified HRV virions were composed of: polymerase (L), glycoprotein (G), nucleoprotein (N), and 2 matrix proteins (M1 and M2). Based upon their relative mobilities, the estimated molecular weights of the proteins were: L, 156 KDa; G, 68 KDa; N, 46.4 KDa; M1, 26.4 KDa; and M2, 19.9 KDa. The electrophorehc pattern formed by the structural proteins of HRV was clearly different from that formed by pike fry rhabdovirus, spring viremia of carp virus, eel virus of America, and eel virus European X which belong to the Vesiculovirus genus; however, it resembled the pattern formed by structural proteins of viral hemorrhagic septicemia virus (VHSV) and infectious hematopoietic necrosis virus (IHNV) which are members of the Lyssavirus genus. Among HRV, IHNV, and VHSV, differences were observed in the relative mobilities of the G, N, M1, and M2 proteins. Western blot analysis revealed that the G. N, and M2 proteins of HRV shared antigenic determinants with IHNV and VHSV, but not with any of the 4 fish vesiculoviruses tested. Cross-reactions between the M1 proteins of HRV, IHNV, or VHSV were not detected in this assay. Two-dimensional gel electrophoresis was used to show that HRV differed from IHNV or VHSV in the isoelectric point (PI) of the M1 and M2 proteins. In this system, 2 forms of the M1 protein of HRV and IHNV were observed.These subspecies of M1 had the same relative mobility but different p1 values. Treatment of purified virions with 2% Triton X-100 in Tris buffer containing NaCl removed the G, M1, and M2 proteins of IHNV, but HRV virions were more stable under these conditions.
An ambiguity principle for assigning protein structural domains.
Postic, Guillaume; Ghouzam, Yassine; Chebrek, Romain; Gelly, Jean-Christophe
2017-01-01
Ambiguity is the quality of being open to several interpretations. For an image, it arises when the contained elements can be delimited in two or more distinct ways, which may cause confusion. We postulate that it also applies to the analysis of protein three-dimensional structure, which consists in dividing the molecule into subunits called domains. Because different definitions of what constitutes a domain can be used to partition a given structure, the same protein may have different but equally valid domain annotations. However, knowledge and experience generally displace our ability to accept more than one way to decompose the structure of an object-in this case, a protein. This human bias in structure analysis is particularly harmful because it leads to ignoring potential avenues of research. We present an automated method capable of producing multiple alternative decompositions of protein structure (web server and source code available at www.dsimb.inserm.fr/sword/). Our innovative algorithm assigns structural domains through the hierarchical merging of protein units, which are evolutionarily preserved substructures that describe protein architecture at an intermediate level, between domain and secondary structure. To validate the use of these protein units for decomposing protein structures into domains, we set up an extensive benchmark made of expert annotations of structural domains and including state-of-the-art domain parsing algorithms. The relevance of our "multipartitioning" approach is shown through numerous examples of applications covering protein function, evolution, folding, and structure prediction. Finally, we introduce a measure for the structural ambiguity of protein molecules.
Bioinformatics and variability in drug response: a protein structural perspective
Lahti, Jennifer L.; Tang, Grace W.; Capriotti, Emidio; Liu, Tianyun; Altman, Russ B.
2012-01-01
Marketed drugs frequently perform worse in clinical practice than in the clinical trials on which their approval is based. Many therapeutic compounds are ineffective for a large subpopulation of patients to whom they are prescribed; worse, a significant fraction of patients experience adverse effects more severe than anticipated. The unacceptable risk–benefit profile for many drugs mandates a paradigm shift towards personalized medicine. However, prior to adoption of patient-specific approaches, it is useful to understand the molecular details underlying variable drug response among diverse patient populations. Over the past decade, progress in structural genomics led to an explosion of available three-dimensional structures of drug target proteins while efforts in pharmacogenetics offered insights into polymorphisms correlated with differential therapeutic outcomes. Together these advances provide the opportunity to examine how altered protein structures arising from genetic differences affect protein–drug interactions and, ultimately, drug response. In this review, we first summarize structural characteristics of protein targets and common mechanisms of drug interactions. Next, we describe the impact of coding mutations on protein structures and drug response. Finally, we highlight tools for analysing protein structures and protein–drug interactions and discuss their application for understanding altered drug responses associated with protein structural variants. PMID:22552919
Li de La Sierra-Gallay, Ines; Collinet, Bruno; Graille, Marc; Quevillon-Cheruel, Sophie; Liger, Dominique; Minard, Philippe; Blondeau, Karine; Henckes, Gilles; Aufrère, Robert; Leulliot, Nicolas; Zhou, Cong-Zhao; Sorel, Isabelle; Ferrer, Jean-Luc; Poupon, Anne; Janin, Joël; van Tilbeurgh, Herman
2004-03-01
The protein product of the YGR205w gene of Saccharomyces cerevisiae was targeted as part of our yeast structural genomics project. YGR205w codes for a small (290 amino acids) protein with unknown structure and function. The only recognizable sequence feature is the presence of a Walker A motif (P loop) indicating a possible nucleotide binding/converting function. We determined the three-dimensional crystal structure of Se-methionine substituted protein using multiple anomalous diffraction. The structure revealed a well known mononucleotide fold and strong resemblance to the structure of small metabolite phosphorylating enzymes such as pantothenate and phosphoribulo kinase. Biochemical experiments show that YGR205w binds specifically ATP and, less tightly, ADP. The structure also revealed the presence of two bound sulphate ions, occupying opposite niches in a canyon that corresponds to the active site of the protein. One sulphate is bound to the P-loop in a position that corresponds to the position of beta-phosphate in mononucleotide protein ATP complex, suggesting the protein is indeed a kinase. The nature of the phosphate accepting substrate remains to be determined. Copyright 2004 Wiley-Liss, Inc.
From protein sequence to dynamics and disorder with DynaMine.
Cilia, Elisa; Pancsa, Rita; Tompa, Peter; Lenaerts, Tom; Vranken, Wim F
2013-01-01
Protein function and dynamics are closely related; however, accurate dynamics information is difficult to obtain. Here based on a carefully assembled data set derived from experimental data for proteins in solution, we quantify backbone dynamics properties on the amino-acid level and develop DynaMine--a fast, high-quality predictor of protein backbone dynamics. DynaMine uses only protein sequence information as input and shows great potential in distinguishing regions of different structural organization, such as folded domains, disordered linkers, molten globules and pre-structured binding motifs of different sizes. It also identifies disordered regions within proteins with an accuracy comparable to the most sophisticated existing predictors, without depending on prior disorder knowledge or three-dimensional structural information. DynaMine provides molecular biologists with an important new method that grasps the dynamical characteristics of any protein of interest, as we show here for human p53 and E1A from human adenovirus 5.
Yu, Isseki; Mori, Takaharu; Ando, Tadashi; Harada, Ryuhei; Jung, Jaewoon; Sugita, Yuji; Feig, Michael
2016-11-01
Biological macromolecules function in highly crowded cellular environments. The structure and dynamics of proteins and nucleic acids are well characterized in vitro, but in vivo crowding effects remain unclear. Using molecular dynamics simulations of a comprehensive atomistic model cytoplasm we found that protein-protein interactions may destabilize native protein structures, whereas metabolite interactions may induce more compact states due to electrostatic screening. Protein-protein interactions also resulted in significant variations in reduced macromolecular diffusion under crowded conditions, while metabolites exhibited significant two-dimensional surface diffusion and altered protein-ligand binding that may reduce the effective concentration of metabolites and ligands in vivo. Metabolic enzymes showed weak non-specific association in cellular environments attributed to solvation and entropic effects. These effects are expected to have broad implications for the in vivo functioning of biomolecules. This work is a first step towards physically realistic in silico whole-cell models that connect molecular with cellular biology.
Alignment hierarchies: engineering architecture from the nanometre to the micrometre scale.
Kureshi, Alvena; Cheema, Umber; Alekseeva, Tijna; Cambrey, Alison; Brown, Robert
2010-12-06
Natural tissues are built of metabolites, soluble proteins and solid extracellular matrix components (largely fibrils) together with cells. These are configured in highly organized hierarchies of structure across length scales from nanometre to millimetre, with alignments that are dominated by anisotropies in their fibrillar matrix. If we are to successfully engineer tissues, these hierarchies need to be mimicked with an understanding of the interaction between them. In particular, the movement of different elements of the tissue (e.g. molecules, cells and bulk fluids) is controlled by matrix structures at distinct scales. We present three novel systems to introduce alignment of collagen fibrils, cells and growth factor gradients within a three-dimensional collagen scaffold using fluid flow, embossing and layering of construct. Importantly, these can be seen as different parts of the same hierarchy of three-dimensional structure, as they are all formed into dense collagen gels. Fluid flow aligns collagen fibrils at the nanoscale, embossed topographical features provide alignment cues at the microscale and introducing layered configuration to three-dimensional collagen scaffolds provides microscale- and mesoscale-aligned pathways for protein factor delivery as well as barriers to confine protein diffusion to specific spatial directions. These seemingly separate methods can be employed to increase complexity of simple extracellular matrix scaffolds, providing insight into new approaches to directly fabricate complex physical and chemical cues at different hierarchical scales, similar to those in natural tissues.
NASA Astrophysics Data System (ADS)
Schultz, Christian P.; Bârzu, Octavian; Mantsch, Henry H.
2000-03-01
The functional role of CMP kinases is to regenerate mono-phosphate nucleotides in cells by transferring phosphate residues from tri-phosphorylated nucleotides to monophosphorylated nucleotides. These enzymes possess two binding sites and maintain a highly conserved secondary structure. They are essential for cell survival. Herein we compare the infrared spectra of two similar, but not identical enzymes, the CMP kinases from Escherichia coli and Bacillus subtilis. A two-dimensional cross correlation analysis of the infrared spectra reveals differences in the denaturation behavior of the two proteins. Different secondary structure elements show different time-delayed or advanced unfolding events in the two enzymes. When bound to the active sites, the two nucleotide-substrates CMP and ATP exert a stabilizing effect on the structure of both proteins. The changes observed upon thermal denaturation are different for the two enzymes. Model 2D correlations are used to simulate the different denaturation of the two enzymes. Thermal denaturation and aggregation can be distinguished as two processes separated in time.
Pillai, Harikrishna; Yadav, Brijesh Singh; Chaturvedi, Navaneet; Jan, Arif Tasleem; Gupta, Girish Kumar; Baig, Mohammad Hassan; Bhure, Sanjeev Kumar
2017-01-01
Regucalcin (RGN), a calcium regulating protein having anti-prolific, antiapoptotic functions, plays important part in the biosynthesis of ascorbic acid. It is a highly conserved protein that has been reported from many tissue types of various vertebrate species. Employing its effect of regulating enzyme activities through reaction with sulfhydryl group (-SH) and calcium, structural level study believed to offer a better understanding of binding properties and regulatory mechanisms of RGN, was performed. Using sample from testis of Bubalus bubalis, amplification of regucalcin (RGN) gene was subjected to characterization by performing digestion using different restriction endonucleases (RE). Alongside, cDNA was cloned into pPICZαC vector and transformed in DH5α host for custom sequencing. To get a better insight of its structural characteristics, three dimensional (3D) structure of protein sequence was generated using in silico molecular modelling approach. The full trajectory analysis of structure was achieved by the Molecular Dynamics (MD) that explains the stability, flexibility and robustness of protein during simulation in a time of 50ns. Molecular docking against 1,5-anhydrosorbitol was performed for functional characterization of RGN. Preliminary screening of amplified products on Agarose gel showed expected size of ~893 bp of PCR product corresponding to RGN. Following sequencing, BLASTp search of the target sequence revealed that it shares 91% similarity score with human senescence marker protein-30 (pdb id: 3G4E). Molecular docking of 1,5-anhydrosorbitol reveals information regarding important binding site residues of RGN. 1,5-anhydrosorbitol was found to interact with binding free energy of - 6.01 Kcal/mol. RMSD calculation of subunits A, B and D-F might be responsible for functional and conserved regions of modeled protein. Three dimensional structure of RGN was generated and its interactions with 1,5- anhydrosorbitol, demonstrates the role of key binding residues. Until now, no structural details were available for buffalo RGN proteins, hence this study will broaden the horizon towards understanding the structural and functional aspects of different proteins in cattle. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Horejs, Christine; Pum, Dietmar; Sleytr, Uwe B; Peterlik, Herwig; Jungbauer, Alois; Tscheliessnig, Rupert
2010-11-07
Surface layers (S-layers) are the most commonly observed cell surface structure of prokaryotic organisms. They are made up of proteins that spontaneously self-assemble into functional crystalline lattices in solution, on various solid surfaces, and interfaces. While classical experimental techniques failed to recover a complete structural model of an unmodified S-layer protein, small angle x-ray scattering (SAXS) provides an opportunity to study the structure of S-layer monomers in solution and of self-assembled two-dimensional sheets. For the protein under investigation we recently suggested an atomistic structural model by the use of molecular dynamics simulations. This structural model is now refined on the basis of SAXS data together with a fractal assembly approach. Here we show that a nondiluted critical system of proteins, which crystallize into monomolecular structures, might be analyzed by SAXS if protein-protein interactions are taken into account by relating a fractal local density distribution to a fractal local mean potential, which has to fulfill the Poisson equation. The present work demonstrates an important step into the elucidation of the structure of S-layers and offers a tool to analyze the structure of self-assembling systems in solution by means of SAXS and computer simulations.
NASA Astrophysics Data System (ADS)
Horejs, Christine; Pum, Dietmar; Sleytr, Uwe B.; Peterlik, Herwig; Jungbauer, Alois; Tscheliessnig, Rupert
2010-11-01
Surface layers (S-layers) are the most commonly observed cell surface structure of prokaryotic organisms. They are made up of proteins that spontaneously self-assemble into functional crystalline lattices in solution, on various solid surfaces, and interfaces. While classical experimental techniques failed to recover a complete structural model of an unmodified S-layer protein, small angle x-ray scattering (SAXS) provides an opportunity to study the structure of S-layer monomers in solution and of self-assembled two-dimensional sheets. For the protein under investigation we recently suggested an atomistic structural model by the use of molecular dynamics simulations. This structural model is now refined on the basis of SAXS data together with a fractal assembly approach. Here we show that a nondiluted critical system of proteins, which crystallize into monomolecular structures, might be analyzed by SAXS if protein-protein interactions are taken into account by relating a fractal local density distribution to a fractal local mean potential, which has to fulfill the Poisson equation. The present work demonstrates an important step into the elucidation of the structure of S-layers and offers a tool to analyze the structure of self-assembling systems in solution by means of SAXS and computer simulations.
BAYESIAN PROTEIN STRUCTURE ALIGNMENT.
Rodriguez, Abel; Schmidler, Scott C
The analysis of the three-dimensional structure of proteins is an important topic in molecular biochemistry. Structure plays a critical role in defining the function of proteins and is more strongly conserved than amino acid sequence over evolutionary timescales. A key challenge is the identification and evaluation of structural similarity between proteins; such analysis can aid in understanding the role of newly discovered proteins and help elucidate evolutionary relationships between organisms. Computational biologists have developed many clever algorithmic techniques for comparing protein structures, however, all are based on heuristic optimization criteria, making statistical interpretation somewhat difficult. Here we present a fully probabilistic framework for pairwise structural alignment of proteins. Our approach has several advantages, including the ability to capture alignment uncertainty and to estimate key "gap" parameters which critically affect the quality of the alignment. We show that several existing alignment methods arise as maximum a posteriori estimates under specific choices of prior distributions and error models. Our probabilistic framework is also easily extended to incorporate additional information, which we demonstrate by including primary sequence information to generate simultaneous sequence-structure alignments that can resolve ambiguities obtained using structure alone. This combined model also provides a natural approach for the difficult task of estimating evolutionary distance based on structural alignments. The model is illustrated by comparison with well-established methods on several challenging protein alignment examples.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Horejs, Christine; Pum, Dietmar; Sleytr, Uwe B.
2010-11-07
Surface layers (S-layers) are the most commonly observed cell surface structure of prokaryotic organisms. They are made up of proteins that spontaneously self-assemble into functional crystalline lattices in solution, on various solid surfaces, and interfaces. While classical experimental techniques failed to recover a complete structural model of an unmodified S-layer protein, small angle x-ray scattering (SAXS) provides an opportunity to study the structure of S-layer monomers in solution and of self-assembled two-dimensional sheets. For the protein under investigation we recently suggested an atomistic structural model by the use of molecular dynamics simulations. This structural model is now refined on themore » basis of SAXS data together with a fractal assembly approach. Here we show that a nondiluted critical system of proteins, which crystallize into monomolecular structures, might be analyzed by SAXS if protein-protein interactions are taken into account by relating a fractal local density distribution to a fractal local mean potential, which has to fulfill the Poisson equation. The present work demonstrates an important step into the elucidation of the structure of S-layers and offers a tool to analyze the structure of self-assembling systems in solution by means of SAXS and computer simulations.« less
Rydzewski, J; Nowak, W
2016-04-12
In this work we propose an application of a nonlinear dimensionality reduction method to represent the high-dimensional configuration space of the ligand-protein dissociation process in a manner facilitating interpretation. Rugged ligand expulsion paths are mapped into 2-dimensional space. The mapping retains the main structural changes occurring during the dissociation. The topological similarity of the reduced paths may be easily studied using the Fréchet distances, and we show that this measure facilitates machine learning classification of the diffusion pathways. Further, low-dimensional configuration space allows for identification of residues active in transport during the ligand diffusion from a protein. The utility of this approach is illustrated by examination of the configuration space of cytochrome P450cam involved in expulsing camphor by means of enhanced all-atom molecular dynamics simulations. The expulsion trajectories are sampled and constructed on-the-fly during molecular dynamics simulations using the recently developed memetic algorithms [ Rydzewski, J.; Nowak, W. J. Chem. Phys. 2015 , 143 ( 12 ), 124101 ]. We show that the memetic algorithms are effective for enforcing the ligand diffusion and cavity exploration in the P450cam-camphor complex. Furthermore, we demonstrate that machine learning techniques are helpful in inspecting ligand diffusion landscapes and provide useful tools to examine structural changes accompanying rare events.
Ligation site in proteins recognized in silico
Brylinski, Michal; Konieczny, Leszek; Roterman, Irena
2006-01-01
Recognition of a ligation site in a protein molecule is important for identifying its biological activity. The model for in silico recognition of ligation sites in proteins is presented. The idealized hydrophobic core stabilizing protein structure is represented by a three-dimensional Gaussian function. The experimentally observed distribution of hydrophobicity compared with the theoretical distribution reveals differences. The area of high differences indicates the ligation site. Availability http://bioinformatics.cm-uj.krakow.pl/activesite PMID:17597871
Automated multi-dimensional purification of tagged proteins.
Sigrell, Jill A; Eklund, Pär; Galin, Markus; Hedkvist, Lotta; Liljedahl, Pia; Johansson, Christine Markeland; Pless, Thomas; Torstenson, Karin
2003-01-01
The capacity for high throughput purification (HTP) is essential in fields such as structural genomics where large numbers of protein samples are routinely characterized in, for example, studies of structural determination, functionality and drug development. Proteins required for such analysis must be pure and homogenous and available in relatively large amounts. AKTA 3D system is a powerful automated protein purification system, which minimizes preparation, run-time and repetitive manual tasks. It has the capacity to purify up to 6 different His6- or GST-tagged proteins per day and can produce 1-50 mg protein per run at >90% purity. The success of automated protein purification increases with careful experimental planning. Protocol, columns and buffers need to be chosen with the final application area for the purified protein in mind.
7 Å resolution in protein two-dimensional-crystal X-ray diffraction at Linac Coherent Light Source
Pedrini, Bill; Tsai, Ching-Ju; Capitani, Guido; Padeste, Celestino; Hunter, Mark S.; Zatsepin, Nadia A.; Barty, Anton; Benner, W. Henry; Boutet, Sébastien; Feld, Geoffrey K.; Hau-Riege, Stefan P.; Kirian, Richard A.; Kupitz, Christopher; Messerschmitt, Marc; Ogren, John I.; Pardini, Tommaso; Segelke, Brent; Williams, Garth J.; Spence, John C. H.; Abela, Rafael; Coleman, Matthew; Evans, James E.; Schertler, Gebhard F. X.; Frank, Matthias; Li, Xiao-Dan
2014-01-01
Membrane proteins arranged as two-dimensional crystals in the lipid environment provide close-to-physiological structural information, which is essential for understanding the molecular mechanisms of protein function. Previously, X-ray diffraction from individual two-dimensional crystals did not represent a suitable investigational tool because of radiation damage. The recent availability of ultrashort pulses from X-ray free-electron lasers (XFELs) has now provided a means to outrun the damage. Here, we report on measurements performed at the Linac Coherent Light Source XFEL on bacteriorhodopsin two-dimensional crystals mounted on a solid support and kept at room temperature. By merging data from about a dozen single crystal diffraction images, we unambiguously identified the diffraction peaks to a resolution of 7 Å, thus improving the observable resolution with respect to that achievable from a single pattern alone. This indicates that a larger dataset will allow for reliable quantification of peak intensities, and in turn a corresponding increase in the resolution. The presented results pave the way for further XFEL studies on two-dimensional crystals, which may include pump–probe experiments at subpicosecond time resolution. PMID:24914166
Zhang, Songyan; Gao, Jiuxiang; Lu, Yiling; Cai, Shasha; Qiao, Xue; Wang, Yipeng; Yu, Haining
2013-08-01
Antifreeze proteins (AFPs) refer to a class of polypeptides that are produced by certain vertebrates, plants, fungi, and bacteria and which permit their survival in subzero environments. In this study, we report the molecular cloning, sequence analysis and three-dimensional structure of the axolotl antifreeze-like protein (AFLP) by homology modeling of the first caudate amphibian AFLP. We constructed a full-length spleen cDNA library of axolotl (Ambystoma mexicanum). An EST having highest similarity (∼42%) with freeze-responsive liver protein Li16 from Rana sylvatica was identified, and the full-length cDNA was subsequently obtained by RACE-PCR. The axolotl antifreeze-like protein sequence represents an open reading frame for a putative signal peptide and the mature protein composed of 93 amino acids. The calculated molecular mass and the theoretical isoelectric point (pl) of this mature protein were 10128.6 Da and 8.97, respectively. The molecular characterization of this gene and its deduced protein were further performed by detailed bioinformatics analysis. The three-dimensional structure of current AFLP was predicted by homology modeling, and the conserved residues required for functionality were identified. The homology model constructed could be of use for effective drug design. This is the first report of an antifreeze-like protein identified from a caudate amphibian.
Kavianpour, Hamidreza; Vasighi, Mahdi
2017-02-01
Nowadays, having knowledge about cellular attributes of proteins has an important role in pharmacy, medical science and molecular biology. These attributes are closely correlated with the function and three-dimensional structure of proteins. Knowledge of protein structural class is used by various methods for better understanding the protein functionality and folding patterns. Computational methods and intelligence systems can have an important role in performing structural classification of proteins. Most of protein sequences are saved in databanks as characters and strings and a numerical representation is essential for applying machine learning methods. In this work, a binary representation of protein sequences is introduced based on reduced amino acids alphabets according to surrounding hydrophobicity index. Many important features which are hidden in these long binary sequences can be clearly displayed through their cellular automata images. The extracted features from these images are used to build a classification model by support vector machine. Comparing to previous studies on the several benchmark datasets, the promising classification rates obtained by tenfold cross-validation imply that the current approach can help in revealing some inherent features deeply hidden in protein sequences and improve the quality of predicting protein structural class.
Johnson, Derrick E.; Xue, Bin; Sickmeier, Megan D.; Meng, Jingwei; Cortese, Marc S.; Oldfield, Christopher J.; Le Gall, Tanguy; Dunker, A. Keith; Uversky, Vladimir N.
2012-01-01
The identification of intrinsically disordered proteins (IDPs) among the targets that fail to form satisfactory crystal structures in the Protein Structure Initiative represent a key to reducing the costs and time for determining three-dimensional structures of proteins. To help in this endeavor, several Protein Structure Initiative Centers were asked to send samples of both crystallizable proteins and proteins that failed to crystallize. The abundance of intrinsic disorder in these proteins was evaluated via computational analysis using Predictors of Natural Disordered Regions (PONDR®) and the potential cleavage sites and corresponding fragments were determined. Then, the target proteins were analyzed for intrinsic disorder by their resistance to limited proteolysis. The rates of tryptic digestion of sample target proteins were compared to those of lysozyme/myoglobin, apo-myoglobin and α-casein as standards of ordered, partially disordered and completely disordered proteins, respectively. At the next stage, the protein samples were subjected to both far-UV and near-UV circular dichroism (CD) analysis. For most of the samples, a good agreement between CD data, predictions of disorder and the rates of limited tryptic digestion was established. Further experimentation is being performed on a smaller subset of these samples in order to obtain more detailed information on the ordered/disordered nature of the proteins. PMID:22651963
NASA Astrophysics Data System (ADS)
Tavenor, Nathan Albert
Protein-based supramolecular polymers (SMPs) are a class of biomaterials which draw inspiration from and expand upon the many examples of complex protein quaternary structures observed in nature: collagen, microtubules, viral capsids, etc. Designing synthetic supramolecular protein scaffolds both increases our understanding of natural superstructures and allows for the creation of novel materials. Similar to small-molecule SMPs, protein-based SMPs form due to self-assembly driven by intermolecular interactions between monomers, and monomer structure determines the properties of the overall material. Using protein-based monomers takes advantage of the self-assembly and highly specific molecular recognition properties encodable in polypeptide sequences to rationally design SMP architectures. The central hypothesis underlying our work is that alpha-helical coiled coils, a well-studied protein quaternary folding motif, are well-suited to SMP design through the addition of synthetic linkers at solvent-exposed sites. Through small changes in the structures of the cross-links and/or peptide sequence, we have been able to control both the nanoscale organization and the macroscopic properties of the SMPs. Changes to the linker and hydrophobic core of the peptide can be used to control polymer rigidity, stability, and dimensionality. The gaps in knowledge that this thesis sought to fill on this project were 1) the relationship between the molecular structure of the cross-linked polypeptides and the macroscopic properties of the SMPs and 2) a means of creating materials exhibiting multi-dimensional net or framework topologies. Separate from the above efforts on supramolecular architectures was work on improving backbone modification strategies for an alpha-helix in the context of a complex protein tertiary fold. Earlier work in our lab had successfully incorporated unnatural building blocks into every major secondary structure (beta-sheet, alpha-helix, loops and beta-turns) of a small protein with a tertiary fold. Although the tertiary fold of the native sequence was mimicked by the resulting artificial protein, the thermodynamic stability was greatly compromised. Most of this energetic penalty derived from the modifications present in the alpha-helix. The contribution within this thesis was direct comparison of several alpha-helical design strategies and establishment of the thermodynamic consequences of each.
Predicting nucleic acid binding interfaces from structural models of proteins
Dror, Iris; Shazman, Shula; Mukherjee, Srayanta; Zhang, Yang; Glaser, Fabian; Mandel-Gutfreund, Yael
2011-01-01
The function of DNA- and RNA-binding proteins can be inferred from the characterization and accurate prediction of their binding interfaces. However the main pitfall of various structure-based methods for predicting nucleic acid binding function is that they are all limited to a relatively small number of proteins for which high-resolution three dimensional structures are available. In this study, we developed a pipeline for extracting functional electrostatic patches from surfaces of protein structural models, obtained using the I-TASSER protein structure predictor. The largest positive patches are extracted from the protein surface using the patchfinder algorithm. We show that functional electrostatic patches extracted from an ensemble of structural models highly overlap the patches extracted from high-resolution structures. Furthermore, by testing our pipeline on a set of 55 known nucleic acid binding proteins for which I-TASSER produces high-quality models, we show that the method accurately identifies the nucleic acids binding interface on structural models of proteins. Employing a combined patch approach we show that patches extracted from an ensemble of models better predicts the real nucleic acid binding interfaces compared to patches extracted from independent models. Overall, these results suggest that combining information from a collection of low-resolution structural models could be a valuable approach for functional annotation. We suggest that our method will be further applicable for predicting other functional surfaces of proteins with unknown structure. PMID:22086767
Recent advances in racemic protein crystallography.
Yan, Bingjia; Ye, Linzhi; Xu, Weiliang; Liu, Lei
2017-09-15
Solution of the three-dimensional structures of proteins is a critical step in deciphering the molecular mechanisms of their bioactivities. Among the many approaches for obtaining protein crystals, racemic protein crystallography has been developed as a unique method to solve the structures of an increasing number of proteins. Exploiting unnatural protein enantiomers in crystallization and resolution, racemic protein crystallography manifests two major advantages that are 1) to increase the success rate of protein crystallization, and 2) to obviate the phase problem in X-ray diffraction. The requirement of unnatural protein enantiomers in racemic protein crystallography necessitates chemical protein synthesis, which is hitherto accomplished through solid phase peptide synthesis and chemical ligation reactions. This review highlights the fundamental ideas of racemic protein crystallography and surveys the harvests in the field of racemic protein crystallography over the last five years from early 2012 to late 2016. Copyright © 2017. Published by Elsevier Ltd.
Kobayashi, Ayaho; Kanaba, Teppei; Satoh, Ryosuke; Ito, Yutaka; Sugiura, Reiko; Mishima, Masaki
2017-10-01
Negative regulator differentiation 1 (Nrd1), a fission yeast RNA binding protein, modulates cytokinesis and sexual development and contributes to stress granule formation in response to environmental stresses. Nrd1 comprises four RRM domains and binds and stabilizes Cdc4 mRNA that encodes the myosin II light chain. Nrd1 binds the Cpc2 fission-yeast RACK1 homolog, and the interaction promotes Nrd1 localization to stress granules. Interestingly, Pmk1 mitogen-activated protein kinase phosphorylates Thr40 in the unstructured N-terminal region and Thr126 in the first RRM domain of Nrd1. Phosphorylation significantly reduces RNA-binding activity and likely modulates Nrd1 function. To reveal the relationship between the structure and function of Nrd1 and how phosphorylation affects structure, we used heteronuclear NMR techniques to investigate the three-dimensional structure of Nrd1. Here we report the 1 H, 13 C, and 15 N resonance assignments of RRM1-RRM2 (residues 108-284) comprising the first and second RRMs obtained using heteronuclear NMR techniques. Secondary structures derived from the chemical shifts are reported. These data should contribute to the understanding of the three-dimensional structure of the RRM1-RRM2 region of Nrd1 and the perturbation caused by phosphorylation.
Zebrafish Cardiac Muscle Thick Filaments: Isolation Technique and Three-Dimensional Structure
González-Solá, Maryví; AL-Khayat, Hind A.; Behra, Martine; Kensler, Robert W.
2014-01-01
To understand how mutations in thick filament proteins such as cardiac myosin binding protein-C or titin, cause familial hypertrophic cardiomyopathies, it is important to determine the structure of the cardiac thick filament. Techniques for the genetic manipulation of the zebrafish are well established and it has become a major model for the study of the cardiovascular system. Our goal is to develop zebrafish as an alternative system to the mammalian heart model for the study of the structure of the cardiac thick filaments and the proteins that form it. We have successfully isolated thick filaments from zebrafish cardiac muscle, using a procedure similar to those for mammalian heart, and analyzed their structure by negative-staining and electron microscopy. The isolated filaments appear well ordered with the characteristic 42.9 nm quasi-helical repeat of the myosin heads expected from x-ray diffraction. We have performed single particle image analysis on the collected electron microscopy images for the C-zone region of these filaments and obtained a three-dimensional reconstruction at 3.5 nm resolution. This reconstruction reveals structure similar to the mammalian thick filament, and demonstrates that zebrafish may provide a useful model for the study of the changes in the cardiac thick filament associated with disease processes. PMID:24739166
Structural Insights into the Degradation of Mcl-1 Induced by BH3 Domains
DOE Office of Scientific and Technical Information (OSTI.GOV)
Czabotar,P.; Lee, E.; van Delft, M.
2007-01-01
Apoptosis is held in check by prosurvival proteins of the Bcl-2 family. The distantly related BH3-only proteins bind to and antagonize them, thereby promoting apoptosis. Whereas binding of the BH3-only protein Noxa to prosurvival Mcl-1 induces Mcl-1 degradation by the proteasome, binding of another BH3-only ligand, Bim, elevates Mcl-1 protein levels. We compared the three-dimensional structures of the complexes formed between BH3 peptides of both Bim and Noxa, and we show that a discrete C-terminal sequence of the Noxa BH3 is necessary to instigate Mcl-1 degradation.
Which strategy for a protein crystallization project?
NASA Technical Reports Server (NTRS)
Kundrot, C. E.
2004-01-01
The three-dimensional, atomic-resolution protein structures produced by X-ray crystallography over the past 50+ years have led to tremendous chemical understanding of fundamental biochemical processes. The pace of discovery in protein crystallography has increased greatly with advances in molecular biology, crystallization techniques, cryocrystallography, area detectors, synchrotrons and computing. While the methods used to produce single, well-ordered crystals have also evolved over the years in response to increased understanding and advancing technology, crystallization strategies continue to be rooted in trial-and-error approaches. This review summarizes the current approaches in protein crystallization and surveys the first results to emerge from the structural genomics efforts.
Which Strategy for a Protein Crystallization Project?
NASA Technical Reports Server (NTRS)
Kundrot, Craig E.
2003-01-01
The three-dimensional, atomic-resolution protein structures produced by X-ray crystallography over the past 50+ years have led to tremendous chemical understanding of fundamental biochemical processes. The pace of discovery in protein crystallography has increased greatly with advances in molecular biology, crystallization techniques, cryo-crystallography, area detectors, synchrotrons and computing. While the methods used to produce single, well-ordered crystals have also evolved over the years in response to increased understanding and advancing technology, crystallization strategies continue to be rooted in trial-and-error approaches. This review summarizes the current approaches in protein crystallization and surveys the first results to emerge from the structural genomics efforts.
Two-dimensional protein crystals (S-layers): fundamentals and applications.
Sleytr, U B; Sára, M; Messner, P; Pum, D
1994-10-01
Two-dimensional crystalline surface layers (S-layers) composed of protein or glycoprotein subunits are one of the most commonly observed prokaryotic cell envelope structures. Isolated S-layer subunits are endowed with the ability to assemble into monomolecular arrays in suspension, on surfaces or interfaces by an entropy-driven process. S-layer lattices are isoporous structures with functional groups located on the surface in an identical position and orientation. These characteristic features have already led to applications of S-layers as (1) ultrafiltration membranes with well-defined molecular weight cut-offs and excellent antifouling characteristics, (2) immobilization matrices for functional molecules as required for affinity and enzyme membranes, affinity microcarriers and biosensors, (3) conjugate vaccines, (4) carriers for Langmuir-Blodgett films and reconstituted biological membranes, and (5) patterning elements in molecular nanotechnology.
Understand protein functions by comparing the similarity of local structural environments.
Chen, Jiawen; Xie, Zhong-Ru; Wu, Yinghao
2017-02-01
The three-dimensional structures of proteins play an essential role in regulating binding between proteins and their partners, offering a direct relationship between structures and functions of proteins. It is widely accepted that the function of a protein can be determined if its structure is similar to other proteins whose functions are known. However, it is also observed that proteins with similar global structures do not necessarily correspond to the same function, while proteins with very different folds can share similar functions. This indicates that function similarity is originated from the local structural information of proteins instead of their global shapes. We assume that proteins with similar local environments prefer binding to similar types of molecular targets. In order to testify this assumption, we designed a new structural indicator to define the similarity of local environment between residues in different proteins. This indicator was further used to calculate the probability that a given residue binds to a specific type of structural neighbors, including DNA, RNA, small molecules and proteins. After applying the method to a large-scale non-redundant database of proteins, we show that the positive signal of binding probability calculated from the local structural indicator is statistically meaningful. In summary, our studies suggested that the local environment of residues in a protein is a good indicator to recognize specific binding partners of the protein. The new method could be a potential addition to a suite of existing template-based approaches for protein function prediction. Copyright © 2016 Elsevier B.V. All rights reserved.
High Resolution Crystal Structure of the Catalytic Domain of ADAMTS-5 (Aggrecanase-2)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shieh, Huey-Sheng; Mathis, Karl J.; Williams, Jennifer M.
Aggrecanase-2 (a disintegrin and metalloproteinase with thrombospondin motifs-5 (ADAMTS-5)), a member of the ADAMTS protein family, is critically involved in arthritic diseases because of its direct role in cleaving the cartilage component aggrecan. The catalytic domain of aggrecanase-2 has been refolded, purified, and crystallized, and its three-dimensional structure determined to 1.4{angstrom} resolution in the presence of an inhibitor. A high resolution structure of an ADAMTS/aggrecanase protein provides an opportunity for the development of therapeutics to treat osteoarthritis.
Potrzebowski, Wojciech; André, Ingemar
2015-07-01
For highly oriented fibrillar molecules, three-dimensional structures can often be determined from X-ray fiber diffraction data. However, because of limited information content, structure determination and validation can be challenging. We demonstrate that automated structure determination of protein fibers can be achieved by guiding the building of macromolecular models with fiber diffraction data. We illustrate the power of our approach by determining the structures of six bacteriophage viruses de novo using fiber diffraction data alone and together with solid-state NMR data. Furthermore, we demonstrate the feasibility of molecular replacement from monomeric and fibrillar templates by solving the structure of a plant virus using homology modeling and protein-protein docking. The generated models explain the experimental data to the same degree as deposited reference structures but with improved structural quality. We also developed a cross-validation method for model selection. The results highlight the power of fiber diffraction data as structural constraints.
Functional Evolution of PLP-dependent Enzymes based on Active-Site Structural Similarities
Catazaro, Jonathan; Caprez, Adam; Guru, Ashu; Swanson, David; Powers, Robert
2014-01-01
Families of distantly related proteins typically have very low sequence identity, which hinders evolutionary analysis and functional annotation. Slowly evolving features of proteins, such as an active site, are therefore valuable for annotating putative and distantly related proteins. To date, a complete evolutionary analysis of the functional relationship of an entire enzyme family based on active-site structural similarities has not yet been undertaken. Pyridoxal-5’-phosphate (PLP) dependent enzymes are primordial enzymes that diversified in the last universal ancestor. Using the Comparison of Protein Active Site Structures (CPASS) software and database, we show that the active site structures of PLP-dependent enzymes can be used to infer evolutionary relationships based on functional similarity. The enzymes successfully clustered together based on substrate specificity, function, and three-dimensional fold. This study demonstrates the value of using active site structures for functional evolutionary analysis and the effectiveness of CPASS. PMID:24920327
Functional evolution of PLP-dependent enzymes based on active-site structural similarities.
Catazaro, Jonathan; Caprez, Adam; Guru, Ashu; Swanson, David; Powers, Robert
2014-10-01
Families of distantly related proteins typically have very low sequence identity, which hinders evolutionary analysis and functional annotation. Slowly evolving features of proteins, such as an active site, are therefore valuable for annotating putative and distantly related proteins. To date, a complete evolutionary analysis of the functional relationship of an entire enzyme family based on active-site structural similarities has not yet been undertaken. Pyridoxal-5'-phosphate (PLP) dependent enzymes are primordial enzymes that diversified in the last universal ancestor. Using the comparison of protein active site structures (CPASS) software and database, we show that the active site structures of PLP-dependent enzymes can be used to infer evolutionary relationships based on functional similarity. The enzymes successfully clustered together based on substrate specificity, function, and three-dimensional-fold. This study demonstrates the value of using active site structures for functional evolutionary analysis and the effectiveness of CPASS. © 2014 Wiley Periodicals, Inc.
ERIC Educational Resources Information Center
Douglas, Kristin R.
2008-01-01
Prerequisites for the Developmental Biology course at Augustana College are introductory courses in zoology and cell biology. After introductory courses students appreciate the fact that proteins have three-dimensional structures; however, they often fail to recognize how protein interactions with other cellular components can lead to specific…
NASA Astrophysics Data System (ADS)
Jiang, Zhou-Ting; Zhang, Lin-Xi; Sun, Ting-Ting; Wu, Tai-Quan
2009-10-01
The character of forming long-range contacts affects the three-dimensional structure of globular proteins deeply. As the different ability to form long-range contacts between 20 types of amino acids and 4 categories of globular proteins, the statistical properties are thoroughly discussed in this paper. Two parameters NC and ND are defined to confine the valid residues in detail. The relationship between hydrophobicity scales and valid residue percentage of each amino acid is given in the present work and the linear functions are shown in our statistical results. It is concluded that the hydrophobicity scale defined by chemical derivatives of the amino acids and nonpolar phase of large unilamellar vesicle membranes is the most effective technique to characterise the hydrophobic behavior of amino acid residues. Meanwhile, residue percentage Pi and sequential residue length Li of a certain protein i are calculated under different conditions. The statistical results show that the average value of Pi as well as Li of all-α proteins has a minimum among these 4 classes of globular proteins, indicating that all-α proteins are hardly capable of forming long-range contacts one by one along their linear amino acid sequences. All-β proteins have a higher tendency to construct long-range contacts along their primary sequences related to the secondary configurations, i.e. parallel and anti-parallel configurations of β sheets. The investigation of the interior properties of globular proteins give us the connection between the three-dimensional structure and its primary sequence data or secondary configurations, and help us to understand the structure of protein and its folding process well.
Glusman, Gustavo; Rose, Peter W; Prlić, Andreas; Dougherty, Jennifer; Duarte, José M; Hoffman, Andrew S; Barton, Geoffrey J; Bendixen, Emøke; Bergquist, Timothy; Bock, Christian; Brunk, Elizabeth; Buljan, Marija; Burley, Stephen K; Cai, Binghuang; Carter, Hannah; Gao, JianJiong; Godzik, Adam; Heuer, Michael; Hicks, Michael; Hrabe, Thomas; Karchin, Rachel; Leman, Julia Koehler; Lane, Lydie; Masica, David L; Mooney, Sean D; Moult, John; Omenn, Gilbert S; Pearl, Frances; Pejaver, Vikas; Reynolds, Sheila M; Rokem, Ariel; Schwede, Torsten; Song, Sicheng; Tilgner, Hagen; Valasatava, Yana; Zhang, Yang; Deutsch, Eric W
2017-12-18
The translation of personal genomics to precision medicine depends on the accurate interpretation of the multitude of genetic variants observed for each individual. However, even when genetic variants are predicted to modify a protein, their functional implications may be unclear. Many diseases are caused by genetic variants affecting important protein features, such as enzyme active sites or interaction interfaces. The scientific community has catalogued millions of genetic variants in genomic databases and thousands of protein structures in the Protein Data Bank. Mapping mutations onto three-dimensional (3D) structures enables atomic-level analyses of protein positions that may be important for the stability or formation of interactions; these may explain the effect of mutations and in some cases even open a path for targeted drug development. To accelerate progress in the integration of these data types, we held a two-day Gene Variation to 3D (GVto3D) workshop to report on the latest advances and to discuss unmet needs. The overarching goal of the workshop was to address the question: what can be done together as a community to advance the integration of genetic variants and 3D protein structures that could not be done by a single investigator or laboratory? Here we describe the workshop outcomes, review the state of the field, and propose the development of a framework with which to promote progress in this arena. The framework will include a set of standard formats, common ontologies, a common application programming interface to enable interoperation of the resources, and a Tool Registry to make it easy to find and apply the tools to specific analysis problems. Interoperability will enable integration of diverse data sources and tools and collaborative development of variant effect prediction methods.
Computational analysis of sequence selection mechanisms.
Meyerguz, Leonid; Grasso, Catherine; Kleinberg, Jon; Elber, Ron
2004-04-01
Mechanisms leading to gene variations are responsible for the diversity of species and are important components of the theory of evolution. One constraint on gene evolution is that of protein foldability; the three-dimensional shapes of proteins must be thermodynamically stable. We explore the impact of this constraint and calculate properties of foldable sequences using 3660 structures from the Protein Data Bank. We seek a selection function that receives sequences as input, and outputs survival probability based on sequence fitness to structure. We compute the number of sequences that match a particular protein structure with energy lower than the native sequence, the density of the number of sequences, the entropy, and the "selection" temperature. The mechanism of structure selection for sequences longer than 200 amino acids is approximately universal. For shorter sequences, it is not. We speculate on concrete evolutionary mechanisms that show this behavior.
Interaction of sucralose with whey protein: Experimental and molecular modeling studies
NASA Astrophysics Data System (ADS)
Zhang, Hongmei; Sun, Shixin; Wang, Yanqing; Cao, Jian
2017-12-01
The objective of this research was to study the interactions of sucralose with whey protein isolate (WPI) by using the three-dimensional fluorescence spectroscopy, circular dichroism spectroscopy and molecular modeling. The results showed that the peptide strands structure of WPI had been changed by sucralose. Sucralose binding induced the secondary structural changes and increased content of aperiodic structure of WPI. Sucralose decreased the thermal stability of WPI and acted as a structure destabilizer during the thermal unfolding process of protein. In addition, the existence of sucralose decreased the reversibility of the unfolding of WPI. Nonetheless, sucralose-WPI complex was less stable than protein alone. The molecular modeling result showed that van der Waals and hydrogen bonding interactions contribute to the complexation free binding energy. There are more than one possible binding sites of WPI with sucralose by surface binding mode.
ProteinShader: illustrative rendering of macromolecules
Weber, Joseph R
2009-01-01
Background Cartoon-style illustrative renderings of proteins can help clarify structural features that are obscured by space filling or balls and sticks style models, and recent advances in programmable graphics cards offer many new opportunities for improving illustrative renderings. Results The ProteinShader program, a new tool for macromolecular visualization, uses information from Protein Data Bank files to produce illustrative renderings of proteins that approximate what an artist might create by hand using pen and ink. A combination of Hermite and spherical linear interpolation is used to draw smooth, gradually rotating three-dimensional tubes and ribbons with a repeating pattern of texture coordinates, which allows the application of texture mapping, real-time halftoning, and smooth edge lines. This free platform-independent open-source program is written primarily in Java, but also makes extensive use of the OpenGL Shading Language to modify the graphics pipeline. Conclusion By programming to the graphics processor unit, ProteinShader is able to produce high quality images and illustrative rendering effects in real-time. The main feature that distinguishes ProteinShader from other free molecular visualization tools is its use of texture mapping techniques that allow two-dimensional images to be mapped onto the curved three-dimensional surfaces of ribbons and tubes with minimum distortion of the images. PMID:19331660
Chromosome structure inside the nucleus.
Swedlow, J R; Agard, D A; Sedat, J W
1993-06-01
Recent in situ three-dimensional structural studies have provided a new model for the 30 nm chromatin fiber. In addition, research during the past year has revealed some of the molecular complexity of non-histone chromosomal proteins. Still to come is the unification of molecular insights with chromosomal architecture.
Desideri, A; Falconi, M; Polticelli, F; Bolognesi, M; Djinovic, K; Rotilio, G
1992-01-05
Equipotential lines were calculated, using the Poisson-Boltzmann equation, for six Cu,Zn superoxide dismutases with different protein electric charge and various degrees of sequence homology, namely those from ox, pig, sheep, yeast, and the isoenzymes A and B from the amphibian Xenopus laevis. The three-dimensional structures of the porcine and ovine superoxide dismutases were obtained by molecular modelling reconstruction using the structure of the highly homologous bovine enzyme as a template. The three-dimensional structure of the evolutionary distant yeast Cu,Zn superoxide dismutase was recently resolved by us, while computer-modelled structures are available for X. laevis isoenzymes. The six proteins display large differences in the net protein charge and distribution of electrically charged surface residues but the trend of the equipotential lines in the proximity of the active sites was found to be constant in all cases. These results are in line with the very similar catlytic rate constants experimentally measured for the corresponding enzyme activities. This analysis shows that electrostatic guidance for the enzyme-substrate interaction in Cu,Zn superoxide dismutases is related to a spatial distribution of charges, arranged so as to maintain, in the area surrounding the active sites, an identical electrostatic potential distribution, which is conserved in the evolution of this protein family.
Campagnola, Paul J; Millard, Andrew C; Terasaki, Mark; Hoppe, Pamela E; Malone, Christian J; Mohler, William A
2002-01-01
We find that several key endogenous protein structures give rise to intense second-harmonic generation (SHG)-nonabsorptive frequency doubling of an excitation laser line. Second-harmonic imaging microscopy (SHIM) on a laser-scanning system proves, therefore, to be a powerful and unique tool for high-resolution, high-contrast, three-dimensional studies of live cell and tissue architecture. Unlike fluorescence, SHG suffers no inherent photobleaching or toxicity and does not require exogenous labels. Unlike polarization microscopy, SHIM provides intrinsic confocality and deep sectioning in complex tissues. In this study, we demonstrate the clarity of SHIM optical sectioning within unfixed, unstained thick specimens. SHIM and two-photon excited fluorescence (TPEF) were combined in a dual-mode nonlinear microscopy to elucidate the molecular sources of SHG in live cells and tissues. SHG arose not only from coiled-coil complexes within connective tissues and muscle thick filaments, but also from microtubule arrays within interphase and mitotic cells. Both polarization dependence and a local symmetry cancellation effect of SHG allowed the signal from species generating the second harmonic to be decoded, by ratiometric correlation with TPEF, to yield information on local structure below optical resolution. The physical origin of SHG within these tissues is addressed and is attributed to the laser interaction with dipolar protein structures that is enhanced by the intrinsic chirality of the protein helices. PMID:11751336
Three-dimensional structure of the human immunodeficiency virus type 1 matrix protein.
Massiah, M A; Starich, M R; Paschall, C; Summers, M F; Christensen, A M; Sundquist, W I
1994-11-25
The HIV-1 matrix protein forms an icosahedral shell associated with the inner membrane of the mature virus. Genetic analyses have indicated that the protein performs important functions throughout the viral life-cycle, including anchoring the transmembrane envelope protein on the surface of the virus, assisting in viral penetration, transporting the proviral integration complex across the nuclear envelope, and localizing the assembling virion to the cell membrane. We now report the three-dimensional structure of recombinant HIV-1 matrix protein, determined at high resolution by nuclear magnetic resonance (NMR) methods. The HIV-1 matrix protein is the first retroviral matrix protein to be characterized structurally and only the fourth HIV-1 protein of known structure. NMR signal assignments required recently developed triple-resonance (1H, 13C, 15N) NMR methodologies because signals for 91% of 132 assigned H alpha protons and 74% of the 129 assignable backbone amide protons resonate within chemical shift ranges of 0.8 p.p.m. and 1 p.p.m., respectively. A total of 636 nuclear Overhauser effect-derived distance restraints were employed for distance geometry-based structure calculations, affording an average of 13.0 NMR-derived distance restraints per residue for the experimentally constrained amino acids. An ensemble of 25 refined distance geometry structures with penalties (sum of the squares of the distance violations) of 0.32 A2 or less and individual distance violations under 0.06 A was generated; best-fit superposition of ordered backbone heavy atoms relative to mean atom positions afforded root-mean-square deviations of 0.50 (+/- 0.08) A. The folded HIV-1 matrix protein structure is composed of five alpha-helices, a short 3(10) helical stretch, and a three-strand mixed beta-sheet. Helices I to III and the 3(10) helix pack about a central helix (IV) to form a compact globular domain that is capped by the beta-sheet. The C-terminal helix (helix V) projects away from the beta-sheet to expose carboxyl-terminal residues essential for early steps in the HIV-1 infectious cycle. Basic residues implicated in membrane binding and nuclear localization functions cluster about an extruded cationic loop that connects beta-strands 1 and 2. The structure suggests that both membrane binding and nuclear localization may be mediated by complex tertiary structures rather than simple linear determinants.
Protein-Protein Docking in Drug Design and Discovery.
Kaczor, Agnieszka A; Bartuzi, Damian; Stępniewski, Tomasz Maciej; Matosiuk, Dariusz; Selent, Jana
2018-01-01
Protein-protein interactions (PPIs) are responsible for a number of key physiological processes in the living cells and underlie the pathomechanism of many diseases. Nowadays, along with the concept of so-called "hot spots" in protein-protein interactions, which are well-defined interface regions responsible for most of the binding energy, these interfaces can be targeted with modulators. In order to apply structure-based design techniques to design PPIs modulators, a three-dimensional structure of protein complex has to be available. In this context in silico approaches, in particular protein-protein docking, are a valuable complement to experimental methods for elucidating 3D structure of protein complexes. Protein-protein docking is easy to use and does not require significant computer resources and time (in contrast to molecular dynamics) and it results in 3D structure of a protein complex (in contrast to sequence-based methods of predicting binding interfaces). However, protein-protein docking cannot address all the aspects of protein dynamics, in particular the global conformational changes during protein complex formation. In spite of this fact, protein-protein docking is widely used to model complexes of water-soluble proteins and less commonly to predict structures of transmembrane protein assemblies, including dimers and oligomers of G protein-coupled receptors (GPCRs). In this chapter we review the principles of protein-protein docking, available algorithms and software and discuss the recent examples, benefits, and drawbacks of protein-protein docking application to water-soluble proteins, membrane anchoring and transmembrane proteins, including GPCRs.
A Generative Angular Model of Protein Structure Evolution
Golden, Michael; García-Portugués, Eduardo; Sørensen, Michael; Mardia, Kanti V.; Hamelryck, Thomas; Hein, Jotun
2017-01-01
Abstract Recently described stochastic models of protein evolution have demonstrated that the inclusion of structural information in addition to amino acid sequences leads to a more reliable estimation of evolutionary parameters. We present a generative, evolutionary model of protein structure and sequence that is valid on a local length scale. The model concerns the local dependencies between sequence and structure evolution in a pair of homologous proteins. The evolutionary trajectory between the two structures in the protein pair is treated as a random walk in dihedral angle space, which is modeled using a novel angular diffusion process on the two-dimensional torus. Coupling sequence and structure evolution in our model allows for modeling both “smooth” conformational changes and “catastrophic” conformational jumps, conditioned on the amino acid changes. The model has interpretable parameters and is comparatively more realistic than previous stochastic models, providing new insights into the relationship between sequence and structure evolution. For example, using the trained model we were able to identify an apparent sequence–structure evolutionary motif present in a large number of homologous protein pairs. The generative nature of our model enables us to evaluate its validity and its ability to simulate aspects of protein evolution conditioned on an amino acid sequence, a related amino acid sequence, a related structure or any combination thereof. PMID:28453724
Web3DMol: interactive protein structure visualization based on WebGL.
Shi, Maoxiang; Gao, Juntao; Zhang, Michael Q
2017-07-03
A growing number of web-based databases and tools for protein research are being developed. There is now a widespread need for visualization tools to present the three-dimensional (3D) structure of proteins in web browsers. Here, we introduce our 3D modeling program-Web3DMol-a web application focusing on protein structure visualization in modern web browsers. Users submit a PDB identification code or select a PDB archive from their local disk, and Web3DMol will display and allow interactive manipulation of the 3D structure. Featured functions, such as sequence plot, fragment segmentation, measure tool and meta-information display, are offered for users to gain a better understanding of protein structure. Easy-to-use APIs are available for developers to reuse and extend Web3DMol. Web3DMol can be freely accessed at http://web3dmol.duapp.com/, and the source code is distributed under the MIT license. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Ukleja, Marta; Valpuesta, José María; Dziembowski, Andrzej; Cuellar, Jorge
2016-10-01
Large protein assemblies are usually the effectors of major cellular processes. The intricate cell homeostasis network is divided into numerous interconnected pathways, each controlled by a set of protein machines. One of these master regulators is the CCR4-NOT complex, which ultimately controls protein expression levels. This multisubunit complex assembles around a scaffold platform, which enables a wide variety of well-studied functions from mRNA synthesis to transcript decay, as well as other tasks still being identified. Solving the structure of the entire CCR4-NOT complex will help to define the distribution of its functions. The recently published three-dimensional reconstruction of the complex, in combination with the known crystal structures of some of the components, has begun to address this. Methodological improvements in structural biology, especially in cryoelectron microscopy, encourage further structural and protein-protein interaction studies, which will advance our comprehension of the gene expression machinery. © 2016 WILEY Periodicals, Inc.
An Amino Acid Code to Define a Protein’s Tertiary Packing Surface
Fraga, Keith J.; Joo, Hyun; Tsai, Jerry
2015-01-01
One difficult aspect of the protein-folding problem is characterizing the non-specific interactions that define packing in protein tertiary structure. To better understand tertiary structure, this work extends the knob-socket model by classifying the interactions of a single knob residue packed into a set of contiguous sockets, or a pocket made up of 4 or more residues. The knob-socket construct allows for a symbolic two-dimensional mapping of pockets. The two-dimensional mapping of pockets provides a simple method to investigate the variety of pocket shapes in order to understand the geometry of protein tertiary surfaces. The diversity of pocket geometries can be organized into groups of pockets that share a common core, which suggests that some interactions in pockets are ancillary to packing. Further analysis of pocket geometries displays a preferred configuration that is right-handed in α-helices and left-handed in β-sheets. The amino acid composition of pockets illustrates the importance of non-polar amino acids in packing as well as position specificity. As expected, all pocket shapes prefer to pack with hydrophobic knobs; however, knobs are not selective for the pockets they pack. Investigating side-chain rotamer preferences for certain pocket shapes uncovers no strong correlations. These findings allow a simple vocabulary based on knobs and sockets to describe protein tertiary packing that supports improved analysis, design and prediction of protein structure. PMID:26575337
BiGGER: a new (soft) docking algorithm for predicting protein interactions.
Palma, P N; Krippahl, L; Wampler, J E; Moura, J J
2000-06-01
A new computationally efficient and automated "soft docking" algorithm is described to assist the prediction of the mode of binding between two proteins, using the three-dimensional structures of the unbound molecules. The method is implemented in a software package called BiGGER (Bimolecular Complex Generation with Global Evaluation and Ranking) and works in two sequential steps: first, the complete 6-dimensional binding spaces of both molecules is systematically searched. A population of candidate protein-protein docked geometries is thus generated and selected on the basis of the geometric complementarity and amino acid pairwise affinities between the two molecular surfaces. Most of the conformational changes observed during protein association are treated in an implicit way and test results are equally satisfactory, regardless of starting from the bound or the unbound forms of known structures of the interacting proteins. In contrast to other methods, the entire molecular surfaces are searched during the simulation, using absolutely no additional information regarding the binding sites. In a second step, an interaction scoring function is used to rank the putative docked structures. The function incorporates interaction terms that are thought to be relevant to the stabilization of protein complexes. These include: geometric complementarity of the surfaces, explicit electrostatic interactions, desolvation energy, and pairwise propensities of the amino acid side chains to contact across the molecular interface. The relative functional contribution of each of these interaction terms to the global scoring function has been empirically adjusted through a neural network optimizer using a learning set of 25 protein-protein complexes of known crystallographic structures. In 22 out of 25 protein-protein complexes tested, near-native docked geometries were found with C(alpha) RMS deviations < or =4.0 A from the experimental structures, of which 14 were found within the 20 top ranking solutions. The program works on widely available personal computers and takes 2 to 8 hours of CPU time to run any of the docking tests herein presented. Finally, the value and limitations of the method for the study of macromolecular interactions, not yet revealed by experimental techniques, are discussed.
An ambiguity principle for assigning protein structural domains
Postic, Guillaume; Ghouzam, Yassine; Chebrek, Romain; Gelly, Jean-Christophe
2017-01-01
Ambiguity is the quality of being open to several interpretations. For an image, it arises when the contained elements can be delimited in two or more distinct ways, which may cause confusion. We postulate that it also applies to the analysis of protein three-dimensional structure, which consists in dividing the molecule into subunits called domains. Because different definitions of what constitutes a domain can be used to partition a given structure, the same protein may have different but equally valid domain annotations. However, knowledge and experience generally displace our ability to accept more than one way to decompose the structure of an object—in this case, a protein. This human bias in structure analysis is particularly harmful because it leads to ignoring potential avenues of research. We present an automated method capable of producing multiple alternative decompositions of protein structure (web server and source code available at www.dsimb.inserm.fr/sword/). Our innovative algorithm assigns structural domains through the hierarchical merging of protein units, which are evolutionarily preserved substructures that describe protein architecture at an intermediate level, between domain and secondary structure. To validate the use of these protein units for decomposing protein structures into domains, we set up an extensive benchmark made of expert annotations of structural domains and including state-of-the-art domain parsing algorithms. The relevance of our “multipartitioning” approach is shown through numerous examples of applications covering protein function, evolution, folding, and structure prediction. Finally, we introduce a measure for the structural ambiguity of protein molecules. PMID:28097215
Ordered nanoparticle arrays formed on engineered chaperonin protein templates
NASA Technical Reports Server (NTRS)
McMillan, R. Andrew; Paavola, Chad D.; Howard, Jeanie; Chan, Suzanne L.; Zaluzec, Nestor J.; Trent, Jonathan D.
2002-01-01
Traditional methods for fabricating nanoscale arrays are usually based on lithographic techniques. Alternative new approaches rely on the use of nanoscale templates made of synthetic or biological materials. Some proteins, for example, have been used to form ordered two-dimensional arrays. Here, we fabricated nanoscale ordered arrays of metal and semiconductor quantum dots by binding preformed nanoparticles onto crystalline protein templates made from genetically engineered hollow double-ring structures called chaperonins. Using structural information as a guide, a thermostable recombinant chaperonin subunit was modified to assemble into chaperonins with either 3 nm or 9 nm apical pores surrounded by chemically reactive thiols. These engineered chaperonins were crystallized into two-dimensional templates up to 20 microm in diameter. The periodic solvent-exposed thiols within these crystalline templates were used to size-selectively bind and organize either gold (1.4, 5 or 10nm) or CdSe-ZnS semiconductor (4.5 nm) quantum dots into arrays. The order within the arrays was defined by the lattice of the underlying protein crystal. By combining the self-assembling properties of chaperonins with mutations guided by structural modelling, we demonstrate that quantum dots can be manipulated using modified chaperonins and organized into arrays for use in next-generation electronic and photonic devices.
Sharma, Alok K; Krieger, Tobias; Rigby, Alan C; Zelikovic, Israel; Alper, Seth L
2016-12-01
Mutations in the human SLC26A4/Pendrin polypeptide (hPDS) cause Pendred Syndrome /DFNB4, syndromic deafness with enlargement of the vestibular aqueduct and low-penetrance goiter. Here we present data on cloning, protein overexpression and purification, refolding, and biophysical characterization of the recombinant hPDS STAS domain lacking its intrinsic variable sequence (STAS-ΔIVS). We report a reproducible protein refolding protocol enabling milligram scale expression and purification of uniformly 15 N- and 13 C /15 N-enriched hPDS STAS-ΔIVS domain suitable for structural characterization by solution NMR. Circular dichroism, one-dimensional 1 H, two-dimensional 1 H- 15 N HSQC, and 1 H- 13 C HSQC NMR spectra confirmed the well-folded state of purified hPDS STAS-ΔIVS in solution. Heteronuclear NMR chemical shift perturbation of select STAS-ΔIVS residues by GDP was observed at fast-to-intermediate NMR time scales. Intrinsic tryptophan fluorescence quench experiments demonstrated GDP binding to hPDS STAS-ΔIVS with K d of 178 μM. These results are useful for structure/function characterization of hPDS STAS, the cytoplasmic subdomain of the congenital deafness protein, pendrin, as well as for studies of other mammalian STAS domains.
SA-Mot: a web server for the identification of motifs of interest extracted from protein loops
Regad, Leslie; Saladin, Adrien; Maupetit, Julien; Geneix, Colette; Camproux, Anne-Claude
2011-01-01
The detection of functional motifs is an important step for the determination of protein functions. We present here a new web server SA-Mot (Structural Alphabet Motif) for the extraction and location of structural motifs of interest from protein loops. Contrary to other methods, SA-Mot does not focus only on functional motifs, but it extracts recurrent and conserved structural motifs involved in structural redundancy of loops. SA-Mot uses the structural word notion to extract all structural motifs from uni-dimensional sequences corresponding to loop structures. Then, SA-Mot provides a description of these structural motifs using statistics computed in the loop data set and in SCOP superfamily, sequence and structural parameters. SA-Mot results correspond to an interactive table listing all structural motifs extracted from a target structure and their associated descriptors. Using this information, the users can easily locate loop regions that are important for the protein folding and function. The SA-Mot web server is available at http://sa-mot.mti.univ-paris-diderot.fr. PMID:21665924
SA-Mot: a web server for the identification of motifs of interest extracted from protein loops.
Regad, Leslie; Saladin, Adrien; Maupetit, Julien; Geneix, Colette; Camproux, Anne-Claude
2011-07-01
The detection of functional motifs is an important step for the determination of protein functions. We present here a new web server SA-Mot (Structural Alphabet Motif) for the extraction and location of structural motifs of interest from protein loops. Contrary to other methods, SA-Mot does not focus only on functional motifs, but it extracts recurrent and conserved structural motifs involved in structural redundancy of loops. SA-Mot uses the structural word notion to extract all structural motifs from uni-dimensional sequences corresponding to loop structures. Then, SA-Mot provides a description of these structural motifs using statistics computed in the loop data set and in SCOP superfamily, sequence and structural parameters. SA-Mot results correspond to an interactive table listing all structural motifs extracted from a target structure and their associated descriptors. Using this information, the users can easily locate loop regions that are important for the protein folding and function. The SA-Mot web server is available at http://sa-mot.mti.univ-paris-diderot.fr.
Długosz, Maciej; Trylska, Joanna
2008-01-01
We present a method for describing and comparing global electrostatic properties of biomolecules based on the spherical harmonic decomposition of electrostatic potential data. Unlike other approaches our method does not require any prior three dimensional structural alignment. The electrostatic potential, given as a volumetric data set from a numerical solution of the Poisson or Poisson–Boltzmann equation, is represented with descriptors that are rotation invariant. The method can be applied to large and structurally diverse sets of biomolecules enabling to cluster them according to their electrostatic features. PMID:18624502
Buried and accessible surface area control intrinsic protein flexibility.
Marsh, Joseph A
2013-09-09
Proteins experience a wide variety of conformational dynamics that can be crucial for facilitating their diverse functions. How is the intrinsic flexibility required for these motions encoded in their three-dimensional structures? Here, the overall flexibility of a protein is demonstrated to be tightly coupled to the total amount of surface area buried within its fold. A simple proxy for this, the relative solvent-accessible surface area (Arel), therefore shows excellent agreement with independent measures of global protein flexibility derived from various experimental and computational methods. Application of Arel on a large scale demonstrates its utility by revealing unique sequence and structural properties associated with intrinsic flexibility. In particular, flexibility as measured by Arel shows little correspondence with intrinsic disorder, but instead tends to be associated with multiple domains and increased α-helical structure. Furthermore, the apparent flexibility of monomeric proteins is found to be useful for identifying quaternary-structure errors in published crystal structures. There is also a strong tendency for the crystal structures of more flexible proteins to be solved to lower resolutions. Finally, local solvent accessibility is shown to be a primary determinant of local residue flexibility. Overall, this work provides both fundamental mechanistic insight into the origin of protein flexibility and a simple, practical method for predicting flexibility from protein structures. © 2013 Elsevier Ltd. All rights reserved.
Protein Structure Classification and Loop Modeling Using Multiple Ramachandran Distributions.
Najibi, Seyed Morteza; Maadooliat, Mehdi; Zhou, Lan; Huang, Jianhua Z; Gao, Xin
2017-01-01
Recently, the study of protein structures using angular representations has attracted much attention among structural biologists. The main challenge is how to efficiently model the continuous conformational space of the protein structures based on the differences and similarities between different Ramachandran plots. Despite the presence of statistical methods for modeling angular data of proteins, there is still a substantial need for more sophisticated and faster statistical tools to model the large-scale circular datasets. To address this need, we have developed a nonparametric method for collective estimation of multiple bivariate density functions for a collection of populations of protein backbone angles. The proposed method takes into account the circular nature of the angular data using trigonometric spline which is more efficient compared to existing methods. This collective density estimation approach is widely applicable when there is a need to estimate multiple density functions from different populations with common features. Moreover, the coefficients of adaptive basis expansion for the fitted densities provide a low-dimensional representation that is useful for visualization, clustering, and classification of the densities. The proposed method provides a novel and unique perspective to two important and challenging problems in protein structure research: structure-based protein classification and angular-sampling-based protein loop structure prediction.
Direct Calculation of Protein Fitness Landscapes through Computational Protein Design
Au, Loretta; Green, David F.
2016-01-01
Naturally selected amino-acid sequences or experimentally derived ones are often the basis for understanding how protein three-dimensional conformation and function are determined by primary structure. Such sequences for a protein family comprise only a small fraction of all possible variants, however, representing the fitness landscape with limited scope. Explicitly sampling and characterizing alternative, unexplored protein sequences would directly identify fundamental reasons for sequence robustness (or variability), and we demonstrate that computational methods offer an efficient mechanism toward this end, on a large scale. The dead-end elimination and A∗ search algorithms were used here to find all low-energy single mutant variants, and corresponding structures of a G-protein heterotrimer, to measure changes in structural stability and binding interactions to define a protein fitness landscape. We established consistency between these algorithms with known biophysical and evolutionary trends for amino-acid substitutions, and could thus recapitulate known protein side-chain interactions and predict novel ones. PMID:26745411
Predicting Real-Valued Protein Residue Fluctuation Using FlexPred.
Peterson, Lenna; Jamroz, Michal; Kolinski, Andrzej; Kihara, Daisuke
2017-01-01
The conventional view of a protein structure as static provides only a limited picture. There is increasing evidence that protein dynamics are often vital to protein function including interaction with partners such as other proteins, nucleic acids, and small molecules. Considering flexibility is also important in applications such as computational protein docking and protein design. While residue flexibility is partially indicated by experimental measures such as the B-factor from X-ray crystallography and ensemble fluctuation from nuclear magnetic resonance (NMR) spectroscopy as well as computational molecular dynamics (MD) simulation, these techniques are resource-intensive. In this chapter, we describe the web server and stand-alone version of FlexPred, which rapidly predicts absolute per-residue fluctuation from a three-dimensional protein structure. On a set of 592 nonredundant structures, comparing the fluctuations predicted by FlexPred to the observed fluctuations in MD simulations showed an average correlation coefficient of 0.669 and an average root mean square error of 1.07 Å. FlexPred is available at http://kiharalab.org/flexPred/ .
Domain organizations of modular extracellular matrix proteins and their evolution.
Engel, J
1996-11-01
Multidomain proteins which are composed of modular units are a rather recent invention of evolution. Domains are defined as autonomously folding regions of a protein, and many of them are similar in sequence and structure, indicating common ancestry. Their modular nature is emphasized by frequent repetitions in identical or in different proteins and by a large number of different combinations with other domains. The extracellular matrix is perhaps the largest biological system composed of modular mosaic proteins, and its astonishing complexity and diversity are based on them. A cluster of minireviews on modular proteins is being published in Matrix Biology. These deal with the evolution of modular proteins, the three-dimensional structure of domains and the ways in which these interact in a multidomain protein. They discuss structure-function relationships in calcium binding domains, collagen helices, alpha-helical coiled-coil domains and C-lectins. The present minireview is focused on some general aspects and serves as an introduction to the cluster.
The role of protein structural analysis in the next generation sequencing era.
Yue, Wyatt W; Froese, D Sean; Brennan, Paul E
2014-01-01
Proteins are macromolecules that serve a cell's myriad processes and functions in all living organisms via dynamic interactions with other proteins, small molecules and cellular components. Genetic variations in the protein-encoding regions of the human genome account for >85% of all known Mendelian diseases, and play an influential role in shaping complex polygenic diseases. Proteins also serve as the predominant target class for the design of small molecule drugs to modulate their activity. Knowledge of the shape and form of proteins, by means of their three-dimensional structures, is therefore instrumental to understanding their roles in disease and their potentials for drug development. In this chapter we outline, with the wide readership of non-structural biologists in mind, the various experimental and computational methods available for protein structure determination. We summarize how the wealth of structure information, contributed to a large extent by the technological advances in structure determination to date, serves as a useful tool to decipher the molecular basis of genetic variations for disease characterization and diagnosis, particularly in the emerging era of genomic medicine, and becomes an integral component in the modern day approach towards rational drug development.
TIPdb-3D: the three-dimensional structure database of phytochemicals from Taiwan indigenous plants.
Tung, Chun-Wei; Lin, Ying-Chi; Chang, Hsun-Shuo; Wang, Chia-Chi; Chen, Ih-Sheng; Jheng, Jhao-Liang; Li, Jih-Heng
2014-01-01
The rich indigenous and endemic plants in Taiwan serve as a resourceful bank for biologically active phytochemicals. Based on our TIPdb database curating bioactive phytochemicals from Taiwan indigenous plants, this study presents a three-dimensional (3D) chemical structure database named TIPdb-3D to support the discovery of novel pharmacologically active compounds. The Merck Molecular Force Field (MMFF94) was used to generate 3D structures of phytochemicals in TIPdb. The 3D structures could facilitate the analysis of 3D quantitative structure-activity relationship, the exploration of chemical space and the identification of potential pharmacologically active compounds using protein-ligand docking. Database URL: http://cwtung.kmu.edu.tw/tipdb. © The Author(s) 2014. Published by Oxford University Press.
Murata, Michio; Sugiyama, Shigeru; Matsuoka, Shigeru; Matsumori, Nobuaki
2015-08-01
Determining the bioactive structure of membrane lipids is a new concept, which aims to examine the functions of lipids with respect to their three-dimensional structures. As lipids are dynamic by nature, their "structure" does not refer solely to a static picture but also to the local and global motions of the lipid molecules. We consider that interactions with lipids, which are completely defined by their structures, are controlled by the chemical, functional, and conformational matching between lipids and between lipid and protein. In this review, we describe recent advances in understanding the bioactive structures of membrane lipids bound to proteins and related molecules, including some of our recent results. By examining recent works on lipid-raft-related molecules, lipid-protein interactions, and membrane-active natural products, we discuss current perspectives on membrane structural biology. © 2015 The Chemical Society of Japan & Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Dong, Zheng; Zhou, Hongyu; Tao, Peng
2018-02-01
PAS domains are widespread in archaea, bacteria, and eukaryota, and play important roles in various functions. In this study, we aim to explore functional evolutionary relationship among proteins in the PAS domain superfamily in view of the sequence-structure-dynamics-function relationship. We collected protein sequences and crystal structure data from RCSB Protein Data Bank of the PAS domain superfamily belonging to three biological functions (nucleotide binding, photoreceptor activity, and transferase activity). Protein sequences were aligned and then used to select sequence-conserved residues and build phylogenetic tree. Three-dimensional structure alignment was also applied to obtain structure-conserved residues. The protein dynamics were analyzed using elastic network model (ENM) and validated by molecular dynamics (MD) simulation. The result showed that the proteins with same function could be grouped by sequence similarity, and proteins in different functional groups displayed statistically significant difference in their vibrational patterns. Interestingly, in all three functional groups, conserved amino acid residues identified by sequence and structure conservation analysis generally have a lower fluctuation than other residues. In addition, the fluctuation of conserved residues in each biological function group was strongly correlated with the corresponding biological function. This research suggested a direct connection in which the protein sequences were related to various functions through structural dynamics. This is a new attempt to delineate functional evolution of proteins using the integrated information of sequence, structure, and dynamics. © 2017 The Protein Society.
Studies of Single Biomolecules, DNA Conformational Dynamics, and Protein Binding
2008-07-11
Nucleotide Base pairs Hydrogen bonds FIG. 1: Ladder structure of DNA showing the Watson - Crick bonding of the bases A, T, G, and C which are suspended by a...protected against unwanted action of chemicals and proteins. The three-dimensional structure of DNA is the famed Watson - Crick double-helix, the equilibrium...quantitative analysis [88]. [1] A. Kornberg and T. A. Baker, DNA Replication (W. H. Freeman, New York, 1992). [2] J. D. Watson and F. H. C. Crick
ERIC Educational Resources Information Center
Rossi, Sergio; Benaglia, Maurizio; Brenna, Davide; Porta, Riccardo; Orlandi, Manuel
2015-01-01
A simple procedure to convert protein data bank files (.pdb) into a stereolithography file (.stl) using VMD software (Virtual Molecular Dynamic) is reported. This tutorial allows generating, with a very simple protocol, three-dimensional customized structures that can be printed by a low-cost 3D-printer, and used for teaching chemical education…
Matching organic libraries with protein-substructures
NASA Astrophysics Data System (ADS)
Preissner, R.; Goede, A.; Rother, K.; Osterkamp, F.; Koert, U.; Froemmel, C.
2001-09-01
We present a general approach which allows automatic identification of sub-structures in proteins that resemble given three-dimensional templates. This paper documents its success with non-peptide templates such as β-turn mimetics. We considered well-tested turn-mimetics such as the bicyclic turned dipeptide (BTD), spiro lactam (Spiro) and the 2,5-disubstituded tetrahydrofuran (THF), a new furan-derivative which was recently developed and characterized. The detected geometric similarity between the templates and the protein patches corresponds to r.m.s.-values of 0.3 Å for more than 80% of the constituting atoms, which is typical for active site comparisons of homologous proteins. This fast automatic procedure might be of biomedical value for finding special mimicking leads for particular protein sub-structures as well as for template-assembled synthetic protein (TASP) design.
Papillomavirus E6 oncoproteins
Vande Pol, Scott B.; Klingelhutz, Aloysius J.
2013-01-01
Papillomaviruses induce benign and malignant epithelial tumors, and the viral E6 oncoprotein is essential for full transformation. E6 contributes to transformation by associating with cellular proteins, docking on specific acidic LXXLL peptide motifs found on the associated cellular proteins. This review examines insights from recent studies of human and animal E6 proteins that determine the three-dimensional structure of E6 when bound to acidic LXXLL peptides. The structure of E6 is related to recent advances in the purification and identification of E6 associated protein complexes. These E6 protein-complexes, together with other proteins that bind to E6, alter a broad array of biological outcomes including modulation of cell survival, cellular transcription, host cell differentiation, growth factor dependence, DNA damage responses, and cell cycle progression. PMID:23711382
Structure and interactions in biomaterials based on membrane-biopolymer self-assembly
NASA Astrophysics Data System (ADS)
Koltover, Ilya
Physical and chemical properties of artificial pure lipid membranes have been extensively studied during the last two decades and are relatively well understood. However, most real membrane systems of biological and biotechnological importance incorporate macromolecules either embedded into the membranes or absorbed onto their surfaces. We have investigated three classes of self-assembled membrane-biopolymer biomaterials: (i) Structure, interactions and stability of the two-dimensional crystals of the integral membrane protein bacteriorhodopsin (bR). We have conducted a synchrotron x-ray diffraction study of oriented bR multilayers. The important findings were as follows: (1) the protein 2D lattice exhibited diffraction patterns characteristic of a 2D solid with power-law decay of in-plane positional correlations, which allowed to measure the elastic constants of protein crystal; (2) The crystal melting temperature was a function of the multilayer hydration, reflecting the effect of inter-membrane repulsion on the stability of protein lattice; (3) Preparation of nearly perfect (mosaicity < 0.04° ) multilayers of fused bR membranes permitted, for the first time, application of powerful interface-sensitive x-ray scattering techniques to a membrane-protein system. (ii) Interactions between the particles chemically attached or absorbed onto the surfaces of flexible giant phospholipid vesicles. Using video-enhanced light microscopy we have observed a membrane-distortion induced attraction between the particles with the interaction range of the order of particle diameter. Fluid membranes decorated with many particles exhibited: (i) a finite-sized two-dimensional closed packed aggregates and (ii) a one-dimensional ring-like aggregates. (iii) Structure, stability and interactions in the cationic lipid-DNA complexes. Cationic liposomes complexed with DNA are among the most promising synthetic non-viral carriers of DNA vectors currently used in gene therapy applications. We have established that DNA complexes with cationic lipid (DOTAP) and a neutral lipid (DOPC) have a compact multilayer liquid crystalline structure ( L ca ) with DNA intercalated between the lipid bilayers in a periodic 2D smectic phase. Furthermore, a different 2D columnar phase of complexes was found in mixtures with a transfectionen-hancing lipid DOPE. This structure ( HcII ) derived from synchrotron x-ray diffraction consists of DNA coated by cationic lipid monolayers and arranged on a two-dimensional hexagonal lattice. Optical microscopy revealed that the L ca complexes bind stably to anionic vesicles (models of cellular membranes), whereas the more transfectant HcII complexes are unstable, rapidly fusing and releasing DNA upon adhering to anionic vesicles.
NASA Astrophysics Data System (ADS)
Rauf, Muhammad; Saeed, Nasir A.; Habib, Imran; Ahmed, Moddassir; Shahzad, Khurram; Mansoor, Shahid; Ali, Rashid
2017-02-01
Structure prediction can provide information about function and active sites of protein which helps to design new functional proteins. H+-pyrophosphatase is transmembrane protein involved in establishing proton motive force for active transport of Na+ across membrane by Na+/H+ antiporters. A full length novel H+-pyrophosphatase gene was isolated from halophytic grass Leptochloa fusca using RT-PCR and RACE method. Full length LfVP1 gene sequence of 2292 nucleotides encodes protein of 764 amino acids. DNA and protein sequences were used for characterization using bioinformatics tools. Various important potential sites were predicted by PROSITE webserver. Primary structural analysis showed LfVP1 as stable protein and Grand average hydropathy (GRAVY) indicated that LfVP1 protein has good hydrosolubility. Secondary structure analysis showed that LfVP1 protein sequence contains significant proportion of alpha helix and random coil. Protein membrane topology suggested the presence of 14 transmembrane domains and presence of catalytic domain in TM3. Three dimensional structure from LfVP1 protein sequence also indicated the presence of 14 transmembrane domains and hydrophobicity surface model showed amino acid hydrophobicity. Ramachandran plot showed that 98% amino acid residues were predicted in the favored region.
Predicting nucleic acid binding interfaces from structural models of proteins.
Dror, Iris; Shazman, Shula; Mukherjee, Srayanta; Zhang, Yang; Glaser, Fabian; Mandel-Gutfreund, Yael
2012-02-01
The function of DNA- and RNA-binding proteins can be inferred from the characterization and accurate prediction of their binding interfaces. However, the main pitfall of various structure-based methods for predicting nucleic acid binding function is that they are all limited to a relatively small number of proteins for which high-resolution three-dimensional structures are available. In this study, we developed a pipeline for extracting functional electrostatic patches from surfaces of protein structural models, obtained using the I-TASSER protein structure predictor. The largest positive patches are extracted from the protein surface using the patchfinder algorithm. We show that functional electrostatic patches extracted from an ensemble of structural models highly overlap the patches extracted from high-resolution structures. Furthermore, by testing our pipeline on a set of 55 known nucleic acid binding proteins for which I-TASSER produces high-quality models, we show that the method accurately identifies the nucleic acids binding interface on structural models of proteins. Employing a combined patch approach we show that patches extracted from an ensemble of models better predicts the real nucleic acid binding interfaces compared with patches extracted from independent models. Overall, these results suggest that combining information from a collection of low-resolution structural models could be a valuable approach for functional annotation. We suggest that our method will be further applicable for predicting other functional surfaces of proteins with unknown structure. Copyright © 2011 Wiley Periodicals, Inc.
Three-dimensional Architecture of Hair-bundle Linkages Revealed by Electron-microscopic Tomography
Auer, Manfred; Koster, Abrahram J.; Ziese, Ulrike; Bajaj, Chandrajit; Volkmann, Niels; Wang, Da Neng
2008-01-01
The senses of hearing and balance rest upon mechanoelectrical transduction by the hair bundles of hair cells in the inner ear. Located at the apical cellular surface, each hair bundle comprises several tens of stereocilia and a single kinocilium that are interconnected by extracellular proteinaceous links. Using electron-microscopic tomography of bullfrog saccular sensory epithelia, we examined the three-dimensional structures of basal links, kinociliary links, and tip links. We observed significant differences in the appearances and dimensions of these three structures and found two distinct populations of tip links suggestive of the involvement of different proteins, splice variants, or protein–protein interactions. We noted auxiliary links connecting the upper portions of tip links to the taller stereocilia. Tip links and auxiliary links show a tendency to adopt a globular conformation when disconnected from the membrane surface. PMID:18421501
Muraki, Michiro
2016-01-01
Human Fas ligand extracellular domain has been investigated as an important target protein in the field of medical biotechnology. In a recent study, the author developed an effective method to produce biologically active human Fas ligand extracellular domain derivatives using site-specific chemical modifications. A human Fas ligand extracellular domain derivative containing a reactive cysteine residue within its N-terminal tag sequence, which locates not proximal to the binding interface between the ligand and the receptor in terms of the three-dimensional structure, was modified by Fluorescein-5-Maleimide without impairing the specific binding activity toward human Fas receptor extracellular domain. The purified protein sample free of low molecular-weight contaminants showed a characteristic fluorescence spectrum derived from the attached Fluorescein moieties, and formed a stable binding complex with human Fas receptor extracellular domain-human IgG1 Fc domain fusion protein in solution. The conjugation number of the fluorochrome was estimated to be 2.5 per a single human Fas ligand extracellular domain trimer from the ratio of the absorbance value at 280 nm to that at 495 nm. A functional fluorescent human Fas ligand extracellular domain derivative was prepared via a site-specific conjugation of fluorochrome, which was guided by the three-dimensional structure information on the ligand-receptor complex. Fluorescent derivatives created by this method may contribute to the development of an improved diagnosis system for the diseases related to Fas receptor.
In situ structural analysis of the Yersinia enterocolitica injectisome
Kudryashev, Mikhail; Stenta, Marco; Schmelz, Stefan; Amstutz, Marlise; Wiesand, Ulrich; Castaño-Díez, Daniel; Degiacomi, Matteo T; Münnich, Stefan; Bleck, Christopher KE; Kowal, Julia; Diepold, Andreas; Heinz, Dirk W; Dal Peraro, Matteo; Cornelis, Guy R; Stahlberg, Henning
2013-01-01
Injectisomes are multi-protein transmembrane machines allowing pathogenic bacteria to inject effector proteins into eukaryotic host cells, a process called type III secretion. Here we present the first three-dimensional structure of Yersinia enterocolitica and Shigella flexneri injectisomes in situ and the first structural analysis of the Yersinia injectisome. Unexpectedly, basal bodies of injectisomes inside the bacterial cells showed length variations of 20%. The in situ structures of the Y. enterocolitica and S. flexneri injectisomes had similar dimensions and were significantly longer than the isolated structures of related injectisomes. The crystal structure of the inner membrane injectisome component YscD appeared elongated compared to a homologous protein, and molecular dynamics simulations documented its elongation elasticity. The ring-shaped secretin YscC at the outer membrane was stretched by 30–40% in situ, compared to its isolated liposome-embedded conformation. We suggest that elasticity is critical for some two-membrane spanning protein complexes to cope with variations in the intermembrane distance. DOI: http://dx.doi.org/10.7554/eLife.00792.001 PMID:23908767
Leite, Wellington C; Galvão, Carolina W; Saab, Sérgio C; Iulek, Jorge; Etto, Rafael M; Steffens, Maria B R; Chitteni-Pattu, Sindhu; Stanage, Tyler; Keck, James L; Cox, Michael M
2016-01-01
The bacterial RecA protein plays a role in the complex system of DNA damage repair. Here, we report the functional and structural characterization of the Herbaspirillum seropedicae RecA protein (HsRecA). HsRecA protein is more efficient at displacing SSB protein from ssDNA than Escherichia coli RecA protein. HsRecA also promotes DNA strand exchange more efficiently. The three dimensional structure of HsRecA-ADP/ATP complex has been solved to 1.7 Å resolution. HsRecA protein contains a small N-terminal domain, a central core ATPase domain and a large C-terminal domain, that are similar to homologous bacterial RecA proteins. Comparative structural analysis showed that the N-terminal polymerization motif of archaeal and eukaryotic RecA family proteins are also present in bacterial RecAs. Reconstruction of electrostatic potential from the hexameric structure of HsRecA-ADP/ATP revealed a high positive charge along the inner side, where ssDNA is bound inside the filament. The properties of this surface may explain the greater capacity of HsRecA protein to bind ssDNA, forming a contiguous nucleoprotein filament, displace SSB and promote DNA exchange relative to EcRecA. Our functional and structural analyses provide insight into the molecular mechanisms of polymerization of bacterial RecA as a helical nucleoprotein filament.
Bayesian Peak Picking for NMR Spectra
Cheng, Yichen; Gao, Xin; Liang, Faming
2013-01-01
Protein structure determination is a very important topic in structural genomics, which helps people to understand varieties of biological functions such as protein-protein interactions, protein–DNA interactions and so on. Nowadays, nuclear magnetic resonance (NMR) has often been used to determine the three-dimensional structures of protein in vivo. This study aims to automate the peak picking step, the most important and tricky step in NMR structure determination. We propose to model the NMR spectrum by a mixture of bivariate Gaussian densities and use the stochastic approximation Monte Carlo algorithm as the computational tool to solve the problem. Under the Bayesian framework, the peak picking problem is casted as a variable selection problem. The proposed method can automatically distinguish true peaks from false ones without preprocessing the data. To the best of our knowledge, this is the first effort in the literature that tackles the peak picking problem for NMR spectrum data using Bayesian method. PMID:24184964
Trujillo, Uldaeliz; Vázquez-Rosa, Edwin; Oyola-Robles, Delise; Stagg, Loren J; Vassallo, David A; Vega, Irving E; Arold, Stefan T; Baerga-Ortiz, Abel
2013-01-01
The polyunsaturated fatty acid (PUFA) synthases from deep-sea bacteria invariably contain multiple acyl carrier protein (ACP) domains in tandem. This conserved tandem arrangement has been implicated in both amplification of fatty acid production (additive effect) and in structural stabilization of the multidomain protein (synergistic effect). While the more accepted model is one in which domains act independently, recent reports suggest that ACP domains may form higher oligomers. Elucidating the three-dimensional structure of tandem arrangements may therefore give important insights into the functional relevance of these structures, and hence guide bioengineering strategies. In an effort to elucidate the three-dimensional structure of tandem repeats from deep-sea anaerobic bacteria, we have expressed and purified a fragment consisting of five tandem ACP domains from the PUFA synthase from Photobacterium profundum. Analysis of the tandem ACP fragment by analytical gel filtration chromatography showed a retention time suggestive of a multimeric protein. However, small angle X-ray scattering (SAXS) revealed that the multi-ACP fragment is an elongated monomer which does not form a globular unit. Stokes radii calculated from atomic monomeric SAXS models were comparable to those measured by analytical gel filtration chromatography, showing that in the gel filtration experiment, the molecular weight was overestimated due to the elongated protein shape. Thermal denaturation monitored by circular dichroism showed that unfolding of the tandem construct was not cooperative, and that the tandem arrangement did not stabilize the protein. Taken together, these data are consistent with an elongated beads-on-a-string arrangement of the tandem ACP domains in PUFA synthases, and speak against synergistic biocatalytic effects promoted by quaternary structuring. Thus, it is possible to envision bioengineering strategies which simply involve the artificial linking of multiple ACP domains for increasing the yield of fatty acids in bacterial cultures.
Trujillo, Uldaeliz; Vázquez-Rosa, Edwin; Oyola-Robles, Delise; Stagg, Loren J.; Vassallo, David A.; Vega, Irving E.; Arold, Stefan T.; Baerga-Ortiz, Abel
2013-01-01
The polyunsaturated fatty acid (PUFA) synthases from deep-sea bacteria invariably contain multiple acyl carrier protein (ACP) domains in tandem. This conserved tandem arrangement has been implicated in both amplification of fatty acid production (additive effect) and in structural stabilization of the multidomain protein (synergistic effect). While the more accepted model is one in which domains act independently, recent reports suggest that ACP domains may form higher oligomers. Elucidating the three-dimensional structure of tandem arrangements may therefore give important insights into the functional relevance of these structures, and hence guide bioengineering strategies. In an effort to elucidate the three-dimensional structure of tandem repeats from deep-sea anaerobic bacteria, we have expressed and purified a fragment consisting of five tandem ACP domains from the PUFA synthase from Photobacterium profundum. Analysis of the tandem ACP fragment by analytical gel filtration chromatography showed a retention time suggestive of a multimeric protein. However, small angle X-ray scattering (SAXS) revealed that the multi-ACP fragment is an elongated monomer which does not form a globular unit. Stokes radii calculated from atomic monomeric SAXS models were comparable to those measured by analytical gel filtration chromatography, showing that in the gel filtration experiment, the molecular weight was overestimated due to the elongated protein shape. Thermal denaturation monitored by circular dichroism showed that unfolding of the tandem construct was not cooperative, and that the tandem arrangement did not stabilize the protein. Taken together, these data are consistent with an elongated beads-on-a-string arrangement of the tandem ACP domains in PUFA synthases, and speak against synergistic biocatalytic effects promoted by quaternary structuring. Thus, it is possible to envision bioengineering strategies which simply involve the artificial linking of multiple ACP domains for increasing the yield of fatty acids in bacterial cultures. PMID:23469090
Getzenberg, R H; Coffey, D S
1990-09-01
The DNA of interphase nuclei have very specific three-dimensional organizations that are different in different cell types, and it is possible that this varying DNA organization is responsible for the tissue specificity of gene expression. The nuclear matrix organizes the three-dimensional structure of the DNA and is believed to be involved in the control of gene expression. This study compares the nuclear structural proteins between two sex accessory tissues in the same animal responding to the same androgen stimulation by the differential expression of major tissue-specific secretory proteins. We demonstrate here that the nuclear matrix is tissue specific in the rat ventral prostate and seminal vesicle, and undergoes characteristic alterations in its protein composition upon androgen withdrawal. Three types of nuclear matrix proteins were observed: 1) nuclear matrix proteins that are different and tissue specific in the rat ventral prostate and seminal vesicle, 2) a set of nuclear matrix proteins that either appear or disappear upon androgen withdrawal, and 3) a set of proteins that are common to both the ventral prostate and seminal vesicle and do not change with the hormonal state of the animal. Since the nuclear matrix is known to bind androgen receptors in a tissue- and steroid-specific manner, we propose that the tissue specificity of the nuclear matrix arranges the DNA in a unique conformation, which may be involved in the specific interaction of transcription factors with DNA sequences, resulting in tissue-specific patterns of secretory protein expression.
Reed, Benjamin J.; Locke, Melissa N.; Gardner, Richard G.
2015-01-01
In the canonical view of protein function, it is generally accepted that the three-dimensional structure of a protein determines its function. However, the past decade has seen a dramatic growth in the identification of proteins with extensive intrinsically disordered regions (IDRs), which are conformationally plastic and do not appear to adopt single three-dimensional structures. One current paradigm for IDR function is that disorder enables IDRs to adopt multiple conformations, expanding the ability of a protein to interact with a wide variety of disparate proteins. The capacity for many interactions is an important feature of proteins that occupy the hubs of protein networks, in particular protein-modifying enzymes that usually have a broad spectrum of substrates. One such protein modification is ubiquitination, where ubiquitin is attached to proteins through ubiquitin ligases (E3s) and removed through deubiquitinating enzymes. Numerous proteomic studies have found that thousands of proteins are dynamically regulated by cycles of ubiquitination and deubiquitination. Thus, how these enzymes target their wide array of substrates is of considerable importance for understanding the function of the cell's diverse ubiquitination networks. Here, we characterize a yeast deubiquitinating enzyme, Ubp10, that possesses IDRs flanking its catalytic protease domain. We show that Ubp10 possesses multiple, distinct binding modules within its IDRs that are necessary and sufficient for directing protein interactions important for Ubp10's known roles in gene silencing and ribosome biogenesis. The human homolog of Ubp10, USP36, also has IDRs flanking its catalytic domain, and these IDRs similarly contain binding modules important for protein interactions. This work highlights the significant protein interaction scaffolding abilities of IDRs in the regulation of dynamic protein ubiquitination. PMID:26149687
NASA Astrophysics Data System (ADS)
Hikosaka, Ryouichi; Nagata, Fukue; Tomita, Masahiro; Kato, Katsuya
2016-10-01
Antibodies have received significant attention for use as antibody drugs, because they bind the objective protein (antigen) via antigen-antibody reactions. Recently, many reports have appeared on various monoclonal antibodies that recognize a single antigen. In this study, monoclonal antibodies are used as adsorbates on mesoporous silica (MPS) for affinity chromatography. MPS has high surface area and large pore volume; moreover, pore diameter, pore structure, and particle morphology are relatively easy to tune by adjusting the conditions of synthesis. The pore structure (two-dimensional (2D) hexagonal and three-dimensional cubic) and particle morphology (spherical and polyhedral) of MPS are optimized for use in a monoclonal antibody/MPS composite. When anti-IgG (one of the monoclonal antibodies) adsorbs on the MPS material and IgG (antigen) binds to anti-IgG/MPS composites, MCM-41p with a 2D-hexagonal pore structure and polyhedral particle morphology has the highest IgG binding efficiency. In addition, the antibody/MPS composites remain stable in chaotropic and low-pH solutions and can be cycled at least five times without decreasing IgG elution. In purification and removal tests, the use of the antibody/MPS composites allows only the objective protein from protein mixtures to be bound and eluted.
Albert, Armando; Yunta, Cristina; Arranz, Rocío; Peña, Álvaro; Salido, Eduardo; Valpuesta, José María; Martín-Benito, Jaime
2010-01-01
Primary hyperoxaluria type 1 is a rare autosomal recessive disease caused by mutations in the alanine glyoxylate aminotransferase gene (AGXT). We have previously shown that P11L and I340M polymorphisms together with I244T mutation (AGXT-LTM) represent a conformational disease that could be amenable to pharmacological intervention. Thus, the study of the folding mechanism of AGXT is crucial to understand the molecular basis of the disease. Here, we provide biochemical and structural data showing that AGXT-LTM is able to form non-native folding intermediates. The three-dimensional structure of a complex between the bacterial chaperonin GroEL and a folding intermediate of AGXT-LTM mutant has been solved by cryoelectron microscopy. The electron density map shows the protein substrate in a non-native extended conformation that crosses the GroEL central cavity. Addition of ATP to the complex induces conformational changes on the chaperonin and the internalization of the protein substrate into the folding cavity. The structure provides a three-dimensional picture of an in vivo early ATP-dependent step of the folding reaction cycle of the chaperonin and supports a GroEL functional model in which the chaperonin promotes folding of the AGXT-LTM mutant protein through forced unfolding mechanism. PMID:20056599
Albert, Armando; Yunta, Cristina; Arranz, Rocío; Peña, Alvaro; Salido, Eduardo; Valpuesta, José María; Martín-Benito, Jaime
2010-02-26
Primary hyperoxaluria type 1 is a rare autosomal recessive disease caused by mutations in the alanine glyoxylate aminotransferase gene (AGXT). We have previously shown that P11L and I340M polymorphisms together with I244T mutation (AGXT-LTM) represent a conformational disease that could be amenable to pharmacological intervention. Thus, the study of the folding mechanism of AGXT is crucial to understand the molecular basis of the disease. Here, we provide biochemical and structural data showing that AGXT-LTM is able to form non-native folding intermediates. The three-dimensional structure of a complex between the bacterial chaperonin GroEL and a folding intermediate of AGXT-LTM mutant has been solved by cryoelectron microscopy. The electron density map shows the protein substrate in a non-native extended conformation that crosses the GroEL central cavity. Addition of ATP to the complex induces conformational changes on the chaperonin and the internalization of the protein substrate into the folding cavity. The structure provides a three-dimensional picture of an in vivo early ATP-dependent step of the folding reaction cycle of the chaperonin and supports a GroEL functional model in which the chaperonin promotes folding of the AGXT-LTM mutant protein through forced unfolding mechanism.
Feiten, Mirian Cristina; Di Luccio, Marco; Santos, Karine F; de Oliveira, Débora; Oliveira, J Vladimir
2017-06-01
The study of enzyme function often involves a multi-disciplinary approach. Several techniques are documented in the literature towards determining secondary and tertiary structures of enzymes, and X-ray crystallography is the most explored technique for obtaining three-dimensional structures of proteins. Knowledge of three-dimensional structures is essential to understand reaction mechanisms at the atomic level. Additionally, structures can be used to modulate or improve functional activity of enzymes by the production of small molecules that act as substrates/cofactors or by engineering selected mutants with enhanced biological activity. This paper presentes a short overview on how to streamline sample preparation for crystallographic studies of treated enzymes. We additionally revise recent developments on the effects of pressurized fluid treatment on activity and stability of commercial enzymes. Future directions and perspectives on the the role of crystallography as a tool to access the molecular mechanisms underlying enzymatic activity modulation upon treatment in pressurized fluids are also addressed.
Conserved thioredoxin fold is present in Pisum sativum L. sieve element occlusion-1 protein
Umate, Pavan; Tuteja, Renu
2010-01-01
Homology-based three-dimensional model for Pisum sativum sieve element occlusion 1 (Ps.SEO1) (forisomes) protein was constructed. A stretch of amino acids (residues 320 to 456) which is well conserved in all known members of forisomes proteins was used to model the 3D structure of Ps.SEO1. The structural prediction was done using Protein Homology/analogY Recognition Engine (PHYRE) web server. Based on studies of local sequence alignment, the thioredoxin-fold containing protein [Structural Classification of Proteins (SCOP) code d1o73a_], a member of the glutathione peroxidase family was selected as a template for modeling the spatial structure of Ps.SEO1. Selection was based on comparison of primary sequence, higher match quality and alignment accuracy. Motif 1 (EVF) is conserved in Ps.SEO1, Vicia faba (Vf.For1) and Medicago truncatula (MT.SEO3); motif 2 (KKED) is well conserved across all forisomes proteins and motif 3 (IGYIGNP) is conserved in Ps.SEO1 and Vf.For1. PMID:20404566
Sixty-five years of the long march in protein secondary structure prediction: the final stretch?
Yang, Yuedong; Gao, Jianzhao; Wang, Jihua; Heffernan, Rhys; Hanson, Jack; Paliwal, Kuldip; Zhou, Yaoqi
2018-01-01
Abstract Protein secondary structure prediction began in 1951 when Pauling and Corey predicted helical and sheet conformations for protein polypeptide backbone even before the first protein structure was determined. Sixty-five years later, powerful new methods breathe new life into this field. The highest three-state accuracy without relying on structure templates is now at 82–84%, a number unthinkable just a few years ago. These improvements came from increasingly larger databases of protein sequences and structures for training, the use of template secondary structure information and more powerful deep learning techniques. As we are approaching to the theoretical limit of three-state prediction (88–90%), alternative to secondary structure prediction (prediction of backbone torsion angles and Cα-atom-based angles and torsion angles) not only has more room for further improvement but also allows direct prediction of three-dimensional fragment structures with constantly improved accuracy. About 20% of all 40-residue fragments in a database of 1199 non-redundant proteins have <6 Å root-mean-squared distance from the native conformations by SPIDER2. More powerful deep learning methods with improved capability of capturing long-range interactions begin to emerge as the next generation of techniques for secondary structure prediction. The time has come to finish off the final stretch of the long march towards protein secondary structure prediction. PMID:28040746
Damberger, F. F.; Pelton, J. G.; Harrison, C. J.; Nelson, H. C.; Wemmer, D. E.
1994-01-01
The solution structure of the 92-residue DNA-binding domain of the heat shock transcription factor from Kluyveromyces lactis has been determined using multidimensional NMR methods. Three-dimensional (3D) triple resonance, 1H-13C-13C-1H total correlation spectroscopy, and 15N-separated total correlation spectroscopy-heteronuclear multiple quantum correlation experiments were used along with various 2D spectra to make nearly complete assignments for the backbone and side-chain 1H, 15N, and 13C resonances. Five-hundred eighty-three NOE constraints identified in 3D 13C- and 15N-separated NOE spectroscopy (NOESY)-heteronuclear multiple quantum correlation spectra and a 4-dimensional 13C/13C-edited NOESY spectrum, along with 35 phi, 9 chi 1, and 30 hydrogen bond constraints, were used to calculate 30 structures by hybrid distance geometry/stimulated annealing protocol, of which 24 were used for structural comparison. The calculations revealed that a 3-helix bundle packs against a small 4-stranded antiparallel beta-sheet. The backbone RMS deviation (RMSD) for the family of structures was 1.03 +/- 0.19 A with respect to the average structure. The topology is analogous to that of the C-terminal domain of the catabolite gene activator protein and appears to be in the helix-turn-helix family of DNA-binding proteins. The overall fold determined by the NMR data is consistent with recent crystallographic work on this domain (Harrison CJ, Bohm AA, Nelson HCM, 1994, Science 263:224) as evidenced by RMSD between backbone atoms in the NMR and X-ray structures of 1.77 +/- 0.20 A. Several differences were identified some of which may be due to protein-protein interactions in the crystal. PMID:7849597
Yu, Isseki; Mori, Takaharu; Ando, Tadashi; Harada, Ryuhei; Jung, Jaewoon; Sugita, Yuji; Feig, Michael
2016-01-01
Biological macromolecules function in highly crowded cellular environments. The structure and dynamics of proteins and nucleic acids are well characterized in vitro, but in vivo crowding effects remain unclear. Using molecular dynamics simulations of a comprehensive atomistic model cytoplasm we found that protein-protein interactions may destabilize native protein structures, whereas metabolite interactions may induce more compact states due to electrostatic screening. Protein-protein interactions also resulted in significant variations in reduced macromolecular diffusion under crowded conditions, while metabolites exhibited significant two-dimensional surface diffusion and altered protein-ligand binding that may reduce the effective concentration of metabolites and ligands in vivo. Metabolic enzymes showed weak non-specific association in cellular environments attributed to solvation and entropic effects. These effects are expected to have broad implications for the in vivo functioning of biomolecules. This work is a first step towards physically realistic in silico whole-cell models that connect molecular with cellular biology. DOI: http://dx.doi.org/10.7554/eLife.19274.001 PMID:27801646
Solution structure and interactions of the Escherichia coli cell division activator protein CedA.
Chen, Ho An; Simpson, Peter; Huyton, Trevor; Roper, David; Matthews, Stephen
2005-05-10
CedA is a protein that is postulated to be involved in the regulation of cell division in Escherichia coli and related organisms; however, little biological data about its possible mode of action are available. Here we present a three-dimensional structure of this protein as determined by NMR spectroscopy. The protein is made up of four antiparallel beta-strands, an alpha-helix, and a large unstructured stretch of residues at the N-terminus. It shows structural similarity to a family of DNA-binding proteins which interact with dsDNA via a three-stranded beta-sheet, suggesting that CedA may be a DNA-binding protein. The putative binding surface of CedA is predominantly positively charged with a number of basic residues surrounding a groove largely dominated by aromatic residues. NMR chemical shift perturbations and gel-shift experiments performed with CedA confirm that the protein binds dsDNA, and its interaction is mediated primarily via the beta-sheet.
Meeting Report: Structural Determination of Environmentally Responsive Proteins
Reinlib, Leslie
2005-01-01
The three-dimensional structure of gene products continues to be a missing lynchpin between linear genome sequences and our understanding of the normal and abnormal function of proteins and pathways. Enhanced activity in this area is likely to lead to better understanding of how discrete changes in molecular patterns and conformation underlie functional changes in protein complexes and, with it, sensitivity of an individual to an exposure. The National Institute of Environmental Health Sciences convened a workshop of experts in structural determination and environmental health to solicit advice for future research in structural resolution relative to environmentally responsive proteins and pathways. The highest priorities recommended by the workshop were to support studies of structure, analysis, control, and design of conformational and functional states at molecular resolution for environmentally responsive molecules and complexes; promote understanding of dynamics, kinetics, and ligand responses; investigate the mechanisms and steps in posttranslational modifications, protein partnering, impact of genetic polymorphisms on structure/function, and ligand interactions; and encourage integrated experimental and computational approaches. The workshop participants also saw value in improving the throughput and purity of protein samples and macromolecular assemblies; developing optimal processes for design, production, and assembly of macromolecular complexes; encouraging studies on protein–protein and macromolecular interactions; and examining assemblies of individual proteins and their functions in pathways of interest for environmental health. PMID:16263521
Protein-directed assembly of arbitrary three-dimensional nanoporous silica architectures.
Khripin, Constantine Y; Pristinski, Denis; Dunphy, Darren R; Brinker, C Jeffrey; Kaehr, Bryan
2011-02-22
Through precise control of nanoscale building blocks, such as proteins and polyamines, silica condensing microorganisms are able to create intricate mineral structures displaying hierarchical features from nano- to millimeter-length scales. The creation of artificial structures of similar characteristics is facilitated through biomimetic approaches, for instance, by first creating a bioscaffold comprised of silica condensing moieties which, in turn, govern silica deposition into three-dimensional (3D) structures. In this work, we demonstrate a protein-directed approach to template silica into true arbitrary 3D architectures by employing cross-linked protein hydrogels to controllably direct silica condensation. Protein hydrogels are fabricated using multiphoton lithography, which enables user-defined control over template features in three dimensions. Silica deposition, under acidic conditions, proceeds throughout protein hydrogel templates via flocculation of silica nanoparticles by protein molecules, as indicated by dynamic light scattering (DLS) and time-dependent measurements of elastic modulus. Following silica deposition, the protein template can be removed using mild thermal processing yielding high surface area (625 m(2)/g) porous silica replicas that do not undergo significant volume change compared to the starting template. We demonstrate the capabilities of this approach to create bioinspired silica microstructures displaying hierarchical features over broad length scales and the infiltration/functionalization capabilities of the nanoporous silica matrix by laser printing a 3D gold image within a 3D silica matrix. This work provides a foundation to potentially understand and mimic biogenic silica condensation under the constraints of user-defined biotemplates and further should enable a wide range of complex inorganic architectures to be explored using silica transformational chemistries, for instance silica to silicon, as demonstrated herein.
Hati, Sanchita; Bhattacharyya, Sudeep
2016-01-01
A project-based biophysical chemistry laboratory course, which is offered to the biochemistry and molecular biology majors in their senior year, is described. In this course, the classroom study of the structure-function of biomolecules is integrated with the discovery-guided laboratory study of these molecules using computer modeling and simulations. In particular, modern computational tools are employed to elucidate the relationship between structure, dynamics, and function in proteins. Computer-based laboratory protocols that we introduced in three modules allow students to visualize the secondary, super-secondary, and tertiary structures of proteins, analyze non-covalent interactions in protein-ligand complexes, develop three-dimensional structural models (homology model) for new protein sequences and evaluate their structural qualities, and study proteins' intrinsic dynamics to understand their functions. In the fourth module, students are assigned to an authentic research problem, where they apply their laboratory skills (acquired in modules 1-3) to answer conceptual biophysical questions. Through this process, students gain in-depth understanding of protein dynamics-the missing link between structure and function. Additionally, the requirement of term papers sharpens students' writing and communication skills. Finally, these projects result in new findings that are communicated in peer-reviewed journals. © 2016 The International Union of Biochemistry and Molecular Biology.
Patent protection for structural genomics-related inventions.
Vinarov, Sara D
2003-01-01
Recently there have been some important developments with respect to the patentability of inventions in the field of structural genomics. The leaders of the European Patent Office (EPO), Japan Patent Office (JPO) and the United States Patent Office (USPTO) came together for a trilateral meeting to conduct a comparative study on protein 3-dimensional (3-D) structure related claims in an effort to come to a mutual understanding about the examination of such inventions. The three patent offices were presented with eight different cases: 1) 3-D structural data of a protein per se; 2) computer-readable storage medium encoded with structural data of a protein; 3) protein defined by its tertiary structure; 4) crystals of known proteins; 5) binding pockets and protein domains; 6) and 7) are both directed to in silico screening methods directed to a specific protein; and 8) pharmacophores. The preliminary conclusions reached at the trilateral meeting provide clarity regarding the types of inventions that may be patentable given a specific set of scientific facts in a patent application. Therefore, the guidance provided by this study will help inventors, attorneys and other patent practitioners who file for patent protection on structural genomics-based inventions both here and abroad comply with the patentability requirements of each office.
Zebrafish cardiac muscle thick filaments: isolation technique and three-dimensional structure.
González-Solá, Maryví; Al-Khayat, Hind A; Behra, Martine; Kensler, Robert W
2014-04-15
To understand how mutations in thick filament proteins such as cardiac myosin binding protein-C or titin, cause familial hypertrophic cardiomyopathies, it is important to determine the structure of the cardiac thick filament. Techniques for the genetic manipulation of the zebrafish are well established and it has become a major model for the study of the cardiovascular system. Our goal is to develop zebrafish as an alternative system to the mammalian heart model for the study of the structure of the cardiac thick filaments and the proteins that form it. We have successfully isolated thick filaments from zebrafish cardiac muscle, using a procedure similar to those for mammalian heart, and analyzed their structure by negative-staining and electron microscopy. The isolated filaments appear well ordered with the characteristic 42.9 nm quasi-helical repeat of the myosin heads expected from x-ray diffraction. We have performed single particle image analysis on the collected electron microscopy images for the C-zone region of these filaments and obtained a three-dimensional reconstruction at 3.5 nm resolution. This reconstruction reveals structure similar to the mammalian thick filament, and demonstrates that zebrafish may provide a useful model for the study of the changes in the cardiac thick filament associated with disease processes. Copyright © 2014 Biophysical Society. Published by Elsevier Inc. All rights reserved.
D'auria, S; Barone, R; Rossi, M; Nucci, R; Barone, G; Fessas, D; Bertoli, E; Tanfani, F
1997-01-01
The effects of temperature and SDS on the three-dimensional organization and secondary structure of beta-glycosidase from the thermophilic archaeon Sulfolobus solfataricus were investigated by CD, IR spectroscopy and differential scanning calorimetry. CD spectra in the near UV region showed that the detergent caused a remarkable change in the protein tertiary structure, and far-UV CD analysis revealed only a slight effect on secondary structure. Infrared spectroscopy showed that low concentrations of the detergent (up to 0.02%) induced slight changes in the enzyme secondary structure, whereas high concentrations caused the alpha-helix content to increase at high temperatures and prevented protein aggregation. PMID:9169619
High-throughput Cloning and Expression of Integral Membrane Proteins in Escherichia coli
Bruni, Renato
2014-01-01
Recently, several structural genomics centers have been established and a remarkable number of three-dimensional structures of soluble proteins have been solved. For membrane proteins, the number of structures solved has been significantly trailing those for their soluble counterparts, not least because over-expression and purification of membrane proteins is a much more arduous process. By using high throughput technologies, a large number of membrane protein targets can be screened simultaneously and a greater number of expression and purification conditions can be employed, leading to a higher probability of successfully determining the structure of membrane proteins. This unit describes the cloning, expression and screening of membrane proteins using high throughput methodologies developed in our laboratory. Basic Protocol 1 deals with the cloning of inserts into expression vectors by ligation-independent cloning. Basic Protocol 2 describes the expression and purification of the target proteins on a miniscale. Lastly, for the targets that express at the miniscale, basic protocols 3 and 4 outline the methods employed for the expression and purification of targets at the midi-scale, as well as a procedure for detergent screening and identification of detergent(s) in which the target protein is stable. PMID:24510647
Bayesian peak picking for NMR spectra.
Cheng, Yichen; Gao, Xin; Liang, Faming
2014-02-01
Protein structure determination is a very important topic in structural genomics, which helps people to understand varieties of biological functions such as protein-protein interactions, protein-DNA interactions and so on. Nowadays, nuclear magnetic resonance (NMR) has often been used to determine the three-dimensional structures of protein in vivo. This study aims to automate the peak picking step, the most important and tricky step in NMR structure determination. We propose to model the NMR spectrum by a mixture of bivariate Gaussian densities and use the stochastic approximation Monte Carlo algorithm as the computational tool to solve the problem. Under the Bayesian framework, the peak picking problem is casted as a variable selection problem. The proposed method can automatically distinguish true peaks from false ones without preprocessing the data. To the best of our knowledge, this is the first effort in the literature that tackles the peak picking problem for NMR spectrum data using Bayesian method. Copyright © 2013. Production and hosting by Elsevier Ltd.
Template-based protein structure modeling using the RaptorX web server.
Källberg, Morten; Wang, Haipeng; Wang, Sheng; Peng, Jian; Wang, Zhiyong; Lu, Hui; Xu, Jinbo
2012-07-19
A key challenge of modern biology is to uncover the functional role of the protein entities that compose cellular proteomes. To this end, the availability of reliable three-dimensional atomic models of proteins is often crucial. This protocol presents a community-wide web-based method using RaptorX (http://raptorx.uchicago.edu/) for protein secondary structure prediction, template-based tertiary structure modeling, alignment quality assessment and sophisticated probabilistic alignment sampling. RaptorX distinguishes itself from other servers by the quality of the alignment between a target sequence and one or multiple distantly related template proteins (especially those with sparse sequence profiles) and by a novel nonlinear scoring function and a probabilistic-consistency algorithm. Consequently, RaptorX delivers high-quality structural models for many targets with only remote templates. At present, it takes RaptorX ~35 min to finish processing a sequence of 200 amino acids. Since its official release in August 2011, RaptorX has processed ~6,000 sequences submitted by ~1,600 users from around the world.
Template-based protein structure modeling using the RaptorX web server
Källberg, Morten; Wang, Haipeng; Wang, Sheng; Peng, Jian; Wang, Zhiyong; Lu, Hui; Xu, Jinbo
2016-01-01
A key challenge of modern biology is to uncover the functional role of the protein entities that compose cellular proteomes. To this end, the availability of reliable three-dimensional atomic models of proteins is often crucial. This protocol presents a community-wide web-based method using RaptorX (http://raptorx.uchicago.edu/) for protein secondary structure prediction, template-based tertiary structure modeling, alignment quality assessment and sophisticated probabilistic alignment sampling. RaptorX distinguishes itself from other servers by the quality of the alignment between a target sequence and one or multiple distantly related template proteins (especially those with sparse sequence profiles) and by a novel nonlinear scoring function and a probabilistic-consistency algorithm. Consequently, RaptorX delivers high-quality structural models for many targets with only remote templates. At present, it takes RaptorX ~35 min to finish processing a sequence of 200 amino acids. Since its official release in August 2011, RaptorX has processed ~6,000 sequences submitted by ~1,600 users from around the world. PMID:22814390
Li, Congmin; Lim, Sunghyuk; Braunewell, Karl H; Ames, James B
2016-01-01
Visinin-like protein 3 (VILIP-3) belongs to a family of Ca2+-myristoyl switch proteins that regulate signal transduction in the brain and retina. Here we analyze Ca2+ binding, characterize Ca2+-induced conformational changes, and determine the NMR structure of myristoylated VILIP-3. Three Ca2+ bind cooperatively to VILIP-3 at EF2, EF3 and EF4 (KD = 0.52 μM and Hill slope of 1.8). NMR assignments, mutagenesis and structural analysis indicate that the covalently attached myristoyl group is solvent exposed in Ca2+-bound VILIP-3, whereas Ca2+-free VILIP-3 contains a sequestered myristoyl group that interacts with protein residues (E26, Y64, V68), which are distinct from myristate contacts seen in other Ca2+-myristoyl switch proteins. The myristoyl group in VILIP-3 forms an unusual L-shaped structure that places the C14 methyl group inside a shallow protein groove, in contrast to the much deeper myristoyl binding pockets observed for recoverin, NCS-1 and GCAP1. Thus, the myristoylated VILIP-3 protein structure determined in this study is quite different from those of other known myristoyl switch proteins (recoverin, NCS-1, and GCAP1). We propose that myristoylation serves to fine tune the three-dimensional structures of neuronal calcium sensor proteins as a means of generating functional diversity.
Structural Integrity of Proteins under Applied Bias during Solid-State Nanopore Translocation
NASA Astrophysics Data System (ADS)
Hasan, Mohammad R.; Khanzada, Raja Raheel; Mahmood, Mohammed A. I.; Ashfaq, Adnan; Iqbal, Samir M.
2015-03-01
The translocation behavior of proteins through solid-state nanopores can be used as a new way to detect and identify proteins. The ionic current through a nanopore that flows under applied bias gets perturbed when a biomolecule traverses the Nanopore. It is important for a protein detection scheme to know of any changes in the three-dimensional structure of the molecule during the process. Here we report the data on structural integrity of protein during translocation through nanopore under different applied biases. Nanoscale Molecular Dynamic was used to establish a framework to study the changes in protein structures as these travelled across the nanopore. The analysis revealed the contributions of structural changes of protein to its ionic current signature. As a model, thrombin protein crystalline structure was imported and positioned inside a 6 nm diameter pore in a 6 nm thick silicon nitride membrane. The protein was solvated in 1 M KCl at 295 K and the system was equilibrated for 20 ns to attain its minimum energy state. The simulation was performed at different electric fields from 0 to 1 kCal/(mol.Å.e). RMSD, radial distribution function, movement of the center of mass and velocity of the protein were calculated. The results showed linear increments in the velocity and perturbations in ionic current profile with increasing electric potential. Support Acknowledged from NSF through ECCS-1201878.
Hu, Ben; Kuang, Zheng-Kun; Feng, Shi-Yu; Wang, Dong; He, Song-Bing; Kong, De-Xin
2016-11-17
The crystallized ligands in the Protein Data Bank (PDB) can be treated as the inverse shapes of the active sites of corresponding proteins. Therefore, the shape similarity between a molecule and PDB ligands indicated the possibility of the molecule to bind with the targets. In this paper, we proposed a shape similarity profile that can be used as a molecular descriptor for ligand-based virtual screening. First, through three-dimensional (3D) structural clustering, 300 diverse ligands were extracted from the druggable protein-ligand database, sc-PDB. Then, each of the molecules under scrutiny was flexibly superimposed onto the 300 ligands. Superimpositions were scored by shape overlap and property similarity, producing a 300 dimensional similarity array termed the "Three-Dimensional Biologically Relevant Spectrum (BRS-3D)". Finally, quantitative or discriminant models were developed with the 300 dimensional descriptor using machine learning methods (support vector machine). The effectiveness of this approach was evaluated using 42 benchmark data sets from the G protein-coupled receptor (GPCR) ligand library and the GPCR decoy database (GLL/GDD). We compared the performance of BRS-3D with other 2D and 3D state-of-the-art molecular descriptors. The results showed that models built with BRS-3D performed best for most GLL/GDD data sets. We also applied BRS-3D in histone deacetylase 1 inhibitors screening and GPCR subtype selectivity prediction. The advantages and disadvantages of this approach are discussed.
Membrane-spanning α-helical barrels as tractable protein-design targets.
Niitsu, Ai; Heal, Jack W; Fauland, Kerstin; Thomson, Andrew R; Woolfson, Derek N
2017-08-05
The rational ( de novo ) design of membrane-spanning proteins lags behind that for water-soluble globular proteins. This is due to gaps in our knowledge of membrane-protein structure, and experimental difficulties in studying such proteins compared to water-soluble counterparts. One limiting factor is the small number of experimentally determined three-dimensional structures for transmembrane proteins. By contrast, many tens of thousands of globular protein structures provide a rich source of 'scaffolds' for protein design, and the means to garner sequence-to-structure relationships to guide the design process. The α-helical coiled coil is a protein-structure element found in both globular and membrane proteins, where it cements a variety of helix-helix interactions and helical bundles. Our deep understanding of coiled coils has enabled a large number of successful de novo designs. For one class, the α-helical barrels-that is, symmetric bundles of five or more helices with central accessible channels-there are both water-soluble and membrane-spanning examples. Recent computational designs of water-soluble α-helical barrels with five to seven helices have advanced the design field considerably. Here we identify and classify analogous and more complicated membrane-spanning α-helical barrels from the Protein Data Bank. These provide tantalizing but tractable targets for protein engineering and de novo protein design.This article is part of the themed issue 'Membrane pores: from structure and assembly, to medicine and technology'. © 2017 The Author(s).
Structural modeling of G-protein coupled receptors: An overview on automatic web-servers.
Busato, Mirko; Giorgetti, Alejandro
2016-08-01
Despite the significant efforts and discoveries during the last few years in G protein-coupled receptor (GPCR) expression and crystallization, the receptors with known structures to date are limited only to a small fraction of human GPCRs. The lack of experimental three-dimensional structures of the receptors represents a strong limitation that hampers a deep understanding of their function. Computational techniques are thus a valid alternative strategy to model three-dimensional structures. Indeed, recent advances in the field, together with extraordinary developments in crystallography, in particular due to its ability to capture GPCRs in different activation states, have led to encouraging results in the generation of accurate models. This, prompted the community of modelers to render their methods publicly available through dedicated databases and web-servers. Here, we present an extensive overview on these services, focusing on their advantages, drawbacks and their role in successful applications. Future challenges in the field of GPCR modeling, such as the predictions of long loop regions and the modeling of receptor activation states are presented as well. Copyright © 2016 Elsevier Ltd. All rights reserved.
Exploration of the relationship between topology and designability of conformations
NASA Astrophysics Data System (ADS)
Leelananda, Sumudu P.; Towfic, Fadi; Jernigan, Robert L.; Kloczkowski, Andrzej
2011-06-01
Protein structures are evolutionarily more conserved than sequences, and sequences with very low sequence identity frequently share the same fold. This leads to the concept of protein designability. Some folds are more designable and lots of sequences can assume that fold. Elucidating the relationship between protein sequence and the three-dimensional (3D) structure that the sequence folds into is an important problem in computational structural biology. Lattice models have been utilized in numerous studies to model protein folds and predict the designability of certain folds. In this study, all possible compact conformations within a set of two-dimensional and 3D lattice spaces are explored. Complementary interaction graphs are then generated for each conformation and are described using a set of graph features. The full HP sequence space for each lattice model is generated and contact energies are calculated by threading each sequence onto all the possible conformations. Unique conformation giving minimum energy is identified for each sequence and the number of sequences folding to each conformation (designability) is obtained. Machine learning algorithms are used to predict the designability of each conformation. We find that the highly designable structures can be distinguished from other non-designable conformations based on certain graphical geometric features of the interactions. This finding confirms the fact that the topology of a conformation is an important determinant of the extent of its designability and suggests that the interactions themselves are important for determining the designability.
An Evolution-Based Approach to De Novo Protein Design and Case Study on Mycobacterium tuberculosis
Brender, Jeffrey R.; Czajka, Jeff; Marsh, David; Gray, Felicia; Cierpicki, Tomasz; Zhang, Yang
2013-01-01
Computational protein design is a reverse procedure of protein folding and structure prediction, where constructing structures from evolutionarily related proteins has been demonstrated to be the most reliable method for protein 3-dimensional structure prediction. Following this spirit, we developed a novel method to design new protein sequences based on evolutionarily related protein families. For a given target structure, a set of proteins having similar fold are identified from the PDB library by structural alignments. A structural profile is then constructed from the protein templates and used to guide the conformational search of amino acid sequence space, where physicochemical packing is accommodated by single-sequence based solvation, torsion angle, and secondary structure predictions. The method was tested on a computational folding experiment based on a large set of 87 protein structures covering different fold classes, which showed that the evolution-based design significantly enhances the foldability and biological functionality of the designed sequences compared to the traditional physics-based force field methods. Without using homologous proteins, the designed sequences can be folded with an average root-mean-square-deviation of 2.1 Å to the target. As a case study, the method is extended to redesign all 243 structurally resolved proteins in the pathogenic bacteria Mycobacterium tuberculosis, which is the second leading cause of death from infectious disease. On a smaller scale, five sequences were randomly selected from the design pool and subjected to experimental validation. The results showed that all the designed proteins are soluble with distinct secondary structure and three have well ordered tertiary structure, as demonstrated by circular dichroism and NMR spectroscopy. Together, these results demonstrate a new avenue in computational protein design that uses knowledge of evolutionary conservation from protein structural families to engineer new protein molecules of improved fold stability and biological functionality. PMID:24204234
Three-dimensional structural analysis of eukaryotic flagella/cilia by electron cryo-tomography
Bui, Khanh Huy; Pigino, Gaia; Ishikawa, Takashi
2011-01-01
Electron cryo-tomography is a potential approach to analyzing the three-dimensional conformation of frozen hydrated biological macromolecules using electron microscopy. Since projections of each individual object illuminated from different orientations are merged, electron tomography is capable of structural analysis of such heterogeneous environments as in vivo or with polymorphism, although radiation damage and the missing wedge are severe problems. Here, recent results on the structure of eukaryotic flagella, which is an ATP-driven bending organelle, from green algae Chlamydomonas are presented. Tomographic analysis reveals asymmetric molecular arrangements, especially that of the dynein motor proteins, in flagella, giving insight into the mechanism of planar asymmetric bending motion. Methodological challenges to obtaining higher-resolution structures from this technique are also discussed. PMID:21169680
Membrane Topology and Insertion of Membrane Proteins: Search for Topogenic Signals
van Geest, Marleen; Lolkema, Juke S.
2000-01-01
Integral membrane proteins are found in all cellular membranes and carry out many of the functions that are essential to life. The membrane-embedded domains of integral membrane proteins are structurally quite simple, allowing the use of various prediction methods and biochemical methods to obtain structural information about membrane proteins. A critical step in the biosynthetic pathway leading to the folded protein in the membrane is its insertion into the lipid bilayer. Understanding of the fundamentals of the insertion and folding processes will significantly improve the methods used to predict the three-dimensional membrane protein structure from the amino acid sequence. In the first part of this review, biochemical approaches to elucidate membrane protein topology are reviewed and evaluated, and in the second part, the use of similar techniques to study membrane protein insertion is discussed. The latter studies search for signals in the polypeptide chain that direct the insertion process. Knowledge of the topogenic signals in the nascent chain of a membrane protein is essential for the evaluation of membrane topology studies. PMID:10704472
PACSY, a relational database management system for protein structure and chemical shift analysis.
Lee, Woonghee; Yu, Wookyung; Kim, Suhkmann; Chang, Iksoo; Lee, Weontae; Markley, John L
2012-10-01
PACSY (Protein structure And Chemical Shift NMR spectroscopY) is a relational database management system that integrates information from the Protein Data Bank, the Biological Magnetic Resonance Data Bank, and the Structural Classification of Proteins database. PACSY provides three-dimensional coordinates and chemical shifts of atoms along with derived information such as torsion angles, solvent accessible surface areas, and hydrophobicity scales. PACSY consists of six relational table types linked to one another for coherence by key identification numbers. Database queries are enabled by advanced search functions supported by an RDBMS server such as MySQL or PostgreSQL. PACSY enables users to search for combinations of information from different database sources in support of their research. Two software packages, PACSY Maker for database creation and PACSY Analyzer for database analysis, are available from http://pacsy.nmrfam.wisc.edu.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bae, Euiyoung; Bingman, Craig A.; Aceti, David J.
LOC79017 (MW 21.0 kDa, residues 1-188) was annotated as a hypothetical protein encoded by Homo sapiens chromosome 7 open reading frame 24. It was selected as a target by the Center for Eukaryotic Structural Genomics (CESG) because it did not share more than 30% sequence identity with any protein for which the three-dimensional structure is known. The biological function of the protein has not been established yet. Parts of LOC79017 were identified as members of uncharacterized Pfam families (residues 1-95 as PB006073 and residues 104-180 as PB031696). BLAST searches revealed homologues of LOC79017 in many eukaryotes, but none of themmore » have been functionally characterized. Here, we report the crystal structure of H. sapiens protein LOC79017 (UniGene code Hs.530024, UniProt code O75223, CESG target number go.35223).« less
Crystal structure of YHI9, the yeast member of the phenazine biosynthesis PhzF enzyme superfamily.
Liger, Dominique; Quevillon-Cheruel, Sophie; Sorel, Isabelle; Bremang, Michael; Blondeau, Karine; Aboulfath, Ilham; Janin, Joël; van Tilbeurgh, Herman; Leulliot, Nicolas
2005-09-01
In the Pseudomonas bacterial genomes, the PhzF proteins are involved in the production of phenazine derivative antibiotic and antifungal compounds. The PhzF superfamily however also encompasses proteins in all genomes from bacteria to eukaryotes, for which no function has been assigned. We have determined the three dimensional crystal structure at 2.05 A resolution of YHI9, the yeast member of the PhzF family. YHI9 has a fold similar to bacterial diaminopimelate epimerase, revealing a bimodular structure with an internal symmetry. Residue conservation identifies a putative active site at the interface between the two domains. Evolution of this protein by gene duplication, gene fusion and domain swapping from an ancestral gene containing the "hot dog" fold, identifies the protein as a "kinked double hot dog" fold. Copyright 2005 Wiley-Liss, Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Metrick, Claire M.; Heldwein, Ekaterina E.; Sandri-Goldin, R. M.
Proteins forming the tegument layers of herpesviral virions mediate many essential processes in the viral replication cycle, yet few have been characterized in detail. UL21 is one such multifunctional tegument protein and is conserved among alphaherpesviruses. While UL21 has been implicated in many processes in viral replication, ranging from nuclear egress to virion morphogenesis to cell-cell spread, its precise roles remain unclear. Here we report the 2.7-Å crystal structure of the C-terminal domain of herpes simplex virus 1 (HSV-1) UL21 (UL21C), which has a unique α-helical fold resembling a dragonfly. Analysis of evolutionary conservation patterns and surface electrostatics pinpointed fourmore » regions of potential functional importance on the surface of UL21C to be pursued by mutagenesis. In combination with the previously determined structure of the N-terminal domain of UL21, the structure of UL21C provides a 3-dimensional framework for targeted exploration of the multiple roles of UL21 in the replication and pathogenesis of alphaherpesviruses. Additionally, we describe an unanticipated ability of UL21 to bind RNA, which may hint at a yet unexplored function. IMPORTANCEDue to the limited genomic coding capacity of viruses, viral proteins are often multifunctional, which makes them attractive antiviral targets. Such multifunctionality, however, complicates their study, which often involves constructing and characterizing null mutant viruses. Systematic exploration of these multifunctional proteins requires detailed road maps in the form of 3-dimensional structures. In this work, we determined the crystal structure of the C-terminal domain of UL21, a multifunctional tegument protein that is conserved among alphaherpesviruses. Structural analysis pinpointed surface areas of potential functional importance that provide a starting point for mutagenesis. In addition, the unexpected RNA-binding ability of UL21 may expand its functional repertoire. The structure of UL21C and the observation of its RNA-binding ability are the latest additions to the navigational chart that can guide the exploration of the multiple functions of UL21.« less
Symmetry based assembly of a 2 dimensional protein lattice
DOE Office of Scientific and Technical Information (OSTI.GOV)
Poulos, Sandra; Agah, Sayeh; Jallah, Nikardi
2017-04-18
The design of proteins that self-assemble into higher order architectures is of great interest due to their potential application in nanotechnology. Specifically, the self-assembly of proteins into ordered lattices is of special interest to the field of structural biology. Here we designed a 2 dimensional (2D) protein lattice using a fusion of a tandem repeat of three TelSAM domains (TTT) to the Ferric uptake regulator (FUR) domain. We determined the structure of the designed (TTT-FUR) fusion protein to 2.3 Å by X-ray crystallographic methods. In agreement with the design, a 2D lattice composed of TelSAM fibers interdigitated by the FURmore » domain was observed. As expected, the fusion of a tandem repeat of three TelSAM domains formed 21 screw axis, and the self-assembly of the ordered oligomer was under pH control. We demonstrated that the fusion of TTT to a domain having a 2-fold symmetry, such as the FUR domain, can produce an ordered 2D lattice. The TTT-FUR system combines features from the rotational symmetry matching approach with the oligomer driven crystallization method. This TTT-FUR fusion was amenable to X-ray crystallographic methods, and is a promising crystallization chaperone.« less
Fully Mechanically Controlled Automated Electron Microscopic Tomography
Liu, Jinxin; Li, Hongchang; Zhang, Lei; ...
2016-07-11
Knowledge of three-dimensional (3D) structures of each individual particles of asymmetric and flexible proteins is essential in understanding those proteins' functions; but their structures are difficult to determine. Electron tomography (ET) provides a tool for imaging a single and unique biological object from a series of tilted angles, but it is challenging to image a single protein for three-dimensional (3D) reconstruction due to the imperfect mechanical control capability of the specimen goniometer under both a medium to high magnification (approximately 50,000-160,000×) and an optimized beam coherence condition. Here, we report a fully mechanical control method for automating ET data acquisitionmore » without using beam tilt/shift processes. This method could reduce the accumulation of beam tilt/shift that used to compensate the error from the mechanical control, but downgraded the beam coherence. Our method was developed by minimizing the error of the target object center during the tilting process through a closed-loop proportional-integral (PI) control algorithm. The validations by both negative staining (NS) and cryo-electron microscopy (cryo-EM) suggest that this method has a comparable capability to other ET methods in tracking target proteins while maintaining optimized beam coherence conditions for imaging.« less
Structural Genomics and Drug Discovery for Infectious Diseases
DOE Office of Scientific and Technical Information (OSTI.GOV)
Anderson, W.F.
The application of structural genomics methods and approaches to proteins from organisms causing infectious diseases is making available the three dimensional structures of many proteins that are potential drug targets and laying the groundwork for structure aided drug discovery efforts. There are a number of structural genomics projects with a focus on pathogens that have been initiated worldwide. The Center for Structural Genomics of Infectious Diseases (CSGID) was recently established to apply state-of-the-art high throughput structural biology technologies to the characterization of proteins from the National Institute for Allergy and Infectious Diseases (NIAID) category A-C pathogens and organisms causing emerging,more » or re-emerging infectious diseases. The target selection process emphasizes potential biomedical benefits. Selected proteins include known drug targets and their homologs, essential enzymes, virulence factors and vaccine candidates. The Center also provides a structure determination service for the infectious disease scientific community. The ultimate goal is to generate a library of structures that are available to the scientific community and can serve as a starting point for further research and structure aided drug discovery for infectious diseases. To achieve this goal, the CSGID will determine protein crystal structures of 400 proteins and protein-ligand complexes using proven, rapid, highly integrated, and cost-effective methods for such determination, primarily by X-ray crystallography. High throughput crystallographic structure determination is greatly aided by frequent, convenient access to high-performance beamlines at third-generation synchrotron X-ray sources.« less
A three-dimensional movie of structural changes in bacteriorhodopsin.
Nango, Eriko; Royant, Antoine; Kubo, Minoru; Nakane, Takanori; Wickstrand, Cecilia; Kimura, Tetsunari; Tanaka, Tomoyuki; Tono, Kensuke; Song, Changyong; Tanaka, Rie; Arima, Toshi; Yamashita, Ayumi; Kobayashi, Jun; Hosaka, Toshiaki; Mizohata, Eiichi; Nogly, Przemyslaw; Sugahara, Michihiro; Nam, Daewoong; Nomura, Takashi; Shimamura, Tatsuro; Im, Dohyun; Fujiwara, Takaaki; Yamanaka, Yasuaki; Jeon, Byeonghyun; Nishizawa, Tomohiro; Oda, Kazumasa; Fukuda, Masahiro; Andersson, Rebecka; Båth, Petra; Dods, Robert; Davidsson, Jan; Matsuoka, Shigeru; Kawatake, Satoshi; Murata, Michio; Nureki, Osamu; Owada, Shigeki; Kameshima, Takashi; Hatsui, Takaki; Joti, Yasumasa; Schertler, Gebhard; Yabashi, Makina; Bondar, Ana-Nicoleta; Standfuss, Jörg; Neutze, Richard; Iwata, So
2016-12-23
Bacteriorhodopsin (bR) is a light-driven proton pump and a model membrane transport protein. We used time-resolved serial femtosecond crystallography at an x-ray free electron laser to visualize conformational changes in bR from nanoseconds to milliseconds following photoactivation. An initially twisted retinal chromophore displaces a conserved tryptophan residue of transmembrane helix F on the cytoplasmic side of the protein while dislodging a key water molecule on the extracellular side. The resulting cascade of structural changes throughout the protein shows how motions are choreographed as bR transports protons uphill against a transmembrane concentration gradient. Copyright © 2016, American Association for the Advancement of Science.
The First Mammalian Aldehyde Oxidase Crystal Structure
Coelho, Catarina; Mahro, Martin; Trincão, José; Carvalho, Alexandra T. P.; Ramos, Maria João; Terao, Mineko; Garattini, Enrico; Leimkühler, Silke; Romão, Maria João
2012-01-01
Aldehyde oxidases (AOXs) are homodimeric proteins belonging to the xanthine oxidase family of molybdenum-containing enzymes. Each 150-kDa monomer contains a FAD redox cofactor, two spectroscopically distinct [2Fe-2S] clusters, and a molybdenum cofactor located within the protein active site. AOXs are characterized by broad range substrate specificity, oxidizing different aldehydes and aromatic N-heterocycles. Despite increasing recognition of its role in the metabolism of drugs and xenobiotics, the physiological function of the protein is still largely unknown. We have crystallized and solved the crystal structure of mouse liver aldehyde oxidase 3 to 2.9 Å. This is the first mammalian AOX whose structure has been solved. The structure provides important insights into the protein active center and further evidence on the catalytic differences characterizing AOX and xanthine oxidoreductase. The mouse liver aldehyde oxidase 3 three-dimensional structure combined with kinetic, mutagenesis data, molecular docking, and molecular dynamics studies make a decisive contribution to understand the molecular basis of its rather broad substrate specificity. PMID:23019336
ePlant and the 3D data display initiative: integrative systems biology on the world wide web.
Fucile, Geoffrey; Di Biase, David; Nahal, Hardeep; La, Garon; Khodabandeh, Shokoufeh; Chen, Yani; Easley, Kante; Christendat, Dinesh; Kelley, Lawrence; Provart, Nicholas J
2011-01-10
Visualization tools for biological data are often limited in their ability to interactively integrate data at multiple scales. These computational tools are also typically limited by two-dimensional displays and programmatic implementations that require separate configurations for each of the user's computing devices and recompilation for functional expansion. Towards overcoming these limitations we have developed "ePlant" (http://bar.utoronto.ca/eplant) - a suite of open-source world wide web-based tools for the visualization of large-scale data sets from the model organism Arabidopsis thaliana. These tools display data spanning multiple biological scales on interactive three-dimensional models. Currently, ePlant consists of the following modules: a sequence conservation explorer that includes homology relationships and single nucleotide polymorphism data, a protein structure model explorer, a molecular interaction network explorer, a gene product subcellular localization explorer, and a gene expression pattern explorer. The ePlant's protein structure explorer module represents experimentally determined and theoretical structures covering >70% of the Arabidopsis proteome. The ePlant framework is accessed entirely through a web browser, and is therefore platform-independent. It can be applied to any model organism. To facilitate the development of three-dimensional displays of biological data on the world wide web we have established the "3D Data Display Initiative" (http://3ddi.org).
Protein Assembly and Building Blocks: Beyond the Limits of the LEGO Brick Metaphor.
Levy, Yaakov
2017-09-26
Proteins, like other biomolecules, have a modular and hierarchical structure. Various building blocks are used to construct proteins of high structural complexity and diverse functionality. In multidomain proteins, for example, domains are fused to each other in different combinations to achieve different functions. Although the LEGO brick metaphor is justified as a means of simplifying the complexity of three-dimensional protein structures, several fundamental properties (such as allostery or the induced-fit mechanism) make deviation from it necessary to respect the plasticity, softness, and cross-talk that are essential to protein function. In this work, we illustrate recently reported protein behavior in multidomain proteins that deviates from the LEGO brick analogy. While earlier studies showed that a protein domain is often unaffected by being fused to another domain or becomes more stable following the formation of a new interface between the tethered domains, destabilization due to tethering has been reported for several systems. We illustrate that tethering may sometimes result in a multidomain protein behaving as "less than the sum of its parts". We survey these cases for which structure additivity does not guarantee thermodynamic additivity. Protein destabilization due to fusion to other domains may be linked in some cases to biological function and should be taken into account when designing large assemblies.
Eichmann, Cédric; Orts, Julien; Tzitzilonis, Christos; Vögeli, Beat; Smrt, Sean; Lorieau, Justin; Riek, Roland
2014-12-11
The interaction between membrane proteins and lipids or lipid mimetics such as detergents is key for the three-dimensional structure and dynamics of membrane proteins. In NMR-based structural studies of membrane proteins, qualitative analysis of intermolecular nuclear Overhauser enhancements (NOEs) or paramagnetic resonance enhancement are used in general to identify the transmembrane segments of a membrane protein. Here, we employed a quantitative characterization of intermolecular NOEs between (1)H of the detergent and (1)H(N) of (2)H-perdeuterated, (15)N-labeled α-helical membrane protein-detergent complexes following the exact NOE (eNOE) approach. Structural considerations suggest that these intermolecular NOEs should show a helical-wheel-type behavior along a transmembrane helix or a membrane-attached helix within a membrane protein as experimentally demonstrated for the complete influenza hemagglutinin fusion domain HAfp23. The partial absence of such a NOE pattern along the amino acid sequence as shown for a truncated variant of HAfp23 and for the Escherichia coli inner membrane protein YidH indicates the presence of large tertiary structure fluctuations such as an opening between helices or the presence of large rotational dynamics of the helices. Detergent-protein NOEs thus appear to be a straightforward probe for a qualitative characterization of structural and dynamical properties of membrane proteins embedded in detergent micelles.
Direct folding simulation of helical proteins using an effective polarizable bond force field.
Duan, Lili; Zhu, Tong; Ji, Changge; Zhang, Qinggang; Zhang, John Z H
2017-06-14
We report a direct folding study of seven helical proteins (, Trpcage, , C34, N36, , ) ranging from 17 to 53 amino acids through standard molecular dynamics simulations using a recently developed polarizable force field-Effective Polarizable Bond (EPB) method. The backbone RMSDs, radius of gyrations, native contacts and native helix content are in good agreement with the experimental results. Cluster analysis has also verified that these folded structures with the highest population are in good agreement with their corresponding native structures for these proteins. In addition, the free energy landscape of seven proteins in the two dimensional space comprised of RMSD and radius of gyration proved that these folded structures are indeed of the lowest energy conformations. However, when the corresponding simulations were performed using the standard (nonpolarizable) AMBER force fields, no stable folded structures were observed for these proteins. Comparison of the simulation results based on a polarizable EPB force field and a nonpolarizable AMBER force field clearly demonstrates the importance of polarization in the folding of stable helical structures.
NASA Astrophysics Data System (ADS)
Paulino, M.; Esteves, A.; Vega, M.; Tabares, G.; Ehrlich, R.; Tapia, O.
1998-07-01
EgDf1 is a developmentally regulated protein from the parasite Echinococcus granulosus related to a family of hydrophobic ligand binding proteins. This protein could play a crucial role during the parasite life cycle development since this organism is unable to synthetize most of their own lipids de novo. Furthermore, it has been shown that two related protein from other parasitic platyhelminths (Fh15 from Fasciola hepatica and Sm14 from Schistosoma mansoni) are able to confer protective inmunity against experimental infection in animal models. A three-dimensional structure would help establishing structure/function relationships on a knowledge based manner. 3D structures for EgDf1 protein were modelled by using myelin P2 (mP2) and intestine fatty acid binding protein (I-FABP) as templates. Molecular dynamics techniques were used to validate the models. Template mP2 yielded the best 3D structure for EgDf1. Palmitic and oleic acids were docked inside EgDf1. The present theoretical results suggest definite location in the secondary structure of the epitopic regions, consensus phosphorylation motifs and oleic acid as a good ligand candidate to EgDf1. This protein might well be involved in the process of supplying hydrophobic metabolites for membrane biosynthesis and for signaling pathways.
Coarse-grained mechanics of viral shells
NASA Astrophysics Data System (ADS)
Klug, William S.; Gibbons, Melissa M.
2008-03-01
We present an approach for creating three-dimensional finite element models of viral capsids from atomic-level structural data (X-ray or cryo-EM). The models capture heterogeneous geometric features and are used in conjunction with three-dimensional nonlinear continuum elasticity to simulate nanoindentation experiments as performed using atomic force microscopy. The method is extremely flexible; able to capture varying levels of detail in the three-dimensional structure. Nanoindentation simulations are presented for several viruses: Hepatitis B, CCMV, HK97, and φ29. In addition to purely continuum elastic models a multiscale technique is developed that combines finite-element kinematics with MD energetics such that large-scale deformations are facilitated by a reduction in degrees of freedom. Simulations of these capsid deformation experiments provide a testing ground for the techniques, as well as insight into the strength-determining mechanisms of capsid deformation. These methods can be extended as a framework for modeling other proteins and macromolecular structures in cell biology.
Can misfolded proteins be beneficial? The HAMLET case.
Pettersson-Kastberg, Jenny; Aits, Sonja; Gustafsson, Lotta; Mossberg, Anki; Storm, Petter; Trulsson, Maria; Persson, Filip; Mok, K Hun; Svanborg, Catharina
2009-01-01
By changing the three-dimensional structure, a protein can attain new functions, distinct from those of the native protein. Amyloid-forming proteins are one example, in which conformational change may lead to fibril formation and, in many cases, neurodegenerative disease. We have proposed that partial unfolding provides a mechanism to generate new and useful functional variants from a given polypeptide chain. Here we present HAMLET (Human Alpha-lactalbumin Made LEthal to Tumor cells) as an example where partial unfolding and the incorporation of cofactor create a complex with new, beneficial properties. Native alpha-lactalbumin functions as a substrate specifier in lactose synthesis, but when partially unfolded the protein binds oleic acid and forms the tumoricidal HAMLET complex. When the properties of HAMLET were first described they were surprising, as protein folding intermediates and especially amyloid-forming protein intermediates had been regarded as toxic conformations, but since then structural studies have supported functional diversity arising from a change in fold. The properties of HAMLET suggest a mechanism of structure-function variation, which might help the limited number of human protein genes to generate sufficient structural diversity to meet the diverse functional demands of complex organisms.
(PS)2: protein structure prediction server version 3.0.
Huang, Tsun-Tsao; Hwang, Jenn-Kang; Chen, Chu-Huang; Chu, Chih-Sheng; Lee, Chi-Wen; Chen, Chih-Chieh
2015-07-01
Protein complexes are involved in many biological processes. Examining coupling between subunits of a complex would be useful to understand the molecular basis of protein function. Here, our updated (PS)(2) web server predicts the three-dimensional structures of protein complexes based on comparative modeling; furthermore, this server examines the coupling between subunits of the predicted complex by combining structural and evolutionary considerations. The predicted complex structure could be indicated and visualized by Java-based 3D graphics viewers and the structural and evolutionary profiles are shown and compared chain-by-chain. For each subunit, considerations with or without the packing contribution of other subunits cause the differences in similarities between structural and evolutionary profiles, and these differences imply which form, complex or monomeric, is preferred in the biological condition for the subunit. We believe that the (PS)(2) server would be a useful tool for biologists who are interested not only in the structures of protein complexes but also in the coupling between subunits of the complexes. The (PS)(2) is freely available at http://ps2v3.life.nctu.edu.tw/. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Small Scaffolds, Big Potential: Developing Miniature Proteins as Therapeutic Agents.
Holub, Justin M
2017-09-01
Preclinical Research Miniature proteins are a class of oligopeptide characterized by their short sequence lengths and ability to adopt well-folded, three-dimensional structures. Because of their biomimetic nature and synthetic tractability, miniature proteins have been used to study a range of biochemical processes including fast protein folding, signal transduction, catalysis and molecular transport. Recently, miniature proteins have been gaining traction as potential therapeutic agents because their small size and ability to fold into defined tertiary structures facilitates their development as protein-based drugs. This research overview discusses emerging developments involving the use of miniature proteins as scaffolds to design novel therapeutics for the treatment and study of human disease. Specifically, this review will explore strategies to: (i) stabilize miniature protein tertiary structure; (ii) optimize biomolecular recognition by grafting functional epitopes onto miniature protein scaffolds; and (iii) enhance cytosolic delivery of miniature proteins through the use of cationic motifs that facilitate endosomal escape. These objectives are discussed not only to address challenges in developing effective miniature protein-based drugs, but also to highlight the tremendous potential miniature proteins hold for combating and understanding human disease. Drug Dev Res 78 : 268-282, 2017. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
G23D: Online tool for mapping and visualization of genomic variants on 3D protein structures.
Solomon, Oz; Kunik, Vered; Simon, Amos; Kol, Nitzan; Barel, Ortal; Lev, Atar; Amariglio, Ninette; Somech, Raz; Rechavi, Gidi; Eyal, Eran
2016-08-26
Evaluation of the possible implications of genomic variants is an increasingly important task in the current high throughput sequencing era. Structural information however is still not routinely exploited during this evaluation process. The main reasons can be attributed to the partial structural coverage of the human proteome and the lack of tools which conveniently convert genomic positions, which are the frequent output of genomic pipelines, to proteins and structure coordinates. We present G23D, a tool for conversion of human genomic coordinates to protein coordinates and protein structures. G23D allows mapping of genomic positions/variants on evolutionary related (and not only identical) protein three dimensional (3D) structures as well as on theoretical models. By doing so it significantly extends the space of variants for which structural insight is feasible. To facilitate interpretation of the variant consequence, pathogenic variants, functional sites and polymorphism sites are displayed on protein sequence and structure diagrams alongside the input variants. G23D also provides modeling of the mutant structure, analysis of intra-protein contacts and instant access to functional predictions and predictions of thermo-stability changes. G23D is available at http://www.sheba-cancer.org.il/G23D . G23D extends the fraction of variants for which structural analysis is applicable and provides better and faster accessibility for structural data to biologists and geneticists who routinely work with genomic information.
Regad, Leslie; Martin, Juliette; Camproux, Anne-Claude
2011-06-20
One of the strategies for protein function annotation is to search particular structural motifs that are known to be shared by proteins with a given function. Here, we present a systematic extraction of structural motifs of seven residues from protein loops and we explore their correspondence with functional sites. Our approach is based on the structural alphabet HMM-SA (Hidden Markov Model - Structural Alphabet), which allows simplification of protein structures into uni-dimensional sequences, and advanced pattern statistics adapted to short sequences. Structural motifs of interest are selected by looking for structural motifs significantly over-represented in SCOP superfamilies in protein loops. We discovered two types of structural motifs significantly over-represented in SCOP superfamilies: (i) ubiquitous motifs, shared by several superfamilies and (ii) superfamily-specific motifs, over-represented in few superfamilies. A comparison of ubiquitous words with known small structural motifs shows that they contain well-described motifs as turn, niche or nest motifs. A comparison between superfamily-specific motifs and biological annotations of Swiss-Prot reveals that some of them actually correspond to functional sites involved in the binding sites of small ligands, such as ATP/GTP, NAD(P) and SAH/SAM. Our findings show that statistical over-representation in SCOP superfamilies is linked to functional features. The detection of over-represented motifs within structures simplified by HMM-SA is therefore a promising approach for prediction of functional sites and annotation of uncharacterized proteins.
2011-01-01
Background One of the strategies for protein function annotation is to search particular structural motifs that are known to be shared by proteins with a given function. Results Here, we present a systematic extraction of structural motifs of seven residues from protein loops and we explore their correspondence with functional sites. Our approach is based on the structural alphabet HMM-SA (Hidden Markov Model - Structural Alphabet), which allows simplification of protein structures into uni-dimensional sequences, and advanced pattern statistics adapted to short sequences. Structural motifs of interest are selected by looking for structural motifs significantly over-represented in SCOP superfamilies in protein loops. We discovered two types of structural motifs significantly over-represented in SCOP superfamilies: (i) ubiquitous motifs, shared by several superfamilies and (ii) superfamily-specific motifs, over-represented in few superfamilies. A comparison of ubiquitous words with known small structural motifs shows that they contain well-described motifs as turn, niche or nest motifs. A comparison between superfamily-specific motifs and biological annotations of Swiss-Prot reveals that some of them actually correspond to functional sites involved in the binding sites of small ligands, such as ATP/GTP, NAD(P) and SAH/SAM. Conclusions Our findings show that statistical over-representation in SCOP superfamilies is linked to functional features. The detection of over-represented motifs within structures simplified by HMM-SA is therefore a promising approach for prediction of functional sites and annotation of uncharacterized proteins. PMID:21689388
In silico modeling of the Moniliophthora perniciosa Atg8 protein.
Pereira, A C F; Cardoso, T H S; Brendel, M; Pungartnik, C
2013-12-11
Autophagy is defined as an intracellular system of lysosomal degradation in eukaryotic cells, and the genes involved in this process are conserved from yeast to humans. Among these genes, ATG8 encodes a ubiquitin-like protein that is conjugated to a phosphatidylethanolamine (PE) membrane by the ubiquitination system. The Atg8p-PE complex is important in initiating the formation of the autophagosome and thus plays a critical role in autophagy. In silico modeling of Atg8p of Moniliophthora perniciosa revealed its three-dimensional structure and enabled comparison with its Saccharomyces cerevisiae homologue ScAtg8p. Some common and distinct features were observed between these two proteins, including the conservation of residues required to allow the interaction of α-helix1 with the ubiquitin core. However, the electrostatic potential surfaces of these helices differ, implying particular roles in selecting specific binding partners. The proposed structure was validated by the programs PROCHECK 3.4, ANOLEA, and QMEAN, which demonstrated 100% of amino acids located in favorable regions with low total energy. Our results showed that MpAtg8p contains the same functional domains (3 α-helices and 4 β-sheets) and is similar in structure as the ScAtg8p yeast. Both proteins have many conserved sequences in common, and therefore, their proposed three-dimensional models show similar configuration.
Paquet, M J; Laviolette, M; Pézolet, M; Auger, M
2001-01-01
Two-dimensional infrared correlation spectroscopy (2D-IR) was used in this study to investigate the aggregation of cytochrome c in the presence of dimyristoylphosphatidylglycerol. The influence of temperature on the aggregation has been evaluated by monitoring the intensity of a band at 1616 cm(-1), which is characteristic of aggregated proteins, and the 2D-IR analysis has been used to determine the various secondary structure components of cytochrome c involved before and during its aggregation. The 2D-IR correlation analysis clearly reveals for the first time that aggregation starts to occur between nearly native proteins, which then unfold, yielding to further aggregation of the protein. Later in the aggregation process, the formation of intermolecular bonds and unfolding of the alpha-helices appear to be simultaneous. These results lead us to propose a two-step aggregation process. Finally, the results obtained during the heating period clearly indicate that before the protein starts to aggregate, there is a loosening of the tertiary structure of cytochrome c, resulting in a decrease of the beta-sheet content and an increase of the amount of beta-turns. This study clearly demonstrates the potential of 2D-IR spectroscopy to investigate the aggregation of proteins and this technique could therefore be applied to other proteins such as those involved in fibrilogenesis. PMID:11423415
Hensen, Ulf; Meyer, Tim; Haas, Jürgen; Rex, René; Vriend, Gert; Grubmüller, Helmut
2012-01-01
Proteins are usually described and classified according to amino acid sequence, structure or function. Here, we develop a minimally biased scheme to compare and classify proteins according to their internal mobility patterns. This approach is based on the notion that proteins not only fold into recurring structural motifs but might also be carrying out only a limited set of recurring mobility motifs. The complete set of these patterns, which we tentatively call the dynasome, spans a multi-dimensional space with axes, the dynasome descriptors, characterizing different aspects of protein dynamics. The unique dynamic fingerprint of each protein is represented as a vector in the dynasome space. The difference between any two vectors, consequently, gives a reliable measure of the difference between the corresponding protein dynamics. We characterize the properties of the dynasome by comparing the dynamics fingerprints obtained from molecular dynamics simulations of 112 proteins but our approach is, in principle, not restricted to any specific source of data of protein dynamics. We conclude that: 1. the dynasome consists of a continuum of proteins, rather than well separated classes. 2. For the majority of proteins we observe strong correlations between structure and dynamics. 3. Proteins with similar function carry out similar dynamics, which suggests a new method to improve protein function annotation based on protein dynamics. PMID:22606222
Adaptive compressive learning for prediction of protein-protein interactions from primary sequence.
Zhang, Ya-Nan; Pan, Xiao-Yong; Huang, Yan; Shen, Hong-Bin
2011-08-21
Protein-protein interactions (PPIs) play an important role in biological processes. Although much effort has been devoted to the identification of novel PPIs by integrating experimental biological knowledge, there are still many difficulties because of lacking enough protein structural and functional information. It is highly desired to develop methods based only on amino acid sequences for predicting PPIs. However, sequence-based predictors are often struggling with the high-dimensionality causing over-fitting and high computational complexity problems, as well as the redundancy of sequential feature vectors. In this paper, a novel computational approach based on compressed sensing theory is proposed to predict yeast Saccharomyces cerevisiae PPIs from primary sequence and has achieved promising results. The key advantage of the proposed compressed sensing algorithm is that it can compress the original high-dimensional protein sequential feature vector into a much lower but more condensed space taking the sparsity property of the original signal into account. What makes compressed sensing much more attractive in protein sequence analysis is its compressed signal can be reconstructed from far fewer measurements than what is usually considered necessary in traditional Nyquist sampling theory. Experimental results demonstrate that proposed compressed sensing method is powerful for analyzing noisy biological data and reducing redundancy in feature vectors. The proposed method represents a new strategy of dealing with high-dimensional protein discrete model and has great potentiality to be extended to deal with many other complicated biological systems. Copyright © 2011 Elsevier Ltd. All rights reserved.
A method for fast energy estimation and visualization of protein-ligand interaction
NASA Astrophysics Data System (ADS)
Tomioka, Nobuo; Itai, Akiko; Iitaka, Yoichi
1987-10-01
A new computational and graphical method for facilitating ligand-protein docking studies is developed on a three-dimensional computer graphics display. Various physical and chemical properties inside the ligand binding pocket of a receptor protein, whose structure is elucidated by X-ray crystal analysis, are calculated on three-dimensional grid points and are stored in advance. By utilizing those tabulated data, it is possible to estimate the non-bonded and electrostatic interaction energy and the number of possible hydrogen bonds between protein and ligand molecules in real time during an interactive docking operation. The method also provides a comprehensive visualization of the local environment inside the binding pocket. With this method, it becomes easier to find a roughly stable geometry of ligand molecules, and one can therefore make a rapid survey of the binding capability of many drug candidates. The method will be useful for drug design as well as for the examination of protein-ligand interactions.
Small Artery Elastin Distribution and Architecture-Focus on Three Dimensional Organization.
Hill, Michael A; Nourian, Zahra; Ho, I-Lin; Clifford, Philip S; Martinez-Lemus, Luis; Meininger, Gerald A
2016-11-01
The distribution of ECM proteins within the walls of resistance vessels is complex both in variety of proteins and structural arrangement. In particular, elastin exists as discrete fibers varying in orientation across the adventitia and media as well as often resembling a sheet-like structure in the case of the IEL. Adding to the complexity is the tissue heterogeneity that exists in these structural arrangements. For example, small intracranial cerebral arteries lack adventitial elastin while similar sized arteries from skeletal muscle and intestinal mesentery exhibit a complex adventitial network of elastin fibers. With regard to the IEL, several vascular beds exhibit an elastin sheet with punctate holes/fenestrae while in others the IEL is discontinuous and fibrous in appearance. Importantly, these structural patterns likely sub-serve specific functional properties, including mechanosensing, control of external forces, mechanical properties of the vascular wall, cellular positioning, and communication between cells. Of further significance, these processes are altered in vascular disorders such as hypertension and diabetes mellitus where there is modification of ECM. This brief report focuses on the three-dimensional wall structure of small arteries and considers possible implications with regard to mechanosensing under physiological and pathophysiological conditions. © 2016 John Wiley & Sons Ltd.
Hartl, F Ulrich
2017-06-20
The majority of protein molecules must fold into defined three-dimensional structures to acquire functional activity. However, protein chains can adopt a multitude of conformational states, and their biologically active conformation is often only marginally stable. Metastable proteins tend to populate misfolded species that are prone to forming toxic aggregates, including soluble oligomers and fibrillar amyloid deposits, which are linked with neurodegeneration in Alzheimer and Parkinson disease, and many other pathologies. To prevent or regulate protein aggregation, all cells contain an extensive protein homeostasis (or proteostasis) network comprising molecular chaperones and other factors. These defense systems tend to decline during aging, facilitating the manifestation of aggregate deposition diseases. This volume of the Annual Review of Biochemistry contains a set of three articles addressing our current understanding of the structures of pathological protein aggregates and their associated disease mechanisms. These articles also discuss recent insights into the strategies cells have evolved to neutralize toxic aggregates by sequestering them in specific cellular locations.
Zhang, Zhe; Schindler, Christina E. M.; Lange, Oliver F.; Zacharias, Martin
2015-01-01
The high-resolution refinement of docked protein-protein complexes can provide valuable structural and mechanistic insight into protein complex formation complementing experiment. Monte Carlo (MC) based approaches are frequently applied to sample putative interaction geometries of proteins including also possible conformational changes of the binding partners. In order to explore efficiency improvements of the MC sampling, several enhanced sampling techniques, including temperature or Hamiltonian replica exchange and well-tempered ensemble approaches, have been combined with the MC method and were evaluated on 20 protein complexes using unbound partner structures. The well-tempered ensemble method combined with a 2-dimensional temperature and Hamiltonian replica exchange scheme (WTE-H-REMC) was identified as the most efficient search strategy. Comparison with prolonged MC searches indicates that the WTE-H-REMC approach requires approximately 5 times fewer MC steps to identify near native docking geometries compared to conventional MC searches. PMID:26053419
The role of stabilization centers in protein thermal stability
DOE Office of Scientific and Technical Information (OSTI.GOV)
Magyar, Csaba; Gromiha, M. Michael; Sávoly, Zoltán
2016-02-26
The definition of stabilization centers was introduced almost two decades ago. They are centers of noncovalent long range interaction clusters, believed to have a role in maintaining the three-dimensional structure of proteins by preventing their decay due to their cooperative long range interactions. Here, this hypothesis is investigated from the viewpoint of thermal stability for the first time, using a large protein thermodynamics database. The positions of amino acids belonging to stabilization centers are correlated with available experimental thermodynamic data on protein thermal stability. Our analysis suggests that stabilization centers, especially solvent exposed ones, do contribute to the thermal stabilizationmore » of proteins. - Highlights: • Stabilization centers contribute to thermal stabilization of protein structures. • Stabilization center content correlates with melting temperature of proteins. • Exposed stabilization center content correlates with stability even in hyperthermophiles. • Stability changing mutations are frequently found at stabilization centers.« less
Imai, Takashi; Ohyama, Shusaku; Kovalenko, Andriy; Hirata, Fumio
2007-01-01
The partial molar volume (PMV) change associated with the pressure-induced structural transition of ubiquitin is analyzed by the three-dimensional reference interaction site model (3D-RISM) theory of molecular solvation. The theory predicts that the PMV decreases upon the structural transition, which is consistent with the experimental observation. The volume decomposition analysis demonstrates that the PMV reduction is primarily caused by the decrease in the volume of structural voids in the protein, which is partially canceled by the volume expansion due to the hydration effects. It is found from further analysis that the PMV reduction is ascribed substantially to the penetration of water molecules into a specific part of the protein. Based on the thermodynamic relation, this result implies that the water penetration causes the pressure-induced structural transition. It supports the water penetration model of pressure denaturation of proteins proposed earlier. PMID:17660257
Imai, Takashi; Ohyama, Shusaku; Kovalenko, Andriy; Hirata, Fumio
2007-09-01
The partial molar volume (PMV) change associated with the pressure-induced structural transition of ubiquitin is analyzed by the three-dimensional reference interaction site model (3D-RISM) theory of molecular solvation. The theory predicts that the PMV decreases upon the structural transition, which is consistent with the experimental observation. The volume decomposition analysis demonstrates that the PMV reduction is primarily caused by the decrease in the volume of structural voids in the protein, which is partially canceled by the volume expansion due to the hydration effects. It is found from further analysis that the PMV reduction is ascribed substantially to the penetration of water molecules into a specific part of the protein. Based on the thermodynamic relation, this result implies that the water penetration causes the pressure-induced structural transition. It supports the water penetration model of pressure denaturation of proteins proposed earlier.
Kihara, Daisuke; Sael, Lee; Chikhi, Rayan; Esquivel-Rodriguez, Juan
2011-09-01
The tertiary structures of proteins have been solved in an increasing pace in recent years. To capitalize the enormous efforts paid for accumulating the structure data, efficient and effective computational methods need to be developed for comparing, searching, and investigating interactions of protein structures. We introduce the 3D Zernike descriptor (3DZD), an emerging technique to describe molecular surfaces. The 3DZD is a series expansion of mathematical three-dimensional function, and thus a tertiary structure is represented compactly by a vector of coefficients of terms in the series. A strong advantage of the 3DZD is that it is invariant to rotation of target object to be represented. These two characteristics of the 3DZD allow rapid comparison of surface shapes, which is sufficient for real-time structure database screening. In this article, we review various applications of the 3DZD, which have been recently proposed.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Leite, Wellington C.; Galvão, Carolina W.; Saab, Sérgio C.
The bacterial RecA protein plays a role in the complex system of DNA damage repair. Here, we report the functional and structural characterization of the Herbaspirillum seropedicae RecA protein (HsRecA). HsRecA protein is more efficient at displacing SSB protein from ssDNA than Escherichia coli RecA protein. HsRecA also promotes DNA strand exchange more efficiently. The three dimensional structure of HsRecA-ADP/ATP complex has been solved to 1.7 Å resolution. HsRecA protein contains a small N-terminal domain, a central core ATPase domain and a large C-terminal domain, that are similar to homologous bacterial RecA proteins. Comparative structural analysis showed that the N-terminalmore » polymerization motif of archaeal and eukaryotic RecA family proteins are also present in bacterial RecAs. Reconstruction of electrostatic potential from the hexameric structure of HsRecA-ADP/ATP revealed a high positive charge along the inner side, where ssDNA is bound inside the filament. The properties of this surface may explain the greater capacity of HsRecA protein to bind ssDNA, forming a contiguous nucleoprotein filament, displace SSB and promote DNA exchange relative to EcRecA. In conclusion, our functional and structural analyses provide insight into the molecular mechanisms of polymerization of bacterial RecA as a helical nucleoprotein filament.« less
Galvão, Carolina W.; Saab, Sérgio C.; Iulek, Jorge; Etto, Rafael M.; Steffens, Maria B. R.; Chitteni-Pattu, Sindhu; Stanage, Tyler; Keck, James L.; Cox, Michael M.
2016-01-01
The bacterial RecA protein plays a role in the complex system of DNA damage repair. Here, we report the functional and structural characterization of the Herbaspirillum seropedicae RecA protein (HsRecA). HsRecA protein is more efficient at displacing SSB protein from ssDNA than Escherichia coli RecA protein. HsRecA also promotes DNA strand exchange more efficiently. The three dimensional structure of HsRecA-ADP/ATP complex has been solved to 1.7 Å resolution. HsRecA protein contains a small N-terminal domain, a central core ATPase domain and a large C-terminal domain, that are similar to homologous bacterial RecA proteins. Comparative structural analysis showed that the N-terminal polymerization motif of archaeal and eukaryotic RecA family proteins are also present in bacterial RecAs. Reconstruction of electrostatic potential from the hexameric structure of HsRecA-ADP/ATP revealed a high positive charge along the inner side, where ssDNA is bound inside the filament. The properties of this surface may explain the greater capacity of HsRecA protein to bind ssDNA, forming a contiguous nucleoprotein filament, displace SSB and promote DNA exchange relative to EcRecA. Our functional and structural analyses provide insight into the molecular mechanisms of polymerization of bacterial RecA as a helical nucleoprotein filament. PMID:27447485
NASA Astrophysics Data System (ADS)
Zhang, Lei; Lei, Dongsheng; Smith, Jessica M.; Zhang, Meng; Tong, Huimin; Zhang, Xing; Lu, Zhuoyang; Liu, Jiankang; Alivisatos, A. Paul; Ren, Gang
2016-03-01
DNA base pairing has been used for many years to direct the arrangement of inorganic nanocrystals into small groupings and arrays with tailored optical and electrical properties. The control of DNA-mediated assembly depends crucially on a better understanding of three-dimensional structure of DNA-nanocrystal-hybridized building blocks. Existing techniques do not allow for structural determination of these flexible and heterogeneous samples. Here we report cryo-electron microscopy and negative-staining electron tomography approaches to image, and three-dimensionally reconstruct a single DNA-nanogold conjugate, an 84-bp double-stranded DNA with two 5-nm nanogold particles for potential substrates in plasmon-coupling experiments. By individual-particle electron tomography reconstruction, we obtain 14 density maps at ~2-nm resolution. Using these maps as constraints, we derive 14 conformations of dsDNA by molecular dynamics simulations. The conformational variation is consistent with that from liquid solution, suggesting that individual-particle electron tomography could be an expected approach to study DNA-assembling and flexible protein structure and dynamics.
DOE Office of Scientific and Technical Information (OSTI.GOV)
De Re, Eleonora; Schlau-Cohen, Gabriela S.; Leverenz, Ryan L.
Carotenoids play an essential role in photoprotection, interacting with other pigments to safely dissipate excess absorbed energy as heat. In cyanobacteria, the short time scale photoprotective mechanisms involve the photoactive orange carotenoid protein (OCP), which binds a single carbonyl carotenoid. Blue-green light induces the photoswitching of OCP from its ground state form (OCPO) to a metastable photoproduct (OCPR). OCPR can bind to the phycobilisome antenna and induce fluorescence quenching. The photoswitching is accompanied by structural and functional changes at the level of the protein and of the bound carotenoid. In this study, we use broadband two-dimensional electronic spectroscopy to lookmore » at the differences in excited state dynamics of the carotenoid in the two forms of OCP. Our results provide insight into the origin of the pronounced vibrational lineshape and oscillatory dynamics observed in linear absorption and 2D electronic spectroscopy of OCPO and the large inhomogeneous broadening in OCPR, with consequences for the chemical function of the two forms.« less
The Role of Protein Loops and Linkers in Conformational Dynamics and Allostery.
Papaleo, Elena; Saladino, Giorgio; Lambrughi, Matteo; Lindorff-Larsen, Kresten; Gervasio, Francesco Luigi; Nussinov, Ruth
2016-06-08
Proteins are dynamic entities that undergo a plethora of conformational changes that may take place on a wide range of time scales. These changes can be as small as the rotation of one or a few side-chain dihedral angles or involve concerted motions in larger portions of the three-dimensional structure; both kinds of motions can be important for biological function and allostery. It is becoming increasingly evident that "connector regions" are important components of the dynamic personality of protein structures. These regions may be either disordered loops, i.e., poorly structured regions connecting secondary structural elements, or linkers that connect entire protein domains. Experimental and computational studies have, however, revealed that these regions are not mere connectors, and their role in allostery and conformational changes has been emerging in the last few decades. Here we provide a detailed overview of the structural properties and classification of loops and linkers, as well as a discussion of the main computational methods employed to investigate their function and dynamical properties. We also describe their importance for protein dynamics and allostery using as examples key proteins in cellular biology and human diseases such as kinases, ubiquitinating enzymes, and transcription factors.
Covering complete proteomes with X-ray structures: A current snapshot
Mizianty, Marcin J.; Fan, Xiao; Yan, Jing; ...
2014-10-23
Structural genomics programs have developed and applied structure-determination pipelines to a wide range of protein targets, facilitating the visualization of macromolecular interactions and the understanding of their molecular and biochemical functions. The fundamental question of whether three-dimensional structures of all proteins and all functional annotations can be determined using X-ray crystallography is investigated. A first-of-its-kind large-scale analysis of crystallization propensity for all proteins encoded in 1953 fully sequenced genomes was performed. It is shown that current X-ray crystallographic knowhow combined with homology modeling can provide structures for 25% of modeling families (protein clusters for which structural models can be obtainedmore » through homology modeling), with at least one structural model produced for each Gene Ontology functional annotation. The coverage varies between superkingdoms, with 19% for eukaryotes, 35% for bacteria and 49% for archaea, and with those of viruses following the coverage values of their hosts. It is shown that the crystallization propensities of proteomes from the taxonomic superkingdoms are distinct. The use of knowledge-based target selection is shown to substantially increase the ability to produce X-ray structures. It is demonstrated that the human proteome has one of the highest attainable coverage values among eukaryotes, and GPCR membrane proteins suitable for X-ray structure determination were determined.« less
Shin, Jae-Min; Cho, Doo-Ho
2005-01-01
PDB-Ligand (http://www.idrtech.com/PDB-Ligand/) is a three-dimensional structure database of small molecular ligands that are bound to larger biomolecules deposited in the Protein Data Bank (PDB). It is also a database tool that allows one to browse, classify, superimpose and visualize these structures. As of May 2004, there are about 4870 types of small molecular ligands, experimentally determined as a complex with protein or DNA in the PDB. The proteins that a given ligand binds are often homologous and present the same binding structure to the ligand. However, there are also many instances wherein a given ligand binds to two or more unrelated proteins, or to the same or homologous protein in different binding environments. PDB-Ligand serves as an interactive structural analysis and clustering tool for all the ligand-binding structures in the PDB. PDB-Ligand also provides an easier way to obtain a number of different structure alignments of many related ligand-binding structures based on a simple and flexible ligand clustering method. PDB-Ligand will be a good resource for both a better interpretation of ligand-binding structures and the development of better scoring functions to be used in many drug discovery applications.
Comparative proteome analysis of monolayer and spheroid culture of canine osteosarcoma cells.
Gebhard, Christiane; Miller, Ingrid; Hummel, Karin; Neschi Née Ondrovics, Martina; Schlosser, Sarah; Walter, Ingrid
2018-04-15
Osteosarcoma is an aggressive bone tumor with high metastasis rate in the lungs and affects both humans and dogs in a similar way. Three-dimensional tumor cell cultures mimic the in vivo situation of micro-tumors and metastases and are therefore better experimental in vitro models than the often applied two-dimensional monolayer cultures. The aim of the present study was to perform comparative proteomics of standard monolayer cultures of canine osteosarcoma cells (D17) and three-dimensional spheroid cultures, to better characterize the 3D model before starting with experiments like migration assays. Using DIGE in combination with MALDI-TOF/TOF we found 27 unique canine proteins differently represented between these two culture systems, most of them being part of a functional network including mainly chaperones, structural proteins, stress-related proteins, proteins of the glycolysis/gluconeogenesis pathway and oxidoreductases. In monolayer cells, a noticeable shift to more acidic pI values was noticed for several proteins of medium to high abundance; two proteins (protein disulfide isomerase A3, stress-induced-phosphoprotein 1) showed an increase of phosphorylated protein species. Protein distribution within the cells, as detected by immunohistochemistry, displayed a switch of stress-induced-phosphoprotein 1 from the cytoplasm (in monolayer cultures) to the nucleus (in spheroid cultures). Additionally, Western blot testing revealed upregulated concentrations of metastasin (S100A4), triosephosphate isomerase 1 and septin 2 in spheroid cultures, in contrast to decreased concentrations of CCT2, a subunit of the T-complex. Results indicate regulation of stress proteins in the process of three-dimensional organization characterized by a hypoxic and nutrient-deficient environment comparable to tumor micro-metastases. Osteosarcoma is an aggressive bone tumor that early spreads to the lungs. Three-dimensional tumor cell cultures represent the avascular stage of micro-tumors and metastases, and should therefore represent a better experimental in vitro model compared to two-dimensional monolayer cultures. Significant differences have been reported in response to drug and radiation treatment between these two culture systems. A gel-based proteomic investigation was performed to compare protein patterns of a canine osteosarcoma cell line cultivated under those two conditions, to learn more about altered cell composition and its impact on cell behaviour. Due to the fact that the canine osteosarcoma is an accepted model for the human disease, results will be relevant for the human species as well. Copyright © 2018 Elsevier B.V. All rights reserved.
How Community Has Shaped the Protein Data Bank
Berman, Helen M.; Kleywegt, Gerard J.; Nakamura, Haruki; Markley, John L.
2015-01-01
Following several years of community discussion, the Protein Data Bank (PDB) was established in 1971 as a public repository for the coordinates of three-dimensional models of biological macromolecules. Since then, the number, size, and complexity of structural models have continued to grow, reflecting the productivity of structural biology. Managed by the Worldwide PDB organization, the PDB has been able to meet increasing demands for the quantity of structural information and of quality. In addition to providing unrestricted access to structural information, the PDB also works to promote data standards and to raise the profile of structural biology with broader audiences. In this perspective, we describe the history of PDB and the many ways in which the community continues to shape the archive. PMID:24010707
Liang, H; Olejniczak, E T; Mao, X; Nettesheim, D G; Yu, L; Thompson, C B; Fesik, S W
1994-01-01
The ets family of eukaryotic transcription factors is characterized by a conserved DNA-binding domain of approximately 85 amino acids for which the three-dimensional structure is not known. By using multidimensional NMR spectroscopy, we have determined the secondary structure of the ets domain of one member of this gene family, human Fli-1, both in the free form and in a complex with a 16-bp cognate DNA site. The secondary structure of the Fli-1 ets domain consists of three alpha-helices and a short four-stranded antiparallel beta-sheet. This secondary structure arrangement resembles that of the DNA-binding domain of the catabolite gene activator protein of Escherichia coli, as well as those of several eukaryotic DNA-binding proteins including histone H5, HNF-3/fork head, and the heat shock transcription factor. Differences in chemical shifts of backbone resonances and amide exchange rates between the DNA-bound and free forms of the Fli-1 ets domain suggest that the third helix is the DNA recognition helix, as in the catabolite gene activator protein and other structurally related proteins. These results suggest that the ets domain is structurally similar to the catabolite gene activator protein family of helix-turn-helix DNA-binding proteins. Images PMID:7972119
Kuzu, Guray; Keskin, Ozlem; Nussinov, Ruth; Gursoy, Attila
2016-10-01
The structures of protein assemblies are important for elucidating cellular processes at the molecular level. Three-dimensional electron microscopy (3DEM) is a powerful method to identify the structures of assemblies, especially those that are challenging to study by crystallography. Here, a new approach, PRISM-EM, is reported to computationally generate plausible structural models using a procedure that combines crystallographic structures and density maps obtained from 3DEM. The predictions are validated against seven available structurally different crystallographic complexes. The models display mean deviations in the backbone of <5 Å. PRISM-EM was further tested on different benchmark sets; the accuracy was evaluated with respect to the structure of the complex, and the correlation with EM density maps and interface predictions were evaluated and compared with those obtained using other methods. PRISM-EM was then used to predict the structure of the ternary complex of the HIV-1 envelope glycoprotein trimer, the ligand CD4 and the neutralizing protein m36.
Maintenance of a Protein Structure in the Dynamic Evolution of TIMPs over 600 Million Years
Nicosia, Aldo; Maggio, Teresa; Costa, Salvatore; Salamone, Monica; Tagliavia, Marcello; Mazzola, Salvatore; Gianguzza, Fabrizio; Cuttitta, Angela
2016-01-01
Deciphering the events leading to protein evolution represents a challenge, especially for protein families showing complex evolutionary history. Among them, TIMPs represent an ancient eukaryotic protein family widely distributed in the animal kingdom. They are known to control the turnover of the extracellular matrix and are considered to arise early during metazoan evolution, arguably tuning essential features of tissue and epithelial organization. To probe the structure and molecular evolution of TIMPs within metazoans, we report the mining and structural characterization of a large data set of TIMPs over approximately 600 Myr. The TIMPs repertoire was explored starting from the Cnidaria phylum, coeval with the origins of connective tissue, to great apes and humans. Despite dramatic sequence differences compared with highest metazoans, the ancestral proteins displayed the canonical TIMP fold. Only small structural changes, represented by an α-helix located in the N-domain, have occurred over the evolution. Both the occurrence of such secondary structure elements and the relative solvent accessibility of the corresponding residues in the three-dimensional structures raises the possibility that these sites represent unconserved element prone to accept variations. PMID:26957029
Koharudin, Leonardus M I; Kollipara, Sireesha; Aiken, Christopher; Gronenborn, Angela M
2012-09-28
Oscillatoria agardhii agglutinin homolog (OAAH) proteins belong to a recently discovered lectin family. All members contain a sequence repeat of ~66 amino acids, with the number of repeats varying among different family members. Apart from data for the founding member OAA, neither three-dimensional structures, information about carbohydrate binding specificities, nor antiviral activity data have been available up to now for any other members of the OAAH family. To elucidate the structural basis for the antiviral mechanism of OAAHs, we determined the crystal structures of Pseudomonas fluorescens and Myxococcus xanthus lectins. Both proteins exhibit the same fold, resembling the founding family member, OAA, with minor differences in loop conformations. Carbohydrate binding studies by NMR and x-ray structures of glycan-lectin complexes reveal that the number of sugar binding sites corresponds to the number of sequence repeats in each protein. As for OAA, tight and specific binding to α3,α6-mannopentaose was observed. All the OAAH proteins described here exhibit potent anti-HIV activity at comparable levels. Altogether, our results provide structural details of the protein-carbohydrate interaction for this novel lectin family and insights into the molecular basis of their HIV inactivation properties.
Grandison, Scott; Roberts, Carl; Morris, Richard J
2009-03-01
Protein structures are not static entities consisting of equally well-determined atomic coordinates. Proteins undergo continuous motion, and as catalytic machines, these movements can be of high relevance for understanding function. In addition to this strong biological motivation for considering shape changes is the necessity to correctly capture different levels of detail and error in protein structures. Some parts of a structural model are often poorly defined, and the atomic displacement parameters provide an excellent means to characterize the confidence in an atom's spatial coordinates. A mathematical framework for studying these shape changes, and handling positional variance is therefore of high importance. We present an approach for capturing various protein structure properties in a concise mathematical framework that allows us to compare features in a highly efficient manner. We demonstrate how three-dimensional Zernike moments can be employed to describe functions, not only on the surface of a protein but throughout the entire molecule. A number of proof-of-principle examples are given which demonstrate how this approach may be used in practice for the representation of movement and uncertainty.
3D structural fluctuation of IgG1 antibody revealed by individual particle electron tomography
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhang, Xing; Zhang, Lei; Tong, Huimin
2015-05-05
Commonly used methods for determining protein structure, including X-ray crystallography and single-particle reconstruction, often provide a single and unique three-dimensional (3D) structure. However, in these methods, the protein dynamics and flexibility/fluctuation remain mostly unknown. Here, we utilized advances in electron tomography (ET) to study the antibody flexibility and fluctuation through structural determination of individual antibody particles rather than averaging multiple antibody particles together. Through individual-particle electron tomography (IPET) 3D reconstruction from negatively-stained ET images, we obtained 120 ab-initio 3D density maps at an intermediate resolution (~1–3 nm) from 120 individual IgG1 antibody particles. Using these maps as a constraint, wemore » derived 120 conformations of the antibody via structural flexible docking of the crystal structure to these maps by targeted molecular dynamics simulations. Statistical analysis of the various conformations disclosed the antibody 3D conformational flexibility through the distribution of its domain distances and orientations. This blueprint approach, if extended to other flexible proteins, may serve as a useful methodology towards understanding protein dynamics and functions.« less
Protein Denaturation on p-T Axes--Thermodynamics and Analysis.
Smeller, László
2015-01-01
Proteins are essential players in the vast majority of molecular level life processes. Since their structure is in most cases substantial for their correct function, study of their structural changes attracted great interest in the past decades. The three dimensional structure of proteins is influenced by several factors including temperature, pH, presence of chaotropic and cosmotropic agents, or presence of denaturants. Although pressure is an equally important thermodynamic parameter as temperature, pressure studies are considerably less frequent in the literature, probably due to the technical difficulties associated to the pressure studies. Although the first steps in the high-pressure protein study have been done 100 years ago with Bridgman's ground breaking work, the field was silent until the modern spectroscopic techniques allowed the characterization of the protein structural changes, while the protein was under pressure. Recently a number of proteins were studied under pressure, and complete pressure-temperature phase diagrams were determined for several of them. This review summarizes the thermodynamic background of the typical elliptic p-T phase diagram, its limitations and the possible reasons for deviations of the experimental diagrams from the theoretical one. Finally we show some examples of experimentally determined pressure-temperature phase diagrams.
NASA Astrophysics Data System (ADS)
Park, GwangSik; Shin, SeungWoo; Kim, Kyoohyun; Park, YongKeun
2017-02-01
Optical diffraction tomography (ODT) has been an emerging optical technique for label-free imaging of three-dimensional (3-D) refractive index (RI) distribution of biological samples. ODT employs interferometric microscopy for measuring multiple holograms of samples with various incident angles, from which the Fourier diffraction theorem reconstructs the 3-D RI distribution of samples from retrieved complex optical fields. Since the RI value is linearly proportional to the protein concentration of biological samples where the proportional coefficient is called as refractive index increment (RII), reconstructed 3-D RI tomograms provide precise structural and biochemical information of individual biological samples. Because most proteins have similar RII value, however, ODT has limited molecular specificity, especially for imaging eukaryotic cells having various types of proteins and subcellular organelles. Here, we present an ODT system combined with structured illumination microscopy which can measure the 3-D RI distribution of biological samples as well as 3-D super-resolution fluorescent images in the same optical setup. A digital micromirror device (DMD) controls the incident angle of the illumination beam for tomogram reconstruction, and the same DMD modulates the structured illumination pattern of the excitation beam for super-resolution fluorescent imaging. We first validate the proposed method for simultaneous optical diffraction tomographic imaging and super-resolution fluorescent imaging of fluorescent beads. The proposed method is also exploited for various biological samples.
Solution structure of the C-terminal domain of Ole e 9, a major allergen of olive pollen
Treviño, Miguel Á.; Palomares, Oscar; Castrillo, Inés; Villalba, Mayte; Rodríguez, Rosalía; Rico, Manuel; Santoro, Jorge; Bruix, Marta
2008-01-01
Ole e 9 is an olive pollen allergen belonging to group 2 of pathogenesis-related proteins. The protein is composed of two immunological independent domains: an N-terminal domain (NtD) with 1,3-β-glucanase activity, and a C-terminal domain (CtD) that binds 1,3-β-glucans. We have determined the three-dimensional structure of CtD-Ole e 9 (101 amino acids), which consists of two parallel α-helices forming an angle of ∼55°, a small antiparallel β-sheet with two short strands, and a 3–10 helix turn, all connected by long coil segments, resembling a novel type of folding among allergens. Two regions surrounded by aromatic residues (F49, Y60, F96, Y91 and Y31, H68, Y65, F78) have been localized on the protein surface, and a role for sugar binding is suggested. The epitope mapping of CtD-Ole e 9 shows that B-cell epitopes are mainly located on loops, although some of them are contained in secondary structural elements. Interestingly, the IgG and IgE epitopes are contiguous or overlapped, rather than coincident. The three-dimensional structure of CtD-Ole e 9 might help to understand the underlying mechanism of its biochemical function and to determine possible structure–allergenicity relationships. PMID:18096638
PACSY, a relational database management system for protein structure and chemical shift analysis
Lee, Woonghee; Yu, Wookyung; Kim, Suhkmann; Chang, Iksoo
2012-01-01
PACSY (Protein structure And Chemical Shift NMR spectroscopY) is a relational database management system that integrates information from the Protein Data Bank, the Biological Magnetic Resonance Data Bank, and the Structural Classification of Proteins database. PACSY provides three-dimensional coordinates and chemical shifts of atoms along with derived information such as torsion angles, solvent accessible surface areas, and hydrophobicity scales. PACSY consists of six relational table types linked to one another for coherence by key identification numbers. Database queries are enabled by advanced search functions supported by an RDBMS server such as MySQL or PostgreSQL. PACSY enables users to search for combinations of information from different database sources in support of their research. Two software packages, PACSY Maker for database creation and PACSY Analyzer for database analysis, are available from http://pacsy.nmrfam.wisc.edu. PMID:22903636
Cooperative Subunit Refolding of a Light-Harvesting Protein through a Self-Chaperone Mechanism.
Laos, Alistair J; Dean, Jacob C; Toa, Zi S D; Wilk, Krystyna E; Scholes, Gregory D; Curmi, Paul M G; Thordarson, Pall
2017-07-10
The fold of a protein is encoded by its amino acid sequence, but how complex multimeric proteins fold and assemble into functional quaternary structures remains unclear. Here we show that two structurally different phycobiliproteins refold and reassemble in a cooperative manner from their unfolded polypeptide subunits, without biological chaperones. Refolding was confirmed by ultrafast broadband transient absorption and two-dimensional electronic spectroscopy to probe internal chromophores as a marker of quaternary structure. Our results demonstrate a cooperative, self-chaperone refolding mechanism, whereby the β-subunits independently refold, thereby templating the folding of the α-subunits, which then chaperone the assembly of the native complex, quantitatively returning all coherences. Our results indicate that subunit self-chaperoning is a robust mechanism for heteromeric protein folding and assembly that could also be applied in self-assembled synthetic hierarchical systems. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Hanson, Jack; Paliwal, Kuldip; Litfin, Thomas; Yang, Yuedong; Zhou, Yaoqi
2018-06-19
Accurate prediction of a protein contact map depends greatly on capturing as much contextual information as possible from surrounding residues for a target residue pair. Recently, ultra-deep residual convolutional networks were found to be state-of-the-art in the latest Critical Assessment of Structure Prediction techniques (CASP12, (Schaarschmidt et al., 2018)) for protein contact map prediction by attempting to provide a protein-wide context at each residue pair. Recurrent neural networks have seen great success in recent protein residue classification problems due to their ability to propagate information through long protein sequences, especially Long Short-Term Memory (LSTM) cells. Here we propose a novel protein contact map prediction method by stacking residual convolutional networks with two-dimensional residual bidirectional recurrent LSTM networks, and using both one-dimensional sequence-based and two-dimensional evolutionary coupling-based information. We show that the proposed method achieves a robust performance over validation and independent test sets with the Area Under the receiver operating characteristic Curve (AUC)>0.95 in all tests. When compared to several state-of-the-art methods for independent testing of 228 proteins, the method yields an AUC value of 0.958, whereas the next-best method obtains an AUC of 0.909. More importantly, the improvement is over contacts at all sequence-position separations. Specifically, a 8.95%, 5.65% and 2.84% increase in precision were observed for the top L∕10 predictions over the next best for short, medium and long-range contacts, respectively. This confirms the usefulness of ResNets to congregate the short-range relations and 2D-BRLSTM to propagate the long-range dependencies throughout the entire protein contact map 'image'. SPOT-Contact server url: http://sparks-lab.org/jack/server/SPOT-Contact/. Supplementary data is available at Bioinformatics online.
Huenges, M; Rölz, C; Gschwind, R; Peteranderl, R; Berglechner, F; Richter, G; Bacher, A; Kessler, H; Gemmecker, G
1998-01-01
The NusB protein of Escherichia coli is involved in the regulation of rRNA biosynthesis by transcriptional antitermination. In cooperation with several other proteins, it binds to a dodecamer motif designated rrn boxA on the nascent rRNA. The antitermination proteins of E.coli are recruited in the replication cycle of bacteriophage lambda, where they play an important role in switching from the lysogenic to the lytic cycle. Multidimensional heteronuclear NMR experiments were performed with recombinant NusB protein labelled with 13C, 15N and 2H. The three-dimensional structure of the protein was solved from 1926 NMR-derived distances and 80 torsion angle restraints. The protein folds into an alpha/alpha-helical topology consisting of six helices; the arginine-rich N-terminus appears to be disordered. Complexation of the protein with an RNA dodecamer equivalent to the rrn boxA site results in chemical shift changes of numerous amide signals. The overall packing of the protein appears to be conserved, but the flexible N-terminus adopts a more rigid structure upon RNA binding, indicating that the N-terminus functions as an arginine-rich RNA-binding motif (ARM). PMID:9670024
A minimalist model protein with multiple folding funnels
Locker, C. Rebecca; Hernandez, Rigoberto
2001-01-01
Kinetic and structural studies of wild-type proteins such as prions and amyloidogenic proteins provide suggestive evidence that proteins may adopt multiple long-lived states in addition to the native state. All of these states differ structurally because they lie far apart in configuration space, but their stability is not necessarily caused by cooperative (nucleation) effects. In this study, a minimalist model protein is designed to exhibit multiple long-lived states to explore the dynamics of the corresponding wild-type proteins. The minimalist protein is modeled as a 27-monomer sequence confined to a cubic lattice with three different monomer types. An order parameter—the winding index—is introduced to characterize the extent of folding. The winding index has several advantages over other commonly used order parameters like the number of native contacts. It can distinguish between enantiomers, its calculation requires less computational time than the number of native contacts, and reduced-dimensional landscapes can be developed when the native state structure is not known a priori. The results for the designed model protein prove by existence that the rugged energy landscape picture of protein folding can be generalized to include protein “misfolding” into long-lived states. PMID:11470921
Crystallization, structure and dynamics of the proton-translocating P-type ATPase.
Scarborough, G A
2000-01-01
Large single three-dimensional crystals of the dodecylmaltoside complex of the Neurospora crassa plasma membrane H(+)-ATPase (H(+) P-ATPase) can be grown in polyethylene-glycol-containing solutions optimized for moderate supersaturation of both the protein surfaces and detergent micellar region. Large two-dimensional H(+) P-ATPase crystals also grow on the surface of such mixtures and on carbon films located at such surfaces. Electron crystallographic analysis of the two-dimensional crystals grown on carbon films has recently elucidated the structure of the H(+) P-ATPase at a resolution of 0.8 nm in the membrane plane. The two-dimensional crystals comprise two offset layers of ring-shaped ATPase hexamers with their exocytoplasmic surfaces face to face. Side-to-side interactions between the cytoplasmic regions of the hexamers in each layer can be seen, and an interaction between identical exocytoplasmic loops in opposing hexamer layers holds the two layers together. Detergent rings around the membrane-embedded region of the hexamers are clearly visible, and detergent-detergent interactions between the rings are also apparent. The crystal packing forces thus comprise both protein-protein and detergent-detergent interactions, supporting the validity of the original crystallization strategy. Ten transmembrane helices in each ATPase monomer are well-defined in the structure map. They are all relatively straight, closely packed, moderately tilted at various angles with respect to a plane normal to the membrane surface and average approximately 3.5 nm in length. The transmembrane helix region is connected in at least three places to the larger cytoplasmic region, which comprises several discrete domains separated by relatively wide, deep clefts. Previous work has shown that the H(+) P-ATPase undergoes substantial conformational changes during its catalytic cycle that are not changes in secondary structure. Importantly, the results of hydrogen/deuterium exchange experiments indicate that these conformational changes are probably rigid-body interdomain movements that lead to cleft closure. When interpreted within the framework of established principles of enzyme catalysis, this information on the structure and dynamics of the H(+) P-ATPase molecule provides the basis of a rational model for the sequence of events that occurs as the ATPase proceeds through its transport cycle. The forces that drive the sequence can also be clearly stipulated. However, an understanding of the molecular mechanism of ion transport catalyzed by the H(+) P-ATPase awaits an atomic resolution structure.
Veluraja, Kasinadar; Selvin, Jeyasigamani F A; Venkateshwari, Selvakumar; Priyadarzini, Thanu R K
2010-09-23
The inherent flexibility and lack of strong intramolecular interactions of oligosaccharides demand the use of theoretical methods for their structural elucidation. In spite of the developments of theoretical methods, not much research on glycoinformatics is done so far when compared to bioinformatics research on proteins and nucleic acids. We have developed three dimensional structural database for a sialic acid-containing carbohydrates (3DSDSCAR). This is an open-access database that provides 3D structural models of a given sialic acid-containing carbohydrate. At present, 3DSDSCAR contains 60 conformational models, belonging to 14 different sialic acid-containing carbohydrates, deduced through 10 ns molecular dynamics (MD) simulations. The database is available at the URL: http://www.3dsdscar.org. Copyright 2010 Elsevier Ltd. All rights reserved.
Joseph, Agnel Praveen; Srinivasan, Narayanaswamy; de Brevern, Alexandre G
2012-09-01
Comparison of multiple protein structures has a broad range of applications in the analysis of protein structure, function and evolution. Multiple structure alignment tools (MSTAs) are necessary to obtain a simultaneous comparison of a family of related folds. In this study, we have developed a method for multiple structure comparison largely based on sequence alignment techniques. A widely used Structural Alphabet named Protein Blocks (PBs) was used to transform the information on 3D protein backbone conformation as a 1D sequence string. A progressive alignment strategy similar to CLUSTALW was adopted for multiple PB sequence alignment (mulPBA). Highly similar stretches identified by the pairwise alignments are given higher weights during the alignment. The residue equivalences from PB based alignments are used to obtain a three dimensional fit of the structures followed by an iterative refinement of the structural superposition. Systematic comparisons using benchmark datasets of MSTAs underlines that the alignment quality is better than MULTIPROT, MUSTANG and the alignments in HOMSTRAD, in more than 85% of the cases. Comparison with other rigid-body and flexible MSTAs also indicate that mulPBA alignments are superior to most of the rigid-body MSTAs and highly comparable to the flexible alignment methods. Copyright © 2012 Elsevier Masson SAS. All rights reserved.
Phosphorylation of the budgerigar fledgling disease virus major capsid protein VP1
NASA Technical Reports Server (NTRS)
Haynes, J. I. 2nd; Consigli, R. A.; Spooner, B. S. (Principal Investigator)
1992-01-01
The structural proteins of the budgerigar fledgling disease virus, the first known nonmammalian polyomavirus, were analyzed by isoelectric focusing and sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE). The major capsid protein VP1 was found to be composed of at least five distinct species having isoelectric points ranging from pH 6.45 to 5.85. By analogy with the murine polyomavirus, these species apparently result from different modifications of an initial translation product. Primary chicken embryo cells were infected in the presence of 32Pi to determine whether the virus structural proteins were modified by phosphorylation. SDS-PAGE of the purified virus structural proteins demonstrated that VP1 (along with both minor capsid proteins) was phosphorylated. Two-dimensional analysis of the radiolabeled virus showed phosphorylation of only the two most acidic isoelectric species of VP1, indicating that this posttranslational modification contributes to VP1 species heterogeneity. Phosphoamino acid analysis of 32P-labeled VP1 revealed that phosphoserine is the only phosphoamino acid present in the VP1 protein.
Laaser, Jennifer E.; Skoff, David R.; Ho, Jia-Jung; Joo, Yongho; Serrano, Arnaldo L.; Steinkruger, Jay D.; Gopalan, Padma; Gellman, Samuel H.; Zanni, Martin T.
2014-01-01
Surface-bound polypeptides and proteins are increasingly used to functionalize inorganic interfaces such as electrodes, but their structural characterization is exceedingly difficult with standard technologies. In this paper, we report the first two-dimensional sum-frequency generation (2D SFG) spectra of a peptide monolayer, which is collected by adding a mid-IR pulse shaper to a standard femtosecond SFG spectrometer. On a gold surface, standard FTIR spectroscopy is inconclusive about the peptide structure because of solvation-induced frequency shifts, but the 2D lineshapes, anharmonic shifts, and lifetimes obtained from 2D SFG reveal that the peptide is largely α-helical and upright. Random coil residues are also observed, which do not themselves appear in SFG spectra due to their isotropic structural distribution, but which still absorb infrared light and so can be detected by cross-peaks in 2D SFG spectra. We discuss these results in the context of peptide design. Because of the similar way in which the spectra are collected, these 2D SFG spectra can be directly compared to 2D IR spectra, thereby enabling structural interpretations of surface-bound peptides and biomolecules based on the well-studied structure/2D IR spectra relationships established from soluble proteins. PMID:24372101
Uncluttered Single-Image Visualization of Vascular Structures using GPU and Integer Programming
Won, Joong-Ho; Jeon, Yongkweon; Rosenberg, Jarrett; Yoon, Sungroh; Rubin, Geoffrey D.; Napel, Sandy
2013-01-01
Direct projection of three-dimensional branching structures, such as networks of cables, blood vessels, or neurons onto a 2D image creates the illusion of intersecting structural parts and creates challenges for understanding and communication. We present a method for visualizing such structures, and demonstrate its utility in visualizing the abdominal aorta and its branches, whose tomographic images might be obtained by computed tomography or magnetic resonance angiography, in a single two-dimensional stylistic image, without overlaps among branches. The visualization method, termed uncluttered single-image visualization (USIV), involves optimization of geometry. This paper proposes a novel optimization technique that utilizes an interesting connection of the optimization problem regarding USIV to the protein structure prediction problem. Adopting the integer linear programming-based formulation for the protein structure prediction problem, we tested the proposed technique using 30 visualizations produced from five patient scans with representative anatomical variants in the abdominal aortic vessel tree. The novel technique can exploit commodity-level parallelism, enabling use of general-purpose graphics processing unit (GPGPU) technology that yields a significant speedup. Comparison of the results with the other optimization technique previously reported elsewhere suggests that, in most aspects, the quality of the visualization is comparable to that of the previous one, with a significant gain in the computation time of the algorithm. PMID:22291148
Accurate Prediction of Contact Numbers for Multi-Spanning Helical Membrane Proteins
Li, Bian; Mendenhall, Jeffrey; Nguyen, Elizabeth Dong; Weiner, Brian E.; Fischer, Axel W.; Meiler, Jens
2017-01-01
Prediction of the three-dimensional (3D) structures of proteins by computational methods is acknowledged as an unsolved problem. Accurate prediction of important structural characteristics such as contact number is expected to accelerate the otherwise slow progress being made in the prediction of 3D structure of proteins. Here, we present a dropout neural network-based method, TMH-Expo, for predicting the contact number of transmembrane helix (TMH) residues from sequence. Neuronal dropout is a strategy where certain neurons of the network are excluded from back-propagation to prevent co-adaptation of hidden-layer neurons. By using neuronal dropout, overfitting was significantly reduced and performance was noticeably improved. For multi-spanning helical membrane proteins, TMH-Expo achieved a remarkable Pearson correlation coefficient of 0.69 between predicted and experimental values and a mean absolute error of only 1.68. In addition, among those membrane protein–membrane protein interface residues, 76.8% were correctly predicted. Mapping of predicted contact numbers onto structures indicates that contact numbers predicted by TMH-Expo reflect the exposure patterns of TMHs and reveal membrane protein–membrane protein interfaces, reinforcing the potential of predicted contact numbers to be used as restraints for 3D structure prediction and protein–protein docking. TMH-Expo can be accessed via a Web server at www.meilerlab.org. PMID:26804342
Liu, Mengjie; Duan, Liangwei; Wang, Meifang; Zeng, Hongmei; Liu, Xinqi; Qiu, Dewen
2016-01-01
The protein elicitor MoHrip2, which was extracted from Magnaporthe oryzae as an exocrine protein, triggers the tobacco immune system and enhances blast resistance in rice. However, the detailed mechanisms by which MoHrip2 acts as an elicitor remain unclear. Here, we investigated the structure of MoHrip2 to elucidate its functions based on molecular structure. The three-dimensional structure of MoHrip2 was obtained. Overall, the crystal structure formed a β-barrel structure and showed high similarity to the pathogenesis-related (PR) thaumatin superfamily protein thaumatin-like xylanase inhibitor (TL-XI). To investigate the functional regions responsible for MoHrip2 elicitor activities, the full length and eight truncated proteins were expressed in Escherichia coli and were evaluated for elicitor activity in tobacco. Biological function analysis showed that MoHrip2 triggered the defense system against Botrytis cinerea in tobacco. Moreover, only MoHrip2M14 and other fragments containing the 14 amino acids residues in the middle region of the protein showed the elicitor activity of inducing a hypersensitive response and resistance related pathways, which were similar to that of full-length MoHrip2. These results revealed that the central 14 amino acid residues were essential for anti-pathogenic activity.
Cocco, Simona; Monasson, Remi; Weigt, Martin
2013-01-01
Various approaches have explored the covariation of residues in multiple-sequence alignments of homologous proteins to extract functional and structural information. Among those are principal component analysis (PCA), which identifies the most correlated groups of residues, and direct coupling analysis (DCA), a global inference method based on the maximum entropy principle, which aims at predicting residue-residue contacts. In this paper, inspired by the statistical physics of disordered systems, we introduce the Hopfield-Potts model to naturally interpolate between these two approaches. The Hopfield-Potts model allows us to identify relevant ‘patterns’ of residues from the knowledge of the eigenmodes and eigenvalues of the residue-residue correlation matrix. We show how the computation of such statistical patterns makes it possible to accurately predict residue-residue contacts with a much smaller number of parameters than DCA. This dimensional reduction allows us to avoid overfitting and to extract contact information from multiple-sequence alignments of reduced size. In addition, we show that low-eigenvalue correlation modes, discarded by PCA, are important to recover structural information: the corresponding patterns are highly localized, that is, they are concentrated in few sites, which we find to be in close contact in the three-dimensional protein fold. PMID:23990764
Zhang, Honghu; Liu, Xunpei; Feng, Shuren; ...
2015-02-10
In this study, magnetotactic bacteria that produce magnetic nanocrystals of uniform size and well-defined morphologies have inspired the use of biomineralization protein Mms6 to promote formation of uniform magnetic nanocrystals in vitro. Small angle X-ray scattering (SAXS) studies in physiological solutions reveal that Mms6 forms compact globular three-dimensional (3D) micelles (approximately 10 nm in diameter) that are, to a large extent, independent of concentration. In the presence of iron ions in the solutions, the general micellar morphology is preserved, however, with associations among micelles that are induced by iron ions. Compared with Mms6, the m2Mms6 mutant (with the sequence ofmore » hydroxyl/carboxyl containing residues in the C-terminal domain shuffled) exhibits subtle morphological changes in the presence of iron ions in solutions. The analysis of the SAXS data is consistent with a hierarchical core–corona micellar structure similar to that found in amphiphilic polymers. The addition of ferric and ferrous iron ions to the protein solution induces morphological changes in the micellar structure by transforming the 3D micelles into objects of reduced dimensionality of 2, with fractal-like characteristics (including Gaussian-chain-like) or, alternatively, platelet-like structures.« less
Nature of the protein universe
Levitt, Michael
2009-01-01
The protein universe is the set of all proteins of all organisms. Here, all currently known sequences are analyzed in terms of families that have single-domain or multidomain architectures and whether they have a known three-dimensional structure. Growth of new single-domain families is very slow: Almost all growth comes from new multidomain architectures that are combinations of domains characterized by ≈15,000 sequence profiles. Single-domain families are mostly shared by the major groups of organisms, whereas multidomain architectures are specific and account for species diversity. There are known structures for a quarter of the single-domain families, and >70% of all sequences can be partially modeled thanks to their membership in these families. PMID:19541617
Dal Palù, Alessandro; Dovier, Agostino; Pontelli, Enrico
2010-01-01
Crystal lattices are discrete models of the three-dimensional space that have been effectively employed to facilitate the task of determining proteins' natural conformation. This paper investigates alternative global constraints that can be introduced in a constraint solver over discrete crystal lattices. The objective is to enhance the efficiency of lattice solvers in dealing with the construction of approximate solutions of the protein structure determination problem. Some of them (e.g., self-avoiding-walk) have been explicitly or implicitly already used in previous approaches, while others (e.g., the density constraint) are new. The intrinsic complexities of all of them are studied and preliminary experimental results are discussed.
Membrane protein structure determination — The next generation☆☆☆
Moraes, Isabel; Evans, Gwyndaf; Sanchez-Weatherby, Juan; Newstead, Simon; Stewart, Patrick D. Shaw
2014-01-01
The field of Membrane Protein Structural Biology has grown significantly since its first landmark in 1985 with the first three-dimensional atomic resolution structure of a membrane protein. Nearly twenty-six years later, the crystal structure of the beta2 adrenergic receptor in complex with G protein has contributed to another landmark in the field leading to the 2012 Nobel Prize in Chemistry. At present, more than 350 unique membrane protein structures solved by X-ray crystallography (http://blanco.biomol.uci.edu/mpstruc/exp/list, Stephen White Lab at UC Irvine) are available in the Protein Data Bank. The advent of genomics and proteomics initiatives combined with high-throughput technologies, such as automation, miniaturization, integration and third-generation synchrotrons, has enhanced membrane protein structure determination rate. X-ray crystallography is still the only method capable of providing detailed information on how ligands, cofactors, and ions interact with proteins, and is therefore a powerful tool in biochemistry and drug discovery. Yet the growth of membrane protein crystals suitable for X-ray diffraction studies amazingly remains a fine art and a major bottleneck in the field. It is often necessary to apply as many innovative approaches as possible. In this review we draw attention to the latest methods and strategies for the production of suitable crystals for membrane protein structure determination. In addition we also highlight the impact that third-generation synchrotron radiation has made in the field, summarizing the latest strategies used at synchrotron beamlines for screening and data collection from such demanding crystals. This article is part of a Special Issue entitled: Structural and biophysical characterisation of membrane protein-ligand binding. PMID:23860256
Modularity of Protein Folds as a Tool for Template-Free Modeling of Structures.
Vallat, Brinda; Madrid-Aliste, Carlos; Fiser, Andras
2015-08-01
Predicting the three-dimensional structure of proteins from their amino acid sequences remains a challenging problem in molecular biology. While the current structural coverage of proteins is almost exclusively provided by template-based techniques, the modeling of the rest of the protein sequences increasingly require template-free methods. However, template-free modeling methods are much less reliable and are usually applicable for smaller proteins, leaving much space for improvement. We present here a novel computational method that uses a library of supersecondary structure fragments, known as Smotifs, to model protein structures. The library of Smotifs has saturated over time, providing a theoretical foundation for efficient modeling. The method relies on weak sequence signals from remotely related protein structures to create a library of Smotif fragments specific to the target protein sequence. This Smotif library is exploited in a fragment assembly protocol to sample decoys, which are assessed by a composite scoring function. Since the Smotif fragments are larger in size compared to the ones used in other fragment-based methods, the proposed modeling algorithm, SmotifTF, can employ an exhaustive sampling during decoy assembly. SmotifTF successfully predicts the overall fold of the target proteins in about 50% of the test cases and performs competitively when compared to other state of the art prediction methods, especially when sequence signal to remote homologs is diminishing. Smotif-based modeling is complementary to current prediction methods and provides a promising direction in addressing the structure prediction problem, especially when targeting larger proteins for modeling.
Thoden, James B; Holden, Hazel M
2014-06-01
Unusual di- and trideoxysugars are often found on the O-antigens of Gram-negative bacteria, on the S-layers of Gram-positive bacteria, and on various natural products. One such sugar is 3-acetamido-3,6-dideoxy-D-glucose. A key step in its biosynthesis, catalyzed by a 3,4-ketoisomerase, is the conversion of thymidine diphosphate (dTDP)-4-keto-6-deoxyglucose to dTDP-3-keto-6-deoxyglucose. Here we report an X-ray analysis of a 3,4-ketoisomerase from Thermoanaerobacterium thermosaccharolyticum. For this investigation, the wild-type enzyme, referred to as QdtA, was crystallized in the presence of dTDP and its structure solved to 2.0-Å resolution. The dimeric enzyme adopts a three-dimensional architecture that is characteristic for proteins belonging to the cupin superfamily. In order to trap the dTDP-4-keto-6-deoxyglucose substrate into the active site, a mutant protein, H51N, was subsequently constructed, and the structure of this protein in complex with the dTDP-sugar ligand was solved to 1.9-Å resolution. Taken together, the structures suggest that His 51 serves as a catalytic base, that Tyr 37 likely functions as a catalytic acid, and that His 53 provides a proton shuttle between the C-3' hydroxyl and the C-4' keto group of the hexose. This study reports the first three-dimensional structure of a 3,4-ketoisomerase in complex with its dTDP-sugar substrate and thus sheds new molecular insight into this fascinating class of enzymes. © 2014 The Protein Society.
Probing Protein Fold Space with a Simplified Model
Minary, Peter; Levitt, Michael
2008-01-01
We probe the stability and near-native energy landscape of protein fold space using powerful conformational sampling methods together with simple reduced models and statistical potentials. Fold space is represented by a set of 280 protein domains spanning all topological classes and having a wide range of lengths (0-300 residues), amino acid composition, and number of secondary structural elements. The degrees of freedom are taken as the loop torsion angles. This choice preserves the native secondary structure but allows the tertiary structure to change. The proteins are represented by three-point per residue, three-dimensional models with statistical potentials derived from a knowledge-based study of known protein structures. When this space is sampled by a combination of Parallel Tempering and Equi-Energy Monte Carlo, we find that the three-point model captures the known stability of protein native structures with stable energy basins that are near-native (all-α: 4.77 Å, all-β: 2.93 Å, α/β: 3.09 Å, α+β: 4.89 Å on average and within 6 Å for 71.41 %, 92.85 %, 94.29 % and 64.28 % for all-α, all-β, α/β and α+β, classes respectively). Denatured structures also occur and these have interesting structural properties that shed light on the different landscape characteristics of α and β folds. We find that α/β proteins with alternating α and β segments (such as the beta-barrel) are more stable than proteins in other fold classes. PMID:18054792
NASA Astrophysics Data System (ADS)
Hong, Mei
1999-08-01
We describe an approach to efficiently determine the backbone conformation of solid proteins that utilizes selective and extensive 13C labeling in conjunction with two-dimensional magic-angle-spinning NMR. The selective 13C labeling approach aims to reduce line broadening and other multispin complications encountered in solid-state NMR of uniformly labeled proteins while still enhancing the sensitivity of NMR spectra. It is achieved by using specifically labeled glucose or glycerol as the sole carbon source in the protein expression medium. For amino acids synthesized in the linear part of the biosynthetic pathways, [1-13C]glucose preferentially labels the ends of the side chains, while [2-13C]glycerol labels the Cα of these residues. Amino acids produced from the citric-acid cycle are labeled in a more complex manner. Information on the secondary structure of such a labeled protein was obtained by measuring multiple backbone torsion angles φ simultaneously, using an isotropic-anisotropic 2D correlation technique, the HNCH experiment. Initial experiments for resonance assignment of a selectively 13C labeled protein were performed using 15N-13C 2D correlation spectroscopy. From the time dependence of the 15N-13C dipolar coherence transfer, both intraresidue and interresidue connectivities can be observed, thus yielding partial sequential assignment. We demonstrate the selective 13C labeling and these 2D NMR experiments on a 8.5-kDa model protein, ubiquitin. This isotope-edited NMR approach is expected to facilitate the structure determination of proteins in the solid state.
Solution structure of the strawberry allergen Fra a 1
Seutter von Loetzen, Christian; Schweimer, Kristian; Schwab, Wilfried; Rösch, Paul; Hartl-Spiegelhauer, Olivia
2012-01-01
The PR10 family protein Fra a 1E from strawberry (Fragaria x ananassa) is down-regulated in white strawberry mutants, and transient RNAi (RNA interference)-mediated silencing experiments confirmed that Fra a 1 is involved in fruit pigment synthesis. In the present study, we determined the solution structure of Fra a 1E. The protein fold is identical with that of other members of the PR10 protein family and consists of a seven-stranded antiparallel β-sheet, two short V-shaped α-helices and a long C-terminal α-helix that encompass a hydrophobic pocket. Whereas Fra a 1E contains the glycine-rich loop that is highly conserved throughout the protein family, the volume of the hydrophobic pocket and the size of its entrance are much larger than expected. The three-dimensional structure may shed some light on its physiological function and may help to further understand the role of PR10 proteins in plants. PMID:22913709
Lead discovery and in silico 3D structure modeling of tumorigenic FAM72A (p17).
Pramanik, Subrata; Kutzner, Arne; Heese, Klaus
2015-01-01
FAM72A (p17) is a novel neuronal protein that has been linked to tumorigenic effects in non-neuronal tissue. Using state of the art in silico physicochemical analyses (e.g., I-TASSER, RaptorX, and Modeller), we determined the three-dimensional (3D) protein structure of FAM72A and further identified potential ligand-protein interactions. Our data indicate a Zn(2+)/Fe(3+)-containing 3D protein structure, based on a 3GA3_A model template, which potentially interacts with the organic molecule RSM ((2s)-2-(acetylamino)-N-methyl-4-[(R)-methylsulfinyl] butanamide). The discovery of RSM may serve as potential lead for further anti-FAM72A drug screening tests in the pharmaceutical industry because interference with FAM72A's activities via RSM-related molecules might be a novel option to influence the tumor suppressor protein p53 signaling pathways for the treatment of various types of cancers.
The Physics of Amyloid Aggregation and Templating in Prions
NASA Astrophysics Data System (ADS)
Cox, Daniel
2012-02-01
The problem of self-assembled amyloid aggregation of proteins in structures with beta-strands perpendicular to a one dimensional grown axis is interesting at a fundamental level (is this the most generic end state of proteins?), from a biological level (if the self-assembly can be regulated it is of use in contexts like spider silk and bacterial colony formation), for human public health (aggregation unregulated induces diseases like mad cow and Alzheimer's), and for possible materials applications (e.g., in tissue scaffolding). In this presentation, I will review the work of my group in examining the possibility that the left-handed beta helix (LHBH) structure can be the building block of the aggregates of mammalian prion and yeast prion proteins. I will also discuss our efforts to assess the possibility of a novel pH driven structural switch between LHBH and alpha-helical forms in the ordered half of the mammalian prion protein, and now the possibly pH stabilized LHBH structure can template aggregate growth of the disordered half of the protein, identified in numerous experimental studies as most relevant to disease.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Soriano, Erika V.; McCloskey, Diane E.; Kinsland, Cynthia
2008-04-01
The crystal structures of two arginine decarboxylase mutant proteins provide insights into the mechanisms of pyruvoyl-group formation and the decarboxylation reaction. Pyruvoyl-dependent arginine decarboxylase (PvlArgDC) catalyzes the first step of the polyamine-biosynthetic pathway in plants and some archaebacteria. The pyruvoyl group of PvlArgDC is generated by an internal autoserinolysis reaction at an absolutely conserved serine residue in the proenzyme, resulting in two polypeptide chains. Based on the native structure of PvlArgDC from Methanococcus jannaschii, the conserved residues Asn47 and Glu109 were proposed to be involved in the decarboxylation and autoprocessing reactions. N47A and E109Q mutant proteins were prepared and themore » three-dimensional structure of each protein was determined at 2.0 Å resolution. The N47A and E109Q mutant proteins showed reduced decarboxylation activity compared with the wild-type PvlArgDC. These residues may also be important for the autoprocessing reaction, which utilizes a mechanism similar to that of the decarboxylation reaction.« less
The neuronal porosome complex in health and disease
Naik, Akshata R; Lewis, Kenneth T
2015-01-01
Cup-shaped secretory portals at the cell plasma membrane called porosomes mediate the precision release of intravesicular material from cells. Membrane-bound secretory vesicles transiently dock and fuse at the base of porosomes facing the cytosol to expel pressurized intravesicular contents from the cell during secretion. The structure, isolation, composition, and functional reconstitution of the neuronal porosome complex have greatly progressed, providing a molecular understanding of its function in health and disease. Neuronal porosomes are 15 nm cup-shaped lipoprotein structures composed of nearly 40 proteins, compared to the 120 nm nuclear pore complex composed of >500 protein molecules. Membrane proteins compose the porosome complex, making it practically impossible to solve its atomic structure. However, atomic force microscopy and small-angle X-ray solution scattering studies have provided three-dimensional structural details of the native neuronal porosome at sub-nanometer resolution, providing insights into the molecular mechanism of its function. The participation of several porosome proteins previously implicated in neurotransmission and neurological disorders, further attest to the crosstalk between porosome proteins and their coordinated involvement in release of neurotransmitter at the synapse. PMID:26264442
Brown, Peter; Pullan, Wayne; Yang, Yuedong; Zhou, Yaoqi
2016-02-01
The three dimensional tertiary structure of a protein at near atomic level resolution provides insight alluding to its function and evolution. As protein structure decides its functionality, similarity in structure usually implies similarity in function. As such, structure alignment techniques are often useful in the classifications of protein function. Given the rapidly growing rate of new, experimentally determined structures being made available from repositories such as the Protein Data Bank, fast and accurate computational structure comparison tools are required. This paper presents SPalignNS, a non-sequential protein structure alignment tool using a novel asymmetrical greedy search technique. The performance of SPalignNS was evaluated against existing sequential and non-sequential structure alignment methods by performing trials with commonly used datasets. These benchmark datasets used to gauge alignment accuracy include (i) 9538 pairwise alignments implied by the HOMSTRAD database of homologous proteins; (ii) a subset of 64 difficult alignments from set (i) that have low structure similarity; (iii) 199 pairwise alignments of proteins with similar structure but different topology; and (iv) a subset of 20 pairwise alignments from the RIPC set. SPalignNS is shown to achieve greater alignment accuracy (lower or comparable root-mean squared distance with increased structure overlap coverage) for all datasets, and the highest agreement with reference alignments from the challenging dataset (iv) above, when compared with both sequentially constrained alignments and other non-sequential alignments. SPalignNS was implemented in C++. The source code, binary executable, and a web server version is freely available at: http://sparks-lab.org yaoqi.zhou@griffith.edu.au. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
High-throughput methods for electron crystallography.
Stokes, David L; Ubarretxena-Belandia, Iban; Gonen, Tamir; Engel, Andreas
2013-01-01
Membrane proteins play a tremendously important role in cell physiology and serve as a target for an increasing number of drugs. Structural information is key to understanding their function and for developing new strategies for combating disease. However, the complex physical chemistry associated with membrane proteins has made them more difficult to study than their soluble cousins. Electron crystallography has historically been a successful method for solving membrane protein structures and has the advantage of providing a native lipid environment for these proteins. Specifically, when membrane proteins form two-dimensional arrays within a lipid bilayer, electron microscopy can be used to collect images and diffraction and the corresponding data can be combined to produce a three-dimensional reconstruction, which under favorable conditions can extend to atomic resolution. Like X-ray crystallography, the quality of the structures are very much dependent on the order and size of the crystals. However, unlike X-ray crystallography, high-throughput methods for screening crystallization trials for electron crystallography are not in general use. In this chapter, we describe two alternative methods for high-throughput screening of membrane protein crystallization within the lipid bilayer. The first method relies on the conventional use of dialysis for removing detergent and thus reconstituting the bilayer; an array of dialysis wells in the standard 96-well format allows the use of a liquid-handling robot and greatly increases throughput. The second method relies on titration of cyclodextrin as a chelating agent for detergent; a specialized pipetting robot has been designed not only to add cyclodextrin in a systematic way, but to use light scattering to monitor the reconstitution process. In addition, the use of liquid-handling robots for making negatively stained grids and methods for automatically imaging samples in the electron microscope are described.
Identification of DNA-Binding Proteins Using Structural, Electrostatic and Evolutionary Features
Nimrod, Guy; Szilágyi, András; Leslie, Christina; Ben-Tal, Nir
2009-01-01
Summary DNA binding proteins (DBPs) often take part in various crucial processes of the cell's life cycle. Therefore, the identification and characterization of these proteins are of great importance. We present here a random forests classifier for identifying DBPs among proteins with known three-dimensional structures. First, clusters of evolutionarily conserved regions (patches) on the protein's surface are detected using the PatchFinder algorithm; previous studies showed that these regions are typically the proteins' functionally important regions. Next, we train a classifier using features like the electrostatic potential, cluster-based amino acid conservation patterns and the secondary structure content of the patches, as well as features of the whole protein including its dipole moment. Using 10-fold cross validation on a dataset of 138 DNA-binding proteins and 110 proteins which do not bind DNA, the classifier achieved a sensitivity and a specificity of 0.90, which is overall better than the performance of previously published methods. Furthermore, when we tested 5 different methods on 11 new DBPs which did not appear in the original dataset, only our method annotated all correctly. The resulting classifier was applied to a collection of 757 proteins of known structure and unknown function. Of these proteins, 218 were predicted to bind DNA, and we anticipate that some of them interact with DNA using new structural motifs. The use of complementary computational tools supports the notion that at least some of them do bind DNA. PMID:19233205
The alphabet of intrinsic disorder
Theillet, Francois-Xavier; Kalmar, Lajos; Tompa, Peter; Han, Kyou-Hoon; Selenko, Philipp; Dunker, A. Keith; Daughdrill, Gary W.; Uversky, Vladimir N
2013-01-01
A significant fraction of every proteome is occupied by biologically active proteins that do not form unique three-dimensional structures. These intrinsically disordered proteins (IDPs) and IDP regions (IDPRs) have essential biological functions and are characterized by extensive structural plasticity. Such structural and functional behavior is encoded in the amino acid sequences of IDPs/IDPRs, which are enriched in disorder-promoting residues and depleted in order-promoting residues. In fact, amino acid residues can be arranged according to their disorder-promoting tendency to form an alphabet of intrinsic disorder that defines the structural complexity and diversity of IDPs/IDPRs. This review is the first in a series of publications dedicated to the roles that different amino acid residues play in defining the phenomenon of protein intrinsic disorder. We start with proline because data suggests that of the 20 common amino acid residues, this one is the most disorder-promoting. PMID:28516008
An improved stochastic fractal search algorithm for 3D protein structure prediction.
Zhou, Changjun; Sun, Chuan; Wang, Bin; Wang, Xiaojun
2018-05-03
Protein structure prediction (PSP) is a significant area for biological information research, disease treatment, and drug development and so on. In this paper, three-dimensional structures of proteins are predicted based on the known amino acid sequences, and the structure prediction problem is transformed into a typical NP problem by an AB off-lattice model. This work applies a novel improved Stochastic Fractal Search algorithm (ISFS) to solve the problem. The Stochastic Fractal Search algorithm (SFS) is an effective evolutionary algorithm that performs well in exploring the search space but falls into local minimums sometimes. In order to avoid the weakness, Lvy flight and internal feedback information are introduced in ISFS. In the experimental process, simulations are conducted by ISFS algorithm on Fibonacci sequences and real peptide sequences. Experimental results prove that the ISFS performs more efficiently and robust in terms of finding the global minimum and avoiding getting stuck in local minimums.
Electron-transfer dynamics of photosynthetic reaction centers in thermoresponsive soft materials.
Laible, Philip D; Kelley, Richard F; Wasielewski, Michael R; Firestone, Millicent A
2005-12-15
Poly(ethylene glycol)-grafted, lipid-based, thermoresponsive, soft nanostructures are shown to serve as scaffolding into which reconstituted integral membrane proteins, such as the bacterial photosynthetic reaction centers (RCs) can be stabilized, and their packing arrangement, and hence photophysical properties, can be controlled. The self-assembled nanostructures exist in two distinct states: a liquid-crystalline gel phase at temperatures above 21 degrees C and a non-birefringent, reduced viscosity state at lower temperatures. Characterization of the effect of protein introduction on the mesoscopic structure of the materials by 31P NMR and small-angle X-ray scattering shows that the expanded lamellar structure of the protein-free material is retained. At reduced temperatures, however, the aggregate structure is found to convert from a two-dimensional normal hexagonal structure to a three-dimensional cubic phase upon introduction of the RCs. Structural and functional characteristics of the RCs were determined by ground-state and femtosecond transient absorption spectroscopy. Time-resolved results indicate that the kinetics of primary electron transfer for the RCs in the low-viscosity cold phase of the self-assembled nanostructures are identical to those observed in a detergent-solubilized state in buffered aqueous solutions (approximately 4 ps) over a wide range of protein concentrations and experimental conditions. This is also true for RCs held within the lamellar gel phase at low protein concentrations and at short sample storage times. In contrast are kinetics from samples that are prepared with high RC concentrations and stored for several hours, which display additional kinetic components with extended electron-transfer times (approximately 10-12 ps). This observation is tentatively attributed to energy transfer between RCs that have laterally (in-plane) organized within the lipid bilayers of the lamellar gel phase prior to charge separation. These results not only demonstrate the use of soft nanostructures as a matrix in which to stabilize and organize membrane proteins but also suggest the possibility of using them to control the interactions between proteins and thus to tune their collective optical/electronic properties.
Algorithm to find distant repeats in a single protein sequence
Banerjee, Nirjhar; Sarani, Rangarajan; Ranjani, Chellamuthu Vasuki; Sowmiya, Govindaraj; Michael, Daliah; Balakrishnan, Narayanasamy; Sekar, Kanagaraj
2008-01-01
Distant repeats in protein sequence play an important role in various aspects of protein analysis. A keen analysis of the distant repeats would enable to establish a firm relation of the repeats with respect to their function and three-dimensional structure during the evolutionary process. Further, it enlightens the diversity of duplication during the evolution. To this end, an algorithm has been developed to find all distant repeats in a protein sequence. The scores from Point Accepted Mutation (PAM) matrix has been deployed for the identification of amino acid substitutions while detecting the distant repeats. Due to the biological importance of distant repeats, the proposed algorithm will be of importance to structural biologists, molecular biologists, biochemists and researchers involved in phylogenetic and evolutionary studies. PMID:19052663
Open-Porous Hydroxyapatite Scaffolds for Three-Dimensional Culture of Human Adult Liver Cells
Schmelzer, Eva; Over, Patrick; Nettleship, Ian; Gerlach, Joerg C.
2016-01-01
Liver cell culture within three-dimensional structures provides an improved culture system for various applications in basic research, pharmacological screening, and implantable or extracorporeal liver support. Biodegradable calcium-based scaffolds in such systems could enhance liver cell functionality by providing endothelial and hepatic cell support through locally elevated calcium levels, increased surface area for cell attachment, and allowing three-dimensional tissue restructuring. Open-porous hydroxyapatite scaffolds were fabricated and seeded with primary adult human liver cells, which were embedded within or without gels of extracellular matrix protein collagen-1 or hyaluronan. Metabolic functions were assessed after 5, 15, and 28 days. Longer-term cultures exhibited highest cell numbers and liver specific gene expression when cultured on hydroxyapatite scaffolds in collagen-1. Endothelial gene expression was induced in cells cultured on scaffolds without extracellular matrix proteins. Hydroxyapatite induced gene expression for cytokeratin-19 when cells were cultured in collagen-1 gel while culture in hyaluronan increased cytokeratin-19 gene expression independent of the use of scaffold in long-term culture. The implementation of hydroxyapatite composites with extracellular matrices affected liver cell cultures and cell differentiation depending on the type of matrix protein and the presence of a scaffold. The hydroxyapatite scaffolds enable scale-up of hepatic three-dimensional culture models for regenerative medicine applications. PMID:27403430
Present and future of membrane protein structure determination by electron crystallography.
Ubarretxena-Belandia, Iban; Stokes, David L
2010-01-01
Membrane proteins are critical to cell physiology, playing roles in signaling, trafficking, transport, adhesion, and recognition. Despite their relative abundance in the proteome and their prevalence as targets of therapeutic drugs, structural information about membrane proteins is in short supply. This chapter describes the use of electron crystallography as a tool for determining membrane protein structures. Electron crystallography offers distinct advantages relative to the alternatives of X-ray crystallography and NMR spectroscopy. Namely, membrane proteins are placed in their native membranous environment, which is likely to favor a native conformation and allow changes in conformation in response to physiological ligands. Nevertheless, there are significant logistical challenges in finding appropriate conditions for inducing membrane proteins to form two-dimensional arrays within the membrane and in using electron cryo-microscopy to collect the data required for structure determination. A number of developments are described for high-throughput screening of crystallization trials and for automated imaging of crystals with the electron microscope. These tools are critical for exploring the necessary range of factors governing the crystallization process. There have also been recent software developments to facilitate the process of structure determination. However, further innovations in the algorithms used for processing images and electron diffraction are necessary to improve throughput and to make electron crystallography truly viable as a method for determining atomic structures of membrane proteins. Copyright © 2010 Elsevier Inc. All rights reserved.
Present and future of membrane protein structure determination by electron crystallography
Ubarretxena-Belandia, Iban; Stokes, David L.
2011-01-01
Membrane proteins are critical to cell physiology, playing roles in signaling, trafficking, transport, adhesion, and recognition. Despite their relative abundance in the proteome and their prevalence as targets of therapeutic drugs, structural information about membrane proteins is in short supply. This review describes the use of electron crystallography as a tool for determining membrane protein structures. Electron crystallography offers distinct advantages relative to the alternatives of X-ray crystallography and NMR spectroscopy. Namely, membrane proteins are placed in their native membranous environment, which is likely to favor a native conformation and allow changes in conformation in response to physiological ligands. Nevertheless, there are significant logistical challenges in finding appropriate conditions for inducing membrane proteins to form two-dimensional arrays within the membrane and in using electron cryo-microscopy to collect the data required for structure determination. A number of developments are described for high-throughput screening of crystallization trials and for automated imaging of crystals with the electron microscope. These tools are critical for exploring the necessary range of factors governing the crystallization process. There have also been recent software developments to facilitate the process of structure determination. However, further innovations in the algorithms used for processing images and electron diffraction are necessary to improve throughput and to make electron crystallography truly viable as a method for determining atomic structures of membrane proteins. PMID:21115172
Expression and Purification of Rat Glucose Transporter 1 in Pichia pastoris.
Venskutonytė, Raminta; Elbing, Karin; Lindkvist-Petersson, Karin
2018-01-01
Large amounts of pure and homogenous protein are a prerequisite for several biochemical and biophysical analyses, and in particular if aiming at resolving the three-dimensional protein structure. Here we describe the production of the rat glucose transporter 1 (GLUT1), a membrane protein facilitating the transport of glucose in cells. The protein is recombinantly expressed in the yeast Pichia pastoris. It is easily maintained and large-scale protein production in shaker flasks, as commonly performed in academic research laboratories, results in relatively high yields of membrane protein. The purification protocol describes all steps needed to obtain a pure and homogenous GLUT1 protein solution, including cell growth, membrane isolation, and chromatographic purification methods.
Koromyslova, Anna D; Chugunov, Anton O; Efremov, Roman G
2014-04-28
Molecular surfaces are the key players in biomolecular recognition and interactions. Nowadays, it is trivial to visualize a molecular surface and surface-distributed properties in three-dimensional space. However, such a representation trends to be biased and ambiguous in case of thorough analysis. We present a new method to create 2D spherical projection maps of entire protein surfaces and manipulate with them--protein surface topography (PST). It permits visualization and thoughtful analysis of surface properties. PST helps to easily portray conformational transitions, analyze proteins' properties and their dynamic behavior, improve docking performance, and reveal common patterns and dissimilarities in molecular surfaces of related bioactive peptides. This paper describes basic usage of PST with an example of small G-proteins conformational transitions, mapping of caspase-1 intersubunit interface, and intrinsic "complementarity" in the conotoxin-acetylcholine binding protein complex. We suggest that PST is a beneficial approach for structure-function studies of bioactive peptides and small proteins.
Network representation of protein interactions: Theory of graph description and analysis.
Kurzbach, Dennis
2016-09-01
A methodological framework is presented for the graph theoretical interpretation of NMR data of protein interactions. The proposed analysis generalizes the idea of network representations of protein structures by expanding it to protein interactions. This approach is based on regularization of residue-resolved NMR relaxation times and chemical shift data and subsequent construction of an adjacency matrix that represents the underlying protein interaction as a graph or network. The network nodes represent protein residues. Two nodes are connected if two residues are functionally correlated during the protein interaction event. The analysis of the resulting network enables the quantification of the importance of each amino acid of a protein for its interactions. Furthermore, the determination of the pattern of correlations between residues yields insights into the functional architecture of an interaction. This is of special interest for intrinsically disordered proteins, since the structural (three-dimensional) architecture of these proteins and their complexes is difficult to determine. The power of the proposed methodology is demonstrated at the example of the interaction between the intrinsically disordered protein osteopontin and its natural ligand heparin. © 2016 The Protein Society.
Protein structure shapes immunodominance in the CD4 T cell response to yellow fever vaccination.
Koblischke, Maximilian; Mackroth, Maria S; Schwaiger, Julia; Fae, Ingrid; Fischer, Gottfried; Stiasny, Karin; Heinz, Franz X; Aberle, Judith H
2017-08-21
The live attenuated yellow fever (YF) vaccine is a highly effective human vaccine and induces long-term protective neutralizing antibodies directed against the viral envelope protein E. The generation of such antibodies requires the help of CD4 T cells which recognize peptides derived from proteins in virus particles internalized and processed by E-specific B cells. The CD4 T helper cell response is restricted to few immunodominant epitopes, but the mechanisms of their selection are largely unknown. Here, we report that CD4 T cell responses elicited by the YF-17D vaccine are focused to hotspots of two helices of the viral capsid protein and to exposed strands and loops of E. We found that the locations of immunodominant epitopes within three-dimensional protein structures exhibit a high degree of overlap between YF virus and the structurally homologous flavivirus tick-borne encephalitis virus, although amino acid sequence identity of the epitope regions is only 15-45%. The restriction of epitopes to exposed E protein surfaces and their strikingly similar positioning within proteins of distantly related flaviviruses are consistent with a strong influence of protein structure that shapes CD4 T cell responses and provide leads for a rational design of immunogens for vaccination.
3dRPC: a web server for 3D RNA-protein structure prediction.
Huang, Yangyu; Li, Haotian; Xiao, Yi
2018-04-01
RNA-protein interactions occur in many biological processes. To understand the mechanism of these interactions one needs to know three-dimensional (3D) structures of RNA-protein complexes. 3dRPC is an algorithm for prediction of 3D RNA-protein complex structures and consists of a docking algorithm RPDOCK and a scoring function 3dRPC-Score. RPDOCK is used to sample possible complex conformations of an RNA and a protein by calculating the geometric and electrostatic complementarities and stacking interactions at the RNA-protein interface according to the features of atom packing of the interface. 3dRPC-Score is a knowledge-based potential that uses the conformations of nucleotide-amino-acid pairs as statistical variables and that is used to choose the near-native complex-conformations obtained from the docking method above. Recently, we built a web server for 3dRPC. The users can easily use 3dRPC without installing it locally. RNA and protein structures in PDB (Protein Data Bank) format are the only needed input files. It can also incorporate the information of interface residues or residue-pairs obtained from experiments or theoretical predictions to improve the prediction. The address of 3dRPC web server is http://biophy.hust.edu.cn/3dRPC. yxiao@hust.edu.cn.
Proposed structure of putative glucose channel in GLUT1 facilitative glucose transporter.
Zeng, H; Parthasarathy, R; Rampal, A L; Jung, C Y
1996-01-01
A family of structurally related intrinsic membrane proteins (facilitative glucose transporters) catalyzes the movement of glucose across the plasma membrane of animal cells. Evidence indicates that these proteins show a common structural motif where approximately 50% of the mass is embedded in lipid bilayer (transmembrane domain) in 12 alpha-helices (transmembrane helices; TMHs) and accommodates a water-filled channel for substrate passage (glucose channel) whose tertiary structure is currently unknown. Using recent advances in protein structure prediction algorithms we proposed here two three-dimensional structural models for the transmembrane glucose channel of GLUT1 glucose transporter. Our models emphasize the physical dimension and water accessibility of the channel, loop lengths between TMHs, the macrodipole orientation in four-helix bundle motif, and helix packing energy. Our models predict that five TMHs, either TMHs 3, 4, 7, 8, 11 (Model 1) or TMHs 2, 5, 11, 8, 7 (Model 2), line the channel, and the remaining TMHs surround these channel-lining TMHs. We discuss how our models are compatible with the experimental data obtained with this protein, and how they can be used in designing new biochemical and molecular biological experiments in elucidation of the structural basis of this important protein function. Images FIGURE 1 FIGURE 2 FIGURE 4 FIGURE 5 PMID:8770183
Identify High-Quality Protein Structural Models by Enhanced K-Means.
Wu, Hongjie; Li, Haiou; Jiang, Min; Chen, Cheng; Lv, Qiang; Wu, Chuang
2017-01-01
Background. One critical issue in protein three-dimensional structure prediction using either ab initio or comparative modeling involves identification of high-quality protein structural models from generated decoys. Currently, clustering algorithms are widely used to identify near-native models; however, their performance is dependent upon different conformational decoys, and, for some algorithms, the accuracy declines when the decoy population increases. Results. Here, we proposed two enhanced K -means clustering algorithms capable of robustly identifying high-quality protein structural models. The first one employs the clustering algorithm SPICKER to determine the initial centroids for basic K -means clustering ( SK -means), whereas the other employs squared distance to optimize the initial centroids ( K -means++). Our results showed that SK -means and K -means++ were more robust as compared with SPICKER alone, detecting 33 (59%) and 42 (75%) of 56 targets, respectively, with template modeling scores better than or equal to those of SPICKER. Conclusions. We observed that the classic K -means algorithm showed a similar performance to that of SPICKER, which is a widely used algorithm for protein-structure identification. Both SK -means and K -means++ demonstrated substantial improvements relative to results from SPICKER and classical K -means.
Identify High-Quality Protein Structural Models by Enhanced K-Means
Li, Haiou; Chen, Cheng; Lv, Qiang; Wu, Chuang
2017-01-01
Background. One critical issue in protein three-dimensional structure prediction using either ab initio or comparative modeling involves identification of high-quality protein structural models from generated decoys. Currently, clustering algorithms are widely used to identify near-native models; however, their performance is dependent upon different conformational decoys, and, for some algorithms, the accuracy declines when the decoy population increases. Results. Here, we proposed two enhanced K-means clustering algorithms capable of robustly identifying high-quality protein structural models. The first one employs the clustering algorithm SPICKER to determine the initial centroids for basic K-means clustering (SK-means), whereas the other employs squared distance to optimize the initial centroids (K-means++). Our results showed that SK-means and K-means++ were more robust as compared with SPICKER alone, detecting 33 (59%) and 42 (75%) of 56 targets, respectively, with template modeling scores better than or equal to those of SPICKER. Conclusions. We observed that the classic K-means algorithm showed a similar performance to that of SPICKER, which is a widely used algorithm for protein-structure identification. Both SK-means and K-means++ demonstrated substantial improvements relative to results from SPICKER and classical K-means. PMID:28421198
Dzurová, Lenka; Forneris, Federico; Savino, Simone; Galuszka, Petr; Vrabka, Josef; Frébort, Ivo
2015-08-01
The recently discovered cytokinin (CK)-specific phosphoribohydrolase "Lonely Guy" (LOG) is a key enzyme of CK biosynthesis, converting inactive CK nucleotides into biologically active free bases. We have determined the crystal structures of LOG from Claviceps purpurea (cpLOG) and its complex with the enzymatic product phosphoribose. The structures reveal a dimeric arrangement of Rossmann folds, with the ligands bound to large pockets at the interface between cpLOG monomers. Structural comparisons highlight the homology of cpLOG to putative lysine decarboxylases. Extended sequence analysis enabled identification of a distinguishing LOG sequence signature. Taken together, our data suggest phosphoribohydrolase activity for several proteins of unknown function. © 2015 Wiley Periodicals, Inc.
Proteomics Analysis of the Nucleolus in Adenovirus-infected Cells
Lam, Yun W.; Evans, Vanessa C.; Heesom, Kate J.; Lamond, Angus I.; Matthews, David A.
2010-01-01
Adenoviruses replicate primarily in the host cell nucleus, and it is well established that adenovirus infection affects the structure and function of host cell nucleoli in addition to coding for a number of nucleolar targeted viral proteins. Here we used unbiased proteomics methods, including high throughput mass spectrometry coupled with stable isotope labeling by amino acids in cell culture (SILAC) and traditional two-dimensional gel electrophoresis, to identify quantitative changes in the protein composition of the nucleolus during adenovirus infection. Two-dimensional gel analysis revealed changes in six proteins. By contrast, SILAC-based approaches identified 351 proteins with 24 proteins showing at least a 2-fold change after infection. Of those, four were previously reported to have aberrant localization and/or functional relevance during adenovirus infection. In total, 15 proteins identified as changing in amount by proteomics methods were examined in infected cells using confocal microscopy. Eleven of these proteins showed altered patterns of localization in adenovirus-infected cells. Comparing our data with the effects of actinomycin D on the nucleolar proteome revealed that adenovirus infection apparently specifically targets a relatively small subset of nucleolar antigens at the time point examined. PMID:19812395
Proteomics analysis of the nucleolus in adenovirus-infected cells.
Lam, Yun W; Evans, Vanessa C; Heesom, Kate J; Lamond, Angus I; Matthews, David A
2010-01-01
Adenoviruses replicate primarily in the host cell nucleus, and it is well established that adenovirus infection affects the structure and function of host cell nucleoli in addition to coding for a number of nucleolar targeted viral proteins. Here we used unbiased proteomics methods, including high throughput mass spectrometry coupled with stable isotope labeling by amino acids in cell culture (SILAC) and traditional two-dimensional gel electrophoresis, to identify quantitative changes in the protein composition of the nucleolus during adenovirus infection. Two-dimensional gel analysis revealed changes in six proteins. By contrast, SILAC-based approaches identified 351 proteins with 24 proteins showing at least a 2-fold change after infection. Of those, four were previously reported to have aberrant localization and/or functional relevance during adenovirus infection. In total, 15 proteins identified as changing in amount by proteomics methods were examined in infected cells using confocal microscopy. Eleven of these proteins showed altered patterns of localization in adenovirus-infected cells. Comparing our data with the effects of actinomycin D on the nucleolar proteome revealed that adenovirus infection apparently specifically targets a relatively small subset of nucleolar antigens at the time point examined.
Antunes, Deborah; Jorge, Natasha A. N.; Caffarena, Ernesto R.; Passetti, Fabio
2018-01-01
RNA molecules are essential players in many fundamental biological processes. Prokaryotes and eukaryotes have distinct RNA classes with specific structural features and functional roles. Computational prediction of protein structures is a research field in which high confidence three-dimensional protein models can be proposed based on the sequence alignment between target and templates. However, to date, only a few approaches have been developed for the computational prediction of RNA structures. Similar to proteins, RNA structures may be altered due to the interaction with various ligands, including proteins, other RNAs, and metabolites. A riboswitch is a molecular mechanism, found in the three kingdoms of life, in which the RNA structure is modified by the binding of a metabolite. It can regulate multiple gene expression mechanisms, such as transcription, translation initiation, and mRNA splicing and processing. Due to their nature, these entities also act on the regulation of gene expression and detection of small metabolites and have the potential to helping in the discovery of new classes of antimicrobial agents. In this review, we describe software and web servers currently available for riboswitch aptamer identification and secondary and tertiary structure prediction, including applications. PMID:29403526
Gibbons, Don L.; Reilly, Brigid; Ahn, Anna; Vaney, Marie-Christine; Vigouroux, Armelle; Rey, Felix A.; Kielian, Margaret
2004-01-01
The fusion proteins of the alphaviruses and flaviviruses have a similar native structure and convert to a highly stable homotrimer conformation during the fusion of the viral and target membranes. The properties of the alpha- and flavivirus fusion proteins distinguish them from the class I viral fusion proteins, such as influenza virus hemagglutinin, and establish them as the first members of the class II fusion proteins. Understanding how this new class carries out membrane fusion will require analysis of the structural basis for both the interaction of the protein subunits within the homotrimer and their interaction with the viral and target membranes. To this end we report a purification method for the E1 ectodomain homotrimer from the alphavirus Semliki Forest virus. The purified protein is trimeric, detergent soluble, retains the characteristic stability of the starting homotrimer, and is free of lipid and other contaminants. In contrast to the postfusion structures that have been determined for the class I proteins, the E1 homotrimer contains the fusion peptide region responsible for interaction with target membranes. This E1 trimer preparation is an excellent candidate for structural studies of the class II viral fusion proteins, and we report conditions that generate three-dimensional crystals suitable for analysis by X-ray diffraction. Determination of the structure will provide our first high-resolution views of both the low-pH-induced trimeric conformation and the target membrane-interacting region of the alphavirus fusion protein. PMID:15016874
Binding Mechanisms of Intrinsically Disordered Proteins: Theory, Simulation, and Experiment
Mollica, Luca; Bessa, Luiza M.; Hanoulle, Xavier; Jensen, Malene Ringkjøbing; Blackledge, Martin; Schneider, Robert
2016-01-01
In recent years, protein science has been revolutionized by the discovery of intrinsically disordered proteins (IDPs). In contrast to the classical paradigm that a given protein sequence corresponds to a defined structure and an associated function, we now know that proteins can be functional in the absence of a stable three-dimensional structure. In many cases, disordered proteins or protein regions become structured, at least locally, upon interacting with their physiological partners. Many, sometimes conflicting, hypotheses have been put forward regarding the interaction mechanisms of IDPs and the potential advantages of disorder for protein-protein interactions. Whether disorder may increase, as proposed, e.g., in the “fly-casting” hypothesis, or decrease binding rates, increase or decrease binding specificity, or what role pre-formed structure might play in interactions involving IDPs (conformational selection vs. induced fit), are subjects of intense debate. Experimentally, these questions remain difficult to address. Here, we review experimental studies of binding mechanisms of IDPs using NMR spectroscopy and transient kinetic techniques, as well as the underlying theoretical concepts and numerical methods that can be applied to describe these interactions at the atomic level. The available literature suggests that the kinetic and thermodynamic parameters characterizing interactions involving IDPs can vary widely and that there may be no single common mechanism that can explain the different binding modes observed experimentally. Rather, disordered proteins appear to make combined use of features such as pre-formed structure and flexibility, depending on the individual system and the functional context. PMID:27668217
Mahalingam, Rajasekaran; Peng, Hung-Pin; Yang, An-Suei
2014-08-01
Protein-fatty acid interaction is vital for many cellular processes and understanding this interaction is important for functional annotation as well as drug discovery. In this work, we present a method for predicting the fatty acid (FA)-binding residues by using three-dimensional probability density distributions of interacting atoms of FAs on protein surfaces which are derived from the known protein-FA complex structures. A machine learning algorithm was established to learn the characteristic patterns of the probability density maps specific to the FA-binding sites. The predictor was trained with five-fold cross validation on a non-redundant training set and then evaluated with an independent test set as well as on holo-apo pair's dataset. The results showed good accuracy in predicting the FA-binding residues. Further, the predictor developed in this study is implemented as an online server which is freely accessible at the following website, http://ismblab.genomics.sinica.edu.tw/. Copyright © 2014 Elsevier B.V. All rights reserved.
Computational mining for hypothetical patterns of amino acid side chains in protein data bank (PDB)
NASA Astrophysics Data System (ADS)
Ghani, Nur Syatila Ab; Firdaus-Raih, Mohd
2018-04-01
The three-dimensional structure of a protein can provide insights regarding its function. Functional relationship between proteins can be inferred from fold and sequence similarities. In certain cases, sequence or fold comparison fails to conclude homology between proteins with similar mechanism. Since the structure is more conserved than the sequence, a constellation of functional residues can be similarly arranged among proteins of similar mechanism. Local structural similarity searches are able to detect such constellation of amino acids among distinct proteins, which can be useful to annotate proteins of unknown function. Detection of such patterns of amino acids on a large scale can increase the repertoire of important 3D motifs since available known 3D motifs currently, could not compensate the ever-increasing numbers of uncharacterized proteins to be annotated. Here, a computational platform for an automated detection of 3D motifs is described. A fuzzy-pattern searching algorithm derived from IMagine an Amino Acid 3D Arrangement search EnGINE (IMAAAGINE) was implemented to develop an automated method for searching of hypothetical patterns of amino acid side chains in Protein Data Bank (PDB), without the need for prior knowledge on related sequence or structure of pattern of interest. We present an example of the searches, which is the detection of a hypothetical pattern derived from known structural motif of C2H2 structural pattern from zinc fingers. The conservation of particular patterns of amino acid side chains in unrelated proteins is highlighted. This approach can act as a complementary method for available structure- and sequence-based platforms and may contribute in improving functional association between proteins.
How community has shaped the Protein Data Bank.
Berman, Helen M; Kleywegt, Gerard J; Nakamura, Haruki; Markley, John L
2013-09-03
Following several years of community discussion, the Protein Data Bank (PDB) was established in 1971 as a public repository for the coordinates of three-dimensional models of biological macromolecules. Since then, the number, size, and complexity of structural models have continued to grow, reflecting the productivity of structural biology. Managed by the Worldwide PDB organization, the PDB has been able to meet increasing demands for the quantity of structural information and of quality. In addition to providing unrestricted access to structural information, the PDB also works to promote data standards and to raise the profile of structural biology with broader audiences. In this perspective, we describe the history of PDB and the many ways in which the community continues to shape the archive. Copyright © 2013 Elsevier Ltd. All rights reserved.
How does symmetry impact the flexibility of proteins?
Schulze, Bernd; Sljoka, Adnan; Whiteley, Walter
2014-02-13
It is well known that (i) the flexibility and rigidity of proteins are central to their function, (ii) a number of oligomers with several copies of individual protein chains assemble with symmetry in the native state and (iii) added symmetry sometimes leads to added flexibility in structures. We observe that the most common symmetry classes of protein oligomers are also the symmetry classes that lead to increased flexibility in certain three-dimensional structures-and investigate the possible significance of this coincidence. This builds on the well-developed theory of generic rigidity of body-bar frameworks, which permits an analysis of the rigidity and flexibility of molecular structures such as proteins via fast combinatorial algorithms. In particular, we outline some very simple counting rules and possible algorithmic extensions that allow us to predict continuous symmetry-preserving motions in body-bar frameworks that possess non-trivial point-group symmetry. For simplicity, we focus on dimers, which typically assemble with twofold rotational axes, and often have allosteric function that requires motions to link distant sites on the two protein chains.
Ambrosi, Emmanuele; Capaldi, Stefano; Bovi, Michele; Saccomani, Gianmaria; Perduca, Massimiliano; Monaco, Hugo L.
2011-01-01
The SOUL protein is known to induce apoptosis by provoking the mitochondrial permeability transition, and a sequence homologous with the BH3 (Bcl-2 homology 3) domains has recently been identified in the protein, thus making it a potential new member of the BH3-only protein family. In the present study, we provide NMR, SPR (surface plasmon resonance) and crystallographic evidence that a peptide spanning residues 147–172 in SOUL interacts with the anti-apoptotic protein Bcl-xL. We have crystallized SOUL alone and the complex of its BH3 domain peptide with Bcl-xL, and solved their three-dimensional structures. The SOUL monomer is a single domain organized as a distorted β-barrel with eight anti-parallel strands and two α-helices. The BH3 domain extends across 15 residues at the end of the second helix and eight amino acids in the chain following it. There are important structural differences in the BH3 domain in the intact SOUL molecule and the same sequence bound to Bcl-xL. PMID:21639858
Ripoche, Hugues; Laine, Elodie; Ceres, Nicoletta; Carbone, Alessandra
2017-01-04
The database JET2 Viewer, openly accessible at http://www.jet2viewer.upmc.fr/, reports putative protein binding sites for all three-dimensional (3D) structures available in the Protein Data Bank (PDB). This knowledge base was generated by applying the computational method JET 2 at large-scale on more than 20 000 chains. JET 2 strategy yields very precise predictions of interacting surfaces and unravels their evolutionary process and complexity. JET2 Viewer provides an online intelligent display, including interactive 3D visualization of the binding sites mapped onto PDB structures and suitable files recording JET 2 analyses. Predictions were evaluated on more than 15 000 experimentally characterized protein interfaces. This is, to our knowledge, the largest evaluation of a protein binding site prediction method. The overall performance of JET 2 on all interfaces are: Sen = 52.52, PPV = 51.24, Spe = 80.05, Acc = 75.89. The data can be used to foster new strategies for protein-protein interactions modulation and interaction surface redesign. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Ryu, Hyojung; Lim, GyuTae; Sung, Bong Hyun; Lee, Jinhyuk
2016-02-15
Protein structure refinement is a necessary step for the study of protein function. In particular, some nuclear magnetic resonance (NMR) structures are of lower quality than X-ray crystallographic structures. Here, we present NMRe, a web-based server for NMR structure refinement. The previously developed knowledge-based energy function STAP (Statistical Torsion Angle Potential) was used for NMRe refinement. With STAP, NMRe provides two refinement protocols using two types of distance restraints. If a user provides NOE (Nuclear Overhauser Effect) data, the refinement is performed with the NOE distance restraints as a conventional NMR structure refinement. Additionally, NMRe generates NOE-like distance restraints based on the inter-hydrogen distances derived from the input structure. The efficiency of NMRe refinement was validated on 20 NMR structures. Most of the quality assessment scores of the refined NMR structures were better than those of the original structures. The refinement results are provided as a three-dimensional structure view, a secondary structure scheme, and numerical and graphical structure validation scores. NMRe is available at http://psb.kobic.re.kr/nmre/. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Forbes-Lorman, Robin M; Harris, Michelle A; Chang, Wesley S; Dent, Erik W; Nordheim, Erik V; Franzen, Margaret A
2016-07-08
Understanding how basic structural units influence function is identified as a foundational/core concept for undergraduate biological and biochemical literacy. It is essential for students to understand this concept at all size scales, but it is often more difficult for students to understand structure-function relationships at the molecular level, which they cannot as effectively visualize. Students need to develop accurate, 3-dimensional mental models of biomolecules to understand how biomolecular structure affects cellular functions at the molecular level, yet most traditional curricular tools such as textbooks include only 2-dimensional representations. We used a controlled, backward design approach to investigate how hand-held physical molecular model use affected students' ability to logically predict structure-function relationships. Brief (one class period) physical model use increased quiz score for females, whereas there was no significant increase in score for males using physical models. Females also self-reported higher learning gains in their understanding of context-specific protein function. Gender differences in spatial visualization may explain the gender-specific benefits of physical model use observed. © 2016 The Authors Biochemistry and Molecular Biology Education published by Wiley Periodicals, Inc. on behalf of International Union of Biochemistry and Molecular Biology, 44(4):326-335, 2016. © 2016 The International Union of Biochemistry and Molecular Biology.
MEGADOCK: An All-to-All Protein-Protein Interaction Prediction System Using Tertiary Structure Data
Ohue, Masahito; Matsuzaki, Yuri; Uchikoga, Nobuyuki; Ishida, Takashi; Akiyama, Yutaka
2014-01-01
The elucidation of protein-protein interaction (PPI) networks is important for understanding cellular structure and function and structure-based drug design. However, the development of an effective method to conduct exhaustive PPI screening represents a computational challenge. We have been investigating a protein docking approach based on shape complementarity and physicochemical properties. We describe here the development of the protein-protein docking software package “MEGADOCK” that samples an extremely large number of protein dockings at high speed. MEGADOCK reduces the calculation time required for docking by using several techniques such as a novel scoring function called the real Pairwise Shape Complementarity (rPSC) score. We showed that MEGADOCK is capable of exhaustive PPI screening by completing docking calculations 7.5 times faster than the conventional docking software, ZDOCK, while maintaining an acceptable level of accuracy. When MEGADOCK was applied to a subset of a general benchmark dataset to predict 120 relevant interacting pairs from 120 x 120 = 14,400 combinations of proteins, an F-measure value of 0.231 was obtained. Further, we showed that MEGADOCK can be applied to a large-scale protein-protein interaction-screening problem with accuracy better than random. When our approach is combined with parallel high-performance computing systems, it is now feasible to search and analyze protein-protein interactions while taking into account three-dimensional structures at the interactome scale. MEGADOCK is freely available at http://www.bi.cs.titech.ac.jp/megadock. PMID:23855673
GenProBiS: web server for mapping of sequence variants to protein binding sites.
Konc, Janez; Skrlj, Blaz; Erzen, Nika; Kunej, Tanja; Janezic, Dusanka
2017-07-03
Discovery of potentially deleterious sequence variants is important and has wide implications for research and generation of new hypotheses in human and veterinary medicine, and drug discovery. The GenProBiS web server maps sequence variants to protein structures from the Protein Data Bank (PDB), and further to protein-protein, protein-nucleic acid, protein-compound, and protein-metal ion binding sites. The concept of a protein-compound binding site is understood in the broadest sense, which includes glycosylation and other post-translational modification sites. Binding sites were defined by local structural comparisons of whole protein structures using the Protein Binding Sites (ProBiS) algorithm and transposition of ligands from the similar binding sites found to the query protein using the ProBiS-ligands approach with new improvements introduced in GenProBiS. Binding site surfaces were generated as three-dimensional grids encompassing the space occupied by predicted ligands. The server allows intuitive visual exploration of comprehensively mapped variants, such as human somatic mis-sense mutations related to cancer and non-synonymous single nucleotide polymorphisms from 21 species, within the predicted binding sites regions for about 80 000 PDB protein structures using fast WebGL graphics. The GenProBiS web server is open and free to all users at http://genprobis.insilab.org. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Liang, Yunyun; Liu, Sanyang; Zhang, Shengli
2015-01-01
Prediction of protein structural classes for low-similarity sequences is useful for understanding fold patterns, regulation, functions, and interactions of proteins. It is well known that feature extraction is significant to prediction of protein structural class and it mainly uses protein primary sequence, predicted secondary structure sequence, and position-specific scoring matrix (PSSM). Currently, prediction solely based on the PSSM has played a key role in improving the prediction accuracy. In this paper, we propose a novel method called CSP-SegPseP-SegACP by fusing consensus sequence (CS), segmented PsePSSM, and segmented autocovariance transformation (ACT) based on PSSM. Three widely used low-similarity datasets (1189, 25PDB, and 640) are adopted in this paper. Then a 700-dimensional (700D) feature vector is constructed and the dimension is decreased to 224D by using principal component analysis (PCA). To verify the performance of our method, rigorous jackknife cross-validation tests are performed on 1189, 25PDB, and 640 datasets. Comparison of our results with the existing PSSM-based methods demonstrates that our method achieves the favorable and competitive performance. This will offer an important complementary to other PSSM-based methods for prediction of protein structural classes for low-similarity sequences.
Chaudhary, Nitika; Sandhu, Padmani; Ahmed, Mushtaq; Akhter, Yusuf
2017-02-01
Trichothecenes are the sesquiterpenes secreted by Trichoderma spp. residing in the rhizosphere. These compounds have been reported to act as plant growth promoters and bio-control agents. The structural knowledge for the transporter proteins of their efflux remained limited. In this study, three-dimensional structure of Thmfs1 protein, a trichothecene transporter from Trichoderma harzianum, was homology modelled and further Molecular Dynamics (MD) simulations were used to decipher its mechanism. Fourteen transmembrane helices of Thmfs1 protein are observed contributing to an inward-open conformation. The transport channel and ligand binding sites in Thmfs1 are identified based on heuristic, iterative algorithm and structural alignment with homologous proteins. MD simulations were performed to reveal the differential structural behaviour occurring in the ligand free and ligand bound forms. We found that two discrete trichothecene binding sites are located on either side of the central transport tunnel running from the cytoplasmic side to the extracellular side across the Thmfs1 protein. Detailed analysis of the MD trajectories showed an alternative access mechanism between N and C-terminal domains contributing to its function. These results also demonstrate that the transport of trichodermin occurs via hopping mechanism in which the substrate molecule jumps from one binding site to another lining the transport tunnel. Copyright © 2016 Elsevier B.V. All rights reserved.
Al Nasr, Kamal; Ranjan, Desh; Zubair, Mohammad; Chen, Lin; He, Jing
2014-01-01
Electron cryomicroscopy is becoming a major experimental technique in solving the structures of large molecular assemblies. More and more three-dimensional images have been obtained at the medium resolutions between 5 and 10 Å. At this resolution range, major α-helices can be detected as cylindrical sticks and β-sheets can be detected as plain-like regions. A critical question in de novo modeling from cryo-EM images is to determine the match between the detected secondary structures from the image and those on the protein sequence. We formulate this matching problem into a constrained graph problem and present an O(Δ(2)N(2)2(N)) algorithm to this NP-Hard problem. The algorithm incorporates the dynamic programming approach into a constrained K-shortest path algorithm. Our method, DP-TOSS, has been tested using α-proteins with maximum 33 helices and α-β proteins up to five helices and 12 β-strands. The correct match was ranked within the top 35 for 19 of the 20 α-proteins and all nine α-β proteins tested. The results demonstrate that DP-TOSS improves accuracy, time and memory space in deriving the topologies of the secondary structure elements for proteins with a large number of secondary structures and a complex skeleton.
Structural bioinformatics: methods, concepts and applications to blood coagulation proteins.
Villoutreix, Bruno O
2002-06-01
Structural and theoretical analyses of proteins are central to the understanding of complex molecular mechanisms and are fundamental to the drug discovery process. Computational techniques yield useful insights into an ever-wider range of biomolecular systems. Protein three-dimensional structures and molecular functions can be predicted in some circumstances, while experimental structures can be analyzed in depth via such computational approaches. Non-covalent binding of biomolecules can be understood by considering structural, thermodynamic and kinetic issues, and theoretical simulations of such events can be attempted. The central role of electrostatic interactions with regard to protein function, structure and stability has been investigated and some electrostatic properties can be modeled theoretically. Computer methods thus help to prioritize, design, analyze and rationalize biochemical experiments. Cardiovascular diseases and associated blood coagulation disorders are leading causes of death worldwide. Blood coagulation involves more than 30 proteins that interact specifically with various degrees of affinity. Many of these molecules can also bind transiently to phospholipid surfaces. Numerous point mutations in the genes of coagulation proteins and regulators have been identified. Understanding the coagulation cascade, its regulation and the impact of mutations is required for the development of new therapies and diagnostic tools. In this review, we describe concepts and methods pertaining to the field of structural bioinformatics. We provide examples of applications of these approaches to blood coagulation proteins and show that such studies can give insights about molecular mechanisms contributing to cardiovascular disease susceptibility.
A review on the effects of supercritical carbon dioxide on enzyme activity.
Wimmer, Zdenek; Zarevúcka, Marie
2010-01-19
Different types of enzymes such as lipases, several phosphatases, dehydrogenases, oxidases, amylases and others are well suited for the reactions in SC-CO(2). The stability and the activity of enzymes exposed to carbon dioxide under high pressure depend on enzyme species, water content in the solution and on the pressure and temperature of the reaction system. The three-dimensional structure of enzymes may be significantly altered under extreme conditions, causing their denaturation and consequent loss of activity. If the conditions are less adverse, the protein structure may be largely retained. Minor structural changes may induce an alternative active protein state with altered enzyme activity, specificity and stability.
A Review on the Effects of Supercritical Carbon Dioxide on Enzyme Activity
Wimmer, Zdeněk; Zarevúcka, Marie
2010-01-01
Different types of enzymes such as lipases, several phosphatases, dehydrogenases, oxidases, amylases and others are well suited for the reactions in SC-CO2. The stability and the activity of enzymes exposed to carbon dioxide under high pressure depend on enzyme species, water content in the solution and on the pressure and temperature of the reaction system. The three-dimensional structure of enzymes may be significantly altered under extreme conditions, causing their denaturation and consequent loss of activity. If the conditions are less adverse, the protein structure may be largely retained. Minor structural changes may induce an alternative active protein state with altered enzyme activity, specificity and stability. PMID:20162013
Schlessinger, J
1994-02-01
SH2 and SH3 domains are small protein modules that mediate protein-protein interactions in signal transduction pathways that are activated by protein tyrosine kinases. SH2 domains bind to short phosphotyrosine-containing sequences in growth factor receptors and other phosphoproteins. SH3 domains bind to target proteins through sequences containing proline and hydrophobic amino acids. SH2 and SH3 domain containing proteins, such as Grb2 and phospholipase C gamma, utilize these modules in order to link receptor and cytoplasmic protein tyrosine kinases to the Ras signaling pathway and to phosphatidylinositol hydrolysis, respectively. The three-dimensional structures of several SH2 and SH3 domains have been determined by NMR and X-ray crystallography, and the molecular basis of their specificity is beginning to be unveiled.
Syntactic structures in languages and biology.
Horn, David
2008-08-01
Both natural languages and cell biology make use of one-dimensional encryption. Their investigation calls for syntactic deciphering of the text and semantic understanding of the resulting structures. Here we discuss recently published algorithms that allow for such searches: automatic distillation of structure (ADIOS) that is successful in discovering syntactic structures in linguistic texts and its motif extraction (MEX) component that can be used for uncovering motifs in DNA and protein sequences. The underlying principles of these syntactic algorithms and some of their results will be described.
Waiwijit, Uraiwan; Maturos, Thitima; Pakapongpan, Saithip; Phokharatkul, Ditsayut; Wisitsoraat, Anurat; Tuantranont, Adisorn
2016-08-01
Recently, three-dimensional graphene interconnected network has attracted great interest as a scaffold structure for tissue engineering due to its high biocompatibility, high electrical conductivity, high specific surface area and high porosity. However, free-standing three-dimensional graphene exhibits poor flexibility and stability due to ease of disintegration during processing. In this work, three-dimensional graphene is composited with polydimethylsiloxane to improve the structural flexibility and stability by a new simple two-step process comprising dip coating of polydimethylsiloxane on chemical vapor deposited graphene/Ni foam and wet etching of nickel foam. Structural characterizations confirmed an interconnected three-dimensional multi-layer graphene structure with thin polydimethylsiloxane scaffold. The composite was employed as a substrate for culture of L929 fibroblast cells and its cytocompatibility was evaluated by cell viability (Alamar blue assay), reactive oxygen species production and vinculin immunofluorescence imaging. The result revealed that cell viability on three-dimensional graphene/polydimethylsiloxane composite increased with increasing culture time and was slightly different from a polystyrene substrate (control). Moreover, cells cultured on three-dimensional graphene/polydimethylsiloxane composite generated less ROS than the control at culture times of 3-6 h. The results of immunofluorescence staining demonstrated that fibroblast cells expressed adhesion protein (vinculin) and adhered well on three-dimensional graphene/polydimethylsiloxane surface. Good cell adhesion could be attributed to suitable surface properties of three-dimensional graphene/polydimethylsiloxane with moderate contact angle and small negative zeta potential in culture solution. The results of electrochemical study by cyclic voltammetry showed that an oxidation current signal with no apparent peak was induced by fibroblast cells and the oxidation current at an oxidation potential of +0.9 V increased linearly with increasing cell number. Therefore, the three-dimensional graphene/polydimethylsiloxane composite exhibits high cytocompatibility and can potentially be used as a conductive substrate for cell-based electrochemical sensing. © The Author(s) 2016.
Wang, Shunfang; Liu, Shuhui
2015-12-19
An effective representation of a protein sequence plays a crucial role in protein sub-nuclear localization. The existing representations, such as dipeptide composition (DipC), pseudo-amino acid composition (PseAAC) and position specific scoring matrix (PSSM), are insufficient to represent protein sequence due to their single perspectives. Thus, this paper proposes two fusion feature representations of DipPSSM and PseAAPSSM to integrate PSSM with DipC and PseAAC, respectively. When constructing each fusion representation, we introduce the balance factors to value the importance of its components. The optimal values of the balance factors are sought by genetic algorithm. Due to the high dimensionality of the proposed representations, linear discriminant analysis (LDA) is used to find its important low dimensional structure, which is essential for classification and location prediction. The numerical experiments on two public datasets with KNN classifier and cross-validation tests showed that in terms of the common indexes of sensitivity, specificity, accuracy and MCC, the proposed fusing representations outperform the traditional representations in protein sub-nuclear localization, and the representation treated by LDA outperforms the untreated one.
SLLE for predicting membrane protein types.
Wang, Meng; Yang, Jie; Xu, Zhi-Jie; Chou, Kuo-Chen
2005-01-07
Introduction of the concept of pseudo amino acid composition (PROTEINS: Structure, Function, and Genetics 43 (2001) 246; Erratum: ibid. 44 (2001) 60) has made it possible to incorporate a considerable amount of sequence-order effects by representing a protein sample in terms of a set of discrete numbers, and hence can significantly enhance the prediction quality of membrane protein type. As a continuous effort along such a line, the Supervised Locally Linear Embedding (SLLE) technique for nonlinear dimensionality reduction is introduced (Science 22 (2000) 2323). The advantage of using SLLE is that it can reduce the operational space by extracting the essential features from the high-dimensional pseudo amino acid composition space, and that the cluster-tolerant capacity can be increased accordingly. As a consequence by combining these two approaches, high success rates have been observed during the tests of self-consistency, jackknife and independent data set, respectively, by using the simplest nearest neighbour classifier. The current approach represents a new strategy to deal with the problems of protein attribute prediction, and hence may become a useful vehicle in the area of bioinformatics and proteomics.
Wang, Shunfang; Liu, Shuhui
2015-01-01
An effective representation of a protein sequence plays a crucial role in protein sub-nuclear localization. The existing representations, such as dipeptide composition (DipC), pseudo-amino acid composition (PseAAC) and position specific scoring matrix (PSSM), are insufficient to represent protein sequence due to their single perspectives. Thus, this paper proposes two fusion feature representations of DipPSSM and PseAAPSSM to integrate PSSM with DipC and PseAAC, respectively. When constructing each fusion representation, we introduce the balance factors to value the importance of its components. The optimal values of the balance factors are sought by genetic algorithm. Due to the high dimensionality of the proposed representations, linear discriminant analysis (LDA) is used to find its important low dimensional structure, which is essential for classification and location prediction. The numerical experiments on two public datasets with KNN classifier and cross-validation tests showed that in terms of the common indexes of sensitivity, specificity, accuracy and MCC, the proposed fusing representations outperform the traditional representations in protein sub-nuclear localization, and the representation treated by LDA outperforms the untreated one. PMID:26703574
The Proteome Folding Project: Proteome-scale prediction of structure and function
Drew, Kevin; Winters, Patrick; Butterfoss, Glenn L.; Berstis, Viktors; Uplinger, Keith; Armstrong, Jonathan; Riffle, Michael; Schweighofer, Erik; Bovermann, Bill; Goodlett, David R.; Davis, Trisha N.; Shasha, Dennis; Malmström, Lars; Bonneau, Richard
2011-01-01
The incompleteness of proteome structure and function annotation is a critical problem for biologists and, in particular, severely limits interpretation of high-throughput and next-generation experiments. We have developed a proteome annotation pipeline based on structure prediction, where function and structure annotations are generated using an integration of sequence comparison, fold recognition, and grid-computing-enabled de novo structure prediction. We predict protein domain boundaries and three-dimensional (3D) structures for protein domains from 94 genomes (including human, Arabidopsis, rice, mouse, fly, yeast, Escherichia coli, and worm). De novo structure predictions were distributed on a grid of more than 1.5 million CPUs worldwide (World Community Grid). We generated significant numbers of new confident fold annotations (9% of domains that are otherwise unannotated in these genomes). We demonstrate that predicted structures can be combined with annotations from the Gene Ontology database to predict new and more specific molecular functions. PMID:21824995
Pesavento, Joseph B.; Billingsley, Angela M.; Roberts, Ed J.; Ramig, Robert F.; Prasad, B. V. Venkataram
2003-01-01
Numerous prior studies have indicated that viable rotavirus reassortants containing structural proteins of heterologous parental origin may express unexpected phenotypes, such as changes in infectivity and immunogenicity. To provide a structural basis for alterations in phenotypic expression, a three-dimensional structural analysis of these reassortants was conducted. The structures of the reassortants show that while VP4 generally maintains the parental structure when moved to a heterologous protein background, in certain reassortants, there are subtle alterations in the conformation of VP4. The alterations in VP4 conformation correlated with expression of unexpected VP4-associated phenotypes. Interactions between heterologous VP4 and VP7 in reassortants expressing unexpected phenotypes appeared to induce the conformational alterations seen in VP4. PMID:12584352
Solution Model of the Intrinsically Disordered Polyglutamine Tract-Binding Protein-1
Rees, Martin; Gorba, Christian; de Chiara, Cesira; Bui, Tam T.T.; Garcia-Maya, Mitla; Drake, Alex F.; Okazawa, Hitoshi; Pastore, Annalisa; Svergun, Dmitri; Chen, Yu Wai
2012-01-01
Polyglutamine tract-binding protein-1 (PQBP-1) is a 265-residue nuclear protein that is involved in transcriptional regulation. In addition to its role in the molecular pathology of the polyglutamine expansion diseases, mutations of the protein are associated with X-linked mental retardation. PQBP-1 binds specifically to glutamine repeat sequences and proline-rich regions, and interacts with RNA polymerase II and the spliceosomal protein U5-15kD. In this work, we obtained a biophysical characterization of this protein by employing complementary structural methods. PQBP-1 is shown to be a moderately compact but largely disordered molecule with an elongated shape, having a Stokes radius of 3.7 nm and a maximum molecular dimension of 13 nm. The protein is monomeric in solution, has residual β-structure, and is in a premolten globule state that is unaffected by natural osmolytes. Using small-angle x-ray scattering data, we were able to generate a low-resolution, three-dimensional model of PQBP-1. PMID:22500761
Feltes, Bruno César; Bonatto, Diego
2015-01-01
The xeroderma pigmentosum complementation group proteins (XPs), which include XPA through XPG, play a critical role in coordinating and promoting global genome and transcription-coupled nucleotide excision repair (GG-NER and TC-NER, respectively) pathways in eukaryotic cells. GG-NER and TC-NER are both required for the repair of bulky DNA lesions, such as those induced by UV radiation. Mutations in genes that encode XPs lead to the clinical condition xeroderma pigmentosum (XP). Although the roles of XPs in the GG-NER/TC-NER subpathways have been extensively studied, complete knowledge of their three-dimensional structure is only beginning to emerge. Hence, this review aims to summarize the current knowledge of mapped mutations and other structural information on XP proteins that influence their function and protein-protein interactions. We also review the possible post-translational modifications for each protein and the impact of these modifications on XP protein functions. Copyright © 2014 Elsevier B.V. All rights reserved.
Wang, Ruiwu; Chen, Wenqian; Cai, Shitian; Zhang, Jing; Bolstad, Jeff; Wagenknecht, Terence; Liu, Zheng; Chen, S. R. Wayne
2009-01-01
A region between residues 414 and 466 in the cardiac ryanodine receptor (RyR2) harbors more than half of the known NH2-terminal mutations associated with cardiac arrhythmias and sudden death. To gain insight into the structural basis of this NH2-terminal mutation hotspot, we have determined its location in the three-dimensional structure of RyR2. Green fluorescent protein (GFP), used as a structural marker, was inserted into the middle of this mutation hotspot after Ser-437 in the RyR2 sequence. The resultant GFP-RyR2 fusion protein, RyR2S437-GFP, was expressed in HEK293 cells and characterized using Ca2+ release, [3H]ryanodine binding, and single cell Ca2+ imaging studies. These functional analyses revealed that RyR2S437-GFP forms a caffeine-and ryanodine-sensitive Ca2+ release channel that possesses Ca2+- and caffeine-dependence of activation indistinguishable from that of wild type (wt) RyR2. HEK293 cells expressing RyR2S437-GFP displayed a propensity for store-overload induced Ca2+ release similar to that in cells expressing RyR2-wt. The three-dimensional structure of the purified RyR2S437-GFP was reconstructed using cryo-electron microscopy and single particle image processing. Subtraction of the three-dimensional reconstructions of RyR2-wt and RyR2S437-GFP revealed the location of the inserted GFP, and hence the NH2-terminal mutation hotspot, in a region between domains 5 and 9 in the clamp-shaped structure. This location is close to a previously mapped central disease-causing mutation site located in a region between domains 5 and 6. These results, together with findings from previous studies, suggest that the proposed interactions between the NH2-terminal and central regions of RyR2 are likely to take place between domains 5 and 6, and that the clamp-shaped structure, which shows substantial conformational differences between the closed and open states, is highly susceptible to disease-causing mutations. PMID:17452324
Yamaguchi, Akihiro; Go, Mitiko
2006-01-01
We have been developing FAMSBASE, a protein homology-modeling database of whole ORFs predicted from genome sequences. The latest update of FAMSBASE (http://daisy.nagahama-i-bio.ac.jp/Famsbase/), which is based on the protein three-dimensional (3D) structures released by November 2003, contains modeled 3D structures for 368,724 open reading frames (ORFs) derived from genomes of 276 species, namely 17 archaebacterial, 130 eubacterial, 18 eukaryotic and 111 phage genomes. Those 276 genomes are predicted to have 734,193 ORFs in total and the current FAMSBASE contains protein 3D structure of approximately 50% of the ORF products. However, cases that a modeled 3D structure covers the whole part of an ORF product are rare. When portion of an ORF with 3D structure is compared in three kingdoms of life, in archaebacteria and eubacteria, approximately 60% of the ORFs have modeled 3D structures covering almost the entire amino acid sequences, however, the percentage falls to about 30% in eukaryotes. When annual differences in the number of ORFs with modeled 3D structure are calculated, the fraction of modeled 3D structures of soluble protein for archaebacteria is increased by 5%, and that for eubacteria by 7% in the last 3 years. Assuming that this rate would be maintained and that determination of 3D structures for predicted disordered regions is unattainable, whole soluble protein model structures of prokaryotes without the putative disordered regions will be in hand within 15 years. For eukaryotic proteins, they will be in hand within 25 years. The 3D structures we will have at those times are not the 3D structure of the entire proteins encoded in single ORFs, but the 3D structures of separate structural domains. Measuring or predicting spatial arrangements of structural domains in an ORF will then be a coming issue of structural genomics. PMID:17146617
Lou, Yan-Ru; Kanninen, Liisa; Kaehr, Bryan; ...
2015-09-01
Three-dimensional (3D) cell cultures produce more in vivo-like multicellular structures such as spheroids that cannot be obtained in two-dimensional (2D) cell cultures. Thus, they are increasingly employed as models for cancer and drug research, as well as tissue engineering. It has proven challenging to stabilize spheroid architectures for detailed morphological examination. Here we overcome this issue using a silica bioreplication (SBR) process employed on spheroids formed from human pluripotent stem cells (hPSCs) and hepatocellular carcinoma HepG2 cells cultured in the nanofibrillar cellulose (NFC) hydrogel. The cells in the spheroids are more round and tightly interacting with each other than thosemore » in 2D cultures, and they develop microvilli-like structures on the cell membranes as seen in 2D cultures. Furthermore, SBR preserves extracellular matrix-like materials and cellular proteins. In conclusion, these findings provide the first evidence of intact hPSC spheroid architectures and similar fine structures to 2D-cultured cells, providing a pathway to enable our understanding of morphogenesis in 3D cultures.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lou, Yan-Ru; Kanninen, Liisa; Kaehr, Bryan
Three-dimensional (3D) cell cultures produce more in vivo-like multicellular structures such as spheroids that cannot be obtained in two-dimensional (2D) cell cultures. Thus, they are increasingly employed as models for cancer and drug research, as well as tissue engineering. It has proven challenging to stabilize spheroid architectures for detailed morphological examination. Here we overcome this issue using a silica bioreplication (SBR) process employed on spheroids formed from human pluripotent stem cells (hPSCs) and hepatocellular carcinoma HepG2 cells cultured in the nanofibrillar cellulose (NFC) hydrogel. The cells in the spheroids are more round and tightly interacting with each other than thosemore » in 2D cultures, and they develop microvilli-like structures on the cell membranes as seen in 2D cultures. Furthermore, SBR preserves extracellular matrix-like materials and cellular proteins. In conclusion, these findings provide the first evidence of intact hPSC spheroid architectures and similar fine structures to 2D-cultured cells, providing a pathway to enable our understanding of morphogenesis in 3D cultures.« less
Zhang, Lei; Lei, Dongsheng; Smith, Jessica M.; ...
2016-03-30
DNA base pairing has been used for many years to direct the arrangement of inorganic nanocrystals into small groupings and arrays with tailored optical and electrical properties. The control of DNA-mediated assembly depends crucially on a better understanding of three-dimensional structure of DNA-nanocrystal-hybridized building blocks. Existing techniques do not allow for structural determination of these flexible and heterogeneous samples. Here we report cryo-electron microscopy and negative-staining electron tomography approaches to image, and three-dimensionally reconstruct a single DNA-nanogold conjugate, an 84-bp double-stranded DNA with two 5-nm nanogold particles for potential substrates in plasmon-coupling experiments. By individual-particle electron tomography reconstruction, we obtainmore » 14 density maps at ~ 2-nm resolution . Using these maps as constraints, we derive 14 conformations of dsDNA by molecular dynamics simulations. The conformational variation is consistent with that from liquid solution, suggesting that individual-particle electron tomography could be an expected approach to study DNA-assembling and flexible protein structure and dynamics.« less
Lou, Yan-Ru; Kanninen, Liisa; Kaehr, Bryan; Townson, Jason L; Niklander, Johanna; Harjumäki, Riina; Jeffrey Brinker, C; Yliperttula, Marjo
2015-09-01
Three-dimensional (3D) cell cultures produce more in vivo-like multicellular structures such as spheroids that cannot be obtained in two-dimensional (2D) cell cultures. Thus, they are increasingly employed as models for cancer and drug research, as well as tissue engineering. It has proven challenging to stabilize spheroid architectures for detailed morphological examination. Here we overcome this issue using a silica bioreplication (SBR) process employed on spheroids formed from human pluripotent stem cells (hPSCs) and hepatocellular carcinoma HepG2 cells cultured in the nanofibrillar cellulose (NFC) hydrogel. The cells in the spheroids are more round and tightly interacting with each other than those in 2D cultures, and they develop microvilli-like structures on the cell membranes as seen in 2D cultures. Furthermore, SBR preserves extracellular matrix-like materials and cellular proteins. These findings provide the first evidence of intact hPSC spheroid architectures and similar fine structures to 2D-cultured cells, providing a pathway to enable our understanding of morphogenesis in 3D cultures.
Worldwide Protein Data Bank validation information: usage and trends.
Smart, Oliver S; Horský, Vladimír; Gore, Swanand; Svobodová Vařeková, Radka; Bendová, Veronika; Kleywegt, Gerard J; Velankar, Sameer
2018-03-01
Realising the importance of assessing the quality of the biomolecular structures deposited in the Protein Data Bank (PDB), the Worldwide Protein Data Bank (wwPDB) partners established Validation Task Forces to obtain advice on the methods and standards to be used to validate structures determined by X-ray crystallography, nuclear magnetic resonance spectroscopy and three-dimensional electron cryo-microscopy. The resulting wwPDB validation pipeline is an integral part of the wwPDB OneDep deposition, biocuration and validation system. The wwPDB Validation Service webserver (https://validate.wwpdb.org) can be used to perform checks prior to deposition. Here, it is shown how validation metrics can be combined to produce an overall score that allows the ranking of macromolecular structures and domains in search results. The ValTrends DB database provides users with a convenient way to access and analyse validation information and other properties of X-ray crystal structures in the PDB, including investigating trends in and correlations between different structure properties and validation metrics.
Worldwide Protein Data Bank validation information: usage and trends
Horský, Vladimír; Gore, Swanand; Svobodová Vařeková, Radka; Bendová, Veronika
2018-01-01
Realising the importance of assessing the quality of the biomolecular structures deposited in the Protein Data Bank (PDB), the Worldwide Protein Data Bank (wwPDB) partners established Validation Task Forces to obtain advice on the methods and standards to be used to validate structures determined by X-ray crystallography, nuclear magnetic resonance spectroscopy and three-dimensional electron cryo-microscopy. The resulting wwPDB validation pipeline is an integral part of the wwPDB OneDep deposition, biocuration and validation system. The wwPDB Validation Service webserver (https://validate.wwpdb.org) can be used to perform checks prior to deposition. Here, it is shown how validation metrics can be combined to produce an overall score that allows the ranking of macromolecular structures and domains in search results. The ValTrendsDB database provides users with a convenient way to access and analyse validation information and other properties of X-ray crystal structures in the PDB, including investigating trends in and correlations between different structure properties and validation metrics. PMID:29533231
Jo, Sunhwan; Song, Kevin C.; Desaire, Heather; MacKerell, Alexander D.; Im, Wonpil
2011-01-01
Understanding how glycosylation affects protein structure, dynamics, and function is an emerging and challenging problem in biology. As a first step toward glycan modeling in the context of structural glycobiology, we have developed Glycan Reader and integrated it into the CHARMM-GUI, http://www.charmm-gui.org/input/glycan. Glycan Reader greatly simplifies the reading of PDB structure files containing glycans through (i) detection of carbohydrate molecules, (ii) automatic annotation of carbohydrates based on their three-dimensional structures, (iii) recognition of glycosidic linkages between carbohydrates as well as N-/O-glycosidic linkages to proteins, and (iv) generation of inputs for the biomolecular simulation program CHARMM with the proper glycosidic linkage setup. In addition, Glycan Reader is linked to other functional modules in CHARMM-GUI, allowing users to easily generate carbohydrate or glycoprotein molecular simulation systems in solution or membrane environments and visualize the electrostatic potential on glycoprotein surfaces. These tools are useful for studying the impact of glycosylation on protein structure and dynamics. PMID:21815173
Adaptive Covariation between the Coat and Movement Proteins of Prunus Necrotic Ringspot Virus
Codoñer, Francisco M.; Fares, Mario A.; Elena, Santiago F.
2006-01-01
The relative functional and/or structural importance of different amino acid sites in a protein can be assessed by evaluating the selective constraints to which they have been subjected during the course of evolution. Here we explore such constraints at the linear and three-dimensional levels for the movement protein (MP) and coat protein (CP) encoded by RNA 3 of prunus necrotic ringspot ilarvirus (PNRSV). By a maximum-parsimony approach, the nucleotide sequences from 46 isolates of PNRSV varying in symptomatology, host tree, and geographic origin have been analyzed and sites under different selective pressures have been identified in both proteins. We have also performed covariation analyses to explore whether changes in certain amino acid sites condition subsequent variation in other sites of the same protein or the other protein. These covariation analyses shed light on which particular amino acids should be involved in the physical and functional interaction between MP and CP. Finally, we discuss these findings in the light of what is already known about the implication of certain sites and domains in structure and protein-protein and RNA-protein interactions. PMID:16731922
Adaptive covariation between the coat and movement proteins of prunus necrotic ringspot virus.
Codoñer, Francisco M; Fares, Mario A; Elena, Santiago F
2006-06-01
The relative functional and/or structural importance of different amino acid sites in a protein can be assessed by evaluating the selective constraints to which they have been subjected during the course of evolution. Here we explore such constraints at the linear and three-dimensional levels for the movement protein (MP) and coat protein (CP) encoded by RNA 3 of prunus necrotic ringspot ilarvirus (PNRSV). By a maximum-parsimony approach, the nucleotide sequences from 46 isolates of PNRSV varying in symptomatology, host tree, and geographic origin have been analyzed and sites under different selective pressures have been identified in both proteins. We have also performed covariation analyses to explore whether changes in certain amino acid sites condition subsequent variation in other sites of the same protein or the other protein. These covariation analyses shed light on which particular amino acids should be involved in the physical and functional interaction between MP and CP. Finally, we discuss these findings in the light of what is already known about the implication of certain sites and domains in structure and protein-protein and RNA-protein interactions.
Nagpal, Suhani; Tiwari, Satyam; Mapa, Koyeli; Thukral, Lipi
2015-01-01
Many proteins comprising of complex topologies require molecular chaperones to achieve their unique three-dimensional folded structure. The E.coli chaperone, GroEL binds with a large number of unfolded and partially folded proteins, to facilitate proper folding and prevent misfolding and aggregation. Although the major structural components of GroEL are well defined, scaffolds of the non-native substrates that determine chaperone-mediated folding have been difficult to recognize. Here we performed all-atomistic and replica-exchange molecular dynamics simulations to dissect non-native ensemble of an obligate GroEL folder, DapA. Thermodynamics analyses of unfolding simulations revealed populated intermediates with distinct structural characteristics. We found that surface exposed hydrophobic patches are significantly increased, primarily contributed from native and non-native β-sheet elements. We validate the structural properties of these conformers using experimental data, including circular dichroism (CD), 1-anilinonaphthalene-8-sulfonic acid (ANS) binding measurements and previously reported hydrogen-deutrium exchange coupled to mass spectrometry (HDX-MS). Further, we constructed network graphs to elucidate long-range intra-protein connectivity of native and intermediate topologies, demonstrating regions that serve as central "hubs". Overall, our results implicate that genomic variations (or mutations) in the distinct regions of protein structures might disrupt these topological signatures disabling chaperone-mediated folding, leading to formation of aggregates.
NASA Astrophysics Data System (ADS)
Krokhotin, Andrey; Dokholyan, Nikolay V.
2017-07-01
Most proteins fold into unique three-dimensional (3D) structures that determine their biological functions, such as catalytic activity or macromolecular binding. Misfolded proteins can pose a threat through aberrant interactions with other proteins leading to a number of diseases including Alzheimer's disease, Parkinson's disease, and amyotrophic lateral sclerosis [1,2]. What does determine 3D structure of proteins? The first clue to this question came more than fifty years ago when Anfinsen demonstrated that unfolded proteins can spontaneously fold to their native 3D structures [3,4]. Anfinsen's experiments lead to the conclusion that proteins fold to unique native structure corresponding to the stable and kinetically accessible free energy minimum, and protein native structure is solely determined by its amino acid sequence. The question of how exactly proteins find their free energy minimum proved to be a difficult problem. One of the puzzles, initially pointed out by Levinthal, was an inconsistency between observed protein folding times and theoretical estimates. A self-avoiding polymer model of a globular protein of 100-residues length on a cubic lattice can sample at least 1047 states. Based on the assumption that conformational sampling occurs at the highest vibrational mode of proteins (∼picoseconds), predicted folding time by searching among all the possible conformations leads to ∼1027 years (much larger than the age of the universe) [5]. In contrast, observed protein folding time range from microseconds to minutes. Due to tremendous theoretical progress in protein folding field that has been achieved in past decades, the source of this inconsistency is currently understood that is thoroughly described in the review by Finkelstein et al. [6].
Comparison of intrinsic dynamics of cytochrome p450 proteins using normal mode analysis
Dorner, Mariah E; McMunn, Ryan D; Bartholow, Thomas G; Calhoon, Brecken E; Conlon, Michelle R; Dulli, Jessica M; Fehling, Samuel C; Fisher, Cody R; Hodgson, Shane W; Keenan, Shawn W; Kruger, Alyssa N; Mabin, Justin W; Mazula, Daniel L; Monte, Christopher A; Olthafer, Augustus; Sexton, Ashley E; Soderholm, Beatrice R; Strom, Alexander M; Hati, Sanchita
2015-01-01
Cytochrome P450 enzymes are hemeproteins that catalyze the monooxygenation of a wide-range of structurally diverse substrates of endogenous and exogenous origin. These heme monooxygenases receive electrons from NADH/NADPH via electron transfer proteins. The cytochrome P450 enzymes, which constitute a diverse superfamily of more than 8,700 proteins, share a common tertiary fold but < 25% sequence identity. Based on their electron transfer protein partner, cytochrome P450 proteins are classified into six broad classes. Traditional methods of pro are based on the canonical paradigm that attributes proteins' function to their three-dimensional structure, which is determined by their primary structure that is the amino acid sequence. It is increasingly recognized that protein dynamics play an important role in molecular recognition and catalytic activity. As the mobility of a protein is an intrinsic property that is encrypted in its primary structure, we examined if different classes of cytochrome P450 enzymes display any unique patterns of intrinsic mobility. Normal mode analysis was performed to characterize the intrinsic dynamics of five classes of cytochrome P450 proteins. The present study revealed that cytochrome P450 enzymes share a strong dynamic similarity (root mean squared inner product > 55% and Bhattacharyya coefficient > 80%), despite the low sequence identity (< 25%) and sequence similarity (< 50%) across the cytochrome P450 superfamily. Noticeable differences in Cα atom fluctuations of structural elements responsible for substrate binding were noticed. These differences in residue fluctuations might be crucial for substrate selectivity in these enzymes. PMID:26130403
NASA Astrophysics Data System (ADS)
Zhang, Xuekai; Lu, Gang; Sun, Meng; Mahankali, Madhu; Ma, Yanfei; Zhang, Mingming; Hua, Wangde; Hu, Yuting; Wang, Qingbing; Chen, Jinghuo; He, Gang; Qi, Xiangbing; Shen, Weijun; Liu, Peng; Chen, Gong
2018-05-01
New methods capable of effecting cyclization, and forming novel three-dimensional structures while maintaining favourable physicochemical properties are needed to facilitate the development of cyclic peptide-based drugs that can engage challenging biological targets, such as protein-protein interactions. Here, we report a highly efficient and generally applicable strategy for constructing new types of peptide macrocycles using palladium-catalysed intramolecular C(sp3)-H arylation reactions. Easily accessible linear peptide precursors of simple and versatile design can be selectively cyclized at the side chains of either aromatic or modified non-aromatic amino acid units to form various cyclophane-braced peptide cycles. This strategy provides a powerful tool to address the long-standing challenge of size- and composition-dependence in peptide macrocyclization, and generates novel peptide macrocycles with uniquely buttressed backbones and distinct loop-type three-dimensional structures. Preliminary cell proliferation screening of the pilot library revealed a potent lead compound with selective cytotoxicity toward proliferative Myc-dependent cancer cell lines.
On-column refolding of recombinant human interleukin-4 from inclusion bodies.
Razeghifard, M Reza
2004-09-01
Interleukin-4 (IL4) is a multifunctional cytokine which plays a key role in the immune system. Several antagonists/agonists of IL4 are reported through mutagenesis studies, but their solution structural studies using nuclear magnetic resonance (NMR) spectroscopy are hindered as milligram quantities of isotopically labeled protein are required for structural refinements. In this work, a His-tagged recombinant form of human IL4 was overexpressed in Escherichia coli under the control of a T7 promoter. The resulting inclusion bodies were separated from cellular debris by centrifugation and solubilized by 6M guanidine-HCl in the presence of reducing agents. The denatured IL4 was immobilized on Ni2+-fractogel beads and refolded in a single chromatographic step by gradual removal of denaturant. This protocol yielded 15-20 mg of isotope-enriched protein from 1L of culture grown in minimal medium. The refolded protein was highly pure and was correctly folded as judged by its two-dimensional NMR spectrum. To show the successful application of this refolding protocol to IL4 variants, 15N-labeled Y124D-IL4 was also prepared and its first two-dimensional NMR spectrum was presented.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Borbulevych, Oleg; Kumarasiri, Malika; Wilson, Brian
The integral membrane protein BlaR1 of methicillin-resistant Staphylococcus aureus senses the presence of {beta}-lactam antibiotics in the milieu and transduces the information to the cytoplasm, where the biochemical events that unleash induction of antibiotic resistance mechanisms take place. We report herein by two-dimensional and three-dimensional NMR experiments of the sensor domain of BlaR1 in solution and by determination of an x-ray structure for the apo protein that Lys-392 of the antibiotic-binding site is posttranslationally modified by N{sup {zeta}}-carboxylation. Additional crystallographic and NMR data reveal that on acylation of Ser-389 by antibiotics, Lys-392 experiences N{sup {zeta}}-decarboxylation. This unique process, termed themore » lysine N{sup {zeta}}-decarboxylation switch, arrests the sensor domain in the activated ('on') state, necessary for signal transduction and all the subsequent biochemical processes. We present structural information on how this receptor activation process takes place, imparting longevity to the antibiotic-receptor complex that is needed for the induction of the antibiotic-resistant phenotype in methicillin-resistant S. aureus.« less
Characterization of Protein-Carbohydrate Interactions by NMR Spectroscopy.
Grondin, Julie M; Langelaan, David N; Smith, Steven P
2017-01-01
Solution-state nuclear magnetic resonance (NMR) spectroscopy can be used to monitor protein-carbohydrate interactions. Two-dimensional 1 H- 15 N heteronuclear single quantum coherence (HSQC)-based techniques described in this chapter can be used quickly and effectively to screen a set of possible carbohydrate binding partners, to quantify the dissociation constant (K d ) of any identified interactions, and to map the carbohydrate binding site on the structure of the protein. Here, we describe the titration of a family 32 carbohydrate binding module from Clostridium perfringens (CpCBM32) with the monosaccharide N-acetylgalactosamine (GalNAc), in which we calculate the apparent dissociation of the interaction, and map the GalNAc binding site onto the structure of CpCBM32.
Domain atrophy creates rare cases of functional partial protein domains.
Prakash, Ananth; Bateman, Alex
2015-04-30
Protein domains display a range of structural diversity, with numerous additions and deletions of secondary structural elements between related domains. We have observed a small number of cases of surprising large-scale deletions of core elements of structural domains. We propose a new concept called domain atrophy, where protein domains lose a significant number of core structural elements. Here, we implement a new pipeline to systematically identify new cases of domain atrophy across all known protein sequences. The output of this pipeline was carefully checked by hand, which filtered out partial domain instances that were unlikely to represent true domain atrophy due to misannotations or un-annotated sequence fragments. We identify 75 cases of domain atrophy, of which eight cases are found in a three-dimensional protein structure and 67 cases have been inferred based on mapping to a known homologous structure. Domains with structural variations include ancient folds such as the TIM-barrel and Rossmann folds. Most of these domains are observed to show structural loss that does not affect their functional sites. Our analysis has significantly increased the known cases of domain atrophy. We discuss specific instances of domain atrophy and see that there has often been a compensatory mechanism that helps to maintain the stability of the partial domain. Our study indicates that although domain atrophy is an extremely rare phenomenon, protein domains under certain circumstances can tolerate extreme mutations giving rise to partial, but functional, domains.
Ikeya, Teppei; Takeda, Mitsuhiro; Yoshida, Hitoshi; Terauchi, Tsutomu; Jee, Jun-Goo; Kainosho, Masatsune; Güntert, Peter
2009-08-01
Stereo-array isotope labeling (SAIL) has been combined with the fully automated NMR structure determination algorithm FLYA to determine the three-dimensional structure of the protein ubiquitin from different sets of input NMR spectra. SAIL provides a complete stereo- and regio-specific pattern of stable isotopes that results in sharper resonance lines and reduced signal overlap, without information loss. Here we show that as a result of the superior quality of the SAIL NMR spectra, reliable, fully automated analyses of the NMR spectra and structure calculations are possible using fewer input spectra than with conventional uniformly 13C/15N-labeled proteins. FLYA calculations with SAIL ubiquitin, using a single three-dimensional "through-bond" spectrum (and 2D HSQC spectra) in addition to the 13C-edited and 15N-edited NOESY spectra for conformational restraints, yielded structures with an accuracy of 0.83-1.15 A for the backbone RMSD to the conventionally determined solution structure of SAIL ubiquitin. NMR structures can thus be determined almost exclusively from the NOESY spectra that yield the conformational restraints, without the need to record many spectra only for determining intermediate, auxiliary data of the chemical shift assignments. The FLYA calculations for this report resulted in 252 ubiquitin structure bundles, obtained with different input data but identical structure calculation and refinement methods. These structures cover the entire range from highly accurate structures to seriously, but not trivially, wrong structures, and thus constitute a valuable database for the substantiation of structure validation methods.
Pokkuluri, P Raj; Dwulit-Smith, Jeff; Duke, Norma E; Wilton, Rosemarie; Mack, Jamey C; Bearden, Jessica; Rakowski, Ella; Babnigg, Gyorgy; Szurmant, Hendrik; Joachimiak, Andrzej; Schiffer, Marianne
2013-01-01
Anaeromyxobacter dehalogenans is a δ-proteobacterium found in diverse soils and sediments. It is of interest in bioremediation efforts due to its dechlorination and metal-reducing capabilities. To gain an understanding on A. dehalogenans' abilities to adapt to diverse environments we analyzed its signal transduction proteins. The A. dehalogenans genome codes for a large number of sensor histidine kinases (HK) and methyl-accepting chemotaxis proteins (MCP); among these 23 HK and 11 MCP proteins have a sensor domain in the periplasm. These proteins most likely contribute to adaptation to the organism's surroundings. We predicted their three-dimensional folds and determined the structures of two of the periplasmic sensor domains by X-ray diffraction. Most of the domains are predicted to have either PAS-like or helical bundle structures, with two predicted to have solute-binding protein fold, and another predicted to have a 6-phosphogluconolactonase like fold. Atomic structures of two sensor domains confirmed the respective fold predictions. The Adeh_2942 sensor (HK) was found to have a helical bundle structure, and the Adeh_3718 sensor (MCP) has a PAS-like structure. Interestingly, the Adeh_3718 sensor has an acetate moiety bound in a binding site typical for PAS-like domains. Future work is needed to determine whether Adeh_3718 is involved in acetate sensing by A. dehalogenans. PMID:23897711
NASA Astrophysics Data System (ADS)
Marchetti, S.; Sbrana, F.; Toscano, A.; Fratini, E.; Carlà, M.; Vassalli, M.; Tiribilli, B.; Pacini, A.; Gambi, C. M. C.
2011-05-01
The three-dimensional structure and the mechanical properties of a β-connectin fragment from human cardiac muscle, belonging to the I band, from I27 to I34, were investigated by small-angle x-ray scattering (SAXS) and single-molecule force spectroscopy (SMFS). This molecule presents an entropic elasticity behavior, associated to globular domain unfolding, that has been widely studied in the last 10 years. In addition, atomic force microscopy based SMFS experiments suggest that this molecule has an additional elastic regime, for low forces, probably associated to tertiary structure remodeling. From a structural point of view, this behavior is a mark of the fact that the eight domains in the I27-I34 fragment are not independent and they organize in solution, assuming a well-defined three-dimensional structure. This hypothesis has been confirmed by SAXS scattering, both on a diluted and a concentrated sample. Two different models were used to fit the SAXS curves: one assuming a globular shape and one corresponding to an elongated conformation, both coupled with a Coulomb repulsion potential to take into account the protein-protein interaction. Due to the predominance of the structure factor, the effective shape of the protein in solution could not be clearly disclosed. By performing SMFS by atomic force microscopy, mechanical unfolding properties were investigated. Typical sawtooth profiles were obtained and the rupture force of each unfolding domain was estimated. By fitting a wormlike chain model to each peak of the sawtooth profile, the entropic elasticity of octamer was described.
Antibacterial peptides from plants: what they are and how they probably work.
Barbosa Pelegrini, Patrícia; Del Sarto, Rafael Perseghini; Silva, Osmar Nascimento; Franco, Octávio Luiz; Grossi-de-Sa, Maria Fátima
2011-01-01
Plant antibacterial peptides have been isolated from a wide variety of species. They consist of several protein groups with different features, such as the overall charge of the molecule, the content of disulphide bonds, and structural stability under environmental stress. Although the three-dimensional structures of several classes of plant peptides are well determined, the mechanism of action of some of these molecules is still not well defined. However, further studies may provide new evidences for their function on bacterial cell wall. Therefore, this paper focuses on plant peptides that show activity against plant-pathogenic and human-pathogenic bacteria. Furthermore, we describe the folding of several peptides and similarities among their three-dimensional structures. Some hypotheses for their mechanisms of action and attack on the bacterial membrane surface are also proposed.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gabdulkhakov, A. G., E-mail: azat@vega.protes.ru; Dontsova, M. V.; Saenger, W.
Photosystem II is a key component of the photosynthetic pathway producing oxygen at the thylakoid membrane of cyanobacteria, green algae, and plants. The three-dimensional structure of photosystem II from the cyanobacterium Thermosynechococcus elongates in a complex with herbicide terbutryn (a photosynthesis inhibitor) was determined for the first time by X-ray diffraction and refined at 3.2 Angstrom-Sign resolution (R{sub factor} = 26.9%, R{sub free} = 29.9%, rmsd for bond lengths is 0.013 Angstrom-Sign , and rmsd for bond angles is 2.2 Degree-Sign ). The terbutryn molecule was located in the binding pocket of the mobile plastoquinone. The atomic coordinates of themore » refined structure of photosystem II in a complex with terbutryn were deposited in the Protein Data Bank.« less
Inverse statistical physics of protein sequences: a key issues review.
Cocco, Simona; Feinauer, Christoph; Figliuzzi, Matteo; Monasson, Rémi; Weigt, Martin
2018-03-01
In the course of evolution, proteins undergo important changes in their amino acid sequences, while their three-dimensional folded structure and their biological function remain remarkably conserved. Thanks to modern sequencing techniques, sequence data accumulate at unprecedented pace. This provides large sets of so-called homologous, i.e. evolutionarily related protein sequences, to which methods of inverse statistical physics can be applied. Using sequence data as the basis for the inference of Boltzmann distributions from samples of microscopic configurations or observables, it is possible to extract information about evolutionary constraints and thus protein function and structure. Here we give an overview over some biologically important questions, and how statistical-mechanics inspired modeling approaches can help to answer them. Finally, we discuss some open questions, which we expect to be addressed over the next years.
Inverse statistical physics of protein sequences: a key issues review
NASA Astrophysics Data System (ADS)
Cocco, Simona; Feinauer, Christoph; Figliuzzi, Matteo; Monasson, Rémi; Weigt, Martin
2018-03-01
In the course of evolution, proteins undergo important changes in their amino acid sequences, while their three-dimensional folded structure and their biological function remain remarkably conserved. Thanks to modern sequencing techniques, sequence data accumulate at unprecedented pace. This provides large sets of so-called homologous, i.e. evolutionarily related protein sequences, to which methods of inverse statistical physics can be applied. Using sequence data as the basis for the inference of Boltzmann distributions from samples of microscopic configurations or observables, it is possible to extract information about evolutionary constraints and thus protein function and structure. Here we give an overview over some biologically important questions, and how statistical-mechanics inspired modeling approaches can help to answer them. Finally, we discuss some open questions, which we expect to be addressed over the next years.
Collision induced unfolding of isolated proteins in the gas phase: past, present, and future.
Dixit, Sugyan M; Polasky, Daniel A; Ruotolo, Brandon T
2018-02-01
Rapidly characterizing the three-dimensional structures of proteins and the multimeric machines they form remains one of the great challenges facing modern biological and medical sciences. Ion mobility-mass spectrometry based techniques are playing an expanding role in characterizing these functional complexes, especially in drug discovery and development workflows. Despite this expansion, ion mobility-mass spectrometry faces many challenges, especially in the context of detecting small differences in protein tertiary structure that bear functional consequences. Collision induced unfolding is an ion mobility-mass spectrometry method that enables the rapid differentiation of subtly-different protein isoforms based on their unfolding patterns and stabilities. In this review, we summarize the modern implementation of such gas-phase unfolding experiments and provide an overview of recent developments in both methods and applications. Copyright © 2017 Elsevier Ltd. All rights reserved.
Sweetness determinant sites of brazzein, a small, heat-stable, sweet-tasting protein.
Assadi-Porter, F M; Aceti, D J; Markley, J L
2000-04-15
Brazzein, originally isolated from the fruit of the African plant Pentadiplandra brazzeana Baillon, is the smallest, most heat-stable and pH-stable member of the set of proteins known to have intrinsic sweetness. These properties make brazzein an ideal system for investigating the chemical and structural requirements of a sweet-tasting protein. We have used the three-dimensional structure of the protein (J. E. Caldwell et al. (1998) Nat. Struct. Biol. 5, 427-431) as a guide in designing 15 synthetic genes in expression constructs aimed at delineating the sweetness determinants of brazzein. Protein was produced heterologously in Escherichia coli, isolated, and purified as described in the companion paper (Assadi-Porter, F. M., Aceti, D., Cheng, H., and Markley, J. L., this issue). Analysis by one-dimensional (1)H NMR spectroscopy indicated that all but one of these variants had folded properly under the conditions used. A taste panel compared the gustatory properties of solutions of these proteins to those of sucrose and brazzein isolated from fruit. Of the 14 mutations in the des-pGlu1-brazzein background, four exhibited almost no sweetness, six had significantly reduced sweetness, two had taste properties equivalent to des-pGlu1-brazzein (two times as sweet as the major form of brazzein isolated from fruit which contains pGlu1), and two were about twice as sweet as des-pGlu1-brazzein. Overall, the results suggest that two regions of the protein are critical for the sweetness of brazzein: a region that includes the N- and C-termini of the protein, which are located close to one another, and a region that includes the flexible loop around Arg43. Copyright 2000 Academic Press.
Lipid nanotechnologies for structural studies of membrane-associated proteins.
Stoilova-McPhie, Svetla; Grushin, Kirill; Dalm, Daniela; Miller, Jaimy
2014-11-01
We present a methodology of lipid nanotubes (LNT) and nanodisks technologies optimized in our laboratory for structural studies of membrane-associated proteins at close to physiological conditions. The application of these lipid nanotechnologies for structure determination by cryo-electron microscopy (cryo-EM) is fundamental for understanding and modulating their function. The LNTs in our studies are single bilayer galactosylceramide based nanotubes of ∼20 nm inner diameter and a few microns in length, that self-assemble in aqueous solutions. The lipid nanodisks (NDs) are self-assembled discoid lipid bilayers of ∼10 nm diameter, which are stabilized in aqueous solutions by a belt of amphipathic helical scaffold proteins. By combining LNT and ND technologies, we can examine structurally how the membrane curvature and lipid composition modulates the function of the membrane-associated proteins. As proof of principle, we have engineered these lipid nanotechnologies to mimic the activated platelet's phosphtaidylserine rich membrane and have successfully assembled functional membrane-bound coagulation factor VIII in vitro for structure determination by cryo-EM. The macromolecular organization of the proteins bound to ND and LNT are further defined by fitting the known atomic structures within the calculated three-dimensional maps. The combination of LNT and ND technologies offers a means to control the design and assembly of a wide range of functional membrane-associated proteins and complexes for structural studies by cryo-EM. The presented results confirm the suitability of the developed methodology for studying the functional structure of membrane-associated proteins, such as the coagulation factors, at a close to physiological environment. © 2014 Wiley Periodicals, Inc.
2014-07-01
coordinates of the EscN protein (Zarivach et al., 2007) were downloaded in pdb file format from the Research Collaboratory for Structural Biology...catalytic activity. Two structurally related compounds were observed to adopt extended conformations in the active-site cleft and essentially...adopt a very compact conformation that occupied only one side of the cleft. Our goal was to determine the three-dimensional structures of the
Gromiha, M Michael; Anoosha, P; Huang, Liang-Tsung
2016-01-01
Protein stability is the free energy difference between unfolded and folded states of a protein, which lies in the range of 5-25 kcal/mol. Experimentally, protein stability is measured with circular dichroism, differential scanning calorimetry, and fluorescence spectroscopy using thermal and denaturant denaturation methods. These experimental data have been accumulated in the form of a database, ProTherm, thermodynamic database for proteins and mutants. It also contains sequence and structure information of a protein, experimental methods and conditions, and literature information. Different features such as search, display, and sorting options and visualization tools have been incorporated in the database. ProTherm is a valuable resource for understanding/predicting the stability of proteins and it can be accessed at http://www.abren.net/protherm/ . ProTherm has been effectively used to examine the relationship among thermodynamics, structure, and function of proteins. We describe the recent progress on the development of methods for understanding/predicting protein stability, such as (1) general trends on mutational effects on stability, (2) relationship between the stability of protein mutants and amino acid properties, (3) applications of protein three-dimensional structures for predicting their stability upon point mutations, (4) prediction of protein stability upon single mutations from amino acid sequence, and (5) prediction methods for addressing double mutants. A list of online resources for predicting has also been provided.
Hikone, Yuya; Hirai, Go; Mishima, Masaki; Inomata, Kohsuke; Ikeya, Teppei; Arai, Souichiro; Shirakawa, Masahiro; Sodeoka, Mikiko; Ito, Yutaka
2016-10-01
Structural analyses of proteins under macromolecular crowding inside human cultured cells by in-cell NMR spectroscopy are crucial not only for explicit understanding of their cellular functions but also for applications in medical and pharmaceutical sciences. In-cell NMR experiments using human cultured cells however suffer from low sensitivity, thus pseudocontact shifts from protein-tagged paramagnetic lanthanoid ions, analysed using sensitive heteronuclear two-dimensional correlation NMR spectra, offer huge potential advantage in obtaining structural information over conventional NOE-based approaches. We synthesised a new lanthanoid-chelating tag (M8-CAM-I), in which the eight-fold, stereospecifically methylated DOTA (M8) scaffold was retained, while a stable carbamidemethyl (CAM) group was introduced as the functional group connecting to proteins. M8-CAM-I successfully fulfilled the requirements for in-cell NMR: high-affinity to lanthanoid, low cytotoxicity and the stability under reducing condition inside cells. Large PCSs for backbone N-H resonances observed for M8-CAM-tagged human ubiquitin mutant proteins, which were introduced into HeLa cells by electroporation, demonstrated that this approach readily provides the useful information enabling the determination of protein structures, relative orientations of domains and protein complexes within human cultured cells.
PASS2: an automated database of protein alignments organised as structural superfamilies.
Bhaduri, Anirban; Pugalenthi, Ganesan; Sowdhamini, Ramanathan
2004-04-02
The functional selection and three-dimensional structural constraints of proteins in nature often relates to the retention of significant sequence similarity between proteins of similar fold and function despite poor sequence identity. Organization of structure-based sequence alignments for distantly related proteins, provides a map of the conserved and critical regions of the protein universe that is useful for the analysis of folding principles, for the evolutionary unification of protein families and for maximizing the information return from experimental structure determination. The Protein Alignment organised as Structural Superfamily (PASS2) database represents continuously updated, structural alignments for evolutionary related, sequentially distant proteins. An automated and updated version of PASS2 is, in direct correspondence with SCOP 1.63, consisting of sequences having identity below 40% among themselves. Protein domains have been grouped into 628 multi-member superfamilies and 566 single member superfamilies. Structure-based sequence alignments for the superfamilies have been obtained using COMPARER, while initial equivalencies have been derived from a preliminary superposition using LSQMAN or STAMP 4.0. The final sequence alignments have been annotated for structural features using JOY4.0. The database is supplemented with sequence relatives belonging to different genomes, conserved spatially interacting and structural motifs, probabilistic hidden markov models of superfamilies based on the alignments and useful links to other databases. Probabilistic models and sensitive position specific profiles obtained from reliable superfamily alignments aid annotation of remote homologues and are useful tools in structural and functional genomics. PASS2 presents the phylogeny of its members both based on sequence and structural dissimilarities. Clustering of members allows us to understand diversification of the family members. The search engine has been improved for simpler browsing of the database. The database resolves alignments among the structural domains consisting of evolutionarily diverged set of sequences. Availability of reliable sequence alignments of distantly related proteins despite poor sequence identity and single-member superfamilies permit better sampling of structures in libraries for fold recognition of new sequences and for the understanding of protein structure-function relationships of individual superfamilies. PASS2 is accessible at http://www.ncbs.res.in/~faculty/mini/campass/pass2.html
Dissimilar sweet proteins from plants: oddities or normal components?
Picone, Delia; Temussi, Piero Andrea
2012-10-01
The fruits of a few tropical plants contain intensely sweet proteins. Their common property points to a protein family. Generally, proteins belonging to the same family share similar folds, similar sequences and, at least in part, similar function but sweet proteins constitute an exception to this rule. Apart from sharing the rather unusual taste function, they show no obvious similarities either in their sequences or in three-dimensional structures. In this review we describe the nature, structure and mechanism of action of the best known sweet tasting proteins, including two taste modifying proteins. Sweet proteins stand out among sweet molecules because their volume is not compatible with an interaction with orthosteric active sites of the sweet taste receptor. The best explanation of their mechanism of action is the interaction with the external surface of the sweet taste receptor, according to a model that has been named "wedge model". It is hypothesized that this mode of action may be related to the ability of other members of their protein families to inhibit different enzymes. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
Statistical inference of protein structural alignments using information and compression.
Collier, James H; Allison, Lloyd; Lesk, Arthur M; Stuckey, Peter J; Garcia de la Banda, Maria; Konagurthu, Arun S
2017-04-01
Structural molecular biology depends crucially on computational techniques that compare protein three-dimensional structures and generate structural alignments (the assignment of one-to-one correspondences between subsets of amino acids based on atomic coordinates). Despite its importance, the structural alignment problem has not been formulated, much less solved, in a consistent and reliable way. To overcome these difficulties, we present here a statistical framework for the precise inference of structural alignments, built on the Bayesian and information-theoretic principle of Minimum Message Length (MML). The quality of any alignment is measured by its explanatory power-the amount of lossless compression achieved to explain the protein coordinates using that alignment. We have implemented this approach in MMLigner , the first program able to infer statistically significant structural alignments. We also demonstrate the reliability of MMLigner 's alignment results when compared with the state of the art. Importantly, MMLigner can also discover different structural alignments of comparable quality, a challenging problem for oligomers and protein complexes. Source code, binaries and an interactive web version are available at http://lcb.infotech.monash.edu.au/mmligner . arun.konagurthu@monash.edu. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Zykwinska, Agata; Pihet, Marc; Radji, Sadia; Bouchara, Jean-Philippe; Cuenot, Stéphane
2014-06-01
Hydrophobins are small surface active proteins that fulfil a wide spectrum of functions in fungal growth and development. The human fungal pathogen Aspergillus fumigatus expresses RodA hydrophobins that self-assemble on the outer conidial surface into tightly organized nanorods known as rodlets. AFM investigation of the conidial surface allows us to evidence that RodA hydrophobins self-assemble into rodlets through bilayers. Within bilayers, hydrophilic domains of hydrophobins point inward, thus making a hydrophilic core, while hydrophobic domains point outward. AFM measurements reveal that several rodlet bilayers are present on the conidial surface thus showing that proteins self-assemble into a complex three-dimensional multilayer system. The self-assembly of RodA hydrophobins into rodlets results from attractive interactions between stacked β-sheets, which conduct to a final linear cross-β spine structure. A Monte Carlo simulation shows that anisotropic interactions are the main driving forces leading the hydrophobins to self-assemble into parallel rodlets, which are further structured in nanodomains. Taken together, these findings allow us to propose a mechanism, which conducts RodA hydrophobins to a highly ordered rodlet structure. The mechanism of hydrophobin assembly into rodlets offers new prospects for the development of more efficient strategies leading to disruption of rodlet formation allowing a rapid detection of the fungus by the immune system. Copyright © 2014 Elsevier B.V. All rights reserved.
Probing the mechanism of fusion in a two-dimensional computer simulation.
Chanturiya, Alexandr; Scaria, Puthurapamil; Kuksenok, Oleksandr; Woodle, Martin C
2002-01-01
A two-dimensional (2D) model of lipid bilayers was developed and used to investigate a possible role of membrane lateral tension in membrane fusion. We found that an increase of lateral tension in contacting monolayers of 2D analogs of liposomes and planar membranes could cause not only hemifusion, but also complete fusion when internal pressure is introduced in the model. With a certain set of model parameters it was possible to induce hemifusion-like structural changes by a tension increase in only one of the two contacting bilayers. The effect of lysolipids was modeled as an insertion of a small number of extra molecules into the cis or trans side of the interacting bilayers at different stages of simulation. It was found that cis insertion arrests fusion and trans insertion has no inhibitory effect on fusion. The possibility of protein participation in tension-driven fusion was tested in simulation, with one of two model liposomes containing a number of structures capable of reducing the area occupied by them in the outer monolayer. It was found that condensation of these structures was sufficient to produce membrane reorganization similar to that observed in simulations with "protein-free" bilayers. These data support the hypothesis that changes in membrane lateral tension may be responsible for fusion in both model phospholipid membranes and in biological protein-mediated fusion. PMID:12023230
On the problem of resonance assignments in solid state NMR of uniformly 15N, 13C-labeled proteins
NASA Astrophysics Data System (ADS)
Tycko, Robert
2015-04-01
Determination of accurate resonance assignments from multidimensional chemical shift correlation spectra is one of the major problems in biomolecular solid state NMR, particularly for relative large proteins with less-than-ideal NMR linewidths. This article investigates the difficulty of resonance assignment, using a computational Monte Carlo/simulated annealing (MCSA) algorithm to search for assignments from artificial three-dimensional spectra that are constructed from the reported isotropic 15N and 13C chemical shifts of two proteins whose structures have been determined by solution NMR methods. The results demonstrate how assignment simulations can provide new insights into factors that affect the assignment process, which can then help guide the design of experimental strategies. Specifically, simulations are performed for the catalytic domain of SrtC (147 residues, primarily β-sheet secondary structure) and the N-terminal domain of MLKL (166 residues, primarily α-helical secondary structure). Assuming unambiguous residue-type assignments and four ideal three-dimensional data sets (NCACX, NCOCX, CONCA, and CANCA), uncertainties in chemical shifts must be less than 0.4 ppm for assignments for SrtC to be unique, and less than 0.2 ppm for MLKL. Eliminating CANCA data has no significant effect, but additionally eliminating CONCA data leads to more stringent requirements for chemical shift precision. Introducing moderate ambiguities in residue-type assignments does not have a significant effect.
Systematic Comparison of Crystal and NMR Protein Structures Deposited in the Protein Data Bank
Sikic, Kresimir; Tomic, Sanja; Carugo, Oliviero
2010-01-01
Nearly all the macromolecular three-dimensional structures deposited in Protein Data Bank were determined by either crystallographic (X-ray) or Nuclear Magnetic Resonance (NMR) spectroscopic methods. This paper reports a systematic comparison of the crystallographic and NMR results deposited in the files of the Protein Data Bank, in order to find out to which extent these information can be aggregated in bioinformatics. A non-redundant data set containing 109 NMR – X-ray structure pairs of nearly identical proteins was derived from the Protein Data Bank. A series of comparisons were performed by focusing the attention towards both global features and local details. It was observed that: (1) the RMDS values between NMR and crystal structures range from about 1.5 Å to about 2.5 Å; (2) the correlation between conformational deviations and residue type reveals that hydrophobic amino acids are more similar in crystal and NMR structures than hydrophilic amino acids; (3) the correlation between solvent accessibility of the residues and their conformational variability in solid state and in solution is relatively modest (correlation coefficient = 0.462); (4) beta strands on average match better between NMR and crystal structures than helices and loops; (5) conformational differences between loops are independent of crystal packing interactions in the solid state; (6) very seldom, side chains buried in the protein interior are observed to adopt different orientations in the solid state and in solution. PMID:21293729
Chen, Fu; Sun, Huiyong; Wang, Junmei; Zhu, Feng; Liu, Hui; Wang, Zhe; Lei, Tailong; Li, Youyong; Hou, Tingjun
2018-06-21
Molecular docking provides a computationally efficient way to predict the atomic structural details of protein-RNA interactions (PRI), but accurate prediction of the three-dimensional structures and binding affinities for PRI is still notoriously difficult, partly due to the unreliability of the existing scoring functions for PRI. MM/PBSA and MM/GBSA are more theoretically rigorous than most scoring functions for protein-RNA docking, but their prediction performance for protein-RNA systems remains unclear. Here, we systemically evaluated the capability of MM/PBSA and MM/GBSA to predict the binding affinities and recognize the near-native binding structures for protein-RNA systems with different solvent models and interior dielectric constants (ϵ in ). For predicting the binding affinities, the predictions given by MM/GBSA based on the minimized structures in explicit solvent and the GBGBn1 model with ϵ in = 2 yielded the highest correlation with the experimental data. Moreover, the MM/GBSA calculations based on the minimized structures in implicit solvent and the GBGBn1 model distinguished the near-native binding structures within the top 10 decoys for 118 out of the 149 protein-RNA systems (79.2%). This performance is better than all docking scoring functions studied here. Therefore, the MM/GBSA rescoring is an efficient way to improve the prediction capability of scoring functions for protein-RNA systems. Published by Cold Spring Harbor Laboratory Press for the RNA Society.