ERIC Educational Resources Information Center
Giron, Maria D.; Salto, Rafael
2011-01-01
Structure-function relationship studies in proteins are essential in modern Cell Biology. Laboratory exercises that allow students to familiarize themselves with basic mutagenesis techniques are essential in all Genetic Engineering courses to teach the relevance of protein structure. We have implemented a laboratory course based on the…
Whitmire, Jeannette M; Merrell, D Scott
2017-01-01
Mutagenesis is a valuable tool to examine the structure-function relationships of bacterial proteins. As such, a wide variety of mutagenesis techniques and strategies have been developed. This chapter details a selection of random mutagenesis methods and site-directed mutagenesis procedures that can be applied to an array of bacterial species. Additionally, the direct application of the techniques to study the Helicobacter pylori Ferric Uptake Regulator (Fur) protein is described. The varied approaches illustrated herein allow the robust investigation of the structural-functional relationships within a protein of interest.
-6223 Research Interests Molecular mechanisms of cellulose-degrading enzymes Structure-function relationships of biomass-derived polymers Structure-function relationships in glycoside hydrolases Methane potential protein engineering targets. Structure-Function Relationships of Biomass-Derived Polymers
Visualizing and Clustering Protein Similarity Networks: Sequences, Structures, and Functions.
Mai, Te-Lun; Hu, Geng-Ming; Chen, Chi-Ming
2016-07-01
Research in the recent decade has demonstrated the usefulness of protein network knowledge in furthering the study of molecular evolution of proteins, understanding the robustness of cells to perturbation, and annotating new protein functions. In this study, we aimed to provide a general clustering approach to visualize the sequence-structure-function relationship of protein networks, and investigate possible causes for inconsistency in the protein classifications based on sequences, structures, and functions. Such visualization of protein networks could facilitate our understanding of the overall relationship among proteins and help researchers comprehend various protein databases. As a demonstration, we clustered 1437 enzymes by their sequences and structures using the minimum span clustering (MSC) method. The general structure of this protein network was delineated at two clustering resolutions, and the second level MSC clustering was found to be highly similar to existing enzyme classifications. The clustering of these enzymes based on sequence, structure, and function information is consistent with each other. For proteases, the Jaccard's similarity coefficient is 0.86 between sequence and function classifications, 0.82 between sequence and structure classifications, and 0.78 between structure and function classifications. From our clustering results, we discussed possible examples of divergent evolution and convergent evolution of enzymes. Our clustering approach provides a panoramic view of the sequence-structure-function network of proteins, helps visualize the relation between related proteins intuitively, and is useful in predicting the structure and function of newly determined protein sequences.
A Web-Accessible Protein Structure Prediction Pipeline
2009-06-01
Abstract Proteins are the molecular basis of nearly all structural, catalytic, sensory, and regulatory functions in living organisms. The biological...sensory, and regulatory functions in living organisms. The structure of a protein is essential in understanding its function at the molecular level...Characterizing sequence-structure and structure-function relationships have been the goals of molecular biology for more than three decades
Dong, Zheng; Zhou, Hongyu; Tao, Peng
2018-02-01
PAS domains are widespread in archaea, bacteria, and eukaryota, and play important roles in various functions. In this study, we aim to explore functional evolutionary relationship among proteins in the PAS domain superfamily in view of the sequence-structure-dynamics-function relationship. We collected protein sequences and crystal structure data from RCSB Protein Data Bank of the PAS domain superfamily belonging to three biological functions (nucleotide binding, photoreceptor activity, and transferase activity). Protein sequences were aligned and then used to select sequence-conserved residues and build phylogenetic tree. Three-dimensional structure alignment was also applied to obtain structure-conserved residues. The protein dynamics were analyzed using elastic network model (ENM) and validated by molecular dynamics (MD) simulation. The result showed that the proteins with same function could be grouped by sequence similarity, and proteins in different functional groups displayed statistically significant difference in their vibrational patterns. Interestingly, in all three functional groups, conserved amino acid residues identified by sequence and structure conservation analysis generally have a lower fluctuation than other residues. In addition, the fluctuation of conserved residues in each biological function group was strongly correlated with the corresponding biological function. This research suggested a direct connection in which the protein sequences were related to various functions through structural dynamics. This is a new attempt to delineate functional evolution of proteins using the integrated information of sequence, structure, and dynamics. © 2017 The Protein Society.
Scavuzzo-Duggan, Tess R.; Chaves, Arielle M.; Roberts, Alison W.
2015-07-14
Here, a method for rapid in vivo functional analysis of engineered proteins was developed using Physcomitrella patens. A complementation assay was designed for testing structure/function relationships in cellulose synthase (CESA) proteins. The components of the assay include (1) construction of test vectors that drive expression of epitope-tagged PpCESA5 carrying engineered mutations, (2) transformation of a ppcesa5 knockout line that fails to produce gametophores with test and control vectors, (3) scoring the stable transformants for gametophore production, (4) statistical analysis comparing complementation rates for test vectors to positive and negative control vectors, and (5) analysis of transgenic protein expression by Westernmore » blotting. The assay distinguished mutations that generate fully functional, nonfunctional, and partially functional proteins. In conclusion, compared with existing methods for in vivo testing of protein function, this complementation assay provides a rapid method for investigating protein structure/function relationships in plants.« less
Taylor, Gregory K.; Stoddard, Barry L.
2012-01-01
Homing endonucleases (HEs) are highly specific DNA-cleaving enzymes that are encoded by invasive DNA elements (usually mobile introns or inteins) within the genomes of phage, bacteria, archea, protista and eukaryotic organelles. Six unique structural HE families, that collectively span four distinct nuclease catalytic motifs, have been characterized to date. Members of each family display structural homology and functional relationships to a wide variety of proteins from various organisms. The biological functions of those proteins are highly disparate and include non-specific DNA-degradation enzymes, restriction endonucleases, DNA-repair enzymes, resolvases, intron splicing factors and transcription factors. These relationships suggest that modern day HEs share common ancestors with proteins involved in genome fidelity, maintenance and gene expression. This review summarizes the results of structural studies of HEs and corresponding proteins from host organisms that have illustrated the manner in which these factors are related. PMID:22406833
Dynamics of endoglucanase catalytic domains: implications towards thermostability
USDA-ARS?s Scientific Manuscript database
The function of proteins is controlled by their dynamics inherently determined by their structure. Exploring the protein structure-dynamics relationship is important to develop an understanding of protein function that allows tapping the potential of economically important proteins, such as endogluc...
Functional Evolution of PLP-dependent Enzymes based on Active-Site Structural Similarities
Catazaro, Jonathan; Caprez, Adam; Guru, Ashu; Swanson, David; Powers, Robert
2014-01-01
Families of distantly related proteins typically have very low sequence identity, which hinders evolutionary analysis and functional annotation. Slowly evolving features of proteins, such as an active site, are therefore valuable for annotating putative and distantly related proteins. To date, a complete evolutionary analysis of the functional relationship of an entire enzyme family based on active-site structural similarities has not yet been undertaken. Pyridoxal-5’-phosphate (PLP) dependent enzymes are primordial enzymes that diversified in the last universal ancestor. Using the Comparison of Protein Active Site Structures (CPASS) software and database, we show that the active site structures of PLP-dependent enzymes can be used to infer evolutionary relationships based on functional similarity. The enzymes successfully clustered together based on substrate specificity, function, and three-dimensional fold. This study demonstrates the value of using active site structures for functional evolutionary analysis and the effectiveness of CPASS. PMID:24920327
Functional evolution of PLP-dependent enzymes based on active-site structural similarities.
Catazaro, Jonathan; Caprez, Adam; Guru, Ashu; Swanson, David; Powers, Robert
2014-10-01
Families of distantly related proteins typically have very low sequence identity, which hinders evolutionary analysis and functional annotation. Slowly evolving features of proteins, such as an active site, are therefore valuable for annotating putative and distantly related proteins. To date, a complete evolutionary analysis of the functional relationship of an entire enzyme family based on active-site structural similarities has not yet been undertaken. Pyridoxal-5'-phosphate (PLP) dependent enzymes are primordial enzymes that diversified in the last universal ancestor. Using the comparison of protein active site structures (CPASS) software and database, we show that the active site structures of PLP-dependent enzymes can be used to infer evolutionary relationships based on functional similarity. The enzymes successfully clustered together based on substrate specificity, function, and three-dimensional-fold. This study demonstrates the value of using active site structures for functional evolutionary analysis and the effectiveness of CPASS. © 2014 Wiley Periodicals, Inc.
Xie, Hongbo; Vucetic, Slobodan; Iakoucheva, Lilia M.; Oldfield, Christopher J.; Dunker, A. Keith; Uversky, Vladimir N.; Obradovic, Zoran
2008-01-01
Identifying relationships between function, amino acid sequence and protein structure represents a major challenge. In this study we propose a bioinformatics approach that identifies functional keywords in the Swiss-Prot database that correlate with intrinsic disorder. A statistical evaluation is employed to rank the significance of these correlations. Protein sequence data redundancy and the relationship between protein length and protein structure were taken into consideration to ensure the quality of the statistical inferences. Over 200,000 proteins from Swiss-Prot database were analyzed using this approach. The predictions of intrinsic disorder were carried out using PONDR VL3E predictor of long disordered regions that achieves an accuracy of above 86%. Overall, out of the 710 Swiss-Prot functional keywords that were each associated with at least 20 proteins, 238 were found to be strongly positively correlated with predicted long intrinsically disordered regions, whereas 302 were strongly negatively correlated with such regions. The remaining 170 keywords were ambiguous without strong positive or negative correlation with the disorder predictions. These functions cover a large variety of biological activities and imply that disordered regions are characterized by a wide functional repertoire. Our results agree well with literature findings, as we were able to find at least one illustrative example of functional disorder or order shown experimentally for the vast majority of keywords showing the strongest positive or negative correlation with intrinsic disorder. This work opens a series of three papers, which enriches the current view of protein structure-function relationships, especially with regards to functionalities of intrinsically disordered proteins and provides researchers with a novel tool that could be used to improve the understanding of the relationships between protein structure and function. The first paper of the series describes our statistical approach, outlines the major findings and provides illustrative examples of biological processes and functions positively and negatively correlated with intrinsic disorder. PMID:17391014
Xie, Hongbo; Vucetic, Slobodan; Iakoucheva, Lilia M; Oldfield, Christopher J; Dunker, A Keith; Uversky, Vladimir N; Obradovic, Zoran
2007-05-01
Identifying relationships between function, amino acid sequence, and protein structure represents a major challenge. In this study, we propose a bioinformatics approach that identifies functional keywords in the Swiss-Prot database that correlate with intrinsic disorder. A statistical evaluation is employed to rank the significance of these correlations. Protein sequence data redundancy and the relationship between protein length and protein structure were taken into consideration to ensure the quality of the statistical inferences. Over 200,000 proteins from the Swiss-Prot database were analyzed using this approach. The predictions of intrinsic disorder were carried out using PONDR VL3E predictor of long disordered regions that achieves an accuracy of above 86%. Overall, out of the 710 Swiss-Prot functional keywords that were each associated with at least 20 proteins, 238 were found to be strongly positively correlated with predicted long intrinsically disordered regions, whereas 302 were strongly negatively correlated with such regions. The remaining 170 keywords were ambiguous without strong positive or negative correlation with the disorder predictions. These functions cover a large variety of biological activities and imply that disordered regions are characterized by a wide functional repertoire. Our results agree well with literature findings, as we were able to find at least one illustrative example of functional disorder or order shown experimentally for the vast majority of keywords showing the strongest positive or negative correlation with intrinsic disorder. This work opens a series of three papers, which enriches the current view of protein structure-function relationships, especially with regards to functionalities of intrinsically disordered proteins, and provides researchers with a novel tool that could be used to improve the understanding of the relationships between protein structure and function. The first paper of the series describes our statistical approach, outlines the major findings, and provides illustrative examples of biological processes and functions positively and negatively correlated with intrinsic disorder.
Mudgal, Richa; Srinivasan, Narayanaswamy; Chandra, Nagasuma
2017-07-01
Functional annotation is seldom straightforward with complexities arising due to functional divergence in protein families or functional convergence between non-homologous protein families, leading to mis-annotations. An enzyme may contain multiple domains and not all domains may be involved in a given function, adding to the complexity in function annotation. To address this, we use binding site information from bound cognate ligands and catalytic residues, since it can help in resolving fold-function relationships at a finer level and with higher confidence. A comprehensive database of 2,020 fold-function-binding site relationships has been systematically generated. A network-based approach is employed to capture the complexity in these relationships, from which different types of associations are deciphered, that identify versatile protein folds performing diverse functions, same function associated with multiple folds and one-to-one relationships. Binding site similarity networks integrated with fold, function, and ligand similarity information are generated to understand the depth of these relationships. Apart from the observed continuity in the functional site space, network properties of these revealed versatile families with topologically different or dissimilar binding sites and structural families that perform very similar functions. As a case study, subtle changes in the active site of a set of evolutionarily related superfamilies are studied using these networks. Tracing of such similarities in evolutionarily related proteins provide clues into the transition and evolution of protein functions. Insights from this study will be helpful in accurate and reliable functional annotations of uncharacterized proteins, poly-pharmacology, and designing enzymes with new functional capabilities. Proteins 2017; 85:1319-1335. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Nakai, S; Li-Chan, E
1985-10-01
According to the original idea of quantitative structure-activity relationship, electric, hydrophobic, and structural parameters should be taken into consideration for elucidating functionality. Changes in these parameters are reflected in the property of protein solubility upon modification of whey proteins by heating. Although solubility is itself a functional property, it has been utilized to explain other functionalities of proteins. However, better correlations were obtained when hydrophobic parameters of the proteins were used in conjunction with solubility. Various treatments reported in the literature were applied to whey protein concentrate in an attempt to obtain whipping and gelling properties similar to those of egg white. Mapping simplex optimization was used to search for the best results. Improvement in whipping properties by pepsin hydrolysis may have been due to higher protein solubility, and good gelling properties resulting from polyphosphate treatment may have been due to an increase in exposable hydrophobicity. However, the results of angel food cake making were still unsatisfactory.
Protein crystallization X-ray diffraction data collection Protein structure determination Obtaining structures of protein-ligand complexes Site-directed mutagenesis Structure-function relationship Enzymatic CelA," Science (2013) "Sequence, Structure, and Evolution of Cellulases in Glycoside
Ogawa, Seiji; Watanabe, Toshihide; Moriyuki, Kazumi; Goto, Yoshikazu; Yamane, Shinsaku; Watanabe, Akio; Tsuboi, Kazuma; Kinoshita, Atsushi; Okada, Takuya; Takeda, Hiroyuki; Tani, Kousuke; Maruyama, Toru
2016-05-15
The modification of the novel G protein-biased EP2 agonist 1 has been investigated to improve its G protein activity and develop a better understanding of its structure-functional selectivity relationship (SFSR). The optimization of the substituents on the phenyl ring of 1, followed by the inversion of the hydroxyl group on the cyclopentane moiety led to compound 9, which showed a 100-fold increase in its G protein activity compared with 1 without any increase in β-arrestin recruitment. Furthermore, SFSR studies revealed that the combination of meta and para substituents on the phenyl moiety was crucial to the functional selectivity. Copyright © 2016 Elsevier Ltd. All rights reserved.
Hati, Sanchita; Bhattacharyya, Sudeep
2016-01-01
A project-based biophysical chemistry laboratory course, which is offered to the biochemistry and molecular biology majors in their senior year, is described. In this course, the classroom study of the structure-function of biomolecules is integrated with the discovery-guided laboratory study of these molecules using computer modeling and simulations. In particular, modern computational tools are employed to elucidate the relationship between structure, dynamics, and function in proteins. Computer-based laboratory protocols that we introduced in three modules allow students to visualize the secondary, super-secondary, and tertiary structures of proteins, analyze non-covalent interactions in protein-ligand complexes, develop three-dimensional structural models (homology model) for new protein sequences and evaluate their structural qualities, and study proteins' intrinsic dynamics to understand their functions. In the fourth module, students are assigned to an authentic research problem, where they apply their laboratory skills (acquired in modules 1-3) to answer conceptual biophysical questions. Through this process, students gain in-depth understanding of protein dynamics-the missing link between structure and function. Additionally, the requirement of term papers sharpens students' writing and communication skills. Finally, these projects result in new findings that are communicated in peer-reviewed journals. © 2016 The International Union of Biochemistry and Molecular Biology.
Dynamic New World: Refining Our View of Protein Structure, Function and Evolution
Mannige, Ranjan V.
2014-01-01
Proteins are crucial to the functioning of all lifeforms. Traditional understanding posits that a single protein occupies a single structure (“fold”), which performs a single function. This view is radically challenged with the recognition that high structural dynamism—the capacity to be extra “floppy”—is more prevalent in functional proteins than previously assumed. As reviewed here, this dynamic take on proteins affects our understanding of protein “structure”, function, and evolution, and even gives us a glimpse into protein origination. Specifically, this review will discuss historical developments concerning protein structure, and important new relationships between dynamism and aspects of protein sequence, structure, binding modes, binding promiscuity, evolvability, and origination. Along the way, suggestions will be provided for how key parts of textbook definitions—that so far have excluded membership to intrinsically disordered proteins (IDPs)—could be modified to accommodate our more dynamic understanding of proteins. PMID:28250374
Zebra: a web server for bioinformatic analysis of diverse protein families.
Suplatov, Dmitry; Kirilin, Evgeny; Takhaveev, Vakil; Svedas, Vytas
2014-01-01
During evolution of proteins from a common ancestor, one functional property can be preserved while others can vary leading to functional diversity. A systematic study of the corresponding adaptive mutations provides a key to one of the most challenging problems of modern structural biology - understanding the impact of amino acid substitutions on protein function. The subfamily-specific positions (SSPs) are conserved within functional subfamilies but are different between them and, therefore, seem to be responsible for functional diversity in protein superfamilies. Consequently, a corresponding method to perform the bioinformatic analysis of sequence and structural data has to be implemented in the common laboratory practice to study the structure-function relationship in proteins and develop novel protein engineering strategies. This paper describes Zebra web server - a powerful remote platform that implements a novel bioinformatic analysis algorithm to study diverse protein families. It is the first application that provides specificity determinants at different levels of functional classification, therefore addressing complex functional diversity of large superfamilies. Statistical analysis is implemented to automatically select a set of highly significant SSPs to be used as hotspots for directed evolution or rational design experiments and analyzed studying the structure-function relationship. Zebra results are provided in two ways - (1) as a single all-in-one parsable text file and (2) as PyMol sessions with structural representation of SSPs. Zebra web server is available at http://biokinet.belozersky.msu.ru/zebra .
ERIC Educational Resources Information Center
Forbes-Lorman, Robin M.; Harris, Michelle A.; Chang, Wesley S.; Dent, Erik W.; Nordheim, Erik V.; Franzen, Margaret A.
2016-01-01
Understanding how basic structural units influence function is identified as a foundational/core concept for undergraduate biological and biochemical literacy. It is essential for students to understand this concept at all size scales, but it is often more difficult for students to understand structure-function relationships at the molecular…
PROFESS: a PROtein Function, Evolution, Structure and Sequence database
Triplet, Thomas; Shortridge, Matthew D.; Griep, Mark A.; Stark, Jaime L.; Powers, Robert; Revesz, Peter
2010-01-01
The proliferation of biological databases and the easy access enabled by the Internet is having a beneficial impact on biological sciences and transforming the way research is conducted. There are ∼1100 molecular biology databases dispersed throughout the Internet. To assist in the functional, structural and evolutionary analysis of the abundant number of novel proteins continually identified from whole-genome sequencing, we introduce the PROFESS (PROtein Function, Evolution, Structure and Sequence) database. Our database is designed to be versatile and expandable and will not confine analysis to a pre-existing set of data relationships. A fundamental component of this approach is the development of an intuitive query system that incorporates a variety of similarity functions capable of generating data relationships not conceived during the creation of the database. The utility of PROFESS is demonstrated by the analysis of the structural drift of homologous proteins and the identification of potential pancreatic cancer therapeutic targets based on the observation of protein–protein interaction networks. Database URL: http://cse.unl.edu/∼profess/ PMID:20624718
Nagata, Koji
2010-01-01
Peptides and proteins with similar amino acid sequences can have different biological functions. Knowledge of their three-dimensional molecular structures is critically important in identifying their functional determinants. In this review, I describe the results of our and other groups' structure-based functional characterization of insect insulin-like peptides, a crustacean hyperglycemic hormone-family peptide, a mammalian epidermal growth factor-family protein, and an intracellular signaling domain that recognizes proline-rich sequence.
Hidden relationships between metalloproteins unveiled by structural comparison of their metal sites
NASA Astrophysics Data System (ADS)
Valasatava, Yana; Andreini, Claudia; Rosato, Antonio
2015-03-01
Metalloproteins account for a substantial fraction of all proteins. They incorporate metal atoms, which are required for their structure and/or function. Here we describe a new computational protocol to systematically compare and classify metal-binding sites on the basis of their structural similarity. These sites are extracted from the MetalPDB database of minimal functional sites (MFSs) in metal-binding biological macromolecules. Structural similarity is measured by the scoring function of the available MetalS2 program. Hierarchical clustering was used to organize MFSs into clusters, for each of which a representative MFS was identified. The comparison of all representative MFSs provided a thorough structure-based classification of the sites analyzed. As examples, the application of the proposed computational protocol to all heme-binding proteins and zinc-binding proteins of known structure highlighted the existence of structural subtypes, validated known evolutionary links and shed new light on the occurrence of similar sites in systems at different evolutionary distances. The present approach thus makes available an innovative viewpoint on metalloproteins, where the functionally crucial metal sites effectively lead the discovery of structural and functional relationships in a largely protein-independent manner.
Dawson, Natalie L; Sillitoe, Ian; Lees, Jonathan G; Lam, Su Datt; Orengo, Christine A
2017-01-01
This chapter describes the generation of the data in the CATH-Gene3D online resource and how it can be used to study protein domains and their evolutionary relationships. Methods will be presented for: comparing protein structures, recognizing homologs, predicting domain structures within protein sequences, and subclassifying superfamilies into functionally pure families, together with a guide on using the webpages.
Thapliyal, Charu; Jain, Neha; Chaudhuri, Pratima
2015-01-01
A protein, differing in origin, may exhibit variable physicochemical behaviour, difference in sequence homology, fold and function. Thus studying structure-function relationship of proteins from altered sources is meaningful in the sense that it may give rise to comparative aspects of their sequence-structure-function relationship. Dihydrofolate reductase is an enzyme involved in cell cycle regulation. It is a significant enzyme as.a target for developing anticancer drugs. Hence, detailed understanding of structure-function relationships of wide variants of the enzyme dihydrofolate reductase would be important for developing an inhibitor or an antagonist against the enzyme involved in the cellular developmental processes. In this communication, we have reported the comparative structure-function relationship between E. coli and human dihydrofolate reductase. The differences in the unfolding behaviour of these two proteins have been investigated to understand various properties of these two proteins like relative' stability differences and variation in conformational changes under identical denaturing conditions. The equilibrium unfolding mechanism of dihydrofolate reductase proteins using guanidine hydrochloride as a denaturant in the presence of various types of osmolytes has been monitored using loss in enzymatic activity, intrinsic tryptophan fluorescence and an extrinsic fluorophore 8-anilino-1-naphthalene-sulfonic acid as probes. It has been observed that osmolytes, such as 1M sucrose, and 30% glycerol, provided enhanced stability to both variants of dihydrofolate reductase. Their level of stabilisation has been observed to be dependent on intrinsic protein stability. It was observed that 100 mM proline does not show any 'significant stabilisation to either of dihydrofolate reductases. In the present study, it has been observed that the human protein is relatively less stable than the E.coli counterpart.
Esque, Jérémy; Urbain, Aurélie; Etchebest, Catherine; de Brevern, Alexandre G
2015-11-01
Transmembrane proteins (TMPs) are major drug targets, but the knowledge of their precise topology structure remains highly limited compared with globular proteins. In spite of the difficulties in obtaining their structures, an important effort has been made these last years to increase their number from an experimental and computational point of view. In view of this emerging challenge, the development of computational methods to extract knowledge from these data is crucial for the better understanding of their functions and in improving the quality of structural models. Here, we revisit an efficient unsupervised learning procedure, called Hybrid Protein Model (HPM), which is applied to the analysis of transmembrane proteins belonging to the all-α structural class. HPM method is an original classification procedure that efficiently combines sequence and structure learning. The procedure was initially applied to the analysis of globular proteins. In the present case, HPM classifies a set of overlapping protein fragments, extracted from a non-redundant databank of TMP 3D structure. After fine-tuning of the learning parameters, the optimal classification results in 65 clusters. They represent at best similar relationships between sequence and local structure properties of TMPs. Interestingly, HPM distinguishes among the resulting clusters two helical regions with distinct hydrophobic patterns. This underlines the complexity of the topology of these proteins. The HPM classification enlightens unusual relationship between amino acids in TMP fragments, which can be useful to elaborate new amino acids substitution matrices. Finally, two challenging applications are described: the first one aims at annotating protein functions (channel or not), the second one intends to assess the quality of the structures (X-ray or models) via a new scoring function deduced from the HPM classification.
From Sequence and Forces to Structure, Function and Evolution of Intrinsically Disordered Proteins
Forman-Kay, Julie D.; Mittag, Tanja
2015-01-01
Intrinsically disordered proteins (IDPs), which lack persistent structure, are a challenge to structural biology due to the inapplicability of standard methods for characterization of folded proteins as well as their deviation from the dominant structure/function paradigm. Their widespread presence and involvement in biological function, however, has spurred the growing acceptance of the importance of IDPs and the development of new tools for studying their structure, dynamics and function. The interplay of folded and disordered domains or regions for function and the existence of a continuum of protein states with respect to conformational energetics, motional timescales and compactness is shaping a unified understanding of structure-dynamics-disorder/function relationships. On the 20th anniversary of this journal, Structure, we provide a historical perspective on the investigation of IDPs and summarize the sequence features and physical forces that underlie their unique structural, functional and evolutionary properties. PMID:24010708
From sequence and forces to structure, function, and evolution of intrinsically disordered proteins.
Forman-Kay, Julie D; Mittag, Tanja
2013-09-03
Intrinsically disordered proteins (IDPs), which lack persistent structure, are a challenge to structural biology due to the inapplicability of standard methods for characterization of folded proteins as well as their deviation from the dominant structure/function paradigm. Their widespread presence and involvement in biological function, however, has spurred the growing acceptance of the importance of IDPs and the development of new tools for studying their structure, dynamics, and function. The interplay of folded and disordered domains or regions for function and the existence of a continuum of protein states with respect to conformational energetics, motional timescales, and compactness are shaping a unified understanding of structure-dynamics-disorder/function relationships. In the 20(th) anniversary of Structure, we provide a historical perspective on the investigation of IDPs and summarize the sequence features and physical forces that underlie their unique structural, functional, and evolutionary properties. Copyright © 2013 Elsevier Ltd. All rights reserved.
ERIC Educational Resources Information Center
Lawrence, Sarah H.; Jaffe, Eileen K.
2008-01-01
A morpheein is a homo-oligomeric protein that can exist as an ensemble of physiologically significant and functionally distinct alternate quaternary assemblies. Morpheeins exist in nature and use conformational equilibria between different tertiary structures to form distinct oligomers as a means of regulating their function. Notably, alternate…
ECOD: An Evolutionary Classification of Protein Domains
Kinch, Lisa N.; Pei, Jimin; Shi, Shuoyong; Kim, Bong-Hyun; Grishin, Nick V.
2014-01-01
Understanding the evolution of a protein, including both close and distant relationships, often reveals insight into its structure and function. Fast and easy access to such up-to-date information facilitates research. We have developed a hierarchical evolutionary classification of all proteins with experimentally determined spatial structures, and presented it as an interactive and updatable online database. ECOD (Evolutionary Classification of protein Domains) is distinct from other structural classifications in that it groups domains primarily by evolutionary relationships (homology), rather than topology (or “fold”). This distinction highlights cases of homology between domains of differing topology to aid in understanding of protein structure evolution. ECOD uniquely emphasizes distantly related homologs that are difficult to detect, and thus catalogs the largest number of evolutionary links among structural domain classifications. Placing distant homologs together underscores the ancestral similarities of these proteins and draws attention to the most important regions of sequence and structure, as well as conserved functional sites. ECOD also recognizes closer sequence-based relationships between protein domains. Currently, approximately 100,000 protein structures are classified in ECOD into 9,000 sequence families clustered into close to 2,000 evolutionary groups. The classification is assisted by an automated pipeline that quickly and consistently classifies weekly releases of PDB structures and allows for continual updates. This synchronization with PDB uniquely distinguishes ECOD among all protein classifications. Finally, we present several case studies of homologous proteins not recorded in other classifications, illustrating the potential of how ECOD can be used to further biological and evolutionary studies. PMID:25474468
ECOD: an evolutionary classification of protein domains.
Cheng, Hua; Schaeffer, R Dustin; Liao, Yuxing; Kinch, Lisa N; Pei, Jimin; Shi, Shuoyong; Kim, Bong-Hyun; Grishin, Nick V
2014-12-01
Understanding the evolution of a protein, including both close and distant relationships, often reveals insight into its structure and function. Fast and easy access to such up-to-date information facilitates research. We have developed a hierarchical evolutionary classification of all proteins with experimentally determined spatial structures, and presented it as an interactive and updatable online database. ECOD (Evolutionary Classification of protein Domains) is distinct from other structural classifications in that it groups domains primarily by evolutionary relationships (homology), rather than topology (or "fold"). This distinction highlights cases of homology between domains of differing topology to aid in understanding of protein structure evolution. ECOD uniquely emphasizes distantly related homologs that are difficult to detect, and thus catalogs the largest number of evolutionary links among structural domain classifications. Placing distant homologs together underscores the ancestral similarities of these proteins and draws attention to the most important regions of sequence and structure, as well as conserved functional sites. ECOD also recognizes closer sequence-based relationships between protein domains. Currently, approximately 100,000 protein structures are classified in ECOD into 9,000 sequence families clustered into close to 2,000 evolutionary groups. The classification is assisted by an automated pipeline that quickly and consistently classifies weekly releases of PDB structures and allows for continual updates. This synchronization with PDB uniquely distinguishes ECOD among all protein classifications. Finally, we present several case studies of homologous proteins not recorded in other classifications, illustrating the potential of how ECOD can be used to further biological and evolutionary studies.
Ramakrishnan, Gayatri; Ochoa-Montaño, Bernardo; Raghavender, Upadhyayula S; Mudgal, Richa; Joshi, Adwait G; Chandra, Nagasuma R; Sowdhamini, Ramanathan; Blundell, Tom L; Srinivasan, Narayanaswamy
2015-01-01
The availability of the genome sequence of Mycobacterium tuberculosis H37Rv has encouraged determination of large numbers of protein structures and detailed definition of the biological information encoded therein; yet, the functions of many proteins in M. tuberculosis remain unknown. The emergence of multidrug resistant strains makes it a priority to exploit recent advances in homology recognition and structure prediction to re-analyse its gene products. Here we report the structural and functional characterization of gene products encoded in the M. tuberculosis genome, with the help of sensitive profile-based remote homology search and fold recognition algorithms resulting in an enhanced annotation of the proteome where 95% of the M. tuberculosis proteins were identified wholly or partly with information on structure or function. New information includes association of 244 proteins with 205 domain families and a separate set of new association of folds to 64 proteins. Extending structural information across uncharacterized protein families represented in the M. tuberculosis proteome, by determining superfamily relationships between families of known and unknown structures, has contributed to an enhancement in the knowledge of structural content. In retrospect, such superfamily relationships have facilitated recognition of probable structure and/or function for several uncharacterized protein families, eventually aiding recognition of probable functions for homologous proteins corresponding to such families. Gene products unique to mycobacteria for which no functions could be identified are 183. Of these 18 were determined to be M. tuberculosis specific. Such pathogen-specific proteins are speculated to harbour virulence factors required for pathogenesis. A re-annotated proteome of M. tuberculosis, with greater completeness of annotated proteins and domain assigned regions, provides a valuable basis for experimental endeavours designed to obtain a better understanding of pathogenesis and to accelerate the process of drug target discovery. Copyright © 2014 Elsevier Ltd. All rights reserved.
von Grotthuss, Marcin; Plewczynski, Dariusz; Ginalski, Krzysztof; Rychlewski, Leszek; Shakhnovich, Eugene I
2006-02-06
The number of protein structures from structural genomics centers dramatically increases in the Protein Data Bank (PDB). Many of these structures are functionally unannotated because they have no sequence similarity to proteins of known function. However, it is possible to successfully infer function using only structural similarity. Here we present the PDB-UF database, a web-accessible collection of predictions of enzymatic properties using structure-function relationship. The assignments were conducted for three-dimensional protein structures of unknown function that come from structural genomics initiatives. We show that 4 hypothetical proteins (with PDB accession codes: 1VH0, 1NS5, 1O6D, and 1TO0), for which standard BLAST tools such as PSI-BLAST or RPS-BLAST failed to assign any function, are probably methyltransferase enzymes. We suggest that the structure-based prediction of an EC number should be conducted having the different similarity score cutoff for different protein folds. Moreover, performing the annotation using two different algorithms can reduce the rate of false positive assignments. We believe, that the presented web-based repository will help to decrease the number of protein structures that have functions marked as "unknown" in the PDB file. http://paradox.harvard.edu/PDB-UF and http://bioinfo.pl/PDB-UF.
Motomura, Kenta; Nakamura, Morikazu; Otaki, Joji M.
2013-01-01
Protein structure and function information is coded in amino acid sequences. However, the relationship between primary sequences and three-dimensional structures and functions remains enigmatic. Our approach to this fundamental biochemistry problem is based on the frequencies of short constituent sequences (SCSs) or words. A protein amino acid sequence is considered analogous to an English sentence, where SCSs are equivalent to words. Availability scores, which are defined as real SCS frequencies in the non-redundant amino acid database relative to their probabilistically expected frequencies, demonstrate the biological usage bias of SCSs. As a result, this frequency-based linguistic approach is expected to have diverse applications, such as secondary structure specifications by structure-specific SCSs and immunological adjuvants with rare or non-existent SCSs. Linguistic similarities (e.g., wide ranges of scale-free distributions) and dissimilarities (e.g., behaviors of low-rank samples) between proteins and the natural English language have been revealed in the rank-frequency relationships of SCSs or words. We have developed a web server, the SCS Package, which contains five applications for analyzing protein sequences based on the linguistic concept. These tools have the potential to assist researchers in deciphering structurally and functionally important protein sites, species-specific sequences, and functional relationships between SCSs. The SCS Package also provides researchers with a tool to construct amino acid sequences de novo based on the idiomatic usage of SCSs. PMID:24688703
Motomura, Kenta; Nakamura, Morikazu; Otaki, Joji M
2013-01-01
Protein structure and function information is coded in amino acid sequences. However, the relationship between primary sequences and three-dimensional structures and functions remains enigmatic. Our approach to this fundamental biochemistry problem is based on the frequencies of short constituent sequences (SCSs) or words. A protein amino acid sequence is considered analogous to an English sentence, where SCSs are equivalent to words. Availability scores, which are defined as real SCS frequencies in the non-redundant amino acid database relative to their probabilistically expected frequencies, demonstrate the biological usage bias of SCSs. As a result, this frequency-based linguistic approach is expected to have diverse applications, such as secondary structure specifications by structure-specific SCSs and immunological adjuvants with rare or non-existent SCSs. Linguistic similarities (e.g., wide ranges of scale-free distributions) and dissimilarities (e.g., behaviors of low-rank samples) between proteins and the natural English language have been revealed in the rank-frequency relationships of SCSs or words. We have developed a web server, the SCS Package, which contains five applications for analyzing protein sequences based on the linguistic concept. These tools have the potential to assist researchers in deciphering structurally and functionally important protein sites, species-specific sequences, and functional relationships between SCSs. The SCS Package also provides researchers with a tool to construct amino acid sequences de novo based on the idiomatic usage of SCSs.
NASA Technical Reports Server (NTRS)
Lanyi, J. K.
1986-01-01
The archaebacteria occupy a unique place in phylogenetic trees constructed from analyses of sequences from key informational macromolecules, and their study continues to yield interesting ideas on the early evolution and divergence of biological forms. It is now known that the halobacteria among these species contain various retinal-proteins, resembling eukaryotic rhodopsins, but with different functions. Two of these pigments, located in the cytoplasmic membranes of the bacteria, are bacteriorhodopsin (a light-driven proton pump) and halorhodopsin (a light-driven chloride pump). Comparison of these systems is expected to reveal structure/function relationships in these simple (primitive?) energy transducing membrane components and evolutionary relationships which had produced the structural features which allow the divergent functions. Findings indicate that very different primary structures are needed for these proteins to accomplish their different functions. Indeed, analysis of partial amino acid sequences from halo-opsin shows already that few if any long segments exist which are homologous to bacterio-opsin. Either these proteins diverged a very long time ago to allow for the observed differences, or the evolutionary clock in the halobacteria runs faster than usual.
Supra-domains: evolutionary units larger than single protein domains.
Vogel, Christine; Berzuini, Carlo; Bashton, Matthew; Gough, Julian; Teichmann, Sarah A
2004-02-20
Domains are the evolutionary units that comprise proteins, and most proteins are built from more than one domain. Domains can be shuffled by recombination to create proteins with new arrangements of domains. Using structural domain assignments, we examined the combinations of domains in the proteins of 131 completely sequenced organisms. We found two-domain and three-domain combinations that recur in different protein contexts with different partner domains. The domains within these combinations have a particular functional and spatial relationship. These units are larger than individual domains and we term them "supra-domains". Amongst the supra-domains, we identified some 1400 (1203 two-domain and 166 three-domain) combinations that are statistically significantly over-represented relative to the occurrence and versatility of the individual component domains. Over one-third of all structurally assigned multi-domain proteins contain these over-represented supra-domains. This means that investigation of the structural and functional relationships of the domains forming these popular combinations would be particularly useful for an understanding of multi-domain protein function and evolution as well as for genome annotation. These and other supra-domains were analysed for their versatility, duplication, their distribution across the three kingdoms of life and their functional classes. By examining the three-dimensional structures of several examples of supra-domains in different biological processes, we identify two basic types of spatial relationships between the component domains: the combined function of the two domains is such that either the geometry of the two domains is crucial and there is a tight constraint on the interface, or the precise orientation of the domains is less important and they are spatially separate. Frequently, the role of the supra-domain becomes clear only once the three-dimensional structure is known. Since this is the case for only a quarter of the supra-domains, we provide a list of the most important unknown supra-domains as potential targets for structural genomics projects.
Understand protein functions by comparing the similarity of local structural environments.
Chen, Jiawen; Xie, Zhong-Ru; Wu, Yinghao
2017-02-01
The three-dimensional structures of proteins play an essential role in regulating binding between proteins and their partners, offering a direct relationship between structures and functions of proteins. It is widely accepted that the function of a protein can be determined if its structure is similar to other proteins whose functions are known. However, it is also observed that proteins with similar global structures do not necessarily correspond to the same function, while proteins with very different folds can share similar functions. This indicates that function similarity is originated from the local structural information of proteins instead of their global shapes. We assume that proteins with similar local environments prefer binding to similar types of molecular targets. In order to testify this assumption, we designed a new structural indicator to define the similarity of local environment between residues in different proteins. This indicator was further used to calculate the probability that a given residue binds to a specific type of structural neighbors, including DNA, RNA, small molecules and proteins. After applying the method to a large-scale non-redundant database of proteins, we show that the positive signal of binding probability calculated from the local structural indicator is statistically meaningful. In summary, our studies suggested that the local environment of residues in a protein is a good indicator to recognize specific binding partners of the protein. The new method could be a potential addition to a suite of existing template-based approaches for protein function prediction. Copyright © 2016 Elsevier B.V. All rights reserved.
Bhagavat, Raghu; Sankar, Santhosh; Srinivasan, Narayanaswamy; Chandra, Nagasuma
2018-03-06
Protein-ligand interactions form the basis of most cellular events. Identifying ligand binding pockets in proteins will greatly facilitate rationalizing and predicting protein function. Ligand binding sites are unknown for many proteins of known three-dimensional (3D) structure, creating a gap in our understanding of protein structure-function relationships. To bridge this gap, we detect pockets in proteins of known 3D structures, using computational techniques. This augmented pocketome (PocketDB) consists of 249,096 pockets, which is about seven times larger than what is currently known. We deduce possible ligand associations for about 46% of the newly identified pockets. The augmented pocketome, when subjected to clustering based on similarities among pockets, yielded 2,161 site types, which are associated with 1,037 ligand types, together providing fold-site-type-ligand-type associations. The PocketDB resource facilitates a structure-based function annotation, delineation of the structural basis of ligand recognition, and provides functional clues for domains of unknown functions, allosteric proteins, and druggable pockets. Copyright © 2018 Elsevier Ltd. All rights reserved.
Novel functions of CCM1 delimit the relationship of PTB/PH domains.
Zhang, Jun; Dubey, Pallavi; Padarti, Akhil; Zhang, Aileen; Patel, Rinkal; Patel, Vipulkumar; Cistola, David; Badr, Ahmed
2017-10-01
Three NPXY motifs and one FERM domain in CCM1 makes it a versatile scaffold protein for tethering the signaling components together within the CCM signaling complex (CSC). The cellular role of CCM1 protein remains inadequately expounded. Both phosphotyrosine binding (PTB) and pleckstrin homology (PH) domains were recognized as structurally related but functionally distinct domains. By utilizing molecular cloning, protein binding assays and RT-qPCR to identify novel cellular partners of CCM1 and its cellular expression patterns; by screening candidate PTB/PH proteins and subsequently structurally simulation in combining with current X-ray crystallography and NMR data to defined the essential structure of PTB/PH domain for NPXY-binding and the relationship among PTB, PH and FERM domain(s). We identified a group of 28 novel cellular partners of CCM1, all of which contain either PTB or PH domain(s), and developed a novel classification system for these PTB/PH proteins based on their relationship with different NPXY motifs of CCM1. Our results demonstrated that CCM1 has a wide spectrum of binding to different PTB/PH proteins and perpetuates their specificity to interact with certain PTB/PH domains through selective combination of three NPXY motifs. We also demonstrated that CCM1 can be assembled into oligomers through intermolecular interaction between its F3 lobe in FERM domain and one of the three NPXY motifs. Despite being embedded in FERM domain as F3 lobe, F3 module acts as a fully functional PH domain to interact with NPXY motif. The most salient feature of the study was that both PTB and PH domains are structurally and functionally comparable, suggesting that PTB domain is likely evolved from PH domain with polymorphic structural additions at its N-terminus. A new β1A-strand of the PTB domain was discovered and new minimum structural requirement of PTB/PH domain for NPXY motif-binding was determined. Based on our data, a novel theory of structure, function and relationship of PTB, PH and FERM domains has been proposed, which extends the importance of the NPXY-PTB/PH interaction on the CSC signaling and/or other cell receptors with great potential pointing to new therapeutic strategies. The study provides new insight into the structural characteristics of PTB/PH domains, essential structural elements of PTB/PH domain required for NPXY motif-binding, and function and relationship among PTB, PH and FERM domains. Copyright © 2017 Elsevier B.V. All rights reserved.
The Structure and Function of Non-Collagenous Bone Proteins
NASA Technical Reports Server (NTRS)
Hook, Magnus; McQuillan, David J.
1997-01-01
The research done under the cooperative research agreement for the project titled 'The structure and function of non-collagenous bone proteins' represented the first phase of an ongoing program to define the structural and functional relationships of the principal noncollagenous proteins in bone. An ultimate goal of this research is to enable design and execution of useful pharmacological compounds that will have a beneficial effect in treatment of osteoporosis, both land-based and induced by long-duration space travel. The goals of the now complete first phase were as follows: 1. Establish and/or develop powerful recombinant protein expression systems; 2. Develop and refine isolation and purification of recombinant proteins; 3. Express wild-type non-collagenous bone proteins; 4. Express site-specific mutant proteins and domains of wild-type proteins to enhance likelihood of crystal formation for subsequent solution of structure.
Accounting for epistatic interactions improves the functional analysis of protein structures.
Wilkins, Angela D; Venner, Eric; Marciano, David C; Erdin, Serkan; Atri, Benu; Lua, Rhonald C; Lichtarge, Olivier
2013-11-01
The constraints under which sequence, structure and function coevolve are not fully understood. Bringing this mutual relationship to light can reveal the molecular basis of binding, catalysis and allostery, thereby identifying function and rationally guiding protein redesign. Underlying these relationships are the epistatic interactions that occur when the consequences of a mutation to a protein are determined by the genetic background in which it occurs. Based on prior data, we hypothesize that epistatic forces operate most strongly between residues nearby in the structure, resulting in smooth evolutionary importance across the structure. We find that when residue scores of evolutionary importance are distributed smoothly between nearby residues, functional site prediction accuracy improves. Accordingly, we designed a novel measure of evolutionary importance that focuses on the interaction between pairs of structurally neighboring residues. This measure that we term pair-interaction Evolutionary Trace yields greater functional site overlap and better structure-based proteome-wide functional predictions. Our data show that the structural smoothness of evolutionary importance is a fundamental feature of the coevolution of sequence, structure and function. Mutations operate on individual residues, but selective pressure depends in part on the extent to which a mutation perturbs interactions with neighboring residues. In practice, this principle led us to redefine the importance of a residue in terms of the importance of its epistatic interactions with neighbors, yielding better annotation of functional residues, motivating experimental validation of a novel functional site in LexA and refining protein function prediction. lichtarge@bcm.edu. Supplementary data are available at Bioinformatics online.
Accounting for epistatic interactions improves the functional analysis of protein structures
Wilkins, Angela D.; Venner, Eric; Marciano, David C.; Erdin, Serkan; Atri, Benu; Lua, Rhonald C.; Lichtarge, Olivier
2013-01-01
Motivation: The constraints under which sequence, structure and function coevolve are not fully understood. Bringing this mutual relationship to light can reveal the molecular basis of binding, catalysis and allostery, thereby identifying function and rationally guiding protein redesign. Underlying these relationships are the epistatic interactions that occur when the consequences of a mutation to a protein are determined by the genetic background in which it occurs. Based on prior data, we hypothesize that epistatic forces operate most strongly between residues nearby in the structure, resulting in smooth evolutionary importance across the structure. Methods and Results: We find that when residue scores of evolutionary importance are distributed smoothly between nearby residues, functional site prediction accuracy improves. Accordingly, we designed a novel measure of evolutionary importance that focuses on the interaction between pairs of structurally neighboring residues. This measure that we term pair-interaction Evolutionary Trace yields greater functional site overlap and better structure-based proteome-wide functional predictions. Conclusions: Our data show that the structural smoothness of evolutionary importance is a fundamental feature of the coevolution of sequence, structure and function. Mutations operate on individual residues, but selective pressure depends in part on the extent to which a mutation perturbs interactions with neighboring residues. In practice, this principle led us to redefine the importance of a residue in terms of the importance of its epistatic interactions with neighbors, yielding better annotation of functional residues, motivating experimental validation of a novel functional site in LexA and refining protein function prediction. Contact: lichtarge@bcm.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:24021383
Insights from molecular dynamics simulations for computational protein design.
Childers, Matthew Carter; Daggett, Valerie
2017-02-01
A grand challenge in the field of structural biology is to design and engineer proteins that exhibit targeted functions. Although much success on this front has been achieved, design success rates remain low, an ever-present reminder of our limited understanding of the relationship between amino acid sequences and the structures they adopt. In addition to experimental techniques and rational design strategies, computational methods have been employed to aid in the design and engineering of proteins. Molecular dynamics (MD) is one such method that simulates the motions of proteins according to classical dynamics. Here, we review how insights into protein dynamics derived from MD simulations have influenced the design of proteins. One of the greatest strengths of MD is its capacity to reveal information beyond what is available in the static structures deposited in the Protein Data Bank. In this regard simulations can be used to directly guide protein design by providing atomistic details of the dynamic molecular interactions contributing to protein stability and function. MD simulations can also be used as a virtual screening tool to rank, select, identify, and assess potential designs. MD is uniquely poised to inform protein design efforts where the application requires realistic models of protein dynamics and atomic level descriptions of the relationship between dynamics and function. Here, we review cases where MD simulations was used to modulate protein stability and protein function by providing information regarding the conformation(s), conformational transitions, interactions, and dynamics that govern stability and function. In addition, we discuss cases where conformations from protein folding/unfolding simulations have been exploited for protein design, yielding novel outcomes that could not be obtained from static structures.
Insights from molecular dynamics simulations for computational protein design
Childers, Matthew Carter; Daggett, Valerie
2017-01-01
A grand challenge in the field of structural biology is to design and engineer proteins that exhibit targeted functions. Although much success on this front has been achieved, design success rates remain low, an ever-present reminder of our limited understanding of the relationship between amino acid sequences and the structures they adopt. In addition to experimental techniques and rational design strategies, computational methods have been employed to aid in the design and engineering of proteins. Molecular dynamics (MD) is one such method that simulates the motions of proteins according to classical dynamics. Here, we review how insights into protein dynamics derived from MD simulations have influenced the design of proteins. One of the greatest strengths of MD is its capacity to reveal information beyond what is available in the static structures deposited in the Protein Data Bank. In this regard simulations can be used to directly guide protein design by providing atomistic details of the dynamic molecular interactions contributing to protein stability and function. MD simulations can also be used as a virtual screening tool to rank, select, identify, and assess potential designs. MD is uniquely poised to inform protein design efforts where the application requires realistic models of protein dynamics and atomic level descriptions of the relationship between dynamics and function. Here, we review cases where MD simulations was used to modulate protein stability and protein function by providing information regarding the conformation(s), conformational transitions, interactions, and dynamics that govern stability and function. In addition, we discuss cases where conformations from protein folding/unfolding simulations have been exploited for protein design, yielding novel outcomes that could not be obtained from static structures. PMID:28239489
Disorder-function relationships for the cell cycle regulatory proteins p21 and p27.
Mitrea, Diana M; Yoon, Mi-Kyung; Ou, Li; Kriwacki, Richard W
2012-04-01
The classic structure-function paradigm has been challenged by a recently identified class of proteins: intrinsically disordered proteins (IDPs). Despite their lack of stable secondary or tertiary structure, IDPs are prevalent in all forms of life and perform myriad cellular functions, including signaling and regulation. Importantly, disruption of IDP homeostasis is associated with numerous human diseases, including cancer and neurodegeneration. Despite wide recognition of IDPs, the molecular mechanisms underlying their functions are not fully understood. Here we review the structural features and disorder-function relationships for p21 and p27, two cyclin-dependent kinase (Cdk) regulators involved in controlling cell division and fate. Studies of p21 bound to Cdk2/cyclin A revealed that a helix stretching mechanism mediates binding promiscuity. Further, investigations of Tyr88-phosphorylated p27 identified a signaling conduit that controls cell division and is disrupted in certain cancers. These mechanisms rely upon a balance between nascent structure in the free state, induced folding upon binding, and persistent flexibility within functional complexes. Although these disorder-function relationships are likely to be recapitulated in other IDPs, it is also likely that the vocabulary of their mechanisms is much more extensive than is currently understood. Further study of the physical properties of IDPs and elucidation of their links with function are needed to fully understand the mechanistic language of IDPs.
Phylogeny-Based Systematization of Arabidopsis Proteins with Histone H1 Globular Domain1[OPEN
Knizewski, Lukasz; Schmidt, Anja; Ginalski, Krzysztof
2017-01-01
H1 (or linker) histones are basic nuclear proteins that possess an evolutionarily conserved nucleosome-binding globular domain, GH1. They perform critical functions in determining the accessibility of chromatin DNA to trans-acting factors. In most metazoan species studied so far, linker histones are highly heterogenous, with numerous nonallelic variants cooccurring in the same cells. The phylogenetic relationships among these variants as well as their structural and functional properties have been relatively well established. This contrasts markedly with the rather limited knowledge concerning the phylogeny and structural and functional roles of an unusually diverse group of GH1-containing proteins in plants. The dearth of information and the lack of a coherent phylogeny-based nomenclature of these proteins can lead to misunderstandings regarding their identity and possible relationships, thereby hampering plant chromatin research. Based on published data and our in silico and high-throughput analyses, we propose a systematization and coherent nomenclature of GH1-containing proteins of Arabidopsis (Arabidopsis thaliana [L.] Heynh) that will be useful for both the identification and structural and functional characterization of homologous proteins from other plant species. PMID:28298478
Phylogeny-Based Systematization of Arabidopsis Proteins with Histone H1 Globular Domain.
Kotliński, Maciej; Knizewski, Lukasz; Muszewska, Anna; Rutowicz, Kinga; Lirski, Maciej; Schmidt, Anja; Baroux, Célia; Ginalski, Krzysztof; Jerzmanowski, Andrzej
2017-05-01
H1 (or linker) histones are basic nuclear proteins that possess an evolutionarily conserved nucleosome-binding globular domain, GH1. They perform critical functions in determining the accessibility of chromatin DNA to trans-acting factors. In most metazoan species studied so far, linker histones are highly heterogenous, with numerous nonallelic variants cooccurring in the same cells. The phylogenetic relationships among these variants as well as their structural and functional properties have been relatively well established. This contrasts markedly with the rather limited knowledge concerning the phylogeny and structural and functional roles of an unusually diverse group of GH1-containing proteins in plants. The dearth of information and the lack of a coherent phylogeny-based nomenclature of these proteins can lead to misunderstandings regarding their identity and possible relationships, thereby hampering plant chromatin research. Based on published data and our in silico and high-throughput analyses, we propose a systematization and coherent nomenclature of GH1-containing proteins of Arabidopsis ( Arabidopsis thaliana [L.] Heynh) that will be useful for both the identification and structural and functional characterization of homologous proteins from other plant species. © 2017 American Society of Plant Biologists. All Rights Reserved.
Evolution of the arginase fold and functional diversity
Dowling, Daniel P.; Costanzo, Luigi Di; Gennadios, Heather A.; Christianson, David W.
2009-01-01
The large number of protein structures deposited in the Protein Data Bank allows for the identification of novel structural superfamilies based on conservation of fold in addition to conservation of amino acid sequence. Since sequence diverges more rapidly than fold in protein evolution, proteins with little or no significant sequence identity are occasionally observed to adopt similar folds, thereby reflecting unanticipated evolutionary relationships. Here, we review the unique α/β fold first observed in the manganese metalloenzyme rat liver arginase, consisting of a parallel 8 stranded β-sheet surrounded by several helices, and its evolutionary relationship with the zinc-requiring and/or iron-requiring histone deacetylases and acetylpolyamine amidohydrolases. Structural comparisons reveal key features of the core α/β fold that contribute to the divergent metal ion specificity and stoichiometry required for the chemical and biological functions of these enzymes. PMID:18360740
Crystal growth of enzymes in low gravity (L-5)
NASA Technical Reports Server (NTRS)
Morita, Yuhei
1993-01-01
Recent developments in protein engineering have expanded the possibilities of studies of enzymes and other proteins. Now such studies are not limited to the elucidation of the relationship between the structure and function of the protein. They also aim at the production of proteins with new and practical functions, based on results obtained during investigation of structure and function. For continuing research in this field, investigation of the tertiary structure of proteins is important. X-ray diffraction of single crystals of protein is usually used for this purpose. The main difficulty is the preparation of the crystals. The theme of the research is to prepare such crystals at very low gravity, with the main purpose being to obtain large single crystals of proteins suitable for x-ray diffraction studies.
Watching proteins function with picosecond X-ray crystallography and molecular dynamics simulations.
NASA Astrophysics Data System (ADS)
Anfinrud, Philip
2006-03-01
Time-resolved electron density maps of myoglobin, a ligand-binding heme protein, have been stitched together into movies that unveil with < 2-å spatial resolution and 150-ps time-resolution the correlated protein motions that accompany and/or mediate ligand migration within the hydrophobic interior of a protein. A joint analysis of all-atom molecular dynamics (MD) calculations and picosecond time-resolved X-ray structures provides single-molecule insights into mechanisms of protein function. Ensemble-averaged MD simulations of the L29F mutant of myoglobin following ligand dissociation reproduce the direction, amplitude, and timescales of crystallographically-determined structural changes. This close agreement with experiments at comparable resolution in space and time validates the individual MD trajectories, which identify and structurally characterize a conformational switch that directs dissociated ligands to one of two nearby protein cavities. This unique combination of simulation and experiment unveils functional protein motions and illustrates at an atomic level relationships among protein structure, dynamics, and function. In collaboration with Friedrich Schotte and Gerhard Hummer, NIH.
The Classification of Protein Domains.
Dawson, Natalie; Sillitoe, Ian; Marsden, Russell L; Orengo, Christine A
2017-01-01
The significant expansion in protein sequence and structure data that we are now witnessing brings with it a pressing need to bring order to the protein world. Such order enables us to gain insights into the evolution of proteins, their function and the extent to which the functional repertoire can vary across the three kingdoms of life. This has lead to the creation of a wide range of protein family classifications that aim to group proteins based upon their evolutionary relationships.In this chapter we discuss the approaches and methods that are frequently used in the classification of proteins, with a specific emphasis on the classification of protein domains. The construction of both domain sequence and domain structure databases is considered and we show how the use of domain family annotations to assign structural and functional information is enhancing our understanding of genomes.
Centrins in unicellular organisms: functional diversity and specialization.
Zhang, Yu; He, Cynthia Y
2012-07-01
Centrins (also known as caltractins) are conserved, EF hand-containing proteins ubiquitously found in eukaryotes. Similar to calmodulins, the calcium-binding EF hands in centrins fold into two structurally similar domains separated by an alpha-helical linker region, shaping like a dumbbell. The small size (15-22 kDa) and domain organization of centrins and their functional diversity/specialization make them an ideal system to study protein structure-function relationship. Here, we review the work on centrins with a focus on their structures and functions characterized in unicellular organisms.
Form follows function: the importance of endoplasmic reticulum shape.
Westrate, L M; Lee, J E; Prinz, W A; Voeltz, G K
2015-01-01
The endoplasmic reticulum (ER) has a remarkably complex structure, composed of a single bilayer that forms the nuclear envelope, along with a network of sheets and dynamic tubules. Our understanding of the biological significance of the complex architecture of the ER has improved dramatically in the last few years. The identification of proteins and forces required for maintaining ER shape, as well as more advanced imaging techniques, has allowed the relationship between ER shape and function to come into focus. These studies have also revealed unexpected new functions of the ER and novel ER domains regulating alterations in ER dynamics. The importance of ER structure has become evident as recent research has identified diseases linked to mutations in ER-shaping proteins. In this review, we discuss what is known about the maintenance of ER architecture, the relationship between ER structure and function, and diseases associated with defects in ER structure.
Exploring Fold Space Preferences of New-born and Ancient Protein Superfamilies
Edwards, Hannah; Abeln, Sanne; Deane, Charlotte M.
2013-01-01
The evolution of proteins is one of the fundamental processes that has delivered the diversity and complexity of life we see around ourselves today. While we tend to define protein evolution in terms of sequence level mutations, insertions and deletions, it is hard to translate these processes to a more complete picture incorporating a polypeptide's structure and function. By considering how protein structures change over time we can gain an entirely new appreciation of their long-term evolutionary dynamics. In this work we seek to identify how populations of proteins at different stages of evolution explore their possible structure space. We use an annotation of superfamily age to this space and explore the relationship between these ages and a diverse set of properties pertaining to a superfamily's sequence, structure and function. We note several marked differences between the populations of newly evolved and ancient structures, such as in their length distributions, secondary structure content and tertiary packing arrangements. In particular, many of these differences suggest a less elaborate structure for newly evolved superfamilies when compared with their ancient counterparts. We show that the structural preferences we report are not a residual effect of a more fundamental relationship with function. Furthermore, we demonstrate the robustness of our results, using significant variation in the algorithm used to estimate the ages. We present these age estimates as a useful tool to analyse protein populations. In particularly, we apply this in a comparison of domains containing greek key or jelly roll motifs. PMID:24244135
The Significance of G Protein-Coupled Receptor Crystallography for Drug Discovery
Salon, John A.; Lodowski, David T.
2011-01-01
Crucial as molecular sensors for many vital physiological processes, seven-transmembrane domain G protein-coupled receptors (GPCRs) comprise the largest family of proteins targeted by drug discovery. Together with structures of the prototypical GPCR rhodopsin, solved structures of other liganded GPCRs promise to provide insights into the structural basis of the superfamily's biochemical functions and assist in the development of new therapeutic modalities and drugs. One of the greatest technical and theoretical challenges to elucidating and exploiting structure-function relationships in these systems is the emerging concept of GPCR conformational flexibility and its cause-effect relationship for receptor-receptor and receptor-effector interactions. Such conformational changes can be subtle and triggered by relatively small binding energy effects, leading to full or partial efficacy in the activation or inactivation of the receptor system at large. Pharmacological dogma generally dictates that these changes manifest themselves through kinetic modulation of the receptor's G protein partners. Atomic resolution information derived from increasingly available receptor structures provides an entrée to the understanding of these events and practically applying it to drug design. Supported by structure-activity relationship information arising from empirical screening, a unified structural model of GPCR activation/inactivation promises to both accelerate drug discovery in this field and improve our fundamental understanding of structure-based drug design in general. This review discusses fundamental problems that persist in drug design and GPCR structural determination. PMID:21969326
De Filippis, Vincenzo; Acquasaliente, Laura; Pontarollo, Giulia; Peterle, Daniele
2018-01-01
The advent of recombinant DNA technology allowed to site-specifically insert, delete, or mutate almost any amino acid in a given protein, significantly improving our knowledge of protein structure, stability, and function. Nevertheless, a quantitative description of the physical and chemical basis that makes a polypeptide chain to efficiently fold into a stable and functionally active conformation is still elusive. This mainly originates from the fact that nature combined, in a yet unknown manner, different properties (i.e., hydrophobicity, conformational propensity, polarizability, and hydrogen bonding capability) into the 20 standard natural amino acids, thus making difficult, if not impossible, to univocally relate the change in protein stability or function to the alteration of physicochemical properties caused by amino acid exchange(s). In this view, incorporation of noncoded amino acids with tailored side chains, allowing to finely tune the structure at a protein site, would facilitate to dissect the effects of a given mutation in terms of one or a few physicochemical properties, thus much expanding the scope of physical organic chemistry in the study of proteins. In this review, relevant applications from our laboratory will be presented on the use of noncoded amino acids in structure-activity relationships studies of hirudin binding to thrombin. © 2017 International Union of Biochemistry and Molecular Biology, Inc.
Structural Biology for A-Level Students
ERIC Educational Resources Information Center
Philip, Judith
2013-01-01
The relationship between the structure and function of proteins is an important area in biochemistry. Pupils studying A-level Biology are introduced to the four levels of protein structure (primary, secondary, tertiary and quaternary) and how these can be used to describe the progressive folding of a chain of amino acid residues to a final,…
Modelling protein functional domains in signal transduction using Maude
NASA Technical Reports Server (NTRS)
Sriram, M. G.
2003-01-01
Modelling of protein-protein interactions in signal transduction is receiving increased attention in computational biology. This paper describes recent research in the application of Maude, a symbolic language founded on rewriting logic, to the modelling of functional domains within signalling proteins. Protein functional domains (PFDs) are a critical focus of modern signal transduction research. In general, Maude models can simulate biological signalling networks and produce specific testable hypotheses at various levels of abstraction. Developing symbolic models of signalling proteins containing functional domains is important because of the potential to generate analyses of complex signalling networks based on structure-function relationships.
Gold, Nicola D; Jackson, Richard M
2006-02-03
The rapid growth in protein structural data and the emergence of structural genomics projects have increased the need for automatic structure analysis and tools for function prediction. Small molecule recognition is critical to the function of many proteins; therefore, determination of ligand binding site similarity is important for understanding ligand interactions and may allow their functional classification. Here, we present a binding sites database (SitesBase) that given a known protein-ligand binding site allows rapid retrieval of other binding sites with similar structure independent of overall sequence or fold similarity. However, each match is also annotated with sequence similarity and fold information to aid interpretation of structure and functional similarity. Similarity in ligand binding sites can indicate common binding modes and recognition of similar molecules, allowing potential inference of function for an uncharacterised protein or providing additional evidence of common function where sequence or fold similarity is already known. Alternatively, the resource can provide valuable information for detailed studies of molecular recognition including structure-based ligand design and in understanding ligand cross-reactivity. Here, we show examples of atomic similarity between superfamily or more distant fold relatives as well as between seemingly unrelated proteins. Assignment of unclassified proteins to structural superfamiles is also undertaken and in most cases substantiates assignments made using sequence similarity. Correct assignment is also possible where sequence similarity fails to find significant matches, illustrating the potential use of binding site comparisons for newly determined proteins.
Defining functional distance using manifold embeddings of gene ontology annotations
Lerman, Gilad; Shakhnovich, Boris E.
2007-01-01
Although rigorous measures of similarity for sequence and structure are now well established, the problem of defining functional relationships has been particularly daunting. Here, we present several manifold embedding techniques to compute distances between Gene Ontology (GO) functional annotations and consequently estimate functional distances between protein domains. To evaluate accuracy, we correlate the functional distance to the well established measures of sequence, structural, and phylogenetic similarities. Finally, we show that manual classification of structures into folds and superfamilies is mirrored by proximity in the newly defined function space. We show how functional distances place structure–function relationships in biological context resulting in insight into divergent and convergent evolution. The methods and results in this paper can be readily generalized and applied to a wide array of biologically relevant investigations, such as accuracy of annotation transference, the relationship between sequence, structure, and function, or coherence of expression modules. PMID:17595300
A phylogenetic analysis of normal modes evolution in enzymes and its relationship to enzyme function
Lai, Jason; Jin, Jing; Kubelka, Jan; Liberles, David A.
2012-01-01
Since the dynamic nature of protein structures is essential for enzymatic function, it is expected that the functional evolution can be inferred from the changes in the protein dynamics. However, dynamics can also diverge neutrally with sequence substitution between enzymes without changes of function. In this study, a phylogenetic approach is implemented to explore the relationship between enzyme dynamics and function through evolutionary history. Protein dynamics are described by normal mode analysis based on a simplified harmonic potential force field applied to the reduced Cα representation of the protein structure while enzymatic function is described by Enzyme Commission (EC) numbers. Similarity of the binding pocket dynamics at each branch of the protein family’s phylogeny was analyzed in two ways: 1) explicitly by quantifying the normal mode overlap calculated for the reconstructed ancestral proteins at each end and 2) implicitly using a diffusion model to obtain the reconstructed lineage-specific changes in the normal modes. Both explicit and implicit ancestral reconstruction identified generally faster rates of change in dynamics compared with the expected change from neutral evolution at the branches of potential functional divergences for the alpha-amylase, D-isomer specific 2-hydroxyacid dehydrogenase, and copper-containing amine oxidase protein families. Normal modes analysis added additional information over just comparing the RMSD of static structures. However, the branch-specific changes were not statistically significant compared to background function-independent neutral rates of change of dynamic properties and blind application of the analysis would not enable prediction of changes in enzyme specificity. PMID:22651983
Lai, Jason; Jin, Jing; Kubelka, Jan; Liberles, David A
2012-09-21
Since the dynamic nature of protein structures is essential for enzymatic function, it is expected that functional evolution can be inferred from the changes in protein dynamics. However, dynamics can also diverge neutrally with sequence substitution between enzymes without changes of function. In this study, a phylogenetic approach is implemented to explore the relationship between enzyme dynamics and function through evolutionary history. Protein dynamics are described by normal mode analysis based on a simplified harmonic potential force field applied to the reduced C(α) representation of the protein structure while enzymatic function is described by Enzyme Commission numbers. Similarity of the binding pocket dynamics at each branch of the protein family's phylogeny was analyzed in two ways: (1) explicitly by quantifying the normal mode overlap calculated for the reconstructed ancestral proteins at each end and (2) implicitly using a diffusion model to obtain the reconstructed lineage-specific changes in the normal modes. Both explicit and implicit ancestral reconstruction identified generally faster rates of change in dynamics compared with the expected change from neutral evolution at the branches of potential functional divergences for the α-amylase, D-isomer-specific 2-hydroxyacid dehydrogenase, and copper-containing amine oxidase protein families. Normal mode analysis added additional information over just comparing the RMSD of static structures. However, the branch-specific changes were not statistically significant compared to background function-independent neutral rates of change of dynamic properties and blind application of the analysis would not enable prediction of changes in enzyme specificity. Copyright © 2012 Elsevier Ltd. All rights reserved.
Forbes-Lorman, Robin M; Harris, Michelle A; Chang, Wesley S; Dent, Erik W; Nordheim, Erik V; Franzen, Margaret A
2016-07-08
Understanding how basic structural units influence function is identified as a foundational/core concept for undergraduate biological and biochemical literacy. It is essential for students to understand this concept at all size scales, but it is often more difficult for students to understand structure-function relationships at the molecular level, which they cannot as effectively visualize. Students need to develop accurate, 3-dimensional mental models of biomolecules to understand how biomolecular structure affects cellular functions at the molecular level, yet most traditional curricular tools such as textbooks include only 2-dimensional representations. We used a controlled, backward design approach to investigate how hand-held physical molecular model use affected students' ability to logically predict structure-function relationships. Brief (one class period) physical model use increased quiz score for females, whereas there was no significant increase in score for males using physical models. Females also self-reported higher learning gains in their understanding of context-specific protein function. Gender differences in spatial visualization may explain the gender-specific benefits of physical model use observed. © 2016 The Authors Biochemistry and Molecular Biology Education published by Wiley Periodicals, Inc. on behalf of International Union of Biochemistry and Molecular Biology, 44(4):326-335, 2016. © 2016 The International Union of Biochemistry and Molecular Biology.
Structure-Based Characterization of Multiprotein Complexes
Wiederstein, Markus; Gruber, Markus; Frank, Karl; Melo, Francisco; Sippl, Manfred J.
2014-01-01
Summary Multiprotein complexes govern virtually all cellular processes. Their 3D structures provide important clues to their biological roles, especially through structural correlations among protein molecules and complexes. The detection of such correlations generally requires comprehensive searches in databases of known protein structures by means of appropriate structure-matching techniques. Here, we present a high-speed structure search engine capable of instantly matching large protein oligomers against the complete and up-to-date database of biologically functional assemblies of protein molecules. We use this tool to reveal unseen structural correlations on the level of protein quaternary structure and demonstrate its general usefulness for efficiently exploring complex structural relationships among known protein assemblies. PMID:24954616
Thermodynamic database for proteins: features and applications.
Gromiha, M Michael; Sarai, Akinori
2010-01-01
We have developed a thermodynamic database for proteins and mutants, ProTherm, which is a collection of a large number of thermodynamic data on protein stability along with the sequence and structure information, experimental methods and conditions, and literature information. This is a valuable resource for understanding/predicting the stability of proteins, and it can be accessible at http://www.gibk26.bse.kyutech.ac.jp/jouhou/Protherm/protherm.html . ProTherm has several features including various search, display, and sorting options and visualization tools. We have analyzed the data in ProTherm to examine the relationship among thermodynamics, structure, and function of proteins. We describe the progress on the development of methods for understanding/predicting protein stability, such as (i) relationship between the stability of protein mutants and amino acid properties, (ii) average assignment method, (iii) empirical energy functions, (iv) torsion, distance, and contact potentials, and (v) machine learning techniques. The list of online resources for predicting protein stability has also been provided.
Wong, Sienna; Jin, J-P
2017-01-01
Study of folded structure of proteins provides insights into their biological functions, conformational dynamics and molecular evolution. Current methods of elucidating folded structure of proteins are laborious, low-throughput, and constrained by various limitations. Arising from these methods is the need for a sensitive, quantitative, rapid and high-throughput method not only analysing the folded structure of proteins, but also to monitor dynamic changes under physiological or experimental conditions. In this focused review, we outline the foundation and limitations of current protein structure-determination methods prior to discussing the advantages of an emerging antibody epitope analysis for applications in structural, conformational and evolutionary studies of proteins. We discuss the application of this method using representative examples in monitoring allosteric conformation of regulatory proteins and the determination of the evolutionary lineage of related proteins and protein isoforms. The versatility of the method described herein is validated by the ability to modulate a variety of assay parameters to meet the needs of the user in order to monitor protein conformation. Furthermore, the assay has been used to clarify the lineage of troponin isoforms beyond what has been depicted by sequence homology alone, demonstrating the nonlinear evolutionary relationship between primary structure and tertiary structure of proteins. The antibody epitope analysis method is a highly adaptable technique of protein conformation elucidation, which can be easily applied without the need for specialized equipment or technical expertise. When applied in a systematic and strategic manner, this method has the potential to reveal novel and biomedically meaningful information for structure-function relationship and evolutionary lineage of proteins. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Use of designed sequences in protein structure recognition.
Kumar, Gayatri; Mudgal, Richa; Srinivasan, Narayanaswamy; Sandhya, Sankaran
2018-05-09
Knowledge of the protein structure is a pre-requisite for improved understanding of molecular function. The gap in the sequence-structure space has increased in the post-genomic era. Grouping related protein sequences into families can aid in narrowing the gap. In the Pfam database, structure description is provided for part or full-length proteins of 7726 families. For the remaining 52% of the families, information on 3-D structure is not yet available. We use the computationally designed sequences that are intermediately related to two protein domain families, which are already known to share the same fold. These strategically designed sequences enable detection of distant relationships and here, we have employed them for the purpose of structure recognition of protein families of yet unknown structure. We first measured the success rate of our approach using a dataset of protein families of known fold and achieved a success rate of 88%. Next, for 1392 families of yet unknown structure, we made structural assignments for part/full length of the proteins. Fold association for 423 domains of unknown function (DUFs) are provided as a step towards functional annotation. The results indicate that knowledge-based filling of gaps in protein sequence space is a lucrative approach for structure recognition. Such sequences assist in traversal through protein sequence space and effectively function as 'linkers', where natural linkers between distant proteins are unavailable. This article was reviewed by Oliviero Carugo, Christine Orengo and Srikrishna Subramanian.
Relationships between residue Voronoi volume and sequence conservation in proteins.
Liu, Jen-Wei; Cheng, Chih-Wen; Lin, Yu-Feng; Chen, Shao-Yu; Hwang, Jenn-Kang; Yen, Shih-Chung
2018-02-01
Functional and biophysical constraints can cause different levels of sequence conservation in proteins. Previously, structural properties, e.g., relative solvent accessibility (RSA) and packing density of the weighted contact number (WCN), have been found to be related to protein sequence conservation (CS). The Voronoi volume has recently been recognized as a new structural property of the local protein structural environment reflecting CS. However, for surface residues, it is sensitive to water molecules surrounding the protein structure. Herein, we present a simple structural determinant termed the relative space of Voronoi volume (RSV); it uses the Voronoi volume and the van der Waals volume of particular residues to quantify the local structural environment. RSV (range, 0-1) is defined as (Voronoi volume-van der Waals volume)/Voronoi volume of the target residue. The concept of RSV describes the extent of available space for every protein residue. RSV and Voronoi profiles with and without water molecules (RSVw, RSV, VOw, and VO) were compared for 554 non-homologous proteins. RSV (without water) showed better Pearson's correlations with CS than did RSVw, VO, or VOw values. The mean correlation coefficient between RSV and CS was 0.51, which is comparable to the correlation between RSA and CS (0.49) and that between WCN and CS (0.56). RSV is a robust structural descriptor with and without water molecules and can quantitatively reflect evolutionary information in a single protein structure. Therefore, it may represent a practical structural determinant to study protein sequence, structure, and function relationships. Copyright © 2017 Elsevier B.V. All rights reserved.
The proteome: structure, function and evolution
Fleming, Keiran; Kelley, Lawrence A; Islam, Suhail A; MacCallum, Robert M; Muller, Arne; Pazos, Florencio; Sternberg, Michael J.E
2006-01-01
This paper reports two studies to model the inter-relationships between protein sequence, structure and function. First, an automated pipeline to provide a structural annotation of proteomes in the major genomes is described. The results are stored in a database at Imperial College, London (3D-GENOMICS) that can be accessed at www.sbg.bio.ic.ac.uk. Analysis of the assignments to structural superfamilies provides evolutionary insights. 3D-GENOMICS is being integrated with related proteome annotation data at University College London and the European Bioinformatics Institute in a project known as e-protein (http://www.e-protein.org/). The second topic is motivated by the developments in structural genomics projects in which the structure of a protein is determined prior to knowledge of its function. We have developed a new approach PHUNCTIONER that uses the gene ontology (GO) classification to supervise the extraction of the sequence signal responsible for protein function from a structure-based sequence alignment. Using GO we can obtain profiles for a range of specificities described in the ontology. In the region of low sequence similarity (around 15%), our method is more accurate than assignment from the closest structural homologue. The method is also able to identify the specific residues associated with the function of the protein family. PMID:16524832
Computational prediction of hinge axes in proteins
2014-01-01
Background A protein's function is determined by the wide range of motions exhibited by its 3D structure. However, current experimental techniques are not able to reliably provide the level of detail required for elucidating the exact mechanisms of protein motion essential for effective drug screening and design. Computational tools are instrumental in the study of the underlying structure-function relationship. We focus on a special type of proteins called "hinge proteins" which exhibit a motion that can be interpreted as a rotation of one domain relative to another. Results This work proposes a computational approach that uses the geometric structure of a single conformation to predict the feasible motions of the protein and is founded in recent work from rigidity theory, an area of mathematics that studies flexibility properties of general structures. Given a single conformational state, our analysis predicts a relative axis of motion between two specified domains. We analyze a dataset of 19 structures known to exhibit this hinge-like behavior. For 15, the predicted axis is consistent with a motion to a second, known conformation. We present a detailed case study for three proteins whose dynamics have been well-studied in the literature: calmodulin, the LAO binding protein and the Bence-Jones protein. Conclusions Our results show that incorporating rigidity-theoretic analyses can lead to effective computational methods for understanding hinge motions in macromolecules. This initial investigation is the first step towards a new tool for probing the structure-dynamics relationship in proteins. PMID:25080829
A new method to improve network topological similarity search: applied to fold recognition
Lhota, John; Hauptman, Ruth; Hart, Thomas; Ng, Clara; Xie, Lei
2015-01-01
Motivation: Similarity search is the foundation of bioinformatics. It plays a key role in establishing structural, functional and evolutionary relationships between biological sequences. Although the power of the similarity search has increased steadily in recent years, a high percentage of sequences remain uncharacterized in the protein universe. Thus, new similarity search strategies are needed to efficiently and reliably infer the structure and function of new sequences. The existing paradigm for studying protein sequence, structure, function and evolution has been established based on the assumption that the protein universe is discrete and hierarchical. Cumulative evidence suggests that the protein universe is continuous. As a result, conventional sequence homology search methods may be not able to detect novel structural, functional and evolutionary relationships between proteins from weak and noisy sequence signals. To overcome the limitations in existing similarity search methods, we propose a new algorithmic framework—Enrichment of Network Topological Similarity (ENTS)—to improve the performance of large scale similarity searches in bioinformatics. Results: We apply ENTS to a challenging unsolved problem: protein fold recognition. Our rigorous benchmark studies demonstrate that ENTS considerably outperforms state-of-the-art methods. As the concept of ENTS can be applied to any similarity metric, it may provide a general framework for similarity search on any set of biological entities, given their representation as a network. Availability and implementation: Source code freely available upon request Contact: lxie@iscb.org PMID:25717198
Muthu Krishnan, S
2018-05-14
The receptor-associated protein (RAP) is an inhibitor of endocytic receptors that belong to the lipoprotein receptor gene family. In this study, a computational approach was tried to find the evolutionarily related fold of the RAP proteins. Through the structural and sequence-based analysis, found various protein folds that are very close to the RAP folds. Remote homolog datasets were used potentially to develop a different support vector machine (SVM) methods to recognize the homologous RAP fold. This study helps in understanding the relationship of RAP homologs folds based on the structure, function and evolutionary history. Copyright © 2018 Elsevier Ltd. All rights reserved.
Structure-based characterization of multiprotein complexes.
Wiederstein, Markus; Gruber, Markus; Frank, Karl; Melo, Francisco; Sippl, Manfred J
2014-07-08
Multiprotein complexes govern virtually all cellular processes. Their 3D structures provide important clues to their biological roles, especially through structural correlations among protein molecules and complexes. The detection of such correlations generally requires comprehensive searches in databases of known protein structures by means of appropriate structure-matching techniques. Here, we present a high-speed structure search engine capable of instantly matching large protein oligomers against the complete and up-to-date database of biologically functional assemblies of protein molecules. We use this tool to reveal unseen structural correlations on the level of protein quaternary structure and demonstrate its general usefulness for efficiently exploring complex structural relationships among known protein assemblies. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
Clustering and visualizing similarity networks of membrane proteins.
Hu, Geng-Ming; Mai, Te-Lun; Chen, Chi-Ming
2015-08-01
We proposed a fast and unsupervised clustering method, minimum span clustering (MSC), for analyzing the sequence-structure-function relationship of biological networks, and demonstrated its validity in clustering the sequence/structure similarity networks (SSN) of 682 membrane protein (MP) chains. The MSC clustering of MPs based on their sequence information was found to be consistent with their tertiary structures and functions. For the largest seven clusters predicted by MSC, the consistency in chain function within the same cluster is found to be 100%. From analyzing the edge distribution of SSN for MPs, we found a characteristic threshold distance for the boundary between clusters, over which SSN of MPs could be properly clustered by an unsupervised sparsification of the network distance matrix. The clustering results of MPs from both MSC and the unsupervised sparsification methods are consistent with each other, and have high intracluster similarity and low intercluster similarity in sequence, structure, and function. Our study showed a strong sequence-structure-function relationship of MPs. We discussed evidence of convergent evolution of MPs and suggested applications in finding structural similarities and predicting biological functions of MP chains based on their sequence information. © 2015 Wiley Periodicals, Inc.
Tighter Ligand Binding Can Compensate for Impaired Stability of an RNA-Binding Protein.
Wallis, Christopher P; Richman, Tara R; Filipovska, Aleksandra; Rackham, Oliver
2018-06-15
It has been widely shown that ligand-binding residues, by virtue of their orientation, charge, and solvent exposure, often have a net destabilizing effect on proteins that is offset by stability conferring residues elsewhere in the protein. This structure-function trade-off can constrain possible adaptive evolutionary changes of function and may hamper protein engineering efforts to design proteins with new functions. Here, we present evidence from a large randomized mutant library screen that, in the case of PUF RNA-binding proteins, this structural relationship may be inverted and that active-site mutations that increase protein activity are also able to compensate for impaired stability. We show that certain mutations in RNA-protein binding residues are not necessarily destabilizing and that increased ligand-binding can rescue an insoluble, unstable PUF protein. We hypothesize that these mutations restabilize the protein via thermodynamic coupling of protein folding and RNA binding.
Zurawski, S M; Zurawski, G
1988-01-01
We have analyzed structure--function relationships of the protein hormone murine interleukin 2 by fine structural deletion mapping. A total of 130 deletion mutant proteins, together with some substitution and insertion mutant proteins, was expressed in Escherichia coli and analyzed for their ability to sustain the proliferation of a cloned murine T cell line. This analysis has permitted a functional map of the protein to be drawn and classifies five segments of the protein, which together contain 48% of the sequence, as unessential to the biological activity of the protein. A further 26% of the protein is classified as important, but not crucial, for the activity. Three regions, consisting of amino acids 32-35, 66-77 and 119-141 contain the remaining 26% of the protein and are critical to the biological activity of the protein. The functional map is discussed in the context of the possible role of the identified critical regions in the structure of the hormone and its binding to the interleukin 2 receptor complex. Images PMID:3261239
Stability and the Evolvability of Function in a Model Protein
Bloom, Jesse D.; Wilke, Claus O.; Arnold, Frances H.; Adami, Christoph
2004-01-01
Functional proteins must fold with some minimal stability to a structure that can perform a biochemical task. Here we use a simple model to investigate the relationship between the stability requirement and the capacity of a protein to evolve the function of binding to a ligand. Although our model contains no built-in tradeoff between stability and function, proteins evolved function more efficiently when the stability requirement was relaxed. Proteins with both high stability and high function evolved more efficiently when the stability requirement was gradually increased than when there was constant selection for high stability. These results show that in our model, the evolution of function is enhanced by allowing proteins to explore sequences corresponding to marginally stable structures, and that it is easier to improve stability while maintaining high function than to improve function while maintaining high stability. Our model also demonstrates that even in the absence of a fundamental biophysical tradeoff between stability and function, the speed with which function can evolve is limited by the stability requirement imposed on the protein. PMID:15111394
Chowdhury, S Roy; Cao, Jin; He, Yufan; Lu, H Peter
2018-03-27
Manipulating protein conformations for exploring protein structure-function relationship has shown great promise. Although protein conformational changes under pulling force manipulation have been extensively studied, protein conformation changes under a compressive force have not been explored quantitatively. The latter is even more biologically significant and relevant in revealing protein functions in living cells associated with protein crowdedness, distribution fluctuations, and cell osmotic stress. Here we report our experimental observations on abrupt ruptures of protein native structures under compressive force, demonstrated and studied by single-molecule AFM-FRET spectroscopic nanoscopy. Our results show that the protein ruptures are abrupt and spontaneous events occurred when the compressive force reaches a threshold of 12-75 pN, a force amplitude accessible from thermal fluctuations in a living cell. The abrupt ruptures are sensitive to local environment, likely a general and important pathway of protein unfolding in living cells.
DWARF – a data warehouse system for analyzing protein families
Fischer, Markus; Thai, Quan K; Grieb, Melanie; Pleiss, Jürgen
2006-01-01
Background The emerging field of integrative bioinformatics provides the tools to organize and systematically analyze vast amounts of highly diverse biological data and thus allows to gain a novel understanding of complex biological systems. The data warehouse DWARF applies integrative bioinformatics approaches to the analysis of large protein families. Description The data warehouse system DWARF integrates data on sequence, structure, and functional annotation for protein fold families. The underlying relational data model consists of three major sections representing entities related to the protein (biochemical function, source organism, classification to homologous families and superfamilies), the protein sequence (position-specific annotation, mutant information), and the protein structure (secondary structure information, superimposed tertiary structure). Tools for extracting, transforming and loading data from public available resources (ExPDB, GenBank, DSSP) are provided to populate the database. The data can be accessed by an interface for searching and browsing, and by analysis tools that operate on annotation, sequence, or structure. We applied DWARF to the family of α/β-hydrolases to host the Lipase Engineering database. Release 2.3 contains 6138 sequences and 167 experimentally determined protein structures, which are assigned to 37 superfamilies 103 homologous families. Conclusion DWARF has been designed for constructing databases of large structurally related protein families and for evaluating their sequence-structure-function relationships by a systematic analysis of sequence, structure and functional annotation. It has been applied to predict biochemical properties from sequence, and serves as a valuable tool for protein engineering. PMID:17094801
Computer analysis of protein functional sites projection on exon structure of genes in Metazoa.
Medvedeva, Irina V; Demenkov, Pavel S; Ivanisenko, Vladimir A
2015-01-01
Study of the relationship between the structural and functional organization of proteins and their coding genes is necessary for an understanding of the evolution of molecular systems and can provide new knowledge for many applications for designing proteins with improved medical and biological properties. It is well known that the functional properties of proteins are determined by their functional sites. Functional sites are usually represented by a small number of amino acid residues that are distantly located from each other in the amino acid sequence. They are highly conserved within their functional group and vary significantly in structure between such groups. According to this facts analysis of the general properties of the structural organization of the functional sites at the protein level and, at the level of exon-intron structure of the coding gene is still an actual problem. One approach to this analysis is the projection of amino acid residue positions of the functional sites along with the exon boundaries to the gene structure. In this paper, we examined the discontinuity of the functional sites in the exon-intron structure of genes and the distribution of lengths and phases of the functional site encoding exons in vertebrate genes. We have shown that the DNA fragments coding the functional sites were in the same exons, or in close exons. The observed tendency to cluster the exons that code functional sites which could be considered as the unit of protein evolution. We studied the characteristics of the structure of the exon boundaries that code, and do not code, functional sites in 11 Metazoa species. This is accompanied by a reduced frequency of intercodon gaps (phase 0) in exons encoding the amino acid residue functional site, which may be evidence of the existence of evolutionary limitations to the exon shuffling. These results characterize the features of the coding exon-intron structure that affect the functionality of the encoded protein and allow a better understanding of the emergence of biological diversity.
Bio-mimicking galactose oxidase and hemocyanin, two dioxygen-processing copper proteins.
Gamez, Patrick; Koval, Iryna A; Reedijk, Jan
2004-12-21
The modelling of the active sites of metalloproteins is one of the most challenging tasks in bio-inorganic chemistry. Copper proteins form part of this stimulating field of research as copper enzymes are mainly involved in oxidation bio-reactions. Thus, the understanding of the structure-function relationship of their active sites will allow the design of effective and environmental friendly oxidation catalysts. This perspective illustrates some outstanding structural and functional synthetic models of the active site of copper proteins, with special attention given to models of galactose oxidase and hemocyanin.
Recent developments in structural proteomics for protein structure determination.
Liu, Hsuan-Liang; Hsu, Jyh-Ping
2005-05-01
The major challenges in structural proteomics include identifying all the proteins on the genome-wide scale, determining their structure-function relationships, and outlining the precise three-dimensional structures of the proteins. Protein structures are typically determined by experimental approaches such as X-ray crystallography or nuclear magnetic resonance (NMR) spectroscopy. However, the knowledge of three-dimensional space by these techniques is still limited. Thus, computational methods such as comparative and de novo approaches and molecular dynamic simulations are intensively used as alternative tools to predict the three-dimensional structures and dynamic behavior of proteins. This review summarizes recent developments in structural proteomics for protein structure determination; including instrumental methods such as X-ray crystallography and NMR spectroscopy, and computational methods such as comparative and de novo structure prediction and molecular dynamics simulations.
Prototype Protein-Based Three-Dimensional Memory
2003-01-01
9 Figure 3.2: Hypothetical mutational landscape ...to explore the genetic mutational landscape of a protein without any a priori knowledge of structure- function relationships. As such, it explores...native organism, Halobacterium salinarum, the protein acts as a photosynthetic sunlight to chemical energy transducer. Through several billion years of
Glycan array data management at Consortium for Functional Glycomics.
Venkataraman, Maha; Sasisekharan, Ram; Raman, Rahul
2015-01-01
Glycomics or the study of structure-function relationships of complex glycans has reshaped post-genomics biology. Glycans mediate fundamental biological functions via their specific interactions with a variety of proteins. Recognizing the importance of glycomics, large-scale research initiatives such as the Consortium for Functional Glycomics (CFG) were established to address these challenges. Over the past decade, the Consortium for Functional Glycomics (CFG) has generated novel reagents and technologies for glycomics analyses, which in turn have led to generation of diverse datasets. These datasets have contributed to understanding glycan diversity and structure-function relationships at molecular (glycan-protein interactions), cellular (gene expression and glycan analysis), and whole organism (mouse phenotyping) levels. Among these analyses and datasets, screening of glycan-protein interactions on glycan array platforms has gained much prominence and has contributed to cross-disciplinary realization of the importance of glycomics in areas such as immunology, infectious diseases, cancer biomarkers, etc. This manuscript outlines methodologies for capturing data from glycan array experiments and online tools to access and visualize glycan array data implemented at the CFG.
Mechanism of Resilin Elasticity
Qin, Guokui; Hu, Xiao; Cebe, Peggy; Kaplan, David L.
2012-01-01
Resilin is critical in the flight and jumping systems of insects as a polymeric rubber-like protein with outstanding elasticity. However, insight into the underlying molecular mechanisms responsible for resilin elasticity remains undefined. Here we report the structure and function of resilin from Drosophila CG15920. A reversible beta-turn transition was identified in the peptide encoded by exon III and for full length resilin during energy input and release, features that correlate to the rapid deformation of resilin during functions in vivo. Micellar structures and nano-porous patterns formed after beta-turn structures were present via changes in either the thermal or mechanical inputs. A model is proposed to explain the super elasticity and energy conversion mechanisms of resilin, providing important insight into structure-function relationships for this protein. Further, this model offers a view of elastomeric proteins in general where beta-turn related structures serve as fundamental units of the structure and elasticity. PMID:22893127
Evolution of Enzyme Superfamilies: Comprehensive Exploration of Sequence-Function Relationships.
Baier, F; Copp, J N; Tokuriki, N
2016-11-22
The sequence and functional diversity of enzyme superfamilies have expanded through billions of years of evolution from a common ancestor. Understanding how protein sequence and functional "space" have expanded, at both the evolutionary and molecular level, is central to biochemistry, molecular biology, and evolutionary biology. Integrative approaches that examine protein sequence, structure, and function have begun to provide comprehensive views of the functional diversity and evolutionary relationships within enzyme superfamilies. In this review, we outline the recent advances in our understanding of enzyme evolution and superfamily functional diversity. We describe the tools that have been used to comprehensively analyze sequence relationships and to characterize sequence and function relationships. We also highlight recent large-scale experimental approaches that systematically determine the activity profiles across enzyme superfamilies. We identify several intriguing insights from this recent body of work. First, promiscuous activities are prevalent among extant enzymes. Second, many divergent proteins retain "function connectivity" via enzyme promiscuity, which can be used to probe the evolutionary potential and history of enzyme superfamilies. Finally, we discuss open questions regarding the intricacies of enzyme divergence, as well as potential research directions that will deepen our understanding of enzyme superfamily evolution.
The interface of protein structure, protein biophysics, and molecular evolution
Liberles, David A; Teichmann, Sarah A; Bahar, Ivet; Bastolla, Ugo; Bloom, Jesse; Bornberg-Bauer, Erich; Colwell, Lucy J; de Koning, A P Jason; Dokholyan, Nikolay V; Echave, Julian; Elofsson, Arne; Gerloff, Dietlind L; Goldstein, Richard A; Grahnen, Johan A; Holder, Mark T; Lakner, Clemens; Lartillot, Nicholas; Lovell, Simon C; Naylor, Gavin; Perica, Tina; Pollock, David D; Pupko, Tal; Regan, Lynne; Roger, Andrew; Rubinstein, Nimrod; Shakhnovich, Eugene; Sjölander, Kimmen; Sunyaev, Shamil; Teufel, Ashley I; Thorne, Jeffrey L; Thornton, Joseph W; Weinreich, Daniel M; Whelan, Simon
2012-01-01
Abstract The interface of protein structural biology, protein biophysics, molecular evolution, and molecular population genetics forms the foundations for a mechanistic understanding of many aspects of protein biochemistry. Current efforts in interdisciplinary protein modeling are in their infancy and the state-of-the art of such models is described. Beyond the relationship between amino acid substitution and static protein structure, protein function, and corresponding organismal fitness, other considerations are also discussed. More complex mutational processes such as insertion and deletion and domain rearrangements and even circular permutations should be evaluated. The role of intrinsically disordered proteins is still controversial, but may be increasingly important to consider. Protein geometry and protein dynamics as a deviation from static considerations of protein structure are also important. Protein expression level is known to be a major determinant of evolutionary rate and several considerations including selection at the mRNA level and the role of interaction specificity are discussed. Lastly, the relationship between modeling and needed high-throughput experimental data as well as experimental examination of protein evolution using ancestral sequence resurrection and in vitro biochemistry are presented, towards an aim of ultimately generating better models for biological inference and prediction. PMID:22528593
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chakraborty, Sandeep; Rao, Basuthkar J.; Baker, Nathan A.
2013-04-01
Phylogenetic analysis of proteins using multiple sequence alignment (MSA) assumes an underlying evolutionary relationship in these proteins which occasionally remains undetected due to considerable sequence divergence. Structural alignment programs have been developed to unravel such fuzzy relationships. However, none of these structure based methods have used electrostatic properties to discriminate between spatially equivalent residues. We present a methodology for MSA of a set of related proteins with known structures using electrostatic properties as an additional discriminator (STEEP). STEEP first extracts a profile, then generates a multiple structural superimposition providing a consolidated spatial framework for comparing residues and finally emits themore » MSA. Residues that are aligned differently by including or excluding electrostatic properties can be targeted by directed evolution experiments to transform the enzymatic properties of one protein into another. We have compared STEEP results to those obtained from a MSA program (ClustalW) and a structural alignment method (MUSTANG) for chymotrypsin serine proteases. Subsequently, we used PhyML to generate phylogenetic trees for the serine and metallo-β-lactamase superfamilies from the STEEP generated MSA, and corroborated the accepted relationships in these superfamilies. We have observed that STEEP acts as a functional classifier when electrostatic congruence is used as a discriminator, and thus identifies potential targets for directed evolution experiments. In summary, STEEP is unique among phylogenetic methods for its ability to use electrostatic congruence to specify mutations that might be the source of the functional divergence in a protein family. Based on our results, we also hypothesize that the active site and its close vicinity contains enough information to infer the correct phylogeny for related proteins.« less
Boareto, Marcelo; Yamagishi, Michel E B; Caticha, Nestor; Leite, Vitor B P
2012-10-01
In protein databases there is a substantial number of proteins structurally determined but without function annotation. Understanding the relationship between function and structure can be useful to predict function on a large scale. We have analyzed the similarities in global physicochemical parameters for a set of enzymes which were classified according to the four Enzyme Commission (EC) hierarchical levels. Using relevance theory we introduced a distance between proteins in the space of physicochemical characteristics. This was done by minimizing a cost function of the metric tensor built to reflect the EC classification system. Using an unsupervised clustering method on a set of 1025 enzymes, we obtained no relevant clustering formation compatible with EC classification. The distance distributions between enzymes from the same EC group and from different EC groups were compared by histograms. Such analysis was also performed using sequence alignment similarity as a distance. Our results suggest that global structure parameters are not sufficient to segregate enzymes according to EC hierarchy. This indicates that features essential for function are rather local than global. Consequently, methods for predicting function based on global attributes should not obtain high accuracy in main EC classes prediction without relying on similarities between enzymes from training and validation datasets. Furthermore, these results are consistent with a substantial number of studies suggesting that function evolves fundamentally by recruitment, i.e., a same protein motif or fold can be used to perform different enzymatic functions and a few specific amino acids (AAs) are actually responsible for enzyme activity. These essential amino acids should belong to active sites and an effective method for predicting function should be able to recognize them. Copyright © 2012 Elsevier Ltd. All rights reserved.
Leuthaeuser, Janelle B; Knutson, Stacy T; Kumar, Kiran; Babbitt, Patricia C; Fetrow, Jacquelyn S
2015-09-01
The development of accurate protein function annotation methods has emerged as a major unsolved biological problem. Protein similarity networks, one approach to function annotation via annotation transfer, group proteins into similarity-based clusters. An underlying assumption is that the edge metric used to identify such clusters correlates with functional information. In this contribution, this assumption is evaluated by observing topologies in similarity networks using three different edge metrics: sequence (BLAST), structure (TM-Align), and active site similarity (active site profiling, implemented in DASP). Network topologies for four well-studied protein superfamilies (enolase, peroxiredoxin (Prx), glutathione transferase (GST), and crotonase) were compared with curated functional hierarchies and structure. As expected, network topology differs, depending on edge metric; comparison of topologies provides valuable information on structure/function relationships. Subnetworks based on active site similarity correlate with known functional hierarchies at a single edge threshold more often than sequence- or structure-based networks. Sequence- and structure-based networks are useful for identifying sequence and domain similarities and differences; therefore, it is important to consider the clustering goal before deciding appropriate edge metric. Further, conserved active site residues identified in enolase and GST active site subnetworks correspond with published functionally important residues. Extension of this analysis yields predictions of functionally determinant residues for GST subgroups. These results support the hypothesis that active site similarity-based networks reveal clusters that share functional details and lay the foundation for capturing functionally relevant hierarchies using an approach that is both automatable and can deliver greater precision in function annotation than current similarity-based methods. © 2015 The Authors Protein Science published by Wiley Periodicals, Inc. on behalf of The Protein Society.
Looking at the Disordered Proteins through the Computational Microscope.
Das, Payel; Matysiak, Silvina; Mittal, Jeetain
2018-05-23
Intrinsically disordered proteins (IDPs) have attracted wide interest over the past decade due to their surprising prevalence in the proteome and versatile roles in cell physiology and pathology. A large selection of IDPs has been identified as potential targets for therapeutic intervention. Characterizing the structure-function relationship of disordered proteins is therefore an essential but daunting task, as these proteins can adapt transient structure, necessitating a new paradigm for connecting structural disorder to function. Molecular simulation has emerged as a natural complement to experiments for atomic-level characterizations and mechanistic investigations of this intriguing class of proteins. The diverse range of length and time scales involved in IDP function requires performing simulations at multiple levels of resolution. In this Outlook, we focus on summarizing available simulation methods, along with a few interesting example applications. We also provide an outlook on how these simulation methods can be further improved in order to provide a more accurate description of IDP structure, binding, and assembly.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hong, Liang; Jain, Nitin; Cheng, Xiaolin
Protein function often depends on global, collective internal motions. However, the simultaneous quantitative experimental determination of the forms, amplitudes, and time scales of these motions has remained elusive. We demonstrate that a complete description of these large-scale dynamic modes can be obtained using coherent neutron-scattering experiments on perdeuterated samples. With this approach, a microscopic relationship between the structure, dynamics, and function in a protein, cytochrome P450cam, is established. The approach developed here should be of general applicability to protein systems.
Hong, Liang; Jain, Nitin; Cheng, Xiaolin; ...
2016-10-14
Protein function often depends on global, collective internal motions. However, the simultaneous quantitative experimental determination of the forms, amplitudes, and time scales of these motions has remained elusive. We demonstrate that a complete description of these large-scale dynamic modes can be obtained using coherent neutron-scattering experiments on perdeuterated samples. With this approach, a microscopic relationship between the structure, dynamics, and function in a protein, cytochrome P450cam, is established. The approach developed here should be of general applicability to protein systems.
Wang, Yong-Cui; Wang, Yong; Yang, Zhi-Xia; Deng, Nai-Yang
2011-06-20
Enzymes are known as the largest class of proteins and their functions are usually annotated by the Enzyme Commission (EC), which uses a hierarchy structure, i.e., four numbers separated by periods, to classify the function of enzymes. Automatically categorizing enzyme into the EC hierarchy is crucial to understand its specific molecular mechanism. In this paper, we introduce two key improvements in predicting enzyme function within the machine learning framework. One is to introduce the efficient sequence encoding methods for representing given proteins. The second one is to develop a structure-based prediction method with low computational complexity. In particular, we propose to use the conjoint triad feature (CTF) to represent the given protein sequences by considering not only the composition of amino acids but also the neighbor relationships in the sequence. Then we develop a support vector machine (SVM)-based method, named as SVMHL (SVM for hierarchy labels), to output enzyme function by fully considering the hierarchical structure of EC. The experimental results show that our SVMHL with the CTF outperforms SVMHL with the amino acid composition (AAC) feature both in predictive accuracy and Matthew's correlation coefficient (MCC). In addition, SVMHL with the CTF obtains the accuracy and MCC ranging from 81% to 98% and 0.82 to 0.98 when predicting the first three EC digits on a low-homologous enzyme dataset. We further demonstrate that our method outperforms the methods which do not take account of hierarchical relationship among enzyme categories and alternative methods which incorporate prior knowledge about inter-class relationships. Our structure-based prediction model, SVMHL with the CTF, reduces the computational complexity and outperforms the alternative approaches in enzyme function prediction. Therefore our new method will be a useful tool for enzyme function prediction community.
Structure-Function-Property-Design Interplay in Biopolymers: Spider Silk
Tokareva, Olena; Jacobsen, Matthew; Buehler, Markus; Wong, Joyce; Kaplan, David L.
2013-01-01
Spider silks have been a focus of research for almost two decades due to their outstanding mechanical and biophysical properties. Recent advances in genetic engineering have led to the synthesis of recombinant spider silks, thus helping to unravel a fundamental understanding of structure-function-property relationships. The relationships between molecular composition, secondary structures, and mechanical properties found in different types of spider silks are described, along with a discussion of artificial spinning of these proteins and their bioapplications, including the role of silks in biomineralization and fabrication of biomaterials with controlled properties. PMID:23962644
Rational Protein Engineering Guided by Deep Mutational Scanning
Shin, HyeonSeok; Cho, Byung-Kwan
2015-01-01
Sequence–function relationship in a protein is commonly determined by the three-dimensional protein structure followed by various biochemical experiments. However, with the explosive increase in the number of genome sequences, facilitated by recent advances in sequencing technology, the gap between protein sequences available and three-dimensional structures is rapidly widening. A recently developed method termed deep mutational scanning explores the functional phenotype of thousands of mutants via massive sequencing. Coupled with a highly efficient screening system, this approach assesses the phenotypic changes made by the substitution of each amino acid sequence that constitutes a protein. Such an informational resource provides the functional role of each amino acid sequence, thereby providing sufficient rationale for selecting target residues for protein engineering. Here, we discuss the current applications of deep mutational scanning and consider experimental design. PMID:26404267
Computational mining for hypothetical patterns of amino acid side chains in protein data bank (PDB)
NASA Astrophysics Data System (ADS)
Ghani, Nur Syatila Ab; Firdaus-Raih, Mohd
2018-04-01
The three-dimensional structure of a protein can provide insights regarding its function. Functional relationship between proteins can be inferred from fold and sequence similarities. In certain cases, sequence or fold comparison fails to conclude homology between proteins with similar mechanism. Since the structure is more conserved than the sequence, a constellation of functional residues can be similarly arranged among proteins of similar mechanism. Local structural similarity searches are able to detect such constellation of amino acids among distinct proteins, which can be useful to annotate proteins of unknown function. Detection of such patterns of amino acids on a large scale can increase the repertoire of important 3D motifs since available known 3D motifs currently, could not compensate the ever-increasing numbers of uncharacterized proteins to be annotated. Here, a computational platform for an automated detection of 3D motifs is described. A fuzzy-pattern searching algorithm derived from IMagine an Amino Acid 3D Arrangement search EnGINE (IMAAAGINE) was implemented to develop an automated method for searching of hypothetical patterns of amino acid side chains in Protein Data Bank (PDB), without the need for prior knowledge on related sequence or structure of pattern of interest. We present an example of the searches, which is the detection of a hypothetical pattern derived from known structural motif of C2H2 structural pattern from zinc fingers. The conservation of particular patterns of amino acid side chains in unrelated proteins is highlighted. This approach can act as a complementary method for available structure- and sequence-based platforms and may contribute in improving functional association between proteins.
NMR relaxation studies on the hydrate layer of intrinsically unstructured proteins.
Bokor, Mónika; Csizmók, Veronika; Kovács, Dénes; Bánki, Péter; Friedrich, Peter; Tompa, Peter; Tompa, Kálmán
2005-03-01
Intrinsically unstructured/disordered proteins (IUPs) exist in a disordered and largely solvent-exposed, still functional, structural state under physiological conditions. As their function is often directly linked with structural disorder, understanding their structure-function relationship in detail is a great challenge to structural biology. In particular, their hydration and residual structure, both closely linked with their mechanism of action, require close attention. Here we demonstrate that the hydration of IUPs can be adequately approached by a technique so far unexplored with respect to IUPs, solid-state NMR relaxation measurements. This technique provides quantitative information on various features of hydrate water bound to these proteins. By freezing nonhydrate (bulk) water out, we have been able to measure free induction decays pertaining to protons of bound water from which the amount of hydrate water, its activation energy, and correlation times could be calculated. Thus, for three IUPs, the first inhibitory domain of calpastatin, microtubule-associated protein 2c, and plant dehydrin early responsive to dehydration 10, we demonstrate that they bind a significantly larger amount of water than globular proteins, whereas their suboptimal hydration and relaxation parameters are correlated with their differing modes of function. The theoretical treatment and experimental approach presented in this article may have general utility in characterizing proteins that belong to this novel structural class.
Single-Molecule Microscopy and Force Spectroscopy of Membrane Proteins
NASA Astrophysics Data System (ADS)
Engel, Andreas; Janovjak, Harald; Fotiadis, Dimtrios; Kedrov, Alexej; Cisneros, David; Müller, Daniel J.
Single-molecule atomic force microscopy (AFM) provides novel ways to characterize the structure-function relationship of native membrane proteins. High-resolution AFM topographs allow observing the structure of single proteins at sub-nanometer resolution as well as their conformational changes, oligomeric state, molecular dynamics and assembly. We will review these feasibilities illustrating examples of membrane proteins in native and reconstituted membranes. Classification of individual topographs of single proteins allows understanding the principles of motions of their extrinsic domains, to learn about their local structural flexibilities and to find the entropy minima of certain conformations. Combined with the visualization of functionally related conformational changes these insights allow understanding why certain flexibilities are required for the protein to function and how structurally flexible regions allow certain conformational changes. Complementary to AFM imaging, single-molecule force spectroscopy (SMFS) experiments detect molecular interactions established within and between membrane proteins. The sensitivity of this method makes it possible to measure interactions that stabilize secondary structures such as transmembrane α-helices, polypeptide loops and segments within. Changes in temperature or protein-protein assembly do not change the locations of stable structural segments, but influence their stability established by collective molecular interactions. Such changes alter the probability of proteins to choose a certain unfolding pathway. Recent examples have elucidated unfolding and refolding pathways of membrane proteins as well as their energy landscapes.
Liu, Lu-Ning; Su, Hai-Nan; Yan, Shi-Gan; Shao, Si-Mi; Xie, Bin-Bin; Chen, Xiu-Lan; Zhang, Xi-Ying; Zhou, Bai-Cheng; Zhang, Yu-Zhong
2009-07-01
Crystal structures of phycobiliproteins have provided valuable information regarding the conformations and amino acid organizations of peptides and chromophores, and enable us to investigate their structural and functional relationships with respect to environmental variations. In this work, we explored the pH-induced conformational and functional dynamics of R-phycoerythrin (R-PE) by means of absorption, fluorescence and circular dichroism spectra, together with analysis of its crystal structure. R-PE presents stronger functional stability in the pH range of 3.5-10 compared to the structural stability. Beyond this range, pronounced functional and structural changes occur. Crystal structure analysis shows that the tertiary structure of R-PE is fixed by several key anchoring points of the protein. With this specific association, the fundamental structure of R-PE is stabilized to present physiological spectroscopic properties, while local variations in protein peptides are also allowed in response to environmental disturbances. The functional stability and relative structural sensitivity of R-PE allow environmental adaptation.
Structure of a group II intron in complex with its reverse transcriptase.
Qu, Guosheng; Kaushal, Prem Singh; Wang, Jia; Shigematsu, Hideki; Piazza, Carol Lyn; Agrawal, Rajendra Kumar; Belfort, Marlene; Wang, Hong-Wei
2016-06-01
Bacterial group II introns are large catalytic RNAs related to nuclear spliceosomal introns and eukaryotic retrotransposons. They self-splice, yielding mature RNA, and integrate into DNA as retroelements. A fully active group II intron forms a ribonucleoprotein complex comprising the intron ribozyme and an intron-encoded protein that performs multiple activities including reverse transcription, in which intron RNA is copied into the DNA target. Here we report cryo-EM structures of an endogenously spliced Lactococcus lactis group IIA intron in its ribonucleoprotein complex form at 3.8-Å resolution and in its protein-depleted form at 4.5-Å resolution, revealing functional coordination of the intron RNA with the protein. Remarkably, the protein structure reveals a close relationship between the reverse transcriptase catalytic domain and telomerase, whereas the active splicing center resembles the spliceosomal Prp8 protein. These extraordinary similarities hint at intricate ancestral relationships and provide new insights into splicing and retromobility.
Molecular Structures and Functional Relationships in Clostridial Neurotoxins
DOE Office of Scientific and Technical Information (OSTI.GOV)
Swaminathan S.
2011-12-01
The seven serotypes of Clostridium botulinum neurotoxins (A-G) are the deadliest poison known to humans. They share significant sequence homology and hence possess similar structure-function relationships. Botulinum neurotoxins (BoNT) act via a four-step mechanism, viz., binding and internalization to neuronal cells, translocation of the catalytic domain into the cytosol and finally cleavage of one of the three soluble N-ethylmaleimide-sensitive factor attachment protein receptors (SNARE) causing blockage of neurotransmitter release leading to flaccid paralysis. Crystal structures of three holotoxins, BoNT/A, B and E, are available to date. Although the individual domains are remarkably similar, their domain organization is different. These structuresmore » have helped in correlating the structural and functional domains. This has led to the determination of structures of individual domains and combinations of them. Crystal structures of catalytic domains of all serotypes and several binding domains are now available. The catalytic domains are zinc endopeptidases and share significant sequence and structural homology. The active site architecture and the catalytic mechanism are similar although the binding mode of individual substrates may be different, dictating substrate specificity and peptide cleavage selectivity. Crystal structures of catalytic domains with substrate peptides provide clues to specificity and selectivity unique to BoNTs. Crystal structures of the receptor domain in complex with ganglioside or the protein receptor have provided information about the binding of botulinum neurotoxin to the neuronal cell. An overview of the structure-function relationship correlating the 3D structures with biochemical and biophysical data and how they can be used for structure-based drug discovery is presented here.« less
Konc, Janez; Cesnik, Tomo; Konc, Joanna Trykowska; Penca, Matej; Janežič, Dušanka
2012-02-27
ProBiS-Database is a searchable repository of precalculated local structural alignments in proteins detected by the ProBiS algorithm in the Protein Data Bank. Identification of functionally important binding regions of the protein is facilitated by structural similarity scores mapped to the query protein structure. PDB structures that have been aligned with a query protein may be rapidly retrieved from the ProBiS-Database, which is thus able to generate hypotheses concerning the roles of uncharacterized proteins. Presented with uncharacterized protein structure, ProBiS-Database can discern relationships between such a query protein and other better known proteins in the PDB. Fast access and a user-friendly graphical interface promote easy exploration of this database of over 420 million local structural alignments. The ProBiS-Database is updated weekly and is freely available online at http://probis.cmm.ki.si/database.
Shahbaaz, Mohd; Ahmad, Faizan; Imtaiyaz Hassan, Md
2015-06-01
Haemophilus influenzae is a small pleomorphic Gram-negative bacteria which causes several chronic diseases, including bacteremia, meningitis, cellulitis, epiglottitis, septic arthritis, pneumonia, and empyema. Here we extensively analyzed the sequenced genome of H. influenzae strain Rd KW20 using protein family databases, protein structure prediction, pathways and genome context methods to assign a precise function to proteins whose functions are unknown. These proteins are termed as hypothetical proteins (HPs), for which no experimental information is available. Function prediction of these proteins would surely be supportive to precisely understand the biochemical pathways and mechanism of pathogenesis of Haemophilus influenzae. During the extensive analysis of H. influenzae genome, we found the presence of eight HPs showing lyase activity. Subsequently, we modeled and analyzed three-dimensional structure of all these HPs to determine their functions more precisely. We found these HPs possess cystathionine-β-synthase, cyclase, carboxymuconolactone decarboxylase, pseudouridine synthase A and C, D-tagatose-1,6-bisphosphate aldolase and aminodeoxychorismate lyase-like features, indicating their corresponding functions in the H. influenzae. Lyases are actively involved in the regulation of biosynthesis of various hormones, metabolic pathways, signal transduction, and DNA repair. Lyases are also considered as a key player for various biological processes. These enzymes are critically essential for the survival and pathogenesis of H. influenzae and, therefore, these enzymes may be considered as a potential target for structure-based rational drug design. Our structure-function relationship analysis will be useful to search and design potential lead molecules based on the structure of these lyases, for drug design and discovery.
Nagasundaram, N; Priya Doss, C George
2011-01-01
Distinguishing the deleterious from the massive number of non-functional nsSNPs that occur within a single genome is a considerable challenge in mutation research. In this approach, we have used the existing in silico methods to explore the mutation-structure-function relationship in the XPAgene. We used the Sorting Intolerant From Tolerant (SIFT), Polymorphism Phenotyping (PolyPhen), I-Mutant 2.0, and the Protein Analysis THrough Evolutionary Relationships methods to predict the effects of deleterious nsSNPs on protein function and evaluated the impact of mutation on protein stability by Molecular Dynamics simulations. By comparing the scores of all the four in silico methods, nsSNP with an ID rs104894131 at position C108F was predicted to be highly deleterious. We extended our Molecular dynamics approach to gain insight into the impact of this non-synonymous polymorphism on structural changes that may affect the activity of the XPAgene. Based on the in silico methods score, potential energy, root-mean-square deviation, and root-mean-square fluctuation, we predict that deleterious nsSNP at position C108F would play a significant role in causing disease by the XPA gene. Our approach would present the application of in silicotools in understanding the functional variation from the perspective of structure, evolution, and phenotype.
On the relationship between residue structural environment and sequence conservation in proteins.
Liu, Jen-Wei; Lin, Jau-Ji; Cheng, Chih-Wen; Lin, Yu-Feng; Hwang, Jenn-Kang; Huang, Tsun-Tsao
2017-09-01
Residues that are crucial to protein function or structure are usually evolutionarily conserved. To identify the important residues in protein, sequence conservation is estimated, and current methods rely upon the unbiased collection of homologous sequences. Surprisingly, our previous studies have shown that the sequence conservation is closely correlated with the weighted contact number (WCN), a measure of packing density for residue's structural environment, calculated only based on the C α positions of a protein structure. Moreover, studies have shown that sequence conservation is correlated with environment-related structural properties calculated based on different protein substructures, such as a protein's all atoms, backbone atoms, side-chain atoms, or side-chain centroid. To know whether the C α atomic positions are adequate to show the relationship between residue environment and sequence conservation or not, here we compared C α atoms with other substructures in their contributions to the sequence conservation. Our results show that C α positions are substantially equivalent to the other substructures in calculations of various measures of residue environment. As a result, the overlapping contributions between C α atoms and the other substructures are high, yielding similar structure-conservation relationship. Take the WCN as an example, the average overlapping contribution to sequence conservation is 87% between C α and all-atom substructures. These results indicate that only C α atoms of a protein structure could reflect sequence conservation at the residue level. © 2017 Wiley Periodicals, Inc.
Cho, Kyung Ho; Bae, Hyoung Eun; Das, Manabendra; Gellman, Samuel H; Chae, Pil Seok
2014-02-01
Membrane proteins are inherently amphipathic and undergo dynamic conformational changes for proper function within native membranes. Maintaining the functional structures of these biomacromolecules in aqueous media is necessary for structural studies but difficult to achieve with currently available tools, thus necessitating the development of novel agents with favorable properties. This study introduces several new glucose-neopentyl glycol (GNG) amphiphiles and reveals some agents that display favorable behaviors for the solubilization and stabilization of a large, multi-subunit membrane protein assembly. Furthermore, a detergent structure-property relationship that could serve as a useful guideline for the design of novel amphiphiles is discussed. Copyright © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Kant, Ravi; Rayaprolu, Vamseedhar; McDonald, Kaitlyn; Bothner, Brian
2018-06-01
The beauty, symmetry, and functionality of icosahedral virus capsids has attracted the attention of biologists, physicists, and mathematicians ever since they were first observed. Viruses and protein cages assemble into functional architectures in a range of sizes, shapes, and symmetries. To fulfill their biological roles, these structures must self-assemble, resist stress, and are often dynamic. The increasing use of icosahedral capsids and cages in materials science has driven the need to quantify them in terms of structural properties such as rigidity, stiffness, and viscoelasticity. In this study, we employed Quartz Crystal Microbalance with Dissipation technology (QCM-D) to characterize and compare the mechanical rigidity of different protein cages and viruses. We attempted to unveil the relationships between rigidity, radius, shell thickness, and triangulation number. We show that the rigidity and triangulation numbers are inversely related to each other and the comparison of rigidity and radius also follows the same trend. Our results suggest that subunit orientation, protein-protein interactions, and protein-nucleic acid interactions are important for the resistance to deformation of these complexes, however, the relationships are complex and need to be explored further. The QCM-D based viscoelastic measurements presented here help us elucidate these relationships and show the future prospect of this technique in the field of physical virology and nano-biotechnology.
Structures composing protein domains.
Kubrycht, Jaroslav; Sigler, Karel; Souček, Pavel; Hudeček, Jiří
2013-08-01
This review summarizes available data concerning intradomain structures (IS) such as functionally important amino acid residues, short linear motifs, conserved or disordered regions, peptide repeats, broadly occurring secondary structures or folds, etc. IS form structural features (units or elements) necessary for interactions with proteins or non-peptidic ligands, enzyme reactions and some structural properties of proteins. These features have often been related to a single structural level (e.g. primary structure) mostly requiring certain structural context of other levels (e.g. secondary structures or supersecondary folds) as follows also from some examples reported or demonstrated here. In addition, we deal with some functionally important dynamic properties of IS (e.g. flexibility and different forms of accessibility), and more special dynamic changes of IS during enzyme reactions and allosteric regulation. Selected notes concern also some experimental methods, still more necessary tools of bioinformatic processing and clinically interesting relationships. Copyright © 2013 Elsevier Masson SAS. All rights reserved.
Thompson, Jared J; Tabatabaei Ghomi, Hamed; Lill, Markus A
2014-12-01
Knowledge-based methods for analyzing protein structures, such as statistical potentials, primarily consider the distances between pairs of bodies (atoms or groups of atoms). Considerations of several bodies simultaneously are generally used to characterize bonded structural elements or those in close contact with each other, but historically do not consider atoms that are not in direct contact with each other. In this report, we introduce an information-theoretic method for detecting and quantifying distance-dependent through-space multibody relationships between the sidechains of three residues. The technique introduced is capable of producing convergent and consistent results when applied to a sufficiently large database of randomly chosen, experimentally solved protein structures. The results of our study can be shown to reproduce established physico-chemical properties of residues as well as more recently discovered properties and interactions. These results offer insight into the numerous roles that residues play in protein structure, as well as relationships between residue function, protein structure, and evolution. The techniques and insights presented in this work should be useful in the future development of novel knowledge-based tools for the evaluation of protein structure. © 2014 Wiley Periodicals, Inc.
Functional Classification of Immune Regulatory Proteins
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rubinstein, Rotem; Ramagopal, Udupi A.; Nathenson, Stanley G.
2013-05-01
Members of the immunoglobulin superfamily (IgSF) control innate and adaptive immunity and are prime targets for the treatment of autoimmune diseases, infectious diseases, and malignancies. We describe a computational method, termed the Brotherhood algorithm, which utilizes intermediate sequence information to classify proteins into functionally related families. This approach identifies functional relationships within the IgSF and predicts additional receptor-ligand interactions. As a specific example, we examine the nectin/nectin-like family of cell adhesion and signaling proteins and propose receptor-ligand interactions within this family. We were guided by the Brotherhood approach and present the high-resolution structural characterization of a homophilic interaction involving themore » class-I MHC-restricted T-cell-associated molecule, which we now classify as a nectin-like family member. The Brotherhood algorithm is likely to have a significant impact on structural immunology by identifying those proteins and complexes for which structural characterization will be particularly informative.« less
Beta-propellers: associated functions and their role in human diseases.
Pons, Tirso; Gómez, Raú; Chinea, Glay; Valencia, Alfonso
2003-03-01
The beta-propeller fold appears as a very fascinating architecture based on four-stranded antiparallel and twisted beta-sheets, radially arranged around a central tunnel. Similar to the alpha/beta-barrel (TIM-barrel) fold, the beta-propeller has a wide range of different functions, and is gaining substantial attention. Some proteins containing beta-propeller domains have been implicated in the pathogenesis of a variety of diseases such as cancer, Alzheimer, Huntington, arthritis, familial hypercholesterolemia, retinitis pigmentosa, osteogenesis, hypertension, and microbial and viral infections. This article reviews some aspects of 3D structure, amino acids sequence regularities, and biological functions of the proteins containing beta-propeller domains. Major emphasis has been laid on beta-propellers whose functions are associated to human diseases. Recent research efforts reported in the fields of protein engineering, drug design, and protein structure-function relationship studies, concerning the beta-propeller architecture, have also been discussed.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Marks, Shawn M.; Lockhart, Samuel N.; Baker, Suzanne L.
Normal aging is associated with a decline in episodic memory and also with aggregation of the β-amyloid (Aβ) and tau proteins and atrophy of medial temporal lobe (MTL) structures crucial to memory formation. Although some evidence suggests that Aβ is associated with aberrant neural activity, the relationships among these two aggregated proteins, neural function, and brain structure are poorly understood. Using in vivo human Aβ and tau imaging, we demonstrate that increased Aβ and tau are both associated with aberrant fMRI activity in the MTL during memory encoding in cognitively normal older adults. This pathological neural activity was in turnmore » associated with worse memory performance and atrophy within the MTL. A mediation analysis revealed that the relationship with regional atrophy was explained by MTL tau. These findings broaden the concept of cognitive aging to include evidence of Alzheimer’s disease-related protein aggregation as an underlying mechanism of age-related memory impairment.« less
Computer analysis of protein functional sites projection on exon structure of genes in Metazoa
2015-01-01
Background Study of the relationship between the structural and functional organization of proteins and their coding genes is necessary for an understanding of the evolution of molecular systems and can provide new knowledge for many applications for designing proteins with improved medical and biological properties. It is well known that the functional properties of proteins are determined by their functional sites. Functional sites are usually represented by a small number of amino acid residues that are distantly located from each other in the amino acid sequence. They are highly conserved within their functional group and vary significantly in structure between such groups. According to this facts analysis of the general properties of the structural organization of the functional sites at the protein level and, at the level of exon-intron structure of the coding gene is still an actual problem. Results One approach to this analysis is the projection of amino acid residue positions of the functional sites along with the exon boundaries to the gene structure. In this paper, we examined the discontinuity of the functional sites in the exon-intron structure of genes and the distribution of lengths and phases of the functional site encoding exons in vertebrate genes. We have shown that the DNA fragments coding the functional sites were in the same exons, or in close exons. The observed tendency to cluster the exons that code functional sites which could be considered as the unit of protein evolution. We studied the characteristics of the structure of the exon boundaries that code, and do not code, functional sites in 11 Metazoa species. This is accompanied by a reduced frequency of intercodon gaps (phase 0) in exons encoding the amino acid residue functional site, which may be evidence of the existence of evolutionary limitations to the exon shuffling. Conclusions These results characterize the features of the coding exon-intron structure that affect the functionality of the encoded protein and allow a better understanding of the emergence of biological diversity. PMID:26693737
Caveolae, caveolins, and cavins: complex control of cellular signalling and inflammation.
Chidlow, John H; Sessa, William C
2010-05-01
Caveolae are specialized lipid rafts that form flask-shaped invaginations of the plasma membrane. They are involved in cell signalling and transport and have been shown critically regulate vascular reactivity and blood pressure. The organization and functions of caveolae are mediated by coat proteins (caveolins) and support or adapter proteins (cavins). The caveolins, caveolin-1, -2, and -3, form the structural backbone of caveolae. These proteins are also highly integrated into caveolae function and have their own activity independent of caveolae. The cavins, cavins 1-4, are involved in regulation of caveolae and modulate the function of caveolins by promoting the membrane remodelling and trafficking of caveolin-derived structures. The relationships between these different proteins are complex and intersect with many aspects of cell function. Caveolae have also been implicated in chronic inflammatory conditions and other pathologies including atherosclerosis, inflammatory bowel disease, muscular dystrophy, and generalized dyslipidaemia. The pathogenic role of the caveolins is an emerging area, however, the roles of cavins in disease is just beginning to be explored. This review will examine the relationship between caveolins and cavins and explore the role of caveolae in inflammatory signalling mechanisms.
Kaur, Gurmeet; Subramanian, Srikrishna
2016-08-26
Treble clef (TC) zinc fingers constitute a large fold-group of structural zinc-binding protein domains that mediate numerous cellular functions. We have analysed the sequence, structure, and function relationships among all TCs in the Protein Data Bank. This led to the identification of novel TCs, such as lsr2, YggX and TFIIIC τ 60 kDa subunit, and prediction of a nuclease-like function for the DUF1364 family. The structural malleability of TCs is evident from the many examples with variations to the core structural elements of the fold. We observe domains wherein the structural core of the TC fold is circularly permuted, and also some examples where the overall fold resembles both the TC motif and another unrelated fold. All extant TC families do not share a monophyletic origin, as several TC proteins are known to have been present in the last universal common ancestor and the last eukaryotic common ancestor. We identify several TCs where the zinc-chelating site and residues are not merely responsible for structure stabilization but also perform other functions, such as being redox active in C1B domain of protein kinase C, a nucleophilic acceptor in Ada and catalytic in organomercurial lyase, MerB.
NASA Astrophysics Data System (ADS)
Kaur, Gurmeet; Subramanian, Srikrishna
2016-08-01
Treble clef (TC) zinc fingers constitute a large fold-group of structural zinc-binding protein domains that mediate numerous cellular functions. We have analysed the sequence, structure, and function relationships among all TCs in the Protein Data Bank. This led to the identification of novel TCs, such as lsr2, YggX and TFIIIC τ 60 kDa subunit, and prediction of a nuclease-like function for the DUF1364 family. The structural malleability of TCs is evident from the many examples with variations to the core structural elements of the fold. We observe domains wherein the structural core of the TC fold is circularly permuted, and also some examples where the overall fold resembles both the TC motif and another unrelated fold. All extant TC families do not share a monophyletic origin, as several TC proteins are known to have been present in the last universal common ancestor and the last eukaryotic common ancestor. We identify several TCs where the zinc-chelating site and residues are not merely responsible for structure stabilization but also perform other functions, such as being redox active in C1B domain of protein kinase C, a nucleophilic acceptor in Ada and catalytic in organomercurial lyase, MerB.
Protein Delivery into Plant Cells: Toward In vivo Structural Biology
Cedeño, Cesyen; Pauwels, Kris; Tompa, Peter
2017-01-01
Understanding the biologically relevant structural and functional behavior of proteins inside living plant cells is only possible through the combination of structural biology and cell biology. The state-of-the-art structural biology techniques are typically applied to molecules that are isolated from their native context. Although most experimental conditions can be easily controlled while dealing with an isolated, purified protein, a serious shortcoming of such in vitro work is that we cannot mimic the extremely complex intracellular environment in which the protein exists and functions. Therefore, it is highly desirable to investigate proteins in their natural habitat, i.e., within live cells. This is the major ambition of in-cell NMR, which aims to approach structure-function relationship under true in vivo conditions following delivery of labeled proteins into cells under physiological conditions. With a multidisciplinary approach that includes recombinant protein production, confocal fluorescence microscopy, nuclear magnetic resonance (NMR) spectroscopy and different intracellular protein delivery strategies, we explore the possibility to develop in-cell NMR studies in living plant cells. While we provide a comprehensive framework to set-up in-cell NMR, we identified the efficient intracellular introduction of isotope-labeled proteins as the major bottleneck. Based on experiments with the paradigmatic intrinsically disordered proteins (IDPs) Early Response to Dehydration protein 10 and 14, we also established the subcellular localization of ERD14 under abiotic stress. PMID:28469623
Structural studies of G protein-coupled receptors.
Lu, Mengjie; Wu, Beili
2016-11-01
G protein-coupled receptors (GPCRs) comprise the largest membrane protein family. These receptors sense a variety of signaling molecules, activate multiple intracellular signal pathways, and act as the targets of over 40% of marketed drugs. Recent progress on GPCR structural studies provides invaluable insights into the structure-function relationship of the GPCR superfamily, deepening our understanding about the molecular mechanisms of GPCR signal transduction. Here, we review recent breakthroughs on GPCR structure determination and the structural features of GPCRs, and take the structures of chemokine receptor CCR5 and purinergic receptors P2Y 1 R and P2Y 12 R as examples to discuss the importance of GPCR structures on functional studies and drug discovery. In addition, we discuss the prospect of GPCR structure-based drug discovery. © 2016 IUBMB Life, 68(11):894-903, 2016. © 2016 International Union of Biochemistry and Molecular Biology.
Protein domain organisation: adding order.
Kummerfeld, Sarah K; Teichmann, Sarah A
2009-01-29
Domains are the building blocks of proteins. During evolution, they have been duplicated, fused and recombined, to produce proteins with novel structures and functions. Structural and genome-scale studies have shown that pairs or groups of domains observed together in a protein are almost always found in only one N to C terminal order and are the result of a single recombination event that has been propagated by duplication of the multi-domain unit. Previous studies of domain organisation have used graph theory to represent the co-occurrence of domains within proteins. We build on this approach by adding directionality to the graphs and connecting nodes based on their relative order in the protein. Most of the time, the linear order of domains is conserved. However, using the directed graph representation we have identified non-linear features of domain organization that are over-represented in genomes. Recognising these patterns and unravelling how they have arisen may allow us to understand the functional relationships between domains and understand how the protein repertoire has evolved. We identify groups of domains that are not linearly conserved, but instead have been shuffled during evolution so that they occur in multiple different orders. We consider 192 genomes across all three kingdoms of life and use domain and protein annotation to understand their functional significance. To identify these features and assess their statistical significance, we represent the linear order of domains in proteins as a directed graph and apply graph theoretical methods. We describe two higher-order patterns of domain organisation: clusters and bi-directionally associated domain pairs and explore their functional importance and phylogenetic conservation. Taking into account the order of domains, we have derived a novel picture of global protein organization. We found that all genomes have a higher than expected degree of clustering and more domain pairs in forward and reverse orientation in different proteins relative to random graphs with identical degree distributions. While these features were statistically over-represented, they are still fairly rare. Looking in detail at the proteins involved, we found strong functional relationships within each cluster. In addition, the domains tended to be involved in protein-protein interaction and are able to function as independent structural units. A particularly striking example was the human Jak-STAT signalling pathway which makes use of a set of domains in a range of orders and orientations to provide nuanced signaling functionality. This illustrated the importance of functional and structural constraints (or lack thereof) on domain organisation.
Determinants of cation transport selectivity: Equilibrium binding and transport kinetics
2015-01-01
The crystal structures of channels and transporters reveal the chemical nature of ion-binding sites and, thereby, constrain mechanistic models for their transport processes. However, these structures, in and of themselves, do not reveal equilibrium selectivity or transport preferences, which can be discerned only from various functional assays. In this Review, I explore the relationship between cation transport protein structures, equilibrium binding measurements, and ion transport selectivity. The primary focus is on K+-selective channels and nonselective cation channels because they have been extensively studied both functionally and structurally, but the principles discussed are relevant to other transport proteins and molecules. PMID:26078056
Isom, Daniel G; Marguet, Philippe R; Oas, Terrence G; Hellinga, Homme W
2011-04-01
Protein thermodynamic stability is a fundamental physical characteristic that determines biological function. Furthermore, alteration of thermodynamic stability by macromolecular interactions or biochemical modifications is a powerful tool for assessing the relationship between protein structure, stability, and biological function. High-throughput approaches for quantifying protein stability are beginning to emerge that enable thermodynamic measurements on small amounts of material, in short periods of time, and using readily accessible instrumentation. Here we present such a method, fast quantitative cysteine reactivity, which exploits the linkage between protein stability, sidechain protection by protein structure, and structural dynamics to characterize the thermodynamic and kinetic properties of proteins. In this approach, the reaction of a protected cysteine and thiol-reactive fluorogenic indicator is monitored over a gradient of temperatures after a short incubation time. These labeling data can be used to determine the midpoint of thermal unfolding, measure the temperature dependence of protein stability, quantify ligand-binding affinity, and, under certain conditions, estimate folding rate constants. Here, we demonstrate the fQCR method by characterizing these thermodynamic and kinetic properties for variants of Staphylococcal nuclease and E. coli ribose-binding protein engineered to contain single, protected cysteines. These straightforward, information-rich experiments are likely to find applications in protein engineering and functional genomics. Copyright © 2010 Wiley-Liss, Inc.
Looking at the Disordered Proteins through the Computational Microscope
2018-01-01
Intrinsically disordered proteins (IDPs) have attracted wide interest over the past decade due to their surprising prevalence in the proteome and versatile roles in cell physiology and pathology. A large selection of IDPs has been identified as potential targets for therapeutic intervention. Characterizing the structure–function relationship of disordered proteins is therefore an essential but daunting task, as these proteins can adapt transient structure, necessitating a new paradigm for connecting structural disorder to function. Molecular simulation has emerged as a natural complement to experiments for atomic-level characterizations and mechanistic investigations of this intriguing class of proteins. The diverse range of length and time scales involved in IDP function requires performing simulations at multiple levels of resolution. In this Outlook, we focus on summarizing available simulation methods, along with a few interesting example applications. We also provide an outlook on how these simulation methods can be further improved in order to provide a more accurate description of IDP structure, binding, and assembly.
Molecular structures and functional relationships in clostridial neurotoxins.
Swaminathan, Subramanyam
2011-12-01
The seven serotypes of Clostridium botulinum neurotoxins (A-G) are the deadliest poison known to humans. They share significant sequence homology and hence possess similar structure-function relationships. Botulinum neurotoxins (BoNT) act via a four-step mechanism, viz., binding and internalization to neuronal cells, translocation of the catalytic domain into the cytosol and finally cleavage of one of the three soluble N-ethylmaleimide-sensitive factor attachment protein receptors (SNARE) causing blockage of neurotransmitter release leading to flaccid paralysis. Crystal structures of three holotoxins, BoNT/A, B and E, are available to date. Although the individual domains are remarkably similar, their domain organization is different. These structures have helped in correlating the structural and functional domains. This has led to the determination of structures of individual domains and combinations of them. Crystal structures of catalytic domains of all serotypes and several binding domains are now available. The catalytic domains are zinc endopeptidases and share significant sequence and structural homology. The active site architecture and the catalytic mechanism are similar although the binding mode of individual substrates may be different, dictating substrate specificity and peptide cleavage selectivity. Crystal structures of catalytic domains with substrate peptides provide clues to specificity and selectivity unique to BoNTs. Crystal structures of the receptor domain in complex with ganglioside or the protein receptor have provided information about the binding of botulinum neurotoxin to the neuronal cell. An overview of the structure-function relationship correlating the 3D structures with biochemical and biophysical data and how they can be used for structure-based drug discovery is presented here. Journal compilation © 2011 FEBS. No claim to original US government works.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Goldstein, D.A.; Rackovsky, S.R.
1989-08-01
During the initial period of this work we explored the differential geometry results which had been used to explain the structure-function relationships in the set of yeast iso-1-cytochrome c mutants studied under the initial contract. In addition we continued the development of techniques which would permit the structural characterization and comparison of proteins in a very efficient manner. We have expanded the studies based on the characterization of the structural preferences of various residues in a sample of twenty six globular proteins. It has been demonstrated that the overall structural preferences and the amino acid specific preferences seen in themore » analysis carried out at the five alpha carbon level can not be explained by the results of the analysis carried out at the four alpha carbon level. Thus the structural preferences seen must be described by considering groups of five or more residues. We do no yet have enough data to extend the analysis to the six alpha carbon unit level. We have also verified that the yeast/tuna structural analogy which we used before was justified, and have performed a conformational energy minimization of the reduced yeast cytochrome c crystal data in order to have a baseline for the study of mutant proteins. 6 refs.« less
An Algorithm for Protein Helix Assignment Using Helix Geometry
Cao, Chen; Xu, Shutan; Wang, Lincong
2015-01-01
Helices are one of the most common and were among the earliest recognized secondary structure elements in proteins. The assignment of helices in a protein underlies the analysis of its structure and function. Though the mathematical expression for a helical curve is simple, no previous assignment programs have used a genuine helical curve as a model for helix assignment. In this paper we present a two-step assignment algorithm. The first step searches for a series of bona fide helical curves each one best fits the coordinates of four successive backbone Cα atoms. The second step uses the best fit helical curves as input to make helix assignment. The application to the protein structures in the PDB (protein data bank) proves that the algorithm is able to assign accurately not only regular α-helix but also 310 and π helices as well as their left-handed versions. One salient feature of the algorithm is that the assigned helices are structurally more uniform than those by the previous programs. The structural uniformity should be useful for protein structure classification and prediction while the accurate assignment of a helix to a particular type underlies structure-function relationship in proteins. PMID:26132394
Recent advances in MeCP2 structure and function1
Hite, Kristopher C.; Adams, Valerie H.; Hansen, Jeffrey C.
2010-01-01
Mutations in methyl DNA binding protein 2 (MeCP2) cause the neurodevelopmental disorder Rett syndrome (RTT). The mechanism(s) by which the native MeCP2 protein operates in the cell are not well understood. Historically, MeCP2 has been characterized as a proximal gene silencer with 2 functional domains: a methyl DNA binding domain and a transcription repression domain. However, several lines of new data indicate that MeCP2 structure and function relationships are more complex. In this review, we first discuss recent studies that have advanced understanding of the basic structural biochemistry of MeCP2. This is followed by an analysis of cell-based experiments suggesting MeCP2 is a regulator, rather than a strict silencer, of transcription. The new data establish MeCP2 as a multifunctional nuclear protein, with potentially important roles in chromatin architecture, regulation of RNA splicing, and active transcription. We conclude by discussing clinical correlations between domain-specific mutations and RTT pathology to stress that all structural domains of MeCP2 are required to properly mediate cellular function of the intact protein. PMID:19234536
Jaiswal, Mamta; Dvorsky, Radovan; Ahmadian, Mohammad Reza
2013-02-08
The diffuse B-cell lymphoma (Dbl) family of the guanine nucleotide exchange factors is a direct activator of the Rho family proteins. The Rho family proteins are involved in almost every cellular process that ranges from fundamental (e.g. the establishment of cell polarity) to highly specialized processes (e.g. the contraction of vascular smooth muscle cells). Abnormal activation of the Rho proteins is known to play a crucial role in cancer, infectious and cognitive disorders, and cardiovascular diseases. However, the existence of 74 Dbl proteins and 25 Rho-related proteins in humans, which are largely uncharacterized, has led to increasing complexity in identifying specific upstream pathways. Thus, we comprehensively investigated sequence-structure-function-property relationships of 21 representatives of the Dbl protein family regarding their specificities and activities toward 12 Rho family proteins. The meta-analysis approach provides an unprecedented opportunity to broadly profile functional properties of Dbl family proteins, including catalytic efficiency, substrate selectivity, and signaling specificity. Our analysis has provided novel insights into the following: (i) understanding of the relative differences of various Rho protein members in nucleotide exchange; (ii) comparing and defining individual and overall guanine nucleotide exchange factor activities of a large representative set of the Dbl proteins toward 12 Rho proteins; (iii) grouping the Dbl family into functionally distinct categories based on both their catalytic efficiencies and their sequence-structural relationships; (iv) identifying conserved amino acids as fingerprints of the Dbl and Rho protein interaction; and (v) defining amino acid sequences conserved within, but not between, Dbl subfamilies. Therefore, the characteristics of such specificity-determining residues identified the regions or clusters conserved within the Dbl subfamilies.
Designing protein-based biomaterials for medical applications.
Gagner, Jennifer E; Kim, Wookhyun; Chaikof, Elliot L
2014-04-01
Biomaterials produced by nature have been honed through billions of years, evolving exquisitely precise structure-function relationships that scientists strive to emulate. Advances in genetic engineering have facilitated extensive investigations to determine how changes in even a single peptide within a protein sequence can produce biomaterials with unique thermal, mechanical and biological properties. Elastin, a naturally occurring protein polymer, serves as a model protein to determine the relationship between specific structural elements and desirable material characteristics. The modular, repetitive nature of the protein facilitates the formation of well-defined secondary structures with the ability to self-assemble into complex three-dimensional architectures on a variety of length scales. Furthermore, many opportunities exist to incorporate other protein-based motifs and inorganic materials into recombinant protein-based materials, extending the range and usefulness of these materials in potential biomedical applications. Elastin-like polypeptides (ELPs) can be assembled into 3-D architectures with precise control over payload encapsulation, mechanical and thermal properties, as well as unique functionalization opportunities through both genetic and enzymatic means. An overview of current protein-based materials, their properties and uses in biomedicine will be provided, with a focus on the advantages of ELPs. Applications of these biomaterials as imaging and therapeutic delivery agents will be discussed. Finally, broader implications and future directions of these materials as diagnostic and therapeutic systems will be explored. Copyright © 2013 Elsevier Ltd. All rights reserved.
Designing Protein-Based Biomaterials for Medical Applications
Gagner, Jennifer E.; Kim, Wookhyun; Chaikof, Elliot L.
2013-01-01
Biomaterials produced by nature have been honed through billions of years, evolving exquisitely precise structure-function relationships that scientists strive to emulate. Advances in genetic engineering have facilitated extensive investigations to determine how changes in even a single peptide within a protein sequence can produce biomaterials with unique thermal, mechanical and biological properties. Elastin, a naturally occurring protein polymer, serves as a model protein to determine the relationship between specific structural elements and desirable material characteristics. The modular, repetitive nature of the protein facilitates the formation of well-defined secondary structures with the ability to self-assemble into complex three-dimensional architectures on a variety of length scales. Furthermore, many opportunities exist to incorporate other protein-based motifs and inorganic materials into recombinant protein-based materials, extending the range and usefulness of these materials in potential biomedical applications. Elastin-like polypeptides can be assembled into 3D architectures with precise control over payload encapsulation, mechanical and thermal properties, as well as unique functionalization opportunities through both genetic and enzymatic means. An overview of current protein-based materials, their properties and uses in biomedicine will be provided, with a focus on the advantages of elastin-like polypeptides. Applications of these biomaterials as imaging and therapeutic delivery agents will be discussed. Finally, broader implications and future directions of these materials as diagnostic and therapeutic systems will be explored. PMID:24121196
Chiang, Harry; Robinson, Lucy C; Brame, Cynthia J; Messina, Troy C
2013-01-01
Over the past 20 years, the biological sciences have increasingly incorporated chemistry, physics, computer science, and mathematics to aid in the development and use of mathematical models. Such combined approaches have been used to address problems from protein structure-function relationships to the workings of complex biological systems. Computer simulations of molecular events can now be accomplished quickly and with standard computer technology. Also, simulation software is freely available for most computing platforms, and online support for the novice user is ample. We have therefore created a molecular dynamics laboratory module to enhance undergraduate student understanding of molecular events underlying organismal phenotype. This module builds on a previously described project in which students use site-directed mutagenesis to investigate functions of conserved sequence features in members of a eukaryotic protein kinase family. In this report, we detail the laboratory activities of a MD module that provide a complement to phenotypic outcomes by providing a hypothesis-driven and quantifiable measure of predicted structural changes caused by targeted mutations. We also present examples of analyses students may perform. These laboratory activities can be integrated with genetics or biochemistry experiments as described, but could also be used independently in any course that would benefit from a quantitative approach to protein structure-function relationships. Copyright © 2013 Wiley Periodicals, Inc.
Modelling dynamics in protein crystal structures by ensemble refinement
Burnley, B Tom; Afonine, Pavel V; Adams, Paul D; Gros, Piet
2012-01-01
Single-structure models derived from X-ray data do not adequately account for the inherent, functionally important dynamics of protein molecules. We generated ensembles of structures by time-averaged refinement, where local molecular vibrations were sampled by molecular-dynamics (MD) simulation whilst global disorder was partitioned into an underlying overall translation–libration–screw (TLS) model. Modeling of 20 protein datasets at 1.1–3.1 Å resolution reduced cross-validated Rfree values by 0.3–4.9%, indicating that ensemble models fit the X-ray data better than single structures. The ensembles revealed that, while most proteins display a well-ordered core, some proteins exhibit a ‘molten core’ likely supporting functionally important dynamics in ligand binding, enzyme activity and protomer assembly. Order–disorder changes in HIV protease indicate a mechanism of entropy compensation for ordering the catalytic residues upon ligand binding by disordering specific core residues. Thus, ensemble refinement extracts dynamical details from the X-ray data that allow a more comprehensive understanding of structure–dynamics–function relationships. DOI: http://dx.doi.org/10.7554/eLife.00311.001 PMID:23251785
Goonesekere, Nalin Cw
2009-01-01
The large numbers of protein sequences generated by whole genome sequencing projects require rapid and accurate methods of annotation. The detection of homology through computational sequence analysis is a powerful tool in determining the complex evolutionary and functional relationships that exist between proteins. Homology search algorithms employ amino acid substitution matrices to detect similarity between proteins sequences. The substitution matrices in common use today are constructed using sequences aligned without reference to protein structure. Here we present amino acid substitution matrices constructed from the alignment of a large number of protein domain structures from the structural classification of proteins (SCOP) database. We show that when incorporated into the homology search algorithms BLAST and PSI-blast, the structure-based substitution matrices enhance the efficacy of detecting remote homologs.
Harnessing glycomics technologies: integrating structure with function for glycan characterization
Robinson, Luke N.; Artpradit, Charlermchai; Raman, Rahul; Shriver, Zachary H.; Ruchirawat, Mathuros; Sasisekharan, Ram
2013-01-01
Glycans, or complex carbohydrates, are a ubiquitous class of biological molecules which impinge on a variety of physiological processes ranging from signal transduction to tissue development and microbial pathogenesis. In comparison to DNA and proteins, glycans present unique challenges to the study of their structure and function owing to their complex and heterogeneous structures and the dominant role played by multivalency in their sequence-specific biological interactions. Arising from these challenges, there is a need to integrate information from multiple complementary methods to decode structure-function relationships. Focusing on acidic glycans, we describe here key glycomics technologies for characterizing their structural attributes, including linkage, modifications, and topology, as well as for elucidating their role in biological processes. Two cases studies, one involving sialylated branched glycans and the other sulfated glycosaminoglycans, are used to highlight how integration of orthogonal information from diverse datasets enables rapid convergence of glycan characterization for development of robust structure-function relationships. PMID:22522536
Structural Elements Regulating AAA+ Protein Quality Control Machines.
Chang, Chiung-Wen; Lee, Sukyeong; Tsai, Francis T F
2017-01-01
Members of the ATPases Associated with various cellular Activities (AAA+) superfamily participate in essential and diverse cellular pathways in all kingdoms of life by harnessing the energy of ATP binding and hydrolysis to drive their biological functions. Although most AAA+ proteins share a ring-shaped architecture, AAA+ proteins have evolved distinct structural elements that are fine-tuned to their specific functions. A central question in the field is how ATP binding and hydrolysis are coupled to substrate translocation through the central channel of ring-forming AAA+ proteins. In this mini-review, we will discuss structural elements present in AAA+ proteins involved in protein quality control, drawing similarities to their known role in substrate interaction by AAA+ proteins involved in DNA translocation. Elements to be discussed include the pore loop-1, the Inter-Subunit Signaling (ISS) motif, and the Pre-Sensor I insert (PS-I) motif. Lastly, we will summarize our current understanding on the inter-relationship of those structural elements and propose a model how ATP binding and hydrolysis might be coupled to polypeptide translocation in protein quality control machines.
New Frontiers in NanoBiotechnology: Monitoring the Protein Function With Single Protein Resolution
2005-03-29
Protein (GFP) is a spontaneously fluorescent polypeptide of 27 kD from the jellyfish Aequorea victoria that absorbs UV-blue light and emits in the...will have vast applications in science. Relationship between structure and optical properties in Green Fluorescent Proteins : A quantum mechanical study...RESEARCH AND DEVELOPMENT Invited talks Folding, stability and fluorescence efficiency of the Green and Red Fluorescent Proteins Saverio Alberti Lab.
Quality Control Test for Sequence-Phenotype Assignments
Ortiz, Maria Teresa Lara; Rosario, Pablo Benjamín Leon; Luna-Nevarez, Pablo; Gamez, Alba Savin; Martínez-del Campo, Ana; Del Rio, Gabriel
2015-01-01
Relating a gene mutation to a phenotype is a common task in different disciplines such as protein biochemistry. In this endeavour, it is common to find false relationships arising from mutations introduced by cells that may be depurated using a phenotypic assay; yet, such phenotypic assays may introduce additional false relationships arising from experimental errors. Here we introduce the use of high-throughput DNA sequencers and statistical analysis aimed to identify incorrect DNA sequence-phenotype assignments and observed that 10–20% of these false assignments are expected in large screenings aimed to identify critical residues for protein function. We further show that this level of incorrect DNA sequence-phenotype assignments may significantly alter our understanding about the structure-function relationship of proteins. We have made available an implementation of our method at http://bis.ifc.unam.mx/en/software/chispas. PMID:25700273
CABS-flex 2.0: a web server for fast simulations of flexibility of protein structures.
Kuriata, Aleksander; Gierut, Aleksandra Maria; Oleniecki, Tymoteusz; Ciemny, Maciej Pawel; Kolinski, Andrzej; Kurcinski, Mateusz; Kmiecik, Sebastian
2018-05-14
Classical simulations of protein flexibility remain computationally expensive, especially for large proteins. A few years ago, we developed a fast method for predicting protein structure fluctuations that uses a single protein model as the input. The method has been made available as the CABS-flex web server and applied in numerous studies of protein structure-function relationships. Here, we present a major update of the CABS-flex web server to version 2.0. The new features include: extension of the method to significantly larger and multimeric proteins, customizable distance restraints and simulation parameters, contact maps and a new, enhanced web server interface. CABS-flex 2.0 is freely available at http://biocomp.chem.uw.edu.pl/CABSflex2.
Leuthaeuser, Janelle B; Knutson, Stacy T; Kumar, Kiran; Babbitt, Patricia C; Fetrow, Jacquelyn S
2015-01-01
The development of accurate protein function annotation methods has emerged as a major unsolved biological problem. Protein similarity networks, one approach to function annotation via annotation transfer, group proteins into similarity-based clusters. An underlying assumption is that the edge metric used to identify such clusters correlates with functional information. In this contribution, this assumption is evaluated by observing topologies in similarity networks using three different edge metrics: sequence (BLAST), structure (TM-Align), and active site similarity (active site profiling, implemented in DASP). Network topologies for four well-studied protein superfamilies (enolase, peroxiredoxin (Prx), glutathione transferase (GST), and crotonase) were compared with curated functional hierarchies and structure. As expected, network topology differs, depending on edge metric; comparison of topologies provides valuable information on structure/function relationships. Subnetworks based on active site similarity correlate with known functional hierarchies at a single edge threshold more often than sequence- or structure-based networks. Sequence- and structure-based networks are useful for identifying sequence and domain similarities and differences; therefore, it is important to consider the clustering goal before deciding appropriate edge metric. Further, conserved active site residues identified in enolase and GST active site subnetworks correspond with published functionally important residues. Extension of this analysis yields predictions of functionally determinant residues for GST subgroups. These results support the hypothesis that active site similarity-based networks reveal clusters that share functional details and lay the foundation for capturing functionally relevant hierarchies using an approach that is both automatable and can deliver greater precision in function annotation than current similarity-based methods. PMID:26073648
NagaSundaram, N; Priya Doss, C George
2011-01-01
Background: Distinguishing the deleterious from the massive number of non-functional nsSNPs that occur within a single genome is a considerable challenge in mutation research. In this approach, we have used the existing in silico methods to explore the mutation-structure-function relationship in the XPAgene. Materials and Methods: We used the Sorting Intolerant From Tolerant (SIFT), Polymorphism Phenotyping (PolyPhen), I-Mutant 2.0, and the Protein Analysis THrough Evolutionary Relationships methods to predict the effects of deleterious nsSNPs on protein function and evaluated the impact of mutation on protein stability by Molecular Dynamics simulations. Results: By comparing the scores of all the four in silico methods, nsSNP with an ID rs104894131 at position C108F was predicted to be highly deleterious. We extended our Molecular dynamics approach to gain insight into the impact of this non-synonymous polymorphism on structural changes that may affect the activity of the XPAgene. Conclusion: Based on the in silico methods score, potential energy, root-mean-square deviation, and root-mean-square fluctuation, we predict that deleterious nsSNP at position C108F would play a significant role in causing disease by the XPA gene. Our approach would present the application of in silicotools in understanding the functional variation from the perspective of structure, evolution, and phenotype. PMID:22190868
del Sol, Antonio; Araúzo-Bravo, Marcos J; Amoros, Dolors; Nussinov, Ruth
2007-01-01
Background Allosteric communications are vital for cellular signaling. Here we explore a relationship between protein architectural organization and shortcuts in signaling pathways. Results We show that protein domains consist of modules interconnected by residues that mediate signaling through the shortest pathways. These mediating residues tend to be located at the inter-modular boundaries, which are more rigid and display a larger number of long-range interactions than intra-modular regions. The inter-modular boundaries contain most of the residues centrally conserved in the protein fold, which may be crucial for information transfer between amino acids. Our approach to modular decomposition relies on a representation of protein structures as residue-interacting networks, and removal of the most central residue contacts, which are assumed to be crucial for allosteric communications. The modular decomposition of 100 multi-domain protein structures indicates that modules constitute the building blocks of domains. The analysis of 13 allosteric proteins revealed that modules characterize experimentally identified functional regions. Based on the study of an additional functionally annotated dataset of 115 proteins, we propose that high-modularity modules include functional sites and are the basic functional units. We provide examples (the Gαs subunit and P450 cytochromes) to illustrate that the modular architecture of active sites is linked to their functional specialization. Conclusion Our method decomposes protein structures into modules, allowing the study of signal transmission between functional sites. A modular configuration might be advantageous: it allows signaling proteins to expand their regulatory linkages and may elicit a broader range of control mechanisms either via modular combinations or through modulation of inter-modular linkages. PMID:17531094
Scoring functions for protein-protein interactions.
Moal, Iain H; Moretti, Rocco; Baker, David; Fernández-Recio, Juan
2013-12-01
The computational evaluation of protein-protein interactions will play an important role in organising the wealth of data being generated by high-throughput initiatives. Here we discuss future applications, report recent developments and identify areas requiring further investigation. Many functions have been developed to quantify the structural and energetic properties of interacting proteins, finding use in interrelated challenges revolving around the relationship between sequence, structure and binding free energy. These include loop modelling, side-chain refinement, docking, multimer assembly, affinity prediction, affinity change upon mutation, hotspots location and interface design. Information derived from models optimised for one of these challenges can be used to benefit the others, and can be unified within the theoretical frameworks of multi-task learning and Pareto-optimal multi-objective learning. Copyright © 2013 Elsevier Ltd. All rights reserved.
Inorganic pyrophosphatases: structural diversity serving the function
NASA Astrophysics Data System (ADS)
Samygina, V. R.
2016-05-01
The review is devoted to ubiquitous enzymes, inorganic pyrophosphatases, which are essential in all living organisms. Despite the long history of investigations, these enzymes continue to attract interest. The review focuses on the three-dimensional structures of various representatives of this class of proteins. The structural diversity, the relationship between the structure and some properties of pyrophosphatases and various mechanisms of enzyme action related to the structural diversity of these enzymes are discussed. Interactions of pyrophosphatase with other proteins and possible practical applications are considered. The bibliography includes 56 references.
da Fonseca, Néli José; Lima Afonso, Marcelo Querino; Pedersolli, Natan Gonçalves; de Oliveira, Lucas Carrijo; Andrade, Dhiego Souto; Bleicher, Lucas
2017-10-28
Flaviviruses are responsible for serious diseases such as dengue, yellow fever, and zika fever. Their genomes encode a polyprotein which, after cleavage, results in three structural and seven non-structural proteins. Homologous proteins can be studied by conservation and coevolution analysis as detected in multiple sequence alignments, usually reporting positions which are strictly necessary for the structure and/or function of all members in a protein family or which are involved in a specific sub-class feature requiring the coevolution of residue sets. This study provides a complete conservation and coevolution analysis on all flaviviruses non-structural proteins, with results mapped on all well-annotated available sequences. A literature review on the residues found in the analysis enabled us to compile available information on their roles and distribution among different flaviviruses. Also, we provide the mapping of conserved and coevolved residues for all sequences currently in SwissProt as a supplementary material, so that particularities in different viruses can be easily analyzed. Copyright © 2017 Elsevier Inc. All rights reserved.
3D Printing of Protein Models in an Undergraduate Laboratory: Leucine Zippers
ERIC Educational Resources Information Center
Meyer, Scott C.
2015-01-01
An upper-division undergraduate laboratory experiment is described that explores the structure/function relationship of protein domains, namely leucine zippers, through a molecular graphics computer program and physical models fabricated by 3D printing. By generating solvent accessible surfaces and color-coding hydrophobic, basic, and acidic amino…
ERIC Educational Resources Information Center
Hati, Sanchita; Bhattacharyya, Sudeep
2016-01-01
A project-based biophysical chemistry laboratory course, which is offered to the biochemistry and molecular biology majors in their senior year, is described. In this course, the classroom study of the structure-function of biomolecules is integrated with the discovery-guided laboratory study of these molecules using computer modeling and…
FunTree: advances in a resource for exploring and contextualising protein function evolution.
Sillitoe, Ian; Furnham, Nicholas
2016-01-04
FunTree is a resource that brings together protein sequence, structure and functional information, including overall chemical reaction and mechanistic data, for structurally defined domain superfamilies. Developed in tandem with the CATH database, the original FunTree contained just 276 superfamilies focused on enzymes. Here, we present an update of FunTree that has expanded to include 2340 superfamilies including both enzymes and proteins with non-enzymatic functions annotated by Gene Ontology (GO) terms. This allows the investigation of how novel functions have evolved within a structurally defined superfamily and provides a means to analyse trends across many superfamilies. This is done not only within the context of a protein's sequence and structure but also the relationships of their functions. New measures of functional similarity have been integrated, including for enzymes comparisons of overall reactions based on overall bond changes, reaction centres (the local environment atoms involved in the reaction) and the sub-structure similarities of the metabolites involved in the reaction and for non-enzymes semantic similarities based on the GO. To identify and highlight changes in function through evolution, ancestral character estimations are made and presented. All this is accessible through a new re-designed web interface that can be found at http://www.funtree.info. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Unraveling secrets of telomeres: one molecule at a time
Lin, Jiangguo; Kaur, Parminder; Countryman, Preston; Opresko, Patricia L.; Wang, Hong
2016-01-01
Telomeres play important roles in maintaining the stability of linear chromosomes. Telomere maintenance involves dynamic actions of multiple proteins interacting with long repetitive sequences and complex dynamic DNA structures, such as G-quadruplexes, T-loops and t-circles. Given the heterogeneity and complexity of telomeres, single-molecule approaches are essential to fully understand the structure-function relationships that govern telomere maintenance. In this review, we present a brief overview of the principles of single-molecule imaging and manipulation techniques. We then highlight results obtained from applying these single-molecule techniques for studying structure, dynamics and functions of G-quadruplexes, telomerase, and shelterin proteins. PMID:24569170
Unzippers, Resolvers and Sensors: A Structural and Functional Biochemistry Tale of RNA Helicases
Leitão, Ana Lúcia; Costa, Marina C.; Enguita, Francisco J.
2015-01-01
The centrality of RNA within the biological world is an irrefutable fact that currently attracts increasing attention from the scientific community. The panoply of functional RNAs requires the existence of specific biological caretakers, RNA helicases, devoted to maintain the proper folding of those molecules, resolving unstable structures. However, evolution has taken advantage of the specific position and characteristics of RNA helicases to develop new functions for these proteins, which are at the interface of the basic processes for transference of information from DNA to proteins. RNA helicases are involved in many biologically relevant processes, not only as RNA chaperones, but also as signal transducers, scaffolds of molecular complexes, and regulatory elements. Structural biology studies during the last decade, founded in X-ray crystallography, have characterized in detail several RNA-helicases. This comprehensive review summarizes the structural knowledge accumulated in the last two decades within this family of proteins, with special emphasis on the structure-function relationships of the most widely-studied families of RNA helicases: the DEAD-box, RIG-I-like and viral NS3 classes. PMID:25622248
ERIC Educational Resources Information Center
Maza, Johnathan C.; Villa, Jordan K.; Landino, Lisa M.; Young, Douglas D
2016-01-01
The site-specific introduction of unnatural amino acids (UAAs) has been demonstrated to be a useful tool in protein engineering. Moreover, the incorporation of a UAA into a protein has become feasible with the increased commercial availability of UAAs and robust expression plasmids. In addition to the ease of incorporation, the concepts utilized…
Structure-function relationships in the evolutionary framework of spermine oxidase.
Cervelli, Manuela; Salvi, Daniele; Polticelli, Fabio; Amendola, Roberto; Mariottini, Paolo
2013-06-01
Spermine oxidase is a FAD-dependent enzyme that specifically oxidizes spermine, and plays a central role in the highly regulated catabolism of polyamines in vertebrates. The spermine oxidase substrate is specifically spermine, a tetramine that plays mandatory roles in several cell functions, such as DNA synthesis, cellular proliferation, modulation of ion channels function, cellular signalling, nitric oxide synthesis and inhibition of immune responses. The oxidative products of spermine oxidase activity are spermidine, H2O2 and the aldehyde 3-aminopropanal that spontaneously turns into acrolein. In this study the reconstruction of the phylogenetic relationships among spermine oxidase proteins from different vertebrate taxa allowed to infer their molecular evolutionary history, and assisted in elucidating the conservation of structural and functional properties of this enzyme family. The amino acid residues, which have been hypothesized or demonstrated to play a pivotal role in the enzymatic activity, and substrate specificity are here analysed to obtain a comprehensive and updated view of the structure-function relationships in the evolution of spermine oxidase.
Structure-function-property-design interplay in biopolymers: spider silk.
Tokareva, Olena; Jacobsen, Matthew; Buehler, Markus; Wong, Joyce; Kaplan, David L
2014-04-01
Spider silks have been a focus of research for almost two decades due to their outstanding mechanical and biophysical properties. Recent advances in genetic engineering have led to the synthesis of recombinant spider silks, thus helping to unravel a fundamental understanding of structure-function-property relationships. The relationships between molecular composition, secondary structures and mechanical properties found in different types of spider silks are described, along with a discussion of artificial spinning of these proteins and their bioapplications, including the role of silks in biomineralization and fabrication of biomaterials with controlled properties. Copyright © 2013 Acta Materialia Inc. Published by Elsevier Ltd. All rights reserved.
CAB-Align: A Flexible Protein Structure Alignment Method Based on the Residue-Residue Contact Area.
Terashi, Genki; Takeda-Shitaka, Mayuko
2015-01-01
Proteins are flexible, and this flexibility has an essential functional role. Flexibility can be observed in loop regions, rearrangements between secondary structure elements, and conformational changes between entire domains. However, most protein structure alignment methods treat protein structures as rigid bodies. Thus, these methods fail to identify the equivalences of residue pairs in regions with flexibility. In this study, we considered that the evolutionary relationship between proteins corresponds directly to the residue-residue physical contacts rather than the three-dimensional (3D) coordinates of proteins. Thus, we developed a new protein structure alignment method, contact area-based alignment (CAB-align), which uses the residue-residue contact area to identify regions of similarity. The main purpose of CAB-align is to identify homologous relationships at the residue level between related protein structures. The CAB-align procedure comprises two main steps: First, a rigid-body alignment method based on local and global 3D structure superposition is employed to generate a sufficient number of initial alignments. Then, iterative dynamic programming is executed to find the optimal alignment. We evaluated the performance and advantages of CAB-align based on four main points: (1) agreement with the gold standard alignment, (2) alignment quality based on an evolutionary relationship without 3D coordinate superposition, (3) consistency of the multiple alignments, and (4) classification agreement with the gold standard classification. Comparisons of CAB-align with other state-of-the-art protein structure alignment methods (TM-align, FATCAT, and DaliLite) using our benchmark dataset showed that CAB-align performed robustly in obtaining high-quality alignments and generating consistent multiple alignments with high coverage and accuracy rates, and it performed extremely well when discriminating between homologous and nonhomologous pairs of proteins in both single and multi-domain comparisons. The CAB-align software is freely available to academic users as stand-alone software at http://www.pharm.kitasato-u.ac.jp/bmd/bmd/Publications.html.
Chang, Yi-Chien; Hu, Zhenjun; Rachlin, John; Anton, Brian P; Kasif, Simon; Roberts, Richard J; Steffen, Martin
2016-01-04
The COMBREX database (COMBREX-DB; combrex.bu.edu) is an online repository of information related to (i) experimentally determined protein function, (ii) predicted protein function, (iii) relationships among proteins of unknown function and various types of experimental data, including molecular function, protein structure, and associated phenotypes. The database was created as part of the novel COMBREX (COMputational BRidges to EXperiments) effort aimed at accelerating the rate of gene function validation. It currently holds information on ∼ 3.3 million known and predicted proteins from over 1000 completely sequenced bacterial and archaeal genomes. The database also contains a prototype recommendation system for helping users identify those proteins whose experimental determination of function would be most informative for predicting function for other proteins within protein families. The emphasis on documenting experimental evidence for function predictions, and the prioritization of uncharacterized proteins for experimental testing distinguish COMBREX from other publicly available microbial genomics resources. This article describes updates to COMBREX-DB since an initial description in the 2011 NAR Database Issue. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
PASS2: an automated database of protein alignments organised as structural superfamilies.
Bhaduri, Anirban; Pugalenthi, Ganesan; Sowdhamini, Ramanathan
2004-04-02
The functional selection and three-dimensional structural constraints of proteins in nature often relates to the retention of significant sequence similarity between proteins of similar fold and function despite poor sequence identity. Organization of structure-based sequence alignments for distantly related proteins, provides a map of the conserved and critical regions of the protein universe that is useful for the analysis of folding principles, for the evolutionary unification of protein families and for maximizing the information return from experimental structure determination. The Protein Alignment organised as Structural Superfamily (PASS2) database represents continuously updated, structural alignments for evolutionary related, sequentially distant proteins. An automated and updated version of PASS2 is, in direct correspondence with SCOP 1.63, consisting of sequences having identity below 40% among themselves. Protein domains have been grouped into 628 multi-member superfamilies and 566 single member superfamilies. Structure-based sequence alignments for the superfamilies have been obtained using COMPARER, while initial equivalencies have been derived from a preliminary superposition using LSQMAN or STAMP 4.0. The final sequence alignments have been annotated for structural features using JOY4.0. The database is supplemented with sequence relatives belonging to different genomes, conserved spatially interacting and structural motifs, probabilistic hidden markov models of superfamilies based on the alignments and useful links to other databases. Probabilistic models and sensitive position specific profiles obtained from reliable superfamily alignments aid annotation of remote homologues and are useful tools in structural and functional genomics. PASS2 presents the phylogeny of its members both based on sequence and structural dissimilarities. Clustering of members allows us to understand diversification of the family members. The search engine has been improved for simpler browsing of the database. The database resolves alignments among the structural domains consisting of evolutionarily diverged set of sequences. Availability of reliable sequence alignments of distantly related proteins despite poor sequence identity and single-member superfamilies permit better sampling of structures in libraries for fold recognition of new sequences and for the understanding of protein structure-function relationships of individual superfamilies. PASS2 is accessible at http://www.ncbs.res.in/~faculty/mini/campass/pass2.html
Theodoridou, Katerina; Yu, Peiqiang
2013-06-12
Protein quality relies not only on total protein but also on protein inherent structures. The most commonly occurring protein secondary structures (α-helix and β-sheet) may influence protein quality, nutrient utilization, and digestive behavior. The objectives of this study were to reveal the protein molecular structures of canola meal (yellow and brown) and presscake as affected by the heat-processing methods and to investigate the relationship between structure changes and protein rumen degradations kinetics, estimated protein intestinal digestibility, degraded protein balance, and metabolizable protein. Heat-processing conditions resulted in a higher value for α-helix and β-sheet for brown canola presscake compared to brown canola meal. The multivariate molecular spectral analyses (PCA, CLA) showed that there were significant molecular structural differences in the protein amide I and II fingerprint region (ca. 1700-1480 cm(-1)) between the brown canola meal and presscake. The in situ degradation parameters, amide I and II, and α-helix to β-sheet ratio (R_a_β) were positively correlated with the degradable fraction and the degradation rate. Modeling results showed that α-helix was positively correlated with the truly absorbed rumen synthesized microbial protein in the small intestine when using both the Dutch DVE/OEB system and the NRC-2001 model. Concerning the protein profiles, R_a_β was a better predictor for crude protein (79%) and for neutral detergent insoluble crude protein (68%). In conclusion, ATR-FT/IR molecular spectroscopy may be used to rapidly characterize feed structures at the molecular level and also as a potential predictor of feed functionality, digestive behavior, and nutrient utilization of canola feed.
Nanomechanical strength mechanisms of hierarchical biological materials and tissues.
Buehler, Markus J; Ackbarow, Theodor
2008-12-01
Biological protein materials (BPMs), intriguing hierarchical structures formed by assembly of chemical building blocks, are crucial for critical functions of life. The structural details of BPMs are fascinating: They represent a combination of universally found motifs such as alpha-helices or beta-sheets with highly adapted protein structures such as cytoskeletal networks or spider silk nanocomposites. BPMs combine properties like strength and robustness, self-healing ability, adaptability, changeability, evolvability and others into multi-functional materials at a level unmatched in synthetic materials. The ability to achieve these properties depends critically on the particular traits of these materials, first and foremost their hierarchical architecture and seamless integration of material and structure, from nano to macro. Here, we provide a brief review of this field and outline new research directions, along with a review of recent research results in the development of structure-property relationships of biological protein materials exemplified in a study of vimentin intermediate filaments.
Morales, Yalemi; Olsen, Keith J; Bulcher, Jacqueline M; Johnson, Sean J
2018-01-01
The FRH (frequency-interacting RNA helicase) protein is the Neurospora crassa homolog of yeast Mtr4, an essential RNA helicase that plays a central role in RNA metabolism as an activator of the nuclear RNA exosome. FRH is also a required component of the circadian clock, mediating protein interactions that result in the rhythmic repression of gene expression. Here we show that FRH unwinds RNA substrates in vitro with a kinetic profile similar to Mtr4, indicating that while FRH has acquired additional functionality, its core helicase function remains intact. In contrast with the earlier FRH structures, a new crystal form of FRH results in an ATP binding site that is undisturbed by crystal contacts and adopts a conformation consistent with nucleotide binding and hydrolysis. Strikingly, this new FRH structure adopts an arch domain conformation that is dramatically altered from previous structures. Comparison of the existing FRH structures reveals conserved hinge points that appear to facilitate arch motion. Regions in the arch have been previously shown to mediate a variety of protein-protein interactions critical for RNA surveillance and circadian clock functions. The conformational changes highlighted in the FRH structures provide a platform for investigating the relationship between arch dynamics and Mtr4/FRH function.
Automated identification of functional dynamic networks from X-ray crystallography
van den Bedem, Henry; Bhabha, Gira; Yang, Kun; Wright, Peter E.; Fraser, James S.
2013-01-01
Protein function often depends on the exchange between conformational substates. Allosteric ligand binding or distal mutations can stabilize specific active site conformations and consequently alter protein function. In addition to comparing independently determined X-ray crystal structures, alternative conformations observed at low levels of electron density have the potential to provide mechanistic insights into conformational dynamics. Here, we report a new multi-conformer contact network algorithm (CONTACT) that identifies networks of conformationally heterogeneous residues directly from high-resolution X-ray crystallography data. Contact networks in Escherichia coli dihydrofolate reductase (ecDHFR) predict the long-range pattern of NMR chemical shift perturbations of an allosteric mutation. A comparison of contact networks in wild type and mutant ecDHFR suggests how mutations that alter optimized networks of coordinated motions can impair catalytic function. Thus, CONTACT-guided mutagenesis will allow the structure-dynamics-function relationship to be exploited in protein engineering and design. PMID:23913260
Structure to function: Spider silk and human collagen
NASA Astrophysics Data System (ADS)
Rabotyagova, Olena S.
Nature has the ability to assemble a variety of simple molecules into complex functional structures with diverse properties. Collagens, silks and muscles fibers are some examples of fibrous proteins with self-assembling properties. One of the great challenges facing Science is to mimic these designs in Nature to find a way to construct molecules that are capable of organizing into functional supra-structures by self-assembly. In order to do so, a construction kit consisting of molecular building blocks along with a complete understanding on how to form functional materials is required. In this current research, the focus is on spider silk and collagen as fibrous protein-based biopolymers that can shed light on how to generate nanostructures through the complex process of self-assembly. Spider silk in fiber form offers a unique combination of high elasticity, toughness, and mechanical strength, along with biological compatibility and biodegrability. Spider silk is an example of a natural block copolymer, in which hydrophobic and hydrophilic blocks are linked together generating polymers that organize into functional materials with extraordinary properties. Since silks resemble synthetic block copolymer systems, we adopted the principles of block copolymer design from the synthetic polymer literature to build block copolymers based on spider silk sequences. Moreover, we consider spider silk to be an important model with which to study the relationships between structure and properties in our system. Thus, the first part of this work was dedicated to a novel family of spider silk block copolymers, where we generated a new family of functional spider silk-like block copolymers through recombinant DNA technology. To provide fundamental insight into relationships between peptide primary sequence, block composition, and block length and observed morphological and structural features, we used these bioengineered spider silk block copolymers to study secondary structure, morphological features and assembly. Aside from fundamental perspectives, we anticipate that these results will provide a blueprint for the design of precise materials for a range of potential applications such as controlled release devices, functional coatings, components of tissue regeneration materials and environmentally friendly polymers in future studies. In the second part of this work, human collagen type I was studied as another representative of the family of fibrous proteins. Collagen type I is the most abundant extracellular matrix protein in the human body, providing the basis for tissue structure and directing cellular functions. Collagen has a complex structural hierarchy, organized at different length scales, including the characteristic triple helical feature. In the present study we assessed the relationship between collagen structure (native vs. denatured) and sensitivity to UV radiation with a focus on changes in the primary structure, conformation, microstructure and material properties. Free radical reactions are involved in collagen degradation and a mechanism for UV-induced collagen degradation related to structure was proposed. The results from this study demonstrated the role of collagen supramolecular organization (triple helix) in the context of the effects of electromagnetic radiation on extracellular matrices. Owing to the fact that both silks and collagens are proteins that have found widespread interest for biomaterial related needs, we anticipate that the current studies will serve as a foundation for future biomaterial designs with controlled properties. Furthermore, fundamental insight into self-assembly and environmentally-2mediated degradation, will build a foundation for fundamental understanding of the remodeling and functions of these types of fibrous proteins in vivo and in vitro. This type of insight is essential for many areas of scientific inquiry, from drug delivery, to scaffolds for tissue engineering, and to the stability of materials in space.
Structure-guided Protein Transition Modeling with a Probabilistic Roadmap Algorithm.
Maximova, Tatiana; Plaku, Erion; Shehu, Amarda
2016-07-07
Proteins are macromolecules in perpetual motion, switching between structural states to modulate their function. A detailed characterization of the precise yet complex relationship between protein structure, dynamics, and function requires elucidating transitions between functionally-relevant states. Doing so challenges both wet and dry laboratories, as protein dynamics involves disparate temporal scales. In this paper we present a novel, sampling-based algorithm to compute transition paths. The algorithm exploits two main ideas. First, it leverages known structures to initialize its search and define a reduced conformation space for rapid sampling. This is key to address the insufficient sampling issue suffered by sampling-based algorithms. Second, the algorithm embeds samples in a nearest-neighbor graph where transition paths can be efficiently computed via queries. The algorithm adapts the probabilistic roadmap framework that is popular in robot motion planning. In addition to efficiently computing lowest-cost paths between any given structures, the algorithm allows investigating hypotheses regarding the order of experimentally-known structures in a transition event. This novel contribution is likely to open up new venues of research. Detailed analysis is presented on multiple-basin proteins of relevance to human disease. Multiscaling and the AMBER ff14SB force field are used to obtain energetically-credible paths at atomistic detail.
Pukáncsik, Mária; Orbán, Ágnes; Nagy, Kinga; Matsuo, Koichi; Gekko, Kunihiko; Maurin, Damien; Hart, Darren; Kézsmárki, István; Vertessy, Beata G.
2016-01-01
A novel uracil-DNA degrading protein factor (termed UDE) was identified in Drosophila melanogaster with no significant structural and functional homology to other uracil-DNA binding or processing factors. Determination of the 3D structure of UDE is excepted to provide key information on the description of the molecular mechanism of action of UDE catalysis, as well as in general uracil-recognition and nuclease action. Towards this long-term aim, the random library ESPRIT technology was applied to the novel protein UDE to overcome problems in identifying soluble expressing constructs given the absence of precise information on domain content and arrangement. Nine constructs of UDE were chosen to decipher structural and functional relationships. Vacuum ultraviolet circular dichroism (VUVCD) spectroscopy was performed to define the secondary structure content and location within UDE and its truncated variants. The quantitative analysis demonstrated exclusive α-helical content for the full-length protein, which is preserved in the truncated constructs. Arrangement of α-helical bundles within the truncated protein segments suggested new domain boundaries which differ from the conserved motifs determined by sequence-based alignment of UDE homologues. Here we demonstrate that the combination of ESPRIT and VUVCD spectroscopy provides a new structural description of UDE and confirms that the truncated constructs are useful for further detailed functional studies. PMID:27273007
Revealing protein functions based on relationships of interacting proteins and GO terms.
Teng, Zhixia; Guo, Maozu; Liu, Xiaoyan; Tian, Zhen; Che, Kai
2017-09-20
In recent years, numerous computational methods predicted protein function based on the protein-protein interaction (PPI) network. These methods supposed that two proteins share the same function if they interact with each other. However, it is reported by recent studies that the functions of two interacting proteins may be just related. It will mislead the prediction of protein function. Therefore, there is a need for investigating the functional relationship between interacting proteins. In this paper, the functional relationship between interacting proteins is studied and a novel method, called as GoDIN, is advanced to annotate functions of interacting proteins in Gene Ontology (GO) context. It is assumed that the functional difference between interacting proteins can be expressed by semantic difference between GO term and its relatives. Thus, the method uses GO term and its relatives to annotate the interacting proteins separately according to their functional roles in the PPI network. The method is validated by a series of experiments and compared with the concerned method. The experimental results confirm the assumption and suggest that GoDIN is effective on predicting functions of protein. This study demonstrates that: (1) interacting proteins are not equal in the PPI network, and their function may be same or similar, or just related; (2) functional difference between interacting proteins can be measured by their degrees in the PPI network; (3) functional relationship between interacting proteins can be expressed by relationship between GO term and its relatives.
Statistical discovery of site inter-dependencies in sub-molecular hierarchical protein structuring
2012-01-01
Background Much progress has been made in understanding the 3D structure of proteins using methods such as NMR and X-ray crystallography. The resulting 3D structures are extremely informative, but do not always reveal which sites and residues within the structure are of special importance. Recently, there are indications that multiple-residue, sub-domain structural relationships within the larger 3D consensus structure of a protein can be inferred from the analysis of the multiple sequence alignment data of a protein family. These intra-dependent clusters of associated sites are used to indicate hierarchical inter-residue relationships within the 3D structure. To reveal the patterns of associations among individual amino acids or sub-domain components within the structure, we apply a k-modes attribute (aligned site) clustering algorithm to the ubiquitin and transthyretin families in order to discover associations among groups of sites within the multiple sequence alignment. We then observe what these associations imply within the 3D structure of these two protein families. Results The k-modes site clustering algorithm we developed maximizes the intra-group interdependencies based on a normalized mutual information measure. The clusters formed correspond to sub-structural components or binding and interface locations. Applying this data-directed method to the ubiquitin and transthyretin protein family multiple sequence alignments as a test bed, we located numerous interesting associations of interdependent sites. These clusters were then arranged into cluster tree diagrams which revealed four structural sub-domains within the single domain structure of ubiquitin and a single large sub-domain within transthyretin associated with the interface among transthyretin monomers. In addition, several clusters of mutually interdependent sites were discovered for each protein family, each of which appear to play an important role in the molecular structure and/or function. Conclusions Our results demonstrate that the method we present here using a k-modes site clustering algorithm based on interdependency evaluation among sites obtained from a sequence alignment of homologous proteins can provide significant insights into the complex, hierarchical inter-residue structural relationships within the 3D structure of a protein family. PMID:22793672
Statistical discovery of site inter-dependencies in sub-molecular hierarchical protein structuring.
Durston, Kirk K; Chiu, David Ky; Wong, Andrew Kc; Li, Gary Cl
2012-07-13
Much progress has been made in understanding the 3D structure of proteins using methods such as NMR and X-ray crystallography. The resulting 3D structures are extremely informative, but do not always reveal which sites and residues within the structure are of special importance. Recently, there are indications that multiple-residue, sub-domain structural relationships within the larger 3D consensus structure of a protein can be inferred from the analysis of the multiple sequence alignment data of a protein family. These intra-dependent clusters of associated sites are used to indicate hierarchical inter-residue relationships within the 3D structure. To reveal the patterns of associations among individual amino acids or sub-domain components within the structure, we apply a k-modes attribute (aligned site) clustering algorithm to the ubiquitin and transthyretin families in order to discover associations among groups of sites within the multiple sequence alignment. We then observe what these associations imply within the 3D structure of these two protein families. The k-modes site clustering algorithm we developed maximizes the intra-group interdependencies based on a normalized mutual information measure. The clusters formed correspond to sub-structural components or binding and interface locations. Applying this data-directed method to the ubiquitin and transthyretin protein family multiple sequence alignments as a test bed, we located numerous interesting associations of interdependent sites. These clusters were then arranged into cluster tree diagrams which revealed four structural sub-domains within the single domain structure of ubiquitin and a single large sub-domain within transthyretin associated with the interface among transthyretin monomers. In addition, several clusters of mutually interdependent sites were discovered for each protein family, each of which appear to play an important role in the molecular structure and/or function. Our results demonstrate that the method we present here using a k-modes site clustering algorithm based on interdependency evaluation among sites obtained from a sequence alignment of homologous proteins can provide significant insights into the complex, hierarchical inter-residue structural relationships within the 3D structure of a protein family.
VarMod: modelling the functional effects of non-synonymous variants
Pappalardo, Morena; Wass, Mark N.
2014-01-01
Unravelling the genotype–phenotype relationship in humans remains a challenging task in genomics studies. Recent advances in sequencing technologies mean there are now thousands of sequenced human genomes, revealing millions of single nucleotide variants (SNVs). For non-synonymous SNVs present in proteins the difficulties of the problem lie in first identifying those nsSNVs that result in a functional change in the protein among the many non-functional variants and in turn linking this functional change to phenotype. Here we present VarMod (Variant Modeller) a method that utilises both protein sequence and structural features to predict nsSNVs that alter protein function. VarMod develops recent observations that functional nsSNVs are enriched at protein–protein interfaces and protein–ligand binding sites and uses these characteristics to make predictions. In benchmarking on a set of nearly 3000 nsSNVs VarMod performance is comparable to an existing state of the art method. The VarMod web server provides extensive resources to investigate the sequence and structural features associated with the predictions including visualisation of protein models and complexes via an interactive JSmol molecular viewer. VarMod is available for use at http://www.wasslab.org/varmod. PMID:24906884
Structural basis for host membrane remodeling induced by protein 2B of hepatitis A virus.
Vives-Adrián, Laia; Garriga, Damià; Buxaderas, Mònica; Fraga, Joana; Pereira, Pedro José Barbosa; Macedo-Ribeiro, Sandra; Verdaguer, Núria
2015-04-01
The complexity of viral RNA synthesis and the numerous participating factors require a mechanism to topologically coordinate and concentrate these multiple viral and cellular components, ensuring a concerted function. Similarly to all other positive-strand RNA viruses, picornaviruses induce rearrangements of host intracellular membranes to create structures that act as functional scaffolds for genome replication. The membrane-targeting proteins 2B and 2C, their precursor 2BC, and protein 3A appear to be primarily involved in membrane remodeling. Little is known about the structure of these proteins and the mechanisms by which they induce massive membrane remodeling. Here we report the crystal structure of the soluble region of hepatitis A virus (HAV) protein 2B, consisting of two domains: a C-terminal helical bundle preceded by an N-terminally curved five-stranded antiparallel β-sheet that displays striking structural similarity to the β-barrel domain of enteroviral 2A proteins. Moreover, the helicoidal arrangement of the protein molecules in the crystal provides a model for 2B-induced host membrane remodeling during HAV infection. No structural information is currently available for the 2B protein of any picornavirus despite it being involved in a critical process in viral factory formation: the rearrangement of host intracellular membranes. Here we present the structure of the soluble domain of the 2B protein of hepatitis A virus (HAV). Its arrangement, both in crystals and in solution under physiological conditions, can help to understand its function and sheds some light on the membrane rearrangement process, a putative target of future antiviral drugs. Moreover, this first structure of a picornaviral 2B protein also unveils a closer evolutionary relationship between the hepatovirus and enterovirus genera within the Picornaviridae family. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Structural Basis for Host Membrane Remodeling Induced by Protein 2B of Hepatitis A Virus
Vives-Adrián, Laia; Garriga, Damià; Buxaderas, Mònica; Fraga, Joana; Pereira, Pedro José Barbosa
2015-01-01
ABSTRACT The complexity of viral RNA synthesis and the numerous participating factors require a mechanism to topologically coordinate and concentrate these multiple viral and cellular components, ensuring a concerted function. Similarly to all other positive-strand RNA viruses, picornaviruses induce rearrangements of host intracellular membranes to create structures that act as functional scaffolds for genome replication. The membrane-targeting proteins 2B and 2C, their precursor 2BC, and protein 3A appear to be primarily involved in membrane remodeling. Little is known about the structure of these proteins and the mechanisms by which they induce massive membrane remodeling. Here we report the crystal structure of the soluble region of hepatitis A virus (HAV) protein 2B, consisting of two domains: a C-terminal helical bundle preceded by an N-terminally curved five-stranded antiparallel β-sheet that displays striking structural similarity to the β-barrel domain of enteroviral 2A proteins. Moreover, the helicoidal arrangement of the protein molecules in the crystal provides a model for 2B-induced host membrane remodeling during HAV infection. IMPORTANCE No structural information is currently available for the 2B protein of any picornavirus despite it being involved in a critical process in viral factory formation: the rearrangement of host intracellular membranes. Here we present the structure of the soluble domain of the 2B protein of hepatitis A virus (HAV). Its arrangement, both in crystals and in solution under physiological conditions, can help to understand its function and sheds some light on the membrane rearrangement process, a putative target of future antiviral drugs. Moreover, this first structure of a picornaviral 2B protein also unveils a closer evolutionary relationship between the hepatovirus and enterovirus genera within the Picornaviridae family. PMID:25589659
Proteins with Novel Structure, Function and Dynamics
NASA Technical Reports Server (NTRS)
Pohorille, Andrew
2014-01-01
Recently, a small enzyme that ligates two RNA fragments with the rate of 10(exp 6) above background was evolved in vitro (Seelig and Szostak, Nature 448:828-831, 2007). This enzyme does not resemble any contemporary protein (Chao et al., Nature Chem. Biol. 9:81-83, 2013). It consists of a dynamic, catalytic loop, a small, rigid core containing two zinc ions coordinated by neighboring amino acids, and two highly flexible tails that might be unimportant for protein function. In contrast to other proteins, this enzyme does not contain ordered secondary structure elements, such as alpha-helix or beta-sheet. The loop is kept together by just two interactions of a charged residue and a histidine with a zinc ion, which they coordinate on the opposite side of the loop. Such structure appears to be very fragile. Surprisingly, computer simulations indicate otherwise. As the coordinating, charged residue is mutated to alanine, another, nearby charged residue takes its place, thus keeping the structure nearly intact. If this residue is also substituted by alanine a salt bridge involving two other, charged residues on the opposite sides of the loop keeps the loop in place. These adjustments are facilitated by high flexibility of the protein. Computational predictions have been confirmed experimentally, as both mutants retain full activity and overall structure. These results challenge our notions about what is required for protein activity and about the relationship between protein dynamics, stability and robustness. We hypothesize that small, highly dynamic proteins could be both active and fault tolerant in ways that many other proteins are not, i.e. they can adjust to retain their structure and activity even if subjected to mutations in structurally critical regions. This opens the doors for designing proteins with novel functions, structures and dynamics that have not been yet considered.
Godwin, Ryan C; Melvin, Ryan L; Gmeiner, William H; Salsbury, Freddie R
2017-01-31
Zinc-finger proteins are regulators of critical signaling pathways for various cellular functions, including apoptosis and oncogenesis. Here, we investigate how binding site protonation states and zinc coordination influence protein structure, dynamics, and ultimately function, as these pivotal regulatory proteins are increasingly important for protein engineering and therapeutic discovery. To better understand the thermodynamics and dynamics of the zinc finger of NEMO (NF-κB essential modulator), as well as the role of zinc, we present results of 20 μs molecular dynamics trajectories, 5 μs for each of four active site configurations. Consistent with experimental evidence, the zinc ion is essential for mechanical stabilization of the functional, folded conformation. Hydrogen bond motifs are unique for deprotonated configurations yet overlap in protonated cases. Correlated motions and principal component analysis corroborate the similarity of the protonated configurations and highlight unique relationships of the zinc-bound configuration. We hypothesize a potential mechanism for zinc binding from results of the thiol configurations. The deprotonated, zinc-bound configuration alone predominantly maintains its tertiary structure throughout all 5 μs and alludes rare conformations potentially important for (im)proper zinc-finger-related protein-protein or protein-DNA interactions.
D'Antonio, Matteo; Masseroli, Marco
2009-01-01
Background Alternative splicing has been demonstrated to affect most of human genes; different isoforms from the same gene encode for proteins which differ for a limited number of residues, thus yielding similar structures. This suggests possible correlations between alternative splicing and protein structure. In order to support the investigation of such relationships, we have developed the Alternative Splicing and Protein Structure Scrutinizer (PASS), a Web application to automatically extract, integrate and analyze human alternative splicing and protein structure data sparsely available in the Alternative Splicing Database, Ensembl databank and Protein Data Bank. Primary data from these databases have been integrated and analyzed using the Protein Identifier Cross-Reference, BLAST, CLUSTALW and FeatureMap3D software tools. Results A database has been developed to store the considered primary data and the results from their analysis; a system of Perl scripts has been implemented to automatically create and update the database and analyze the integrated data; a Web interface has been implemented to make the analyses easily accessible; a database has been created to manage user accesses to the PASS Web application and store user's data and searches. Conclusion PASS automatically integrates data from the Alternative Splicing Database with protein structure data from the Protein Data Bank. Additionally, it comprehensively analyzes the integrated data with publicly available well-known bioinformatics tools in order to generate structural information of isoform pairs. Further analysis of such valuable information might reveal interesting relationships between alternative splicing and protein structure differences, which may be significantly associated with different functions. PMID:19828075
VarMod: modelling the functional effects of non-synonymous variants.
Pappalardo, Morena; Wass, Mark N
2014-07-01
Unravelling the genotype-phenotype relationship in humans remains a challenging task in genomics studies. Recent advances in sequencing technologies mean there are now thousands of sequenced human genomes, revealing millions of single nucleotide variants (SNVs). For non-synonymous SNVs present in proteins the difficulties of the problem lie in first identifying those nsSNVs that result in a functional change in the protein among the many non-functional variants and in turn linking this functional change to phenotype. Here we present VarMod (Variant Modeller) a method that utilises both protein sequence and structural features to predict nsSNVs that alter protein function. VarMod develops recent observations that functional nsSNVs are enriched at protein-protein interfaces and protein-ligand binding sites and uses these characteristics to make predictions. In benchmarking on a set of nearly 3000 nsSNVs VarMod performance is comparable to an existing state of the art method. The VarMod web server provides extensive resources to investigate the sequence and structural features associated with the predictions including visualisation of protein models and complexes via an interactive JSmol molecular viewer. VarMod is available for use at http://www.wasslab.org/varmod. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Advances in Homology Protein Structure Modeling
Xiang, Zhexin
2007-01-01
Homology modeling plays a central role in determining protein structure in the structural genomics project. The importance of homology modeling has been steadily increasing because of the large gap that exists between the overwhelming number of available protein sequences and experimentally solved protein structures, and also, more importantly, because of the increasing reliability and accuracy of the method. In fact, a protein sequence with over 30% identity to a known structure can often be predicted with an accuracy equivalent to a low-resolution X-ray structure. The recent advances in homology modeling, especially in detecting distant homologues, aligning sequences with template structures, modeling of loops and side chains, as well as detecting errors in a model, have contributed to reliable prediction of protein structure, which was not possible even several years ago. The ongoing efforts in solving protein structures, which can be time-consuming and often difficult, will continue to spur the development of a host of new computational methods that can fill in the gap and further contribute to understanding the relationship between protein structure and function. PMID:16787261
The Structure and Function of Non-Collagenous Bone Proteins
NASA Technical Reports Server (NTRS)
Hook, Magnus
1997-01-01
The long-term goal for this program is to determine the structural and functional relationships of bone proteins and proteins that interact with bone. This information will used to design useful pharmacological compounds that will have a beneficial effect in osteoporotic patients and in the osteoporotic-like effects experienced on long duration space missions. The first phase of this program, funded under a cooperative research agreement with NASA through the Texas Medical Center, aimed to develop powerful recombinant expression systems and purification methods for production of large amounts of target proteins. Proteins expressed in sufficient'amount and purity would be characterized by a variety of structural methods, and made available for crystallization studies. In order to increase the likelihood of crystallization and subsequent high resolution solution of structures, we undertook to develop expression of normal and mutant forms of proteins by bacterial and mammalian cells. In addition to the main goals of this program, we would also be able to provide reagents for other related studies, including development of anti-fibrotic and anti-metastatic therapeutics.
Gaia: automated quality assessment of protein structure models.
Kota, Pradeep; Ding, Feng; Ramachandran, Srinivas; Dokholyan, Nikolay V
2011-08-15
Increasing use of structural modeling for understanding structure-function relationships in proteins has led to the need to ensure that the protein models being used are of acceptable quality. Quality of a given protein structure can be assessed by comparing various intrinsic structural properties of the protein to those observed in high-resolution protein structures. In this study, we present tools to compare a given structure to high-resolution crystal structures. We assess packing by calculating the total void volume, the percentage of unsatisfied hydrogen bonds, the number of steric clashes and the scaling of the accessible surface area. We assess covalent geometry by determining bond lengths, angles, dihedrals and rotamers. The statistical parameters for the above measures, obtained from high-resolution crystal structures enable us to provide a quality-score that points to specific areas where a given protein structural model needs improvement. We provide these tools that appraise protein structures in the form of a web server Gaia (http://chiron.dokhlab.org). Gaia evaluates the packing and covalent geometry of a given protein structure and provides quantitative comparison of the given structure to high-resolution crystal structures. dokh@unc.edu Supplementary data are available at Bioinformatics online.
WEBnm@ v2.0: Web server and services for comparing protein flexibility.
Tiwari, Sandhya P; Fuglebakk, Edvin; Hollup, Siv M; Skjærven, Lars; Cragnolini, Tristan; Grindhaug, Svenn H; Tekle, Kidane M; Reuter, Nathalie
2014-12-30
Normal mode analysis (NMA) using elastic network models is a reliable and cost-effective computational method to characterise protein flexibility and by extension, their dynamics. Further insight into the dynamics-function relationship can be gained by comparing protein motions between protein homologs and functional classifications. This can be achieved by comparing normal modes obtained from sets of evolutionary related proteins. We have developed an automated tool for comparative NMA of a set of pre-aligned protein structures. The user can submit a sequence alignment in the FASTA format and the corresponding coordinate files in the Protein Data Bank (PDB) format. The computed normalised squared atomic fluctuations and atomic deformation energies of the submitted structures can be easily compared on graphs provided by the web user interface. The web server provides pairwise comparison of the dynamics of all proteins included in the submitted set using two measures: the Root Mean Squared Inner Product and the Bhattacharyya Coefficient. The Comparative Analysis has been implemented on our web server for NMA, WEBnm@, which also provides recently upgraded functionality for NMA of single protein structures. This includes new visualisations of protein motion, visualisation of inter-residue correlations and the analysis of conformational change using the overlap analysis. In addition, programmatic access to WEBnm@ is now available through a SOAP-based web service. Webnm@ is available at http://apps.cbu.uib.no/webnma . WEBnm@ v2.0 is an online tool offering unique capability for comparative NMA on multiple protein structures. Along with a convenient web interface, powerful computing resources, and several methods for mode analyses, WEBnm@ facilitates the assessment of protein flexibility within protein families and superfamilies. These analyses can give a good view of how the structures move and how the flexibility is conserved over the different structures.
Discovering rules for protein-ligand specificity using support vector inductive logic programming.
Kelley, Lawrence A; Shrimpton, Paul J; Muggleton, Stephen H; Sternberg, Michael J E
2009-09-01
Structural genomics initiatives are rapidly generating vast numbers of protein structures. Comparative modelling is also capable of producing accurate structural models for many protein sequences. However, for many of the known structures, functions are not yet determined, and in many modelling tasks, an accurate structural model does not necessarily tell us about function. Thus, there is a pressing need for high-throughput methods for determining function from structure. The spatial arrangement of key amino acids in a folded protein, on the surface or buried in clefts, is often the determinants of its biological function. A central aim of molecular biology is to understand the relationship between such substructures or surfaces and biological function, leading both to function prediction and to function design. We present a new general method for discovering the features of binding pockets that confer specificity for particular ligands. Using a recently developed machine-learning technique which couples the rule-discovery approach of inductive logic programming with the statistical learning power of support vector machines, we are able to discriminate, with high precision (90%) and recall (86%) between pockets that bind FAD and those that bind NAD on a large benchmark set given only the geometry and composition of the backbone of the binding pocket without the use of docking. In addition, we learn rules governing this specificity which can feed into protein functional design protocols. An analysis of the rules found suggests that key features of the binding pocket may be tied to conformational freedom in the ligand. The representation is sufficiently general to be applicable to any discriminatory binding problem. All programs and data sets are freely available to non-commercial users at http://www.sbg.bio.ic.ac.uk/svilp_ligand/.
ERIC Educational Resources Information Center
Harris, Michelle A.; Peck, Ronald F.; Colton, Shannon; Morris, Jennifer; Neto, Elias Chaibub; Kallio, Julie
2009-01-01
We conducted a controlled investigation to examine whether a combination of computer imagery and tactile tools helps introductory cell biology laboratory undergraduate students better learn about protein structure/function relationships as compared with computer imagery alone. In all five laboratory sections, students used the molecular imaging…
ERIC Educational Resources Information Center
Harle, Marissa; Towns, Marcy H.
2012-01-01
Research that has focused on external representations in biochemistry has uncovered student difficulties in comprehending and interpreting external representations. This study focuses on students' understanding of three external representations (ribbon diagram, wireframe, and hydrophobic/hydrophilic) of the potassium ion channel protein. Analysis…
Identifying and reducing error in cluster-expansion approximations of protein energies.
Hahn, Seungsoo; Ashenberg, Orr; Grigoryan, Gevorg; Keating, Amy E
2010-12-01
Protein design involves searching a vast space for sequences that are compatible with a defined structure. This can pose significant computational challenges. Cluster expansion is a technique that can accelerate the evaluation of protein energies by generating a simple functional relationship between sequence and energy. The method consists of several steps. First, for a given protein structure, a training set of sequences with known energies is generated. Next, this training set is used to expand energy as a function of clusters consisting of single residues, residue pairs, and higher order terms, if required. The accuracy of the sequence-based expansion is monitored and improved using cross-validation testing and iterative inclusion of additional clusters. As a trade-off for evaluation speed, the cluster-expansion approximation causes prediction errors, which can be reduced by including more training sequences, including higher order terms in the expansion, and/or reducing the sequence space described by the cluster expansion. This article analyzes the sources of error and introduces a method whereby accuracy can be improved by judiciously reducing the described sequence space. The method is applied to describe the sequence-stability relationship for several protein structures: coiled-coil dimers and trimers, a PDZ domain, and T4 lysozyme as examples with computationally derived energies, and SH3 domains in amphiphysin-1 and endophilin-1 as examples where the expanded pseudo-energies are obtained from experiments. Our open-source software package Cluster Expansion Version 1.0 allows users to expand their own energy function of interest and thereby apply cluster expansion to custom problems in protein design. © 2010 Wiley Periodicals, Inc.
De novo design of recombinant spider silk proteins for material applications.
Zheng, Ke; Ling, Shengjie
2018-05-21
Spider silks are well known for their superior mechanical properties that are stronger and tougher than steel despite being assembled at close to ambient conditions and using water as the solvent. However, it is a significant challenge to utilize spider silks for practical applications due to their limited sources. Fortunately, genetic engineering techniques offer a promising approach to produce useable amounts of spider silk variants. Starting from these recombinant spider silk proteins, a series of experiments and simulations strategies were developed to improve the recombinant spider silk proteins (RSSP) material design and fabrication with the aim of biomimicking the structure-property-function relationships of spider silks. Accordingly, in this review, we first introduce the structure-property-function relationship of spider silks. Then, we discuss the recent progress in the genetic synthesis of RSSPs and summarize their related multiscale self-assembly behaviors. Finally, we outline works utilizing multiscale modeling to assist RSSP material design. This article is protected by copyright. All rights reserved.
Protein functional features are reflected in the patterns of mRNA translation speed.
López, Daniel; Pazos, Florencio
2015-07-09
The degeneracy of the genetic code makes it possible for the same amino acid string to be coded by different messenger RNA (mRNA) sequences. These "synonymous mRNAs" may differ largely in a number of aspects related to their overall translational efficiency, such as secondary structure content and availability of the encoded transfer RNAs (tRNAs). Consequently, they may render different yields of the translated polypeptides. These mRNA features related to translation efficiency are also playing a role locally, resulting in a non-uniform translation speed along the mRNA, which has been previously related to some protein structural features and also used to explain some dramatic effects of "silent" single-nucleotide-polymorphisms (SNPs). In this work we perform the first large scale analysis of the relationship between three experimental proxies of mRNA local translation efficiency and the local features of the corresponding encoded proteins. We found that a number of protein functional and structural features are reflected in the patterns of ribosome occupancy, secondary structure and tRNA availability along the mRNA. One or more of these proxies of translation speed have distinctive patterns around the mRNA regions coding for certain protein local features. In some cases the three patterns follow a similar trend. We also show specific examples where these patterns of translation speed point to the protein's important structural and functional features. This support the idea that the genome not only codes the protein functional features as sequences of amino acids, but also as subtle patterns of mRNA properties which, probably through local effects on the translation speed, have some consequence on the final polypeptide. These results open the possibility of predicting a protein's functional regions based on a single genomic sequence, and have implications for heterologous protein expression and fine-tuning protein function.
NASA Astrophysics Data System (ADS)
Hong, Seok Hoon; Kwon, Yong-Chan; Jewett, Michael
2014-06-01
Incorporating non-standard amino acids (NSAAs) into proteins enables new chemical properties, new structures, and new functions. In recent years, improvements in cell-free protein synthesis (CFPS) systems have opened the way to accurate and efficient incorporation of NSAAs into proteins. The driving force behind this development has been three-fold. First, a technical renaissance has enabled high-yielding (>1 g/L) and long-lasting (>10 h in batch operation) CFPS in systems derived from Escherichia coli. Second, the efficiency of orthogonal translation systems has improved. Third, the open nature of the CFPS platform has brought about an unprecedented level of control and freedom of design. Here, we review recent developments in CFPS platforms designed to precisely incorporate NSAAs. In the coming years, we anticipate that CFPS systems will impact efforts to elucidate structure/function relationships of proteins and to make biomaterials and sequence-defined biopolymers for medical and industrial applications.
Ice-binding proteins: a remarkable diversity of structures for stopping and starting ice growth.
Davies, Peter L
2014-11-01
Antifreeze proteins (AFPs) were discovered in marine fishes that need protection from freezing. These ice-binding proteins (IBPs) are widespread across biological kingdoms, and their functions include freeze tolerance and ice adhesion. Consistent with recent independent evolution, AFPs have remarkably diverse folds that rely heavily on hydrogen- and disulfide-bonding. AFP ice-binding sites are typically flat, extensive, relatively hydrophobic, and are thought to organize water into an ice-like arrangement that merges and freezes with the quasi-liquid layer next to the ice lattice. In this article, the roles, properties, and structure-function interactions of IBPs are reviewed, and their relationship to ice nucleation proteins, which promote freezing at high subzero temperatures, is explored. Copyright © 2014 Elsevier Ltd. All rights reserved.
Suplatov, Dmitry; Kirilin, Eugeny; Arbatsky, Mikhail; Takhaveev, Vakil; Svedas, Vytas
2014-07-01
The new web-server pocketZebra implements the power of bioinformatics and geometry-based structural approaches to identify and rank subfamily-specific binding sites in proteins by functional significance, and select particular positions in the structure that determine selective accommodation of ligands. A new scoring function has been developed to annotate binding sites by the presence of the subfamily-specific positions in diverse protein families. pocketZebra web-server has multiple input modes to meet the needs of users with different experience in bioinformatics. The server provides on-site visualization of the results as well as off-line version of the output in annotated text format and as PyMol sessions ready for structural analysis. pocketZebra can be used to study structure-function relationship and regulation in large protein superfamilies, classify functionally important binding sites and annotate proteins with unknown function. The server can be used to engineer ligand-binding sites and allosteric regulation of enzymes, or implemented in a drug discovery process to search for potential molecular targets and novel selective inhibitors/effectors. The server, documentation and examples are freely available at http://biokinet.belozersky.msu.ru/pocketzebra and there are no login requirements. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Structure elucidation of dimeric transmembrane domains of bitopic proteins.
Bocharov, Eduard V; Volynsky, Pavel E; Pavlov, Konstantin V; Efremov, Roman G; Arseniev, Alexander S
2010-01-01
The interaction between transmembrane helices is of great interest because it directly determines biological activity of a membrane protein. Either destroying or enhancing such interactions can result in many diseases related to dysfunction of different tissues in human body. One much studied form of membrane proteins known as bitopic protein is a dimer containing two membrane-spanning helices associating laterally. Establishing structure-function relationship as well as rational design of new types of drugs targeting membrane proteins requires precise structural information about this class of objects. At present time, to investigate spatial structure and internal dynamics of such transmembrane helical dimers, several strategies were developed based mainly on a combination of NMR spectroscopy, optical spectroscopy, protein engineering and molecular modeling. These approaches were successfully applied to homo- and heterodimeric transmembrane fragments of several bitopic proteins, which play important roles in normal and in pathological conditions of human organism.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tsukamoto, Yuta; Kagiwada, Satoshi; Shimazu, Sayuri
The small GTPase Rab5 is reported to regulate various cellular functions, such as vesicular transport and endocytosis. VPS9 domain-containing proteins are thought to activate Rab5(s) by their guanine-nucleotide exchange activities. Numerous VPS9 proteins have been identified and are structurally conserved from yeast to mammalian cells. However, the functional relationships among VPS9 proteins in cells remain unclear. Only one Rab5 and two VPS9 proteins were identified in the Schizosaccharomyces pombe genome. Here, we examined the cellular function of two VPS9 proteins and the relationship between these proteins in cellular functions. Vps901-GFP and Vps902-GFP exhibited dotted signals in vegetative and differentiated cells.more » vps901 deletion mutant (Δvps901) cells exhibited a phenotype deficient in the mating process and responses to high concentrations of ions, such as calcium and metals, and Δvps901Δvps902 double mutant cells exhibited round cell shapes similar to ypt5-909 (Rab5 mutant allele) cells. Deletion of both vps901 and vps902 genes completely abolished the mating process and responses to various stresses. A lack of vacuole formation and aberrant inner cell membrane structures were also observed in Δvps901Δvps902 cells by electron microscopy. These data strongly suggest that Vps901 and Vps902 are cooperatively involved in the regulation of cellular functions, such as cell morphology, sexual development, response to ion stresses, and vacuole formation, via Rab5 signaling pathways in fission yeast cells. - Highlights: • Roles of Rab5 activator VPS9 proteins in cellular functions. • Cooperation between VPS9 proteins in Rab5 signaling pathway. • Roles of each VPS9 protein in Rab5 signaling pathway are discussed.« less
NASA Astrophysics Data System (ADS)
Akhir, Nor Azurah Mat; Nadzirin, Nurul; Mohamed, Rahmah; Firdaus-Raih, Mohd
2015-09-01
Hypothetical proteins of bacterial pathogens represent a large numbers of novel biological mechanisms which could belong to essential pathways in the bacteria. They lack functional characterizations mainly due to the inability of sequence homology based methods to detect functional relationships in the absence of detectable sequence similarity. The dataset derived from this study showed 550 candidates conserved in genomes that has pathogenicity information and only present in the Burkholderiales order. The dataset has been narrowed down to taxonomic clusters. Ten proteins were selected for ORF amplification, seven of them were successfully amplified, and only four proteins were successfully expressed. These proteins will be great candidates in determining the true function via structural biology.
Archaeal Viruses: Diversity, Replication, and Structure.
Dellas, Nikki; Snyder, Jamie C; Bolduc, Benjamin; Young, Mark J
2014-11-01
The Archaea-and their viruses-remain the most enigmatic of life's three domains. Once thought to inhabit only extreme environments, archaea are now known to inhabit diverse environments. Even though the first archaeal virus was described over 40 years ago, only 117 archaeal viruses have been discovered to date. Despite this small number, these viruses have painted a portrait of enormous morphological and genetic diversity. For example, research centered around the various steps of the archaeal virus life cycle has led to the discovery of unique mechanisms employed by archaeal viruses during replication, maturation, and virion release. In many instances, archaeal virus proteins display very low levels of sequence homology to other proteins listed in the public database, and therefore, structural characterization of these proteins has played an integral role in functional assignment. These structural studies have not only provided insights into structure-function relationships but have also identified links between viruses across all three domains of life.
Neshich, Goran; Rocchia, Walter; Mancini, Adauto L.; Yamagishi, Michel E. B.; Kuser, Paula R.; Fileto, Renato; Baudet, Christian; Pinto, Ivan P.; Montagner, Arnaldo J.; Palandrani, Juliana F.; Krauchenco, Joao N.; Torres, Renato C.; Souza, Savio; Togawa, Roberto C.; Higa, Roberto H.
2004-01-01
JavaProtein Dossier (JPD) is a new concept, database and visualization tool providing one of the largest collections of the physicochemical parameters describing proteins' structure, stability, function and interaction with other macromolecules. By collecting as many descriptors/parameters as possible within a single database, we can achieve a better use of the available data and information. Furthermore, data grouping allows us to generate different parameters with the potential to provide new insights into the sequence–structure–function relationship. In JPD, residue selection can be performed according to multiple criteria. JPD can simultaneously display and analyze all the physicochemical parameters of any pair of structures, using precalculated structural alignments, allowing direct parameter comparison at corresponding amino acid positions among homologous structures. In order to focus on the physicochemical (and consequently pharmacological) profile of proteins, visualization tools (showing the structure and structural parameters) also had to be optimized. Our response to this challenge was the use of Java technology with its exceptional level of interactivity. JPD is freely accessible (within the Gold Sting Suite) at http://sms.cbi.cnptia.embrapa.br, http://mirrors.rcsb.org/SMS, http://trantor.bioc.columbia.edu/SMS and http://www.es.embnet.org/SMS/ (Option: JavaProtein Dossier). PMID:15215458
Suplatov, Dmitry; Kirilin, Eugeny; Arbatsky, Mikhail; Takhaveev, Vakil; Švedas, Vytas
2014-01-01
The new web-server pocketZebra implements the power of bioinformatics and geometry-based structural approaches to identify and rank subfamily-specific binding sites in proteins by functional significance, and select particular positions in the structure that determine selective accommodation of ligands. A new scoring function has been developed to annotate binding sites by the presence of the subfamily-specific positions in diverse protein families. pocketZebra web-server has multiple input modes to meet the needs of users with different experience in bioinformatics. The server provides on-site visualization of the results as well as off-line version of the output in annotated text format and as PyMol sessions ready for structural analysis. pocketZebra can be used to study structure–function relationship and regulation in large protein superfamilies, classify functionally important binding sites and annotate proteins with unknown function. The server can be used to engineer ligand-binding sites and allosteric regulation of enzymes, or implemented in a drug discovery process to search for potential molecular targets and novel selective inhibitors/effectors. The server, documentation and examples are freely available at http://biokinet.belozersky.msu.ru/pocketzebra and there are no login requirements. PMID:24852248
Analysis of self-assembly of S-layer protein slp-B53 from Lysinibacillus sphaericus.
Liu, Jun; Falke, Sven; Drobot, Bjoern; Oberthuer, Dominik; Kikhney, Alexey; Guenther, Tobias; Fahmy, Karim; Svergun, Dmitri; Betzel, Christian; Raff, Johannes
2017-01-01
The formation of stable and functional surface layers (S-layers) via self-assembly of surface-layer proteins on the cell surface is a dynamic and complex process. S-layers facilitate a number of important biological functions, e.g., providing protection and mediating selective exchange of molecules and thereby functioning as molecular sieves. Furthermore, S-layers selectively bind several metal ions including uranium, palladium, gold, and europium, some of them with high affinity. Most current research on surface layers focuses on investigating crystalline arrays of protein subunits in Archaea and bacteria. In this work, several complementary analytical techniques and methods have been applied to examine structure-function relationships and dynamics for assembly of S-layer protein slp-B53 from Lysinibacillus sphaericus: (1) The secondary structure of the S-layer protein was analyzed by circular dichroism spectroscopy; (2) Small-angle X-ray scattering was applied to gain insights into the three-dimensional structure in solution; (3) The interaction with bivalent cations was followed by differential scanning calorimetry; (4) The dynamics and time-dependent assembly of S-layers were followed by applying dynamic light scattering; (5) The two-dimensional structure of the paracrystalline S-layer lattice was examined by atomic force microscopy. The data obtained provide essential structural insights into the mechanism of S-layer self-assembly, particularly with respect to binding of bivalent cations, i.e., Mg 2+ and Ca 2+ . Furthermore, the results obtained highlight potential applications of S-layers in the fields of micromaterials and nanobiotechnology by providing engineered or individual symmetric thin protein layers, e.g., for protective, antimicrobial, or otherwise functionalized surfaces.
NASA Astrophysics Data System (ADS)
Hilaire, Mary Rose
Proteins possess unique physical and chemical properties that allow them to carry out a wide variety of biological activities and functions. While it is generally understood that a protein's function is dictated by its structure and dynamics, arriving at a molecule-level understanding of the underlying structure-dynamics-function relationship still poses a challenging task in many cases. This is due, at least in part, to the fact that we lack the ability to take snapshots along the reaction coordinate of proteins with sufficient temporal and structural resolution. Therefore, to improve one's ability to acquire site-specific structural and/or environmental information of proteins via either infrared (IR) or fluorescence spectroscopy, the main focus of this thesis is to develop and characterize amino acid-based spectroscopic probes as well as to use such probes to study important biological questions. Specifically, we show that (1) p-cyanophenylalanine and selenomethionine constitute an efficient fluorophore-quencher pair, useful for characterizing protein conformational changes that occur on a short distance; (2) 4-cyanotryptophan is a novel blue fluorescent amino acid, applicable for biological imaging due to its unique photophysical properties; (3) the dielectric constant inside the hydrophobic interior of staphylococcal nuclease is about 10-15, significantly larger than previously assumed; and (4) a single mutation in a short segment of the protein transthyretin (i.e., 110-115) induces formation of amyloid fibrils consisting of both beta- and alpha-sheets, where the latter is a proposed structure in proteins, but has never been observed previously.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ruskamo, Salla; University of Oulu, Oulu; Yadav, Ravi P.
2014-01-01
The structure of the human myelin peripheral membrane protein P2 has been refined at 0.93 Å resolution. In combination with functional experiments in vitro, in vivo and in silico, the fine details of the structure–function relationships in P2 are emerging. P2 is a fatty acid-binding protein expressed in vertebrate peripheral nerve myelin, where it may function in bilayer stacking and lipid transport. P2 binds to phospholipid membranes through its positively charged surface and a hydrophobic tip, and accommodates fatty acids inside its barrel structure. The structure of human P2 refined at the ultrahigh resolution of 0.93 Å allows detailed structuralmore » analyses, including the full organization of an internal hydrogen-bonding network. The orientation of the bound fatty-acid carboxyl group is linked to the protonation states of two coordinating arginine residues. An anion-binding site in the portal region is suggested to be relevant for membrane interactions and conformational changes. When bound to membrane multilayers, P2 has a preferred orientation and is stabilized, and the repeat distance indicates a single layer of P2 between membranes. Simulations show the formation of a double bilayer in the presence of P2, and in cultured cells wild-type P2 induces membrane-domain formation. Here, the most accurate structural and functional view to date on P2, a major component of peripheral nerve myelin, is presented, showing how it can interact with two membranes simultaneously while going through conformational changes at its portal region enabling ligand transfer.« less
Design of sweet protein based sweeteners: hints from structure-function relationships.
Rega, Michele Fortunato; Di Monaco, Rossella; Leone, Serena; Donnarumma, Federica; Spadaccini, Roberta; Cavella, Silvana; Picone, Delia
2015-04-15
Sweet proteins represent a class of natural molecules, which are extremely interesting regarding their potential use as safe low-calories sweeteners for individuals who need to control sugar intake, such as obese or diabetic subjects. Punctual mutations of amino acid residues of MNEI, a single chain derivative of the natural sweet protein monellin, allow the modulation of its taste. In this study we present a structural and functional comparison between MNEI and a sweeter mutant Y65R, containing an extra positive charge on the protein surface, in conditions mimicking those of typical beverages. Y65R exhibits superior sweetness in all the experimental conditions tested, has a better solubility at mild acidic pH and preserves a significant thermal stability in a wide range of pH conditions, although slightly lower than MNEI. Our findings confirm the advantages of structure-guided protein engineering to design improved low-calorie sweeteners and excipients for food and pharmaceutical preparations. Copyright © 2014 Elsevier Ltd. All rights reserved.
2011-01-01
Background The inorganic (Pi) phosphate transporter (PiT) family comprises known and putative Na+- or H+-dependent Pi-transporting proteins with representatives from all kingdoms. The mammalian members are placed in the outer cell membranes and suggested to supply cells with Pi to maintain house-keeping functions. Alignment of protein sequences representing PiT family members from all kingdoms reveals the presence of conserved amino acids and that bacterial phosphate permeases and putative phosphate permeases from archaea lack substantial parts of the protein sequence when compared to the mammalian PiT family members. Besides being Na+-dependent Pi (NaPi) transporters, the mammalian PiT paralogs, PiT1 and PiT2, also are receptors for gamma-retroviruses. We have here exploited the dual-function of PiT1 and PiT2 to study the structure-function relationship of PiT proteins. Results We show that the human PiT2 histidine, H502, and the human PiT1 glutamate, E70, - both conserved in eukaryotic PiT family members - are critical for Pi transport function. Noticeably, human PiT2 H502 is located in the C-terminal PiT family signature sequence, and human PiT1 E70 is located in ProDom domains characteristic for all PiT family members. A human PiT2 truncation mutant, which consists of the predicted 10 transmembrane (TM) domain backbone without a large intracellular domain (human PiT2ΔR254-V483), was found to be a fully functional Pi transporter. Further truncation of the human PiT2 protein by additional removal of two predicted TM domains together with the large intracellular domain created a mutant that resembles a bacterial phosphate permease and an archaeal putative phosphate permease. This human PiT2 truncation mutant (human PiT2ΔL183-V483) did also support Pi transport albeit at very low levels. Conclusions The results suggest that the overall structure of the Pi-transporting unit of the PiT family proteins has remained unchanged during evolution. Moreover, in combination, our studies of the gene structure of the human PiT1 and PiT2 genes (SLC20A1 and SLC20A2, respectively) and alignment of protein sequences of PiT family members from all kingdoms, along with the studies of the dual functions of the human PiT paralogs show that these proteins are excellent as models for studying the evolution of a protein's structure-function relationship. PMID:21586110
Bøttger, Pernille; Pedersen, Lene
2011-05-17
The inorganic (Pi) phosphate transporter (PiT) family comprises known and putative Na(+)- or H(+)-dependent Pi-transporting proteins with representatives from all kingdoms. The mammalian members are placed in the outer cell membranes and suggested to supply cells with Pi to maintain house-keeping functions. Alignment of protein sequences representing PiT family members from all kingdoms reveals the presence of conserved amino acids and that bacterial phosphate permeases and putative phosphate permeases from archaea lack substantial parts of the protein sequence when compared to the mammalian PiT family members. Besides being Na(+)-dependent P(i) (NaP(i)) transporters, the mammalian PiT paralogs, PiT1 and PiT2, also are receptors for gamma-retroviruses. We have here exploited the dual-function of PiT1 and PiT2 to study the structure-function relationship of PiT proteins. We show that the human PiT2 histidine, H(502), and the human PiT1 glutamate, E(70),--both conserved in eukaryotic PiT family members--are critical for P(i) transport function. Noticeably, human PiT2 H(502) is located in the C-terminal PiT family signature sequence, and human PiT1 E(70) is located in ProDom domains characteristic for all PiT family members.A human PiT2 truncation mutant, which consists of the predicted 10 transmembrane (TM) domain backbone without a large intracellular domain (human PiT2ΔR(254)-V(483)), was found to be a fully functional P(i) transporter. Further truncation of the human PiT2 protein by additional removal of two predicted TM domains together with the large intracellular domain created a mutant that resembles a bacterial phosphate permease and an archaeal putative phosphate permease. This human PiT2 truncation mutant (human PiT2ΔL(183)-V(483)) did also support P(i) transport albeit at very low levels. The results suggest that the overall structure of the P(i)-transporting unit of the PiT family proteins has remained unchanged during evolution. Moreover, in combination, our studies of the gene structure of the human PiT1 and PiT2 genes (SLC20A1 and SLC20A2, respectively) and alignment of protein sequences of PiT family members from all kingdoms, along with the studies of the dual functions of the human PiT paralogs show that these proteins are excellent as models for studying the evolution of a protein's structure-function relationship. © 2011 Bøttger and Pedersen; licensee BioMed Central Ltd.
Bhaskara, Ramachandra M; Padhi, Amrita; Srinivasan, Narayanaswamy
2014-07-01
With the preponderance of multidomain proteins in eukaryotic genomes, it is essential to recognize the constituent domains and their functions. Often function involves communications across the domain interfaces, and the knowledge of the interacting sites is essential to our understanding of the structure-function relationship. Using evolutionary information extracted from homologous domains in at least two diverse domain architectures (single and multidomain), we predict the interface residues corresponding to domains from the two-domain proteins. We also use information from the three-dimensional structures of individual domains of two-domain proteins to train naïve Bayes classifier model to predict the interfacial residues. Our predictions are highly accurate (∼85%) and specific (∼95%) to the domain-domain interfaces. This method is specific to multidomain proteins which contain domains in at least more than one protein architectural context. Using predicted residues to constrain domain-domain interaction, rigid-body docking was able to provide us with accurate full-length protein structures with correct orientation of domains. We believe that these results can be of considerable interest toward rational protein and interaction design, apart from providing us with valuable information on the nature of interactions. © 2013 Wiley Periodicals, Inc.
Molecular structures guide the engineering of chromatin
Tekel, Stefan J.
2017-01-01
Abstract Chromatin is a system of proteins, RNA, and DNA that interact with each other to organize and regulate genetic information within eukaryotic nuclei. Chromatin proteins carry out essential functions: packing DNA during cell division, partitioning DNA into sub-regions within the nucleus, and controlling levels of gene expression. There is a growing interest in manipulating chromatin dynamics for applications in medicine and agriculture. Progress in this area requires the identification of design rules for the chromatin system. Here, we focus on the relationship between the physical structure and function of chromatin proteins. We discuss key research that has elucidated the intrinsic properties of chromatin proteins and how this information informs design rules for synthetic systems. Recent work demonstrates that chromatin-derived peptide motifs are portable and in some cases can be customized to alter their function. Finally, we present a workflow for fusion protein design and discuss best practices for engineering chromatin to assist scientists in advancing the field of synthetic epigenetics. PMID:28609787
Shah, Syed Hussinien H; Kar, Rajiv K; Asmawi, Azren A; Rahman, Mohd Basyaruddin A; Murad, Abdul Munir A; Mahadi, Nor M; Basri, Mahiran; Rahman, Raja Noor Zaliha A; Salleh, Abu B; Chatterjee, Subhrangsu; Tejo, Bimo A; Bhunia, Anirban
2012-01-01
Exotic functions of antifreeze proteins (AFP) and antifreeze glycopeptides (AFGP) have recently been attracted with much interest to develop them as commercial products. AFPs and AFGPs inhibit ice crystal growth by lowering the water freezing point without changing the water melting point. Our group isolated the Antarctic yeast Glaciozyma antarctica that expresses antifreeze protein to assist it in its survival mechanism at sub-zero temperatures. The protein is unique and novel, indicated by its low sequence homology compared to those of other AFPs. We explore the structure-function relationship of G. antarctica AFP using various approaches ranging from protein structure prediction, peptide design and antifreeze activity assays, nuclear magnetic resonance (NMR) studies and molecular dynamics simulation. The predicted secondary structure of G. antarctica AFP shows several α-helices, assumed to be responsible for its antifreeze activity. We designed several peptide fragments derived from the amino acid sequences of α-helical regions of the parent AFP and they also showed substantial antifreeze activities, below that of the original AFP. The relationship between peptide structure and activity was explored by NMR spectroscopy and molecular dynamics simulation. NMR results show that the antifreeze activity of the peptides correlates with their helicity and geometrical straightforwardness. Furthermore, molecular dynamics simulation also suggests that the activity of the designed peptides can be explained in terms of the structural rigidity/flexibility, i.e., the most active peptide demonstrates higher structural stability, lower flexibility than that of the other peptides with lower activities, and of lower rigidity. This report represents the first detailed report of downsizing a yeast AFP into its peptide fragments with measurable antifreeze activities.
Asmawi, Azren A.; Rahman, Mohd Basyaruddin A.; Murad, Abdul Munir A.; Mahadi, Nor M.; Basri, Mahiran; Rahman, Raja Noor Zaliha A.; Salleh, Abu B.; Chatterjee, Subhrangsu; Tejo, Bimo A.; Bhunia, Anirban
2012-01-01
Exotic functions of antifreeze proteins (AFP) and antifreeze glycopeptides (AFGP) have recently been attracted with much interest to develop them as commercial products. AFPs and AFGPs inhibit ice crystal growth by lowering the water freezing point without changing the water melting point. Our group isolated the Antarctic yeast Glaciozyma antarctica that expresses antifreeze protein to assist it in its survival mechanism at sub-zero temperatures. The protein is unique and novel, indicated by its low sequence homology compared to those of other AFPs. We explore the structure-function relationship of G. antarctica AFP using various approaches ranging from protein structure prediction, peptide design and antifreeze activity assays, nuclear magnetic resonance (NMR) studies and molecular dynamics simulation. The predicted secondary structure of G. antarctica AFP shows several α-helices, assumed to be responsible for its antifreeze activity. We designed several peptide fragments derived from the amino acid sequences of α-helical regions of the parent AFP and they also showed substantial antifreeze activities, below that of the original AFP. The relationship between peptide structure and activity was explored by NMR spectroscopy and molecular dynamics simulation. NMR results show that the antifreeze activity of the peptides correlates with their helicity and geometrical straightforwardness. Furthermore, molecular dynamics simulation also suggests that the activity of the designed peptides can be explained in terms of the structural rigidity/flexibility, i.e., the most active peptide demonstrates higher structural stability, lower flexibility than that of the other peptides with lower activities, and of lower rigidity. This report represents the first detailed report of downsizing a yeast AFP into its peptide fragments with measurable antifreeze activities. PMID:23209600
From Split to Sibenik: The Tortuous Pathway in the Cholinesterase Field
Taylor, Palmer
2010-01-01
The interim between the first and tenth International Cholinesterase meetings has seen remarkable advances associated with the applications of structural biology and recombinant DNA methodology to our field. The cloning of the cholinesterase genes led to the identification of a new super family of proteins, termed the α,β–hydrolase fold; members of this family possess a four helix bundle capable of linking structural subunits to the functioning globular protein. Sequence comparisons and three dimensional structural studies revealed unexpected cousins possessing this fold that, in turn, revealed three distinct functions for the α,β-hydrolase proteins. These encompass: (1) a capacity for hydrolytic cleavage of a great variety of substrates, (2) a heterophilic adhesion function that results in trans-synaptic associations in linked neurons, (3) a chaperone function leading to stabilization of nascent protein and its trafficking to an extracellular or secretory storage location. The analysis and modification of structure may go beyond understanding mechanism, since it may be possible to convert the cholinesterases to efficient detoxifying agents of organophosphatases assisted by added oximes. Also, the study of the relationship between the α,β–hydrolase fold proteins and their biosynthesis may yield means by which aberrant trafficking may be corrected, enhancing expression of mutant proteins. Those engaged in cholinesterase research should take great pride in our accomplishments punctuated by the series of ten meetings. The momentum established and initial studies with related proteins all hold great promise for the future. PMID:20493179
Structure and function of enzymes in heme biosynthesis.
Layer, Gunhild; Reichelt, Joachim; Jahn, Dieter; Heinz, Dirk W
2010-06-01
Tetrapyrroles like hemes, chlorophylls, and cobalamin are complex macrocycles which play essential roles in almost all living organisms. Heme serves as prosthetic group of many proteins involved in fundamental biological processes like respiration, photosynthesis, and the metabolism and transport of oxygen. Further, enzymes such as catalases, peroxidases, or cytochromes P450 rely on heme as essential cofactors. Heme is synthesized in most organisms via a highly conserved biosynthetic route. In humans, defects in heme biosynthesis lead to severe metabolic disorders called porphyrias. The elucidation of the 3D structures for all heme biosynthetic enzymes over the last decade provided new insights into their function and elucidated the structural basis of many known diseases. In terms of structure and function several rather unique proteins were revealed such as the V-shaped glutamyl-tRNA reductase, the dipyrromethane cofactor containing porphobilinogen deaminase, or the "Radical SAM enzyme" coproporphyrinogen III dehydrogenase. This review summarizes the current understanding of the structure-function relationship for all heme biosynthetic enzymes and their potential interactions in the cell.
Pattern similarity study of functional sites in protein sequences: lysozymes and cystatins
Nakai, Shuryo; Li-Chan, Eunice CY; Dou, Jinglie
2005-01-01
Background Although it is generally agreed that topography is more conserved than sequences, proteins sharing the same fold can have different functions, while there are protein families with low sequence similarity. An alternative method for profile analysis of characteristic conserved positions of the motifs within the 3D structures may be needed for functional annotation of protein sequences. Using the approach of quantitative structure-activity relationships (QSAR), we have proposed a new algorithm for postulating functional mechanisms on the basis of pattern similarity and average of property values of side-chains in segments within sequences. This approach was used to search for functional sites of proteins belonging to the lysozyme and cystatin families. Results Hydrophobicity and β-turn propensity of reference segments with 3–7 residues were used for the homology similarity search (HSS) for active sites. Hydrogen bonding was used as the side-chain property for searching the binding sites of lysozymes. The profiles of similarity constants and average values of these parameters as functions of their positions in the sequences could identify both active and substrate binding sites of the lysozyme of Streptomyces coelicolor, which has been reported as a new fold enzyme (Cellosyl). The same approach was successfully applied to cystatins, especially for postulating the mechanisms of amyloidosis of human cystatin C as well as human lysozyme. Conclusion Pattern similarity and average index values of structure-related properties of side chains in short segments of three residues or longer were, for the first time, successfully applied for predicting functional sites in sequences. This new approach may be applicable to studying functional sites in un-annotated proteins, for which complete 3D structures are not yet available. PMID:15904486
He, Yi-Ming; Ma, Bin-Guang
2016-01-01
Protein complexes are major forms of protein-protein interactions and implement essential biological functions. The subunit interface in a protein complex is related to its thermostability. Though the roles of interface properties in thermal adaptation have been investigated for protein complexes, the relationship between the interface size and the expression level of the subunits remains unknown. In the present work, we studied this relationship and found a positive correlation in thermophiles rather than mesophiles. Moreover, we found that the protein interaction strength in complexes is not only temperature-dependent but also abundance-dependent. The underlying mechanism for the observed correlation was explored by simulating the evolution of protein interface stability, which highlights the avoidance of misinteraction. Our findings make more complete the picture of the mechanisms for protein complex thermal adaptation and provide new insights into the principles of protein-protein interactions. PMID:27220911
NASA Astrophysics Data System (ADS)
He, Yi-Ming; Ma, Bin-Guang
2016-05-01
Protein complexes are major forms of protein-protein interactions and implement essential biological functions. The subunit interface in a protein complex is related to its thermostability. Though the roles of interface properties in thermal adaptation have been investigated for protein complexes, the relationship between the interface size and the expression level of the subunits remains unknown. In the present work, we studied this relationship and found a positive correlation in thermophiles rather than mesophiles. Moreover, we found that the protein interaction strength in complexes is not only temperature-dependent but also abundance-dependent. The underlying mechanism for the observed correlation was explored by simulating the evolution of protein interface stability, which highlights the avoidance of misinteraction. Our findings make more complete the picture of the mechanisms for protein complex thermal adaptation and provide new insights into the principles of protein-protein interactions.
Machnicka, Magdalena A; Kaminska, Katarzyna H; Dunin-Horkawicz, Stanislaw; Bujnicki, Janusz M
2015-10-23
GmrSD is a modification-dependent restriction endonuclease that specifically targets and cleaves glucosylated hydroxymethylcytosine (glc-HMC) modified DNA. It is encoded either as two separate single-domain GmrS and GmrD proteins or as a single protein carrying both domains. Previous studies suggested that GmrS acts as endonuclease and NTPase whereas GmrD binds DNA. In this work we applied homology detection, sequence conservation analysis, fold recognition and homology modeling methods to study sequence-structure-function relationships in the GmrSD restriction endonucleases family. We also analyzed the phylogeny and genomic context of the family members. Results of our comparative genomics study show that GmrS exhibits similarity to proteins from the ParB/Srx fold which can have both NTPase and nuclease activity. In contrast to the previous studies though, we attribute the nuclease activity also to GmrD as we found it to contain the HNH endonuclease motif. We revealed residues potentially important for structure and function in both domains. Moreover, we found that GmrSD systems exist predominantly as a fused, double-domain form rather than as a heterodimer and that their homologs are often encoded in regions enriched in defense and gene mobility-related elements. Finally, phylogenetic reconstructions of GmrS and GmrD domains revealed that they coevolved and only few GmrSD systems appear to be assembled from distantly related GmrS and GmrD components. Our study provides insight into sequence-structure-function relationships in the yet poorly characterized family of Type IV restriction enzymes. Comparative genomics allowed to propose possible role of GmrD domain in the function of the GmrSD enzyme and possible active sites of both GmrS and GmrD domains. Presented results can guide further experimental characterization of these enzymes.
ERIC Educational Resources Information Center
Chiang, Harry; Robinson, Lucy C.; Brame, Cynthia J.; Messina, Troy C.
2013-01-01
Over the past 20 years, the biological sciences have increasingly incorporated chemistry, physics, computer science, and mathematics to aid in the development and use of mathematical models. Such combined approaches have been used to address problems from protein structure-function relationships to the workings of complex biological systems.…
Lakshminarayanan, Rajamani; Joseph, Jeremiah S; Kini, R Manjunatha; Valiyaveettil, Suresh
2005-01-01
The role of individual matrix proteins in avian eggshell calcification is poorly understood despite numerous attempts to characterize and localize their presence in the eggshell matrix. Ansocalcin, the major matrix protein from goose eggshell, was found to induce the formation of calcite crystal aggregates under in vitro. Owing to its high similarity with the chicken eggshell matrix protein ovocleidin 17 (OC-17), a comparative investigation has been carried out to understand the structure-function relationship. RP-HPLC shows that ansocalcin is the major component in extracts of goose eggshells before and after bleach treatment. However, OC-17 was observed in minute quantities in the extract of bleach-treated chicken eggshells. In vitro crystal growth experiments showed that OC-17 and ansocalcin interact differently with the calcite crystals formed. Circular dichroism, intrinsic tryptophan fluorescence, and dynamic light scattering studies showed that, under the conditions used in our experiments, OC-17 does not aggregate in solution or induce the nucleation of calcite aggregates in the concentration range used. These observations indicate that OC-17 and ansocalcin play different roles in the eggshell calcification. To our knowledge, this is the first report on the comparison of properties of homologous eggshell proteins that belong to the same phylogeny.
Protein machines and self assembly in muscle organization
NASA Technical Reports Server (NTRS)
Barral, J. M.; Epstein, H. F.
1999-01-01
The remarkable order of striated muscle is the result of a complex series of protein interactions at different levels of organization. Within muscle, the thick filament and its major protein myosin are classical examples of functioning protein machines. Our understanding of the structure and assembly of thick filaments and their organization into the regular arrays of the A-band has recently been enhanced by the application of biochemical, genetic, and structural approaches. Detailed studies of the thick filament backbone have shown that the myosins are organized into a tubular structure. Additional protein machines and specific myosin rod sequences have been identified that play significant roles in thick filament structure, assembly, and organization. These include intrinsic filament components, cross-linking molecules of the M-band and constituents of the membrane-cytoskeleton system. Muscle organization is directed by the multistep actions of protein machines that take advantage of well-established self-assembly relationships. Copyright 1999 John Wiley & Sons, Inc.
2014-01-01
Background Bacteroides spp. form a significant part of our gut microbiome and are well known for optimized metabolism of diverse polysaccharides. Initial analysis of the archetypal Bacteroides thetaiotaomicron genome identified 172 glycosyl hydrolases and a large number of uncharacterized proteins associated with polysaccharide metabolism. Results BT_1012 from Bacteroides thetaiotaomicron VPI-5482 is a protein of unknown function and a member of a large protein family consisting entirely of uncharacterized proteins. Initial sequence analysis predicted that this protein has two domains, one on the N- and one on the C-terminal. A PSI-BLAST search found over 150 full length and over 90 half size homologs consisting only of the N-terminal domain. The experimentally determined three-dimensional structure of the BT_1012 protein confirms its two-domain architecture and structural analysis of both domains suggests their specific functions. The N-terminal domain is a putative catalytic domain with significant similarity to known glycoside hydrolases, the C-terminal domain has a beta-sandwich fold typically found in C-terminal domains of other glycosyl hydrolases, however these domains are typically involved in substrate binding. We describe the structure of the BT_1012 protein and discuss its sequence-structure relationship and their possible functional implications. Conclusions Structural and sequence analyses of the BT_1012 protein identifies it as a glycosyl hydrolase, expanding an already impressive catalog of enzymes involved in polysaccharide metabolism in Bacteroides spp. Based on this we have renamed the Pfam families representing the two domains found in the BT_1012 protein, PF13204 and PF12904, as putative glycoside hydrolase and glycoside hydrolase-associated C-terminal domain respectively. PMID:24742328
The Modular Organization of Protein Interactions in Escherichia coli
Peregrín-Alvarez, José M.; Xiong, Xuejian; Su, Chong; Parkinson, John
2009-01-01
Escherichia coli serves as an excellent model for the study of fundamental cellular processes such as metabolism, signalling and gene expression. Understanding the function and organization of proteins within these processes is an important step towards a ‘systems’ view of E. coli. Integrating experimental and computational interaction data, we present a reliable network of 3,989 functional interactions between 1,941 E. coli proteins (∼45% of its proteome). These were combined with a recently generated set of 3,888 high-quality physical interactions between 918 proteins and clustered to reveal 316 discrete modules. In addition to known protein complexes (e.g., RNA and DNA polymerases), we identified modules that represent biochemical pathways (e.g., nitrate regulation and cell wall biosynthesis) as well as batteries of functionally and evolutionarily related processes. To aid the interpretation of modular relationships, several case examples are presented, including both well characterized and novel biochemical systems. Together these data provide a global view of the modular organization of the E. coli proteome and yield unique insights into structural and evolutionary relationships in bacterial networks. PMID:19798435
Jaspard, Emmanuel; Macherel, David; Hunault, Gilles
2012-01-01
Late Embryogenesis Abundant Proteins (LEAPs) are ubiquitous proteins expected to play major roles in desiccation tolerance. Little is known about their structure - function relationships because of the scarcity of 3-D structures for LEAPs. The previous building of LEAPdb, a database dedicated to LEAPs from plants and other organisms, led to the classification of 710 LEAPs into 12 non-overlapping classes with distinct properties. Using this resource, numerous physico-chemical properties of LEAPs and amino acid usage by LEAPs have been computed and statistically analyzed, revealing distinctive features for each class. This unprecedented analysis allowed a rigorous characterization of the 12 LEAP classes, which differed also in multiple structural and physico-chemical features. Although most LEAPs can be predicted as intrinsically disordered proteins, the analysis indicates that LEAP class 7 (PF03168) and probably LEAP class 11 (PF04927) are natively folded proteins. This study thus provides a detailed description of the structural properties of this protein family opening the path toward further LEAP structure - function analysis. Finally, since each LEAP class can be clearly characterized by a unique set of physico-chemical properties, this will allow development of software to predict proteins as LEAPs. PMID:22615859
Gromiha, M Michael; Anoosha, P; Huang, Liang-Tsung
2016-01-01
Protein stability is the free energy difference between unfolded and folded states of a protein, which lies in the range of 5-25 kcal/mol. Experimentally, protein stability is measured with circular dichroism, differential scanning calorimetry, and fluorescence spectroscopy using thermal and denaturant denaturation methods. These experimental data have been accumulated in the form of a database, ProTherm, thermodynamic database for proteins and mutants. It also contains sequence and structure information of a protein, experimental methods and conditions, and literature information. Different features such as search, display, and sorting options and visualization tools have been incorporated in the database. ProTherm is a valuable resource for understanding/predicting the stability of proteins and it can be accessed at http://www.abren.net/protherm/ . ProTherm has been effectively used to examine the relationship among thermodynamics, structure, and function of proteins. We describe the recent progress on the development of methods for understanding/predicting protein stability, such as (1) general trends on mutational effects on stability, (2) relationship between the stability of protein mutants and amino acid properties, (3) applications of protein three-dimensional structures for predicting their stability upon point mutations, (4) prediction of protein stability upon single mutations from amino acid sequence, and (5) prediction methods for addressing double mutants. A list of online resources for predicting has also been provided.
González-Díaz, Humberto; Munteanu, Cristian R; Postelnicu, Lucian; Prado-Prado, Francisco; Gestal, Marcos; Pazos, Alejandro
2012-03-01
Lipid-Binding Proteins (LIBPs) or Fatty Acid-Binding Proteins (FABPs) play an important role in many diseases such as different types of cancer, kidney injury, atherosclerosis, diabetes, intestinal ischemia and parasitic infections. Thus, the computational methods that can predict LIBPs based on 3D structure parameters became a goal of major importance for drug-target discovery, vaccine design and biomarker selection. In addition, the Protein Data Bank (PDB) contains 3000+ protein 3D structures with unknown function. This list, as well as new experimental outcomes in proteomics research, is a very interesting source to discover relevant proteins, including LIBPs. However, to the best of our knowledge, there are no general models to predict new LIBPs based on 3D structures. We developed new Quantitative Structure-Activity Relationship (QSAR) models based on 3D electrostatic parameters of 1801 different proteins, including 801 LIBPs. We calculated these electrostatic parameters with the MARCH-INSIDE software and they correspond to the entire protein or to specific protein regions named core, inner, middle, and surface. We used these parameters as inputs to develop a simple Linear Discriminant Analysis (LDA) classifier to discriminate 3D structure of LIBPs from other proteins. We implemented this predictor in the web server named LIBP-Pred, freely available at , along with other important web servers of the Bio-AIMS portal. The users can carry out an automatic retrieval of protein structures from PDB or upload their custom protein structural models from their disk created with LOMETS server. We demonstrated the PDB mining option performing a predictive study of 2000+ proteins with unknown function. Interesting results regarding the discovery of new Cancer Biomarkers in humans or drug targets in parasites have been discussed here in this sense.
F2Dock: Fast Fourier Protein-Protein Docking
Bajaj, Chandrajit; Chowdhury, Rezaul; Siddavanahalli, Vinay
2009-01-01
The functions of proteins is often realized through their mutual interactions. Determining a relative transformation for a pair of proteins and their conformations which form a stable complex, reproducible in nature, is known as docking. It is an important step in drug design, structure determination and understanding function and structure relationships. In this paper we extend our non-uniform fast Fourier transform docking algorithm to include an adaptive search phase (both translational and rotational) and thereby speed up its execution. We have also implemented a multithreaded version of the adaptive docking algorithm for even faster execution on multicore machines. We call this protein-protein docking code F2Dock (F2 = Fast Fourier). We have calibrated F2Dock based on an extensive experimental study on a list of benchmark complexes and conclude that F2Dock works very well in practice. Though all docking results reported in this paper use shape complementarity and Coulombic potential based scores only, F2Dock is structured to incorporate Lennard-Jones potential and re-ranking docking solutions based on desolvation energy. PMID:21071796
Evolution driven structural changes in CENP-E motor domain.
Kumar, Ambuj; Kamaraj, Balu; Sethumadhavan, Rao; Purohit, Rituraj
2013-06-01
Genetic evolution corresponds to various biochemical changes that are vital development of new functional traits. Phylogenetic analysis has provided an important insight into the genetic closeness among species and their evolutionary relationships. Centromere-associated protein-E (CENP-E) protein is vital for maintaining cell cycle and checkpoint signal mechanisms are vital for recruitment process of other essential kinetochore proteins. In this study we have focussed on the evolution driven structural changes in CENP-E motor domain among primate lineage. Through molecular dynamics simulation and computational chemistry approaches we examined the changes in ATP binding affinity and conformational deviations in human CENP-E motor domain as compared to the other primates. Root mean square deviation (RMSD), Root mean square fluctuation (RMSF), Radius of gyration (Rg) and principle component analysis (PCA) results together suggested a gain in stability level as we move from tarsier towards human. This study provides a significant insight into how the cell cycle proteins and their corresponding biochemical activities are evolving and illustrates the potency of a theoretical approach for assessing, in a single study, the structural, functional, and dynamical aspects of protein evolution.
Lamin-like analogues in plants: the characterization of NMCP1 in Allium cepa
Moreno Díaz de la Espina, Susana
2013-01-01
The nucleoskeleton of plants contains a peripheral lamina (also called plamina) and, even though lamins are absent in plants, their roles are still fulfilled in plant nuclei. One of the most intriguing topics in plant biology concerns the identity of lamin protein analogues in plants. Good candidates to play lamin functions in plants are the members of the NMCP (nuclear matrix constituent protein) family, which exhibit the typical tripartite structure of lamins. This paper describes a bioinformatics analysis and classification of the NMCP family based on phylogenetic relationships, sequence similarity and the distribution of conserved regions in 76 homologues. In addition, NMCP1 in the monocot Allium cepa characterized by its sequence and structure, biochemical properties, and subnuclear distribution and alterations in its expression throughout the root were identified. The results demonstrate that these proteins exhibit many similarities to lamins (structural organization, conserved regions, subnuclear distribution, and solubility) and that they may fulfil the functions of lamins in plants. These findings significantly advance understanding of the structural proteins of the plant lamina and nucleoskeleton and provide a basis for further investigation of the protein networks forming these structures. PMID:23378381
Lamin-like analogues in plants: the characterization of NMCP1 in Allium cepa.
Ciska, Malgorzata; Masuda, Kiyoshi; Moreno Díaz de la Espina, Susana
2013-04-01
The nucleoskeleton of plants contains a peripheral lamina (also called plamina) and, even though lamins are absent in plants, their roles are still fulfilled in plant nuclei. One of the most intriguing topics in plant biology concerns the identity of lamin protein analogues in plants. Good candidates to play lamin functions in plants are the members of the NMCP (nuclear matrix constituent protein) family, which exhibit the typical tripartite structure of lamins. This paper describes a bioinformatics analysis and classification of the NMCP family based on phylogenetic relationships, sequence similarity and the distribution of conserved regions in 76 homologues. In addition, NMCP1 in the monocot Allium cepa characterized by its sequence and structure, biochemical properties, and subnuclear distribution and alterations in its expression throughout the root were identified. The results demonstrate that these proteins exhibit many similarities to lamins (structural organization, conserved regions, subnuclear distribution, and solubility) and that they may fulfil the functions of lamins in plants. These findings significantly advance understanding of the structural proteins of the plant lamina and nucleoskeleton and provide a basis for further investigation of the protein networks forming these structures.
Zhou, Peng; Wang, Congcong; Tian, Feifei; Ren, Yanrong; Yang, Chao; Huang, Jian
2013-01-01
Quantitative structure-activity relationship (QSAR), a regression modeling methodology that establishes statistical correlation between structure feature and apparent behavior for a series of congeneric molecules quantitatively, has been widely used to evaluate the activity, toxicity and property of various small-molecule compounds such as drugs, toxicants and surfactants. However, it is surprising to see that such useful technique has only very limited applications to biomacromolecules, albeit the solved 3D atom-resolution structures of proteins, nucleic acids and their complexes have accumulated rapidly in past decades. Here, we present a proof-of-concept paradigm for the modeling, prediction and interpretation of the binding affinity of 144 sequence-nonredundant, structure-available and affinity-known protein complexes (Kastritis et al. Protein Sci 20:482-491, 2011) using a biomacromolecular QSAR (BioQSAR) scheme. We demonstrate that the modeling performance and predictive power of BioQSAR are comparable to or even better than that of traditional knowledge-based strategies, mechanism-type methods and empirical scoring algorithms, while BioQSAR possesses certain additional features compared to the traditional methods, such as adaptability, interpretability, deep-validation and high-efficiency. The BioQSAR scheme could be readily modified to infer the biological behavior and functions of other biomacromolecules, if their X-ray crystal structures, NMR conformation assemblies or computationally modeled structures are available.
Unraveling protein catalysis through neutron diffraction
NASA Astrophysics Data System (ADS)
Myles, Dean
Neutron scattering and diffraction are exquisitely sensitive to the location, concentration and dynamics of hydrogen atoms in materials and provide a powerful tool for the characterization of structure-function and interfacial relationships in biological systems. Modern neutron scattering facilities offer access to a sophisticated, non-destructive suite of instruments for biophysical characterization that provide spatial and dynamic information spanning from Angstroms to microns and from picoseconds to microseconds, respectively. Applications range from atomic-resolution analysis of individual hydrogen atoms in enzymes, through to multi-scale analysis of hierarchical structures and assemblies in biological complexes, membranes and in living cells. Here we describe how the precise location of protein and water hydrogen atoms using neutron diffraction provides a more complete description of the atomic and electronic structures of proteins, enabling key questions concerning enzyme reaction mechanisms, molecular recognition and binding and protein-water interactions to be addressed. Current work is focused on understanding how molecular structure and dynamics control function in photosynthetic, cell signaling and DNA repair proteins. We will highlight recent studies that provide detailed understanding of the physiochemical mechanisms through which proteins recognize ligands and catalyze reactions, and help to define and understand the key principles involved.
2013-01-01
Background In recent years, various types of cellular networks have penetrated biology and are nowadays used omnipresently for studying eukaryote and prokaryote organisms. Still, the relation and the biological overlap among phenomenological and inferential gene networks, e.g., between the protein interaction network and the gene regulatory network inferred from large-scale transcriptomic data, is largely unexplored. Results We provide in this study an in-depth analysis of the structural, functional and chromosomal relationship between a protein-protein network, a transcriptional regulatory network and an inferred gene regulatory network, for S. cerevisiae and E. coli. Further, we study global and local aspects of these networks and their biological information overlap by comparing, e.g., the functional co-occurrence of Gene Ontology terms by exploiting the available interaction structure among the genes. Conclusions Although the individual networks represent different levels of cellular interactions with global structural and functional dissimilarities, we observe crucial functions of their network interfaces for the assembly of protein complexes, proteolysis, transcription, translation, metabolic and regulatory interactions. Overall, our results shed light on the integrability of these networks and their interfacing biological processes. PMID:23663484
The Structural Basis of IKs Ion-Channel Activation: Mechanistic Insights from Molecular Simulations.
Ramasubramanian, Smiruthi; Rudy, Yoram
2018-06-05
Relating ion channel (iCh) structural dynamics to physiological function remains a challenge. Current experimental and computational techniques have limited ability to explore this relationship in atomistic detail over physiological timescales. A framework associating iCh structure to function is necessary for elucidating normal and disease mechanisms. We formulated a modeling schema that overcomes the limitations of current methods through applications of artificial intelligence machine learning. Using this approach, we studied molecular processes that underlie human IKs voltage-mediated gating. IKs malfunction underlies many debilitating and life-threatening diseases. Molecular components of IKs that underlie its electrophysiological function include KCNQ1 (a pore-forming tetramer) and KCNE1 (an auxiliary subunit). Simulations, using the IKs structure-function model, reproduced experimentally recorded saturation of gating-charge displacement at positive membrane voltages, two-step voltage sensor (VS) movement shown by fluorescence, iCh gating statistics, and current-voltage relationship. Mechanistic insights include the following: 1) pore energy profile determines iCh subconductance; 2) the entire protein structure, not limited to the pore, contributes to pore energy and channel subconductance; 3) interactions with KCNE1 result in two distinct VS movements, causing gating-charge saturation at positive membrane voltages and current activation delay; and 4) flexible coupling between VS and pore permits pore opening at lower VS positions, resulting in sequential gating. The new modeling approach is applicable to atomistic scale studies of other proteins on timescales of physiological function. Copyright © 2018 Biophysical Society. Published by Elsevier Inc. All rights reserved.
Shagin, Dmitry A; Barsova, Ekaterina V; Yanushevich, Yurii G; Fradkov, Arkady F; Lukyanov, Konstantin A; Labas, Yulii A; Semenova, Tatiana N; Ugalde, Juan A; Meyers, Ann; Nunez, Jose M; Widder, Edith A; Lukyanov, Sergey A; Matz, Mikhail V
2004-05-01
Homologs of the green fluorescent protein (GFP), including the recently described GFP-like domains of certain extracellular matrix proteins in Bilaterian organisms, are remarkably similar at the protein structure level, yet they often perform totally unrelated functions, thereby warranting recognition as a superfamily. Here we describe diverse GFP-like proteins from previously undersampled and completely new sources, including hydromedusae and planktonic Copepoda. In hydromedusae, yellow and nonfluorescent purple proteins were found in addition to greens. Notably, the new yellow protein seems to follow exactly the same structural solution to achieving the yellow color of fluorescence as YFP, an engineered yellow-emitting mutant variant of GFP. The addition of these new sequences made it possible to resolve deep-level phylogenetic relationships within the superfamily. Fluorescence (most likely green) must have already existed in the common ancestor of Cnidaria and Bilateria, and therefore GFP-like proteins may be responsible for fluorescence and/or coloration in virtually any animal. At least 15 color diversification events can be inferred following the maximum parsimony principle in Cnidaria. Origination of red fluorescence and nonfluorescent purple-blue colors on several independent occasions provides a remarkable example of convergent evolution of complex features at the molecular level.
Huang, He; Sarai, Akinori
2012-12-01
The evolvability of proteins is not only restricted by functional and structural importance, but also by other factors such as gene duplication, protein stability, and an organism's robustness. Recently, intrinsically disordered proteins (IDPs)/regions (IDRs) have been suggested to play a role in facilitating protein evolution. However, the mechanisms by which this occurs remain largely unknown. To address this, we have systematically analyzed the relationship between the evolvability, stability, and function of IDPs/IDRs. Evolutionary analysis shows that more recently emerged IDRs have higher evolutionary rates with more functional constraints relaxed (or experiencing more positive selection), and that this may have caused accelerated evolution in the flanking regions and in the whole protein. A systematic analysis of observed stability changes due to single amino acid mutations in IDRs and ordered regions shows that while most mutations induce a destabilizing effect in proteins, mutations in IDRs cause smaller stability changes than in ordered regions. The weaker impact of mutations in IDRs on protein stability may have advantages for protein evolvability in the gain of new functions. Interestingly, however, an analysis of functional motifs in the PROSITE and ELM databases showed that motifs in IDRs are more conserved, characterized by smaller entropy and lower evolutionary rate, than in ordered regions. This apparently opposing evolutionary effect may be partly due to the flexible nature of motifs in IDRs, which require some key amino acid residues to engage in tighter interactions with other molecules. Our study suggests that the unique conformational and thermodynamic characteristics of IDPs/IDRs play an important role in the evolvability of proteins to gain new functions. Copyright © 2012 Elsevier Ltd. All rights reserved.
Molloy, Kevin; Shehu, Amarda
2013-01-01
Many proteins tune their biological function by transitioning between different functional states, effectively acting as dynamic molecular machines. Detailed structural characterization of transition trajectories is central to understanding the relationship between protein dynamics and function. Computational approaches that build on the Molecular Dynamics framework are in principle able to model transition trajectories at great detail but also at considerable computational cost. Methods that delay consideration of dynamics and focus instead on elucidating energetically-credible conformational paths connecting two functionally-relevant structures provide a complementary approach. Effective sampling-based path planning methods originating in robotics have been recently proposed to produce conformational paths. These methods largely model short peptides or address large proteins by simplifying conformational space. We propose a robotics-inspired method that connects two given structures of a protein by sampling conformational paths. The method focuses on small- to medium-size proteins, efficiently modeling structural deformations through the use of the molecular fragment replacement technique. In particular, the method grows a tree in conformational space rooted at the start structure, steering the tree to a goal region defined around the goal structure. We investigate various bias schemes over a progress coordinate for balance between coverage of conformational space and progress towards the goal. A geometric projection layer promotes path diversity. A reactive temperature scheme allows sampling of rare paths that cross energy barriers. Experiments are conducted on small- to medium-size proteins of length up to 214 amino acids and with multiple known functionally-relevant states, some of which are more than 13Å apart of each-other. Analysis reveals that the method effectively obtains conformational paths connecting structural states that are significantly different. A detailed analysis on the depth and breadth of the tree suggests that a soft global bias over the progress coordinate enhances sampling and results in higher path diversity. The explicit geometric projection layer that biases the exploration away from over-sampled regions further increases coverage, often improving proximity to the goal by forcing the exploration to find new paths. The reactive temperature scheme is shown effective in increasing path diversity, particularly in difficult structural transitions with known high-energy barriers.
The cystic fibrosis transmembrane conductance regulator (CFTR) and its stability.
Meng, Xin; Clews, Jack; Kargas, Vasileios; Wang, Xiaomeng; Ford, Robert C
2017-01-01
The cystic fibrosis transmembrane conductance regulator (CFTR) is responsible for the disease cystic fibrosis (CF). It is a membrane protein belonging to the ABC transporter family functioning as a chloride/anion channel in epithelial cells around the body. There are over 1500 mutations that have been characterised as CF-causing; the most common of these, accounting for ~70 % of CF cases, is the deletion of a phenylalanine at position 508. This leads to instability of the nascent protein and the modified structure is recognised and then degraded by the ER quality control mechanism. However, even pharmacologically 'rescued' F508del CFTR displays instability at the cell's surface, losing its channel function rapidly and it is rapidly removed from the plasma membrane for lysosomal degradation. This review will, therefore, explore the link between stability and structure/function relationships of membrane proteins and CFTR in particular and how approaches to study CFTR structure depend on its stability. We will also review the application of a fluorescence labelling method for the assessment of the thermostability and the tertiary structure of CFTR.
Protein Secondary Structure Prediction Using Deep Convolutional Neural Fields.
Wang, Sheng; Peng, Jian; Ma, Jianzhu; Xu, Jinbo
2016-01-11
Protein secondary structure (SS) prediction is important for studying protein structure and function. When only the sequence (profile) information is used as input feature, currently the best predictors can obtain ~80% Q3 accuracy, which has not been improved in the past decade. Here we present DeepCNF (Deep Convolutional Neural Fields) for protein SS prediction. DeepCNF is a Deep Learning extension of Conditional Neural Fields (CNF), which is an integration of Conditional Random Fields (CRF) and shallow neural networks. DeepCNF can model not only complex sequence-structure relationship by a deep hierarchical architecture, but also interdependency between adjacent SS labels, so it is much more powerful than CNF. Experimental results show that DeepCNF can obtain ~84% Q3 accuracy, ~85% SOV score, and ~72% Q8 accuracy, respectively, on the CASP and CAMEO test proteins, greatly outperforming currently popular predictors. As a general framework, DeepCNF can be used to predict other protein structure properties such as contact number, disorder regions, and solvent accessibility.
Quantitative theory of hydrophobic effect as a driving force of protein structure
Perunov, Nikolay; England, Jeremy L
2014-01-01
Various studies suggest that the hydrophobic effect plays a major role in driving the folding of proteins. In the past, however, it has been challenging to translate this understanding into a predictive, quantitative theory of how the full pattern of sequence hydrophobicity in a protein shapes functionally important features of its tertiary structure. Here, we extend and apply such a phenomenological theory of the sequence-structure relationship in globular protein domains, which had previously been applied to the study of allosteric motion. In an effort to optimize parameters for the model, we first analyze the patterns of backbone burial found in single-domain crystal structures, and discover that classic hydrophobicity scales derived from bulk physicochemical properties of amino acids are already nearly optimal for prediction of burial using the model. Subsequently, we apply the model to studying structural fluctuations in proteins and establish a means of identifying ligand-binding and protein–protein interaction sites using this approach. PMID:24408023
Protein Secondary Structure Prediction Using Deep Convolutional Neural Fields
NASA Astrophysics Data System (ADS)
Wang, Sheng; Peng, Jian; Ma, Jianzhu; Xu, Jinbo
2016-01-01
Protein secondary structure (SS) prediction is important for studying protein structure and function. When only the sequence (profile) information is used as input feature, currently the best predictors can obtain ~80% Q3 accuracy, which has not been improved in the past decade. Here we present DeepCNF (Deep Convolutional Neural Fields) for protein SS prediction. DeepCNF is a Deep Learning extension of Conditional Neural Fields (CNF), which is an integration of Conditional Random Fields (CRF) and shallow neural networks. DeepCNF can model not only complex sequence-structure relationship by a deep hierarchical architecture, but also interdependency between adjacent SS labels, so it is much more powerful than CNF. Experimental results show that DeepCNF can obtain ~84% Q3 accuracy, ~85% SOV score, and ~72% Q8 accuracy, respectively, on the CASP and CAMEO test proteins, greatly outperforming currently popular predictors. As a general framework, DeepCNF can be used to predict other protein structure properties such as contact number, disorder regions, and solvent accessibility.
Tertiary structural propensities reveal fundamental sequence/structure relationships.
Zheng, Fan; Zhang, Jian; Grigoryan, Gevorg
2015-05-05
Extracting useful generalizations from the continually growing Protein Data Bank (PDB) is of central importance. We hypothesize that the PDB contains valuable quantitative information on the level of local tertiary structural motifs (TERMs). We show that by breaking a protein structure into its constituent TERMs, and querying the PDB to characterize the natural ensemble matching each, we can estimate the compatibility of the structure with a given amino acid sequence through a metric we term "structure score." Considering submissions from recent Critical Assessment of Structure Prediction (CASP) experiments, we found a strong correlation (R = 0.69) between structure score and model accuracy, with poorly predicted regions readily identifiable. This performance exceeds that of leading atomistic statistical energy functions. Furthermore, TERM-based analysis of two prototypical multi-state proteins rapidly produced structural insights fully consistent with prior extensive experimental studies. We thus find that TERM-based analysis should have considerable utility for protein structural biology. Copyright © 2015 Elsevier Ltd. All rights reserved.
Girard, Eric; Marchal, Stéphane; Perez, Javier; Finet, Stéphanie; Kahn, Richard; Fourme, Roger; Marassio, Guillaume; Dhaussy, Anne-Claire; Prangé, Thierry; Giffard, Marion; Dulin, Fabienne; Bonneté, Françoise; Lange, Reinhard; Abraini, Jacques H.; Mezouar, Mohamed; Colloc'h, Nathalie
2010-01-01
Abstract Structure-function relationships in the tetrameric enzyme urate oxidase were investigated using pressure perturbation. As the active sites are located at the interfaces between monomers, enzyme activity is directly related to the integrity of the tetramer. The effect of hydrostatic pressure on the enzyme was investigated by x-ray crystallography, small-angle x-ray scattering, and fluorescence spectroscopy. Enzymatic activity was also measured under pressure and after decompression. A global model, consistent with all measurements, discloses structural and functional details of the pressure-induced dissociation of the tetramer. Before dissociating, the pressurized protein adopts a conformational substate characterized by an expansion of its substrate binding pocket at the expense of a large neighboring hydrophobic cavity. This substate should be adopted by the enzyme during its catalytic mechanism, where the active site has to accommodate larger intermediates and product. The approach, combining several high-pressure techniques, offers a new (to our knowledge) means of exploring structural and functional properties of transient states relevant to protein mechanisms. PMID:20483346
Engin, H. Billur; Guney, Emre; Keskin, Ozlem; Oliva, Baldo; Gursoy, Attila
2013-01-01
Blocking specific protein interactions can lead to human diseases. Accordingly, protein interactions and the structural knowledge on interacting surfaces of proteins (interfaces) have an important role in predicting the genotype-phenotype relationship. We have built the phenotype specific sub-networks of protein-protein interactions (PPIs) involving the relevant genes responsible for lung and brain metastasis from primary tumor in breast cancer. First, we selected the PPIs most relevant to metastasis causing genes (seed genes), by using the “guilt-by-association” principle. Then, we modeled structures of the interactions whose complex forms are not available in Protein Databank (PDB). Finally, we mapped mutations to interface structures (real and modeled), in order to spot the interactions that might be manipulated by these mutations. Functional analyses performed on these sub-networks revealed the potential relationship between immune system-infectious diseases and lung metastasis progression, but this connection was not observed significantly in the brain metastasis. Besides, structural analyses showed that some PPI interfaces in both metastasis sub-networks are originating from microbial proteins, which in turn were mostly related with cell adhesion. Cell adhesion is a key mechanism in metastasis, therefore these PPIs may be involved in similar molecular pathways that are shared by infectious disease and metastasis. Finally, by mapping the mutations and amino acid variations on the interface regions of the proteins in the metastasis sub-networks we found evidence for some mutations to be involved in the mechanisms differentiating the type of the metastasis. PMID:24278371
Pupo, Amaury; Baez-Nieto, David; Martínez, Agustín; Latorre, Ramón; González, Carlos
2014-01-01
Voltage-gated proton channels are integral membrane proteins with the capacity to permeate elementary particles in a voltage and pH dependent manner. These proteins have been found in several species and are involved in various physiological processes. Although their primary topology is known, lack of details regarding their structures in the open conformation has limited analyses toward a deeper understanding of the molecular determinants of their function and regulation. Consequently, the function-structure relationships have been inferred based on homology models. In the present work, we review the existing proton channel models, their assumptions, predictions and the experimental facts that support them. Modeling proton channels is not a trivial task due to the lack of a close homolog template. Hence, there are important differences between published models. This work attempts to critically review existing proton channel models toward the aim of contributing to a better understanding of the structural features of these proteins. PMID:24755912
Measuring and comparing structural fluctuation patterns in large protein datasets.
Fuglebakk, Edvin; Echave, Julián; Reuter, Nathalie
2012-10-01
The function of a protein depends not only on its structure but also on its dynamics. This is at the basis of a large body of experimental and theoretical work on protein dynamics. Further insight into the dynamics-function relationship can be gained by studying the evolutionary divergence of protein motions. To investigate this, we need appropriate comparative dynamics methods. The most used dynamical similarity score is the correlation between the root mean square fluctuations (RMSF) of aligned residues. Despite its usefulness, RMSF is in general less evolutionarily conserved than the native structure. A fundamental issue is whether RMSF is not as conserved as structure because dynamics is less conserved or because RMSF is not the best property to use to study its conservation. We performed a systematic assessment of several scores that quantify the (dis)similarity between protein fluctuation patterns. We show that the best scores perform as well as or better than structural dissimilarity, as assessed by their consistency with the SCOP classification. We conclude that to uncover the full extent of the evolutionary conservation of protein fluctuation patterns, it is important to measure the directions of fluctuations and their correlations between sites. Nathalie.Reuter@mbi.uib.no Supplementary data are available at Bioinformatics Online.
RepeatsDB-lite: a web server for unit annotation of tandem repeat proteins.
Hirsh, Layla; Paladin, Lisanna; Piovesan, Damiano; Tosatto, Silvio C E
2018-05-09
RepeatsDB-lite (http://protein.bio.unipd.it/repeatsdb-lite) is a web server for the prediction of repetitive structural elements and units in tandem repeat (TR) proteins. TRs are a widespread but poorly annotated class of non-globular proteins carrying heterogeneous functions. RepeatsDB-lite extends the prediction to all TR types and strongly improves the performance both in terms of computational time and accuracy over previous methods, with precision above 95% for solenoid structures. The algorithm exploits an improved TR unit library derived from the RepeatsDB database to perform an iterative structural search and assignment. The web interface provides tools for analyzing the evolutionary relationships between units and manually refine the prediction by changing unit positions and protein classification. An all-against-all structure-based sequence similarity matrix is calculated and visualized in real-time for every user edit. Reviewed predictions can be submitted to RepeatsDB for review and inclusion.
Chan, Yvonne H.; Venev, Sergey V.; Zeldovich, Konstantin B.; Matthews, C. Robert
2017-01-01
Sequence divergence of orthologous proteins enables adaptation to environmental stresses and promotes evolution of novel functions. Limits on evolution imposed by constraints on sequence and structure were explored using a model TIM barrel protein, indole-3-glycerol phosphate synthase (IGPS). Fitness effects of point mutations in three phylogenetically divergent IGPS proteins during adaptation to temperature stress were probed by auxotrophic complementation of yeast with prokaryotic, thermophilic IGPS. Analysis of beneficial mutations pointed to an unexpected, long-range allosteric pathway towards the active site of the protein. Significant correlations between the fitness landscapes of distant orthologues implicate both sequence and structure as primary forces in defining the TIM barrel fitness landscape and suggest that fitness landscapes can be translocated in sequence space. Exploration of fitness landscapes in the context of a protein fold provides a strategy for elucidating the sequence-structure-fitness relationships in other common motifs. PMID:28262665
Heras, Begoña; Totsika, Makrina; Peters, Kate M.; Paxman, Jason J.; Gee, Christine L.; Jarrott, Russell J.; Perugini, Matthew A.; Whitten, Andrew E.; Schembri, Mark A.
2014-01-01
Aggregation and biofilm formation are critical mechanisms for bacterial resistance to host immune factors and antibiotics. Autotransporter (AT) proteins, which represent the largest group of outer-membrane and secreted proteins in Gram-negative bacteria, contribute significantly to these phenotypes. Despite their abundance and role in bacterial pathogenesis, most AT proteins have not been structurally characterized, and there is a paucity of detailed information with regard to their mode of action. Here we report the structure–function relationships of Antigen 43 (Ag43a), a prototypic self-associating AT protein from uropathogenic Escherichia coli. The functional domain of Ag43a displays a twisted L-shaped β-helical structure firmly stabilized by a 3D hydrogen-bonded scaffold. Notably, the distinctive Ag43a L shape facilitates self-association and cell aggregation. Combining all our data, we define a molecular “Velcro-like” mechanism of AT-mediated bacterial clumping, which can be tailored to fit different bacterial lifestyles such as the formation of biofilms. PMID:24335802
Chen, Zhong-Yuan; Gao, Xiao-Chan; Zhang, Qi-Ya
2015-08-03
Aquareoviruses are serious pathogens of aquatic animals. Here, genome characterization and functional gene analysis of a novel aquareovirus, largemouth bass Micropterus salmoides reovirus (MsReV), was described. It comprises 11 dsRNA segments (S1-S11) covering 24,024 bp, and encodes 12 putative proteins including the inclusion forming-related protein NS87 and the fusion-associated small transmembrane (FAST) protein NS22. The function of NS22 was confirmed by expression in fish cells. Subsequently, MsReV was compared with two representative aquareoviruses, saltwater fish turbot Scophthalmus maximus reovirus (SMReV) and freshwater fish grass carp reovirus strain 109 (GCReV-109). MsReV NS87 and NS22 genes have the same structure and function with those of SMReV, whereas GCReV-109 is either missing the coiled-coil region in NS79 or the gene-encoding NS22. Significant similarities are also revealed among equivalent genome segments between MsReV and SMReV, but a difference is found between MsReV and GCReV-109. Furthermore, phylogenetic analysis showed that 13 aquareoviruses could be divided into freshwater and saline environments subgroups, and MsReV was closely related to SMReV in saline environments. Consequently, these viruses from hosts in saline environments have more genomic structural similarities than the viruses from hosts in freshwater. This is the first study of the relationships between aquareovirus genomic structure and their host environments.
Thermodynamic and structural characterization of an antibody gel
Esue, Osigwe; Xie, Anna X.; Kamerzell, Tim J.; Patapoff, Thomas W.
2013-01-01
Although extensively studied, protein–protein interactions remain highly elusive and are of increasing interest in drug development. We show the assembly of a monoclonal antibody, using multivalent carboxylate ions, into highly-ordered structures. While the presence and function of similar structures in vivo are not known, the results may present a possible unexplored area of antibody structure-function relationships. Using a variety of tools (e.g., mechanical rheology, electron microscopy, isothermal calorimetry, Fourier transform infrared spectroscopy), we characterized the physical, biochemical, and thermodynamic properties of these structures and found that citrate may interact directly with the amino acid residue histidine, after which the individual protein units assemble into a filamentous network gel exhibiting high elasticity and interfilament interactions. Citrate interacts exothermically with the monoclonal antibody with an association constant that is highly dependent on solution pH and temperature. Secondary structure analysis also reveals involvement of hydrophobic and aromatic residues. PMID:23425660
Virtual Interactomics of Proteins from Biochemical Standpoint
Kubrycht, Jaroslav; Sigler, Karel; Souček, Pavel
2012-01-01
Virtual interactomics represents a rapidly developing scientific area on the boundary line of bioinformatics and interactomics. Protein-related virtual interactomics then comprises instrumental tools for prediction, simulation, and networking of the majority of interactions important for structural and individual reproduction, differentiation, recognition, signaling, regulation, and metabolic pathways of cells and organisms. Here, we describe the main areas of virtual protein interactomics, that is, structurally based comparative analysis and prediction of functionally important interacting sites, mimotope-assisted and combined epitope prediction, molecular (protein) docking studies, and investigation of protein interaction networks. Detailed information about some interesting methodological approaches and online accessible programs or databases is displayed in our tables. Considerable part of the text deals with the searches for common conserved or functionally convergent protein regions and subgraphs of conserved interaction networks, new outstanding trends and clinically interesting results. In agreement with the presented data and relationships, virtual interactomic tools improve our scientific knowledge, help us to formulate working hypotheses, and they frequently also mediate variously important in silico simulations. PMID:22928109
Profiling Synaptic Proteins Identifies Regulators of Insulin Secretion and Lifespan
Kaplan, Joshua M.
2008-01-01
Cells are organized into distinct compartments to perform specific tasks with spatial precision. In neurons, presynaptic specializations are biochemically complex subcellular structures dedicated to neurotransmitter secretion. Activity-dependent changes in the abundance of presynaptic proteins are thought to endow synapses with different functional states; however, relatively little is known about the rules that govern changes in the composition of presynaptic terminals. We describe a genetic strategy to systematically analyze protein localization at Caenorhabditis elegans presynaptic specializations. Nine presynaptic proteins were GFP-tagged, allowing visualization of multiple presynaptic structures. Changes in the distribution and abundance of these proteins were quantified in 25 mutants that alter different aspects of neurotransmission. Global analysis of these data identified novel relationships between particular presynaptic components and provides a new method to compare gene functions by identifying shared protein localization phenotypes. Using this strategy, we identified several genes that regulate secretion of insulin-like growth factors (IGFs) and influence lifespan in a manner dependent on insulin/IGF signaling. PMID:19043554
Das, Debanu; Hervé, Mireille; Feuerhelm, Julie; Farr, Carol L.; Chiu, Hsiu-Ju; Elsliger, Marc-André; Knuth, Mark W.; Klock, Heath E.; Miller, Mitchell D.; Godzik, Adam; Lesley, Scott A.; Deacon, Ashley M.; Mengin-Lecreulx, Dominique; Wilson, Ian A.
2011-01-01
Bacterial cell walls contain peptidoglycan, an essential polymer made by enzymes in the Mur pathway. These proteins are specific to bacteria, which make them targets for drug discovery. MurC, MurD, MurE and MurF catalyze the synthesis of the peptidoglycan precursor UDP-N-acetylmuramoyl-L-alanyl-γ-D-glutamyl-meso-diaminopimelyl-D-alanyl-D-alanine by the sequential addition of amino acids onto UDP-N-acetylmuramic acid (UDP-MurNAc). MurC-F enzymes have been extensively studied by biochemistry and X-ray crystallography. In Gram-negative bacteria, ∼30–60% of the bacterial cell wall is recycled during each generation. Part of this recycling process involves the murein peptide ligase (Mpl), which attaches the breakdown product, the tripeptide L-alanyl-γ-D-glutamyl-meso-diaminopimelate, to UDP-MurNAc. We present the crystal structure at 1.65 Å resolution of a full-length Mpl from the permafrost bacterium Psychrobacter arcticus 273-4 (PaMpl). Although the Mpl structure has similarities to Mur enzymes, it has unique sequence and structure features that are likely related to its role in cell wall recycling, a function that differentiates it from the MurC-F enzymes. We have analyzed the sequence-structure relationships that are unique to Mpl proteins and compared them to MurC-F ligases. We have also characterized the biochemical properties of this enzyme (optimal temperature, pH and magnesium binding profiles and kinetic parameters). Although the structure does not contain any bound substrates, we have identified ∼30 residues that are likely to be important for recognition of the tripeptide and UDP-MurNAc substrates, as well as features that are unique to Psychrobacter Mpl proteins. These results provide the basis for future mutational studies for more extensive function characterization of the Mpl sequence-structure relationships. PMID:21445265
Das, Debanu; Hervé, Mireille; Feuerhelm, Julie; Farr, Carol L; Chiu, Hsiu-Ju; Elsliger, Marc-André; Knuth, Mark W; Klock, Heath E; Miller, Mitchell D; Godzik, Adam; Lesley, Scott A; Deacon, Ashley M; Mengin-Lecreulx, Dominique; Wilson, Ian A
2011-03-18
Bacterial cell walls contain peptidoglycan, an essential polymer made by enzymes in the Mur pathway. These proteins are specific to bacteria, which make them targets for drug discovery. MurC, MurD, MurE and MurF catalyze the synthesis of the peptidoglycan precursor UDP-N-acetylmuramoyl-L-alanyl-γ-D-glutamyl-meso-diaminopimelyl-D-alanyl-D-alanine by the sequential addition of amino acids onto UDP-N-acetylmuramic acid (UDP-MurNAc). MurC-F enzymes have been extensively studied by biochemistry and X-ray crystallography. In gram-negative bacteria, ∼30-60% of the bacterial cell wall is recycled during each generation. Part of this recycling process involves the murein peptide ligase (Mpl), which attaches the breakdown product, the tripeptide L-alanyl-γ-D-glutamyl-meso-diaminopimelate, to UDP-MurNAc. We present the crystal structure at 1.65 Å resolution of a full-length Mpl from the permafrost bacterium Psychrobacter arcticus 273-4 (PaMpl). Although the Mpl structure has similarities to Mur enzymes, it has unique sequence and structure features that are likely related to its role in cell wall recycling, a function that differentiates it from the MurC-F enzymes. We have analyzed the sequence-structure relationships that are unique to Mpl proteins and compared them to MurC-F ligases. We have also characterized the biochemical properties of this enzyme (optimal temperature, pH and magnesium binding profiles and kinetic parameters). Although the structure does not contain any bound substrates, we have identified ∼30 residues that are likely to be important for recognition of the tripeptide and UDP-MurNAc substrates, as well as features that are unique to Psychrobacter Mpl proteins. These results provide the basis for future mutational studies for more extensive function characterization of the Mpl sequence-structure relationships.
Duncan, R; Horne, D; Strong, J E; Leone, G; Pon, R T; Yeung, M C; Lee, P W
1991-06-01
We have been investigating structure-function relationships in the reovirus cell attachment protein sigma 1 using various deletion mutants and protease analysis. In the present study, a series of deletion mutants were constructed which lacked 90, 44, 30, 12, or 4 amino acids from the C-terminus of the 455-amino acid-long reovirus type 3 (T3) sigma 1 protein. The full-length and truncated sigma 1 proteins were expressed in an in vitro transcription/translation system and assayed for L cell binding activity. It was found that the removal of as few as four amino acids from the C-terminus drastically affected the cell binding function of the sigma 1 protein. The C-terminal-truncated proteins were further characterized using trypsin, chymotrypsin, and monoclonal and polyclonal antibodies. Our results indicated that the C-terminal portions of the mutant proteins were misfolded, leading to a loss in cell binding function. The N-terminal fibrous tail of the proteins was unaffected by the deletions as was sigma 1 oligomerization, further illustrating the discrete structural and functional roles of the N- and C-terminal domains of sigma 1. In an attempt to identify smaller, functional peptides, full-length sigma 1 expressed in vitro was digested with trypsin and subsequently with chymotrypsin under various conditions. The results clearly demonstrated the highly stable nature of the C-terminal globular head of sigma 1, even when separated from the N-terminal fibrous tail. We concluded that: (1) the C-terminal globular head of sigma 1 exists as a compact, protease-resistant oligomeric structure; (2) an intact C-terminus is required for proper head folding and generation of the conformationally dependent cell binding domain.
Rapid and Programmable Protein Mutagenesis Using Plasmid Recombineering.
Higgins, Sean A; Ouonkap, Sorel V Y; Savage, David F
2017-10-20
Comprehensive and programmable protein mutagenesis is critical for understanding structure-function relationships and improving protein function. There is thus a need for robust and unbiased molecular biological approaches for the construction of the requisite comprehensive protein libraries. Here we demonstrate that plasmid recombineering is a simple and robust in vivo method for the generation of protein mutants for both comprehensive library generation as well as programmable targeting of sequence space. Using the fluorescent protein iLOV as a model target, we build a complete mutagenesis library and find it to be specific and comprehensive, detecting 99.8% of our intended mutations. We then develop a thermostability screen and utilize our comprehensive mutation data to rapidly construct a targeted and multiplexed library that identifies significantly improved variants, thus demonstrating rapid protein engineering in a simple protocol.
Is the isolated ligand binding domain a good model of the domain in the native receptor?
Deming, Dustin; Cheng, Qing; Jayaraman, Vasanthi
2003-05-16
Numerous studies have used the atomic level structure of the isolated ligand binding domain of the glutamate receptor to elucidate the agonist-induced activation and desensitization processes in this group of proteins. However, no study has demonstrated the structural equivalence of the isolated ligand binding fragments and the protein in the native receptor. In this report, using visible absorption spectroscopy we show that the electronic environment of the antagonist 6-cyano-7-nitro-2,3-dihydroxyquinoxaline is identical for the isolated protein and the native glutamate receptors expressed in cells. Our results hence establish that the local structure of the ligand binding site is the same in the two proteins and validate the detailed structure-function relationships that have been developed based on a comparison of the structure of the isolated ligand binding domain and electrophysiological consequences in the native receptor.
Probing receptor structure/function with chimeric G-protein-coupled receptors.
Yin, Dezhong; Gavi, Shai; Wang, Hsien-yu; Malbon, Craig C
2004-06-01
Owing its name to an image borrowed from Greek mythology, a chimera is seen to represent a new entity created as a composite from existing creatures or, in this case, molecules. Making use of various combinations of three basic domains of the receptors (i.e., exofacial, transmembrane, and cytoplasmic segments) that couple agonist binding into activation of effectors through heterotrimeric G-proteins, molecular pharmacology has probed the basic organization, structure/function relationships of this superfamily of heptahelical receptors. Chimeric G-protein-coupled receptors obviate the need for a particular agonist ligand when the ligand is resistant to purification or, in the case of orphan receptors, is not known. Chimeric receptors created from distant members of the heptahelical receptors enable new strategies in understanding how these receptors transduce agonist binding into receptor activation and may be able to offer insights into the evolution of G-protein-coupled receptors from yeast to humans.
Watching proteins function with 150-ps time-resolved X-ray crystallography
NASA Astrophysics Data System (ADS)
Anfinrud, Philip
2007-03-01
We have used time-resolved Laue crystallography to characterize ligand migration pathways and dynamics in wild-type and several mutant forms of myoglobin (Mb), a ligand-binding heme protein found in muscle tissue. In these pump-probe experiments, which were conducted on the ID09B time-resolved beamline at the European Synchrotron and Radiation Facility, a laser pulse photodissociates CO from an MbCO crystal and a suitably delayed X-ray pulse probes its structure via Laue diffraction. Single-site mutations in the vicinity of the heme pocket docking site were found to have a dramatic effect on ligand migration. To visualize this process, time-resolved electron density maps were stitched together into movies that unveil with <2-å spatial resolution and 150-ps time-resolution the correlated protein motions that accompany and/or mediate ligand migration. These studies help to illustrate at an atomic level relationships between protein structure, dynamics, and function.
Protein intrinsic disorder in plants.
Pazos, Florencio; Pietrosemoli, Natalia; García-Martín, Juan A; Solano, Roberto
2013-09-12
To some extent contradicting the classical paradigm of the relationship between protein 3D structure and function, now it is clear that large portions of the proteomes, especially in higher organisms, lack a fixed structure and still perform very important functions. Proteins completely or partially unstructured in their native (functional) form are involved in key cellular processes underlain by complex networks of protein interactions. The intrinsic conformational flexibility of these disordered proteins allows them to bind multiple partners in transient interactions of high specificity and low affinity. In concordance, in plants this type of proteins has been found in processes requiring these complex and versatile interaction networks. These include transcription factor networks, where disordered proteins act as integrators of different signals or link different transcription factor subnetworks due to their ability to interact (in many cases simultaneously) with different partners. Similarly, they also serve as signal integrators in signaling cascades, such as those related to response to external stimuli. Disordered proteins have also been found in plants in many stress-response processes, acting as protein chaperones or protecting other cellular components and structures. In plants, it is especially important to have complex and versatile networks able to quickly and efficiently respond to changing environmental conditions since these organisms cannot escape and have no other choice than adapting to them. Consequently, protein disorder can play an especially important role in plants, providing them with a fast mechanism to obtain complex, interconnected and versatile molecular networks.
Protein intrinsic disorder in plants
Pazos, Florencio; Pietrosemoli, Natalia; García-Martín, Juan A.; Solano, Roberto
2013-01-01
To some extent contradicting the classical paradigm of the relationship between protein 3D structure and function, now it is clear that large portions of the proteomes, especially in higher organisms, lack a fixed structure and still perform very important functions. Proteins completely or partially unstructured in their native (functional) form are involved in key cellular processes underlain by complex networks of protein interactions. The intrinsic conformational flexibility of these disordered proteins allows them to bind multiple partners in transient interactions of high specificity and low affinity. In concordance, in plants this type of proteins has been found in processes requiring these complex and versatile interaction networks. These include transcription factor networks, where disordered proteins act as integrators of different signals or link different transcription factor subnetworks due to their ability to interact (in many cases simultaneously) with different partners. Similarly, they also serve as signal integrators in signaling cascades, such as those related to response to external stimuli. Disordered proteins have also been found in plants in many stress-response processes, acting as protein chaperones or protecting other cellular components and structures. In plants, it is especially important to have complex and versatile networks able to quickly and efficiently respond to changing environmental conditions since these organisms cannot escape and have no other choice than adapting to them. Consequently, protein disorder can play an especially important role in plants, providing them with a fast mechanism to obtain complex, interconnected and versatile molecular networks. PMID:24062761
Biophysics of protein evolution and evolutionary protein biophysics
Sikosek, Tobias; Chan, Hue Sun
2014-01-01
The study of molecular evolution at the level of protein-coding genes often entails comparing large datasets of sequences to infer their evolutionary relationships. Despite the importance of a protein's structure and conformational dynamics to its function and thus its fitness, common phylogenetic methods embody minimal biophysical knowledge of proteins. To underscore the biophysical constraints on natural selection, we survey effects of protein mutations, highlighting the physical basis for marginal stability of natural globular proteins and how requirement for kinetic stability and avoidance of misfolding and misinteractions might have affected protein evolution. The biophysical underpinnings of these effects have been addressed by models with an explicit coarse-grained spatial representation of the polypeptide chain. Sequence–structure mappings based on such models are powerful conceptual tools that rationalize mutational robustness, evolvability, epistasis, promiscuous function performed by ‘hidden’ conformational states, resolution of adaptive conflicts and conformational switches in the evolution from one protein fold to another. Recently, protein biophysics has been applied to derive more accurate evolutionary accounts of sequence data. Methods have also been developed to exploit sequence-based evolutionary information to predict biophysical behaviours of proteins. The success of these approaches demonstrates a deep synergy between the fields of protein biophysics and protein evolution. PMID:25165599
Molecular structures guide the engineering of chromatin.
Tekel, Stefan J; Haynes, Karmella A
2017-07-27
Chromatin is a system of proteins, RNA, and DNA that interact with each other to organize and regulate genetic information within eukaryotic nuclei. Chromatin proteins carry out essential functions: packing DNA during cell division, partitioning DNA into sub-regions within the nucleus, and controlling levels of gene expression. There is a growing interest in manipulating chromatin dynamics for applications in medicine and agriculture. Progress in this area requires the identification of design rules for the chromatin system. Here, we focus on the relationship between the physical structure and function of chromatin proteins. We discuss key research that has elucidated the intrinsic properties of chromatin proteins and how this information informs design rules for synthetic systems. Recent work demonstrates that chromatin-derived peptide motifs are portable and in some cases can be customized to alter their function. Finally, we present a workflow for fusion protein design and discuss best practices for engineering chromatin to assist scientists in advancing the field of synthetic epigenetics. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Uncovering the structure-function relationship in spider silk
NASA Astrophysics Data System (ADS)
Yarger, Jeffery L.; Cherry, Brian R.; van der Vaart, Arjan
2018-03-01
All spiders produce protein-based biopolymer fibres that we call silk. The most studied of these silks is spider dragline silk, which is very tough and relatively abundant compared with other types of spider silks. Considerable research has been devoted to understanding the relationship between the molecular structure and mechanical properties of spider dragline silks. In this Review, we overview experimental and computational studies that have provided a wealth of detail at the molecular level on the highly conserved repetitive core and terminal regions of spider dragline silk. We also discuss the role of the nanocrystalline β-sheets and amorphous regions in determining the properties of spider silk fibres, endowing them with strength and elasticity. Additionally, we outline imaging techniques and modelling studies that elucidate the importance of the hierarchical structure of silk fibres at the molecular level. These insights into structure-function relationships can guide the reverse engineering of spider silk to enable the production of superior synthetic fibres.
Song, Jiangning; Li, Fuyi; Takemoto, Kazuhiro; Haffari, Gholamreza; Akutsu, Tatsuya; Chou, Kuo-Chen; Webb, Geoffrey I
2018-04-14
Determining the catalytic residues in an enzyme is critical to our understanding the relationship between protein sequence, structure, function, and enhancing our ability to design novel enzymes and their inhibitors. Although many enzymes have been sequenced, and their primary and tertiary structures determined, experimental methods for enzyme functional characterization lag behind. Because experimental methods used for identifying catalytic residues are resource- and labor-intensive, computational approaches have considerable value and are highly desirable for their ability to complement experimental studies in identifying catalytic residues and helping to bridge the sequence-structure-function gap. In this study, we describe a new computational method called PREvaIL for predicting enzyme catalytic residues. This method was developed by leveraging a comprehensive set of informative features extracted from multiple levels, including sequence, structure, and residue-contact network, in a random forest machine-learning framework. Extensive benchmarking experiments on eight different datasets based on 10-fold cross-validation and independent tests, as well as side-by-side performance comparisons with seven modern sequence- and structure-based methods, showed that PREvaIL achieved competitive predictive performance, with an area under the receiver operating characteristic curve and area under the precision-recall curve ranging from 0.896 to 0.973 and from 0.294 to 0.523, respectively. We demonstrated that this method was able to capture useful signals arising from different levels, leveraging such differential but useful types of features and allowing us to significantly improve the performance of catalytic residue prediction. We believe that this new method can be utilized as a valuable tool for both understanding the complex sequence-structure-function relationships of proteins and facilitating the characterization of novel enzymes lacking functional annotations. Copyright © 2018 Elsevier Ltd. All rights reserved.
The effect of oxidation on the mechanical response and microstructure of porcine aortas.
Stephen, Elizabeth A; Venkatasubramaniam, Arundhathi; Good, Theresa A; Topoleski, L D Timmie
2014-09-01
Reactive oxygen species (ROS), a product of many cellular functions, has been implicated in many age-related pathophysiological processes, including cardiovascular disease. The arterial proteins collagen and elastin may also undergo structural and functional changes due to damage caused by ROS. This study examined the effect of oxidation on the mechanical response of porcine aortas and aorta elastin and the associated changes in structural protein ultrastructure as a step in exploring the role of molecular changes in structural proteins with aging on elastic artery function. We examined the change in mechanical properties of aorta samples after various oxidation times as a first step in understanding how the oxidative environment associated with aging could impact mechanical properties of arterial structural proteins. We used confocal microscopy to visualize how the microstructure of isolated elastin changed with oxidation. We find that short term oxidation of elastin isolated from aortas leads to an increase in material stiffness, but also an increase in the fiber diameter, increase in void space in the matrix, and a decrease in the fiber orientation, possibly due to fiber cross-linking. The short term effects of oxidation on arterial collagen is more complex, with increase in material stiffness seen in the collagen region of the stress stretch curve at low extents of oxidation, but not at high levels of oxidation. These results may provide insight into the relationship between oxidative damage to tissue associated with aging and disease, structure of the arterial proteins elastin and collagen, and arterial mechanical properties and function. © 2013 Wiley Periodicals, Inc.
BAYESIAN PROTEIN STRUCTURE ALIGNMENT.
Rodriguez, Abel; Schmidler, Scott C
The analysis of the three-dimensional structure of proteins is an important topic in molecular biochemistry. Structure plays a critical role in defining the function of proteins and is more strongly conserved than amino acid sequence over evolutionary timescales. A key challenge is the identification and evaluation of structural similarity between proteins; such analysis can aid in understanding the role of newly discovered proteins and help elucidate evolutionary relationships between organisms. Computational biologists have developed many clever algorithmic techniques for comparing protein structures, however, all are based on heuristic optimization criteria, making statistical interpretation somewhat difficult. Here we present a fully probabilistic framework for pairwise structural alignment of proteins. Our approach has several advantages, including the ability to capture alignment uncertainty and to estimate key "gap" parameters which critically affect the quality of the alignment. We show that several existing alignment methods arise as maximum a posteriori estimates under specific choices of prior distributions and error models. Our probabilistic framework is also easily extended to incorporate additional information, which we demonstrate by including primary sequence information to generate simultaneous sequence-structure alignments that can resolve ambiguities obtained using structure alone. This combined model also provides a natural approach for the difficult task of estimating evolutionary distance based on structural alignments. The model is illustrated by comparison with well-established methods on several challenging protein alignment examples.
Characterizing Conformational Dynamics of Proteins Using Evolutionary Couplings.
Feng, Jiangyan; Shukla, Diwakar
2018-01-25
Understanding of protein conformational dynamics is essential for elucidating molecular origins of protein structure-function relationship. Traditionally, reaction coordinates, i.e., some functions of protein atom positions and velocities have been used to interpret the complex dynamics of proteins obtained from experimental and computational approaches such as molecular dynamics simulations. However, it is nontrivial to identify the reaction coordinates a priori even for small proteins. Here, we evaluate the power of evolutionary couplings (ECs) to capture protein dynamics by exploring their use as reaction coordinates, which can efficiently guide the sampling of a conformational free energy landscape. We have analyzed 10 diverse proteins and shown that a few ECs are sufficient to characterize complex conformational dynamics of proteins involved in folding and conformational change processes. With the rapid strides in sequencing technology, we expect that ECs could help identify reaction coordinates a priori and enhance the sampling of the slow dynamical process associated with protein folding and conformational change.
Bioinformatics analysis for structure and function of CPR of Plasmodium falciparum.
Fan, Zhigang; Zhang, Lingmin; Yan, Guogang; Wu, Qiang; Gan, Xiufeng; Zhong, Saifeng; Lin, Guifen
2011-02-01
To analyse the structure and function of NADPH-cytochrome p450 reductase (CYPOR or CPR) from Plasmodium falciparum (Pf), and to predict its' drug target and vaccine target. The structure, function, drug target and vaccine target of CPR from Plasmodium falciparum were analyzed and predicted by bioinformatics methods. PfCPR, which was older CPR, had close relationship with the CPR from other Plasmodium species, but it was distant from its hosts, such as Homo sapiens and Anopheles. PfCPR was located in the cellular nucleus of Plasmodium falciparum. 335aa-352aa and 591aa - 608aa were inserted the interior side of the nuclear membrane, while 151aa-265aa was located in the nucleolus organizer regions. PfCPR had 40 function sites and 44 protein-protein binding sites in amino acid sequence. The teriary structure of 1aa-700aa was forcep-shaped with wings. 15 segments of PfCPR had no homology with Homo sapien CPR and most were exposed on the surface of the protein. These segments had 25 protein-protein binding sites. While 13 other segments all possessed function sites. The evolution or genesis of Plasmodium falciparum is earlier than those of Homo sapiens. PfCPR is a possible resistance site of antimalarial drug and may involve immune evasion, which is associated with parasite of sporozoite in hepatocytes. PfCPR is unsuitable as vaccine target, but it has at least 13 ideal drug targets. Copyright © 2011 Hainan Medical College. Published by Elsevier B.V. All rights reserved.
Mutational Analysis of Escherichia coli MoeA: Two Functional Activities Map to the Active Site Cleft
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nichols,J.; Xiang, S.; Schindelin, H.
2007-01-01
The molybdenum cofactor is ubiquitous in nature, and the pathway for Moco biosynthesis is conserved in all three domains of life. Recent work has helped to illuminate one of the most enigmatic steps in Moco biosynthesis, ligation of metal to molybdopterin (the organic component of the cofactor) to form the active cofactor. In Escherichia coli, the MoeA protein mediates ligation of Mo to molybdopterin while the MogA protein enhances this process in an ATP-dependent manner. The X-ray crystal structures for both proteins have been previously described as well as two essential MogA residues, Asp49 and Asp82. Here we describe amore » detailed mutational analysis of the MoeA protein. Variants of conserved residues at the putative active site of MoeA were analyzed for a loss of function in two different, previously described assays, one employing moeA{sup -} crude extracts and the other utilizing a defined system. Oddly, no correlation was observed between the activity in the two assays. In fact, our results showed a general trend toward an inverse relationship between the activity in each assay. Moco binding studies indicated a strong correlation between a variant's ability to bind Moco and its activity in the purified component assay. Crystal structures of the functionally characterized MoeA variants revealed no major structural changes, indicating that the functional differences observed are not due to disruption of the protein structure. On the basis of these results, two different functional areas were assigned to regions at or near the MoeA active site cleft.« less
Structural features that predict real-value fluctuations of globular proteins.
Jamroz, Michal; Kolinski, Andrzej; Kihara, Daisuke
2012-05-01
It is crucial to consider dynamics for understanding the biological function of proteins. We used a large number of molecular dynamics (MD) trajectories of nonhomologous proteins as references and examined static structural features of proteins that are most relevant to fluctuations. We examined correlation of individual structural features with fluctuations and further investigated effective combinations of features for predicting the real value of residue fluctuations using the support vector regression (SVR). It was found that some structural features have higher correlation than crystallographic B-factors with fluctuations observed in MD trajectories. Moreover, SVR that uses combinations of static structural features showed accurate prediction of fluctuations with an average Pearson's correlation coefficient of 0.669 and a root mean square error of 1.04 Å. This correlation coefficient is higher than the one observed in predictions by the Gaussian network model (GNM). An advantage of the developed method over the GNMs is that the former predicts the real value of fluctuation. The results help improve our understanding of relationships between protein structure and fluctuation. Furthermore, the developed method provides a convienient practial way to predict fluctuations of proteins using easily computed static structural features of proteins. Copyright © 2012 Wiley Periodicals, Inc.
Structural features that predict real-value fluctuations of globular proteins
Jamroz, Michal; Kolinski, Andrzej; Kihara, Daisuke
2012-01-01
It is crucial to consider dynamics for understanding the biological function of proteins. We used a large number of molecular dynamics trajectories of non-homologous proteins as references and examined static structural features of proteins that are most relevant to fluctuations. We examined correlation of individual structural features with fluctuations and further investigated effective combinations of features for predicting the real-value of residue fluctuations using the support vector regression. It was found that some structural features have higher correlation than crystallographic B-factors with fluctuations observed in molecular dynamics trajectories. Moreover, support vector regression that uses combinations of static structural features showed accurate prediction of fluctuations with an average Pearson’s correlation coefficient of 0.669 and a root mean square error of 1.04 Å. This correlation coefficient is higher than the one observed for the prediction by the Gaussian network model. An advantage of the developed method over the Gaussian network models is that the former predicts the real-value of fluctuation. The results help improve our understanding of relationships between protein structure and fluctuation. Furthermore, the developed method provides a convienient practial way to predict fluctuations of proteins using easily computed static structural features of proteins. PMID:22328193
The dual role of fragments in fragment-assembly methods for de novo protein structure prediction
Handl, Julia; Knowles, Joshua; Vernon, Robert; Baker, David; Lovell, Simon C.
2013-01-01
In fragment-assembly techniques for protein structure prediction, models of protein structure are assembled from fragments of known protein structures. This process is typically guided by a knowledge-based energy function and uses a heuristic optimization method. The fragments play two important roles in this process: they define the set of structural parameters available, and they also assume the role of the main variation operators that are used by the optimiser. Previous analysis has typically focused on the first of these roles. In particular, the relationship between local amino acid sequence and local protein structure has been studied by a range of authors. The correlation between the two has been shown to vary with the window length considered, and the results of these analyses have informed directly the choice of fragment length in state-of-the-art prediction techniques. Here, we focus on the second role of fragments and aim to determine the effect of fragment length from an optimization perspective. We use theoretical analyses to reveal how the size and structure of the search space changes as a function of insertion length. Furthermore, empirical analyses are used to explore additional ways in which the size of the fragment insertion influences the search both in a simulation model and for the fragment-assembly technique, Rosetta. PMID:22095594
Barber-Zucker, Shiran; Uebe, René; Davidov, Geula; Navon, Yotam; Sherf, Dror; Chill, Jordan H.; Kass, Itamar; Bitton, Ronit; Schüler, Dirk; Zarivach, Raz
2016-01-01
Cation diffusion facilitators (CDF) are highly conserved, metal ion efflux transporters that maintain divalent transition metal cation homeostasis. Most CDF proteins contain two domains, the cation transporting transmembrane domain and the regulatory cytoplasmic C-terminal domain (CTD). MamM is a magnetosome-associated CDF protein essential for the biomineralization of magnetic iron-oxide particles in magnetotactic bacteria. To investigate the structure-function relationship of CDF cytoplasmic domains, we characterized a MamM M250P mutation that is synonymous with the disease-related mutation L349P of the human CDF protein ZnT-10. Our results show that the M250P exchange in MamM causes severe structural changes in its CTD resulting in abnormal reduced function. Our in vivo, in vitro and in silico studies indicate that the CTD fold is critical for CDF proteins’ proper function and support the previously suggested role of the CDF cytoplasmic domain as a CDF regulatory element. Based on our results, we also suggest a mechanism for the effects of the ZnT-10 L349P mutation in human. PMID:27550551
Domain organizations of modular extracellular matrix proteins and their evolution.
Engel, J
1996-11-01
Multidomain proteins which are composed of modular units are a rather recent invention of evolution. Domains are defined as autonomously folding regions of a protein, and many of them are similar in sequence and structure, indicating common ancestry. Their modular nature is emphasized by frequent repetitions in identical or in different proteins and by a large number of different combinations with other domains. The extracellular matrix is perhaps the largest biological system composed of modular mosaic proteins, and its astonishing complexity and diversity are based on them. A cluster of minireviews on modular proteins is being published in Matrix Biology. These deal with the evolution of modular proteins, the three-dimensional structure of domains and the ways in which these interact in a multidomain protein. They discuss structure-function relationships in calcium binding domains, collagen helices, alpha-helical coiled-coil domains and C-lectins. The present minireview is focused on some general aspects and serves as an introduction to the cluster.
Fc-fusion proteins and FcRn: structural insights for longer-lasting and more effective therapeutics
Rath, Timo; Baker, Kristi; Dumont, Jennifer A.; Peters, Robert T.; Jiang, Haiyan; Qiao, Shuo-Wang; Lencer, Wayne I.; Pierce, Glenn F.; Blumberg, Richard S.
2016-01-01
Nearly 350 IgG-based therapeutics are approved for clinical use or are under development for many diseases lacking adequate treatment options. These include molecularly engineered biologicals comprising the IgG Fc-domain fused to various effector molecules (so-called Fc-fusion proteins) that confer the advantages of IgG, including binding to the neonatal Fc receptor (FcRn) to facilitate in vivo stability, and the therapeutic benefit of the specific effector functions. Advances in IgG structure-function relationships and an understanding of FcRn biology have provided therapeutic opportunities for previously unapproachable diseases. This article discusses approved Fc-fusion therapeutics, novel Fc-fusion proteins and FcRn-dependent delivery approaches in development, and how engineering of the FcRn–Fc interaction can generate longer-lasting and more effective therapeutics. PMID:24156398
Dunwell, Jim M.; Khuri, Sawsan; Gane, Paul J.
2000-01-01
This review summarizes the recent discovery of the cupin superfamily (from the Latin term “cupa,” a small barrel) of functionally diverse proteins that initially were limited to several higher plant proteins such as seed storage proteins, germin (an oxalate oxidase), germin-like proteins, and auxin-binding protein. Knowledge of the three-dimensional structure of two vicilins, seed proteins with a characteristic β-barrel core, led to the identification of a small number of conserved residues and thence to the discovery of several microbial proteins which share these key amino acids. In particular, there is a highly conserved pattern of two histidine-containing motifs with a varied intermotif spacing. This cupin signature is found as a central component of many microbial proteins including certain types of phosphomannose isomerase, polyketide synthase, epimerase, and dioxygenase. In addition, the signature has been identified within the N-terminal effector domain in a subgroup of bacterial AraC transcription factors. As well as these single-domain cupins, this survey has identified other classes of two-domain bicupins including bacterial gentisate 1,2-dioxygenases and 1-hydroxy-2-naphthoate dioxygenases, fungal oxalate decarboxylases, and legume sucrose-binding proteins. Cupin evolution is discussed from the perspective of the structure-function relationships, using data from the genomes of several prokaryotes, especially Bacillus subtilis. Many of these functions involve aspects of sugar metabolism and cell wall synthesis and are concerned with responses to abiotic stress such as heat, desiccation, or starvation. Particular emphasis is also given to the oxalate-degrading enzymes from microbes, their biological significance, and their value in a range of medical and other applications. PMID:10704478
Computational approaches for drug discovery.
Hung, Che-Lun; Chen, Chi-Chun
2014-09-01
Cellular proteins are the mediators of multiple organism functions being involved in physiological mechanisms and disease. By discovering lead compounds that affect the function of target proteins, the target diseases or physiological mechanisms can be modulated. Based on knowledge of the ligand-receptor interaction, the chemical structures of leads can be modified to improve efficacy, selectivity and reduce side effects. One rational drug design technology, which enables drug discovery based on knowledge of target structures, functional properties and mechanisms, is computer-aided drug design (CADD). The application of CADD can be cost-effective using experiments to compare predicted and actual drug activity, the results from which can used iteratively to improve compound properties. The two major CADD-based approaches are structure-based drug design, where protein structures are required, and ligand-based drug design, where ligand and ligand activities can be used to design compounds interacting with the protein structure. Approaches in structure-based drug design include docking, de novo design, fragment-based drug discovery and structure-based pharmacophore modeling. Approaches in ligand-based drug design include quantitative structure-affinity relationship and pharmacophore modeling based on ligand properties. Based on whether the structure of the receptor and its interaction with the ligand are known, different design strategies can be seed. After lead compounds are generated, the rule of five can be used to assess whether these have drug-like properties. Several quality validation methods, such as cost function analysis, Fisher's cross-validation analysis and goodness of hit test, can be used to estimate the metrics of different drug design strategies. To further improve CADD performance, multi-computers and graphics processing units may be applied to reduce costs. © 2014 Wiley Periodicals, Inc.
Recombinant protein blends: silk beyond natural design.
Dinjaski, Nina; Kaplan, David L
2016-06-01
Recombinant DNA technology and new material concepts are shaping future directions in biomaterial science for the design and production of the next-generation biomaterial platforms. Aside from conventionally used synthetic polymers, numerous natural biopolymers (e.g., silk, elastin, collagen, gelatin, alginate, cellulose, keratin, chitin, polyhydroxyalkanoates) have been investigated for properties and manipulation via bioengineering. Genetic engineering provides a path to increase structural and functional complexity of these biopolymers, and thereby expand the catalog of available biomaterials beyond that which exists in nature. In addition, the integration of experimental approaches with computational modeling to analyze sequence-structure-function relationships is starting to have an impact in the field by establishing predictive frameworks for determining material properties. Herein, we review advances in recombinant DNA-mediated protein production and functionalization approaches, with a focus on hybrids or combinations of proteins; recombinant protein blends or 'recombinamers'. We highlight the potential biomedical applications of fibrous protein recombinamers, such as Silk-Elastin Like Polypeptides (SELPs) and Silk-Bacterial Collagens (SBCs). We also discuss the possibility for the rationale design of fibrous proteins to build smart, stimuli-responsive biomaterials for diverse applications. We underline current limitations with production systems for these proteins and discuss the main trends in systems/synthetic biology that may improve recombinant fibrous protein design and production. Copyright © 2016. Published by Elsevier Ltd.
Mittag, Tanja; Marsh, Joseph; Grishaev, Alexander; Orlicky, Stephen; Lin, Hong; Sicheri, Frank; Tyers, Mike; Forman-Kay, Julie D.
2010-01-01
Summary Intrinsically disordered proteins can form highly dynamic complexes with partner proteins. One such dynamic complex involves the intrinsically disordered Sic1 with its partner Cdc4 in regulation of yeast cell cycle progression. Phosphorylation of six N-terminal Sic1 sites leads to equilibrium engagement of each phosphorylation site with the primary binding pocket in Cdc4, the substrate recognition subunit of a ubiquitin ligase. ENSEMBLE calculations utilizing experimental NMR and small-angle x-ray scattering data reveal significant transient structure in both phosphorylation states of the isolated ensembles (Sic1 and pSic1) that modulates their electrostatic potential, suggesting a structural basis for the proposed strong contribution of electrostatics to binding. A structural model of the dynamic pSic1-Cdc4 complex demonstrates the spatial arrangements in the ubiquitin ligase complex. These results provide a physical picture of a protein that is predominantly disordered in both its free and bound states, enabling aspects of its structure/function relationship to be elucidated. PMID:20399186
Döring, Clemens; Hussein, Mohamed A; Jekle, Mario; Becker, Thomas
2017-08-15
For rye dough structure, it is hypothesised that the presence of arabinoxylan hinders the proteins from forming a coherent network. This hypothesis was investigated using fluorescent-stained antibodies that bind to the arabinoxylan chains. Image analysis proves that the arabinoxylan surrounds the proteins, negatively affecting protein networking. Further, it is hypothesised that the dosing of xylanase and transglutaminase has a positive impact on rye dough and bread characteristics; the findings in this study evidenced that this increases the protein network by up to 38% accompanied by a higher volume rise of 10.67%, compared to standard rye dough. These outcomes combine a product-oriented and physiochemical design of a recipe, targeting structural and functional relationships, and demonstrate a successful methodology for enhancing rye bread quality. Copyright © 2017 Elsevier Ltd. All rights reserved.
Munteanu, Cristian R; Pedreira, Nieves; Dorado, Julián; Pazos, Alejandro; Pérez-Montoto, Lázaro G; Ubeira, Florencio M; González-Díaz, Humberto
2014-04-01
Lectins (Ls) play an important role in many diseases such as different types of cancer, parasitic infections and other diseases. Interestingly, the Protein Data Bank (PDB) contains +3000 protein 3D structures with unknown function. Thus, we can in principle, discover new Ls mining non-annotated structures from PDB or other sources. However, there are no general models to predict new biologically relevant Ls based on 3D chemical structures. We used the MARCH-INSIDE software to calculate the Markov-Shannon 3D electrostatic entropy parameters for the complex networks of protein structure of 2200 different protein 3D structures, including 1200 Ls. We have performed a Linear Discriminant Analysis (LDA) using these parameters as inputs in order to seek a new Quantitative Structure-Activity Relationship (QSAR) model, which is able to discriminate 3D structure of Ls from other proteins. We implemented this predictor in the web server named LECTINPred, freely available at http://bio-aims.udc.es/LECTINPred.php. This web server showed the following goodness-of-fit statistics: Sensitivity=96.7 % (for Ls), Specificity=87.6 % (non-active proteins), and Accuracy=92.5 % (for all proteins), considering altogether both the training and external prediction series. In mode 2, users can carry out an automatic retrieval of protein structures from PDB. We illustrated the use of this server, in operation mode 1, performing a data mining of PDB. We predicted Ls scores for +2000 proteins with unknown function and selected the top-scored ones as possible lectins. In operation mode 2, LECTINPred can also upload 3D structural models generated with structure-prediction tools like LOMETS or PHYRE2. The new Ls are expected to be of relevance as cancer biomarkers or useful in parasite vaccine design. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Transmembrane Helices Tilt, Bend, Slide, Torque, and Unwind between Functional States of Rhodopsin
Ren, Zhong; Ren, Peter X.; Balusu, Rohith; Yang, Xiaojing
2016-01-01
The seven-helical bundle of rhodopsin and other G-protein coupled receptors undergoes structural rearrangements as the transmembrane receptor protein is activated. These structural changes are known to involve tilting and bending of various transmembrane helices. However, the cause and effect relationship among structural events leading to a cytoplasmic crevasse for G-protein binding is less well defined. Here we present a mathematical model of the protein helix and a simple procedure to determine multiple parameters that offer precise depiction of a helical conformation. A comprehensive survey of bovine rhodopsin structures shows that the helical rearrangements during the activation of rhodopsin involve a variety of angular and linear motions such as torsion, unwinding, and sliding in addition to the previously reported tilting and bending. These hitherto undefined motion components unify the results obtained from different experimental approaches, and demonstrate conformational similarity between the active opsin structure and the photoactivated structures in crystallo near the retinal anchor despite their marked differences. PMID:27658480
Phylogeny-dominant classification of J-proteins in Arabidopsis thaliana and Brassica oleracea.
Zhang, Bin; Qiu, Han-Lin; Qu, Dong-Hai; Ruan, Ying; Chen, Dong-Hong
2018-04-05
Hsp40s or DnaJ/J-proteins are evolutionarily conserved in all organisms as co-chaperones of molecular chaperone HSP70s that mainly participate in maintaining cellular protein homeostasis, such as protein folding, assembly, stabilization, and translocation under normal conditions as well as refolding and degradation under environmental stresses. It has been reported that Arabidopsis J-proteins are classified into four classes (types A-D) according to domain organization, but their phylogenetic relationships are unknown. Here, we identified 129 J-proteins in the world-wide popular vegetable Brassica oleracea, a close relative of the model plant Arabidopsis, and also revised the information of Arabidopsis J-proteins based on the latest online bioresources. According to phylogenetic analysis with domain organization and gene structure as references, the J-proteins from Arabidopsis and B. oleracea were classified into 15 main clades (I-XV) separated by a number of undefined small branches with remote relationship. Based on the number of members, they respectively belong to multigene clades, oligo-gene clades, and mono-gene clades. The J-protein genes from different clades may function together or separately to constitute a complicated regulatory network. This study provides a constructive viewpoint for J-protein classification and an informative platform for further functional dissection and resistant genes discovery related to genetic improvement of crop plants.
NASA Astrophysics Data System (ADS)
Cross, Sarah E.; Kreth, Jens; Zhu, Lin; Qi, Fengxia; Pelling, Andrew E.; Shi, Wenyuan; Gimzewski, James K.
2006-02-01
Atomic force microscopy (AFM) has garnered much interest in recent years for its ability to probe the structure, function and cellular nanomechanics inherent to specific biological cells. In particular, we have used AFM to probe the important structure-function relationships of the bacterium Streptococcus mutans. S. mutans is the primary aetiological agent in human dental caries (tooth decay), and is of medical importance due to the virulence properties of these cells in biofilm initiation and formation, leading to increased tolerance to antibiotics. We have used AFM to characterize the unique surface structures of distinct mutants of S. mutans. These mutations are located in specific genes that encode surface proteins, thus using AFM we have resolved characteristic surface features for mutant strains compared to the wild type. Ultimately, our characterization of surface morphology has shown distinct differences in the local properties displayed by various S. mutans strains on the nanoscale, which is imperative for understanding the collective properties of these cells in biofilm formation.
Protein Conformational Populations and Functionally Relevant Sub-states
DOE Office of Scientific and Technical Information (OSTI.GOV)
Agarwal, Pratul K; Burger, Virginia; Savol, Andrej
2013-01-01
Functioning proteins do not remain fixed in a unique structure, but instead they sample a range of conformations facilitated by motions within the protein. Even in the native state, a protein exists as a collection of interconverting conformations driven by thermodynamic fluctuations. Motions on the fast time scale allow a protein to sample conformations in the nearby area of its conformational landscape, while motions on slower time scales give it access to conformations in distal areas of the landscape. Emerging evidence indicates that protein landscapes contain conformational substates with dynamic and structural features that support the designated function of themore » protein. Nuclear magnetic resonance (NMR) experiments provide information about conformational ensembles of proteins. X-ray crystallography allows researchers to identify the most populated states along the landscape, and computational simulations give atom-level information about the conformational substates of different proteins. This ability to characterize and obtain quantitative information about the conformational substates and the populations of proteins within them is allowing researchers to better understand the relationship between protein structure and dynamics and the mechanisms of protein function. In this Account, we discuss recent developments and challenges in the characterization of functionally relevant conformational populations and substates of proteins. In some enzymes, the sampling of functionally relevant conformational substates is connected to promoting the overall mechanism of catalysis. For example, the conformational landscape of the enzyme dihydrofolate reductase has multiple substates, which facilitate the binding and the release of the cofactor and substrate and catalyze the hydride transfer. For the enzyme cyclophilin A, computational simulations reveal that the long time scale conformational fluctuations enable the enzyme to access conformational substates that allow it to attain the transition state, therefore promoting the reaction mechanism. In the long term, this emerging view of proteins with conformational substates has broad implications for improving our understanding of enzymes, enzyme engineering, and better drug design. Researchers have already used photoactivation to modulate protein conformations as a strategy to develop a hypercatalytic enzyme. In addition, the alteration of the conformational substates through binding of ligands at locations other than the active site provides the basis for the design of new medicines through allosteric modulation.« less
Conformational diversity analysis reveals three functional mechanisms in proteins
Fornasari, María Silvina
2017-01-01
Protein motions are a key feature to understand biological function. Recently, a large-scale analysis of protein conformational diversity showed a positively skewed distribution with a peak at 0.5 Å C-alpha root-mean-square-deviation (RMSD). To understand this distribution in terms of structure-function relationships, we studied a well curated and large dataset of ~5,000 proteins with experimentally determined conformational diversity. We searched for global behaviour patterns studying how structure-based features change among the available conformer population for each protein. This procedure allowed us to describe the RMSD distribution in terms of three main protein classes sharing given properties. The largest of these protein subsets (~60%), which we call “rigid” (average RMSD = 0.83 Å), has no disordered regions, shows low conformational diversity, the largest tunnels and smaller and buried cavities. The two additional subsets contain disordered regions, but with differential sequence composition and behaviour. Partially disordered proteins have on average 67% of their conformers with disordered regions, average RMSD = 1.1 Å, the highest number of hinges and the longest disordered regions. In contrast, malleable proteins have on average only 25% of disordered conformers and average RMSD = 1.3 Å, flexible cavities affected in size by the presence of disordered regions and show the highest diversity of cognate ligands. Proteins in each set are mostly non-homologous to each other, share no given fold class, nor functional similarity but do share features derived from their conformer population. These shared features could represent conformational mechanisms related with biological functions. PMID:28192432
Protein structure similarity from Principle Component Correlation analysis.
Zhou, Xiaobo; Chou, James; Wong, Stephen T C
2006-01-25
Owing to rapid expansion of protein structure databases in recent years, methods of structure comparison are becoming increasingly effective and important in revealing novel information on functional properties of proteins and their roles in the grand scheme of evolutionary biology. Currently, the structural similarity between two proteins is measured by the root-mean-square-deviation (RMSD) in their best-superimposed atomic coordinates. RMSD is the golden rule of measuring structural similarity when the structures are nearly identical; it, however, fails to detect the higher order topological similarities in proteins evolved into different shapes. We propose new algorithms for extracting geometrical invariants of proteins that can be effectively used to identify homologous protein structures or topologies in order to quantify both close and remote structural similarities. We measure structural similarity between proteins by correlating the principle components of their secondary structure interaction matrix. In our approach, the Principle Component Correlation (PCC) analysis, a symmetric interaction matrix for a protein structure is constructed with relationship parameters between secondary elements that can take the form of distance, orientation, or other relevant structural invariants. When using a distance-based construction in the presence or absence of encoded N to C terminal sense, there are strong correlations between the principle components of interaction matrices of structurally or topologically similar proteins. The PCC method is extensively tested for protein structures that belong to the same topological class but are significantly different by RMSD measure. The PCC analysis can also differentiate proteins having similar shapes but different topological arrangements. Additionally, we demonstrate that when using two independently defined interaction matrices, comparison of their maximum eigenvalues can be highly effective in clustering structurally or topologically similar proteins. We believe that the PCC analysis of interaction matrix is highly flexible in adopting various structural parameters for protein structure comparison.
Topology-function conservation in protein-protein interaction networks.
Davis, Darren; Yaveroğlu, Ömer Nebil; Malod-Dognin, Noël; Stojmirovic, Aleksandar; Pržulj, Nataša
2015-05-15
Proteins underlay the functioning of a cell and the wiring of proteins in protein-protein interaction network (PIN) relates to their biological functions. Proteins with similar wiring in the PIN (topology around them) have been shown to have similar functions. This property has been successfully exploited for predicting protein functions. Topological similarity is also used to guide network alignment algorithms that find similarly wired proteins between PINs of different species; these similarities are used to transfer annotation across PINs, e.g. from model organisms to human. To refine these functional predictions and annotation transfers, we need to gain insight into the variability of the topology-function relationships. For example, a function may be significantly associated with specific topologies, while another function may be weakly associated with several different topologies. Also, the topology-function relationships may differ between different species. To improve our understanding of topology-function relationships and of their conservation among species, we develop a statistical framework that is built upon canonical correlation analysis. Using the graphlet degrees to represent the wiring around proteins in PINs and gene ontology (GO) annotations to describe their functions, our framework: (i) characterizes statistically significant topology-function relationships in a given species, and (ii) uncovers the functions that have conserved topology in PINs of different species, which we term topologically orthologous functions. We apply our framework to PINs of yeast and human, identifying seven biological process and two cellular component GO terms to be topologically orthologous for the two organisms. © The Author 2015. Published by Oxford University Press.
[Relationships between venomous function and innate immune function].
Goyffon, Max; Saul, Frederick; Faure, Grazyna
2015-01-01
Venomous function is investigated in relation to innate immune function in two cases selected from scorpion venom and serpent venom. In the first case, structural analysis of scorpion toxins and defensins reveals a close interrelation between both functions (toxic and innate immune system function). In the second case, structural and functional studies of natural inhibitors of toxic snake venom phospholipases A2 reveal homology with components of the innate immune system, leading to a similar conclusion. Although there is a clear functional distinction between neurotoxins, which act by targeting membrane ion channels, and the circulating defensins which protect the organism from pathogens, the scorpion short toxins and defensins share a common protein folding scaffold with a conserved cysteine-stabilized alpha-beta motif of three disulfide bridges linking a short alpha helix and an antiparallel beta sheet. Genomic analysis suggests that these proteins share a common ancestor (long venom toxins were separated from an early gene family which gave rise to separate short toxin and defensin families). Furthermore, a scorpion toxin has been experimentally synthetized from an insect defensin, and an antibacterial scorpion peptide, androctonin (whose structure is similar to that of a cone snail venom toxin), was shown to have a similar high affinity for the postsynaptic acetylcholine receptor of Torpedo sp. Natural inhibitors of phospholipase A2 found in the blood of snakes are associated with the resistance of venomous snakes to their own highly neurotoxic venom proteins. Three classes of phospholipases A2 inhibitors (PLI-α, PLI-β, PLI-γ) have been identified. These inhibitors display diverse structural motifs related to innate immune proteins including carbohydrate recognition domains (CRD), leucine rich repeat domains (found in Toll-like receptors) and three finger domains, which clearly differentiate them from components of the adaptive immune system. Thus, in structure, function and phylogeny, venomous function in both vertebrates and invertebrates are clearly interrelated with innate immune function. © Société de Biologie, 2016.
Lammerding, Jan
2015-01-01
The nucleus is the distinguishing feature of eukaryotic cells. Until recently, it was often considered simply as a unique compartment containing the genetic information of the cell and associated machinery, without much attention to its structure and mechanical properties. This article provides compelling examples that illustrate how specific nuclear structures are associated with important cellular functions, and how defects in nuclear mechanics can cause a multitude of human diseases. During differentiation, embryonic stem cells modify their nuclear envelope composition and chromatin structure, resulting in stiffer nuclei that reflect decreased transcriptional plasticity. In contrast, neutrophils have evolved characteristic lobulated nuclei that increase their physical plasticity, enabling passage through narrow tissue spaces in their response to inflammation. Research on diverse cell types further demonstrates how induced nuclear deformations during cellular compression or stretch can modulate cellular function. Pathological examples of disturbed nuclear mechanics include the many diseases caused by mutations in the nuclear envelope proteins lamin A/C and associated proteins, as well as cancer cells that are often characterized by abnormal nuclear morphology. In this article, we will focus on determining the functional relationship between nuclear mechanics and cellular (dys-)function, describing the molecular changes associated with physiological and pathological examples, the resulting defects in nuclear mechanics, and the effects on cellular function. New insights into the close relationship between nuclear mechanics and cellular organization and function will yield a better understanding of normal biology and will offer new clues into therapeutic approaches to the various diseases associated with defective nuclear mechanics. PMID:23737203
2013-01-01
Background Many proteins tune their biological function by transitioning between different functional states, effectively acting as dynamic molecular machines. Detailed structural characterization of transition trajectories is central to understanding the relationship between protein dynamics and function. Computational approaches that build on the Molecular Dynamics framework are in principle able to model transition trajectories at great detail but also at considerable computational cost. Methods that delay consideration of dynamics and focus instead on elucidating energetically-credible conformational paths connecting two functionally-relevant structures provide a complementary approach. Effective sampling-based path planning methods originating in robotics have been recently proposed to produce conformational paths. These methods largely model short peptides or address large proteins by simplifying conformational space. Methods We propose a robotics-inspired method that connects two given structures of a protein by sampling conformational paths. The method focuses on small- to medium-size proteins, efficiently modeling structural deformations through the use of the molecular fragment replacement technique. In particular, the method grows a tree in conformational space rooted at the start structure, steering the tree to a goal region defined around the goal structure. We investigate various bias schemes over a progress coordinate for balance between coverage of conformational space and progress towards the goal. A geometric projection layer promotes path diversity. A reactive temperature scheme allows sampling of rare paths that cross energy barriers. Results and conclusions Experiments are conducted on small- to medium-size proteins of length up to 214 amino acids and with multiple known functionally-relevant states, some of which are more than 13Å apart of each-other. Analysis reveals that the method effectively obtains conformational paths connecting structural states that are significantly different. A detailed analysis on the depth and breadth of the tree suggests that a soft global bias over the progress coordinate enhances sampling and results in higher path diversity. The explicit geometric projection layer that biases the exploration away from over-sampled regions further increases coverage, often improving proximity to the goal by forcing the exploration to find new paths. The reactive temperature scheme is shown effective in increasing path diversity, particularly in difficult structural transitions with known high-energy barriers. PMID:24565158
Sequence repeats and protein structure
NASA Astrophysics Data System (ADS)
Hoang, Trinh X.; Trovato, Antonio; Seno, Flavio; Banavar, Jayanth R.; Maritan, Amos
2012-11-01
Repeats are frequently found in known protein sequences. The level of sequence conservation in tandem repeats correlates with their propensities to be intrinsically disordered. We employ a coarse-grained model of a protein with a two-letter amino acid alphabet, hydrophobic (H) and polar (P), to examine the sequence-structure relationship in the realm of repeated sequences. A fraction of repeated sequences comprises a distinct class of bad folders, whose folding temperatures are much lower than those of random sequences. Imperfection in sequence repetition improves the folding properties of the bad folders while deteriorating those of the good folders. Our results may explain why nature has utilized repeated sequences for their versatility and especially to design functional proteins that are intrinsically unstructured at physiological temperatures.
Doppelt-Azeroual, Olivia; Delfaud, François; Moriaud, Fabrice; de Brevern, Alexandre G
2010-04-01
Ligand-protein interactions are essential for biological processes, and precise characterization of protein binding sites is crucial to understand protein functions. MED-SuMo is a powerful technology to localize similar local regions on protein surfaces. Its heuristic is based on a 3D representation of macromolecules using specific surface chemical features associating chemical characteristics with geometrical properties. MED-SMA is an automated and fast method to classify binding sites. It is based on MED-SuMo technology, which builds a similarity graph, and it uses the Markov Clustering algorithm. Purine binding sites are well studied as drug targets. Here, purine binding sites of the Protein DataBank (PDB) are classified. Proteins potentially inhibited or activated through the same mechanism are gathered. Results are analyzed according to PROSITE annotations and to carefully refined functional annotations extracted from the PDB. As expected, binding sites associated with related mechanisms are gathered, for example, the Small GTPases. Nevertheless, protein kinases from different Kinome families are also found together, for example, Aurora-A and CDK2 proteins which are inhibited by the same drugs. Representative examples of different clusters are presented. The effectiveness of the MED-SMA approach is demonstrated as it gathers binding sites of proteins with similar structure-activity relationships. Moreover, an efficient new protocol associates structures absent of cocrystallized ligands to the purine clusters enabling those structures to be associated with a specific binding mechanism. Applications of this classification by binding mode similarity include target-based drug design and prediction of cross-reactivity and therefore potential toxic side effects.
Doppelt-Azeroual, Olivia; Delfaud, François; Moriaud, Fabrice; de Brevern, Alexandre G
2010-01-01
Ligand–protein interactions are essential for biological processes, and precise characterization of protein binding sites is crucial to understand protein functions. MED-SuMo is a powerful technology to localize similar local regions on protein surfaces. Its heuristic is based on a 3D representation of macromolecules using specific surface chemical features associating chemical characteristics with geometrical properties. MED-SMA is an automated and fast method to classify binding sites. It is based on MED-SuMo technology, which builds a similarity graph, and it uses the Markov Clustering algorithm. Purine binding sites are well studied as drug targets. Here, purine binding sites of the Protein DataBank (PDB) are classified. Proteins potentially inhibited or activated through the same mechanism are gathered. Results are analyzed according to PROSITE annotations and to carefully refined functional annotations extracted from the PDB. As expected, binding sites associated with related mechanisms are gathered, for example, the Small GTPases. Nevertheless, protein kinases from different Kinome families are also found together, for example, Aurora-A and CDK2 proteins which are inhibited by the same drugs. Representative examples of different clusters are presented. The effectiveness of the MED-SMA approach is demonstrated as it gathers binding sites of proteins with similar structure-activity relationships. Moreover, an efficient new protocol associates structures absent of cocrystallized ligands to the purine clusters enabling those structures to be associated with a specific binding mechanism. Applications of this classification by binding mode similarity include target-based drug design and prediction of cross-reactivity and therefore potential toxic side effects. PMID:20162627
Tamarozzi, Elvira Regina; Giuliatti, Silvana
2018-01-09
Intrinsic disorder is very important in the biological function of several proteins, and is directly linked to their foldability during interaction with their targets. There is a close relationship between the intrinsically disordered proteins and the process of carcinogenesis involving viral pathogens. Among these pathogens, we have highlighted the human papillomavirus (HPV) in this study. HPV is currently among the most common sexually transmitted infections, besides being the cause of several types of cancer. HPVs are divided into two groups, called high- and low-risk, based on their oncogenic potential. The high-risk HPV E6 protein has been the target of much research, in seeking treatments against HPV, due to its direct involvement in the process of cell cycle control. To understand the role of intrinsic disorder of the viral proteins in the oncogenic potential of different HPV types, the structural characteristics of intrinsically disordered regions of high and low-risk HPV E6 proteins were analyzed. In silico analyses of primary sequences, prediction of tertiary structures, and analyses of molecular dynamics allowed the observation of the behavior of such disordered regions in these proteins, thereby proving a direct relationship of structural variation with the degree of oncogenicity of HPVs. The results obtained may contribute to the development of new therapies, targeting the E6 oncoprotein, for the treatment of HPV-associated diseases.
Molecular tandem repeat strategy for elucidating mechanical properties of high-strength proteins
Jung, Huihun; Pena-Francesch, Abdon; Saadat, Alham; Sebastian, Aswathy; Kim, Dong Hwan; Hamilton, Reginald F.; Albert, Istvan; Allen, Benjamin D.; Demirel, Melik C.
2016-01-01
Many globular and structural proteins have repetitions in their sequences or structures. However, a clear relationship between these repeats and their contribution to the mechanical properties remains elusive. We propose a new approach for the design and production of synthetic polypeptides that comprise one or more tandem copies of a single unit with distinct amorphous and ordered regions. Our designed sequences are based on a structural protein produced in squid suction cups that has a segmented copolymer structure with amorphous and crystalline domains. We produced segmented polypeptides with varying repeat number, while keeping the lengths and compositions of the amorphous and crystalline regions fixed. We showed that mechanical properties of these synthetic proteins could be tuned by modulating their molecular weights. Specifically, the toughness and extensibility of synthetic polypeptides increase as a function of the number of tandem repeats. This result suggests that the repetitions in native squid proteins could have a genetic advantage for increased toughness and flexibility. PMID:27222581
NASA Astrophysics Data System (ADS)
Simon, Joseph R.; Carroll, Nick J.; Rubinstein, Michael; Chilkoti, Ashutosh; López, Gabriel P.
2017-06-01
Dynamic protein-rich intracellular structures that contain phase-separated intrinsically disordered proteins (IDPs) composed of sequences of low complexity (SLC) have been shown to serve a variety of important cellular functions, which include signalling, compartmentalization and stabilization. However, our understanding of these structures and our ability to synthesize models of them have been limited. We present design rules for IDPs possessing SLCs that phase separate into diverse assemblies within droplet microenvironments. Using theoretical analyses, we interpret the phase behaviour of archetypal IDP sequences and demonstrate the rational design of a vast library of multicomponent protein-rich structures that ranges from uniform nano-, meso- and microscale puncta (distinct protein droplets) to multilayered orthogonally phase-separated granular structures. The ability to predict and program IDP-rich assemblies in this fashion offers new insights into (1) genetic-to-molecular-to-macroscale relationships that encode hierarchical IDP assemblies, (2) design rules of such assemblies in cell biology and (3) molecular-level engineering of self-assembled recombinant IDP-rich materials.
Reinartz, Michael T; Kälble, Solveig; Littmann, Timo; Ozawa, Takeaki; Dove, Stefan; Kaever, Volkhard; Wainer, Irving W; Seifert, Roland
2015-01-01
Functional selectivity is well established as an underlying concept of ligand-specific signaling via G protein-coupled receptors (GPCRs). Functionally, selective drugs could show greater therapeutic efficacy and fewer adverse effects. Dual coupling of the β2-adrenoceptor (β2AR) triggers a signal transduction via Gsα and Giα proteins. Here, we examined 12 fenoterol stereoisomers in six molecular and cellular assays. Using β2AR-Gsα and β2AR-Giα fusion proteins, (R,S')- and (S,S')-isomers of 4'-methoxy-1-naphthyl-fenoterol were identified as biased ligands with preference for Gs. G protein-independent signaling via β-arrestin-2 was disfavored by these ligands. Isolated human neutrophils constituted an ex vivo model of β2AR signaling and demonstrated functional selectivity through the dissociation of cAMP accumulation and the inhibition of formyl peptide-stimulated production of reactive oxygen species. Ligand bias was calculated using an operational model of agonism and revealed that the fenoterol scaffold constitutes a promising lead structure for the development of Gs-biased β2AR agonists.
Membrane raft association is a determinant of plasma membrane localization.
Diaz-Rohrer, Blanca B; Levental, Kandice R; Simons, Kai; Levental, Ilya
2014-06-10
The lipid raft hypothesis proposes lateral domains driven by preferential interactions between sterols, sphingolipids, and specific proteins as a central mechanism for the regulation of membrane structure and function; however, experimental limitations in defining raft composition and properties have prevented unequivocal demonstration of their functional relevance. Here, we establish a quantitative, functional relationship between raft association and subcellular protein sorting. By systematic mutation of the transmembrane and juxtamembrane domains of a model transmembrane protein, linker for activation of T-cells (LAT), we generated a panel of variants possessing a range of raft affinities. These mutations revealed palmitoylation, transmembrane domain length, and transmembrane sequence to be critical determinants of membrane raft association. Moreover, plasma membrane (PM) localization was strictly dependent on raft partitioning across the entire panel of unrelated mutants, suggesting that raft association is necessary and sufficient for PM sorting of LAT. Abrogation of raft partitioning led to mistargeting to late endosomes/lysosomes because of a failure to recycle from early endosomes. These findings identify structural determinants of raft association and validate lipid-driven domain formation as a mechanism for endosomal protein sorting.
Membrane raft association is a determinant of plasma membrane localization
Diaz-Rohrer, Blanca B.; Levental, Kandice R.; Simons, Kai; Levental, Ilya
2014-01-01
The lipid raft hypothesis proposes lateral domains driven by preferential interactions between sterols, sphingolipids, and specific proteins as a central mechanism for the regulation of membrane structure and function; however, experimental limitations in defining raft composition and properties have prevented unequivocal demonstration of their functional relevance. Here, we establish a quantitative, functional relationship between raft association and subcellular protein sorting. By systematic mutation of the transmembrane and juxtamembrane domains of a model transmembrane protein, linker for activation of T-cells (LAT), we generated a panel of variants possessing a range of raft affinities. These mutations revealed palmitoylation, transmembrane domain length, and transmembrane sequence to be critical determinants of membrane raft association. Moreover, plasma membrane (PM) localization was strictly dependent on raft partitioning across the entire panel of unrelated mutants, suggesting that raft association is necessary and sufficient for PM sorting of LAT. Abrogation of raft partitioning led to mistargeting to late endosomes/lysosomes because of a failure to recycle from early endosomes. These findings identify structural determinants of raft association and validate lipid-driven domain formation as a mechanism for endosomal protein sorting. PMID:24912166
Knowledge Discovery in Variant Databases Using Inductive Logic Programming
Nguyen, Hoan; Luu, Tien-Dao; Poch, Olivier; Thompson, Julie D.
2013-01-01
Understanding the effects of genetic variation on the phenotype of an individual is a major goal of biomedical research, especially for the development of diagnostics and effective therapeutic solutions. In this work, we describe the use of a recent knowledge discovery from database (KDD) approach using inductive logic programming (ILP) to automatically extract knowledge about human monogenic diseases. We extracted background knowledge from MSV3d, a database of all human missense variants mapped to 3D protein structure. In this study, we identified 8,117 mutations in 805 proteins with known three-dimensional structures that were known to be involved in human monogenic disease. Our results help to improve our understanding of the relationships between structural, functional or evolutionary features and deleterious mutations. Our inferred rules can also be applied to predict the impact of any single amino acid replacement on the function of a protein. The interpretable rules are available at http://decrypthon.igbmc.fr/kd4v/. PMID:23589683
Knowledge discovery in variant databases using inductive logic programming.
Nguyen, Hoan; Luu, Tien-Dao; Poch, Olivier; Thompson, Julie D
2013-01-01
Understanding the effects of genetic variation on the phenotype of an individual is a major goal of biomedical research, especially for the development of diagnostics and effective therapeutic solutions. In this work, we describe the use of a recent knowledge discovery from database (KDD) approach using inductive logic programming (ILP) to automatically extract knowledge about human monogenic diseases. We extracted background knowledge from MSV3d, a database of all human missense variants mapped to 3D protein structure. In this study, we identified 8,117 mutations in 805 proteins with known three-dimensional structures that were known to be involved in human monogenic disease. Our results help to improve our understanding of the relationships between structural, functional or evolutionary features and deleterious mutations. Our inferred rules can also be applied to predict the impact of any single amino acid replacement on the function of a protein. The interpretable rules are available at http://decrypthon.igbmc.fr/kd4v/.
Fluorescein isothiocyanate-labeled human plasma fibronectin in extracellular matrix remodeling.
Hoffmann, Celine; Leroy-Dudal, Johanne; Patel, Salima; Gallet, Olivier; Pauthe, Emmanuel
2008-01-01
Fluorescein isothiocyanate (FITC) is a well-known probe for labeling biologically relevant proteins. However, the impact of the labeling procedure on protein structure and biological activities remains unclear. In this work, FITC-labeled human plasma fibronectin (Fn) was developed to gain insight into the dynamic relationship between cells and Fn. The similarities and differences concerning the structure and function between Fn-FITC and standard Fn were evaluated using biochemical as well as cellular approaches. By varying the FITC/Fn ratio, we demonstrated that overlabeling (>10 FITC molecules/Fn molecule) induces probe fluorescence quenching, protein aggregation, and cell growth modifications. A correct balance between reliable fluorescence for detection and no significant modifications to structure and biological function compared with standard Fn was obtained with a final ratio of 3 FITC molecules per Fn molecule (Fn-FITC3). Fn-FITC3, similar to standard Fn, is correctly recruited into the cell matrix network. Also, Fn-FITC3 is proposed to be a powerful molecular tool to investigate Fn organization and cellular behavior concomitantly.
Power law tails in phylogenetic systems.
Qin, Chongli; Colwell, Lucy J
2018-01-23
Covariance analysis of protein sequence alignments uses coevolving pairs of sequence positions to predict features of protein structure and function. However, current methods ignore the phylogenetic relationships between sequences, potentially corrupting the identification of covarying positions. Here, we use random matrix theory to demonstrate the existence of a power law tail that distinguishes the spectrum of covariance caused by phylogeny from that caused by structural interactions. The power law is essentially independent of the phylogenetic tree topology, depending on just two parameters-the sequence length and the average branch length. We demonstrate that these power law tails are ubiquitous in the large protein sequence alignments used to predict contacts in 3D structure, as predicted by our theory. This suggests that to decouple phylogenetic effects from the interactions between sequence distal sites that control biological function, it is necessary to remove or down-weight the eigenvectors of the covariance matrix with largest eigenvalues. We confirm that truncating these eigenvectors improves contact prediction.
ERIC Educational Resources Information Center
Wilder, Anna; Brinkerhoff, Jonathan
2007-01-01
This study assessed the effectiveness of computer-based biomolecular visualization activities on the development of high school biology students' representational competence as a means of understanding and visualizing protein structure/function relationships. Also assessed were students' attitudes toward these activities. Sixty-nine students…
Modulation of electronic structures of bases through DNA recognition of protein.
Hagiwara, Yohsuke; Kino, Hiori; Tateno, Masaru
2010-04-21
The effects of environmental structures on the electronic states of functional regions in a fully solvated DNA·protein complex were investigated using combined ab initio quantum mechanics/molecular mechanics calculations. A complex of a transcriptional factor, PU.1, and the target DNA was used for the calculations. The effects of solvent on the energies of molecular orbitals (MOs) of some DNA bases strongly correlate with the magnitude of masking of the DNA bases from the solvent by the protein. In the complex, PU.1 causes a variation in the magnitude among DNA bases by means of directly recognizing the DNA bases through hydrogen bonds and inducing structural changes of the DNA structure from the canonical one. Thus, the strong correlation found in this study is the first evidence showing the close quantitative relationship between recognition modes of DNA bases and the energy levels of the corresponding MOs. Thus, it has been revealed that the electronic state of each base is highly regulated and organized by the DNA recognition of the protein. Other biological macromolecular systems can be expected to also possess similar modulation mechanisms, suggesting that this finding provides a novel basis for the understanding for the regulation functions of biological macromolecular systems.
Kopitz, Jürgen; Vértesy, Sabine; André, Sabine; Fiedler, Sabine; Schnölzer, Martina; Gabius, Hans-Joachim
2014-09-01
Many human proteins have a modular design with receptor and structural domains. Using adhesion/growth-regulatory galectin-3 as model, we describe an interdisciplinary strategy to define the functional significance of its tail established by nine non-triple helical collagen-like repeats (I-IX) and the N-terminal peptide. Genetic engineering with sophisticated mass spectrometric product analysis provided the tools for biotesting, i.e. eight protein variants with different degrees of tail truncation. Evidently,various aspects of galectin-3 activity (cis binding and cell bridging) are affected by tail shortening in a different manner. Thus, this combined approach reveals an unsuspected complexity of structure-function relationship, encouraging further application beyond this chimera-type galectin. Copyright © 2014 Elsevier Masson SAS. All rights reserved.
Suplatov, Dmitry; Panin, Nikolay; Kirilin, Evgeny; Shcherbakova, Tatyana; Kudryavtsev, Pavel; Svedas, Vytas
2014-01-01
Protein stability provides advantageous development of novel properties and can be crucial in affording tolerance to mutations that introduce functionally preferential phenotypes. Consequently, understanding the determining factors for protein stability is important for the study of structure-function relationship and design of novel protein functions. Thermal stability has been extensively studied in connection with practical application of biocatalysts. However, little work has been done to explore the mechanism of pH-dependent inactivation. In this study, bioinformatic analysis of the Ntn-hydrolase superfamily was performed to identify functionally important subfamily-specific positions in protein structures. Furthermore, the involvement of these positions in pH-induced inactivation was studied. The conformational mobility of penicillin acylase in Escherichia coli was analyzed through molecular modeling in neutral and alkaline conditions. Two functionally important subfamily-specific residues, Gluβ482 and Aspβ484, were found. Ionization of these residues at alkaline pH promoted the collapse of a buried network of stabilizing interactions that consequently disrupted the functional protein conformation. The subfamily-specific position Aspβ484 was selected as a hotspot for mutation to engineer enzyme variant tolerant to alkaline medium. The corresponding Dβ484N mutant was produced and showed 9-fold increase in stability at alkaline conditions. Bioinformatic analysis of subfamily-specific positions can be further explored to study mechanisms of protein inactivation and to design more stable variants for the engineering of homologous Ntn-hydrolases with improved catalytic properties.
Suplatov, Dmitry; Panin, Nikolay; Kirilin, Evgeny; Shcherbakova, Tatyana; Kudryavtsev, Pavel; Švedas, Vytas
2014-01-01
Protein stability provides advantageous development of novel properties and can be crucial in affording tolerance to mutations that introduce functionally preferential phenotypes. Consequently, understanding the determining factors for protein stability is important for the study of structure-function relationship and design of novel protein functions. Thermal stability has been extensively studied in connection with practical application of biocatalysts. However, little work has been done to explore the mechanism of pH-dependent inactivation. In this study, bioinformatic analysis of the Ntn-hydrolase superfamily was performed to identify functionally important subfamily-specific positions in protein structures. Furthermore, the involvement of these positions in pH-induced inactivation was studied. The conformational mobility of penicillin acylase in Escherichia coli was analyzed through molecular modeling in neutral and alkaline conditions. Two functionally important subfamily-specific residues, Gluβ482 and Aspβ484, were found. Ionization of these residues at alkaline pH promoted the collapse of a buried network of stabilizing interactions that consequently disrupted the functional protein conformation. The subfamily-specific position Aspβ484 was selected as a hotspot for mutation to engineer enzyme variant tolerant to alkaline medium. The corresponding Dβ484N mutant was produced and showed 9-fold increase in stability at alkaline conditions. Bioinformatic analysis of subfamily-specific positions can be further explored to study mechanisms of protein inactivation and to design more stable variants for the engineering of homologous Ntn-hydrolases with improved catalytic properties. PMID:24959852
Development of studies of TPO gene and its application in nuclear medicine.
Xing, Y; Kuang, A
2003-08-01
Thyroperoxidase (TPO) is a glycosylated protein bound to the apical plasma membrane of thyrocytes. It is the key enzyme in the synthesis of thyroid hormones. Its gene structure and transcriptional regulation have been studied in detail. This article reviews the structure, function and transcriptional regulation of the TPO gene, and the relationship between TPO, thyroid diseases and radioactive iodide therapy.
Exhaustive comparison and classification of ligand-binding surfaces in proteins
Murakami, Yoichi; Kinoshita, Kengo; Kinjo, Akira R; Nakamura, Haruki
2013-01-01
Many proteins function by interacting with other small molecules (ligands). Identification of ligand-binding sites (LBS) in proteins can therefore help to infer their molecular functions. A comprehensive comparison among local structures of LBSs was previously performed, in order to understand their relationships and to classify their structural motifs. However, similar exhaustive comparison among local surfaces of LBSs (patches) has never been performed, due to computational complexity. To enhance our understanding of LBSs, it is worth performing such comparisons among patches and classifying them based on similarities of their surface configurations and electrostatic potentials. In this study, we first developed a rapid method to compare two patches. We then clustered patches corresponding to the same PDB chemical component identifier for a ligand, and selected a representative patch from each cluster. We subsequently exhaustively as compared the representative patches and clustered them using similarity score, PatSim. Finally, the resultant PatSim scores were compared with similarities of atomic structures of the LBSs and those of the ligand-binding protein sequences and functions. Consequently, we classified the patches into ∼2000 well-characterized clusters. We found that about 63% of these clusters are used in identical protein folds, although about 25% of the clusters are conserved in distantly related proteins and even in proteins with cross-fold similarity. Furthermore, we showed that patches with higher PatSim score have potential to be involved in similar biological processes. PMID:23934772
Medina-Carmona, Encarnación; Fuchs, Julian E; Gavira, Jose A; Mesa-Torres, Noel; Neira, Jose L; Salido, Eduardo; Palomino-Morales, Rogelio; Burgos, Miguel; Timson, David J; Pey, Angel L
2017-09-15
Human proteins are vulnerable towards disease-associated single amino acid replacements affecting protein stability and function. Interestingly, a few studies have shown that consensus amino acids from mammals or vertebrates can enhance protein stability when incorporated into human proteins. Here, we investigate yet unexplored relationships between the high vulnerability of human proteins towards disease-associated inactivation and recent evolutionary site-specific divergence of stabilizing amino acids. Using phylogenetic, structural and experimental analyses, we show that divergence from the consensus amino acids at several sites during mammalian evolution has caused local protein destabilization in two human proteins linked to disease: cancer-associated NQO1 and alanine:glyoxylate aminotransferase, mutated in primary hyperoxaluria type I. We demonstrate that a single consensus mutation (H80R) acts as a disease suppressor on the most common cancer-associated polymorphism in NQO1 (P187S). The H80R mutation reactivates P187S by enhancing FAD binding affinity through local and dynamic stabilization of its binding site. Furthermore, we show how a second suppressor mutation (E247Q) cooperates with H80R in protecting the P187S polymorphism towards inactivation through long-range allosteric communication within the structural ensemble of the protein. Our results support that recent divergence of consensus amino acids may have occurred with neutral effects on many functional and regulatory traits of wild-type human proteins. However, divergence at certain sites may have increased the propensity of some human proteins towards inactivation due to disease-associated mutations and polymorphisms. Consensus mutations also emerge as a potential strategy to identify structural hot-spots in proteins as targets for pharmacological rescue in loss-of-function genetic diseases. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Sinha, Rajeshwari; Khare, Sunil K
2014-01-01
Search for new industrial enzymes having novel properties continues to be a desirable pursuit in enzyme research. The halophilic organisms inhabiting under saline/ hypersaline conditions are considered as promising source of useful enzymes. Their enzymes are structurally adapted to perform efficient catalysis under saline environment wherein n0n-halophilic enzymes often lose their structure and activity. Haloenzymes have been documented to be polyextremophilic and withstand high temperature, pH, organic solvents, and chaotropic agents. However, this stability is modulated by salt. Although vast amount of information have been generated on salt mediated protection and structure function relationship in halophilic proteins, their clear understanding and correct perspective still remain incoherent. Furthermore, understanding their protein architecture may give better clue for engineering stable enzymes which can withstand harsh industrial conditions. The article encompasses the current level of understanding about haloadaptations and analyzes structural basis of their enzyme stability against classical denaturants.
Sinha, Rajeshwari; Khare, Sunil K.
2014-01-01
Search for new industrial enzymes having novel properties continues to be a desirable pursuit in enzyme research. The halophilic organisms inhabiting under saline/ hypersaline conditions are considered as promising source of useful enzymes. Their enzymes are structurally adapted to perform efficient catalysis under saline environment wherein n0n-halophilic enzymes often lose their structure and activity. Haloenzymes have been documented to be polyextremophilic and withstand high temperature, pH, organic solvents, and chaotropic agents. However, this stability is modulated by salt. Although vast amount of information have been generated on salt mediated protection and structure function relationship in halophilic proteins, their clear understanding and correct perspective still remain incoherent. Furthermore, understanding their protein architecture may give better clue for engineering stable enzymes which can withstand harsh industrial conditions. The article encompasses the current level of understanding about haloadaptations and analyzes structural basis of their enzyme stability against classical denaturants. PMID:24782853
Protein interactions and ligand binding: from protein subfamilies to functional specificity.
Rausell, Antonio; Juan, David; Pazos, Florencio; Valencia, Alfonso
2010-02-02
The divergence accumulated during the evolution of protein families translates into their internal organization as subfamilies, and it is directly reflected in the characteristic patterns of differentially conserved residues. These specifically conserved positions in protein subfamilies are known as "specificity determining positions" (SDPs). Previous studies have limited their analysis to the study of the relationship between these positions and ligand-binding specificity, demonstrating significant yet limited predictive capacity. We have systematically extended this observation to include the role of differential protein interactions in the segregation of protein subfamilies and explored in detail the structural distribution of SDPs at protein interfaces. Our results show the extensive influence of protein interactions in the evolution of protein families and the widespread association of SDPs with protein interfaces. The combined analysis of SDPs in interfaces and ligand-binding sites provides a more complete picture of the organization of protein families, constituting the necessary framework for a large scale analysis of the evolution of protein function.
Torsion Profiling of Proteins Using Magnetic Particles
van Reenen, A.; Gutiérrez-Mejía, F.; van IJzendoorn, L.J.; Prins, M.W.J.
2013-01-01
We report a method to profile the torsional spring properties of proteins as a function of the angle of rotation. The torque is applied by superparamagnetic particles and has been calibrated while taking account of the magnetization dynamics of the particles. We record and compare the torsional profiles of single Protein G-Immunoglobulin G (IgG) and IgG-IgG complexes, sandwiched between a substrate and a superparamagnetic particle, for torques in the range between 0.5 × 103 and 5 × 103 pN·nm. Both molecular systems show torsional stiffening for increasing rotation angle, but the elastic and inelastic torsion stiffnesses are remarkably different. We interpret the results in terms of the structural properties of the molecules. The torsion profiling technique opens new dimensions for research on biomolecular characterization and for research on bio-nanomechanical structure-function relationships. PMID:23473490
Gao, Yujuan; Wang, Sheng; Deng, Minghua; Xu, Jinbo
2018-05-08
Protein dihedral angles provide a detailed description of protein local conformation. Predicted dihedral angles can be used to narrow down the conformational space of the whole polypeptide chain significantly, thus aiding protein tertiary structure prediction. However, direct angle prediction from sequence alone is challenging. In this article, we present a novel method (named RaptorX-Angle) to predict real-valued angles by combining clustering and deep learning. Tested on a subset of PDB25 and the targets in the latest two Critical Assessment of protein Structure Prediction (CASP), our method outperforms the existing state-of-art method SPIDER2 in terms of Pearson Correlation Coefficient (PCC) and Mean Absolute Error (MAE). Our result also shows approximately linear relationship between the real prediction errors and our estimated bounds. That is, the real prediction error can be well approximated by our estimated bounds. Our study provides an alternative and more accurate prediction of dihedral angles, which may facilitate protein structure prediction and functional study.
Theil, Elizabeth C; Turano, Paola; Ghini, Veronica; Allegrozzi, Marco; Bernacchioni, Caterina
2014-06-01
Integrated ferritin protein cage function is the reversible synthesis of protein-caged, solid Fe2O3·H2O minerals from Fe(2+) for metabolic iron concentrates and oxidant protection; biomineral order differs in different ferritin proteins. The conserved 432 geometric symmetry of ferritin protein cages parallels the subunit dimer, trimer, and tetramer interfaces, and coincides with function at several cage axes. Multiple subdomains distributed in the self-assembling ferritin nanocages have functional relationships to cage symmetry such as Fe(2+) transport though ion channels (threefold symmetry), biomineral nucleation/order (fourfold symmetry), and mineral dissolution (threefold symmetry) studied in ferritin variants. On the basis of the effects of natural or synthetic subunit dimer cross-links, cage subunit dimers (twofold symmetry) influence iron oxidation and mineral dissolution. 2Fe(2+)/O2 catalysis in ferritin occurs in single subunits, but with cooperativity (n = 3) that is possibly related to the structure/function of the ion channels, which are constructed from segments of three subunits. Here, we study 2Fe(2+) + O2 protein catalysis (diferric peroxo formation) and dissolution of ferritin Fe2O3·H2O biominerals in variants with altered subunit interfaces for trimers (ion channels), E130I, and external dimer surfaces (E88A) as controls, and altered tetramer subunit interfaces (L165I and H169F). The results extend observations on the functional importance of structure at ferritin protein twofold and threefold cage axes to show function at ferritin fourfold cage axes. Here, conserved amino acids facilitate dissolution of ferritin-protein-caged iron biominerals. Biological and nanotechnological uses of ferritin protein cage fourfold symmetry and solid-state mineral properties remain largely unexplored.
Theil, Elizabeth C.; Turano, Paola; Ghini, Veronica; Allegrozzi, Marco; Bernacchioni, Caterina
2014-01-01
Integrated ferritin protein cage function is the reversible synthesis of protein-caged, solid Fe2O3•H2O minerals from Fe2+, for metabolic iron concentrates and oxidant protection; biomineral order varies in different ferritin proteins. The conserved 4, 3, 2 geometric symmetry of ferritin protein cages, parallels subunit dimer, trimer and tetramer interfaces, and coincides with function at several cage axes. Multiple subdomains distributed in the self- assembling ferritin nanocages have functional relationships to cage symmetry such as Fe2+ transport though ion channels (3-fold symmetry), biomineral nucleation/order (4-fold symmetry) and mineral dissolution (3-fold symmetry) studied in ferritin variants. Cage subunit dimers (2-fold symmetry) influence iron oxidation and mineral dissolution, based on effects of natural or synthetic subunit dimer crosslinks. 2Fe2+/O2 catalysis in ferritin occurs in single subunits, but with cooperativity (n=3) that is possibly related to the structure/function of the ion channels, which are constructed from segments of 3 subunits. Here, we study 2Fe2+ + O2 protein catalysis (diferric peroxo formation) and dissolution of ferritin Fe2O3•H2O biominerals in variants with altered subunit interfaces for trimers (ion channels), E130I, and external dimer surfaces (E88A) as controls, and altered tetramer subunit interfaces (L165I and H169F). The results extend observations on the functional importance of structure at ferritin protein 2-fold and 3-fold cage axes to show function at ferritin 4-fold cage axes. Here, conserved amino acids facilitate dissolution of ferritin protein-caged iron biominerals. Biological and nanotechnological uses of ferritin protein cage 4-fold symmetry and solid state mineral properties remain largely unexplored. PMID:24504941
Nucleic and Amino Acid Sequences Support Structure-Based Viral Classification.
Sinclair, Robert M; Ravantti, Janne J; Bamford, Dennis H
2017-04-15
Viral capsids ensure viral genome integrity by protecting the enclosed nucleic acids. Interactions between the genome and capsid and between individual capsid proteins (i.e., capsid architecture) are intimate and are expected to be characterized by strong evolutionary conservation. For this reason, a capsid structure-based viral classification has been proposed as a way to bring order to the viral universe. The seeming lack of sufficient sequence similarity to reproduce this classification has made it difficult to reject structural convergence as the basis for the classification. We reinvestigate whether the structure-based classification for viral coat proteins making icosahedral virus capsids is in fact supported by previously undetected sequence similarity. Since codon choices can influence nascent protein folding cotranslationally, we searched for both amino acid and nucleotide sequence similarity. To demonstrate the sensitivity of the approach, we identify a candidate gene for the pandoravirus capsid protein. We show that the structure-based classification is strongly supported by amino acid and also nucleotide sequence similarities, suggesting that the similarities are due to common descent. The correspondence between structure-based and sequence-based analyses of the same proteins shown here allow them to be used in future analyses of the relationship between linear sequence information and macromolecular function, as well as between linear sequence and protein folds. IMPORTANCE Viral capsids protect nucleic acid genomes, which in turn encode capsid proteins. This tight coupling of protein shell and nucleic acids, together with strong functional constraints on capsid protein folding and architecture, leads to the hypothesis that capsid protein-coding nucleotide sequences may retain signatures of ancient viral evolution. We have been able to show that this is indeed the case, using the major capsid proteins of viruses forming icosahedral capsids. Importantly, we detected similarity at the nucleotide level between capsid protein-coding regions from viruses infecting cells belonging to all three domains of life, reproducing a previously established structure-based classification of icosahedral viral capsids. Copyright © 2017 Sinclair et al.
Nucleic and Amino Acid Sequences Support Structure-Based Viral Classification
Sinclair, Robert M.; Ravantti, Janne J.
2017-01-01
ABSTRACT Viral capsids ensure viral genome integrity by protecting the enclosed nucleic acids. Interactions between the genome and capsid and between individual capsid proteins (i.e., capsid architecture) are intimate and are expected to be characterized by strong evolutionary conservation. For this reason, a capsid structure-based viral classification has been proposed as a way to bring order to the viral universe. The seeming lack of sufficient sequence similarity to reproduce this classification has made it difficult to reject structural convergence as the basis for the classification. We reinvestigate whether the structure-based classification for viral coat proteins making icosahedral virus capsids is in fact supported by previously undetected sequence similarity. Since codon choices can influence nascent protein folding cotranslationally, we searched for both amino acid and nucleotide sequence similarity. To demonstrate the sensitivity of the approach, we identify a candidate gene for the pandoravirus capsid protein. We show that the structure-based classification is strongly supported by amino acid and also nucleotide sequence similarities, suggesting that the similarities are due to common descent. The correspondence between structure-based and sequence-based analyses of the same proteins shown here allow them to be used in future analyses of the relationship between linear sequence information and macromolecular function, as well as between linear sequence and protein folds. IMPORTANCE Viral capsids protect nucleic acid genomes, which in turn encode capsid proteins. This tight coupling of protein shell and nucleic acids, together with strong functional constraints on capsid protein folding and architecture, leads to the hypothesis that capsid protein-coding nucleotide sequences may retain signatures of ancient viral evolution. We have been able to show that this is indeed the case, using the major capsid proteins of viruses forming icosahedral capsids. Importantly, we detected similarity at the nucleotide level between capsid protein-coding regions from viruses infecting cells belonging to all three domains of life, reproducing a previously established structure-based classification of icosahedral viral capsids. PMID:28122979
NASA Astrophysics Data System (ADS)
Stănciuc, Nicoleta; Aprodu, Iuliana; Ioniță, Elena; Bahrim, Gabriela; Râpeanu, Gabriela
2015-08-01
Given the importance of peroxidase as an indicator for the preservation of vegetables by heat treatment, the present study is focused on enzyme behavior under different pH and temperature conditions, in terms of process-structure-function relationships. Thus, the process-structure-function relationship of peroxidase was investigated by combining fluorescence spectroscopy, in silico prediction methods and inactivation kinetic studies. The fluorescence spectra indicated that at optimum pH value, the Trp117 residue is not located in the hydrophobic core of the protein. Significant blue- and red-shifts were obtained at different pH values, whereas the heat-treatment did not cause significant changes in Trp and Tyr environment. The ANS and quenching experiments demonstrated a more flexible conformation at lower pH and respectively at higher temperature. On the other hand molecular dynamics simulations at different temperatures highlighted that the secondary structure appeared better preserved against temperature, whereas the tertiary structure around the heme was more affected. Temperature dependent changes in the hydrogen bonding and ion paring involving amino acids from the heme-binding region (His170 and Asp247) might trigger miss-coordination of the heme iron atom by His170 residue and further enzyme activity loss.
Wilson, Corey J
2015-01-01
Proteins are the most functionally diverse macromolecules observed in nature, participating in a broad array of catalytic, biosensing, transport, scaffolding, and regulatory functions. Fittingly, proteins have become one of the most promising nanobiotechnological tools to date, and through the use of recombinant DNA and other laboratory methods we have produced a vast number of biological therapeutics derived from human genes. Our emerging ability to rationally design proteins (e.g., via computational methods) holds the promise of significantly expanding the number and diversity of protein therapies and has opened the gateway to realizing true and uncompromised personalized medicine. In the last decade computational protein design has been transformed from a set of fundamental strategies to stringently test our understanding of the protein structure-function relationship, to practical tools for developing useful biological processes, nano-devices, and novel therapeutics. As protein design strategies improve (i.e., in terms of accuracy and efficiency) clinicians will be able to leverage individual genetic data and biological metrics to develop and deliver personalized protein therapeutics with minimal delay. © 2014 Wiley Periodicals, Inc.
Balsera, Monica; Buey, Ruben M.; Li, Xiao-Dan
2011-01-01
The oxaloacetate decarboxylase primary Na+ pump (OAD) is an essential membrane protein complex that functions in the citrate fermentation pathway of some pathogenic bacteria under anaerobic conditions. OAD contains three different subunits: Oad-α, a biotinylated extrinsic protein that catalyzes the α-ketodecarboxylation of oxaloacetate; Oad-γ, a structural bitopic membrane protein whose cytosolic tail (named as Oad-γ′) binds tightly to Oad-α; and Oad-β, a multispan transmembrane α-helical protein that constitutes the Na+ channel. How OAD is organized structurally at the membrane and what the molecular determinants are that lead to an efficient energy coupling mechanism remain elusive. In the present work, we elucidate the stoichiometry of the native complex as well as the low resolution structure of the peripheral components of OAD (Oad-α and Oad-γ′) by small angle x-ray scattering. Our results point to a quaternary assembly similar to the pyruvate carboxylase complex organization. Herein, we propose a model in which the association in pairs of Oad-α dimers, mediated by Oad-γ, results in the acquisition of a functional oligomeric state at the bacterial membrane. New structural insights for the conformational rearrangements associated with the carboxylbiotin transfer reaction within OAD are provided. PMID:21209096
Structural classification of small, disulfide-rich protein domains.
Cheek, Sara; Krishna, S Sri; Grishin, Nick V
2006-05-26
Disulfide-rich domains are small protein domains whose global folds are stabilized primarily by the formation of disulfide bonds and, to a much lesser extent, by secondary structure and hydrophobic interactions. Disulfide-rich domains perform a wide variety of roles functioning as growth factors, toxins, enzyme inhibitors, hormones, pheromones, allergens, etc. These domains are commonly found both as independent (single-domain) proteins and as domains within larger polypeptides. Here, we present a comprehensive structural classification of approximately 3000 small, disulfide-rich protein domains. We find that these domains can be arranged into 41 fold groups on the basis of structural similarity. Our fold groups, which describe broader structural relationships than existing groupings of these domains, bring together representatives with previously unacknowledged similarities; 18 of the 41 fold groups include domains from several SCOP folds. Within the fold groups, the domains are assembled into families of homologs. We define 98 families of disulfide-rich domains, some of which include newly detected homologs, particularly among knottin-like domains. On the basis of this classification, we have examined cases of convergent and divergent evolution of functions performed by disulfide-rich proteins. Disulfide bonding patterns in these domains are also evaluated. Reducible disulfide bonding patterns are much less frequent, while symmetric disulfide bonding patterns are more common than expected from random considerations. Examples of variations in disulfide bonding patterns found within families and fold groups are discussed.
Le, Xuan T; Rioux, Laurie-Eve; Turgeon, Sylvie L
2017-01-01
Protein and polysaccharide mixed systems have been actively studied for at least 50years as they can be assembled into functional particles or gels. This article reviews the properties of electrostatic gels, a recently discovered particular case of associative protein-polysaccharide mixtures formed through associative electrostatic interaction under appropriate solution conditions (coupled gel). This review highlights the factors influencing gel formation such as protein-polysaccharide ratio, biopolymer structural characteristics, final pH, ionic strength and total solid concentration. For the first time, the functional properties of protein-polysaccharide coupled gels are presented and discussed in relationship to individual protein and polysaccharide hydrogels. One of their outstanding characteristics is their gel water retention. Up to 600g of water per g of biopolymer may be retained in the electrostatic gel network compared to a protein gel (3-9g of water per g of protein). Potential applications of the gels are proposed to enable the food and non-food industries to develop new functional products with desirable attributes or new interesting materials to incorporate bioactive molecules. Copyright © 2016 Elsevier B.V. All rights reserved.
Controllable molecular motors engineered from myosin and RNA
NASA Astrophysics Data System (ADS)
Omabegho, Tosan; Gurel, Pinar S.; Cheng, Clarence Y.; Kim, Laura Y.; Ruijgrok, Paul V.; Das, Rhiju; Alushin, Gregory M.; Bryant, Zev
2018-01-01
Engineering biomolecular motors can provide direct tests of structure-function relationships and customized components for controlling molecular transport in artificial systems1 or in living cells2. Previously, synthetic nucleic acid motors3-5 and modified natural protein motors6-10 have been developed in separate complementary strategies to achieve tunable and controllable motor function. Integrating protein and nucleic-acid components to form engineered nucleoprotein motors may enable additional sophisticated functionalities. However, this potential has only begun to be explored in pioneering work harnessing DNA scaffolds to dictate the spacing, number and composition of tethered protein motors11-15. Here, we describe myosin motors that incorporate RNA lever arms, forming hybrid assemblies in which conformational changes in the protein motor domain are amplified and redirected by nucleic acid structures. The RNA lever arm geometry determines the speed and direction of motor transport and can be dynamically controlled using programmed transitions in the lever arm structure7,9. We have characterized the hybrid motors using in vitro motility assays, single-molecule tracking, cryo-electron microscopy and structural probing16. Our designs include nucleoprotein motors that reversibly change direction in response to oligonucleotides that drive strand-displacement17 reactions. In multimeric assemblies, the controllable motors walk processively along actin filaments at speeds of 10-20 nm s-1. Finally, to illustrate the potential for multiplexed addressable control, we demonstrate sequence-specific responses of RNA variants to oligonucleotide signals.
Rehan, Shahid; Jaakola, Veli-Pekka
2015-10-01
Human equilibrative nucleoside transporter-1 (hENT1) is the major plasma membrane transporter involved in transportation of natural nucleosides as well as nucleoside analog drugs, used in anti-cancer and anti-viral therapies. Despite extensive biochemical and pharmacological studies, little is known about the structure-function relationship of this protein. The major obstacles to purification include a low endogenous expression level, the lack of an efficient expression and purification protocol, and the hydrophobic nature of the protein. Here, we report protein expression, purification and functional characterization of hENT1 from Sf9 insect cells. hENT1 expressed by Sf9 cells is functionally active as demonstrated by saturation binding with a Kd of 1.2±0.2nM and Bmax of 110±5pmol/mg for [(3)H]nitrobenzylmercaptopurine ribonucleoside ([(3)H]NBMPR). We also demonstrate purification of hENT1 using FLAG antibody affinity resin in lauryl maltose neopentyl glycol detergent with a Kd of 4.3±0.7nM. The yield of hENT1 from Sf9 cells was ∼0.5mg active transporter per liter of culture. The purified protein is functionally active, stable, homogenous and appropriate for further biophysical and structural studies. Copyright © 2015 Elsevier Inc. All rights reserved.
Atkinson, Joshua T; Campbell, Ian; Bennett, George N; Silberg, Jonathan J
2016-12-27
The ferredoxin (Fd) protein family is a structurally diverse group of iron-sulfur proteins that function as electron carriers, linking biochemical pathways important for energy transduction, nutrient assimilation, and primary metabolism. While considerable biochemical information about individual Fd protein electron carriers and their reactions has been acquired, we cannot yet anticipate the proportion of electrons shuttled between different Fd-partner proteins within cells using biochemical parameters that govern electron flow, such as holo-Fd concentration, midpoint potential (driving force), molecular interactions (affinity and kinetics), conformational changes (allostery), and off-pathway electron leakage (chemical oxidation). Herein, we describe functional and structural gaps in our Fd knowledge within the context of a sequence similarity network and phylogenetic tree, and we propose a strategy for improving our understanding of Fd sequence-function relationships. We suggest comparing the functions of divergent Fds within cells whose growth, or other measurable output, requires electron transfer between defined electron donor and acceptor proteins. By comparing Fd-mediated electron transfer with biochemical parameters that govern electron flow, we posit that models that anticipate energy flow across Fd interactomes can be built. This approach is expected to transform our ability to anticipate Fd control over electron flow in cellular settings, an obstacle to the construction of synthetic electron transfer pathways and rational optimization of existing energy-conserving pathways.
Dygut, Jacek; Kalinowska, Barbara; Banach, Mateusz; Piwowar, Monika; Konieczny, Leszek; Roterman, Irena
2016-10-18
The presented analysis concerns the inter-domain and inter-protein interface in protein complexes. We propose extending the traditional understanding of the protein domain as a function of local compactness with an additional criterion which refers to the presence of a well-defined hydrophobic core. Interface areas in selected homodimers vary with respect to their contribution to share as well as individual (domain-specific) hydrophobic cores. The basic definition of a protein domain, i.e., a structural unit characterized by tighter packing than its immediate environment, is extended in order to acknowledge the role of a structured hydrophobic core, which includes the interface area. The hydrophobic properties of interfaces vary depending on the status of interacting domains-In this context we can distinguish: (1) Shared hydrophobic cores (spanning the whole dimer); (2) Individual hydrophobic cores present in each monomer irrespective of whether the dimer contains a shared core. Analysis of interfaces in dystrophin and utrophin indicates the presence of an additional quasi-domain with a prominent hydrophobic core, consisting of fragments contributed by both monomers. In addition, we have also attempted to determine the relationship between the type of interface (as categorized above) and the biological function of each complex. This analysis is entirely based on the fuzzy oil drop model.
Geisbrecht, Brian V; Hamaoka, Brent Y; Perman, Benjamin; Zemla, Adam; Leahy, Daniel J
2005-04-29
The Eap (extracellular adherence protein) of Staphylococcus aureus functions as a secreted virulence factor by mediating interactions between the bacterial cell surface and several extracellular host proteins. Eap proteins from different Staphylococcal strains consist of four to six tandem repeats of a structurally uncharacterized domain (EAP domain). We have determined the three-dimensional structures of three different EAP domains to 1.8, 2.2, and 1.35 A resolution, respectively. These structures reveal a core fold that is comprised of an alpha-helix lying diagonally across a five-stranded, mixed beta-sheet. Comparison of EAP domains with known structures reveals an unexpected homology with the C-terminal domain of bacterial superantigens. Examination of the structure of the superantigen SEC2 bound to the beta-chain of a T-cell receptor suggests a possible ligand-binding site within the EAP domain (Fields, B. A., Malchiodi, E. L., Li, H., Ysern, X., Stauffacher, C. V., Schlievert, P. M., Karjalainen, K., and Mariuzza, R. (1996) Nature 384, 188-192). These results provide the first structural characterization of EAP domains, relate EAP domains to a large class of bacterial toxins, and will guide the design of future experiments to analyze EAP domain structure/function relationships.
Single Amino Acid Repeats in the Proteome World: Structural, Functional, and Evolutionary Insights
Kumar, Amitha Sampath; Sowpati, Divya Tej; Mishra, Rakesh K.
2016-01-01
Microsatellites or simple sequence repeats (SSR) are abundant, highly diverse stretches of short DNA repeats present in all genomes. Tandem mono/tri/hexanucleotide repeats in the coding regions contribute to single amino acids repeats (SAARs) in the proteome. While SSRs in the coding region always result in amino acid repeats, a majority of SAARs arise due to a combination of various codons representing the same amino acid and not as a consequence of SSR events. Certain amino acids are abundant in repeat regions indicating a positive selection pressure behind the accumulation of SAARs. By analysing 22 proteomes including the human proteome, we explored the functional and structural relationship of amino acid repeats in an evolutionary context. Only ~15% of repeats are present in any known functional domain, while ~74% of repeats are present in the disordered regions, suggesting that SAARs add to the functionality of proteins by providing flexibility, stability and act as linker elements between domains. Comparison of SAAR containing proteins across species reveals that while shorter repeats are conserved among orthologs, proteins with longer repeats, >15 amino acids, are unique to the respective organism. Lysine repeats are well conserved among orthologs with respect to their length and number of occurrences in a protein. Other amino acids such as glutamic acid, proline, serine and alanine repeats are generally conserved among the orthologs with varying repeat lengths. These findings suggest that SAARs have accumulated in the proteome under positive selection pressure and that they provide flexibility for optimal folding of functional/structural domains of proteins. The insights gained from our observations can help in effective designing and engineering of proteins with novel features. PMID:27893794
The recombinant expression and activity detection of MAF-1 fusion protein.
Fu, Ping; Wu, Jianwei; Gao, Song; Guo, Guo; Zhang, Yong; Liu, Jian
2015-10-01
This study establishes the recombinant expression system of MAF-1 (Musca domestica antifungal peptide-1) and demonstrates the antifungal activity of the expression product and shows the relationship between biological activity and structure. The gene segments on mature peptide part of MAF-1 were cloned, based on the primers designed according to the cDNA sequence of MAF-1. We constructed the recombinant prokaryotic expression plasmid using prokaryotic expression vector (pET-28a(+)) and converted it to the competent cell of BL21(DE3) to gain recombinant MAF-1 fusion protein with His tag sequence through purifying affinity chromatographic column of Ni-NTA. To conduct the Western Blotting test, recombinant MAF-1 fusion protein was used to produce the polyclonal antibody of rat. The antifungal activity of the expression product was detected using Candida albicans (ATCC10231) as the indicator. The MAF-1 recombinant fusion protein was purified to exhibit obvious antifungal activity, which lays the foundation for the further study of MAF-1 biological activity, the relationship between structure and function, as well as control of gene expression.
HDAPD: a web tool for searching the disease-associated protein structures
2010-01-01
Background The protein structures of the disease-associated proteins are important for proceeding with the structure-based drug design to against a particular disease. Up until now, proteins structures are usually searched through a PDB id or some sequence information. However, in the HDAPD database presented here the protein structure of a disease-associated protein can be directly searched through the associated disease name keyed in. Description The search in HDAPD can be easily initiated by keying some key words of a disease, protein name, protein type, or PDB id. The protein sequence can be presented in FASTA format and directly copied for a BLAST search. HDAPD is also interfaced with Jmol so that users can observe and operate a protein structure with Jmol. The gene ontological data such as cellular components, molecular functions, and biological processes are provided once a hyperlink to Gene Ontology (GO) is clicked. Further, HDAPD provides a link to the KEGG map such that where the protein is placed and its relationship with other proteins in a metabolic pathway can be found from the map. The latest literatures namely titles, journals, authors, and abstracts searched from PubMed for the protein are also presented as a length controllable list. Conclusions Since the HDAPD data content can be routinely updated through a PHP-MySQL web page built, the new database presented is useful for searching the structures for some disease-associated proteins that may play important roles in the disease developing process for performing the structure-based drug design to against the diseases. PMID:20158919
Installing hydrolytic activity into a completely de novo protein framework
NASA Astrophysics Data System (ADS)
Burton, Antony J.; Thomson, Andrew R.; Dawson, William M.; Brady, R. Leo; Woolfson, Derek N.
2016-09-01
The design of enzyme-like catalysts tests our understanding of sequence-to-structure/function relationships in proteins. Here we install hydrolytic activity predictably into a completely de novo and thermostable α-helical barrel, which comprises seven helices arranged around an accessible channel. We show that the lumen of the barrel accepts 21 mutations to functional polar residues. The resulting variant, which has cysteine-histidine-glutamic acid triads on each helix, hydrolyses p-nitrophenyl acetate with catalytic efficiencies that match the most-efficient redesigned hydrolases based on natural protein scaffolds. This is the first report of a functional catalytic triad engineered into a de novo protein framework. The flexibility of our system also allows the facile incorporation of unnatural side chains to improve activity and probe the catalytic mechanism. Such a predictable and robust construction of truly de novo biocatalysts holds promise for applications in chemical and biochemical synthesis.
Blocker, Kory M; Britton, Zachary T; Naranjo, Andrea N; McNeely, Patrick M; Young, Carissa L; Robinson, Anne S
2015-01-01
G protein-coupled receptors (GPCRs) are membrane proteins that mediate signaling across the cellular membrane and facilitate cellular responses to external stimuli. Due to the critical role that GPCRs play in signal transduction, therapeutics have been developed to influence GPCR function without an extensive understanding of the receptors themselves. Closing this knowledge gap is of paramount importance to improving therapeutic efficacy and specificity, where efforts to achieve this end have focused chiefly on improving our knowledge of the structure-function relationship. The purpose of this chapter is to review methods for the heterologous expression of GPCRs in Saccharomyces cerevisiae, including whole-cell assays that enable quantitation of expression, localization, and function in vivo. In addition, we describe methods for the micellular solubilization of the human adenosine A2a receptor and for reconstitution of the receptor in liposomes that have enabled its biophysical characterization. © 2015 Elsevier Inc. All rights reserved.
Abriata, Luciano A; Bovigny, Christophe; Dal Peraro, Matteo
2016-06-17
Protein variability can now be studied by measuring high-resolution tolerance-to-substitution maps and fitness landscapes in saturated mutational libraries. But these rich and expensive datasets are typically interpreted coarsely, restricting detailed analyses to positions of extremely high or low variability or dubbed important beforehand based on existing knowledge about active sites, interaction surfaces, (de)stabilizing mutations, etc. Our new webserver PsychoProt (freely available without registration at http://psychoprot.epfl.ch or at http://lucianoabriata.altervista.org/psychoprot/index.html ) helps to detect, quantify, and sequence/structure map the biophysical and biochemical traits that shape amino acid preferences throughout a protein as determined by deep-sequencing of saturated mutational libraries or from large alignments of naturally occurring variants. We exemplify how PsychoProt helps to (i) unveil protein structure-function relationships from experiments and from alignments that are consistent with structures according to coevolution analysis, (ii) recall global information about structural and functional features and identify hitherto unknown constraints to variation in alignments, and (iii) point at different sources of variation among related experimental datasets or between experimental and alignment-based data. Remarkably, metabolic costs of the amino acids pose strong constraints to variability at protein surfaces in nature but not in the laboratory. This and other differences call for caution when extrapolating results from in vitro experiments to natural scenarios in, for example, studies of protein evolution. We show through examples how PsychoProt can be a useful tool for the broad communities of structural biology and molecular evolution, particularly for studies about protein modeling, evolution and design.
Arpino, James A J; Reddington, Samuel C; Halliwell, Lisa M; Rizkallah, Pierre J; Jones, D Dafydd
2014-06-10
Altering a protein's backbone through amino acid deletion is a common evolutionary mutational mechanism, but is generally ignored during protein engineering primarily because its effect on the folding-structure-function relationship is difficult to predict. Using directed evolution, enhanced green fluorescent protein (EGFP) was observed to tolerate residue deletion across the breadth of the protein, particularly within short and long loops, helical elements, and at the termini of strands. A variant with G4 removed from a helix (EGFP(G4Δ)) conferred significantly higher cellular fluorescence. Folding analysis revealed that EGFP(G4Δ) retained more structure upon unfolding and refolded with almost 100% efficiency but at the expense of thermodynamic stability. The EGFP(G4Δ) structure revealed that G4 deletion caused a beneficial helical registry shift resulting in a new polar interaction network, which potentially stabilizes a cis proline peptide bond and links secondary structure elements. Thus, deletion mutations and registry shifts can enhance proteins through structural rearrangements not possible by substitution mutations alone. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Humpula, James F.; Ostrom, Peggy H.; Gandhi, Hasand; Strahler, John R.; Walker, Angela K.; Stafford, Thomas W.; Smith, James J.; Voorhies, Michael R.; George Corner, R.; Andrews, Phillip C.
2007-12-01
Ancient DNA sequences offer an extraordinary opportunity to unravel the evolutionary history of ancient organisms. Protein sequences offer another reservoir of genetic information that has recently become tractable through the application of mass spectrometric techniques. The extent to which ancient protein sequences resolve phylogenetic relationships, however, has not been explored. We determined the osteocalcin amino acid sequence from the bone of an extinct Camelid (21 ka, Camelops hesternus) excavated from Isleta Cave, New Mexico and three bones of extant camelids: bactrian camel ( Camelus bactrianus); dromedary camel ( Camelus dromedarius) and guanaco ( Llama guanacoe) for a diagenetic and phylogenetic assessment. There was no difference in sequence among the four taxa. Structural attributes observed in both modern and ancient osteocalcin include a post-translation modification, Hyp 9, deamidation of Gln 35 and Gln 39, and oxidation of Met 36. Carbamylation of the N-terminus in ancient osteocalcin may result in blockage and explain previous difficulties in sequencing ancient proteins via Edman degradation. A phylogenetic analysis using osteocalcin sequences of 25 vertebrate taxa was conducted to explore osteocalcin protein evolution and the utility of osteocalcin sequences for delineating phylogenetic relationships. The maximum likelihood tree closely reflected generally recognized taxonomic relationships. For example, maximum likelihood analysis recovered rodents, birds and, within hominins, the Homo-Pan-Gorilla trichotomy. Within Artiodactyla, character state analysis showed that a substitution of Pro 4 for His 4 defines the Capra-Ovis clade within Artiodactyla. Homoplasy in our analysis indicated that osteocalcin evolution is not a perfect indicator of species evolution. Limited sequence availability prevented assigning functional significance to sequence changes. Our preliminary analysis of osteocalcin evolution represents an initial step towards a complete character analysis aimed at determining the evolutionary history of this functionally significant protein. We emphasize that ancient protein sequencing and phylogenetic analyses using amino acid sequences must pay close attention to post-translational modifications, amino acid substitutions due to diagenetic alteration and the impacts of isobaric amino acids on mass shifts and sequence alignments.
Arighi, Cecilia; Shamovsky, Veronica; Masci, Anna Maria; Ruttenberg, Alan; Smith, Barry; Natale, Darren A; Wu, Cathy; D'Eustachio, Peter
2015-01-01
The Protein Ontology (PRO) provides terms for and supports annotation of species-specific protein complexes in an ontology framework that relates them both to their components and to species-independent families of complexes. Comprehensive curation of experimentally known forms and annotations thereof is expected to expose discrepancies, differences, and gaps in our knowledge. We have annotated the early events of innate immune signaling mediated by Toll-Like Receptor 3 and 4 complexes in human, mouse, and chicken. The resulting ontology and annotation data set has allowed us to identify species-specific gaps in experimental data and possible functional differences between species, and to employ inferred structural and functional relationships to suggest plausible resolutions of these discrepancies and gaps.
Functional assignment to JEV proteins using SVM.
Sahoo, Ganesh Chandra; Dikhit, Manas Ranjan; Das, Pradeep
2008-01-01
Identification of different protein functions facilitates a mechanistic understanding of Japanese encephalitis virus (JEV) infection and opens novel means for drug development. Support vector machines (SVM), useful for predicting the functional class of distantly related proteins, is employed to ascribe a possible functional class to Japanese encephalitis virus protein. Our study from SVMProt and available JE virus sequences suggests that structural and nonstructural proteins of JEV genome possibly belong to diverse protein functions, are expected to occur in the life cycle of JE virus. Protein functions common to both structural and non-structural proteins are iron-binding, metal-binding, lipid-binding, copper-binding, transmembrane, outer membrane, channels/Pores - Pore-forming toxins (proteins and peptides) group of proteins. Non-structural proteins perform functions like actin binding, zinc-binding, calcium-binding, hydrolases, Carbon-Oxygen Lyases, P-type ATPase, proteins belonging to major facilitator family (MFS), secreting main terminal branch (MTB) family, phosphotransfer-driven group translocators and ATP-binding cassette (ABC) family group of proteins. Whereas structural proteins besides belonging to same structural group of proteins (capsid, structural, envelope), they also perform functions like nuclear receptor, antibiotic resistance, RNA-binding, DNA-binding, magnesium-binding, isomerase (intra-molecular), oxidoreductase and participate in type II (general) secretory pathway (IISP).
Functional assignment to JEV proteins using SVM
Sahoo, Ganesh Chandra; Dikhit, Manas Ranjan; Das, Pradeep
2008-01-01
Identification of different protein functions facilitates a mechanistic understanding of Japanese encephalitis virus (JEV) infection and opens novel means for drug development. Support vector machines (SVM), useful for predicting the functional class of distantly related proteins, is employed to ascribe a possible functional class to Japanese encephalitis virus protein. Our study from SVMProt and available JE virus sequences suggests that structural and nonstructural proteins of JEV genome possibly belong to diverse protein functions, are expected to occur in the life cycle of JE virus. Protein functions common to both structural and non-structural proteins are iron-binding, metal-binding, lipid-binding, copper-binding, transmembrane, outer membrane, channels/Pores - Pore-forming toxins (proteins and peptides) group of proteins. Non-structural proteins perform functions like actin binding, zinc-binding, calcium-binding, hydrolases, Carbon-Oxygen Lyases, P-type ATPase, proteins belonging to major facilitator family (MFS), secreting main terminal branch (MTB) family, phosphotransfer-driven group translocators and ATP-binding cassette (ABC) family group of proteins. Whereas structural proteins besides belonging to same structural group of proteins (capsid, structural, envelope), they also perform functions like nuclear receptor, antibiotic resistance, RNA-binding, DNA-binding, magnesium-binding, isomerase (intra-molecular), oxidoreductase and participate in type II (general) secretory pathway (IISP). PMID:19052658
DOE Office of Scientific and Technical Information (OSTI.GOV)
Baig, M.; Brown, A.; Eswaramoorthy, S.
Klebsiella pneumoniae, a gram-negative enteric bacterium, is found in nosocomial infections which are acquired during hospital stays for about 10% of hospital patients in the United States. The crystal structure of a putative oxidoreductase from K. pneumoniae has been determined. The structural information of this K. pneumoniae protein was used to understand its function. Crystals of the putative oxidoreductase enzyme were obtained by the sitting drop vapor diffusion method using Polyethylene glycol (PEG) 3350, Bis-Tris buffer, pH 5.5 as precipitant. These crystals were used to collect X-ray data at beam line X12C of the National Synchrotron Light Source (NSLS) atmore » Brookhaven National Laboratory (BNL). The crystal structure was determined using the SHELX program and refi ned with CNS 1.1. This protein, which is involved in the catalysis of an oxidation-reduction (redox) reaction, has an alpha/beta structure. It utilizes nicotinamide adenine dinucleotide phosphate (NADP) or nicotine adenine dinucleotide (NAD) to perform its function. This structure could be used to determine the active and co-factor binding sites of the protein, information that could help pharmaceutical companies in drug design and in determining the protein’s relationship to disease treatment such as that for pneumonia and other related pathologies.« less
SANSparallel: interactive homology search against Uniprot
Somervuo, Panu; Holm, Liisa
2015-01-01
Proteins evolve by mutations and natural selection. The network of sequence similarities is a rich source for mining homologous relationships that inform on protein structure and function. There are many servers available to browse the network of homology relationships but one has to wait up to a minute for results. The SANSparallel webserver provides protein sequence database searches with immediate response and professional alignment visualization by third-party software. The output is a list, pairwise alignment or stacked alignment of sequence-similar proteins from Uniprot, UniRef90/50, Swissprot or Protein Data Bank. The stacked alignments are viewed in Jalview or as sequence logos. The database search uses the suffix array neighborhood search (SANS) method, which has been re-implemented as a client-server, improved and parallelized. The method is extremely fast and as sensitive as BLAST above 50% sequence identity. Benchmarks show that the method is highly competitive compared to previously published fast database search programs: UBLAST, DIAMOND, LAST, LAMBDA, RAPSEARCH2 and BLAT. The web server can be accessed interactively or programmatically at http://ekhidna2.biocenter.helsinki.fi/cgi-bin/sans/sans.cgi. It can be used to make protein functional annotation pipelines more efficient, and it is useful in interactive exploration of the detailed evidence supporting the annotation of particular proteins of interest. PMID:25855811
NASA Astrophysics Data System (ADS)
Paulino, M.; Esteves, A.; Vega, M.; Tabares, G.; Ehrlich, R.; Tapia, O.
1998-07-01
EgDf1 is a developmentally regulated protein from the parasite Echinococcus granulosus related to a family of hydrophobic ligand binding proteins. This protein could play a crucial role during the parasite life cycle development since this organism is unable to synthetize most of their own lipids de novo. Furthermore, it has been shown that two related protein from other parasitic platyhelminths (Fh15 from Fasciola hepatica and Sm14 from Schistosoma mansoni) are able to confer protective inmunity against experimental infection in animal models. A three-dimensional structure would help establishing structure/function relationships on a knowledge based manner. 3D structures for EgDf1 protein were modelled by using myelin P2 (mP2) and intestine fatty acid binding protein (I-FABP) as templates. Molecular dynamics techniques were used to validate the models. Template mP2 yielded the best 3D structure for EgDf1. Palmitic and oleic acids were docked inside EgDf1. The present theoretical results suggest definite location in the secondary structure of the epitopic regions, consensus phosphorylation motifs and oleic acid as a good ligand candidate to EgDf1. This protein might well be involved in the process of supplying hydrophobic metabolites for membrane biosynthesis and for signaling pathways.
S-layers: principles and applications
Sleytr, Uwe B; Schuster, Bernhard; Egelseer, Eva-Maria; Pum, Dietmar
2014-01-01
Monomolecular arrays of protein or glycoprotein subunits forming surface layers (S-layers) are one of the most commonly observed prokaryotic cell envelope components. S-layers are generally the most abundantly expressed proteins, have been observed in species of nearly every taxonomical group of walled bacteria, and represent an almost universal feature of archaeal envelopes. The isoporous lattices completely covering the cell surface provide organisms with various selection advantages including functioning as protective coats, molecular sieves and ion traps, as structures involved in surface recognition and cell adhesion, and as antifouling layers. S-layers are also identified to contribute to virulence when present as a structural component of pathogens. In Archaea, most of which possess S-layers as exclusive wall component, they are involved in determining cell shape and cell division. Studies on structure, chemistry, genetics, assembly, function, and evolutionary relationship of S-layers revealed considerable application potential in (nano)biotechnology, biomimetics, biomedicine, and synthetic biology. PMID:24483139
Molecular targeting of growth factor receptor-bound 2 (Grb2) as an anti-cancer strategy.
Dharmawardana, Pathirage G; Peruzzi, Benedetta; Giubellino, Alessio; Burke, Terrence R; Bottaro, Donald P
2006-01-01
Growth factor receptor-bound 2 (Grb2) is a ubiquitously expressed adapter protein that provides a critical link between cell surface growth factor receptors and the Ras signaling pathway. As such, it has been implicated in the oncogenesis of several important human malignancies. In addition to this function, research over the last decade has revealed other fundamental roles for Grb2 in cell motility and angiogenesis--processes that also contribute to tumor growth, invasiveness and metastasis. This functional profile makes Grb2 a high priority target for anti-cancer drug development. Knowledge of Grb2 protein structure, its component Src homology domains and their respective structure-function relationships has facilitated the rapid development of sophisticated drug candidates that can penetrate cells, bind Grb2 with high affinity and potently antagonize Grb2 signaling. These novel compounds offer considerable promise in our growing arsenal of rationally designed anti-cancer therapeutics.
Toshima, Kazunobu
2013-05-01
Proteins and carbohydrates play crucial roles in a wide range of biological processes, including serious diseases. The development of novel and innovative methods for selective control of specific proteins and carbohydrates functions has attracted much attention in the field of chemical biology. In this account article, the development of novel chemical tools, which can degrade target proteins and carbohydrates by irradiation with a specific wavelength of light under mild conditions without any additives, is introduced. This novel class of photochemical agents promise bright prospects for finding not only molecular-targeted bioprobes for understanding of the structure-activity relationships of proteins and carbohydrates but also novel therapeutic drugs targeting proteins and carbohydrates.
Uniform structure of eukaryotic plasma membrane: lateral domains in plants.
Malínská, Kateŕina; Zažímalová, Eva
2011-03-01
Current models of the plasma membrane (PM) organization focus on the lateral heterogeneity of the membrane and its relation to the cell function. Increasing evidence in mammals and yeast supports the direct relationship between PM lateral microdomains and specific cell processes and functions (nutrient transport, signaling, protein and lipid sorting, endocytosis, pathogen entry etc.). However, for the present the functional significance of an enrichment of specific proteins and possibly lipids in plant PM domains as well as the underlying molecular mechanism driving the lateral PM segregation remain unaddressed. Here we summarize recent findings on the plant PM organization and its role in signaling pathways, with the special emphasis on auxin transport.
Expression, purification and crystallization of a plant polyketide cyclase from Cannabis sativa.
Yang, Xinmei; Matsui, Takashi; Mori, Takahiro; Taura, Futoshi; Noguchi, Hiroshi; Abe, Ikuro; Morita, Hiroyuki
2015-12-01
Plant polyketides are a structurally diverse family of natural products. In the biosynthesis of plant polyketides, the construction of the carbocyclic scaffold is a key step in diversifying the polyketide structure. Olivetolic acid cyclase (OAC) from Cannabis sativa L. is the only known plant polyketide cyclase that catalyzes the C2-C7 intramolecular aldol cyclization of linear pentyl tetra-β-ketide-CoA to generate olivetolic acid in the biosynthesis of cannabinoids. The enzyme is also thought to belong to the dimeric α+β barrel (DABB) protein family. However, because of a lack of functional analysis of other plant DABB proteins and low sequence identity with the functionally distinct bacterial DABB proteins, the catalytic mechanism of OAC has remained unclear. To clarify the intimate catalytic mechanism of OAC, the enzyme was overexpressed in Escherichia coli and crystallized using the vapour-diffusion method. The crystals diffracted X-rays to 1.40 Å resolution and belonged to space group P3121 or P3221, with unit-cell parameters a = b = 47.3, c = 176.0 Å. Further crystallographic analysis will provide valuable insights into the structure-function relationship and catalytic mechanism of OAC.
Epistasis in protein evolution
Starr, Tyler N.
2016-01-01
Abstract The structure, function, and evolution of proteins depend on physical and genetic interactions among amino acids. Recent studies have used new strategies to explore the prevalence, biochemical mechanisms, and evolutionary implications of these interactions—called epistasis—within proteins. Here we describe an emerging picture of pervasive epistasis in which the physical and biological effects of mutations change over the course of evolution in a lineage‐specific fashion. Epistasis can restrict the trajectories available to an evolving protein or open new paths to sequences and functions that would otherwise have been inaccessible. We describe two broad classes of epistatic interactions, which arise from different physical mechanisms and have different effects on evolutionary processes. Specific epistasis—in which one mutation influences the phenotypic effect of few other mutations—is caused by direct and indirect physical interactions between mutations, which nonadditively change the protein's physical properties, such as conformation, stability, or affinity for ligands. In contrast, nonspecific epistasis describes mutations that modify the effect of many others; these typically behave additively with respect to the physical properties of a protein but exhibit epistasis because of a nonlinear relationship between the physical properties and their biological effects, such as function or fitness. Both types of interaction are rampant, but specific epistasis has stronger effects on the rate and outcomes of evolution, because it imposes stricter constraints and modulates evolutionary potential more dramatically; it therefore makes evolution more contingent on low‐probability historical events and leaves stronger marks on the sequences, structures, and functions of protein families. PMID:26833806
Sivakolundu, Sivashankar G; Mabrouk, Patricia Ann
2003-05-01
The complete solution structure of ferrocytochrome c in 30% acetonitrile/70% water has been determined using high-field 1D and 2D (1)H NMR methods and deposited in the Protein Data Bank with codes 1LC1 and 1LC2. This is the first time a complete solution protein structure has been determined for a protein in nonaqueous media. Ferrocyt c retains a native protein secondary structure (five alpha-helices and two omega loops) in 30% acetonitrile. H18 and M80 residues are the axial heme ligands, as in aqueous solution. Residues believed to be axial heme ligands in the alkaline-like conformers of ferricyt c, specifically H33 and K72, are positioned close to the heme iron. The orientations of both heme propionates are markedly different in 30% acetonitrile/70% water. Comparative structural analysis of reduced cyt c in 30% acetonitrile/70% water solution with cyt c in different environments has given new insight into the cyt c folding mechanism, the electron transfer pathway, and cell apoptosis.
Modeling Protein Excited-state Structures from "Over-length" Chemical Cross-links.
Ding, Yue-He; Gong, Zhou; Dong, Xu; Liu, Kan; Liu, Zhu; Liu, Chao; He, Si-Min; Dong, Meng-Qiu; Tang, Chun
2017-01-27
Chemical cross-linking coupled with mass spectroscopy (CXMS) provides proximity information for the cross-linked residues and is used increasingly for modeling protein structures. However, experimentally identified cross-links are sometimes incompatible with the known structure of a protein, as the distance calculated between the cross-linked residues far exceeds the maximum length of the cross-linker. The discrepancies may persist even after eliminating potentially false cross-links and excluding intermolecular ones. Thus the "over-length" cross-links may arise from alternative excited-state conformation of the protein. Here we present a method and associated software DynaXL for visualizing the ensemble structures of multidomain proteins based on intramolecular cross-links identified by mass spectrometry with high confidence. Representing the cross-linkers and cross-linking reactions explicitly, we show that the protein excited-state structure can be modeled with as few as two over-length cross-links. We demonstrate the generality of our method with three systems: calmodulin, enzyme I, and glutamine-binding protein, and we show that these proteins alternate between different conformations for interacting with other proteins and ligands. Taken together, the over-length chemical cross-links contain valuable information about protein dynamics, and our findings here illustrate the relationship between dynamic domain movement and protein function. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Redesign of LAOBP to bind novel l-amino acid ligands.
Banda-Vázquez, Jesús; Shanmugaratnam, Sooruban; Rodríguez-Sotres, Rogelio; Torres-Larios, Alfredo; Höcker, Birte; Sosa-Peinado, Alejandro
2018-05-01
Computational protein design is still a challenge for advancing structure-function relationships. While recent advances in this field are promising, more information for genuine predictions is needed. Here, we discuss different approaches applied to install novel glutamine (Gln) binding into the Lysine/Arginine/Ornithine binding protein (LAOBP) from Salmonella typhimurium. We studied the ligand binding behavior of two mutants: a binding pocket grafting design based on a structural superposition of LAOBP to the Gln binding protein QBP from Escherichia coli and a design based on statistical coupled positions. The latter showed the ability to bind Gln even though the protein was not very stable. Comparison of both approaches highlighted a nonconservative shared point mutation between LAOBP_graft and LAOBP_sca. This context dependent L117K mutation in LAOBP turned out to be sufficient for introducing Gln binding, as confirmed by different experimental techniques. Moreover, the crystal structure of LAOBP_L117K in complex with its ligand is reported. © 2018 The Protein Society.
Function and structure in glycine receptors and some of their relatives.
Colquhoun, David; Sivilotti, Lucia G
2004-06-01
In the field of ligand-gated ion channels, recent developments, both in the knowledge of structure and in the measurement of function at the single-channel level, have allowed a sensible start to be made on understanding the relationship between structure and function in these proteins. In this review, the cases of glycine, nicotinic ACh and glutamate receptors are compared and contrasted, and problems such as how binding of agonist causes the channel to open, and why partial agonists are partial, are considered. Some observations, both structural and functional, suggest that more attention needs to be paid to conformational changes that occur before the channel opens. Such changes might account for the interaction found between subunits of the glycine receptor while it is still shut and, perhaps, the agonist-dependent structural changes seen in AMPA receptors. They might also complicate our understanding of the binding-gating problem.
Bouzat, Juan L; Hoostal, Matthew J
2013-05-01
Microorganisms have adapted intricate signal transduction mechanisms to coordinate tolerance to toxic levels of metals, including two-component regulatory systems (TCRS). In particular, both cop and czc operons are regulated by TCRS; the cop operon plays a key role in bacterial tolerance to copper, whereas the czc operon is involved in the efflux of cadmium, zinc, and cobalt from the cell. Although the molecular physiology of heavy metal tolerance genes has been extensively studied, their evolutionary relationships are not well-understood. Phylogenetic relationships among heavy-metal efflux proteins and their corresponding two-component regulatory proteins revealed orthologous and paralogous relationships from species divergences and ancient gene duplications. The presence of heavy metal tolerance genes on bacterial plasmids suggests these genes may be prone to spread through horizontal gene transfer. Phylogenetic inferences revealed nine potential examples of lateral gene transfer associated with metal efflux proteins and two examples for regulatory proteins. Notably, four of the examples suggest lateral transfer across major evolutionary domains. In most cases, differences in GC content in metal tolerance genes and their corresponding host genomes confirmed lateral gene transfer events. Three-dimensional protein structures predicted for the response regulators encoded by cop and czc operons showed a high degree of structural similarity with other known proteins involved in TCRS signal transduction, which suggests common evolutionary origins of functional phenotypes and similar mechanisms of action for these response regulators.
Efflux proteins at the blood-brain barrier: review and bioinformatics analysis.
Saidijam, Massoud; Karimi Dermani, Fatemeh; Sohrabi, Sareh; Patching, Simon G
2018-05-01
1. Efflux proteins at the blood-brain barrier provide a mechanism for export of waste products of normal metabolism from the brain and help to maintain brain homeostasis. They also prevent entry into the brain of a wide range of potentially harmful compounds such as drugs and xenobiotics. 2. Conversely, efflux proteins also hinder delivery of therapeutic drugs to the brain and central nervous system used to treat brain tumours and neurological disorders. For bypassing efflux proteins, a comprehensive understanding of their structures, functions and molecular mechanisms is necessary, along with new strategies and technologies for delivery of drugs across the blood-brain barrier. 3. We review efflux proteins at the blood-brain barrier, classified as either ATP-binding cassette (ABC) transporters (P-gp, BCRP, MRPs) or solute carrier (SLC) transporters (OATP1A2, OATP1A4, OATP1C1, OATP2B1, OAT3, EAATs, PMAT/hENT4 and MATE1). 4. This includes information about substrate and inhibitor specificity, structural organisation and mechanism, membrane localisation, regulation of expression and activity, effects of diseases and conditions and the principal technique used for in vivo analysis of efflux protein activity: positron emission tomography (PET). 5. We also performed analyses of evolutionary relationships, membrane topologies and amino acid compositions of the proteins, and linked these to structure and function.
Silk Materials Functionalized via Genetic Engineering for Biomedical Applications.
Deptuch, Tomasz; Dams-Kozlowska, Hanna
2017-12-12
The great mechanical properties, biocompatibility and biodegradability of silk-based materials make them applicable to the biomedical field. Genetic engineering enables the construction of synthetic equivalents of natural silks. Knowledge about the relationship between the structure and function of silk proteins enables the design of bioengineered silks that can serve as the foundation of new biomaterials. Furthermore, in order to better address the needs of modern biomedicine, genetic engineering can be used to obtain silk-based materials with new functionalities. Sequences encoding new peptides or domains can be added to the sequences encoding the silk proteins. The expression of one cDNA fragment indicates that each silk molecule is related to a functional fragment. This review summarizes the proposed genetic functionalization of silk-based materials that can be potentially useful for biomedical applications.
Developing protein documentaries and other multimedia presentations for molecular biology.
Quinn, G; Wang, H P; Martinez, D; Bourne, P E
1999-01-01
Computer-based multimedia technology for distance learning and research has come of age--the price point is acceptable, domain experts using off-the-shelf software can prepare compelling materials, and the material can be efficiently delivered via the Internet to a large audience. While not presenting any new scientific results, this paper outlines experiences with a variety of commercial and free software tools and the associated protocols we have used to prepare protein documentaries and other multimedia presentations relevant to molecular biology. A protein documentary is defined here as a description of the relationship between structure and function in a single protein or in a related family of proteins. A description using text and images which is further enhanced by the use of sound and interactive graphics. Examples of documentaries prepared to describe cAMP dependent protein kinase, the founding structural member of the protein kinase family for which there is now over 40 structures can be found at http://franklin.burnham-inst.org/rcsb. A variety of other prototype multimedia presentations for molecular biology described in this paper can be found at http://fraklin.burnham-inst.org.
Barone, Rita; Sturiale, Luisella; Fiumara, Agata; Palmigiano, Angelo; Bua, Rosaria O; Rizzo, Renata; Zappia, Mario; Garozzo, Domenico
2016-04-01
Protein N-glycosylation consists in the synthesis and processing of the oligosaccharide moiety (N-glycan) linked to a protein and it serves several functions for the proper central nervous system (CNS) development and function. Previous experimental and clinical studies have shown the importance of proper glycoprotein sialylation for the synaptic function and the occurrence of autism spectrum disorders (ASD) in the presence of sialylation deficiency in the CNS. Late-onset Tay Sachs disease (LOTSD) is a lysosomal disorder caused by mutations in the HEXA gene resulting in GM2-ganglioside storage in the CNS. It is characterized by progressive neurological impairment and high co-occurrence of psychiatric disturbances. We studied the N-glycome profile of the cerebrospinal fluid (CSF) in a 14 year-old patient with GM2-gangliosidosis (LOTSD). At the age of 4, the patient presented regressive autism fulfilling criteria for childhood disintegrative disorder (CDD). A CSF sample was obtained in the course of diagnostic work-up for the suspicion of an underlying neurodegenerative disorder. We found definite changes of CSF N-glycans due to a dramatic decrease of sialylated biantennary and triantennary structures and an increase of asialo-core fucosylated bisected N-glycans. No changes of total plasma N-glycans were found. Herein findings highlight possible relationships between the early onset psychiatric disturbance featuring CDD in the patient and defective protein sialylation in the CNS. In conclusion, the study first shows aberrant N-glycan structures of CSF proteins in LOTSD; unveils possible pathomechanisms of GM2-gangliosidosis; supports existing relationships between neuropsychiatric disorders and unproper protein glycosylation in the CNS. © 2015 International Society for Autism Research, Wiley Periodicals, Inc.
Evolutionary Descent of Prion Genes from the ZIP Family of Metal Ion Transporters
Schmitt-Ulms, Gerold; Ehsani, Sepehr; Watts, Joel C.; Westaway, David; Wille, Holger
2009-01-01
In the more than twenty years since its discovery, both the phylogenetic origin and cellular function of the prion protein (PrP) have remained enigmatic. Insights into a possible function of PrP may be obtained through the characterization of its molecular neighborhood in cells. Quantitative interactome data demonstrated the spatial proximity of two metal ion transporters of the ZIP family, ZIP6 and ZIP10, to mammalian prion proteins in vivo. A subsequent bioinformatic analysis revealed the unexpected presence of a PrP-like amino acid sequence within the N-terminal, extracellular domain of a distinct sub-branch of the ZIP protein family that includes ZIP5, ZIP6 and ZIP10. Additional structural threading and orthologous sequence alignment analyses argued that the prion gene family is phylogenetically derived from a ZIP-like ancestral molecule. The level of sequence homology and the presence of prion protein genes in most chordate species place the split from the ZIP-like ancestor gene at the base of the chordate lineage. This relationship explains structural and functional features found within mammalian prion proteins as elements of an ancient involvement in the transmembrane transport of divalent cations. The phylogenetic and spatial connection to ZIP proteins is expected to open new avenues of research to elucidate the biology of the prion protein in health and disease. PMID:19784368
ERIC Educational Resources Information Center
Inlow, Jennifer K.; Miller, Paige; Pittman, Bethany
2007-01-01
We describe two bioinformatics exercises intended for use in a computer laboratory setting in an upper-level undergraduate biochemistry course. To introduce students to bioinformatics, the exercises incorporate several commonly used bioinformatics tools, including BLAST, that are freely available online. The exercises build upon the students'…
DOE Office of Scientific and Technical Information (OSTI.GOV)
Srikannathasan, Velupillai; English, Grant; Bui, Nhat Khai
Crystal structures of type VI secretion system-associated immunity proteins, a peptidoglycan endopeptidase and a complex of the endopeptidase and its cognate immunity protein are reported together with assays of endopeptidase activity and functional assessment. Some Gram-negative bacteria target their competitors by exploiting the type VI secretion system to extrude toxic effector proteins. To prevent self-harm, these bacteria also produce highly specific immunity proteins that neutralize these antagonistic effectors. Here, the peptidoglycan endopeptidase specificity of two type VI secretion-system-associated effectors from Serratia marcescens is characterized. These small secreted proteins, Ssp1 and Ssp2, cleave between γ-d-glutamic acid and l-meso-diaminopimelic acid with differentmore » specificities. Ssp2 degrades the acceptor part of cross-linked tetratetrapeptides. Ssp1 displays greater promiscuity and cleaves monomeric tripeptides, tetrapeptides and pentapeptides and dimeric tetratetra and tetrapenta muropeptides on both the acceptor and donor strands. Functional assays confirm the identity of a catalytic cysteine in these endopeptidases and crystal structures provide information on the structure–activity relationships of Ssp1 and, by comparison, of related effectors. Functional assays also reveal that neutralization of these effectors by their cognate immunity proteins, which are called resistance-associated proteins (Raps), contributes an essential role to cell fitness. The structures of two immunity proteins, Rap1a and Rap2a, responsible for the neutralization of Ssp1 and Ssp2-like endopeptidases, respectively, revealed two distinct folds, with that of Rap1a not having previously been observed. The structure of the Ssp1–Rap1a complex revealed a tightly bound heteromeric assembly with two effector molecules flanking a Rap1a dimer. A highly effective steric block of the Ssp1 active site forms the basis of effector neutralization. Comparisons with Ssp2–Rap2a orthologues suggest that the specificity of these immunity proteins for neutralizing effectors is fold-dependent and that in cases where the fold is conserved sequence differences contribute to the specificity of effector–immunity protein interactions.« less
Modeling coding-sequence evolution within the context of residue solvent accessibility.
Scherrer, Michael P; Meyer, Austin G; Wilke, Claus O
2012-09-12
Protein structure mediates site-specific patterns of sequence divergence. In particular, residues in the core of a protein (solvent-inaccessible residues) tend to be more evolutionarily conserved than residues on the surface (solvent-accessible residues). Here, we present a model of sequence evolution that explicitly accounts for the relative solvent accessibility of each residue in a protein. Our model is a variant of the Goldman-Yang 1994 (GY94) model in which all model parameters can be functions of the relative solvent accessibility (RSA) of a residue. We apply this model to a data set comprised of nearly 600 yeast genes, and find that an evolutionary-rate ratio ω that varies linearly with RSA provides a better model fit than an RSA-independent ω or an ω that is estimated separately in individual RSA bins. We further show that the branch length t and the transition-transverion ratio κ also vary with RSA. The RSA-dependent GY94 model performs better than an RSA-dependent Muse-Gaut 1994 (MG94) model in which the synonymous and non-synonymous rates individually are linear functions of RSA. Finally, protein core size affects the slope of the linear relationship between ω and RSA, and gene expression level affects both the intercept and the slope. Structure-aware models of sequence evolution provide a significantly better fit than traditional models that neglect structure. The linear relationship between ω and RSA implies that genes are better characterized by their ω slope and intercept than by just their mean ω.
The unfoldomics decade: an update on intrinsically disordered proteins.
Dunker, A Keith; Oldfield, Christopher J; Meng, Jingwei; Romero, Pedro; Yang, Jack Y; Chen, Jessica Walton; Vacic, Vladimir; Obradovic, Zoran; Uversky, Vladimir N
2008-09-16
Our first predictor of protein disorder was published just over a decade ago in the Proceedings of the IEEE International Conference on Neural Networks (Romero P, Obradovic Z, Kissinger C, Villafranca JE, Dunker AK (1997) Identifying disordered regions in proteins from amino acid sequence. Proceedings of the IEEE International Conference on Neural Networks, 1: 90-95). By now more than twenty other laboratory groups have joined the efforts to improve the prediction of protein disorder. While the various prediction methodologies used for protein intrinsic disorder resemble those methodologies used for secondary structure prediction, the two types of structures are entirely different. For example, the two structural classes have very different dynamic properties, with the irregular secondary structure class being much less mobile than the disorder class. The prediction of secondary structure has been useful. On the other hand, the prediction of intrinsic disorder has been revolutionary, leading to major modifications of the more than 100 year-old views relating protein structure and function. Experimentalists have been providing evidence over many decades that some proteins lack fixed structure or are disordered (or unfolded) under physiological conditions. In addition, experimentalists are also showing that, for many proteins, their functions depend on the unstructured rather than structured state; such results are in marked contrast to the greater than hundred year old views such as the lock and key hypothesis. Despite extensive data on many important examples, including disease-associated proteins, the importance of disorder for protein function has been largely ignored. Indeed, to our knowledge, current biochemistry books don't present even one acknowledged example of a disorder-dependent function, even though some reports of disorder-dependent functions are more than 50 years old. The results from genome-wide predictions of intrinsic disorder and the results from other bioinformatics studies of intrinsic disorder are demanding attention for these proteins. Disorder prediction has been important for showing that the relatively few experimentally characterized examples are members of a very large collection of related disordered proteins that are wide-spread over all three domains of life. Many significant biological functions are now known to depend directly on, or are importantly associated with, the unfolded or partially folded state. Here our goal is to review the key discoveries and to weave these discoveries together to support novel approaches for understanding sequence-function relationships. Intrinsically disordered protein is common across the three domains of life, but especially common among the eukaryotic proteomes. Signaling sequences and sites of posttranslational modifications are frequently, or very likely most often, located within regions of intrinsic disorder. Disorder-to-order transitions are coupled with the adoption of different structures with different partners. Also, the flexibility of intrinsic disorder helps different disordered regions to bind to a common binding site on a common partner. Such capacity for binding diversity plays important roles in both protein-protein interaction networks and likely also in gene regulation networks. Such disorder-based signaling is further modulated in multicellular eukaryotes by alternative splicing, for which such splicing events map to regions of disorder much more often than to regions of structure. Associating alternative splicing with disorder rather than structure alleviates theoretical and experimentally observed problems associated with the folding of different length, isomeric amino acid sequences. The combination of disorder and alternative splicing is proposed to provide a mechanism for easily "trying out" different signaling pathways, thereby providing the mechanism for generating signaling diversity and enabling the evolution of cell differentiation and multicellularity. Finally, several recent small molecules of interest as potential drugs have been shown to act by blocking protein-protein interactions based on intrinsic disorder of one of the partners. Study of these examples has led to a new approach for drug discovery, and bioinformatics analysis of the human proteome suggests that various disease-associated proteins are very rich in such disorder-based drug discovery targets.
Nuclear speckles: molecular organization, biological function and role in disease
Galganski, Lukasz; Urbanek, Martyna O.
2017-01-01
Abstract The nucleoplasm is not homogenous; it consists of many types of nuclear bodies, also known as nuclear domains or nuclear subcompartments. These self-organizing structures gather machinery involved in various nuclear activities. Nuclear speckles (NSs) or splicing speckles, also called interchromatin granule clusters, were discovered as sites for splicing factor storage and modification. Further studies on transcription and mRNA maturation and export revealed a more general role for splicing speckles in RNA metabolism. Here, we discuss the functional implications of the localization of numerous proteins crucial for epigenetic regulation, chromatin organization, DNA repair and RNA modification to nuclear speckles. We highlight recent advances suggesting that NSs facilitate integrated regulation of gene expression. In addition, we consider the influence of abundant regulatory and signaling proteins, i.e. protein kinases and proteins involved in protein ubiquitination, phosphoinositide signaling and nucleoskeletal organization, on pre-mRNA synthesis and maturation. While many of these regulatory proteins act within NSs, direct evidence for mRNA metabolism events occurring in NSs is still lacking. NSs contribute to numerous human diseases, including cancers and viral infections. In addition, recent data have demonstrated close relationships between these structures and the development of neurological disorders. PMID:28977640
Dissecting the relationship between protein structure and sequence variation
NASA Astrophysics Data System (ADS)
Shahmoradi, Amir; Wilke, Claus; Wilke Lab Team
2015-03-01
Over the past decade several independent works have shown that some structural properties of proteins are capable of predicting protein evolution. The strength and significance of these structure-sequence relations, however, appear to vary widely among different proteins, with absolute correlation strengths ranging from 0 . 1 to 0 . 8 . Here we present the results from a comprehensive search for the potential biophysical and structural determinants of protein evolution by studying more than 200 structural and evolutionary properties in a dataset of 209 monomeric enzymes. We discuss the main protein characteristics responsible for the general patterns of protein evolution, and identify sequence divergence as the main determinant of the strengths of virtually all structure-evolution relationships, explaining ~ 10 - 30 % of observed variation in sequence-structure relations. In addition to sequence divergence, we identify several protein structural properties that are moderately but significantly coupled with the strength of sequence-structure relations. In particular, proteins with more homogeneous back-bone hydrogen bond energies, large fractions of helical secondary structures and low fraction of beta sheets tend to have the strongest sequence-structure relation. BEACON-NSF center for the study of evolution in action.
Scalise, Mariafrancesca; Pochini, Lorena; Console, Lara; Pappacoda, Gilda; Pingitore, Piero; Hedfalk, Kristina; Indiveri, Cesare
2018-01-01
The human plasma membrane transporter ASCT2 is responsible for mediating Na- dependent antiport of neutral amino acids. New insights into structure/function relationships were unveiled by a combined approach of recombinant over-expression, site-directed mutagenesis, transport assays in proteoliposomes and bioinformatics. WT and Cys mutants of hASCT2 were produced in P. pastoris and purified for functional assay. The reactivity towards SH reducing and oxidizing agents of WT protein was investigated and opposite effects were revealed; transport activity increased upon treatment with the Cys reducing agent DTE, i.e., when Cys residues were in thiol (reduced) state. Methyl-Hg, which binds to SH groups, was able to inhibit WT and seven out of eight Cys to Ala mutants. On the contrary, C467A loses the sensitivity to both DTE activation and Methyl-Hg inhibition. The C467A mutant showed a Km for Gln one order of magnitude higher than that of WT. Moreover, the C467 residue is localized in the substrate binding region of the protein, as suggested by bioinformatics on the basis of the EAAT1 structure comparison. Taken together, the experimental data allowed identifying C467 residue as crucial for substrate binding and for transport activity modulation of hASCT2. PMID:29495336
Tiwari, Sandhya P.; Reuter, Nathalie
2016-01-01
The conservation of the intrinsic dynamics of proteins emerges as we attempt to understand the relationship between sequence, structure and functional conservation. We characterise the conservation of such dynamics in a case where the structure is conserved but function differs greatly. The triosephosphate isomerase barrel fold (TBF), renowned for its 8 β-strand-α-helix repeats that close to form a barrel, is one of the most diverse and abundant folds found in known protein structures. Proteins with this fold have diverse enzymatic functions spanning five of six Enzyme Commission classes, and we have picked five different superfamily candidates for our analysis using elastic network models. We find that the overall shape is a large determinant in the similarity of the intrinsic dynamics, regardless of function. In particular, the β-barrel core is highly rigid, while the α-helices that flank the β-strands have greater relative mobility, allowing for the many possibilities for placement of catalytic residues. We find that these elements correlate with each other via the loops that link them, as opposed to being directly correlated. We are also able to analyse the types of motions encoded by the normal mode vectors of the α-helices. We suggest that the global conservation of the intrinsic dynamics in the TBF contributes greatly to its success as an enzymatic scaffold both through evolution and enzyme design. PMID:27015412
Hatton, Leslie; Warr, Gregory
2015-01-01
That the physicochemical properties of amino acids constrain the structure, function and evolution of proteins is not in doubt. However, principles derived from information theory may also set bounds on the structure (and thus also the evolution) of proteins. Here we analyze the global properties of the full set of proteins in release 13-11 of the SwissProt database, showing by experimental test of predictions from information theory that their collective structure exhibits properties that are consistent with their being guided by a conservation principle. This principle (Conservation of Information) defines the global properties of systems composed of discrete components each of which is in turn assembled from discrete smaller pieces. In the system of proteins, each protein is a component, and each protein is assembled from amino acids. Central to this principle is the inter-relationship of the unique amino acid count and total length of a protein and its implications for both average protein length and occurrence of proteins with specific unique amino acid counts. The unique amino acid count is simply the number of distinct amino acids (including those that are post-translationally modified) that occur in a protein, and is independent of the number of times that the particular amino acid occurs in the sequence. Conservation of Information does not operate at the local level (it is independent of the physicochemical properties of the amino acids) where the influences of natural selection are manifest in the variety of protein structure and function that is well understood. Rather, this analysis implies that Conservation of Information would define the global bounds within which the whole system of proteins is constrained; thus it appears to be acting to constrain evolution at a level different from natural selection, a conclusion that appears counter-intuitive but is supported by the studies described herein.
2017-01-01
Normal aging is associated with a decline in episodic memory and also with aggregation of the β-amyloid (Aβ) and tau proteins and atrophy of medial temporal lobe (MTL) structures crucial to memory formation. Although some evidence suggests that Aβ is associated with aberrant neural activity, the relationships among these two aggregated proteins, neural function, and brain structure are poorly understood. Using in vivo human Aβ and tau imaging, we demonstrate that increased Aβ and tau are both associated with aberrant fMRI activity in the MTL during memory encoding in cognitively normal older adults. This pathological neural activity was in turn associated with worse memory performance and atrophy within the MTL. A mediation analysis revealed that the relationship with regional atrophy was explained by MTL tau. These findings broaden the concept of cognitive aging to include evidence of Alzheimer's disease-related protein aggregation as an underlying mechanism of age-related memory impairment. SIGNIFICANCE STATEMENT Alterations in episodic memory and the accumulation of Alzheimer's pathology are common in cognitively normal older adults. However, evidence of pathological effects on episodic memory has largely been limited to β-amyloid (Aβ). Because Aβ and tau often cooccur in older adults, previous research offers an incomplete understanding of the relationship between pathology and episodic memory. With the recent development of in vivo tau PET radiotracers, we show that Aβ and tau are associated with different aspects of memory encoding, leading to aberrant neural activity that is behaviorally detrimental. In addition, our results provide evidence linking Aβ- and tau-associated neural dysfunction to brain atrophy. PMID:28213439
Accurate protein structure modeling using sparse NMR data and homologous structure information.
Thompson, James M; Sgourakis, Nikolaos G; Liu, Gaohua; Rossi, Paolo; Tang, Yuefeng; Mills, Jeffrey L; Szyperski, Thomas; Montelione, Gaetano T; Baker, David
2012-06-19
While information from homologous structures plays a central role in X-ray structure determination by molecular replacement, such information is rarely used in NMR structure determination because it can be incorrect, both locally and globally, when evolutionary relationships are inferred incorrectly or there has been considerable evolutionary structural divergence. Here we describe a method that allows robust modeling of protein structures of up to 225 residues by combining (1)H(N), (13)C, and (15)N backbone and (13)Cβ chemical shift data, distance restraints derived from homologous structures, and a physically realistic all-atom energy function. Accurate models are distinguished from inaccurate models generated using incorrect sequence alignments by requiring that (i) the all-atom energies of models generated using the restraints are lower than models generated in unrestrained calculations and (ii) the low-energy structures converge to within 2.0 Å backbone rmsd over 75% of the protein. Benchmark calculations on known structures and blind targets show that the method can accurately model protein structures, even with very remote homology information, to a backbone rmsd of 1.2-1.9 Å relative to the conventional determined NMR ensembles and of 0.9-1.6 Å relative to X-ray structures for well-defined regions of the protein structures. This approach facilitates the accurate modeling of protein structures using backbone chemical shift data without need for side-chain resonance assignments and extensive analysis of NOESY cross-peak assignments.
The Lysozyme from Insect (Manduca sexta) is a Cold-Adapted Enzyme
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sotelo-Mundo,R.; Lopez-Zavala, A.; Garcia-Orozco, K.
Enzymatic activity is dependent on temperature, although some proteins have evolved to retain activity at low temperatures at the expense of stability. Cold adapted enzymes are present in a variety of organisms and there is ample interest in their structure-function relationships. Lysozyme (E.C. 3.2.1.17) is one of the most studied enzymes due to its antibacterial activity against Gram positive bacteria and is also a cold adapted protein. In this work the characterization of lysozyme from the insect Manduca sexta and its activity at low temperatures is presented. Both M. sexta lysozymes natural and recombinant showed a higher content of {alpha}-helixmore » secondary structure compared to that of hen egg white lysozyme and a higher specific enzymatic activity in the range of 5-30 {sup o}C. These results together with measured thermodynamic activation parameters support the designation of M. sexta lysozyme as a cold adapted enzyme. Therefore, the insect recombinant lysozyme is feasible as a model for structure-function studies for cold-adapted proteins.« less
Merckel, Michael C; Huiskonen, Juha T; Bamford, Dennis H; Goldman, Adrian; Tuma, Roman
2005-04-15
Comparisons of bacteriophage PRD1 and adenovirus protein structures and virion architectures have been instrumental in unraveling an evolutionary relationship and have led to a proposal of a phylogeny-based virus classification. The structure of the PRD1 spike protein P5 provides further insight into the evolution of viral proteins. The crystallized P5 fragment comprises two structural domains: a globular knob and a fibrous shaft. The head folds into a ten-stranded jelly roll beta barrel, which is structurally related to the tumor necrosis factor (TNF) and the PRD1 coat protein domains. The shaft domain is a structural counterpart to the adenovirus spike shaft. The structural relationships between PRD1, TNF, and adenovirus proteins suggest that the vertex proteins may have originated from an ancestral TNF-like jelly roll coat protein via a combination of gene duplication and deletion.
Patel, Trushar R; Chojnowski, Grzegorz; Astha; Koul, Amit; McKenna, Sean A; Bujnicki, Janusz M
2017-04-15
The diverse functional cellular roles played by ribonucleic acids (RNA) have emphasized the need to develop rapid and accurate methodologies to elucidate the relationship between the structure and function of RNA. Structural biology tools such as X-ray crystallography and Nuclear Magnetic Resonance are highly useful methods to obtain atomic-level resolution models of macromolecules. However, both methods have sample, time, and technical limitations that prevent their application to a number of macromolecules of interest. An emerging alternative to high-resolution structural techniques is to employ a hybrid approach that combines low-resolution shape information about macromolecules and their complexes from experimental hydrodynamic (e.g. analytical ultracentrifugation) and solution scattering measurements (e.g., solution X-ray or neutron scattering), with computational modeling to obtain atomic-level models. While promising, scattering methods rely on aggregation-free, monodispersed preparations and therefore the careful development of a quality control pipeline is fundamental to an unbiased and reliable structural determination. This review article describes hydrodynamic techniques that are highly valuable for homogeneity studies, scattering techniques useful to study the low-resolution shape, and strategies for computational modeling to obtain high-resolution 3D structural models of RNAs, proteins, and RNA-protein complexes. Copyright © 2016 The Author(s). Published by Elsevier Inc. All rights reserved.
Systematic detection of internal symmetry in proteins using CE-Symm.
Myers-Turnbull, Douglas; Bliven, Spencer E; Rose, Peter W; Aziz, Zaid K; Youkharibache, Philippe; Bourne, Philip E; Prlić, Andreas
2014-05-29
Symmetry is an important feature of protein tertiary and quaternary structures that has been associated with protein folding, function, evolution, and stability. Its emergence and ensuing prevalence has been attributed to gene duplications, fusion events, and subsequent evolutionary drift in sequence. This process maintains structural similarity and is further supported by this study. To further investigate the question of how internal symmetry evolved, how symmetry and function are related, and the overall frequency of internal symmetry, we developed an algorithm, CE-Symm, to detect pseudo-symmetry within the tertiary structure of protein chains. Using a large manually curated benchmark of 1007 protein domains, we show that CE-Symm performs significantly better than previous approaches. We use CE-Symm to build a census of symmetry among domain superfamilies in SCOP and note that 18% of all superfamilies are pseudo-symmetric. Our results indicate that more domains are pseudo-symmetric than previously estimated. We establish a number of recurring types of symmetry-function relationships and describe several characteristic cases in detail. With the use of the Enzyme Commission classification, symmetry was found to be enriched in some enzyme classes but depleted in others. CE-Symm thus provides a methodology for a more complete and detailed study of the role of symmetry in tertiary protein structure [availability: CE-Symm can be run from the Web at http://source.rcsb.org/jfatcatserver/symmetry.jsp. Source code and software binaries are also available under the GNU Lesser General Public License (version 2.1) at https://github.com/rcsb/symmetry. An interactive census of domains identified as symmetric by CE-Symm is available from http://source.rcsb.org/jfatcatserver/scopResults.jsp]. Copyright © 2014. Published by Elsevier Ltd.
Schwer, Beate; Kruchten, Joshua; Shuman, Stewart
2016-09-01
A seven-subunit Sm protein ring forms a core scaffold of the U1, U2, U4, and U5 snRNPs that direct pre-mRNA splicing. Using human snRNP structures to guide mutagenesis in Saccharomyces cerevisiae, we gained new insights into structure-function relationships of the SmG, SmE, and SmF subunits. An alanine scan of 19 conserved amino acids of these three proteins, comprising the Sm RNA binding sites or inter-subunit interfaces, revealed that, with the exception of Arg74 in SmF, none are essential for yeast growth. Yet, for SmG, SmE, and SmF, as for many components of the yeast spliceosome, the effects of perturbing protein-RNA and protein-protein interactions are masked by built-in functional redundancies of the splicing machine. For example, tests for genetic interactions with non-Sm splicing factors showed that many benign mutations of SmG, SmE, and SmF (and of SmB and SmD3) were synthetically lethal with null alleles of U2 snRNP subunits Lea1 and Msl1. Tests of pairwise combinations of SmG, SmE, SmF, SmB, and SmD3 alleles highlighted the inherent redundancies within the Sm ring, whereby simultaneous mutations of the RNA binding sites of any two of the Sm subunits are lethal. Our results suggest that six intact RNA binding sites in the Sm ring suffice for function but five sites may not. © 2016 Schwer et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Phylogenetic profiles reveal structural/functional determinants of TRPC3 signal-sensing antennae
Ko, Kyung Dae; Bhardwaj, Gaurav; Hong, Yoojin; Chang, Gue Su; Kiselyov, Kirill
2009-01-01
Biochemical assessment of channel structure/function is incredibly challenging. Developing computational tools that provide these data would enable translational research, accelerating mechanistic experimentation for the bench scientist studying ion channels. Starting with the premise that protein sequence encodes information about structure, function and evolution (SF&E), we developed a unified framework for inferring SF&E from sequence information using a knowledge-based approach. The Gestalt Domain Detection Algorithm-Basic Local Alignment Tool (GDDA-BLAST) provides phylogenetic profiles that can model, ab initio, SF&E relationships of biological sequences at the whole protein, single domain and single-amino acid level.1,2 In our recent paper,4 we have applied GDDA-BLAST analysis to study canonical TRP (TRPC) channels1 and empirically validated predicted lipid-binding and trafficking activities contained within the TRPC3 TRP_2 domain of unknown function. Overall, our in silico, in vitro, and in vivo experiments support a model in which TRPC3 has signal-sensing antennae which are adorned with lipid-binding, trafficking and calmodulin regulatory domains. In this Addendum, we correlate our functional domain analysis with the cryo-EM structure of TRPC3.3 In addition, we synthesize recent studies with our new findings to provide a refined model on the mechanism(s) of TRPC3 activation/deactivation. PMID:19704910
Logeman, Brandon L; Wood, L Kent; Lee, Jaekwon; Thiele, Dennis J
2017-07-07
Copper is an essential element for proper organismal development and is involved in a range of processes, including oxidative phosphorylation, neuropeptide biogenesis, and connective tissue maturation. The copper transporter (Ctr) family of integral membrane proteins is ubiquitously found in eukaryotes and mediates the high-affinity transport of Cu + across both the plasma membrane and endomembranes. Although mammalian Ctr1 functions as a Cu + transporter for Cu acquisition and is essential for embryonic development, a homologous protein, Ctr2, has been proposed to function as a low-affinity Cu transporter, a lysosomal Cu exporter, or a regulator of Ctr1 activity, but its functional and evolutionary relationship to Ctr1 is unclear. Here we report a biochemical, genetic, and phylogenetic comparison of metazoan Ctr1 and Ctr2, suggesting that Ctr2 arose over 550 million years ago as a result of a gene duplication event followed by loss of Cu + transport activity. Using a random mutagenesis and growth selection approach, we identified amino acid substitutions in human and mouse Ctr2 proteins that support copper-dependent growth in yeast and enhance copper accumulation in Ctr1 -/- mouse embryonic fibroblasts. These mutations revert Ctr2 to a more ancestral Ctr1-like state while maintaining endogenous functions, such as stimulating Ctr1 cleavage. We suggest key structural aspects of metazoan Ctr1 and Ctr2 that discriminate between their biological roles, providing mechanistic insights into the evolutionary, biochemical, and functional relationships between these two related proteins. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Determinants of Mammalian Nucleolar Architecture
Farley, Katherine I.; Surovtseva, Yulia; Merkel, Janie; Baserga, Susan J.
2015-01-01
The nucleolus is responsible for the production of ribosomes, essential machines which synthesize all proteins needed by the cell. The structure of human nucleoli is highly dynamic and is directly related to its functions in ribosome biogenesis. Despite the importance of this organelle, the intricate relationship between nucleolar structure and function remains largely unexplored. How do cells control nucleolar formation and function? What are the minimal requirements for making a functional nucleolus? Here we review what is currently known regarding mammalian nucleolar formation at nucleolar organizer regions (NORs), which can be studied by observing the dissolution and reformation of the nucleolus during each cell division. Additionally, the nucleolus can be examined by analyzing how alterations in nucleolar function manifest in differences in nucleolar architecture. Furthermore, changes in nucleolar structure and function are correlated with cancer, highlighting the importance of studying the determinants of nucleolar formation. PMID:25670395
Sequence Determinants of Compaction in Intrinsically Disordered Proteins
Marsh, Joseph A.; Forman-Kay, Julie D.
2010-01-01
Abstract Intrinsically disordered proteins (IDPs), which lack folded structure and are disordered under nondenaturing conditions, have been shown to perform important functions in a large number of cellular processes. These proteins have interesting structural properties that deviate from the random-coil-like behavior exhibited by chemically denatured proteins. In particular, IDPs are often observed to exhibit significant compaction. In this study, we have analyzed the hydrodynamic radii of a number of IDPs to investigate the sequence determinants of this compaction. Net charge and proline content are observed to be strongly correlated with increased hydrodynamic radii, suggesting that these are the dominant contributors to compaction. Hydrophobicity and secondary structure, on the other hand, appear to have negligible effects on compaction, which implies that the determinants of structure in folded and intrinsically disordered proteins are profoundly different. Finally, we observe that polyhistidine tags seem to increase IDP compaction, which suggests that these tags have significant perturbing effects and thus should be removed before any structural characterizations of IDPs. Using the relationships observed in this analysis, we have developed a sequence-based predictor of hydrodynamic radius for IDPs that shows substantial improvement over a simple model based upon chain length alone. PMID:20483348
The Diverse Roles of Arrestin Scaffolds in G Protein-Coupled Receptor Signaling.
Peterson, Yuri K; Luttrell, Louis M
2017-07-01
The visual/ β -arrestins, a small family of proteins originally described for their role in the desensitization and intracellular trafficking of G protein-coupled receptors (GPCRs), have emerged as key regulators of multiple signaling pathways. Evolutionarily related to a larger group of regulatory scaffolds that share a common arrestin fold, the visual/ β -arrestins acquired the capacity to detect and bind activated GPCRs on the plasma membrane, which enables them to control GPCR desensitization, internalization, and intracellular trafficking. By acting as scaffolds that bind key pathway intermediates, visual/ β -arrestins both influence the tonic level of pathway activity in cells and, in some cases, serve as ligand-regulated scaffolds for GPCR-mediated signaling. Growing evidence supports the physiologic and pathophysiologic roles of arrestins and underscores their potential as therapeutic targets. Circumventing arrestin-dependent GPCR desensitization may alleviate the problem of tachyphylaxis to drugs that target GPCRs, and find application in the management of chronic pain, asthma, and psychiatric illness. As signaling scaffolds, arrestins are also central regulators of pathways controlling cell growth, migration, and survival, suggesting that manipulating their scaffolding functions may be beneficial in inflammatory diseases, fibrosis, and cancer. In this review we examine the structure-function relationships that enable arrestins to perform their diverse roles, addressing arrestin structure at the molecular level, the relationship between arrestin conformation and function, and sites of interaction between arrestins, GPCRs, and nonreceptor-binding partners. We conclude with a discussion of arrestins as therapeutic targets and the settings in which manipulating arrestin function might be of clinical benefit. Copyright © 2017 by The American Society for Pharmacology and Experimental Therapeutics.
Sampling Enrichment toward Target Structures Using Hybrid Molecular Dynamics-Monte Carlo Simulations
Yang, Kecheng; Różycki, Bartosz; Cui, Fengchao; Shi, Ce; Chen, Wenduo; Li, Yunqi
2016-01-01
Sampling enrichment toward a target state, an analogue of the improvement of sampling efficiency (SE), is critical in both the refinement of protein structures and the generation of near-native structure ensembles for the exploration of structure-function relationships. We developed a hybrid molecular dynamics (MD)-Monte Carlo (MC) approach to enrich the sampling toward the target structures. In this approach, the higher SE is achieved by perturbing the conventional MD simulations with a MC structure-acceptance judgment, which is based on the coincidence degree of small angle x-ray scattering (SAXS) intensity profiles between the simulation structures and the target structure. We found that the hybrid simulations could significantly improve SE by making the top-ranked models much closer to the target structures both in the secondary and tertiary structures. Specifically, for the 20 mono-residue peptides, when the initial structures had the root-mean-squared deviation (RMSD) from the target structure smaller than 7 Å, the hybrid MD-MC simulations afforded, on average, 0.83 Å and 1.73 Å in RMSD closer to the target than the parallel MD simulations at 310K and 370K, respectively. Meanwhile, the average SE values are also increased by 13.2% and 15.7%. The enrichment of sampling becomes more significant when the target states are gradually detectable in the MD-MC simulations in comparison with the parallel MD simulations, and provide >200% improvement in SE. We also performed a test of the hybrid MD-MC approach in the real protein system, the results showed that the SE for 3 out of 5 real proteins are improved. Overall, this work presents an efficient way of utilizing solution SAXS to improve protein structure prediction and refinement, as well as the generation of near native structures for function annotation. PMID:27227775
Yang, Kecheng; Różycki, Bartosz; Cui, Fengchao; Shi, Ce; Chen, Wenduo; Li, Yunqi
2016-01-01
Sampling enrichment toward a target state, an analogue of the improvement of sampling efficiency (SE), is critical in both the refinement of protein structures and the generation of near-native structure ensembles for the exploration of structure-function relationships. We developed a hybrid molecular dynamics (MD)-Monte Carlo (MC) approach to enrich the sampling toward the target structures. In this approach, the higher SE is achieved by perturbing the conventional MD simulations with a MC structure-acceptance judgment, which is based on the coincidence degree of small angle x-ray scattering (SAXS) intensity profiles between the simulation structures and the target structure. We found that the hybrid simulations could significantly improve SE by making the top-ranked models much closer to the target structures both in the secondary and tertiary structures. Specifically, for the 20 mono-residue peptides, when the initial structures had the root-mean-squared deviation (RMSD) from the target structure smaller than 7 Å, the hybrid MD-MC simulations afforded, on average, 0.83 Å and 1.73 Å in RMSD closer to the target than the parallel MD simulations at 310K and 370K, respectively. Meanwhile, the average SE values are also increased by 13.2% and 15.7%. The enrichment of sampling becomes more significant when the target states are gradually detectable in the MD-MC simulations in comparison with the parallel MD simulations, and provide >200% improvement in SE. We also performed a test of the hybrid MD-MC approach in the real protein system, the results showed that the SE for 3 out of 5 real proteins are improved. Overall, this work presents an efficient way of utilizing solution SAXS to improve protein structure prediction and refinement, as well as the generation of near native structures for function annotation.
Wilczynski, Andrzej; Wilson, Krista R; Scott, Joseph W; Edison, Arthur S; Haskell-Luevano, Carrie
2005-04-21
The melanocortin receptor system consists of endogenous agonists, antagonists, G-protein coupled receptors, and auxiliary proteins that are involved in the regulation of complex physiological functions such as energy and weight homeostasis, feeding behavior, inflammation, sexual function, pigmentation, and exocrine gland function. Herein, we report the structure-activity relationship (SAR) of a new chimeric hAGRP-melanocortin agonist peptide template Tyr-c[beta-Asp-His-DPhe-Arg-Trp-Asn-Ala-Phe-Dpr]-Tyr-NH(2) that was characterized using amino acids previously reported in other melanocortin agonist templates. Twenty peptides were examined in this study, and six peptides were selected for (1)H NMR and computer-assisted molecular modeling structural analysis. The most notable results include the identification that modification of the chimeric template at the His position with Pro and Phe resulted in ligands that were nM mouse melanocortin-3 receptor (mMC3R) antagonists and nM mouse melanocortin-4 receptor (mMC4R) agonists. The peptides Tyr-c[beta-Asp-His-DPhe-Ala-Trp-Asn-Ala-Phe-Dpr]-Tyr-NH(2) and Tyr-c[beta-Asp-His-DNal(1')-Arg-Trp-Asn-Ala-Phe-Dpr]-Tyr-NH(2) resulted in 730- and 560-fold, respectively, mMC4R versus mMC3R selective agonists that also possessed nM agonist potency at the mMC1R and mMC5R. Structural studies identified a reverse turn occurring in the His-DPhe-Arg-Trp domain, with subtle differences observed that may account for the differences in melanocortin receptor pharmacology. Specifically, a gamma-turn secondary structure involving the DPhe(4) in the central position of the Tyr-c[beta-Asp-Phe-DPhe-Arg-Trp-Asn-Ala-Phe-Dpr]-Tyr-NH(2) peptide may differentiate the mixed mMC3R antagonist and mMC4R agonist pharmacology.
Structure and function of matrix proteins and peptides in the biomineral formation in crustaceans.
Nagasawa, Hiromichi
2011-01-01
Crustaceans have hard cuticle with layered structure, which is composed mainly of chitin, proteins, and calcium carbonate. Crustaceans grow by shedding the old cuticle and replacing it with a new one. Decalcification in the cuticle during the pre-molt stage and concomitant calcification in the stomach to form gastroliths observed in some crustacean species are triggered by the molting hormone. Various proteins and peptides have been identified from calcified cuticle and gastroliths, and their functions have been examined in terms of calcification and interaction with chitin. Acidic nature of matrix proteins is important for recruitment of calcium ions and interaction with calcium carbonate. Examination of the relationship between amino acid sequence containing acidic amino acid residues and calcification inhibitory activity revealed that the potency did not depend on the sequence but on the number of acidic amino acid residues. Calcium carbonate in the calcified tissues of crustaceans is amorphous in many cases. Crustaceans take a strategy to induce and maintain amorphous calcium carbonate by using low-molecular-weight phosphorus compounds.
Strong underwater adhesives made by self-assembling multi-protein nanofibres.
Zhong, Chao; Gurry, Thomas; Cheng, Allen A; Downey, Jordan; Deng, Zhengtao; Stultz, Collin M; Lu, Timothy K
2014-10-01
Many natural underwater adhesives harness hierarchically assembled amyloid nanostructures to achieve strong and robust interfacial adhesion under dynamic and turbulent environments. Despite recent advances, our understanding of the molecular design, self-assembly and structure-function relationships of these natural amyloid fibres remains limited. Thus, designing biomimetic amyloid-based adhesives remains challenging. Here, we report strong and multi-functional underwater adhesives obtained from fusing mussel foot proteins (Mfps) of Mytilus galloprovincialis with CsgA proteins, the major subunit of Escherichia coli amyloid curli fibres. These hybrid molecular materials hierarchically self-assemble into higher-order structures, in which, according to molecular dynamics simulations, disordered adhesive Mfp domains are exposed on the exterior of amyloid cores formed by CsgA. Our fibres have an underwater adhesion energy approaching 20.9 mJ m(-2), which is 1.5 times greater than the maximum of bio-inspired and bio-derived protein-based underwater adhesives reported thus far. Moreover, they outperform Mfps or curli fibres taken on their own and exhibit better tolerance to auto-oxidation than Mfps at pH ≥ 7.0.
Liu, Jin; Chen, Yu; Li, Jing-Ya; Luo, Cheng; Li, Jia; Chen, Kai-Xian; Li, Xu-Wen; Guo, Yue-Wei
2018-03-20
Phidianidines A and B are two novel marine indole alkaloids bearing an uncommon 1,2,4-oxadiazole ring and exhibiting various biological activities. Our previous research showed that the synthesized phidianidine analogs had the potential to inhibit the activity of protein tyrosine phosphatase 1B (PTP1B), a validated target for Type II diabetes, which indicates that these analogs are worth further structural modification. Therefore, in this paper, a series of phidianidine derivatives were designed and rapidly synthesized with a function-oriented synthesis (FOS) strategy. Their inhibitory effects on PTP1B and T-cell protein tyrosine phosphatase (TCPTP) were evaluated, and several compounds displayed significant inhibitory potency and specific selectivity over PTP1B. The structure-activity relationship (SAR) and molecular docking analyses are also described.
Seki, Takakazu; So, Christopher R; Page, Tamon R; Starkebaum, David; Hayamizu, Yuhei; Sarikaya, Mehmet
2018-02-06
The nanoscale self-organization of biomolecules, such as proteins and peptides, on solid surfaces under controlled conditions is an important issue in establishing functional bio/solid soft interfaces for bioassays, biosensors, and biofuel cells. Electrostatic interaction between proteins and surfaces is one of the most essential parameters in the adsorption and self-assembly of proteins on solid surfaces. Although the adsorption of proteins has been studied with respect to the electrochemical surface potential, the self-assembly of proteins or peptides forming well-organized nanostructures templated by lattice structure of the solid surfaces has not been studied in the relation to the surface potential. In this work, we utilize graphite-binding peptides (GrBPs) selected by the phage display method to investigate the relationship between the electrochemical potential of the highly ordered pyrolytic graphite (HOPG) and peptide self-organization forming long-range-ordered structures. Under modulated electrical bias, graphite-binding peptides form various ordered structures, such as well-ordered nanowires, dendritic structures, wavy wires, amorphous (disordered) structures, and islands. A systematic investigation of the correlation between peptide sequence and self-organizational characteristics reveals that the presence of the bias-sensitive amino acid modules in the peptide sequence has a significant effect on not only surface coverage but also on the morphological features of self-assembled structures. Our results show a new method to control peptide self-assembly by means of applied electrochemical bias as well as peptide design-rules for the construction of functional soft bio/solid interfaces that could be integrated in a wide range of practical implementations.
ERIC Educational Resources Information Center
Treacy, Daniel J.; Sankaran, Saumya M.; Gordon-Messer, Susannah; Saly, Danielle; Miller, Rebecca; Isaac, R. Stefan; Kosinski-Collins, Melissa S.
2011-01-01
In introductory laboratory courses, many universities are turning from traditional laboratories with predictable outcomes to inquiry-inspired, project-based laboratory curricula. In these labs, students are allowed to design at least some portion of their own experiment and interpret new, undiscovered data. We have redesigned the introductory…
Hrle, Ajla; Maier, Lisa-Katharina; Sharma, Kundan; Ebert, Judith; Basquin, Claire; Urlaub, Henning; Marchfelder, Anita; Conti, Elena
2014-01-01
Upon pathogen invasion, bacteria and archaea activate an RNA-interference-like mechanism termed CRISPR (clustered regularly interspaced short palindromic repeats). A large family of Cas (CRISPR-associated) proteins mediates the different stages of this sophisticated immune response. Bioinformatic studies have classified the Cas proteins into families, according to their sequences and respective functions. These range from the insertion of the foreign genetic elements into the host genome to the activation of the interference machinery as well as target degradation upon attack. Cas7 family proteins are central to the type I and type III interference machineries as they constitute the backbone of the large interference complexes. Here we report the crystal structure of Thermofilum pendens Csc2, a Cas7 family protein of type I-D. We found that Csc2 forms a core RRM-like domain, flanked by three peripheral insertion domains: a lid domain, a Zinc-binding domain and a helical domain. Comparison with other Cas7 family proteins reveals a set of similar structural features both in the core and in the peripheral domains, despite the absence of significant sequence similarity. T. pendens Csc2 binds single-stranded RNA in vitro in a sequence-independent manner. Using a crosslinking - mass-spectrometry approach, we mapped the RNA-binding surface to a positively charged surface patch on T. pendens Csc2. Thus our analysis of the key structural and functional features of T. pendens Csc2 highlights recurring themes and evolutionary relationships in type I and type III Cas proteins.
On the Origin of Protein Superfamilies and Superfolds
NASA Astrophysics Data System (ADS)
Magner, Abram; Szpankowski, Wojciech; Kihara, Daisuke
2015-02-01
Distributions of protein families and folds in genomes are highly skewed, having a small number of prevalent superfamiles/superfolds and a large number of families/folds of a small size. Why are the distributions of protein families and folds skewed? Why are there only a limited number of protein families? Here, we employ an information theoretic approach to investigate the protein sequence-structure relationship that leads to the skewed distributions. We consider that protein sequences and folds constitute an information theoretic channel and computed the most efficient distribution of sequences that code all protein folds. The identified distributions of sequences and folds are found to follow a power law, consistent with those observed for proteins in nature. Importantly, the skewed distributions of sequences and folds are suggested to have different origins: the skewed distribution of sequences is due to evolutionary pressure to achieve efficient coding of necessary folds, whereas that of folds is based on the thermodynamic stability of folds. The current study provides a new information theoretic framework for proteins that could be widely applied for understanding protein sequences, structures, functions, and interactions.
Dixit, Anshuman; Verkhivker, Gennady M.
2012-01-01
Deciphering functional mechanisms of the Hsp90 chaperone machinery is an important objective in cancer biology aiming to facilitate discovery of targeted anti-cancer therapies. Despite significant advances in understanding structure and function of molecular chaperones, organizing molecular principles that control the relationship between conformational diversity and functional mechanisms of the Hsp90 activity lack a sufficient quantitative characterization. We combined molecular dynamics simulations, principal component analysis, the energy landscape model and structure-functional analysis of Hsp90 regulatory interactions to systematically investigate functional dynamics of the molecular chaperone. This approach has identified a network of conserved regions common to the Hsp90 chaperones that could play a universal role in coordinating functional dynamics, principal collective motions and allosteric signaling of Hsp90. We have found that these functional motifs may be utilized by the molecular chaperone machinery to act collectively as central regulators of Hsp90 dynamics and activity, including the inter-domain communications, control of ATP hydrolysis, and protein client binding. These findings have provided support to a long-standing assertion that allosteric regulation and catalysis may have emerged via common evolutionary routes. The interaction networks regulating functional motions of Hsp90 may be determined by the inherent structural architecture of the molecular chaperone. At the same time, the thermodynamics-based “conformational selection” of functional states is likely to be activated based on the nature of the binding partner. This mechanistic model of Hsp90 dynamics and function is consistent with the notion that allosteric networks orchestrating cooperative protein motions can be formed by evolutionary conserved and sparsely connected residue clusters. Hence, allosteric signaling through a small network of distantly connected residue clusters may be a rather general functional requirement encoded across molecular chaperones. The obtained insights may be useful in guiding discovery of allosteric Hsp90 inhibitors targeting protein interfaces with co-chaperones and protein binding clients. PMID:22624053
Silk Materials Functionalized via Genetic Engineering for Biomedical Applications
Deptuch, Tomasz
2017-01-01
The great mechanical properties, biocompatibility and biodegradability of silk-based materials make them applicable to the biomedical field. Genetic engineering enables the construction of synthetic equivalents of natural silks. Knowledge about the relationship between the structure and function of silk proteins enables the design of bioengineered silks that can serve as the foundation of new biomaterials. Furthermore, in order to better address the needs of modern biomedicine, genetic engineering can be used to obtain silk-based materials with new functionalities. Sequences encoding new peptides or domains can be added to the sequences encoding the silk proteins. The expression of one cDNA fragment indicates that each silk molecule is related to a functional fragment. This review summarizes the proposed genetic functionalization of silk-based materials that can be potentially useful for biomedical applications. PMID:29231863
SANSparallel: interactive homology search against Uniprot.
Somervuo, Panu; Holm, Liisa
2015-07-01
Proteins evolve by mutations and natural selection. The network of sequence similarities is a rich source for mining homologous relationships that inform on protein structure and function. There are many servers available to browse the network of homology relationships but one has to wait up to a minute for results. The SANSparallel webserver provides protein sequence database searches with immediate response and professional alignment visualization by third-party software. The output is a list, pairwise alignment or stacked alignment of sequence-similar proteins from Uniprot, UniRef90/50, Swissprot or Protein Data Bank. The stacked alignments are viewed in Jalview or as sequence logos. The database search uses the suffix array neighborhood search (SANS) method, which has been re-implemented as a client-server, improved and parallelized. The method is extremely fast and as sensitive as BLAST above 50% sequence identity. Benchmarks show that the method is highly competitive compared to previously published fast database search programs: UBLAST, DIAMOND, LAST, LAMBDA, RAPSEARCH2 and BLAT. The web server can be accessed interactively or programmatically at http://ekhidna2.biocenter.helsinki.fi/cgi-bin/sans/sans.cgi. It can be used to make protein functional annotation pipelines more efficient, and it is useful in interactive exploration of the detailed evidence supporting the annotation of particular proteins of interest. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Kristensen, Tatjana P; Maria Cherian, Reeja; Gray, Fiona C; MacNeill, Stuart A
2014-01-01
The hexameric MCM complex is the catalytic core of the replicative helicase in eukaryotic and archaeal cells. Here we describe the first in vivo analysis of archaeal MCM protein structure and function relationships using the genetically tractable haloarchaeon Haloferax volcanii as a model system. Hfx. volcanii encodes a single MCM protein that is part of the previously identified core group of haloarchaeal MCM proteins. Three structural features of the N-terminal domain of the Hfx. volcanii MCM protein were targeted for mutagenesis: the β7-β8 and β9-β10 β-hairpin loops and putative zinc binding domain. Five strains carrying single point mutations in the β7-β8 β-hairpin loop were constructed, none of which displayed impaired cell growth under normal conditions or when treated with the DNA damaging agent mitomycin C. However, short sequence deletions within the β7-β8 β-hairpin were not tolerated and neither was replacement of the highly conserved residue glutamate 187 with alanine. Six strains carrying paired alanine substitutions within the β9-β10 β-hairpin loop were constructed, leading to the conclusion that no individual amino acid within that hairpin loop is absolutely required for MCM function, although one of the mutant strains displays greatly enhanced sensitivity to mitomycin C. Deletions of two or four amino acids from the β9-β10 β-hairpin were tolerated but mutants carrying larger deletions were inviable. Similarly, it was not possible to construct mutants in which any of the conserved zinc binding cysteines was replaced with alanine, underlining the likely importance of zinc binding for MCM function. The results of these studies demonstrate the feasibility of using Hfx. volcanii as a model system for reverse genetic analysis of archaeal MCM protein function and provide important confirmation of the in vivo importance of conserved structural features identified by previous bioinformatic, biochemical and structural studies.
NASA Astrophysics Data System (ADS)
Howorka, Stefan
2017-07-01
Membrane nanopores--hollow nanoscale barrels that puncture biological or synthetic membranes--have become powerful tools in chemical- and biosensing, and have achieved notable success in portable DNA sequencing. The pores can be self-assembled from a variety of materials, including proteins, peptides, synthetic organic compounds and, more recently, DNA. But which building material is best for which application, and what is the relationship between pore structure and function? In this Review, I critically compare the characteristics of the different building materials, and explore the influence of the building material on pore structure, dynamics and function. I also discuss the future challenges of developing nanopore technology, and consider what the next-generation of nanopore structures could be and where further practical applications might emerge.
Ponting, C P; Mott, R; Bork, P; Copley, R R
2001-12-01
Sequence database searching methods such as BLAST, are invaluable for predicting molecular function on the basis of sequence similarities among single regions of proteins. Searches of whole databases however, are not optimized to detect multiple homologous regions within a single polypeptide. Here we have used the prospero algorithm to perform self-comparisons of all predicted Drosophila melanogaster gene products. Predicted repeats, and their homologs from all species, were analyzed further to detect hitherto unappreciated evolutionary relationships. Results included the identification of novel tandem repeats in the human X-linked retinitis pigmentosa type-2 gene product, repeated segments in cystinosin, associated with a defect in cystine transport, and 'nested' homologous domains in dysferlin, whose gene is mutated in limb girdle muscular dystrophy. Novel signaling domain families were found that may regulate the microtubule-based cytoskeleton and ubiquitin-mediated proteolysis, respectively. Two families of glycosyl hydrolases were shown to contain internal repetitions that hint at their evolution via a piecemeal, modular approach. In addition, three examples of fruit fly genes were detected with tandem exons that appear to have arisen via internal duplication. These findings demonstrate how completely sequenced genomes can be exploited to further understand the relationships between molecular structure, function, and evolution.
2012-01-01
Background The NCBI Conserved Domain Database (CDD) consists of a collection of multiple sequence alignments of protein domains that are at various stages of being manually curated into evolutionary hierarchies based on conserved and divergent sequence and structural features. These domain models are annotated to provide insights into the relationships between sequence, structure and function via web-based BLAST searches. Results Here we automate the generation of conserved domain (CD) hierarchies using a combination of heuristic and Markov chain Monte Carlo (MCMC) sampling procedures and starting from a (typically very large) multiple sequence alignment. This procedure relies on statistical criteria to define each hierarchy based on the conserved and divergent sequence patterns associated with protein functional-specialization. At the same time this facilitates the sequence and structural annotation of residues that are functionally important. These statistical criteria also provide a means to objectively assess the quality of CD hierarchies, a non-trivial task considering that the protein subgroups are often very distantly related—a situation in which standard phylogenetic methods can be unreliable. Our aim here is to automatically generate (typically sub-optimal) hierarchies that, based on statistical criteria and visual comparisons, are comparable to manually curated hierarchies; this serves as the first step toward the ultimate goal of obtaining optimal hierarchical classifications. A plot of runtimes for the most time-intensive (non-parallelizable) part of the algorithm indicates a nearly linear time complexity so that, even for the extremely large Rossmann fold protein class, results were obtained in about a day. Conclusions This approach automates the rapid creation of protein domain hierarchies and thus will eliminate one of the most time consuming aspects of conserved domain database curation. At the same time, it also facilitates protein domain annotation by identifying those pattern residues that most distinguish each protein domain subgroup from other related subgroups. PMID:22726767
Bioinformatics analysis of disordered proteins in prokaryotes.
Pavlović-Lažetić, Gordana M; Mitić, Nenad S; Kovačević, Jovana J; Obradović, Zoran; Malkov, Saša N; Beljanski, Miloš V
2011-03-02
A significant number of proteins have been shown to be intrinsically disordered, meaning that they lack a fixed 3 D structure or contain regions that do not posses a well defined 3 D structure. It has also been proven that a protein's disorder content is related to its function. We have performed an exhaustive analysis and comparison of the disorder content of proteins from prokaryotic organisms (i.e., superkingdoms Archaea and Bacteria) with respect to functional categories they belong to, i.e., Clusters of Orthologous Groups of proteins (COGs) and groups of COGs-Cellular processes (Cp), Information storage and processing (Isp), Metabolism (Me) and Poorly characterized (Pc). We also analyzed the disorder content of proteins with respect to various genomic, metabolic and ecological characteristics of the organism they belong to. We used correlations and association rule mining in order to identify the most confident associations between specific modalities of the characteristics considered and disorder content. Bacteria are shown to have a somewhat higher level of protein disorder than archaea, except for proteins in the Me functional group. It is demonstrated that the Isp and Cp functional groups in particular (L-repair function and N-cell motility and secretion COGs of proteins in specific) possess the highest disorder content, while Me proteins, in general, posses the lowest. Disorder fractions have been confirmed to have the lowest level for the so-called order-promoting amino acids and the highest level for the so-called disorder promoters. For each pair of organism characteristics, specific modalities are identified with the maximum disorder proteins in the corresponding organisms, e.g., high genome size-high GC content organisms, facultative anaerobic-low GC content organisms, aerobic-high genome size organisms, etc. Maximum disorder in archaea is observed for high GC content-low genome size organisms, high GC content-facultative anaerobic or aquatic or mesophilic organisms, etc. Maximum disorder in bacteria is observed for high GC content-high genome size organisms, high genome size-aerobic organisms, etc. Some of the most reliable association rules mined establish relationships between high GC content and high protein disorder, medium GC content and both medium and low protein disorder, anaerobic organisms and medium protein disorder, Gammaproteobacteria and low protein disorder, etc. A web site Prokaryote Disorder Database has been designed and implemented at the address http://bioinfo.matf.bg.ac.rs/disorder, which contains complete results of the analysis of protein disorder performed for 296 prokaryotic completely sequenced genomes. Exhaustive disorder analysis has been performed by functional classes of proteins, for a larger dataset of prokaryotic organisms than previously done. Results obtained are well correlated to those previously published, with some extension in the range of disorder level and clear distinction between functional classes of proteins. Wide correlation and association analysis between protein disorder and genomic and ecological characteristics has been performed for the first time. The results obtained give insight into multi-relationships among the characteristics and protein disorder. Such analysis provides for better understanding of the evolutionary process and may be useful for taxon determination. The main drawback of the approach is the fact that the disorder considered has been predicted and not experimentally established.
Kobayashi, Ayaho; Kanaba, Teppei; Satoh, Ryosuke; Ito, Yutaka; Sugiura, Reiko; Mishima, Masaki
2017-10-01
Negative regulator differentiation 1 (Nrd1), a fission yeast RNA binding protein, modulates cytokinesis and sexual development and contributes to stress granule formation in response to environmental stresses. Nrd1 comprises four RRM domains and binds and stabilizes Cdc4 mRNA that encodes the myosin II light chain. Nrd1 binds the Cpc2 fission-yeast RACK1 homolog, and the interaction promotes Nrd1 localization to stress granules. Interestingly, Pmk1 mitogen-activated protein kinase phosphorylates Thr40 in the unstructured N-terminal region and Thr126 in the first RRM domain of Nrd1. Phosphorylation significantly reduces RNA-binding activity and likely modulates Nrd1 function. To reveal the relationship between the structure and function of Nrd1 and how phosphorylation affects structure, we used heteronuclear NMR techniques to investigate the three-dimensional structure of Nrd1. Here we report the 1 H, 13 C, and 15 N resonance assignments of RRM1-RRM2 (residues 108-284) comprising the first and second RRMs obtained using heteronuclear NMR techniques. Secondary structures derived from the chemical shifts are reported. These data should contribute to the understanding of the three-dimensional structure of the RRM1-RRM2 region of Nrd1 and the perturbation caused by phosphorylation.
Zhu, Shi-Yong; Li, Xue-Nan; Sun, Xiao-Chen; Lin, Jia; Li, Wei; Zhang, Cong; Li, Jin-Long
2017-02-22
Knowledge about mammalian selenoproteins is increasing. However, the selenoproteome of birds remains considerably less understood, especially concerning its biochemical characterization, structure-function relationships and the interactions of binding molecules. In this work, the SECIS elements, subcellular localization, protein domains and interactions of binding molecules of the selenoproteome in Gallus gallus were analyzed using bioinformatics tools. We carried out comprehensive analyses of the structure-function relationships and interactions of the binding molecules of selenoproteins, to provide biochemical characterization of the selenoproteome in Gallus gallus. Our data provided a wealth of information on the biochemical functions of bird selenoproteins. Members of the selenoproteome were found to be involved in various biological processes in chickens, such as in antioxidants, maintenance of the redox balance, Se transport, and interactions with metals. Six membrane-bound selenoproteins (SelI, SelK, SelS, SelT, DIO1 and DIO3) played important roles in maintaining the membrane integrity. Chicken selenoproteins were classified according to their ligand binding sites as zinc-containing matrix metalloselenoproteins (Sep15, MsrB1, SelW and SelM), POP-containing selenoproteins (GPx1-4), FAD-interacting selenoproteins (TrxR1-3), secretory transport selenoproteins (GPx3 and SelPa) and other selenoproteins. The results of our study provided new evidence for the unknown biological functions of the selenoproteome in birds. Future research is required to confirm the novel biochemical functions of bird selenoproteins.
Jacquin, Hugo; Gilson, Amy; Shakhnovich, Eugene; Cocco, Simona; Monasson, Rémi
2016-05-01
Inverse statistical approaches to determine protein structure and function from Multiple Sequence Alignments (MSA) are emerging as powerful tools in computational biology. However the underlying assumptions of the relationship between the inferred effective Potts Hamiltonian and real protein structure and energetics remain untested so far. Here we use lattice protein model (LP) to benchmark those inverse statistical approaches. We build MSA of highly stable sequences in target LP structures, and infer the effective pairwise Potts Hamiltonians from those MSA. We find that inferred Potts Hamiltonians reproduce many important aspects of 'true' LP structures and energetics. Careful analysis reveals that effective pairwise couplings in inferred Potts Hamiltonians depend not only on the energetics of the native structure but also on competing folds; in particular, the coupling values reflect both positive design (stabilization of native conformation) and negative design (destabilization of competing folds). In addition to providing detailed structural information, the inferred Potts models used as protein Hamiltonian for design of new sequences are able to generate with high probability completely new sequences with the desired folds, which is not possible using independent-site models. Those are remarkable results as the effective LP Hamiltonians used to generate MSA are not simple pairwise models due to the competition between the folds. Our findings elucidate the reasons for the success of inverse approaches to the modelling of proteins from sequence data, and their limitations.
Rutsdottir, Gudrun; Härmark, Johan; Weide, Yoran; Hebert, Hans; Rasmussen, Morten I; Wernersson, Sven; Respondek, Michal; Akke, Mikael; Højrup, Peter; Koeck, Philip J B; Söderberg, Christopher A G; Emanuelsson, Cecilia
2017-05-12
Small heat-shock proteins (sHsps) prevent aggregation of thermosensitive client proteins in a first line of defense against cellular stress. The mechanisms by which they perform this function have been hard to define due to limited structural information; currently, there is only one high-resolution structure of a plant sHsp published, that of the cytosolic Hsp16.9. We took interest in Hsp21, a chloroplast-localized sHsp crucial for plant stress resistance, which has even longer N-terminal arms than Hsp16.9, with a functionally important and conserved methionine-rich motif. To provide a framework for investigating structure-function relationships of Hsp21 and understanding these sequence variations, we developed a structural model of Hsp21 based on homology modeling, cryo-EM, cross-linking mass spectrometry, NMR, and small-angle X-ray scattering. Our data suggest a dodecameric arrangement of two trimer-of-dimer discs stabilized by the C-terminal tails, possibly through tail-to-tail interactions between the discs, mediated through extended I X V X I motifs. Our model further suggests that six N-terminal arms are located on the outside of the dodecamer, accessible for interaction with client proteins, and distinct from previous undefined or inwardly facing arms. To test the importance of the I X V X I motif, we created the point mutant V181A, which, as expected, disrupts the Hsp21 dodecamer and decreases chaperone activity. Finally, our data emphasize that sHsp chaperone efficiency depends on oligomerization and that client interactions can occur both with and without oligomer dissociation. These results provide a generalizable workflow to explore sHsps, expand our understanding of sHsp structural motifs, and provide a testable Hsp21 structure model to inform future investigations. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Proteoglycomics: Recent Progress and Future Challenges
Ly, Mellisa; Laremore, Tatiana N.
2010-01-01
Abstract Proteoglycomics is a systematic study of structure, expression, and function of proteoglycans, a posttranslationally modified subset of a proteome. Although relying on the established technologies of proteomics and glycomics, proteoglycomics research requires unique approaches for elucidating structure–function relationships of both proteoglycan components, glycosaminoglycan chain, and core protein. This review discusses our current understanding of structure and function of proteoglycans, major players in the development, normal physiology, and disease. A brief outline of the proteoglycomic sample preparation and analysis is provided along with examples of several recent proteoglycomic studies. Unique challenges in the characterization of glycosaminoglycan component of proteoglycans are discussed, with emphasis on the many analytical tools used and the types of information they provide. PMID:20450439
Structure and function of nucleotide sugar transporters: Current progress.
Hadley, Barbara; Maggioni, Andrea; Ashikov, Angel; Day, Christopher J; Haselhorst, Thomas; Tiralongo, Joe
2014-06-01
The proteomes of eukaryotes, bacteria and archaea are highly diverse due, in part, to the complex post-translational modification of protein glycosylation. The diversity of glycosylation in eukaryotes is reliant on nucleotide sugar transporters to translocate specific nucleotide sugars that are synthesised in the cytosol and nucleus, into the endoplasmic reticulum and Golgi apparatus where glycosylation reactions occur. Thirty years of research utilising multidisciplinary approaches has contributed to our current understanding of NST function and structure. In this review, the structure and function, with reference to various disease states, of several NSTs including the UDP-galactose, UDP-N-acetylglucosamine, UDP-N-acetylgalactosamine, GDP-fucose, UDP-N-acetylglucosamine/UDP-glucose/GDP-mannose and CMP-sialic acid transporters will be described. Little is known regarding the exact structure of NSTs due to difficulties associated with crystallising membrane proteins. To date, no three-dimensional structure of any NST has been elucidated. What is known is based on computer predictions, mutagenesis experiments, epitope-tagging studies, in-vitro assays and phylogenetic analysis. In this regard the best-characterised NST to date is the CMP-sialic acid transporter (CST). Therefore in this review we will provide the current state-of-play with respect to the structure-function relationship of the (CST). In particular we have summarised work performed by a number groups detailing the affect of various mutations on CST transport activity, efficiency, and substrate specificity.
In Silico Analysis for the Study of Botulinum Toxin Structure
NASA Astrophysics Data System (ADS)
Suzuki, Tomonori; Miyazaki, Satoru
2010-01-01
Protein-protein interactions play many important roles in biological function. Knowledge of protein-protein complex structure is required for understanding the function. The determination of protein-protein complex structure by experimental studies remains difficult, therefore computational prediction of protein structures by structure modeling and docking studies is valuable method. In addition, MD simulation is also one of the most popular methods for protein structure modeling and characteristics. Here, we attempt to predict protein-protein complex structure and property using some of bioinformatic methods, and we focus botulinum toxin complex as target structure.
Conformational and functional studies of a cytosolic 90 kDa heat shock protein Hsp90 from sugarcane.
da Silva, Viviane C H; Cagliari, Thiago C; Lima, Tatiani B; Gozzo, Fábio C; Ramos, Carlos H I
2013-07-01
Hsp90s are involved in several cellular processes, such as signaling, proteostasis, epigenetics, differentiation and stress defense. Although Hsp90s from different organisms are highly similar, they usually have small variations in conformation and function. Thus, the characterization of different Hsp90s is important to gain insight into the structure-function relationship that makes these chaperones key regulators in protein homeostasis. This work describes the characterization of a cytosolic Hsp90 from sugarcane and its comparison with Hsp90s from other plants. Previous expressed sequence tag (EST) studies in Saccharum spp. (sugarcane) predicted the presence of an mRNA coding for a cytosolic Hsp90. The corresponding cDNA was cloned, and the recombinant protein was purified and its conformation and function characterized. The structural conformation of Hsp90 was assessed by chemical cross-linking and hydrogen/deuterium exchange using mass spectrometry and hydrodynamic assays, which revealed regions accessible to solvent and that Hsp90 is an elongated dimer in solution. The in vivo expression of Hsp90 in sugarcane leaves was confirmed by western blot, and in vitro functional characterization indicated that sugarcane Hsp90 has strong chaperone activity. Copyright © 2013 Elsevier Masson SAS. All rights reserved.
Liu, Jie; Su, Minyi; Liu, Zhihai; Li, Jie; Li, Yan; Wang, Renxiao
2017-07-18
In structure-based drug design, binding affinity prediction remains as a challenging goal for current scoring functions. Development of target-biased scoring functions provides a new possibility for tackling this problem, but this approach is also associated with certain technical difficulties. We previously reported the Knowledge-Guided Scoring (KGS) method as an alternative approach (BMC Bioinformatics, 2010, 11, 193-208). The key idea is to compute the binding affinity of a given protein-ligand complex based on the known binding data of an appropriate reference complex, so the error in binding affinity prediction can be reduced effectively. In this study, we have developed an upgraded version, i.e. KGS2, by employing 3D protein-ligand interaction fingerprints in reference selection. KGS2 was evaluated in combination with four scoring functions (X-Score, ChemPLP, ASP, and GoldScore) on five drug targets (HIV-1 protease, carbonic anhydrase 2, beta-secretase 1, beta-trypsin, and checkpoint kinase 1). In the in situ scoring test, considerable improvements were observed in most cases after application of KGS2. Besides, the performance of KGS2 was always better than KGS in all cases. In the more challenging molecular docking test, application of KGS2 also led to improved structure-activity relationship in some cases. KGS2 can be applied as a convenient "add-on" to current scoring functions without the need to re-engineer them, and its application is not limited to certain target proteins as customized scoring functions. As an interpolation method, its accuracy in principle can be improved further with the increasing knowledge of protein-ligand complex structures and binding affinity data. We expect that KGS2 will become a practical tool for enhancing the performance of current scoring functions in binding affinity prediction. The KGS2 software is available upon contacting the authors.
Mori, Mirko; Kateb, Fatiha; Bodenhausen, Geoffrey; Piccioli, Mario; Abergel, Daniel
2010-03-17
Multiple quantum relaxation in proteins reveals unexpected relationships between correlated or anti-correlated conformational backbone dynamics in alpha-helices or beta-sheets. The contributions of conformational exchange to the relaxation rates of C'N coherences (i.e., double- and zero-quantum coherences involving backbone carbonyl (13)C' and neighboring amide (15)N nuclei) depend on the kinetics of slow exchange processes, as well as on the populations of the conformations and chemical shift differences of (13)C' and (15)N nuclei. The relaxation rates of C'N coherences, which reflect concerted fluctuations due to slow chemical shift modulations (CSMs), were determined by direct (13)C detection in diamagnetic and paramagnetic proteins. In well-folded proteins such as lanthanide-substituted calbindin (CaLnCb), copper,zinc superoxide dismutase (Cu,Zn SOD), and matrix metalloproteinase (MMP12), slow conformational exchange occurs along the entire backbone. Our observations demonstrate that relaxation rates of C'N coherences arising from slow backbone dynamics have positive signs (characteristic of correlated fluctuations) in beta-sheets and negative signs (characteristic of anti-correlated fluctuations) in alpha-helices. This extends the prospects of structure-dynamics relationships to slow time scales that are relevant for protein function and enzymatic activity.
Rincon, Sergio A; Paoletti, Anne
2016-01-01
Unveiling the function of a novel protein is a challenging task that requires careful experimental design. Yeast cytokinesis is a conserved process that involves modular structural and regulatory proteins. For such proteins, an important step is to identify their domains and structural organization. Here we briefly discuss a collection of methods commonly used for sequence alignment and prediction of protein structure that represent powerful tools for the identification homologous domains and design of structure-function approaches to test experimentally the function of multi-domain proteins such as those implicated in yeast cytokinesis.
Structural elements and organization of the ancestral translational machinery
NASA Technical Reports Server (NTRS)
Rein, R.; Srinivasan, S.; Mcdonald, J.; Raghunathan, G.; Shibata, M.
1987-01-01
The molecular mechanisms of the primitive translational apparatus are discussed in the framework of present-day protein biosynthesis. The structural necessities of an early adaptor and the multipoint recognition properties of such an adaptor are investigated on the basis of structure/function relationships found in a contemporary system and a molecular model of the contemporary transpeptidation complex. A model of the tRNA(Tyr)-tyrosyl tRNA synthetase complex including the positioning of the disordered region is proposed; the model is used to illustrate the required recognition properties of the ancestor aminoacyl synthetase.
Blacklock, Kristin; Verkhivker, Gennady M.
2014-01-01
A fundamental role of the Hsp90 chaperone in regulating functional activity of diverse protein clients is essential for the integrity of signaling networks. In this work we have combined biophysical simulations of the Hsp90 crystal structures with the protein structure network analysis to characterize the statistical ensemble of allosteric interaction networks and communication pathways in the Hsp90 chaperones. We have found that principal structurally stable communities could be preserved during dynamic changes in the conformational ensemble. The dominant contribution of the inter-domain rigidity to the interaction networks has emerged as a common factor responsible for the thermodynamic stability of the active chaperone form during the ATPase cycle. Structural stability analysis using force constant profiling of the inter-residue fluctuation distances has identified a network of conserved structurally rigid residues that could serve as global mediating sites of allosteric communication. Mapping of the conformational landscape with the network centrality parameters has demonstrated that stable communities and mediating residues may act concertedly with the shifts in the conformational equilibrium and could describe the majority of functionally significant chaperone residues. The network analysis has revealed a relationship between structural stability, global centrality and functional significance of hotspot residues involved in chaperone regulation. We have found that allosteric interactions in the Hsp90 chaperone may be mediated by modules of structurally stable residues that display high betweenness in the global interaction network. The results of this study have suggested that allosteric interactions in the Hsp90 chaperone may operate via a mechanism that combines rapid and efficient communication by a single optimal pathway of structurally rigid residues and more robust signal transmission using an ensemble of suboptimal multiple communication routes. This may be a universal requirement encoded in protein structures to balance the inherent tension between resilience and efficiency of the residue interaction networks. PMID:24922508
Blacklock, Kristin; Verkhivker, Gennady M
2014-06-01
A fundamental role of the Hsp90 chaperone in regulating functional activity of diverse protein clients is essential for the integrity of signaling networks. In this work we have combined biophysical simulations of the Hsp90 crystal structures with the protein structure network analysis to characterize the statistical ensemble of allosteric interaction networks and communication pathways in the Hsp90 chaperones. We have found that principal structurally stable communities could be preserved during dynamic changes in the conformational ensemble. The dominant contribution of the inter-domain rigidity to the interaction networks has emerged as a common factor responsible for the thermodynamic stability of the active chaperone form during the ATPase cycle. Structural stability analysis using force constant profiling of the inter-residue fluctuation distances has identified a network of conserved structurally rigid residues that could serve as global mediating sites of allosteric communication. Mapping of the conformational landscape with the network centrality parameters has demonstrated that stable communities and mediating residues may act concertedly with the shifts in the conformational equilibrium and could describe the majority of functionally significant chaperone residues. The network analysis has revealed a relationship between structural stability, global centrality and functional significance of hotspot residues involved in chaperone regulation. We have found that allosteric interactions in the Hsp90 chaperone may be mediated by modules of structurally stable residues that display high betweenness in the global interaction network. The results of this study have suggested that allosteric interactions in the Hsp90 chaperone may operate via a mechanism that combines rapid and efficient communication by a single optimal pathway of structurally rigid residues and more robust signal transmission using an ensemble of suboptimal multiple communication routes. This may be a universal requirement encoded in protein structures to balance the inherent tension between resilience and efficiency of the residue interaction networks.
Protein Structure and Function Prediction Using I-TASSER
Yang, Jianyi; Zhang, Yang
2016-01-01
I-TASSER is a hierarchical protocol for automated protein structure prediction and structure-based function annotation. Starting from the amino acid sequence of target proteins, I-TASSER first generates full-length atomic structural models from multiple threading alignments and iterative structural assembly simulations followed by atomic-level structure refinement. The biological functions of the protein, including ligand-binding sites, enzyme commission number, and gene ontology terms, are then inferred from known protein function databases based on sequence and structure profile comparisons. I-TASSER is freely available as both an on-line server and a stand-alone package. This unit describes how to use the I-TASSER protocol to generate structure and function prediction and how to interpret the prediction results, as well as alternative approaches for further improving the I-TASSER modeling quality for distant-homologous and multi-domain protein targets. PMID:26678386
Structure and function of seed storage proteins in faba bean (Vicia faba L.).
Liu, Yujiao; Wu, Xuexia; Hou, Wanwei; Li, Ping; Sha, Weichao; Tian, Yingying
2017-05-01
The protein subunit is the most important basic unit of protein, and its study can unravel the structure and function of seed storage proteins in faba bean. In this study, we identified six specific protein subunits in Faba bean (cv. Qinghai 13) combining liquid chromatography (LC), liquid chromatography-electronic spray ionization mass (LC-ESI-MS/MS) and bio-information technology. The results suggested a diversity of seed storage proteins in faba bean, and a total of 16 proteins (four GroEL molecular chaperones and 12 plant-specific proteins) were identified from 97-, 96-, 64-, 47-, 42-, and 38-kD-specific protein subunits in faba bean based on the peptide sequence. We also analyzed the composition and abundance of the amino acids, the physicochemical characteristics, secondary structure, three-dimensional structure, transmembrane domain, and possible subcellular localization of these identified proteins in faba bean seed, and finally predicted function and structure. The three-dimensional structures were generated based on homologous modeling, and the protein function was analyzed based on the annotation from the non-redundant protein database (NR database, NCBI) and function analysis of optimal modeling. The objective of this study was to identify the seed storage proteins in faba bean and confirm the structure and function of these proteins. Our results can be useful for the study of protein nutrition and achieve breeding goals for optimal protein quality in faba bean.
Yang, Aimin; Pantoom, Supansa; Wu, Yao-Wen
2017-01-01
Autophagy is a conserved cellular process involved in the elimination of proteins and organelles. It is also used to combat infection with pathogenic microbes. The intracellular pathogen Legionella pneumophila manipulates autophagy by delivering the effector protein RavZ to deconjugate Atg8/LC3 proteins coupled to phosphatidylethanolamine (PE) on autophagosomal membranes. To understand how RavZ recognizes and deconjugates LC3-PE, we prepared semisynthetic LC3 proteins and elucidated the structures of the RavZ:LC3 interaction. Semisynthetic LC3 proteins allowed the analysis of structure-function relationships. RavZ extracts LC3-PE from the membrane before deconjugation. RavZ initially recognizes the LC3 molecule on membranes via its N-terminal LC3-interacting region (LIR) motif. The RavZ α3 helix is involved in extraction of the PE moiety and docking of the acyl chains into the lipid-binding site of RavZ that is related in structure to that of the phospholipid transfer protein Sec14. Thus, Legionella has evolved a novel mechanism to specifically evade host autophagy. DOI: http://dx.doi.org/10.7554/eLife.23905.001 PMID:28395732
THE ROLES OF METAL IONS IN REGULATION BY RIBOSWITCHES
2012-01-01
Metal ions are required by all organisms in order to execute an array of essential molecular functions. They play a critical role in many catalytic mechanisms and structural properties. Proper homeostasis of ions is critical; levels that are aberrantly low or high are deleterious to cellular physiology. To maintain stable intracellular pools, metal ion-sensing regulatory (metalloregulatory) proteins couple metal ion concentration fluctuations with expression of genes encoding for cation transport or sequestration. However, these transcriptional-based regulatory strategies are not the only mechanisms by which organisms coordinate metal ions with gene expression. Intriguingly, a few classes of signal-responsive RNA elements have also been discovered to function as metalloregulatory agents. This suggests that RNA-based regulatory strategies can be precisely tuned to intracellular metal ion pools, functionally akin to metalloregulatory proteins. In addition to these metal-sensing regulatory RNAs, there is a yet broader role for metal ions in directly assisting the structural integrity of other signal-responsive regulatory RNA elements. In this chapter, we discuss how the intimate physicochemical relationship between metal ions and nucleic acids is important for the structure and function of metal ion- and metabolite-sensing regulatory RNAs. PMID:22010271
Segmented molecular design of self-healing proteinaceous materials
NASA Astrophysics Data System (ADS)
Sariola, Veikko; Pena-Francesch, Abdon; Jung, Huihun; Çetinkaya, Murat; Pacheco, Carlos; Sitti, Metin; Demirel, Melik C.
2015-09-01
Hierarchical assembly of self-healing adhesive proteins creates strong and robust structural and interfacial materials, but understanding of the molecular design and structure-property relationships of structural proteins remains unclear. Elucidating this relationship would allow rational design of next generation genetically engineered self-healing structural proteins. Here we report a general self-healing and -assembly strategy based on a multiphase recombinant protein based material. Segmented structure of the protein shows soft glycine- and tyrosine-rich segments with self-healing capability and hard beta-sheet segments. The soft segments are strongly plasticized by water, lowering the self-healing temperature close to body temperature. The hard segments self-assemble into nanoconfined domains to reinforce the material. The healing strength scales sublinearly with contact time, which associates with diffusion and wetting of autohesion. The finding suggests that recombinant structural proteins from heterologous expression have potential as strong and repairable engineering materials.
Segmented molecular design of self-healing proteinaceous materials.
Sariola, Veikko; Pena-Francesch, Abdon; Jung, Huihun; Çetinkaya, Murat; Pacheco, Carlos; Sitti, Metin; Demirel, Melik C
2015-09-01
Hierarchical assembly of self-healing adhesive proteins creates strong and robust structural and interfacial materials, but understanding of the molecular design and structure-property relationships of structural proteins remains unclear. Elucidating this relationship would allow rational design of next generation genetically engineered self-healing structural proteins. Here we report a general self-healing and -assembly strategy based on a multiphase recombinant protein based material. Segmented structure of the protein shows soft glycine- and tyrosine-rich segments with self-healing capability and hard beta-sheet segments. The soft segments are strongly plasticized by water, lowering the self-healing temperature close to body temperature. The hard segments self-assemble into nanoconfined domains to reinforce the material. The healing strength scales sublinearly with contact time, which associates with diffusion and wetting of autohesion. The finding suggests that recombinant structural proteins from heterologous expression have potential as strong and repairable engineering materials.
A New Method for Determining Structure Ensemble: Application to a RNA Binding Di-Domain Protein.
Liu, Wei; Zhang, Jingfeng; Fan, Jing-Song; Tria, Giancarlo; Grüber, Gerhard; Yang, Daiwen
2016-05-10
Structure ensemble determination is the basis of understanding the structure-function relationship of a multidomain protein with weak domain-domain interactions. Paramagnetic relaxation enhancement has been proven a powerful tool in the study of structure ensembles, but there exist a number of challenges such as spin-label flexibility, domain dynamics, and overfitting. Here we propose a new (to our knowledge) method to describe structure ensembles using a minimal number of conformers. In this method, individual domains are considered rigid; the position of each spin-label conformer and the structure of each protein conformer are defined by three and six orthogonal parameters, respectively. First, the spin-label ensemble is determined by optimizing the positions and populations of spin-label conformers against intradomain paramagnetic relaxation enhancements with a genetic algorithm. Subsequently, the protein structure ensemble is optimized using a more efficient genetic algorithm-based approach and an overfitting indicator, both of which were established in this work. The method was validated using a reference ensemble with a set of conformers whose populations and structures are known. This method was also applied to study the structure ensemble of the tandem di-domain of a poly (U) binding protein. The determined ensemble was supported by small-angle x-ray scattering and nuclear magnetic resonance relaxation data. The ensemble obtained suggests an induced fit mechanism for recognition of target RNA by the protein. Copyright © 2016 Biophysical Society. Published by Elsevier Inc. All rights reserved.
Mohammadkhah, Melika; Simms, Ciaran K; Murphy, Paula
2017-02-01
Detection and visualisation of Collagen structure are important to understand the relationship between mechanical behaviour and microstructure in skeletal muscle since Collagen is the main structural protein in animal connective tissues, and is primarily responsible for their passive load-bearing properties. In the current study, the direct detection and visualization of Collagen using fluorescently tagged CNA35 binding protein (fused to EGFP or tdTomato) is reported for the first time on fixed skeletal muscle tissue. This Technical Note also establishes a working protocol by examining tissue preparation, dilution factor, exposure time etc. for sensitivity and specificity. Penetration of the binding protein into intact mature skeletal muscle was found to be very limited, but detection works well on tissue sections with higher sensitivity on wax embedded sections compared to frozen sections. CNA35 fused to tdTomato has a higher sensitivity than CNA35 fused to EGFP but both show specific detection. Best results were obtained with 15μm wax embedded sections, with blocking of non-specific binding in 1% BSA and antigen retrieval in Sodium Citrate. There was a play-off between dilution of the binding protein and time of incubation but both CNA35-tdTomato and CNA35-EGFP worked well with approximately 100μg/ml of purified protein with overnight incubation, while CNA35-tdTomato could be utilized at 5 fold less concentration. This approach can be applied to study the relationship between skeletal muscle micro-structure and to observe mechanical response to applied deformation. It can be used more broadly to detect Collagen in a variety of fixed tissues, useful for structure-functions studies, constitutive modelling, tissue engineering and assessment of muscle tissue pathologies. Copyright © 2016 Elsevier Ltd. All rights reserved.
Tuncbag, Nurcan; Gursoy, Attila; Nussinov, Ruth; Keskin, Ozlem
2011-08-11
Prediction of protein-protein interactions at the structural level on the proteome scale is important because it allows prediction of protein function, helps drug discovery and takes steps toward genome-wide structural systems biology. We provide a protocol (termed PRISM, protein interactions by structural matching) for large-scale prediction of protein-protein interactions and assembly of protein complex structures. The method consists of two components: rigid-body structural comparisons of target proteins to known template protein-protein interfaces and flexible refinement using a docking energy function. The PRISM rationale follows our observation that globally different protein structures can interact via similar architectural motifs. PRISM predicts binding residues by using structural similarity and evolutionary conservation of putative binding residue 'hot spots'. Ultimately, PRISM could help to construct cellular pathways and functional, proteome-scale annotation. PRISM is implemented in Python and runs in a UNIX environment. The program accepts Protein Data Bank-formatted protein structures and is available at http://prism.ccbb.ku.edu.tr/prism_protocol/.
Ma, Pikyee; Patching, Simon G.; Ivanova, Ekaterina; Baldwin, Jocelyn M.; Sharples, David; Baldwin, Stephen A.
2016-01-01
This work reports the evolutionary relationships, amplified expression, functional characterization and purification of the putative allantoin transport protein, PucI, from Bacillus subtilis. Sequence alignments and phylogenetic analysis confirmed close evolutionary relationships between PucI and membrane proteins of the nucleobase-cation-symport-1 family of secondary active transporters. These include the sodium-coupled hydantoin transport protein, Mhp1, from Microbacterium liquefaciens, and related proteins from bacteria, fungi and plants. Membrane topology predictions for PucI were consistent with 12 putative transmembrane-spanning α-helices with both N- and C-terminal ends at the cytoplasmic side of the membrane. The pucI gene was cloned into the IPTG-inducible plasmid pTTQ18 upstream from an in-frame hexahistidine tag and conditions determined for optimal amplified expression of the PucI(His6) protein in Escherichia coli to a level of about 5 % in inner membranes. Initial rates of inducible PucI-mediated uptake of 14C-allantoin into energized E. coli whole cells conformed to Michaelis–Menten kinetics with an apparent affinity (K mapp) of 24 ± 3 μM, therefore confirming that PucI is a medium-affinity transporter of allantoin. Dependence of allantoin transport on sodium was not apparent. Competitive uptake experiments showed that PucI recognizes some additional hydantoin compounds, including hydantoin itself, and to a lesser extent a range of nucleobases and nucleosides. PucI(His6) was solubilized from inner membranes using n-dodecyl-β-d-maltoside and purified. The isolated protein contained a substantial proportion of α-helix secondary structure, consistent with the predictions, and a 3D model was therefore constructed on a template of the Mhp1 structure, which aided localization of the potential ligand binding site in PucI. PMID:26967546
Characterizing protein domain associations by Small-molecule ligand binding
Li, Qingliang; Cheng, Tiejun; Wang, Yanli; Bryant, Stephen H.
2012-01-01
Background Protein domains are evolutionarily conserved building blocks for protein structure and function, which are conventionally identified based on protein sequence or structure similarity. Small molecule binding domains are of great importance for the recognition of small molecules in biological systems and drug development. Many small molecules, including drugs, have been increasingly identified to bind to multiple targets, leading to promiscuous interactions with protein domains. Thus, a large scale characterization of the protein domains and their associations with respect to small-molecule binding is of particular interest to system biology research, drug target identification, as well as drug repurposing. Methods We compiled a collection of 13,822 physical interactions of small molecules and protein domains derived from the Protein Data Bank (PDB) structures. Based on the chemical similarity of these small molecules, we characterized pairwise associations of the protein domains and further investigated their global associations from a network point of view. Results We found that protein domains, despite lack of similarity in sequence and structure, were comprehensively associated through binding the same or similar small-molecule ligands. Moreover, we identified modules in the domain network that consisted of closely related protein domains by sharing similar biochemical mechanisms, being involved in relevant biological pathways, or being regulated by the same cognate cofactors. Conclusions A novel protein domain relationship was identified in the context of small-molecule binding, which is complementary to those identified by traditional sequence-based or structure-based approaches. The protein domain network constructed in the present study provides a novel perspective for chemogenomic study and network pharmacology, as well as target identification for drug repurposing. PMID:23745168
Cook, W B; Walker, J C
1992-01-01
A cDNA encoding a nuclear-encoded chloroplast nucleic acid-binding protein (NBP) has been isolated from maize. Identified as an in vitro DNA-binding activity, NBP belongs to a family of nuclear-encoded chloroplast proteins which share a common domain structure and are thought to be involved in posttranscriptional regulation of chloroplast gene expression. NBP contains an N-terminal chloroplast transit peptide, a highly acidic domain and a pair of ribonucleoprotein consensus sequence domains. NBP is expressed in a light-dependent, organ-specific manner which is consistent with its involvement in chloroplast biogenesis. The relationship of NBP to the other members of this protein family and their possible regulatory functions are discussed. Images PMID:1346929
Sweeney, Shawn M.; Orgel, Joseph P.; Fertala, Andrzej; McAuliffe, Jon D.; Turner, Kevin R.; Di Lullo, Gloria A.; Chen, Steven; Antipova, Olga; Perumal, Shiamalee; Ala-Kokko, Leena; Forlino, Antonella; Cabral, Wayne A.; Barnes, Aileen M.; Marini, Joan C.; Antonio, James D. San
2008-01-01
Type I collagen, the predominant protein of vertebrates, polymerizes with type III and V collagens and non-collagenous molecules into large cable-like fibrils, yet how the fibril interacts with cells and other binding partners remains poorly understood. To help reveal insights into the collagen structure-function relationship, a data base was assembled including hundreds of type I collagen ligand binding sites and mutations on a two-dimensional model of the fibril. Visual examination of the distribution of functional sites, and statistical analysis of mutation distributions on the fibril suggest it is organized into two domains. The “cell interaction domain” is proposed to regulate dynamic aspects of collagen biology, including integrin-mediated cell interactions and fibril remodeling. The “matrix interaction domain” may assume a structural role, mediating collagen cross-linking, proteoglycan interactions, and tissue mineralization. Molecular modeling was used to superimpose the positions of functional sites and mutations from the two-dimensional fibril map onto a three-dimensional x-ray diffraction structure of the collagen microfibril in situ, indicating the existence of domains in the native fibril. Sequence searches revealed that major fibril domain elements are conserved in type I collagens through evolution and in the type II/XI collagen fibril predominant in cartilage. Moreover, the fibril domain model provides potential insights into the genotype-phenotype relationship for several classes of human connective tissue diseases, mechanisms of integrin clustering by fibrils, the polarity of fibril assembly, heterotypic fibril function, and connective tissue pathology in diabetes and aging. PMID:18487200
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sweeney, Shawn M.; Orgel, Joseph P.; Fertala, Andrzej
Type I collagen, the predominant protein of vertebrates, polymerizes with type III and V collagens and non-collagenous molecules into large cable-like fibrils, yet how the fibril interacts with cells and other binding partners remains poorly understood. To help reveal insights into the collagen structure-function relationship, a data base was assembled including hundreds of type I collagen ligand binding sites and mutations on a two-dimensional model of the fibril. Visual examination of the distribution of functional sites, and statistical analysis of mutation distributions on the fibril suggest it is organized into two domains. The 'cell interaction domain' is proposed to regulatemore » dynamic aspects of collagen biology, including integrin-mediated cell interactions and fibril remodeling. The 'matrix interaction domain' may assume a structural role, mediating collagen cross-linking, proteoglycan interactions, and tissue mineralization. Molecular modeling was used to superimpose the positions of functional sites and mutations from the two-dimensional fibril map onto a three-dimensional x-ray diffraction structure of the collagen microfibril in situ, indicating the existence of domains in the native fibril. Sequence searches revealed that major fibril domain elements are conserved in type I collagens through evolution and in the type II/XI collagen fibril predominant in cartilage. Moreover, the fibril domain model provides potential insights into the genotype-phenotype relationship for several classes of human connective tissue diseases, mechanisms of integrin clustering by fibrils, the polarity of fibril assembly, heterotypic fibril function, and connective tissue pathology in diabetes and aging.« less
NASA Astrophysics Data System (ADS)
Houde, Damian J.; Bou-Assaf, George M.; Berkowitz, Steven A.
2017-05-01
Introduction of a chemical change to one or more amino acids in a protein's polypeptide chain can result in various effects on its higher-order structure (HOS) and biophysical behavior (or properties). These effects range from no detectable change to significant structural or conformational alteration that can greatly affect the protein's biophysical properties and its resulting biological function. The ability to reliably detect the absence or presence of such changes is essential to understanding the structure-function relationship in a protein and in the successful commercial development of protein-based drugs (biopharmaceuticals). In this paper, we focus our attention on the latter by specifically elucidating the impact of oxidation on the HOS, structural dynamics, and biophysical properties of interferon beta-1a (IFNβ-1a). Oxidation is a common biochemical modification that occurs in many biopharmaceuticals, specifically in two naturally-occurring sulfur-containing amino acids, methionine and cysteine. To carry out this work, we used combinations of hydrogen peroxide and pH to differentially oxidize IFNβ-1a (to focus on only methionine oxidation versus methionine and cysteine oxidation). We then employed several analytical and biophysical techniques to acquire information about the differential impact of these two oxidation scenarios on IFNβ-1a. In particular, the use of MS-based techniques, especially HDX-MS, play a dominant role in revealing the differential effects.
Livingston, B T; Killian, C E; Wilt, F; Cameron, A; Landrum, M J; Ermolaeva, O; Sapojnikov, V; Maglott, D R; Buchanan, A M; Ettensohn, C A
2006-12-01
Biomineralization, the biologically controlled formation of mineral deposits, is of widespread importance in biology, medicine, and engineering. Mineralized structures are found in most metazoan phyla and often have supportive, protective, or feeding functions. Among deuterostomes, only echinoderms and vertebrates produce extensive biomineralized structures. Although skeletons appeared independently in these two groups, ancestors of the vertebrates and echinoderms may have utilized similar components of a shared genetic "toolkit" to carry out biomineralization. The present study had two goals. First, we sought to expand our understanding of the proteins involved in biomineralization in the sea urchin, a powerful model system for analyzing the basic cellular and molecular mechanisms that underlie this process. Second, we sought to shed light on the possible evolutionary relationships between biomineralization in echinoderms and vertebrates. We used several computational methods to survey the genome of the purple sea urchin Strongylocentrotus purpuratus for gene products involved in biomineralization. Our analysis has greatly expanded the collection of biomineralization-related proteins. We have found that these proteins are often members of small families encoded by genes that are clustered in the genome. Most of the proteins are sea urchin-specific; that is, they have no apparent homologues in other invertebrate deuterostomes or vertebrates. Similarly, many of the vertebrate proteins that mediate mineral deposition do not have counterparts in the S. purpuratus genome. Our findings therefore reveal substantial differences in the primary sequences of proteins that mediate biomineral formation in echinoderms and vertebrates, possibly reflecting loose constraints on the primary structures of the proteins involved. On the other hand, certain cellular and molecular processes associated with earlier events in skeletogenesis appear similar in echinoderms and vertebrates, leaving open the possibility of deeper evolutionary relationships.
Proteomic characterization of the nucleolar linker histone H1 interaction network
Szerlong, Heather J.; Herman, Jacob A.; Krause, Christine M.; DeLuca, Jennifer G.; Skoultchi, Arthur; Winger, Quinton A.; Prenni, Jessica E.; Hansen, Jeffrey C.
2015-01-01
To investigate the relationship between linker histone H1 and protein-protein interactions in the nucleolus, biochemical and proteomics approaches were used to characterize nucleoli purified from cultured human and mouse cells. Mass spectrometry identified 175 proteins in human T-cell nucleolar extracts that bound to sepharose-immobilized H1 in vitro. Gene ontology analysis found significant enrichment for H1 binding proteins with functions related to nucleolar chromatin structure and RNA polymerase I transcription regulation, rRNA processing, and mRNA splicing. Consistent with the affinity binding results, H1 existed in large (400 to >650 kDa) macromolecular complexes in human T cell nucleolar extracts. To complement the biochemical experiments, the effects of in vivo H1 depletion on protein content and structural integrity of the nucleolus were investigated using the H1 triple isoform knock out (H1ΔTKO) mouse embryonic stem cell (mESC) model system. Proteomic profiling of purified wild type mESC nucleoli identified a total of 613 proteins, only ~60% of which were detected in the H1 mutant nucleoli. Within the affected group, spectral counting analysis quantitated 135 specific nucleolar proteins whose levels were significantly altered in H1ΔTKO mESC. Importantly, the functions of the affected proteins in mESC closely overlapped with those of the human T cell nucleolar H1 binding proteins. Immunofluorescence microscopy of intact H1ΔTKO mESC demonstrated both a loss of nucleolar RNA content and altered nucleolar morphology resulting from in vivo H1 depletion. We conclude that H1 organizes and maintains an extensive protein-protein interaction network in the nucleolus required for nucleolar structure and integrity. PMID:25584861
Aguado-Llera, David; Martínez-Gómez, Ana Isabel; Prieto, Jesús; Marenchino, Marco; Traverso, José Angel; Gómez, Javier; Chueca, Ana; Neira, José L.
2011-01-01
Thioredoxins (TRXs) are ubiquitous proteins involved in redox processes. About forty genes encode TRX or TRX-related proteins in plants, grouped in different families according to their subcellular localization. For instance, the h-type TRXs are located in cytoplasm or mitochondria, whereas f-type TRXs have a plastidial origin, although both types of proteins have an eukaryotic origin as opposed to other TRXs. Herein, we study the conformational and the biophysical features of TRXh1, TRXh2 and TRXf from Pisum sativum. The modelled structures of the three proteins show the well-known TRX fold. While sharing similar pH-denaturations features, the chemical and thermal stabilities are different, being PsTRXh1 (Pisum sativum thioredoxin h1) the most stable isoform; moreover, the three proteins follow a three-state denaturation model, during the chemical-denaturations. These differences in the thermal- and chemical-denaturations result from changes, in a broad sense, of the several ASAs (accessible surface areas) of the proteins. Thus, although a strong relationship can be found between the primary amino acid sequence and the structure among TRXs, that between the residue sequence and the conformational stability and biophysical properties is not. We discuss how these differences in the biophysical properties of TRXs determine their unique functions in pea, and we show how residues involved in the biophysical features described (pH-titrations, dimerizations and chemical-denaturations) belong to regions involved in interaction with other proteins. Our results suggest that the sequence demands of protein-protein function are relatively rigid, with different protein-binding pockets (some in common) for each of the three proteins, but the demands of structure and conformational stability per se (as long as there is a maintained core), are less so. PMID:21364950
Derewenda, Zygmunt S; Godzik, Adam
2017-01-01
Crystallization of macromolecules has long been perceived as a stochastic process, which cannot be predicted or controlled. This is consistent with another popular notion that the interactions of molecules within the crystal, i.e., crystal contacts, are essentially random and devoid of specific physicochemical features. In contrast, functionally relevant surfaces, such as oligomerization interfaces and specific protein-protein interaction sites, are under evolutionary pressures so their amino acid composition, structure, and topology are distinct. However, current theoretical and experimental studies are significantly changing our understanding of the nature of crystallization. The increasingly popular "sticky patch" model, derived from soft matter physics, describes crystallization as a process driven by interactions between select, specific surface patches, with properties thermodynamically favorable for cohesive interactions. Independent support for this model comes from various sources including structural studies and bioinformatics. Proteins that are recalcitrant to crystallization can be modified for enhanced crystallizability through chemical or mutational modification of their surface to effectively engineer "sticky patches" which would drive crystallization. Here, we discuss the current state of knowledge of the relationship between the microscopic properties of the target macromolecule and its crystallizability, focusing on the "sticky patch" model. We discuss state-of-the-art in silico methods that evaluate the propensity of a given target protein to form crystals based on these relationships, with the objective to design variants with modified molecular surface properties and enhanced crystallization propensity. We illustrate this discussion with specific cases where these approaches allowed to generate crystals suitable for structural analysis.
Ote, Manabu; Yamamoto, Daisuke
2018-04-27
The toxic manipulator of oogenesis (TomO) protein has been identified in the wMel strain of Wolbachia that symbioses with the vinegar fly Drosophila melanogaster, as a protein that affects host reproduction. TomO protects germ stem cells (GSCs) from degeneration, which otherwise occurs in ovaries of host females that are mutant for the gene Sex-lethal (Sxl). We isolated the TomO homologs from wPip, a Wolbachia strain from the mosquito Culex quinquefasciatus. One of the homologs, TomO w Pip 1, exerted the GSC rescue activity in fly Sxl mutants when lacking its hydrophobic stretches. The GSC-rescuing action of the TomO w Pip 1 variant was ascribable to its abilities to associate with Nanos (nos) mRNA and to enhance Nos protein expression. The analysis of structure-activity relationships with TomO homologs and TomO deletion variants revealed distinct modules in the protein that are each dedicated to different functions, i.e., subcellular localization, nos mRNA binding or Nos expression enhancement. We propose that modular reshuffling is the basis for structural and functional diversification of TomO protein members. © 2018 Wiley Periodicals, Inc.
Xie, Hongbo; Vucetic, Slobodan; Iakoucheva, Lilia M; Oldfield, Christopher J; Dunker, A Keith; Obradovic, Zoran; Uversky, Vladimir N
2007-05-01
Currently, the understanding of the relationships between function, amino acid sequence, and protein structure continues to represent one of the major challenges of the modern protein science. As many as 50% of eukaryotic proteins are likely to contain functionally important long disordered regions. Many proteins are wholly disordered but still possess numerous biologically important functions. However, the number of experimentally confirmed disordered proteins with known biological functions is substantially smaller than their actual number in nature. Therefore, there is a crucial need for novel bionformatics approaches that allow projection of the current knowledge from a few experimentally verified examples to much larger groups of known and potential proteins. The elaboration of a bioinformatics tool for the analysis of functional diversity of intrinsically disordered proteins and application of this data mining tool to >200 000 proteins from the Swiss-Prot database, each annotated with at least one of the 875 functional keywords, was described in the first paper of this series (Xie, H.; Vucetic, S.; Iakoucheva, L. M.; Oldfield, C. J.; Dunker, A. K.; Obradovic, Z.; Uversky, V.N. Functional anthology of intrinsic disorder. 1. Biological processes and functions of proteins with long disordered regions. J. Proteome Res. 2007, 5, 1882-1898). Using this tool, we have found that out of the 710 Swiss-Prot functional keywords associated with at least 20 proteins, 262 were strongly positively correlated with long intrinsically disordered regions, and 302 were strongly negatively correlated. Illustrative examples of functional disorder or order were found for the vast majority of keywords showing strongest positive or negative correlation with intrinsic disorder, respectively. Some 80 Swiss-Prot keywords associated with disorder- and order-driven biological processes and protein functions were described in the first paper (see above). The second paper of the series was devoted to the presentation of 87 Swiss-Prot keywords attributed to the cellular components, domains, technical terms, developmental processes, and coding sequence diversities possessing strong positive and negative correlation with long disordered regions (Vucetic, S.; Xie, H.; Iakoucheva, L. M.; Oldfield, C. J.; Dunker, A. K.; Obradovic, Z.; Uversky, V. N. Functional anthology of intrinsic disorder. 2. Cellular components, domains, technical terms, developmental processes, and coding sequence diversities correlated with long disordered regions. J. Proteome Res. 2007, 5, 1899-1916). Protein structure and functionality can be modulated by various post-translational modifications or/and as a result of binding of specific ligands. Numerous human diseases are associated with protein misfolding/misassembly/misfunctioning. This work concludes the series of papers dedicated to the functional anthology of intrinsic disorder and describes approximately 80 Swiss-Prot functional keywords that are related to ligands, post-translational modifications, and diseases possessing strong positive or negative correlation with the predicted long disordered regions in proteins.
NASA Astrophysics Data System (ADS)
Rundgren, Carl-Johan; Hirsch, Richard; Chang Rundgren, Shu-Nu; Tibell, Lena A. E.
2012-10-01
This study examines how students explain their conceptual understanding of protein function using visualizations. Thirteen upper secondary students, four tertiary students (studying chemical biology), and two experts were interviewed in semi-structured interviews. The interviews were structured around 2D illustrations of proteins and an animated representation of water transport through a channel in the cell membrane. In the analysis of the transcripts, a score, based on the SOLO-taxonomy, was given to each student to indicate the conceptual depth achieved in their explanations. The use of scientific terms and non-conventionalized expressions in the students' explanations were investigated based upon a semiotic approach. The results indicated that there was a positive relationship between use of scientific terms and level of education. However, there was no correlation between students' use of scientific terms and conceptual depth. In the interviews, we found that non-conventionalized expressions were used by several participants to express conceptual understanding and played a role in making sense of the visualizations of protein function. Interestingly, also the experts made use of non-conventionalized expressions. The results of our study imply that more attention should be drawn to students' use of scientific and non-conventionalized terms in relation to their conceptual understanding.
Bevans, Carville G.; Krettler, Christoph; Reinhart, Christoph; Watzka, Matthias; Oldenburg, Johannes
2015-01-01
In humans and other vertebrate animals, vitamin K 2,3-epoxide reductase (VKOR) family enzymes are the gatekeepers between nutritionally acquired K vitamins and the vitamin K cycle responsible for posttranslational modifications that confer biological activity upon vitamin K-dependent proteins with crucial roles in hemostasis, bone development and homeostasis, hormonal carbohydrate regulation and fertility. We report a phylogenetic analysis of the VKOR family that identifies five major clades. Combined phylogenetic and site-specific conservation analyses point to clade-specific similarities and differences in structure and function. We discovered a single-site determinant uniquely identifying VKOR homologs belonging to human pathogenic, obligate intracellular prokaryotes and protists. Building on previous work by Sevier et al. (Protein Science 14:1630), we analyzed structural data from both VKOR and prokaryotic disulfide bond formation protein B (DsbB) families and hypothesize an ancient evolutionary relationship between the two families where one family arose from the other through a gene duplication/deletion event. This has resulted in circular permutation of primary sequence threading through the four-helical bundle protein folds of both families. This is the first report of circular permutation relating distant α-helical membrane protein sequences and folds. In conclusion, we suggest a chronology for the evolution of the five extant VKOR clades. PMID:26230708
Bevans, Carville G; Krettler, Christoph; Reinhart, Christoph; Watzka, Matthias; Oldenburg, Johannes
2015-07-29
In humans and other vertebrate animals, vitamin K 2,3-epoxide reductase (VKOR) family enzymes are the gatekeepers between nutritionally acquired K vitamins and the vitamin K cycle responsible for posttranslational modifications that confer biological activity upon vitamin K-dependent proteins with crucial roles in hemostasis, bone development and homeostasis, hormonal carbohydrate regulation and fertility. We report a phylogenetic analysis of the VKOR family that identifies five major clades. Combined phylogenetic and site-specific conservation analyses point to clade-specific similarities and differences in structure and function. We discovered a single-site determinant uniquely identifying VKOR homologs belonging to human pathogenic, obligate intracellular prokaryotes and protists. Building on previous work by Sevier et al. (Protein Science 14:1630), we analyzed structural data from both VKOR and prokaryotic disulfide bond formation protein B (DsbB) families and hypothesize an ancient evolutionary relationship between the two families where one family arose from the other through a gene duplication/deletion event. This has resulted in circular permutation of primary sequence threading through the four-helical bundle protein folds of both families. This is the first report of circular permutation relating distant a-helical membrane protein sequences and folds. In conclusion, we suggest a chronology for the evolution of the five extant VKOR clades.
Improved data visualization techniques for analyzing macromolecule structural changes.
Kim, Jae Hyun; Iyer, Vidyashankara; Joshi, Sangeeta B; Volkin, David B; Middaugh, C Russell
2012-10-01
The empirical phase diagram (EPD) is a colored representation of overall structural integrity and conformational stability of macromolecules in response to various environmental perturbations. Numerous proteins and macromolecular complexes have been analyzed by EPDs to summarize results from large data sets from multiple biophysical techniques. The current EPD method suffers from a number of deficiencies including lack of a meaningful relationship between color and actual molecular features, difficulties in identifying contributions from individual techniques, and a limited ability to be interpreted by color-blind individuals. In this work, three improved data visualization approaches are proposed as techniques complementary to the EPD. The secondary, tertiary, and quaternary structural changes of multiple proteins as a function of environmental stress were first measured using circular dichroism, intrinsic fluorescence spectroscopy, and static light scattering, respectively. Data sets were then visualized as (1) RGB colors using three-index EPDs, (2) equiangular polygons using radar charts, and (3) human facial features using Chernoff face diagrams. Data as a function of temperature and pH for bovine serum albumin, aldolase, and chymotrypsin as well as candidate protein vaccine antigens including a serine threonine kinase protein (SP1732) and surface antigen A (SP1650) from S. pneumoniae and hemagglutinin from an H1N1 influenza virus are used to illustrate the advantages and disadvantages of each type of data visualization technique. Copyright © 2012 The Protein Society.
Ye, Chaohui; Ilghari, Dariush; Niu, Jianlou; Xie, Yaoyao; Wang, Yan; Wang, Chao; Li, Xiaokun; Liu, Bailin; Huang, Zhifeng
2012-08-31
An in-depth understanding of molecular basis by which smart polymers assist protein refolding can lead us to develop a more effective polymer for protein refolding. In this report, to investigate structure-function relationship of pH-sensitive smart polymers, a series of poly(methylacrylic acid (MAc)-acrylic acid (AA))s with different MAc/AA ratios and molecular weights were synthesized and then their abilities in refolding of denatured lysozyme were compared by measuring the lytic activity of the refolded lysozyme. Based on our analysis, there were optimal MAc/AA ratio (44% MAc), M(w) (1700 Da), and copolymer concentration (0.1%, w/v) at which the highest yield of protein refolding was achieved. Fluorescence, circular dichroism, and RP-HPLC analysis reported in this study demonstrated that the presence of P(MAc-AA)s in the refolding buffer significantly improved the refolding yield of denatured lysozyme without affecting the overall structure of the enzyme. Importantly, our bioseparation analysis, together with the analysis of zeta potential and particle size of the copolymer in refolding buffers with different copolymer concentrations, suggested that the polymer provided a negatively charged surface for an electrostatic interaction with the denatured lysozyme molecules and thereby minimized the hydrophobic-prone aggregation of unfolded proteins during the process of refolding. Copyright © 2012 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Burrington, J.D.; Clark, D.S.
1989-01-01
The proceedings are divided into three parts: Bioscience and biotechnology; Structure-function relationships; and Biomimetics. Topics include: the chemistry of biotechnology, biomimetics, and biocatalysts; crystallography and mutagenesis; computerized simulation of biocatalysis and biomimetic processes; enzymatic reactions in micellar systems; hydroxylation of hydrocarbons; oxidation of lignin; zeolite catalysts as enzyme mimics; and immobilization of proteins and enzymes. Some papers have been processed separately for inclusion on the data base.
Molecular evolution of cyclin proteins in animals and fungi
2011-01-01
Background The passage through the cell cycle is controlled by complexes of cyclins, the regulatory units, with cyclin-dependent kinases, the catalytic units. It is also known that cyclins form several families, which differ considerably in primary structure from one eukaryotic organism to another. Despite these lines of evidence, the relationship between the evolution of cyclins and their function is an open issue. Here we present the results of our study on the molecular evolution of A-, B-, D-, E-type cyclin proteins in animals and fungi. Results We constructed phylogenetic trees for these proteins, their ancestral sequences and analyzed patterns of amino acid replacements. The analysis of infrequently fixed atypical amino acid replacements in cyclins evidenced that accelerated evolution proceeded predominantly during paralog duplication or after it in animals and fungi and that it was related to aromorphic changes in animals. It was shown also that evolutionary flexibility of cyclin function may be provided by consequential reorganization of regions on protein surface remote from CDK binding sites in animal and fungal cyclins and by functional differentiation of paralogous cyclins formed in animal evolution. Conclusions The results suggested that changes in the number and/or nature of cyclin-binding proteins may underlie the evolutionary role of the alterations in the molecular structure of cyclins and their involvement in diverse molecular-genetic events. PMID:21798004
Bandyopadhyay, Deepak; Huan, Jun; Prins, Jan; Snoeyink, Jack; Wang, Wei; Tropsha, Alexander
2009-11-01
Protein function prediction is one of the central problems in computational biology. We present a novel automated protein structure-based function prediction method using libraries of local residue packing patterns that are common to most proteins in a known functional family. Critical to this approach is the representation of a protein structure as a graph where residue vertices (residue name used as a vertex label) are connected by geometrical proximity edges. The approach employs two steps. First, it uses a fast subgraph mining algorithm to find all occurrences of family-specific labeled subgraphs for all well characterized protein structural and functional families. Second, it queries a new structure for occurrences of a set of motifs characteristic of a known family, using a graph index to speed up Ullman's subgraph isomorphism algorithm. The confidence of function inference from structure depends on the number of family-specific motifs found in the query structure compared with their distribution in a large non-redundant database of proteins. This method can assign a new structure to a specific functional family in cases where sequence alignments, sequence patterns, structural superposition and active site templates fail to provide accurate annotation.
Peptoid architectures: elaboration, actuation, and application.
Yoo, Barney; Kirshenbaum, Kent
2008-12-01
Peptoids are peptidomimetic oligomers composed of N-substituted glycine units. Their convenient synthesis enables strict control over the sequence of highly diverse monomers and is capable of generating extensive compound libraries. Recent studies are beginning to explore the relationship between peptoid sequence, structure and function. We describe new approaches to direct the conformation of the peptoid backbone, leading to secondary structures such as helices, loops, and turns. These advances are enabling the discovery of bioactive peptoids and will establish modules for the design and assembly of protein mimetics.
Rebelling for a Reason: Protein Structural “Outliers”
Arumugam, Gandhimathi; Nair, Anu G.; Hariharaputran, Sridhar; Ramanathan, Sowdhamini
2013-01-01
Analysis of structural variation in domain superfamilies can reveal constraints in protein evolution which aids protein structure prediction and classification. Structure-based sequence alignment of distantly related proteins, organized in PASS2 database, provides clues about structurally conserved regions among different functional families. Some superfamily members show large structural differences which are functionally relevant. This paper analyses the impact of structural divergence on function for multi-member superfamilies, selected from the PASS2 superfamily alignment database. Functional annotations within superfamilies, with structural outliers or ‘rebels’, are discussed in the context of structural variations. Overall, these data reinforce the idea that functional similarities cannot be extrapolated from mere structural conservation. The implication for fold-function prediction is that the functional annotations can only be inherited with very careful consideration, especially at low sequence identities. PMID:24073209
Relationships among msx gene structure and function in zebrafish and other vertebrates.
Ekker, M; Akimenko, M A; Allende, M L; Smith, R; Drouin, G; Langille, R M; Weinberg, E S; Westerfield, M
1997-10-01
The zebrafish genome contains at least five msx homeobox genes, msxA, msxB, msxC, msxD, and the newly isolated msxE. Although these genes share structural features common to all Msx genes, phylogenetic analyses of protein sequences indicate that the msx genes from zebrafish are not orthologous to the Msx1 and Msx2 genes of mammals, birds, and amphibians. The zebrafish msxB and msxC are more closely related to each other and to the mouse Msx3. Similarly, although the combinatorial expression of the zebrafish msx genes in the embryonic dorsal neuroectoderm, visceral arches, fins, and sensory organs suggests functional similarities with the Msx genes of other vertebrates, differences in the expression patterns preclude precise assignment of orthological relationships. Distinct duplication events may have given rise to the msx genes of modern fish and other vertebrate lineages whereas many aspects of msx gene functions during embryonic development have been preserved.
The spatial architecture of protein function and adaptation
McLaughlin, Richard N.; Poelwijk, Frank J.; Raman, Arjun; Gosal, Walraj S.; Ranganathan, Rama
2014-01-01
Statistical analysis of protein evolution suggests a design for natural proteins in which sparse networks of coevolving amino acids (termed sectors) comprise the essence of three-dimensional structure and function1, 2, 3, 4, 5. However, proteins are also subject to pressures deriving from the dynamics of the evolutionary process itself—the ability to tolerate mutation and to be adaptive to changing selection pressures6, 7, 8, 9, 10. To understand the relationship of the sector architecture to these properties, we developed a high-throughput quantitative method for a comprehensive single-mutation study in which every position is substituted individually to every other amino acid. Using a PDZ domain (PSD95pdz3) model system, we show that sector positions are functionally sensitive to mutation, whereas non-sector positions are more tolerant to substitution. In addition, we find that adaptation to a new binding specificity initiates exclusively through variation within sector residues. A combination of just two sector mutations located near and away from the ligand-binding site suffices to switch the binding specificity of PSD95pdz3 quantitatively towards a class-switching ligand. The localization of functional constraint and adaptive variation within the sector has important implications for understanding and engineering proteins. PMID:23041932
Orthogonal use of a human tRNA synthetase active site to achieve multi-functionality
Zhou, Quansheng; Kapoor, Mili; Guo, Min; Belani, Rajesh; Xu, Xiaoling; Kiosses, William B.; Hanan, Melanie; Park, Chulho; Armour, Eva; Do, Minh-Ha; Nangle, Leslie A.; Schimmel, Paul; Yang, Xiang-Lei
2011-01-01
Protein multi-functionality is an emerging explanation for the complexity of higher organisms. In this regard, while aminoacyl tRNA synthetases catalyze amino acid activation for protein synthesis, some also act in pathways for inflammation, angiogenesis, and apoptosis. How multiple functions evolved and their relationship to the active site is not clear. Here structural modeling analysis, mutagenesis, and cell-based functional studies show that the potent angiostatic, natural fragment of human TrpRS associates via Trp side chains that protrude from the cognate cellular receptor VE-cadherin. Modeling indicates that (I prefer the way it was because the conclusion was reached not only by modeling, but more so by experimental studies.)VE-cadherin Trp side chains fit into the Trp-specific active site of the synthetase. Thus, specific side chains of the receptor mimic (?) amino acid substrates and expand the functionality of the active site of the synthetase. We propose that orthogonal use of the same active site may be a general way to develop multi-functionality of human tRNA synthetases and other proteins. PMID:20010843
BioLayout(Java): versatile network visualisation of structural and functional relationships.
Goldovsky, Leon; Cases, Ildefonso; Enright, Anton J; Ouzounis, Christos A
2005-01-01
Visualisation of biological networks is becoming a common task for the analysis of high-throughput data. These networks correspond to a wide variety of biological relationships, such as sequence similarity, metabolic pathways, gene regulatory cascades and protein interactions. We present a general approach for the representation and analysis of networks of variable type, size and complexity. The application is based on the original BioLayout program (C-language implementation of the Fruchterman-Rheingold layout algorithm), entirely re-written in Java to guarantee portability across platforms. BioLayout(Java) provides broader functionality, various analysis techniques, extensions for better visualisation and a new user interface. Examples of analysis of biological networks using BioLayout(Java) are presented.
Clearing the skies over modular polyketide synthases.
Sherman, David H; Smith, Janet L
2006-09-19
Modular polyketide synthases (PKSs) are large multifunctional proteins that synthesize complex polyketide metabolites in microbial cells. A series of recent studies confirm the close protein structural relationship between catalytic domains in the type I mammalian fatty acid synthase (FAS) and the basic synthase unit of the modular PKS. They also establish a remarkable similarity in the overall organization of the type I FAS and the PKS module. This information provides important new conclusions about catalytic domain architecture, function, and molecular recognition that are essential for future efforts to engineer useful polyketide metabolites with valuable biological activities.
Membrane nanodomains in plants: capturing form, function, and movement.
Tapken, Wiebke; Murphy, Angus S
2015-03-01
The plasma membrane is the interface between the cell and the external environment. Plasma membrane lipids provide scaffolds for proteins and protein complexes that are involved in cell to cell communication, signal transduction, immune responses, and transport of small molecules. In animals, fungi, and plants, a substantial subset of these plasma membrane proteins function within ordered sterol- and sphingolipid-rich nanodomains. High-resolution microscopy, lipid dyes, pharmacological inhibitors of lipid biosynthesis, and lipid biosynthetic mutants have been employed to examine the relationship between the lipid environment and protein activity in plants. They have also been used to identify proteins associated with nanodomains and the pathways by which nanodomain-associated proteins are trafficked to their plasma membrane destinations. These studies suggest that plant membrane nanodomains function in a context-specific manner, analogous to similar structures in animals and fungi. In addition to the highly conserved flotillin and remorin markers, some members of the B and G subclasses of ATP binding cassette transporters have emerged as functional markers for plant nanodomains. Further, the glycophosphatidylinositol-anchored fasciclin-like arabinogalactan proteins, that are often associated with detergent-resistant membranes, appear also to have a functional role in membrane nanodomains. © The Author 2015. Published by Oxford University Press on behalf of the Society for Experimental Biology. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Auer, Markus; Gruber, Clemens; Bellei, Marzia; Pirker, Katharina F.; Zamocky, Marcel; Kroiss, Daniela; Teufer, Stefan A.; Hofbauer, Stefan; Soudi, Monika; Battistuzzi, Gianantonio; Furtmüller, Paul G.; Obinger, Christian
2013-01-01
Reconstructing the phylogenetic relationships of the main evolutionary lines of the mammalian peroxidases lactoperoxidase and myeloperoxidase revealed the presence of novel bacterial heme peroxidase subfamilies. Here, for the first time, an ancestral bacterial heme peroxidase is shown to possess a very high bromide oxidation activity (besides conventional peroxidase activity). The recombinant protein allowed monitoring of the autocatalytic peroxide-driven formation of covalent heme to protein bonds. Thereby, the high spin ferric rhombic heme spectrum became similar to lactoperoxidase, the standard reduction potential of the Fe(III)/Fe(II) couple shifted to more positive values (−145 ± 10 mV at pH 7), and the conformational and thermal stability of the protein increased significantly. We discuss structure-function relationships of this new peroxidase in relation to its mammalian counterparts and ask for its putative physiological role. PMID:23918925
HHsvm: fast and accurate classification of profile–profile matches identified by HHsearch
Dlakić, Mensur
2009-01-01
Motivation: Recently developed profile–profile methods rival structural comparisons in their ability to detect homology between distantly related proteins. Despite this tremendous progress, many genuine relationships between protein families cannot be recognized as comparisons of their profiles result in scores that are statistically insignificant. Results: Using known evolutionary relationships among protein superfamilies in SCOP database, support vector machines were trained on four sets of discriminatory features derived from the output of HHsearch. Upon validation, it was shown that the automatic classification of all profile–profile matches was superior to fixed threshold-based annotation in terms of sensitivity and specificity. The effectiveness of this approach was demonstrated by annotating several domains of unknown function from the Pfam database. Availability: Programs and scripts implementing the methods described in this manuscript are freely available from http://hhsvm.dlakiclab.org/. Contact: mdlakic@montana.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:19773335
Medvedeva, Irina V; Demenkov, Pavel S; Ivanisenko, Vladimir A
2017-04-01
Functional sites define the diversity of protein functions and are the central object of research of the structural and functional organization of proteins. The mechanisms underlying protein functional sites emergence and their variability during evolution are distinguished by duplication, shuffling, insertion and deletion of the exons in genes. The study of the correlation between a site structure and exon structure serves as the basis for the in-depth understanding of sites organization. In this regard, the development of programming resources that allow the realization of the mutual projection of exon structure of genes and primary and tertiary structures of encoded proteins is still the actual problem. Previously, we developed the SitEx system that provides information about protein and gene sequences with mapped exon borders and protein functional sites amino acid positions. The database included information on proteins with known 3D structure. However, data with respect to orthologs was not available. Therefore, we added the projection of sites positions to the exon structures of orthologs in SitEx 2.0. We implemented a search through database using site conservation variability and site discontinuity through exon structure. Inclusion of the information on orthologs allowed to expand the possibilities of SitEx usage for solving problems regarding the analysis of the structural and functional organization of proteins. Database URL: http://www-bionet.sscc.ru/sitex/ .
Achievements and Challenges in Computational Protein Design.
Samish, Ilan
2017-01-01
Computational protein design (CPD), a yet evolving field, includes computer-aided engineering for partial or full de novo designs of proteins of interest. Designs are defined by a requested structure, function, or working environment. This chapter describes the birth and maturation of the field by presenting 101 CPD examples in a chronological order emphasizing achievements and pending challenges. Integrating these aspects presents the plethora of CPD approaches with the hope of providing a "CPD 101". These reflect on the broader structural bioinformatics and computational biophysics field and include: (1) integration of knowledge-based and energy-based methods, (2) hierarchical designated approach towards local, regional, and global motifs and the integration of high- and low-resolution design schemes that fit each such region, (3) systematic differential approaches towards different protein regions, (4) identification of key hot-spot residues and the relative effect of remote regions, (5) assessment of shape-complementarity, electrostatics and solvation effects, (6) integration of thermal plasticity and functional dynamics, (7) negative design, (8) systematic integration of experimental approaches, (9) objective cross-assessment of methods, and (10) successful ranking of potential designs. Future challenges also include dissemination of CPD software to the general use of life-sciences researchers and the emphasis of success within an in vivo milieu. CPD increases our understanding of protein structure and function and the relationships between the two along with the application of such know-how for the benefit of mankind. Applied aspects range from biological drugs, via healthier and tastier food products to nanotechnology and environmentally friendly enzymes replacing toxic chemicals utilized in the industry.
Aminoacyl-tRNA synthetases: versatile players in the changing theater of translation.
Francklyn, Christopher; Perona, John J; Puetz, Joern; Hou, Ya-Ming
2002-01-01
Aminoacyl-tRNA synthetases attach amino acids to the 3' termini of cognate tRNAs to establish the specificity of protein synthesis. A recent Asilomar conference (California, January 13-18, 2002) discussed new research into the structure-function relationship of these crucial enzymes, as well as a multitude of novel functions, including participation in amino acid biosynthesis, cell cycle control, RNA splicing, and export of tRNAs from nucleus to cytoplasm in eukaryotic cells. Together with the discovery of their role in the cellular synthesis of proteins to incorporate selenocysteine and pyrrolysine, these diverse functions of aminoacyl-tRNA synthetases underscore the flexibility and adaptability of these ancient enzymes and stimulate the development of new concepts and methods for expanding the genetic code. PMID:12458790
G-LoSA for Prediction of Protein-Ligand Binding Sites and Structures.
Lee, Hui Sun; Im, Wonpil
2017-01-01
Recent advances in high-throughput structure determination and computational protein structure prediction have significantly enriched the universe of protein structure. However, there is still a large gap between the number of available protein structures and that of proteins with annotated function in high accuracy. Computational structure-based protein function prediction has emerged to reduce this knowledge gap. The identification of a ligand binding site and its structure is critical to the determination of a protein's molecular function. We present a computational methodology for predicting small molecule ligand binding site and ligand structure using G-LoSA, our protein local structure alignment and similarity measurement tool. All the computational procedures described here can be easily implemented using G-LoSA Toolkit, a package of standalone software programs and preprocessed PDB structure libraries. G-LoSA and G-LoSA Toolkit are freely available to academic users at http://compbio.lehigh.edu/GLoSA . We also illustrate a case study to show the potential of our template-based approach harnessing G-LoSA for protein function prediction.
Density functional study of molecular interactions in secondary structures of proteins.
Takano, Yu; Kusaka, Ayumi; Nakamura, Haruki
2016-01-01
Proteins play diverse and vital roles in biology, which are dominated by their three-dimensional structures. The three-dimensional structure of a protein determines its functions and chemical properties. Protein secondary structures, including α-helices and β-sheets, are key components of the protein architecture. Molecular interactions, in particular hydrogen bonds, play significant roles in the formation of protein secondary structures. Precise and quantitative estimations of these interactions are required to understand the principles underlying the formation of three-dimensional protein structures. In the present study, we have investigated the molecular interactions in α-helices and β-sheets, using ab initio wave function-based methods, the Hartree-Fock method (HF) and the second-order Møller-Plesset perturbation theory (MP2), density functional theory, and molecular mechanics. The characteristic interactions essential for forming the secondary structures are discussed quantitatively.
Bhatia, Chitra; Oerum, Stephanie; Bray, James; Kavanagh, Kathryn L; Shafqat, Naeem; Yue, Wyatt; Oppermann, Udo
2015-06-05
Short-chain dehydrogenases/reductases (SDRs) constitute a large, functionally diverse branch of enzymes within the class of NAD(P)(H) dependent oxidoreductases. In humans, over 80 genes have been identified with distinct metabolic roles in carbohydrate, amino acid, lipid, retinoid and steroid hormone metabolism, frequently associated with inherited genetic defects. Besides metabolic functions, a subset of atypical SDR proteins appears to play critical roles in adapting to redox status or RNA processing, and thereby controlling metabolic pathways. Here we present an update on the human SDR superfamily and a ligand identification strategy using differential scanning fluorimetry (DSF) with a focused library of oxidoreductase and metabolic ligands to identify substrate classes and inhibitor chemotypes. This method is applicable to investigate structure-activity relationships of oxidoreductases and ultimately to better understand their physiological roles. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Detection of functionally important regions in "hypothetical proteins" of known structure.
Nimrod, Guy; Schushan, Maya; Steinberg, David M; Ben-Tal, Nir
2008-12-10
Structural genomics initiatives provide ample structures of "hypothetical proteins" (i.e., proteins of unknown function) at an ever increasing rate. However, without function annotation, this structural goldmine is of little use to biologists who are interested in particular molecular systems. To this end, we used (an improved version of) the PatchFinder algorithm for the detection of functional regions on the protein surface, which could mediate its interactions with, e.g., substrates, ligands, and other proteins. Examination, using a data set of annotated proteins, showed that PatchFinder outperforms similar methods. We collected 757 structures of hypothetical proteins and their predicted functional regions in the N-Func database. Inspection of several of these regions demonstrated that they are useful for function prediction. For example, we suggested an interprotein interface and a putative nucleotide-binding site. A web-server implementation of PatchFinder and the N-Func database are available at http://patchfinder.tau.ac.il/.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sachleben, Joseph R.; Adhikari, Aashish N.; Gawlak, Grzegorz
2016-11-10
We determined the NMR structure of a highly aromatic (13%) protein of unknown function, Aq1974 from Aquifex aeolicus (PDB ID: 5SYQ). The unusual sequence of this protein has a tryptophan content five times the normal (six tryptophan residues of 114 or 5.2% while the average tryptophan content is 1.0%) with the tryptophans occurring in a WXW motif. It has no detectable sequence homology with known protein structures. Although its NMR spectrum suggested that the protein was rich in β-sheet, upon resonance assignment and solution structure determination, the protein was found to be primarily α-helical with a small two-stranded β-sheet withmore » a novel fold that we have termed an Aromatic Claw. As this fold was previously unknown and the sequence unique, we submitted the sequence to CASP10 as a target for blind structural prediction. At the end of the competition, the sequence was classified a hard template based model; the structural relationship between the template and the experimental structure was small and the predictions all failed to predict the structure. CSRosetta was found to predict the secondary structure and its packing; however, it was found that there was little correlation between CSRosetta score and the RMSD between the CSRosetta structure and the NMR determined one. This work demonstrates that even in relatively small proteins, we do not yet have the capacity to accurately predict the fold for all primary sequences. The experimental discovery of new folds helps guide the improvement of structural prediction methods.« less
Chandonia, John-Marc; Fox, Naomi K; Brenner, Steven E
2017-02-03
SCOPe (Structural Classification of Proteins-extended, http://scop.berkeley.edu) is a database of relationships between protein structures that extends the Structural Classification of Proteins (SCOP) database. SCOP is an expert-curated ordering of domains from the majority of proteins of known structure in a hierarchy according to structural and evolutionary relationships. SCOPe classifies the majority of protein structures released since SCOP development concluded in 2009, using a combination of manual curation and highly precise automated tools, aiming to have the same accuracy as fully hand-curated SCOP releases. SCOPe also incorporates and updates the ASTRAL compendium, which provides several databases and tools to aid in the analysis of the sequences and structures of proteins classified in SCOPe. SCOPe continues high-quality manual classification of new superfamilies, a key feature of SCOP. Artifacts such as expression tags are now separated into their own class, in order to distinguish them from the homology-based annotations in the remainder of the SCOPe hierarchy. SCOPe 2.06 contains 77,439 Protein Data Bank entries, double the 38,221 structures classified in SCOP. Copyright © 2016 The Author(s). Published by Elsevier Ltd.. All rights reserved.
NASA Astrophysics Data System (ADS)
Wang, Bo
We are living an era wherein nanoparticles (NPs) have been widely applied in our lives. Dendrimers are special polymeric NPs with unique physiochemical properties, which have been intensely explored for a variety of applications. Current studies on dendrimers are bottlenecked by insufficient understandings of their structure and dynamic behaviors from a molecular level. With primarily computational approaches supplemented by many other experimental technics, this dissertation aims to establish structure-function relationships of dendrimers in environmental and biomedical applications. More specifically, it thoroughly investigates the interactions between dendrimers and different biomolecules including carbon-based NPs, metal-based NPs, and proteins/peptides. Those results not only provide profound knowledge for evaluating the impacts of dendrimers on environmental and biological systems but also facilitate designing next-generation functional polymeric nanomaterials. The dissertation is organized as following. Chapter 1 provides an overview of current progresses on dendrimer studies, where methodology of Discrete Molecular Dynamics (DMD), my major research tool, is also introduced. Two directions of utilizing dendrimers will be discussed in following chapters. Chapter 2 will focus on environmental applications of dendrimers, where two back-to-back studies are presented. I will start from describing some interesting observations from experiments i.e. dendrimers dispersed model oil molecules. Then, I will reveal why surface chemistries of dendrimers lead to different remediation efficiencies by computational modelings. Finally, I will demonstrate different scenarios of dendrimer-small molecules association. Chapter 3 is centered on dendrimers in the biomedical applications including two subtopics. In the first topic, we will discuss dendrimers as surfactants that modulating the interactions between proteins and NPs. Some fundamental concepts regarding to NPs-Protein interactions such as NP-protein corona are also explained. In the following topic, I will look into amyloid protein aggregation mediated by dendrimers, which is of high expectations for combating amyloidogenic-related diseases. Chapter 4 concludes the whole dissertation. It also briefly introduces my ongoing projects and future research directions about dendrimers. This dissertation has presented a systematic study of dendrimers in environmental and biomedical applications which might provide valuable information for future dendrimer design thus benefit the nanobiotechnology.
Functional classification of protein structures by local structure matching in graph representation.
Mills, Caitlyn L; Garg, Rohan; Lee, Joslynn S; Tian, Liang; Suciu, Alexandru; Cooperman, Gene; Beuning, Penny J; Ondrechen, Mary Jo
2018-03-31
As a result of high-throughput protein structure initiatives, over 14,400 protein structures have been solved by structural genomics (SG) centers and participating research groups. While the totality of SG data represents a tremendous contribution to genomics and structural biology, reliable functional information for these proteins is generally lacking. Better functional predictions for SG proteins will add substantial value to the structural information already obtained. Our method described herein, Graph Representation of Active Sites for Prediction of Function (GRASP-Func), predicts quickly and accurately the biochemical function of proteins by representing residues at the predicted local active site as graphs rather than in Cartesian coordinates. We compare the GRASP-Func method to our previously reported method, structurally aligned local sites of activity (SALSA), using the ribulose phosphate binding barrel (RPBB), 6-hairpin glycosidase (6-HG), and Concanavalin A-like Lectins/Glucanase (CAL/G) superfamilies as test cases. In each of the superfamilies, SALSA and the much faster method GRASP-Func yield similar correct classification of previously characterized proteins, providing a validated benchmark for the new method. In addition, we analyzed SG proteins using our SALSA and GRASP-Func methods to predict function. Forty-one SG proteins in the RPBB superfamily, nine SG proteins in the 6-HG superfamily, and one SG protein in the CAL/G superfamily were successfully classified into one of the functional families in their respective superfamily by both methods. This improved, faster, validated computational method can yield more reliable predictions of function that can be used for a wide variety of applications by the community. © 2018 The Authors Protein Science published by Wiley Periodicals, Inc. on behalf of The Protein Society.
Scior, Thomas; Paiz-Candia, Bertin; Islas, Ángel A; Sánchez-Solano, Alfredo; Millan-Perez Peña, Lourdes; Mancilla-Simbro, Claudia; Salinas-Stefanon, Eduardo M
2015-01-01
The molecular structure modeling of the β1 subunit of the skeletal muscle voltage-gated sodium channel (Nav1.4) was carried out in the twilight zone of very low homology. Structural significance can per se be confounded with random sequence similarities. Hence, we combined (i) not automated computational modeling of weakly homologous 3D templates, some with interfaces to analogous structures to the pore-bearing Nav1.4 α subunit with (ii) site-directed mutagenesis (SDM), as well as (iii) electrophysiological experiments to study the structure and function of the β1 subunit. Despite the distant phylogenic relationships, we found a 3D-template to identify two adjacent amino acids leading to the long-awaited loss of function (inactivation) of Nav1.4 channels. This mutant type (T109A, N110A, herein called TANA) was expressed and tested on cells of hamster ovary (CHO). The present electrophysiological results showed that the double alanine substitution TANA disrupted channel inactivation as if the β1 subunit would not be in complex with the α subunit. Exhaustive and unbiased sampling of "all β proteins" (Ig-like, Ig) resulted in a plethora of 3D templates which were compared to the target secondary structure prediction. The location of TANA was made possible thanks to another "all β protein" structure in complex with an irreversible bound protein as well as a reversible protein-protein interface (our "Rosetta Stone" effect). This finding coincides with our electrophysiological data (disrupted β1-like voltage dependence) and it is safe to utter that the Nav1.4 α/β1 interface is likely to be of reversible nature.
Expression, purification and crystallization of a plant polyketide cyclase from Cannabis sativa
Yang, Xinmei; Matsui, Takashi; Mori, Takahiro; Taura, Futoshi; Noguchi, Hiroshi; Abe, Ikuro; Morita, Hiroyuki
2015-01-01
Plant polyketides are a structurally diverse family of natural products. In the biosynthesis of plant polyketides, the construction of the carbocyclic scaffold is a key step in diversifying the polyketide structure. Olivetolic acid cyclase (OAC) from Cannabis sativa L. is the only known plant polyketide cyclase that catalyzes the C2–C7 intramolecular aldol cyclization of linear pentyl tetra-β-ketide-CoA to generate olivetolic acid in the biosynthesis of cannabinoids. The enzyme is also thought to belong to the dimeric α+β barrel (DABB) protein family. However, because of a lack of functional analysis of other plant DABB proteins and low sequence identity with the functionally distinct bacterial DABB proteins, the catalytic mechanism of OAC has remained unclear. To clarify the intimate catalytic mechanism of OAC, the enzyme was overexpressed in Escherichia coli and crystallized using the vapour-diffusion method. The crystals diffracted X-rays to 1.40 Å resolution and belonged to space group P3121 or P3221, with unit-cell parameters a = b = 47.3, c = 176.0 Å. Further crystallographic analysis will provide valuable insights into the structure–function relationship and catalytic mechanism of OAC. PMID:26625288
Segmented molecular design of self-healing proteinaceous materials
Sariola, Veikko; Pena-Francesch, Abdon; Jung, Huihun; Çetinkaya, Murat; Pacheco, Carlos; Sitti, Metin; Demirel, Melik C.
2015-01-01
Hierarchical assembly of self-healing adhesive proteins creates strong and robust structural and interfacial materials, but understanding of the molecular design and structure–property relationships of structural proteins remains unclear. Elucidating this relationship would allow rational design of next generation genetically engineered self-healing structural proteins. Here we report a general self-healing and -assembly strategy based on a multiphase recombinant protein based material. Segmented structure of the protein shows soft glycine- and tyrosine-rich segments with self-healing capability and hard beta-sheet segments. The soft segments are strongly plasticized by water, lowering the self-healing temperature close to body temperature. The hard segments self-assemble into nanoconfined domains to reinforce the material. The healing strength scales sublinearly with contact time, which associates with diffusion and wetting of autohesion. The finding suggests that recombinant structural proteins from heterologous expression have potential as strong and repairable engineering materials. PMID:26323335
Carraro, Nicola; Tisdale-Orr, Tracy Eizabeth; Clouse, Ronald Matthew; Knöller, Anne Sophie; Spicer, Rachel
2012-01-01
Intercellular transport of the plant hormone auxin is mediated by three families of membrane-bound protein carriers, with the PIN and ABCB families coding primarily for efflux proteins and the AUX/LAX family coding for influx proteins. In the last decade our understanding of gene and protein function for these transporters in Arabidopsis has expanded rapidly but very little is known about their role in woody plant development. Here we present a comprehensive account of all three families in the model woody species Populus, including chromosome distribution, protein structure, quantitative gene expression, and evolutionary relationships. The PIN and AUX/LAX gene families in Populus comprise 16 and 8 members respectively and show evidence for the retention of paralogs following a relatively recent whole genome duplication. There is also differential expression across tissues within many gene pairs. The ABCB family is previously undescribed in Populus and includes 20 members, showing a much deeper evolutionary history, including both tandem and whole genome duplication as well as probable gene loss. A striking number of these transporters are expressed in developing Populus stems and we suggest that evolutionary and structural relationships with known auxin transporters in Arabidopsis can point toward candidate genes for further study in Populus. This is especially important for the ABCBs, which is a large family and includes members in Arabidopsis that are able to transport other substrates in addition to auxin. Protein modeling, sequence alignment and expression data all point to ABCB1.1 as a likely auxin transport protein in Populus. Given that basipetal auxin flow through the cambial zone shapes the development of woody stems, it is important that we identify the full complement of genes involved in this process. This work should lay the foundation for studies targeting specific proteins for functional characterization and in situ localization. PMID:22645571
The present study explores the merit of utilizing available pharmaceutical data to construct a quantitative structure-activity relationship (QSAR) for prediction of the fraction of a chemical unbound to plasma protein (Fub) in environmentally relevant compounds. Independent model...
Catana, Cornel; Stouten, Pieter F W
2007-01-01
The ability to accurately predict biological affinity on the basis of in silico docking to a protein target remains a challenging goal in the CADD arena. Typically, "standard" scoring functions have been employed that use the calculated docking result and a set of empirical parameters to calculate a predicted binding affinity. To improve on this, we are exploring novel strategies for rapidly developing and tuning "customized" scoring functions tailored to a specific need. In the present work, three such customized scoring functions were developed using a set of 129 high-resolution protein-ligand crystal structures with measured Ki values. The functions were parametrized using N-PLS (N-way partial least squares), a multivariate technique well-known in the 3D quantitative structure-activity relationship field. A modest correlation between observed and calculated pKi values using a standard scoring function (r2 = 0.5) could be improved to 0.8 when a customized scoring function was applied. To mimic a more realistic scenario, a second scoring function was developed, not based on crystal structures but exclusively on several binding poses generated with the Flo+ docking program. Finally, a validation study was conducted by generating a third scoring function with 99 randomly selected complexes from the 129 as a training set and predicting pKi values for a test set that comprised the remaining 30 complexes. Training and test set r2 values were 0.77 and 0.78, respectively. These results indicate that, even without direct structural information, predictive customized scoring functions can be developed using N-PLS, and this approach holds significant potential as a general procedure for predicting binding affinity on the basis of in silico docking.
Komaromy, Andras Z; Kulsing, Chadin; Boysen, Reinhard I; Hearn, Milton T W
2015-03-01
Key requirements of protein purification by hydrophobic interaction chromatography (HIC) are preservation of the tertiary/quaternary structure, maintenance of biological function, and separation of the correctly folded protein from its unfolded forms or aggregates. This study examines the relationship between the HIC retention behavior of hen egg white lysozyme (HEWL) in high concentrations of several kosmotropic salts and its conformation, assessed by circular dichroism (CD) spectroscopy. Further, the physicochemical properties of HEWL in the presence of high concentrations of ammonium sulfate, sodium chloride and magnesium chloride were investigated by small angle X-ray scattering (SAXS) at different temperatures. Radii of gyration were extrapolated from Guinier approximations and the indirect transform program GNOM with protein-protein interaction and contrast variation taken into account. A bead model simulation provided information on protein structural changes using ab initio reconstruction with GASBOR. These results correlated to the secondary structure content obtained from CD spectroscopy of HEWL. These changes in SAXS and CD data were consistent with heat capacity ΔCp -values obtained from van't Hoff plot analyses of the retention data. Collectively, these insights enable informed decisions to be made on the choice of chromatographic conditions, leading to improved separation selectivity and opportunities for innovative column-assisted protein refolding methods. Copyright © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Membrane Transporters: Structure, Function and Targets for Drug Design
NASA Astrophysics Data System (ADS)
Ravna, Aina W.; Sager, Georg; Dahl, Svein G.; Sylte, Ingebrigt
Current therapeutic drugs act on four main types of molecular targets: enzymes, receptors, ion channels and transporters, among which a major part (60-70%) are membrane proteins. This review discusses the molecular structures and potential impact of membrane transporter proteins on new drug discovery. The three-dimensional (3D) molecular structure of a protein contains information about the active site and possible ligand binding, and about evolutionary relationships within the protein family. Transporters have a recognition site for a particular substrate, which may be used as a target for drugs inhibiting the transporter or acting as a false substrate. Three groups of transporters have particular interest as drug targets: the major facilitator superfamily, which includes almost 4000 different proteins transporting sugars, polyols, drugs, neurotransmitters, metabolites, amino acids, peptides, organic and inorganic anions and many other substrates; the ATP-binding cassette superfamily, which plays an important role in multidrug resistance in cancer chemotherapy; and the neurotransmitter:sodium symporter family, which includes the molecular targets for some of the most widely used psychotropic drugs. Recent technical advances have increased the number of known 3D structures of membrane transporters, and demonstrated that they form a divergent group of proteins with large conformational flexibility which facilitates transport of the substrate.
LeBlanc, Sharonda; Wilkins, Hunter; Li, Zimeng; Kaur, Parminder; Wang, Hong; Erie, Dorothy A
2017-01-01
Atomic force microscopy (AFM) is a scanning probe technique that allows visualization of single biomolecules and complexes deposited on a surface with nanometer resolution. AFM is a powerful tool for characterizing protein-protein and protein-DNA interactions. It can be used to capture snapshots of protein-DNA solution dynamics, which in turn, enables the characterization of the conformational properties of transient protein-protein and protein-DNA interactions. With AFM, it is possible to determine the stoichiometries and binding affinities of protein-protein and protein-DNA associations, the specificity of proteins binding to specific sites on DNA, and the conformations of the complexes. We describe methods to prepare and deposit samples, including surface treatments for optimal depositions, and how to quantitatively analyze images. We also discuss a new electrostatic force imaging technique called DREEM, which allows the visualization of the path of DNA within proteins in protein-DNA complexes. Collectively, these methods facilitate the development of comprehensive models of DNA repair and provide a broader understanding of all protein-protein and protein-nucleic acid interactions. The structural details gleaned from analysis of AFM images coupled with biochemistry provide vital information toward establishing the structure-function relationships that govern DNA repair processes. © 2017 Elsevier Inc. All rights reserved.
Predicting nucleic acid binding interfaces from structural models of proteins
Dror, Iris; Shazman, Shula; Mukherjee, Srayanta; Zhang, Yang; Glaser, Fabian; Mandel-Gutfreund, Yael
2011-01-01
The function of DNA- and RNA-binding proteins can be inferred from the characterization and accurate prediction of their binding interfaces. However the main pitfall of various structure-based methods for predicting nucleic acid binding function is that they are all limited to a relatively small number of proteins for which high-resolution three dimensional structures are available. In this study, we developed a pipeline for extracting functional electrostatic patches from surfaces of protein structural models, obtained using the I-TASSER protein structure predictor. The largest positive patches are extracted from the protein surface using the patchfinder algorithm. We show that functional electrostatic patches extracted from an ensemble of structural models highly overlap the patches extracted from high-resolution structures. Furthermore, by testing our pipeline on a set of 55 known nucleic acid binding proteins for which I-TASSER produces high-quality models, we show that the method accurately identifies the nucleic acids binding interface on structural models of proteins. Employing a combined patch approach we show that patches extracted from an ensemble of models better predicts the real nucleic acid binding interfaces compared to patches extracted from independent models. Overall, these results suggest that combining information from a collection of low-resolution structural models could be a valuable approach for functional annotation. We suggest that our method will be further applicable for predicting other functional surfaces of proteins with unknown structure. PMID:22086767
Adam, Benoit; Charloteaux, Benoit; Beaufays, Jerome; Vanhamme, Luc; Godfroid, Edmond; Brasseur, Robert; Lins, Laurence
2008-01-01
Background Lipocalins are widely distributed in nature and are found in bacteria, plants, arthropoda and vertebra. In hematophagous arthropods, they are implicated in the successful accomplishment of the blood meal, interfering with platelet aggregation, blood coagulation and inflammation and in the transmission of disease parasites such as Trypanosoma cruzi and Borrelia burgdorferi. The pairwise sequence identity is low among this family, often below 30%, despite a well conserved tertiary structure. Under the 30% identity threshold, alignment methods do not correctly assign and align proteins. The only safe way to assign a sequence to that family is by experimental determination. However, these procedures are long and costly and cannot always be applied. A way to circumvent the experimental approach is sequence and structure analyze. To further help in that task, the residues implicated in the stabilisation of the lipocalin fold were determined. This was done by analyzing the conserved interactions for ten lipocalins having a maximum pairwise identity of 28% and various functions. Results It was determined that two hydrophobic clusters of residues are conserved by analysing the ten lipocalin structures and sequences. One cluster is internal to the barrel, involving all strands and the 310 helix. The other is external, involving four strands and the helix lying parallel to the barrel surface. These clusters are also present in RaHBP2, a unusual "outlier" lipocalin from tick Rhipicephalus appendiculatus. This information was used to assess assignment of LIR2 a protein from Ixodes ricinus and to build a 3D model that helps to predict function. FTIR data support the lipocalin fold for this protein. Conclusion By sequence and structural analyzes, two conserved clusters of hydrophobic residues in interactions have been identified in lipocalins. Since the residues implicated are not conserved for function, they should provide the minimal subset necessary to confer the lipocalin fold. This information has been used to assign LIR2 to lipocalins and to investigate its structure/function relationship. This study could be applied to other protein families with low pairwise similarity, such as the structurally related fatty acid binding proteins or avidins. PMID:18190694
Wang, Hsin-Wei; Hsu, Yen-Chu; Hwang, Jenn-Kang; Lyu, Ping-Chiang; Pai, Tun-Wen; Tang, Chuan Yi
2010-01-01
This work presents a novel detection method for three-dimensional domain swapping (DS), a mechanism for forming protein quaternary structures that can be visualized as if monomers had “opened” their “closed” structures and exchanged the opened portion to form intertwined oligomers. Since the first report of DS in the mid 1990s, an increasing number of identified cases has led to the postulation that DS might occur in a protein with an unconstrained terminus under appropriate conditions. DS may play important roles in the molecular evolution and functional regulation of proteins and the formation of depositions in Alzheimer's and prion diseases. Moreover, it is promising for designing auto-assembling biomaterials. Despite the increasing interest in DS, related bioinformatics methods are rarely available. Owing to a dramatic conformational difference between the monomeric/closed and oligomeric/open forms, conventional structural comparison methods are inadequate for detecting DS. Hence, there is also a lack of comprehensive datasets for studying DS. Based on angle-distance (A-D) image transformations of secondary structural elements (SSEs), specific patterns within A-D images can be recognized and classified for structural similarities. In this work, a matching algorithm to extract corresponding SSE pairs from A-D images and a novel DS score have been designed and demonstrated to be applicable to the detection of DS relationships. The Matthews correlation coefficient (MCC) and sensitivity of the proposed DS-detecting method were higher than 0.81 even when the sequence identities of the proteins examined were lower than 10%. On average, the alignment percentage and root-mean-square distance (RMSD) computed by the proposed method were 90% and 1.8Å for a set of 1,211 DS-related pairs of proteins. The performances of structural alignments remain high and stable for DS-related homologs with less than 10% sequence identities. In addition, the quality of its hinge loop determination is comparable to that of manual inspection. This method has been implemented as a web-based tool, which requires two protein structures as the input and then the type and/or existence of DS relationships between the input structures are determined according to the A-D image-based structural alignments and the DS score. The proposed method is expected to trigger large-scale studies of this interesting structural phenomenon and facilitate related applications. PMID:20976204
Clustering and Network Analysis of Reverse Phase Protein Array Data.
Byron, Adam
2017-01-01
Molecular profiling of proteins and phosphoproteins using a reverse phase protein array (RPPA) platform, with a panel of target-specific antibodies, enables the parallel, quantitative proteomic analysis of many biological samples in a microarray format. Hence, RPPA analysis can generate a high volume of multidimensional data that must be effectively interrogated and interpreted. A range of computational techniques for data mining can be applied to detect and explore data structure and to form functional predictions from large datasets. Here, two approaches for the computational analysis of RPPA data are detailed: the identification of similar patterns of protein expression by hierarchical cluster analysis and the modeling of protein interactions and signaling relationships by network analysis. The protocols use freely available, cross-platform software, are easy to implement, and do not require any programming expertise. Serving as data-driven starting points for further in-depth analysis, validation, and biological experimentation, these and related bioinformatic approaches can accelerate the functional interpretation of RPPA data.
Molecular Modeling of Lipid Aggregates: Theory and Application
NASA Astrophysics Data System (ADS)
Fenner, Joel Stewart
The ability of cell membranes to perform a wide variety of biological functions stems from the organization and composition of its molecular constituents. There are many engineering applications, such as liposome drug delivery carriers, whose functionality takes advantage of the structure to function relationship of lipid membranes. The fundamental understanding of the relationship between the thermodynamic behavior and structure of lipid membranes and the molecular properties of their lipid constituents is crucial to the successful design of lipid related applications. However, information about how the local microscopic composition of lipid membranes responds to the presence of proteins and nanomaterials is challenging given the intrinsic experimental and theoretical difficulties of studying such small-scale systems. The present work generalizes a self consistent mean field theory for the study of the thermodynamic and structural behavior of lipid bilayers as a function of its molecular composition and physicochemical environments. This novel molecular theory provides with the ability of performing systematic thermodynamic calculations at relatively low computational costs while considering a detailed molecular description of the system under study. The competition of all relevant molecular interactions, such as electrostatics, vdW and chemical equilibria, in the membrane system is described. The developed molecular theory is applied to study how the protonation state of pH-sensitive amphiphiles in a membrane system affects the membrane's morphology. The molecular theory results demonstrate that the protonation state of ionizable groups within amphiphilic membranes shows a highly complex non-monotonic dependence on bulk salt concentration and pH strength. This result suggests that information about the pKa of the molecules is not sufficient to predict the protonation state of the ionizable groups in the membrane system. The molecular theory is also applied to study how the presence of proteins or functionalized nanoparticles near a multicomponent membrane surface leads to changes in its local membrane composition. The results support an electrostatic dependent recruitment mechanism of oncogenic RhoA proteins to the cell membrane. Finally, the molecular theory results describe how nanoparticle functionality and/or membrane molecular composition can be tuned to enhance or suppress nanoparticle adsorption on to phospholipid membranes.
An overview of the structures of protein-DNA complexes
Luscombe, Nicholas M; Austin, Susan E; Berman , Helen M; Thornton, Janet M
2000-01-01
On the basis of a structural analysis of 240 protein-DNA complexes contained in the Protein Data Bank (PDB), we have classified the DNA-binding proteins involved into eight different structural/functional groups, which are further classified into 54 structural families. Here we present this classification and review the functions, structures and binding interactions of these protein-DNA complexes. PMID:11104519
Xie, Hongbo; Vucetic, Slobodan; Iakoucheva, Lilia M.; Oldfield, Christopher J.; Dunker, A. Keith; Obradovic, Zoran; Uversky, Vladimir N.
2008-01-01
Currently, the understanding of the relationships between function, amino acid sequence and protein structure continues to represent one of the major challenges of the modern protein science. As much as 50% of eukaryotic proteins are likely to contain functionally important long disordered regions. Many proteins are wholly disordered but still possess numerous biologically important functions. However, the number of experimentally confirmed disordered proteins with known biological functions is substantially smaller than their actual number in nature. Therefore, there is a crucial need for novel bioinformatics approaches that allow projection of the current knowledge from a few experimentally verified examples to much larger groups of known and potential proteins. The elaboration of a bioinformatics tool for the analysis of functional diversity of intrinsically disordered proteins and application of this data mining tool to >200,000 proteins from Swiss-Prot database, each annotated with at least one of the 875 functional keywords was described in the first paper of this series (Xie H., Vucetic S., Iakoucheva L.M., Oldfield C.J., Dunker A.K., Obradovic Z., Uversky V.N. (2006) Functional anthology of intrinsic disorder. I. Biological processes and functions of proteins with long disordered regions. J. Proteome Res.). Using this tool, we have found that out of the 711 Swiss-Prot functional keywords associated with at least 20 proteins, 262 were strongly positively correlated with long intrinsically disordered regions, and 302 were strongly negatively correlated. Illustrative examples of functional disorder or order were found for the vast majority of keywords showing strongest positive or negative correlation with intrinsic disorder, respectively. Some 80 Swiss-Prot keywords associated with disorder- and order-driven biological processes and protein functions were described in the first paper (Xie H., Vucetic S., Iakoucheva L.M., Oldfield C.J., Dunker A.K., Obradovic Z., Uversky V.N. (2006) Functional anthology of intrinsic disorder. I. Biological processes and functions of proteins with long disordered regions. J. Proteome Res.). The second paper of the series was devoted to the presentation of 87 Swiss-Prot keywords attributed to the cellular components, domains, technical terms, developmental processes and coding sequence diversities possessing strong positive and negative correlation with long disordered regions (Vucetic S., Xie H., Iakoucheva L.M., Oldfield C.J., Dunker A.K., Obradovic Z., Uversky V.N. (2006) Functional anthology of intrinsic disorder. II. Cellular components, domains, technical terms, developmental processes and coding sequence diversities correlated with long disordered regions. J. Proteome Res.). Protein structure and functionality can be modulated by various posttranslational modifications or/and as a result of binding of specific ligands. Numerous human diseases are associated with protein misfolding/misassembly/ misfunctioning. This work concludes the series of papers dedicated to the functional anthology of intrinsic disorder and describes ~80 Swiss-Prot functional keywords that are related to ligands, posttranslational modifications and diseases possessing strong positive or negative correlation with the predicted long disordered regions in proteins. PMID:17391016
Du, Yushen; Wu, Nicholas C; Jiang, Lin; Zhang, Tianhao; Gong, Danyang; Shu, Sara; Wu, Ting-Ting; Sun, Ren
2016-11-01
Identification and annotation of functional residues are fundamental questions in protein sequence analysis. Sequence and structure conservation provides valuable information to tackle these questions. It is, however, limited by the incomplete sampling of sequence space in natural evolution. Moreover, proteins often have multiple functions, with overlapping sequences that present challenges to accurate annotation of the exact functions of individual residues by conservation-based methods. Using the influenza A virus PB1 protein as an example, we developed a method to systematically identify and annotate functional residues. We used saturation mutagenesis and high-throughput sequencing to measure the replication capacity of single nucleotide mutations across the entire PB1 protein. After predicting protein stability upon mutations, we identified functional PB1 residues that are essential for viral replication. To further annotate the functional residues important to the canonical or noncanonical functions of viral RNA-dependent RNA polymerase (vRdRp), we performed a homologous-structure analysis with 16 different vRdRp structures. We achieved high sensitivity in annotating the known canonical polymerase functional residues. Moreover, we identified a cluster of noncanonical functional residues located in the loop region of the PB1 β-ribbon. We further demonstrated that these residues were important for PB1 protein nuclear import through the interaction with Ran-binding protein 5. In summary, we developed a systematic and sensitive method to identify and annotate functional residues that are not restrained by sequence conservation. Importantly, this method is generally applicable to other proteins about which homologous-structure information is available. To fully comprehend the diverse functions of a protein, it is essential to understand the functionality of individual residues. Current methods are highly dependent on evolutionary sequence conservation, which is usually limited by sampling size. Sequence conservation-based methods are further confounded by structural constraints and multifunctionality of proteins. Here we present a method that can systematically identify and annotate functional residues of a given protein. We used a high-throughput functional profiling platform to identify essential residues. Coupling it with homologous-structure comparison, we were able to annotate multiple functions of proteins. We demonstrated the method with the PB1 protein of influenza A virus and identified novel functional residues in addition to its canonical function as an RNA-dependent RNA polymerase. Not limited to virology, this method is generally applicable to other proteins that can be functionally selected and about which homologous-structure information is available. Copyright © 2016 Du et al.
Towards enamel biomimetics: Structure, mechanical properties and biomineralization of dental enamel
NASA Astrophysics Data System (ADS)
Fong, Hanson Kwok
Dental enamel is the most mineralized tissue in the human body. This bioceramic, composed largely of hydroxyapatite (HAp), is also one of the most durable tissues despite a lifetime of masticatory loading and bacterial attack. The biosynthesis of enamel, which occurs in physiological conditions is a complex orchestration of protein assembly and mineral formation. The resulting product is the hardest tissue in the vertebrate body with the longest and most organized arrangement of hydroxyapatite crystals known to biomineralizing systems. Detail understanding of the structure of enamel in relationship to its mechanical function and the biomineralization process will provide a framework for enamel regeneration as well as potential lessons in the design of engineering materials. The objective of this study, therefore, is twofold: (1) establish the structure-function relationship of enamel as well as the dentine-enamel junction (DEJ) and (2) determine the effect of proteins on the enamel biomineralization process. A hierarchy in the enamel structure was established by means of various microscopy techniques (e.g. SEM, TEM, AFM). Mechanical properties (hardness and elastic modulus) associated with the microstructural features were also determined by nanoindentation. Furthermore, the DEJ was found to have a width in the range of micrometers to 10s of micrometers with continuous change in structure and mechanical properties. Indentation tests and contact fatigue tests using a spherical indenter have revealed that the structural features in the enamel and the DEJ played important roles in containing crack propagation emanating from the enamel tissue. To further understand the effect of this protein on the biominerailzation process, we have studied genetically engineered animals that express altered amelogenin which lack the known self-assembly properties. This in vivo study has revealed that, without the proper self-assembly of the amelogenin protein as demonstrated by the altered amelogenin, the crystal organization of the apatite phase was severely disrupted at the nucleation stage resulting in lower mineral density at the mature stage. Consequently measurably inferior mechanical properties were found in the mature enamel grown with altered amelogenin when compared to the age matched wild-type.
Evolution and functional divergence of the anoctamin family of membrane proteins
2010-01-01
Background The anoctamin family of transmembrane proteins are found in all eukaryotes and consists of 10 members in vertebrates. Ano1 and ano2 were observed to have Ca2+ activated Cl- channel activity. Recent findings however have revealed that ano6, and ano7 can also produce chloride currents, although with different properties. In contrast, ano9 and ano10 suppress baseline Cl- conductance when co-expressed with ano1 thus suggesting that different anoctamins can interfere with each other. In order to elucidate intrinsic functional diversity, and underlying evolutionary mechanism among anoctamins, we performed comprehensive bioinformatics analysis of anoctamin gene family. Results Our results show that anoctamin protein paralogs evolved from several gene duplication events followed by functional divergence of vertebrate anoctamins. Most of the amino acid replacements responsible for the functional divergence were fixed by adaptive evolution and this seem to be a common pattern in anoctamin gene family evolution. Strong purifying selection and the loss of many gene duplication products indicate rigid structure-function relationships among anoctamins. Conclusions Our study suggests that anoctamins have evolved by series of duplication events, and that they are constrained by purifying selection. In addition we identified a number of protein domains, and amino acid residues which contribute to predicted functional divergence. Hopefully, this work will facilitate future functional characterization of the anoctamin membrane protein family. PMID:20964844
Evolutionary and Functional Relationships in the Truncated Hemoglobin Family.
Bustamante, Juan P; Radusky, Leandro; Boechi, Leonardo; Estrin, Darío A; Ten Have, Arjen; Martí, Marcelo A
2016-01-01
Predicting function from sequence is an important goal in current biological research, and although, broad functional assignment is possible when a protein is assigned to a family, predicting functional specificity with accuracy is not straightforward. If function is provided by key structural properties and the relevant properties can be computed using the sequence as the starting point, it should in principle be possible to predict function in detail. The truncated hemoglobin family presents an interesting benchmark study due to their ubiquity, sequence diversity in the context of a conserved fold and the number of characterized members. Their functions are tightly related to O2 affinity and reactivity, as determined by the association and dissociation rate constants, both of which can be predicted and analyzed using in-silico based tools. In the present work we have applied a strategy, which combines homology modeling with molecular based energy calculations, to predict and analyze function of all known truncated hemoglobins in an evolutionary context. Our results show that truncated hemoglobins present conserved family features, but that its structure is flexible enough to allow the switch from high to low affinity in a few evolutionary steps. Most proteins display moderate to high oxygen affinities and multiple ligand migration paths, which, besides some minor trends, show heterogeneous distributions throughout the phylogenetic tree, again suggesting fast functional adaptation. Our data not only deepens our comprehension of the structural basis governing ligand affinity, but they also highlight some interesting functional evolutionary trends.
Evolutionary and Functional Relationships in the Truncated Hemoglobin Family
Bustamante, Juan P.; Radusky, Leandro; Boechi, Leonardo; Estrin, Darío A.; ten Have, Arjen; Martí, Marcelo A.
2016-01-01
Predicting function from sequence is an important goal in current biological research, and although, broad functional assignment is possible when a protein is assigned to a family, predicting functional specificity with accuracy is not straightforward. If function is provided by key structural properties and the relevant properties can be computed using the sequence as the starting point, it should in principle be possible to predict function in detail. The truncated hemoglobin family presents an interesting benchmark study due to their ubiquity, sequence diversity in the context of a conserved fold and the number of characterized members. Their functions are tightly related to O2 affinity and reactivity, as determined by the association and dissociation rate constants, both of which can be predicted and analyzed using in-silico based tools. In the present work we have applied a strategy, which combines homology modeling with molecular based energy calculations, to predict and analyze function of all known truncated hemoglobins in an evolutionary context. Our results show that truncated hemoglobins present conserved family features, but that its structure is flexible enough to allow the switch from high to low affinity in a few evolutionary steps. Most proteins display moderate to high oxygen affinities and multiple ligand migration paths, which, besides some minor trends, show heterogeneous distributions throughout the phylogenetic tree, again suggesting fast functional adaptation. Our data not only deepens our comprehension of the structural basis governing ligand affinity, but they also highlight some interesting functional evolutionary trends. PMID:26788940
Bioinformatics analysis of disordered proteins in prokaryotes
2011-01-01
Background A significant number of proteins have been shown to be intrinsically disordered, meaning that they lack a fixed 3 D structure or contain regions that do not posses a well defined 3 D structure. It has also been proven that a protein's disorder content is related to its function. We have performed an exhaustive analysis and comparison of the disorder content of proteins from prokaryotic organisms (i.e., superkingdoms Archaea and Bacteria) with respect to functional categories they belong to, i.e., Clusters of Orthologous Groups of proteins (COGs) and groups of COGs-Cellular processes (Cp), Information storage and processing (Isp), Metabolism (Me) and Poorly characterized (Pc). We also analyzed the disorder content of proteins with respect to various genomic, metabolic and ecological characteristics of the organism they belong to. We used correlations and association rule mining in order to identify the most confident associations between specific modalities of the characteristics considered and disorder content. Results Bacteria are shown to have a somewhat higher level of protein disorder than archaea, except for proteins in the Me functional group. It is demonstrated that the Isp and Cp functional groups in particular (L-repair function and N-cell motility and secretion COGs of proteins in specific) possess the highest disorder content, while Me proteins, in general, posses the lowest. Disorder fractions have been confirmed to have the lowest level for the so-called order-promoting amino acids and the highest level for the so-called disorder promoters. For each pair of organism characteristics, specific modalities are identified with the maximum disorder proteins in the corresponding organisms, e.g., high genome size-high GC content organisms, facultative anaerobic-low GC content organisms, aerobic-high genome size organisms, etc. Maximum disorder in archaea is observed for high GC content-low genome size organisms, high GC content-facultative anaerobic or aquatic or mesophilic organisms, etc. Maximum disorder in bacteria is observed for high GC content-high genome size organisms, high genome size-aerobic organisms, etc. Some of the most reliable association rules mined establish relationships between high GC content and high protein disorder, medium GC content and both medium and low protein disorder, anaerobic organisms and medium protein disorder, Gammaproteobacteria and low protein disorder, etc. A web site Prokaryote Disorder Database has been designed and implemented at the address http://bioinfo.matf.bg.ac.rs/disorder, which contains complete results of the analysis of protein disorder performed for 296 prokaryotic completely sequenced genomes. Conclusions Exhaustive disorder analysis has been performed by functional classes of proteins, for a larger dataset of prokaryotic organisms than previously done. Results obtained are well correlated to those previously published, with some extension in the range of disorder level and clear distinction between functional classes of proteins. Wide correlation and association analysis between protein disorder and genomic and ecological characteristics has been performed for the first time. The results obtained give insight into multi-relationships among the characteristics and protein disorder. Such analysis provides for better understanding of the evolutionary process and may be useful for taxon determination. The main drawback of the approach is the fact that the disorder considered has been predicted and not experimentally established. PMID:21366926
Wetzel, Margaret E.; Olsen, Gary J.; Chakravartty, Vandana; ...
2015-11-19
The large repABC plasmids of the order Rhizobiales with Class I quorum-regulated conjugative transfer systems often define the nature of the bacterium that harbors them. These otherwise diverse plasmids contain a core of highly conserved genes for replication and conjugation raising the question of their evolutionary relationships. In an analysis of 18 such plasmids these elements fall into two organizational classes, Group I and Group II, based on the sites at which cargo DNA is located. Cladograms constructed from proteins of the transfer and quorum-sensing components indicated that those of the Group I plasmids, while coevolving, have diverged from thosemore » coevolving proteins of the Group II plasmids. Moreover, within these groups the phylogenies of the proteins usually occupy similar, if not identical, tree topologies. Remarkably, such relationships were not seen among proteins of the replication system; although RepA and RepB coevolve, RepC does not. Nor do the replication proteins coevolve with the proteins of the transfer and quorum-sensing systems. Functional analysis was mostly consistent with phylogenies. TraR activated promoters from plasmids within its group, but not between groups and dimerized with TraR proteins from within but not between groups. However, oriT sequences, which are highly conserved, were processed by the transfer system of plasmids regardless of group. Here, we conclude that these plasmids diverged into two classes based on the locations at which cargo DNA is inserted, that the quorum-sensing and transfer functions are coevolving within but not between the two groups, and that this divergent evolution extends to function.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wetzel, Margaret E.; Olsen, Gary J.; Chakravartty, Vandana
The large repABC plasmids of the order Rhizobiales with Class I quorum-regulated conjugative transfer systems often define the nature of the bacterium that harbors them. These otherwise diverse plasmids contain a core of highly conserved genes for replication and conjugation raising the question of their evolutionary relationships. In an analysis of 18 such plasmids these elements fall into two organizational classes, Group I and Group II, based on the sites at which cargo DNA is located. Cladograms constructed from proteins of the transfer and quorum-sensing components indicated that those of the Group I plasmids, while coevolving, have diverged from thosemore » coevolving proteins of the Group II plasmids. Moreover, within these groups the phylogenies of the proteins usually occupy similar, if not identical, tree topologies. Remarkably, such relationships were not seen among proteins of the replication system; although RepA and RepB coevolve, RepC does not. Nor do the replication proteins coevolve with the proteins of the transfer and quorum-sensing systems. Functional analysis was mostly consistent with phylogenies. TraR activated promoters from plasmids within its group, but not between groups and dimerized with TraR proteins from within but not between groups. However, oriT sequences, which are highly conserved, were processed by the transfer system of plasmids regardless of group. Here, we conclude that these plasmids diverged into two classes based on the locations at which cargo DNA is inserted, that the quorum-sensing and transfer functions are coevolving within but not between the two groups, and that this divergent evolution extends to function.« less
New Measurement for Correlation of Co-evolution Relationship of Subsequences in Protein.
Gao, Hongyun; Yu, Xiaoqing; Dou, Yongchao; Wang, Jun
2015-12-01
Many computational tools have been developed to measure the protein residues co-evolution. Most of them only focus on co-evolution for pairwise residues in a protein sequence. However, number of residues participate in co-evolution might be multiple. And some co-evolved residues are clustered in several distinct regions in primary structure. Therefore, the co-evolution among the adjacent residues and the correlation between the distinct regions offer insights into function and evolution of the protein and residues. Subsequence is used to represent the adjacent multiple residues in one distinct region. In the paper, co-evolution relationship in each subsequence is represented by mutual information matrix (MIM). Then, Pearson's correlation coefficient: R value is developed to measure the similarity correlation of two MIMs. MSAs from Catalytic Data Base (Catalytic Site Atlas, CSA) are used for testing. R value characterizes a specific class of residues. In contrast to individual pairwise co-evolved residues, adjacent residues without high individual MI values are found since the co-evolved relationship among them is similar to that among another set of adjacent residues. These subsequences possess some flexibility in the composition of side chains, such as the catalyzed environment.
Rapid search for tertiary fragments reveals protein sequence–structure relationships
Zhou, Jianfu; Grigoryan, Gevorg
2015-01-01
Finding backbone substructures from the Protein Data Bank that match an arbitrary query structural motif, composed of multiple disjoint segments, is a problem of growing relevance in structure prediction and protein design. Although numerous protein structure search approaches have been proposed, methods that address this specific task without additional restrictions and on practical time scales are generally lacking. Here, we propose a solution, dubbed MASTER, that is both rapid, enabling searches over the Protein Data Bank in a matter of seconds, and provably correct, finding all matches below a user-specified root-mean-square deviation cutoff. We show that despite the potentially exponential time complexity of the problem, running times in practice are modest even for queries with many segments. The ability to explore naturally plausible structural and sequence variations around a given motif has the potential to synthesize its design principles in an automated manner; so we go on to illustrate the utility of MASTER to protein structural biology. We demonstrate its capacity to rapidly establish structure–sequence relationships, uncover the native designability landscapes of tertiary structural motifs, identify structural signatures of binding, and automatically rewire protein topologies. Given the broad utility of protein tertiary fragment searches, we hope that providing MASTER in an open-source format will enable novel advances in understanding, predicting, and designing protein structure. PMID:25420575
Lua, Rhonald C; Wilson, Stephen J; Konecki, Daniel M; Wilkins, Angela D; Venner, Eric; Morgan, Daniel H; Lichtarge, Olivier
2016-01-04
The structure and function of proteins underlie most aspects of biology and their mutational perturbations often cause disease. To identify the molecular determinants of function as well as targets for drugs, it is central to characterize the important residues and how they cluster to form functional sites. The Evolutionary Trace (ET) achieves this by ranking the functional and structural importance of the protein sequence positions. ET uses evolutionary distances to estimate functional distances and correlates genotype variations with those in the fitness phenotype. Thus, ET ranks are worse for sequence positions that vary among evolutionarily closer homologs but better for positions that vary mostly among distant homologs. This approach identifies functional determinants, predicts function, guides the mutational redesign of functional and allosteric specificity, and interprets the action of coding sequence variations in proteins, people and populations. Now, the UET database offers pre-computed ET analyses for the protein structure databank, and on-the-fly analysis of any protein sequence. A web interface retrieves ET rankings of sequence positions and maps results to a structure to identify functionally important regions. This UET database integrates several ways of viewing the results on the protein sequence or structure and can be found at http://mammoth.bcm.tmc.edu/uet/. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Lam, Winnie W M; Chan, Keith C C
2012-04-01
Protein molecules interact with each other in protein complexes to perform many vital functions, and different computational techniques have been developed to identify protein complexes in protein-protein interaction (PPI) networks. These techniques are developed to search for subgraphs of high connectivity in PPI networks under the assumption that the proteins in a protein complex are highly interconnected. While these techniques have been shown to be quite effective, it is also possible that the matching rate between the protein complexes they discover and those that are previously determined experimentally be relatively low and the "false-alarm" rate can be relatively high. This is especially the case when the assumption of proteins in protein complexes being more highly interconnected be relatively invalid. To increase the matching rate and reduce the false-alarm rate, we have developed a technique that can work effectively without having to make this assumption. The name of the technique called protein complex identification by discovering functional interdependence (PCIFI) searches for protein complexes in PPI networks by taking into consideration both the functional interdependence relationship between protein molecules and the network topology of the network. The PCIFI works in several steps. The first step is to construct a multiple-function protein network graph by labeling each vertex with one or more of the molecular functions it performs. The second step is to filter out protein interactions between protein pairs that are not functionally interdependent of each other in the statistical sense. The third step is to make use of an information-theoretic measure to determine the strength of the functional interdependence between all remaining interacting protein pairs. Finally, the last step is to try to form protein complexes based on the measure of the strength of functional interdependence and the connectivity between proteins. For performance evaluation, PCIFI was used to identify protein complexes in real PPI network data and the protein complexes it found were matched against those that were previously known in MIPS. The results show that PCIFI can be an effective technique for the identification of protein complexes. The protein complexes it found can match more known protein complexes with a smaller false-alarm rate and can provide useful insights into the understanding of the functional interdependence relationships between proteins in protein complexes.
Maurer-Stroh, Sebastian; Gao, He; Han, Hao; Baeten, Lies; Schymkowitz, Joost; Rousseau, Frederic; Zhang, Louxin; Eisenhaber, Frank
2013-02-01
Data mining in protein databases, derivatives from more fundamental protein 3D structure and sequence databases, has considerable unearthed potential for the discovery of sequence motif--structural motif--function relationships as the finding of the U-shape (Huf-Zinc) motif, originally a small student's project, exemplifies. The metal ion zinc is critically involved in universal biological processes, ranging from protein-DNA complexes and transcription regulation to enzymatic catalysis and metabolic pathways. Proteins have evolved a series of motifs to specifically recognize and bind zinc ions. Many of these, so called zinc fingers, are structurally independent globular domains with discontinuous binding motifs made up of residues mostly far apart in sequence. Through a systematic approach starting from the BRIX structure fragment database, we discovered that there exists another predictable subset of zinc-binding motifs that not only have a conserved continuous sequence pattern but also share a characteristic local conformation, despite being included in totally different overall folds. While this does not allow general prediction of all Zn binding motifs, a HMM-based web server, Huf-Zinc, is available for prediction of these novel, as well as conventional, zinc finger motifs in protein sequences. The Huf-Zinc webserver can be freely accessed through this URL (http://mendel.bii.a-star.edu.sg/METHODS/hufzinc/).
Comparison of molecular dynamics and superfamily spaces of protein domain deformation.
Velázquez-Muriel, Javier A; Rueda, Manuel; Cuesta, Isabel; Pascual-Montano, Alberto; Orozco, Modesto; Carazo, José-María
2009-02-17
It is well known the strong relationship between protein structure and flexibility, on one hand, and biological protein function, on the other hand. Technically, protein flexibility exploration is an essential task in many applications, such as protein structure prediction and modeling. In this contribution we have compared two different approaches to explore the flexibility space of protein domains: i) molecular dynamics (MD-space), and ii) the study of the structural changes within superfamily (SF-space). Our analysis indicates that the MD-space and the SF-space display a significant overlap, but are still different enough to be considered as complementary. The SF-space space is wider but less complex than the MD-space, irrespective of the number of members in the superfamily. Also, the SF-space does not sample all possibilities offered by the MD-space, but often introduces very large changes along just a few deformation modes, whose number tend to a plateau as the number of related folds in the superfamily increases. Theoretically, we obtained two conclusions. First, that function restricts the access to some flexibility patterns to evolution, as we observe that when a superfamily member changes to become another, the path does not completely overlap with the physical deformability. Second, that conformational changes from variation in a superfamily are larger and much simpler than those allowed by physical deformability. Methodologically, the conclusion is that both spaces studied are complementary, and have different size and complexity. We expect this fact to have application in fields as 3D-EM/X-ray hybrid models or ab initio protein folding.
Comparison of molecular dynamics and superfamily spaces of protein domain deformation
Velázquez-Muriel, Javier A; Rueda, Manuel; Cuesta, Isabel; Pascual-Montano, Alberto; Orozco, Modesto; Carazo, José-María
2009-01-01
Background It is well known the strong relationship between protein structure and flexibility, on one hand, and biological protein function, on the other hand. Technically, protein flexibility exploration is an essential task in many applications, such as protein structure prediction and modeling. In this contribution we have compared two different approaches to explore the flexibility space of protein domains: i) molecular dynamics (MD-space), and ii) the study of the structural changes within superfamily (SF-space). Results Our analysis indicates that the MD-space and the SF-space display a significant overlap, but are still different enough to be considered as complementary. The SF-space space is wider but less complex than the MD-space, irrespective of the number of members in the superfamily. Also, the SF-space does not sample all possibilities offered by the MD-space, but often introduces very large changes along just a few deformation modes, whose number tend to a plateau as the number of related folds in the superfamily increases. Conclusion Theoretically, we obtained two conclusions. First, that function restricts the access to some flexibility patterns to evolution, as we observe that when a superfamily member changes to become another, the path does not completely overlap with the physical deformability. Second, that conformational changes from variation in a superfamily are larger and much simpler than those allowed by physical deformability. Methodologically, the conclusion is that both spaces studied are complementary, and have different size and complexity. We expect this fact to have application in fields as 3D-EM/X-ray hybrid models or ab initio protein folding. PMID:19220918
LenVarDB: database of length-variant protein domains.
Mutt, Eshita; Mathew, Oommen K; Sowdhamini, Ramanathan
2014-01-01
Protein domains are functionally and structurally independent modules, which add to the functional variety of proteins. This array of functional diversity has been enabled by evolutionary changes, such as amino acid substitutions or insertions or deletions, occurring in these protein domains. Length variations (indels) can introduce changes at structural, functional and interaction levels. LenVarDB (freely available at http://caps.ncbs.res.in/lenvardb/) traces these length variations, starting from structure-based sequence alignments in our Protein Alignments organized as Structural Superfamilies (PASS2) database, across 731 structural classification of proteins (SCOP)-based protein domain superfamilies connected to 2 730 625 sequence homologues. Alignment of sequence homologues corresponding to a structural domain is available, starting from a structure-based sequence alignment of the superfamily. Orientation of the length-variant (indel) regions in protein domains can be visualized by mapping them on the structure and on the alignment. Knowledge about location of length variations within protein domains and their visual representation will be useful in predicting changes within structurally or functionally relevant sites, which may ultimately regulate protein function. Non-technical summary: Evolutionary changes bring about natural changes to proteins that may be found in many organisms. Such changes could be reflected as amino acid substitutions or insertions-deletions (indels) in protein sequences. LenVarDB is a database that provides an early overview of observed length variations that were set among 731 protein families and after examining >2 million sequences. Indels are followed up to observe if they are close to the active site such that they can affect the activity of proteins. Inclusion of such information can aid the design of bioengineering experiments.
Matharu, Zimple; Daggumati, Pallavi; Wang, Ling; Dorofeeva, Tatiana S; Li, Zidong; Seker, Erkin
2017-04-19
Nanoporous gold (np-Au) electrode coatings significantly enhance the performance of electrochemical nucleic acid biosensors because of their three-dimensional nanoscale network, high electrical conductivity, facile surface functionalization, and biocompatibility. Contrary to planar electrodes, the np-Au electrodes also exhibit sensitive detection in the presence of common biofouling media due to their porous structure. However, the pore size of the nanomatrix plays a critical role in dictating the extent of biomolecular capture and transport. Small pores perform better in the case of target detection in complex samples by filtering out the large nonspecific proteins. On the other hand, larger pores increase the accessibility of target nucleic acids in the nanoporous structure, enhancing the detection limits of the sensor at the expense of more interference from biofouling molecules. Here, we report a microfabricated np-Au multiple electrode array that displays a range of electrode morphologies on the same chip for identifying feature sizes that reduce the nonspecific adsorption of proteins but facilitate the permeation of target DNA molecules into the pores. We demonstrate the utility of the electrode morphology library in studying DNA functionalization and target detection in complex biological media with a special emphasis on revealing ranges of electrode morphologies that mutually enhance the limit of detection and biofouling resilience. We expect this technique to assist in the development of high-performance biosensors for point-of-care diagnostics and facilitate studies on the electrode structure-property relationships in potential applications ranging from neural electrodes to catalysts.
Ghosh, Manik C.; Ray, Arun K.
2013-01-01
Cytochrome P450 is a superfamily of membrane-bound hemoprotein that gets involved with the degradation of xenobiotics and internal metabolites. Accumulated body of evidence indicates that phospholipids play a crucial role in determining the enzymatic activity of cytochrome P450 in the microenvironment by modulating its structure during detoxification; however, the structure-function relationship of cytochrome P4501A, a family of enzymes responsible for degrading lipophilic aromatic hydrocarbons, is still not well defined. Inducibility of cytochrome P4501A in cultured catfish hepatocytes in response to carbofuran, a widely used pesticide around the world, was studied earlier in our laboratory. In this present investigation, we observed that treating catfish with carbofuran augmented total phospholipid in the liver. We examined the role of phospholipid on the of cytochrome P4501A-marker enzyme which is known as ethoxyresorufin-O-deethylase (EROD) in the context of structure and function. We purified the carbofuran-induced cytochrome P4501A protein from catfish liver. Subsequently, we examined the enzymatic activity of purified P4501A protein in the presence of phospholipid, and studied how the structure of purified protein was influenced in the phospholipid environment. Membrane phospholipid appeared to accelerate the enzymatic activity of EROD by changing its structural conformation and thus controlling the detoxification of xenobiotics. Our study revealed the missing link of how the cytochrome P450 restores its enzymatic activity by changing its structural conformation in the phospholipid microenvironment. PMID:23469105
Ghosh, Manik C; Ray, Arun K
2013-01-01
Cytochrome P450 is a superfamily of membrane-bound hemoprotein that gets involved with the degradation of xenobiotics and internal metabolites. Accumulated body of evidence indicates that phospholipids play a crucial role in determining the enzymatic activity of cytochrome P450 in the microenvironment by modulating its structure during detoxification; however, the structure-function relationship of cytochrome P4501A, a family of enzymes responsible for degrading lipophilic aromatic hydrocarbons, is still not well defined. Inducibility of cytochrome P4501A in cultured catfish hepatocytes in response to carbofuran, a widely used pesticide around the world, was studied earlier in our laboratory. In this present investigation, we observed that treating catfish with carbofuran augmented total phospholipid in the liver. We examined the role of phospholipid on the of cytochrome P4501A-marker enzyme which is known as ethoxyresorufin-O-deethylase (EROD) in the context of structure and function. We purified the carbofuran-induced cytochrome P4501A protein from catfish liver. Subsequently, we examined the enzymatic activity of purified P4501A protein in the presence of phospholipid, and studied how the structure of purified protein was influenced in the phospholipid environment. Membrane phospholipid appeared to accelerate the enzymatic activity of EROD by changing its structural conformation and thus controlling the detoxification of xenobiotics. Our study revealed the missing link of how the cytochrome P450 restores its enzymatic activity by changing its structural conformation in the phospholipid microenvironment.
Modeling G Protein-Coupled Receptors: a Concrete Possibility
Costanzi, Stefano
2010-01-01
G protein-coupled receptors (GPCRs) are a large superfamily of membrane bound signaling proteins that are involved in the regulation of a wide range of physiological functions and constitute the most common target for therapeutic intervention. Due to the paucity of crystal structures, homology modeling has become a widespread technique for the construction of GPCR models, which have been applied to the study of their structure-function relationships and to the identification of lead ligands through virtual screening. Rhodopsin has been for years the only available template. However, recent breakthroughs in GPCR crystallography have led to the solution of the structures of a few additional receptors. In light of these newly elucidated crystal structures, we have been able to produce a substantial amount of data to demonstrate that accurate models of GPCRs in complex with their ligands can be constructed through homology modeling followed by fully flexible molecular docking. These results have been confirmed by our success in the first blind assessment of GPCR modeling and docking, organized in coordination with the solution of the X-ray structure of the adenosine A2A receptor. Taken together, these data indicate that: a) the transmembrane helical bundle can be modeled with considerable accuracy; b) predicting the binding mode of a ligand, although doable, is challenging; c) modeling of the extracellular and intracellular loops is still problematic. PMID:21253444
Brandariz-Nuñez, Alberto; Otero-Romero, Iria; Benavente, Javier; Martinez-Costas, Jose M
2011-09-20
We have recently developed a versatile tagging system (IC-tagging) that causes relocation of the tagged proteins to ARV muNS-derived intracellular globular inclusions. In the present study we demonstrate (i) that the IC-tag can be successfully fused either to the amino or carboxyl terminus of the protein to be tagged and (ii) that IC-tagged proteins are able to interact between them and perform complex reactions that require such interactions while integrated into muNS inclusions, increasing the versatility of the IC-tagging system. Also, our studies with the DsRed protein add some light on the structure/function relationship of the evolution of DsRed chromophore. Copyright © 2011 Elsevier B.V. All rights reserved.
The anatomy of mammalian sweet taste receptors.
Chéron, Jean-Baptiste; Golebiowski, Jérôme; Antonczak, Serge; Fiorucci, Sébastien
2017-02-01
All sweet-tasting compounds are detected by a single G-protein coupled receptor (GPCR), the heterodimer T1R2-T1R3, for which no experimental structure is available. The sweet taste receptor is a class C GPCR, and the recently published crystallographic structures of metabotropic glutamate receptor (mGluR) 1 and 5 provide a significant step forward for understanding structure-function relationships within this family. In this article, we recapitulate more than 600 single point site-directed mutations and available structural data to obtain a critical alignment of the sweet taste receptor sequences with respect to other class C GPCRs. Using this alignment, a homology 3D-model of the human sweet taste receptor is built and analyzed to dissect out the role of key residues involved in ligand binding and those responsible for receptor activation. Proteins 2017; 85:332-341. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
2016-01-01
Biologically active but floppy proteins represent a new reality of modern protein science. These intrinsically disordered proteins (IDPs) and hybrid proteins containing ordered and intrinsically disordered protein regions (IDPRs) constitute a noticeable part of any given proteome. Functionally, they complement ordered proteins, and their conformational flexibility and structural plasticity allow them to perform impossible tricks and be engaged in biological activities that are inaccessible to well folded proteins with their unique structures. The major goals of this minireview are to show that, despite their simplified amino acid sequences, IDPs/IDPRs are complex entities often resembling chaotic systems, are structurally and functionally heterogeneous, and can be considered an important part of the structure-function continuum. Furthermore, IDPs/IDPRs are everywhere, and are ubiquitously engaged in various interactions characterized by a wide spectrum of binding scenarios and an even wider spectrum of structural and functional outputs. PMID:26851286
Yang, Jing; Mei, Ying; Hook, Andrew L.; Taylor, Michael; Urquhart, Andrew J.; Bogatyrev, Said R.; Langer, Robert; Anderson, Daniel G.; Davies, Martyn C.; Alexander, Morgan R.
2010-01-01
High throughput materials discovery using combinatorial polymer microarrays to screen for new biomaterials with new and improved function is established as a powerful strategy. Here we combine this screening approach with high throughput surface characterisation (HT-SC) to identify surface structure-function relationships. We explore how this combination can help to identify surface chemical moieties that control protein adsorption and subsequent cellular response. The adhesion of human embryoid body (hEB) cells to a large number (496) of different acrylate polymers synthesized in a microarray format is screened using a high throughput procedure. To determine the role of the polymer surface properties on hEB cell adhesion, detailed HT-SC of these acrylate polymers is carried out using time of flight secondary ion mass spectrometry (ToF SIMS), x-ray photoelectron spectroscopy (XPS), pico litre drop sessile water contact angle (WCA) measurement and atomic force microscopy (AFM). A structure-function relationship is identified between the ToF SIMS analysis of the surface chemistry after a fibronectin (Fn) pre-conditioning step and the cell adhesion to each spot using the multivariate analysis technique partial least squares (PLS) regression. Secondary ions indicative of the adsorbed Fn correlate with increased cell adhesion whereas glycol and other functionalities from the polymers are identified that reduce cell adhesion. Furthermore, a strong relationship between the ToF SIMS spectra of bare polymers and the cell adhesion to each spot is identified using PLS regression. This identifies a role for both the surface chemistry of the bare polymer and the pre-adsorbed Fn, as-represented in the ToF SIMS spectra, in controlling cellular adhesion. In contrast, no relationship is found between cell adhesion and wettability, surface roughness, elemental or functional surface composition. The correlation between ToF SIMS data of the surfaces and the cell adhesion demonstrates the ability of identifying surface moieties that control protein adsorption and subsequent cell adhesion using ToF SIMS and multivariate analysis. PMID:20832108
NASA Astrophysics Data System (ADS)
Riest, Jonas; Nägele, Gerhard; Liu, Yun; Wagner, Norman J.; Godfrin, P. Douglas
2018-02-01
Recently, atypical static features of microstructural ordering in low-salinity lysozyme protein solutions have been extensively explored experimentally and explained theoretically based on a short-range attractive plus long-range repulsive (SALR) interaction potential. However, the protein dynamics and the relationship to the atypical SALR structure remain to be demonstrated. Here, the applicability of semi-analytic theoretical methods predicting diffusion properties and viscosity in isotropic particle suspensions to low-salinity lysozyme protein solutions is tested. Using the interaction potential parameters previously obtained from static structure factor measurements, our results of Monte Carlo simulations representing seven experimental lysoyzme samples indicate that they exist either in dispersed fluid or random percolated states. The self-consistent Zerah-Hansen scheme is used to describe the static structure factor, S(q), which is the input to our calculation schemes for the short-time hydrodynamic function, H(q), and the zero-frequency viscosity η. The schemes account for hydrodynamic interactions included on an approximate level. Theoretical predictions for H(q) as a function of the wavenumber q quantitatively agree with experimental results at small protein concentrations obtained using neutron spin echo measurements. At higher concentrations, qualitative agreement is preserved although the calculated hydrodynamic functions are overestimated. We attribute the differences for higher concentrations and lower temperatures to translational-rotational diffusion coupling induced by the shape and interaction anisotropy of particles and clusters, patchiness of the lysozyme particle surfaces, and the intra-cluster dynamics, features not included in our simple globular particle model. The theoretical results for the solution viscosity, η, are in qualitative agreement with our experimental data even at higher concentrations. We demonstrate that semi-quantitative predictions of diffusion properties and viscosity of solutions of globular proteins are possible given only the equilibrium structure factor of proteins. Furthermore, we explore the effects of changing the attraction strength on H(q) and η.
Lee, Yu Qi; Collins, Clare E.; Gordon, Adrienne; Rae, Kym M.; Pringle, Kirsty G.
2018-01-01
The intrauterine environment is critical for fetal growth and organ development. Evidence from animal models indicates that the developing kidney is vulnerable to suboptimal maternal nutrition and changes in health status. However, evidence from human studies are yet to be synthesised. Therefore, the aim of the current study was to systematically review current research on the relationship between maternal nutrition during pregnancy and offspring kidney structure and function in humans. A search of five databases identified 9501 articles, of which three experimental and seven observational studies met the inclusion criteria. Nutrients reviewed to date included vitamin A (n = 3), folate and vitamin B12 (n = 2), iron (n = 1), vitamin D (n = 1), total energy (n = 2) and protein (n = 1). Seven studies were assessed as being of “positive” and three of “neutral” quality. A variety of populations were studied, with limited studies investigating maternal nutrition during pregnancy, while measurements of offspring kidney outcomes were diverse across studies. There was a lack of consistency in the timing of follow-up for offspring kidney structure and/or function assessments, thus limiting comparability between studies. Deficiencies in maternal folate, vitamin A, and total energy during pregnancy were associated with detrimental impacts on kidney structure and function, measured by kidney volume, proteinuria, eGFRcystC and mean creatinine clearance in the offspring. Additional experimental and longitudinal prospective studies are warranted to confirm this relationship, especially in Indigenous populations where the risk of renal disease is greater. PMID:29466283
Connecting mitochondrial dynamics and life-or-death events via Bcl-2 family proteins.
Aouacheria, Abdel; Baghdiguian, Stephen; Lamb, Heather M; Huska, Jason D; Pineda, Fernando J; Hardwick, J Marie
2017-10-01
The morphology of a population of mitochondria is the result of several interacting dynamical phenomena, including fission, fusion, movement, elimination and biogenesis. Each of these phenomena is controlled by underlying molecular machinery, and when defective can cause disease. New understanding of the relationships between form and function of mitochondria in health and disease is beginning to be unraveled on several fronts. Studies in mammals and model organisms have revealed that mitochondrial morphology, dynamics and function appear to be subject to regulation by the same proteins that regulate apoptotic cell death. One protein family that influences mitochondrial dynamics in both healthy and dying cells is the Bcl-2 protein family. Connecting mitochondrial dynamics with life-death pathway forks may arise from the intersection of Bcl-2 family proteins with the proteins and lipids that determine mitochondrial shape and function. Bcl-2 family proteins also have multifaceted influences on cells and mitochondria, including calcium handling, autophagy and energetics, as well as the subcellular localization of mitochondrial organelles to neuronal synapses. The remarkable range of physical or functional interactions by Bcl-2 family proteins is challenging to assimilate into a cohesive understanding. Most of their effects may be distinct from their direct roles in apoptotic cell death and are particularly apparent in the nervous system. Dual roles in mitochondrial dynamics and cell death extend beyond BCL-2 family proteins. In this review, we discuss many processes that govern mitochondrial structure and function in health and disease, and how Bcl-2 family proteins integrate into some of these processes. Copyright © 2017 Elsevier Ltd. All rights reserved.