protein structure analyses: Topics by Science.gov

Sample records for protein structure analyses

PDBsum: Structural summaries of PDB entries.

PubMed

Laskowski, Roman A; Jabłońska, Jagoda; Pravda, Lukáš; Vařeková, Radka Svobodová; Thornton, Janet M

2018-01-01

PDBsum is a web server providing structural information on the entries in the Protein Data Bank (PDB). The analyses are primarily image-based and include protein secondary structure, protein-ligand and protein-DNA interactions, PROCHECK analyses of structural quality, and many others. The 3D structures can be viewed interactively in RasMol, PyMOL, and a JavaScript viewer called 3Dmol.js. Users can upload their own PDB files and obtain a set of password-protected PDBsum analyses for each. The server is freely accessible to all at: http://www.ebi.ac.uk/pdbsum. © 2017 The Protein Society.
A Circular Dichroism Reference Database for Membrane Proteins

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wallace,B.; Wien, F.; Stone, T.

2006-01-01

Membrane proteins are a major product of most genomes and the target of a large number of current pharmaceuticals, yet little information exists on their structures because of the difficulty of crystallising them; hence for the most part they have been excluded from structural genomics programme targets. Furthermore, even methods such as circular dichroism (CD) spectroscopy which seek to define secondary structure have not been fully exploited because of technical limitations to their interpretation for membrane embedded proteins. Empirical analyses of circular dichroism (CD) spectra are valuable for providing information on secondary structures of proteins. However, the accuracy of themore » results depends on the appropriateness of the reference databases used in the analyses. Membrane proteins have different spectral characteristics than do soluble proteins as a result of the low dielectric constants of membrane bilayers relative to those of aqueous solutions (Chen & Wallace (1997) Biophys. Chem. 65:65-74). To date, no CD reference database exists exclusively for the analysis of membrane proteins, and hence empirical analyses based on current reference databases derived from soluble proteins are not adequate for accurate analyses of membrane protein secondary structures (Wallace et al (2003) Prot. Sci. 12:875-884). We have therefore created a new reference database of CD spectra of integral membrane proteins whose crystal structures have been determined. To date it contains more than 20 proteins, and spans the range of secondary structures from mostly helical to mostly sheet proteins. This reference database should enable more accurate secondary structure determinations of membrane embedded proteins and will become one of the reference database options in the CD calculation server DICHROWEB (Whitmore & Wallace (2004) NAR 32:W668-673).« less
MD simulations of papillomavirus DNA-E2 protein complexes hints at a protein structural code for DNA deformation.

PubMed

Falconi, M; Oteri, F; Eliseo, T; Cicero, D O; Desideri, A

2008-08-01

The structural dynamics of the DNA binding domains of the human papillomavirus strain 16 and the bovine papillomavirus strain 1, complexed with their DNA targets, has been investigated by modeling, molecular dynamics simulations, and nuclear magnetic resonance analysis. The simulations underline different dynamical features of the protein scaffolds and a different mechanical interaction of the two proteins with DNA. The two protein structures, although very similar, show differences in the relative mobility of secondary structure elements. Protein structural analyses, principal component analysis, and geometrical and energetic DNA analyses indicate that the two transcription factors utilize a different strategy in DNA recognition and deformation. Results show that the protein indirect DNA readout is not only addressable to the DNA molecule flexibility but it is finely tuned by the mechanical and dynamical properties of the protein scaffold involved in the interaction.
Lessons on RNA Silencing Mechanisms in Plants from Eukaryotic Argonaute Structures[W

PubMed Central

Poulsen, Christian; Vaucheret, Hervé; Brodersen, Peter

2013-01-01

RNA silencing refers to a collection of gene regulatory mechanisms that use small RNAs for sequence specific repression. These mechanisms rely on ARGONAUTE (AGO) proteins that directly bind small RNAs and thereby constitute the central component of the RNA-induced silencing complex (RISC). AGO protein function has been probed extensively by mutational analyses, particularly in plants where large allelic series of several AGO proteins have been isolated. Structures of entire human and yeast AGO proteins have only very recently been obtained, and they allow more precise analyses of functional consequences of mutations obtained by forward genetics. To a large extent, these analyses support current models of regions of particular functional importance of AGO proteins. Interestingly, they also identify previously unrecognized parts of AGO proteins with profound structural and functional importance and provide the first hints at structural elements that have important functions specific to individual AGO family members. A particularly important outcome of the analysis concerns the evidence for existence of Gly-Trp (GW) repeat interactors of AGO proteins acting in the plant microRNA pathway. The parallel analysis of AGO structures and plant AGO mutations also suggests that such interactions with GW proteins may be a determinant of whether an endonucleolytically competent RISC is formed. PMID:23303917
Lessons on RNA silencing mechanisms in plants from eukaryotic argonaute structures.

PubMed

Poulsen, Christian; Vaucheret, Hervé; Brodersen, Peter

2013-01-01

RNA silencing refers to a collection of gene regulatory mechanisms that use small RNAs for sequence specific repression. These mechanisms rely on ARGONAUTE (AGO) proteins that directly bind small RNAs and thereby constitute the central component of the RNA-induced silencing complex (RISC). AGO protein function has been probed extensively by mutational analyses, particularly in plants where large allelic series of several AGO proteins have been isolated. Structures of entire human and yeast AGO proteins have only very recently been obtained, and they allow more precise analyses of functional consequences of mutations obtained by forward genetics. To a large extent, these analyses support current models of regions of particular functional importance of AGO proteins. Interestingly, they also identify previously unrecognized parts of AGO proteins with profound structural and functional importance and provide the first hints at structural elements that have important functions specific to individual AGO family members. A particularly important outcome of the analysis concerns the evidence for existence of Gly-Trp (GW) repeat interactors of AGO proteins acting in the plant microRNA pathway. The parallel analysis of AGO structures and plant AGO mutations also suggests that such interactions with GW proteins may be a determinant of whether an endonucleolytically competent RISC is formed.
Implication of the cause of differences in 3D structures of proteins with high sequence identity based on analyses of amino acid sequences and 3D structures.

PubMed

Matsuoka, Masanari; Sugita, Masatake; Kikuchi, Takeshi

2014-09-18

Proteins that share a high sequence homology while exhibiting drastically different 3D structures are investigated in this study. Recently, artificial proteins related to the sequences of the GA and IgG binding GB domains of human serum albumin have been designed. These artificial proteins, referred to as GA and GB, share 98% amino acid sequence identity but exhibit different 3D structures, namely, a 3α bundle versus a 4β + α structure. Discriminating between their 3D structures based on their amino acid sequences is a very difficult problem. In the present work, in addition to using bioinformatics techniques, an analysis based on inter-residue average distance statistics is used to address this problem. It was hard to distinguish which structure a given sequence would take only with the results of ordinary analyses like BLAST and conservation analyses. However, in addition to these analyses, with the analysis based on the inter-residue average distance statistics and our sequence tendency analysis, we could infer which part would play an important role in its structural formation. The results suggest possible determinants of the different 3D structures for sequences with high sequence identity. The possibility of discriminating between the 3D structures based on the given sequences is also discussed.
Effects of autoclaving and high pressure on allergenicity of hazelnut proteins

PubMed Central

2012-01-01

Background Hazelnut is reported as a causative agent of allergic reactions. However it is also an edible nut with health benefits. The allergenic characteristics of hazelnut-samples after autoclaving (AC) and high-pressure (HHP) processing have been studied and are also presented here. Previous studies demonstrated that AC treatments were responsible for structural transformation of protein structure motifs. Thus, structural analyses of allergen proteins from hazelnut were carried out to observe what is occurring in relation to the specific-IgE recognition of the related allergenic proteins. The aims of this work are to evaluate the effect of AC and HHP processing on hazelnut in vitro allergenicity using human-sera and to analyse the complexity of hazelnut allergen-protein structures. Methods Hazelnut-samples were subjected to AC and HHP processing. The specific IgE- reactivity was studied in 15 allergic clinic-patients via western blotting analyses. A series of homology-based-bioinformatics 3D-models (Cora 1, Cora 8, Cora 9 and Cora 11) were generated for the antigens included in the study to analyse the co mplexity of their protein structure. This study is supported by the Declaration of Helsinki and subsequent ethical guidelines. Results A severe reduction in vitro in allergenicity to hazelnut after AC processing was observed in the allergic clinic-patients studied. The specific-IgE binding of some of the described immunoreactive hazelnut protein-bands: Cora 1 ~18KDa, Cora 8 ~9KDa, Cora 9 ~35-40KDa and Cora 11 ~47-48 KDa decreases. Furthermore a relevant glycosylation was assigned and visualized via structural analysis of proteins (3D-modelling) for the first time in the protein-allergen Cora 11 showing a new role which could open a new door for allergenicity-unravellings. Conclusion Hazelnut allergenicity-studies in vivo via Prick-Prick and other means using AC processing are crucial to verify the data we observed via in vitro analyses. Glycosylation studies provided us with clues to elucidate, in the near future, mechanisms of the structures that contribute to hazelnut allergenicity, which thus, in turn, help alleviate food allergens. PMID:22616776
XLinkDB 2.0: integrated, large-scale structural analysis of protein crosslinking data

PubMed Central

Schweppe, Devin K.; Zheng, Chunxiang; Chavez, Juan D.; Navare, Arti T.; Wu, Xia; Eng, Jimmy K.; Bruce, James E.

2016-01-01

Motivation: Large-scale chemical cross-linking with mass spectrometry (XL-MS) analyses are quickly becoming a powerful means for high-throughput determination of protein structural information and protein–protein interactions. Recent studies have garnered thousands of cross-linked interactions, yet the field lacks an effective tool to compile experimental data or access the network and structural knowledge for these large scale analyses. We present XLinkDB 2.0 which integrates tools for network analysis, Protein Databank queries, modeling of predicted protein structures and modeling of docked protein structures. The novel, integrated approach of XLinkDB 2.0 enables the holistic analysis of XL-MS protein interaction data without limitation to the cross-linker or analytical system used for the analysis. Availability and Implementation: XLinkDB 2.0 can be found here, including documentation and help: http://xlinkdb.gs.washington.edu/. Contact: jimbruce@uw.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27153666
Mathematical methods for protein science

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hart, W.; Istrail, S.; Atkins, J.

1997-12-31

Understanding the structure and function of proteins is a fundamental endeavor in molecular biology. Currently, over 100,000 protein sequences have been determined by experimental methods. The three dimensional structure of the protein determines its function, but there are currently less than 4,000 structures known to atomic resolution. Accordingly, techniques to predict protein structure from sequence have an important role in aiding the understanding of the Genome and the effects of mutations in genetic disease. The authors describe current efforts at Sandia to better understand the structure of proteins through rigorous mathematical analyses of simple lattice models. The efforts have focusedmore » on two aspects of protein science: mathematical structure prediction, and inverse protein folding.« less
@TOME-2: a new pipeline for comparative modeling of protein-ligand complexes.

PubMed

Pons, Jean-Luc; Labesse, Gilles

2009-07-01

@TOME 2.0 is new web pipeline dedicated to protein structure modeling and small ligand docking based on comparative analyses. @TOME 2.0 allows fold recognition, template selection, structural alignment editing, structure comparisons, 3D-model building and evaluation. These tasks are routinely used in sequence analyses for structure prediction. In our pipeline the necessary software is efficiently interconnected in an original manner to accelerate all the processes. Furthermore, we have also connected comparative docking of small ligands that is performed using protein-protein superposition. The input is a simple protein sequence in one-letter code with no comment. The resulting 3D model, protein-ligand complexes and structural alignments can be visualized through dedicated Web interfaces or can be downloaded for further studies. These original features will aid in the functional annotation of proteins and the selection of templates for molecular modeling and virtual screening. Several examples are described to highlight some of the new functionalities provided by this pipeline. The server and its documentation are freely available at http://abcis.cbs.cnrs.fr/AT2/
SBION: A Program for Analyses of Salt-Bridges from Multiple Structure Files.

PubMed

Gupta, Parth Sarthi Sen; Mondal, Sudipta; Mondal, Buddhadev; Islam, Rifat Nawaz Ul; Banerjee, Shyamashree; Bandyopadhyay, Amal K

2014-01-01

Salt-bridge and network salt-bridge are specific electrostatic interactions that contribute to the overall stability of proteins. In hierarchical protein folding model, these interactions play crucial role in nucleation process. The advent and growth of protein structure database and its availability in public domain made an urgent need for context dependent rapid analysis of salt-bridges. While these analyses on single protein is cumbersome and time-consuming, batch analyses need efficient software for rapid topological scan of a large number of protein for extracting details on (i) fraction of salt-bridge residues (acidic and basic). (ii) Chain specific intra-molecular salt-bridges, (iii) inter-molecular salt-bridges (protein-protein interactions) in all possible binary combinations (iv) network salt-bridges and (v) secondary structure distribution of salt-bridge residues. To the best of our knowledge, such efficient software is not available in public domain. At this juncture, we have developed a program i.e. SBION which can perform all the above mentioned computations for any number of protein with any number of chain at any given distance of ion-pair. It is highly efficient, fast, error-free and user friendly. Finally we would say that our SBION indeed possesses potential for applications in the field of structural and comparative bioinformatics studies. SBION is freely available for non-commercial/academic institutions on formal request to the corresponding author (akbanerjee@biotech.buruniv.ac.in).
The dual role of fragments in fragment-assembly methods for de novo protein structure prediction

PubMed Central

Handl, Julia; Knowles, Joshua; Vernon, Robert; Baker, David; Lovell, Simon C.

2013-01-01

In fragment-assembly techniques for protein structure prediction, models of protein structure are assembled from fragments of known protein structures. This process is typically guided by a knowledge-based energy function and uses a heuristic optimization method. The fragments play two important roles in this process: they define the set of structural parameters available, and they also assume the role of the main variation operators that are used by the optimiser. Previous analysis has typically focused on the first of these roles. In particular, the relationship between local amino acid sequence and local protein structure has been studied by a range of authors. The correlation between the two has been shown to vary with the window length considered, and the results of these analyses have informed directly the choice of fragment length in state-of-the-art prediction techniques. Here, we focus on the second role of fragments and aim to determine the effect of fragment length from an optimization perspective. We use theoretical analyses to reveal how the size and structure of the search space changes as a function of insertion length. Furthermore, empirical analyses are used to explore additional ways in which the size of the fragment insertion influences the search both in a simulation model and for the fragment-assembly technique, Rosetta. PMID:22095594
Mass spectrometry: Raw protein from the top down

NASA Astrophysics Data System (ADS)

Breuker, Kathrin

2018-02-01

Mass spectrometry is a powerful technique for analysing proteins, yet linking higher-order protein structure to amino acid sequence and post-translational modifications is far from simple. Now, a native top-down method has been developed that can provide information on higher-order protein structure and different proteoforms at the same time.
Classification of proteins: available structural space for molecular modeling.

PubMed

Andreeva, Antonina

2012-01-01

The wealth of available protein structural data provides unprecedented opportunity to study and better understand the underlying principles of protein folding and protein structure evolution. A key to achieving this lies in the ability to analyse these data and to organize them in a coherent classification scheme. Over the past years several protein classifications have been developed that aim to group proteins based on their structural relationships. Some of these classification schemes explore the concept of structural neighbourhood (structural continuum), whereas other utilize the notion of protein evolution and thus provide a discrete rather than continuum view of protein structure space. This chapter presents a strategy for classification of proteins with known three-dimensional structure. Steps in the classification process along with basic definitions are introduced. Examples illustrating some fundamental concepts of protein folding and evolution with a special focus on the exceptions to them are presented.
Deciphering the shape and deformation of secondary structures through local conformation analysis

PubMed Central

2011-01-01

Background Protein deformation has been extensively analysed through global methods based on RMSD, torsion angles and Principal Components Analysis calculations. Here we use a local approach, able to distinguish among the different backbone conformations within loops, α-helices and β-strands, to address the question of secondary structures' shape variation within proteins and deformation at interface upon complexation. Results Using a structural alphabet, we translated the 3 D structures of large sets of protein-protein complexes into sequences of structural letters. The shape of the secondary structures can be assessed by the structural letters that modeled them in the structural sequences. The distribution analysis of the structural letters in the three protein compartments (surface, core and interface) reveals that secondary structures tend to adopt preferential conformations that differ among the compartments. The local description of secondary structures highlights that curved conformations are preferred on the surface while straight ones are preferred in the core. Interfaces display a mixture of local conformations either preferred in core or surface. The analysis of the structural letters transition occurring between protein-bound and unbound conformations shows that the deformation of secondary structure is tightly linked to the compartment preference of the local conformations. Conclusion The conformation of secondary structures can be further analysed and detailed thanks to a structural alphabet which allows a better description of protein surface, core and interface in terms of secondary structures' shape and deformation. Induced-fit modification tendencies described here should be valuable information to identify and characterize regions under strong structural constraints for functional reasons. PMID:21284872
Deciphering the shape and deformation of secondary structures through local conformation analysis.

PubMed

Baussand, Julie; Camproux, Anne-Claude

2011-02-01

Protein deformation has been extensively analysed through global methods based on RMSD, torsion angles and Principal Components Analysis calculations. Here we use a local approach, able to distinguish among the different backbone conformations within loops, α-helices and β-strands, to address the question of secondary structures' shape variation within proteins and deformation at interface upon complexation. Using a structural alphabet, we translated the 3 D structures of large sets of protein-protein complexes into sequences of structural letters. The shape of the secondary structures can be assessed by the structural letters that modeled them in the structural sequences. The distribution analysis of the structural letters in the three protein compartments (surface, core and interface) reveals that secondary structures tend to adopt preferential conformations that differ among the compartments. The local description of secondary structures highlights that curved conformations are preferred on the surface while straight ones are preferred in the core. Interfaces display a mixture of local conformations either preferred in core or surface. The analysis of the structural letters transition occurring between protein-bound and unbound conformations shows that the deformation of secondary structure is tightly linked to the compartment preference of the local conformations. The conformation of secondary structures can be further analysed and detailed thanks to a structural alphabet which allows a better description of protein surface, core and interface in terms of secondary structures' shape and deformation. Induced-fit modification tendencies described here should be valuable information to identify and characterize regions under strong structural constraints for functional reasons.
Investigating Molecular Structures of Bio-Fuel and Bio-Oil Seeds as Predictors To Estimate Protein Bioavailability for Ruminants by Advanced Nondestructive Vibrational Molecular Spectroscopy.

PubMed

Ban, Yajing; L Prates, Luciana; Yu, Peiqiang

2017-10-18

This study was conducted to (1) determine protein and carbohydrate molecular structure profiles and (2) quantify the relationship between structural features and protein bioavailability of newly developed carinata and canola seeds for dairy cows by using Fourier transform infrared molecular spectroscopy. Results showed similarity in protein structural makeup within the entire protein structural region between carinata and canola seeds. The highest area ratios related to structural CHO, total CHO, and cellulosic compounds were obtained for carinata seeds. Carinata and canola seeds showed similar carbohydrate and protein molecular structures by multivariate analyses. Carbohydrate molecular structure profiles were highly correlated to protein rumen degradation and intestinal digestion characteristics. In conclusion, the molecular spectroscopy can detect inherent structural characteristics in carinata and canola seeds in which carbohydrate-relative structural features are related to protein metabolism and utilization. Protein and carbohydrate spectral profiles could be used as predictors of rumen protein bioavailability in cows.
CavityPlus: a web server for protein cavity detection with pharmacophore modelling, allosteric site identification and covalent ligand binding ability prediction.

PubMed

Xu, Youjun; Wang, Shiwei; Hu, Qiwan; Gao, Shuaishi; Ma, Xiaomin; Zhang, Weilin; Shen, Yihang; Chen, Fangjin; Lai, Luhua; Pei, Jianfeng

2018-05-10

CavityPlus is a web server that offers protein cavity detection and various functional analyses. Using protein three-dimensional structural information as the input, CavityPlus applies CAVITY to detect potential binding sites on the surface of a given protein structure and rank them based on ligandability and druggability scores. These potential binding sites can be further analysed using three submodules, CavPharmer, CorrSite, and CovCys. CavPharmer uses a receptor-based pharmacophore modelling program, Pocket, to automatically extract pharmacophore features within cavities. CorrSite identifies potential allosteric ligand-binding sites based on motion correlation analyses between cavities. CovCys automatically detects druggable cysteine residues, which is especially useful to identify novel binding sites for designing covalent allosteric ligands. Overall, CavityPlus provides an integrated platform for analysing comprehensive properties of protein binding cavities. Such analyses are useful for many aspects of drug design and discovery, including target selection and identification, virtual screening, de novo drug design, and allosteric and covalent-binding drug design. The CavityPlus web server is freely available at http://repharma.pku.edu.cn/cavityplus or http://www.pkumdl.cn/cavityplus.
Ser/Thr Motifs in Transmembrane Proteins: Conservation Patterns and Effects on Local Protein Structure and Dynamics

PubMed Central

del Val, Coral; White, Stephen H.

2014-01-01

We combined systematic bioinformatics analyses and molecular dynamics simulations to assess the conservation patterns of Ser and Thr motifs in membrane proteins, and the effect of such motifs on the structure and dynamics of α-helical transmembrane (TM) segments. We find that Ser/Thr motifs are often present in β-barrel TM proteins. At least one Ser/Thr motif is present in almost half of the sequences of α-helical proteins analyzed here. The extensive bioinformatics analyses and inspection of protein structures led to the identification of molecular transporters with noticeable numbers of Ser/Thr motifs within the TM region. Given the energetic penalty for burying multiple Ser/Thr groups in the membrane hydrophobic core, the observation of transporters with multiple membrane-embedded Ser/Thr is intriguing and raises the question of how the presence of multiple Ser/Thr affects protein local structure and dynamics. Molecular dynamics simulations of four different Ser-containing model TM peptides indicate that backbone hydrogen bonding of membrane-buried Ser/Thr hydroxyl groups can significantly change the local structure and dynamics of the helix. Ser groups located close to the membrane interface can hydrogen bond to solvent water instead of protein backbone, leading to an enhanced local solvation of the peptide. PMID:22836667
PDBsum new things.

PubMed

Laskowski, Roman A

2009-01-01

PDBsum (http://www.ebi.ac.uk/pdbsum) provides summary information about each experimentally determined structural model in the Protein Data Bank (PDB). Here we describe some of its most recent features, including figures from the structure's key reference, citation data, Pfam domain diagrams, topology diagrams and protein-protein interactions. Furthermore, it now accepts users' own PDB format files and generates a private set of analyses for each uploaded structure.

Applying graph theory to protein structures: an atlas of coiled coils.

PubMed

Heal, Jack W; Bartlett, Gail J; Wood, Christopher W; Thomson, Andrew R; Woolfson, Derek N

2018-05-02

To understand protein structure, folding and function fully and to design proteins de novo reliably, we must learn from natural protein structures that have been characterised experimentally. The number of protein structures available is large and growing exponentially, which makes this task challenging. Indeed, computational resources are becoming increasingly important for classifying and analysing this resource. Here, we use tools from graph theory to define an atlas classification scheme for automatically categorising certain protein substructures. Focusing on the α-helical coiled coils, which are ubiquitous protein-structure and protein-protein interaction motifs, we present a suite of computational resources designed for analysing these assemblies. iSOCKET enables interactive analysis of side-chain packing within proteins to identify coiled coils automatically and with considerable user control. Applying a graph theory-based atlas classification scheme to structures identified by iSOCKET gives the Atlas of Coiled Coils, a fully automated, updated overview of extant coiled coils. The utility of this approach is illustrated with the first formal classification of an emerging subclass of coiled coils called α-helical barrels. Furthermore, in the Atlas, the known coiled-coil universe is presented alongside a partial enumeration of the 'dark matter' of coiled-coil structures; i.e., those coiled-coil architectures that are theoretically possible but have not been observed to date, and thus present defined targets for protein design. iSOCKET is available as part of the open-source GitHub repository associated with this work (https://github.com/woolfson-group/isocket). This repository also contains all the data generated when classifying the protein graphs. The Atlas of Coiled Coils is available at: http://coiledcoils.chm.bris.ac.uk/atlas/app.
Local Structural Differences in Homologous Proteins: Specificities in Different SCOP Classes

PubMed Central

Joseph, Agnel Praveen; Valadié, Hélène; Srinivasan, Narayanaswamy; de Brevern, Alexandre G.

2012-01-01

The constant increase in the number of solved protein structures is of great help in understanding the basic principles behind protein folding and evolution. 3-D structural knowledge is valuable in designing and developing methods for comparison, modelling and prediction of protein structures. These approaches for structure analysis can be directly implicated in studying protein function and for drug design. The backbone of a protein structure favours certain local conformations which include α-helices, β-strands and turns. Libraries of limited number of local conformations (Structural Alphabets) were developed in the past to obtain a useful categorization of backbone conformation. Protein Block (PB) is one such Structural Alphabet that gave a reasonable structure approximation of 0.42 Å. In this study, we use PB description of local structures to analyse conformations that are preferred sites for structural variations and insertions, among group of related folds. This knowledge can be utilized in improving tools for structure comparison that work by analysing local structure similarities. Conformational differences between homologous proteins are known to occur often in the regions comprising turns and loops. Interestingly, these differences are found to have specific preferences depending upon the structural classes of proteins. Such class-specific preferences are mainly seen in the all-β class with changes involving short helical conformations and hairpin turns. A test carried out on a benchmark dataset also indicates that the use of knowledge on the class specific variations can improve the performance of a PB based structure comparison approach. The preference for the indel sites also seem to be confined to a few backbone conformations involving β-turns and helix C-caps. These are mainly associated with short loops joining the regular secondary structures that mediate a reversal in the chain direction. Rare β-turns of type I’ and II’ are also identified as preferred sites for insertions. PMID:22745680
DOE Office of Scientific and Technical Information (OSTI.GOV)

Chang, C.; Coggill, P.; Bateman, A.

Many Gram-positive lactic acid bacteria (LAB) produce anti-bacterial peptides and small proteins called bacteriocins, which enable them to compete against other bacteria in the environment. These peptides fall structurally into three different classes, I, II, III, with class IIa being pediocin-like single entities and class IIb being two-peptide bacteriocins. Self-protective cognate immunity proteins are usually co-transcribed with these toxins. Several examples of cognates for IIa have already been solved structurally. Streptococcus pyogenes, closely related to LAB, is one of the most common human pathogens, so knowledge of how it competes against other LAB species is likely to prove invaluable. Wemore » have solved the crystal structure of the gene-product of locus Spy-2152 from S. pyogenes, (PDB: 2fu2), and found it to comprise an anti-parallel four-helix bundle that is structurally similar to other bacteriocin immunity proteins. Sequence analyses indicate this protein to be a possible immunity protein protective against class IIa or IIb bacteriocins. However, given that S. pyogenes appears to lack any IIa pediocin-like proteins but does possess class IIb bacteriocins, we suggest this protein confers immunity to IIb-like peptides. Combined structural, genomic and proteomic analyses have allowed the identification and in silico characterization of a new putative immunity protein from S. pyogenes, possibly the first structure of an immunity protein protective against potential class IIb two-peptide bacteriocins. We have named the two pairs of putative bacteriocins found in S. pyogenes pyogenecin 1, 2, 3 and 4.« less
WEBnm@ v2.0: Web server and services for comparing protein flexibility.

PubMed

Tiwari, Sandhya P; Fuglebakk, Edvin; Hollup, Siv M; Skjærven, Lars; Cragnolini, Tristan; Grindhaug, Svenn H; Tekle, Kidane M; Reuter, Nathalie

2014-12-30

Normal mode analysis (NMA) using elastic network models is a reliable and cost-effective computational method to characterise protein flexibility and by extension, their dynamics. Further insight into the dynamics-function relationship can be gained by comparing protein motions between protein homologs and functional classifications. This can be achieved by comparing normal modes obtained from sets of evolutionary related proteins. We have developed an automated tool for comparative NMA of a set of pre-aligned protein structures. The user can submit a sequence alignment in the FASTA format and the corresponding coordinate files in the Protein Data Bank (PDB) format. The computed normalised squared atomic fluctuations and atomic deformation energies of the submitted structures can be easily compared on graphs provided by the web user interface. The web server provides pairwise comparison of the dynamics of all proteins included in the submitted set using two measures: the Root Mean Squared Inner Product and the Bhattacharyya Coefficient. The Comparative Analysis has been implemented on our web server for NMA, WEBnm@, which also provides recently upgraded functionality for NMA of single protein structures. This includes new visualisations of protein motion, visualisation of inter-residue correlations and the analysis of conformational change using the overlap analysis. In addition, programmatic access to WEBnm@ is now available through a SOAP-based web service. Webnm@ is available at http://apps.cbu.uib.no/webnma . WEBnm@ v2.0 is an online tool offering unique capability for comparative NMA on multiple protein structures. Along with a convenient web interface, powerful computing resources, and several methods for mode analyses, WEBnm@ facilitates the assessment of protein flexibility within protein families and superfamilies. These analyses can give a good view of how the structures move and how the flexibility is conserved over the different structures.
TP Atlas: integration and dissemination of advances in Targeted Proteins Research Program (TPRP)-structural biology project phase II in Japan.

PubMed

Iwayanagi, Takao; Miyamoto, Sei; Konno, Takeshi; Mizutani, Hisashi; Hirai, Tomohiro; Shigemoto, Yasumasa; Gojobori, Takashi; Sugawara, Hideaki

2012-09-01

The Targeted Proteins Research Program (TPRP) promoted by the Ministry of Education, Culture, Sports, Science and Technology (MEXT) of Japan is the phase II of structural biology project (2007-2011) following the Protein 3000 Project (2002-2006) in Japan. While the phase I Protein 3000 Project put partial emphasis on the construction and maintenance of pipelines for structural analyses, the TPRP is dedicated to revealing the structures and functions of the targeted proteins that have great importance in both basic research and industrial applications. To pursue this objective, 35 Targeted Proteins (TP) Projects selected in the three areas of fundamental biology, medicine and pharmacology, and food and environment are tightly collaborated with 10 Advanced Technology (AT) Projects in the four fields of protein production, structural analyses, chemical library and screening, and information platform. Here, the outlines and achievements of the 35 TP Projects are summarized in the system named TP Atlas. Progress in the diversified areas is described in the modules of Graphical Summary, General Summary, Tabular Summary, and Structure Gallery of the TP Atlas in the standard and unified format. Advances in TP Projects owing to novel technologies stemmed from AT Projects and collaborative research among TP Projects are illustrated as a hallmark of the Program. The TP Atlas can be accessed at http://net.genes.nig.ac.jp/tpatlas/index_e.html .
Quantifying side-chain conformational variations in protein structure

PubMed Central

Miao, Zhichao; Cao, Yang

2016-01-01

Protein side-chain conformation is closely related to their biological functions. The side-chain prediction is a key step in protein design, protein docking and structure optimization. However, side-chain polymorphism comprehensively exists in protein as various types and has been long overlooked by side-chain prediction. But such conformational variations have not been quantitatively studied and the correlations between these variations and residue features are vague. Here, we performed statistical analyses on large scale data sets and found that the side-chain conformational flexibility is closely related to the exposure to solvent, degree of freedom and hydrophilicity. These analyses allowed us to quantify different types of side-chain variabilities in PDB. The results underscore that protein side-chain conformation prediction is not a single-answer problem, leading us to reconsider the assessment approaches of side-chain prediction programs. PMID:27845406
Quantifying side-chain conformational variations in protein structure

NASA Astrophysics Data System (ADS)

Miao, Zhichao; Cao, Yang

2016-11-01

Protein side-chain conformation is closely related to their biological functions. The side-chain prediction is a key step in protein design, protein docking and structure optimization. However, side-chain polymorphism comprehensively exists in protein as various types and has been long overlooked by side-chain prediction. But such conformational variations have not been quantitatively studied and the correlations between these variations and residue features are vague. Here, we performed statistical analyses on large scale data sets and found that the side-chain conformational flexibility is closely related to the exposure to solvent, degree of freedom and hydrophilicity. These analyses allowed us to quantify different types of side-chain variabilities in PDB. The results underscore that protein side-chain conformation prediction is not a single-answer problem, leading us to reconsider the assessment approaches of side-chain prediction programs.
Quantifying side-chain conformational variations in protein structure.

PubMed

Miao, Zhichao; Cao, Yang

2016-11-15

Protein side-chain conformation is closely related to their biological functions. The side-chain prediction is a key step in protein design, protein docking and structure optimization. However, side-chain polymorphism comprehensively exists in protein as various types and has been long overlooked by side-chain prediction. But such conformational variations have not been quantitatively studied and the correlations between these variations and residue features are vague. Here, we performed statistical analyses on large scale data sets and found that the side-chain conformational flexibility is closely related to the exposure to solvent, degree of freedom and hydrophilicity. These analyses allowed us to quantify different types of side-chain variabilities in PDB. The results underscore that protein side-chain conformation prediction is not a single-answer problem, leading us to reconsider the assessment approaches of side-chain prediction programs.
Amino acid pair- and triplet-wise groupings in the interior of α-helical segments in proteins.

PubMed

de Sousa, Miguel M; Munteanu, Cristian R; Pazos, Alejandro; Fonseca, Nuno A; Camacho, Rui; Magalhães, A L

2011-02-21

A statistical approach has been applied to analyse primary structure patterns at inner positions of α-helices in proteins. A systematic survey was carried out in a recent sample of non-redundant proteins selected from the Protein Data Bank, which were used to analyse α-helix structures for amino acid pairing patterns. Only residues more than three positions apart from both termini of the α-helix were considered as inner. Amino acid pairings i, i+k (k=1, 2, 3, 4, 5), were analysed and the corresponding 20×20 matrices of relative global propensities were constructed. An analysis of (i, i+4, i+8) and (i, i+3, i+4) triplet patterns was also performed. These analysis yielded information on a series of amino acid patterns (pairings and triplets) showing either high or low preference for α-helical motifs and suggested a novel approach to protein alphabet reduction. In addition, it has been shown that the individual amino acid propensities are not enough to define the statistical distribution of these patterns. Global pair propensities also depend on the type of pattern, its composition and orientation in the protein sequence. The data presented should prove useful to obtain and refine useful predictive rules which can further the development and fine-tuning of protein structure prediction algorithms and tools. Copyright Â© 2010 Elsevier Ltd. All rights reserved.
Protein flexibility in the light of structural alphabets

PubMed Central

Craveur, Pierrick; Joseph, Agnel P.; Esque, Jeremy; Narwani, Tarun J.; Noël, Floriane; Shinada, Nicolas; Goguet, Matthieu; Leonard, Sylvain; Poulain, Pierre; Bertrand, Olivier; Faure, Guilhem; Rebehmed, Joseph; Ghozlane, Amine; Swapna, Lakshmipuram S.; Bhaskara, Ramachandra M.; Barnoud, Jonathan; Téletchéa, Stéphane; Jallu, Vincent; Cerny, Jiri; Schneider, Bohdan; Etchebest, Catherine; Srinivasan, Narayanaswamy; Gelly, Jean-Christophe; de Brevern, Alexandre G.

2015-01-01

Protein structures are valuable tools to understand protein function. Nonetheless, proteins are often considered as rigid macromolecules while their structures exhibit specific flexibility, which is essential to complete their functions. Analyses of protein structures and dynamics are often performed with a simplified three-state description, i.e., the classical secondary structures. More precise and complete description of protein backbone conformation can be obtained using libraries of small protein fragments that are able to approximate every part of protein structures. These libraries, called structural alphabets (SAs), have been widely used in structure analysis field, from definition of ligand binding sites to superimposition of protein structures. SAs are also well suited to analyze the dynamics of protein structures. Here, we review innovative approaches that investigate protein flexibility based on SAs description. Coupled to various sources of experimental data (e.g., B-factor) and computational methodology (e.g., Molecular Dynamic simulation), SAs turn out to be powerful tools to analyze protein dynamics, e.g., to examine allosteric mechanisms in large set of structures in complexes, to identify order/disorder transition. SAs were also shown to be quite efficient to predict protein flexibility from amino-acid sequence. Finally, in this review, we exemplify the interest of SAs for studying flexibility with different cases of proteins implicated in pathologies and diseases. PMID:26075209
Structures of membrane proteins

PubMed Central

Vinothkumar, Kutti R.; Henderson, Richard

2010-01-01

In reviewing the structures of membrane proteins determined up to the end of 2009, we present in words and pictures the most informative examples from each family. We group the structures together according to their function and architecture to provide an overview of the major principles and variations on the most common themes. The first structures, determined 20 years ago, were those of naturally abundant proteins with limited conformational variability, and each membrane protein structure determined was a major landmark. With the advent of complete genome sequences and efficient expression systems, there has been an explosion in the rate of membrane protein structure determination, with many classes represented. New structures are published every month and more than 150 unique membrane protein structures have been determined. This review analyses the reasons for this success, discusses the challenges that still lie ahead, and presents a concise summary of the key achievements with illustrated examples selected from each class. PMID:20667175
ESBRI: a web server for evaluating salt bridges in proteins.

PubMed

Costantini, Susan; Colonna, Giovanni; Facchiano, Angelo M

2008-01-01

Salt bridges can play important roles in protein structure and function and have stabilizing and destabilizing effects in protein folding. ESBRI is a software available as web tool which analyses the salt bridges in a protein structure, starting from the atomic coordinates. In the case of protein complexes, the salt bridges between protein chains can be evaluated, as well as those among specific charged amino acids and the different protein subunits, in order to obtain useful information regard the protein-protein interaction. The service is available at the URL: http://bioinformatica.isa.cnr.it/ESBRI/
On the Role of Aggregation Prone Regions in Protein Evolution, Stability, and Enzymatic Catalysis: Insights from Diverse Analyses

PubMed Central

Buck, Patrick M.; Kumar, Sandeep; Singh, Satish K.

2013-01-01

The various roles that aggregation prone regions (APRs) are capable of playing in proteins are investigated here via comprehensive analyses of multiple non-redundant datasets containing randomly generated amino acid sequences, monomeric proteins, intrinsically disordered proteins (IDPs) and catalytic residues. Results from this study indicate that the aggregation propensities of monomeric protein sequences have been minimized compared to random sequences with uniform and natural amino acid compositions, as observed by a lower average aggregation propensity and fewer APRs that are shorter in length and more often punctuated by gate-keeper residues. However, evidence for evolutionary selective pressure to disrupt these sequence regions among homologous proteins is inconsistent. APRs are less conserved than average sequence identity among closely related homologues (≥80% sequence identity with a parent) but APRs are more conserved than average sequence identity among homologues that have at least 50% sequence identity with a parent. Structural analyses of APRs indicate that APRs are three times more likely to contain ordered versus disordered residues and that APRs frequently contribute more towards stabilizing proteins than equal length segments from the same protein. Catalytic residues and APRs were also found to be in structural contact significantly more often than expected by random chance. Our findings suggest that proteins have evolved by optimizing their risk of aggregation for cellular environments by both minimizing aggregation prone regions and by conserving those that are important for folding and function. In many cases, these sequence optimizations are insufficient to develop recombinant proteins into commercial products. Rational design strategies aimed at improving protein solubility for biotechnological purposes should carefully evaluate the contributions made by candidate APRs, targeted for disruption, towards protein structure and activity. PMID:24146608
@TOME-2: a new pipeline for comparative modeling of protein–ligand complexes

PubMed Central

Pons, Jean-Luc; Labesse, Gilles

2009-01-01

@TOME 2.0 is new web pipeline dedicated to protein structure modeling and small ligand docking based on comparative analyses. @TOME 2.0 allows fold recognition, template selection, structural alignment editing, structure comparisons, 3D-model building and evaluation. These tasks are routinely used in sequence analyses for structure prediction. In our pipeline the necessary software is efficiently interconnected in an original manner to accelerate all the processes. Furthermore, we have also connected comparative docking of small ligands that is performed using protein–protein superposition. The input is a simple protein sequence in one-letter code with no comment. The resulting 3D model, protein–ligand complexes and structural alignments can be visualized through dedicated Web interfaces or can be downloaded for further studies. These original features will aid in the functional annotation of proteins and the selection of templates for molecular modeling and virtual screening. Several examples are described to highlight some of the new functionalities provided by this pipeline. The server and its documentation are freely available at http://abcis.cbs.cnrs.fr/AT2/ PMID:19443448
Molecular modeling of the human sperm associated antigen 11 B (SPAG11B) proteins.

PubMed

Narmadha, Ganapathy; Yenugu, Suresh

2015-04-01

Antimicrobial proteins and peptides are ubiquitous in nature with diverse structural and biological properties. Among them, the human beta-defensins are known to contribute to the innate immune response. Besides the defensins, a number of defensin-like proteins and peptides are expressed in many organ systems including the male reproductive system. Some of the protein isoforms encoded by the sperm associated antigen 11B (SPAG11) gene in humans are beta-defensin-like and exhibit structure dependent and salt tolerant antimicrobial activity, besides contributing to sperm maturation. Though some of the functional roles of these proteins are reported, the structural and molecular features that contribute to their antimicrobial activity is not yet reported. In this study, using in silico tools, we report the three dimensional structure of the human SPAG11B proteins and their C-terminal peptides. web-based hydropathy, amphipathicity, and topology (WHAT) analyses and grand average of hydropathy (GRAVY) indices show that these proteins and peptides are amphipathic and highly hydrophilic. Self-optimized prediction method with alignment (SOPMA) analyses and circular dichroism data suggest that the secondary structure of these proteins and peptides primarily contain beta-sheet and random coil structure and alpha-helix to a lesser extent. Ramachandran plots show that majority of the amino acids in these proteins and peptides fall in the permissible regions, thus indicating stable structures. The secondary structure of SPAG11B isoforms and their peptides were not perturbed with increasing NaCl concentration (0-300 mM) and at different pH (3, 7, and 10), thus reinforcing our previously reported observation that their antimicrobial activity is salt tolerant. To the best of our knowledge, for the first time, results of our study provide vital information on the structural features of SPAG11B protein isoforms and their contribution to antimicrobial activity.
Dynamic regulation of GDP binding to G proteins revealed by magnetic field-dependent NMR relaxation analyses

PubMed Central

Toyama, Yuki; Kano, Hanaho; Mase, Yoko; Yokogawa, Mariko; Osawa, Masanori; Shimada, Ichio

2017-01-01

Heterotrimeric guanine-nucleotide-binding proteins (G proteins) serve as molecular switches in signalling pathways, by coupling the activation of cell surface receptors to intracellular responses. Mutations in the G protein α-subunit (Gα) that accelerate guanosine diphosphate (GDP) dissociation cause hyperactivation of the downstream effector proteins, leading to oncogenesis. However, the structural mechanism of the accelerated GDP dissociation has remained unclear. Here, we use magnetic field-dependent nuclear magnetic resonance relaxation analyses to investigate the structural and dynamic properties of GDP bound Gα on a microsecond timescale. We show that Gα rapidly exchanges between a ground-state conformation, which tightly binds to GDP and an excited conformation with reduced GDP affinity. The oncogenic D150N mutation accelerates GDP dissociation by shifting the equilibrium towards the excited conformation. PMID:28223697
Dynamic regulation of GDP binding to G proteins revealed by magnetic field-dependent NMR relaxation analyses.

PubMed

Toyama, Yuki; Kano, Hanaho; Mase, Yoko; Yokogawa, Mariko; Osawa, Masanori; Shimada, Ichio

2017-02-22

Heterotrimeric guanine-nucleotide-binding proteins (G proteins) serve as molecular switches in signalling pathways, by coupling the activation of cell surface receptors to intracellular responses. Mutations in the G protein α-subunit (Gα) that accelerate guanosine diphosphate (GDP) dissociation cause hyperactivation of the downstream effector proteins, leading to oncogenesis. However, the structural mechanism of the accelerated GDP dissociation has remained unclear. Here, we use magnetic field-dependent nuclear magnetic resonance relaxation analyses to investigate the structural and dynamic properties of GDP bound Gα on a microsecond timescale. We show that Gα rapidly exchanges between a ground-state conformation, which tightly binds to GDP and an excited conformation with reduced GDP affinity. The oncogenic D150N mutation accelerates GDP dissociation by shifting the equilibrium towards the excited conformation.
The Phyre2 web portal for protein modelling, prediction and analysis

PubMed Central

Kelley, Lawrence A; Mezulis, Stefans; Yates, Christopher M; Wass, Mark N; Sternberg, Michael JE

2017-01-01

Summary Phyre2 is a suite of tools available on the web to predict and analyse protein structure, function and mutations. The focus of Phyre2 is to provide biologists with a simple and intuitive interface to state-of-the-art protein bioinformatics tools. Phyre2 replaces Phyre, the original version of the server for which we previously published a protocol. In this updated protocol, we describe Phyre2, which uses advanced remote homology detection methods to build 3D models, predict ligand binding sites, and analyse the effect of amino-acid variants (e.g. nsSNPs) for a user’s protein sequence. Users are guided through results by a simple interface at a level of detail determined by them. This protocol will guide a user from submitting a protein sequence to interpreting the secondary and tertiary structure of their models, their domain composition and model quality. A range of additional available tools is described to find a protein structure in a genome, to submit large number of sequences at once and to automatically run weekly searches for proteins difficult to model. The server is available at http://www.sbg.bio.ic.ac.uk/phyre2. A typical structure prediction will be returned between 30mins and 2 hours after submission. PMID:25950237
Automation of NMR structure determination of proteins.

PubMed

Altieri, Amanda S; Byrd, R Andrew

2004-10-01

The automation of protein structure determination using NMR is coming of age. The tedious processes of resonance assignment, followed by assignment of NOE (nuclear Overhauser enhancement) interactions (now intertwined with structure calculation), assembly of input files for structure calculation, intermediate analyses of incorrect assignments and bad input data, and finally structure validation are all being automated with sophisticated software tools. The robustness of the different approaches continues to deal with problems of completeness and uniqueness; nevertheless, the future is very bright for automation of NMR structure generation to approach the levels found in X-ray crystallography. Currently, near completely automated structure determination is possible for small proteins, and the prospect for medium-sized and large proteins is good. Copyright 2004 Elsevier Ltd.
Integrating Structure to Protein-Protein Interaction Networks That Drive Metastasis to Brain and Lung in Breast Cancer

PubMed Central

Engin, H. Billur; Guney, Emre; Keskin, Ozlem; Oliva, Baldo; Gursoy, Attila

2013-01-01

Blocking specific protein interactions can lead to human diseases. Accordingly, protein interactions and the structural knowledge on interacting surfaces of proteins (interfaces) have an important role in predicting the genotype-phenotype relationship. We have built the phenotype specific sub-networks of protein-protein interactions (PPIs) involving the relevant genes responsible for lung and brain metastasis from primary tumor in breast cancer. First, we selected the PPIs most relevant to metastasis causing genes (seed genes), by using the “guilt-by-association” principle. Then, we modeled structures of the interactions whose complex forms are not available in Protein Databank (PDB). Finally, we mapped mutations to interface structures (real and modeled), in order to spot the interactions that might be manipulated by these mutations. Functional analyses performed on these sub-networks revealed the potential relationship between immune system-infectious diseases and lung metastasis progression, but this connection was not observed significantly in the brain metastasis. Besides, structural analyses showed that some PPI interfaces in both metastasis sub-networks are originating from microbial proteins, which in turn were mostly related with cell adhesion. Cell adhesion is a key mechanism in metastasis, therefore these PPIs may be involved in similar molecular pathways that are shared by infectious disease and metastasis. Finally, by mapping the mutations and amino acid variations on the interface regions of the proteins in the metastasis sub-networks we found evidence for some mutations to be involved in the mechanisms differentiating the type of the metastasis. PMID:24278371

Nucleic and Amino Acid Sequences Support Structure-Based Viral Classification.

PubMed

Sinclair, Robert M; Ravantti, Janne J; Bamford, Dennis H

2017-04-15

Viral capsids ensure viral genome integrity by protecting the enclosed nucleic acids. Interactions between the genome and capsid and between individual capsid proteins (i.e., capsid architecture) are intimate and are expected to be characterized by strong evolutionary conservation. For this reason, a capsid structure-based viral classification has been proposed as a way to bring order to the viral universe. The seeming lack of sufficient sequence similarity to reproduce this classification has made it difficult to reject structural convergence as the basis for the classification. We reinvestigate whether the structure-based classification for viral coat proteins making icosahedral virus capsids is in fact supported by previously undetected sequence similarity. Since codon choices can influence nascent protein folding cotranslationally, we searched for both amino acid and nucleotide sequence similarity. To demonstrate the sensitivity of the approach, we identify a candidate gene for the pandoravirus capsid protein. We show that the structure-based classification is strongly supported by amino acid and also nucleotide sequence similarities, suggesting that the similarities are due to common descent. The correspondence between structure-based and sequence-based analyses of the same proteins shown here allow them to be used in future analyses of the relationship between linear sequence information and macromolecular function, as well as between linear sequence and protein folds. IMPORTANCE Viral capsids protect nucleic acid genomes, which in turn encode capsid proteins. This tight coupling of protein shell and nucleic acids, together with strong functional constraints on capsid protein folding and architecture, leads to the hypothesis that capsid protein-coding nucleotide sequences may retain signatures of ancient viral evolution. We have been able to show that this is indeed the case, using the major capsid proteins of viruses forming icosahedral capsids. Importantly, we detected similarity at the nucleotide level between capsid protein-coding regions from viruses infecting cells belonging to all three domains of life, reproducing a previously established structure-based classification of icosahedral viral capsids. Copyright © 2017 Sinclair et al.
Nucleic and Amino Acid Sequences Support Structure-Based Viral Classification

PubMed Central

Sinclair, Robert M.; Ravantti, Janne J.

2017-01-01

ABSTRACT Viral capsids ensure viral genome integrity by protecting the enclosed nucleic acids. Interactions between the genome and capsid and between individual capsid proteins (i.e., capsid architecture) are intimate and are expected to be characterized by strong evolutionary conservation. For this reason, a capsid structure-based viral classification has been proposed as a way to bring order to the viral universe. The seeming lack of sufficient sequence similarity to reproduce this classification has made it difficult to reject structural convergence as the basis for the classification. We reinvestigate whether the structure-based classification for viral coat proteins making icosahedral virus capsids is in fact supported by previously undetected sequence similarity. Since codon choices can influence nascent protein folding cotranslationally, we searched for both amino acid and nucleotide sequence similarity. To demonstrate the sensitivity of the approach, we identify a candidate gene for the pandoravirus capsid protein. We show that the structure-based classification is strongly supported by amino acid and also nucleotide sequence similarities, suggesting that the similarities are due to common descent. The correspondence between structure-based and sequence-based analyses of the same proteins shown here allow them to be used in future analyses of the relationship between linear sequence information and macromolecular function, as well as between linear sequence and protein folds. IMPORTANCE Viral capsids protect nucleic acid genomes, which in turn encode capsid proteins. This tight coupling of protein shell and nucleic acids, together with strong functional constraints on capsid protein folding and architecture, leads to the hypothesis that capsid protein-coding nucleotide sequences may retain signatures of ancient viral evolution. We have been able to show that this is indeed the case, using the major capsid proteins of viruses forming icosahedral capsids. Importantly, we detected similarity at the nucleotide level between capsid protein-coding regions from viruses infecting cells belonging to all three domains of life, reproducing a previously established structure-based classification of icosahedral viral capsids. PMID:28122979
Adaptive Covariation between the Coat and Movement Proteins of Prunus Necrotic Ringspot Virus

PubMed Central

Codoñer, Francisco M.; Fares, Mario A.; Elena, Santiago F.

2006-01-01

The relative functional and/or structural importance of different amino acid sites in a protein can be assessed by evaluating the selective constraints to which they have been subjected during the course of evolution. Here we explore such constraints at the linear and three-dimensional levels for the movement protein (MP) and coat protein (CP) encoded by RNA 3 of prunus necrotic ringspot ilarvirus (PNRSV). By a maximum-parsimony approach, the nucleotide sequences from 46 isolates of PNRSV varying in symptomatology, host tree, and geographic origin have been analyzed and sites under different selective pressures have been identified in both proteins. We have also performed covariation analyses to explore whether changes in certain amino acid sites condition subsequent variation in other sites of the same protein or the other protein. These covariation analyses shed light on which particular amino acids should be involved in the physical and functional interaction between MP and CP. Finally, we discuss these findings in the light of what is already known about the implication of certain sites and domains in structure and protein-protein and RNA-protein interactions. PMID:16731922
Adaptive covariation between the coat and movement proteins of prunus necrotic ringspot virus.

PubMed

Codoñer, Francisco M; Fares, Mario A; Elena, Santiago F

2006-06-01

The relative functional and/or structural importance of different amino acid sites in a protein can be assessed by evaluating the selective constraints to which they have been subjected during the course of evolution. Here we explore such constraints at the linear and three-dimensional levels for the movement protein (MP) and coat protein (CP) encoded by RNA 3 of prunus necrotic ringspot ilarvirus (PNRSV). By a maximum-parsimony approach, the nucleotide sequences from 46 isolates of PNRSV varying in symptomatology, host tree, and geographic origin have been analyzed and sites under different selective pressures have been identified in both proteins. We have also performed covariation analyses to explore whether changes in certain amino acid sites condition subsequent variation in other sites of the same protein or the other protein. These covariation analyses shed light on which particular amino acids should be involved in the physical and functional interaction between MP and CP. Finally, we discuss these findings in the light of what is already known about the implication of certain sites and domains in structure and protein-protein and RNA-protein interactions.
Robust, high-throughput solution structural analyses by small angle X-ray scattering (SAXS)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hura, Greg L.; Menon, Angeli L.; Hammel, Michal

2009-07-20

We present an efficient pipeline enabling high-throughput analysis of protein structure in solution with small angle X-ray scattering (SAXS). Our SAXS pipeline combines automated sample handling of microliter volumes, temperature and anaerobic control, rapid data collection and data analysis, and couples structural analysis with automated archiving. We subjected 50 representative proteins, mostly from Pyrococcus furiosus, to this pipeline and found that 30 were multimeric structures in solution. SAXS analysis allowed us to distinguish aggregated and unfolded proteins, define global structural parameters and oligomeric states for most samples, identify shapes and similar structures for 25 unknown structures, and determine envelopes formore » 41 proteins. We believe that high-throughput SAXS is an enabling technology that may change the way that structural genomics research is done.« less
A new carbamidemethyl-linked lanthanoid chelating tag for PCS NMR spectroscopy of proteins in living HeLa cells.

PubMed

Hikone, Yuya; Hirai, Go; Mishima, Masaki; Inomata, Kohsuke; Ikeya, Teppei; Arai, Souichiro; Shirakawa, Masahiro; Sodeoka, Mikiko; Ito, Yutaka

2016-10-01

Structural analyses of proteins under macromolecular crowding inside human cultured cells by in-cell NMR spectroscopy are crucial not only for explicit understanding of their cellular functions but also for applications in medical and pharmaceutical sciences. In-cell NMR experiments using human cultured cells however suffer from low sensitivity, thus pseudocontact shifts from protein-tagged paramagnetic lanthanoid ions, analysed using sensitive heteronuclear two-dimensional correlation NMR spectra, offer huge potential advantage in obtaining structural information over conventional NOE-based approaches. We synthesised a new lanthanoid-chelating tag (M8-CAM-I), in which the eight-fold, stereospecifically methylated DOTA (M8) scaffold was retained, while a stable carbamidemethyl (CAM) group was introduced as the functional group connecting to proteins. M8-CAM-I successfully fulfilled the requirements for in-cell NMR: high-affinity to lanthanoid, low cytotoxicity and the stability under reducing condition inside cells. Large PCSs for backbone N-H resonances observed for M8-CAM-tagged human ubiquitin mutant proteins, which were introduced into HeLa cells by electroporation, demonstrated that this approach readily provides the useful information enabling the determination of protein structures, relative orientations of domains and protein complexes within human cultured cells.
Complete genome sequence of 285P, a novel T7-like polyvalent E. coli bacteriophage.

PubMed

Xu, Bin; Ma, Xiangyu; Xiong, Hongyan; Li, Yafei

2014-06-01

Bacteriophages are considered potential biological agents for the control of infectious diseases and environmental disinfection. Here, we describe a novel T7-like polyvalent Escherichia coli bacteriophage, designated "285P," which can lyse several strains of E. coli. The genome, which consists of 39,270 base pairs with a G+C content of 48.73 %, was sequenced and annotated. Forty-three potential open reading frames were identified using bioinformatics tools. Based on whole-genome sequence comparison, phage 285P was identified as a novel strain of subgroup T7. It showed strongest sequence similarity to Kluyvera phage Kvp1. The phylogenetic analyses of both non-structural proteins (endonuclease gp3, amidase gp3.5, DNA primase/helicase gp4, DNA polymerase gp5, and exonuclease gp6) and structural protein (tail fiber protein gp17) led to the identification of 285P as T7-like phage. Sodium dodecyl sulfate-polyacrylamide gel electrophoresis and matrix-assisted laser desorption/ionization time-of-flight mass spectrometric analyses verified the annotation of the structural proteins (major capsid protein gp10a, tail protein gp12, and tail fiber protein gp17).
Rebelling for a Reason: Protein Structural “Outliers”

PubMed Central

Arumugam, Gandhimathi; Nair, Anu G.; Hariharaputran, Sridhar; Ramanathan, Sowdhamini

2013-01-01

Analysis of structural variation in domain superfamilies can reveal constraints in protein evolution which aids protein structure prediction and classification. Structure-based sequence alignment of distantly related proteins, organized in PASS2 database, provides clues about structurally conserved regions among different functional families. Some superfamily members show large structural differences which are functionally relevant. This paper analyses the impact of structural divergence on function for multi-member superfamilies, selected from the PASS2 superfamily alignment database. Functional annotations within superfamilies, with structural outliers or ‘rebels’, are discussed in the context of structural variations. Overall, these data reinforce the idea that functional similarities cannot be extrapolated from mere structural conservation. The implication for fold-function prediction is that the functional annotations can only be inherited with very careful consideration, especially at low sequence identities. PMID:24073209
Amaranth, quinoa and chia protein isolates: Physicochemical and structural properties.

PubMed

López, Débora N; Galante, Micaela; Robson, María; Boeris, Valeria; Spelzini, Darío

2018-04-01

An increasing use of vegetable protein is required to support the production of protein-rich foods which can replace animal proteins in the human diet. Amaranth, chia and quinoa seeds contain proteins which have biological and functional properties that provide nutritional benefits due to their reasonably well-balanced aminoacid content. This review analyses these vegetable proteins and focuses on recent research on protein classification and isolation as well as structural characterization by means of fluorescence spectroscopy, surface hydrophobicity and differential scanning calorimetry. Isolation procedures have a profound influence on the structural properties of the proteins and, therefore, on their in vitro digestibility. The present article provides a comprehensive overview of the properties and characterization of these proteins. Copyright © 2017 Elsevier B.V. All rights reserved.
Tools to evaluate the conformation of protein products.

PubMed

Manta, Bruno; Obal, Gonzalo; Ricciardi, Alejandro; Pritsch, Otto; Denicola, Ana

2011-06-01

Production of recombinant proteins is a process intensively used in the research laboratory. In addition, the main biotechnology market products are recombinant proteins and monoclonal antibodies. The biological (and clinical) properties of the protein product strongly depend on the conformation of the polypeptide. Therefore, assessment of the correct conformation of the produced protein is crucial. There is no single method to assess every aspect of protein structure or function. Depending on the protein, the methods of choice vary. There are general methods to evaluate not only mass and primary sequence of the protein, but also higher-order structure. This review outlines the principal techniques for determining the conformation of a protein from structural (biophysical methods) to functional (in vitro binding assays) analyses. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Characterization of protein and carbohydrate mid-IR spectral features in crop residues

NASA Astrophysics Data System (ADS)

Xin, Hangshu; Zhang, Yonggen; Wang, Mingjun; Li, Zhongyu; Wang, Zhibo; Yu, Peiqiang

2014-08-01

To the best of our knowledge, a few studies have been conducted on inherent structure spectral traits related to biopolymers of crop residues. The objective of this study was to characterize protein and carbohydrate structure spectral features of three field crop residues (rice straw, wheat straw and millet straw) in comparison with two crop vines (peanut vine and pea vine) by using Fourier transform infrared spectroscopy (FTIR) technique with attenuated total reflectance (ATR). Also, multivariate analyses were performed on spectral data sets within the regions mainly related to protein and carbohydrate in this study. The results showed that spectral differences existed in mid-IR peak intensities that are mainly related to protein and carbohydrate among these crop residue samples. With regard to protein spectral profile, peanut vine showed the greatest mid-IR band intensities that are related to protein amide and protein secondary structures, followed by pea vine and the rest three field crop straws. The crop vines had 48-134% higher spectral band intensity than the grain straws in spectral features associated with protein. Similar trends were also found in the bands that are mainly related to structural carbohydrates (such as cellulosic compounds). However, the field crop residues had higher peak intensity in total carbohydrates region than the crop vines. Furthermore, spectral ratios varied among the residue samples, indicating that these five crop residues had different internal structural conformation. However, multivariate spectral analyses showed that structural similarities still exhibited among crop residues in the regions associated with protein biopolymers and carbohydrate. Further study is needed to find out whether there is any relationship between spectroscopic information and nutrition supply in various kinds of crop residue when fed to animals.
Characterization of protein and carbohydrate mid-IR spectral features in crop residues.

PubMed

Xin, Hangshu; Zhang, Yonggen; Wang, Mingjun; Li, Zhongyu; Wang, Zhibo; Yu, Peiqiang

2014-08-14

To the best of our knowledge, a few studies have been conducted on inherent structure spectral traits related to biopolymers of crop residues. The objective of this study was to characterize protein and carbohydrate structure spectral features of three field crop residues (rice straw, wheat straw and millet straw) in comparison with two crop vines (peanut vine and pea vine) by using Fourier transform infrared spectroscopy (FTIR) technique with attenuated total reflectance (ATR). Also, multivariate analyses were performed on spectral data sets within the regions mainly related to protein and carbohydrate in this study. The results showed that spectral differences existed in mid-IR peak intensities that are mainly related to protein and carbohydrate among these crop residue samples. With regard to protein spectral profile, peanut vine showed the greatest mid-IR band intensities that are related to protein amide and protein secondary structures, followed by pea vine and the rest three field crop straws. The crop vines had 48-134% higher spectral band intensity than the grain straws in spectral features associated with protein. Similar trends were also found in the bands that are mainly related to structural carbohydrates (such as cellulosic compounds). However, the field crop residues had higher peak intensity in total carbohydrates region than the crop vines. Furthermore, spectral ratios varied among the residue samples, indicating that these five crop residues had different internal structural conformation. However, multivariate spectral analyses showed that structural similarities still exhibited among crop residues in the regions associated with protein biopolymers and carbohydrate. Further study is needed to find out whether there is any relationship between spectroscopic information and nutrition supply in various kinds of crop residue when fed to animals. Copyright © 2014 Elsevier B.V. All rights reserved.
The determinants of bond angle variability in protein/peptide backbones: A comprehensive statistical/quantum mechanics analysis.

PubMed

Improta, Roberto; Vitagliano, Luigi; Esposito, Luciana

2015-11-01

The elucidation of the mutual influence between peptide bond geometry and local conformation has important implications for protein structure refinement, validation, and prediction. To gain insights into the structural determinants and the energetic contributions associated with protein/peptide backbone plasticity, we here report an extensive analysis of the variability of the peptide bond angles by combining statistical analyses of protein structures and quantum mechanics calculations on small model peptide systems. Our analyses demonstrate that all the backbone bond angles strongly depend on the peptide conformation and unveil the existence of regular trends as function of ψ and/or φ. The excellent agreement of the quantum mechanics calculations with the statistical surveys of protein structures validates the computational scheme here employed and demonstrates that the valence geometry of protein/peptide backbone is primarily dictated by local interactions. Notably, for the first time we show that the position of the H(α) hydrogen atom, which is an important parameter in NMR structural studies, is also dependent on the local conformation. Most of the trends observed may be satisfactorily explained by invoking steric repulsive interactions; in some specific cases the valence bond variability is also influenced by hydrogen-bond like interactions. Moreover, we can provide a reliable estimate of the energies involved in the interplay between geometry and conformations. © 2015 Wiley Periodicals, Inc.
Unfolding stabilities of two structurally similar proteins as probed by temperature-induced and force-induced molecular dynamics simulations.

PubMed

Gorai, Biswajit; Prabhavadhni, Arasu; Sivaraman, Thirunavukkarasu

2015-09-01

Unfolding stabilities of two homologous proteins, cardiotoxin III and short-neurotoxin (SNTX) belonging to three-finger toxin (TFT) superfamily, have been probed by means of molecular dynamics (MD) simulations. Combined analysis of data obtained from steered MD and all-atom MD simulations at various temperatures in near physiological conditions on the proteins suggested that overall structural stabilities of the two proteins were different from each other and the MD results are consistent with experimental data of the proteins reported in the literature. Rationalization for the differential structural stabilities of the structurally similar proteins has been chiefly attributed to the differences in the structural contacts between C- and N-termini regions in their three-dimensional structures, and the findings endorse the 'CN network' hypothesis proposed to qualitatively analyse the thermodynamic stabilities of proteins belonging to TFT superfamily of snake venoms. Moreover, the 'CN network' hypothesis has been revisited and the present study suggested that 'CN network' should be accounted in terms of 'structural contacts' and 'structural strengths' in order to precisely describe order of structural stabilities of TFTs.
Deciphering Cryptic Binding Sites on Proteins by Mixed-Solvent Molecular Dynamics.

PubMed

Kimura, S Roy; Hu, Hai Peng; Ruvinsky, Anatoly M; Sherman, Woody; Favia, Angelo D

2017-06-26

In recent years, molecular dynamics simulations of proteins in explicit mixed solvents have been applied to various problems in protein biophysics and drug discovery, including protein folding, protein surface characterization, fragment screening, allostery, and druggability assessment. In this study, we perform a systematic study on how mixtures of organic solvent probes in water can reveal cryptic ligand binding pockets that are not evident in crystal structures of apo proteins. We examine a diverse set of eight PDB proteins that show pocket opening induced by ligand binding and investigate whether solvent MD simulations on the apo structures can induce the binding site observed in the holo structures. The cosolvent simulations were found to induce conformational changes on the protein surface, which were characterized and compared with the holo structures. Analyses of the biological systems, choice of probes and concentrations, druggability of the resulting induced pockets, and application to drug discovery are discussed here.
MolTalk – a programming library for protein structures and structure analysis

PubMed Central

Diemand, Alexander V; Scheib, Holger

2004-01-01

Background Two of the mostly unsolved but increasingly urgent problems for modern biologists are a) to quickly and easily analyse protein structures and b) to comprehensively mine the wealth of information, which is distributed along with the 3D co-ordinates by the Protein Data Bank (PDB). Tools which address this issue need to be highly flexible and powerful but at the same time must be freely available and easy to learn. Results We present MolTalk, an elaborate programming language, which consists of the programming library libmoltalk implemented in Objective-C and the Smalltalk-based interpreter MolTalk. MolTalk combines the advantages of an easy to learn and programmable procedural scripting with the flexibility and power of a full programming language. An overview of currently available applications of MolTalk is given and with PDBChainSaw one such application is described in more detail. PDBChainSaw is a MolTalk-based parser and information extraction utility of PDB files. Weekly updates of the PDB are synchronised with PDBChainSaw and are available for free download from the MolTalk project page following the link to PDBChainSaw. For each chain in a protein structure, PDBChainSaw extracts the sequence from its co-ordinates and provides additional information from the PDB-file header section, such as scientific organism, compound name, and EC code. Conclusion MolTalk provides a rich set of methods to analyse and even modify experimentally determined or modelled protein structures. These methods vary in complexity and are thus suitable for beginners and advanced programmers alike. We envision MolTalk to be most valuable in the following applications: 1) To analyse protein structures repetitively in large-scale, i.e. to benchmark protein structure prediction methods or to evaluate structural models. The quality of the resulting 3D-models can be assessed by e.g. calculating a Ramachandran-Sasisekharan plot. 2) To quickly retrieve information for (a limited number of) macro-molecular structures, i.e. H-bonds, salt bridges, contacts between amino acids and ligands or at the interface between two chains. 3) To programme more complex structural bioinformatics software and to implement demanding algorithms through its portability to Objective-C, e.g. iMolTalk. 4) To be used as a front end to databases, e.g. PDBChainSaw. PMID:15096277
MolTalk--a programming library for protein structures and structure analysis.

PubMed

Diemand, Alexander V; Scheib, Holger

2004-04-19

Two of the mostly unsolved but increasingly urgent problems for modern biologists are a) to quickly and easily analyse protein structures and b) to comprehensively mine the wealth of information, which is distributed along with the 3D co-ordinates by the Protein Data Bank (PDB). Tools which address this issue need to be highly flexible and powerful but at the same time must be freely available and easy to learn. We present MolTalk, an elaborate programming language, which consists of the programming library libmoltalk implemented in Objective-C and the Smalltalk-based interpreter MolTalk. MolTalk combines the advantages of an easy to learn and programmable procedural scripting with the flexibility and power of a full programming language. An overview of currently available applications of MolTalk is given and with PDBChainSaw one such application is described in more detail. PDBChainSaw is a MolTalk-based parser and information extraction utility of PDB files. Weekly updates of the PDB are synchronised with PDBChainSaw and are available for free download from the MolTalk project page http://www.moltalk.org following the link to PDBChainSaw. For each chain in a protein structure, PDBChainSaw extracts the sequence from its co-ordinates and provides additional information from the PDB-file header section, such as scientific organism, compound name, and EC code. MolTalk provides a rich set of methods to analyse and even modify experimentally determined or modelled protein structures. These methods vary in complexity and are thus suitable for beginners and advanced programmers alike. We envision MolTalk to be most valuable in the following applications:1) To analyse protein structures repetitively in large-scale, i.e. to benchmark protein structure prediction methods or to evaluate structural models. The quality of the resulting 3D-models can be assessed by e.g. calculating a Ramachandran-Sasisekharan plot.2) To quickly retrieve information for (a limited number of) macro-molecular structures, i.e. H-bonds, salt bridges, contacts between amino acids and ligands or at the interface between two chains.3) To programme more complex structural bioinformatics software and to implement demanding algorithms through its portability to Objective-C, e.g. iMolTalk.4) To be used as a front end to databases, e.g. PDBChainSaw.
Comparative analyses of putative toxin gene homologs from an Old World viper, Daboia russelii

PubMed Central

Krishnan, Neeraja M.

2017-01-01

Availability of snake genome sequences has opened up exciting areas of research on comparative genomics and gene diversity. One of the challenges in studying snake genomes is the acquisition of biological material from live animals, especially from the venomous ones, making the process cumbersome and time-consuming. Here, we report comparative sequence analyses of putative toxin gene homologs from Russell’s viper (Daboia russelii) using whole-genome sequencing data obtained from shed skin. When compared with the major venom proteins in Russell’s viper studied previously, we found 45–100% sequence similarity between the venom proteins and their putative homologs in the skin. Additionally, comparative analyses of 20 putative toxin gene family homologs provided evidence of unique sequence motifs in nerve growth factor (NGF), platelet derived growth factor (PDGF), Kunitz/Bovine pancreatic trypsin inhibitor (Kunitz BPTI), cysteine-rich secretory proteins, antigen 5, andpathogenesis-related1 proteins (CAP) and cysteine-rich secretory protein (CRISP). In those derived proteins, we identified V11 and T35 in the NGF domain; F23 and A29 in the PDGF domain; N69, K2 and A5 in the CAP domain; and Q17 in the CRISP domain to be responsible for differences in the largest pockets across the protein domain structures in crotalines, viperines and elapids from the in silico structure-based analysis. Similarly, residues F10, Y11 and E20 appear to play an important role in the protein structures across the kunitz protein domain of viperids and elapids. Our study highlights the usefulness of shed skin in obtaining good quality high-molecular weight DNA for comparative genomic studies, and provides evidence towards the unique features and evolution of putative venom gene homologs in vipers. PMID:29230357
Inferences from structural comparison: flexibility, secondary structure wobble and sequence alignment optimization.

PubMed

Zhang, Gaihua; Su, Zhen

2012-01-01

Work on protein structure prediction is very useful in biological research. To evaluate their accuracy, experimental protein structures or their derived data are used as the 'gold standard'. However, as proteins are dynamic molecular machines with structural flexibility such a standard may be unreliable. To investigate the influence of the structure flexibility, we analysed 3,652 protein structures of 137 unique sequences from 24 protein families. The results showed that (1) the three-dimensional (3D) protein structures were not rigid: the root-mean-square deviation (RMSD) of the backbone Cα of structures with identical sequences was relatively large, with the average of the maximum RMSD from each of the 137 sequences being 1.06 Å; (2) the derived data of the 3D structure was not constant, e.g. the highest ratio of the secondary structure wobble site was 60.69%, with the sequence alignments from structural comparisons of two proteins in the same family sometimes being completely different. Proteins may have several stable conformations and the data derived from resolved structures as a 'gold standard' should be optimized before being utilized as criteria to evaluate the prediction methods, e.g. sequence alignment from structural comparison. Helix/β-sheet transition exists in normal free proteins. The coil ratio of the 3D structure could affect its resolution as determined by X-ray crystallography.
Three reasons protein disorder analysis makes more sense in the light of collagen

PubMed Central

Oates, Matt E.; Tompa, Peter; Gough, Julian

2016-01-01

Abstract We have identified that the collagen helix has the potential to be disruptive to analyses of intrinsically disordered proteins. The collagen helix is an extended fibrous structure that is both promiscuous and repetitive. Whilst its sequence is predicted to be disordered, this type of protein structure is not typically considered as intrinsic disorder. Here, we show that collagen‐encoding proteins skew the distribution of exon lengths in genes. We find that previous results, demonstrating that exons encoding disordered regions are more likely to be symmetric, are due to the abundance of the collagen helix. Other related results, showing increased levels of alternative splicing in disorder‐encoding exons, still hold after considering collagen‐containing proteins. Aside from analyses of exons, we find that the set of proteins that contain collagen significantly alters the amino acid composition of regions predicted as disordered. We conclude that research in this area should be conducted in the light of the collagen helix. PMID:26941008

In Search of Functional Advantages of Knots in Proteins.

PubMed

Dabrowski-Tumanski, Pawel; Stasiak, Andrzej; Sulkowska, Joanna I

2016-01-01

We analysed the structure of deeply knotted proteins representing three unrelated families of knotted proteins. We looked at the correlation between positions of knotted cores in these proteins and such local structural characteristics as the number of intra-chain contacts, structural stability and solvent accessibility. We observed that the knotted cores and especially their borders showed strong enrichment in the number of contacts. These regions showed also increased thermal stability, whereas their solvent accessibility was decreased. Interestingly, the active sites within these knotted proteins preferentially located in the regions with increased number of contacts that also have increased thermal stability and decreased solvent accessibility. Our results suggest that knotting of polypeptide chains provides a favourable environment for the active sites observed in knotted proteins. Some knotted proteins have homologues without a knot. Interestingly, these unknotted homologues form local entanglements that retain structural characteristics of the knotted cores.
A protein relational database and protein family knowledge bases to facilitate structure-based design analyses.

PubMed

Mobilio, Dominick; Walker, Gary; Brooijmans, Natasja; Nilakantan, Ramaswamy; Denny, R Aldrin; Dejoannis, Jason; Feyfant, Eric; Kowticwar, Rupesh K; Mankala, Jyoti; Palli, Satish; Punyamantula, Sairam; Tatipally, Maneesh; John, Reji K; Humblet, Christine

2010-08-01

The Protein Data Bank is the most comprehensive source of experimental macromolecular structures. It can, however, be difficult at times to locate relevant structures with the Protein Data Bank search interface. This is particularly true when searching for complexes containing specific interactions between protein and ligand atoms. Moreover, searching within a family of proteins can be tedious. For example, one cannot search for some conserved residue as residue numbers vary across structures. We describe herein three databases, Protein Relational Database, Kinase Knowledge Base, and Matrix Metalloproteinase Knowledge Base, containing protein structures from the Protein Data Bank. In Protein Relational Database, atom-atom distances between protein and ligand have been precalculated allowing for millisecond retrieval based on atom identity and distance constraints. Ring centroids, centroid-centroid and centroid-atom distances and angles have also been included permitting queries for pi-stacking interactions and other structural motifs involving rings. Other geometric features can be searched through the inclusion of residue pair and triplet distances. In Kinase Knowledge Base and Matrix Metalloproteinase Knowledge Base, the catalytic domains have been aligned into common residue numbering schemes. Thus, by searching across Protein Relational Database and Kinase Knowledge Base, one can easily retrieve structures wherein, for example, a ligand of interest is making contact with the gatekeeper residue.
Structure of the higher plant light harvesting complex I: in vivo characterization and structural interdependence of the Lhca proteins.

PubMed

Klimmek, Frank; Ganeteg, Ulrika; Ihalainen, Janne A; van Roon, Henny; Jensen, Poul E; Scheller, Henrik V; Dekker, Jan P; Jansson, Stefan

2005-03-01

We have investigated the structure of the higher plant light harvesting complex of photosystem I (LHCI) by analyzing PSI-LHCI particles isolated from a set of Arabidopsis plant lines, each lacking a specific Lhca (Lhca1-4) polypeptide. Functional antenna size measurements support the recent finding that there are four Lhca proteins per PSI in the crystal structure [Ben-Shem, A., Frolow, F., and Nelson, N. (2003) Nature 426, 630-635]. According to HPLC analyses the number of pigment molecules bound within the LHCI is higher than expected from reconstitution studies or analyses of isolated native LHCI. Comparison of the spectra of the particles from the different lines reveals chlorophyll absorption bands peaking at 696, 688, 665, and 655 nm that are not present in isolated PSI or LHCI. These bands presumably originate from "gap" or "linker" pigments that are cooperatively coordinated by the Lhca and/or PSI proteins, which we have tentatively localized in the PSI-LHCI complex.
Identification of a Herbal Powder by Deoxyribonucleic Acid Barcoding and Structural Analyses.

PubMed

Sheth, Bhavisha P; Thaker, Vrinda S

2015-10-01

Authentic identification of plants is essential for exploiting their medicinal properties as well as to stop the adulteration and malpractices with the trade of the same. To identify a herbal powder obtained from a herbalist in the local vicinity of Rajkot, Gujarat, using deoxyribonucleic acid (DNA) barcoding and molecular tools. The DNA was extracted from a herbal powder and selected Cassia species, followed by the polymerase chain reaction (PCR) and sequencing of the rbcL barcode locus. Thereafter the sequences were subjected to National Center for Biotechnology Information (NCBI) basic local alignment search tool (BLAST) analysis, followed by the protein three-dimension structure determination of the rbcL protein from the herbal powder and Cassia species namely Cassia fistula, Cassia tora and Cassia javanica (sequences obtained in the present study), Cassia Roxburghii, and Cassia abbreviata (sequences retrieved from Genbank). Further, the multiple and pairwise structural alignment were carried out in order to identify the herbal powder. The nucleotide sequences obtained from the selected species of Cassia were submitted to Genbank (Accession No. JX141397, JX141405, JX141420). The NCBI BLAST analysis of the rbcL protein from the herbal powder showed an equal sequence similarity (with reference to different parameters like E value, maximum identity, total score, query coverage) to C. javanica and C. roxburghii. In order to solve the ambiguities of the BLAST result, a protein structural approach was implemented. The protein homology models obtained in the present study were submitted to the protein model database (PM0079748-PM0079753). The pairwise structural alignment of the herbal powder (as template) and C. javanica and C. roxburghii (as targets individually) revealed a close similarity of the herbal powder with C. javanica. A strategy as used here, incorporating the integrated use of DNA barcoding and protein structural analyses could be adopted, as a novel rapid and economic procedure, especially in cases when protein coding loci are considered. Authentic identification of plants is essential for exploiting their medicinal properties as well as to stop the adulteration and malpractices with the trade of the same. A herbal powder was obtained from a herbalist in the local vicinity of Rajkot, Gujarat. An integrated approach using DNA barcoding and structural analyses was carried out to identify the herbal powder. The herbal powder was identified as Cassia javanica L.
High-resolution X-ray crystal structure of bovine H-protein using the high-pressure cryocooling method.

PubMed

Higashiura, Akifumi; Ohta, Kazunori; Masaki, Mika; Sato, Masaru; Inaka, Koji; Tanaka, Hiroaki; Nakagawa, Atsushi

2013-11-01

Recently, many technical improvements in macromolecular X-ray crystallography have increased the number of structures deposited in the Protein Data Bank and improved the resolution limit of protein structures. Almost all high-resolution structures have been determined using a synchrotron radiation source in conjunction with cryocooling techniques, which are required in order to minimize radiation damage. However, optimization of cryoprotectant conditions is a time-consuming and difficult step. To overcome this problem, the high-pressure cryocooling method was developed (Kim et al., 2005) and successfully applied to many protein-structure analyses. In this report, using the high-pressure cryocooling method, the X-ray crystal structure of bovine H-protein was determined at 0.86 Å resolution. Structural comparisons between high- and ambient-pressure cryocooled crystals at ultra-high resolution illustrate the versatility of this technique. This is the first ultra-high-resolution X-ray structure obtained using the high-pressure cryocooling method.
Development of the field of structural physiology

PubMed Central

FUJIYOSHI, Yoshinori

2015-01-01

Electron crystallography is especially useful for studying the structure and function of membrane proteins — key molecules with important functions in neural and other cells. Electron crystallography is now an established technique for analyzing the structures of membrane proteins in lipid bilayers that closely simulate their natural biological environment. Utilizing cryo-electron microscopes with helium-cooled specimen stages that were developed through a personal motivation to understand the functions of neural systems from a structural point of view, the structures of membrane proteins can be analyzed at a higher than 3 Å resolution. This review covers four objectives. First, I introduce the new research field of structural physiology. Second, I recount some of the struggles involved in developing cryo-electron microscopes. Third, I review the structural and functional analyses of membrane proteins mainly by electron crystallography using cryo-electron microscopes. Finally, I discuss multifunctional channels named “adhennels” based on structures analyzed using electron and X-ray crystallography. PMID:26560835
The Identification and Functional Characterization of WxL Proteins from Enterococcus faecium Reveal Surface Proteins Involved in Extracellular Matrix Interactions

PubMed Central

Galloway-Peña, Jessica R.; Liang, Xiaowen; Singh, Kavindra V.; Yadav, Puja; Chang, Chungyu; La Rosa, Sabina Leanti; Shelburne, Samuel; Ton-That, Hung; Höök, Magnus

2014-01-01

The WxL domain recently has been identified as a novel cell wall binding domain found in numerous predicted proteins within multiple Gram-positive bacterial species. However, little is known about the function of proteins containing this novel domain. Here, we identify and characterize 6 Enterococcus faecium proteins containing the WxL domain which, by reverse transcription-PCR (RT-PCR) and genomic analyses, are located in three similarly organized operons, deemed WxL loci A, B, and C. Western blotting, electron microscopy, and enzyme-linked immunosorbent assays (ELISAs) determined that genes of WxL loci A and C encode antigenic, cell surface proteins exposed at higher levels in clinical isolates than in commensal isolates. Secondary structural analyses of locus A recombinant WxL domain-containing proteins found they are rich in β-sheet structure and disordered segments. Using Biacore analyses, we discovered that recombinant WxL proteins from locus A bind human extracellular matrix proteins, specifically type I collagen and fibronectin. Proteins encoded by locus A also were found to bind to each other, suggesting a novel cell surface complex. Furthermore, bile salt survival assays and animal models using a mutant from which all three WxL loci were deleted revealed the involvement of WxL operons in bile salt stress and endocarditis pathogenesis. In summary, these studies extend our understanding of proteins containing the WxL domain and their potential impact on colonization and virulence in E. faecium and possibly other Gram-positive bacterial species. PMID:25512313
Structural basis for amino acid export by DMT superfamily transporter YddG.

PubMed

Tsuchiya, Hirotoshi; Doki, Shintaro; Takemoto, Mizuki; Ikuta, Tatsuya; Higuchi, Takashi; Fukui, Keita; Usuda, Yoshihiro; Tabuchi, Eri; Nagatoishi, Satoru; Tsumoto, Kouhei; Nishizawa, Tomohiro; Ito, Koichi; Dohmae, Naoshi; Ishitani, Ryuichiro; Nureki, Osamu

2016-06-16

The drug/metabolite transporter (DMT) superfamily is a large group of membrane transporters ubiquitously found in eukaryotes, bacteria and archaea, and includes exporters for a remarkably wide range of substrates, such as toxic compounds and metabolites. YddG is a bacterial DMT protein that expels aromatic amino acids and exogenous toxic compounds, thereby contributing to cellular homeostasis. Here we present structural and functional analyses of YddG. Using liposome-based analyses, we show that Escherichia coli and Starkeya novella YddG export various amino acids. The crystal structure of S. novella YddG at 2.4 Å resolution reveals a new membrane transporter topology, with ten transmembrane segments in an outward-facing state. The overall structure is basket-shaped, with a large substrate-binding cavity at the centre of the molecule, and is composed of inverted structural repeats related by two-fold pseudo-symmetry. On the basis of this intramolecular symmetry, we propose a structural model for the inward-facing state and a mechanism of the conformational change for substrate transport, which we confirmed by biochemical analyses. These findings provide a structural basis for the mechanism of transport of DMT superfamily proteins.
Structure Prediction and Analysis of DNA Transposon and LINE Retrotransposon Proteins*

PubMed Central

Abrusán, György; Zhang, Yang; Szilágyi, András

2013-01-01

Despite the considerable amount of research on transposable elements, no large-scale structural analyses of the TE proteome have been performed so far. We predicted the structures of hundreds of proteins from a representative set of DNA and LINE transposable elements and used the obtained structural data to provide the first general structural characterization of TE proteins and to estimate the frequency of TE domestication and horizontal transfer events. We show that 1) ORF1 and Gag proteins of retrotransposons contain high amounts of structural disorder; thus, despite their very low conservation, the presence of disordered regions and probably their chaperone function is conserved. 2) The distribution of SCOP classes in DNA transposons and LINEs indicates that the proteins of DNA transposons are more ancient, containing folds that already existed when the first cellular organisms appeared. 3) DNA transposon proteins have lower contact order than randomly selected reference proteins, indicating rapid folding, most likely to avoid protein aggregation. 4) Structure-based searches for TE homologs indicate that the overall frequency of TE domestication events is low, whereas we found a relatively high number of cases where horizontal transfer, frequently involving parasites, is the most likely explanation for the observed homology. PMID:23530042
PDB@: an offline toolkit for exploration and analysis of PDB files.

PubMed

Mani, Udayakumar; Ravisankar, Sadhana; Ramakrishnan, Sai Mukund

2013-12-01

Protein Data Bank (PDB) is a freely accessible archive of the 3-D structural data of biological molecules. Structure based studies offers a unique vantage point in inferring the properties of a protein molecule from structural data. This is too big a task to be done manually. Moreover, there is no single tool, software or server that comprehensively analyses all structure-based properties. The objective of the present work is to develop an offline computational toolkit, PDB@ containing in-built algorithms that help categorizing the structural properties of a protein molecule. The user has the facility to view and edit the PDB file to his need. Some features of the present work are unique in itself and others are an improvement over existing tools. Also, the representation of protein properties in both graphical and textual formats helps in predicting all the necessary details of a protein molecule on a single platform.
SARS-unique fold in the Rousettus bat coronavirus HKU9.

PubMed

Hammond, Robert G; Tan, Xuan; Johnson, Margaret A

2017-09-01

The coronavirus nonstructural protein 3 (nsp3) is a multifunctional protein that comprises multiple structural domains. This protein assists viral polyprotein cleavage, host immune interference, and may play other roles in genome replication or transcription. Here, we report the solution NMR structure of a protein from the "SARS-unique region" of the bat coronavirus HKU9. The protein contains a frataxin fold or double-wing motif, which is an α + β fold that is associated with protein/protein interactions, DNA binding, and metal ion binding. High structural similarity to the human severe acute respiratory syndrome (SARS) coronavirus nsp3 is present. A possible functional site that is conserved among some betacoronaviruses has been identified using bioinformatics and biochemical analyses. This structure provides strong experimental support for the recent proposal advanced by us and others that the "SARS-unique" region is not unique to the human SARS virus, but is conserved among several different phylogenetic groups of coronaviruses and provides essential functions. © 2017 The Protein Society.
Structure-based Reassessment of the Caveolin Signaling Model: Do Caveolae Regulate Signaling Through Caveolin-Protein Interactions?

PubMed Central

Collins, Brett M.; Davis, Melissa J.; Hancock, John F.; Parton, Robert G.

2012-01-01

Summary Caveolin proteins drive formation of caveolae, specialized cell-surface microdomains that influence cell signaling. Signaling proteins are proposed to use conserved caveolin-binding motifs (CBMs) to associate with caveolae via the caveolin scaffolding domain (CSD). However, structural and bioinformatic analyses argue against such direct physical interactions: In the majority of signaling proteins, the CBM is buried and inaccessible. Putative CBMs do not form a common structure for caveolin recognition, are not enriched amongst caveolin-binding proteins, and are even more common in yeast, which lack caveolae. We propose that CBM/CSD-dependent interactions are unlikely to mediate caveolar signaling, and the basis for signaling effects should therefore be reassessed. PMID:22814599
MannDB: A microbial annotation database for protein characterization

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhou, C; Lam, M; Smith, J

2006-05-19

MannDB was created to meet a need for rapid, comprehensive automated protein sequence analyses to support selection of proteins suitable as targets for driving the development of reagents for pathogen or protein toxin detection. Because a large number of open-source tools were needed, it was necessary to produce a software system to scale the computations for whole-proteome analysis. Thus, we built a fully automated system for executing software tools and for storage, integration, and display of automated protein sequence analysis and annotation data. MannDB is a relational database that organizes data resulting from fully automated, high-throughput protein-sequence analyses using open-sourcemore » tools. Types of analyses provided include predictions of cleavage, chemical properties, classification, features, functional assignment, post-translational modifications, motifs, antigenicity, and secondary structure. Proteomes (lists of hypothetical and known proteins) are downloaded and parsed from Genbank and then inserted into MannDB, and annotations from SwissProt are downloaded when identifiers are found in the Genbank entry or when identical sequences are identified. Currently 36 open-source tools are run against MannDB protein sequences either on local systems or by means of batch submission to external servers. In addition, BLAST against protein entries in MvirDB, our database of microbial virulence factors, is performed. A web client browser enables viewing of computational results and downloaded annotations, and a query tool enables structured and free-text search capabilities. When available, links to external databases, including MvirDB, are provided. MannDB contains whole-proteome analyses for at least one representative organism from each category of biological threat organism listed by APHIS, CDC, HHS, NIAID, USDA, USFDA, and WHO. MannDB comprises a large number of genomes and comprehensive protein sequence analyses representing organisms listed as high-priority agents on the websites of several governmental organizations concerned with bio-terrorism. MannDB provides the user with a BLAST interface for comparison of native and non-native sequences and a query tool for conveniently selecting proteins of interest. In addition, the user has access to a web-based browser that compiles comprehensive and extensive reports.« less
Towards fully automated structure-based function prediction in structural genomics: a case study.

PubMed

Watson, James D; Sanderson, Steve; Ezersky, Alexandra; Savchenko, Alexei; Edwards, Aled; Orengo, Christine; Joachimiak, Andrzej; Laskowski, Roman A; Thornton, Janet M

2007-04-13

As the global Structural Genomics projects have picked up pace, the number of structures annotated in the Protein Data Bank as hypothetical protein or unknown function has grown significantly. A major challenge now involves the development of computational methods to assign functions to these proteins accurately and automatically. As part of the Midwest Center for Structural Genomics (MCSG) we have developed a fully automated functional analysis server, ProFunc, which performs a battery of analyses on a submitted structure. The analyses combine a number of sequence-based and structure-based methods to identify functional clues. After the first stage of the Protein Structure Initiative (PSI), we review the success of the pipeline and the importance of structure-based function prediction. As a dataset, we have chosen all structures solved by the MCSG during the 5 years of the first PSI. Our analysis suggests that two of the structure-based methods are particularly successful and provide examples of local similarity that is difficult to identify using current sequence-based methods. No one method is successful in all cases, so, through the use of a number of complementary sequence and structural approaches, the ProFunc server increases the chances that at least one method will find a significant hit that can help elucidate function. Manual assessment of the results is a time-consuming process and subject to individual interpretation and human error. We present a method based on the Gene Ontology (GO) schema using GO-slims that can allow the automated assessment of hits with a success rate approaching that of expert manual assessment.
Three-dimensional (3D) structure prediction and function analysis of the chitin-binding domain 3 protein HD73_3189 from Bacillus thuringiensis HD73.

PubMed

Zhan, Yiling; Guo, Shuyuan

2015-01-01

Bacillus thuringiensis (Bt) is capable of producing a chitin-binding protein believed to be functionally important to bacteria during the stationary phase of its growth cycle. In this paper, the chitin-binding domain 3 protein HD73_3189 from B. thuringiensis has been analyzed by computer technology. Primary and secondary structural analyses demonstrated that HD73_3189 is negatively charged and contains several α-helices, aperiodical coils and β-strands. Domain and motif analyses revealed that HD73_3189 contains a signal peptide, an N-terminal chitin binding 3 domains, two copies of a fibronectin-like domain 3 and a C-terminal carbohydrate binding domain classified as CBM_5_12. Moreover, analysis predicted the protein's associated localization site to be the cell wall. Ligand site prediction determined that amino acid residues GLU-312, TRP-334, ILE-341 and VAL-382 exposed on the surface of the target protein exhibit polar interactions with the substrate.
Similarity Measures for Protein Ensembles

PubMed Central

Lindorff-Larsen, Kresten; Ferkinghoff-Borg, Jesper

2009-01-01

Analyses of similarities and changes in protein conformation can provide important information regarding protein function and evolution. Many scores, including the commonly used root mean square deviation, have therefore been developed to quantify the similarities of different protein conformations. However, instead of examining individual conformations it is in many cases more relevant to analyse ensembles of conformations that have been obtained either through experiments or from methods such as molecular dynamics simulations. We here present three approaches that can be used to compare conformational ensembles in the same way as the root mean square deviation is used to compare individual pairs of structures. The methods are based on the estimation of the probability distributions underlying the ensembles and subsequent comparison of these distributions. We first validate the methods using a synthetic example from molecular dynamics simulations. We then apply the algorithms to revisit the problem of ensemble averaging during structure determination of proteins, and find that an ensemble refinement method is able to recover the correct distribution of conformations better than standard single-molecule refinement. PMID:19145244
Genetic analyses of bone morphogenetic protein 2, 4 and 7 in congenital combined pituitary hormone deficiency.

PubMed

Breitfeld, Jana; Martens, Susanne; Klammt, Jürgen; Schlicke, Marina; Pfäffle, Roland; Krause, Kerstin; Weidle, Kerstin; Schleinitz, Dorit; Stumvoll, Michael; Führer, Dagmar; Kovacs, Peter; Tönjes, Anke

2013-12-01

The complex process of development of the pituitary gland is regulated by a number of signalling molecules and transcription factors. Mutations in these factors have been identified in rare cases of congenital hypopituitarism but for most subjects with combined pituitary hormone deficiency (CPHD) genetic causes are unknown. Bone morphogenetic proteins (BMPs) affect induction and growth of the pituitary primordium and thus represent plausible candidates for mutational screening of patients with CPHD. We sequenced BMP2, 4 and 7 in 19 subjects with CPHD. For validation purposes, novel genetic variants were genotyped in 1046 healthy subjects. Additionally, potential functional relevance for most promising variants has been assessed by phylogenetic analyses and prediction of effects on protein structure. Sequencing revealed two novel variants and confirmed 30 previously known polymorphisms and mutations in BMP2, 4 and 7. Although phylogenetic analyses indicated that these variants map within strongly conserved gene regions, there was no direct support for their impact on protein structure when applying predictive bioinformatics tools. A mutation in the BMP4 coding region resulting in an amino acid exchange (p.Arg300Pro) appeared most interesting among the identified variants. Further functional analyses are required to ultimately map the relevance of these novel variants in CPHD.
Genetic analyses of bone morphogenetic protein 2, 4 and 7 in congenital combined pituitary hormone deficiency

PubMed Central

2013-01-01

Background The complex process of development of the pituitary gland is regulated by a number of signalling molecules and transcription factors. Mutations in these factors have been identified in rare cases of congenital hypopituitarism but for most subjects with combined pituitary hormone deficiency (CPHD) genetic causes are unknown. Bone morphogenetic proteins (BMPs) affect induction and growth of the pituitary primordium and thus represent plausible candidates for mutational screening of patients with CPHD. Methods We sequenced BMP2, 4 and 7 in 19 subjects with CPHD. For validation purposes, novel genetic variants were genotyped in 1046 healthy subjects. Additionally, potential functional relevance for most promising variants has been assessed by phylogenetic analyses and prediction of effects on protein structure. Results Sequencing revealed two novel variants and confirmed 30 previously known polymorphisms and mutations in BMP2, 4 and 7. Although phylogenetic analyses indicated that these variants map within strongly conserved gene regions, there was no direct support for their impact on protein structure when applying predictive bioinformatics tools. Conclusions A mutation in the BMP4 coding region resulting in an amino acid exchange (p.Arg300Pro) appeared most interesting among the identified variants. Further functional analyses are required to ultimately map the relevance of these novel variants in CPHD. PMID:24289245
Bioinformatics and variability in drug response: a protein structural perspective

PubMed Central

Lahti, Jennifer L.; Tang, Grace W.; Capriotti, Emidio; Liu, Tianyun; Altman, Russ B.

2012-01-01

Marketed drugs frequently perform worse in clinical practice than in the clinical trials on which their approval is based. Many therapeutic compounds are ineffective for a large subpopulation of patients to whom they are prescribed; worse, a significant fraction of patients experience adverse effects more severe than anticipated. The unacceptable risk–benefit profile for many drugs mandates a paradigm shift towards personalized medicine. However, prior to adoption of patient-specific approaches, it is useful to understand the molecular details underlying variable drug response among diverse patient populations. Over the past decade, progress in structural genomics led to an explosion of available three-dimensional structures of drug target proteins while efforts in pharmacogenetics offered insights into polymorphisms correlated with differential therapeutic outcomes. Together these advances provide the opportunity to examine how altered protein structures arising from genetic differences affect protein–drug interactions and, ultimately, drug response. In this review, we first summarize structural characteristics of protein targets and common mechanisms of drug interactions. Next, we describe the impact of coding mutations on protein structures and drug response. Finally, we highlight tools for analysing protein structures and protein–drug interactions and discuss their application for understanding altered drug responses associated with protein structural variants. PMID:22552919
Variability and genetic structure of the population of watermelon mosaic virus infecting melon in Spain.

PubMed

Moreno, I M; Malpica, J M; Díaz-Pendón, J A; Moriones, E; Fraile, A; García-Arenal, F

2004-01-05

The genetic structure of the population of Watermelon mosaic virus (WMV) in Spain was analysed by the biological and molecular characterisation of isolates sampled from its main host plant, melon. The population was a highly homogeneous one, built of a single pathotype, and comprising isolates closely related genetically. There was indication of temporal replacement of genotypes, but not of spatial structure of the population. Analyses of nucleotide sequences in three genomic regions, that is, in the cistrons for the P1, cylindrical inclusion (CI) and capsid (CP) proteins, showed lower similar values of nucleotide diversity for the P1 than for the CI or CP cistrons. The CI protein and the CP were under tighter evolutionary constraints than the P1 protein. Also, for the CI and CP cistrons, but not for the P1 cistron, two groups of sequences, defining two genetic strains, were apparent. Thus, different genomic regions of WMV show different evolutionary dynamics. Interestingly, for the CI and CP cistrons, sequences were clustered into two regions of the sequence space, defining the two strains above, and no intermediary sequences were identified. Recombinant isolates were found, accounting for at least 7% of the population. These recombinants presented two interesting features: (i) crossover points were detected between the analysed regions in the CI and CP cistrons, but not between those in the P1 and CI cistrons, (ii) crossover points were not observed within the analysed coding regions for the P1, CI or CP proteins. This indicates strong selection against isolates with recombinant proteins, even when originated from closely related strains. Hence, data indicate that genotypes of WMV, generated by mutation or recombination, outside of acceptable, discrete, regions in the evolutionary space, are eliminated from the virus population by negative selection.

An Uncharacterized Member of the Ribokinase Family in Thermococcus kodakarensis Exhibits myo-Inositol Kinase Activity*

PubMed Central

Sato, Takaaki; Fujihashi, Masahiro; Miyamoto, Yukika; Kuwata, Keiko; Kusaka, Eriko; Fujita, Haruo; Miki, Kunio; Atomi, Haruyuki

2013-01-01

Here we performed structural and biochemical analyses on the TK2285 gene product, an uncharacterized protein annotated as a member of the ribokinase family, from the hyperthermophilic archaeon Thermococcus kodakarensis. The three-dimensional structure of the TK2285 protein resembled those of previously characterized members of the ribokinase family including ribokinase, adenosine kinase, and phosphofructokinase. Conserved residues characteristic of this protein family were located in a cleft of the TK2285 protein as in other members whose structures have been determined. We thus examined the kinase activity of the TK2285 protein toward various sugars recognized by well characterized ribokinase family members. Although activity with sugar phosphates and nucleosides was not detected, kinase activity was observed toward d-allose, d-lyxose, d-tagatose, d-talose, d-xylose, and d-xylulose. Kinetic analyses with the six sugar substrates revealed high Km values, suggesting that they were not the true physiological substrates. By examining activity toward amino sugars, sugar alcohols, and disaccharides, we found that the TK2285 protein exhibited prominent kinase activity toward myo-inositol. Kinetic analyses with myo-inositol revealed a greater kcat and much lower Km value than those obtained with the monosaccharides, resulting in over a 2,000-fold increase in kcat/Km values. TK2285 homologs are distributed among members of Thermococcales, and in most species, the gene is positioned close to a myo-inositol monophosphate synthase gene. Our results suggest the presence of a novel subfamily of the ribokinase family whose members are present in Archaea and recognize myo-inositol as a substrate. PMID:23737529
Crystal structure of Gib2, a signal-transducing protein scaffold associated with ribosomes in Cryptococcus neoformans

NASA Astrophysics Data System (ADS)

Ero, Rya; Dimitrova, Valya Tenusheva; Chen, Yun; Bu, Wenting; Feng, Shu; Liu, Tongbao; Wang, Ping; Xue, Chaoyang; Tan, Suet Mien; Gao, Yong-Gui

2015-03-01

The atypical Gβ-like/RACK1 Gib2 protein promotes cAMP signalling that plays a central role in regulating the virulence of Cryptococcus neoformans. Gib2 contains a seven-bladed β transducin structure and is emerging as a scaffold protein interconnecting signalling pathways through interactions with various protein partners. Here, we present the crystal structure of Gib2 at a 2.2-Å resolution. The structure allows us to analyse the association between Gib2 and the ribosome, as well as to identify the Gib2 amino acid residues involved in ribosome binding. Our studies not only suggest that Gib2 has a role in protein translation but also present Gib2 as a physical link at the crossroads of various regulatory pathways important for the growth and virulence of C. neoformans.
Use of conserved key amino acid positions to morph protein folds.

PubMed

Reddy, Boojala V B; Li, Wilfred W; Bourne, Philip E

2002-07-15

By using three-dimensional (3D) structure alignments and a previously published method to determine Conserved Key Amino Acid Positions (CKAAPs) we propose a theoretical method to design mutations that can be used to morph the protein folds. The original Paracelsus challenge, met by several groups, called for the engineering of a stable but different structure by modifying less than 50% of the amino acid residues. We have used the sequences from the Protein Data Bank (PDB) identifiers 1ROP, and 2CRO, which were previously used in the Paracelsus challenge by those groups, and suggest mutation to CKAAPs to morph the protein fold. The total number of mutations suggested is less than 40% of the starting sequence theoretically improving the challenge results. From secondary structure prediction experiments of the proposed mutant sequence structures, we observe that each of the suggested mutant protein sequences likely folds to a different, non-native potentially stable target structure. These results are an early indicator that analyses using structure alignments leading to CKAAPs of a given structure are of value in protein engineering experiments. Copyright 2002 Wiley Periodicals, Inc.
Global Dynamics of Proteins: Bridging Between Structure and Function

PubMed Central

Bahar, Ivet; Lezon, Timothy R.; Yang, Lee-Wei; Eyal, Eran

2010-01-01

Biomolecular systems possess unique, structure-encoded dynamic properties that underlie their biological functions. Recent studies indicate that these dynamic properties are determined to a large extent by the topology of native contacts. In recent years, elastic network models used in conjunction with normal mode analyses have proven to be useful for elucidating the collective dynamics intrinsically accessible under native state conditions, including in particular the global modes of motions that are robustly defined by the overall architecture. With increasing availability of structural data for well-studied proteins in different forms (liganded, complexed, or free), there is increasing evidence in support of the correspondence between functional changes in structures observed in experiments and the global motions predicted by these coarse-grained analyses. These observed correlations suggest that computational methods may be advantageously employed for assessing functional changes in structure and allosteric mechanisms intrinsically favored by the native fold. PMID:20192781
Global dynamics of proteins: bridging between structure and function.

PubMed

Bahar, Ivet; Lezon, Timothy R; Yang, Lee-Wei; Eyal, Eran

2010-01-01

Biomolecular systems possess unique, structure-encoded dynamic properties that underlie their biological functions. Recent studies indicate that these dynamic properties are determined to a large extent by the topology of native contacts. In recent years, elastic network models used in conjunction with normal mode analyses have proven to be useful for elucidating the collective dynamics intrinsically accessible under native state conditions, including in particular the global modes of motions that are robustly defined by the overall architecture. With increasing availability of structural data for well-studied proteins in different forms (liganded, complexed, or free), there is increasing evidence in support of the correspondence between functional changes in structures observed in experiments and the global motions predicted by these coarse-grained analyses. These observed correlations suggest that computational methods may be advantageously employed for assessing functional changes in structure and allosteric mechanisms intrinsically favored by the native fold.
Design and structure of an equilibrium protein folding intermediate: a hint into dynamical regions of proteins.

PubMed

Ayuso-Tejedor, Sara; Angarica, Vladimir Espinosa; Bueno, Marta; Campos, Luis A; Abián, Olga; Bernadó, Pau; Sancho, Javier; Jiménez, M Angeles

2010-07-23

Partly unfolded protein conformations close to the native state may play important roles in protein function and in protein misfolding. Structural analyses of such conformations which are essential for their fully physicochemical understanding are complicated by their characteristic low populations at equilibrium. We stabilize here with a single mutation the equilibrium intermediate of apoflavodoxin thermal unfolding and determine its solution structure by NMR. It consists of a large native region identical with that observed in the X-ray structure of the wild-type protein plus an unfolded region. Small-angle X-ray scattering analysis indicates that the calculated ensemble of structures is consistent with the actual degree of expansion of the intermediate. The unfolded region encompasses discontinuous sequence segments that cluster in the 3D structure of the native protein forming the FMN cofactor binding loops and the binding site of a variety of partner proteins. Analysis of the apoflavodoxin inner interfaces reveals that those becoming destabilized in the intermediate are more polar than other inner interfaces of the protein. Natively folded proteins contain hydrophobic cores formed by the packing of hydrophobic surfaces, while natively unfolded proteins are rich in polar residues. The structure of the apoflavodoxin thermal intermediate suggests that the regions of natively folded proteins that are easily responsive to thermal activation may contain cores of intermediate hydrophobicity. Copyright (c) 2010 Elsevier Ltd. All rights reserved.
Differential protein folding and chemical changes in lung tissues exposed to asbestos or particulates

PubMed Central

Pascolo, Lorella; Borelli, Violetta; Canzonieri, Vincenzo; Gianoncelli, Alessandra; Birarda, Giovanni; Bedolla, Diana E.; Salomé, Murielle; Vaccari, Lisa; Calligaro, Carla; Cotte, Marine; Hesse, Bernhard; Luisi, Fernando; Zabucchi, Giuliano; Melato, Mauro; Rizzardi, Clara

2015-01-01

Environmental and occupational inhalants may induce a large number of pulmonary diseases, with asbestos exposure being the most risky. The mechanisms are clearly related to chemical composition and physical and surface properties of materials. A combination of X-ray fluorescence (μXRF) and Fourier Transform InfraRed (μFTIR) microscopy was used to chemically characterize and compare asbestos bodies versus environmental particulates (anthracosis) in lung tissues from asbestos exposed and control patients. μXRF analyses revealed heterogeneously aggregated particles in the anthracotic structures, containing mainly Si, K, Al and Fe. Both asbestos and particulates alter lung iron homeostasis, with a more marked effect in asbestos exposure. μFTIR analyses revealed abundant proteins on asbestos bodies but not on anthracotic particles. Most importantly, the analyses demonstrated that the asbestos coating proteins contain high levels of β-sheet structures. The occurrence of conformational changes in the proteic component of the asbestos coating provides new insights into long-term asbestos effects. PMID:26159651
Assessment of biofilm changes and concentration-depth profiles during arsenopyrite oxidation by Acidithiobacillus thiooxidans.

PubMed

Ramírez-Aldaba, Hugo; Vazquez-Arenas, Jorge; Sosa-Rodríguez, Fabiola S; Valdez-Pérez, Donato; Ruiz-Baca, Estela; García-Meza, Jessica Viridiana; Trejo-Córdova, Gabriel; Lara, René H

2017-08-01

Biofilm formation and evolution are key factors to consider to better understand the kinetics of arsenopyrite biooxidation. Chemical and surface analyses were carried out using Raman spectroscopy, scanning electron microscopy (SEM), confocal laser scanning microscopy (CLSM), glow discharge spectroscopy (GDS), and protein analysis (i.e., quantification) in order to evaluate the formation of intermediate secondary compounds and any significant changes arising in the biofilm structure of Acidithiobacillus thiooxidans during a 120-h period of biooxidation. Results show that the biofilm first evolves from a low cell density structure (1 to 12 h) into a formation of microcolonies (24 to 120 h) and then finally becomes enclosed by a secondary compound matrix that includes pyrite (FeS 2 )-like, S n 2- /S 0 , and As 2 S 3 compounds, as shown by Raman and SEM-EDS. GDS analyses (concentration-depth profiles, i.e., 12 h) indicate significant differences for depth speciation between abiotic control and biooxidized surfaces, thus providing a quantitative assessment of surface-bulk changes across samples (i.e. reactivity and /or structure-activity relationship). Respectively, quantitative protein analyses and CLSM analyses suggest variations in the type of extracellular protein expressed and changes in the biofilm structure from hydrophilic (i.e., exopolysaccharides) to hydrophobic (i.e., lipids) due to arsenopyrite and cell interactions during the 120-h period of biooxidation. We suggest feasible environmental and industrial implications for arsenopyrite biooxidation based on the findings of this study.
Structure and transcriptional regulation of the major intrinsic protein gene family in grapevine.

PubMed

Wong, Darren Chern Jan; Zhang, Li; Merlin, Isabelle; Castellarin, Simone D; Gambetta, Gregory A

2018-04-11

The major intrinsic protein (MIP) family is a family of proteins, including aquaporins, which facilitate water and small molecule transport across plasma membranes. In plants, MIPs function in a huge variety of processes including water transport, growth, stress response, and fruit development. In this study, we characterize the structure and transcriptional regulation of the MIP family in grapevine, describing the putative genome duplication events leading to the family structure and characterizing the family's tissue and developmental specific expression patterns across numerous preexisting microarray and RNAseq datasets. Gene co-expression network (GCN) analyses were carried out across these datasets and the promoters of each family member were analyzed for cis-regulatory element structure in order to provide insight into their transcriptional regulation. A total of 29 Vitis vinifera MIP family members (excluding putative pseudogenes) were identified of which all but two were mapped onto Vitis vinifera chromosomes. In this study, segmental duplication events were identified for five plasma membrane intrinsic protein (PIP) and four tonoplast intrinsic protein (TIP) genes, contributing to the expansion of PIPs and TIPs in grapevine. Grapevine MIP family members have distinct tissue and developmental expression patterns and hierarchical clustering revealed two primary groups regardless of the datasets analyzed. Composite microarray and RNA-seq gene co-expression networks (GCNs) highlighted the relationships between MIP genes and functional categories involved in cell wall modification and transport, as well as with other MIPs revealing a strong co-regulation within the family itself. Some duplicated MIP family members have undergone sub-functionalization and exhibit distinct expression patterns and GCNs. Cis-regulatory element (CRE) analyses of the MIP promoters and their associated GCN members revealed enrichment for numerous CREs including AP2/ERFs and NACs. Combining phylogenetic analyses, gene expression profiling, gene co-expression network analyses, and cis-regulatory element enrichment, this study provides a comprehensive overview of the structure and transcriptional regulation of the grapevine MIP family. The study highlights the duplication and sub-functionalization of the family, its strong coordinated expression with genes involved in growth and transport, and the putative classes of TFs responsible for its regulation.
Understanding the Role of Intrinsic Disorder of Viral Proteins in the Oncogenicity of Different Types of HPV.

PubMed

Tamarozzi, Elvira Regina; Giuliatti, Silvana

2018-01-09

Intrinsic disorder is very important in the biological function of several proteins, and is directly linked to their foldability during interaction with their targets. There is a close relationship between the intrinsically disordered proteins and the process of carcinogenesis involving viral pathogens. Among these pathogens, we have highlighted the human papillomavirus (HPV) in this study. HPV is currently among the most common sexually transmitted infections, besides being the cause of several types of cancer. HPVs are divided into two groups, called high- and low-risk, based on their oncogenic potential. The high-risk HPV E6 protein has been the target of much research, in seeking treatments against HPV, due to its direct involvement in the process of cell cycle control. To understand the role of intrinsic disorder of the viral proteins in the oncogenic potential of different HPV types, the structural characteristics of intrinsically disordered regions of high and low-risk HPV E6 proteins were analyzed. In silico analyses of primary sequences, prediction of tertiary structures, and analyses of molecular dynamics allowed the observation of the behavior of such disordered regions in these proteins, thereby proving a direct relationship of structural variation with the degree of oncogenicity of HPVs. The results obtained may contribute to the development of new therapies, targeting the E6 oncoprotein, for the treatment of HPV-associated diseases.
Dimerization of a flocculent protein from Moringa oleifera: experimental evidence and in silico interpretation.

PubMed

Pavankumar, Asalapuram R; Kayathri, Rajarathinam; Murugan, Natarajan A; Zhang, Qiong; Srivastava, Vaibhav; Okoli, Chuka; Bulone, Vincent; Rajarao, Gunaratna K; Ågren, Hans

2014-01-01

Many proteins exist in dimeric and other oligomeric forms to gain stability and functional advantages. In this study, the dimerization property of a coagulant protein (MO2.1) from Moringa oleifera seeds was addressed through laboratory experiments, protein-protein docking studies and binding free energy calculations. The structure of MO2.1 was predicted by homology modelling, while binding free energy and residues-distance profile analyses provided insight into the energetics and structural factors for dimer formation. Since the coagulation activities of the monomeric and dimeric forms of MO2.1 were comparable, it was concluded that oligomerization does not affect the biological activity of the protein.
Mixture models for protein structure ensembles.

PubMed

Hirsch, Michael; Habeck, Michael

2008-10-01

Protein structure ensembles provide important insight into the dynamics and function of a protein and contain information that is not captured with a single static structure. However, it is not clear a priori to what extent the variability within an ensemble is caused by internal structural changes. Additional variability results from overall translations and rotations of the molecule. And most experimental data do not provide information to relate the structures to a common reference frame. To report meaningful values of intrinsic dynamics, structural precision, conformational entropy, etc., it is therefore important to disentangle local from global conformational heterogeneity. We consider the task of disentangling local from global heterogeneity as an inference problem. We use probabilistic methods to infer from the protein ensemble missing information on reference frames and stable conformational sub-states. To this end, we model a protein ensemble as a mixture of Gaussian probability distributions of either entire conformations or structural segments. We learn these models from a protein ensemble using the expectation-maximization algorithm. Our first model can be used to find multiple conformers in a structure ensemble. The second model partitions the protein chain into locally stable structural segments or core elements and less structured regions typically found in loops. Both models are simple to implement and contain only a single free parameter: the number of conformers or structural segments. Our models can be used to analyse experimental ensembles, molecular dynamics trajectories and conformational change in proteins. The Python source code for protein ensemble analysis is available from the authors upon request.
Proteomic identification and purification of seed proteins from native Amazonian species displaying antifungal activity.

PubMed

Ramos, Márcio V; Brito, Daniel; Freitas, Cléverson D T; Gonçalves, José Francisco C; Porfirio, Camila T M N; Lobo, Marina D P; Monteiro-Moreira, Ana Cristina O; Souza, Luiz A C; Fernandes, Andreia V

2018-04-19

Seeds of native species from the rain forest (Amazon) are source of chitinases and their protein extracts exhibited strong and broad antifungal activity. Numerous plant species native to the Amazon have not yet been chemically studied. Studies of seeds are scarcer, since adversities in accessing study areas and seasonality pose constant hurdles to systematic research. In this study, proteins were extracted from seeds belonging to endemic Amazon species and were investigated for the first time. Proteolytic activity, peptidase inhibitors, and chitinases were identified, but chitinolytic activity predominated. Four proteins were purified through chromatography and identified as lectin and chitinases by MS/MS analyses. The proteins were examined for inhibition of a phytopathogen (Fusarium oxysporum). Analyses by fluorescence microscopy suggested binding of propidium iodide to DNA of fungal spores, revealing that spore integrity was lost when accessed by the proteins. Further structural and functional analyses of defensive proteins belonging to species facing highly complex ecosystems such as Amazonia should be conducted, since these could provide new insights into specificity and synergism involving defense proteins of plants submitted to a very complex ecosystem.
Extensive structural change of the envelope protein of dengue virus induced by a tuned ionic strength: conformational and energetic analyses

NASA Astrophysics Data System (ADS)

Degrève, Léo; Fuzo, Carlos A.; Caliri, Antonio

2012-12-01

The Dengue has become a global public health threat, with over 100 million infections annually; to date there is no specific vaccine or any antiviral drug. The structures of the envelope (E) proteins of the four known serotype of the dengue virus (DENV) are already known, but there are insufficient molecular details of their structural behavior in solution in the distinct environmental conditions in which the DENVs are submitted, from the digestive tract of the mosquito up to its replication inside the host cell. Such detailed knowledge becomes important because of the multifunctional character of the E protein: it mediates the early events in cell entry, via receptor endocytosis and, as a class II protein, participates determinately in the process of membrane fusion. The proposed infection mechanism asserts that once in the endosome, at low pH, the E homodimers dissociate and insert into the endosomal lipid membrane, after an extensive conformational change, mainly on the relative arrangement of its three domains. In this work we employ all-atom explicit solvent Molecular Dynamics simulations to specify the thermodynamic conditions in that the E proteins are induced to experience extensive structural changes, such as during the process of reducing pH. We study the structural behavior of the E protein monomer at acid pH solution of distinct ionic strength. Extensive simulations are carried out with all the histidine residues in its full protonated form at four distinct ionic strengths. The results are analyzed in detail from structural and energetic perspectives, and the virtual protein movements are described by means of the principal component analyses. As the main result, we found that at acid pH and physiological ionic strength, the E protein suffers a major structural change; for lower or higher ionic strengths, the crystal structure is essentially maintained along of all extensive simulations. On the other hand, at basic pH, when all histidine residues are in the unprotonated form, the protein structure is very stable for ionic strengths ranging from 0 to 225 mM. Therefore, our findings support the hypothesis that the histidines constitute the hot points that induce configurational changes of E protein in acid pH, and give extra motivation to the development of new ideas for antivirus compound design.
Arc is a flexible modular protein capable of reversible self-oligomerization

PubMed Central

Myrum, Craig; Baumann, Anne; Bustad, Helene J.; Flydal, Marte Innselset; Mariaule, Vincent; Alvira, Sara; Cuéllar, Jorge; Haavik, Jan; Soulé, Jonathan; Valpuesta, José Maria; Márquez, José Antonio; Martinez, Aurora; Bramham, Clive R.

2015-01-01

The immediate early gene product Arc (activity-regulated cytoskeleton-associated protein) is posited as a master regulator of long-term synaptic plasticity and memory. However, the physicochemical and structural properties of Arc have not been elucidated. In the present study, we expressed and purified recombinant human Arc (hArc) and performed the first biochemical and biophysical analysis of hArc's structure and stability. Limited proteolysis assays and MS analysis indicate that hArc has two major domains on either side of a central more disordered linker region, consistent with in silico structure predictions. hArc's secondary structure was estimated using CD, and stability was analysed by CD-monitored thermal denaturation and differential scanning fluorimetry (DSF). Oligomerization states under different conditions were studied by dynamic light scattering (DLS) and visualized by AFM and EM. Biophysical analyses show that hArc is a modular protein with defined secondary structure and loose tertiary structure. hArc appears to be pyramid-shaped as a monomer and is capable of reversible self-association, forming large soluble oligomers. The N-terminal domain of hArc is highly basic, which may promote interaction with cytoskeletal structures or other polyanionic surfaces, whereas the C-terminal domain is acidic and stabilized by ionic conditions that promote oligomerization. Upon binding of presenilin-1 (PS1) peptide, hArc undergoes a large structural change. A non-synonymous genetic variant of hArc (V231G) showed properties similar to the wild-type (WT) protein. We conclude that hArc is a flexible multi-domain protein that exists in monomeric and oligomeric forms, compatible with a diverse, hub-like role in plasticity-related processes. PMID:25748042
General Characteristics of the Changes in the Thermal Stability of Proteins and Enzymes After the Chemical Modification of Their Functional Groups

NASA Astrophysics Data System (ADS)

Kutuzova, G. D.; Ugarova, N. N.; Berezin, Ilya V.

1984-11-01

The principal structural and physicochemical factors determining the stability of protein macromolecules in solution and the characteristics of the structure of the proteins from thermophilic microorganisms are examined. The mechanism of the changes in the thermal stability of proteins and enzymes after the chemical modification of their functional side groups and the experimental data concerning the influence of chemical modification on the thermal stability of proteins are analysed. The dependence of the stabilisation effect and of the changes in the structure of protein macromolecules on the degree of modification and on the nature of the modified groups and the groups introduced into proteins in the course of modification (their charge and hydrophobic properties) is demonstrated. The great practical value of the method of chemical modification for the preparation of stabilised forms of biocatalysts is shown in relation to specific examples. The bibliography includes 178 references.
CCProf: exploring conformational change profile of proteins

PubMed Central

Chang, Che-Wei; Chou, Chai-Wei; Chang, Darby Tien-Hao

2016-01-01

In many biological processes, proteins have important interactions with various molecules such as proteins, ions or ligands. Many proteins undergo conformational changes upon these interactions, where regions with large conformational changes are critical to the interactions. This work presents the CCProf platform, which provides conformational changes of entire proteins, named conformational change profile (CCP) in the context. CCProf aims to be a platform where users can study potential causes of novel conformational changes. It provides 10 biological features, including conformational change, potential binding target site, secondary structure, conservation, disorder propensity, hydropathy propensity, sequence domain, structural domain, phosphorylation site and catalytic site. All these information are integrated into a well-aligned view, so that researchers can capture important relevance between different biological features visually. The CCProf contains 986 187 protein structure pairs for 3123 proteins. In addition, CCProf provides a 3D view in which users can see the protein structures before and after conformational changes as well as binding targets that induce conformational changes. All information (e.g. CCP, binding targets and protein structures) shown in CCProf, including intermediate data are available for download to expedite further analyses. Database URL: http://zoro.ee.ncku.edu.tw/ccprof/ PMID:27016699
The proteins of the grape (Vitis vinifera L.) seed endosperm: fractionation and identification of the major components.

PubMed

Gazzola, Diana; Vincenzi, Simone; Gastaldon, Luca; Tolin, Serena; Pasini, Gabriella; Curioni, Andrea

2014-07-15

In the present study, grape (Vitis vinifera L.) seed endosperm proteins were characterized after sequential fractionation, according to a modified Osborne procedure. The salt-soluble fraction (albumins and globulins) comprised the majority (58.4%) of the total extracted protein. The protein fractions analysed by SDS-PAGE showed similar bands, indicating different solubility of the same protein components. SDS-PAGE in non-reducing and reducing conditions revealed the polypeptide composition of the protein bands. The main polypeptides, which were similar in all the grape varieties analysed, were identified by LC-MS/MS as homologous to the 11S globulin-like seed storage proteins of other plant species, while a monomeric 43 kDa protein presented high homology with the 7S globulins of legume seeds. The results provide new insights about the identity, structure and polypeptide composition of the grape seed storage proteins. Copyright © 2014 Elsevier Ltd. All rights reserved.
Integration of Structural Dynamics and Molecular Evolution via Protein Interaction Networks: A New Era in Genomic Medicine

PubMed Central

Kumar, Avishek; Butler, Brandon M.; Kumar, Sudhir; Ozkan, S. Banu

2016-01-01

Summary Sequencing technologies are revealing many new non-synonymous single nucleotide variants (nsSNVs) in each personal exome. To assess their functional impacts, comparative genomics is frequently employed to predict if they are benign or not. However, evolutionary analysis alone is insufficient, because it misdiagnoses many disease-associated nsSNVs, such as those at positions involved in protein interfaces, and because evolutionary predictions do not provide mechanistic insights into functional change or loss. Structural analyses can aid in overcoming both of these problems by incorporating conformational dynamics and allostery in nSNV diagnosis. Finally, protein-protein interaction networks using systems-level methodologies shed light onto disease etiology and pathogenesis. Bridging these network approaches with structurally resolved protein interactions and dynamics will advance genomic medicine. PMID:26684487
Structural and Genetic Analyses of the Mycobacterium tuberculosis Protein Kinase B Sensor Domain Identify a Potential Ligand-binding Site.

PubMed

Prigozhin, Daniil M; Papavinasasundaram, Kadamba G; Baer, Christina E; Murphy, Kenan C; Moskaleva, Alisa; Chen, Tony Y; Alber, Tom; Sassetti, Christopher M

2016-10-28

Monitoring the environment with serine/threonine protein kinases is critical for growth and survival of Mycobacterium tuberculosis, a devastating human pathogen. Protein kinase B (PknB) is a transmembrane serine/threonine protein kinase that acts as an essential regulator of mycobacterial growth and division. The PknB extracellular domain (ECD) consists of four repeats homologous to penicillin-binding protein and serine/threonine kinase associated (PASTA) domains, and binds fragments of peptidoglycan. These properties suggest that PknB activity is modulated by ECD binding to peptidoglycan substructures, however, the molecular mechanisms underpinning PknB regulation remain unclear. In this study, we report structural and genetic characterization of the PknB ECD. We determined the crystal structures of overlapping ECD fragments at near atomic resolution, built a model of the full ECD, and discovered a region on the C-terminal PASTA domain that has the properties of a ligand-binding site. Hydrophobic interaction between this surface and a bound molecule of citrate was observed in a crystal structure. Our genetic analyses in M. tuberculosis showed that nonfunctional alleles were produced either by deletion of any of single PASTA domain or by mutation of individual conserved residues lining the putative ligand-binding surface of the C-terminal PASTA repeat. These results define two distinct structural features necessary for PknB signal transduction, a fully extended ECD and a conserved, membrane-distal putative ligand-binding site. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.

Structure and molecular dynamics simulation of archaeal prefoldin: the molecular mechanism for binding and recognition of nonnative substrate proteins.

PubMed

Ohtaki, Akashi; Kida, Hiroshi; Miyata, Yusuke; Ide, Naoki; Yonezawa, Akihiro; Arakawa, Takatoshi; Iizuka, Ryo; Noguchi, Keiichi; Kita, Akiko; Odaka, Masafumi; Miki, Kunio; Yohda, Masafumi

2008-02-29

Prefoldin (PFD) is a heterohexameric molecular chaperone complex in the eukaryotic cytosol and archaea with a jellyfish-like structure containing six long coiled-coil tentacles. PFDs capture protein folding intermediates or unfolded polypeptides and transfer them to group II chaperonins for facilitated folding. Although detailed studies on the mechanisms for interaction with unfolded proteins or cooperation with chaperonins of archaeal PFD have been performed, it is still unclear how PFD captures the unfolded protein. In this study, we determined the X-ray structure of Pyrococcus horikoshii OT3 PFD (PhPFD) at 3.0 A resolution and examined the molecular mechanism for binding and recognition of nonnative substrate proteins by molecular dynamics (MD) simulation and mutation analyses. PhPFD has a jellyfish-like structure with six long coiled-coil tentacles and a large central cavity. Each subunit has a hydrophobic groove at the distal region where an unfolded substrate protein is bound. During MD simulation at 330 K, each coiled coil was highly flexible, enabling it to widen its central cavity and capture various nonnative proteins. Docking MD simulation of PhPFD with unfolded insulin showed that the beta subunit is essentially involved in substrate binding and that the alpha subunit modulates the shape and width of the central cavity. Analyses of mutant PhPFDs with amino acid replacement of the hydrophobic residues of the beta subunit in the hydrophobic groove have shown that beta Ile107 has a critical role in forming the hydrophobic groove.
Positive Darwinian Selection in the Piston That Powers Proton Pumps in Complex I of the Mitochondria of Pacific Salmon

PubMed Central

Garvin, Michael R.; Bielawski, Joseph P.; Gharrett, Anthony J.

2011-01-01

The mechanism of oxidative phosphorylation is well understood, but evolution of the proteins involved is not. We combined phylogenetic, genomic, and structural biology analyses to examine the evolution of twelve mitochondrial encoded proteins of closely related, yet phenotypically diverse, Pacific salmon. Two separate analyses identified the same seven positively selected sites in ND5. A strong signal was also detected at three sites of ND2. An energetic coupling analysis revealed several structures in the ND5 protein that may have co-evolved with the selected sites. These data implicate Complex I, specifically the piston arm of ND5 where it connects the proton pumps, as important in the evolution of Pacific salmon. Lastly, the lineage to Chinook experienced rapid evolution at the piston arm. PMID:21969854
Positive Darwinian selection in the piston that powers proton pumps in complex I of the mitochondria of Pacific salmon.

PubMed

Garvin, Michael R; Bielawski, Joseph P; Gharrett, Anthony J

2011-01-01

The mechanism of oxidative phosphorylation is well understood, but evolution of the proteins involved is not. We combined phylogenetic, genomic, and structural biology analyses to examine the evolution of twelve mitochondrial encoded proteins of closely related, yet phenotypically diverse, Pacific salmon. Two separate analyses identified the same seven positively selected sites in ND5. A strong signal was also detected at three sites of ND2. An energetic coupling analysis revealed several structures in the ND5 protein that may have co-evolved with the selected sites. These data implicate Complex I, specifically the piston arm of ND5 where it connects the proton pumps, as important in the evolution of Pacific salmon. Lastly, the lineage to Chinook experienced rapid evolution at the piston arm.
Understanding the differences in molecular conformation of carbohydrate and protein in endosperm tissues of grains with different biodegradation kinetics using advanced synchrotron technology

NASA Astrophysics Data System (ADS)

Yu, P.; Block, H. C.; Doiron, K.

2009-01-01

Conventional "wet" chemical analyses rely heavily on the use of harsh chemicals and derivatization, thereby altering native seed structures leaving them unable to detect any original inherent structures within an intact tissue sample. A synchrotron is a giant particle accelerator that turns electrons into light (million times brighter than sunlight) which can be used to study the structure of materials at the molecular level. Synchrotron radiation-based Fourier transform IR microspectroscopy (SR-FTIRM) has been developed as a rapid, direct, non-destructive and bioanalytical technique. This technique, taking advantage of the brightness of synchrotron light and a small effective source size, is capable of exploring the molecular chemistry within the microstructures of a biological tissue without the destruction of inherent structures at ultraspatial resolutions within cellular dimensions. This is in contrast to traditional 'wet' chemical methods, which, during processing for analysis, often result in the destruction of the intrinsic structures of feeds. To date there has been very little application of this technique to the study of plant seed tissue in relation to nutrient utilization. The objective of this study was to use novel synchrotron radiation-based technology (SR-FTIRM) to identify the differences in the molecular chemistry and conformation of carbohydrate and protein in various plant seed endosperms within intact tissues at cellular and subcellular level from grains with different biodegradation kinetics. Barley grain (cv. Harrington) with a high rate (31.3%/h) and extent (78%), corn grain (cv. Pioneer) with a low rate (9.6%/h) and extent of (57%), and wheat grain (cv. AC Barrie) with an intermediate rate (23%/h) and extent (72%) of ruminal DM degradation were selected for evaluation. SR-FTIRM evaluations were performed at the National Synchrotron Light Source at the Brookhaven National Laboratory (Brookhaven, NY). The molecular structure spectral analysis involved the fingerprint regions of ca. 1720-1485 cm -1 (attributed to protein amide I C dbnd O and C sbnd N stretching; amide II N sbnd H bending and C sbnd N stretching), ca. 1650-950 cm -1 (non-structural CHO starch in endosperms), and ca. 1185-800 cm -1 (attributed to total CHO C sbnd O stretching vibrations) together with agglomerative hierarchical cluster and principal component analyses. Analyses involving the protein amide I features consistently identified differences between all three grains. Other analyses involving carbohydrate features were able to differentiate between wheat and barley but failed however to differentiate between wheat and corn. These results suggest that SR-FTIRM plus the multivariate analyses can be used to identify spectral features associated with the molecular structure of endosperm from grains with different biodegradation kinetics, especially in relation to protein structure. The Novel synchrotron radiation-based bioanalytical technique provides a new approach for plant seed structural molecular studies at ultraspatial resolution and within intact tissue in relation to nutrient availability.
An affinity-structure database of helix-turn-helix: DNA complexes with a universal coordinate system.

PubMed

AlQuraishi, Mohammed; Tang, Shengdong; Xia, Xide

2015-11-19

Molecular interactions between proteins and DNA molecules underlie many cellular processes, including transcriptional regulation, chromosome replication, and nucleosome positioning. Computational analyses of protein-DNA interactions rely on experimental data characterizing known protein-DNA interactions structurally and biochemically. While many databases exist that contain either structural or biochemical data, few integrate these two data sources in a unified fashion. Such integration is becoming increasingly critical with the rapid growth of structural and biochemical data, and the emergence of algorithms that rely on the synthesis of multiple data types to derive computational models of molecular interactions. We have developed an integrated affinity-structure database in which the experimental and quantitative DNA binding affinities of helix-turn-helix proteins are mapped onto the crystal structures of the corresponding protein-DNA complexes. This database provides access to: (i) protein-DNA structures, (ii) quantitative summaries of protein-DNA binding affinities using position weight matrices, and (iii) raw experimental data of protein-DNA binding instances. Critically, this database establishes a correspondence between experimental structural data and quantitative binding affinity data at the single basepair level. Furthermore, we present a novel alignment algorithm that structurally aligns the protein-DNA complexes in the database and creates a unified residue-level coordinate system for comparing the physico-chemical environments at the interface between complexes. Using this unified coordinate system, we compute the statistics of atomic interactions at the protein-DNA interface of helix-turn-helix proteins. We provide an interactive website for visualization, querying, and analyzing this database, and a downloadable version to facilitate programmatic analysis. This database will facilitate the analysis of protein-DNA interactions and the development of programmatic computational methods that capitalize on integration of structural and biochemical datasets. The database can be accessed at http://ProteinDNA.hms.harvard.edu.
Systems biology of the structural proteome.

PubMed

Brunk, Elizabeth; Mih, Nathan; Monk, Jonathan; Zhang, Zhen; O'Brien, Edward J; Bliven, Spencer E; Chen, Ke; Chang, Roger L; Bourne, Philip E; Palsson, Bernhard O

2016-03-11

The success of genome-scale models (GEMs) can be attributed to the high-quality, bottom-up reconstructions of metabolic, protein synthesis, and transcriptional regulatory networks on an organism-specific basis. Such reconstructions are biochemically, genetically, and genomically structured knowledge bases that can be converted into a mathematical format to enable a myriad of computational biological studies. In recent years, genome-scale reconstructions have been extended to include protein structural information, which has opened up new vistas in systems biology research and empowered applications in structural systems biology and systems pharmacology. Here, we present the generation, application, and dissemination of genome-scale models with protein structures (GEM-PRO) for Escherichia coli and Thermotoga maritima. We show the utility of integrating molecular scale analyses with systems biology approaches by discussing several comparative analyses on the temperature dependence of growth, the distribution of protein fold families, substrate specificity, and characteristic features of whole cell proteomes. Finally, to aid in the grand challenge of big data to knowledge, we provide several explicit tutorials of how protein-related information can be linked to genome-scale models in a public GitHub repository ( https://github.com/SBRG/GEMPro/tree/master/GEMPro_recon/). Translating genome-scale, protein-related information to structured data in the format of a GEM provides a direct mapping of gene to gene-product to protein structure to biochemical reaction to network states to phenotypic function. Integration of molecular-level details of individual proteins, such as their physical, chemical, and structural properties, further expands the description of biochemical network-level properties, and can ultimately influence how to model and predict whole cell phenotypes as well as perform comparative systems biology approaches to study differences between organisms. GEM-PRO offers insight into the physical embodiment of an organism's genotype, and its use in this comparative framework enables exploration of adaptive strategies for these organisms, opening the door to many new lines of research. With these provided tools, tutorials, and background, the reader will be in a position to run GEM-PRO for their own purposes.
Investigation of Inhibition Mechanism of Chemokine Receptor CCR5 by Micro-second Molecular Dynamics Simulations.

PubMed

Salmas, Ramin Ekhteiari; Yurtsever, Mine; Durdagi, Serdar

2015-08-24

Chemokine receptor 5 (CCR5) belongs to G protein coupled receptors (GPCRs) and plays an important role in treatment of human immunodeficiency virus (HIV) infection since HIV uses CCR5 protein as a co-receptor. Recently, the crystal structure of CCR5-bound complex with an approved anti-retroviral drug (maroviroc) was resolved. During the crystallization procedure, amino acid residues (i.e., Cys224, Arg225, Asn226 and Glu227) at the third intra-cellular loop were replaced by the rubredoxin for stability reasons. In the current study, we aimed to understand the impact of the incorporated rubredoxin on the conformations of TM domains of the target protein. For this reason, rubredoxin was deleted from the crystal structure and the missing amino acids were engineered. The resultant structure was subjected to long (μs) molecular dynamics (MD) simulations to shed light into the inhibitory mechanism. The derived model structure displayed a significant deviation in the cytoplasmic domain of TM5 and IC3 in the absence of rubredoxin. The principal component analyses (PCA) and MD trajectory analyses revealed important structural and dynamical differences at apo and holo forms of the CCR5.
Structural and Functional Studies of H. seropedicae RecA Protein - Insights into the Polymerization of RecA Protein as Nucleoprotein Filament.

PubMed

Leite, Wellington C; Galvão, Carolina W; Saab, Sérgio C; Iulek, Jorge; Etto, Rafael M; Steffens, Maria B R; Chitteni-Pattu, Sindhu; Stanage, Tyler; Keck, James L; Cox, Michael M

2016-01-01

The bacterial RecA protein plays a role in the complex system of DNA damage repair. Here, we report the functional and structural characterization of the Herbaspirillum seropedicae RecA protein (HsRecA). HsRecA protein is more efficient at displacing SSB protein from ssDNA than Escherichia coli RecA protein. HsRecA also promotes DNA strand exchange more efficiently. The three dimensional structure of HsRecA-ADP/ATP complex has been solved to 1.7 Å resolution. HsRecA protein contains a small N-terminal domain, a central core ATPase domain and a large C-terminal domain, that are similar to homologous bacterial RecA proteins. Comparative structural analysis showed that the N-terminal polymerization motif of archaeal and eukaryotic RecA family proteins are also present in bacterial RecAs. Reconstruction of electrostatic potential from the hexameric structure of HsRecA-ADP/ATP revealed a high positive charge along the inner side, where ssDNA is bound inside the filament. The properties of this surface may explain the greater capacity of HsRecA protein to bind ssDNA, forming a contiguous nucleoprotein filament, displace SSB and promote DNA exchange relative to EcRecA. Our functional and structural analyses provide insight into the molecular mechanisms of polymerization of bacterial RecA as a helical nucleoprotein filament.
Computational analysis of human and mouse CREB3L4 Protein

PubMed Central

Velpula, Kiran Kumar; Rehman, Azeem Abdul; Chigurupati, Soumya; Sanam, Ramadevi; Inampudi, Krishna Kishore; Akila, Chandra Sekhar

2012-01-01

CREB3L4 is a member of the CREB/ATF transcription factor family, characterized by their regulation of gene expression through the cAMP-responsive element. Previous studies identified this protein in mice and humans. Whereas CREB3L4 in mice (referred to as Tisp40) is found in the testes and functions in spermatogenesis, human CREB3L4 is primarily detected in the prostate and has been implicated in cancer. We conducted computational analyses to compare the structural homology between murine Tisp40α human CREB3L4. Our results reveal that the primary and secondary structures of the two proteins contain high similarity. Additionally, predicted helical transmembrane structure reveals that the proteins likely have similar structure and function. This study offers preliminary findings that support the translation of mouse Tisp40α findings into human models, based on structural homology. PMID:22829733
Predicting protein crystallization propensity from protein sequence

PubMed Central

2011-01-01

The high-throughput structure determination pipelines developed by structural genomics programs offer a unique opportunity for data mining. One important question is how protein properties derived from a primary sequence correlate with the protein’s propensity to yield X-ray quality crystals (crystallizability) and 3D X-ray structures. A set of protein properties were computed for over 1,300 proteins that expressed well but were insoluble, and for ~720 unique proteins that resulted in X-ray structures. The correlation of the protein’s iso-electric point and grand average hydropathy (GRAVY) with crystallizability was analyzed for full length and domain constructs of protein targets. In a second step, several additional properties that can be calculated from the protein sequence were added and evaluated. Using statistical analyses we have identified a set of the attributes correlating with a protein’s propensity to crystallize and implemented a Support Vector Machine (SVM) classifier based on these. We have created applications to analyze and provide optimal boundary information for query sequences and to visualize the data. These tools are available via the web site http://bioinformatics.anl.gov/cgi-bin/tools/pdpredictor. PMID:20177794
Discriminative structural approaches for enzyme active-site prediction.

PubMed

Kato, Tsuyoshi; Nagano, Nozomi

2011-02-15

Predicting enzyme active-sites in proteins is an important issue not only for protein sciences but also for a variety of practical applications such as drug design. Because enzyme reaction mechanisms are based on the local structures of enzyme active-sites, various template-based methods that compare local structures in proteins have been developed to date. In comparing such local sites, a simple measurement, RMSD, has been used so far. This paper introduces new machine learning algorithms that refine the similarity/deviation for comparison of local structures. The similarity/deviation is applied to two types of applications, single template analysis and multiple template analysis. In the single template analysis, a single template is used as a query to search proteins for active sites, whereas a protein structure is examined as a query to discover the possible active-sites using a set of templates in the multiple template analysis. This paper experimentally illustrates that the machine learning algorithms effectively improve the similarity/deviation measurements for both the analyses.
Random close packing in protein cores

NASA Astrophysics Data System (ADS)

Gaines, Jennifer C.; Smith, W. Wendell; Regan, Lynne; O'Hern, Corey S.

2016-03-01

Shortly after the determination of the first protein x-ray crystal structures, researchers analyzed their cores and reported packing fractions ϕ ≈0.75 , a value that is similar to close packing of equal-sized spheres. A limitation of these analyses was the use of extended atom models, rather than the more physically accurate explicit hydrogen model. The validity of the explicit hydrogen model was proved in our previous studies by its ability to predict the side chain dihedral angle distributions observed in proteins. In contrast, the extended atom model is not able to recapitulate the side chain dihedral angle distributions, and gives rise to large atomic clashes at side chain dihedral angle combinations that are highly probable in protein crystal structures. Here, we employ the explicit hydrogen model to calculate the packing fraction of the cores of over 200 high-resolution protein structures. We find that these protein cores have ϕ ≈0.56 , which is similar to results obtained from simulations of random packings of individual amino acids. This result provides a deeper understanding of the physical basis of protein structure that will enable predictions of the effects of amino acid mutations to protein cores and interfaces of known structure.
Random close packing in protein cores.

PubMed

Gaines, Jennifer C; Smith, W Wendell; Regan, Lynne; O'Hern, Corey S

2016-03-01

Shortly after the determination of the first protein x-ray crystal structures, researchers analyzed their cores and reported packing fractions ϕ ≈ 0.75, a value that is similar to close packing of equal-sized spheres. A limitation of these analyses was the use of extended atom models, rather than the more physically accurate explicit hydrogen model. The validity of the explicit hydrogen model was proved in our previous studies by its ability to predict the side chain dihedral angle distributions observed in proteins. In contrast, the extended atom model is not able to recapitulate the side chain dihedral angle distributions, and gives rise to large atomic clashes at side chain dihedral angle combinations that are highly probable in protein crystal structures. Here, we employ the explicit hydrogen model to calculate the packing fraction of the cores of over 200 high-resolution protein structures. We find that these protein cores have ϕ ≈ 0.56, which is similar to results obtained from simulations of random packings of individual amino acids. This result provides a deeper understanding of the physical basis of protein structure that will enable predictions of the effects of amino acid mutations to protein cores and interfaces of known structure.
Structural changes of malt proteins during boiling.

PubMed

Jin, Bei; Li, Lin; Liu, Guo-Qin; Li, Bing; Zhu, Yu-Kui; Liao, Liao-Ning

2009-03-09

Changes in the physicochemical properties and structure of proteins derived from two malt varieties (Baudin and Guangmai) during wort boiling were investigated by differential scanning calorimetry, SDS-PAGE, two-dimensional electrophoresis, gel filtration chromatography and circular dichroism spectroscopy. The results showed that both protein content and amino acid composition changed only slightly during boiling, and that boiling might cause a gradual unfolding of protein structures, as indicated by the decrease in surface hydrophobicity and free sulfhydryl content and enthalpy value, as well as reduced alpha-helix contents and markedly increased random coil contents. It was also found that major component of both worts was a boiling-resistant protein with a molecular mass of 40 kDa, and that according to the two-dimensional electrophoresis and SE-HPLC analyses, a small amount of soluble aggregates might be formed via hydrophobic interactions. It was thus concluded that changes of protein structure caused by boiling that might influence beer quality are largely independent of malt variety.
Surface Proteins of Gram-Positive Pathogens: Using Crystallography to Uncover Novel Features in Drug and Vaccine Candidates

NASA Astrophysics Data System (ADS)

Baker, Edward N.; Proft, Thomas; Kang, Haejoo

Proteins displayed on the cell surfaces of pathogenic organisms are the front-line troops of bacterial attack, playing critical roles in colonization, infection and virulence. Although such proteins can often be recognized from genome sequence data, through characteristic sequence motifs, their functions are often unknown. One such group of surface proteins is attached to the cell surface of Gram-positive pathogens through the action of sortase enzymes. Some of these proteins are now known to form pili: long filamentous structures that mediate attachment to human cells. Crystallographic analyses of these and other cell surface proteins have uncovered novel features in their structure, assembly and stability, including the presence of inter- and intramolecular isopeptide crosslinks. This improved understanding of structures on the bacterial cell surface offers opportunities for the development of some new drug targets and for novel approaches to vaccine design.
Antibody Epitope Analysis to Investigate Folded Structure, Allosteric Conformation, and Evolutionary Lineage of Proteins.

PubMed

Wong, Sienna; Jin, J-P

2017-01-01

Study of folded structure of proteins provides insights into their biological functions, conformational dynamics and molecular evolution. Current methods of elucidating folded structure of proteins are laborious, low-throughput, and constrained by various limitations. Arising from these methods is the need for a sensitive, quantitative, rapid and high-throughput method not only analysing the folded structure of proteins, but also to monitor dynamic changes under physiological or experimental conditions. In this focused review, we outline the foundation and limitations of current protein structure-determination methods prior to discussing the advantages of an emerging antibody epitope analysis for applications in structural, conformational and evolutionary studies of proteins. We discuss the application of this method using representative examples in monitoring allosteric conformation of regulatory proteins and the determination of the evolutionary lineage of related proteins and protein isoforms. The versatility of the method described herein is validated by the ability to modulate a variety of assay parameters to meet the needs of the user in order to monitor protein conformation. Furthermore, the assay has been used to clarify the lineage of troponin isoforms beyond what has been depicted by sequence homology alone, demonstrating the nonlinear evolutionary relationship between primary structure and tertiary structure of proteins. The antibody epitope analysis method is a highly adaptable technique of protein conformation elucidation, which can be easily applied without the need for specialized equipment or technical expertise. When applied in a systematic and strategic manner, this method has the potential to reveal novel and biomedically meaningful information for structure-function relationship and evolutionary lineage of proteins. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Recent Progress and Development of Crystal Structure Analysis of Enzymes and Other Proteins

NASA Astrophysics Data System (ADS)

Tanokura, Masaru; Nagata, Koji; Miyazono, Ken-Ichi; Miyakawa, Takuya; Okai, Masahiko

Structural biology has made tremendous progress in this decade. Here we briefly introduce the Target Proteins Research Program, a national project promoted by the Ministry of Education, Culture, Sports, Science and Technology (MEXT) of Japan. The program aims to reveal the structure and function of proteins that are of great importance in both academic research and industrial application. We also summarize the results of structure-function analyses of (i) transcriptional regulatory proteins useful for the breading of drought and heat stress tolerant crops, (ii) useful enzymes for the production of chiral compounds, and (iii) useful enzymes for the degradation of environmental pollution substances. These results can be utilized in various areas of industries, to enhance food production, to improve the efficiency of pharmaceutical compound production, and to promote the bioremediation of contaminated soil and water.
A stoichiometry driven universal spatial organization of backbones of folded proteins: are there Chargaff's rules for protein folding?

PubMed

Mittal, A; Jayaram, B; Shenoy, Sandhya; Bawa, Tejdeep Singh

2010-10-01

Protein folding is at least a six decade old problem, since the times of Pauling and Anfinsen. However, rules of protein folding remain elusive till date. In this work, rigorous analyses of several thousand crystal structures of folded proteins reveal a surprisingly simple unifying principle of backbone organization in protein folding. We find that protein folding is a direct consequence of a narrow band of stoichiometric occurrences of amino-acids in primary sequences, regardless of the size and the fold of a protein. We observe that "preferential interactions" between amino-acids do not drive protein folding, contrary to all prevalent views. We dedicate our discovery to the seminal contribution of Chargaff which was one of the major keys to elucidation of the stoichiometry-driven spatially organized double helical structure of DNA.
Destabilization of psychrotrophic RNase HI in a localized fashion as revealed by mutational and X-ray crystallographic analyses.

PubMed

Rohman, Muhammad S; Tadokoro, Takashi; Angkawidjaja, Clement; Abe, Yumi; Matsumura, Hiroyoshi; Koga, Yuichi; Takano, Kazufumi; Kanaya, Shigenori

2009-01-01

The Arg97 --> Gly and Asp136 --> His mutations stabilized So-RNase HI from the psychrotrophic bacterium Shewanella oneidensis MR-1 by 5.4 and 9.7 degrees C, respectively, in T(m), and 3.5 and 6.1 kJ x mol(-1), respectively, in DeltaG(H2O). These mutations also stabilized the So-RNase HI derivative (4x-RNase HI) with quadruple thermostabilizing mutations in an additive manner. As a result, the resultant sextuple mutant protein (6x-RNase HI) was more stable than the wild-type protein by 28.8 degrees C in T(m) and 27.0 kJ x mol(-1) in DeltaG(H2O). To analyse the effects of the mutations on the protein structure, the crystal structure of the 6x-RNase HI protein was determined at 2.5 A resolution. The main chain fold and interactions of the side-chains of the 6x-RNase HI protein were basically identical to those of the wild-type protein, except for the mutation sites. These results indicate that all six mutations independently affect the protein structure, and are consistent with the fact that the thermostabilizing effects of the mutations are roughly additive. The introduction of favourable interactions and the elimination of unfavourable interactions by the mutations contribute to the stabilization of the 6x-RNase HI protein. We propose that So-RNase HI is destabilized when compared with its mesophilic and thermophilic counterparts in a localized fashion by increasing the number of amino acid residues unfavourable for protein stability.
The NBS-LRR architectures of plant R-proteins and metazoan NLRs evolved in independent events

PubMed Central

Urbach, Jonathan M.; Ausubel, Frederick M.

2017-01-01

There are intriguing parallels between plants and animals, with respect to the structures of their innate immune receptors, that suggest universal principles of innate immunity. The cytosolic nucleotide binding site–leucine rich repeat (NBS-LRR) resistance proteins of plants (R-proteins) and the so-called NOD-like receptors of animals (NLRs) share a domain architecture that includes a STAND (signal transduction ATPases with numerous domains) family NTPase followed by a series of LRRs, suggesting inheritance from a common ancestor with that architecture. Focusing on the STAND NTPases of plant R-proteins, animal NLRs, and their homologs that represent the NB-ARC (nucleotide-binding adaptor shared by APAF-1, certain R gene products and CED-4) and NACHT (named for NAIP, CIIA, HET-E, and TEP1) subfamilies of the STAND NTPases, we analyzed the phylogenetic distribution of the NBS-LRR domain architecture, used maximum-likelihood methods to infer a phylogeny of the NTPase domains of R-proteins, and reconstructed the domain structure of the protein containing the common ancestor of the STAND NTPase domain of R-proteins and NLRs. Our analyses reject monophyly of plant R-proteins and NLRs and suggest that the protein containing the last common ancestor of the STAND NTPases of plant R-proteins and animal NLRs (and, by extension, all NB-ARC and NACHT domains) possessed a domain structure that included a STAND NTPase paired with a series of tetratricopeptide repeats. These analyses reject the hypothesis that the domain architecture of R-proteins and NLRs was inherited from a common ancestor and instead suggest the domain architecture evolved at least twice. It remains unclear whether the NBS-LRR architectures were innovations of plants and animals themselves or were acquired by one or both lineages through horizontal gene transfer. PMID:28096345

Application potential of ATR-FT/IR molecular spectroscopy in animal nutrition: revelation of protein molecular structures of canola meal and presscake, as affected by heat-processing methods, in relationship with their protein digestive behavior and utilization for dairy cattle.

PubMed

Theodoridou, Katerina; Yu, Peiqiang

2013-06-12

Protein quality relies not only on total protein but also on protein inherent structures. The most commonly occurring protein secondary structures (α-helix and β-sheet) may influence protein quality, nutrient utilization, and digestive behavior. The objectives of this study were to reveal the protein molecular structures of canola meal (yellow and brown) and presscake as affected by the heat-processing methods and to investigate the relationship between structure changes and protein rumen degradations kinetics, estimated protein intestinal digestibility, degraded protein balance, and metabolizable protein. Heat-processing conditions resulted in a higher value for α-helix and β-sheet for brown canola presscake compared to brown canola meal. The multivariate molecular spectral analyses (PCA, CLA) showed that there were significant molecular structural differences in the protein amide I and II fingerprint region (ca. 1700-1480 cm(-1)) between the brown canola meal and presscake. The in situ degradation parameters, amide I and II, and α-helix to β-sheet ratio (R_a_β) were positively correlated with the degradable fraction and the degradation rate. Modeling results showed that α-helix was positively correlated with the truly absorbed rumen synthesized microbial protein in the small intestine when using both the Dutch DVE/OEB system and the NRC-2001 model. Concerning the protein profiles, R_a_β was a better predictor for crude protein (79%) and for neutral detergent insoluble crude protein (68%). In conclusion, ATR-FT/IR molecular spectroscopy may be used to rapidly characterize feed structures at the molecular level and also as a potential predictor of feed functionality, digestive behavior, and nutrient utilization of canola feed.
Ab Initio Structural Modeling of and Experimental Validation for Chlamydia trachomatis Protein CT296 Reveal Structural Similarity to Fe(II) 2-Oxoglutarate-Dependent Enzymes▿

PubMed Central

Kemege, Kyle E.; Hickey, John M.; Lovell, Scott; Battaile, Kevin P.; Zhang, Yang; Hefty, P. Scott

2011-01-01

Chlamydia trachomatis is a medically important pathogen that encodes a relatively high percentage of proteins with unknown function. The three-dimensional structure of a protein can be very informative regarding the protein's functional characteristics; however, determining protein structures experimentally can be very challenging. Computational methods that model protein structures with sufficient accuracy to facilitate functional studies have had notable successes. To evaluate the accuracy and potential impact of computational protein structure modeling of hypothetical proteins encoded by Chlamydia, a successful computational method termed I-TASSER was utilized to model the three-dimensional structure of a hypothetical protein encoded by open reading frame (ORF) CT296. CT296 has been reported to exhibit functional properties of a divalent cation transcription repressor (DcrA), with similarity to the Escherichia coli iron-responsive transcriptional repressor, Fur. Unexpectedly, the I-TASSER model of CT296 exhibited no structural similarity to any DNA-interacting proteins or motifs. To validate the I-TASSER-generated model, the structure of CT296 was solved experimentally using X-ray crystallography. Impressively, the ab initio I-TASSER-generated model closely matched (2.72-Å Cα root mean square deviation [RMSD]) the high-resolution (1.8-Å) crystal structure of CT296. Modeled and experimentally determined structures of CT296 share structural characteristics of non-heme Fe(II) 2-oxoglutarate-dependent enzymes, although key enzymatic residues are not conserved, suggesting a unique biochemical process is likely associated with CT296 function. Additionally, functional analyses did not support prior reports that CT296 has properties shared with divalent cation repressors such as Fur. PMID:21965559
Ab initio structural modeling of and experimental validation for Chlamydia trachomatis protein CT296 reveal structural similarity to Fe(II) 2-oxoglutarate-dependent enzymes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kemege, Kyle E.; Hickey, John M.; Lovell, Scott

2012-02-13

Chlamydia trachomatis is a medically important pathogen that encodes a relatively high percentage of proteins with unknown function. The three-dimensional structure of a protein can be very informative regarding the protein's functional characteristics; however, determining protein structures experimentally can be very challenging. Computational methods that model protein structures with sufficient accuracy to facilitate functional studies have had notable successes. To evaluate the accuracy and potential impact of computational protein structure modeling of hypothetical proteins encoded by Chlamydia, a successful computational method termed I-TASSER was utilized to model the three-dimensional structure of a hypothetical protein encoded by open reading frame (ORF)more » CT296. CT296 has been reported to exhibit functional properties of a divalent cation transcription repressor (DcrA), with similarity to the Escherichia coli iron-responsive transcriptional repressor, Fur. Unexpectedly, the I-TASSER model of CT296 exhibited no structural similarity to any DNA-interacting proteins or motifs. To validate the I-TASSER-generated model, the structure of CT296 was solved experimentally using X-ray crystallography. Impressively, the ab initio I-TASSER-generated model closely matched (2.72-{angstrom} C{alpha} root mean square deviation [RMSD]) the high-resolution (1.8-{angstrom}) crystal structure of CT296. Modeled and experimentally determined structures of CT296 share structural characteristics of non-heme Fe(II) 2-oxoglutarate-dependent enzymes, although key enzymatic residues are not conserved, suggesting a unique biochemical process is likely associated with CT296 function. Additionally, functional analyses did not support prior reports that CT296 has properties shared with divalent cation repressors such as Fur.« less
Integration of structural dynamics and molecular evolution via protein interaction networks: a new era in genomic medicine.

PubMed

Kumar, Avishek; Butler, Brandon M; Kumar, Sudhir; Ozkan, S Banu

2015-12-01

Sequencing technologies are revealing many new non-synonymous single nucleotide variants (nsSNVs) in each personal exome. To assess their functional impacts, comparative genomics is frequently employed to predict if they are benign or not. However, evolutionary analysis alone is insufficient, because it misdiagnoses many disease-associated nsSNVs, such as those at positions involved in protein interfaces, and because evolutionary predictions do not provide mechanistic insights into functional change or loss. Structural analyses can aid in overcoming both of these problems by incorporating conformational dynamics and allostery in nSNV diagnosis. Finally, protein-protein interaction networks using systems-level methodologies shed light onto disease etiology and pathogenesis. Bridging these network approaches with structurally resolved protein interactions and dynamics will advance genomic medicine. Copyright © 2015 Elsevier Ltd. All rights reserved.
SOLEIL shining on the solution-state structure of biomacromolecules by synchrotron X-ray footprinting at the Metrology beamline.

PubMed

Baud, A; Aymé, L; Gonnet, F; Salard, I; Gohon, Y; Jolivet, P; Brodolin, K; Da Silva, P; Giuliani, A; Sclavi, B; Chardot, T; Mercère, P; Roblin, P; Daniel, R

2017-05-01

Synchrotron X-ray footprinting complements the techniques commonly used to define the structure of molecules such as crystallography, small-angle X-ray scattering and nuclear magnetic resonance. It is remarkably useful in probing the structure and interactions of proteins with lipids, nucleic acids or with other proteins in solution, often better reflecting the in vivo state dynamics. To date, most X-ray footprinting studies have been carried out at the National Synchrotron Light Source, USA, and at the European Synchrotron Radiation Facility in Grenoble, France. This work presents X-ray footprinting of biomolecules performed for the first time at the X-ray Metrology beamline at the SOLEIL synchrotron radiation source. The installation at this beamline of a stopped-flow apparatus for sample delivery, an irradiation capillary and an automatic sample collector enabled the X-ray footprinting study of the structure of the soluble protein factor H (FH) from the human complement system as well as of the lipid-associated hydrophobic protein S3 oleosin from plant seed. Mass spectrometry analysis showed that the structural integrity of both proteins was not affected by the short exposition to the oxygen radicals produced during the irradiation. Irradiated molecules were subsequently analysed using high-resolution mass spectrometry to identify and locate oxidized amino acids. Moreover, the analyses of FH in its free state and in complex with complement C3b protein have allowed us to create a map of reactive solvent-exposed residues on the surface of FH and to observe the changes in oxidation of FH residues upon C3b binding. Studies of the solvent accessibility of the S3 oleosin show that X-ray footprinting offers also a unique approach to studying the structure of proteins embedded within membranes or lipid bodies. All the biomolecular applications reported herein demonstrate that the Metrology beamline at SOLEIL can be successfully used for synchrotron X-ray footprinting of biomolecules.
Dissecting the Calcium-Induced Differentiation of Human Primary Keratinocytes Stem Cells by Integrative and Structural Network Analyses

PubMed Central

Toufighi, Kiana; Yang, Jae-Seong; Luis, Nuno Miguel; Aznar Benitah, Salvador; Lehner, Ben; Serrano, Luis; Kiel, Christina

2015-01-01

The molecular details underlying the time-dependent assembly of protein complexes in cellular networks, such as those that occur during differentiation, are largely unexplored. Focusing on the calcium-induced differentiation of primary human keratinocytes as a model system for a major cellular reorganization process, we look at the expression of genes whose products are involved in manually-annotated protein complexes. Clustering analyses revealed only moderate co-expression of functionally related proteins during differentiation. However, when we looked at protein complexes, we found that the majority (55%) are composed of non-dynamic and dynamic gene products (‘di-chromatic’), 19% are non-dynamic, and 26% only dynamic. Considering three-dimensional protein structures to predict steric interactions, we found that proteins encoded by dynamic genes frequently interact with a common non-dynamic protein in a mutually exclusive fashion. This suggests that during differentiation, complex assemblies may also change through variation in the abundance of proteins that compete for binding to common proteins as found in some cases for paralogous proteins. Considering the example of the TNF-α/NFκB signaling complex, we suggest that the same core complex can guide signals into diverse context-specific outputs by addition of time specific expressed subunits, while keeping other cellular functions constant. Thus, our analysis provides evidence that complex assembly with stable core components and competition could contribute to cell differentiation. PMID:25946651
3D Complex: A Structural Classification of Protein Complexes

PubMed Central

Levy, Emmanuel D; Pereira-Leal, Jose B; Chothia, Cyrus; Teichmann, Sarah A

2006-01-01

Most of the proteins in a cell assemble into complexes to carry out their function. It is therefore crucial to understand the physicochemical properties as well as the evolution of interactions between proteins. The Protein Data Bank represents an important source of information for such studies, because more than half of the structures are homo- or heteromeric protein complexes. Here we propose the first hierarchical classification of whole protein complexes of known 3-D structure, based on representing their fundamental structural features as a graph. This classification provides the first overview of all the complexes in the Protein Data Bank and allows nonredundant sets to be derived at different levels of detail. This reveals that between one-half and two-thirds of known structures are multimeric, depending on the level of redundancy accepted. We also analyse the structures in terms of the topological arrangement of their subunits and find that they form a small number of arrangements compared with all theoretically possible ones. This is because most complexes contain four subunits or less, and the large majority are homomeric. In addition, there is a strong tendency for symmetry in complexes, even for heteromeric complexes. Finally, through comparison of Biological Units in the Protein Data Bank with the Protein Quaternary Structure database, we identified many possible errors in quaternary structure assignments. Our classification, available as a database and Web server at http://www.3Dcomplex.org, will be a starting point for future work aimed at understanding the structure and evolution of protein complexes. PMID:17112313
Knowledge-based prediction of protein backbone conformation using a structural alphabet.

PubMed

Vetrivel, Iyanar; Mahajan, Swapnil; Tyagi, Manoj; Hoffmann, Lionel; Sanejouand, Yves-Henri; Srinivasan, Narayanaswamy; de Brevern, Alexandre G; Cadet, Frédéric; Offmann, Bernard

2017-01-01

Libraries of structural prototypes that abstract protein local structures are known as structural alphabets and have proven to be very useful in various aspects of protein structure analyses and predictions. One such library, Protein Blocks, is composed of 16 standard 5-residues long structural prototypes. This form of analyzing proteins involves drafting its structure as a string of Protein Blocks. Predicting the local structure of a protein in terms of protein blocks is the general objective of this work. A new approach, PB-kPRED is proposed towards this aim. It involves (i) organizing the structural knowledge in the form of a database of pentapeptide fragments extracted from all protein structures in the PDB and (ii) applying a knowledge-based algorithm that does not rely on any secondary structure predictions and/or sequence alignment profiles, to scan this database and predict most probable backbone conformations for the protein local structures. Though PB-kPRED uses the structural information from homologues in preference, if available. The predictions were evaluated rigorously on 15,544 query proteins representing a non-redundant subset of the PDB filtered at 30% sequence identity cut-off. We have shown that the kPRED method was able to achieve mean accuracies ranging from 40.8% to 66.3% depending on the availability of homologues. The impact of the different strategies for scanning the database on the prediction was evaluated and is discussed. Our results highlight the usefulness of the method in the context of proteins without any known structural homologues. A scoring function that gives a good estimate of the accuracy of prediction was further developed. This score estimates very well the accuracy of the algorithm (R2 of 0.82). An online version of the tool is provided freely for non-commercial usage at http://www.bo-protscience.fr/kpred/.
Synchrotron IR microspectroscopy for protein structure analysis: Potential and questions

DOE PAGES

Yu, Peiqiang

2006-01-01

Synchrotron radiation-based Fourier transform infrared microspectroscopy (S-FTIR) has been developed as a rapid, direct, non-destructive, bioanalytical technique. This technique takes advantage of synchrotron light brightness and small effective source size and is capable of exploring the molecular chemical make-up within microstructures of a biological tissue without destruction of inherent structures at ultra-spatial resolutions within cellular dimension. To date there has been very little application of this advanced technique to the study of pure protein inherent structure at a cellular level in biological tissues. In this review, a novel approach was introduced to show the potential of the newly developed, advancedmore » synchrotron-based analytical technology, which can be used to localize relatively “pure“ protein in the plant tissues and relatively reveal protein inherent structure and protein molecular chemical make-up within intact tissue at cellular and subcellular levels. Several complex protein IR spectra data analytical techniques (Gaussian and Lorentzian multi-component peak modeling, univariate and multivariate analysis, principal component analysis (PCA), and hierarchical cluster analysis (CLA) are employed to relatively reveal features of protein inherent structure and distinguish protein inherent structure differences between varieties/species and treatments in plant tissues. By using a multi-peak modeling procedure, RELATIVE estimates (but not EXACT determinations) for protein secondary structure analysis can be made for comparison purpose. The issues of pro- and anti-multi-peaking modeling/fitting procedure for relative estimation of protein structure were discussed. By using the PCA and CLA analyses, the plant molecular structure can be qualitatively separate one group from another, statistically, even though the spectral assignments are not known. The synchrotron-based technology provides a new approach for protein structure research in biological tissues at ultraspatial resolutions.« less
Structure and mechanism of maximum stability of isolated alpha-helical protein domains at a critical length scale.

PubMed

Qin, Zhao; Fabre, Andrea; Buehler, Markus J

2013-05-01

The stability of alpha helices is important in protein folding, bioinspired materials design, and controls many biological properties under physiological and disease conditions. Here we show that a naturally favored alpha helix length of 9 to 17 amino acids exists at which the propensity towards the formation of this secondary structure is maximized. We use a combination of thermodynamical analysis, well-tempered metadynamics molecular simulation and statistical analyses of experimental alpha helix length distributions and find that the favored alpha helix length is caused by a competition between alpha helix folding, unfolding into a random coil and formation of higher-order tertiary structures. The theoretical result is suggested to be used to explain the statistical distribution of the length of alpha helices observed in natural protein structures. Our study provides mechanistic insight into fundamental controlling parameters in alpha helix structure formation and potentially other biopolymers or synthetic materials. The result advances our fundamental understanding of size effects in the stability of protein structures and may enable the design of de novo alpha-helical protein materials.
Bioinformatic Analysis of Strawberry GSTF12 Gene

NASA Astrophysics Data System (ADS)

Wang, Xiran; Jiang, Leiyu; Tang, Haoru

2018-01-01

GSTF12 has always been known as a key factor of proanthocyanins accumulate in plant testa. Through bioinformatics analysis of the nucleotide and encoded protein sequence of GSTF12, it is more advantageous to the study of genes related to anthocyanin biosynthesis accumulation pathway. Therefore, we chosen GSTF12 gene of 11 kinds species, downloaded their nucleotide and protein sequence from NCBI as the research object, found strawberry GSTF12 gene via bioinformation analyse, constructed phylogenetic tree. At the same time, we analysed the strawberry GSTF12 gene of physical and chemical properties and its protein structure and so on. The phylogenetic tree showed that Strawberry and petunia were closest relative. By the protein prediction, we found that the protein owed one proper signal peptide without obvious transmembrane regions.
The Vip3Ag4 Insecticidal Protoxin from Bacillus thuringiensis Adopts A Tetrameric Configuration That Is Maintained on Proteolysis

PubMed Central

Palma, Leopoldo; Scott, David J.; Harris, Gemma; Din, Salah-Ud; Williams, Thomas L.; Roberts, Oliver J.; Young, Mark T.; Caballero, Primitivo; Berry, Colin

2017-01-01

The Vip3 proteins produced during vegetative growth by strains of the bacterium Bacillus thuringiensis show insecticidal activity against lepidopteran insects with a mechanism of action that may involve pore formation and apoptosis. These proteins are promising supplements to our arsenal of insecticidal proteins, but the molecular details of their activity are not understood. As a first step in the structural characterisation of these proteins, we have analysed their secondary structure and resolved the surface topology of a tetrameric complex of the Vip3Ag4 protein by transmission electron microscopy. Sites sensitive to proteolysis by trypsin are identified and the trypsin-cleaved protein appears to retain a similar structure as an octomeric complex comprising four copies each of the ~65 kDa and ~21 kDa products of proteolysis. This processed form of the toxin may represent the active toxin. The quality and monodispersity of the protein produced in this study make Vip3Ag4 a candidate for more detailed structural analysis using cryo-electron microscopy. PMID:28505109
p3d--Python module for structural bioinformatics.

PubMed

Fufezan, Christian; Specht, Michael

2009-08-21

High-throughput bioinformatic analysis tools are needed to mine the large amount of structural data via knowledge based approaches. The development of such tools requires a robust interface to access the structural data in an easy way. For this the Python scripting language is the optimal choice since its philosophy is to write an understandable source code. p3d is an object oriented Python module that adds a simple yet powerful interface to the Python interpreter to process and analyse three dimensional protein structure files (PDB files). p3d's strength arises from the combination of a) very fast spatial access to the structural data due to the implementation of a binary space partitioning (BSP) tree, b) set theory and c) functions that allow to combine a and b and that use human readable language in the search queries rather than complex computer language. All these factors combined facilitate the rapid development of bioinformatic tools that can perform quick and complex analyses of protein structures. p3d is the perfect tool to quickly develop tools for structural bioinformatics using the Python scripting language.
Structural and evolutionary analysis of Leishmania Alba proteins.

PubMed

da Costa, Kauê Santana; Galúcio, João Marcos Pereira; Leonardo, Elvis Santos; Cardoso, Guelber; Leal, Élcio; Conde, Guilherme; Lameira, Jerônimo

2017-10-01

The Alba superfamily proteins share a common RNA-binding domain. These proteins participate in a variety of regulatory pathways by controlling developmental gene expression. They also interact with ribosomal subunits, translation factors, and other RNA-binding proteins. The Leishmania infantum genome encodes two Alba-domain proteins, LiAlba1 and LiAlba3. In this work, we used homology modeling, protein-protein docking, and molecular dynamics (MD) simulations to explore the details of the Alba1-Alba3-RNA complex from Leishmania infantum at the molecular level. In addition, we compared the structure of LiAlba3 with the human ribonuclease P component, Rpp20. We also mapped the ligand-binding residues on the Alba3 surface to analyze its druggability and performed mutational analyses in Alba3 using alanine scanning to identify residues involved in its function and structural stability. These results suggest that the RGG-box motif of LiAlba1 is important for protein function and stability. Finally, we discuss the function of Alba proteins in the context of pathogen adaptation to host cells. The data provided herein will facilitate further translational research regarding Alba structure and function. Copyright © 2017 Elsevier B.V. All rights reserved.
The High-Resolution Structure of Activated Opsin Reveals a Conserved Solvent Network in the Transmembrane Region Essential for Activation.

PubMed

Blankenship, Elise; Vahedi-Faridi, Ardeschir; Lodowski, David T

2015-12-01

Rhodopsin, a light-activated G protein coupled receptor (GPCR), has been the subject of numerous biochemical and structural investigations, serving as a model receptor for GPCRs and their activation. We present the 2.3-Å resolution structure of native source rhodopsin stabilized in a conformation competent for G protein binding. An extensive water-mediated hydrogen bond network linking the chromophore binding site to the site of G protein binding is observed, providing connections to conserved motifs essential for GPCR activation. Comparison of this extensive solvent-mediated hydrogen-bonding network with the positions of ordered solvent in earlier crystallographic structures of rhodopsin photointermediates reveals both static structural and dynamic functional water-protein interactions present during the activation process. When considered along with observations that solvent occupies similar positions in the structures of other GPCRs, these analyses strongly support an integral role for this dynamic ordered water network in both rhodopsin and GPCR activation. Copyright © 2015 Elsevier Ltd. All rights reserved.
Structural pierce into molecular mechanism underlying Clostridium perfringens Epsilon toxin function.

PubMed

Khalili, Saeed; Jahangiri, Abolfazl; Hashemi, Zahra Sadat; Khalesi, Bahman; Mard-Soltani, Maysam; Amani, Jafar

2017-03-01

Epsilon toxin of the Clostridium perfringens garnered a lot of attention due to its potential for toxicity in humans, extreme potency for cytotoxicity in mice and lack of any approved therapeutics prescribed for human. However, the intricacies of the Epsilon toxin action mechanism are yet to be understood. In this regard, various in silico tools have been exploited to model and refine the 3D structure of the toxin and its two receptors. The receptor proteins were embedded into designed lipid membranes within an aqueous and ionized environment. Thereafter, the modeled structures subjected to series of consecutive molecular dynamics runs to achieve the most natural like coordination for each model. Ultimately, protein-protein interaction analyses were performed to understand the probable action mechanism. The obtained results successfully confirmed the accuracy of employed methods to achieve high quality models for the toxin and its receptors within their lipid bilayers. Molecular dynamics analyses lead the structures to a more native like coordination. Moreover, the results of previous empirical studies were confirmed, while new insights for action mechanisms including the detailed roles of Hepatitis A virus cellular receptor 1 (HAVCR1) and Myelin and lymphocyte protein (MAL) proteins were achieved. In light of previous and our observations, we suggested novel models which elucidated the existing interplay between potential players of Epsilon toxin action mechanism with detailed structural evidences. These models would pave the way to have more robust understanding of the Epsilon toxin biology, more precise vaccine construction and more successful drug (inhibitor) design. Copyright © 2017 Elsevier Ltd. All rights reserved.
Investigating the Structural Compaction of Biomolecules Upon Transition to the Gas-Phase Using ESI-TWIMS-MS.

PubMed

Devine, Paul W A; Fisher, Henry C; Calabrese, Antonio N; Whelan, Fiona; Higazi, Daniel R; Potts, Jennifer R; Lowe, David C; Radford, Sheena E; Ashcroft, Alison E

2017-09-01

Collision cross-section (CCS) measurements obtained from ion mobility spectrometry-mass spectrometry (IMS-MS) analyses often provide useful information concerning a protein's size and shape and can be complemented by modeling procedures. However, there have been some concerns about the extent to which certain proteins maintain a native-like conformation during the gas-phase analysis, especially proteins with dynamic or extended regions. Here we have measured the CCSs of a range of biomolecules including non-globular proteins and RNAs of different sequence, size, and stability. Using traveling wave IMS-MS, we show that for the proteins studied, the measured CCS deviates significantly from predicted CCS values based upon currently available structures. The results presented indicate that these proteins collapse to different extents varying on their elongated structures upon transition into the gas-phase. Comparing two RNAs of similar mass but different solution structures, we show that these biomolecules may also be susceptible to gas-phase compaction. Together, the results suggest that caution is needed when predicting structural models based on CCS data for RNAs as well as proteins with non-globular folds. Graphical Abstract ᅟ.
Aromatic Cluster Sensor of Protein Folding: Near-UV Electronic Circular Dichroism Bands Assigned to Fold Compactness.

PubMed

Farkas, Viktor; Jákli, Imre; Tóth, Gábor K; Perczel, András

2016-09-19

Both far- and near-UV electronic circular dichroism (ECD) spectra have bands sensitive to thermal unfolding of Trp and Tyr residues containing proteins. Beside spectral changes at 222 nm reporting secondary structural variations (far-UV range), L b bands (near-UV range) are applicable as 3D-fold sensors of protein's core structure. In this study we show that both L b (Tyr) and L b (Trp) ECD bands could be used as sensors of fold compactness. ECD is a relative method and thus requires NMR referencing and cross-validation, also provided here. The ensemble of 204 ECD spectra of Trp-cage miniproteins is analysed as a training set for "calibrating" Trp↔Tyr folded systems of known NMR structure. While in the far-UV ECD spectra changes are linear as a function of the temperature, near-UV ECD data indicate a non-linear and thus, cooperative unfolding mechanism of these proteins. Ensemble of ECD spectra deconvoluted gives both conformational weights and insight to a protein folding↔unfolding mechanism. We found that the L b 293 band is reporting on the 3D-structure compactness. In addition, the pure near-UV ECD spectrum of the unfolded state is described here for the first time. Thus, ECD folding information now validated can be applied with confidence in a large thermal window (5≤T≤85 °C) compared to NMR for studying the unfolding of Trp↔Tyr residue pairs. In conclusion, folding propensities of important proteins (RNA polymerase II, ubiquitin protein ligase, tryptase-inhibitor etc.) can now be analysed with higher confidence. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Preservation of protein clefts in comparative models.

PubMed

Piedra, David; Lois, Sergi; de la Cruz, Xavier

2008-01-16

Comparative, or homology, modelling of protein structures is the most widely used prediction method when the target protein has homologues of known structure. Given that the quality of a model may vary greatly, several studies have been devoted to identifying the factors that influence modelling results. These studies usually consider the protein as a whole, and only a few provide a separate discussion of the behaviour of biologically relevant features of the protein. Given the value of the latter for many applications, here we extended previous work by analysing the preservation of native protein clefts in homology models. We chose to examine clefts because of their role in protein function/structure, as they are usually the locus of protein-protein interactions, host the enzymes' active site, or, in the case of protein domains, can also be the locus of domain-domain interactions that lead to the structure of the whole protein. We studied how the largest cleft of a protein varies in comparative models. To this end, we analysed a set of 53507 homology models that cover the whole sequence identity range, with a special emphasis on medium and low similarities. More precisely we examined how cleft quality - measured using six complementary parameters related to both global shape and local atomic environment, depends on the sequence identity between target and template proteins. In addition to this general analysis, we also explored the impact of a number of factors on cleft quality, and found that the relationship between quality and sequence identity varies depending on cleft rank amongst the set of protein clefts (when ordered according to size), and number of aligned residues. We have examined cleft quality in homology models at a range of seq.id. levels. Our results provide a detailed view of how quality is affected by distinct parameters and thus may help the user of comparative modelling to determine the final quality and applicability of his/her cleft models. In addition, the large variability in model quality that we observed within each sequence bin, with good models present even at low sequence identities (between 20% and 30%), indicates that properly developed identification methods could be used to recover good cleft models in this sequence range.
Molecular and functional analyses of a maize autoactive NB-LRR protein identify precise structural requirements for activity.

PubMed

Wang, Guan-Feng; Ji, Jiabing; El-Kasmi, Farid; Dangl, Jeffery L; Johal, Guri; Balint-Kurti, Peter J

2015-02-01

Plant disease resistance is often mediated by nucleotide binding-leucine rich repeat (NLR) proteins which remain auto-inhibited until recognition of specific pathogen-derived molecules causes their activation, triggering a rapid, localized cell death called a hypersensitive response (HR). Three domains are recognized in one of the major classes of NLR proteins: a coiled-coil (CC), a nucleotide binding (NB-ARC) and a leucine rich repeat (LRR) domains. The maize NLR gene Rp1-D21 derives from an intergenic recombination event between two NLR genes, Rp1-D and Rp1-dp2 and confers an autoactive HR. We report systematic structural and functional analyses of Rp1 proteins in maize and N. benthamiana to characterize the molecular mechanism of NLR activation/auto-inhibition. We derive a model comprising the following three main features: Rp1 proteins appear to self-associate to become competent for activity. The CC domain is signaling-competent and is sufficient to induce HR. This can be suppressed by the NB-ARC domain through direct interaction. In autoactive proteins, the interaction of the LRR domain with the NB-ARC domain causes de-repression and thus disrupts the inhibition of HR. Further, we identify specific amino acids and combinations thereof that are important for the auto-inhibition/activity of Rp1 proteins. We also provide evidence for the function of MHD2, a previously uncharacterized, though widely conserved NLR motif. This work reports several novel insights into the precise structural requirement for NLR function and informs efforts towards utilizing these proteins for engineering disease resistance.

Structural insights of the MLF1/14-3-3 interaction.

PubMed

Molzan, Manuela; Weyand, Michael; Rose, Rolf; Ottmann, Christian

2012-02-01

Myeloid leukaemia factor 1 (MLF1) binds to 14-3-3 adapter proteins by a sequence surrounding Ser34 with the functional consequences of this interaction largely unknown. We present here the high-resolution crystal structure of this binding motif [MLF1(29-42)pSer34] in complex with 14-3-3ε and analyse the interaction with isothermal titration calorimetry. Fragment-based ligand discovery employing crystals of the binary 14-3-3ε/MLF1(29-42)pSer34 complex was used to identify a molecule that binds to the interface rim of the two proteins, potentially representing the starting point for the development of a small molecule that stabilizes the MLF1/14-3-3 protein-protein interaction. Such a compound might be used as a chemical biology tool to further analyse the 14-3-3/MLF1 interaction without the use of genetic methods. Database Structural data are available in the Protein Data Bank under the accession number(s) 3UAL [14-3-3ε/MLF1(29-42)pSer34 complex] and 3UBW [14-3-3ε/MLF1(29-42)pSer34/3-pyrrolidinol complex] Structured digital abstract • 14-3-3 epsilon and MLF1 bind by x-ray crystallography (View interaction) • 14-3-3 epsilon and MLF1 bind by isothermal titration calorimetry (View Interaction: 1, 2). © 2011 The Authors Journal compilation © 2011 FEBS.
Structural and Functional Studies of H. seropedicae RecA Protein – Insights into the Polymerization of RecA Protein as Nucleoprotein Filament

DOE Office of Scientific and Technical Information (OSTI.GOV)

Leite, Wellington C.; Galvão, Carolina W.; Saab, Sérgio C.

The bacterial RecA protein plays a role in the complex system of DNA damage repair. Here, we report the functional and structural characterization of the Herbaspirillum seropedicae RecA protein (HsRecA). HsRecA protein is more efficient at displacing SSB protein from ssDNA than Escherichia coli RecA protein. HsRecA also promotes DNA strand exchange more efficiently. The three dimensional structure of HsRecA-ADP/ATP complex has been solved to 1.7 Å resolution. HsRecA protein contains a small N-terminal domain, a central core ATPase domain and a large C-terminal domain, that are similar to homologous bacterial RecA proteins. Comparative structural analysis showed that the N-terminalmore » polymerization motif of archaeal and eukaryotic RecA family proteins are also present in bacterial RecAs. Reconstruction of electrostatic potential from the hexameric structure of HsRecA-ADP/ATP revealed a high positive charge along the inner side, where ssDNA is bound inside the filament. The properties of this surface may explain the greater capacity of HsRecA protein to bind ssDNA, forming a contiguous nucleoprotein filament, displace SSB and promote DNA exchange relative to EcRecA. In conclusion, our functional and structural analyses provide insight into the molecular mechanisms of polymerization of bacterial RecA as a helical nucleoprotein filament.« less
Structural and Functional Studies of H. seropedicae RecA Protein – Insights into the Polymerization of RecA Protein as Nucleoprotein Filament

PubMed Central

Galvão, Carolina W.; Saab, Sérgio C.; Iulek, Jorge; Etto, Rafael M.; Steffens, Maria B. R.; Chitteni-Pattu, Sindhu; Stanage, Tyler; Keck, James L.; Cox, Michael M.

2016-01-01

The bacterial RecA protein plays a role in the complex system of DNA damage repair. Here, we report the functional and structural characterization of the Herbaspirillum seropedicae RecA protein (HsRecA). HsRecA protein is more efficient at displacing SSB protein from ssDNA than Escherichia coli RecA protein. HsRecA also promotes DNA strand exchange more efficiently. The three dimensional structure of HsRecA-ADP/ATP complex has been solved to 1.7 Å resolution. HsRecA protein contains a small N-terminal domain, a central core ATPase domain and a large C-terminal domain, that are similar to homologous bacterial RecA proteins. Comparative structural analysis showed that the N-terminal polymerization motif of archaeal and eukaryotic RecA family proteins are also present in bacterial RecAs. Reconstruction of electrostatic potential from the hexameric structure of HsRecA-ADP/ATP revealed a high positive charge along the inner side, where ssDNA is bound inside the filament. The properties of this surface may explain the greater capacity of HsRecA protein to bind ssDNA, forming a contiguous nucleoprotein filament, displace SSB and promote DNA exchange relative to EcRecA. Our functional and structural analyses provide insight into the molecular mechanisms of polymerization of bacterial RecA as a helical nucleoprotein filament. PMID:27447485
Effects of storage temperature on airway exosome integrity for diagnostic and functional analyses

PubMed Central

Maroto, Rosario; Zhao, Yingxin; Jamaluddin, Mohammad; Popov, Vsevolod L.; Wang, Hongwang; Kalubowilage, Madumali; Zhang, Yueqing; Luisi, Jonathan; Sun, Hong; Culbertson, Christopher T.; Bossmann, Stefan H.; Motamedi, Massoud; Brasier, Allan R.

2017-01-01

ABSTRACT Background: Extracellular vesicles contain biological molecules specified by cell-type of origin and modified by microenvironmental changes. To conduct reproducible studies on exosome content and function, storage conditions need to have minimal impact on airway exosome integrity. Aim: We compared surface properties and protein content of airway exosomes that had been freshly isolated vs. those that had been treated with cold storage or freezing. Methods: Mouse bronchoalveolar lavage fluid (BALF) exosomes purified by differential ultracentrifugation were analysed immediately or stored at +4°C or −80°C. Exosomal structure was assessed by dynamic light scattering (DLS), transmission electron microscopy (TEM) and charge density (zeta potential, ζ). Exosomal protein content, including leaking/dissociating proteins, were identified by label-free LC-MS/MS. Results: Freshly isolated BALF exosomes exhibited a mean diameter of 95 nm and characteristic morphology. Storage had significant impact on BALF exosome size and content. Compared to fresh, exosomes stored at +4°C had a 10% increase in diameter, redistribution to polydisperse aggregates and reduced ζ. Storage at −80°C produced an even greater effect, resulting in a 25% increase in diameter, significantly reducing the ζ, resulting in multilamellar structure formation. In fresh exosomes, we identified 1140 high-confidence proteins enriched in 19 genome ontology biological processes. After storage at room temperature, 848 proteins were identified. In preparations stored at +4°C, 224 proteins appeared in the supernatant fraction compared to the wash fractions from freshly prepared exosomes; these proteins represent exosome leakage or dissociation of loosely bound “peri-exosomal” proteins. In preparations stored at −80°C, 194 proteins appeared in the supernatant fraction, suggesting that distinct protein groups leak from exosomes at different storage temperatures. Conclusions: Storage destabilizes the surface characteristics, morphological features and protein content of BALF exosomes. For preservation of the exosome protein content and representative functional analysis, airway exosomes should be analysed immediately after isolation. PMID:28819550
Effects of storage temperature on airway exosome integrity for diagnostic and functional analyses.

PubMed

Maroto, Rosario; Zhao, Yingxin; Jamaluddin, Mohammad; Popov, Vsevolod L; Wang, Hongwang; Kalubowilage, Madumali; Zhang, Yueqing; Luisi, Jonathan; Sun, Hong; Culbertson, Christopher T; Bossmann, Stefan H; Motamedi, Massoud; Brasier, Allan R

2017-01-01

Background : Extracellular vesicles contain biological molecules specified by cell-type of origin and modified by microenvironmental changes. To conduct reproducible studies on exosome content and function, storage conditions need to have minimal impact on airway exosome integrity. Aim : We compared surface properties and protein content of airway exosomes that had been freshly isolated vs. those that had been treated with cold storage or freezing. Methods : Mouse bronchoalveolar lavage fluid (BALF) exosomes purified by differential ultracentrifugation were analysed immediately or stored at +4°C or -80°C. Exosomal structure was assessed by dynamic light scattering (DLS), transmission electron microscopy (TEM) and charge density (zeta potential, ζ). Exosomal protein content, including leaking/dissociating proteins, were identified by label-free LC-MS/MS. Results : Freshly isolated BALF exosomes exhibited a mean diameter of 95 nm and characteristic morphology. Storage had significant impact on BALF exosome size and content. Compared to fresh, exosomes stored at +4°C had a 10% increase in diameter, redistribution to polydisperse aggregates and reduced ζ. Storage at -80°C produced an even greater effect, resulting in a 25% increase in diameter, significantly reducing the ζ, resulting in multilamellar structure formation. In fresh exosomes, we identified 1140 high-confidence proteins enriched in 19 genome ontology biological processes. After storage at room temperature, 848 proteins were identified. In preparations stored at +4°C, 224 proteins appeared in the supernatant fraction compared to the wash fractions from freshly prepared exosomes; these proteins represent exosome leakage or dissociation of loosely bound "peri-exosomal" proteins. In preparations stored at -80°C, 194 proteins appeared in the supernatant fraction, suggesting that distinct protein groups leak from exosomes at different storage temperatures. Conclusions : Storage destabilizes the surface characteristics, morphological features and protein content of BALF exosomes. For preservation of the exosome protein content and representative functional analysis, airway exosomes should be analysed immediately after isolation.
Extraction, integration and analysis of alternative splicing and protein structure distributed information

PubMed Central

D'Antonio, Matteo; Masseroli, Marco

2009-01-01

Background Alternative splicing has been demonstrated to affect most of human genes; different isoforms from the same gene encode for proteins which differ for a limited number of residues, thus yielding similar structures. This suggests possible correlations between alternative splicing and protein structure. In order to support the investigation of such relationships, we have developed the Alternative Splicing and Protein Structure Scrutinizer (PASS), a Web application to automatically extract, integrate and analyze human alternative splicing and protein structure data sparsely available in the Alternative Splicing Database, Ensembl databank and Protein Data Bank. Primary data from these databases have been integrated and analyzed using the Protein Identifier Cross-Reference, BLAST, CLUSTALW and FeatureMap3D software tools. Results A database has been developed to store the considered primary data and the results from their analysis; a system of Perl scripts has been implemented to automatically create and update the database and analyze the integrated data; a Web interface has been implemented to make the analyses easily accessible; a database has been created to manage user accesses to the PASS Web application and store user's data and searches. Conclusion PASS automatically integrates data from the Alternative Splicing Database with protein structure data from the Protein Data Bank. Additionally, it comprehensively analyzes the integrated data with publicly available well-known bioinformatics tools in order to generate structural information of isoform pairs. Further analysis of such valuable information might reveal interesting relationships between alternative splicing and protein structure differences, which may be significantly associated with different functions. PMID:19828075
PLI: a web-based tool for the comparison of protein-ligand interactions observed on PDB structures.

PubMed

Gallina, Anna Maria; Bisignano, Paola; Bergamino, Maurizio; Bordo, Domenico

2013-02-01

A large fraction of the entries contained in the Protein Data Bank describe proteins in complex with low molecular weight molecules such as physiological compounds or synthetic drugs. In many cases, the same molecule is found in distinct protein-ligand complexes. There is an increasing interest in Medicinal Chemistry in comparing protein binding sites to get insight on interactions that modulate the binding specificity, as this structural information can be correlated with other experimental data of biochemical or physiological nature and may help in rational drug design. The web service protein-ligand interaction presented here provides a tool to analyse and compare the binding pockets of homologous proteins in complex with a selected ligand. The information is deduced from protein-ligand complexes present in the Protein Data Bank and stored in the underlying database. Freely accessible at http://bioinformatics.istge.it/pli/.
Transcriptomic analysis of Arabidopsis developing stems: a close-up on cell wall genes

PubMed Central

Minic, Zoran; Jamet, Elisabeth; San-Clemente, Hélène; Pelletier, Sandra; Renou, Jean-Pierre; Rihouey, Christophe; Okinyo, Denis PO; Proux, Caroline; Lerouge, Patrice; Jouanin, Lise

2009-01-01

Background Different strategies (genetics, biochemistry, and proteomics) can be used to study proteins involved in cell biogenesis. The availability of the complete sequences of several plant genomes allowed the development of transcriptomic studies. Although the expression patterns of some Arabidopsis thaliana genes involved in cell wall biogenesis were identified at different physiological stages, detailed microarray analysis of plant cell wall genes has not been performed on any plant tissues. Using transcriptomic and bioinformatic tools, we studied the regulation of cell wall genes in Arabidopsis stems, i.e. genes encoding proteins involved in cell wall biogenesis and genes encoding secreted proteins. Results Transcriptomic analyses of stems were performed at three different developmental stages, i.e., young stems, intermediate stage, and mature stems. Many genes involved in the synthesis of cell wall components such as polysaccharides and monolignols were identified. A total of 345 genes encoding predicted secreted proteins with moderate or high level of transcripts were analyzed in details. The encoded proteins were distributed into 8 classes, based on the presence of predicted functional domains. Proteins acting on carbohydrates and proteins of unknown function constituted the two most abundant classes. Other proteins were proteases, oxido-reductases, proteins with interacting domains, proteins involved in signalling, and structural proteins. Particularly high levels of expression were established for genes encoding pectin methylesterases, germin-like proteins, arabinogalactan proteins, fasciclin-like arabinogalactan proteins, and structural proteins. Finally, the results of this transcriptomic analyses were compared with those obtained through a cell wall proteomic analysis from the same material. Only a small proportion of genes identified by previous proteomic analyses were identified by transcriptomics. Conversely, only a few proteins encoded by genes having moderate or high level of transcripts were identified by proteomics. Conclusion Analysis of the genes predicted to encode cell wall proteins revealed that about 345 genes had moderate or high levels of transcripts. Among them, we identified many new genes possibly involved in cell wall biogenesis. The discrepancies observed between results of this transcriptomic study and a previous proteomic study on the same material revealed post-transcriptional mechanisms of regulation of expression of genes encoding cell wall proteins. PMID:19149885
Can enzyme engineering benefit from the modulation of protein motions? Lessons learned from NMR relaxation dispersion experiments.

PubMed

Doucet, Nicolas

2011-04-01

Despite impressive progress in protein engineering and design, our ability to create new and efficient enzyme activities remains a laborious and time-consuming endeavor. In the past few years, intricate combinations of rational mutagenesis, directed evolution and computational methods have paved the way to exciting engineering examples and are now offering a new perspective on the structural requirements of enzyme activity. However, these structure-function analyses are usually guided by the time-averaged static models offered by enzyme crystal structures, which often fail to describe the functionally relevant 'invisible states' adopted by proteins in space and time. To alleviate such limitations, NMR relaxation dispersion experiments coupled to mutagenesis studies have recently been applied to the study of enzyme catalysis, effectively complementing 'structure-function' analyses with 'flexibility-function' investigations. In addition to offering quantitative, site-specific information to help characterize residue motion, these NMR methods are now being applied to enzyme engineering purposes, providing a powerful tool to help characterize the effects of controlling long-range networks of flexible residues affecting enzyme function. Recent advancements in this emerging field are presented here, with particular attention to mutagenesis reports highlighting the relevance of NMR relaxation dispersion tools in enzyme engineering.
Functional Insights from Structural Genomics

DOE Office of Scientific and Technical Information (OSTI.GOV)

Forouhar,F.; Kuzin, A.; Seetharaman, J.

2007-01-01

Structural genomics efforts have produced structural information, either directly or by modeling, for thousands of proteins over the past few years. While many of these proteins have known functions, a large percentage of them have not been characterized at the functional level. The structural information has provided valuable functional insights on some of these proteins, through careful structural analyses, serendipity, and structure-guided functional screening. Some of the success stories based on structures solved at the Northeast Structural Genomics Consortium (NESG) are reported here. These include a novel methyl salicylate esterase with important role in plant innate immunity, a novel RNAmore » methyltransferase (H. influenzae yggJ (HI0303)), a novel spermidine/spermine N-acetyltransferase (B. subtilis PaiA), a novel methyltransferase or AdoMet binding protein (A. fulgidus AF{_}0241), an ATP:cob(I)alamin adenosyltransferase (B. subtilis YvqK), a novel carboxysome pore (E. coli EutN), a proline racemase homolog with a disrupted active site (B. melitensis BME11586), an FMN-dependent enzyme (S. pneumoniae SP{_}1951), and a 12-stranded {beta}-barrel with a novel fold (V. parahaemolyticus VPA1032).« less
Super: a web server to rapidly screen superposable oligopeptide fragments from the protein data bank.

PubMed

Collier, James H; Lesk, Arthur M; Garcia de la Banda, Maria; Konagurthu, Arun S

2012-07-01

Searching for well-fitting 3D oligopeptide fragments within a large collection of protein structures is an important task central to many analyses involving protein structures. This article reports a new web server, Super, dedicated to the task of rapidly screening the protein data bank (PDB) to identify all fragments that superpose with a query under a prespecified threshold of root-mean-square deviation (RMSD). Super relies on efficiently computing a mathematical bound on the commonly used structural similarity measure, RMSD of superposition. This allows the server to filter out a large proportion of fragments that are unrelated to the query; >99% of the total number of fragments in some cases. For a typical query, Super scans the current PDB containing over 80,500 structures (with ∼40 million potential oligopeptide fragments to match) in under a minute. Super web server is freely accessible from: http://lcb.infotech.monash.edu.au/super.
Characterization and crystal structure of lysine insensitive Corynebacterium glutamicum dihydrodipicolinate synthase (cDHDPS) protein

DOE Office of Scientific and Technical Information (OSTI.GOV)

Rice, E.A.; Bannon, G.A.; Glenn, K.C.

2008-11-21

The lysine insensitive Corynebacterium glutamicum dihydrodipicolinate synthase enzyme (cDHDPS) was recently successfully introduced into maize plants to enhance the level of lysine in the grain. To better understand lysine insensitivity of the cDHDPS, we expressed, purified, kinetically characterized the protein, and solved its X-ray crystal structure. The cDHDPS enzyme has a fold and overall structure that is highly similar to other DHDPS proteins. A noteworthy feature of the active site is the evidence that the catalytic lysine residue forms a Schiff base adduct with pyruvate. Analyses of the cDHDPS structure in the vicinity of the putative binding site for S-lysinemore » revealed that the allosteric binding site in the Escherichia coli DHDPS protein does not exist in cDHDPS due to three non-conservative amino acids substitutions, and this is likely why cDHDPS is not feedback inhibited by lysine.« less
Models of S/π interactions in protein structures: Comparison of the H2S–benzene complex with PDB data

PubMed Central

Ringer, Ashley L.; Senenko, Anastasia; Sherrill, C. David

2007-01-01

S/π interactions are prevalent in biochemistry and play an important role in protein folding and stabilization. Geometries of cysteine/aromatic interactions found in crystal structures from the Brookhaven Protein Data Bank (PDB) are analyzed and compared with the equilibrium configurations predicted by high-level quantum mechanical results for the H2S–benzene complex. A correlation is observed between the energetically favorable configurations on the quantum mechanical potential energy surface of the H2S–benzene model and the cysteine/aromatic configurations most frequently found in crystal structures of the PDB. In contrast to some previous PDB analyses, configurations with the sulfur over the aromatic ring are found to be the most important. Our results suggest that accurate quantum computations on models of noncovalent interactions may be helpful in understanding the structures of proteins and other complex systems. PMID:17766371
Structural basis for activity of highly efficient RNA mimics of green fluorescent protein

PubMed Central

Warner, Katherine Deigan; Chen, Michael C.; Song, Wenjiao; Strack, Rita L.; Thorn, Andrea; Jaffrey, Samie R.; Ferré-D’Amaré, Adrian R.

2014-01-01

Green fluorescent protein (GFP) and its derivatives revolutionized the study of proteins. Spinach is a recently reported in vitro evolved RNA mimic of GFP, which as genetically encoded fusions, makes possible live-cell, real-time imaging of biological RNAs, without resorting to large RNA-binding protein-GFP fusions. To elucidate the molecular basis of Spinach fluorescence, we have solved its co-crystal structure bound to its cognate exogenous chromophore, revealing that Spinach activates the small molecule by immobilizing it between a base triple, a G-quadruplex, and an unpaired guanine. Mutational and NMR analyses indicate that the G-quadruplex is essential for Spinach fluorescence, is also present in other fluorogenic RNAs, and may represent a general strategy for RNAs to induce fluorescence of chromophores. The structure has guided the design of a miniaturized 'Baby Spinach', and provides the foundation for structure-driven design and tuning of fluorescent RNAs. PMID:25026079
[Non-invasive analysis of proteins in living cells using NMR spectroscopy].

PubMed

Tochio, Hidehito; Murayama, Shuhei; Inomata, Kohsuke; Morimoto, Daichi; Ohno, Ayako; Shirakawa, Masahiro

2015-01-01

NMR spectroscopy enables structural analyses of proteins and has been widely used in the structural biology field in recent decades. NMR spectroscopy can be applied to proteins inside living cells, allowing characterization of their structures and dynamics in intracellular environments. The simplest "in-cell NMR" approach employs bacterial cells; in this approach, live Escherichia coli cells overexpressing a specific protein are subjected to NMR. The cells are grown in an NMR active isotope-enriched medium to ensure that the overexpressed proteins are labeled with the stable isotopes. Thus the obtained NMR spectra, which are derived from labeled proteins, contain atomic-level information about the structure and dynamics of the proteins. Recent progress enables us to work with higher eukaryotic cells such as HeLa and HEK293 cells, for which a number of techniques have been developed to achieve isotope labeling of the specific target protein. In this review, we describe successful use of electroporation for in-cell NMR. In addition, (19)F-NMR to characterize protein-ligand interactions in cells is presented. Because (19)F nuclei rarely exist in natural cells, when (19)F-labeled proteins are delivered into cells and (19)F-NMR signals are observed, one can safely ascertain that these signals originate from the delivered proteins and not other molecules.
Structural basis for spectrin recognition by ankyrin.

PubMed

Ipsaro, Jonathan J; Mondragón, Alfonso

2010-05-20

Maintenance of membrane integrity and organization in the metazoan cell is accomplished through intracellular tethering of membrane proteins to an extensive, flexible protein network. Spectrin, the principal component of this network, is anchored to membrane proteins through the adaptor protein ankyrin. To elucidate the atomic basis for this interaction, we determined a crystal structure of human betaI-spectrin repeats 13 to 15 in complex with the ZU5-ANK domain of human ankyrin R. The structure reveals the role of repeats 14 to 15 in binding, the electrostatic and hydrophobic contributions along the interface, and the necessity for a particular orientation of the spectrin repeats. Using structural and biochemical data as a guide, we characterized the individual proteins and their interactions by binding and thermal stability analyses. In addition to validating the structural model, these data provide insight into the nature of some mutations associated with cell morphology defects, including those found in human diseases such as hereditary spherocytosis and elliptocytosis. Finally, analysis of the ZU5 domain suggests it is a versatile protein-protein interaction module with distinct interaction surfaces. The structure represents not only the first of a spectrin fragment in complex with its binding partner, but also that of an intermolecular complex involving a ZU5 domain.
Low-resolution structure of Drosophila translin

PubMed Central

Kumar, Vinay; Gupta, Gagan D.

2012-01-01

Crystals of native Drosophila melanogaster translin diffracted to 7 Å resolution. Reductive methylation of the protein improved crystal quality. The native and methylated proteins showed similar profiles in size-exclusion chromatography analyses but the methylated protein displayed reduced DNA-binding activity. Crystals of the methylated protein diffracted to 4.2 Å resolution at BM14 of the ESRF synchrotron. Crystals with 49% solvent content belonged to monoclinic space group P21 with eight protomers in the asymmetric unit. Only 2% of low-resolution structures with similar low percentage solvent content were found in the PDB. The crystal structure, solved by molecular replacement method, refined to Rwork (Rfree) of 0.24 (0.29) with excellent stereochemistry. The crystal structure clearly shows that drosophila protein exists as an octamer, and not as a decamer as expected from gel-filtration elution profiles. The similar octameric quaternary fold in translin orthologs and in translin–TRAX complexes suggests an up-down dimer as the basic structural subunit of translin-like proteins. The drosophila oligomer displays asymmetric assembly and increased radius of gyration that accounts for the observed differences between the elution profiles of human and drosophila proteins on gel-filtration columns. This study demonstrates clearly that low-resolution X-ray structure can be useful in understanding complex biological oligomers. PMID:23650579
Programming molecular self-assembly of intrinsically disordered proteins containing sequences of low complexity

NASA Astrophysics Data System (ADS)

Simon, Joseph R.; Carroll, Nick J.; Rubinstein, Michael; Chilkoti, Ashutosh; López, Gabriel P.

2017-06-01

Dynamic protein-rich intracellular structures that contain phase-separated intrinsically disordered proteins (IDPs) composed of sequences of low complexity (SLC) have been shown to serve a variety of important cellular functions, which include signalling, compartmentalization and stabilization. However, our understanding of these structures and our ability to synthesize models of them have been limited. We present design rules for IDPs possessing SLCs that phase separate into diverse assemblies within droplet microenvironments. Using theoretical analyses, we interpret the phase behaviour of archetypal IDP sequences and demonstrate the rational design of a vast library of multicomponent protein-rich structures that ranges from uniform nano-, meso- and microscale puncta (distinct protein droplets) to multilayered orthogonally phase-separated granular structures. The ability to predict and program IDP-rich assemblies in this fashion offers new insights into (1) genetic-to-molecular-to-macroscale relationships that encode hierarchical IDP assemblies, (2) design rules of such assemblies in cell biology and (3) molecular-level engineering of self-assembled recombinant IDP-rich materials.
Evolution of strigolactone receptors by gradual neo-functionalization of KAI2 paralogues.

PubMed

Bythell-Douglas, Rohan; Rothfels, Carl J; Stevenson, Dennis W D; Graham, Sean W; Wong, Gane Ka-Shu; Nelson, David C; Bennett, Tom

2017-06-29

Strigolactones (SLs) are a class of plant hormones that control many aspects of plant growth. The SL signalling mechanism is homologous to that of karrikins (KARs), smoke-derived compounds that stimulate seed germination. In angiosperms, the SL receptor is an α/β-hydrolase known as DWARF14 (D14); its close homologue, KARRIKIN INSENSITIVE2 (KAI2), functions as a KAR receptor and likely recognizes an uncharacterized, endogenous signal ('KL'). Previous phylogenetic analyses have suggested that the KAI2 lineage is ancestral in land plants, and that canonical D14-type SL receptors only arose in seed plants; this is paradoxical, however, as non-vascular plants synthesize and respond to SLs. We have used a combination of phylogenetic and structural approaches to re-assess the evolution of the D14/KAI2 family in land plants. We analysed 339 members of the D14/KAI2 family from land plants and charophyte algae. Our phylogenetic analyses show that the divergence between the eu-KAI2 lineage and the DDK (D14/DLK2/KAI2) lineage that includes D14 occurred very early in land plant evolution. We show that eu-KAI2 proteins are highly conserved, and have unique features not found in DDK proteins. Conversely, we show that DDK proteins show considerable sequence and structural variation to each other, and lack clearly definable characteristics. We use homology modelling to show that the earliest members of the DDK lineage structurally resemble KAI2 and that SL receptors in non-seed plants likely do not have D14-like structure. We also show that certain groups of DDK proteins lack the otherwise conserved MORE AXILLARY GROWTH2 (MAX2) interface, and may thus function independently of MAX2, which we show is highly conserved throughout land plant evolution. Our results suggest that D14-like structure is not required for SL perception, and that SL perception has relatively relaxed structural requirements compared to KAI2-mediated signalling. We suggest that SL perception gradually evolved by neo-functionalization within the DDK lineage, and that the transition from KAI2-like to D14-like protein may have been driven by interactions with protein partners, rather than being required for SL perception per se.
In silico analysis of fragile histidine triad involved in regression of carcinoma.

PubMed

Rasheed, Muhammad Asif; Tariq, Fatima; Afzal, Sara; Mannanv, Shazia

2017-04-01

Hepatocellular carcinoma (HCCa) is a primary malignancy of the liver. Many different proteins are involved in HCCa including insulin growth factor (IGF) II , signal transducers and activators of transcription (STAT) 3, STAT4, mothers against decapentaplegic homolog 4 (SMAD 4), fragile histidine triad (FHIT) and selective internal radiation therapy (SIRT) etc. The present study is based on the bioinformatics analysis of FHIT protein in order to understand the proteomics aspect and improvement of the diagnosis of the disease based on the protein. Different information related to protein were gathered from different databases, including National Centre for Biotechnology Information (NCBI) Gene, Protein and Online Mendelian Inheritance in Man (OMIM) databases, Uniprot database, String database and Kyoto Encyclopedia of Genes and Genomes (KEGG) database. Moreover, the structure of the protein and evaluation of the quality of the structure were included from Easy modeler programme. Hence, this analysis not only helped to gather information related to the protein at one place, but also analysed the structure and quality of the protein to conclude that the protein has a role in carcinoma.

Functional dynamics of cell surface membrane proteins

NASA Astrophysics Data System (ADS)

Nishida, Noritaka; Osawa, Masanori; Takeuchi, Koh; Imai, Shunsuke; Stampoulis, Pavlos; Kofuku, Yutaka; Ueda, Takumi; Shimada, Ichio

2014-04-01

Cell surface receptors are integral membrane proteins that receive external stimuli, and transmit signals across plasma membranes. In the conventional view of receptor activation, ligand binding to the extracellular side of the receptor induces conformational changes, which convert the structure of the receptor into an active conformation. However, recent NMR studies of cell surface membrane proteins have revealed that their structures are more dynamic than previously envisioned, and they fluctuate between multiple conformations in an equilibrium on various timescales. In addition, NMR analyses, along with biochemical and cell biological experiments indicated that such dynamical properties are critical for the proper functions of the receptors. In this review, we will describe several NMR studies that revealed direct linkage between the structural dynamics and the functions of the cell surface membrane proteins, such as G-protein coupled receptors (GPCRs), ion channels, membrane transporters, and cell adhesion molecules.
Functional dynamics of cell surface membrane proteins.

PubMed

Nishida, Noritaka; Osawa, Masanori; Takeuchi, Koh; Imai, Shunsuke; Stampoulis, Pavlos; Kofuku, Yutaka; Ueda, Takumi; Shimada, Ichio

2014-04-01

Cell surface receptors are integral membrane proteins that receive external stimuli, and transmit signals across plasma membranes. In the conventional view of receptor activation, ligand binding to the extracellular side of the receptor induces conformational changes, which convert the structure of the receptor into an active conformation. However, recent NMR studies of cell surface membrane proteins have revealed that their structures are more dynamic than previously envisioned, and they fluctuate between multiple conformations in an equilibrium on various timescales. In addition, NMR analyses, along with biochemical and cell biological experiments indicated that such dynamical properties are critical for the proper functions of the receptors. In this review, we will describe several NMR studies that revealed direct linkage between the structural dynamics and the functions of the cell surface membrane proteins, such as G-protein coupled receptors (GPCRs), ion channels, membrane transporters, and cell adhesion molecules. Copyright © 2013 Elsevier Inc. All rights reserved.
Structural dissection of a complex Bacteroides ovatus gene locus conferring xyloglucan metabolism in the human gut.

PubMed

Hemsworth, Glyn R; Thompson, Andrew J; Stepper, Judith; Sobala, Łukasz F; Coyle, Travis; Larsbrink, Johan; Spadiut, Oliver; Goddard-Borger, Ethan D; Stubbs, Keith A; Brumer, Harry; Davies, Gideon J

2016-07-01

The human gastrointestinal tract harbours myriad bacterial species, collectively termed the microbiota, that strongly influence human health. Symbiotic members of our microbiota play a pivotal role in the digestion of complex carbohydrates that are otherwise recalcitrant to assimilation. Indeed, the intrinsic human polysaccharide-degrading enzyme repertoire is limited to various starch-based substrates; more complex polysaccharides demand microbial degradation. Select Bacteroidetes are responsible for the degradation of the ubiquitous vegetable xyloglucans (XyGs), through the concerted action of cohorts of enzymes and glycan-binding proteins encoded by specific xyloglucan utilization loci (XyGULs). Extending recent (meta)genomic, transcriptomic and biochemical analyses, significant questions remain regarding the structural biology of the molecular machinery required for XyG saccharification. Here, we reveal the three-dimensional structures of an α-xylosidase, a β-glucosidase, and two α-l-arabinofuranosidases from the Bacteroides ovatus XyGUL. Aided by bespoke ligand synthesis, our analyses highlight key adaptations in these enzymes that confer individual specificity for xyloglucan side chains and dictate concerted, stepwise disassembly of xyloglucan oligosaccharides. In harness with our recent structural characterization of the vanguard endo-xyloglucanse and cell-surface glycan-binding proteins, the present analysis provides a near-complete structural view of xyloglucan recognition and catalysis by XyGUL proteins. © 2016 The Authors.
The rearrangement of motif F in the flavivirus RNA-directed RNA polymerase.

PubMed

Potapova, Ulyana; Feranchuk, Sergey; Leonova, Galina; Belikov, Sergei

2018-03-01

In the flavivirus genus, the non-structural protein NS5 plays a central role in RNA viral replication and constitutes a major target for drug discovery. One of the prime challenges in the study of NS5 protein is to investigate the interplay between the two protein domains, namely, the RNA-dependent RNA polymerase (RdRp) domain and the methyltransferase (MTase) domain. These investigations could clarify the multiple roles of NS5 protein in the virus life cycle. Here we present the results of sequence analyses and structural bioinformatics studies of NS5 protein, which suggest that the conserved motif F in the NS5 protein could act as a lock which controls the rearrangement of the domains and as a switch in the protein enzymatic activity. Copyright © 2017 Elsevier B.V. All rights reserved.
One-step nanoimprinted hybrid micro-/nano-structure for in situ protein detection of isolated cell array via localized surface plasmon resonance

NASA Astrophysics Data System (ADS)

Ali, Riyaz Ahmad Mohamed; Villariza Espulgar, Wilfred; Aoki, Wataru; Jiang, Shu; Saito, Masato; Ueda, Mitsuyoshi; Tamiya, Eiichi

2018-03-01

Nanoplasmonic biosensors show high potentials as label-free devices for continuous monitoring in biomolecular analyses. However, most current sensors comprise multiple-dedicated layers with complicated fabrication procedures, which increases production time and manufacturing costs. In this work, we report the synergistic integration of cell-trapping microwell structures with plasmonic sensing nanopillar structures in a single-layered substrate by one-step thermal nanoimprinting. Here, microwell arrays are used for isolating cells, wherein gold-capped nanostructures sense changes in local refractive index via localized surface plasmon resonance (LSPR). Hence, proteins secreted from trapped cells can be label-freely detected as peak shifts in absorbance spectra. The fabricated device showed a detection limit of 10 ng/µL anti-IgA. In Pichia pastoris cells trial analysis, a red shift of 6.9 nm was observed over 12 h, which is likely due to the protein secretion from the cells. This approach provides an inexpensive, rapid, and reproducible alternative for mass production of biosensors for continuous biomolecular analyses.
Functional and genomic analyses of alpha-solenoid proteins.

PubMed

Fournier, David; Palidwor, Gareth A; Shcherbinin, Sergey; Szengel, Angelika; Schaefer, Martin H; Perez-Iratxeta, Carol; Andrade-Navarro, Miguel A

2013-01-01

Alpha-solenoids are flexible protein structural domains formed by ensembles of alpha-helical repeats (Armadillo and HEAT repeats among others). While homology can be used to detect many of these repeats, some alpha-solenoids have very little sequence homology to proteins of known structure and we expect that many remain undetected. We previously developed a method for detection of alpha-helical repeats based on a neural network trained on a dataset of protein structures. Here we improved the detection algorithm and updated the training dataset using recently solved structures of alpha-solenoids. Unexpectedly, we identified occurrences of alpha-solenoids in solved protein structures that escaped attention, for example within the core of the catalytic subunit of PI3KC. Our results expand the current set of known alpha-solenoids. Application of our tool to the protein universe allowed us to detect their significant enrichment in proteins interacting with many proteins, confirming that alpha-solenoids are generally involved in protein-protein interactions. We then studied the taxonomic distribution of alpha-solenoids to discuss an evolutionary scenario for the emergence of this type of domain, speculating that alpha-solenoids have emerged in multiple taxa in independent events by convergent evolution. We observe a higher rate of alpha-solenoids in eukaryotic genomes and in some prokaryotic families, such as Cyanobacteria and Planctomycetes, which could be associated to increased cellular complexity. The method is available at http://cbdm.mdc-berlin.de/~ard2/.
Coevolution analysis of Hepatitis C virus genome to identify the structural and functional dependency network of viral proteins

NASA Astrophysics Data System (ADS)

Champeimont, Raphaël; Laine, Elodie; Hu, Shuang-Wei; Penin, Francois; Carbone, Alessandra

2016-05-01

A novel computational approach of coevolution analysis allowed us to reconstruct the protein-protein interaction network of the Hepatitis C Virus (HCV) at the residue resolution. For the first time, coevolution analysis of an entire viral genome was realized, based on a limited set of protein sequences with high sequence identity within genotypes. The identified coevolving residues constitute highly relevant predictions of protein-protein interactions for further experimental identification of HCV protein complexes. The method can be used to analyse other viral genomes and to predict the associated protein interaction networks.
High Temperature Unfolding and Low Temperature Refolding Pathway of Chymotrypsin Inhibitor 2 Using Molecular Dynamics Simulation

NASA Astrophysics Data System (ADS)

Malau, N. D.; Sumaryada, T.

2016-01-01

The mechanism that explains the unfolding/refolding process of the protein is still a major problem that has not been fully understood. In this paper we present our study on the unfolding and refolding pathway of Chymotrypsin Inhibitor 2 (CI2) protein through a molecular dynamics simulation technique. The high temperature unfolding simulation were performed at 500 K for 35 ns. While the low temperature refolding simulation performed at 200 K for 35 ns. The unfolding and refolding pathway of protein were analysed by looking at the dynamics of root mean squared deviation (RMSD) and secondary structure profiles. The signatures of unfolding were observed from significant increase of RMSD within the time span of 10 ns to 35 ns. For the refolding process, the initial structure was prepared from the structure of unfolding protein at t=15 ns and T=500 K. Analysis have shown that some of the secondary structures of CI2 protein that have been damaged at high temperature can be refolded back to its initial structure at low temperature simulation. Our results suggest that most of α-helix structure of CI2 protein can be refolded back to its initial state, while only half beta-sheet structure can be reformed.
Cryo-EM structures of two bovine adenovirus type 3 intermediates

DOE Office of Scientific and Technical Information (OSTI.GOV)

Cheng, Lingpeng; Huang, Xiaoxing; Li, Xiaomin

2014-02-15

Adenoviruses (Ads) infect hosts from all vertebrate species and have been investigated as vaccine vectors. We report here near-atomic structures of two bovine Ad type 3 (BAd3) intermediates obtained by cryo-electron microscopy. A comparison between the two intermediate structures reveals that the differences are localized in the fivefold vertex region, while their facet structures are identical. The overall facet structure of BAd3 exhibits a similar structure to human Ads; however, BAd3 protein IX has a unique conformation. Mass spectrometry and cryo-electron tomography analyses indicate that one intermediate structure represents the stage during DNA encapsidation, whilst the other intermediate structure representsmore » a later stage. These results also suggest that cleavage of precursor protein VI occurs during, rather than after, the DNA encapsidation process. Overall, our results provide insights into the mechanism of Ad assembly, and allow the first structural comparison between human and nonhuman Ads at backbone level. - Highlights: • First structure of bovine adenovirus type 3. • Some channels are located at the vertex of intermediate during DNA encapsidation. • Protein IX exhibits a unique conformation of trimeric coiled–coiled structure. • Cleavage of precursor protein VI occurs during the DNA encapsidation process.« less
An affinity-structure database of helix-turn-helix: DNA complexes with a universal coordinate system

DOE Office of Scientific and Technical Information (OSTI.GOV)

AlQuraishi, Mohammed; Tang, Shengdong; Xia, Xide

Molecular interactions between proteins and DNA molecules underlie many cellular processes, including transcriptional regulation, chromosome replication, and nucleosome positioning. Computational analyses of protein-DNA interactions rely on experimental data characterizing known protein-DNA interactions structurally and biochemically. While many databases exist that contain either structural or biochemical data, few integrate these two data sources in a unified fashion. Such integration is becoming increasingly critical with the rapid growth of structural and biochemical data, and the emergence of algorithms that rely on the synthesis of multiple data types to derive computational models of molecular interactions. We have developed an integrated affinity-structure database inmore » which the experimental and quantitative DNA binding affinities of helix-turn-helix proteins are mapped onto the crystal structures of the corresponding protein-DNA complexes. This database provides access to: (i) protein-DNA structures, (ii) quantitative summaries of protein-DNA binding affinities using position weight matrices, and (iii) raw experimental data of protein-DNA binding instances. Critically, this database establishes a correspondence between experimental structural data and quantitative binding affinity data at the single basepair level. Furthermore, we present a novel alignment algorithm that structurally aligns the protein-DNA complexes in the database and creates a unified residue-level coordinate system for comparing the physico-chemical environments at the interface between complexes. Using this unified coordinate system, we compute the statistics of atomic interactions at the protein-DNA interface of helix-turn-helix proteins. We provide an interactive website for visualization, querying, and analyzing this database, and a downloadable version to facilitate programmatic analysis. Lastly, this database will facilitate the analysis of protein-DNA interactions and the development of programmatic computational methods that capitalize on integration of structural and biochemical datasets. The database can be accessed at http://ProteinDNA.hms.harvard.edu.« less
An affinity-structure database of helix-turn-helix: DNA complexes with a universal coordinate system

DOE PAGES

AlQuraishi, Mohammed; Tang, Shengdong; Xia, Xide

2015-11-19

Molecular interactions between proteins and DNA molecules underlie many cellular processes, including transcriptional regulation, chromosome replication, and nucleosome positioning. Computational analyses of protein-DNA interactions rely on experimental data characterizing known protein-DNA interactions structurally and biochemically. While many databases exist that contain either structural or biochemical data, few integrate these two data sources in a unified fashion. Such integration is becoming increasingly critical with the rapid growth of structural and biochemical data, and the emergence of algorithms that rely on the synthesis of multiple data types to derive computational models of molecular interactions. We have developed an integrated affinity-structure database inmore » which the experimental and quantitative DNA binding affinities of helix-turn-helix proteins are mapped onto the crystal structures of the corresponding protein-DNA complexes. This database provides access to: (i) protein-DNA structures, (ii) quantitative summaries of protein-DNA binding affinities using position weight matrices, and (iii) raw experimental data of protein-DNA binding instances. Critically, this database establishes a correspondence between experimental structural data and quantitative binding affinity data at the single basepair level. Furthermore, we present a novel alignment algorithm that structurally aligns the protein-DNA complexes in the database and creates a unified residue-level coordinate system for comparing the physico-chemical environments at the interface between complexes. Using this unified coordinate system, we compute the statistics of atomic interactions at the protein-DNA interface of helix-turn-helix proteins. We provide an interactive website for visualization, querying, and analyzing this database, and a downloadable version to facilitate programmatic analysis. Lastly, this database will facilitate the analysis of protein-DNA interactions and the development of programmatic computational methods that capitalize on integration of structural and biochemical datasets. The database can be accessed at http://ProteinDNA.hms.harvard.edu.« less
Nucleoplasmin-like domain of FKBP39 from Drosophila melanogaster forms a tetramer with partly disordered tentacle-like C-terminal segments

PubMed Central

Kozłowska, Małgorzata; Tarczewska, Aneta; Jakób, Michał; Bystranowska, Dominika; Taube, Michał; Kozak, Maciej; Czarnocki-Cieciura, Mariusz; Dziembowski, Andrzej; Orłowski, Marek; Tkocz, Katarzyna; Ożyhar, Andrzej

2017-01-01

Nucleoplasmins are a nuclear chaperone family defined by the presence of a highly conserved N-terminal core domain. X-ray crystallographic studies of isolated nucleoplasmin core domains revealed a β-propeller structure consisting of a set of five monomers that together form a stable pentamer. Recent studies on isolated N-terminal domains from Drosophila 39-kDa FK506-binding protein (FKBP39) and from other chromatin-associated proteins showed analogous, nucleoplasmin-like (NPL) pentameric structures. Here, we report that the NPL domain of the full-length FKBP39 does not form pentameric complexes. Multi-angle light scattering (MALS) and sedimentation equilibrium ultracentrifugation (SE AUC) analyses of the molecular mass of the full-length protein indicated that FKBP39 forms homotetrameric complexes. Molecular models reconstructed from small-angle X-ray scattering (SAXS) revealed that the NPL domain forms a stable, tetrameric core and that FK506-binding domains are linked to it by intrinsically disordered, flexible chains that form tentacle-like segments. Analyses of full-length FKBP39 and its isolated NPL domain suggested that the distal regions of the polypeptide chain influence and determine the quaternary conformation of the nucleoplasmin-like protein. These results provide new insights regarding the conserved structure of nucleoplasmin core domains and provide a potential explanation for the importance of the tetrameric structural organization of full-length nucleoplasmins. PMID:28074868
Computational prediction of hinge axes in proteins

PubMed Central

2014-01-01

Background A protein's function is determined by the wide range of motions exhibited by its 3D structure. However, current experimental techniques are not able to reliably provide the level of detail required for elucidating the exact mechanisms of protein motion essential for effective drug screening and design. Computational tools are instrumental in the study of the underlying structure-function relationship. We focus on a special type of proteins called "hinge proteins" which exhibit a motion that can be interpreted as a rotation of one domain relative to another. Results This work proposes a computational approach that uses the geometric structure of a single conformation to predict the feasible motions of the protein and is founded in recent work from rigidity theory, an area of mathematics that studies flexibility properties of general structures. Given a single conformational state, our analysis predicts a relative axis of motion between two specified domains. We analyze a dataset of 19 structures known to exhibit this hinge-like behavior. For 15, the predicted axis is consistent with a motion to a second, known conformation. We present a detailed case study for three proteins whose dynamics have been well-studied in the literature: calmodulin, the LAO binding protein and the Bence-Jones protein. Conclusions Our results show that incorporating rigidity-theoretic analyses can lead to effective computational methods for understanding hinge motions in macromolecules. This initial investigation is the first step towards a new tool for probing the structure-dynamics relationship in proteins. PMID:25080829
Computational design of an endo-1,4-[beta]-xylanase ligand binding site

DOE Office of Scientific and Technical Information (OSTI.GOV)

Morin, Andrew; Kaufmann, Kristian W.; Fortenberry, Carie

2012-09-05

The field of computational protein design has experienced important recent success. However, the de novo computational design of high-affinity protein-ligand interfaces is still largely an open challenge. Using the Rosetta program, we attempted the in silico design of a high-affinity protein interface to a small peptide ligand. We chose the thermophilic endo-1,4-{beta}-xylanase from Nonomuraea flexuosa as the protein scaffold on which to perform our designs. Over the course of the study, 12 proteins derived from this scaffold were produced and assayed for binding to the target ligand. Unfortunately, none of the designed proteins displayed evidence of high-affinity binding. Structural characterizationmore » of four designed proteins revealed that although the predicted structure of the protein model was highly accurate, this structural accuracy did not translate into accurate prediction of binding affinity. Crystallographic analyses indicate that the lack of binding affinity is possibly due to unaccounted for protein dynamics in the 'thumb' region of our design scaffold intrinsic to the family 11 {beta}-xylanase fold. Further computational analysis revealed two specific, single amino acid substitutions responsible for an observed change in backbone conformation, and decreased dynamic stability of the catalytic cleft. These findings offer new insight into the dynamic and structural determinants of the {beta}-xylanase proteins.« less
UET: a database of evolutionarily-predicted functional determinants of protein sequences that cluster as functional sites in protein structures.

PubMed

Lua, Rhonald C; Wilson, Stephen J; Konecki, Daniel M; Wilkins, Angela D; Venner, Eric; Morgan, Daniel H; Lichtarge, Olivier

2016-01-04

The structure and function of proteins underlie most aspects of biology and their mutational perturbations often cause disease. To identify the molecular determinants of function as well as targets for drugs, it is central to characterize the important residues and how they cluster to form functional sites. The Evolutionary Trace (ET) achieves this by ranking the functional and structural importance of the protein sequence positions. ET uses evolutionary distances to estimate functional distances and correlates genotype variations with those in the fitness phenotype. Thus, ET ranks are worse for sequence positions that vary among evolutionarily closer homologs but better for positions that vary mostly among distant homologs. This approach identifies functional determinants, predicts function, guides the mutational redesign of functional and allosteric specificity, and interprets the action of coding sequence variations in proteins, people and populations. Now, the UET database offers pre-computed ET analyses for the protein structure databank, and on-the-fly analysis of any protein sequence. A web interface retrieves ET rankings of sequence positions and maps results to a structure to identify functionally important regions. This UET database integrates several ways of viewing the results on the protein sequence or structure and can be found at http://mammoth.bcm.tmc.edu/uet/. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Transmembrane helix prediction: a comparative evaluation and analysis.

PubMed

Cuthbertson, Jonathan M; Doyle, Declan A; Sansom, Mark S P

2005-06-01

The prediction of transmembrane (TM) helices plays an important role in the study of membrane proteins, given the relatively small number (approximately 0.5% of the PDB) of high-resolution structures for such proteins. We used two datasets (one redundant and one non-redundant) of high-resolution structures of membrane proteins to evaluate and analyse TM helix prediction. The redundant (non-redundant) dataset contains structure of 434 (268) TM helices, from 112 (73) polypeptide chains. Of the 434 helices in the dataset, 20 may be classified as 'half-TM' as they are too short to span a lipid bilayer. We compared 13 TM helix prediction methods, evaluating each method using per segment, per residue and termini scores. Four methods consistently performed well: SPLIT4, TMHMM2, HMMTOP2 and TMAP. However, even the best methods were in error by, on average, about two turns of helix at the TM helix termini. The best and worst case predictions for individual proteins were analysed. In particular, the performance of the various methods and of a consensus prediction method, were compared for a number of proteins (e.g. SecY, ClC, KvAP) containing half-TM helices. The difficulties of predicting half-TM helices suggests that current prediction methods successfully embody the two-state model of membrane protein folding, but do not accommodate a third stage in which, e.g., short helices and re-entrant loops fold within a bundle of stable TM helices.
Molecular mechanics and dynamics characterization of an in silico mutated protein: a stand-alone lab module or support activity for in vivo and in vitro analyses of targeted proteins.

PubMed

Chiang, Harry; Robinson, Lucy C; Brame, Cynthia J; Messina, Troy C

2013-01-01

Over the past 20 years, the biological sciences have increasingly incorporated chemistry, physics, computer science, and mathematics to aid in the development and use of mathematical models. Such combined approaches have been used to address problems from protein structure-function relationships to the workings of complex biological systems. Computer simulations of molecular events can now be accomplished quickly and with standard computer technology. Also, simulation software is freely available for most computing platforms, and online support for the novice user is ample. We have therefore created a molecular dynamics laboratory module to enhance undergraduate student understanding of molecular events underlying organismal phenotype. This module builds on a previously described project in which students use site-directed mutagenesis to investigate functions of conserved sequence features in members of a eukaryotic protein kinase family. In this report, we detail the laboratory activities of a MD module that provide a complement to phenotypic outcomes by providing a hypothesis-driven and quantifiable measure of predicted structural changes caused by targeted mutations. We also present examples of analyses students may perform. These laboratory activities can be integrated with genetics or biochemistry experiments as described, but could also be used independently in any course that would benefit from a quantitative approach to protein structure-function relationships. Copyright © 2013 Wiley Periodicals, Inc.
Structural and biochemical analyses of YvgN and YtbE from Bacillus subtilis

PubMed Central

Lei, Jian; Zhou, Yan-Feng; Li, Lan-Fen; Su, Xiao-Dong

2009-01-01

Bacillus subtilis is one of the most studied gram-positive bacteria. In this work, YvgN and YtbE from B. subtilis, assigned as AKR5G1 and AKR5G2 of aldo-keto reductase (AKR) superfamily. AKR catalyzes the NADPH-dependent reduction of aldehyde or aldose substrates to alcohols. YvgN and YtbE were studied by crystallographic and enzymatic analyses. The apo structures of these proteins were determined by molecular replacement, and the structure of holoenzyme YvgN with NADPH was also solved, revealing the conformational changes upon cofactor binding. Our biochemical data suggest both YvgN and YtbE have preferential specificity for derivatives of benzaldehyde, such as nitryl or halogen group substitution at the 2 or 4 positions. These proteins also showed broad catalytic activity on many standard substrates of AKR, such as glyoxal, dihydroxyacetone, and DL-glyceraldehyde, suggesting a possible role in bacterial detoxification. PMID:19585557
Predicting highly-connected hubs in protein interaction networks by QSAR and biological data descriptors

PubMed Central

Hsing, Michael; Byler, Kendall; Cherkasov, Artem

2009-01-01

Hub proteins (those engaged in most physical interactions in a protein interaction network (PIN) have recently gained much research interest due to their essential role in mediating cellular processes and their potential therapeutic value. It is straightforward to identify hubs if the underlying PIN is experimentally determined; however, theoretical hub prediction remains a very challenging task, as physicochemical properties that differentiate hubs from less connected proteins remain mostly uncharacterized. To adequately distinguish hubs from non-hub proteins we have utilized over 1300 protein descriptors, some of which represent QSAR (quantitative structure-activity relationship) parameters, and some reflect sequence-derived characteristics of proteins including domain composition and functional annotations. Those protein descriptors, together with available protein interaction data have been processed by a machine learning method (boosting trees) and resulted in the development of hub classifiers that are capable of predicting highly interacting proteins for four model organisms: Escherichia coli, Saccharomyces cerevisiae, Drosophila melanogaster and Homo sapiens. More importantly, through the analyses of the most relevant protein descriptors, we are able to demonstrate that hub proteins not only share certain common physicochemical and structural characteristics that make them different from non-hub counterparts, but they also exhibit species-specific characteristics that should be taken into account when analyzing different PINs. The developed prediction models can be used for determining highly interacting proteins in the four studied species to assist future proteomics experiments and PIN analyses. Availability The source code and executable program of the hub classifier are available for download at: http://www.cnbi2.ca/hub-analysis/ PMID:20198194
An extracellular disulfide bond forming protein (DsbF) from Mycobacterium tuberculosis: Structural, biochemical and gene expression analysis

PubMed Central

Chim, Nicholas; Riley, Robert; The, Juliana; Im, Soyeon; Segelke, Brent; Lekin, Tim; Yu, Minmin; Hung, Li Wei; Terwilliger, Tom; Whitelegge, Julian P.; Goulding, Celia W.

2010-01-01

Disulfide bond forming (Dsb) proteins ensure correct folding and disulfide bond formation of secreted proteins. Previously, we showed that Mycobacterium tuberculosis DsbE (Mtb DsbE, Rv2878c) aids in vitro oxidative folding of proteins. Here we present structural, biochemical and gene expression analyses of another putative Mtb secreted disulfide bond isomerase protein homologous to Mtb DsbE, Mtb DsbF (Rv1677). The X-ray crystal structure of Mtb DsbF reveals a conserved thioredoxin fold although the active-site cysteines may be modeled in both oxidized and reduced forms, in contrast to the solely reduced form in Mtb DsbE. Furthermore, the shorter loop region in Mtb DsbF results in a more solvent-exposed active site. Biochemical analyses show that, similar to Mtb DsbE, Mtb DsbF can oxidatively refold reduced, unfolded hirudin and has a comparable pKa for the active-site solvent-exposed cysteine. However, contrary to Mtb DsbE, the Mtb DsbF redox potential is more oxidizing and its reduced state is more stable. From computational genomics analysis of the M. tuberculosis genome, we identified a potential Mtb DsbF interaction partner, Rv1676, a predicted peroxiredoxin. Complex formation is supported by protein co-expression studies and inferred by gene expression profiles, whereby Mtb DsbF and Rv1676 are upregulated under similar environments. Additionally, comparison of Mtb DsbF and Mtb DsbE gene expression data indicate anticorrelated gene expression patterns, suggesting that these two proteins and their functionally linked partners constitute analogous pathways that may function under different conditions. PMID:20060836

Domain wise docking analyses of the modular chitin binding protein CBP50 from Bacillus thuringiensis serovar konkukian S4.

PubMed

Sehar, Ujala; Mehmood, Muhammad Aamer; Hussain, Khadim; Nawaz, Salman; Nadeem, Shahid; Siddique, Muhammad Hussnain; Nadeem, Habibullah; Gull, Munazza; Ahmad, Niaz; Sohail, Iqra; Gill, Saba Shahid; Majeed, Summera

2013-01-01

This paper presents an in silico characterization of the chitin binding protein CBP50 from B. thuringiensis serovar konkukian S4 through homology modeling and molecular docking. The CBP50 has shown a modular structure containing an N-terminal CBM33 domain, two consecutive fibronectin-III (Fn-III) like domains and a C-terminal CBM5 domain. The protein presented a unique modular structure which could not be modeled using ordinary procedures. So, domain wise modeling using MODELLER and docking analyses using Autodock Vina were performed. The best conformation for each domain was selected using standard procedure. It was revealed that four amino acid residues Glu-71, Ser-74, Glu-76 and Gln-90 from N-terminal domain are involved in protein-substrate interaction. Similarly, amino acid residues Trp-20, Asn-21, Ser-23 and Val-30 of Fn-III like domains and Glu-15, Ala-17, Ser-18 and Leu-35 of C-terminal domain were involved in substrate binding. Site-directed mutagenesis of these proposed amino acid residues in future will elucidate the key amino acids involved in chitin binding activity of CBP50 protein.
Engineering A-kinase Anchoring Protein (AKAP)-selective Regulatory Subunits of Protein Kinase A (PKA) through Structure-based Phage Selection*

PubMed Central

Gold, Matthew G.; Fowler, Douglas M.; Means, Christopher K.; Pawson, Catherine T.; Stephany, Jason J.; Langeberg, Lorene K.; Fields, Stanley; Scott, John D.

2013-01-01

PKA is retained within distinct subcellular environments by the association of its regulatory type II (RII) subunits with A-kinase anchoring proteins (AKAPs). Conventional reagents that universally disrupt PKA anchoring are patterned after a conserved AKAP motif. We introduce a phage selection procedure that exploits high-resolution structural information to engineer RII mutants that are selective for a particular AKAP. Selective RII (RSelect) sequences were obtained for eight AKAPs following competitive selection screening. Biochemical and cell-based experiments validated the efficacy of RSelect proteins for AKAP2 and AKAP18. These engineered proteins represent a new class of reagents that can be used to dissect the contributions of different AKAP-targeted pools of PKA. Molecular modeling and high-throughput sequencing analyses revealed the molecular basis of AKAP-selective interactions and shed new light on native RII-AKAP interactions. We propose that this structure-directed evolution strategy might be generally applicable for the investigation of other protein interaction surfaces. PMID:23625929
Comparative analyses of quaternary arrangements in homo-oligomeric proteins in superfamilies: Functional implications.

PubMed

Sudha, Govindarajan; Srinivasan, Narayanaswamy

2016-09-01

A comprehensive analysis of the quaternary features of distantly related homo-oligomeric proteins is the focus of the current study. This study has been performed at the levels of quaternary state, symmetry, and quaternary structure. Quaternary state and quaternary structure refers to the number of subunits and spatial arrangements of subunits, respectively. Using a large dataset of available 3D structures of biologically relevant assemblies, we show that only 53% of the distantly related homo-oligomeric proteins have the same quaternary state. Considering these homologous homo-oligomers with the same quaternary state, conservation of quaternary structures is observed only in 38% of the pairs. In 36% of the pairs of distantly related homo-oligomers with different quaternary states the larger assembly in a pair shows high structural similarity with the entire quaternary structure of the related protein with lower quaternary state and it is referred as "Russian doll effect." The differences in quaternary state and structure have been suggested to contribute to the functional diversity. Detailed investigations show that even though the gross functions of many distantly related homo-oligomers are the same, finer level differences in molecular functions are manifested by differences in quaternary states and structures. Comparison of structures of biological assemblies in distantly and closely related homo-oligomeric proteins throughout the study differentiates the effects of sequence divergence on the quaternary structures and function. Knowledge inferred from this study can provide insights for improved protein structure classification and function prediction of homo-oligomers. Proteins 2016; 84:1190-1202. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Intrinsically disordered proteins--relation to general model expressing the active role of the water environment.

PubMed

Kalinowska, Barbara; Banach, Mateusz; Konieczny, Leszek; Marchewka, Damian; Roterman, Irena

2014-01-01

This work discusses the role of unstructured polypeptide chain fragments in shaping the protein's hydrophobic core. Based on the "fuzzy oil drop" model, which assumes an idealized distribution of hydrophobicity density described by the 3D Gaussian, we can determine which fragments make up the core and pinpoint residues whose location conflicts with theoretical predictions. We show that the structural influence of the water environment determines the positions of disordered fragments, leading to the formation of a hydrophobic core overlaid by a hydrophilic mantle. This phenomenon is further described by studying selected proteins which are known to be unstable and contain intrinsically disordered fragments. Their properties are established quantitatively, explaining the causative relation between the protein's structure and function and facilitating further comparative analyses of various structural models. © 2014 Elsevier Inc. All rights reserved.
Structural study of the Fox-1 RRM protein hydration reveals a role for key water molecules in RRM-RNA recognition

PubMed Central

Blatter, Markus; Cléry, Antoine; Damberger, Fred F.

2017-01-01

Abstract The Fox-1 RNA recognition motif (RRM) domain is an important member of the RRM protein family. We report a 1.8 Å X-ray structure of the free Fox-1 containing six distinct monomers. We use this and the nuclear magnetic resonance (NMR) structure of the Fox-1 protein/RNA complex for molecular dynamics (MD) analyses of the structured hydration. The individual monomers of the X-ray structure show diverse hydration patterns, however, MD excellently reproduces the most occupied hydration sites. Simulations of the protein/RNA complex show hydration consistent with the isolated protein complemented by hydration sites specific to the protein/RNA interface. MD predicts intricate hydration sites with water-binding times extending up to hundreds of nanoseconds. We characterize two of them using NMR spectroscopy, RNA binding with switchSENSE and free-energy calculations of mutant proteins. Both hydration sites are experimentally confirmed and their abolishment reduces the binding free-energy. A quantitative agreement between theory and experiment is achieved for the S155A substitution but not for the S122A mutant. The S155 hydration site is evolutionarily conserved within the RRM domains. In conclusion, MD is an effective tool for predicting and interpreting the hydration patterns of protein/RNA complexes. Hydration is not easily detectable in NMR experiments but can affect stability of protein/RNA complexes. PMID:28505313
Constraint Network Analysis (CNA): a Python software package for efficiently linking biomacromolecular structure, flexibility, (thermo-)stability, and function.

PubMed

Pfleger, Christopher; Rathi, Prakash Chandra; Klein, Doris L; Radestock, Sebastian; Gohlke, Holger

2013-04-22

For deriving maximal advantage from information on biomacromolecular flexibility and rigidity, results from rigidity analyses must be linked to biologically relevant characteristics of a structure. Here, we describe the Python-based software package Constraint Network Analysis (CNA) developed for this task. CNA functions as a front- and backend to the graph-based rigidity analysis software FIRST. CNA goes beyond the mere identification of flexible and rigid regions in a biomacromolecule in that it (I) provides a refined modeling of thermal unfolding simulations that also considers the temperature-dependence of hydrophobic tethers, (II) allows performing rigidity analyses on ensembles of network topologies, either generated from structural ensembles or by using the concept of fuzzy noncovalent constraints, and (III) computes a set of global and local indices for quantifying biomacromolecular stability. This leads to more robust results from rigidity analyses and extends the application domain of rigidity analyses in that phase transition points ("melting points") and unfolding nuclei ("structural weak spots") are determined automatically. Furthermore, CNA robustly handles small-molecule ligands in general. Such advancements are important for applying rigidity analysis to data-driven protein engineering and for estimating the influence of ligand molecules on biomacromolecular stability. CNA maintains the efficiency of FIRST such that the analysis of a single protein structure takes a few seconds for systems of several hundred residues on a single core. These features make CNA an interesting tool for linking biomacromolecular structure, flexibility, (thermo-)stability, and function. CNA is available from http://cpclab.uni-duesseldorf.de/software for nonprofit organizations.
Molecular Mechanics and Dynamics Characterization of an "in silico" Mutated Protein: A Stand-Alone Lab Module or Support Activity for "in vivo" and "in vitro" Analyses of Targeted Proteins

ERIC Educational Resources Information Center

Chiang, Harry; Robinson, Lucy C.; Brame, Cynthia J.; Messina, Troy C.

2013-01-01

Over the past 20 years, the biological sciences have increasingly incorporated chemistry, physics, computer science, and mathematics to aid in the development and use of mathematical models. Such combined approaches have been used to address problems from protein structure-function relationships to the workings of complex biological systems.…
FlaF is a β-sandwich protein that anchors the archaellum in the archaeal cell envelope by binding the S-layer protein

DOE PAGES

Banerjee, Ankan; Tsai, Chi -Lin; Chaudhury, Paushali; ...

2015-05-01

Archaea employ the archaellum, a type IV pilus-like nanomachine, for swimming motility. In the crenarchaeon Sulfolobus acidocaldarius, the archaellum consists of seven proteins: FlaB/X/G/F/H/I/J. FlaF is conserved and essential for archaellum assembly but no FlaF structures exist. Here, we truncated the FlaF N terminus and solved 1.5-Å and 1.65-Å resolution crystal structures of this monotopic membrane protein. Structures revealed an N-terminal α-helix and an eight-strand β-sandwich, immunoglobulin-like fold with striking similarity to S-layer proteins. Crystal structures, X-ray scattering, and mutational analyses suggest dimer assembly is needed for in vivo function. The sole cell envelope component of S. acidocaldarius is amore » paracrystalline S-layer, and FlaF specifically bound to S-layer protein, suggesting that its interaction domain is located in the pseudoperiplasm with its N-terminal helix in the membrane. From these data, FlaF may act as the previously unknown archaellum stator protein that anchors the rotating archaellum to the archaeal cell envelope.« less
Discrete and structurally unique proteins (tāpirins) mediate attachment of extremely thermophilic Caldicellulosiruptor species to cellulose.

PubMed

Blumer-Schuette, Sara E; Alahuhta, Markus; Conway, Jonathan M; Lee, Laura L; Zurawski, Jeffrey V; Giannone, Richard J; Hettich, Robert L; Lunin, Vladimir V; Himmel, Michael E; Kelly, Robert M

2015-04-24

A variety of catalytic and noncatalytic protein domains are deployed by select microorganisms to deconstruct lignocellulose. These extracellular proteins are used to attach to, modify, and hydrolyze the complex polysaccharides present in plant cell walls. Cellulolytic enzymes, often containing carbohydrate-binding modules, are key to this process; however, these enzymes are not solely responsible for attachment. Few mechanisms of attachment have been discovered among bacteria that do not form large polypeptide structures, called cellulosomes, to deconstruct biomass. In this study, bioinformatics and proteomics analyses identified unique, discrete, hypothetical proteins ("tāpirins," origin from Māori: to join), not directly associated with cellulases, that mediate attachment to cellulose by species in the noncellulosomal, extremely thermophilic bacterial genus Caldicellulosiruptor. Two tāpirin genes are located directly downstream of a type IV pilus operon in strongly cellulolytic members of the genus, whereas homologs are absent from the weakly cellulolytic Caldicellulosiruptor species. Based on their amino acid sequence, tāpirins are specific to these extreme thermophiles. Tāpirins are also unusual in that they share no detectable protein domain signatures with known polysaccharide-binding proteins. Adsorption isotherm and trans vivo analyses demonstrated the carbohydrate-binding module-like affinity of the tāpirins for cellulose. Crystallization of a cellulose-binding truncation from one tāpirin indicated that these proteins form a long β-helix core with a shielded hydrophobic face. Furthermore, they are structurally unique and define a new class of polysaccharide adhesins. Strongly cellulolytic Caldicellulosiruptor species employ tāpirins to complement substrate-binding proteins from the ATP-binding cassette transporters and multidomain extracellular and S-layer-associated glycoside hydrolases to process the carbohydrate content of lignocellulose. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.
Discrete and Structurally Unique Proteins (Tāpirins) Mediate Attachment of Extremely Thermophilic Caldicellulosiruptor Species to Cellulose*

PubMed Central

Blumer-Schuette, Sara E.; Alahuhta, Markus; Conway, Jonathan M.; Lee, Laura L.; Zurawski, Jeffrey V.; Giannone, Richard J.; Hettich, Robert L.; Lunin, Vladimir V.; Himmel, Michael E.; Kelly, Robert M.

2015-01-01

A variety of catalytic and noncatalytic protein domains are deployed by select microorganisms to deconstruct lignocellulose. These extracellular proteins are used to attach to, modify, and hydrolyze the complex polysaccharides present in plant cell walls. Cellulolytic enzymes, often containing carbohydrate-binding modules, are key to this process; however, these enzymes are not solely responsible for attachment. Few mechanisms of attachment have been discovered among bacteria that do not form large polypeptide structures, called cellulosomes, to deconstruct biomass. In this study, bioinformatics and proteomics analyses identified unique, discrete, hypothetical proteins (“tāpirins,” origin from Māori: to join), not directly associated with cellulases, that mediate attachment to cellulose by species in the noncellulosomal, extremely thermophilic bacterial genus Caldicellulosiruptor. Two tāpirin genes are located directly downstream of a type IV pilus operon in strongly cellulolytic members of the genus, whereas homologs are absent from the weakly cellulolytic Caldicellulosiruptor species. Based on their amino acid sequence, tāpirins are specific to these extreme thermophiles. Tāpirins are also unusual in that they share no detectable protein domain signatures with known polysaccharide-binding proteins. Adsorption isotherm and trans vivo analyses demonstrated the carbohydrate-binding module-like affinity of the tāpirins for cellulose. Crystallization of a cellulose-binding truncation from one tāpirin indicated that these proteins form a long β-helix core with a shielded hydrophobic face. Furthermore, they are structurally unique and define a new class of polysaccharide adhesins. Strongly cellulolytic Caldicellulosiruptor species employ tāpirins to complement substrate-binding proteins from the ATP-binding cassette transporters and multidomain extracellular and S-layer-associated glycoside hydrolases to process the carbohydrate content of lignocellulose. PMID:25720489
Structural bioinformatics of the human spliceosomal proteome

PubMed Central

Korneta, Iga; Magnus, Marcin; Bujnicki, Janusz M.

2012-01-01

In this work, we describe the results of a comprehensive structural bioinformatics analysis of the spliceosomal proteome. We used fold recognition analysis to complement prior data on the ordered domains of 252 human splicing proteins. Examples of newly identified domains include a PWI domain in the U5 snRNP protein 200K (hBrr2, residues 258–338), while examples of previously known domains with a newly determined fold include the DUF1115 domain of the U4/U6 di-snRNP protein 90K (hPrp3, residues 540–683). We also established a non-redundant set of experimental models of spliceosomal proteins, as well as constructed in silico models for regions without an experimental structure. The combined set of structural models is available for download. Altogether, over 90% of the ordered regions of the spliceosomal proteome can be represented structurally with a high degree of confidence. We analyzed the reduced spliceosomal proteome of the intron-poor organism Giardia lamblia, and as a result, we proposed a candidate set of ordered structural regions necessary for a functional spliceosome. The results of this work will aid experimental and structural analyses of the spliceosomal proteins and complexes, and can serve as a starting point for multiscale modeling of the structure of the entire spliceosome. PMID:22573172
Supramolecular Structures with Blood Plasma Proteins, Sugars and Nanosilica

NASA Astrophysics Data System (ADS)

Turov, V. V.; Gun'ko, V. M.; Galagan, N. P.; Rugal, A. A.; Barvinchenko, V. M.; Gorbyk, P. P.

Supramolecular structures with blood plasma proteins (albumin, immunoglobulin and fibrinogen (HPF)), protein/water/silica and protein/water/ silica/sugar (glucose, fructose and saccharose) were studied by NMR, adsorption, IR and UV spectroscopy methods. Hydration parameters, amounts of weakly and strongly bound waters and interfacial energy (γ S) were determined over a wide range of component concentrations. The γ S(C protein,C silica) graphs were used to estimate the energy of protein-protein, protein-surface and particle-particle interactions. It was shown that interfacial energy of self-association (γ as) of protein molecules depends on a type of proteins. A large fraction of water bound to proteins can be displaced by sugars, and the effect of disaccharide (saccharose) was greater than that of monosugars. Changes in the structural parameters of cavities in HPF molecules and complexes with HPF/silica nanoparticles filled by bound water were analysed using NMR-cryoporometry showing that interaction of proteins with silica leads to a significant decrease in the amounts of water bound to both protein and silica surfaces. Bionanocomposites with BSA/nanosilica/sugar can be used to influence states of living cells and tissues after cryopreservation or other treatments. It was shown that interaction of proteins with silica leads to strong decrease in the volume of all types of internal cavities filled by water.
Conservation and divergence of C-terminal domain structure in the retinoblastoma protein family

DOE Office of Scientific and Technical Information (OSTI.GOV)

Liban, Tyler J.; Medina, Edgar M.; Tripathi, Sarvind

The retinoblastoma protein (Rb) and the homologous pocket proteins p107 and p130 negatively regulate cell proliferation by binding and inhibiting members of the E2F transcription factor family. The structural features that distinguish Rb from other pocket proteins have been unclear but are critical for understanding their functional diversity and determining why Rb has unique tumor suppressor activities. We describe here important differences in how the Rb and p107 C-terminal domains (CTDs) associate with the coiled-coil and marked-box domains (CMs) of E2Fs. We find that although CTD–CM binding is conserved across protein families, Rb and p107 CTDs show clear preferences formore » different E2Fs. A crystal structure of the p107 CTD bound to E2F5 and its dimer partner DP1 reveals the molecular basis for pocket protein–E2F binding specificity and how cyclin-dependent kinases differentially regulate pocket proteins through CTD phosphorylation. Our structural and biochemical data together with phylogenetic analyses of Rb and E2F proteins support the conclusion that Rb evolved specific structural motifs that confer its unique capacity to bind with high affinity those E2Fs that are the most potent activators of the cell cycle.« less
Computational and Statistical Analyses of Amino Acid Usage and Physico-Chemical Properties of the Twelve Late Embryogenesis Abundant Protein Classes

PubMed Central

Jaspard, Emmanuel; Macherel, David; Hunault, Gilles

2012-01-01

Late Embryogenesis Abundant Proteins (LEAPs) are ubiquitous proteins expected to play major roles in desiccation tolerance. Little is known about their structure - function relationships because of the scarcity of 3-D structures for LEAPs. The previous building of LEAPdb, a database dedicated to LEAPs from plants and other organisms, led to the classification of 710 LEAPs into 12 non-overlapping classes with distinct properties. Using this resource, numerous physico-chemical properties of LEAPs and amino acid usage by LEAPs have been computed and statistically analyzed, revealing distinctive features for each class. This unprecedented analysis allowed a rigorous characterization of the 12 LEAP classes, which differed also in multiple structural and physico-chemical features. Although most LEAPs can be predicted as intrinsically disordered proteins, the analysis indicates that LEAP class 7 (PF03168) and probably LEAP class 11 (PF04927) are natively folded proteins. This study thus provides a detailed description of the structural properties of this protein family opening the path toward further LEAP structure - function analysis. Finally, since each LEAP class can be clearly characterized by a unique set of physico-chemical properties, this will allow development of software to predict proteins as LEAPs. PMID:22615859
The presence of OMP inclusion bodies in a Escherichia coli K-12 mutated strain is not related to lipopolysaccharide structure.

PubMed

Corsaro, M Michela; Parrilli, Ermenegilda; Lanzetta, Rosa; Naldi, Teresa; Pieretti, Giuseppina; Lindner, Buko; Carpentieri, Andrea; Parrilli, Michelangelo; Tutino, M Luisa

2009-08-01

The role of lipopolysaccharides (LPSs) in the biogenesis of outer membrane proteins have been investigated in several studies. Some of these analyses showed that LPS is required for correct and efficient folding of outer membrane proteins; other studies support the idea of independence of outer membrane proteins biogenesis from LPS structure. In this article, we investigated the involvement of LPS structure in the anomalous aggregation of outer membrane proteins in a E. coli mutant strain (S17-1(lambdapir)). To achieve this aim, the LPS structure of the mutant strain was carefully determined and compared with the E. coli K-12 one. It turned out that LPS of these two strains differs in the inner core for the absence of a heptose residue (HepIII). We demonstrated that this difference is due to a mutation in waaQ, a gene encoding the transferase for the branch heptose HepIII residue. The mutation was complemented to find out if the restoration of LPS structure influenced the observed outer membrane proteins aggregation. Data reported in this work demonstrated that, in E. coli S17-1(lambdapir) there is no influence of LPS structure on the outer membrane proteins inclusion bodies formation.
Evidence for a large expansion and subfunctionalisation of globin genes in sea anemones.

PubMed

Smith, Hayden L; Pavasovic, Ana; Surm, Joachim M; Phillips, Matthew J; Prentis, Peter J

2018-06-27

The globin gene superfamily has been well-characterised in vertebrates, however, there has been limited research in early-diverging lineages, such as phylum Cnidaria. This study aimed to identify globin genes in multiple cnidarian lineages, and use bioinformatic approaches to characterise the evolution, structure and expression of these genes. Phylogenetic analyses and in silico protein predictions showed that all cnidarians have undergone an expansion of globin genes, which likely have a hexacoordinate protein structure. Our protein modelling has also revealed the possibility of a single pentacoordinate globin lineage in anthozoan species. Some cnidarian globin genes displayed tissue and development specific expression with very few orthologous genes similarly expressed across species. Our phylogenetic analyses also revealed that eumetazoan globin genes form a polyphyletic relationship with vertebrate globin genes. Overall, our analyses suggest that a Ngb-like and GbX-like gene were most likely present in the globin gene repertoire for the last common ancestor of eumetazoans. The identification of a large-scale expansion and subfunctionalisation of globin genes in actiniarians provides an excellent starting point to further our understanding of the evolution and function of the globin gene superfamily in early-diverging lineages.
Enhanced Bio-hydrogen Production from Protein Wastewater by Altering Protein Structure and Amino Acids Acidification Type

PubMed Central

Xiao, Naidong; Chen, Yinguang; Chen, Aihui; Feng, Leiyu

2014-01-01

Enhanced bio-hydrogen production from protein wastewater by altering protein structure and amino acids acidification type via pH control was investigated. The hydrogen production reached 205.2 mL/g-protein when protein wastewater was pretreated at pH 12 and then fermented at pH 10. The mechanism studies showed that pH 12 pretreatment significantly enhanced protein bio-hydrolysis during the subsequent fermentation stage as it caused the unfolding of protein, damaged the protein hydrogen bonding networks, and destroyed the disulfide bridges, which increased the susceptibility of protein to protease. Moreover, pH 10 fermentation produced more acetic but less propionic acid during the anaerobic fermentation of amino acids, which was consistent with the theory of fermentation type affecting hydrogen production. Further analyses of the critical enzymes, genes, and microorganisms indicated that the activity and abundance of hydrogen producing bacteria in the pH 10 fermentation reactor were greater than those in the control. PMID:24495932
Enhanced bio-hydrogen production from protein wastewater by altering protein structure and amino acids acidification type.

PubMed

Xiao, Naidong; Chen, Yinguang; Chen, Aihui; Feng, Leiyu

2014-02-05

Enhanced bio-hydrogen production from protein wastewater by altering protein structure and amino acids acidification type via pH control was investigated. The hydrogen production reached 205.2 mL/g-protein when protein wastewater was pretreated at pH 12 and then fermented at pH 10. The mechanism studies showed that pH 12 pretreatment significantly enhanced protein bio-hydrolysis during the subsequent fermentation stage as it caused the unfolding of protein, damaged the protein hydrogen bonding networks, and destroyed the disulfide bridges, which increased the susceptibility of protein to protease. Moreover, pH 10 fermentation produced more acetic but less propionic acid during the anaerobic fermentation of amino acids, which was consistent with the theory of fermentation type affecting hydrogen production. Further analyses of the critical enzymes, genes, and microorganisms indicated that the activity and abundance of hydrogen producing bacteria in the pH 10 fermentation reactor were greater than those in the control.
Enhanced Bio-hydrogen Production from Protein Wastewater by Altering Protein Structure and Amino Acids Acidification Type

NASA Astrophysics Data System (ADS)

Xiao, Naidong; Chen, Yinguang; Chen, Aihui; Feng, Leiyu

2014-02-01

Enhanced bio-hydrogen production from protein wastewater by altering protein structure and amino acids acidification type via pH control was investigated. The hydrogen production reached 205.2 mL/g-protein when protein wastewater was pretreated at pH 12 and then fermented at pH 10. The mechanism studies showed that pH 12 pretreatment significantly enhanced protein bio-hydrolysis during the subsequent fermentation stage as it caused the unfolding of protein, damaged the protein hydrogen bonding networks, and destroyed the disulfide bridges, which increased the susceptibility of protein to protease. Moreover, pH 10 fermentation produced more acetic but less propionic acid during the anaerobic fermentation of amino acids, which was consistent with the theory of fermentation type affecting hydrogen production. Further analyses of the critical enzymes, genes, and microorganisms indicated that the activity and abundance of hydrogen producing bacteria in the pH 10 fermentation reactor were greater than those in the control.
From Genomes to Protein Models and Back

NASA Astrophysics Data System (ADS)

Tramontano, Anna; Giorgetti, Alejandro; Orsini, Massimiliano; Raimondo, Domenico

2007-12-01

The alternative splicing mechanism allows genes to generate more than one product. When the splicing events occur within protein coding regions they can modify the biological function of the protein. Alternative splicing has been suggested as one way for explaining the discrepancy between the number of human genes and functional complexity. We analysed the putative structure of the alternatively spliced gene products annotated in the ENCODE pilot project and discovered that many of the potential alternative gene products will be unlikely to produce stable functional proteins.

Structure-Functional Basis of Ion Transport in Sodium–Calcium Exchanger (NCX) Proteins

PubMed Central

Giladi, Moshe; Shor, Reut; Lisnyansky, Michal; Khananshvili, Daniel

2016-01-01

The membrane-bound sodium–calcium exchanger (NCX) proteins shape Ca2+ homeostasis in many cell types, thus participating in a wide range of physiological and pathological processes. Determination of the crystal structure of an archaeal NCX (NCX_Mj) paved the way for a thorough and systematic investigation of ion transport mechanisms in NCX proteins. Here, we review the data gathered from the X-ray crystallography, molecular dynamics simulations, hydrogen–deuterium exchange mass-spectrometry (HDX-MS), and ion-flux analyses of mutants. Strikingly, the apo NCX_Mj protein exhibits characteristic patterns in the local backbone dynamics at particular helix segments, thereby possessing characteristic HDX profiles, suggesting structure-dynamic preorganization (geometric arrangements of catalytic residues before the transition state) of conserved α1 and α2 repeats at ion-coordinating residues involved in transport activities. Moreover, dynamic preorganization of local structural entities in the apo protein predefines the status of ion-occlusion and transition states, even though Na+ or Ca2+ binding modifies the preceding backbone dynamics nearby functionally important residues. Future challenges include resolving the structural-dynamic determinants governing the ion selectivity, functional asymmetry and ion-induced alternating access. Taking into account the structural similarities of NCX_Mj with the other proteins belonging to the Ca2+/cation exchanger superfamily, the recent findings can significantly improve our understanding of ion transport mechanisms in NCX and similar proteins. PMID:27879668
Structure-Functional Basis of Ion Transport in Sodium-Calcium Exchanger (NCX) Proteins.

PubMed

Giladi, Moshe; Shor, Reut; Lisnyansky, Michal; Khananshvili, Daniel

2016-11-22

The membrane-bound sodium-calcium exchanger (NCX) proteins shape Ca 2+ homeostasis in many cell types, thus participating in a wide range of physiological and pathological processes. Determination of the crystal structure of an archaeal NCX (NCX_Mj) paved the way for a thorough and systematic investigation of ion transport mechanisms in NCX proteins. Here, we review the data gathered from the X-ray crystallography, molecular dynamics simulations, hydrogen-deuterium exchange mass-spectrometry (HDX-MS), and ion-flux analyses of mutants. Strikingly, the apo NCX_Mj protein exhibits characteristic patterns in the local backbone dynamics at particular helix segments, thereby possessing characteristic HDX profiles, suggesting structure-dynamic preorganization (geometric arrangements of catalytic residues before the transition state) of conserved α₁ and α₂ repeats at ion-coordinating residues involved in transport activities. Moreover, dynamic preorganization of local structural entities in the apo protein predefines the status of ion-occlusion and transition states, even though Na⁺ or Ca 2+ binding modifies the preceding backbone dynamics nearby functionally important residues. Future challenges include resolving the structural-dynamic determinants governing the ion selectivity, functional asymmetry and ion-induced alternating access. Taking into account the structural similarities of NCX_Mj with the other proteins belonging to the Ca 2+ /cation exchanger superfamily, the recent findings can significantly improve our understanding of ion transport mechanisms in NCX and similar proteins.
MALDI Top-Down sequencing: calling N- and C-terminal protein sequences with high confidence and speed.

PubMed

Suckau, Detlev; Resemann, Anja

2009-12-01

The ability to match Top-Down protein sequencing (TDS) results by MALDI-TOF to protein sequences by classical protein database searching was evaluated in this work. Resulting from these analyses were the protein identity, the simultaneous assignment of the N- and C-termini and protein sequences of up to 70 residues from either terminus. In combination with de novo sequencing using the MALDI-TDS data, even fusion proteins were assigned and the detailed sequence around the fusion site was elucidated. MALDI-TDS allowed to efficiently match protein sequences quickly and to validate recombinant protein structures-in particular, protein termini-on the level of undigested proteins.
Objective identification of residue ranges for the superposition of protein structures

PubMed Central

2011-01-01

Background The automation of objectively selecting amino acid residue ranges for structure superpositions is important for meaningful and consistent protein structure analyses. So far there is no widely-used standard for choosing these residue ranges for experimentally determined protein structures, where the manual selection of residue ranges or the use of suboptimal criteria remain commonplace. Results We present an automated and objective method for finding amino acid residue ranges for the superposition and analysis of protein structures, in particular for structure bundles resulting from NMR structure calculations. The method is implemented in an algorithm, CYRANGE, that yields, without protein-specific parameter adjustment, appropriate residue ranges in most commonly occurring situations, including low-precision structure bundles, multi-domain proteins, symmetric multimers, and protein complexes. Residue ranges are chosen to comprise as many residues of a protein domain that increasing their number would lead to a steep rise in the RMSD value. Residue ranges are determined by first clustering residues into domains based on the distance variance matrix, and then refining for each domain the initial choice of residues by excluding residues one by one until the relative decrease of the RMSD value becomes insignificant. A penalty for the opening of gaps favours contiguous residue ranges in order to obtain a result that is as simple as possible, but not simpler. Results are given for a set of 37 proteins and compared with those of commonly used protein structure validation packages. We also provide residue ranges for 6351 NMR structures in the Protein Data Bank. Conclusions The CYRANGE method is capable of automatically determining residue ranges for the superposition of protein structure bundles for a large variety of protein structures. The method correctly identifies ordered regions. Global structure superpositions based on the CYRANGE residue ranges allow a clear presentation of the structure, and unnecessary small gaps within the selected ranges are absent. In the majority of cases, the residue ranges from CYRANGE contain fewer gaps and cover considerably larger parts of the sequence than those from other methods without significantly increasing the RMSD values. CYRANGE thus provides an objective and automatic method for standardizing the choice of residue ranges for the superposition of protein structures. PMID:21592348
StralSV: assessment of sequence variability within similar 3D structures and application to polio RNA-dependent RNA polymerase.

PubMed

Zemla, Adam T; Lang, Dorothy M; Kostova, Tanya; Andino, Raul; Ecale Zhou, Carol L

2011-06-02

Most of the currently used methods for protein function prediction rely on sequence-based comparisons between a query protein and those for which a functional annotation is provided. A serious limitation of sequence similarity-based approaches for identifying residue conservation among proteins is the low confidence in assigning residue-residue correspondences among proteins when the level of sequence identity between the compared proteins is poor. Multiple sequence alignment methods are more satisfactory--still, they cannot provide reliable results at low levels of sequence identity. Our goal in the current work was to develop an algorithm that could help overcome these difficulties by facilitating the identification of structurally (and possibly functionally) relevant residue-residue correspondences between compared protein structures. Here we present StralSV (structure-alignment sequence variability), a new algorithm for detecting closely related structure fragments and quantifying residue frequency from tight local structure alignments. We apply StralSV in a study of the RNA-dependent RNA polymerase of poliovirus, and we demonstrate that the algorithm can be used to determine regions of the protein that are relatively unique, or that share structural similarity with proteins that would be considered distantly related. By quantifying residue frequencies among many residue-residue pairs extracted from local structural alignments, one can infer potential structural or functional importance of specific residues that are determined to be highly conserved or that deviate from a consensus. We further demonstrate that considerable detailed structural and phylogenetic information can be derived from StralSV analyses. StralSV is a new structure-based algorithm for identifying and aligning structure fragments that have similarity to a reference protein. StralSV analysis can be used to quantify residue-residue correspondences and identify residues that may be of particular structural or functional importance, as well as unusual or unexpected residues at a given sequence position. StralSV is provided as a web service at http://proteinmodel.org/AS2TS/STRALSV/.
Evaluation of water displacement energetics in protein binding sites with grid cell theory.

PubMed

Gerogiokas, G; Southey, M W Y; Mazanetz, M P; Heifetz, A; Hefeitz, A; Bodkin, M; Law, R J; Michel, J

2015-04-07

Excess free energies, enthalpies and entropies of water in protein binding sites were computed via classical simulations and Grid Cell Theory (GCT) analyses for three pairs of congeneric ligands in complex with the proteins scytalone dehydratase, p38α MAP kinase and EGFR kinase respectively. Comparative analysis is of interest since the binding modes for each ligand pair differ in the displacement of one binding site water molecule, but significant variations in relative binding affinities are observed. Protocols that vary in their use of restraints on protein and ligand atoms were compared to determine the influence of protein-ligand flexibility on computed water structure and energetics, and to assess protocols for routine analyses of protein-ligand complexes. The GCT-derived binding affinities correctly reproduce experimental trends, but the magnitude of the predicted changes in binding affinities is exaggerated with respect to results from a previous Monte Carlo Free Energy Perturbation study. Breakdown of the GCT water free energies into enthalpic and entropic components indicates that enthalpy changes dominate the observed variations in energetics. In EGFR kinase GCT analyses revealed that replacement of a pyrimidine by a cyanopyridine perturbs water energetics up three hydration shells away from the ligand.
Functional and Genomic Analyses of Alpha-Solenoid Proteins

PubMed Central

Fournier, David; Palidwor, Gareth A.; Shcherbinin, Sergey; Szengel, Angelika; Schaefer, Martin H.; Perez-Iratxeta, Carol; Andrade-Navarro, Miguel A.

2013-01-01

Alpha-solenoids are flexible protein structural domains formed by ensembles of alpha-helical repeats (Armadillo and HEAT repeats among others). While homology can be used to detect many of these repeats, some alpha-solenoids have very little sequence homology to proteins of known structure and we expect that many remain undetected. We previously developed a method for detection of alpha-helical repeats based on a neural network trained on a dataset of protein structures. Here we improved the detection algorithm and updated the training dataset using recently solved structures of alpha-solenoids. Unexpectedly, we identified occurrences of alpha-solenoids in solved protein structures that escaped attention, for example within the core of the catalytic subunit of PI3KC. Our results expand the current set of known alpha-solenoids. Application of our tool to the protein universe allowed us to detect their significant enrichment in proteins interacting with many proteins, confirming that alpha-solenoids are generally involved in protein-protein interactions. We then studied the taxonomic distribution of alpha-solenoids to discuss an evolutionary scenario for the emergence of this type of domain, speculating that alpha-solenoids have emerged in multiple taxa in independent events by convergent evolution. We observe a higher rate of alpha-solenoids in eukaryotic genomes and in some prokaryotic families, such as Cyanobacteria and Planctomycetes, which could be associated to increased cellular complexity. The method is available at http://cbdm.mdc-berlin.de/~ard2/. PMID:24278209
Random close packing in protein cores

NASA Astrophysics Data System (ADS)

Ohern, Corey

Shortly after the determination of the first protein x-ray crystal structures, researchers analyzed their cores and reported packing fractions ϕ ~ 0 . 75 , a value that is similar to close packing equal-sized spheres. A limitation of these analyses was the use of `extended atom' models, rather than the more physically accurate `explicit hydrogen' model. The validity of using the explicit hydrogen model is proved by its ability to predict the side chain dihedral angle distributions observed in proteins. We employ the explicit hydrogen model to calculate the packing fraction of the cores of over 200 high resolution protein structures. We find that these protein cores have ϕ ~ 0 . 55 , which is comparable to random close-packing of non-spherical particles. This result provides a deeper understanding of the physical basis of protein structure that will enable predictions of the effects of amino acid mutations and design of new functional proteins. We gratefully acknowledge the support of the Raymond and Beverly Sackler Institute for Biological, Physical, and Engineering Sciences, National Library of Medicine training grant T15LM00705628 (J.C.G.), and National Science Foundation DMR-1307712 (L.R.).
Functional, structural and phylogenetic analysis of domains underlying the Al sensitivity of the aluminum-activated malate/anion transporter, TaALMT1.

PubMed

Ligaba, Ayalew; Dreyer, Ingo; Margaryan, Armine; Schneider, David J; Kochian, Leon; Piñeros, Miguel

2013-12-01

Triticum aestivum aluminum-activated malate transporter (TaALMT1) is the founding member of a unique gene family of anion transporters (ALMTs) that mediate the efflux of organic acids. A small sub-group of root-localized ALMTs, including TaALMT1, is physiologically associated with in planta aluminum (Al) resistance. TaALMT1 exhibits significant enhancement of transport activity in response to extracellular Al. In this study, we integrated structure-function analyses of structurally altered TaALMT1 proteins expressed in Xenopus oocytes with phylogenic analyses of the ALMT family. Our aim is to re-examine the role of protein domains in terms of their potential involvement in the Al-dependent enhancement (i.e. Al-responsiveness) of TaALMT1 transport activity, as well as the roles of all its 43 negatively charged amino acid residues. Our results indicate that the N-domain, which is predicted to form the conductive pathway, mediates ion transport even in the absence of the C-domain. However, segments in both domains are involved in Al(3+) sensing. We identified two regions, one at the N-terminus and a hydrophobic region at the C-terminus, that jointly contribute to the Al-response phenotype. Interestingly, the characteristic motif at the N-terminus appears to be specific for Al-responsive ALMTs. Our study highlights the need to include a comprehensive phylogenetic analysis when drawing inferences from structure-function analyses, as a significant proportion of the functional changes observed for TaALMT1 are most likely the result of alterations in the overall structural integrity of ALMT family proteins rather than modifications of specific sites involved in Al(3+) sensing. © 2013 The Authors The Plant Journal © 2013 John Wiley & Sons Ltd.
Structural Bioinformatics of the Interactome

PubMed Central

Petrey, Donald; Honig, Barry

2014-01-01

The last decade has seen a dramatic expansion in the number and range of techniques available to obtain genome-wide information, and to analyze this information so as to infer both the function of individual molecules and how they interact to modulate the behavior of biological systems. Here we review these techniques, focusing on the construction of physical protein-protein interaction networks, and highlighting approaches that incorporate protein structure which is becoming an increasingly important component of systems-level computational techniques. We also discuss how network analyses are being applied to enhance the basic understanding of biological systems and their disregulation, and how they are being applied in drug development. PMID:24895853
Modelling protein functional domains in signal transduction using Maude

NASA Technical Reports Server (NTRS)

Sriram, M. G.

2003-01-01

Modelling of protein-protein interactions in signal transduction is receiving increased attention in computational biology. This paper describes recent research in the application of Maude, a symbolic language founded on rewriting logic, to the modelling of functional domains within signalling proteins. Protein functional domains (PFDs) are a critical focus of modern signal transduction research. In general, Maude models can simulate biological signalling networks and produce specific testable hypotheses at various levels of abstraction. Developing symbolic models of signalling proteins containing functional domains is important because of the potential to generate analyses of complex signalling networks based on structure-function relationships.
Identifying intrinsically disordered protein regions likely to undergo binding-induced helical transitions.

PubMed

Glover, Karen; Mei, Yang; Sinha, Sangita C

2016-10-01

Many proteins contain intrinsically disordered regions (IDRs) lacking stable secondary and ordered tertiary structure. IDRs are often implicated in macromolecular interactions, and may undergo structural transitions upon binding to interaction partners. However, as binding partners of many protein IDRs are unknown, these structural transitions are difficult to verify and often are poorly understood. In this study we describe a method to identify IDRs that are likely to undergo helical transitions upon binding. This method combines bioinformatics analyses followed by circular dichroism spectroscopy to monitor 2,2,2-trifluoroethanol (TFE)-induced changes in secondary structure content of these IDRs. Our results demonstrate that there is no significant change in the helicity of IDRs that are not predicted to fold upon binding. IDRs that are predicted to fold fall into two groups: one group does not become helical in the presence of TFE and includes examples of IDRs that form β-strands upon binding, while the other group becomes more helical and includes examples that are known to fold into helices upon binding. Therefore, we propose that bioinformatics analyses combined with experimental evaluation using TFE may provide a general method to identify IDRs that undergo binding-induced disorder-to-helix transitions. Copyright © 2016 Elsevier B.V. All rights reserved.
Effect of Mutations on HP Lattice Proteins

NASA Astrophysics Data System (ADS)

Shi, Guangjie; Vogel, Thomas; Landau, David; Li, Ying; Wüst, Thomas

2013-03-01

Using Wang-Landau sampling with approriate trial moves[2], we investigate the effect of different types of mutations on lattice proteins in the HP model. While exact studies have been carried out for short HP proteins[3], the systems we investigate are of much larger size and hence not accessible for exact enumerations. Based on the estimated density of states, we systematically analyse the changes in structure and degeneracy of ground states of particular proteins and measure thermodynamic quantities like the stability of ground states and the specific heat, for example. Both, neutral mutations, which do not change the structure and stability of ground states, as well as critical mutations, which do change the thermodynamic behavior qualitatively, have been observed. Research supported by NSF
Molecular and functional analyses of a maize autoactive NB-LRR protein identify precise structural requirements for activity

USDA-ARS?s Scientific Manuscript database

Plant disease resistance is often mediated by nucleotide binding-leucine rich repeat (NB-LRR or NLR) proteins, which trigger a hypersensitive response (HR), a rapid, localized cell death upon recognition of specific pathogens. The maize NLR-encoding Rp1-D21 gene is the result of an intergenic recomb...
Low-temperature protein dynamics: a simulation analysis of interprotein vibrations and the boson peak at 150 k.

PubMed

Kurkal-Siebert, Vandana; Smith, Jeremy C

2006-02-22

An understanding of low-frequency, collective protein dynamics at low temperatures can furnish valuable information on functional protein energy landscapes, on the origins of the protein glass transition and on protein-protein interactions. Here, molecular dynamics (MD) simulations and normal-mode analyses are performed on various models of crystalline myoglobin in order to characterize intra- and interprotein vibrations at 150 K. Principal component analysis of the MD trajectories indicates that the Boson peak, a broad peak in the dynamic structure factor centered at about approximately 2-2.5 meV, originates from approximately 10(2) collective, harmonic vibrations. An accurate description of the environment is found to be essential in reproducing the experimental Boson peak form and position. At lower energies other strong peaks are found in the calculated dynamic structure factor. Characterization of these peaks shows that they arise from harmonic vibrations of proteins relative to each other. These vibrations are likely to furnish valuable information on the physical nature of protein-protein interactions.
Structural bioinformatics: methods, concepts and applications to blood coagulation proteins.

PubMed

Villoutreix, Bruno O

2002-06-01

Structural and theoretical analyses of proteins are central to the understanding of complex molecular mechanisms and are fundamental to the drug discovery process. Computational techniques yield useful insights into an ever-wider range of biomolecular systems. Protein three-dimensional structures and molecular functions can be predicted in some circumstances, while experimental structures can be analyzed in depth via such computational approaches. Non-covalent binding of biomolecules can be understood by considering structural, thermodynamic and kinetic issues, and theoretical simulations of such events can be attempted. The central role of electrostatic interactions with regard to protein function, structure and stability has been investigated and some electrostatic properties can be modeled theoretically. Computer methods thus help to prioritize, design, analyze and rationalize biochemical experiments. Cardiovascular diseases and associated blood coagulation disorders are leading causes of death worldwide. Blood coagulation involves more than 30 proteins that interact specifically with various degrees of affinity. Many of these molecules can also bind transiently to phospholipid surfaces. Numerous point mutations in the genes of coagulation proteins and regulators have been identified. Understanding the coagulation cascade, its regulation and the impact of mutations is required for the development of new therapies and diagnostic tools. In this review, we describe concepts and methods pertaining to the field of structural bioinformatics. We provide examples of applications of these approaches to blood coagulation proteins and show that such studies can give insights about molecular mechanisms contributing to cardiovascular disease susceptibility.
Identification of a Unique Fe-S Cluster Binding Site in a Glycyl-Radical Type Microcompartment Shell Protein

PubMed Central

Thompson, Michael C.; Wheatley, Nicole M.; Jorda, Julien; Sawaya, Michael R.; Gidaniyan, Soheil D.; Ahmed, Hoda; Yang, Zhongyu; McCarty, Krystal N.; Whitelegge, Julian P.; Yeates, Todd O.

2014-01-01

Recently, progress has been made toward understanding the functional diversity of bacterial microcompartment (MCP) systems, which serve as protein-based metabolic organelles in diverse microbes. New types of MCPs have been identified, including the glycyl-radical propanediol (Grp) MCP. Within these elaborate protein complexes, BMC-domain shell proteins assemble to form a polyhedral barrier that encapsulates the enzymatic contents of the MCP. Interestingly, the Grp MCP contains a number of shell proteins with unusual sequence features. GrpU is one such shell protein, whose amino acid sequence is particularly divergent from other members of the BMC-domain superfamily of proteins that effectively defines all MCPs. Expression, purification, and subsequent characterization of the protein showed, unexpectedly, that it binds an iron-sulfur cluster. We determined X-ray crystal structures of two GrpU orthologs, providing the first structural insight into the homohexameric BMC-domain shell proteins of the Grp system. The X-ray structures of GrpU, both obtained in the apo form, combined with spectroscopic analyses and computational modeling, show that the metal cluster resides in the central pore of the BMC shell protein at a position of broken 6-fold symmetry. The result is a structurally polymorphic iron-sulfur cluster binding site that appears to be unique among metalloproteins studied to date. PMID:25102080
Coevolution at protein complex interfaces can be detected by the complementarity trace with important impact for predictive docking

PubMed Central

Madaoui, Hocine; Guerois, Raphaël

2008-01-01

Protein surfaces are under significant selection pressure to maintain interactions with their partners throughout evolution. Capturing how selection pressure acts at the interfaces of protein–protein complexes is a fundamental issue with high interest for the structural prediction of macromolecular assemblies. We tackled this issue under the assumption that, throughout evolution, mutations should minimally disrupt the physicochemical compatibility between specific clusters of interacting residues. This constraint drove the development of the so-called Surface COmplementarity Trace in Complex History score (SCOTCH), which was found to discriminate with high efficiency the structure of biological complexes. SCOTCH performances were assessed not only with respect to other evolution-based approaches, such as conservation and coevolution analyses, but also with respect to statistically based scoring methods. Validated on a set of 129 complexes of known structure exhibiting both permanent and transient intermolecular interactions, SCOTCH appears as a robust strategy to guide the prediction of protein–protein complex structures. Of particular interest, it also provides a basic framework to efficiently track how protein surfaces could evolve while keeping their partners in contact. PMID:18511568
An integrated native mass spectrometry and top-down proteomics method that connects sequence to structure and function of macromolecular complexes

NASA Astrophysics Data System (ADS)

Li, Huilin; Nguyen, Hong Hanh; Ogorzalek Loo, Rachel R.; Campuzano, Iain D. G.; Loo, Joseph A.

2018-02-01

Mass spectrometry (MS) has become a crucial technique for the analysis of protein complexes. Native MS has traditionally examined protein subunit arrangements, while proteomics MS has focused on sequence identification. These two techniques are usually performed separately without taking advantage of the synergies between them. Here we describe the development of an integrated native MS and top-down proteomics method using Fourier-transform ion cyclotron resonance (FTICR) to analyse macromolecular protein complexes in a single experiment. We address previous concerns of employing FTICR MS to measure large macromolecular complexes by demonstrating the detection of complexes up to 1.8 MDa, and we demonstrate the efficacy of this technique for direct acquirement of sequence to higher-order structural information with several large complexes. We then summarize the unique functionalities of different activation/dissociation techniques. The platform expands the ability of MS to integrate proteomics and structural biology to provide insights into protein structure, function and regulation.
Super: a web server to rapidly screen superposable oligopeptide fragments from the protein data bank

PubMed Central

Collier, James H.; Lesk, Arthur M.; Garcia de la Banda, Maria; Konagurthu, Arun S.

2012-01-01

Searching for well-fitting 3D oligopeptide fragments within a large collection of protein structures is an important task central to many analyses involving protein structures. This article reports a new web server, Super, dedicated to the task of rapidly screening the protein data bank (PDB) to identify all fragments that superpose with a query under a prespecified threshold of root-mean-square deviation (RMSD). Super relies on efficiently computing a mathematical bound on the commonly used structural similarity measure, RMSD of superposition. This allows the server to filter out a large proportion of fragments that are unrelated to the query; >99% of the total number of fragments in some cases. For a typical query, Super scans the current PDB containing over 80 500 structures (with ∼40 million potential oligopeptide fragments to match) in under a minute. Super web server is freely accessible from: http://lcb.infotech.monash.edu.au/super. PMID:22638586

Imparting albumin-binding affinity to a human protein by mimicking the contact surface of a bacterial binding protein.

PubMed

Oshiro, Satoshi; Honda, Shinya

2014-04-18

Attachment of a bacterial albumin-binding protein module is an attractive strategy for extending the plasma residence time of protein therapeutics. However, a protein fused with such a bacterial module could induce unfavorable immune reactions. To address this, we designed an alternative binding protein by imparting albumin-binding affinity to a human protein using molecular surface grafting. The result was a series of human-derived 6 helix-bundle proteins, one of which specifically binds to human serum albumin (HSA) with adequate affinity (KD = 100 nM). The proteins were designed by transferring key binding residues of a bacterial albumin-binding module, Finegoldia magna protein G-related albumin-binding domain (GA) module, onto the human protein scaffold. Despite 13-15 mutations, the designed proteins maintain the original secondary structure by virtue of careful grafting based on structural informatics. Competitive binding assays and thermodynamic analyses of the best binders show that the binding mode resembles that of the GA module, suggesting that the contacting surface of the GA module is mimicked well on the designed protein. These results indicate that the designed protein may act as an alternative low-risk binding module to HSA. Furthermore, molecular surface grafting in combination with structural informatics is an effective approach for avoiding deleterious mutations on a target protein and for imparting the binding function of one protein onto another.
i3Drefine software for protein 3D structure refinement and its assessment in CASP10.

PubMed

Bhattacharya, Debswapna; Cheng, Jianlin

2013-01-01

Protein structure refinement refers to the process of improving the qualities of protein structures during structure modeling processes to bring them closer to their native states. Structure refinement has been drawing increasing attention in the community-wide Critical Assessment of techniques for Protein Structure prediction (CASP) experiments since its addition in 8(th) CASP experiment. During the 9(th) and recently concluded 10(th) CASP experiments, a consistent growth in number of refinement targets and participating groups has been witnessed. Yet, protein structure refinement still remains a largely unsolved problem with majority of participating groups in CASP refinement category failed to consistently improve the quality of structures issued for refinement. In order to alleviate this need, we developed a completely automated and computationally efficient protein 3D structure refinement method, i3Drefine, based on an iterative and highly convergent energy minimization algorithm with a powerful all-atom composite physics and knowledge-based force fields and hydrogen bonding (HB) network optimization technique. In the recent community-wide blind experiment, CASP10, i3Drefine (as 'MULTICOM-CONSTRUCT') was ranked as the best method in the server section as per the official assessment of CASP10 experiment. Here we provide the community with free access to i3Drefine software and systematically analyse the performance of i3Drefine in strict blind mode on the refinement targets issued in CASP10 refinement category and compare with other state-of-the-art refinement methods participating in CASP10. Our analysis demonstrates that i3Drefine is only fully-automated server participating in CASP10 exhibiting consistent improvement over the initial structures in both global and local structural quality metrics. Executable version of i3Drefine is freely available at http://protein.rnet.missouri.edu/i3drefine/.
Structural, Bioinformatic, and In Vivo Analyses of Two Treponema pallidum Lipoproteins Reveal a Unique TRAP Transporter

DOE Office of Scientific and Technical Information (OSTI.GOV)

Deka, Ranjit K.; Brautigam, Chad A.; Goldberg, Martin

2012-05-25

Treponema pallidum, the bacterial agent of syphilis, is predicted to encode one tripartite ATP-independent periplasmic transporter (TRAP-T). TRAP-Ts typically employ a periplasmic substrate-binding protein (SBP) to deliver the cognate ligand to the transmembrane symporter. Herein, we demonstrate that the genes encoding the putative TRAP-T components from T. pallidum, tp0957 (the SBP), and tp0958 (the symporter), are in an operon with an uncharacterized third gene, tp0956. We determined the crystal structure of recombinant Tp0956; the protein is trimeric and perforated by a pore. Part of Tp0956 forms an assembly similar to those of 'tetratricopeptide repeat' (TPR) motifs. The crystal structure ofmore » recombinant Tp0957 was also determined; like the SBPs of other TRAP-Ts, there are two lobes separated by a cleft. In these other SBPs, the cleft binds a negatively charged ligand. However, the cleft of Tp0957 has a strikingly hydrophobic chemical composition, indicating that its ligand may be substantially different and likely hydrophobic. Analytical ultracentrifugation of the recombinant versions of Tp0956 and Tp0957 established that these proteins associate avidly. This unprecedented interaction was confirmed for the native molecules using in vivo cross-linking experiments. Finally, bioinformatic analyses suggested that this transporter exemplifies a new subfamily of TPATs (TPR-protein-associated TRAP-Ts) that require the action of a TPR-containing accessory protein for the periplasmic transport of a potentially hydrophobic ligand(s).« less
Structure and function of small heat shock/alpha-crystallin proteins: established concepts and emerging ideas.

PubMed

MacRae, T H

2000-06-01

Small heat shock/alpha-crystallin proteins are defined by conserved sequence of approximately 90 amino acid residues, termed the alpha-crystallin domain, which is bounded by variable amino- and carboxy-terminal extensions. These proteins form oligomers, most of uncertain quaternary structure, and oligomerization is prerequisite to their function as molecular chaperones. Sequence modelling and physical analyses show that the secondary structure of small heat shock/alpha-crystallin proteins is predominately beta-pleated sheet. Crystallography, site-directed spin-labelling and yeast two-hybrid selection demonstrate regions of secondary structure within the alpha-crystallin domain that interact during oligomer assembly, a process also dependent on the amino terminus. Oligomers are dynamic, exhibiting subunit exchange and organizational plasticity, perhaps leading to functional diversity. Exposure of hydrophobic residues by structural modification facilitates chaperoning where denaturing proteins in the molten globule state associate with oligomers. The flexible carboxy-terminal extension contributes to chaperone activity by enhancing the solubility of small heat shock/alpha-crystallin proteins. Site-directed mutagenesis has yielded proteins where the effect of the change on structure and function depends upon the residue modified, the organism under study and the analytical techniques used. Most revealing, substitution of a conserved arginine residue within the alpha-crystallin domain has a major impact on quaternary structure and chaperone action probably through realignment of beta-sheets. These mutations are linked to inherited diseases. Oligomer size is regulated by a stress-responsive cascade including MAPKAP kinase 2/3 and p38. Phosphorylation of small heat shock/alpha-crystallin proteins has important consequences within stressed cells, especially for microfilaments.
Structure of a pentameric virion-associated fiber with a potential role in Orsay virus entry to host cells

PubMed Central

Yuan, Wang; Zhou, Ying; Wang, Tao; Demeler, Borries; Zhong, Weiwei; Tao, Yizhi J.

2017-01-01

Despite the wide use of Caenorhabditis elegans as a model organism, the first virus naturally infecting this organism was not discovered until six years ago. The Orsay virus and its related nematode viruses have a positive-sense RNA genome, encoding three proteins: CP, RdRP, and a novel δ protein that shares no homology with any other proteins. δ can be expressed either as a free δ or a CP-δ fusion protein by ribosomal frameshift, but the structure and function of both δ and CP-δ remain unknown. Using a combination of electron microscopy, X-ray crystallography, computational and biophysical analyses, here we show that the Orsay δ protein forms a ~420-Å long, pentameric fiber with an N-terminal α-helical bundle, a β-stranded filament in the middle, and a C-terminal head domain. The pentameric nature of the δ fiber has been independently confirmed by both mass spectrometry and analytical ultracentrifugation. Recombinant Orsay capsid containing CP-δ shows protruding long fibers with globular heads at the distal end. Mutant viruses with disrupted CP-δ fibers were generated by organism-based reverse genetics. These viruses were found to be either non-viable or with poor infectivity according to phenotypic and qRT-PCR analyses. Furthermore, addition of purified δ proteins to worm culture greatly reduced Orsay infectivity in a sequence-specific manner. Based on the structure resemblance between the Orsay CP-δ fiber and the fibers from reovirus and adenovirus, we propose that CP-δ functions as a cell attachment protein to mediate Orsay entry into worm intestine cells. PMID:28241071
Association of Membrane Rafts and Postsynaptic Density: Proteomics, Biochemical, and Ultrastructural Analyses

PubMed Central

Suzuki, Tatsuo; Zhang, Jingping; Miyazawa, Shoko; Liu, Qian; Farzan, Michael R.; Yao, Wei-Dong

2011-01-01

Postsynaptic membrane rafts are believed to play important roles in synaptic signaling, plasticity, and maintenance. However, their molecular identities remain elusive. Further, how they interact with the well-established signaling specialization, the postsynaptic density (PSD), is poorly understood. We previously detected a number of conventional PSD proteins in detergent-resistant membranes (DRMs). Here, we have performed LC-MS/MS (liquid chromatography coupled with tandem mass spectrometry) analyses on postsynaptic membrane rafts and PSDs. Our comparative analysis identified an extensive overlap of protein components in the two structures. This overlapping could be explained, at least partly, by a physical association of the two structures. Meanwhile, a significant number of proteins displayed biased distributions to either rafts or PSDs, suggesting distinct roles for the two postsynaptic specializations. Using biochemical and electron microscopic methods, we directly detected membrane raft-PSD complexes. In vitro reconstitution experiments indicated that the formation of raft-PSD complexes was not due to the artificial reconstruction of once-solubilized membrane components and PSD structures, supporting that these complexes occurred in vivo. Taking together, our results provide evidence that postsynaptic membrane rafts and PSDs may be physically associated. Such association could be important in postsynaptic signal integration, synaptic function, and maintenance. PMID:21797867
Structural Basis for Interactions Between Contactin Family Members and Protein-tyrosine Phosphatase Receptor Type G in Neural Tissues

DOE PAGES

Nikolaienko, Roman M.; Hammel, Michal; Dubreuil, Véronique; ...

2016-08-18

Protein-tyrosine phosphatase receptor type G (RPTPγ/PTPRG) interacts in vitro with contactin-3-6 (CNTN3-6), a group of glycophosphatidylinositol-anchored cell adhesion molecules involved in the wiring of the nervous system. In addition to PTPRG, CNTNs associate with multiple transmembrane proteins and signal inside the cell via cis-binding partners to alleviate the absence of an intracellular region. Here, we use comprehensive biochemical and structural analyses to demonstrate that PTPRG·CNTN3-6 complexes share similar binding affinities and a conserved arrangement. Furthermore, as a first step to identifying PTPRG·CNTN complexes in vivo, we found that PTPRG and CNTN3 associate in the outer segments of mouse rod photoreceptormore » cells. In particular, PTPRG and CNTN3 form cis-complexes at the surface of photoreceptors yet interact in trans when expressed on the surfaces of apposing cells. Further structural analyses suggest that all CNTN ectodomains adopt a bent conformation and might lie parallel to the cell surface to accommodate these cis and trans binding modes. Taken together, these studies identify a PTPRG·CNTN complex in vivo and provide novel insights into PTPRG- and CNTN-mediated signaling.« less
Bacterial flagellar capping proteins adopt diverse oligomeric states

DOE Office of Scientific and Technical Information (OSTI.GOV)

Postel, Sandra; Deredge, Daniel; Bonsor, Daniel A.

2016-09-24

Flagella are crucial for bacterial motility and pathogenesis. The flagellar capping protein (FliD) regulates filament assembly by chaperoning and sorting flagellin (FliC) proteins after they traverse the hollow filament and exit the growing flagellum tip. In the absence of FliD, flagella are not formed, resulting in impaired motility and infectivity. Here, we report the 2.2 Å resolution X-ray crystal structure of FliD fromPseudomonas aeruginosa, the first high-resolution structure of any FliD protein from any bacterium. Using this evidence in combination with a multitude of biophysical and functional analyses, we find thatPseudomonasFliD exhibits unexpected structural similarity to other flagellar proteins atmore » the domain level, adopts a unique hexameric oligomeric state, and depends on flexible determinants for oligomerization. Considering that the flagellin filaments on which FliD oligomers are affixed vary in protofilament number between bacteria, our results suggest that FliD oligomer stoichiometries vary across bacteria to complement their filament assemblies.« less
Lead discovery and in silico 3D structure modeling of tumorigenic FAM72A (p17).

PubMed

Pramanik, Subrata; Kutzner, Arne; Heese, Klaus

2015-01-01

FAM72A (p17) is a novel neuronal protein that has been linked to tumorigenic effects in non-neuronal tissue. Using state of the art in silico physicochemical analyses (e.g., I-TASSER, RaptorX, and Modeller), we determined the three-dimensional (3D) protein structure of FAM72A and further identified potential ligand-protein interactions. Our data indicate a Zn(2+)/Fe(3+)-containing 3D protein structure, based on a 3GA3_A model template, which potentially interacts with the organic molecule RSM ((2s)-2-(acetylamino)-N-methyl-4-[(R)-methylsulfinyl] butanamide). The discovery of RSM may serve as potential lead for further anti-FAM72A drug screening tests in the pharmaceutical industry because interference with FAM72A's activities via RSM-related molecules might be a novel option to influence the tumor suppressor protein p53 signaling pathways for the treatment of various types of cancers.
The mechanism of folding robustness revealed by the crystal structure of extra-superfolder GFP.

PubMed

Choi, Jae Young; Jang, Tae-Ho; Park, Hyun Ho

2017-01-01

Stability of green fluorescent protein (GFP) is sometimes important for a proper practical application of this protein. Random mutagenesis and targeted mutagenesis have been used to create better-folded variants of GFP, including recently reported extra-superfolder GFP. Our aim was to determine the crystal structure of extra-superfolder GFP, which is more robustly folded and stable than GFP and superfolder GFP. The structural and structure-based mutagenesis analyses revealed that some of the mutations that created extra-superfolder GFP (F46L, E126K, N149K, and S208L) contribute to folding robustness by stabilizing extra-superfolder GFP with various noncovalent bonds. © 2016 Federation of European Biochemical Societies.
Matching multiple rigid domain decompositions of proteins

PubMed Central

Flynn, Emily; Streinu, Ileana

2017-01-01

We describe efficient methods for consistently coloring and visualizing collections of rigid cluster decompositions obtained from variations of a protein structure, and lay the foundation for more complex setups that may involve different computational and experimental methods. The focus here is on three biological applications: the conceptually simpler problems of visualizing results of dilution and mutation analyses, and the more complex task of matching decompositions of multiple NMR models of the same protein. Implemented into the KINARI web server application, the improved visualization techniques give useful information about protein folding cores, help examining the effect of mutations on protein flexibility and function, and provide insights into the structural motions of PDB proteins solved with solution NMR. These tools have been developed with the goal of improving and validating rigidity analysis as a credible coarse-grained model capturing essential information about a protein’s slow motions near the native state. PMID:28141528
Illuminating structural proteins in viral "dark matter" with metaproteomics

DOE PAGES

Brum, Jennifer R.; Ignacio-Espinoza, J. Cesar; Kim, Eun -Hae; ...

2016-02-16

Viruses are ecologically important, yet environmental virology is limited by dominance of unannotated genomic sequences representing taxonomic and functional "viral dark matter." Although recent analytical advances are rapidly improving taxonomic annotations, identifying functional darkmatter remains problematic. Here, we apply paired metaproteomics and dsDNA-targeted metagenomics to identify 1,875 virion-associated proteins from the ocean. Over one-half of these proteins were newly functionally annotated and represent abundant and widespread viral metagenome-derived protein clusters (PCs). One primarily unannotated PC dominated the dataset, but structural modeling and genomic context identified this PC as a previously unidentified capsid protein from multiple uncultivated tailed virus families. Furthermore,more » four of the five most abundant PCs in the metaproteome represent capsid proteins containing the HK97-like protein fold previously found in many viruses that infect all three domains of life. The dominance of these proteins within our dataset, as well as their global distribution throughout the world's oceans and seas, supports prior hypotheses that this HK97-like protein fold is the most abundant biological structure on Earth. Altogether, these culture-independent analyses improve virion-associated protein annotations, facilitate the investigation of proteins within natural viral communities, and offer a high-throughput means of illuminating functional viral dark matter.« less
Illuminating structural proteins in viral "dark matter" with metaproteomics.

PubMed

Brum, Jennifer R; Ignacio-Espinoza, J Cesar; Kim, Eun-Hae; Trubl, Gareth; Jones, Robert M; Roux, Simon; VerBerkmoes, Nathan C; Rich, Virginia I; Sullivan, Matthew B

2016-03-01

Viruses are ecologically important, yet environmental virology is limited by dominance of unannotated genomic sequences representing taxonomic and functional "viral dark matter." Although recent analytical advances are rapidly improving taxonomic annotations, identifying functional dark matter remains problematic. Here, we apply paired metaproteomics and dsDNA-targeted metagenomics to identify 1,875 virion-associated proteins from the ocean. Over one-half of these proteins were newly functionally annotated and represent abundant and widespread viral metagenome-derived protein clusters (PCs). One primarily unannotated PC dominated the dataset, but structural modeling and genomic context identified this PC as a previously unidentified capsid protein from multiple uncultivated tailed virus families. Furthermore, four of the five most abundant PCs in the metaproteome represent capsid proteins containing the HK97-like protein fold previously found in many viruses that infect all three domains of life. The dominance of these proteins within our dataset, as well as their global distribution throughout the world's oceans and seas, supports prior hypotheses that this HK97-like protein fold is the most abundant biological structure on Earth. Together, these culture-independent analyses improve virion-associated protein annotations, facilitate the investigation of proteins within natural viral communities, and offer a high-throughput means of illuminating functional viral dark matter.
Illuminating structural proteins in viral “dark matter” with metaproteomics

PubMed Central

Brum, Jennifer R.; Ignacio-Espinoza, J. Cesar; Kim, Eun-Hae; Trubl, Gareth; Jones, Robert M.; Roux, Simon; VerBerkmoes, Nathan C.; Rich, Virginia I.; Sullivan, Matthew B.

2016-01-01

Viruses are ecologically important, yet environmental virology is limited by dominance of unannotated genomic sequences representing taxonomic and functional “viral dark matter.” Although recent analytical advances are rapidly improving taxonomic annotations, identifying functional dark matter remains problematic. Here, we apply paired metaproteomics and dsDNA-targeted metagenomics to identify 1,875 virion-associated proteins from the ocean. Over one-half of these proteins were newly functionally annotated and represent abundant and widespread viral metagenome-derived protein clusters (PCs). One primarily unannotated PC dominated the dataset, but structural modeling and genomic context identified this PC as a previously unidentified capsid protein from multiple uncultivated tailed virus families. Furthermore, four of the five most abundant PCs in the metaproteome represent capsid proteins containing the HK97-like protein fold previously found in many viruses that infect all three domains of life. The dominance of these proteins within our dataset, as well as their global distribution throughout the world’s oceans and seas, supports prior hypotheses that this HK97-like protein fold is the most abundant biological structure on Earth. Together, these culture-independent analyses improve virion-associated protein annotations, facilitate the investigation of proteins within natural viral communities, and offer a high-throughput means of illuminating functional viral dark matter. PMID:26884177
Characterization of a 65 kDa NIF in the nuclear matrix of the monocot Allium cepa that interacts with nuclear spectrin-like proteins.

PubMed

Pérez-Munive, Clara; Blumenthal, Sonal S D; de la Espina, Susana Moreno Díaz

2012-01-01

Plant cells have a well organized nucleus and nuclear matrix, but lack orthologues of the main structural components of the metazoan nuclear matrix. Although data is limited, most plant nuclear structural proteins are coiled-coil proteins, such as the NIFs (nuclear intermediate filaments) in Pisum sativum that cross-react with anti-intermediate filament and anti-lamin antibodies, form filaments 6-12 nm in diameter in vitro, and may play the role of lamins. We have investigated the conservation and features of NIFs in a monocot species, Allium cepa, and compared them with onion lamin-like proteins. Polyclonal antisera against the pea 65 kDa NIF were used in 1D and 2D Western blots, ICM (imunofluorescence confocal microscopy) and IEM (immunoelectron microscopy). Their presence in the nuclear matrix was analysed by differential extraction of nuclei, and their association with structural spectrin-like proteins by co-immunoprecipitation and co-localization in ICM. NIF is a conserved structural component of the nucleus and its matrix in monocots with Mr and pI values similar to those of pea 65 kDa NIF, which localized to the nuclear envelope, perichromatin domains and foci, and to the nuclear matrix, interacting directly with structural nuclear spectrin-like proteins. Its similarities with some of the proteins described as onion lamin-like proteins suggest that they are highly related or perhaps the same proteins.
Determining crystal structures through crowdsourcing and coursework

NASA Astrophysics Data System (ADS)

Horowitz, Scott; Koepnick, Brian; Martin, Raoul; Tymieniecki, Agnes; Winburn, Amanda A.; Cooper, Seth; Flatten, Jeff; Rogawski, David S.; Koropatkin, Nicole M.; Hailu, Tsinatkeab T.; Jain, Neha; Koldewey, Philipp; Ahlstrom, Logan S.; Chapman, Matthew R.; Sikkema, Andrew P.; Skiba, Meredith A.; Maloney, Finn P.; Beinlich, Felix R. M.; Caglar, Ahmet; Coral, Alan; Jensen, Alice Elizabeth; Lubow, Allen; Boitano, Amanda; Lisle, Amy Elizabeth; Maxwell, Andrew T.; Failer, Barb; Kaszubowski, Bartosz; Hrytsiv, Bohdan; Vincenzo, Brancaccio; de Melo Cruz, Breno Renan; McManus, Brian Joseph; Kestemont, Bruno; Vardeman, Carl; Comisky, Casey; Neilson, Catherine; Landers, Catherine R.; Ince, Christopher; Buske, Daniel Jon; Totonjian, Daniel; Copeland, David Marshall; Murray, David; Jagieła, Dawid; Janz, Dietmar; Wheeler, Douglas C.; Cali, Elie; Croze, Emmanuel; Rezae, Farah; Martin, Floyd Orville; Beecher, Gil; de Jong, Guido Alexander; Ykman, Guy; Feldmann, Harald; Chan, Hugo Paul Perez; Kovanecz, Istvan; Vasilchenko, Ivan; Connellan, James C.; Borman, Jami Lynne; Norrgard, Jane; Kanfer, Jebbie; Canfield, Jeffrey M.; Slone, Jesse David; Oh, Jimmy; Mitchell, Joanne; Bishop, John; Kroeger, John Douglas; Schinkler, Jonas; McLaughlin, Joseph; Brownlee, June M.; Bell, Justin; Fellbaum, Karl Willem; Harper, Kathleen; Abbey, Kirk J.; Isaksson, Lennart E.; Wei, Linda; Cummins, Lisa N.; Miller, Lori Anne; Bain, Lyn; Carpenter, Lynn; Desnouck, Maarten; Sharma, Manasa G.; Belcastro, Marcus; Szew, Martin; Szew, Martin; Britton, Matthew; Gaebel, Matthias; Power, Max; Cassidy, Michael; Pfützenreuter, Michael; Minett, Michele; Wesselingh, Michiel; Yi, Minjune; Cameron, Neil Haydn Tormey; Bolibruch, Nicholas I.; Benevides, Noah; Kathleen Kerr, Norah; Barlow, Nova; Crevits, Nykole Krystyne; Dunn, Paul; Silveira Belo Nascimento Roque, Paulo Sergio; Riber, Peter; Pikkanen, Petri; Shehzad, Raafay; Viosca, Randy; James Fraser, Robert; Leduc, Robert; Madala, Roman; Shnider, Scott; de Boisblanc, Sharon; Butkovich, Slava; Bliven, Spencer; Hettler, Stephen; Telehany, Stephen; Schwegmann, Steven A.; Parkes, Steven; Kleinfelter, Susan C.; Michael Holst, Sven; van der Laan, T. J. A.; Bausewein, Thomas; Simon, Vera; Pulley, Warwick; Hull, William; Kim, Annes Yukyung; Lawton, Alexis; Ruesch, Amanda; Sundar, Anjali; Lawrence, Anna-Lisa; Afrin, Antara; Maheshwer, Bhargavi; Turfe, Bilal; Huebner, Christian; Killeen, Courtney Elizabeth; Antebi-Lerrman, Dalia; Luan, Danny; Wolfe, Derek; Pham, Duc; Michewicz, Elaina; Hull, Elizabeth; Pardington, Emily; Galal, Galal Osama; Sun, Grace; Chen, Grace; Anderson, Halie E.; Chang, Jane; Hewlett, Jeffrey Thomas; Sterbenz, Jennifer; Lim, Jiho; Morof, Joshua; Lee, Junho; Inn, Juyoung Samuel; Hahm, Kaitlin; Roth, Kaitlin; Nair, Karun; Markin, Katherine; Schramm, Katie; Toni Eid, Kevin; Gam, Kristina; Murphy, Lisha; Yuan, Lucy; Kana, Lulia; Daboul, Lynn; Shammas, Mario Karam; Chason, Max; Sinan, Moaz; Andrew Tooley, Nicholas; Korakavi, Nisha; Comer, Patrick; Magur, Pragya; Savliwala, Quresh; Davison, Reid Michael; Sankaran, Roshun Rajiv; Lewe, Sam; Tamkus, Saule; Chen, Shirley; Harvey, Sho; Hwang, Sin Ye; Vatsia, Sohrab; Withrow, Stefan; Luther, Tahra K.; Manett, Taylor; Johnson, Thomas James; Ryan Brash, Timothy; Kuhlman, Wyatt; Park, Yeonjung; Popović, Zoran; Baker, David; Khatib, Firas; Bardwell, James C. A.

2016-09-01

We show here that computer game players can build high-quality crystal structures. Introduction of a new feature into the computer game Foldit allows players to build and real-space refine structures into electron density maps. To assess the usefulness of this feature, we held a crystallographic model-building competition between trained crystallographers, undergraduate students, Foldit players and automatic model-building algorithms. After removal of disordered residues, a team of Foldit players achieved the most accurate structure. Analysing the target protein of the competition, YPL067C, uncovered a new family of histidine triad proteins apparently involved in the prevention of amyloid toxicity. From this study, we conclude that crystallographers can utilize crowdsourcing to interpret electron density information and to produce structure solutions of the highest quality.
Determining crystal structures through crowdsourcing and coursework.

PubMed

Horowitz, Scott; Koepnick, Brian; Martin, Raoul; Tymieniecki, Agnes; Winburn, Amanda A; Cooper, Seth; Flatten, Jeff; Rogawski, David S; Koropatkin, Nicole M; Hailu, Tsinatkeab T; Jain, Neha; Koldewey, Philipp; Ahlstrom, Logan S; Chapman, Matthew R; Sikkema, Andrew P; Skiba, Meredith A; Maloney, Finn P; Beinlich, Felix R M; Popović, Zoran; Baker, David; Khatib, Firas; Bardwell, James C A

2016-09-16

We show here that computer game players can build high-quality crystal structures. Introduction of a new feature into the computer game Foldit allows players to build and real-space refine structures into electron density maps. To assess the usefulness of this feature, we held a crystallographic model-building competition between trained crystallographers, undergraduate students, Foldit players and automatic model-building algorithms. After removal of disordered residues, a team of Foldit players achieved the most accurate structure. Analysing the target protein of the competition, YPL067C, uncovered a new family of histidine triad proteins apparently involved in the prevention of amyloid toxicity. From this study, we conclude that crystallographers can utilize crowdsourcing to interpret electron density information and to produce structure solutions of the highest quality.
Toward a blueprint for UDP-glucose pyrophosphorylase structure/function properties: homology-modeling analyses.

PubMed

Geisler, Matt; Wilczynska, Malgorzata; Karpinski, Stanislaw; Kleczkowski, Leszek A

2004-11-01

UDP-glucose pyrophosphorylase (UGPase) is an important enzyme of synthesis of sucrose, cellulose, and several other polysaccharides in all plants. The protein is evolutionarily conserved among eukaryotes, but has little relation, aside from its catalytic reaction, to UGPases of prokaryotic origin. Using protein homology modeling strategy, 3D structures for barley, poplar, and Arabidopsis UGPases have been derived, based on recently published crystal structure of human UDP-N-acetylglucosamine pyrophosphorylase. The derived 3D structures correspond to a bowl-shaped protein with the active site at a central groove, and a C-terminal domain that includes a loop (I-loop) possibly involved in dimerization. Data on a plethora of earlier described UGPase mutants from a variety of eukaryotic organisms have been revisited, and we have, in most cases, verified the role of each mutation in enzyme catalysis/regulation/structural integrity. We have also found that one of two alternatively spliced forms of poplar UGPase has a very short I-loop, suggesting differences in oligomerization ability of the two isozymes. The derivation of the structural model for plant UGPase should serve as a useful blueprint for further function/structure studies on this protein.
Structural studies of the Sputnik virophage.

PubMed

Sun, Siyang; La Scola, Bernard; Bowman, Valorie D; Ryan, Christopher M; Whitelegge, Julian P; Raoult, Didier; Rossmann, Michael G

2010-01-01

The virophage Sputnik is a satellite virus of the giant mimivirus and is the only satellite virus reported to date whose propagation adversely affects its host virus' production. Genome sequence analysis showed that Sputnik has genes related to viruses infecting all three domains of life. Here, we report structural studies of Sputnik, which show that it is about 740 A in diameter, has a T=27 icosahedral capsid, and has a lipid membrane inside the protein shell. Structural analyses suggest that the major capsid protein of Sputnik is likely to have a double jelly-roll fold, although sequence alignments do not show any detectable similarity with other viral double jelly-roll capsid proteins. Hence, the origin of Sputnik's capsid might have been derived from other viruses prior to its association with mimivirus.
Structural Studies of the Sputnik Virophage▿

PubMed Central

Sun, Siyang; La Scola, Bernard; Bowman, Valorie D.; Ryan, Christopher M.; Whitelegge, Julian P.; Raoult, Didier; Rossmann, Michael G.

2010-01-01

The virophage Sputnik is a satellite virus of the giant mimivirus and is the only satellite virus reported to date whose propagation adversely affects its host virus' production. Genome sequence analysis showed that Sputnik has genes related to viruses infecting all three domains of life. Here, we report structural studies of Sputnik, which show that it is about 740 Å in diameter, has a T=27 icosahedral capsid, and has a lipid membrane inside the protein shell. Structural analyses suggest that the major capsid protein of Sputnik is likely to have a double jelly-roll fold, although sequence alignments do not show any detectable similarity with other viral double jelly-roll capsid proteins. Hence, the origin of Sputnik's capsid might have been derived from other viruses prior to its association with mimivirus. PMID:19889775

Structure and dimerization of the catalytic domain of the protein phosphatase Cdc14p, a key regulator of mitotic exit in Saccharomyces cerevisiae.

PubMed

Kobayashi, Junya; Matsuura, Yoshiyuki

2017-10-01

In the budding yeast Saccharomyces cerevisiae, the protein phosphatase Cdc14p orchestrates various events essential for mitotic exit. We have determined the X-ray crystal structures at 1.85 Å resolution of the catalytic domain of Cdc14p in both the apo state, and as a complex with S160-phosphorylated Swi6p peptide. Each asymmetric unit contains two Cdc14p chains arranged in an intimately associated homodimer, consistent with its oligomeric state in solution. The dimerization interface is located on the backside of the substrate-binding cleft. Structure-based mutational analyses indicate that the dimerization of Cdc14p is required for normal growth of yeast cells. © 2017 The Protein Society.
The JCSG high-throughput structural biology pipeline.

PubMed

Elsliger, Marc André; Deacon, Ashley M; Godzik, Adam; Lesley, Scott A; Wooley, John; Wüthrich, Kurt; Wilson, Ian A

2010-10-01

The Joint Center for Structural Genomics high-throughput structural biology pipeline has delivered more than 1000 structures to the community over the past ten years. The JCSG has made a significant contribution to the overall goal of the NIH Protein Structure Initiative (PSI) of expanding structural coverage of the protein universe, as well as making substantial inroads into structural coverage of an entire organism. Targets are processed through an extensive combination of bioinformatics and biophysical analyses to efficiently characterize and optimize each target prior to selection for structure determination. The pipeline uses parallel processing methods at almost every step in the process and can adapt to a wide range of protein targets from bacterial to human. The construction, expansion and optimization of the JCSG gene-to-structure pipeline over the years have resulted in many technological and methodological advances and developments. The vast number of targets and the enormous amounts of associated data processed through the multiple stages of the experimental pipeline required the development of variety of valuable resources that, wherever feasible, have been converted to free-access web-based tools and applications.
Crystal structure of the toxin Msmeg_6760, the structural homolog of Mycobacterium tuberculosis Rv2035, a novel type II toxin involved in the hypoxic response

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bajaj, R. Alexandra; Arbing, Mark A.; Shin, Annie

The structure of Msmeg_6760, a protein of unknown function, has been determined. Biochemical and bioinformatics analyses determined that Msmeg_6760 interacts with a protein encoded in the same operon, Msmeg_6762, and predicted that the operon is a toxin–antitoxin (TA) system. Structural comparison of Msmeg_6760 with proteins of known function suggests that Msmeg_6760 binds a hydrophobic ligand in a buried cavity lined by large hydrophobic residues. Access to this cavity could be controlled by a gate–latch mechanism. The function of the Msmeg_6760 toxin is unknown, but structure-based predictions revealed that Msmeg_6760 and Msmeg_6762 are homologous to Rv2034 and Rv2035, a predicted novelmore » TA system involved inMycobacterium tuberculosislatency during macrophage infection. The Msmeg_6760 toxin fold has not been previously described for bacterial toxins and its unique structural features suggest that toxin activation is likely to be mediated by a novel mechanism.« less
Exploring Fold Space Preferences of New-born and Ancient Protein Superfamilies

PubMed Central

Edwards, Hannah; Abeln, Sanne; Deane, Charlotte M.

2013-01-01

The evolution of proteins is one of the fundamental processes that has delivered the diversity and complexity of life we see around ourselves today. While we tend to define protein evolution in terms of sequence level mutations, insertions and deletions, it is hard to translate these processes to a more complete picture incorporating a polypeptide's structure and function. By considering how protein structures change over time we can gain an entirely new appreciation of their long-term evolutionary dynamics. In this work we seek to identify how populations of proteins at different stages of evolution explore their possible structure space. We use an annotation of superfamily age to this space and explore the relationship between these ages and a diverse set of properties pertaining to a superfamily's sequence, structure and function. We note several marked differences between the populations of newly evolved and ancient structures, such as in their length distributions, secondary structure content and tertiary packing arrangements. In particular, many of these differences suggest a less elaborate structure for newly evolved superfamilies when compared with their ancient counterparts. We show that the structural preferences we report are not a residual effect of a more fundamental relationship with function. Furthermore, we demonstrate the robustness of our results, using significant variation in the algorithm used to estimate the ages. We present these age estimates as a useful tool to analyse protein populations. In particularly, we apply this in a comparison of domains containing greek key or jelly roll motifs. PMID:24244135
Genetics of PCOS: A systematic bioinformatics approach to unveil the proteins responsible for PCOS.

PubMed

Panda, Pritam Kumar; Rane, Riya; Ravichandran, Rahul; Singh, Shrinkhla; Panchal, Hetalkumar

2016-06-01

Polycystic ovary syndrome (PCOS) is a hormonal imbalance in women, which causes problems during menstrual cycle and in pregnancy that sometimes results in fatality. Though the genetics of PCOS is not fully understood, early diagnosis and treatment can prevent long-term effects. In this study, we have studied the proteins involved in PCOS and the structural aspects of the proteins that are taken into consideration using computational tools. The proteins involved are modeled using Modeller 9v14 and Ab-initio programs. All the 43 proteins responsible for PCOS were subjected to phylogenetic analysis to identify the relatedness of the proteins. Further, microarray data analysis of PCOS datasets was analyzed that was downloaded from GEO datasets to find the significant protein-coding genes responsible for PCOS, which is an addition to the reported protein-coding genes. Various statistical analyses were done using R programming to get an insight into the structural aspects of PCOS that can be used as drug targets to treat PCOS and other related reproductive diseases.
The effect of high pressure on the functional properties of pork myofibrillar proteins.

PubMed

Grossi, Alberto; Olsen, Karsten; Bolumar, Tomas; Rinnan, Åsmund; Øgendal, Lars H; Orlien, Vibeke

2016-04-01

Complementary methodologies were used to analyse the pressure-induced modification and functionality of myofibrillar proteins from pork meat pressurised at 200, 400, 600, or 800 MPa (10 min, 5 or 20 °C). Pressure at 400 MPa was found to be the threshold for loss of solubility, and the structural proteins, myosin and actin, lost their native solubility due to aggregation. The results from the extraction of proteins with different reagents targeting the disruption of specific molecular interactions suggested that pressure-induced aggregation was caused mainly by hydrogen bonding during pressurisation and not hydrophobic interactions nor disulphide cross-links. Furthermore, the soluble proteins were exposed to remarkable structural changes already at 200 MPa and lost their native functionality. The modification of the proteins in pressurised meat affected the water binding sites of the myofibrillar proteins and, thereby, the interactions between proteins and water molecules, and distribution between myofibrillar and extra-myofibrillar compartments. Copyright © 2015 Elsevier Ltd. All rights reserved.
PDB to AMPL Conversion

DOE Office of Scientific and Technical Information (OSTI.GOV)

Anna Johnston, SNL 9215

2002-09-01

PDB to AMPL Conversion was written to convert protein data base files to AMPL files. The protein data bases on the internet contain a wealth of information about the structue and makeup of proteins. Each file contains information derived by one or more experiments and contains information on how the experiment waw performed, the amino acid building blocks of each chain, and often the three-dimensional structure of the protein extracted from the experiments. The way a protein folds determines much about its function. Thus, studying the three-dimensional structure of the protein is of great interest. Analysing the contact maps ismore » one way to examine the structure. A contact map is a graph which has a linear back bone of amino acids for nodes (i.e., adjacent amino acids are always connected) and vertices between non-adjacent nodes if they are close enough to be considered in contact. If the graphs are similar then the folds of the protein and their function should also be similar. This software extracts the contact maps from a protein data base file and puts in into AMPL data format. This format is designed for use in AMPL, a programming language for simplifying linear programming formulations.« less
JET2 Viewer: a database of predicted multiple, possibly overlapping, protein-protein interaction sites for PDB structures.

PubMed

Ripoche, Hugues; Laine, Elodie; Ceres, Nicoletta; Carbone, Alessandra

2017-01-04

The database JET2 Viewer, openly accessible at http://www.jet2viewer.upmc.fr/, reports putative protein binding sites for all three-dimensional (3D) structures available in the Protein Data Bank (PDB). This knowledge base was generated by applying the computational method JET 2 at large-scale on more than 20 000 chains. JET 2 strategy yields very precise predictions of interacting surfaces and unravels their evolutionary process and complexity. JET2 Viewer provides an online intelligent display, including interactive 3D visualization of the binding sites mapped onto PDB structures and suitable files recording JET 2 analyses. Predictions were evaluated on more than 15 000 experimentally characterized protein interfaces. This is, to our knowledge, the largest evaluation of a protein binding site prediction method. The overall performance of JET 2 on all interfaces are: Sen = 52.52, PPV = 51.24, Spe = 80.05, Acc = 75.89. The data can be used to foster new strategies for protein-protein interactions modulation and interaction surface redesign. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Sequence and structural analyses of nuclear export signals in the NESdb database

PubMed Central

Xu, Darui; Farmer, Alicia; Collett, Garen; Grishin, Nick V.; Chook, Yuh Min

2012-01-01

We compiled >200 nuclear export signal (NES)–containing CRM1 cargoes in a database named NESdb. We analyzed the sequences and three-dimensional structures of natural, experimentally identified NESs and of false-positive NESs that were generated from the database in order to identify properties that might distinguish the two groups of sequences. Analyses of amino acid frequencies, sequence logos, and agreement with existing NES consensus sequences revealed strong preferences for the Φ1-X3-Φ2-X2-Φ3-X-Φ4 pattern and for negatively charged amino acids in the nonhydrophobic positions of experimentally identified NESs but not of false positives. Strong preferences against certain hydrophobic amino acids in the hydrophobic positions were also revealed. These findings led to a new and more precise NES consensus. More important, three-dimensional structures are now available for 68 NESs within 56 different cargo proteins. Analyses of these structures showed that experimentally identified NESs are more likely than the false positives to adopt α-helical conformations that transition to loops at their C-termini and more likely to be surface accessible within their protein domains or be present in disordered or unobserved parts of the structures. Such distinguishing features for real NESs might be useful in future NES prediction efforts. Finally, we also tested CRM1-binding of 40 NESs that were found in the 56 structures. We found that 16 of the NES peptides did not bind CRM1, hence illustrating how NESs are easily misidentified. PMID:22833565
Synchrotron Radiation Circular Dichroism (SRCD) Spectroscopy - An Enhanced Method for Examining Protein Conformations and Protein Interactions

DOE Office of Scientific and Technical Information (OSTI.GOV)

B Wallace; R Janes

CD (circular dichroism) spectroscopy is a well-established technique in structural biology. SRCD (synchrotron radiation circular dichroism) spectroscopy extends the utility and applications of conventional CD spectroscopy (using laboratory-based instruments) because the high flux of a synchrotron enables collection of data at lower wavelengths (resulting in higher information content), detection of spectra with higher signal-to-noise levels and measurements in the presence of absorbing components (buffers, salts, lipids and detergents). SRCD spectroscopy can provide important static and dynamic structural information on proteins in solution, including secondary structures of intact proteins and their domains, protein stability, the differences between wild-type and mutant proteins,more » the identification of natively disordered regions in proteins, and the dynamic processes of protein folding and membrane insertion and the kinetics of enzyme reactions. It has also been used to effectively study protein interactions, including protein-protein complex formation involving either induced-fit or rigid-body mechanisms, and protein-lipid complexes. A new web-based bioinformatics resource, the Protein Circular Dichroism Data Bank (PCDDB), has been created which enables archiving, access and analyses of CD and SRCD spectra and supporting metadata, now making this information publicly available. To summarize, the developing method of SRCD spectroscopy has the potential for playing an important role in new types of studies of protein conformations and their complexes.« less
Enriching the annotation of Mycobacterium tuberculosis H37Rv proteome using remote homology detection approaches: insights into structure and function.

PubMed

Ramakrishnan, Gayatri; Ochoa-Montaño, Bernardo; Raghavender, Upadhyayula S; Mudgal, Richa; Joshi, Adwait G; Chandra, Nagasuma R; Sowdhamini, Ramanathan; Blundell, Tom L; Srinivasan, Narayanaswamy

2015-01-01

The availability of the genome sequence of Mycobacterium tuberculosis H37Rv has encouraged determination of large numbers of protein structures and detailed definition of the biological information encoded therein; yet, the functions of many proteins in M. tuberculosis remain unknown. The emergence of multidrug resistant strains makes it a priority to exploit recent advances in homology recognition and structure prediction to re-analyse its gene products. Here we report the structural and functional characterization of gene products encoded in the M. tuberculosis genome, with the help of sensitive profile-based remote homology search and fold recognition algorithms resulting in an enhanced annotation of the proteome where 95% of the M. tuberculosis proteins were identified wholly or partly with information on structure or function. New information includes association of 244 proteins with 205 domain families and a separate set of new association of folds to 64 proteins. Extending structural information across uncharacterized protein families represented in the M. tuberculosis proteome, by determining superfamily relationships between families of known and unknown structures, has contributed to an enhancement in the knowledge of structural content. In retrospect, such superfamily relationships have facilitated recognition of probable structure and/or function for several uncharacterized protein families, eventually aiding recognition of probable functions for homologous proteins corresponding to such families. Gene products unique to mycobacteria for which no functions could be identified are 183. Of these 18 were determined to be M. tuberculosis specific. Such pathogen-specific proteins are speculated to harbour virulence factors required for pathogenesis. A re-annotated proteome of M. tuberculosis, with greater completeness of annotated proteins and domain assigned regions, provides a valuable basis for experimental endeavours designed to obtain a better understanding of pathogenesis and to accelerate the process of drug target discovery. Copyright © 2014 Elsevier Ltd. All rights reserved.
Genome Pool Strategy for Structural Coverage of Protein Families

PubMed Central

Jaroszewski, Lukasz; Slabinski, Lukasz; Wooley, John; Deacon, Ashley M.; Lesley, Scott A.; Wilson, Ian. A.; Godzik, Adam

2010-01-01

As noticed by generations of structural biologists, closely homologous proteins may have substantially different crystallization properties and propensities. These observations can be used to systematically introduce additional dimensionality into crystallization trials by targeting homologous proteins from multiple genomes in a “genome pool” strategy. Through extensive use of our recently introduced “crystallization feasibility score” (Slabinski et al., 2007a), we can explain that the genome pool strategy works well because the crystallization feasibility scores are surprisingly broad within families of homologous proteins, with most families containing a range of optimal to very difficult targets. We also show that some families can be regarded as relatively “easy”, where a significant number of proteins are predicted to have optimal crystallization features, and others are “very difficult”, where almost none are predicted to result in a crystal structure. Thus, the outcome of such variable distributions of such crystallizability' preferences leads to uneven structural coverage of known families, with “easier” or “optimal” families having several times more solved structures than “very difficult” ones. Nevertheless, this latter category can be successfully targeted by increasing the number of genomes that are used to select targets from a given family. On average, adding 10 new genomes to the “genome pool” provides more promising targets for 7 “very difficult” families. In contrast, our crystallization feasibility score does not indicate that any specific microbial genomes can be readily classified as “easier” or “very difficult” with respect to providing suitable candidates for crystallization and structure determination. Finally, our analyses show that specific physicochemical properties of the protein sequence favor successful outcomes for structure determination and, hence, the group of proteins with known 3D structures is systematically different from the general pool of known proteins. We, therefore, assess the structural consequences of these differences in protein sequence and protein biophysical properties. PMID:19000818
Elastin: a representative ideal protein elastomer.

PubMed Central

Urry, D W; Hugel, T; Seitz, M; Gaub, H E; Sheiba, L; Dea, J; Xu, J; Parker, T

2002-01-01

During the last half century, identification of an ideal (predominantly entropic) protein elastomer was generally thought to require that the ideal protein elastomer be a random chain network. Here, we report two new sets of data and review previous data. The first set of new data utilizes atomic force microscopy to report single-chain force-extension curves for (GVGVP)(251) and (GVGIP)(260), and provides evidence for single-chain ideal elasticity. The second class of new data provides a direct contrast between low-frequency sound absorption (0.1-10 kHz) exhibited by random-chain network elastomers and by elastin protein-based polymers. Earlier composition, dielectric relaxation (1-1000 MHz), thermoelasticity, molecular mechanics and dynamics calculations and thermodynamic and statistical mechanical analyses are presented, that combine with the new data to contrast with random-chain network rubbers and to detail the presence of regular non-random structural elements of the elastin-based systems that lose entropic elastomeric force upon thermal denaturation. The data and analyses affirm an earlier contrary argument that components of elastin, the elastic protein of the mammalian elastic fibre, and purified elastin fibre itself contain dynamic, non-random, regularly repeating structures that exhibit dominantly entropic elasticity by means of a damping of internal chain dynamics on extension. PMID:11911774
Query3d: a new method for high-throughput analysis of functional residues in protein structures.

PubMed

Ausiello, Gabriele; Via, Allegra; Helmer-Citterich, Manuela

2005-12-01

The identification of local similarities between two protein structures can provide clues of a common function. Many different methods exist for searching for similar subsets of residues in proteins of known structure. However, the lack of functional and structural information on single residues, together with the low level of integration of this information in comparison methods, is a limitation that prevents these methods from being fully exploited in high-throughput analyses. Here we describe Query3d, a program that is both a structural DBMS (Database Management System) and a local comparison method. The method conserves a copy of all the residues of the Protein Data Bank annotated with a variety of functional and structural information. New annotations can be easily added from a variety of methods and known databases. The algorithm makes it possible to create complex queries based on the residues' function and then to compare only subsets of the selected residues. Functional information is also essential to speed up the comparison and the analysis of the results. With Query3d, users can easily obtain statistics on how many and which residues share certain properties in all proteins of known structure. At the same time, the method also finds their structural neighbours in the whole PDB. Programs and data can be accessed through the PdbFun web interface.
Query3d: a new method for high-throughput analysis of functional residues in protein structures

PubMed Central

Ausiello, Gabriele; Via, Allegra; Helmer-Citterich, Manuela

2005-01-01

Background The identification of local similarities between two protein structures can provide clues of a common function. Many different methods exist for searching for similar subsets of residues in proteins of known structure. However, the lack of functional and structural information on single residues, together with the low level of integration of this information in comparison methods, is a limitation that prevents these methods from being fully exploited in high-throughput analyses. Results Here we describe Query3d, a program that is both a structural DBMS (Database Management System) and a local comparison method. The method conserves a copy of all the residues of the Protein Data Bank annotated with a variety of functional and structural information. New annotations can be easily added from a variety of methods and known databases. The algorithm makes it possible to create complex queries based on the residues' function and then to compare only subsets of the selected residues. Functional information is also essential to speed up the comparison and the analysis of the results. Conclusion With Query3d, users can easily obtain statistics on how many and which residues share certain properties in all proteins of known structure. At the same time, the method also finds their structural neighbours in the whole PDB. Programs and data can be accessed through the PdbFun web interface. PMID:16351754
Data on publications, structural analyses, and queries used to build and utilize the AlloRep database.

PubMed

Sousa, Filipa L; Parente, Daniel J; Hessman, Jacob A; Chazelle, Allen; Teichmann, Sarah A; Swint-Kruse, Liskin

2016-09-01

The AlloRep database (www.AlloRep.org) (Sousa et al., 2016) [1] compiles extensive sequence, mutagenesis, and structural information for the LacI/GalR family of transcription regulators. Sequence alignments are presented for >3000 proteins in 45 paralog subfamilies and as a subsampled alignment of the whole family. Phenotypic and biochemical data on almost 6000 mutants have been compiled from an exhaustive search of the literature; citations for these data are included herein. These data include information about oligomerization state, stability, DNA binding and allosteric regulation. Protein structural data for 65 proteins are presented as easily-accessible, residue-contact networks. Finally, this article includes example queries to enable the use of the AlloRep database. See the related article, "AlloRep: a repository of sequence, structural and mutagenesis data for the LacI/GalR transcription regulators" (Sousa et al., 2016) [1].
The interactome of CCT complex - A computational analysis.

PubMed

Narayanan, Aswathy; Pullepu, Dileep; Kabir, M Anaul

2016-10-01

The eukaryotic chaperonin, CCT (Chaperonin Containing TCP1 or TriC-TCP-1 Ring Complex) has been subjected to physical and genetic analyses in S. cerevisiae which can be extrapolated to human CCT (hCCT), owing to its structural and functional similarities with yeast CCT (yCCT). Studies on hCCT and its interactome acquire an additional dimension, as it has been implicated in several disease conditions like neurodegeneration and cancer. We attempt to study its stress response role in general, which will be reflected in the aspects of human diseases and yeast physiology, through computational analysis of the interactome. Towards consolidating and analysing the interactome data, we prepared and compared the unique CCT-interacting protein lists for S. cerevisiae and H. sapiens, performed GO term classification and enrichment studies which provide information on the diversity in CCT interactome, in terms of protein classes in the data set. Enrichment with disease-associated proteins and pathways highlight the medical importance of CCT. Different analyses converge, suggesting the significance of WD-repeat proteins, protein kinases and cytoskeletal proteins in the interactome. The prevalence of proteasomal subunits and ribosomal proteins suggest a possible cross-talk between protein-synthesis, folding and degradation machinery. A network of chaperones and chaperonins that function in combination can also be envisaged from the CCT interactome-Hsp70 interactome analysis. Copyright © 2016 Elsevier Ltd. All rights reserved.
Decoding Structural Properties of a Partially Unfolded Protein Substrate: En Route to Chaperone Binding.

PubMed

Nagpal, Suhani; Tiwari, Satyam; Mapa, Koyeli; Thukral, Lipi

2015-01-01

Many proteins comprising of complex topologies require molecular chaperones to achieve their unique three-dimensional folded structure. The E.coli chaperone, GroEL binds with a large number of unfolded and partially folded proteins, to facilitate proper folding and prevent misfolding and aggregation. Although the major structural components of GroEL are well defined, scaffolds of the non-native substrates that determine chaperone-mediated folding have been difficult to recognize. Here we performed all-atomistic and replica-exchange molecular dynamics simulations to dissect non-native ensemble of an obligate GroEL folder, DapA. Thermodynamics analyses of unfolding simulations revealed populated intermediates with distinct structural characteristics. We found that surface exposed hydrophobic patches are significantly increased, primarily contributed from native and non-native β-sheet elements. We validate the structural properties of these conformers using experimental data, including circular dichroism (CD), 1-anilinonaphthalene-8-sulfonic acid (ANS) binding measurements and previously reported hydrogen-deutrium exchange coupled to mass spectrometry (HDX-MS). Further, we constructed network graphs to elucidate long-range intra-protein connectivity of native and intermediate topologies, demonstrating regions that serve as central "hubs". Overall, our results implicate that genomic variations (or mutations) in the distinct regions of protein structures might disrupt these topological signatures disabling chaperone-mediated folding, leading to formation of aggregates.
Deciphering RNA-Recognition Patterns of Intrinsically Disordered Proteins.

PubMed

Srivastava, Ambuj; Ahmad, Shandar; Gromiha, M Michael

2018-05-29

Intrinsically disordered regions (IDRs) and protein (IDPs) are highly flexible owing to their lack of well-defined structures. A subset of such proteins interacts with various substrates; including RNA; frequently adopting regular structures in the final complex. In this work; we have analysed a dataset of protein⁻RNA complexes undergoing disorder-to-order transition (DOT) upon binding. We found that DOT regions are generally small in size (less than 3 residues) for RNA binding proteins. Like structured proteins; positively charged residues are found to interact with RNA molecules; indicating the dominance of electrostatic and cation-π interactions. However, a comparison of binding frequency shows that interface hydrophobic and aromatic residues have more interactions in only DOT regions than in a protein. Further; DOT regions have significantly higher exposure to water than their structured counterparts. Interactions of DOT regions with RNA increase the sheet formation with minor changes in helix forming residues. We have computed the interaction energy for amino acids⁻nucleotide pairs; which showed the preference of His⁻G; Asn⁻U and Ser⁻U at for the interface of DOT regions. This study provides insights to understand protein⁻RNA interactions and the results could also be used for developing a tool for identifying DOT regions in RNA binding proteins.
Evolution and Structural Analyses of Glossina morsitans (Diptera; Glossinidae) Tetraspanins

PubMed Central

Murungi, Edwin K.; Kariithi, Henry M.; Adunga, Vincent; Obonyo, Meshack; Christoffels, Alan

2014-01-01

Tetraspanins are important conserved integral membrane proteins expressed in many organisms. Although there is limited knowledge about the full repertoire, evolution and structural characteristics of individual members in various organisms, data obtained so far show that tetraspanins play major roles in membrane biology, visual processing, memory, olfactory signal processing, and mechanosensory antennal inputs. Thus, these proteins are potential targets for control of insect pests. Here, we report that the genome of the tsetse fly, Glossina morsitans (Diptera: Glossinidae) encodes at least seventeen tetraspanins (GmTsps), all containing the signature features found in the tetraspanin superfamily members. Whereas six of the GmTsps have been previously reported, eleven could be classified as novel because their amino acid sequences do not map to characterized tetraspanins in the available protein data bases. We present a model of the GmTsps by using GmTsp42Ed, whose presence and expression has been recently detected by transcriptomics and proteomics analyses of G. morsitans. Phylogenetically, the identified GmTsps segregate into three major clusters. Structurally, the GmTsps are largely similar to vertebrate tetraspanins. In view of the exploitation of tetraspanins by organisms for survival, these proteins could be targeted using specific antibodies, recombinant large extracellular loop (LEL) domains, small-molecule mimetics and siRNAs as potential novel and efficacious putative targets to combat African trypanosomiasis by killing the tsetse fly vector. PMID:26462947

Structure and mechanism of the phage T4 recombination mediator protein UvsY

DOE PAGES

Gajewski, Stefan; Waddell, Michael Brett; Vaithiyalingam, Sivaraja; ...

2016-03-07

The UvsY recombination mediator protein is critical for efficient homologous recombination in bacteriophage T4 and is the functional analog of the eukaryotic Rad52 protein. During T4 homologous recombination, the UvsX recombinase has to compete with the prebound gp32 single-stranded binding protein for DNA-binding sites and UvsY stimulates this filament nucleation event. We report here the crystal structure of UvsY in four similar open-barrel heptameric assemblies and provide structural and biophysical insights into its function. The UvsY heptamer was confirmed in solution by centrifugation and light scattering, and thermodynamic analyses revealed that the UvsY–ssDNA interaction occurs within the assembly via twomore » distinct binding modes. Using surface plasmon resonance, we also examined the binding of UvsY to both ssDNA and the ssDNA–gp32 complex. These analyses confirmed that ssDNA can bind UvsY and gp32 independently and also as a ternary complex. They also showed that residues located on the rim of the heptamer are required for optimal binding to ssDNA, thus identifying the putative ssDNA-binding surface. We propose a model in which UvsY promotes a helical ssDNA conformation that disfavors the binding of gp32 and initiates the assembly of the ssDNA–UvsX filament.« less
StralSV: assessment of sequence variability within similar 3D structures and application to polio RNA-dependent RNA polymerase

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zemla, A; Lang, D; Kostova, T

2010-11-29

Most of the currently used methods for protein function prediction rely on sequence-based comparisons between a query protein and those for which a functional annotation is provided. A serious limitation of sequence similarity-based approaches for identifying residue conservation among proteins is the low confidence in assigning residue-residue correspondences among proteins when the level of sequence identity between the compared proteins is poor. Multiple sequence alignment methods are more satisfactory - still, they cannot provide reliable results at low levels of sequence identity. Our goal in the current work was to develop an algorithm that could overcome these difficulties and facilitatemore » the identification of structurally (and possibly functionally) relevant residue-residue correspondences between compared protein structures. Here we present StralSV, a new algorithm for detecting closely related structure fragments and quantifying residue frequency from tight local structure alignments. We apply StralSV in a study of the RNA-dependent RNA polymerase of poliovirus and demonstrate that the algorithm can be used to determine regions of the protein that are relatively unique or that shared structural similarity with structures that are distantly related. By quantifying residue frequencies among many residue-residue pairs extracted from local alignments, one can infer potential structural or functional importance of specific residues that are determined to be highly conserved or that deviate from a consensus. We further demonstrate that considerable detailed structural and phylogenetic information can be derived from StralSV analyses. StralSV is a new structure-based algorithm for identifying and aligning structure fragments that have similarity to a reference protein. StralSV analysis can be used to quantify residue-residue correspondences and identify residues that may be of particular structural or functional importance, as well as unusual or unexpected residues at a given sequence position.« less
MEICPS: substitution mutations to engineer intracellular protein stability.

PubMed

Reddy, B V; Ramesh, P; Tiwari, S

1998-01-01

In MEICPS, results from earlier analyses are utilized to suggest possible substitution point mutations to engineer intracellular stability using a given sequence or structure of the protein. From bvbreddy@ccmb.ap.nic.in. This program needs data from other software, PSA and SSTRUC, available from sali@tamika.rockefeller.edu and tom@cryst.bioc.cam.ac.uk, respectively. bvbreddy@ccmb.ap.nic.in
A traditional evolutionary history of foot-and-mouth disease viruses in Southeast Asia challenged by analyses of non-structural protein coding sequences

USDA-ARS?s Scientific Manuscript database

Molecular epidemiology and evolution of foot-and-mouth disease virus (FMDV) are widely studied using genomic sequences encoding VP1, the capsid protein containing the most relevant antigenic domains. Although sequencing of the full viral genome is not used as a routine diagnostic or surveillance too...
Sequence analyses reveal that a TPR-DP module, surrounded by recombinable flanking introns, could be at the origin of eukaryotic Hop and Hip TPR-DP domains and prokaryotic GerD proteins.

PubMed

Hernández Torres, Jorge; Papandreou, Nikolaos; Chomilier, Jacques

2009-05-01

The co-chaperone Hop [heat shock protein (HSP) organising protein] is known to bind both Hsp70 and Hsp90. Hop comprises three repeats of a tetratricopeptide repeat (TPR) domain, each consisting of three TPR motifs. The first and last TPR domains are followed by a domain containing several dipeptide (DP) repeats called the DP domain. These analyses suggest that the hop genes result from successive recombination events of an ancestral TPR-DP module. From a hydrophobic cluster analysis of homologous Hop protein sequences derived from gene families, we can postulate that shifts in the open reading frames are at the origin of the present sequences. Moreover, these shifts can be related to the presence or absence of biological function. We propose to extend the family of Hop co-chaperons into the kingdom of bacteria, as several structurally related genes have been identified by hydrophobic cluster analysis. We also provide evidence of common structural characteristics between hop and hip genes, suggesting a shared precursor of ancestral TPR-DP domains.
Structural genomics analysis of uncharacterized protein families overrepresented in human gut bacteria identifies a novel glycoside hydrolase

PubMed Central

2014-01-01

Background Bacteroides spp. form a significant part of our gut microbiome and are well known for optimized metabolism of diverse polysaccharides. Initial analysis of the archetypal Bacteroides thetaiotaomicron genome identified 172 glycosyl hydrolases and a large number of uncharacterized proteins associated with polysaccharide metabolism. Results BT_1012 from Bacteroides thetaiotaomicron VPI-5482 is a protein of unknown function and a member of a large protein family consisting entirely of uncharacterized proteins. Initial sequence analysis predicted that this protein has two domains, one on the N- and one on the C-terminal. A PSI-BLAST search found over 150 full length and over 90 half size homologs consisting only of the N-terminal domain. The experimentally determined three-dimensional structure of the BT_1012 protein confirms its two-domain architecture and structural analysis of both domains suggests their specific functions. The N-terminal domain is a putative catalytic domain with significant similarity to known glycoside hydrolases, the C-terminal domain has a beta-sandwich fold typically found in C-terminal domains of other glycosyl hydrolases, however these domains are typically involved in substrate binding. We describe the structure of the BT_1012 protein and discuss its sequence-structure relationship and their possible functional implications. Conclusions Structural and sequence analyses of the BT_1012 protein identifies it as a glycosyl hydrolase, expanding an already impressive catalog of enzymes involved in polysaccharide metabolism in Bacteroides spp. Based on this we have renamed the Pfam families representing the two domains found in the BT_1012 protein, PF13204 and PF12904, as putative glycoside hydrolase and glycoside hydrolase-associated C-terminal domain respectively. PMID:24742328
Spatial structure peculiarities of influenza A virus matrix M1 protein in an acidic solution that simulates the internal lysosomal medium.

PubMed

Shishkov, Alexander; Bogacheva, Elena; Fedorova, Natalia; Ksenofontov, Alexander; Badun, Gennadii; Radyukhin, Victor; Lukashina, Elena; Serebryakova, Marina; Dolgov, Alexey; Chulichkov, Alexey; Dobrov, Evgeny; Baratova, Lyudmila

2011-12-01

The structure of the C-terminal domain of the influenza virus A matrix M1 protein, for which X-ray diffraction data were still missing, was studied in acidic solution. Matrix M1 protein was bombarded with thermally-activated tritium atoms, and the resulting intramolecular distribution of the tritium label was analyzed to assess the steric accessibility of the amino acid residues in this protein. This technique revealed that interdomain loops and the C-terminal domain of the protein are the most accessible to labeling with tritium atoms. A model of the spatial arrangement of the C-terminal domain of matrix M1 protein was generated using rosetta software adjusted to the data obtained by tritium planigraphy experiments. This model suggests that the C-terminal domain is an almost flat layer with a three-α-helical structure. To explain the high level of tritium label incorporation into the C-terminal domain of the M1 protein in an acidic solution, we also used independent experimental approaches (CD spectroscopy, limited proteolysis and MALDI-TOF MS analysis of the proteolysis products, dynamic light scattering and analytical ultracentrifugation), as well as multiple computational algorithms, to analyse the intrinsic protein disorder. Taken together, the results obtained in the present study indicate that the C-terminal domain is weakly structured. We hypothesize that the specific 3D structural peculiarities of the M1 protein revealed in acidic pH solution allow the protein greater structural flexibility and enable it to interact effectively with the components of the host cell. © 2011 The Authors Journal compilation © 2011 FEBS.
Geomfinder: a multi-feature identifier of similar three-dimensional protein patterns: a ligand-independent approach.

PubMed

Núñez-Vivanco, Gabriel; Valdés-Jiménez, Alejandro; Besoaín, Felipe; Reyes-Parada, Miguel

2016-01-01

Since the structure of proteins is more conserved than the sequence, the identification of conserved three-dimensional (3D) patterns among a set of proteins, can be important for protein function prediction, protein clustering, drug discovery and the establishment of evolutionary relationships. Thus, several computational applications to identify, describe and compare 3D patterns (or motifs) have been developed. Often, these tools consider a 3D pattern as that described by the residues surrounding co-crystallized/docked ligands available from X-ray crystal structures or homology models. Nevertheless, many of the protein structures stored in public databases do not provide information about the location and characteristics of ligand binding sites and/or other important 3D patterns such as allosteric sites, enzyme-cofactor interaction motifs, etc. This makes necessary the development of new ligand-independent methods to search and compare 3D patterns in all available protein structures. Here we introduce Geomfinder, an intuitive, flexible, alignment-free and ligand-independent web server for detailed estimation of similarities between all pairs of 3D patterns detected in any two given protein structures. We used around 1100 protein structures to form pairs of proteins which were assessed with Geomfinder. In these analyses each protein was considered in only one pair (e.g. in a subset of 100 different proteins, 50 pairs of proteins can be defined). Thus: (a) Geomfinder detected identical pairs of 3D patterns in a series of monoamine oxidase-B structures, which corresponded to the effectively similar ligand binding sites at these proteins; (b) we identified structural similarities among pairs of protein structures which are targets of compounds such as acarbose, benzamidine, adenosine triphosphate and pyridoxal phosphate; these similar 3D patterns are not detected using sequence-based methods; (c) the detailed evaluation of three specific cases showed the versatility of Geomfinder, which was able to discriminate between similar and different 3D patterns related to binding sites of common substrates in a range of diverse proteins. Geomfinder allows detecting similar 3D patterns between any two pair of protein structures, regardless of the divergency among their amino acids sequences. Although the software is not intended for simultaneous multiple comparisons in a large number of proteins, it can be particularly useful in cases such as the structure-based design of multitarget drugs, where a detailed analysis of 3D patterns similarities between a few selected protein targets is essential.
Channel Formation by CarO, the Carbapenem Resistance-Associated Outer Membrane Protein of Acinetobacter baumannii

PubMed Central

Siroy, Axel; Molle, Virginie; Lemaître-Guillier, Christelle; Vallenet, David; Pestel-Caron, Martine; Cozzone, Alain J.; Jouenne, Thierry; Dé, Emmanuelle

2005-01-01

It has been recently shown that resistance to both imipenem and meropenem in multidrug-resistant clinical strains of Acinetobacter baumannii is associated with the loss of a heat-modifiable 25/29-kDa outer membrane protein, called CarO. This study aimed to investigate the channel-forming properties of CarO. Mass spectrometry analyses of this protein band detected another 25-kDa protein (called Omp25), together with CarO. Both proteins presented similar physicochemical parameters (Mw and pI). We overproduced and purified the two polypeptides as His-tagged recombinant proteins. Circular dichroism analyses demonstrated that the secondary structure of these proteins was mainly a β-strand conformation with spectra typical of porins. We studied the channel-forming properties of proteins by reconstitution into artificial lipid bilayers. In these conditions, CarO induced ion channels with a conductance value of 110 pS in 1 M KCl, whereas the Omp25 protein did not form any channels, despite its suggested porin function. The pores formed by CarO showed a slight cationic selectivity and no voltage closure. No specific imipenem binding site was found in CarO, and this protein would rather form unspecific monomeric channels. PMID:16304148
Kinetic analysis of pre-ribosome structure in vivo

PubMed Central

Swiatkowska, Agata; Wlotzka, Wiebke; Tuck, Alex; Barrass, J. David; Beggs, Jean D.; Tollervey, David

2012-01-01

Pre-ribosomal particles undergo numerous structural changes during maturation, but their high complexity and short lifetimes make these changes very difficult to follow in vivo. In consequence, pre-ribosome structure and composition have largely been inferred from purified particles and analyzed in vitro. Here we describe techniques for kinetic analyses of the changes in pre-ribosome structure in living cells of Saccharomyces cerevisiae. To allow this, in vivo structure probing by DMS modification was combined with affinity purification of newly synthesized 20S pre-rRNA over a time course of metabolic labeling with 4-thiouracil. To demonstrate that this approach is generally applicable, we initially analyzed the accessibility of the region surrounding cleavage site D site at the 3′ end of the mature 18S rRNA region of the pre-rRNA. This revealed a remarkably flexible structure throughout 40S subunit biogenesis, with little stable RNA–protein interaction apparent. Analysis of folding in the region of the 18S central pseudoknot was consistent with previous data showing U3 snoRNA–18S rRNA interactions. Dynamic changes in the structure of the hinge between helix 28 (H28) and H44 of pre-18S rRNA were consistent with recently reported interactions with the 3′ guide region of U3 snoRNA. Finally, analysis of the H18 region indicates that the RNA structure matures early, but additional protection appears subsequently, presumably reflecting protein binding. The structural analyses described here were performed on total, affinity-purified, newly synthesized RNA, so many classes of RNA and RNA–protein complex are potentially amenable to this approach. PMID:23093724
Astronaut Scott Parazynski works with PCG experiment on middeck

NASA Image and Video Library

1994-11-14

STS066-13-029 (3-14 Nov 1994) --- On the Space Shuttle Atlantis' mid-deck, astronaut Scott E. Parazynski, mission specialist, works at one of two areas onboard the Shuttle which support the Protein Crystal Growth (PCG) experiment. This particular section is called the Vapor Diffusion Apparatus (VDA), housed in a Single Locker Thermal Enclosure (STES). Together with the Crystal Observation System, housed in the Thermal Enclosure System (COS/TES) the VDA represents the continuing research into the structures of proteins and other macromolecules such as viruses. In addition to using the microgravity of space to grow high-quality protein crystals for structural analyses, the experiments are expected to help develop technologies and methods to improve the protein crystallization process on Earth as well as in space.
Developing advanced X-ray scattering methods combined with crystallography and computation.

PubMed

Perry, J Jefferson P; Tainer, John A

2013-03-01

The extensive use of small angle X-ray scattering (SAXS) over the last few years is rapidly providing new insights into protein interactions, complex formation and conformational states in solution. This SAXS methodology allows for detailed biophysical quantification of samples of interest. Initial analyses provide a judgment of sample quality, revealing the potential presence of aggregation, the overall extent of folding or disorder, the radius of gyration, maximum particle dimensions and oligomerization state. Structural characterizations include ab initio approaches from SAXS data alone, and when combined with previously determined crystal/NMR, atomistic modeling can further enhance structural solutions and assess validity. This combination can provide definitions of architectures, spatial organizations of protein domains within a complex, including those not determined by crystallography or NMR, as well as defining key conformational states of a protein interaction. SAXS is not generally constrained by macromolecule size, and the rapid collection of data in a 96-well plate format provides methods to screen sample conditions. This includes screening for co-factors, substrates, differing protein or nucleotide partners or small molecule inhibitors, to more fully characterize the variations within assembly states and key conformational changes. Such analyses may be useful for screening constructs and conditions to determine those most likely to promote crystal growth of a complex under study. Moreover, these high throughput structural determinations can be leveraged to define how polymorphisms affect assembly formations and activities. This is in addition to potentially providing architectural characterizations of complexes and interactions for systems biology-based research, and distinctions in assemblies and interactions in comparative genomics. Thus, SAXS combined with crystallography/NMR and computation provides a unique set of tools that should be considered as being part of one's repertoire of biophysical analyses, when conducting characterizations of protein and other macromolecular interactions. Copyright © 2013 Elsevier Inc. All rights reserved.
Integrating protein structural dynamics and evolutionary analysis with Bio3D.

PubMed

Skjærven, Lars; Yao, Xin-Qiu; Scarabelli, Guido; Grant, Barry J

2014-12-10

Popular bioinformatics approaches for studying protein functional dynamics include comparisons of crystallographic structures, molecular dynamics simulations and normal mode analysis. However, determining how observed displacements and predicted motions from these traditionally separate analyses relate to each other, as well as to the evolution of sequence, structure and function within large protein families, remains a considerable challenge. This is in part due to the general lack of tools that integrate information of molecular structure, dynamics and evolution. Here, we describe the integration of new methodologies for evolutionary sequence, structure and simulation analysis into the Bio3D package. This major update includes unique high-throughput normal mode analysis for examining and contrasting the dynamics of related proteins with non-identical sequences and structures, as well as new methods for quantifying dynamical couplings and their residue-wise dissection from correlation network analysis. These new methodologies are integrated with major biomolecular databases as well as established methods for evolutionary sequence and comparative structural analysis. New functionality for directly comparing results derived from normal modes, molecular dynamics and principal component analysis of heterogeneous experimental structure distributions is also included. We demonstrate these integrated capabilities with example applications to dihydrofolate reductase and heterotrimeric G-protein families along with a discussion of the mechanistic insight provided in each case. The integration of structural dynamics and evolutionary analysis in Bio3D enables researchers to go beyond a prediction of single protein dynamics to investigate dynamical features across large protein families. The Bio3D package is distributed with full source code and extensive documentation as a platform independent R package under a GPL2 license from http://thegrantlab.org/bio3d/ .
From protein interaction profile to functional assignment: the human protein Ki-1/57 is associated with pre-mRNA splicing events.

PubMed

Bressan, Gustavo Costa; Kobarg, Jörg

2010-01-01

The mapping of protein-protein interactions of a determined organism is considered fundamental to assign protein function in the post-genomic era. As part of this effort, screenings for pairwise interactions by yeast two-hybrid system have been used popularly to reveal protein interaction networks in different biological systems. Through the identification of protein interaction partners we have successfully obtained interesting functional clues for Ki-1/57, a human protein with no previous functional annotation, in the context of RNA metabolism. We briefly discuss the way we approached protein-protein interaction data to conduct and interpret further molecular biological and cellular studies as well as structural analyses on this protein. Our data suggest that Ki-1/57 belongs to the family of intrinsically unstructured proteins and that the structural flexibility may be crucial for its capacity to interact with many different proteins. A large fraction of these proteins are involved in pre-mRNA splicing control. Finally, Ki-1/57 is localized to several subnuclear domains, all of which have been described to splicing and other RNA processing events.
Development of an automated large-scale protein-crystallization and monitoring system for high-throughput protein-structure analyses.

PubMed

Hiraki, Masahiko; Kato, Ryuichi; Nagai, Minoru; Satoh, Tadashi; Hirano, Satoshi; Ihara, Kentaro; Kudo, Norio; Nagae, Masamichi; Kobayashi, Masanori; Inoue, Michio; Uejima, Tamami; Oda, Shunichiro; Chavas, Leonard M G; Akutsu, Masato; Yamada, Yusuke; Kawasaki, Masato; Matsugaki, Naohiro; Igarashi, Noriyuki; Suzuki, Mamoru; Wakatsuki, Soichi

2006-09-01

Protein crystallization remains one of the bottlenecks in crystallographic analysis of macromolecules. An automated large-scale protein-crystallization system named PXS has been developed consisting of the following subsystems, which proceed in parallel under unified control software: dispensing precipitants and protein solutions, sealing crystallization plates, carrying robot, incubators, observation system and image-storage server. A sitting-drop crystallization plate specialized for PXS has also been designed and developed. PXS can set up 7680 drops for vapour diffusion per hour, which includes time for replenishing supplies such as disposable tips and crystallization plates. Images of the crystallization drops are automatically recorded according to a preprogrammed schedule and can be viewed by users remotely using web-based browser software. A number of protein crystals were successfully produced and several protein structures could be determined directly from crystals grown by PXS. In other cases, X-ray quality crystals were obtained by further optimization by manual screening based on the conditions found by PXS.
Salts employed in hydrophobic interaction chromatography can change protein structure - insights from protein-ligand interaction thermodynamics, circular dichroism spectroscopy and small angle X-ray scattering.

PubMed

Komaromy, Andras Z; Kulsing, Chadin; Boysen, Reinhard I; Hearn, Milton T W

2015-03-01

Key requirements of protein purification by hydrophobic interaction chromatography (HIC) are preservation of the tertiary/quaternary structure, maintenance of biological function, and separation of the correctly folded protein from its unfolded forms or aggregates. This study examines the relationship between the HIC retention behavior of hen egg white lysozyme (HEWL) in high concentrations of several kosmotropic salts and its conformation, assessed by circular dichroism (CD) spectroscopy. Further, the physicochemical properties of HEWL in the presence of high concentrations of ammonium sulfate, sodium chloride and magnesium chloride were investigated by small angle X-ray scattering (SAXS) at different temperatures. Radii of gyration were extrapolated from Guinier approximations and the indirect transform program GNOM with protein-protein interaction and contrast variation taken into account. A bead model simulation provided information on protein structural changes using ab initio reconstruction with GASBOR. These results correlated to the secondary structure content obtained from CD spectroscopy of HEWL. These changes in SAXS and CD data were consistent with heat capacity ΔCp -values obtained from van't Hoff plot analyses of the retention data. Collectively, these insights enable informed decisions to be made on the choice of chromatographic conditions, leading to improved separation selectivity and opportunities for innovative column-assisted protein refolding methods. Copyright © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Terahertz mechanical vibrations in lysozyme: Raman spectroscopy vs modal analysis

NASA Astrophysics Data System (ADS)

Carpinteri, Alberto; Lacidogna, Giuseppe; Piana, Gianfranco; Bassani, Andrea

2017-07-01

The mechanical behaviour of proteins is receiving an increasing attention from the scientific community. Recently it has been suggested that mechanical vibrations play a crucial role in controlling structural configuration changes (folding) which govern proteins biological function. The mechanism behind protein folding is still not completely understood, and many efforts are being made to investigate this phenomenon. Complex molecular dynamics simulations and sophisticated experimental measurements are conducted to investigate protein dynamics and to perform protein structure predictions; however, these are two related, although quite distinct, approaches. Here we investigate mechanical vibrations of lysozyme by Raman spectroscopy and linear normal mode calculations (modal analysis). The input mechanical parameters to the numerical computations are taken from the literature. We first give an estimate of the order of magnitude of protein vibration frequencies by considering both classical wave mechanics and structural dynamics formulas. Afterwards, we perform modal analyses of some relevant chemical groups and of the full lysozyme protein. The numerical results are compared to experimental data, obtained from both in-house and literature Raman measurements. In particular, the attention is focused on a large peak at 0.84 THz (29.3 cm-1) in the Raman spectrum obtained analyzing a lyophilized powder sample.
Artificial proteins as allosteric modulators of PDZ3 and SH3 in two-domain constructs: A computational characterization of novel chimeric proteins.

PubMed

Kirubakaran, Palani; Pfeiferová, Lucie; Boušová, Kristýna; Bednarova, Lucie; Obšilová, Veronika; Vondrášek, Jiří

2016-10-01

Artificial multidomain proteins with enhanced structural and functional properties can be utilized in a broad spectrum of applications. The design of chimeric fusion proteins utilizing protein domains or one-domain miniproteins as building blocks is an important advancement for the creation of new biomolecules for biotechnology and medical applications. However, computational studies to describe in detail the dynamics and geometry properties of two-domain constructs made from structurally and functionally different proteins are lacking. Here, we tested an in silico design strategy using all-atom explicit solvent molecular dynamics simulations. The well-characterized PDZ3 and SH3 domains of human zonula occludens (ZO-1) (3TSZ), along with 5 artificial domains and 2 types of molecular linkers, were selected to construct chimeric two-domain molecules. The influence of the artificial domains on the structure and dynamics of the PDZ3 and SH3 domains was determined using a range of analyses. We conclude that the artificial domains can function as allosteric modulators of the PDZ3 and SH3 domains. Proteins 2016; 84:1358-1374. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Structural Basis for the Interaction of the Golgi-Associated Retrograde Protein (GARP) Complex with the t-SNARE Syntaxin 6

PubMed Central

Abascal-Palacios, Guillermo; Schindler, Christina; Rojas, Adriana L; Bonifacino, Juan S.; Hierro, Aitor

2016-01-01

Summary The Golgi-Associated Retrograde Protein (GARP) is a tethering complex involved in the fusion of endosome-derived transport vesicles to the trans-Golgi network through interaction with components of the Syntaxin 6/Syntaxin 16/Vti1a/VAMP4 SNARE complex. The mechanisms by which GARP and other tethering factors engage the SNARE fusion machinery are poorly understood. Herein we report the structural basis for the interaction of the human Ang2 subunit of GARP with Syntaxin 6 and the closely related Syntaxin 10. The crystal structure of Syntaxin 6 Habc domain in complex with a peptide from the N terminus of Ang2 shows a novel binding mode in which a di-tyrosine motif of Ang2 interacts with a highly conserved groove in Syntaxin 6. Structure-based mutational analyses validate the crystal structure and support the phylogenetic conservation of this interaction. The same binding determinants are found in other tethering proteins and syntaxins, suggesting a general interaction mechanism. PMID:23932592
Worldwide Protein Data Bank validation information: usage and trends.

PubMed

Smart, Oliver S; Horský, Vladimír; Gore, Swanand; Svobodová Vařeková, Radka; Bendová, Veronika; Kleywegt, Gerard J; Velankar, Sameer

2018-03-01

Realising the importance of assessing the quality of the biomolecular structures deposited in the Protein Data Bank (PDB), the Worldwide Protein Data Bank (wwPDB) partners established Validation Task Forces to obtain advice on the methods and standards to be used to validate structures determined by X-ray crystallography, nuclear magnetic resonance spectroscopy and three-dimensional electron cryo-microscopy. The resulting wwPDB validation pipeline is an integral part of the wwPDB OneDep deposition, biocuration and validation system. The wwPDB Validation Service webserver (https://validate.wwpdb.org) can be used to perform checks prior to deposition. Here, it is shown how validation metrics can be combined to produce an overall score that allows the ranking of macromolecular structures and domains in search results. The ValTrends DB database provides users with a convenient way to access and analyse validation information and other properties of X-ray crystal structures in the PDB, including investigating trends in and correlations between different structure properties and validation metrics.

Worldwide Protein Data Bank validation information: usage and trends

PubMed Central

Horský, Vladimír; Gore, Swanand; Svobodová Vařeková, Radka; Bendová, Veronika

2018-01-01

Realising the importance of assessing the quality of the biomolecular structures deposited in the Protein Data Bank (PDB), the Worldwide Protein Data Bank (wwPDB) partners established Validation Task Forces to obtain advice on the methods and standards to be used to validate structures determined by X-ray crystallography, nuclear magnetic resonance spectroscopy and three-dimensional electron cryo-microscopy. The resulting wwPDB validation pipeline is an integral part of the wwPDB OneDep deposition, biocuration and validation system. The wwPDB Validation Service webserver (https://validate.wwpdb.org) can be used to perform checks prior to deposition. Here, it is shown how validation metrics can be combined to produce an overall score that allows the ranking of macromolecular structures and domains in search results. The ValTrendsDB database provides users with a convenient way to access and analyse validation information and other properties of X-ray crystal structures in the PDB, including investigating trends in and correlations between different structure properties and validation metrics. PMID:29533231
Annotation of Alternatively Spliced Proteins and Transcripts with Protein-Folding Algorithms and Isoform-Level Functional Networks.

PubMed

Li, Hongdong; Zhang, Yang; Guan, Yuanfang; Menon, Rajasree; Omenn, Gilbert S

2017-01-01

Tens of thousands of splice isoforms of proteins have been catalogued as predicted sequences from transcripts in humans and other species. Relatively few have been characterized biochemically or structurally. With the extensive development of protein bioinformatics, the characterization and modeling of isoform features, isoform functions, and isoform-level networks have advanced notably. Here we present applications of the I-TASSER family of algorithms for folding and functional predictions and the IsoFunc, MIsoMine, and Hisonet data resources for isoform-level analyses of network and pathway-based functional predictions and protein-protein interactions. Hopefully, predictions and insights from protein bioinformatics will stimulate many experimental validation studies.
Expression and Purification of Rat Glucose Transporter 1 in Pichia pastoris.

PubMed

Venskutonytė, Raminta; Elbing, Karin; Lindkvist-Petersson, Karin

2018-01-01

Large amounts of pure and homogenous protein are a prerequisite for several biochemical and biophysical analyses, and in particular if aiming at resolving the three-dimensional protein structure. Here we describe the production of the rat glucose transporter 1 (GLUT1), a membrane protein facilitating the transport of glucose in cells. The protein is recombinantly expressed in the yeast Pichia pastoris. It is easily maintained and large-scale protein production in shaker flasks, as commonly performed in academic research laboratories, results in relatively high yields of membrane protein. The purification protocol describes all steps needed to obtain a pure and homogenous GLUT1 protein solution, including cell growth, membrane isolation, and chromatographic purification methods.
Determining crystal structures through crowdsourcing and coursework

PubMed Central

Horowitz, Scott; Koepnick, Brian; Martin, Raoul; Tymieniecki, Agnes; Winburn, Amanda A.; Cooper, Seth; Flatten, Jeff; Rogawski, David S.; Koropatkin, Nicole M.; Hailu, Tsinatkeab T.; Jain, Neha; Koldewey, Philipp; Ahlstrom, Logan S.; Chapman, Matthew R.; Sikkema, Andrew P.; Skiba, Meredith A.; Maloney, Finn P.; Beinlich, Felix R. M.; Caglar, Ahmet; Coral, Alan; Jensen, Alice Elizabeth; Lubow, Allen; Boitano, Amanda; Lisle, Amy Elizabeth; Maxwell, Andrew T.; Failer, Barb; Kaszubowski, Bartosz; Hrytsiv, Bohdan; Vincenzo, Brancaccio; de Melo Cruz, Breno Renan; McManus, Brian Joseph; Kestemont, Bruno; Vardeman, Carl; Comisky, Casey; Neilson, Catherine; Landers, Catherine R.; Ince, Christopher; Buske, Daniel Jon; Totonjian, Daniel; Copeland, David Marshall; Murray, David; Jagieła, Dawid; Janz, Dietmar; Wheeler, Douglas C.; Cali, Elie; Croze, Emmanuel; Rezae, Farah; Martin, Floyd Orville; Beecher, Gil; de Jong, Guido Alexander; Ykman, Guy; Feldmann, Harald; Chan, Hugo Paul Perez; Kovanecz, Istvan; Vasilchenko, Ivan; Connellan, James C.; Borman, Jami Lynne; Norrgard, Jane; Kanfer, Jebbie; Canfield, Jeffrey M.; Slone, Jesse David; Oh, Jimmy; Mitchell, Joanne; Bishop, John; Kroeger, John Douglas; Schinkler, Jonas; McLaughlin, Joseph; Brownlee, June M.; Bell, Justin; Fellbaum, Karl Willem; Harper, Kathleen; Abbey, Kirk J.; Isaksson, Lennart E.; Wei, Linda; Cummins, Lisa N.; Miller, Lori Anne; Bain, Lyn; Carpenter, Lynn; Desnouck, Maarten; Sharma, Manasa G.; Belcastro, Marcus; Szew, Martin; Szew, Martin; Britton, Matthew; Gaebel, Matthias; Power, Max; Cassidy, Michael; Pfützenreuter, Michael; Minett, Michele; Wesselingh, Michiel; Yi, Minjune; Cameron, Neil Haydn Tormey; Bolibruch, Nicholas I.; Benevides, Noah; Kathleen Kerr, Norah; Barlow, Nova; Crevits, Nykole Krystyne; Dunn, Paul; Roque, Paulo Sergio Silveira Belo Nascimento; Riber, Peter; Pikkanen, Petri; Shehzad, Raafay; Viosca, Randy; James Fraser, Robert; Leduc, Robert; Madala, Roman; Shnider, Scott; de Boisblanc, Sharon; Butkovich, Slava; Bliven, Spencer; Hettler, Stephen; Telehany, Stephen; Schwegmann, Steven A.; Parkes, Steven; Kleinfelter, Susan C.; Michael Holst, Sven; van der Laan, T. J. A.; Bausewein, Thomas; Simon, Vera; Pulley, Warwick; Hull, William; Kim, Annes Yukyung; Lawton, Alexis; Ruesch, Amanda; Sundar, Anjali; Lawrence, Anna-Lisa; Afrin, Antara; Maheshwer, Bhargavi; Turfe, Bilal; Huebner, Christian; Killeen, Courtney Elizabeth; Antebi-Lerrman, Dalia; Luan, Danny; Wolfe, Derek; Pham, Duc; Michewicz, Elaina; Hull, Elizabeth; Pardington, Emily; Galal, Galal Osama; Sun, Grace; Chen, Grace; Anderson, Halie E.; Chang, Jane; Hewlett, Jeffrey Thomas; Sterbenz, Jennifer; Lim, Jiho; Morof, Joshua; Lee, Junho; Inn, Juyoung Samuel; Hahm, Kaitlin; Roth, Kaitlin; Nair, Karun; Markin, Katherine; Schramm, Katie; Toni Eid, Kevin; Gam, Kristina; Murphy, Lisha; Yuan, Lucy; Kana, Lulia; Daboul, Lynn; Shammas, Mario Karam; Chason, Max; Sinan, Moaz; Andrew Tooley, Nicholas; Korakavi, Nisha; Comer, Patrick; Magur, Pragya; Savliwala, Quresh; Davison, Reid Michael; Sankaran, Roshun Rajiv; Lewe, Sam; Tamkus, Saule; Chen, Shirley; Harvey, Sho; Hwang, Sin Ye; Vatsia, Sohrab; Withrow, Stefan; Luther, Tahra K; Manett, Taylor; Johnson, Thomas James; Ryan Brash, Timothy; Kuhlman, Wyatt; Park, Yeonjung; Popović, Zoran; Baker, David; Khatib, Firas; Bardwell, James C. A.

2016-01-01

We show here that computer game players can build high-quality crystal structures. Introduction of a new feature into the computer game Foldit allows players to build and real-space refine structures into electron density maps. To assess the usefulness of this feature, we held a crystallographic model-building competition between trained crystallographers, undergraduate students, Foldit players and automatic model-building algorithms. After removal of disordered residues, a team of Foldit players achieved the most accurate structure. Analysing the target protein of the competition, YPL067C, uncovered a new family of histidine triad proteins apparently involved in the prevention of amyloid toxicity. From this study, we conclude that crystallographers can utilize crowdsourcing to interpret electron density information and to produce structure solutions of the highest quality. PMID:27633552
Investigating the effect of key mutations on the conformational dynamics of toll-like receptor dimers through molecular dynamics simulations and protein structure networks.

PubMed

Mahita, Jarjapu; Sowdhamini, Ramanathan

2018-04-01

The Toll-like receptors (TLRs) are critical components of the innate immune system due to their ability to detect conserved pathogen-associated molecular patterns, present in bacteria, viruses, and other microorganisms. Ligand detection by TLRs leads to a signaling cascade, mediated by interactions among TIR domains present in the receptors, the bridging adaptors and sorting adaptors. The BB loop is a highly conserved region present in the TIR domain and is crucial for mediating interactions among TIR domain-containing proteins. Mutations in the BB loop of the Toll-like receptors, such as the A795P mutation in TLR3 and the P712H mutation (Lps d mutation) in TLR4, have been reported to disrupt or alter downstream signaling. While the phenotypic effect of these mutations is known, the underlying effect of these mutations on the structure, dynamics and interactions with other TIR domain-containing proteins is not well understood. Here, we have attempted to investigate the effect of the BB loop mutations on the dimer form of TLRs, using TLR2 and TLR3 as case studies. Our results based on molecular dynamics simulations, protein-protein interaction analyses and protein structure network analyses highlight significant differences between the dimer interfaces of the wild-type and mutant forms and provide a logical reasoning for the effect of these mutations on adaptor binding to TLRs. Furthermore, it also leads us to propose a hypothesis for the differential requirement of signaling and bridging adaptors by TLRs. This could aid in further understanding of the mechanisms governing such signaling pathways. © 2018 Wiley Periodicals, Inc.
A periodic table of coiled-coil protein structures.

PubMed

Moutevelis, Efrosini; Woolfson, Derek N

2009-01-23

Coiled coils are protein structure domains with two or more alpha-helices packed together via interlacing of side chains known as knob-into-hole packing. We analysed and classified a large set of coiled-coil structures using a combination of automated and manual methods. This led to a systematic classification that we termed a "periodic table of coiled coils," which we have made available at http://coiledcoils.chm.bris.ac.uk/ccplus/search/periodic_table. In this table, coiled-coil assemblies are arranged in columns with increasing numbers of alpha-helices and in rows of increased complexity. The table provides a framework for understanding possibilities in and limits on coiled-coil structures and a basis for future prediction, engineering and design studies.
Predicting the tolerated sequences for proteins and protein interfaces using RosettaBackrub flexible backbone design.

PubMed

Smith, Colin A; Kortemme, Tanja

2011-01-01

Predicting the set of sequences that are tolerated by a protein or protein interface, while maintaining a desired function, is useful for characterizing protein interaction specificity and for computationally designing sequence libraries to engineer proteins with new functions. Here we provide a general method, a detailed set of protocols, and several benchmarks and analyses for estimating tolerated sequences using flexible backbone protein design implemented in the Rosetta molecular modeling software suite. The input to the method is at least one experimentally determined three-dimensional protein structure or high-quality model. The starting structure(s) are expanded or refined into a conformational ensemble using Monte Carlo simulations consisting of backrub backbone and side chain moves in Rosetta. The method then uses a combination of simulated annealing and genetic algorithm optimization methods to enrich for low-energy sequences for the individual members of the ensemble. To emphasize certain functional requirements (e.g. forming a binding interface), interactions between and within parts of the structure (e.g. domains) can be reweighted in the scoring function. Results from each backbone structure are merged together to create a single estimate for the tolerated sequence space. We provide an extensive description of the protocol and its parameters, all source code, example analysis scripts and three tests applying this method to finding sequences predicted to stabilize proteins or protein interfaces. The generality of this method makes many other applications possible, for example stabilizing interactions with small molecules, DNA, or RNA. Through the use of within-domain reweighting and/or multistate design, it may also be possible to use this method to find sequences that stabilize particular protein conformations or binding interactions over others.
Bioinformatics and functional analyses of coronavirus nonstructural proteins involved in the formation of replicative organelles.

PubMed

Neuman, Benjamin W

2016-11-01

Replication of eukaryotic positive-stranded RNA viruses is usually linked to the presence of membrane-associated replicative organelles. The purpose of this review is to discuss the function of proteins responsible for formation of the coronavirus replicative organelle. This will be done by identifying domains that are conserved across the order Nidovirales, and by summarizing what is known about function and structure at the level of protein domains. Copyright © 2016 Elsevier B.V. All rights reserved.
Structural flexibility and protein adaptation to temperature: Molecular dynamics analysis of malate dehydrogenases of marine molluscs.

PubMed

Dong, Yun-Wei; Liao, Ming-Ling; Meng, Xian-Liang; Somero, George N

2018-02-06

Orthologous proteins of species adapted to different temperatures exhibit differences in stability and function that are interpreted to reflect adaptive variation in structural "flexibility." However, quantifying flexibility and comparing flexibility across proteins has remained a challenge. To address this issue, we examined temperature effects on cytosolic malate dehydrogenase (cMDH) orthologs from differently thermally adapted congeners of five genera of marine molluscs whose field body temperatures span a range of ∼60 °C. We describe consistent patterns of convergent evolution in adaptation of function [temperature effects on K M of cofactor (NADH)] and structural stability (rate of heat denaturation of activity). To determine how these differences depend on flexibilities of overall structure and of regions known to be important in binding and catalysis, we performed molecular dynamics simulation (MDS) analyses. MDS analyses revealed a significant negative correlation between adaptation temperature and heat-induced increase of backbone atom movements [root mean square deviation (rmsd) of main-chain atoms]. Root mean square fluctuations (RMSFs) of movement by individual amino acid residues varied across the sequence in a qualitatively similar pattern among orthologs. Regions of sequence involved in ligand binding and catalysis-termed mobile regions 1 and 2 (MR1 and MR2), respectively-showed the largest values for RMSF. Heat-induced changes in RMSF values across the sequence and, importantly, in MR1 and MR2 were greatest in cold-adapted species. MDS methods are shown to provide powerful tools for examining adaptation of enzymes by providing a quantitative index of protein flexibility and identifying sequence regions where adaptive change in flexibility occurs.
Single-column purification of the tag-free, recombinant form of the neuronal calcium sensor protein, hippocalcin expressed in Escherichia coli.

PubMed

Krishnan, Anuradha; Viviano, Jeffrey; Morozov, Yaroslav; Venkataraman, Venkat

2016-07-01

Hippocalcin is a 193 aa protein that is a member of the neuronal calcium sensor protein family, whose functions are regulated by calcium. Mice that lack the function of this protein are compromised in the long term potentiation aspect of memory generation. Recently, mutations in the gene have been linked with dystonia in human. The protein has no intrinsic enzyme activity but is known to bind to variety of target proteins. Very little information is available on how the protein executes its critical role in signaling pathways, except that it is regulated by binding of calcium. Further delineation of its function requires large amounts of pure protein. In this report, we present a single-step purification procedure that yields high quantities of the bacterially expressed, recombinant protein. The procedure may be adapted to purify the protein from inclusion bodies or cytosol in its myristoylated or non-myristoylated forms. MALDI-MS (in source decay) analyses demonstrates that the myristoylation occurs at the glycine residue. The protein is also biologically active as measured through tryptophan fluorescence, mobility shift and guanylate cyclase activity assays. Thus, further analyses of hippocalcin, both structural and functional, need no longer be limited by protein availability. Copyright © 2016 Elsevier Inc. All rights reserved.
Multiple protein–protein interactions converging on the Prp38 protein during activation of the human spliceosome

PubMed Central

Schütze, Tonio; Ulrich, Alexander K.C.; Apelt, Luise; Will, Cindy L.; Bartlick, Natascha; Seeger, Martin; Weber, Gert; Lührmann, Reinhard; Stelzl, Ulrich; Wahl, Markus C.

2016-01-01

Spliceosomal Prp38 proteins contain a conserved amino-terminal domain, but only higher eukaryotic orthologs also harbor a carboxy-terminal RS domain, a hallmark of splicing regulatory SR proteins. We show by crystal structure analysis that the amino-terminal domain of human Prp38 is organized around three pairs of antiparallel α-helices and lacks similarities to RNA-binding domains found in canonical SR proteins. Instead, yeast two-hybrid analyses suggest that the amino-terminal domain is a versatile protein–protein interaction hub that possibly binds 12 other spliceosomal proteins, most of which are recruited at the same stage as Prp38. By quantitative, alanine surface-scanning two-hybrid screens and biochemical analyses we delineated four distinct interfaces on the Prp38 amino-terminal domain. In vitro interaction assays using recombinant proteins showed that Prp38 can bind at least two proteins simultaneously via two different interfaces. Addition of excess Prp38 amino-terminal domain to in vitro splicing assays, but not of an interaction-deficient mutant, stalled splicing at a precatalytic stage. Our results show that human Prp38 is an unusual SR protein, whose amino-terminal domain is a multi-interface protein–protein interaction platform that might organize the relative positioning of other proteins during splicing. PMID:26673105
Molecular and ultrastructural analysis of forisome subunits reveals the principles of forisome assembly

PubMed Central

Müller, Boje; Groscurth, Sira; Menzel, Matthias; Rüping, Boris A.; Twyman, Richard M.; Prüfer, Dirk; Noll, Gundula A.

2014-01-01

Background and Aims Forisomes are specialized structural phloem proteins that mediate sieve element occlusion after wounding exclusively in papilionoid legumes, but most studies of forisome structure and function have focused on the Old World clade rather than the early lineages. A comprehensive phylogenetic, molecular, structural and functional analysis of forisomes from species covering a broad spectrum of the papilionoid legumes was therefore carried out, including the first analysis of Dipteryx panamensis forisomes, representing the earliest branch of the Papilionoideae lineage. The aim was to study the molecular, structural and functional conservation among forisomes from different tribes and to establish the roles of individual forisome subunits. Methods Sequence analysis and bioinformatics were combined with structural and functional analysis of native forisomes and artificial forisome-like protein bodies, the latter produced by expressing forisome genes from different legumes in a heterologous background. The structure of these bodies was analysed using a combination of confocal laser scanning microscopy (CLSM), scanning electron microscopy (SEM) and transmission electron microscopy (TEM), and the function of individual subunits was examined by combinatorial expression, micromanipulation and light microscopy. Key Results Dipteryx panamensis native forisomes and homomeric protein bodies assembled from the single sieve element occlusion by forisome (SEO-F) subunit identified in this species were structurally and functionally similar to forisomes from the Old World clade. In contrast, homomeric protein bodies assembled from individual SEO-F subunits from Old World species yielded artificial forisomes differing in proportion to their native counterparts, suggesting that multiple SEO-F proteins are required for forisome assembly in these plants. Structural differences between Medicago truncatula native forisomes, homomeric protein bodies and heteromeric bodies containing all possible subunit combinations suggested that combinations of SEO-F proteins may fine-tune the geometric proportions and reactivity of forisomes. Conclusions It is concluded that forisome structure and function have been strongly conserved during evolution and that species-dependent subsets of SEO-F proteins may have evolved to fine-tune the structure of native forisomes. PMID:24694827
Phylogenetic and Evolutionary Patterns in Microbial Carotenoid Biosynthesis Are Revealed by Comparative Genomics

PubMed Central

Klassen, Jonathan L.

2010-01-01

Background Carotenoids are multifunctional, taxonomically widespread and biotechnologically important pigments. Their biosynthesis serves as a model system for understanding the evolution of secondary metabolism. Microbial carotenoid diversity and evolution has hitherto been analyzed primarily from structural and biosynthetic perspectives, with the few phylogenetic analyses of microbial carotenoid biosynthetic proteins using either used limited datasets or lacking methodological rigor. Given the recent accumulation of microbial genome sequences, a reappraisal of microbial carotenoid biosynthetic diversity and evolution from the perspective of comparative genomics is warranted to validate and complement models of microbial carotenoid diversity and evolution based upon structural and biosynthetic data. Methodology/Principal Findings Comparative genomics were used to identify and analyze in silico microbial carotenoid biosynthetic pathways. Four major phylogenetic lineages of carotenoid biosynthesis are suggested composed of: (i) Proteobacteria; (ii) Firmicutes; (iii) Chlorobi, Cyanobacteria and photosynthetic eukaryotes; and (iv) Archaea, Bacteroidetes and two separate sub-lineages of Actinobacteria. Using this phylogenetic framework, specific evolutionary mechanisms are proposed for carotenoid desaturase CrtI-family enzymes and carotenoid cyclases. Several phylogenetic lineage-specific evolutionary mechanisms are also suggested, including: (i) horizontal gene transfer; (ii) gene acquisition followed by differential gene loss; (iii) co-evolution with other biochemical structures such as proteorhodopsins; and (iv) positive selection. Conclusions/Significance Comparative genomics analyses of microbial carotenoid biosynthetic proteins indicate a much greater taxonomic diversity then that identified based on structural and biosynthetic data, and divides microbial carotenoid biosynthesis into several, well-supported phylogenetic lineages not evident previously. This phylogenetic framework is applicable to understanding the evolution of specific carotenoid biosynthetic proteins or the unique characteristics of carotenoid biosynthetic evolution in a specific phylogenetic lineage. Together, these analyses suggest a “bramble” model for microbial carotenoid biosynthesis whereby later biosynthetic steps exhibit greater evolutionary plasticity and reticulation compared to those closer to the biosynthetic “root”. Structural diversification may be constrained (“trimmed”) where selection is strong, but less so where selection is weaker. These analyses also highlight likely productive avenues for future research and bioprospecting by identifying both gaps in current knowledge and taxa which may particularly facilitate carotenoid diversification. PMID:20582313
Discrete and Structurally Unique Proteins (T$$\\bar{a}$$pirins) Mediate Attachment of Extremely Thermophilic Caldicellulosiruptor Species to Cellulose

DOE PAGES

Blumer-Schuette, S. E.; Alahuhta, M.; Conway, J. M.; ...

2015-04-24

A variety of catalytic and noncatalytic protein domains are deployed by select microorganisms to deconstruct lignocellulose. These extracellular proteins are used to attach to, modify, and hydrolyze the complex polysaccharides present in plant cell walls. Cellulolytic enzymes, often containing carbohydrate-binding modules, are key to this process; however, these enzymes are not solely responsible for attachment. Few mechanisms of attachment have been discovered among bacteria that do not form large polypeptide structures, called cellulosomes, to deconstruct biomass. In this study, bioinformatics and proteomics analyses identified unique, discrete, hypothetical proteins (“tmore » $$\\bar{a}$$pirins,” origin from M$$\\bar{a}$$ori: to join), not directly associated with cellulases, that mediate attachment to cellulose by species in the noncellulosomal, extremely thermophilic bacterial genus Caldicellulosiruptor. Two t$$\\bar{a}$$pirin genes are located directly downstream of a type IV pilus operon in strongly cellulolytic members of the genus, whereas homologs are absent from the weakly cellulolytic Caldicellulosiruptor species. Based on their amino acid sequence, t$$\\bar{a}$$pirins are specific to these extreme thermophiles. T$$\\bar{a}$$pirins are also unusual in that they share no detectable protein domain signatures with known polysaccharide-binding proteins. Adsorption isotherm and trans vivo analyses demonstrated the carbohydrate-binding module-like affinity of the t$$\\bar{a}$$pirins for cellulose. Crystallization of a cellulose-binding truncation from one t$$\\bar{a}$$pirin indicated that these proteins form a long β-helix core with a shielded hydrophobic face. In addition, they are structurally unique and define a new class of polysaccharide adhesins. Strongly cellulolytic Caldicellulosiruptor species employ t$$\\bar{a}$$pirins to complement substrate-binding proteins from the ATP-binding cassette transporters and multidomain extracellular and S-layer-associated glycoside hydrolases to process the carbohydrate content of lignocellulose.« less
Discrete and Structurally Unique Proteins (T$$\\bar{a}$$pirins) Mediate Attachment of Extremely Thermophilic Caldicellulosiruptor Species to Cellulose

DOE Office of Scientific and Technical Information (OSTI.GOV)

Blumer-Schuette, S. E.; Alahuhta, M.; Conway, J. M.

A variety of catalytic and noncatalytic protein domains are deployed by select microorganisms to deconstruct lignocellulose. These extracellular proteins are used to attach to, modify, and hydrolyze the complex polysaccharides present in plant cell walls. Cellulolytic enzymes, often containing carbohydrate-binding modules, are key to this process; however, these enzymes are not solely responsible for attachment. Few mechanisms of attachment have been discovered among bacteria that do not form large polypeptide structures, called cellulosomes, to deconstruct biomass. In this study, bioinformatics and proteomics analyses identified unique, discrete, hypothetical proteins (“tmore » $$\\bar{a}$$pirins,” origin from M$$\\bar{a}$$ori: to join), not directly associated with cellulases, that mediate attachment to cellulose by species in the noncellulosomal, extremely thermophilic bacterial genus Caldicellulosiruptor. Two t$$\\bar{a}$$pirin genes are located directly downstream of a type IV pilus operon in strongly cellulolytic members of the genus, whereas homologs are absent from the weakly cellulolytic Caldicellulosiruptor species. Based on their amino acid sequence, t$$\\bar{a}$$pirins are specific to these extreme thermophiles. T$$\\bar{a}$$pirins are also unusual in that they share no detectable protein domain signatures with known polysaccharide-binding proteins. Adsorption isotherm and trans vivo analyses demonstrated the carbohydrate-binding module-like affinity of the t$$\\bar{a}$$pirins for cellulose. Crystallization of a cellulose-binding truncation from one t$$\\bar{a}$$pirin indicated that these proteins form a long β-helix core with a shielded hydrophobic face. In addition, they are structurally unique and define a new class of polysaccharide adhesins. Strongly cellulolytic Caldicellulosiruptor species employ t$$\\bar{a}$$pirins to complement substrate-binding proteins from the ATP-binding cassette transporters and multidomain extracellular and S-layer-associated glycoside hydrolases to process the carbohydrate content of lignocellulose.« less
Glycan array data management at Consortium for Functional Glycomics.

PubMed

Venkataraman, Maha; Sasisekharan, Ram; Raman, Rahul

2015-01-01

Glycomics or the study of structure-function relationships of complex glycans has reshaped post-genomics biology. Glycans mediate fundamental biological functions via their specific interactions with a variety of proteins. Recognizing the importance of glycomics, large-scale research initiatives such as the Consortium for Functional Glycomics (CFG) were established to address these challenges. Over the past decade, the Consortium for Functional Glycomics (CFG) has generated novel reagents and technologies for glycomics analyses, which in turn have led to generation of diverse datasets. These datasets have contributed to understanding glycan diversity and structure-function relationships at molecular (glycan-protein interactions), cellular (gene expression and glycan analysis), and whole organism (mouse phenotyping) levels. Among these analyses and datasets, screening of glycan-protein interactions on glycan array platforms has gained much prominence and has contributed to cross-disciplinary realization of the importance of glycomics in areas such as immunology, infectious diseases, cancer biomarkers, etc. This manuscript outlines methodologies for capturing data from glycan array experiments and online tools to access and visualize glycan array data implemented at the CFG.
Large-scale modelling of the divergent spectrin repeats in nesprins: giant modular proteins.

PubMed

Autore, Flavia; Pfuhl, Mark; Quan, Xueping; Williams, Aisling; Roberts, Roland G; Shanahan, Catherine M; Fraternali, Franca

2013-01-01

Nesprin-1 and nesprin-2 are nuclear envelope (NE) proteins characterized by a common structure of an SR (spectrin repeat) rod domain and a C-terminal transmembrane KASH [Klarsicht-ANC-Syne-homology] domain and display N-terminal actin-binding CH (calponin homology) domains. Mutations in these proteins have been described in Emery-Dreifuss muscular dystrophy and attributed to disruptions of interactions at the NE with nesprins binding partners, lamin A/C and emerin. Evolutionary analysis of the rod domains of the nesprins has shown that they are almost entirely composed of unbroken SR-like structures. We present a bioinformatical approach to accurate definition of the boundaries of each SR by comparison with canonical SR structures, allowing for a large-scale homology modelling of the 74 nesprin-1 and 56 nesprin-2 SRs. The exposed and evolutionary conserved residues identify important pbs for protein-protein interactions that can guide tailored binding experiments. Most importantly, the bioinformatics analyses and the 3D models have been central to the design of selected constructs for protein expression. 1D NMR and CD spectra have been performed of the expressed SRs, showing a folded, stable, high content α-helical structure, typical of SRs. Molecular Dynamics simulations have been performed to study the structural and elastic properties of consecutive SRs, revealing insights in the mechanical properties adopted by these modules in the cell.
Structural characterization of the α-mating factor prepro-peptide for secretion of recombinant proteins in Pichia pastoris.

PubMed

Chahal, Sabreen; Wei, Peter; Moua, Pachai; Park, Sung Pil James; Kwon, Janet; Patel, Arth; Vu, Anthony T; Catolico, Jason A; Tsai, Yu Fang Tina; Shaheen, Nadia; Chu, Tiffany T; Tam, Vivian; Khan, Zill-E-Huma; Joo, Hyun Henry; Xue, Liang; Lin-Cereghino, Joan; Tsai, Jerry W; Lin-Cereghino, Geoff P

2017-01-20

The methylotrophic yeast Pichia pastoris has been used extensively for expressing recombinant proteins because it combines the ease of genetic manipulation, the ability to provide complex posttranslational modifications and the capacity for efficient protein secretion. The most successful and commonly used secretion signal leader in Pichia pastoris has been the alpha mating factor (MATα) prepro secretion signal. However, limitations exist as some proteins cannot be secreted efficiently, leading to strategies to enhance secretion efficiency by modifying the secretion signal leader. Based on a Jpred secondary structure prediction and knob-socket modeling of tertiary structure, numerous deletions and duplications of the MATα prepro leader were engineered to evaluate the correlation between predicted secondary structure and the secretion level of the reporters horseradish peroxidase (HRP) and Candida antarctica lipase B. In addition, circular dichroism analyses were completed for the wild type and several mutant pro-peptides to evaluate actual differences in secondary structure. The results lead to a new model of MATα pro-peptide signal leader, which suggests that the N and C-termini of MATα pro-peptide need to be presented in a specific orientation for proper interaction with the cellular secretion machinery and for efficient protein secretion. Copyright Â© 2016 Elsevier B.V. All rights reserved.
A structural analysis of the AAA+ domains in Saccharomyces cerevisiae cytoplasmic dynein

PubMed Central

Gleave, Emma S.; Schmidt, Helgo; Carter, Andrew P.

2014-01-01

Dyneins are large protein complexes that act as microtubule based molecular motors. The dynein heavy chain contains a motor domain which is a member of the AAA+ protein family (ATPases Associated with diverse cellular Activities). Proteins of the AAA+ family show a diverse range of functionalities, but share a related core AAA+ domain, which often assembles into hexameric rings. Dynein is unusual because it has all six AAA+ domains linked together, in one long polypeptide. The dynein motor domain generates movement by coupling ATP driven conformational changes in the AAA+ ring to the swing of a motile element called the linker. Dynein binds to its microtubule track via a long antiparallel coiled-coil stalk that emanates from the AAA+ ring. Recently the first high resolution structures of the dynein motor domain were published. Here we provide a detailed structural analysis of the six AAA+ domains using our Saccharomycescerevisiae crystal structure. We describe how structural similarities in the dynein AAA+ domains suggest they share a common evolutionary origin. We analyse how the different AAA+ domains have diverged from each other. We discuss how this is related to the function of dynein as a motor protein and how the AAA+ domains of dynein compare to those of other AAA+ proteins. PMID:24680784
Order within disorder: Aggrecan chondroitin sulphate-attachment region provides new structural insights into protein sequences classified as disordered

PubMed Central

Jowitt, Thomas A; Murdoch, Alan D; Baldock, Clair; Berry, Richard; Day, Joanna M; Hardingham, Timothy E

2010-01-01

Structural investigation of proteins containing large stretches of sequences without predicted secondary structure is the focus of much increased attention. Here, we have produced an unglycosylated 30 kDa peptide from the chondroitin sulphate (CS)-attachment region of human aggrecan (CS-peptide), which was predicted to be intrinsically disordered and compared its structure with the adjacent aggrecan G3 domain. Biophysical analyses, including analytical ultracentrifugation, light scattering, and circular dichroism showed that the CS-peptide had an elongated and stiffened conformation in contrast to the globular G3 domain. The results suggested that it contained significant secondary structure, which was sensitive to urea, and we propose that the CS-peptide forms an elongated wormlike molecule based on a dynamic range of energetically equivalent secondary structures stabilized by hydrogen bonds. The dimensions of the structure predicted from small-angle X-ray scattering analysis were compatible with EM images of fully glycosylated aggrecan and a partly glycosylated aggrecan CS2-G3 construct. The semiordered structure identified in CS-peptide was not predicted by common structural algorithms and identified a potentially distinct class of semiordered structure within sequences currently identified as disordered. Sequence comparisons suggested some evidence for comparable structures in proteins encoded by other genes (PRG4, MUC5B, and CBP). The function of these semiordered sequences may serve to spatially position attached folded modules and/or to present polypeptides for modification, such as glycosylation, and to provide templates for the multiple pleiotropic interactions proposed for disordered proteins. Proteins 2010. © 2010 Wiley-Liss, Inc. PMID:20806220

A Novel Protein Interaction between Nucleotide Binding Domain of Hsp70 and p53 Motif

PubMed Central

Elengoe, Asita; Naser, Mohammed Abu; Hamdan, Salehhuddin

2015-01-01

Currently, protein interaction of Homo sapiens nucleotide binding domain (NBD) of heat shock 70 kDa protein (PDB: 1HJO) with p53 motif remains to be elucidated. The NBD-p53 motif complex enhances the p53 stabilization, thereby increasing the tumor suppression activity in cancer treatment. Therefore, we identified the interaction between NBD and p53 using STRING version 9.1 program. Then, we modeled the three-dimensional structure of p53 motif through homology modeling and determined the binding affinity and stability of NBD-p53 motif complex structure via molecular docking and dynamics (MD) simulation. Human DNA binding domain of p53 motif (SCMGGMNR) retrieved from UniProt (UniProtKB: P04637) was docked with the NBD protein, using the Autodock version 4.2 program. The binding energy and intermolecular energy for the NBD-p53 motif complex were −0.44 Kcal/mol and −9.90 Kcal/mol, respectively. Moreover, RMSD, RMSF, hydrogen bonds, salt bridge, and secondary structure analyses revealed that the NBD protein had a strong bond with p53 motif and the protein-ligand complex was stable. Thus, the current data would be highly encouraging for designing Hsp70 structure based drug in cancer therapy. PMID:26098630
A Novel Protein Interaction between Nucleotide Binding Domain of Hsp70 and p53 Motif.

PubMed

Elengoe, Asita; Naser, Mohammed Abu; Hamdan, Salehhuddin

2015-01-01

Currently, protein interaction of Homo sapiens nucleotide binding domain (NBD) of heat shock 70 kDa protein (PDB: 1HJO) with p53 motif remains to be elucidated. The NBD-p53 motif complex enhances the p53 stabilization, thereby increasing the tumor suppression activity in cancer treatment. Therefore, we identified the interaction between NBD and p53 using STRING version 9.1 program. Then, we modeled the three-dimensional structure of p53 motif through homology modeling and determined the binding affinity and stability of NBD-p53 motif complex structure via molecular docking and dynamics (MD) simulation. Human DNA binding domain of p53 motif (SCMGGMNR) retrieved from UniProt (UniProtKB: P04637) was docked with the NBD protein, using the Autodock version 4.2 program. The binding energy and intermolecular energy for the NBD-p53 motif complex were -0.44 Kcal/mol and -9.90 Kcal/mol, respectively. Moreover, RMSD, RMSF, hydrogen bonds, salt bridge, and secondary structure analyses revealed that the NBD protein had a strong bond with p53 motif and the protein-ligand complex was stable. Thus, the current data would be highly encouraging for designing Hsp70 structure based drug in cancer therapy.
Same but not alike: Structure, flexibility and energetics of domains in multi-domain proteins are influenced by the presence of other domains

PubMed Central

Vishwanath, Sneha

2018-01-01

The majority of the proteins encoded in the genomes of eukaryotes contain more than one domain. Reasons for high prevalence of multi-domain proteins in various organisms have been attributed to higher stability and functional and folding advantages over single-domain proteins. Despite these advantages, many proteins are composed of only one domain while their homologous domains are part of multi-domain proteins. In the study presented here, differences in the properties of protein domains in single-domain and multi-domain systems and their influence on functions are discussed. We studied 20 pairs of identical protein domains, which were crystallized in two forms (a) tethered to other proteins domains and (b) tethered to fewer protein domains than (a) or not tethered to any protein domain. Results suggest that tethering of domains in multi-domain proteins influences the structural, dynamic and energetic properties of the constituent protein domains. 50% of the protein domain pairs show significant structural deviations while 90% of the protein domain pairs show differences in dynamics and 12% of the residues show differences in the energetics. To gain further insights on the influence of tethering on the function of the domains, 4 pairs of homologous protein domains, where one of them is a full-length single-domain protein and the other protein domain is a part of a multi-domain protein, were studied. Analyses showed that identical and structurally equivalent functional residues show differential dynamics in homologous protein domains; though comparable dynamics between in-silico generated chimera protein and multi-domain proteins were observed. From these observations, the differences observed in the functions of homologous proteins could be attributed to the presence of tethered domain. Overall, we conclude that tethered domains in multi-domain proteins not only provide stability or folding advantages but also influence pathways resulting in differences in function or regulatory properties. PMID:29432415
Same but not alike: Structure, flexibility and energetics of domains in multi-domain proteins are influenced by the presence of other domains.

PubMed

Vishwanath, Sneha; de Brevern, Alexandre G; Srinivasan, Narayanaswamy

2018-02-01

The majority of the proteins encoded in the genomes of eukaryotes contain more than one domain. Reasons for high prevalence of multi-domain proteins in various organisms have been attributed to higher stability and functional and folding advantages over single-domain proteins. Despite these advantages, many proteins are composed of only one domain while their homologous domains are part of multi-domain proteins. In the study presented here, differences in the properties of protein domains in single-domain and multi-domain systems and their influence on functions are discussed. We studied 20 pairs of identical protein domains, which were crystallized in two forms (a) tethered to other proteins domains and (b) tethered to fewer protein domains than (a) or not tethered to any protein domain. Results suggest that tethering of domains in multi-domain proteins influences the structural, dynamic and energetic properties of the constituent protein domains. 50% of the protein domain pairs show significant structural deviations while 90% of the protein domain pairs show differences in dynamics and 12% of the residues show differences in the energetics. To gain further insights on the influence of tethering on the function of the domains, 4 pairs of homologous protein domains, where one of them is a full-length single-domain protein and the other protein domain is a part of a multi-domain protein, were studied. Analyses showed that identical and structurally equivalent functional residues show differential dynamics in homologous protein domains; though comparable dynamics between in-silico generated chimera protein and multi-domain proteins were observed. From these observations, the differences observed in the functions of homologous proteins could be attributed to the presence of tethered domain. Overall, we conclude that tethered domains in multi-domain proteins not only provide stability or folding advantages but also influence pathways resulting in differences in function or regulatory properties.
Protein homology model refinement by large-scale energy optimization.

PubMed

Park, Hahnbeom; Ovchinnikov, Sergey; Kim, David E; DiMaio, Frank; Baker, David

2018-03-20

Proteins fold to their lowest free-energy structures, and hence the most straightforward way to increase the accuracy of a partially incorrect protein structure model is to search for the lowest-energy nearby structure. This direct approach has met with little success for two reasons: first, energy function inaccuracies can lead to false energy minima, resulting in model degradation rather than improvement; and second, even with an accurate energy function, the search problem is formidable because the energy only drops considerably in the immediate vicinity of the global minimum, and there are a very large number of degrees of freedom. Here we describe a large-scale energy optimization-based refinement method that incorporates advances in both search and energy function accuracy that can substantially improve the accuracy of low-resolution homology models. The method refined low-resolution homology models into correct folds for 50 of 84 diverse protein families and generated improved models in recent blind structure prediction experiments. Analyses of the basis for these improvements reveal contributions from both the improvements in conformational sampling techniques and the energy function.
Classification of the treble clef zinc finger: noteworthy lessons for structure and function evolution.

PubMed

Kaur, Gurmeet; Subramanian, Srikrishna

2016-08-26

Treble clef (TC) zinc fingers constitute a large fold-group of structural zinc-binding protein domains that mediate numerous cellular functions. We have analysed the sequence, structure, and function relationships among all TCs in the Protein Data Bank. This led to the identification of novel TCs, such as lsr2, YggX and TFIIIC τ 60 kDa subunit, and prediction of a nuclease-like function for the DUF1364 family. The structural malleability of TCs is evident from the many examples with variations to the core structural elements of the fold. We observe domains wherein the structural core of the TC fold is circularly permuted, and also some examples where the overall fold resembles both the TC motif and another unrelated fold. All extant TC families do not share a monophyletic origin, as several TC proteins are known to have been present in the last universal common ancestor and the last eukaryotic common ancestor. We identify several TCs where the zinc-chelating site and residues are not merely responsible for structure stabilization but also perform other functions, such as being redox active in C1B domain of protein kinase C, a nucleophilic acceptor in Ada and catalytic in organomercurial lyase, MerB.
Classification of the treble clef zinc finger: noteworthy lessons for structure and function evolution

NASA Astrophysics Data System (ADS)

Kaur, Gurmeet; Subramanian, Srikrishna

2016-08-01

Treble clef (TC) zinc fingers constitute a large fold-group of structural zinc-binding protein domains that mediate numerous cellular functions. We have analysed the sequence, structure, and function relationships among all TCs in the Protein Data Bank. This led to the identification of novel TCs, such as lsr2, YggX and TFIIIC τ 60 kDa subunit, and prediction of a nuclease-like function for the DUF1364 family. The structural malleability of TCs is evident from the many examples with variations to the core structural elements of the fold. We observe domains wherein the structural core of the TC fold is circularly permuted, and also some examples where the overall fold resembles both the TC motif and another unrelated fold. All extant TC families do not share a monophyletic origin, as several TC proteins are known to have been present in the last universal common ancestor and the last eukaryotic common ancestor. We identify several TCs where the zinc-chelating site and residues are not merely responsible for structure stabilization but also perform other functions, such as being redox active in C1B domain of protein kinase C, a nucleophilic acceptor in Ada and catalytic in organomercurial lyase, MerB.
Chemical Ligation of Folded Recombinant Proteins: Segmental Isotopic Labeling of Domains for NMR Studies

NASA Astrophysics Data System (ADS)

Xu, Rong; Ayers, Brenda; Cowburn, David; Muir, Tom W.

1999-01-01

A convenient in vitro chemical ligation strategy has been developed that allows folded recombinant proteins to be joined together. This strategy permits segmental, selective isotopic labeling of the product. The src homology type 3 and 2 domains (SH3 and SH2) of Abelson protein tyrosine kinase, which constitute the regulatory apparatus of the protein, were individually prepared in reactive forms that can be ligated together under normal protein-folding conditions to form a normal peptide bond at the ligation junction. This strategy was used to prepare NMR sample quantities of the Abelson protein tyrosine kinase-SH(32) domain pair, in which only one of the domains was labeled with 15N Mass spectrometry and NMR analyses were used to confirm the structure of the ligated protein, which was also shown to have appropriate ligand-binding properties. The ability to prepare recombinant proteins with selectively labeled segments having a single-site mutation, by using a combination of expression of fusion proteins and chemical ligation in vitro, will increase the size limits for protein structural determination in solution with NMR methods. In vitro chemical ligation of expressed protein domains will also provide a combinatorial approach to the synthesis of linked protein domains.
ProteomeVis: a web app for exploration of protein properties from structure to sequence evolution across organisms' proteomes.

PubMed

Razban, Rostam M; Gilson, Amy I; Durfee, Niamh; Strobelt, Hendrik; Dinkla, Kasper; Choi, Jeong-Mo; Pfister, Hanspeter; Shakhnovich, Eugene I

2018-05-08

Protein evolution spans time scales and its effects span the length of an organism. A web app named ProteomeVis is developed to provide a comprehensive view of protein evolution in the S. cerevisiae and E. coli proteomes. ProteomeVis interactively creates protein chain graphs, where edges between nodes represent structure and sequence similarities within user-defined ranges, to study the long time scale effects of protein structure evolution. The short time scale effects of protein sequence evolution are studied by sequence evolutionary rate (ER) correlation analyses with protein properties that span from the molecular to the organismal level. We demonstrate the utility and versatility of ProteomeVis by investigating the distribution of edges per node in organismal protein chain universe graphs (oPCUGs) and putative ER determinants. S. cerevisiae and E. coli oPCUGs are scale-free with scaling constants of 1.79 and 1.56, respectively. Both scaling constants can be explained by a previously reported theoretical model describing protein structure evolution (Dokholyan et al., 2002). Protein abundance most strongly correlates with ER among properties in ProteomeVis, with Spearman correlations of -0.49 (p-value<10-10) and -0.46 (p-value<10-10) for S. cerevisiae and E. coli, respectively. This result is consistent with previous reports that found protein expression to be the most important ER determinant (Zhang and Yang, 2015). ProteomeVis is freely accessible at http://proteomevis.chem.harvard.edu. Supplementary data are available at Bioinformatics. shakhnovich@chemistry.harvard.edu.
NMR studies of a channel protein without membranes: structure and dynamics of water-solubilized KcsA.

PubMed

Ma, Dejian; Tillman, Tommy S; Tang, Pei; Meirovitch, Eva; Eckenhoff, Roderic; Carnini, Anna; Xu, Yan

2008-10-28

Structural studies of polytopic membrane proteins are often hampered by the vagaries of these proteins in membrane mimetic environments and by the difficulties in handling them with conventional techniques. Designing and creating water-soluble analogues with preserved native structures offer an attractive alternative. We report here solution NMR studies of WSK3, a water-soluble analogue of the potassium channel KcsA. The WSK3 NMR structure (PDB ID code 2K1E) resembles the KcsA crystal structures, validating the approach. By more stringent comparison criteria, however, the introduction of several charged residues aimed at improving water solubility seems to have led to the possible formations of a few salt bridges and hydrogen bonds not present in the native structure, resulting in slight differences in the structure of WSK3 relative to KcsA. NMR dynamics measurements show that WSK3 is highly flexible in the absence of a lipid environment. Reduced spectral density mapping and model-free analyses reveal dynamic characteristics consistent with an isotropically tumbling tetramer experiencing slow (nanosecond) motions with unusually low local ordering. An altered hydrogen-bond network near the selectivity filter and the pore helix, and the intrinsically dynamic nature of the selectivity filter, support the notion that this region is crucial for slow inactivation. Our results have implications not only for the design of water-soluble analogues of membrane proteins but also for our understanding of the basic determinants of intrinsic protein structure and dynamics.
Structural Basis for Interactions Between Contactin Family Members and Protein-tyrosine Phosphatase Receptor Type G in Neural Tissues.

PubMed

Nikolaienko, Roman M; Hammel, Michal; Dubreuil, Véronique; Zalmai, Rana; Hall, David R; Mehzabeen, Nurjahan; Karuppan, Sebastian J; Harroch, Sheila; Stella, Salvatore L; Bouyain, Samuel

2016-10-07

Protein-tyrosine phosphatase receptor type G (RPTPγ/PTPRG) interacts in vitro with contactin-3-6 (CNTN3-6), a group of glycophosphatidylinositol-anchored cell adhesion molecules involved in the wiring of the nervous system. In addition to PTPRG, CNTNs associate with multiple transmembrane proteins and signal inside the cell via cis-binding partners to alleviate the absence of an intracellular region. Here, we use comprehensive biochemical and structural analyses to demonstrate that PTPRG·CNTN3-6 complexes share similar binding affinities and a conserved arrangement. Furthermore, as a first step to identifying PTPRG·CNTN complexes in vivo, we found that PTPRG and CNTN3 associate in the outer segments of mouse rod photoreceptor cells. In particular, PTPRG and CNTN3 form cis-complexes at the surface of photoreceptors yet interact in trans when expressed on the surfaces of apposing cells. Further structural analyses suggest that all CNTN ectodomains adopt a bent conformation and might lie parallel to the cell surface to accommodate these cis and trans binding modes. Taken together, these studies identify a PTPRG·CNTN complex in vivo and provide novel insights into PTPRG- and CNTN-mediated signaling. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
The Effect of Differentially Designed Fusion Proteins to Elicit Efficient Anti-human Thyroid Stimulating Hormone Immune Responses.

PubMed

Mard-Soltani, Maysam; Rasaee, Mohamad Javad; Khalili, Saeed; Sheikhi, Abdol-Karim; Hedayati, Mehdi; Ghaderi-Zefrehi, Hossein; Alasvand, Milad

2018-04-01

The production of human thyroid stimulating hormone (hTSH) immunoassays requires specific antibodies against hTSH which is a cumbersome process. Therefore, producing specific polyclonal antibodies against engineered recombinant fusion hTSH antigens would be of great significance. The best immunogenic region of the hTSH was selected based on in silico analyses and equipped with two different fusions. Standard methods were used for protein expression, purification, verification, structural evaluation, and immunizations of the white New Zealand rabbits. Ultimately, immunized serums were used for antibody titration, purification and characterization (specificity, sensitivity and cross reactivity). The desired antigens were successfully designed, sub-cloned, expressed, confirmed and used for in vivo immunization. Structural analyses indicated that only the bigger antigen has showed changed 2 dimensional (2D) and 3D structural properties in comparison to the smaller antigen. The raised polyclonal antibodies were capable of specific and sensitive hTSH detection, while the cross reactivity with the other members of the glycoprotein hormone family was minimum and negligible. The fusion which was solely composed of the tetanus toxin epitopes led to better protein folding and was capable of immunizing the host animals resulting into high titer antibody. Therefore, the minimal fusion sequences seem to be more effective in eliciting specific antibody responses.
Multiple protein-protein interactions converging on the Prp38 protein during activation of the human spliceosome.

PubMed

Schütze, Tonio; Ulrich, Alexander K C; Apelt, Luise; Will, Cindy L; Bartlick, Natascha; Seeger, Martin; Weber, Gert; Lührmann, Reinhard; Stelzl, Ulrich; Wahl, Markus C

2016-02-01

Spliceosomal Prp38 proteins contain a conserved amino-terminal domain, but only higher eukaryotic orthologs also harbor a carboxy-terminal RS domain, a hallmark of splicing regulatory SR proteins. We show by crystal structure analysis that the amino-terminal domain of human Prp38 is organized around three pairs of antiparallel α-helices and lacks similarities to RNA-binding domains found in canonical SR proteins. Instead, yeast two-hybrid analyses suggest that the amino-terminal domain is a versatile protein-protein interaction hub that possibly binds 12 other spliceosomal proteins, most of which are recruited at the same stage as Prp38. By quantitative, alanine surface-scanning two-hybrid screens and biochemical analyses we delineated four distinct interfaces on the Prp38 amino-terminal domain. In vitro interaction assays using recombinant proteins showed that Prp38 can bind at least two proteins simultaneously via two different interfaces. Addition of excess Prp38 amino-terminal domain to in vitro splicing assays, but not of an interaction-deficient mutant, stalled splicing at a precatalytic stage. Our results show that human Prp38 is an unusual SR protein, whose amino-terminal domain is a multi-interface protein-protein interaction platform that might organize the relative positioning of other proteins during splicing. © 2016 Schütze et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Effect of iron oxide loading on magnetoferritin structure in solution as revealed by SAXS and SANS.

PubMed

Melníková, L; Petrenko, V I; Avdeev, M V; Garamus, V M; Almásy, L; Ivankov, O I; Bulavin, L A; Mitróová, Z; Kopčanský, P

2014-11-01

Synthetic biological macromolecule of magnetoferritin containing an iron oxide core inside a protein shell (apoferritin) is prepared with different content of iron. Its structure in aqueous solution is analysed by small-angle synchrotron X-ray (SAXS) and neutron (SANS) scattering. The loading factor (LF) defined as the average number of iron atoms per protein is varied up to LF=800. With an increase of the LF, the scattering curves exhibit a relative increase in the total scattered intensity, a partial smearing and a shift of the match point in the SANS contrast variation data. The analysis shows an increase in the polydispersity of the proteins and a corresponding effective increase in the relative content of magnetic material against the protein moiety of the shell with the LF growth. At LFs above ∼150, the apoferritin shell undergoes structural changes, which is strongly indicative of the fact that the shell stability is affected by iron oxide presence. Copyright © 2014 Elsevier B.V. All rights reserved.
Structure and functional dynamics of the mitochondrial Fe/S cluster synthesis complex.

PubMed

Boniecki, Michal T; Freibert, Sven A; Mühlenhoff, Ulrich; Lill, Roland; Cygler, Miroslaw

2017-11-03

Iron-sulfur (Fe/S) clusters are essential protein cofactors crucial for many cellular functions including DNA maintenance, protein translation, and energy conversion. De novo Fe/S cluster synthesis occurs on the mitochondrial scaffold protein ISCU and requires cysteine desulfurase NFS1, ferredoxin, frataxin, and the small factors ISD11 and ACP (acyl carrier protein). Both the mechanism of Fe/S cluster synthesis and function of ISD11-ACP are poorly understood. Here, we present crystal structures of three different NFS1-ISD11-ACP complexes with and without ISCU, and we use SAXS analyses to define the 3D architecture of the complete mitochondrial Fe/S cluster biosynthetic complex. Our structural and biochemical studies provide mechanistic insights into Fe/S cluster synthesis at the catalytic center defined by the active-site Cys of NFS1 and conserved Cys, Asp, and His residues of ISCU. We assign specific regulatory rather than catalytic roles to ISD11-ACP that link Fe/S cluster synthesis with mitochondrial lipid synthesis and cellular energy status.
Structural and biophysical properties of h-FANCI ARM repeat protein.

PubMed

Siddiqui, Mohd Quadir; Choudhary, Rajan Kumar; Thapa, Pankaj; Kulkarni, Neha; Rajpurohit, Yogendra S; Misra, Hari S; Gadewal, Nikhil; Kumar, Satish; Hasan, Syed K; Varma, Ashok K

2017-11-01

Fanconi anemia complementation groups - I (FANCI) protein facilitates DNA ICL (Inter-Cross-link) repair and plays a crucial role in genomic integrity. FANCI is a 1328 amino acids protein which contains armadillo (ARM) repeats and EDGE motif at the C-terminus. ARM repeats are functionally diverse and evolutionarily conserved domain that plays a pivotal role in protein-protein and protein-DNA interactions. Considering the importance of ARM repeats, we have explored comprehensive in silico and in vitro approach to examine folding pattern. Size exclusion chromatography, dynamic light scattering (DLS) and glutaraldehyde crosslinking studies suggest that FANCI ARM repeat exist as monomer as well as in oligomeric forms. Circular dichroism (CD) and fluorescence spectroscopy results demonstrate that protein has predominantly α- helices and well-folded tertiary structure. DNA binding was analysed using electrophoretic mobility shift assay by autoradiography. Temperature-dependent CD, Fluorescence spectroscopy and DLS studies concluded that protein unfolds and start forming oligomer from 30°C. The existence of stable portion within FANCI ARM repeat was examined using limited proteolysis and mass spectrometry. The normal mode analysis, molecular dynamics and principal component analysis demonstrated that helix-turn-helix (HTH) motif present in ARM repeat is highly dynamic and has anti-correlated motion. Furthermore, FANCI ARM repeat has HTH structural motif which binds to double-stranded DNA.
S-layers at second glance? Altiarchaeal grappling hooks (hami) resemble archaeal S-layer proteins in structure and sequence

PubMed Central

Perras, Alexandra K.; Daum, Bertram; Ziegler, Christine; Takahashi, Lynelle K.; Ahmed, Musahid; Wanner, Gerhard; Klingl, Andreas; Leitinger, Gerd; Kolb-Lenz, Dagmar; Gribaldo, Simonetta; Auerbach, Anna; Mora, Maximilian; Probst, Alexander J.; Bellack, Annett; Moissl-Eichinger, Christine

2015-01-01

The uncultivated “Candidatus Altiarchaeum hamiconexum” (formerly known as SM1 Euryarchaeon) carries highly specialized nano-grappling hooks (“hami”) on its cell surface. Until now little is known about the major protein forming these structured fibrous cell surface appendages, the genes involved or membrane anchoring of these filaments. These aspects were analyzed in depth in this study using environmental transcriptomics combined with imaging methods. Since a laboratory culture of this archaeon is not yet available, natural biofilm samples with high Ca. A. hamiconexum abundance were used for the entire analyses. The filamentous surface appendages spanned both membranes of the cell, which are composed of glycosyl-archaeol. The hami consisted of multiple copies of the same protein, the corresponding gene of which was identified via metagenome-mapped transcriptome analysis. The hamus subunit proteins, which are likely to self-assemble due to their predicted beta sheet topology, revealed no similiarity to known microbial flagella-, archaella-, fimbriae- or pili-proteins, but a high similarity to known S-layer proteins of the archaeal domain at their N-terminal region (44–47% identity). Our results provide new insights into the structure of the unique hami and their major protein and indicate their divergent evolution with S-layer proteins. PMID:26106369
Assembly of the β-Barrel Outer Membrane Proteins in Gram-Negative Bacteria, Mitochondria, and Chloroplasts

PubMed Central

Misra, Rajeev

2012-01-01

In the last decade, there has been an explosion of publications on the assembly of β-barrel outer membrane proteins (OMPs), which carry out diverse cellular functions, including solute transport, protein secretion, and assembly of protein and lipid components of the outer membrane. Of the three outer membrane model systems—Gram-negative bacteria, mitochondria and chloroplasts—research on bacterial and mitochondrial systems has so far led the way in dissecting the β-barrel OMP assembly pathways. Many exciting discoveries have been made, including the identification of β-barrel OMP assembly machineries in bacteria and mitochondria, and potentially the core assembly component in chloroplasts. The atomic structures of all five components of the bacterial β-barrel assembly machinery (BAM) complex, except the β-barrel domain of the core BamA protein, have been solved. Structures reveal that these proteins contain domains/motifs known to facilitate protein-protein interactions, which are at the heart of the assembly pathways. While structural information has been valuable, most of our current understanding of the β-barrel OMP assembly pathways has come from genetic, molecular biology, and biochemical analyses. This paper provides a comparative account of the β-barrel OMP assembly pathways in Gram-negative bacteria, mitochondria, and chloroplasts. PMID:27335668
Applications of NMR and computational methodologies to study protein dynamics.

PubMed

Narayanan, Chitra; Bafna, Khushboo; Roux, Louise D; Agarwal, Pratul K; Doucet, Nicolas

2017-08-15

Overwhelming evidence now illustrates the defining role of atomic-scale protein flexibility in biological events such as allostery, cell signaling, and enzyme catalysis. Over the years, spin relaxation nuclear magnetic resonance (NMR) has provided significant insights on the structural motions occurring on multiple time frames over the course of a protein life span. The present review article aims to illustrate to the broader community how this technique continues to shape many areas of protein science and engineering, in addition to being an indispensable tool for studying atomic-scale motions and functional characterization. Continuing developments in underlying NMR technology alongside software and hardware developments for complementary computational approaches now enable methodologies to routinely provide spatial directionality and structural representations traditionally harder to achieve solely using NMR spectroscopy. In addition to its well-established role in structural elucidation, we present recent examples that illustrate the combined power of selective isotope labeling, relaxation dispersion experiments, chemical shift analyses, and computational approaches for the characterization of conformational sub-states in proteins and enzymes. Copyright © 2017 Elsevier Inc. All rights reserved.
Origin and diversification of leucine-rich repeat receptor-like protein kinase (LRR-RLK) genes in plants.

PubMed

Liu, Ping-Li; Du, Liang; Huang, Yuan; Gao, Shu-Min; Yu, Meng

2017-02-07

Leucine-rich repeat receptor-like protein kinases (LRR-RLKs) are the largest group of receptor-like kinases in plants and play crucial roles in development and stress responses. The evolutionary relationships among LRR-RLK genes have been investigated in flowering plants; however, no comprehensive studies have been performed for these genes in more ancestral groups. The subfamily classification of LRR-RLK genes in plants, the evolutionary history and driving force for the evolution of each LRR-RLK subfamily remain to be understood. We identified 119 LRR-RLK genes in the Physcomitrella patens moss genome, 67 LRR-RLK genes in the Selaginella moellendorffii lycophyte genome, and no LRR-RLK genes in five green algae genomes. Furthermore, these LRR-RLK sequences, along with previously reported LRR-RLK sequences from Arabidopsis thaliana and Oryza sativa, were subjected to evolutionary analyses. Phylogenetic analyses revealed that plant LRR-RLKs belong to 19 subfamilies, eighteen of which were established in early land plants, and one of which evolved in flowering plants. More importantly, we found that the basic structures of LRR-RLK genes for most subfamilies are established in early land plants and conserved within subfamilies and across different plant lineages, but divergent among subfamilies. In addition, most members of the same subfamily had common protein motif compositions, whereas members of different subfamilies showed variations in protein motif compositions. The unique gene structure and protein motif compositions of each subfamily differentiate the subfamily classifications and, more importantly, provide evidence for functional divergence among LRR-RLK subfamilies. Maximum likelihood analyses showed that some sites within four subfamilies were under positive selection. Much of the diversity of plant LRR-RLK genes was established in early land plants. Positive selection contributed to the evolution of a few LRR-RLK subfamilies.

C-terminal, endoplasmic reticulum-lumenal domain of prosurfactant protein C - structural features and membrane interactions.

PubMed

Casals, Cristina; Johansson, Hanna; Saenz, Alejandra; Gustafsson, Magnus; Alfonso, Carlos; Nordling, Kerstin; Johansson, Jan

2008-02-01

Surfactant protein C (SP-C) constitutes the transmembrane part of prosurfactant protein C (proSP-C) and is alpha-helical in its native state. The C-terminal part of proSP-C (CTC) is localized in the endoplasmic reticulum lumen and binds to misfolded (beta-strand) SP-C, thereby preventing its aggregation and amyloid fibril formation. In this study, we investigated the structure of recombinant human CTC and the effects of CTC-membrane interaction on protein structure. CTC forms noncovalent trimers and supratrimeric oligomers. It contains two intrachain disulfide bridges, and its secondary structure is significantly affected by urea or heat only after disulfide reduction. The postulated Brichos domain of CTC, with homologs found in proteins associated with amyloid and proliferative disease, is up to 1000-fold more protected from limited proteolysis than the rest of CTC. The protein exposes hydrophobic surfaces, as determined by CTC binding to the environment-sensitive fluorescent probe 1,1'-bis(4-anilino-5,5'-naphthalenesulfonate). Fluorescence energy transfer experiments further reveal close proximity between bound 1,1'-bis(4-anilino-5,5'-naphthalenesulfonate) and tyrosine residues in CTC, some of which are conserved in all Brichos domains. CTC binds to unilamellar phospholipid vesicles with low micromolar dissociation constants, and differential scanning calorimetry and CD analyses indicate that membrane-bound CTC is less structurally ordered than the unbound protein. The exposed hydrophobic surfaces and the structural disordering that result from interactions with phospholipid membranes suggest a mechanism whereby CTC binds to misfolded SP-C in the endoplasmic reticulum membrane.
Controlled release of functional proteins through designer self-assembling peptide nanofiber hydrogel scaffold

PubMed Central

Koutsopoulos, Sotirios; Unsworth, Larry D.; Nagai, Yusuke; Zhang, Shuguang

2009-01-01

The release kinetics for a variety of proteins of a wide range of molecular mass, hydrodynamic radii, and isoelectric points through a nanofiber hydrogel scaffold consisting of designer self-assembling peptides were studied by using single-molecule fluorescence correlation spectroscopy (FCS). In contrast to classical diffusion experiments, the single-molecule approach allowed for the direct determination of diffusion coefficients for lysozyme, trypsin inhibitor, BSA, and IgG both inside the hydrogel and after being released into the solution. The results of the FCS analyses and the calculated pristine in-gel diffusion coefficients were compared with the values obtained from the Stokes–Einstein equation, Fickian diffusion models, and the literature. The release kinetics suggested that protein diffusion through nanofiber hydrogels depended primarily on the size of the protein. Protein diffusivities decreased, with increasing hydrogel nanofiber density providing a means of controlling the release kinetics. Secondary and tertiary structure analyses and biological assays of the released proteins showed that encapsulation and release did not affect the protein conformation and functionality. Our results show that this biocompatible and injectable designer self-assembling peptide hydrogel system may be useful as a carrier for therapeutic proteins for sustained release applications. PMID:19273853
Building a knowledge-based statistical potential by capturing high-order inter-residue interactions and its applications in protein secondary structure assessment.

PubMed

Li, Yaohang; Liu, Hui; Rata, Ionel; Jakobsson, Eric

2013-02-25

The rapidly increasing number of protein crystal structures available in the Protein Data Bank (PDB) has naturally made statistical analyses feasible in studying complex high-order inter-residue correlations. In this paper, we report a context-based secondary structure potential (CSSP) for assessing the quality of predicted protein secondary structures generated by various prediction servers. CSSP is a sequence-position-specific knowledge-based potential generated based on the potentials of mean force approach, where high-order inter-residue interactions are taken into consideration. The CSSP potential is effective in identifying secondary structure predictions with good quality. In 56% of the targets in the CB513 benchmark, the optimal CSSP potential is able to recognize the native secondary structure or a prediction with Q3 accuracy higher than 90% as best scored in the predicted secondary structures generated by 10 popularly used secondary structure prediction servers. In more than 80% of the CB513 targets, the predicted secondary structures with the lowest CSSP potential values yield higher than 80% Q3 accuracy. Similar performance of CSSP is found on the CASP9 targets as well. Moreover, our computational results also show that the CSSP potential using triplets outperforms the CSSP potential using doublets and is currently better than the CSSP potential using quartets.
Decoding Structural Properties of a Partially Unfolded Protein Substrate: En Route to Chaperone Binding

PubMed Central

Nagpal, Suhani; Tiwari, Satyam; Mapa, Koyeli; Thukral, Lipi

2015-01-01

Many proteins comprising of complex topologies require molecular chaperones to achieve their unique three-dimensional folded structure. The E.coli chaperone, GroEL binds with a large number of unfolded and partially folded proteins, to facilitate proper folding and prevent misfolding and aggregation. Although the major structural components of GroEL are well defined, scaffolds of the non-native substrates that determine chaperone-mediated folding have been difficult to recognize. Here we performed all-atomistic and replica-exchange molecular dynamics simulations to dissect non-native ensemble of an obligate GroEL folder, DapA. Thermodynamics analyses of unfolding simulations revealed populated intermediates with distinct structural characteristics. We found that surface exposed hydrophobic patches are significantly increased, primarily contributed from native and non-native β-sheet elements. We validate the structural properties of these conformers using experimental data, including circular dichroism (CD), 1-anilinonaphthalene-8-sulfonic acid (ANS) binding measurements and previously reported hydrogen-deutrium exchange coupled to mass spectrometry (HDX-MS). Further, we constructed network graphs to elucidate long-range intra-protein connectivity of native and intermediate topologies, demonstrating regions that serve as central “hubs”. Overall, our results implicate that genomic variations (or mutations) in the distinct regions of protein structures might disrupt these topological signatures disabling chaperone-mediated folding, leading to formation of aggregates. PMID:26394388
Assessing the Chemical Accuracy of Protein Structures via Peptide Acidity

PubMed Central

Anderson, Janet S.; Hernández, Griselda; LeMaster, David M.

2012-01-01

Although the protein native state is a Boltzmann conformational ensemble, practical applications often require a representative model from the most populated region of that distribution. The acidity of the backbone amides, as reflected in hydrogen exchange rates, is exquisitely sensitive to the surrounding charge and dielectric volume distribution. For each of four proteins, three independently determined X-ray structures of differing crystallographic resolution were used to predict exchange for the static solvent-exposed amide hydrogens. The average correlation coefficients range from 0.74 for ubiquitin to 0.93 for Pyrococcus furiosus rubredoxin, reflecting the larger range of experimental exchange rates exhibited by the latter protein. The exchange prediction errors modestly correlate with the crystallographic resolution. MODELLER 9v6-derived homology models at ~60% sequence identity (36% identity for chymotrypsin inhibitor CI2) yielded correlation coefficients that are ~0.1 smaller than for the cognate X-ray structures. The most recently deposited NOE-based ubiquitin structure and the original NMR structure of CI2 fail to provide statistically significant predictions of hydrogen exchange. However, the more recent RECOORD refinement study of CI2 yielded predictions comparable to the X-ray and homology model-based analyses. PMID:23182463
A comparative study of cold- and warm-adapted Endonucleases A using sequence analyses and molecular dynamics simulations.

PubMed

Michetti, Davide; Brandsdal, Bjørn Olav; Bon, Davide; Isaksen, Geir Villy; Tiberti, Matteo; Papaleo, Elena

2017-01-01

The psychrophilic and mesophilic endonucleases A (EndA) from Aliivibrio salmonicida (VsEndA) and Vibrio cholera (VcEndA) have been studied experimentally in terms of the biophysical properties related to thermal adaptation. The analyses of their static X-ray structures was no sufficient to rationalize the determinants of their adaptive traits at the molecular level. Thus, we used Molecular Dynamics (MD) simulations to compare the two proteins and unveil their structural and dynamical differences. Our simulations did not show a substantial increase in flexibility in the cold-adapted variant on the nanosecond time scale. The only exception is a more rigid C-terminal region in VcEndA, which is ascribable to a cluster of electrostatic interactions and hydrogen bonds, as also supported by MD simulations of the VsEndA mutant variant where the cluster of interactions was introduced. Moreover, we identified three additional amino acidic substitutions through multiple sequence alignment and the analyses of MD-based protein structure networks. In particular, T120V occurs in the proximity of the catalytic residue H80 and alters the interaction with the residue Y43, which belongs to the second coordination sphere of the Mg2+ ion. This makes T120V an amenable candidate for future experimental mutagenesis.
i3Drefine Software for Protein 3D Structure Refinement and Its Assessment in CASP10

PubMed Central

Bhattacharya, Debswapna; Cheng, Jianlin

2013-01-01

Protein structure refinement refers to the process of improving the qualities of protein structures during structure modeling processes to bring them closer to their native states. Structure refinement has been drawing increasing attention in the community-wide Critical Assessment of techniques for Protein Structure prediction (CASP) experiments since its addition in 8th CASP experiment. During the 9th and recently concluded 10th CASP experiments, a consistent growth in number of refinement targets and participating groups has been witnessed. Yet, protein structure refinement still remains a largely unsolved problem with majority of participating groups in CASP refinement category failed to consistently improve the quality of structures issued for refinement. In order to alleviate this need, we developed a completely automated and computationally efficient protein 3D structure refinement method, i3Drefine, based on an iterative and highly convergent energy minimization algorithm with a powerful all-atom composite physics and knowledge-based force fields and hydrogen bonding (HB) network optimization technique. In the recent community-wide blind experiment, CASP10, i3Drefine (as ‘MULTICOM-CONSTRUCT’) was ranked as the best method in the server section as per the official assessment of CASP10 experiment. Here we provide the community with free access to i3Drefine software and systematically analyse the performance of i3Drefine in strict blind mode on the refinement targets issued in CASP10 refinement category and compare with other state-of-the-art refinement methods participating in CASP10. Our analysis demonstrates that i3Drefine is only fully-automated server participating in CASP10 exhibiting consistent improvement over the initial structures in both global and local structural quality metrics. Executable version of i3Drefine is freely available at http://protein.rnet.missouri.edu/i3drefine/. PMID:23894517
Detection and sequence/structure mapping of biophysical constraints to protein variation in saturated mutational libraries and protein sequence alignments with a dedicated server.

PubMed

Abriata, Luciano A; Bovigny, Christophe; Dal Peraro, Matteo

2016-06-17

Protein variability can now be studied by measuring high-resolution tolerance-to-substitution maps and fitness landscapes in saturated mutational libraries. But these rich and expensive datasets are typically interpreted coarsely, restricting detailed analyses to positions of extremely high or low variability or dubbed important beforehand based on existing knowledge about active sites, interaction surfaces, (de)stabilizing mutations, etc. Our new webserver PsychoProt (freely available without registration at http://psychoprot.epfl.ch or at http://lucianoabriata.altervista.org/psychoprot/index.html ) helps to detect, quantify, and sequence/structure map the biophysical and biochemical traits that shape amino acid preferences throughout a protein as determined by deep-sequencing of saturated mutational libraries or from large alignments of naturally occurring variants. We exemplify how PsychoProt helps to (i) unveil protein structure-function relationships from experiments and from alignments that are consistent with structures according to coevolution analysis, (ii) recall global information about structural and functional features and identify hitherto unknown constraints to variation in alignments, and (iii) point at different sources of variation among related experimental datasets or between experimental and alignment-based data. Remarkably, metabolic costs of the amino acids pose strong constraints to variability at protein surfaces in nature but not in the laboratory. This and other differences call for caution when extrapolating results from in vitro experiments to natural scenarios in, for example, studies of protein evolution. We show through examples how PsychoProt can be a useful tool for the broad communities of structural biology and molecular evolution, particularly for studies about protein modeling, evolution and design.
The evolution of function within the Nudix homology clan

PubMed Central

Srouji, John R.; Xu, Anting; Park, Annsea; Kirsch, Jack F.

2017-01-01

ABSTRACT The Nudix homology clan encompasses over 80,000 protein domains from all three domains of life, defined by homology to each other. Proteins with a domain from this clan fall into four general functional classes: pyrophosphohydrolases, isopentenyl diphosphate isomerases (IDIs), adenine/guanine mismatch‐specific adenine glycosylases (A/G‐specific adenine glycosylases), and nonenzymatic activities such as protein/protein interaction and transcriptional regulation. The largest group, pyrophosphohydrolases, encompasses more than 100 distinct hydrolase specificities. To understand the evolution of this vast number of activities, we assembled and analyzed experimental and structural data for 205 Nudix proteins collected from the literature. We corrected erroneous functions or provided more appropriate descriptions for 53 annotations described in the Gene Ontology Annotation database in this family, and propose 275 new experimentally‐based annotations. We manually constructed a structure‐guided sequence alignment of 78 Nudix proteins. Using the structural alignment as a seed, we then made an alignment of 347 “select” Nudix homology domains, curated from structurally determined, functionally characterized, or phylogenetically important Nudix domains. Based on our review of Nudix pyrophosphohydrolase structures and specificities, we further analyzed a loop region downstream of the Nudix hydrolase motif previously shown to contact the substrate molecule and possess known functional motifs. This loop region provides a potential structural basis for the functional radiation and evolution of substrate specificity within the hydrolase family. Finally, phylogenetic analyses of the 347 select protein domains and of the complete Nudix homology clan revealed general monophyly with regard to function and a few instances of probable homoplasy. Proteins 2017; 85:775–811. © 2016 Wiley Periodicals, Inc. PMID:27936487
Engineering Aromatic-Aromatic Interactions To Nucleate Folding in Intrinsically Disordered Regions of Proteins.

PubMed

Balakrishnan, Swati; Sarma, Siddhartha P

2017-08-22

Aromatic interactions are an important force in protein folding as they combine the stability of a hydrophobic interaction with the selectivity of a hydrogen bond. Much of our understanding of aromatic interactions comes from "bioinformatics" based analyses of protein structures and from the contribution of these interactions to stabilizing secondary structure motifs in model peptides. In this study, the structural consequences of aromatic interactions on protein folding have been explored in engineered mutants of the molten globule protein apo-cytochrome b 5 . Structural changes from disorder to order due to aromatic interactions in two variants of the protein, viz., WF-cytb5 and FF-cytb5, result in significant long-range secondary and tertiary structure. The results show that 54 and 52% of the residues in WF-cytb5 and FF-cytb5, respectively, occupy ordered regions versus 26% in apo-cytochrome b 5 . The interactions between the aromatic groups are offset-stacked and edge-to-face for the Trp-Phe and Phe-Phe mutants, respectively. Urea denaturation studies indicate that both mutants have a C m higher than that of apo-cytochrome b 5 and are more stable to chaotropic agents than apo-cytochrome b 5 . The introduction of these aromatic residues also results in "trimer" interactions with existing aromatic groups, reaffirming the selectivity of the aromatic interactions. These studies provide insights into the aromatic interactions that drive disorder-to-order transitions in intrinsically disordered regions of proteins and will aid in de novo protein design beyond small peptide scaffolds.
Computing the origin and evolution of the ribosome from its structure — Uncovering processes of macromolecular accretion benefiting synthetic biology

PubMed Central

Caetano-Anollés, Gustavo; Caetano-Anollés, Derek

2015-01-01

Accretion occurs pervasively in nature at widely different timeframes. The process also manifests in the evolution of macromolecules. Here we review recent computational and structural biology studies of evolutionary accretion that make use of the ideographic (historical, retrodictive) and nomothetic (universal, predictive) scientific frameworks. Computational studies uncover explicit timelines of accretion of structural parts in molecular repertoires and molecules. Phylogenetic trees of protein structural domains and proteomes and their molecular functions were built from a genomic census of millions of encoded proteins and associated terminal Gene Ontology terms. Trees reveal a ‘metabolic-first’ origin of proteins, the late development of translation, and a patchwork distribution of proteins in biological networks mediated by molecular recruitment. Similarly, the natural history of ancient RNA molecules inferred from trees of molecular substructures built from a census of molecular features shows patchwork-like accretion patterns. Ideographic analyses of ribosomal history uncover the early appearance of structures supporting mRNA decoding and tRNA translocation, the coevolution of ribosomal proteins and RNA, and a first evolutionary transition that brings ribosomal subunits together into a processive protein biosynthetic complex. Nomothetic structural biology studies of tertiary interactions and ancient insertions in rRNA complement these findings, once concentric layering assumptions are removed. Patterns of coaxial helical stacking reveal a frustrated dynamics of outward and inward ribosomal growth possibly mediated by structural grafting. The early rise of the ribosomal ‘turnstile’ suggests an evolutionary transition in natural biological computation. Results make explicit the need to understand processes of molecular growth and information transfer of macromolecules. PMID:27096056
Phylogeny-Based Systematization of Arabidopsis Proteins with Histone H1 Globular Domain1[OPEN

PubMed Central

Knizewski, Lukasz; Schmidt, Anja; Ginalski, Krzysztof

2017-01-01

H1 (or linker) histones are basic nuclear proteins that possess an evolutionarily conserved nucleosome-binding globular domain, GH1. They perform critical functions in determining the accessibility of chromatin DNA to trans-acting factors. In most metazoan species studied so far, linker histones are highly heterogenous, with numerous nonallelic variants cooccurring in the same cells. The phylogenetic relationships among these variants as well as their structural and functional properties have been relatively well established. This contrasts markedly with the rather limited knowledge concerning the phylogeny and structural and functional roles of an unusually diverse group of GH1-containing proteins in plants. The dearth of information and the lack of a coherent phylogeny-based nomenclature of these proteins can lead to misunderstandings regarding their identity and possible relationships, thereby hampering plant chromatin research. Based on published data and our in silico and high-throughput analyses, we propose a systematization and coherent nomenclature of GH1-containing proteins of Arabidopsis (Arabidopsis thaliana [L.] Heynh) that will be useful for both the identification and structural and functional characterization of homologous proteins from other plant species. PMID:28298478
Phylogeny-Based Systematization of Arabidopsis Proteins with Histone H1 Globular Domain.

PubMed

Kotliński, Maciej; Knizewski, Lukasz; Muszewska, Anna; Rutowicz, Kinga; Lirski, Maciej; Schmidt, Anja; Baroux, Célia; Ginalski, Krzysztof; Jerzmanowski, Andrzej

2017-05-01

H1 (or linker) histones are basic nuclear proteins that possess an evolutionarily conserved nucleosome-binding globular domain, GH1. They perform critical functions in determining the accessibility of chromatin DNA to trans-acting factors. In most metazoan species studied so far, linker histones are highly heterogenous, with numerous nonallelic variants cooccurring in the same cells. The phylogenetic relationships among these variants as well as their structural and functional properties have been relatively well established. This contrasts markedly with the rather limited knowledge concerning the phylogeny and structural and functional roles of an unusually diverse group of GH1-containing proteins in plants. The dearth of information and the lack of a coherent phylogeny-based nomenclature of these proteins can lead to misunderstandings regarding their identity and possible relationships, thereby hampering plant chromatin research. Based on published data and our in silico and high-throughput analyses, we propose a systematization and coherent nomenclature of GH1-containing proteins of Arabidopsis ( Arabidopsis thaliana [L.] Heynh) that will be useful for both the identification and structural and functional characterization of homologous proteins from other plant species. © 2017 American Society of Plant Biologists. All Rights Reserved.
Structures of Human CCL18, CCL3, and CCL4 Reveal Molecular Determinants for Quaternary Structures and Sensitivity to Insulin-Degrading Enzyme

DOE PAGES

Liang, Wenguang G.; Ren, Min; Zhao, Fan; ...

2015-01-27

CC chemokine ligands (CCL) are 8-14 kDa signaling proteins involved in diverse immune functions. While CCLs share similar tertiary structures, oligomerization produces highly diverse quaternary structures that protect chemokines from proteolytic degradation and modulate their functions. CCL18 is closely related to CCL3 and CCL4 with respect to both protein sequence and genomic location, yet CCL18 has distinct biochemical and biophysical properties. Here in this paper, we report a crystal structure of human CCL18 and its oligomerization states in solution based on crystallographic and small angle X-ray scattering (SAXS) analyses. Our data shows that CCL18 adopts an α-helical conformation at itsmore » N-terminus that weakens its dimerization, explaining CCL18’s preference for the monomeric state. Multiple contacts between monomers allow CCL18 to reversibly form a unique open-ended oligomer different from those of CCL3, CCL4, and CCL5. Furthermore, these differences hinge on proline 8, which is conserved in CCL3 and CCL4, but is replaced by lysine in human CCL18. Our structural analyses suggest that a proline 8 to alanine mutation stabilizes a type I β-turn at the N-terminus of CCL4 to prevent dimerization but prevents dimers from making key contacts with each other in CCL3. Thus, the P8A mutation induces depolymerization of CCL3 and CCL4 by distinct mechanisms. Finally, we used structural, biochemical, and functional analyses to unravel why insulin-degrading enzyme (IDE) degrades CCL3 and CCL4 but not CCL18. Lastly, our results elucidate the molecular basis for the oligomerization of three closely related CC chemokines and suggest how oligomerization shapes CCL chemokine function.« less
Structures of human CCL18, CCL3, and CCL4 reveal molecular determinants for quaternary structures and sensitivity to insulin-degrading enzyme.

PubMed

Liang, Wenguang G; Ren, Min; Zhao, Fan; Tang, Wei-Jen

2015-03-27

CC chemokine ligands (CCLs) are 8- to 14-kDa signaling proteins involved in diverse immune functions. While CCLs share similar tertiary structures, oligomerization produces highly diverse quaternary structures that protect chemokines from proteolytic degradation and modulate their functions. CCL18 is closely related to CCL3 and CCL4 with respect to both protein sequence and genomic location, yet CCL18 has distinct biochemical and biophysical properties. Here, we report a crystal structure of human CCL18 and its oligomerization states in solution based on crystallographic and small-angle X-ray scattering analyses. Our data show that CCL18 adopts an α-helical conformation at its N-terminus that weakens its dimerization, explaining CCL18's preference for the monomeric state. Multiple contacts between monomers allow CCL18 to reversibly form a unique open-ended oligomer different from those of CCL3, CCL4, and CCL5. Furthermore, these differences hinge on proline 8, which is conserved in CCL3 and CCL4 but is replaced by lysine in human CCL18. Our structural analyses suggest that a mutation of proline 8 to alanine stabilizes a type 1 β-turn at the N-terminus of CCL4 to prevent dimerization but prevents dimers from making key contacts with each other in CCL3. Thus, the P8A mutation induces depolymerization of CCL3 and CCL4 by distinct mechanisms. Finally, we used structural, biochemical, and functional analyses to unravel why insulin-degrading enzyme degrades CCL3 and CCL4 but not CCL18. Our results elucidate the molecular basis for the oligomerization of three closely related CC chemokines and suggest how oligomerization shapes CCL chemokine function. Copyright © 2015 Elsevier Ltd. All rights reserved.
Computational modeling of RNA 3D structures, with the aid of experimental restraints

PubMed Central

Magnus, Marcin; Matelska, Dorota; Łach, Grzegorz; Chojnowski, Grzegorz; Boniecki, Michal J; Purta, Elzbieta; Dawson, Wayne; Dunin-Horkawicz, Stanislaw; Bujnicki, Janusz M

2014-01-01

In addition to mRNAs whose primary function is transmission of genetic information from DNA to proteins, numerous other classes of RNA molecules exist, which are involved in a variety of functions, such as catalyzing biochemical reactions or performing regulatory roles. In analogy to proteins, the function of RNAs depends on their structure and dynamics, which are largely determined by the ribonucleotide sequence. Experimental determination of high-resolution RNA structures is both laborious and difficult, and therefore, the majority of known RNAs remain structurally uncharacterized. To address this problem, computational structure prediction methods were developed that simulate either the physical process of RNA structure formation (“Greek science” approach) or utilize information derived from known structures of other RNA molecules (“Babylonian science” approach). All computational methods suffer from various limitations that make them generally unreliable for structure prediction of long RNA sequences. However, in many cases, the limitations of computational and experimental methods can be overcome by combining these two complementary approaches with each other. In this work, we review computational approaches for RNA structure prediction, with emphasis on implementations (particular programs) that can utilize restraints derived from experimental analyses. We also list experimental approaches, whose results can be relatively easily used by computational methods. Finally, we describe case studies where computational and experimental analyses were successfully combined to determine RNA structures that would remain out of reach for each of these approaches applied separately. PMID:24785264
SAD phasing of a structure based on cocrystallized iodides using an in-house Cu Kalpha X-ray source: effects of data redundancy and completeness on structure solution.

PubMed

Yogavel, Manickam; Gill, Jasmita; Mishra, Prakash Chandra; Sharma, Amit

2007-08-01

Superoxide dismutase (SOD) from Potentilla atrosanguinea (Wall. ex. Lehm.) was crystallized using 20% PEG 3350 and 0.2 M ammonium iodide and diffraction data were collected to 2.36 A resolution using an in-house Cu Kalpha X-ray source. Analyses show that data with a redundancy of 3.2 were sufficient to determine the structure by the SAD technique using the iodine anomalous signal. This redundancy is lower than that in previous cases in which protein structures were determined using iodines for phasing and in-house copper X-ray sources. Cocrystallization of proteins with halide salts such as ammonium iodide in combination with copper-anode X-ray radiation can therefore serve as a powerful and easy avenue for structure solution.
Metaproteomics to investigate the impact of sampling-site biogeochemistry on structure and functionality of leaf-litter degrading microbial communities

NASA Astrophysics Data System (ADS)

Schneider, Thomas; Keiblinger, Katharina; Gerrits, Bertran; Schmid, Emanuel; Eberl, Leo; Zechmeister-Boltenstern, Sophie; Riedel, Kathrin

2010-05-01

The composition of organic matter in natural ecosystems is strongly influenced by the microorganisms present. Conversely, bacteria and fungi are limited by the amount and type of organic matter available in a given environment, most of which is ultimately derived from plants. Changes in the stoichiometry and biochemical constituents of plant litter may therefore alter species composition and elicit changes in the activities of microbial communities and their component parts. The identification of the microbial proteins of a given habitat together with the analysis of their phylogenetic origin and their spatial and temporal distribution are expected to provide fundamentally new insights into the role of microbial diversity in biogeochemical processes. To relate structure and functionality of microbial communities involved in leaf-litter decomposition we determined biogeochemistry, community structure by phospholipid fatty acid (PLFA)-analyses, enzymatic activities, and analysed the protein complement of different litter types, which were collected in winter and spring at various Austrian sampling sites, in a semi-quantitative proteomics approach by one dimensional polyacrylamide gel electrophoresis (1-D-SDS-PAGE) combined with liquid chromatography/tandem mass-spectrometry (LC-MS/MS). Protein abundances were determined by counting the number of MS/MS spectra assigned to each protein. In samples with high manganese and phosphor content a significant increase of fungal proteins from February to May was observed, which was in good agreement with the PLFA-analyses showing similar trends towards an increase of the fungal community. In contrast, the PLFA analysis revealed no temporal changes in the community at Achenkirch and even a decrease in the fungal/bacterial ratio at Klausen-Leopoldsdorf, two sampling sites low in P and Mn; similar trends are reflected in our spectral counts. In conclusion, semi-quantitative proteome- and PLFA-analyses suggest that fungal and bacterial abundance positively correlates with the total amount of P and Mn within the different litter types. Spectral counts of extracellular enzymes demonstrated a significant increase of these enzymes in the May, which was also mirrored by measurements of total enzymatic activities. The finding that almost all hydrolytic enzymes identified from litter were of fungal origin suggests a prominent role of fungi during aerobic litter decomposition.
Proton channel models

PubMed Central

Pupo, Amaury; Baez-Nieto, David; Martínez, Agustín; Latorre, Ramón; González, Carlos

2014-01-01

Voltage-gated proton channels are integral membrane proteins with the capacity to permeate elementary particles in a voltage and pH dependent manner. These proteins have been found in several species and are involved in various physiological processes. Although their primary topology is known, lack of details regarding their structures in the open conformation has limited analyses toward a deeper understanding of the molecular determinants of their function and regulation. Consequently, the function-structure relationships have been inferred based on homology models. In the present work, we review the existing proton channel models, their assumptions, predictions and the experimental facts that support them. Modeling proton channels is not a trivial task due to the lack of a close homolog template. Hence, there are important differences between published models. This work attempts to critically review existing proton channel models toward the aim of contributing to a better understanding of the structural features of these proteins. PMID:24755912
Human Fanconi Anemia Complementation Group A Protein Stimulates the 5’ Flap Endonuclease Activity of FEN1

PubMed Central

Qian, Liangyue; Yuan, Fenghua; Rodriguez-Tello, Paola; Padgaonkar, Suyog; Zhang, Yanbin

2013-01-01

In eukaryotic cells, Flap endonuclease 1 (FEN1) is a major structure-specific endonuclease that processes 5’ flapped structures during maturation of lagging strand DNA synthesis, long patch base excision repair, and rescue of stalled replication forks. Here we report that fanconi anemia complementation group A protein (FANCA), a protein that recognizes 5’ flap structures and is involved in DNA repair and maintenance of replication forks, constantly stimulates FEN1-mediated incision of both DNA and RNA flaps. Kinetic analyses indicate that FANCA stimulates FEN1 by increasing the turnover rate of FEN1 and altering its substrate affinity. More importantly, six pathogenic FANCA mutants are significantly less efficient than the wild-type at stimulating FEN1 endonuclease activity, implicating that regulation of FEN1 by FANCA contributes to the maintenance of genomic stability. PMID:24349332

Sequence analyses reveal that a TPR–DP module, surrounded by recombinable flanking introns, could be at the origin of eukaryotic Hop and Hip TPR–DP domains and prokaryotic GerD proteins

PubMed Central

Papandreou, Nikolaos; Chomilier, Jacques

2008-01-01

The co-chaperone Hop [heat shock protein (HSP) organising protein] is known to bind both Hsp70 and Hsp90. Hop comprises three repeats of a tetratricopeptide repeat (TPR) domain, each consisting of three TPR motifs. The first and last TPR domains are followed by a domain containing several dipeptide (DP) repeats called the DP domain. These analyses suggest that the hop genes result from successive recombination events of an ancestral TPR–DP module. From a hydrophobic cluster analysis of homologous Hop protein sequences derived from gene families, we can postulate that shifts in the open reading frames are at the origin of the present sequences. Moreover, these shifts can be related to the presence or absence of biological function. We propose to extend the family of Hop co-chaperons into the kingdom of bacteria, as several structurally related genes have been identified by hydrophobic cluster analysis. We also provide evidence of common structural characteristics between hop and hip genes, suggesting a shared precursor of ancestral TPR–DP domains. Electronic supplementary material The online version of this article (doi:10.1007/s12192-008-0083-8) contains supplementary material, which is available to authorized users. PMID:18987995
Detect the sensitivity and response of protein molecular structure of whole canola seed (yellow and brown) to different heat processing methods and relation to protein utilization and availability using ATR-FT/IR molecular spectroscopy with chemometrics.

PubMed

Samadi; Theodoridou, Katerina; Yu, Peiqiang

2013-03-15

The objectives of this experiment were to detect the sensitivity and response of protein molecular structure of whole canola seed to different heat processing [moisture (autoclaving) vs. dry (roasting) heating] and quantify heat-induced protein molecular structure changes in relation to protein utilization and availability. In this study, whole canola seeds were autoclaved (moisture heating) and dry (roasting) heated at 120 °C for 1h, respectively. The parameters assessed included changes in (1) chemical composition profile, (2) CNCPS protein subfractions (PA, PB1, PB2, PB3, PC), (3) intestinal absorbed true protein supply, (4) energy values, and (5) protein molecular structures (amide I, amide II, ratio of amide I to II, α-helix, β-sheet, ratio of α-helix to β-sheet). The results showed that autoclave heating significantly decreased (P<0.05) but dry heating increased (P<0.05) the ratio of protein α-helix to β-sheet (with the ratios of 1.07, 0.95, 1.10 for the control (raw), autoclave heating and dry heating, respectively). The multivariate molecular spectral analyses (PCA, CLA) showed that there were significantly molecular structural differences in the protein amide I and II fingerprint region (ca. 1714-1480 cm(-1)) among the control, autoclave and dry heating. These differences were indicated by the form of separate class (PCA) and group of separate ellipse (CLA) between the treatments. The correlation analysis with spearman method showed that there were significantly and highly positive correlation (P<0.05) between heat-induced protein molecular structure changes in terms of α-helix to β-sheet ratios and in situ protein degradation and significantly negative correlation between the protein α-helix to β-sheet ratios and intestinal digestibility of undegraded protein. The results indicated that heat-induced changes of protein molecular structure revealed by vibration molecular spectroscopy could be used as a potential predictor to protein degradation and intestinal protein digestion of whole canola seed. Future study is needed to study response and impact of heat processing to each inherent layer of canola seed from outside to inside tissues and between yellow canola and brown canola. Copyright © 2012 Elsevier B.V. All rights reserved.
Identification of a structural constituent and one possible site of postembryonic formation of a teleost otolithic membrane

PubMed Central

Davis, James G.; Burns, Frank R.; Navaratnam, Dasakumar; Lee, A. Masaji; Ichimiya, Shingo; Oberholtzer, J. Carl; Greene, Mark I.

1997-01-01

A gelatinous otolithic membrane (OM) couples a single calcified otolith to the sensory epithelium in the bluegill sunfish (Lepomis macrochirus) saccule, one of the otolithic organs in the inner ear. Though the OM is an integral part of the anatomic network of endorgan structures that result in vestibular function in the inner ear, the identity of the proteins that make up this sensory accessory membrane in teleosts, or in any vertebrate, is not fully known. Previously, we identified a cDNA from the sunfish saccular otolithic organ that encoded a new member of the collagen family of structural proteins. In this study, we examined biochemical features and the localization of the saccular collagen (SC) protein in vivo using polyclonal antisera that recognize the noncollagenous domains of the SC protein. The SC protein, in vivo, was identified as a 95-kDa glycoprotein in sunfish whole-saccule lysate and in homogenates of microdissected saccular OMs. Immunohistochemical analyses demonstrated that the SC protein was localized within one of the two distinct layers of the sunfish saccular OM. The SC protein was also detected within the cytoplasm of supporting cells at the edges of the saccular sensory epithelium, indicating that these cells are a primary site for the synthesis of this structural protein. Further studies of the organization of this matrix molecule in the OM may help clarify the role of this sensory accessory membrane in vestibular sensory function. PMID:9012849
Evolutionary distance from human homologs reflects allergenicity of animal food proteins.

PubMed

Jenkins, John A; Breiteneder, Heimo; Mills, E N Clare

2007-12-01

In silico analysis of allergens can identify putative relationships among protein sequence, structure, and allergenic properties. Such systematic analysis reveals that most plant food allergens belong to a restricted number of protein superfamilies, with pollen allergens behaving similarly. We have investigated the structural relationships of animal food allergens and their evolutionary relatedness to human homologs to define how closely a protein must resemble a human counterpart to lose its allergenic potential. Profile-based sequence homology methods were used to classify animal food allergens into Pfam families, and in silico analyses of their evolutionary and structural relationships were performed. Animal food allergens could be classified into 3 main families--tropomyosins, EF-hand proteins, and caseins--along with 14 minor families each composed of 1 to 3 allergens. The evolutionary relationships of each of these allergen superfamilies showed that in general, proteins with a sequence identity to a human homolog above approximately 62% were rarely allergenic. Single substitutions in otherwise highly conserved regions containing IgE epitopes in EF-hand parvalbumins may modulate allergenicity. These data support the premise that certain protein structures are more allergenic than others. Contrasting with plant food allergens, animal allergens, such as the highly conserved tropomyosins, challenge the capability of the human immune system to discriminate between foreign and self-proteins. Such immune responses run close to becoming autoimmune responses. Exploiting the closeness between animal allergens and their human homologs in the development of recombinant allergens for immunotherapy will need to consider the potential for developing unanticipated autoimmune responses.
Unravelling the effects of mechanical physiological conditioning on cardiac adipose tissue-derived progenitor cells in vitro and in silico.

PubMed

Llucià-Valldeperas, Aida; Bragós, Ramon; Soler-Botija, Carolina; Roura, Santiago; Gálvez-Montón, Carolina; Prat-Vidal, Cristina; Perea-Gil, Isaac; Bayes-Genis, Antoni

2018-01-11

Mechanical conditioning is incompletely characterized for stimulating therapeutic cells within the physiological range. We sought to unravel the mechanism of action underlying mechanical conditioning of adipose tissue-derived progenitor cells (ATDPCs), both in vitro and in silico. Cardiac ATDPCs, grown on 3 different patterned surfaces, were mechanically stretched for 7 days at 1 Hz. A custom-designed, magnet-based, mechanical stimulator device was developed to apply ~10% mechanical stretching to monolayer cell cultures. Gene and protein analyses were performed for each cell type and condition. Cell supernatants were also collected to analyze secreted proteins and construct an artificial neural network. Gene and protein modulations were different for each surface pattern. After mechanostimulation, cardiac ATDPCs increased the expression of structural genes and there was a rising trend on cardiac transcription factors. Finally, secretome analyses revealed upregulation of proteins associated with both myocardial infarction and cardiac regeneration, such as regulators of the immune response, angiogenesis or cell adhesion. To conclude, mechanical conditioning of cardiac ATDPCs enhanced the expression of early and late cardiac genes in vitro. Additionally, in silico analyses of secreted proteins showed that mechanical stimulation of cardiac ATDPCs was highly associated with myocardial infarction and repair.
Thermally induced disintegration of the oligomeric structure of alphaB-crystallin mutant F28S is associated with diminished chaperone activity.

PubMed

Kelley, Patrick B; Abraham, Edathara C

2003-10-01

alphaB-crystallin, a member of the small heat-shock protein (hsp) family of proteins, is able to function as a molecular chaperone by protecting other proteins from stress-induced aggregation by recognizing and binding to partially unfolded species of damaged proteins. The present work has investigated the role of phenylalanine-28 (F28) of the 22RLFDQFF28 region of alphaB-crystallin in maintaining chaperone function and oligomeric structure under physiological condition and under thermal stress. Bovine alphaB-crystallin was cloned for the first time and the cDNA sequence revealed greater than 90% homology to that of human, rat and mouse alphaB-crystallins. F28 was mutated to a serine followed by expression of the mutant F28S and the wild-type alphaB (alphaB-wt) in E. coli and subsequent purification of the protein by size-exclusion chromatography. Secondary and tertiary structure analyses showed some structural changes in the mutant. Chaperone activity and oligomeric size of the mutant was unchanged at 37 degrees C whereas at 58 degrees C the chaperone activity was significantly decreased and the oligomeric size ranged from low molecular weight to high molecular weight showing disintegration of the oligomeric structure. The data support the idea that the participation of large oligomeric structure rather than smaller units is required to have optimal chaperone activity and the hydrophobic F28 residue is needed for maintaining the native oligomeric structure under thermal stress.
Stn1-Ten1 is an Rpa2-Rpa3-like complex at telomeres

DOE Office of Scientific and Technical Information (OSTI.GOV)

Sun, Jia; Yu, Eun Young; Yang, Yuting

2010-09-02

In budding yeast, Cdc13, Stn1, and Ten1 form a heterotrimeric complex (CST) that is essential for telomere protection and maintenance. Previous bioinformatics analysis revealed a putative oligonucleotide/oligosaccharide-binding (OB) fold at the N terminus of Stn1 (Stn1N) that shows limited sequence similarity to the OB fold of Rpa2, a subunit of the eukaryotic ssDNA-binding protein complex replication protein A (RPA). Here we present functional and structural analyses of Stn1 and Ten1 from multiple budding and fission yeast. The crystal structure of the Candida tropicalis Stn1N complexed with Ten1 demonstrates an Rpa2N-Rpa3-like complex. In both structures, the OB folds of the twomore » components pack against each other through interactions between two C-terminal helices. The structure of the C-terminal domain of Saccharomyces cerevisiae Stn1 (Stn1C) was found to comprise two related winged helix-turn-helix (WH) motifs, one of which is most similar to the WH motif at the C terminus of Rpa2, again supporting the notion that Stn1 resembles Rpa2. The crystal structure of the fission yeast Schizosaccharomyces pombe Stn1N-Ten1 complex exhibits a virtually identical architecture as the C. tropicalis Stn1N-Ten1. Functional analyses of the Candida albicans Stn1 and Ten1 proteins revealed critical roles for these proteins in suppressing aberrant telomerase and recombination activities at telomeres. Mutations that disrupt the Stn1-Ten1 interaction induce telomere uncapping and abolish the telomere localization of Ten1. Collectively, our structural and functional studies illustrate that, instead of being confined to budding yeast telomeres, the CST complex may represent an evolutionarily conserved RPA-like telomeric complex at the 3' overhangs that works in parallel with or instead of the well-characterized POT1-TPP1/TEBP{alpha}-{beta} complex.« less
Structural analyses of EBER1 and EBER2 ribonucleoprotein particles present in Epstein-Barr virus-infected cells.

PubMed Central

Glickman, J N; Howe, J G; Steitz, J A

1988-01-01

The ribonucleoprotein (RNP) particles containing the Epstein-Barr virus-associated small RNAs EBER1 and EBER2 were analyzed to determine their RNA secondary structures and sites of RNA-protein interaction. The secondary structures were probed with nucleases and by chemical modification with single-strand-specific reagents, and the sites of modification or cleavage were mapped by primer extension. These data were used to develop secondary structures for the two RNAs, and likely sites of close RNA-protein contact were identified by comparing modification patterns for naked RNA and RNA in RNP particles. In addition, sites of interaction between each Epstein-Barr virus-encoded RNA (EBER) and the La antigen were identified by analyzing RNA fragments resistant to digestion by RNase A or T1 after immunoprecipitation by an anti-La serum sample from a lupus patient. Our results confirm earlier findings that the La protein binds to the 3' terminus of each molecule. Possible functions for the EBER RNPs are discussed. Images PMID:2828685
Crystal structure of SP-PTP, a low molecular weight protein tyrosine phosphatase from Streptococcus pyogenes.

PubMed

Ku, Bonsu; Keum, Chae Won; Lee, Hye Seon; Yun, Hye-Yeoung; Shin, Ho-Chul; Kim, Bo Yeon; Kim, Seung Jun

2016-09-23

Streptococcus pyogenes, or Group A Streptococcus (GAS), is a pathogenic bacterium that causes a variety of infectious diseases. The GAS genome encodes one protein tyrosine phosphatase, SP-PTP, which plays an essential role in the replication and virulence maintenance of GAS. Herein, we present the crystal structure of SP-PTP at 1.9 Å resolution. Although SP-PTP has been reported to have dual phosphatase specificity for both phosphorylated tyrosine and serine/threonine, three-dimensional structural analysis showed that SP-PTP shares high similarity with typical low molecular weight protein tyrosine phosphatases (LMWPTPs), which are specific for phosphotyrosine, but not with dual-specificity phosphatases, in overall folding and active site composition. In the dephosphorylation activity test, SP-PTP consistently acted on phosphotyrosine substrates, but not or only minimally on phosphoserine/phosphothreonine substrates. Collectively, our structural and biochemical analyses verified SP-PTP as a canonical tyrosine-specific LMWPTP. Copyright © 2016 Elsevier Inc. All rights reserved.
Similarity of a 16.5kDa tegumental protein of the human liver fluke Opisthorchis viverrini to nematode cytoplasmic motility protein.

PubMed

Labbunruang, Nipawan; Phadungsil, Wansika; Tesana, Smarn; Smooker, Peter M; Grams, Rudi

2016-05-01

Opisthorchis viverrini is the causative agent of human opisthorchiasis in Thailand and long lasting infection with the parasite has been correlated with the development of cholangiocarcinoma. In this work we have molecularly characterized the first member of a protein family carrying two DM9 repeats in this parasite (OvDM9-1). InterPro and other protein family databases describe the DM9 repeat as a protein domain of unknown function that has been first noted in Drosophila melanogaster. Two paralogous proteins have been partially characterized in the genus Fasciola, Fasciola hepatica TP16.5, a novel tegumental antigen in human fascioliasis and, recently F. gigantica DM9-1, a parenchymal protein with structural similarity to nematode cytoplasmic motility protein (MFP2). In this study, we show further evidence that this family of trematode proteins is related to MFP2 in sequence and structure. Soluble recombinant OvDM9-1 was used for structural analyses and for production of specific antisera. The native protein was detected in soluble and insoluble crude worm extracts and in seemingly various oligomeric forms in the latter. The potential for oligomerization was supported by cross-linking experiments of recombinant OvDM9-1. Structure prediction suggested a β-rich secondary structure of the protein and this was supported by a circular dichroism analysis. Molecular modeling in Phyre2 identified both MFP2 domains as distant homologs of OvDM9-1. The protein was located in tegumental type tissue and the cecal epithelium in the mature parasite. Recombinant OvDM9-1 was used as target in indirect ELISA but sera from infected hamsters showed only marginal reactivity towards it. It is proposed that OvDM9-1 and other members of this protein family have a role in cellular transport through functions on the cytoskeleton. Copyright © 2016 Elsevier B.V. All rights reserved.
Single Honeybee Silk Protein Mimics Properties of Multi-Protein Silk

PubMed Central

Sutherland, Tara D.; Church, Jeffrey S.; Hu, Xiao; Huson, Mickey G.; Kaplan, David L.; Weisman, Sarah

2011-01-01

Honeybee silk is composed of four fibrous proteins that, unlike other silks, are readily synthesized at full-length and high yield. The four silk genes have been conserved for over 150 million years in all investigated bee, ant and hornet species, implying a distinct functional role for each protein. However, the amino acid composition and molecular architecture of the proteins are similar, suggesting functional redundancy. In this study we compare materials generated from a single honeybee silk protein to materials containing all four recombinant proteins or to natural honeybee silk. We analyse solution conformation by dynamic light scattering and circular dichroism, solid state structure by Fourier Transform Infrared spectroscopy and Raman spectroscopy, and fiber tensile properties by stress-strain analysis. The results demonstrate that fibers artificially generated from a single recombinant silk protein can reproduce the structural and mechanical properties of the natural silk. The importance of the four protein complex found in natural silk may lie in biological silk storage or hierarchical self-assembly. The finding that the functional properties of the mature material can be achieved with a single protein greatly simplifies the route to production for artificial honeybee silk. PMID:21311767
The TIM Barrel Architecture Facilitated the Early Evolution of Protein-Mediated Metabolism.

PubMed

Goldman, Aaron David; Beatty, Joshua T; Landweber, Laura F

2016-01-01

The triosephosphate isomerase (TIM) barrel protein fold is a structurally repetitive architecture that is present in approximately 10% of all enzymes. It is generally assumed that this ubiquity in modern proteomes reflects an essential historical role in early protein-mediated metabolism. Here, we provide quantitative and comparative analyses to support several hypotheses about the early importance of the TIM barrel architecture. An information theoretical analysis of protein structures supports the hypothesis that the TIM barrel architecture could arise more easily by duplication and recombination compared to other mixed α/β structures. We show that TIM barrel enzymes corresponding to the most taxonomically broad superfamilies also have the broadest range of functions, often aided by metal and nucleotide-derived cofactors that are thought to reflect an earlier stage of metabolic evolution. By comparison to other putatively ancient protein architectures, we find that the functional diversity of TIM barrel proteins cannot be explained simply by their antiquity. Instead, the breadth of TIM barrel functions can be explained, in part, by the incorporation of a broad range of cofactors, a trend that does not appear to be shared by proteins in general. These results support the hypothesis that the simple and functionally general TIM barrel architecture may have arisen early in the evolution of protein biosynthesis and provided an ideal scaffold to facilitate the metabolic transition from ribozymes, peptides, and geochemical catalysts to modern protein enzymes.
Mass spectrometric analyses of organophosphate insecticide oxon protein adducts.

PubMed

Thompson, Charles M; Prins, John M; George, Kathleen M

2010-01-01

Organophosphate (OP) insecticides continue to be used to control insect pests. Acute and chronic exposures to OP insecticides have been documented to cause adverse health effects, but few OP-adducted proteins have been correlated with these illnesses at the molecular level. Our aim was to review the literature covering the current state of the art in mass spectrometry (MS) used to identify OP protein biomarkers. We identified general and specific research reports related to OP insecticides, OP toxicity, OP structure, and protein MS by searching PubMed and Chemical Abstracts for articles published before December 2008. A number of OP-based insecticides share common structural elements that result in predictable OP-protein adducts. The resultant OP-protein adducts show an increase in molecular mass that can be identified by MS and correlated with the OP agent. Customized OP-containing probes have also been used to tag and identify protein targets that can be identified by MS. MS is a useful and emerging tool for the identification of proteins that are modified by activated organophosphate insecticides. MS can characterize the structure of the OP adduct and also the specific amino acid residue that forms the key bond with the OP. Each protein that is modified in a unique way by an OP represents a unique molecular biomarker that with further research can lead to new correlations with exposure.
Mass Spectrometric Analyses of Organophosphate Insecticide Oxon Protein Adducts

PubMed Central

Thompson, Charles M.; Prins, John M.; George, Kathleen M.

2010-01-01

Objective Organophosphate (OP) insecticides continue to be used to control insect pests. Acute and chronic exposures to OP insecticides have been documented to cause adverse health effects, but few OP-adducted proteins have been correlated with these illnesses at the molecular level. Our aim was to review the literature covering the current state of the art in mass spectrometry (MS) used to identify OP protein biomarkers. Data sources and extraction We identified general and specific research reports related to OP insecticides, OP toxicity, OP structure, and protein MS by searching PubMed and Chemical Abstracts for articles published before December 2008. Data synthesis A number of OP-based insecticides share common structural elements that result in predictable OP–protein adducts. The resultant OP–protein adducts show an increase in molecular mass that can be identified by MS and correlated with the OP agent. Customized OP-containing probes have also been used to tag and identify protein targets that can be identified by MS. Conclusions MS is a useful and emerging tool for the identification of proteins that are modified by activated organophosphate insecticides. MS can characterize the structure of the OP adduct and also the specific amino acid residue that forms the key bond with the OP. Each protein that is modified in a unique way by an OP represents a unique molecular biomarker that with further research can lead to new correlations with exposure. PMID:20056576
Evolution of the Translocation and Assembly Module (TAM)

PubMed Central

Heinz, Eva; Selkrig, Joel; Belousoff, Matthew J.; Lithgow, Trevor

2015-01-01

Bacterial outer membrane proteins require the beta-barrel assembly machinery (BAM) for their correct folding and function. The central component of this machinery is BamA, an Omp85 protein that is essential and found in all Gram-negative bacteria. An additional feature of the BAM is the translocation and assembly module (TAM), comprised TamA (an Omp85 family protein) and TamB. We report that TamA and a closely related protein TamL are confined almost exclusively to Proteobacteria and Bacteroidetes/Chlorobi respectively, whereas TamB is widely distributed across the majority of Gram-negative bacterial lineages. A comprehensive phylogenetic and secondary structure analysis of the TamB protein family revealed that TamB was present very early in the evolution of bacteria. Several sequence characteristics were discovered to define the TamB protein family: A signal-anchor linkage to the inner membrane, beta-helical structure, conserved domain architecture and a C-terminal region that mimics outer membrane protein beta-strands. Taken together, the structural and phylogenetic analyses suggest that the TAM likely evolved from an original combination of BamA and TamB, with a later gene duplication event of BamA, giving rise to an additional Omp85 sequence that evolved to be TamA in Proteobacteria and TamL in Bacteroidetes/Chlorobi. PMID:25994932
Cloning, characterisation, and comparative quantitative expression analyses of receptor for advanced glycation end products (RAGE) transcript forms.

PubMed

Sterenczak, Katharina A; Willenbrock, Saskia; Barann, Matthias; Klemke, Markus; Soller, Jan T; Eberle, Nina; Nolte, Ingo; Bullerdiek, Jörn; Murua Escobar, Hugo

2009-04-01

RAGE is a member of the immunoglobulin superfamily of cell surface molecules playing key roles in pathophysiological processes, e.g. immune/inflammatory disorders, Alzheimer's disease, diabetic arteriosclerosis and tumourigenesis. In humans 19 naturally occurring RAGE splicing variants resulting in either N-terminally or C-terminally truncated proteins were identified and are lately discussed as mechanisms for receptor regulation. Accordingly, deregulation of sRAGE levels has been associated with several diseases e.g. Alzheimer's disease, Type 1 diabetes, and rheumatoid arthritis. Administration of recombinant sRAGE to animal models of cancer blocked tumour growth successfully. In spite of its obvious relationship to cancer and metastasis data focusing sRAGE deregulation and tumours is rare. In this study we screened a set of tumours, healthy tissues and various cancer cell lines for RAGE splicing variants and analysed their structure. Additionally, we analysed the ratio of the mainly found transcript variants using quantitative Real-Time PCR. In total we characterised 24 previously not described canine and 4 human RAGE splicing variants, analysed their structure, classified their characteristics, and derived their respective protein forms. Interestingly, the healthy and the neoplastic tissue samples showed in majority RAGE transcripts coding for the complete receptor and transcripts showing insertions of intron 1.
Structural Basis of Rap Phosphatase Inhibition by Phr Peptides

PubMed Central

Gallego del Sol, Francisca; Marina, Alberto

2013-01-01

Two-component systems, composed of a sensor histidine kinase and an effector response regulator (RR), are the main signal transduction devices in bacteria. In Bacillus, the Rap protein family modulates complex signaling processes mediated by two-component systems, such as competence, sporulation, or biofilm formation, by inhibiting the RR components involved in these pathways. Despite the high degree of sequence homology, Rap proteins exert their activity by two completely different mechanisms of action: inducing RR dephosphorylation or blocking RR binding to its target promoter. However the regulatory mechanism involving Rap proteins is even more complex since Rap activity is antagonized by specific signaling peptides (Phr) through a mechanism that remains unknown at the molecular level. Using X-ray analyses, we determined the structure of RapF, the anti-activator of competence RR ComA, alone and in complex with its regulatory peptide PhrF. The structural and functional data presented herein reveal that peptide PhrF blocks the RapF-ComA interaction through an allosteric mechanism. PhrF accommodates in the C-terminal tetratricopeptide repeat domain of RapF by inducing its constriction, a conformational change propagated by a pronounced rotation to the N-terminal ComA-binding domain. This movement partially disrupts the ComA binding site by triggering the ComA disassociation, whose interaction with RapF is also sterically impaired in the PhrF-induced conformation of RapF. Sequence analyses of the Rap proteins, guided by the RapF-PhrF structure, unveil the molecular basis of Phr recognition and discrimination, allowing us to relax the Phr specificity of RapF by a single residue change. PMID:23526880
Adaptive expansion of the maize maternally expressed gene (Meg) family involves changes in expression patterns and protein secondary structures of its members

PubMed Central

2014-01-01

Background The Maternally expressed gene (Meg) family is a locally-duplicated gene family of maize which encodes cysteine-rich proteins (CRPs). The founding member of the family, Meg1, is required for normal development of the basal endosperm transfer cell layer (BETL) and is involved in the allocation of maternal nutrients to growing seeds. Despite the important roles of Meg1 in maize seed development, the evolutionary history of the Meg cluster and the activities of the duplicate genes are not understood. Results In maize, the Meg gene cluster resides in a 2.3 Mb-long genomic region that exhibits many features of non-centromeric heterochromatin. Using phylogenetic reconstruction and syntenic alignments, we identified the pedigree of the Meg family, in which 11 of its 13 members arose in maize after allotetraploidization ~4.8 mya. Phylogenetic and population-genetic analyses identified possible signatures suggesting recent positive selection in Meg homologs. Structural analyses of the Meg proteins indicated potentially adaptive changes in secondary structure from α-helix to β-strand during the expansion. Transcriptomic analysis of the maize endosperm indicated that 6 Meg genes are selectively activated in the BETL, and younger Meg genes are more active than older ones. In endosperms from B73 by Mo17 reciprocal crosses, most Meg genes did not display parent-specific expression patterns. Conclusions Recently-duplicated Meg genes have different protein secondary structures, and their expressions in the BETL dominate over those of older members. Together with the signs of positive selections in the young Meg genes, these results suggest that the expansion of the Meg family involves potentially adaptive transitions in which new members with novel functions prevailed over older members. PMID:25084677
Essential multimeric enzymes in kinetoplastid parasites: A host of potentially druggable protein-protein interactions.

PubMed

Wachsmuth, Leah M; Johnson, Meredith G; Gavenonis, Jason

2017-06-01

Parasitic diseases caused by kinetoplastid parasites of the genera Trypanosoma and Leishmania are an urgent public health crisis in the developing world. These closely related species possess a number of multimeric enzymes in highly conserved pathways involved in vital functions, such as redox homeostasis and nucleotide synthesis. Computational alanine scanning of these protein-protein interfaces has revealed a host of potentially ligandable sites on several established and emerging anti-parasitic drug targets. Analysis of interfaces with multiple clustered hotspots has suggested several potentially inhibitable protein-protein interactions that may have been overlooked by previous large-scale analyses focusing solely on secondary structure. These protein-protein interactions provide a promising lead for the development of new peptide and macrocycle inhibitors of these enzymes.
Dissortativity and duplications in oral cancer

NASA Astrophysics Data System (ADS)

Shinde, Pramod; Yadav, Alok; Rai, Aparna; Jalan, Sarika

2015-08-01

More than 300 000 new cases worldwide are being diagnosed with oral cancer annually. Complexity of oral cancer renders designing drug targets very difficult. We analyse protein-protein interaction network for the normal and oral cancer tissue and detect crucial changes in the structural properties of the networks in terms of the interactions of the hub proteins and the degree-degree correlations. Further analysis of the spectra of both the networks, while exhibiting universal statistical behaviour, manifest distinction in terms of the zero degeneracy, providing insight to the complexity of the underlying system.

Solid-State NMR Studies Reveal Native-like β-Sheet Structures in Transthyretin Amyloid

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lim, Kwang Hun; Dasari, Anvesh K. R.; Hung, Ivan

Structural characterization of amyloid rich in cross-β structures is crucial for unraveling the molecular basis of protein misfolding and amyloid formation associated with a wide range of human disorders. Elucidation of the β-sheet structure in noncrystalline amyloid has, however, remained an enormous challenge. Here we report structural analyses of the β-sheet structure in a full-length transthyretin amyloid using solid-state NMR spectroscopy. Magic-angle-spinning (MAS) solid-state NMR was employed to investigate native-like β-sheet structures in the amyloid state using selective labeling schemes for more efficient solid-state NMR studies. Analyses of extensive long-range 13 C- 13 C correlation MAS spectra obtained with selectivelymore » 13 CO- and 13 Cα-labeled TTR reveal that the two main β-structures in the native state, the CBEF and DAGH β-sheets, remain intact after amyloid formation. The tertiary structural information would be of great use for examining the quaternary structure of TTR amyloid.« less
Solid-State NMR Studies Reveal Native-like β-Sheet Structures in Transthyretin Amyloid

DOE PAGES

Lim, Kwang Hun; Dasari, Anvesh K. R.; Hung, Ivan; ...

2016-09-02

Structural characterization of amyloid rich in cross-β structures is crucial for unraveling the molecular basis of protein misfolding and amyloid formation associated with a wide range of human disorders. Elucidation of the β-sheet structure in noncrystalline amyloid has, however, remained an enormous challenge. Here we report structural analyses of the β-sheet structure in a full-length transthyretin amyloid using solid-state NMR spectroscopy. Magic-angle-spinning (MAS) solid-state NMR was employed to investigate native-like β-sheet structures in the amyloid state using selective labeling schemes for more efficient solid-state NMR studies. Analyses of extensive long-range 13 C- 13 C correlation MAS spectra obtained with selectivelymore » 13 CO- and 13 Cα-labeled TTR reveal that the two main β-structures in the native state, the CBEF and DAGH β-sheets, remain intact after amyloid formation. The tertiary structural information would be of great use for examining the quaternary structure of TTR amyloid.« less
Expression and purification of mouse peptide ESP4 in Escherichia coli.

PubMed

Hirakane, Makoto; Taniguchi, Masahiro; Yoshinaga, Sosuke; Misumi, Shogo; Terasawa, Hiroaki

2014-04-01

Pheromones are species-specific chemical signals that regulate a wide range of social and sexual behaviors in many animals. In mice, the male-specific peptide ESP1 (exocrine gland-secreting peptide 1) is secreted into tear fluids and enhances female sexual receptive behavior. ESP1 belongs to the ESP family, a multigene family with 38 genes in mice. ESP1 shares the highest homology with ESP4. ESP1 is expressed in the extraorbital lacrimal gland, whereas ESP4 is expressed in some exocrine glands. Thus, ESP4 is expected to have a function that has not been elucidated yet. Large amounts of the purified ESP4 protein are required for structural and biochemical studies. Here we present an expression and purification scheme for the recombinant ESP4 protein. The N-terminally histidine-tagged ESP4 fusion protein was expressed in Escherichia coli as inclusion bodies, which were solubilized and purified by nickel affinity chromatography. The histidine tag was cleaved with thrombin and removed by a second nickel affinity chromatography step. The ESP4 protein was isolated with high purity by reversed-phase chromatography. For NMR analyses, we prepared a stable isotope-labeled ESP4 protein. Three repeated freeze-drying steps after the reversed-phase chromatography were required, to remove a volatile contaminating compound and to obtain an NMR spectrum with a homogeneous line shape. AMS-modification and far-UV CD spectroscopic analyses suggested that ESP4 has an intramolecular disulfide bridge and a helical structure, respectively. The present study provides a powerful tool for structural and biochemical studies of ESP4, leading toward the elucidation of the roles of the ESP family members. Copyright © 2014 Elsevier Inc. All rights reserved.
Analysis of ABCB phosphoglycoproteins (PGPs) and their contribution to monocot biomass, structural stability, and productivity

DOE Office of Scientific and Technical Information (OSTI.GOV)

Murphy, Angus Stuart

2014-09-23

Efforts to manipulate production of plant secondary cell walls to improve the quality of biofuel feedstocks are currently limited by an inability to regulate the transport of small molecule components out of the cell. Plant ABCB p-glycoproteins are a small family of plasma membrane organic molecule transporters that have become primary targets for this effort, as they can potentially be harnessed to control the export of aromatic compounds and organic acids. However, unlike promiscuous mammalian ABCBs that function in multidrug resistance, all plant ABCB proteins characterized to date exhibit relatively narrow substrate specificity. Although ABCBs exhibit a highly conserved architecture,more » efforts to modify ABCB activity have been hampered by a lack of structural information largely because an eukaryotic ABCB protein crystal structure has yet to be obtained. Structure/ function analyses have been further impeded by the lack of a common heterologous expression system that can be used to characterize recombinant ABCB proteins, as many cannot be functionally expressed in S. cereviseae or other systems where proteins with analogous function can be readily knocked out. Using experimentally-determined plant ABCB substrate affinities and the crystal structure of the bacterial Sav1866 “half” ABC transporter, we have developed sequence/structure models for ABCBs that provide a testable context for mutational analysis of plant ABCB transporters. We have also developed a flexible heterologous expression system in Schizosaccharomyces pombe in which all endogenous ABC transporters have been knocked out. The effectiveness of this system for transport studies has been demonstrated by the successful functional expression all of the known PIN, AUX/LAX and ABCB auxin transporters. Our central hypothesis is that the domains of the ABCB proteins that we have identified as substrate docking sites and regulators of transport directionality can be altered or swapped to alter the transport characteristics of the proteins. We propose to combine computer modelling, mutational analyses, and complementation of well characterized Arabidopsis abcb4,14,and 19 mutants to elucidate ABCB function. The long term objective of this project is to enhance ABCB transport of cell wall components, to manipulate the direction of ABCB substrate transport, and, ultimately, to produce “designer” ABC transporters that can be used to modify plant feedstock quality.« less
Hydrophobic cluster analysis of G protein-coupled receptors: a powerful tool to derive structural and functional information from 2D-representation of protein sequences.

PubMed

Lentes, K U; Mathieu, E; Bischoff, R; Rasmussen, U B; Pavirani, A

1993-01-01

Current methods for comparative analyses of protein sequences are 1D-alignments of amino acid sequences based on the maximization of amino acid identity (homology) and the prediction of secondary structure elements. This method has a major drawback once the amino acid identity drops below 20-25%, since maximization of a homology score does not take into account any structural information. A new technique called Hydrophobic Cluster Analysis (HCA) has been developed by Lemesle-Varloot et al. (Biochimie 72, 555-574), 1990). This consists of comparing several sequences simultaneously and combining homology detection with secondary structure analysis. HCA is primarily based on the detection and comparison of structural segments constituting the hydrophobic core of globular protein domains, with or without transmembrane domains. We have applied HCA to the analysis of different families of G-protein coupled receptors, such as catecholamine receptors as well as peptide hormone receptors. Utilizing HCA the thrombin receptor, a new and as yet unique member of the family of G-protein coupled receptors, can be clearly classified as being closely related to the family of neuropeptide receptors rather than to the catecholamine receptors for which the shape of the hydrophobic clusters and the length of their third cytoplasmic loop are very different. Furthermore, the potential of HCA to predict relationships between new putative and already characterized members of this family of receptors will be presented.
Structural Analysis of a Putative Aminoglycoside N-Acetyltransferase from Bacillus anthracis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Klimecka, Maria M.; Chruszcz, Maksymilian; Font, Jose

2012-02-15

For the last decade, worldwide efforts for the treatment of anthrax infection have focused on developing effective vaccines. Patients that are already infected are still treated traditionally using different types of standard antimicrobial agents. The most popular are antibiotics such as tetracyclines and fluoroquinolones. While aminoglycosides appear to be less effective antimicrobial agents than other antibiotics, synthetic aminoglycosides have been shown to act as potent inhibitors of anthrax lethal factor and may have potential application as antitoxins. Here, we present a structural analysis of the BA2930 protein, a putative aminoglycoside acetyltransferase, which may be a component of the bacterium's aminoglycosidemore » resistance mechanism. The determined structures revealed details of a fold characteristic only for one other protein structure in the Protein Data Bank, namely, YokD from Bacillus subtilis. Both BA2930 and YokD are members of the Antibiotic-NAT superfamily (PF02522). Sequential and structural analyses showed that residues conserved throughout the Antibiotic-NAT superfamily are responsible for the binding of the cofactor acetyl coenzyme A. The interaction of BA2930 with cofactors was characterized by both crystallographic and binding studies.« less
Experimentally observed conformation-dependent geometry and hidden strain in proteins.

PubMed Central

Karplus, P. A.

1996-01-01

A database has been compiled documenting the peptide conformations and geometries from 70 diverse proteins refined at 1.75 A or better. Analysis of the well-ordered residues within the database shows phi, psi-distributions that have more fine structure than is generally observed. Also, clear evidence is presented that the peptide covalent geometry depends on conformation, with the interpeptide N-C alpha-C bond angle varying by nearly +/-5 degrees from its standard value. The observed deviations from standard peptide geometry are greatest near the edges of well-populated regions, consistent with strain occurring in these conformations. Minimization of such hidden strain could be an important factor in thermostability of proteins. These empirical data describing how equilibrium peptide geometry varies as a function of conformation confirm and extend quantum mechanics calculations, and have predictive value that will aid both theoretical and experimental analyses of protein structure. PMID:8819173
Mutual adaptation of a membrane protein and its lipid bilayer during conformational changes.

PubMed

Sonntag, Yonathan; Musgaard, Maria; Olesen, Claus; Schiøtt, Birgit; Møller, Jesper Vuust; Nissen, Poul; Thøgersen, Lea

2011-01-01

The structural elucidation of membrane proteins continues to gather pace, but we know little about their molecular interactions with the lipid environment or how they interact with the surrounding bilayer. Here, with the aid of low-resolution X-ray crystallography, we present direct structural information on membrane interfaces as delineated by lipid phosphate groups surrounding the sarco(endo)plasmic reticulum Ca(2+)-ATPase (SERCA) in its phosphorylated and dephosphorylated Ca(2+)-free forms. The protein-lipid interactions are further analysed using molecular dynamics simulations. We find that SERCA adapts to membranes of different hydrophobic thicknesses by inducing local deformations in the lipid bilayers and by undergoing small rearrangements of the amino-acid side chains and helix tilts. These mutually adaptive interactions allow smooth transitions through large conformational changes associated with the transport cycle of SERCA, a strategy that may be of general nature for many membrane proteins.
Buried chloride stereochemistry in the Protein Data Bank

PubMed Central

2014-01-01

Background Despite the chloride anion is involved in fundamental biological processes, its interactions with proteins are little known. In particular, we lack a systematic survey of its coordination spheres. Results The analysis of a non-redundant set (pairwise sequence identity?
Buried chloride stereochemistry in the Protein Data Bank.

PubMed

Carugo, Oliviero

2014-09-23

Despite the chloride anion is involved in fundamental biological processes, its interactions with proteins are little known. In particular, we lack a systematic survey of its coordination spheres. The analysis of a non-redundant set (pairwise sequence identity < 30%) of 1739 high resolution (<2 Å) crystal structures that contain at least one chloride anion shows that the first coordination spheres of the chlorides are essentially constituted by hydrogen bond donors. Amongst the side-chains positively charged, arginine interacts with chlorides much more frequently than lysine. Although the most common coordination number is 4, the coordination stereochemistry is closer to the expected geometry when the coordination number is 5, suggesting that this is the coordination number towards which the chlorides tend when they interact with proteins. The results of these analyses are useful in interpreting, describing, and validating new protein crystal structures that contain chloride anions.
Structure-based energetics of protein interfaces guide Foot-and-Mouth Disease virus vaccine design

PubMed Central

Scott, Katherine; Burman, Alison; Loureiro, Silvia; Ren, Jingshan; Porta, Claudine; Ginn, Helen M.; Jackson, Terry; Perez-Martin, Eva; Siebert, C. Alistair; Paul, Guntram; Huiskonen, Juha T.; Jones, Ian M.; Esnouf, Robert M.; Fry, Elizabeth E.; Maree, Francois F.; Charleston, Bryan; Stuart, David I.

2018-01-01

Summary Virus capsids are primed for disassembly yet capsid integrity is key to generating a protective immune response. Here we devise a computational method to assess relative stability of protein-protein interfaces and use it to design improved candidate vaccines for two of the least stable, but globally important, serotypes of Foot-and-Mouth Disease virus (FMDV), O and SAT2. FMDV capsids comprise identical pentameric protein subunits held together by tenuous non-covalent interactions, and are often unstable. Chemically inactivated or recombinant empty capsids, which could form the basis of future vaccines, are even less stable than live virus. We use a novel restrained molecular dynamics strategy, to rank mutations predicted to strengthen the pentamer interfaces to produce stabilized capsids. Structural analyses and stability assays confirmed the predictions, and vaccinated animals generated improved neutralising antibody responses to stabilised particles over parental viruses and wild-type capsids. PMID:26389739
Rising dough and baking bread at the Australian synchrotron

NASA Astrophysics Data System (ADS)

Mayo, S. C.; McCann, T.; Day, L.; Favaro, J.; Tuhumury, H.; Thompson, D.; Maksimenko, A.

2016-01-01

Wheat protein quality and the amount of common salt added in dough formulation can have a significant effect on the microstructure and loaf volume of bread. High-speed synchrotron micro-CT provides an ideal tool for observing the three dimensional structure of bread dough in situ during proving (rising) and baking. In this work, the synchrotron micro-CT technique was used to observe the structure and time evolution of doughs made from high and low protein flour and three different salt additives. These experiments showed that, as expected, high protein flour produces a higher volume loaf compared to low protein flour regardless of salt additives. Furthermore the results show that KCl in particular has a very negative effect on dough properties resulting in much reduced porosity. The hundreds of datasets produced and analysed during this experiment also provided a valuable test case for handling large quantities of data using tools on the Australian Synchrotron's MASSIVE cluster.
A novel Usher protein network at the periciliary reloading point between molecular transport machineries in vertebrate photoreceptor cells.

PubMed

Maerker, Tina; van Wijk, Erwin; Overlack, Nora; Kersten, Ferry F J; McGee, Joann; Goldmann, Tobias; Sehn, Elisabeth; Roepman, Ronald; Walsh, Edward J; Kremer, Hannie; Wolfrum, Uwe

2008-01-01

The human Usher syndrome (USH) is the most frequent cause of combined deaf-blindness. USH is genetically heterogeneous with at least 12 chromosomal loci assigned to three clinical types, USH1-3. Although these USH types exhibit similar phenotypes in human, the corresponding gene products belong to very different protein classes and families. The scaffold protein harmonin (USH1C) was shown to integrate all identified USH1 and USH2 molecules into protein networks. Here, we analyzed a protein network organized in the absence of harmonin by the scaffold proteins SANS (USH1G) and whirlin (USH2D). Immunoelectron microscopic analyses disclosed the colocalization of all network components in the apical inner segment collar and the ciliary apparatus of mammalian photoreceptor cells. In this complex, whirlin and SANS directly interact. Furthermore, SANS provides a linkage to the microtubule transport machinery, whereas whirlin may anchor USH2A isoform b and VLGR1b (very large G-protein coupled receptor 1b) via binding to their cytodomains at specific membrane domains. The long ectodomains of both transmembrane proteins extend into the gap between the adjacent membranes of the connecting cilium and the apical inner segment. Analyses of Vlgr1/del7TM mice revealed the ectodomain of VLGR1b as a component of fibrous links present in this gap. Comparative analyses of mouse and Xenopus photoreceptors demonstrated that this USH protein network is also part of the periciliary ridge complex in Xenopus. Since this structural specialization in amphibian photoreceptor cells defines a specialized membrane domain for docking and fusion of transport vesicles, we suggest a prominent role of the USH proteins in cargo shipment.
A structural analysis of the AAA+ domains in Saccharomyces cerevisiae cytoplasmic dynein.

PubMed

Gleave, Emma S; Schmidt, Helgo; Carter, Andrew P

2014-06-01

Dyneins are large protein complexes that act as microtubule based molecular motors. The dynein heavy chain contains a motor domain which is a member of the AAA+ protein family (ATPases Associated with diverse cellular Activities). Proteins of the AAA+ family show a diverse range of functionalities, but share a related core AAA+ domain, which often assembles into hexameric rings. Dynein is unusual because it has all six AAA+ domains linked together, in one long polypeptide. The dynein motor domain generates movement by coupling ATP driven conformational changes in the AAA+ ring to the swing of a motile element called the linker. Dynein binds to its microtubule track via a long antiparallel coiled-coil stalk that emanates from the AAA+ ring. Recently the first high resolution structures of the dynein motor domain were published. Here we provide a detailed structural analysis of the six AAA+ domains using our Saccharomycescerevisiae crystal structure. We describe how structural similarities in the dynein AAA+ domains suggest they share a common evolutionary origin. We analyse how the different AAA+ domains have diverged from each other. We discuss how this is related to the function of dynein as a motor protein and how the AAA+ domains of dynein compare to those of other AAA+ proteins. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
Introduction to bioinformatics.

PubMed

Can, Tolga

2014-01-01

Bioinformatics is an interdisciplinary field mainly involving molecular biology and genetics, computer science, mathematics, and statistics. Data intensive, large-scale biological problems are addressed from a computational point of view. The most common problems are modeling biological processes at the molecular level and making inferences from collected data. A bioinformatics solution usually involves the following steps: Collect statistics from biological data. Build a computational model. Solve a computational modeling problem. Test and evaluate a computational algorithm. This chapter gives a brief introduction to bioinformatics by first providing an introduction to biological terminology and then discussing some classical bioinformatics problems organized by the types of data sources. Sequence analysis is the analysis of DNA and protein sequences for clues regarding function and includes subproblems such as identification of homologs, multiple sequence alignment, searching sequence patterns, and evolutionary analyses. Protein structures are three-dimensional data and the associated problems are structure prediction (secondary and tertiary), analysis of protein structures for clues regarding function, and structural alignment. Gene expression data is usually represented as matrices and analysis of microarray data mostly involves statistics analysis, classification, and clustering approaches. Biological networks such as gene regulatory networks, metabolic pathways, and protein-protein interaction networks are usually modeled as graphs and graph theoretic approaches are used to solve associated problems such as construction and analysis of large-scale networks.
Populations of the Minor α-Conformation in AcGXGNH2 and the α-Helical Nucleation Propensities

NASA Astrophysics Data System (ADS)

Zhou, Yanjun; He, Liu; Zhang, Wenwen; Hu, Jingjing; Shi, Zhengshuang

2016-06-01

Intrinsic backbone conformational preferences of different amino acids are important for understanding the local structure of unfolded protein chains. Recent evidence suggests α-structure is relatively minor among three major backbone conformations for unfolded proteins. The α-helices are the dominant structures in many proteins. For these proteins, how could the α-structures occur from the least in unfolded to the most in folded states? Populations of the minor α-conformation in model peptides provide vital information. Reliable determination of populations of the α-conformers in these peptides that exist in multiple equilibriums of different conformations remains a challenge. Combined analyses on data from AcGXPNH2 and AcGXGNH2 peptides allow us to derive the populations of PII, β and α in AcGXGNH2. Our results show that on average residue X in AcGXGNH2 adopt PII, β, and α 44.7%, 44.5% and 10.8% of time, respectively. The contents of α-conformations for different amino acids define an α-helix nucleation propensity scale. With derived PII, β and α-contents, we can construct a free energy-conformation diagram on each AcGXGNH2 in aqueous solution for the three major backbone conformations. Our results would have broad implications on early-stage events of protein folding.
Structural elucidation of estrus urinary lipocalin protein (EULP) and evaluating binding affinity with pheromones using molecular docking and fluorescence study

PubMed Central

Rajesh, Durairaj; Muthukumar, Subramanian; Saibaba, Ganesan; Siva, Durairaj; Akbarsha, Mohammad Abdulkader; Gulyás, Balázs; Padmanabhan, Parasuraman; Archunan, Govindaraju

2016-01-01

Transportation of pheromones bound with carrier proteins belonging to lipocalin superfamily is known to prolong chemo-signal communication between individuals belonging to the same species. Members of lipocalin family (MLF) proteins have three structurally conserved motifs for delivery of hydrophobic molecules to the specific recognizer. However, computational analyses are critically required to validate and emphasize the sequence and structural annotation of MLF. This study focused to elucidate the evolution, structural documentation, stability and binding efficiency of estrus urinary lipocalin protein (EULP) with endogenous pheromones adopting in-silico and fluorescence study. The results revealed that: (i) EULP perhaps originated from fatty acid binding protein (FABP) revealed in evolutionary analysis; (ii) Dynamic simulation study shows that EULP is highly stable at below 0.45 Å of root mean square deviation (RMSD); (iii) Docking evaluation shows that EULP has higher binding energy with farnesol and 2-iso-butyl-3-methoxypyrazine (IBMP) than 2-naphthol; and (iv) Competitive binding and quenching assay revealed that purified EULP has good binding interaction with farnesol. Both, In-silico and experimental studies showed that EULP is an efficient binding partner to pheromones. The present study provides impetus to create a point mutation for increasing longevity of EULP to develop pheromone trap for rodent pest management. PMID:27782155
Structural elucidation of estrus urinary lipocalin protein (EULP) and evaluating binding affinity with pheromones using molecular docking and fluorescence study.

PubMed

Rajesh, Durairaj; Muthukumar, Subramanian; Saibaba, Ganesan; Siva, Durairaj; Akbarsha, Mohammad Abdulkader; Gulyás, Balázs; Padmanabhan, Parasuraman; Archunan, Govindaraju

2016-10-26

Transportation of pheromones bound with carrier proteins belonging to lipocalin superfamily is known to prolong chemo-signal communication between individuals belonging to the same species. Members of lipocalin family (MLF) proteins have three structurally conserved motifs for delivery of hydrophobic molecules to the specific recognizer. However, computational analyses are critically required to validate and emphasize the sequence and structural annotation of MLF. This study focused to elucidate the evolution, structural documentation, stability and binding efficiency of estrus urinary lipocalin protein (EULP) with endogenous pheromones adopting in-silico and fluorescence study. The results revealed that: (i) EULP perhaps originated from fatty acid binding protein (FABP) revealed in evolutionary analysis; (ii) Dynamic simulation study shows that EULP is highly stable at below 0.45 Å of root mean square deviation (RMSD); (iii) Docking evaluation shows that EULP has higher binding energy with farnesol and 2-iso-butyl-3-methoxypyrazine (IBMP) than 2-naphthol; and (iv) Competitive binding and quenching assay revealed that purified EULP has good binding interaction with farnesol. Both, In-silico and experimental studies showed that EULP is an efficient binding partner to pheromones. The present study provides impetus to create a point mutation for increasing longevity of EULP to develop pheromone trap for rodent pest management.
A Systems Biology Approach for Identifying Hepatotoxicant Groups Based on Similarity in Mechanisms of Action and Chemical Structure.

PubMed

Hebels, Dennie G A J; Rasche, Axel; Herwig, Ralf; van Westen, Gerard J P; Jennen, Danyel G J; Kleinjans, Jos C S

2016-01-01

When evaluating compound similarity, addressing multiple sources of information to reach conclusions about common pharmaceutical and/or toxicological mechanisms of action is a crucial strategy. In this chapter, we describe a systems biology approach that incorporates analyses of hepatotoxicant data for 33 compounds from three different sources: a chemical structure similarity analysis based on the 3D Tanimoto coefficient, a chemical structure-based protein target prediction analysis, and a cross-study/cross-platform meta-analysis of in vitro and in vivo human and rat transcriptomics data derived from public resources (i.e., the diXa data warehouse). Hierarchical clustering of the outcome scores of the separate analyses did not result in a satisfactory grouping of compounds considering their known toxic mechanism as described in literature. However, a combined analysis of multiple data types may hypothetically compensate for missing or unreliable information in any of the single data types. We therefore performed an integrated clustering analysis of all three data sets using the R-based tool iClusterPlus. This indeed improved the grouping results. The compound clusters that were formed by means of iClusterPlus represent groups that show similar gene expression while simultaneously integrating a similarity in structure and protein targets, which corresponds much better with the known mechanism of action of these toxicants. Using an integrative systems biology approach may thus overcome the limitations of the separate analyses when grouping liver toxicants sharing a similar mechanism of toxicity.
Platyhelminth Venom Allergen-Like (VAL) proteins: revealing structural diversity, class-specific features and biological associations across the phylum

PubMed Central

CHALMERS, IAIN W.; HOFFMANN, KARL F.

2012-01-01

SUMMARY During platyhelminth infection, a cocktail of proteins is released by the parasite to aid invasion, initiate feeding, facilitate adaptation and mediate modulation of the host immune response. Included amongst these proteins is the Venom Allergen-Like (VAL) family, part of the larger sperm coating protein/Tpx-1/Ag5/PR-1/Sc7 (SCP/TAPS) superfamily. To explore the significance of this protein family during Platyhelminthes development and host interactions, we systematically summarize all published proteomic, genomic and immunological investigations of the VAL protein family to date. By conducting new genomic and transcriptomic interrogations to identify over 200 VAL proteins (228) from species in all 4 traditional taxonomic classes (Trematoda, Cestoda, Monogenea and Turbellaria), we further expand our knowledge related to platyhelminth VAL diversity across the phylum. Subsequent phylogenetic and tertiary structural analyses reveal several class-specific VAL features, which likely indicate a range of roles mediated by this protein family. Our comprehensive analysis of platyhelminth VALs represents a unifying synopsis for understanding diversity within this protein family and a firm context in which to initiate future functional characterization of these enigmatic members. PMID:22717097

From Structure-Function Analyses to Protein Engineering for Practical Applications of DNA Ligase

PubMed Central

Tanabe, Maiko; Nishida, Hirokazu

2015-01-01

DNA ligases are indispensable in all living cells and ubiquitous in all organs. DNA ligases are broadly utilized in molecular biology research fields, such as genetic engineering and DNA sequencing technologies. Here we review the utilization of DNA ligases in a variety of in vitro gene manipulations, developed over the past several decades. During this period, fewer protein engineering attempts for DNA ligases have been made, as compared to those for DNA polymerases. We summarize the recent progress in the elucidation of the DNA ligation mechanisms obtained from the tertiary structures solved thus far, in each step of the ligation reaction scheme. We also present some examples of engineered DNA ligases, developed from the viewpoint of their three-dimensional structures. PMID:26508902
Chemical taxonomy of the hinge-ligament proteins of bivalves according to their amino acid compositions.

PubMed Central

Kikuchi, Y; Tamiya, N

1987-01-01

The proteins in the hinge ligaments of molluscan bivalves were subjected to chemotaxonomic studies according to their amino acid compositions. The hinge-ligament protein is a new class of structure proteins, and this is the first attempt to introduce chemical taxonomy into the systematics of bivalves. The hinge-ligament proteins from morphologically close species, namely mactra (superfamily Mactracea) or scallop (family Pectinidae) species, showed high intraspecific homology in their compositions. On the other hand, inconsistent results were obtained with two types of ligament proteins in pearl oyster species (genus Pinctada). The results of our chemotaxonomic analyses were sometimes in good agreement with the morphological classifications and sometimes inconsistent, implying a complicated phylogenetic relationship among the species. PMID:3593265
UQlust: combining profile hashing with linear-time ranking for efficient clustering and analysis of big macromolecular data.

PubMed

Adamczak, Rafal; Meller, Jarek

2016-12-28

Advances in computing have enabled current protein and RNA structure prediction and molecular simulation methods to dramatically increase their sampling of conformational spaces. The quickly growing number of experimentally resolved structures, and databases such as the Protein Data Bank, also implies large scale structural similarity analyses to retrieve and classify macromolecular data. Consequently, the computational cost of structure comparison and clustering for large sets of macromolecular structures has become a bottleneck that necessitates further algorithmic improvements and development of efficient software solutions. uQlust is a versatile and easy-to-use tool for ultrafast ranking and clustering of macromolecular structures. uQlust makes use of structural profiles of proteins and nucleic acids, while combining a linear-time algorithm for implicit comparison of all pairs of models with profile hashing to enable efficient clustering of large data sets with a low memory footprint. In addition to ranking and clustering of large sets of models of the same protein or RNA molecule, uQlust can also be used in conjunction with fragment-based profiles in order to cluster structures of arbitrary length. For example, hierarchical clustering of the entire PDB using profile hashing can be performed on a typical laptop, thus opening an avenue for structural explorations previously limited to dedicated resources. The uQlust package is freely available under the GNU General Public License at https://github.com/uQlust . uQlust represents a drastic reduction in the computational complexity and memory requirements with respect to existing clustering and model quality assessment methods for macromolecular structure analysis, while yielding results on par with traditional approaches for both proteins and RNAs.
Structural and functional analyses of genes encoding VQ proteins in apple.

PubMed

Dong, Qinglong; Zhao, Shuang; Duan, Dingyue; Tian, Yi; Wang, Yanpeng; Mao, Ke; Zhou, Zongshan; Ma, Fengwang

2018-07-01

Recent studies with Arabidopsis and soybean have shown that a class of valine-glutamine (VQ) motif-containing proteins interacts with some WRKY transcription factors. However, little is known about the evolution, structures, and functions of those proteins in apple. Here, we examined their features and identified 49 apple VQ genes. Our evolutional analysis revealed that the proteins could be clustered into nine groups together with their homologues in 33 species. Historically, the main characteristics of proteins in Groups I, V, VI, VII, IX, and X were thought to have been generated before the monocot-dicot split, whereas those in Groups II, III + IV, and VIII were generated after that split. In the structural analysis, apple MdVQ proteins appeared to bind only with Group I and IIc MdWRKY proteins. Meanwhile, MdVQ1, MdVQ10, MdVQ15, and MdVQ36 interacted with multiple MdVQ proteins to form heterodimers but MdVQ15 formed a homodimer. The functional analysis indicated that overexpression of some apple MdVQs in Arabidopsis and tobacco plants effected their vegetative and reproductive growth. These results provide important information about the characteristics of apple MdVQ genes and can serve as a solid foundation for further studies about the role of WRKY-VQ interactions in regulating apple developmental and defense mechanisms. Copyright © 2018 Elsevier B.V. All rights reserved.
Network biology discovers pathogen contact points in host protein-protein interactomes.

PubMed

Ahmed, Hadia; Howton, T C; Sun, Yali; Weinberger, Natascha; Belkhadir, Youssef; Mukhtar, M Shahid

2018-06-13

In all organisms, major biological processes are controlled by complex protein-protein interactions networks (interactomes), yet their structural complexity presents major analytical challenges. Here, we integrate a compendium of over 4300 phenotypes with Arabidopsis interactome (AI-1 MAIN ). We show that nodes with high connectivity and betweenness are enriched and depleted in conditional and essential phenotypes, respectively. Such nodes are located in the innermost layers of AI-1 MAIN and are preferential targets of pathogen effectors. We extend these network-centric analyses to Cell Surface Interactome (CSI LRR ) and predict its 35 most influential nodes. To determine their biological relevance, we show that these proteins physically interact with pathogen effectors and modulate plant immunity. Overall, our findings contrast with centrality-lethality rule, discover fast information spreading nodes, and highlight the structural properties of pathogen targets in two different interactomes. Finally, this theoretical framework could possibly be applicable to other inter-species interactomes to reveal pathogen contact points.
Structural insight into GRIP1-PDZ6 in Alzheimer's disease: study from protein expression data to molecular dynamics simulations.

PubMed

Chatterjee, Paulami; Roy, Debjani

2017-08-01

Protein-protein interaction domain, PDZ, plays a critical role in efficient synaptic transmission in brain. Dysfunction of synaptic transmission is thought to be the underlying basis of many neuropsychiatric and neurodegenerative disorders including Alzheimer's disease (AD). In this study, Glutamate Receptor Interacting Protein1 (GRIP1) was identified as one of the most important differentially expressed, topologically significant proteins in the protein-protein interaction network. To date, very few studies have analyzed the detailed structural basis of PDZ-mediated protein interaction of GRIP1. In order to gain better understanding of structural and dynamic basis of these interactions, we employed molecular dynamics (MD) simulations of GRIP1-PDZ6 dimer bound with Liprin-alpha and GRIP1-PDZ6 dimer alone each with 100 ns simulations. The analyses of MD simulations of Liprin-alpha bound GRIP1-PDZ6 dimer show considerable conformational differences than that of peptide-free dimer in terms of SASA, hydrogen bonding patterns, and along principal component 1 (PC1). Our study also furnishes insight into the structural attunement of the PDZ6 domains of Liprin-alpha bound GRIP1 that is attributed by significant shift of the Liprin-alpha recognition helix in the simulated peptide-bound dimer compared to the crystal structure and simulated peptide-free dimer. It is evident that PDZ6 domains of peptide-bound dimer show differential movements along PC1 than that of peptide-free dimers. Thus, Liprin-alpha also serves an important role in conferring conformational changes along the dimeric interface of the peptide-bound dimer. Results reported here provide information that may lead to novel therapeutic approaches in AD.
Metals in proteins: correlation between the metal-ion type, coordination number and the amino-acid residues involved in the coordination.

PubMed

Dokmanić, Ivan; Sikić, Mile; Tomić, Sanja

2008-03-01

Metal ions are constituents of many metalloproteins, in which they have either catalytic (metalloenzymes) or structural functions. In this work, the characteristics of various metals were studied (Cu, Zn, Mg, Mn, Fe, Co, Ni, Cd and Ca in proteins with known crystal structure) as well as the specificity of their environments. The analysis was performed on two data sets: the set of protein structures in the Protein Data Bank (PDB) determined with resolution <1.5 A and the set of nonredundant protein structures from the PDB. The former was used to determine the distances between each metal ion and its electron donors and the latter was used to assess the preferred coordination numbers and common combinations of amino-acid residues in the neighbourhood of each metal. Although the metal ions considered predominantly had a valence of two, their preferred coordination number and the type of amino-acid residues that participate in the coordination differed significantly from one metal ion to the next. This study concentrates on finding the specificities of a metal-ion environment, namely the distribution of coordination numbers and the amino-acid residue types that frequently take part in coordination. Furthermore, the correlation between the coordination number and the occurrence of certain amino-acid residues (quartets and triplets) in a metal-ion coordination sphere was analysed. The results obtained are of particular value for the identification and modelling of metal-binding sites in protein structures derived by homology modelling. Knowledge of the geometry and characteristics of the metal-binding sites in metalloproteins of known function can help to more closely determine the biological activity of proteins of unknown function and to aid in design of proteins with specific affinity for certain metals.
Structural and functional features of lysine acetylation of plant and animal tubulins.

PubMed

Rayevsky, Alexey V; Sharifi, Mohsen; Samofalova, Dariya A; Karpov, Pavel A; Blume, Yaroslav B

2017-10-10

The study of the genome and the proteome of different species and representatives of distinct kingdoms, especially detection of proteome via wide-scaled analyses has various challenges and pitfalls. Attempts to combine all available information together and isolate some common features for determination of the pathway and their mechanism of action generally have a highly complicated nature. However, microtubule (MT) monomers are highly conserved protein structures, and microtubules are structurally conserved from Homo sapiens to Arabidopsis thaliana. The interaction of MT elements with microtubule-associated proteins and post-translational modifiers is fully dependent on protein interfaces, and almost all MT modifications are well described except acetylation. Crystallography and interactome data using different approaches were combined to identify conserved proteins important in acetylation of microtubules. Application of computational methods and comparative analysis of binding modes generated a robust predictive model of acetylation of the ϵ-amino group of Lys40 in α-tubulins. In turn, the model discarded some probable mechanisms of interaction between elements of interest. Reconstruction of unresolved protein structures was carried out with modeling by homology to the existing crystal structure (PDBID: 1Z2B) from B. taurus using Swiss-model server, followed by a molecular dynamics simulation. Docking of the human tubulin fragment with Lys40 into the active site of α-tubulin acetyltransferase, reproduces the binding mode of peptidomimetic from X-ray structure (PDBID: 4PK3). © 2017 International Federation for Cell Biology.
Chlamydia trachomatis protein CT009 is a structural and functional homolog to the key morphogenesis component RodZ and interacts with division septal plane localized MreB

DOE PAGES

Kemege, Kyle E.; Hickey, John M.; Barta, Michael L.; ...

2014-11-10

Cell division in Chlamydiae is poorly understood as apparent homologs to most conserved bacterial cell division proteins are lacking and presence of elongation (rod shape) associated proteins indicate non-canonical mechanisms may be employed. The rod-shape determining protein MreB has been proposed as playing a unique role in chlamydial cell division. In other organisms, MreB is part of an elongation complex that requires RodZ for proper function. A recent study reported that the protein encoded by ORF CT009 interacts with MreB despite low sequence similarity to RodZ. The studies in this paper expand on those observations through protein structure, mutagenesis andmore » cellular localization analyses. Structural analysis indicated that CT009 shares high level of structural similarity to RodZ, revealing the conserved orientation of two residues critical for MreB interaction. Substitutions eliminated MreB protein interaction and partial complementation provided by CT009 in RodZ deficient Escherichia coli. Cellular localization analysis of CT009 showed uniform membrane staining in Chlamydia. This was in contrast to the localization of MreB, which was restricted to predicted septal planes. Finally, MreB localization to septal planes provides direct experimental observation for the role of MreB in cell division and supports the hypothesis that it serves as a functional replacement for FtsZ in Chlamydia.« less
Chlamydia trachomatis protein CT009 is a structural and functional homolog to the key morphogenesis component RodZ and interacts with division septal plane localized MreB

PubMed Central

Kemege, Kyle E.; Hickey, John M.; Barta, Michael L.; Wickstrum, Jason; Balwalli, Namita; Lovell, Scott; Battaile, Kevin P.; Hefty, P. Scott

2015-01-01

Summary Cell division in Chlamydiae is poorly understood as apparent homologs to most conserved bacterial cell division proteins are lacking and presence of elongation (rod shape) associated proteins indicate non-canonical mechanisms may be employed. The rod-shape determining protein MreB has been proposed as playing a unique role in chlamydial cell division. In other organisms, MreB is part of an elongation complex that requires RodZ for proper function. A recent study reported that the protein encoded by ORF CT009 interacts with MreB despite low sequence similarity to RodZ. The studies herein expand on those observations through protein structure, mutagenesis, and cellular localization analyses. Structural analysis indicated that CT009 shares high level of structural similarity to RodZ, revealing the conserved orientation of two residues critical for MreB interaction. Substitutions eliminated MreB protein interaction and partial complementation provided by CT009 in RodZ deficient E. coli. Cellular localization analysis of CT009 showed uniform membrane staining in Chlamydia. This was in contrast to the localization of MreB, which was restricted to predicted septal planes. MreB localization to septal planes provides direct experimental observation for the role of MreB in cell division and supports the hypothesis that it serves as a functional replacement for FtsZ in Chlamydia. PMID:25382739
Chlamydia trachomatis protein CT009 is a structural and functional homolog to the key morphogenesis component RodZ and interacts with division septal plane localized MreB.

PubMed

Kemege, Kyle E; Hickey, John M; Barta, Michael L; Wickstrum, Jason; Balwalli, Namita; Lovell, Scott; Battaile, Kevin P; Hefty, P Scott

2015-02-01

Cell division in Chlamydiae is poorly understood as apparent homologs to most conserved bacterial cell division proteins are lacking and presence of elongation (rod shape) associated proteins indicate non-canonical mechanisms may be employed. The rod-shape determining protein MreB has been proposed as playing a unique role in chlamydial cell division. In other organisms, MreB is part of an elongation complex that requires RodZ for proper function. A recent study reported that the protein encoded by ORF CT009 interacts with MreB despite low sequence similarity to RodZ. The studies herein expand on those observations through protein structure, mutagenesis and cellular localization analyses. Structural analysis indicated that CT009 shares high level of structural similarity to RodZ, revealing the conserved orientation of two residues critical for MreB interaction. Substitutions eliminated MreB protein interaction and partial complementation provided by CT009 in RodZ deficient Escherichia coli. Cellular localization analysis of CT009 showed uniform membrane staining in Chlamydia. This was in contrast to the localization of MreB, which was restricted to predicted septal planes. MreB localization to septal planes provides direct experimental observation for the role of MreB in cell division and supports the hypothesis that it serves as a functional replacement for FtsZ in Chlamydia. © 2014 John Wiley & Sons Ltd.
Crystallographic and Computational Analyses of AUUCU Repeating RNA That Causes Spinocerebellar Ataxia Type 10 (SCA10).

PubMed

Park, HaJeung; González, Àlex L; Yildirim, Ilyas; Tran, Tuan; Lohman, Jeremy R; Fang, Pengfei; Guo, Min; Disney, Matthew D

2015-06-23

Spinocerebellar ataxia type 10 (SCA10) is caused by a pentanucleotide repeat expansion of r(AUUCU) within intron 9 of the ATXN10 pre-mRNA. The RNA causes disease by a gain-of-function mechanism in which it inactivates proteins involved in RNA biogenesis. Spectroscopic studies showed that r(AUUCU) repeats form a hairpin structure; however, there were no high-resolution structural models prior to this work. Herein, we report the first crystal structure of model r(AUUCU) repeats refined to 2.8 Å and analysis of the structure via molecular dynamics simulations. The r(AUUCU) tracts adopt an overall A-form geometry in which 3 × 3 nucleotide (5')UCU(3')/(3')UCU(5') internal loops are closed by AU pairs. Helical parameters of the refined structure as well as the corresponding electron density map on the crystallographic model reflect dynamic features of the internal loop. The computational analyses captured dynamic motion of the loop closing pairs, which can form single-stranded conformations with relatively low energies. Overall, the results presented here suggest the possibility for r(AUUCU) repeats to form metastable A-from structures, which can rearrange into single-stranded conformations and attract proteins such as heterogeneous nuclear ribonucleoprotein K (hnRNP K). The information presented here may aid in the rational design of therapeutics targeting this RNA.
Crystallographic and Computational Analyses of AUUCU Repeating RNA That Causes Spinocerebellar Ataxia Type 10 (SCA10)

PubMed Central

Park, HaJeung; González, Àlex L.; Yildirim, Ilyas; Tran, Tuan; Lohman, Jeremy R.; Fang, Pengfei; Guo, Min; Disney, Matthew D.

2016-01-01

Spinocerebellar ataxia type 10 (SCA10) is caused by a pentanucleotide repeat expansion of r(AUUCU) within intron 9 of the ATXN10 pre-mRNA. The RNA causes disease by a gain-of-function mechanism in which it inactivates proteins involved in RNA biogenesis. Spectroscopic studies showed that r(AUUCU) repeats form a hairpin structure; however, there were no high-resolution structural models prior to this work. Herein, we report the first crystal structure of model r(AUUCU) repeats refined to 2.8 Å and analysis of the structure via molecular dynamics simulations. The r(AUUCU) tracts adopt an overall A-form geometry in which 3 × 3 nucleotide 5′UCU3′/3′UCU5′ internal loops are closed by AU pairs. Helical parameters of the refined structure as well as the corresponding electron density map on the crystallographic model reflect dynamic features of the internal loop. The computational analyses captured dynamic motion of the loop closing pairs, which can form single-stranded conformations with relatively low energies. Overall, the results presented here suggest the possibility for r(AUUCU) repeats to form metastable A-from structures, which can rearrange into single-stranded conformations and attract proteins such as heterogeneous nuclear ribonucleoprotein K (hnRNP K). The information presented here may aid in the rational design of therapeutics targeting this RNA. PMID:26039897
Structure of a Novel O-Linked N-Acetyl-d-glucosamine (O-GlcNAc) Transferase, GtfA, Reveals Insights into the Glycosylation of Pneumococcal Serine-rich Repeat Adhesins*

PubMed Central

Shi, Wei-Wei; Jiang, Yong-Liang; Zhu, Fan; Yang, Yi-Hu; Shao, Qiu-Yan; Yang, Hong-Bo; Ren, Yan-Min; Wu, Hui; Chen, Yuxing; Zhou, Cong-Zhao

2014-01-01

Protein glycosylation catalyzed by the O-GlcNAc transferase (OGT) plays a critical role in various biological processes. In Streptococcus pneumoniae, the core enzyme GtfA and co-activator GtfB form an OGT complex to glycosylate the serine-rich repeat (SRR) of adhesin PsrP (pneumococcal serine-rich repeat protein), which is involved in the infection and pathogenesis. Here we report the 2.0 Å crystal structure of GtfA, revealing a β-meander add-on domain beyond the catalytic domain. It represents a novel add-on domain, which is distinct from the all-α-tetratricopeptide repeats in the only two structure-known OGTs. Structural analyses combined with binding assays indicate that this add-on domain contributes to forming an active GtfA-GtfB complex and recognizing the acceptor protein. In addition, the in vitro glycosylation system enables us to map the O-linkages to the serine residues within the first SRR of PsrP. These findings suggest that fusion with an add-on domain might be a universal mechanism for diverse OGTs that recognize varying acceptor proteins/peptides. PMID:24936067
Integrated proteomic and transcriptomic analysis of the Aedes aegypti eggshell

PubMed Central

2014-01-01

Background Mosquito eggshells show remarkable diversity in physical properties and structure consistent with adaptations to the wide variety of environments exploited by these insects. We applied proteomic, transcriptomic, and hybridization in situ techniques to identify gene products and pathways that participate in the assembly of the Aedes aegypti eggshell. Aedes aegypti population density is low during cold and dry seasons and increases immediately after rainfall. The survival of embryos through unfavorable periods is a key factor in the persistence of their populations. The work described here supports integrated vector control approaches that target eggshell formation and result in Ae. aegypti drought-intolerant phenotypes for public health initiatives directed to reduce mosquito-borne diseases. Results A total of 130 proteins were identified from the combined mass spectrometric analyses of eggshell preparations. Conclusions Classification of proteins according to their known and putative functions revealed the complexity of the eggshell structure. Three novel Ae. aegypti vitelline membrane proteins were discovered. Odorant-binding and cysteine-rich proteins that may be structural components of the eggshell were identified. Enzymes with peroxidase, laccase and phenoloxidase activities also were identified, and their likely involvements in cross-linking reactions that stabilize the eggshell structure are discussed. PMID:24707823
Deconstructing thermodynamic parameters of a coupled system from site-specific observables.

PubMed

Chowdhury, Sandipan; Chanda, Baron

2010-11-02

Cooperative interactions mediate information transfer between structural domains of a protein molecule and are major determinants of protein function and modulation. The prevalent theories to understand the thermodynamic origins of cooperativity have been developed to reproduce the complex behavior of a global thermodynamic observable such as ligand binding or enzyme activity. However, in most cases the measurement of a single global observable cannot uniquely define all the terms that fully describe the energetics of the system. Here we establish a theoretical groundwork for analyzing protein thermodynamics using site-specific information. Our treatment involves extracting a site-specific parameter (defined as χ value) associated with a structural unit. We demonstrate that, under limiting conditions, the χ value is related to the direct interaction terms associated with the structural unit under observation and its intrinsic activation energy. We also introduce a site-specific interaction energy term (χ(diff)) that is a function of the direct interaction energy of that site with every other site in the system. When combined with site-directed mutagenesis and other molecular level perturbations, analyses of χ values of site-specific observables may provide valuable insights into protein thermodynamics and structure.
Structural Basis for Antifreeze Activity of Ice-binding Protein from Arctic Yeast*

PubMed Central

Lee, Jun Hyuck; Park, Ae Kyung; Do, Hackwon; Park, Kyoung Sun; Moh, Sang Hyun; Chi, Young Min; Kim, Hak Jun

2012-01-01

Arctic yeast Leucosporidium sp. produces a glycosylated ice-binding protein (LeIBP) with a molecular mass of ∼25 kDa, which can lower the freezing point below the melting point once it binds to ice. LeIBP is a member of a large class of ice-binding proteins, the structures of which are unknown. Here, we report the crystal structures of non-glycosylated LeIBP and glycosylated LeIBP at 1.57- and 2.43-Å resolution, respectively. Structural analysis of the LeIBPs revealed a dimeric right-handed β-helix fold, which is composed of three parts: a large coiled structural domain, a long helix region (residues 96–115 form a long α-helix that packs along one face of the β-helix), and a C-terminal hydrophobic loop region (243PFVPAPEVV251). Unexpectedly, the C-terminal hydrophobic loop region has an extended conformation pointing away from the body of the coiled structural domain and forms intertwined dimer interactions. In addition, structural analysis of glycosylated LeIBP with sugar moieties attached to Asn185 provides a basis for interpreting previous biochemical analyses as well as the increased stability and secretion of glycosylated LeIBP. We also determined that the aligned Thr/Ser/Ala residues are critical for ice binding within the B face of LeIBP using site-directed mutagenesis. Although LeIBP has a common β-helical fold similar to that of canonical hyperactive antifreeze proteins, the ice-binding site is more complex and does not have a simple ice-binding motif. In conclusion, we could identify the ice-binding site of LeIBP and discuss differences in the ice-binding modes compared with other known antifreeze proteins and ice-binding proteins. PMID:22303017
Subnanometre-resolution structure of the doublet microtubule reveals new classes of microtubule-associated proteins

PubMed Central

Ichikawa, Muneyoshi; Liu, Dinan; Kastritis, Panagiotis L.; Basu, Kaustuv; Hsu, Tzu Chin; Yang, Shunkai; Bui, Khanh Huy

2017-01-01

Cilia are ubiquitous, hair-like appendages found in eukaryotic cells that carry out functions of cell motility and sensory reception. Cilia contain an intriguing cytoskeletal structure, termed the axoneme that consists of nine doublet microtubules radially interlinked and longitudinally organized in multiple specific repeat units. Little is known, however, about how the axoneme allows cilia to be both actively bendable and sturdy or how it is assembled. To answer these questions, we used cryo-electron microscopy to structurally analyse several of the repeating units of the doublet at sub-nanometre resolution. This structural detail enables us to unambiguously assign α- and β-tubulins in the doublet microtubule lattice. Our study demonstrates the existence of an inner sheath composed of different kinds of microtubule inner proteins inside the doublet that likely stabilizes the structure and facilitates the specific building of the B-tubule. PMID:28462916
Energetic frustrations in protein folding at residue resolution: a homologous simulation study of Im9 proteins.

PubMed

Sun, Yunxiang; Ming, Dengming

2014-01-01

Energetic frustration is becoming an important topic for understanding the mechanisms of protein folding, which is a long-standing big biological problem usually investigated by the free energy landscape theory. Despite the significant advances in probing the effects of folding frustrations on the overall features of protein folding pathways and folding intermediates, detailed characterizations of folding frustrations at an atomic or residue level are still lacking. In addition, how and to what extent folding frustrations interact with protein topology in determining folding mechanisms remains unclear. In this paper, we tried to understand energetic frustrations in the context of protein topology structures or native-contact networks by comparing the energetic frustrations of five homologous Im9 alpha-helix proteins that share very similar topology structures but have a single hydrophilic-to-hydrophobic mutual mutation. The folding simulations were performed using a coarse-grained Gō-like model, while non-native hydrophobic interactions were introduced as energetic frustrations using a Lennard-Jones potential function. Energetic frustrations were then examined at residue level based on φ-value analyses of the transition state ensemble structures and mapped back to native-contact networks. Our calculations show that energetic frustrations have highly heterogeneous influences on the folding of the four helices of the examined structures depending on the local environment of the frustration centers. Also, the closer the introduced frustration is to the center of the native-contact network, the larger the changes in the protein folding. Our findings add a new dimension to the understanding of protein folding the topology determination in that energetic frustrations works closely with native-contact networks to affect the protein folding.
Calponin-Like Chd64 Is Partly Disordered

PubMed Central

Jakób, Michał; Szpotkowski, Kamil; Wojtas, Magdalena; Rymarczyk, Grzegorz; Ożyhar, Andrzej

2014-01-01

20-hydroxyecdysone (20E) and juvenile hormone (JH) signaling pathways interact to regulate insect development. Recently, two proteins, a calponin-like Chd64 and immunophilin FKBP39 have been found to play a pivotal role in the cross-talk between 20E and JH, although the molecular basis of interaction remains unknown. The aim of this work was to identify the structural features that would provide understanding of the role of Chd64 in multiple and dynamic complex that cross-links the signaling pathways. Here, we demonstrate the results of in silico and in vitro analyses of the structural organization of Chd64 from Drosophila melanogaster and its homologue from Tribolium castaneum. Computational analysis predicted the existence of disordered regions on the termini of both proteins, while the central region appeared to be globular, probably corresponding to the calponin homology (CH) domain. In vitro analyses of the hydrodynamic properties of the proteins from analytical size-exclusion chromatography and analytical ultracentrifugation revealed that DmChd64 and TcChd64 had an asymmetrical, elongated shape, which was further confirmed by small angle X-ray scattering (SAXS). The Kratky plot indicated disorderness in both Chd64 proteins, which could possibly be on the protein termini and which would give rise to specific hydrodynamic properties. Disordered tails are often involved in diverse interactions. Therefore, it is highly possible that there are intrinsically disordered regions (IDRs) on both termini of the Chd64 proteins that serve as platforms for multiple interaction with various partners and constitute the foundation for their regulatory function. PMID:24805353

Multifunctionality and diversity of GDSL esterase/lipase gene family in rice (Oryza sativa L. japonica) genome: new insights from bioinformatics analysis

PubMed Central

2012-01-01

Background GDSL esterases/lipases are a newly discovered subclass of lipolytic enzymes that are very important and attractive research subjects because of their multifunctional properties, such as broad substrate specificity and regiospecificity. Compared with the current knowledge regarding these enzymes in bacteria, our understanding of the plant GDSL enzymes is very limited, although the GDSL gene family in plant species include numerous members in many fully sequenced plant genomes. Only two genes from a large rice GDSL esterase/lipase gene family were previously characterised, and the majority of the members remain unknown. In the present study, we describe the rice OsGELP (Oryza sativa GDSL esterase/lipase protein) gene family at the genomic and proteomic levels, and use this knowledge to provide insights into the multifunctionality of the rice OsGELP enzymes. Results In this study, an extensive bioinformatics analysis identified 114 genes in the rice OsGELP gene family. A complete overview of this family in rice is presented, including the chromosome locations, gene structures, phylogeny, and protein motifs. Among the OsGELPs and the plant GDSL esterase/lipase proteins of known functions, 41 motifs were found that represent the core secondary structure elements or appear specifically in different phylogenetic subclades. The specification and distribution of identified putative conserved clade-common and -specific peptide motifs, and their location on the predicted protein three dimensional structure may possibly signify their functional roles. Potentially important regions for substrate specificity are highlighted, in accordance with protein three-dimensional model and location of the phylogenetic specific conserved motifs. The differential expression of some representative genes were confirmed by quantitative real-time PCR. The phylogenetic analysis, together with protein motif architectures, and the expression profiling were analysed to predict the possible biological functions of the rice OsGELP genes. Conclusions Our current genomic analysis, for the first time, presents fundamental information on the organization of the rice OsGELP gene family. With combination of the genomic, phylogenetic, microarray expression, protein motif distribution, and protein structure analyses, we were able to create supported basis for the functional prediction of many members in the rice GDSL esterase/lipase family. The present study provides a platform for the selection of candidate genes for further detailed functional study. PMID:22793791
A recruiting protein of geranylgeranyl diphosphate synthase controls metabolic flux toward chlorophyll biosynthesis in rice.

PubMed

Zhou, Fei; Wang, Cheng-Yuan; Gutensohn, Michael; Jiang, Ling; Zhang, Peng; Zhang, Dabing; Dudareva, Natalia; Lu, Shan

2017-06-27

In plants, geranylgeranyl diphosphate (GGPP) is produced by plastidic GGPP synthase (GGPPS) and serves as a precursor for vital metabolic branches, including chlorophyll, carotenoid, and gibberellin biosynthesis. However, molecular mechanisms regulating GGPP allocation among these biosynthetic pathways localized in the same subcellular compartment are largely unknown. We found that rice contains only one functionally active GGPPS, OsGGPPS1, in chloroplasts. A functionally active homodimeric enzyme composed of two OsGGPPS1 subunits is located in the stroma. In thylakoid membranes, however, the GGPPS activity resides in a heterodimeric enzyme composed of one OsGGPPS1 subunit and GGPPS recruiting protein (OsGRP). OsGRP is structurally most similar to members of the geranyl diphosphate synthase small subunit type II subfamily. In contrast to members of this subfamily, OsGRP enhances OsGGPPS1 catalytic efficiency and specificity of GGPP production on interaction with OsGGPPS1. Structural biology and protein interaction analyses demonstrate that affinity between OsGRP and OsGGPPS1 is stronger than between two OsGGPPS1 molecules in homodimers. OsGRP determines OsGGPPS1 suborganellar localization and directs it to a large protein complex in thylakoid membranes, consisting of geranylgeranyl reductase (OsGGR), light-harvesting-like protein 3 (OsLIL3), protochlorophyllide oxidoreductase (OsPORB), and chlorophyll synthase (OsCHLG). Taken together, genetic and biochemical analyses suggest OsGRP functions in recruiting OsGGPPS1 from the stroma toward thylakoid membranes, thus providing a mechanism to control GGPP flux toward chlorophyll biosynthesis.
Tracking the Fragile X Mental Retardation Protein in a Highly Ordered Neuronal RiboNucleoParticles Population: A Link between Stalled Polyribosomes and RNA Granules.

PubMed

El Fatimy, Rachid; Davidovic, Laetitia; Tremblay, Sandra; Jaglin, Xavier; Dury, Alain; Robert, Claude; De Koninck, Paul; Khandjian, Edouard W

2016-07-01

Local translation at the synapse plays key roles in neuron development and activity-dependent synaptic plasticity. mRNAs are translocated from the neuronal soma to the distant synapses as compacted ribonucleoparticles referred to as RNA granules. These contain many RNA-binding proteins, including the Fragile X Mental Retardation Protein (FMRP), the absence of which results in Fragile X Syndrome, the most common inherited form of intellectual disability and the leading genetic cause of autism. Using FMRP as a tracer, we purified a specific population of RNA granules from mouse brain homogenates. Protein composition analyses revealed a strong relationship between polyribosomes and RNA granules. However, the latter have distinct architectural and structural properties, since they are detected as close compact structures as observed by electron microscopy, and converging evidence point to the possibility that these structures emerge from stalled polyribosomes. Time-lapse video microscopy indicated that single granules merge to form cargoes that are transported from the soma to distal locations. Transcriptomic analyses showed that a subset of mRNAs involved in cytoskeleton remodelling and neural development is selectively enriched in RNA granules. One third of the putative mRNA targets described for FMRP appear to be transported in granules and FMRP is more abundant in granules than in polyribosomes. This observation supports a primary role for FMRP in granules biology. Our findings open new avenues for the study of RNA granule dysfunctions in animal models of nervous system disorders, such as Fragile X syndrome.
Structural prerequisites for G-protein activation by the neurotensin receptor

DOE PAGES

Krumm, Brian E.; White, Jim F.; Shah, Priyanka; ...

2015-07-24

We previously determined the structure of neurotensin receptor NTSR1 in an active-like conformation with six thermostabilizing mutations bound to the peptide agonist neurotensin. This receptor was unable to activate G proteins, indicating that the mutations restricted NTSR1 to relate agonist binding to G-protein activation. Here we analyse the effect of three of those mutations (E166A 3.49, L310A 6.37, F358A 7.42) and present two structures of NTSR1 able to catalyse nucleotide exchange at Gα. The presence of F358 7.42 causes the conserved W321 6.48 to adopt a side chain orientation parallel to the lipid bilayer sealing the collapsed Na+ ion pocketmore » and linking the agonist with residues in the lower receptor part implicated in GPCR activation. In the intracellular receptor half, the bulkier L310 6.37 side chain dictates the position of R167 3.50 of the highly conserved D/ERY motif. These residues, together with the presence of E166 3.49 provide determinants for G-protein activation by NTSR1.« less
A comparative study of the N-linked oligosaccharide structures of human IgG subclass proteins.

PubMed Central

Jefferis, R; Lund, J; Mizutani, H; Nakagawa, H; Kawazoe, Y; Arata, Y; Takahashi, N

1990-01-01

Quantitative oligosaccharide profiles were determined for each of 18 human IgG paraproteins representing the four subclasses. Each paraprotein exhibits a unique profile that may be substantially different from that observed for polyclonal IgG. The IgG2 and some IgG3 proteins analysed exhibit a predominance of oligosaccharide moieties having galactose on the Man(alpha 1----3) arm rather than the Man(alpha 1----6) arm; it was previously held that galactosylation of the Man(alpha 1----6) arm is preferred, as observed for IgG1, IgG4 and polyclonal IgG. An IgG4 protein is reported that has galactosylated Man(alpha 1----3) and Man(alpha 1----6) arms on both Fc-localized carbohydrate moieties; previous findings suggested that such fully glycosylated structures could not be accommodated within the internal space of the C gamma 2 domains. Unusual monoantennary oligosaccharides present in IgG2 and IgG3 proteins were isolated and their structures determined. Images Fig. 1. PMID:2363690
Structural prerequisites for G-protein activation by the neurotensin receptor

PubMed Central

Krumm, Brian E.; White, Jim F.; Shah, Priyanka; Grisshammer, Reinhard

2015-01-01

We previously determined the structure of neurotensin receptor NTSR1 in an active-like conformation with six thermostabilizing mutations bound to the peptide agonist neurotensin. This receptor was unable to activate G proteins, indicating that the mutations restricted NTSR1 to relate agonist binding to G-protein activation. Here we analyse the effect of three of those mutations (E166A3.49, L310A6.37, F358A7.42) and present two structures of NTSR1 able to catalyse nucleotide exchange at Gα. The presence of F3587.42 causes the conserved W3216.48 to adopt a side chain orientation parallel to the lipid bilayer sealing the collapsed Na+ ion pocket and linking the agonist with residues in the lower receptor part implicated in GPCR activation. In the intracellular receptor half, the bulkier L3106.37 side chain dictates the position of R1673.50 of the highly conserved D/ERY motif. These residues, together with the presence of E1663.49 provide determinants for G-protein activation by NTSR1. PMID:26205105
Structural prerequisites for G-protein activation by the neurotensin receptor

DOE Office of Scientific and Technical Information (OSTI.GOV)

Krumm, Brian E.; White, Jim F.; Shah, Priyanka

We previously determined the structure of neurotensin receptor NTSR1 in an active-like conformation with six thermostabilizing mutations bound to the peptide agonist neurotensin. This receptor was unable to activate G proteins, indicating that the mutations restricted NTSR1 to relate agonist binding to G-protein activation. Here we analyse the effect of three of those mutations (E166A 3.49, L310A 6.37, F358A 7.42) and present two structures of NTSR1 able to catalyse nucleotide exchange at Gα. The presence of F358 7.42 causes the conserved W321 6.48 to adopt a side chain orientation parallel to the lipid bilayer sealing the collapsed Na+ ion pocketmore » and linking the agonist with residues in the lower receptor part implicated in GPCR activation. In the intracellular receptor half, the bulkier L310 6.37 side chain dictates the position of R167 3.50 of the highly conserved D/ERY motif. These residues, together with the presence of E166 3.49 provide determinants for G-protein activation by NTSR1.« less
Zebavidin - An Avidin-Like Protein from Zebrafish

PubMed Central

Taskinen, Barbara; Zmurko, Joanna; Ojanen, Markus; Kukkurainen, Sampo; Parthiban, Marimuthu; Määttä, Juha A. E.; Leppiniemi, Jenni; Jänis, Janne; Parikka, Mataleena; Turpeinen, Hannu; Rämet, Mika; Pesu, Marko; Johnson, Mark S.; Kulomaa, Markku S.; Airenne, Tomi T.; Hytönen, Vesa P.

2013-01-01

The avidin protein family members are well known for their high affinity towards D-biotin and high structural stability. These properties make avidins valuable tools for a wide range of biotechnology applications. We have identified a new member of the avidin family in the zebrafish (Danio rerio) genome, hereafter called zebavidin. The protein is highly expressed in the gonads of both male and female zebrafish and in the gills of male fish, but our data suggest that zebavidin is not crucial for the developing embryo. Biophysical and structural characterisation of zebavidin revealed distinct properties not found in any previously characterised avidins. Gel filtration chromatography and native mass spectrometry suggest that the protein forms dimers in the absence of biotin at low ionic strength, but assembles into tetramers upon binding biotin. Ligand binding was analysed using radioactive and fluorescently labelled biotin and isothermal titration calorimetry. Moreover, the crystal structure of zebavidin in complex with biotin was solved at 2.4 Å resolution and unveiled unique ligand binding and subunit interface architectures; the atomic-level details support our physicochemical observations. PMID:24204770
PG1058 Is a Novel Multidomain Protein Component of the Bacterial Type IX Secretion System

PubMed Central

Veith, Paul D.; Butler, Catherine A.; Nor Muhammad, Nor A.; Chen, Yu-Yen; Slakeski, Nada; Peng, Benjamin; Zhang, Lianyi; Dashper, Stuart G.; Cross, Keith J.; Cleal, Steven M.; Moore, Caroline; Reynolds, Eric C.

2016-01-01

Porphyromonas gingivalis utilises the Bacteroidetes-specific type IX secretion system (T9SS) to export proteins across the outer membrane (OM), including virulence factors such as the gingipains. The secreted proteins have a conserved carboxy-terminal domain essential for type IX secretion that is cleaved upon export. In P. gingivalis the T9SS substrates undergo glycosylation with anionic lipopolysaccharide (A-LPS) and are attached to the OM. In this study, comparative analyses of 24 Bacteroidetes genomes identified ten putative novel components of the T9SS in P. gingivalis, one of which was PG1058. Computer modelling of the PG1058 structure predicted a novel N- to C-terminal architecture comprising a tetratricopeptide repeat (TPR) domain, a β-propeller domain, a carboxypeptidase regulatory domain-like fold (CRD) and an OmpA_C-like putative peptidoglycan binding domain. Inactivation of pg1058 in P. gingivalis resulted in loss of both colonial pigmentation and surface-associated proteolytic activity; a phenotype common to T9SS mutants. Immunoblot and LC-MS/MS analyses of subcellular fractions revealed T9SS substrates accumulated within the pg1058 mutant periplasm whilst whole-cell ELISA showed the Kgp gingipain was absent from the cell surface, confirming perturbed T9SS function. Immunoblot, TEM and whole-cell ELISA analyses indicated A-LPS was produced and present on the pg1058 mutant cell surface although it was not linked to T9SS substrate proteins. This indicated that PG1058 is crucial for export of T9SS substrates but not for the translocation of A-LPS. PG1058 is a predicted lipoprotein and was localised to the periplasmic side of the OM using whole-cell ELISA, immunoblot and LC-MS/MS analyses of subcellular fractions. The structural prediction and localisation of PG1058 suggests that it may have a role as an essential scaffold linking the periplasmic and OM components of the T9SS. PMID:27711252
Hemoglobin redux: combining neutron and X-ray diffraction with mass spectrometry to analyse the quaternary state of oxidized hemoglobins

DOE Office of Scientific and Technical Information (OSTI.GOV)

Mueser, Timothy C., E-mail: timothy.mueser@utoledo.edu; Griffith, Wendell P.; Kovalevsky, Andrey Y.

2010-11-01

X-ray and neutron diffraction studies of cyanomethemoglobin are being used to evaluate the structural waters within the dimer–dimer interface involved in quaternary-state transitions. Improvements in neutron diffraction instrumentation are affording the opportunity to re-examine the structures of vertebrate hemoglobins and to interrogate proton and solvent position changes between the different quaternary states of the protein. For hemoglobins of unknown primary sequence, structural studies of cyanomethemoglobin (CNmetHb) are being used to help to resolve sequence ambiguity in the mass spectra. These studies have also provided additional structural evidence for the involvement of oxidized hemoglobin in the process of erythrocyte senescence. X-raymore » crystal studies of Tibetan snow leopard CNmetHb have shown that this protein crystallizes in the B state, a structure with a more open dyad, which possibly has relevance to RBC band 3 protein binding and erythrocyte senescence. R-state equine CNmetHb crystal studies elaborate the solvent differences in the switch and hinge region compared with a human deoxyhemoglobin T-state neutron structure. Lastly, comparison of histidine protonation between the T and R state should enumerate the Bohr-effect protons.« less
Chemical crosslinking and mass spectrometry to elucidate the topology of integral membrane proteins

PubMed Central

Debelyy, Mykhaylo O.; Waridel, Patrice; Quadroni, Manfredo; Conzelmann, Andreas

2017-01-01

Here we made an attempt to obtain partial structural information on the topology of multispan integral membrane proteins of yeast by isolating organellar membranes, removing peripheral membrane proteins at pH 11.5 and introducing chemical crosslinks between vicinal amino acids either using homo- or hetero-bifunctional crosslinkers. Proteins were digested with specific proteases and the products analysed by mass spectrometry. Dedicated software tools were used together with filtering steps optimized to remove false positive crosslinks. In proteins of known structure, crosslinks were found only between loops residing on the same side of the membrane. As may be expected, crosslinks were mainly found in very abundant proteins. Our approach seems to hold to promise to yield low resolution topological information for naturally very abundant or strongly overexpressed proteins with relatively little effort. Here, we report novel XL-MS-based topology data for 17 integral membrane proteins (Akr1p, Fks1p, Gas1p, Ggc1p, Gpt2p, Ifa38p, Ist2p, Lag1p, Pet9p, Pma1p, Por1p, Sct1p, Sec61p, Slc1p, Spf1p, Vph1p, Ybt1p). PMID:29073188
The costa of trichomonads: A complex macromolecular cytoskeleton structure made of uncommon proteins.

PubMed

de Andrade Rosa, Ivone; Caruso, Marjolly Brigido; de Oliveira Santos, Eidy; Gonzaga, Luiz; Zingali, Russolina Benedeta; de Vasconcelos, Ana Tereza R; de Souza, Wanderley; Benchimol, Marlene

2017-06-01

The costa is a prominent striated fibre that is found in protozoa of the Trichomonadidae family that present an undulating membrane. It is composed primarily of proteins that have not yet been explored. In this study, we used cell fractionation to obtain a highly enriched costa fraction whose structure and composition was further analysed by electron microscopy and mass spectrometry. Electron microscopy of negatively stained samples revealed that the costa, which is a periodic structure with alternating electron-dense and electron-lucent bands, displays three distinct regions, named the head, neck and body. Fourier transform analysis showed that the electron-lucent bands present sub-bands with a regular pattern. An analysis of the costa fraction via one- and two-dimensional electrophoresis and liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS) allowed the identification of 54 hypothetical proteins. Fourteen of those proteins were considered to be major components of the fraction. The costa of T. foetus is a complex and organised cytoskeleton structure made of a large number of proteins which is assembled into filamentous structures. Some of these proteins exhibit uncharacterised domains and no function related according to gene ontology, suggesting that the costa structure may be formed by a new class of proteins that differ from those previously described in other organisms. Seven of these proteins contain prefoldin domains displaying coiled-coil regions. This propriety is shared with proteins of the striated fibres of other protozoan as well as in intermediate filaments. Our observations suggest the presence of a new class of the cytoskeleton filaments in T. foetus. We believe that our data could auxiliate in determining the specific locations of these proteins in the distinct regions that compose the costa, as well as to define the functional roles of each component. Therefore, our study will help in the better understanding of the organisation and function of this structure in unicellular organisms. © 2017 Société Française des Microscopies and Société de Biologie Cellulaire de France. Published by John Wiley & Sons Ltd.
In Silico Screening and Molecular Dynamics Simulation of Disease-Associated nsSNP in TYRP1 Gene and Its Structural Consequences in OCA3

PubMed Central

Kamaraj, Balu

2013-01-01

Oculocutaneous albinism type III (OCA3), caused by mutations of TYRP1 gene, is an autosomal recessive disorder characterized by reduced biosynthesis of melanin pigment in the hair, skin, and eyes. The TYRP1 gene encodes a protein called tyrosinase-related protein-1 (Tyrp1). Tyrp1 is involved in maintaining the stability of tyrosinase protein and modulating its catalytic activity in eumelanin synthesis. Tyrp1 is also involved in maintenance of melanosome structure and affects melanocyte proliferation and cell death. In this work we implemented computational analysis to filter the most probable mutation that might be associated with OCA3. We found R326H and R356Q as most deleterious and disease associated by using PolyPhen 2.0, SIFT, PANTHER, I-mutant 3.0, PhD-SNP, SNP&GO, Pmut, and Mutpred tools. To understand the atomic arrangement in 3D space, the native and mutant (R326H and R356Q) structures were modelled. Finally the structural analyses of native and mutant Tyrp1 proteins were investigated using molecular dynamics simulation (MDS) approach. MDS results showed more flexibility in native Tyrp1 structure. Due to mutation in Tyrp1 protein, it became more rigid and might disturb the structural conformation and catalytic function of the structure and might also play a significant role in inducing OCA3. The results obtained from this study would facilitate wet-lab researches to develop a potent drug therapies against OCA3. PMID:23862152
Amyloid fibers provide structural integrity to Bacillus subtilis biofilms.

PubMed

Romero, Diego; Aguilar, Claudio; Losick, Richard; Kolter, Roberto

2010-02-02

Bacillus subtilis forms biofilms whose constituent cells are held together by an extracellular matrix. Previous studies have shown that the protein TasA and an exopolysaccharide are the main components of the matrix. Given the importance of TasA in biofilm formation, we characterized the physicochemical properties of this protein. We report that purified TasA forms fibers of variable length and 10-15 nm in width. Biochemical analyses, in combination with the use of specific dyes and microscopic analyses, indicate that TasA forms amyloid fibers. Consistent with this hypothesis, TasA fibers required harsh treatments (e.g., formic acid) to be depolymerized. When added to a culture of a tasA mutant, purified TasA restored wild-type biofilm morphology, indicating that the purified protein retained biological activity. We propose that TasA forms amyloid fibers that bind cells together in the biofilm.
Memprot: a program to model the detergent corona around a membrane protein based on SEC–SAXS data

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pérez, Javier, E-mail: javier.perez@synchrotron-soleil.fr; Koutsioubas, Alexandros; Synchrotron SOLEIL, L’Orme des Merisiers, BP 48, Saint-Aubin, 91192 Gif-sur-Yvette

Systematic SAXS simulations have been analysed over a wide range of parameters in order to better understand the detergent corona around a membrane protein. The application of small-angle X-ray scattering (SAXS) to structural investigations of transmembrane proteins in detergent solution has been hampered by two main inherent hurdles. On the one hand, the formation of a detergent corona around the hydrophobic region of the protein strongly modifies the scattering curve of the protein. On the other hand, free micelles of detergent without a precisely known concentration coexist with the protein–detergent complex in solution, therefore adding an uncontrolled signal. To gainmore » robust structural information on such systems from SAXS data, in previous work, advantage was taken of the online combination of size-exclusion chromatography (SEC) and SAXS, and the detergent corona around aquaporin-0, a membrane protein of known structure, could be modelled. A precise geometrical model of the corona, shaped as an elliptical torus, was determined. Here, in order to better understand the correlations between the corona model parameters and to discuss the uniqueness of the model, this work was revisited by analyzing systematic SAXS simulations over a wide range of parameters of the torus.« less
NMSim web server: integrated approach for normal mode-based geometric simulations of biologically relevant conformational transitions in proteins.

PubMed

Krüger, Dennis M; Ahmed, Aqeel; Gohlke, Holger

2012-07-01

The NMSim web server implements a three-step approach for multiscale modeling of protein conformational changes. First, the protein structure is coarse-grained using the FIRST software. Second, a rigid cluster normal-mode analysis provides low-frequency normal modes. Third, these modes are used to extend the recently introduced idea of constrained geometric simulations by biasing backbone motions of the protein, whereas side chain motions are biased toward favorable rotamer states (NMSim). The generated structures are iteratively corrected regarding steric clashes and stereochemical constraint violations. The approach allows performing three simulation types: unbiased exploration of conformational space; pathway generation by a targeted simulation; and radius of gyration-guided simulation. On a data set of proteins with experimentally observed conformational changes, the NMSim approach has been shown to be a computationally efficient alternative to molecular dynamics simulations for conformational sampling of proteins. The generated conformations and pathways of conformational transitions can serve as input to docking approaches or more sophisticated sampling techniques. The web server output is a trajectory of generated conformations, Jmol representations of the coarse-graining and a subset of the trajectory and data plots of structural analyses. The NMSim webserver, accessible at http://www.nmsim.de, is free and open to all users with no login requirement.
MAPA distinguishes genotype-specific variability of highly similar regulatory protein isoforms in potato tuber.

PubMed

Hoehenwarter, Wolfgang; Larhlimi, Abdelhalim; Hummel, Jan; Egelhofer, Volker; Selbig, Joachim; van Dongen, Joost T; Wienkoop, Stefanie; Weckwerth, Wolfram

2011-07-01

Mass Accuracy Precursor Alignment is a fast and flexible method for comparative proteome analysis that allows the comparison of unprecedented numbers of shotgun proteomics analyses on a personal computer in a matter of hours. We compared 183 LC-MS analyses and more than 2 million MS/MS spectra and could define and separate the proteomic phenotypes of field grown tubers of 12 tetraploid cultivars of the crop plant Solanum tuberosum. Protein isoforms of patatin as well as other major gene families such as lipoxygenase and cysteine protease inhibitor that regulate tuber development were found to be the primary source of variability between the cultivars. This suggests that differentially expressed protein isoforms modulate genotype specific tuber development and the plant phenotype. We properly assigned the measured abundance of tryptic peptides to different protein isoforms that share extensive stretches of primary structure and thus inferred their abundance. Peptides unique to different protein isoforms were used to classify the remaining peptides assigned to the entire subset of isoforms based on a common abundance profile using multivariate statistical procedures. We identified nearly 4000 proteins which we used for quantitative functional annotation making this the most extensive study of the tuber proteome to date.
Structural Insights into the PorK and PorN Components of the Porphyromonas gingivalis Type IX Secretion System.

PubMed

Gorasia, Dhana G; Veith, Paul D; Hanssen, Eric G; Glew, Michelle D; Sato, Keiko; Yukitake, Hideharu; Nakayama, Koji; Reynolds, Eric C

2016-08-01

The type IX secretion system (T9SS) has been recently discovered and is specific to Bacteroidetes species. Porphyromonas gingivalis, a keystone pathogen for periodontitis, utilizes the T9SS to transport many proteins including the gingipain virulence factors across the outer membrane and attach them to the cell surface via a sortase-like mechanism. At least 11 proteins have been identified as components of the T9SS including PorK, PorL, PorM, PorN and PorP, however the precise roles of most of these proteins have not been elucidated and the structural organization of these components is unknown. In this study, we purified PorK and PorN complexes from P. gingivalis and using electron microscopy we have shown that PorN and the PorK lipoprotein interact to form a 50 nm diameter ring-shaped structure containing approximately 32-36 subunits of each protein. The formation of these rings was dependent on both PorK and PorN, but was independent of PorL, PorM and PorP. PorL and PorM were found to form a separate stable complex. PorK and PorN were protected from proteinase K cleavage when present in undisrupted cells, but were rapidly degraded when the cells were lysed, which together with bioinformatic analyses suggests that these proteins are exposed in the periplasm and anchored to the outer membrane via the PorK lipid. Chemical cross-linking and mass spectrometry analyses confirmed the interaction between PorK and PorN and further revealed that they interact with the PG0189 outer membrane protein. Furthermore, we established that PorN was required for the stable expression of PorK, PorL and PorM. Collectively, these results suggest that the ring-shaped PorK/N complex may form part of the secretion channel of the T9SS. This is the first report showing the structural organization of any T9SS component.
Structural Insights into the PorK and PorN Components of the Porphyromonas gingivalis Type IX Secretion System

PubMed Central

Gorasia, Dhana G.; Veith, Paul D.; Hanssen, Eric G.; Glew, Michelle D.; Sato, Keiko; Yukitake, Hideharu; Nakayama, Koji; Reynolds, Eric C.

2016-01-01

The type IX secretion system (T9SS) has been recently discovered and is specific to Bacteroidetes species. Porphyromonas gingivalis, a keystone pathogen for periodontitis, utilizes the T9SS to transport many proteins including the gingipain virulence factors across the outer membrane and attach them to the cell surface via a sortase-like mechanism. At least 11 proteins have been identified as components of the T9SS including PorK, PorL, PorM, PorN and PorP, however the precise roles of most of these proteins have not been elucidated and the structural organization of these components is unknown. In this study, we purified PorK and PorN complexes from P. gingivalis and using electron microscopy we have shown that PorN and the PorK lipoprotein interact to form a 50 nm diameter ring-shaped structure containing approximately 32–36 subunits of each protein. The formation of these rings was dependent on both PorK and PorN, but was independent of PorL, PorM and PorP. PorL and PorM were found to form a separate stable complex. PorK and PorN were protected from proteinase K cleavage when present in undisrupted cells, but were rapidly degraded when the cells were lysed, which together with bioinformatic analyses suggests that these proteins are exposed in the periplasm and anchored to the outer membrane via the PorK lipid. Chemical cross-linking and mass spectrometry analyses confirmed the interaction between PorK and PorN and further revealed that they interact with the PG0189 outer membrane protein. Furthermore, we established that PorN was required for the stable expression of PorK, PorL and PorM. Collectively, these results suggest that the ring-shaped PorK/N complex may form part of the secretion channel of the T9SS. This is the first report showing the structural organization of any T9SS component. PMID:27509186
Data set for the proteomic inventory and quantitative analysis of chicken eggshell matrix proteins during the primary events of eggshell mineralization and the active growth phase of calcification.

PubMed

Marie, Pauline; Labas, Valérie; Brionne, Aurélien; Harichaux, Grégoire; Hennequet-Antier, Christelle; Rodriguez-Navarro, Alejandro B; Nys, Yves; Gautron, Joël

2015-09-01

Chicken eggshell is a biomineral composed of 95% calcite calcium carbonate mineral and of 3.5% organic matrix proteins. The assembly of mineral and its structural organization is controlled by its organic matrix. In a recent study [1], we have used quantitative proteomic, bioinformatic and functional analyses to explore the distribution of 216 eggshell matrix proteins at four key stages of shell mineralization defined as: (1) widespread deposition of amorphous calcium carbonate (ACC), (2) ACC transformation into crystalline calcite aggregates, (3) formation of larger calcite crystal units and (4) rapid growth of calcite as columnar structure with preferential crystal orientation. The current article detailed the quantitative analysis performed at the four stages of shell mineralization to determine the proteins which are the most abundant. Additionally, we reported the enriched GO terms and described the presence of 35 antimicrobial proteins equally distributed at all stages to keep the egg free of bacteria and of 81 proteins, the function of which could not be ascribed.

Data set for the proteomic inventory and quantitative analysis of chicken eggshell matrix proteins during the primary events of eggshell mineralization and the active growth phase of calcification

PubMed Central

Marie, Pauline; Labas, Valérie; Brionne, Aurélien; Harichaux, Grégoire; Hennequet-Antier, Christelle; Rodriguez-Navarro, Alejandro B.; Nys, Yves; Gautron, Joël

2015-01-01

Chicken eggshell is a biomineral composed of 95% calcite calcium carbonate mineral and of 3.5% organic matrix proteins. The assembly of mineral and its structural organization is controlled by its organic matrix. In a recent study [1], we have used quantitative proteomic, bioinformatic and functional analyses to explore the distribution of 216 eggshell matrix proteins at four key stages of shell mineralization defined as: (1) widespread deposition of amorphous calcium carbonate (ACC), (2) ACC transformation into crystalline calcite aggregates, (3) formation of larger calcite crystal units and (4) rapid growth of calcite as columnar structure with preferential crystal orientation. The current article detailed the quantitative analysis performed at the four stages of shell mineralization to determine the proteins which are the most abundant. Additionally, we reported the enriched GO terms and described the presence of 35 antimicrobial proteins equally distributed at all stages to keep the egg free of bacteria and of 81 proteins, the function of which could not be ascribed. PMID:26306314
A quasi-atomic model of human adenovirus type 5 capsid

PubMed Central

Fabry, Céline M S; Rosa-Calatrava, Manuel; Conway, James F; Zubieta, Chloé; Cusack, Stephen; Ruigrok, Rob W H; Schoehn, Guy

2005-01-01

Adenoviruses infect a wide range of vertebrates including humans. Their icosahedral capsids are composed of three major proteins: the trimeric hexon forms the facets and the penton, a noncovalent complex of the pentameric penton base and trimeric fibre proteins, is located at the 12 capsid vertices. Several proteins (IIIa, VI, VIII and IX) stabilise the capsid. We have obtained a 10 Å resolution map of the human adenovirus 5 by image analysis from cryo-electron micrographs (cryoEMs). This map, in combination with the X-ray structures of the penton base and hexon, was used to build a quasi-atomic model of the arrangement of the two major capsid components and to analyse the hexon–hexon and hexon–penton interactions. The secondary proteins, notably VIII, were located by comparing cryoEM maps of native and pIX deletion mutant virions. Minor proteins IX and IIIa are located on the outside of the capsid, whereas protein VIII is organised with a T=2 lattice on the inner face of the capsid. The capsid organisation is compared with the known X-ray structure of bacteriophage PRD1. PMID:15861131
D19S Mutation of the Cationic, Cysteine-Rich Protein PAF: Novel Insights into Its Structural Dynamics, Thermal Unfolding and Antifungal Function

PubMed Central

Burtscher, Laura; Hajdu, Dorottya; Muñoz, Alberto; Gáspári, Zoltán; Read, Nick D.; Batta, Gyula; Marx, Florentine

2017-01-01

The cysteine-rich, cationic, antifungal protein PAF is abundantly secreted into the culture supernatant of the filamentous Ascomycete Penicillium chrysogenum. The five β-strands of PAF form a compact β-barrel that is stabilized by three disulphide bonds. The folding of PAF allows the formation of four surface-exposed loops and distinct charged motifs on the protein surface that might regulate the interaction of PAF with the sensitive target fungus. The growth inhibitory activity of this highly stable protein against opportunistic fungal pathogens provides great potential in antifungal drug research. To understand its mode of action, we started to investigate the surface-exposed loops of PAF and replaced one aspartic acid at position 19 in loop 2 that is potentially involved in PAF active or binding site, with a serine (Asp19 to Ser19). We analysed the overall effects, such as unfolding, electrostatic changes, sporadic conformers and antifungal activity when substituting this specific amino acid to the fairly indifferent amino acid serine. Structural analyses revealed that the overall 3D solution structure is virtually identical with that of PAF. However, PAFD19S showed slightly increased dynamics and significant differences in the surface charge distribution. Thermal unfolding identified PAFD19S to be rather a two-state folder in contrast to the three-state folder PAF. Functional comparison of PAFD19S and PAF revealed that the exchange at residue 19 caused a dramatic loss of antifungal activity: the binding and internalization of PAFD19S by target cells was reduced and the protein failed to trigger an intracellular Ca2+ response, all of which are closely linked to the antifungal toxicity of PAF. We conclude that the negatively charged residue Asp19 in loop 2 is essential for full function of the cationic protein PAF. PMID:28072824
Structural characterization and evaluation of the antioxidant activities of polysaccharides extracted from Qingzhuan brick tea.

PubMed

Yang, Xinhe; Huang, Mingjun; Qin, Caiqin; Lv, Bangyu; Mao, Qingli; Liu, Zhonghua

2017-08-01

The crude tea polysaccharides (CTPS) from Qingzhuan brick tea(QZBT) were extracted and fractionated to afford two fractions, namely TPS-1 and TPS-2. Analyses were conducted concerning the structural characterization and antioxidant activities of these samples. Component analysis revealed that the carbohydrate, uronic acid, protein and polyphenol contents of these samples differed significantly. Fourier transform infrared analysis showed that these samples showed similar characteristic absorption peaks for polysaccharides. Ultraviolet-visible spectroscopy, circular dichroism, scanning electron microscopy and thermogravimetric analyses indicated that there were considerable differences in the presence of protein, surface features, conformational characteristics and thermodynamic behaviors. For antioxidant activities in vitro, CTPS, TPS-1 and TPS-2 exhibited concentration-dependent antioxidant activities, with TPS-2 showing significantly higher antioxidant activity than CTPS and TPS-1. These results provide a scientific and strong foundation for the use of tea polysaccharides(TPS) from QZBT and further research towards the relationships between the characteristics and antioxidant activities of TPS. Copyright © 2017 Elsevier B.V. All rights reserved.
Computational analysis of histidine mutations on the structural stability of human tyrosinases leading to albinism insurgence.

PubMed

Hassan, Mubashir; Abbas, Qamar; Raza, Hussain; Moustafa, Ahmed A; Seo, Sung-Yum

2017-07-25

Misfolding and structural alteration in proteins lead to serious malfunctions and cause various diseases in humans. Mutations at the active binding site in tyrosinase impair structural stability and cause lethal albinism by abolishing copper binding. To evaluate the histidine mutational effect, all mutated structures were built using homology modelling. The protein sequence was retrieved from the UniProt database, and 3D models of original and mutated human tyrosinase sequences were predicted by changing the residual positions within the target sequence separately. Structural and mutational analyses were performed to interpret the significance of mutated residues (N 180 , R 202 , Q 202 , R 211 , Y 363 , R 367 , Y 367 and D 390 ) at the active binding site of tyrosinases. CSpritz analysis depicted that 23.25% residues actively participate in the instability of tyrosinase. The accuracy of predicted models was confirmed through online servers ProSA-web, ERRAT score and VERIFY 3D values. The theoretical pI and GRAVY generated results also showed the accuracy of the predicted models. The CCA negative correlation results depicted that the replacement of mutated residues at His within the active binding site disturbs the structural stability of tyrosinases. The predicted CCA scores of Tyr 367 (-0.079) and Q/R 202 (0.032) revealed that both mutations have more potential to disturb the structural stability. MD simulation analyses of all predicted models justified that Gln 202 , Arg 202 , Tyr 367 and D 390 replacement made the protein structures more susceptible to destabilization. Mutational results showed that the replacement of His with Q/R 202 and Y/R 363 has a lethal effect and may cause melanin associated diseases such as OCA1. Taken together, our computational analysis depicts that the mutated residues such as Q/R 202 and Y/R 363 actively participate in instability and misfolding of tyrosinases, which may govern OCA1 through disturbing the melanin biosynthetic pathway.
Structural conversion of the transformer protein RfaH: new insights derived from protein structure prediction and molecular dynamics simulations.

PubMed

Balasco, Nicole; Barone, Daniela; Vitagliano, Luigi

2015-01-01

Recent structural investigations have shown that the C-terminal domain (CTD) of the transcription factor RfaH undergoes unique structural modifications that have a profound impact into its functional properties. These modifications cause a complete change in RfaH(CTD) topology that converts from an α-hairpin to a β-barrel fold. To gain insights into the determinants of this major structural conversion, we here performed computational studies (protein structure prediction and molecular dynamics simulations) on RfaH(CTD). Although these analyses, in line with literature data, suggest that the isolated RfaH(CTD) has a strong preference for the β-barrel fold, they also highlight that a specific region of the protein is endowed with a chameleon conformational behavior. In particular, the Leu-rich region (residues 141-145) has a good propensity to adopt both α-helical and β-structured states. Intriguingly, in the RfaH homolog NusG, whose CTD uniquely adopts the β-barrel fold, the corresponding region is rich in residues as Val or Ile that present a strong preference for the β-structure. On this basis, we suggest that the presence of this Leu-rich element in RfaH(CTD) may be responsible for the peculiar structural behavior of the domain. The analysis of the sequences of RfaH family (PfamA code PF02357) unraveled that other members potentially share the structural properties of RfaH(CTD). These observations suggest that the unusual conformational behavior of RfaH(CTD) may be rare but not unique.
Efflux proteins at the blood-brain barrier: review and bioinformatics analysis.

PubMed

Saidijam, Massoud; Karimi Dermani, Fatemeh; Sohrabi, Sareh; Patching, Simon G

2018-05-01

1. Efflux proteins at the blood-brain barrier provide a mechanism for export of waste products of normal metabolism from the brain and help to maintain brain homeostasis. They also prevent entry into the brain of a wide range of potentially harmful compounds such as drugs and xenobiotics. 2. Conversely, efflux proteins also hinder delivery of therapeutic drugs to the brain and central nervous system used to treat brain tumours and neurological disorders. For bypassing efflux proteins, a comprehensive understanding of their structures, functions and molecular mechanisms is necessary, along with new strategies and technologies for delivery of drugs across the blood-brain barrier. 3. We review efflux proteins at the blood-brain barrier, classified as either ATP-binding cassette (ABC) transporters (P-gp, BCRP, MRPs) or solute carrier (SLC) transporters (OATP1A2, OATP1A4, OATP1C1, OATP2B1, OAT3, EAATs, PMAT/hENT4 and MATE1). 4. This includes information about substrate and inhibitor specificity, structural organisation and mechanism, membrane localisation, regulation of expression and activity, effects of diseases and conditions and the principal technique used for in vivo analysis of efflux protein activity: positron emission tomography (PET). 5. We also performed analyses of evolutionary relationships, membrane topologies and amino acid compositions of the proteins, and linked these to structure and function.
Discovery of a novel protein modification: alpha-glycerophosphate is a substituent of meningococcal pilin.

PubMed Central

Stimson, E; Virji, M; Barker, S; Panico, M; Blench, I; Saunders, J; Payne, G; Moxon, E R; Dell, A; Morris, H R

1996-01-01

Pili, which are filamentous protein structures on the surface of the meningitis-causing organism Neisseria meningitidis, are known to be post-translationally modified with substituents that affect their mobility in SDS/PAGE and which might play a crucial role in adherence and bloodstream invasion. Tryptic digests of pili were analysed by fast atom bombardment and electrospray MS to identify putative modifications. Serine-93 was found to carry a novel modification of alpha-glycerophosphate. This is the first time that alpha-glycerophosphate has been observed as a substituent of a prokaryotic or eukaryotic protein. PMID:8645220
Structural and biochemical analyses of a Clostridium perfringens sortase D transpeptidase

DOE Office of Scientific and Technical Information (OSTI.GOV)

Suryadinata, Randy, E-mail: randy.suryadinata@csiro.au; Seabrook, Shane A.; Adams, Timothy E.

The structure of C. perfringens sortase D was determined at 1.99 Å resolution. Comparative biochemical and structural analyses revealed that this transpeptidase may represent a new subclass of the sortase D family. The assembly and anchorage of various pathogenic proteins on the surface of Gram-positive bacteria is mediated by the sortase family of enzymes. These cysteine transpeptidases catalyze a unique sorting signal motif located at the C-terminus of their target substrate and promote the covalent attachment of these proteins onto an amino nucleophile located on another protein or on the bacterial cell wall. Each of the six distinct classes ofmore » sortases displays a unique biological role, with sequential activation of multiple sortases often observed in many Gram-positive bacteria to decorate their peptidoglycans. Less is known about the members of the class D family of sortases (SrtD), but they have a suggested role in spore formation in an oxygen-limiting environment. Here, the crystal structure of the SrtD enzyme from Clostridium perfringens was determined at 1.99 Å resolution. Comparative analysis of the C. perfringens SrtD structure reveals the typical eight-stranded β-barrel fold observed in all other known sortases, along with the conserved catalytic triad consisting of cysteine, histidine and arginine residues. Biochemical approaches further reveal the specifics of the SrtD catalytic activity in vitro, with a significant preference for the LPQTGS sorting motif. Additionally, the catalytic activity of SrtD is most efficient at 316 K and can be further improved in the presence of magnesium cations. Since C. perfringens spores are heat-resistant and lead to foodborne illnesses, characterization of the spore-promoting sortase SrtD may lead to the development of new antimicrobial agents.« less
Structure and sequence analyses of Bacteroides proteins BVU_4064 and BF1687 reveal presence of two novel predominantly-beta domains, predicted to be involved in lipid and cell surface interactions

DOE PAGES

Natarajan, Padmaja; Punta, Marco; Kumar, Abhinav; ...

2015-01-16

N-terminal domains of BVU_4064 and BF1687 proteins from Bacteroides vulgatus and Bacteroides fragilis respectively are members of the Pfam family PF12985 (DUF3869). Proteins containing a domain from this family can be found in most Bacteroides species and, in large numbers, in all human gut microbiome samples. Both BVU_4064 and BF1687 proteins have a consensus lipobox motif implying they are anchored to the membrane, but their functions are otherwise unknown. The C-terminal half of BVU_4064 is assigned to protein family PF12986 (DUF3870); the equivalent part of BF1687 was unclassified.
Protein Data Bank depositions from synchrotron sources.

PubMed

Jiang, Jiansheng; Sweet, Robert M

2004-07-01

A survey and analysis of Protein Data Bank (PDB) depositions from international synchrotron radiation facilities, based on the latest released PDB entries, are reported. The results (http://asdp.bnl.gov/asda/Libraries/) show that worldwide, every year since 1999, more than 50% of the deposited X-ray structures have used synchrotron facilities, reaching 75% by 2003. In this web-based database, all PDB entries among individual synchrotron beamlines are archived, synchronized with the weekly PDB release. Statistics regarding the quality of experimental data and the refined model for all structures are presented, and these are analysed to reflect the impact of synchrotron sources. The results confirm the common impression that synchrotron sources extend the size of structures that can be solved with equivalent or better quality than home sources.
Conservation of dark recovery kinetic parameters and structural features in the pseudomonadaceae "short" light, oxygen, voltage (LOV) protein family: implications for the design of LOV-based optogenetic tools.

PubMed

Rani, Raj; Jentzsch, Katrin; Lecher, Justin; Hartmann, Rudolf; Willbold, Dieter; Jaeger, Karl-Erich; Krauss, Ulrich

2013-07-02

In bacteria and fungi, various light, oxygen, voltage (LOV) sensory systems that lack a fused effector domain but instead contain only short N- and C-terminal extensions flanking the LOV core exist. In the prokaryotic kingdom, this so-called "short" LOV protein family represents the third largest LOV photoreceptor family. This observation prompted us to study their distribution and phylogeny as well as their photochemical and structural properties in more detail. We recently described the slow and fast reverting "short" LOV proteins PpSB1-LOV and PpSB2-LOV from Pseudomonas putida KT2440 whose adduct state lifetimes varied by 3 orders of magnitude [Jentzsch, K., Wirtz, A., Circolone, F., Drepper, T., Losi, A., Gärtner, W., Jaeger, K. E., and Krauss, U. (2009) Biochemistry 48, 10321-10333]. We now present evidence of the conservation of similar fast and slow-reverting "short" LOV proteins in different Pseudomonas species. Truncation studies conducted with PpSB1-LOV and PpSB2-LOV suggested that the short N- and C-terminal extensions outside of the LOV core domain are essential for the structural integrity and folding of the two proteins. While circular dichroism and solution nuclear magnetic resonance experiments verify that the two short C-terminal extensions of PpSB1-LOV and PpSB2-LOV form independently folding helical structures in solution, bioinformatic analyses imply the formation of coiled coils of the respective structural elements in the context of the dimeric full-length proteins. Given their prototypic architecture, conserved in most more complex LOV photoreceptor systems, "short" LOV proteins could represent ideally suited building blocks for the design of genetically encoded photoswitches (i.e., LOV-based optogenetic tools).
Structural mapping of the coiled-coil domain of a bacterial condensin and comparative analyses across all domains of life suggest conserved features of SMC proteins.

PubMed

Waldman, Vincent M; Stanage, Tyler H; Mims, Alexandra; Norden, Ian S; Oakley, Martha G

2015-06-01

The structural maintenance of chromosomes (SMC) proteins form the cores of multisubunit complexes that are required for the segregation and global organization of chromosomes in all domains of life. These proteins share a common domain structure in which N- and C- terminal regions pack against one another to form a globular ATPase domain. This "head" domain is connected to a central, globular, "hinge" or dimerization domain by a long, antiparallel coiled coil. To date, most efforts for structural characterization of SMC proteins have focused on the globular domains. Recently, however, we developed a method to map interstrand interactions in the 50-nm coiled-coil domain of MukB, the divergent SMC protein found in γ-proteobacteria. Here, we apply that technique to map the structure of the Bacillus subtilis SMC (BsSMC) coiled-coil domain. We find that, in contrast to the relatively complicated coiled-coil domain of MukB, the BsSMC domain is nearly continuous, with only two detectable coiled-coil interruptions. Near the middle of the domain is a break in coiled-coil structure in which there are three more residues on the C-terminal strand than on the N-terminal strand. Close to the head domain, there is a second break with a significantly longer insertion on the same strand. These results provide an experience base that allows an informed interpretation of the output of coiled-coil prediction algorithms for this family of proteins. A comparison of such predictions suggests that these coiled-coil deviations are highly conserved across SMC types in a wide variety of organisms, including humans. © 2015 Wiley Periodicals, Inc.
Supra-domains: evolutionary units larger than single protein domains.

PubMed

Vogel, Christine; Berzuini, Carlo; Bashton, Matthew; Gough, Julian; Teichmann, Sarah A

2004-02-20

Domains are the evolutionary units that comprise proteins, and most proteins are built from more than one domain. Domains can be shuffled by recombination to create proteins with new arrangements of domains. Using structural domain assignments, we examined the combinations of domains in the proteins of 131 completely sequenced organisms. We found two-domain and three-domain combinations that recur in different protein contexts with different partner domains. The domains within these combinations have a particular functional and spatial relationship. These units are larger than individual domains and we term them "supra-domains". Amongst the supra-domains, we identified some 1400 (1203 two-domain and 166 three-domain) combinations that are statistically significantly over-represented relative to the occurrence and versatility of the individual component domains. Over one-third of all structurally assigned multi-domain proteins contain these over-represented supra-domains. This means that investigation of the structural and functional relationships of the domains forming these popular combinations would be particularly useful for an understanding of multi-domain protein function and evolution as well as for genome annotation. These and other supra-domains were analysed for their versatility, duplication, their distribution across the three kingdoms of life and their functional classes. By examining the three-dimensional structures of several examples of supra-domains in different biological processes, we identify two basic types of spatial relationships between the component domains: the combined function of the two domains is such that either the geometry of the two domains is crucial and there is a tight constraint on the interface, or the precise orientation of the domains is less important and they are spatially separate. Frequently, the role of the supra-domain becomes clear only once the three-dimensional structure is known. Since this is the case for only a quarter of the supra-domains, we provide a list of the most important unknown supra-domains as potential targets for structural genomics projects.
The visualCMAT: A web-server to select and interpret correlated mutations/co-evolving residues in protein families.

PubMed

Suplatov, Dmitry; Sharapova, Yana; Timonina, Daria; Kopylov, Kirill; Švedas, Vytas

2018-04-01

The visualCMAT web-server was designed to assist experimental research in the fields of protein/enzyme biochemistry, protein engineering, and drug discovery by providing an intuitive and easy-to-use interface to the analysis of correlated mutations/co-evolving residues. Sequence and structural information describing homologous proteins are used to predict correlated substitutions by the Mutual information-based CMAT approach, classify them into spatially close co-evolving pairs, which either form a direct physical contact or interact with the same ligand (e.g. a substrate or a crystallographic water molecule), and long-range correlations, annotate and rank binding sites on the protein surface by the presence of statistically significant co-evolving positions. The results of the visualCMAT are organized for a convenient visual analysis and can be downloaded to a local computer as a content-rich all-in-one PyMol session file with multiple layers of annotation corresponding to bioinformatic, statistical and structural analyses of the predicted co-evolution, or further studied online using the built-in interactive analysis tools. The online interactivity is implemented in HTML5 and therefore neither plugins nor Java are required. The visualCMAT web-server is integrated with the Mustguseal web-server capable of constructing large structure-guided sequence alignments of protein families and superfamilies using all available information about their structures and sequences in public databases. The visualCMAT web-server can be used to understand the relationship between structure and function in proteins, implemented at selecting hotspots and compensatory mutations for rational design and directed evolution experiments to produce novel enzymes with improved properties, and employed at studying the mechanism of selective ligand's binding and allosteric communication between topologically independent sites in protein structures. The web-server is freely available at https://biokinet.belozersky.msu.ru/visualcmat and there are no login requirements.
Group II chaperonins: new TRiC(k)s and turns of a protein folding machine.

PubMed

Gutsche, I; Essen, L O; Baumeister, W

1999-10-22

In the past decade, the eubacterial group I chaperonin GroEL became the paradigm of a protein folding machine. More recently, electron microscopy and X-ray crystallography offered insights into the structure of the thermosome, the archetype of the group II chaperonins which also comprise the chaperonin from the eukaryotic cytosol TRiC. Some structural differences from GroEL were revealed, namely the existence of a built-in lid provided by the helical protrusions of the apical domains instead of a GroES-like co-chaperonin. These structural studies provide a framework for understanding the differences in the mode of action between the group II and the group I chaperonins. In vitro analyses of the folding of non-native substrates coupled to ATP binding and hydrolysis are progressing towards establishing a functional cycle for group II chaperonins. A protein complex called GimC/prefoldin has recently been found to cooperate with TRiC in vivo, and its characterization is under way. Copyright 1999 Academic Press.
[Three-dimensional genome organization: a lesson from the Polycomb-Group proteins].

PubMed

Bantignies, Frédéric

2013-01-01

As more and more genomes are being explored and annotated, important features of three-dimensional (3D) genome organization are just being uncovered. In the light of what we know about Polycomb group (PcG) proteins, we will present the latest findings on this topic. The PcG proteins are well-conserved chromatin factors that repress transcription of numerous target genes. They bind the genome at specific sites, forming chromatin domains of associated histone modifications as well as higher-order chromatin structures. These 3D chromatin structures involve the interactions between PcG-bound regulatory regions at short- and long-range distances, and may significantly contribute to PcG function. Recent high throughput "Chromosome Conformation Capture" (3C) analyses have revealed many other higher order structures along the chromatin fiber, partitioning the genomes into well demarcated topological domains. This revealed an unprecedented link between linear epigenetic domains and chromosome architecture, which might be intimately connected to genome function. © Société de Biologie, 2013.
A novel carbohydrate-binding surface layer protein from the hyperthermophilic archaeon Pyrococcus horikoshii.

PubMed

Goda, Shuichiro; Koga, Tomoyuki; Yamashita, Kenichiro; Kuriura, Ryo; Ueda, Toshifumi

2018-04-08

In Archaea and Bacteria, surface layer (S-layer) proteins form the cell envelope and are involved in cell protection. In the present study, a putative S-layer protein was purified from the crude extract of Pyrococcus horikoshii using affinity chromatography. The S-layer gene was cloned and expressed in Escherichia coli. Isothermal titration calorimetry analyses showed that the S-layer protein bound N-acetylglucosamine and induced agglutination of the gram-positive bacterium Micrococcus lysodeikticus. The protein comprised a 21-mer structure, with a molecular mass of 1,340 kDa, as determined using small-angle X-ray scattering. This protein showed high thermal stability, with a midpoint of thermal denaturation of 79 °C in dynamic light scattering experiments. This is the first description of the carbohydrate-binding archaeal S-layer protein and its characteristics.
An Investigation into the Protein Composition of the Teneral Glossina morsitans morsitans Peritrophic Matrix

PubMed Central

Rose, Clair; Belmonte, Rodrigo; Armstrong, Stuart D.; Molyneux, Gemma; Haines, Lee R.; Lehane, Michael J.; Wastling, Jonathan; Acosta-Serrano, Alvaro

2014-01-01

Background Tsetse flies serve as biological vectors for several species of African trypanosomes. In order to survive, proliferate and establish a midgut infection, trypanosomes must cross the tsetse fly peritrophic matrix (PM), which is an acellular gut lining surrounding the blood meal. Crossing of this multi-layered structure occurs at least twice during parasite migration and development, but the mechanism of how trypanosomes do so is not understood. In order to better comprehend the molecular events surrounding trypanosome penetration of the tsetse PM, a mass spectrometry-based approach was applied to investigate the PM protein composition using Glossina morsitans morsitans as a model organism. Methods PMs from male teneral (young, unfed) flies were dissected, solubilised in urea/SDS buffer and the proteins precipitated with cold acetone/TCA. The PM proteins were either subjected to an in-solution tryptic digestion or fractionated on 1D SDS-PAGE, and the resulting bands digested using trypsin. The tryptic fragments from both preparations were purified and analysed by LC-MS/MS. Results Overall, nearly 300 proteins were identified from both analyses, several of those containing signature Chitin Binding Domains (CBD), including novel peritrophins and peritrophin-like glycoproteins, which are essential in maintaining PM architecture and may act as trypanosome adhesins. Furthermore, 27 proteins from the tsetse secondary endosymbiont, Sodalis glossinidius, were also identified, suggesting this bacterium is probably in close association with the tsetse PM. Conclusion To our knowledge this is the first report on the protein composition of teneral G. m. morsitans, an important vector of African trypanosomes. Further functional analyses of these proteins will lead to a better understanding of the tsetse physiology and may help identify potential molecular targets to block trypanosome development within the tsetse. PMID:24763256
The Use of Gene Modification and Advanced Molecular Structure Analyses towards Improving Alfalfa Forage.

PubMed

Lei, Yaogeng; Hannoufa, Abdelali; Yu, Peiqiang

2017-01-29

Alfalfa is one of the most important legume forage crops in the world. In spite of its agronomic and nutritive advantages, alfalfa has some limitations in the usage of pasture forage and hay supplement. High rapid degradation of protein in alfalfa poses a risk of rumen bloat to ruminants which could cause huge economic losses for farmers. Coupled with the relatively high lignin content, which impedes the degradation of carbohydrate in rumen, alfalfa has unbalanced and asynchronous degradation ratio of nitrogen to carbohydrate (N/CHO) in rumen. Genetic engineering approaches have been used to manipulate the expression of genes involved in important metabolic pathways for the purpose of improving the nutritive value, forage yield, and the ability to resist abiotic stress. Such gene modification could bring molecular structural changes in alfalfa that are detectable by advanced structural analytical techniques. These structural analyses have been employed in assessing alfalfa forage characteristics, allowing for rapid, convenient and cost-effective analysis of alfalfa forage quality. In this article, we review two major obstacles facing alfalfa utilization, namely poor protein utilization and relatively high lignin content, and highlight genetic studies that were performed to overcome these drawbacks, as well as to introduce other improvements to alfalfa quality. We also review the use of advanced molecular structural analysis in the assessment of alfalfa forage for its potential usage in quality selection in alfalfa breeding.

The Use of Gene Modification and Advanced Molecular Structure Analyses towards Improving Alfalfa Forage

PubMed Central

Lei, Yaogeng; Hannoufa, Abdelali; Yu, Peiqiang

2017-01-01

Alfalfa is one of the most important legume forage crops in the world. In spite of its agronomic and nutritive advantages, alfalfa has some limitations in the usage of pasture forage and hay supplement. High rapid degradation of protein in alfalfa poses a risk of rumen bloat to ruminants which could cause huge economic losses for farmers. Coupled with the relatively high lignin content, which impedes the degradation of carbohydrate in rumen, alfalfa has unbalanced and asynchronous degradation ratio of nitrogen to carbohydrate (N/CHO) in rumen. Genetic engineering approaches have been used to manipulate the expression of genes involved in important metabolic pathways for the purpose of improving the nutritive value, forage yield, and the ability to resist abiotic stress. Such gene modification could bring molecular structural changes in alfalfa that are detectable by advanced structural analytical techniques. These structural analyses have been employed in assessing alfalfa forage characteristics, allowing for rapid, convenient and cost-effective analysis of alfalfa forage quality. In this article, we review two major obstacles facing alfalfa utilization, namely poor protein utilization and relatively high lignin content, and highlight genetic studies that were performed to overcome these drawbacks, as well as to introduce other improvements to alfalfa quality. We also review the use of advanced molecular structural analysis in the assessment of alfalfa forage for its potential usage in quality selection in alfalfa breeding. PMID:28146083
MetalPDB in 2018: a database of metal sites in biological macromolecular structures.

PubMed

Putignano, Valeria; Rosato, Antonio; Banci, Lucia; Andreini, Claudia

2018-01-04

MetalPDB (http://metalweb.cerm.unifi.it/) is a database providing information on metal-binding sites detected in the three-dimensional (3D) structures of biological macromolecules. MetalPDB represents such sites as 3D templates, called Minimal Functional Sites (MFSs), which describe the local environment around the metal(s) independently of the larger context of the macromolecular structure. The 2018 update of MetalPDB includes new contents and tools. A major extension is the inclusion of proteins whose structures do not contain metal ions although their sequences potentially contain a known MFS. In addition, MetalPDB now provides extensive statistical analyses addressing several aspects of general metal usage within the PDB, across protein families and in catalysis. Users can also query MetalPDB to extract statistical information on structural aspects associated with individual metals, such as preferred coordination geometries or aminoacidic environment. A further major improvement is the functional annotation of MFSs; the annotation is manually performed via a password-protected annotator interface. At present, ∼50% of all MFSs have such a functional annotation. Other noteworthy improvements are bulk query functionality, through the upload of a list of PDB identifiers, and ftp access to MetalPDB contents, allowing users to carry out in-depth analyses on their own computational infrastructure. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Expression and in vitro functional analyses of recombinant Gam1 protein

PubMed Central

Avila, Gustavo A.; Ramirez, Daniel H.; Hildenbrand, Zacariah L.; Jacquez, Pedro; Chiocca, Susanna; Sun, Jianjun; Rosas-Acosta, German; Xiao, Chuan

2014-01-01

Gam1, an early gene product of an avian adenovirus, is essential for viral replication. Gam1 is the first viral protein found to globally inhibit cellular SUMOylation, a critical posttranslational modification that alters the function and cellular localization of proteins. The interaction details at the interface between Gam1 and its cellular targets remain unclear due to the lack of structural information. Although Gam1 has been previously characterized, the purity of the protein was not suitable for structural investigations. In the present study, the gene of Gam1 was cloned and expressed in various bacterial expression systems to obtain pure and soluble recombinant Gam1 protein for in vitro functional and structural studies. While Gam1 was insoluble in most expression systems tested, it became soluble when it was expressed as a fusion protein with trigger factor (TF), a ribosome associated bacterial chaperone, under the control of a cold shock promoter. Careful optimization indicates that both low temperature induction and the chaperone function of TF play critical roles in increasing Gam1 solubility. Soluble Gam1 was purified to homogeneity through sequential chromatography techniques. Monomeric Gam1 was obtained via size exclusion chromatography and analyzed by dynamic light scattering. The SUMOylation inhibitory function of the purified Gam1 was confirmed in an in vitro assay. These results have built the foundation for further structural investigations that will broaden our understanding of Gam1’s roles in viral replication. PMID:25450237
Expression and in vitro functional analyses of recombinant Gam1 protein.

PubMed

Avila, Gustavo A; Ramirez, Daniel H; Hildenbrand, Zacariah L; Jacquez, Pedro; Chiocca, Susanna; Sun, Jianjun; Rosas-Acosta, German; Xiao, Chuan

2015-01-01

Gam1, an early gene product of an avian adenovirus, is essential for viral replication. Gam1 is the first viral protein found to globally inhibit cellular SUMOylation, a critical posttranslational modification that alters the function and cellular localization of proteins. The interaction details at the interface between Gam1 and its cellular targets remain unclear due to the lack of structural information. Although Gam1 has been previously characterized, the purity of the protein was not suitable for structural investigations. In the present study, the gene of Gam1 was cloned and expressed in various bacterial expression systems to obtain pure and soluble recombinant Gam1 protein for in vitro functional and structural studies. While Gam1 was insoluble in most expression systems tested, it became soluble when it was expressed as a fusion protein with trigger factor (TF), a ribosome associated bacterial chaperone, under the control of a cold shock promoter. Careful optimization indicates that both low temperature induction and the chaperone function of TF play critical roles in increasing Gam1 solubility. Soluble Gam1 was purified to homogeneity through sequential chromatography techniques. Monomeric Gam1 was obtained via size exclusion chromatography and analyzed by dynamic light scattering. The SUMOylation inhibitory function of the purified Gam1 was confirmed in an in vitro assay. These results have built the foundation for further structural investigations that will broaden our understanding of Gam1's roles in viral replication. Copyright © 2014 Elsevier Inc. All rights reserved.
Analysis of the interface variability in NMR structure ensembles of protein-protein complexes.

PubMed

Calvanese, Luisa; D'Auria, Gabriella; Vangone, Anna; Falcigno, Lucia; Oliva, Romina

2016-06-01

NMR structures consist in ensembles of conformers, all satisfying the experimental restraints, which exhibit a certain degree of structural variability. We analyzed here the interface in NMR ensembles of protein-protein heterodimeric complexes and found it to span a wide range of different conservations. The different exhibited conservations do not simply correlate with the size of the systems/interfaces, and are most probably the result of an interplay between different factors, including the quality of experimental data and the intrinsic complex flexibility. In any case, this information is not to be missed when NMR structures of protein-protein complexes are analyzed; especially considering that, as we also show here, the first NMR conformer is usually not the one which best reflects the overall interface. To quantify the interface conservation and to analyze it, we used an approach originally conceived for the analysis and ranking of ensembles of docking models, which has now been extended to directly deal with NMR ensembles. We propose this approach, based on the conservation of the inter-residue contacts at the interface, both for the analysis of the interface in whole ensembles of NMR complexes and for the possible selection of a single conformer as the best representative of the overall interface. In order to make the analyses automatic and fast, we made the protocol available as a web tool at: https://www.molnac.unisa.it/BioTools/consrank/consrank-nmr.html. Copyright © 2016 Elsevier Inc. All rights reserved.
Mechanism underlying selective regulation of G protein-gated inwardly rectifying potassium channels by the psychostimulant-sensitive sorting nexin 27

PubMed Central

Balana, Bartosz; Maslennikov, Innokentiy; Kwiatkowski, Witek; Stern, Kalyn M.; Bahima, Laia; Choe, Senyon; Slesinger, Paul A.

2011-01-01

G protein-gated inwardly rectifying potassium (GIRK) channels are important gatekeepers of neuronal excitability. The surface expression of neuronal GIRK channels is regulated by the psychostimulant-sensitive sorting nexin 27 (SNX27) protein through a class I (-X-Ser/Thr-X-Φ, where X is any residue and Φ is a hydrophobic amino acid) PDZ-binding interaction. The G protein-insensitive inward rectifier channel (IRK1) contains the same class I PDZ-binding motif but associates with a different synaptic PDZ protein, postsynaptic density protein 95 (PSD95). The mechanism by which SNX27 and PSD95 discriminate these channels was previously unclear. Using high-resolution structures coupled with biochemical and functional analyses, we identified key amino acids upstream of the channel's canonical PDZ-binding motif that associate electrostatically with a unique structural pocket in the SNX27-PDZ domain. Changing specific charged residues in the channel's carboxyl terminus or in the PDZ domain converts the selective association and functional regulation by SNX27. Elucidation of this unique interaction site between ion channels and PDZ-containing proteins could provide a therapeutic target for treating brain diseases. PMID:21422294
SLDP: a novel protein related to caleosin is associated with the endosymbiotic Symbiodinium lipid droplets from Euphyllia glabrescens.

PubMed

Pasaribu, Buntora; Lin, I-Ping; Tzen, Jason T C; Jauh, Guang-Yuh; Fan, Tung-Yung; Ju, Yu-Min; Cheng, Jing-O; Chen, Chii-Shiarng; Jiang, Pei-Luen

2014-10-01

Intracellular lipid droplets (LDs) have been proposed to play a key role in the mutualistic endosymbiosis between reef-building corals and the dinoflagellate endosymbiont Symbiodinium spp. This study investigates and identifies LD proteins in Symbiodinium from Euphyllia glabrescens. Discontinuous Percoll gradient centrifugation was used to separate Symbiodinium cells from E. glabrescens tentacles. Furthermore, staining with a fluorescent probe, Nile red, indicated that lipids accumulated in that freshly isolated Symbiodinium cells and lipid analyses further showed polyunsaturated fatty acids (PUFA) was abundant. The stable LDs were purified from endosymbiotic Symbiodinium cells. The structural integrity of the Symbiodinium LDs was maintained via electronegative repulsion and steric hindrance possibly provided by their surface proteins. Protein extracts from the purified LDs revealed a major protein band with a molecular weight of 20 kDa, which was termed Symbiodinium lipid droplet protein (SLDP). Interestingly, immunological cross-recognition analysis revealed that SLDP was detected strongly by the anti-sesame and anti-cycad caleosin antibodies. It was suggested that the stable Symbiodinium LDs were sheltered by this unique structural protein and was suggested that SLDP might be homologous to caleosin to a certain extent.
Horizontal gene transfer contributed to the evolution of extracellular surface structures: the freshwater polyp Hydra is covered by a complex fibrous cuticle containing glycosaminoglycans and proteins of the PPOD and SWT (sweet tooth) families.

PubMed

Böttger, Angelika; Doxey, Andrew C; Hess, Michael W; Pfaller, Kristian; Salvenmoser, Willi; Deutzmann, Rainer; Geissner, Andreas; Pauly, Barbara; Altstätter, Johannes; Münder, Sandra; Heim, Astrid; Gabius, Hans-Joachim; McConkey, Brendan J; David, Charles N

2012-01-01

The single-cell layered ectoderm of the fresh water polyp Hydra fulfills the function of an epidermis by protecting the animals from the surrounding medium. Its outer surface is covered by a fibrous structure termed the cuticle layer, with similarity to the extracellular surface coats of mammalian epithelia. In this paper we have identified molecular components of the cuticle. We show that its outermost layer contains glycoproteins and glycosaminoglycans and we have identified chondroitin and chondroitin-6-sulfate chains. In a search for proteins that could be involved in organising this structure we found PPOD proteins and several members of a protein family containing only SWT (sweet tooth) domains. Structural analyses indicate that PPODs consist of two tandem β-trefoil domains with similarity to carbohydrate-binding sites found in lectins. Experimental evidence confirmed that PPODs can bind sulfated glycans and are secreted into the cuticle layer from granules localized under the apical surface of the ectodermal epithelial cells. PPODs are taxon-specific proteins which appear to have entered the Hydra genome by horizontal gene transfer from bacteria. Their acquisition at the time Hydra evolved from a marine ancestor may have been critical for the transition to the freshwater environment.
Horizontal Gene Transfer Contributed to the Evolution of Extracellular Surface Structures: The Freshwater Polyp Hydra Is Covered by a Complex Fibrous Cuticle Containing Glycosaminoglycans and Proteins of the PPOD and SWT (Sweet Tooth) Families

PubMed Central

Böttger, Angelika; Doxey, Andrew C.; Hess, Michael W.; Pfaller, Kristian; Salvenmoser, Willi; Deutzmann, Rainer; Geissner, Andreas; Pauly, Barbara; Altstätter, Johannes; Münder, Sandra; Heim, Astrid; Gabius, Hans-Joachim; McConkey, Brendan J.; David, Charles N.

2012-01-01

The single-cell layered ectoderm of the fresh water polyp Hydra fulfills the function of an epidermis by protecting the animals from the surrounding medium. Its outer surface is covered by a fibrous structure termed the cuticle layer, with similarity to the extracellular surface coats of mammalian epithelia. In this paper we have identified molecular components of the cuticle. We show that its outermost layer contains glycoproteins and glycosaminoglycans and we have identified chondroitin and chondroitin-6-sulfate chains. In a search for proteins that could be involved in organising this structure we found PPOD proteins and several members of a protein family containing only SWT (sweet tooth) domains. Structural analyses indicate that PPODs consist of two tandem β-trefoil domains with similarity to carbohydrate-binding sites found in lectins. Experimental evidence confirmed that PPODs can bind sulfated glycans and are secreted into the cuticle layer from granules localized under the apical surface of the ectodermal epithelial cells. PPODs are taxon-specific proteins which appear to have entered the Hydra genome by horizontal gene transfer from bacteria. Their acquisition at the time Hydra evolved from a marine ancestor may have been critical for the transition to the freshwater environment. PMID:23300632
Convergent evolution of plant and animal embryo defences by hyperstable non-digestible storage proteins.

PubMed

Pasquevich, María Yanina; Dreon, Marcos Sebastián; Qiu, Jian-Wen; Mu, Huawei; Heras, Horacio

2017-11-20

Plants have evolved sophisticated embryo defences by kinetically-stable non-digestible storage proteins that lower the nutritional value of seeds, a strategy that have not been reported in animals. To further understand antinutritive defences in animals, we analysed PmPV1, massively accumulated in the eggs of the gastropod Pomacea maculata, focusing on how its structure and structural stability features affected its capacity to withstand passage through predator guts. The native protein withstands >50 min boiling and resists the denaturing detergent sodium dodecyl sulphate (SDS), indicating an unusually high structural stability (i.e., kinetic stability). PmPV1 is highly resistant to in vitro proteinase digestion and displays structural stability between pH 2.0-12.0 and 25-85 °C. Furthermore, PmPV1 withstands in vitro and mice digestion and is recovered unchanged in faeces, supporting an antinutritive defensive function. Subunit sequence similarities suggest a common origin and tolerance to mutations. This is the first known animal genus that, like plant seeds, lowers the nutritional value of eggs by kinetically-stable non-digestible storage proteins that survive the gut of predators unaffected. The selective pressure of the harsh gastrointestinal environment would have favoured their appearance, extending by convergent evolution the presence of plant-like hyperstable antinutritive proteins to unattended reproductive stages in animals.
Comparative structural studies of psychrophilic and mesophilic protein homologues by molecular dynamics simulation.

PubMed

Kundu, Sangeeta; Roy, Debjani

2009-01-01

Comparative molecular dynamics simulations of psychrophilic type III antifreeze protein from the North-Atlantic ocean-pout Macrozoarces americanus and its corresponding mesophilic counterpart, the antifreeze-like domain of human sialic acid synthase, have been performed for 10 ns each at five different temperatures. Analyses of trajectories in terms of secondary structure content, solvent accessibility, intramolecular hydrogen bonds and protein-solvent interactions indicate distinct differences in these two proteins. The two proteins also follow dissimilar unfolding pathways. The overall flexibility calculated by the trace of the diagonalized covariance matrix displays similar flexibility of both the proteins near their growth temperatures. However at higher temperatures psychrophilic protein shows increased overall flexibility than its mesophilic counterpart. Principal component analysis also indicates that the essential subspaces explored by the simulations of two proteins at different temperatures are non-overlapping and they show significantly different directions of motion. However, there are significant overlaps within the trajectories and similar directions of motion of each protein especially at 298 K, 310 K and 373 K. Overall, the psychrophilic protein leads to increased conformational sampling of the phase space than its mesophilic counterpart. Our study may help in elucidating the molecular basis of thermostability of homologous proteins from two organisms living at different temperature conditions. Such an understanding is required for designing efficient proteins with characteristics for a particular application at desired working temperatures.
Expansion of divergent SEA domains in cell surface proteins and nucleoporin 54.

PubMed

Pei, Jimin; Grishin, Nick V

2017-03-01

SEA (sea urchin sperm protein, enterokinase, agrin) domains, many of which possess autoproteolysis activity, have been found in a number of cell surface and secreted proteins. Despite high sequence divergence, SEA domains were also proposed to be present in dystroglycan based on a conserved autoproteolysis motif and receptor-type protein phosphatase IA-2 based on structural similarity. The presence of a SEA domain adjacent to the transmembrane segment appears to be a recurring theme in quite a number of type I transmembrane proteins on the cell surface, such as MUC1, dystroglycan, IA-2, and Notch receptors. By comparative sequence and structural analyses, we identified dystroglycan-like proteins with SEA domains in Capsaspora owczarzaki of the Filasterea group, one of the closest single-cell relatives of metazoans. We also detected novel and divergent SEA domains in a variety of cell surface proteins such as EpCAM, α/ε-sarcoglycan, PTPRR, collectrin/Tmem27, amnionless, CD34, KIAA0319, fibrocystin-like protein, and a number of cadherins. While these proteins are mostly from metazoans or their single cell relatives such as choanoflagellates and Filasterea, fibrocystin-like proteins with SEA domains were found in several other eukaryotic lineages including green algae, Alveolata, Euglenozoa, and Haptophyta, suggesting an ancient evolutionary origin. In addition, the intracellular protein Nucleoporin 54 (Nup54) acquired a divergent SEA domain in choanoflagellates and metazoans. © 2016 The Protein Society.
Physiological enzymology: The next frontier in understanding protein structure and function at the cellular level.

PubMed

Lee, Irene; Berdis, Anthony J

2016-01-01

Historically, the study of proteins has relied heavily on characterizing the activity of a single purified protein isolated from other cellular components. This classic approach allowed scientists to unambiguously define the intrinsic kinetic and chemical properties of that protein. The ultimate hope was to extrapolate this information toward understanding how the enzyme or receptor behaves within its native cellular context. These types of detailed in vitro analyses were necessary to reduce the innate complexities of measuring the singular activity and biochemical properties of a specific enzyme without interference from other enzymes and potential competing substrates. However, recent developments in fields encompassing cell biology, molecular imaging, and chemical biology now provide the unique chemical tools and instrumentation to study protein structure, function, and regulation in their native cellular environment. These advancements provide the foundation for a new field, coined physiological enzymology, which quantifies the function and regulation of enzymes and proteins at the cellular level. In this Special Edition, we explore the area of Physiological Enzymology and Protein Function through a series of review articles that focus on the tools and techniques used to measure the cellular activity of proteins inside living cells. This article is part of a Special Issue entitled: Physiological Enzymology and Protein Functions. Copyright © 2015 Elsevier B.V. All rights reserved.
Identification of proteins interacting with lactate dehydrogenase in claw muscle of the porcelain crab Petrolisthes cinctipes

PubMed Central

Cayenne, Andrea P.; Gabert, Beverly; Stillman, Jonathon H.

2011-01-01

Biochemical adaptation of enzymes involves conservation of activity, stability and affinity across a wide range of intracellular and environmental conditions. Enzyme adaptation by alteration of primary structure is well known, but the roles of protein-protein interactions in enzyme adaptation are less well understood. Interspecific differences in thermal stability of lactate dehydrogenase (LDH) in porcelain crabs (genus Petrolisthes) are related to intrinsic differences among LDH molecules and by interactions with other stabilizing proteins. Here, we identified proteins that interact with LDH in porcelain crab claw muscle tissue using co-immunoprecipitation, and showed LDH exists in high molecular weight complexes using size exclusion chromatography and Western blot analyses. Co-immunoprecipitated proteins were separated using 2D SDS PAGE and analyzed by LC/ESI using peptide MS/MS. Peptide MS/MS ions were compared to an EST database for Petrolisthes cinctipes to identify proteins. Identified proteins included cytoskeletal elements, glycolytic enzymes, a phosphagen kinase, and the respiratory protein hemocyanin. Our results support the hypothesis that LDH interacts with glycolytic enzymes in a metabolon structured by cytoskeletal elements that may also include the enzyme for transfer of the adenylate charge in glycolytically produced ATP. Those interactions may play specific roles in biochemical adaptation of glycolytic enzymes. PMID:21968246
Comparative Analysis of Type IV Pilin in Desulfuromonadales

PubMed Central

Shu, Chuanjun; Xiao, Ke; Yan, Qin; Sun, Xiao

2016-01-01

During anaerobic respiration, the bacteria Geobacter sulfurreducens can transfer electrons to extracellular electron accepters through its pilus. G. sulfurreducens pili have been reported to have metallic-like conductivity that is similar to doped organic semiconductors. To study the characteristics and origin of conductive pilin proteins found in the pilus structure, their genetic, structural, and phylogenetic properties were analyzed. The genetic relationships, and conserved structures and sequences that were obtained were used to predict the evolution of the pilins. Homologous genes that encode conductive pilin were found using PilFind and Cluster. Sequence characteristics and protein tertiary structures were analyzed with MAFFT and QUARK, respectively. The origin of conductive pilins was explored by building a phylogenetic tree. Truncation is a characteristic of conductive pilin. The structures of truncated pilins and their accompanying proteins were found to be similar to the N-terminal and C-terminal ends of full-length pilins respectively. The emergence of the truncated pilins can probably be ascribed to the evolutionary pressure of their extracellular electron transporting function. Genes encoding truncated pilins and proteins similar to the C-terminal of full-length pilins, which contain a group of consecutive anti-parallel beta-sheets, are adjacent in bacterial genomes. According to the genetic, structure, and phylogenetic analyses performed in this study, we inferred that the truncated pilins and their accompanying proteins probably evolved from full-length pilins by gene fission through duplication, degeneration, and separation. These findings provide new insights about the molecular mechanisms involved in long-range electron transport along the conductive pili of Geobacter species. PMID:28066394
Structural analyses of the CRISPR protein Csc2 reveal the RNA-binding interface of the type I-D Cas7 family.

PubMed

Hrle, Ajla; Maier, Lisa-Katharina; Sharma, Kundan; Ebert, Judith; Basquin, Claire; Urlaub, Henning; Marchfelder, Anita; Conti, Elena

2014-01-01

Upon pathogen invasion, bacteria and archaea activate an RNA-interference-like mechanism termed CRISPR (clustered regularly interspaced short palindromic repeats). A large family of Cas (CRISPR-associated) proteins mediates the different stages of this sophisticated immune response. Bioinformatic studies have classified the Cas proteins into families, according to their sequences and respective functions. These range from the insertion of the foreign genetic elements into the host genome to the activation of the interference machinery as well as target degradation upon attack. Cas7 family proteins are central to the type I and type III interference machineries as they constitute the backbone of the large interference complexes. Here we report the crystal structure of Thermofilum pendens Csc2, a Cas7 family protein of type I-D. We found that Csc2 forms a core RRM-like domain, flanked by three peripheral insertion domains: a lid domain, a Zinc-binding domain and a helical domain. Comparison with other Cas7 family proteins reveals a set of similar structural features both in the core and in the peripheral domains, despite the absence of significant sequence similarity. T. pendens Csc2 binds single-stranded RNA in vitro in a sequence-independent manner. Using a crosslinking - mass-spectrometry approach, we mapped the RNA-binding surface to a positively charged surface patch on T. pendens Csc2. Thus our analysis of the key structural and functional features of T. pendens Csc2 highlights recurring themes and evolutionary relationships in type I and type III Cas proteins.
Protein disorder in the human diseasome: unfoldomics of human genetic diseases

PubMed Central

Midic, Uros; Oldfield, Christopher J; Dunker, A Keith; Obradovic, Zoran; Uversky, Vladimir N

2009-01-01

Background Intrinsically disordered proteins lack stable structure under physiological conditions, yet carry out many crucial biological functions, especially functions associated with regulation, recognition, signaling and control. Recently, human genetic diseases and related genes were organized into a bipartite graph (Goh KI, Cusick ME, Valle D, Childs B, Vidal M, et al. (2007) The human disease network. Proc Natl Acad Sci U S A 104: 8685–8690). This diseasome network revealed several significant features such as the common genetic origin of many diseases. Methods and findings We analyzed the abundance of intrinsic disorder in these diseasome network proteins by means of several prediction algorithms, and we analyzed the functional repertoires of these proteins based on prior studies relating disorder to function. Our analyses revealed that (i) Intrinsic disorder is common in proteins associated with many human genetic diseases; (ii) Different disease classes vary in the IDP contents of their associated proteins; (iii) Molecular recognition features, which are relatively short loosely structured protein regions within mostly disordered sequences and which gain structure upon binding to partners, are common in the diseasome, and their abundance correlates with the intrinsic disorder level; (iv) Some disease classes have a significant fraction of genes affected by alternative splicing, and the alternatively spliced regions in the corresponding proteins are predicted to be highly disordered; and (v) Correlations were found among the various diseasome graph-related properties and intrinsic disorder. Conclusion These observations provide the basis for the construction of the human-genetic-disease-associated unfoldome. PMID:19594871
Expression and crystallization of the plant alternative oxidase.

PubMed

May, Benjamin; Elliott, Catherine; Iwata, Momi; Young, Luke; Shearman, Julia; Albury, Mary S; Moore, Anthony L

2015-01-01

The alternative oxidase (AOX) is an integral monotopic membrane protein located on the inner surface of the inner mitochondrial membrane. Branching from the traditional respiratory chain at the quinone pool, AOX is responsible for cyanide-resistant respiration in plants and fungi, heat generation in thermogenic plants, and survival of parasites, such as Trypanosoma brucei, in the human host. A recently solved AOX structure provides insight into its active site, thereby facilitating rational phytopathogenic and antiparasitic drug design. Here, we describe expression of recombinant AOX using two different expression systems. Purification protocols for the production of highly pure and stable AOX protein in sufficient quantities to facilitate further kinetic, biophysical, and structural analyses are also described.
Three-dimensional structure of the human immunodeficiency virus type 1 matrix protein.

PubMed

Massiah, M A; Starich, M R; Paschall, C; Summers, M F; Christensen, A M; Sundquist, W I

1994-11-25

The HIV-1 matrix protein forms an icosahedral shell associated with the inner membrane of the mature virus. Genetic analyses have indicated that the protein performs important functions throughout the viral life-cycle, including anchoring the transmembrane envelope protein on the surface of the virus, assisting in viral penetration, transporting the proviral integration complex across the nuclear envelope, and localizing the assembling virion to the cell membrane. We now report the three-dimensional structure of recombinant HIV-1 matrix protein, determined at high resolution by nuclear magnetic resonance (NMR) methods. The HIV-1 matrix protein is the first retroviral matrix protein to be characterized structurally and only the fourth HIV-1 protein of known structure. NMR signal assignments required recently developed triple-resonance (1H, 13C, 15N) NMR methodologies because signals for 91% of 132 assigned H alpha protons and 74% of the 129 assignable backbone amide protons resonate within chemical shift ranges of 0.8 p.p.m. and 1 p.p.m., respectively. A total of 636 nuclear Overhauser effect-derived distance restraints were employed for distance geometry-based structure calculations, affording an average of 13.0 NMR-derived distance restraints per residue for the experimentally constrained amino acids. An ensemble of 25 refined distance geometry structures with penalties (sum of the squares of the distance violations) of 0.32 A2 or less and individual distance violations under 0.06 A was generated; best-fit superposition of ordered backbone heavy atoms relative to mean atom positions afforded root-mean-square deviations of 0.50 (+/- 0.08) A. The folded HIV-1 matrix protein structure is composed of five alpha-helices, a short 3(10) helical stretch, and a three-strand mixed beta-sheet. Helices I to III and the 3(10) helix pack about a central helix (IV) to form a compact globular domain that is capped by the beta-sheet. The C-terminal helix (helix V) projects away from the beta-sheet to expose carboxyl-terminal residues essential for early steps in the HIV-1 infectious cycle. Basic residues implicated in membrane binding and nuclear localization functions cluster about an extruded cationic loop that connects beta-strands 1 and 2. The structure suggests that both membrane binding and nuclear localization may be mediated by complex tertiary structures rather than simple linear determinants.
Val-->Ala mutations selectively alter helix-helix packing in the transmembrane segment of phage M13 coat protein.

PubMed Central

Deber, C M; Khan, A R; Li, Z; Joensson, C; Glibowicka, M; Wang, J

1993-01-01

Val-->Ala mutations within the effective transmembrane segment of a model single-spanning membrane protein, the 50-residue major coat (gene VIII) protein of bacteriophage M13, are shown to have sequence-dependent impacts on stabilization of membrane-embedded helical dimeric structures. Randomized mutagenesis performed on the coat protein hydrophobic segment 21-39 (YIGYAWAMV-VVIVGATIGI) produced a library of viable mutants which included those in which each of the four valine residues was replaced by an alanine residue. Significant variations found among these Val-->Ala mutants in the relative populations and thermal stabilities of monomeric and dimeric helical species observed on SDS/PAGE, and in the range of their alpha-helix-->beta-sheet transition temperatures confirmed that intramembranous valine residues are not simply universal contributors to membrane anchoring. Additional analyses of (i) nonmutatable sites in the mutant protein library, (ii) the properties of the double mutant V29A-V31A obtained by recycling mutant V31A DNA through mutagenesis procedures, and (iii) energy-minimized helical dimer structures of wild-type and mutant V31A transmembrane regions indicated that the transmembrane hydrophobic core helix of the M13 coat protein can be partitioned into alternating pairs of potential protein-interactive residues (V30, V31; G34, A35; G38, I39) and membrane-interactive residues (M28, V29; I32, V33; T36, I37). The overall results consitute an experimental approach to categorizing the distinctive contributions to structure of the residues comprising a protein-protein packing interface vs. those facing lipid and confirm the sequence-dependent capacity of specific residues within the transmembrane domain to modulate protein-protein interactions which underlie regulatory events in membrane proteins. Images Fig. 2 Fig. 4 PMID:8265602

Val-->Ala mutations selectively alter helix-helix packing in the transmembrane segment of phage M13 coat protein.

PubMed

Deber, C M; Khan, A R; Li, Z; Joensson, C; Glibowicka, M; Wang, J

1993-12-15

Val-->Ala mutations within the effective transmembrane segment of a model single-spanning membrane protein, the 50-residue major coat (gene VIII) protein of bacteriophage M13, are shown to have sequence-dependent impacts on stabilization of membrane-embedded helical dimeric structures. Randomized mutagenesis performed on the coat protein hydrophobic segment 21-39 (YIGYAWAMV-VVIVGATIGI) produced a library of viable mutants which included those in which each of the four valine residues was replaced by an alanine residue. Significant variations found among these Val-->Ala mutants in the relative populations and thermal stabilities of monomeric and dimeric helical species observed on SDS/PAGE, and in the range of their alpha-helix-->beta-sheet transition temperatures confirmed that intramembranous valine residues are not simply universal contributors to membrane anchoring. Additional analyses of (i) nonmutatable sites in the mutant protein library, (ii) the properties of the double mutant V29A-V31A obtained by recycling mutant V31A DNA through mutagenesis procedures, and (iii) energy-minimized helical dimer structures of wild-type and mutant V31A transmembrane regions indicated that the transmembrane hydrophobic core helix of the M13 coat protein can be partitioned into alternating pairs of potential protein-interactive residues (V30, V31; G34, A35; G38, I39) and membrane-interactive residues (M28, V29; I32, V33; T36, I37). The overall results consitute an experimental approach to categorizing the distinctive contributions to structure of the residues comprising a protein-protein packing interface vs. those facing lipid and confirm the sequence-dependent capacity of specific residues within the transmembrane domain to modulate protein-protein interactions which underlie regulatory events in membrane proteins.
Enhanced vulnerability of human proteins towards disease-associated inactivation through divergent evolution.

PubMed

Medina-Carmona, Encarnación; Fuchs, Julian E; Gavira, Jose A; Mesa-Torres, Noel; Neira, Jose L; Salido, Eduardo; Palomino-Morales, Rogelio; Burgos, Miguel; Timson, David J; Pey, Angel L

2017-09-15

Human proteins are vulnerable towards disease-associated single amino acid replacements affecting protein stability and function. Interestingly, a few studies have shown that consensus amino acids from mammals or vertebrates can enhance protein stability when incorporated into human proteins. Here, we investigate yet unexplored relationships between the high vulnerability of human proteins towards disease-associated inactivation and recent evolutionary site-specific divergence of stabilizing amino acids. Using phylogenetic, structural and experimental analyses, we show that divergence from the consensus amino acids at several sites during mammalian evolution has caused local protein destabilization in two human proteins linked to disease: cancer-associated NQO1 and alanine:glyoxylate aminotransferase, mutated in primary hyperoxaluria type I. We demonstrate that a single consensus mutation (H80R) acts as a disease suppressor on the most common cancer-associated polymorphism in NQO1 (P187S). The H80R mutation reactivates P187S by enhancing FAD binding affinity through local and dynamic stabilization of its binding site. Furthermore, we show how a second suppressor mutation (E247Q) cooperates with H80R in protecting the P187S polymorphism towards inactivation through long-range allosteric communication within the structural ensemble of the protein. Our results support that recent divergence of consensus amino acids may have occurred with neutral effects on many functional and regulatory traits of wild-type human proteins. However, divergence at certain sites may have increased the propensity of some human proteins towards inactivation due to disease-associated mutations and polymorphisms. Consensus mutations also emerge as a potential strategy to identify structural hot-spots in proteins as targets for pharmacological rescue in loss-of-function genetic diseases. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
High-Resolution Mapping of Chromatin Conformation in Cardiac Myocytes Reveals Structural Remodeling of the Epigenome in Heart Failure

PubMed Central

Rosa-Garrido, Manuel; Chapski, Douglas J.; Schmitt, Anthony D.; Kimball, Todd H.; Karbassi, Elaheh; Monte, Emma; Balderas, Enrique; Pellegrini, Matteo; Shih, Tsai-Ting; Soehalim, Elizabeth; Liem, David; Ping, Peipei; Galjart, Niels J.; Ren, Shuxun; Wang, Yibin; Ren, Bing

2017-01-01

Background: Cardiovascular disease is associated with epigenomic changes in the heart; however, the endogenous structure of cardiac myocyte chromatin has never been determined. Methods: To investigate the mechanisms of epigenomic function in the heart, genome-wide chromatin conformation capture (Hi-C) and DNA sequencing were performed in adult cardiac myocytes following development of pressure overload–induced hypertrophy. Mice with cardiac-specific deletion of CTCF (a ubiquitous chromatin structural protein) were generated to explore the role of this protein in chromatin structure and cardiac phenotype. Transcriptome analyses by RNA-seq were conducted as a functional readout of the epigenomic structural changes. Results: Depletion of CTCF was sufficient to induce heart failure in mice, and human patients with heart failure receiving mechanical unloading via left ventricular assist devices show increased CTCF abundance. Chromatin structural analyses revealed interactions within the cardiac myocyte genome at 5-kb resolution, enabling examination of intra- and interchromosomal events, and providing a resource for future cardiac epigenomic investigations. Pressure overload or CTCF depletion selectively altered boundary strength between topologically associating domains and A/B compartmentalization, measurements of genome accessibility. Heart failure involved decreased stability of chromatin interactions around disease-causing genes. In addition, pressure overload or CTCF depletion remodeled long-range interactions of cardiac enhancers, resulting in a significant decrease in local chromatin interactions around these functional elements. Conclusions: These findings provide a high-resolution chromatin architecture resource for cardiac epigenomic investigations and demonstrate that global structural remodeling of chromatin underpins heart failure. The newly identified principles of endogenous chromatin structure have key implications for epigenetic therapy. PMID:28802249
High-Resolution Mapping of Chromatin Conformation in Cardiac Myocytes Reveals Structural Remodeling of the Epigenome in Heart Failure.

PubMed

Rosa-Garrido, Manuel; Chapski, Douglas J; Schmitt, Anthony D; Kimball, Todd H; Karbassi, Elaheh; Monte, Emma; Balderas, Enrique; Pellegrini, Matteo; Shih, Tsai-Ting; Soehalim, Elizabeth; Liem, David; Ping, Peipei; Galjart, Niels J; Ren, Shuxun; Wang, Yibin; Ren, Bing; Vondriska, Thomas M

2017-10-24

Cardiovascular disease is associated with epigenomic changes in the heart; however, the endogenous structure of cardiac myocyte chromatin has never been determined. To investigate the mechanisms of epigenomic function in the heart, genome-wide chromatin conformation capture (Hi-C) and DNA sequencing were performed in adult cardiac myocytes following development of pressure overload-induced hypertrophy. Mice with cardiac-specific deletion of CTCF (a ubiquitous chromatin structural protein) were generated to explore the role of this protein in chromatin structure and cardiac phenotype. Transcriptome analyses by RNA-seq were conducted as a functional readout of the epigenomic structural changes. Depletion of CTCF was sufficient to induce heart failure in mice, and human patients with heart failure receiving mechanical unloading via left ventricular assist devices show increased CTCF abundance. Chromatin structural analyses revealed interactions within the cardiac myocyte genome at 5-kb resolution, enabling examination of intra- and interchromosomal events, and providing a resource for future cardiac epigenomic investigations. Pressure overload or CTCF depletion selectively altered boundary strength between topologically associating domains and A/B compartmentalization, measurements of genome accessibility. Heart failure involved decreased stability of chromatin interactions around disease-causing genes. In addition, pressure overload or CTCF depletion remodeled long-range interactions of cardiac enhancers, resulting in a significant decrease in local chromatin interactions around these functional elements. These findings provide a high-resolution chromatin architecture resource for cardiac epigenomic investigations and demonstrate that global structural remodeling of chromatin underpins heart failure. The newly identified principles of endogenous chromatin structure have key implications for epigenetic therapy. © 2017 The Authors.
Expression, Purification and Characterization of GMZ2'.10C, a Complex Disulphide-Bonded Fusion Protein Vaccine Candidate against the Asexual and Sexual Life-Stages of the Malaria-Causing Plasmodium falciparum Parasite.

PubMed

Mistarz, Ulrik H; Singh, Susheel K; Nguyen, Tam T T N; Roeffen, Will; Yang, Fen; Lissau, Casper; Madsen, Søren M; Vrang, Astrid; Tiendrebeogo, Régis W; Kana, Ikhlaq H; Sauerwein, Robert W; Theisen, Michael; Rand, Kasper D

2017-09-01

Production and characterization of a chimeric fusion protein (GMZ2'.10C) which combines epitopes of key malaria parasite antigens: glutamate-rich protein (GLURP), merozoite surface protein 3 (MSP3), and the highly disulphide bonded Pfs48/45 (10C). GMZ2'.10C is a potential candidate for a multi-stage malaria vaccine that targets both transmission and asexual life-cycle stages of the parasite. GMZ2'.10C was produced in Lactococcus lactis and purified using either an immunoaffinity purification (IP) or a conventional purification (CP) method. Protein purity and stability was analysed by RP-HPLC, SEC-HPLC, 2-site ELISA, gel-electrophoresis and Western blotting. Structural characterization (mass analysis, peptide mapping and cysteine connectivity mapping) was performed by LC-MS/MS. CP-GMZ2'.10C resulted in similar purity, yield, structure and stability as compared to IP-GMZ2'.10C. CP-GMZ2'.10C and IP-GMZ2'.10C both elicited a high titer of transmission blocking (TB) antibodies in rodents. The intricate disulphide-bond connectivity of C-terminus Pfs48/45 was analysed by tandem mass spectrometry and was established for GMZ2'.10C and two reference fusion proteins encompassing similar parts of Pfs48/45. GMZ2'.10C, combining GMZ2' and correctly-folded Pfs48/45 can be produced by the Lactoccus lactis P170 based expression system in purity and quality for pharmaceutical development and elicit high level of TB antibodies. The cysteine connectivity for the 10C region of Pfs48/45 was revealed experimentally, providing an important guideline for employing the Pfs48/45 antigen in vaccine design.
Aggregating Data for Computational Toxicology Applications: The U.S. Environmental Protection Agency (EPA) Aggregated Computational Toxicology Resource (ACToR) System

EPA Science Inventory

Computational toxicology combines data from high-throughput test methods, chemical structure analyses and other biological domains (e.g., genes, proteins, cells, tissues) with the goals of predicting and understanding the underlying mechanistic causes of chemical toxicity and for...
Conformational dynamics of proanthocyanidins: physical and computational approaches

Treesearch

Fred L. Tobiason; Richard W. Hemingway; T. Hatano

1998-01-01

The interaction of plant polyphenols with proteins accounts for a good part of their commercial (e.g., leather manufacture) and biological (e.g., antimicrobial activity) significance. The interplay between observations of physical data such as crystal structure, NMR analyses, and time-resolved fluorescence with results of computational chemistry approaches has been...
Solution structure of the c-terminal dimerization domain of SARS coronavirus nucleocapsid protein solved by the SAIL-NMR method.

PubMed

Takeda, Mitsuhiro; Chang, Chung-ke; Ikeya, Teppei; Güntert, Peter; Chang, Yuan-hsiang; Hsu, Yen-lan; Huang, Tai-huang; Kainosho, Masatsune

2008-07-18

The C-terminal domain (CTD) of the severe acute respiratory syndrome coronavirus (SARS-CoV) nucleocapsid protein (NP) contains a potential RNA-binding region in its N-terminal portion and also serves as a dimerization domain by forming a homodimer with a molecular mass of 28 kDa. So far, the structure determination of the SARS-CoV NP CTD in solution has been impeded by the poor quality of NMR spectra, especially for aromatic resonances. We have recently developed the stereo-array isotope labeling (SAIL) method to overcome the size problem of NMR structure determination by utilizing a protein exclusively composed of stereo- and regio-specifically isotope-labeled amino acids. Here, we employed the SAIL method to determine the high-quality solution structure of the SARS-CoV NP CTD by NMR. The SAIL protein yielded less crowded and better resolved spectra than uniform (13)C and (15)N labeling, and enabled the homodimeric solution structure of this protein to be determined. The NMR structure is almost identical with the previously solved crystal structure, except for a disordered putative RNA-binding domain at the N-terminus. Studies of the chemical shift perturbations caused by the binding of single-stranded DNA and mutational analyses have identified the disordered region at the N-termini as the prime site for nucleic acid binding. In addition, residues in the beta-sheet region also showed significant perturbations. Mapping of the locations of these residues onto the helical model observed in the crystal revealed that these two regions are parts of the interior lining of the positively charged helical groove, supporting the hypothesis that the helical oligomer may form in solution.
Evolutionary implications of phylogenetic analyses of the gene transfer agent (GTA) of Rhodobacter capsulatus.

PubMed

Lang, Andrew S; Taylor, Terumi A; Beatty, J Thomas

2002-11-01

The gene transfer agent (GTA) of the a-proteobacterium Rhodobacter capsulatus is a cell-controlled genetic exchange vector. Genes that encode the GTA structure are clustered in a 15-kb region of the R. capsulatus chromosome, and some of these genes show sequence similarity to known bacteriophage head and tail genes. However, the production of GTA is controlled at the level of transcription by a cellular two-component signal transduction system. This paper describes homologues of both the GTA structural gene cluster and the GTA regulatory genes in the a-proteobacteria Rhodopseudomonas palustris, Rhodobacter sphaeroides, Caulobacter crescentus, Agrobacterium tumefaciens and Brucella melitensis. These sequences were used in a phylogenetic tree approach to examine the evolutionary relationships of selected GTA proteins to these homologues and (pro)phage proteins, which was compared to a 16S rRNA tree. The data indicate that a GTA-like element was present in a single progenitor of the extant species that contain both GTA structural cluster and regulatory gene homologues. The evolutionary relationships of GTA structural proteins to (pro)phage proteins indicated by the phylogenetic tree patterns suggest a predominantly vertical descent of GTA-like sequences in the a-proteobacteria and little past gene exchange with (pro)phages.
FunTree: advances in a resource for exploring and contextualising protein function evolution.

PubMed

Sillitoe, Ian; Furnham, Nicholas

2016-01-04

FunTree is a resource that brings together protein sequence, structure and functional information, including overall chemical reaction and mechanistic data, for structurally defined domain superfamilies. Developed in tandem with the CATH database, the original FunTree contained just 276 superfamilies focused on enzymes. Here, we present an update of FunTree that has expanded to include 2340 superfamilies including both enzymes and proteins with non-enzymatic functions annotated by Gene Ontology (GO) terms. This allows the investigation of how novel functions have evolved within a structurally defined superfamily and provides a means to analyse trends across many superfamilies. This is done not only within the context of a protein's sequence and structure but also the relationships of their functions. New measures of functional similarity have been integrated, including for enzymes comparisons of overall reactions based on overall bond changes, reaction centres (the local environment atoms involved in the reaction) and the sub-structure similarities of the metabolites involved in the reaction and for non-enzymes semantic similarities based on the GO. To identify and highlight changes in function through evolution, ancestral character estimations are made and presented. All this is accessible through a new re-designed web interface that can be found at http://www.funtree.info. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Crystal Structure of the Ubiquitin-associated (UBA) Domain of p62 and Its Interaction with Ubiquitin*

PubMed Central

Isogai, Shin; Morimoto, Daichi; Arita, Kyohei; Unzai, Satoru; Tenno, Takeshi; Hasegawa, Jun; Sou, Yu-shin; Komatsu, Masaaki; Tanaka, Keiji; Shirakawa, Masahiro; Tochio, Hidehito

2011-01-01

p62/SQSTM1/A170 is a multimodular protein that is found in ubiquitin-positive inclusions associated with neurodegenerative diseases. Recent findings indicate that p62 mediates the interaction between ubiquitinated proteins and autophagosomes, leading these proteins to be degraded via the autophagy-lysosomal pathway. This ubiquitin-mediated selective autophagy is thought to begin with recognition of the ubiquitinated proteins by the C-terminal ubiquitin-associated (UBA) domain of p62. We present here the crystal structure of the UBA domain of mouse p62 and the solution structure of its ubiquitin-bound form. The p62 UBA domain adopts a novel dimeric structure in crystals, which is distinctive from those of other UBA domains. NMR analyses reveal that in solution the domain exists in equilibrium between the dimer and monomer forms, and binding ubiquitin shifts the equilibrium toward the monomer to form a 1:1 complex between the UBA domain and ubiquitin. The dimer-to-monomer transition is associated with a structural change of the very C-terminal end of the p62 UBA domain, although the UBA fold itself is essentially maintained. Our data illustrate that dimerization and ubiquitin binding of the p62 UBA domain are incompatible with each other. These observations reveal an autoinhibitory mechanism in the p62 UBA domain and suggest that autoinhibition plays a role in the function of p62. PMID:21715324
CNA web server: rigidity theory-based thermal unfolding simulations of proteins for linking structure, (thermo-)stability, and function.

PubMed

Krüger, Dennis M; Rathi, Prakash Chandra; Pfleger, Christopher; Gohlke, Holger

2013-07-01

The Constraint Network Analysis (CNA) web server provides a user-friendly interface to the CNA approach developed in our laboratory for linking results from rigidity analyses to biologically relevant characteristics of a biomolecular structure. The CNA web server provides a refined modeling of thermal unfolding simulations that considers the temperature dependence of hydrophobic tethers and computes a set of global and local indices for quantifying biomacromolecular stability. From the global indices, phase transition points are identified where the structure switches from a rigid to a floppy state; these phase transition points can be related to a protein's (thermo-)stability. Structural weak spots (unfolding nuclei) are automatically identified, too; this knowledge can be exploited in data-driven protein engineering. The local indices are useful in linking flexibility and function and to understand the impact of ligand binding on protein flexibility. The CNA web server robustly handles small-molecule ligands in general. To overcome issues of sensitivity with respect to the input structure, the CNA web server allows performing two ensemble-based variants of thermal unfolding simulations. The web server output is provided as raw data, plots and/or Jmol representations. The CNA web server, accessible at http://cpclab.uni-duesseldorf.de/cna or http://www.cnanalysis.de, is free and open to all users with no login requirement.
Rational Design of Protein Stability: Effect of (2S,4R)-4-Fluoroproline on the Stability and Folding Pathway of Ubiquitin

PubMed Central

Crespo, Maria D.; Rubini, Marina

2011-01-01

Background Many strategies have been employed to increase the conformational stability of proteins. The use of 4-substituted proline analogs capable to induce pre-organization in target proteins is an attractive tool to deliver an additional conformational stability without perturbing the overall protein structure. Both, peptides and proteins containing 4-fluorinated proline derivatives can be stabilized by forcing the pyrrolidine ring in its favored puckering conformation. The fluorinated pyrrolidine rings of proline can preferably stabilize either a Cγ-exo or a Cγ-endo ring pucker in dependence of proline chirality (4R/4S) in a complex protein structure. To examine whether this rational strategy can be generally used for protein stabilization, we have chosen human ubiquitin as a model protein which contains three proline residues displaying Cγ-exo puckering. Methodology/Principal Findings While (2S,4R)-4-fluoroproline ((4R)-FPro) containing ubiquitinin can be expressed in related auxotrophic Escherichia coli strain, all attempts to incorporate (2S,4S)-4-fluoroproline ((4S)-FPro) failed. Our results indicate that (4R)-FPro is favoring the Cγ-exo conformation present in the wild type structure and stabilizes the protein structure due to a pre-organization effect. This was confirmed by thermal and guanidinium chloride-induced denaturation profile analyses, where we observed an increase in stability of −4.71 kJ·mol−1 in the case of (4R)-FPro containing ubiquitin ((4R)-FPro-ub) compared to wild type ubiquitin (wt-ub). Expectedly, activity assays revealed that (4R)-FPro-ub retained the full biological activity compared to wt-ub. Conclusions/Significance The results fully confirm the general applicability of incorporating fluoroproline derivatives for improving protein stability. In general, a rational design strategy that enforces the natural occurring proline puckering conformation can be used to stabilize the desired target protein. PMID:21625626
Determinants of RNA binding and translational repression by the Bicaudal-C regulatory protein.

PubMed

Zhang, Yan; Park, Sookhee; Blaser, Susanne; Sheets, Michael D

2014-03-14

Bicaudal-C (Bic-C) RNA binding proteins function as important translational repressors in multiple biological contexts within metazoans. However, their RNA binding sites are unknown. We recently demonstrated that Bic-C functions in spatially regulated translational repression of the xCR1 mRNA during Xenopus development. This repression contributes to normal development by confining the xCR1 protein, a regulator of key signaling pathways, to specific cells of the embryo. In this report, we combined biochemical approaches with in vivo mRNA reporter assays to define the minimal Bic-C target site within the xCR1 mRNA. This 32-nucleotide Bic-C target site is predicted to fold into a stem-loop secondary structure. Mutational analyses provided evidence that this stem-loop structure is important for Bic-C binding. The Bic-C target site was sufficient for Bic-C mediated repression in vivo. Thus, we describe the first RNA binding site for a Bic-C protein. This identification provides an important step toward understanding the mechanisms by which evolutionarily conserved Bic-C proteins control cellular function in metazoans.
Construction of two Lactococcus lactis expression vectors combining the Gateway and the NIsin Controlled Expression systems.

PubMed

Douillard, François P; Mahony, Jennifer; Campanacci, Valérie; Cambillau, Christian; van Sinderen, Douwe

2011-09-01

Over the last 10 years, the NIsin Controlled Expression (NICE) system has been extensively used in the food-grade bacterium Lactococcus lactis subsp. cremoris to produce homologous and heterologous proteins for academic and biotechnological purposes. Although various L. lactis molecular tools have been developed, no expression vectors harboring the popular Gateway recombination system are currently available for this widely used cloning host. In this study, we constructed two expression vectors that combine the NICE and the Gateway recombination systems and we tested their applicability by recombining and over-expressing genes encoding structural proteins of lactococcal phages Tuc2009 and TP901-1. Over-expressed phage proteins were analyzed by immunoblotting and purified by His-tag affinity chromatography with protein productions yielding 2.8-3.7 mg/l of culture. This therefore is the first description of L. lactis NICE expression vectors which integrate the Gateway cloning technology and which are suitable for the production of sufficient amounts of proteins to facilitate subsequent structural and functional analyses. Copyright © 2011 Elsevier Inc. All rights reserved.
Sulphur Atoms from Methionines Interacting with Aromatic Residues Are Less Prone to Oxidation

PubMed Central

Aledo, Juan C.; Cantón, Francisco R.; Veredas, Francisco J.

2015-01-01

Methionine residues exhibit different degrees of susceptibility to oxidation. Although solvent accessibility is a relevant factor, oxidation at particular sites cannot be unequivocally explained by accessibility alone. To explore other possible structural determinants, we assembled different sets of oxidation-sensitive and oxidation-resistant methionines contained in human proteins. Comparisons of the proteins containing oxidized methionines with all proteins in the human proteome led to the conclusion that the former exhibit a significantly higher mean value of methionine content than the latter. Within a given protein, an examination of the sequence surrounding the non-oxidized methionine revealed a preference for neighbouring tyrosine and tryptophan residues, but not for phenylalanine residues. However, because the interaction between sulphur atoms and aromatic residues has been reported to be important for the stabilization of protein structure, we carried out an analysis of the spatial interatomic distances between methionines and aromatic residues, including phenylalanine. The results of these analyses uncovered a new determinant for methionine oxidation: the S-aromatic motif, which decreases the reactivity of the involved sulphur towards oxidants. PMID:26597773
The T4 Phage DNA Mimic Protein Arn Inhibits the DNA Binding Activity of the Bacterial Histone-like Protein H-NS*

PubMed Central

Ho, Chun-Han; Wang, Hao-Ching; Ko, Tzu-Ping; Chang, Yuan-Chih; Wang, Andrew H.-J.

2014-01-01

The T4 phage protein Arn (Anti restriction nuclease) was identified as an inhibitor of the restriction enzyme McrBC. However, until now its molecular mechanism remained unclear. In the present study we used structural approaches to investigate biological properties of Arn. A structural analysis of Arn revealed that its shape and negative charge distribution are similar to dsDNA, suggesting that this protein could act as a DNA mimic. In a subsequent proteomic analysis, we found that the bacterial histone-like protein H-NS interacts with Arn, implying a new function. An electrophoretic mobility shift assay showed that Arn prevents H-NS from binding to the Escherichia coli hns and T4 p8.1 promoters. In vitro gene expression and electron microscopy analyses also indicated that Arn counteracts the gene-silencing effect of H-NS on a reporter gene. Because McrBC and H-NS both participate in the host defense system, our findings suggest that T4 Arn might knock down these mechanisms using its DNA mimicking properties. PMID:25118281
Regulation of Glycan Structures in Animal Tissues

PubMed Central

Nairn, Alison V.; York, William S.; Harris, Kyle; Hall, Erica M.; Pierce, J. Michael; Moremen, Kelley W.

2008-01-01

Glycan structures covalently attached to proteins and lipids play numerous roles in mammalian cells, including protein folding, targeting, recognition, and adhesion at the molecular or cellular level. Regulating the abundance of glycan structures on cellular glycoproteins and glycolipids is a complex process that depends on numerous factors. Most models for glycan regulation hypothesize that transcriptional control of the enzymes involved in glycan synthesis, modification, and catabolism determines glycan abundance and diversity. However, few broad-based studies have examined correlations between glycan structures and transcripts encoding the relevant biosynthetic and catabolic enzymes. Low transcript abundance for many glycan-related genes has hampered broad-based transcript profiling for comparison with glycan structural data. In an effort to facilitate comparison with glycan structural data and to identify the molecular basis of alterations in glycan structures, we have developed a medium-throughput quantitative real time reverse transcriptase-PCR platform for the analysis of transcripts encoding glycan-related enzymes and proteins in mouse tissues and cells. The method employs a comprehensive list of >700 genes, including enzymes involved in sugar-nucleotide biosynthesis, transporters, glycan extension, modification, recognition, catabolism, and numerous glycosylated core proteins. Comparison with parallel microarray analyses indicates a significantly greater sensitivity and dynamic range for our quantitative real time reverse transcriptase-PCR approach, particularly for the numerous low abundance glycan-related enzymes. Mapping of the genes and transcript levels to their respective biosynthetic pathway steps allowed a comparison with glycan structural data and provides support for a model where many, but not all, changes in glycan abundance result from alterations in transcript expression of corresponding biosynthetic enzymes. PMID:18411279
The RING 2.0 web server for high quality residue interaction networks.

PubMed

Piovesan, Damiano; Minervini, Giovanni; Tosatto, Silvio C E

2016-07-08

Residue interaction networks (RINs) are an alternative way of representing protein structures where nodes are residues and arcs physico-chemical interactions. RINs have been extensively and successfully used for analysing mutation effects, protein folding, domain-domain communication and catalytic activity. Here we present RING 2.0, a new version of the RING software for the identification of covalent and non-covalent bonds in protein structures, including π-π stacking and π-cation interactions. RING 2.0 is extremely fast and generates both intra and inter-chain interactions including solvent and ligand atoms. The generated networks are very accurate and reliable thanks to a complex empirical re-parameterization of distance thresholds performed on the entire Protein Data Bank. By default, RING output is generated with optimal parameters but the web server provides an exhaustive interface to customize the calculation. The network can be visualized directly in the browser or in Cytoscape. Alternatively, the RING-Viz script for Pymol allows visualizing the interactions at atomic level in the structure. The web server and RING-Viz, together with an extensive help and tutorial, are available from URL: http://protein.bio.unipd.it/ring. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
DIBS: a repository of disordered binding sites mediating interactions with ordered proteins.

PubMed

Schad, Eva; Fichó, Erzsébet; Pancsa, Rita; Simon, István; Dosztányi, Zsuzsanna; Mészáros, Bálint

2018-02-01

Intrinsically Disordered Proteins (IDPs) mediate crucial protein-protein interactions, most notably in signaling and regulation. As their importance is increasingly recognized, the detailed analyses of specific IDP interactions opened up new opportunities for therapeutic targeting. Yet, large scale information about IDP-mediated interactions in structural and functional details are lacking, hindering the understanding of the mechanisms underlying this distinct binding mode. Here, we present DIBS, the first comprehensive, curated collection of complexes between IDPs and ordered proteins. DIBS not only describes by far the highest number of cases, it also provides the dissociation constants of their interactions, as well as the description of potential post-translational modifications modulating the binding strength and linear motifs involved in the binding. Together with the wide range of structural and functional annotations, DIBS will provide the cornerstone for structural and functional studies of IDP complexes. DIBS is freely accessible at http://dibs.enzim.ttk.mta.hu/. The DIBS application is hosted by Apache web server and was implemented in PHP. To enrich querying features and to enhance backend performance a MySQL database was also created. dosztanyi@caesar.elte.hu or bmeszaros@caesar.elte.hu. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press.

Genome-Wide Protein Interaction Screens Reveal Functional Networks Involving Sm-Like Proteins

PubMed Central

Fromont-Racine, Micheline; Mayes, Andrew E.; Brunet-Simon, Adeline; Rain, Jean-Christophe; Colley, Alan; Dix, Ian; Decourty, Laurence; Joly, Nicolas; Ricard, Florence; Beggs, Jean D.

2000-01-01

A set of seven structurally related Sm proteins forms the core of the snRNP particles containing the spliceosomal U1, U2, U4 and U5 snRNAs. A search of the genomic sequence of Saccharomyces cerevisiae has identified a number of open reading frames that potentially encode structurally similar proteins termed Lsm (Like Sm) proteins. With the aim of analysing all possible interactions between the Lsm proteins and any protein encoded in the yeast genome, we performed exhaustive and iterative genomic two-hybrid screens, starting with the Lsm proteins as baits. Indeed, extensive interactions amongst eight Lsm proteins were found that suggest the existence of a Lsm complex or complexes. These Lsm interactions apparently involve the conserved Sm domain that also mediates interactions between the Sm proteins. The screens also reveal functionally significant interactions with splicing factors, in particular with Prp4 and Prp24, compatible with genetic studies and with the reported association of Lsm proteins with spliceosomal U6 and U4/U6 particles. In addition, interactions with proteins involved in mRNA turnover, such as Mrt1, Dcp1, Dcp2 and Xrn1, point to roles for Lsm complexes in distinct RNA metabolic processes, that are confirmed in independent functional studies. These results provide compelling evidence that two-hybrid screens yield functionally meaningful information about protein–protein interactions and can suggest functions for uncharacterized proteins, especially when they are performed on a genome-wide scale. PMID:10900456
Structural characterization of the P1+ intermediate state of the P-cluster of nitrogenase.

PubMed

Keable, Stephen M; Zadvornyy, Oleg A; Johnson, Lewis E; Ginovska, Bojana; Rasmussen, Andrew J; Danyal, Karamatullah; Eilers, Brian J; Prussia, Gregory A; LeVan, Axl X; Raugei, Simone; Seefeldt, Lance C; Peters, John W

2018-05-02

Nitrogenase is the enzyme that reduces atmospheric dinitrogen (N 2 ) to ammonia (NH 3 ) in biological systems. It catalyzes a series of single-electron transfers from the donor iron protein (Fe protein) to the molybdenum-iron protein (MoFe protein) that contains the iron-molybdenum cofactor (FeMo-co) sites where N 2 is reduced to NH 3 The [8Fe-7S] P-cluster in the MoFe protein functions in nitrogenase catalysis as an intermediate electron carrier between the external electron donor, the Fe protein, and the FeMo-co sites of the MoFe protein. Previous work has revealed that the P-cluster undergoes redox dependent structural changes and that the transition from the all-ferrous resting (P N ) state to the two electron oxidized P 2+ state is accompanied by protein serince hydroxyl and backbone amide ligation to Fe. In this work, the MoFe protein was poised at defined potentials with redox mediators in an electrochemical cell, and the three distinct structural states of the P-cluster (P 2+ , P 1+ , and P N ) were characterized by X-ray crystallography and confirmed by computational analysis. These analyses revealed that the three oxidation states differ in coordination implicating that the P 1+ state retains the serine hydroxyl coordination but lacks the backbone amide coordination observed in the P 2+ states. These results provide a complete picture of the redox-dependent ligand rearrangements of the three P-cluster redox states. Published under license by The American Society for Biochemistry and Molecular Biology, Inc.
Differential histone modification and protein expression associated with cell wall removal and regeneration in rice (Oryza sativa).

PubMed

Tan, Feng; Zhang, Kangling; Mujahid, Hana; Verma, Desh Pal S; Peng, Zhaohua

2011-02-04

The cell wall is a critical extracellular structure that provides protection and structural support in plant cells. To study the biological function of the cell wall and the regulation of cell wall resynthesis, we examined cellular responses to enzymatic removal of the cell wall in rice (Oryza sativa) suspension cells using proteomic approaches. We find that removal of cell wall stimulates cell wall synthesis from multiple sites in protoplasts instead of from a single site as in cytokinesis. Nucleus DAPI stain and MNase digestion further show that removal of the cell wall is concomitant with substantial chromatin reorganization. Histone post-translational modification studies using both Western blots and isotope labeling assisted quantitative mass spectrometry analyses reveal that substantial histone modification changes, particularly H3K18(AC) and H3K23(AC), are associated with the removal and regeneration of the cell wall. Label-free quantitative proteome analyses further reveal that chromatin associated proteins undergo dramatic changes upon removal of the cell wall, along with cytoskeleton, cell wall metabolism, and stress-response proteins. This study demonstrates that cell wall removal is associated with substantial chromatin change and may lead to stimulation of cell wall synthesis using a novel mechanism.
APID interactomes: providing proteome-based interactomes with controlled quality for multiple species and derived networks

PubMed Central

Alonso-López, Diego; Gutiérrez, Miguel A.; Lopes, Katia P.; Prieto, Carlos; Santamaría, Rodrigo; De Las Rivas, Javier

2016-01-01

APID (Agile Protein Interactomes DataServer) is an interactive web server that provides unified generation and delivery of protein interactomes mapped to their respective proteomes. This resource is a new, fully redesigned server that includes a comprehensive collection of protein interactomes for more than 400 organisms (25 of which include more than 500 interactions) produced by the integration of only experimentally validated protein–protein physical interactions. For each protein–protein interaction (PPI) the server includes currently reported information about its experimental validation to allow selection and filtering at different quality levels. As a whole, it provides easy access to the interactomes from specific species and includes a global uniform compendium of 90,379 distinct proteins and 678,441 singular interactions. APID integrates and unifies PPIs from major primary databases of molecular interactions, from other specific repositories and also from experimentally resolved 3D structures of protein complexes where more than two proteins were identified. For this purpose, a collection of 8,388 structures were analyzed to identify specific PPIs. APID also includes a new graph tool (based on Cytoscape.js) for visualization and interactive analyses of PPI networks. The server does not require registration and it is freely available for use at http://apid.dep.usal.es. PMID:27131791
Effects of the TAT peptide orientation and relative location on the protein transduction efficiency.

PubMed

Guo, Qingguo; Zhao, Guojie; Hao, Fengjin; Guan, Yifu

2012-05-01

To understand the protein transduction domain (PTD)-mediated protein transduction behavior and to explore its potential in delivering biopharmaceutic drugs, we prepared four TAT-EGFP conjugates: TAT(+)-EGFP, TAT(-)-EGFP, EGFP-TAT(+) and EGFP-TAT(-), where TAT(+) and TAT(-) represent the original and the reversed TAT sequence, respectively. These four TAT-EGFP conjugates were incubated with HeLa and PC12 cells for in vitro study as well as injected intraperitoneally to mice for in vivo study. Flow cytometric results showed that four TAT-EGFP conjugates were able to traverse HeLa and PC12 cells with almost equal transduction efficiency. The in vivo study showed that the TAT-EGFP conjugates could be delivered into different organs of mice with different transduction capabilities. Bioinformatic analyses and CD spectroscopic data revealed that the TAT peptide has no defined secondary structure, and conjugating the TAT peptide to the EGFP cargo protein would not alter the native structure and the function of the EGFP protein. These results conclude that the sequence orientation, the spatial structure, and the relative location of the TAT peptide have much less effect on the TAT-mediated protein transduction. Thus, the TAT-fused conjugates could be constructed in more convenient and flexible formats for a wide range of biopharmaceutical applications. © 2011 John Wiley & Sons A/S.
HotSpot Wizard 3.0: web server for automated design of mutations and smart libraries based on sequence input information.

PubMed

Sumbalova, Lenka; Stourac, Jan; Martinek, Tomas; Bednar, David; Damborsky, Jiri

2018-05-23

HotSpot Wizard is a web server used for the automated identification of hotspots in semi-rational protein design to give improved protein stability, catalytic activity, substrate specificity and enantioselectivity. Since there are three orders of magnitude fewer protein structures than sequences in bioinformatic databases, the major limitation to the usability of previous versions was the requirement for the protein structure to be a compulsory input for the calculation. HotSpot Wizard 3.0 now accepts the protein sequence as input data. The protein structure for the query sequence is obtained either from eight repositories of homology models or is modeled using Modeller and I-Tasser. The quality of the models is then evaluated using three quality assessment tools-WHAT_CHECK, PROCHECK and MolProbity. During follow-up analyses, the system automatically warns the users whenever they attempt to redesign poorly predicted parts of their homology models. The second main limitation of HotSpot Wizard's predictions is that it identifies suitable positions for mutagenesis, but does not provide any reliable advice on particular substitutions. A new module for the estimation of thermodynamic stabilities using the Rosetta and FoldX suites has been introduced which prevents destabilizing mutations among pre-selected variants entering experimental testing. HotSpot Wizard is freely available at http://loschmidt.chemi.muni.cz/hotspotwizard.
Evaluation of Structure, Chaperone-Like Activity and Allergenicity of Reduced Glycated Adduct of Bovine β-casein.

PubMed

Yousefi, Reza; Ferdowsi, Leila; Tavaf, Zohreh; Sadeghian, Tanaz; Tamaddon, Ali M; Moghtaderi, Mozhgan; Pourpak, Zahra

2017-01-01

Milk has a potent reducing environment with an important quantity of sugar levels. In the current study β-casein was glycated in the presence of D-glucose and sodium cyanoborohydride as a reducing agent. Then, the reduced glucitol adduct of β-casein was used for the structural and functional analyses using different spectroscopic techniques. The results of fluorescence and far ultraviolet circular dichroism assessments suggest important structural alteration upon non-enzymatic glycation of β-casein. In addition, the chaperone activity, micellization properties and antioxidant activity of this protein were altered upon glucose modification. Also, as a result of reduced glycation, the allergenicity profile of this protein remained largely unchanged. Additional to its energetic and nutritional values, β-casein has important functional properties. The native structure of this protein is important to perform accurately its biological functions. Non-enzymatic glycation under reducing state was capable to alter both structural and functional aspects of β-casein. Due to effective reducing environment and significant quantity of reducing sugar of human milk, similar structural and functional alterations are most likely to occur upon reducing glycation of β-casein in vivo. Also, these changes might be even intensified during chronic hyperglycemia in diabetic mothers. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
The sequence and structure of snake gourd (Trichosanthes anguina) seed lectin, a three-chain nontoxic homologue of type II RIPs.

PubMed

Sharma, Alok; Pohlentz, Gottfried; Bobbili, Kishore Babu; Jeyaprakash, A Arockia; Chandran, Thyageshwar; Mormann, Michael; Swamy, Musti J; Vijayan, M

2013-08-01

The sequence and structure of snake gourd seed lectin (SGSL), a nontoxic homologue of type II ribosome-inactivating proteins (RIPs), have been determined by mass spectrometry and X-ray crystallography, respectively. As in type II RIPs, the molecule consists of a lectin chain made up of two β-trefoil domains. The catalytic chain, which is connected through a disulfide bridge to the lectin chain in type II RIPs, is cleaved into two in SGSL. However, the integrity of the three-dimensional structure of the catalytic component of the molecule is preserved. This is the first time that a three-chain RIP or RIP homologue has been observed. A thorough examination of the sequence and structure of the protein and of its interactions with the bound methyl-α-galactose indicate that the nontoxicity of SGSL results from a combination of changes in the catalytic and the carbohydrate-binding sites. Detailed analyses of the sequences of type II RIPs of known structure and their homologues with unknown structure provide valuable insights into the evolution of this class of proteins. They also indicate some variability in carbohydrate-binding sites, which appears to contribute to the different levels of toxicity exhibited by lectins from various sources.
STRUM: structure-based prediction of protein stability changes upon single-point mutation.

PubMed

Quan, Lijun; Lv, Qiang; Zhang, Yang

2016-10-01

Mutations in human genome are mainly through single nucleotide polymorphism, some of which can affect stability and function of proteins, causing human diseases. Several methods have been proposed to predict the effect of mutations on protein stability; but most require features from experimental structure. Given the fast progress in protein structure prediction, this work explores the possibility to improve the mutation-induced stability change prediction using low-resolution structure modeling. We developed a new method (STRUM) for predicting stability change caused by single-point mutations. Starting from wild-type sequences, 3D models are constructed by the iterative threading assembly refinement (I-TASSER) simulations, where physics- and knowledge-based energy functions are derived on the I-TASSER models and used to train STRUM models through gradient boosting regression. STRUM was assessed by 5-fold cross validation on 3421 experimentally determined mutations from 150 proteins. The Pearson correlation coefficient (PCC) between predicted and measured changes of Gibbs free-energy gap, ΔΔG, upon mutation reaches 0.79 with a root-mean-square error 1.2 kcal/mol in the mutation-based cross-validations. The PCC reduces if separating training and test mutations from non-homologous proteins, which reflects inherent correlations in the current mutation sample. Nevertheless, the results significantly outperform other state-of-the-art methods, including those built on experimental protein structures. Detailed analyses show that the most sensitive features in STRUM are the physics-based energy terms on I-TASSER models and the conservation scores from multiple-threading template alignments. However, the ΔΔG prediction accuracy has only a marginal dependence on the accuracy of protein structure models as long as the global fold is correct. These data demonstrate the feasibility to use low-resolution structure modeling for high-accuracy stability change prediction upon point mutations. http://zhanglab.ccmb.med.umich.edu/STRUM/ CONTACT: qiang@suda.edu.cn and zhng@umich.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
STRUM: structure-based prediction of protein stability changes upon single-point mutation

PubMed Central

Quan, Lijun; Lv, Qiang; Zhang, Yang

2016-01-01

Motivation: Mutations in human genome are mainly through single nucleotide polymorphism, some of which can affect stability and function of proteins, causing human diseases. Several methods have been proposed to predict the effect of mutations on protein stability; but most require features from experimental structure. Given the fast progress in protein structure prediction, this work explores the possibility to improve the mutation-induced stability change prediction using low-resolution structure modeling. Results: We developed a new method (STRUM) for predicting stability change caused by single-point mutations. Starting from wild-type sequences, 3D models are constructed by the iterative threading assembly refinement (I-TASSER) simulations, where physics- and knowledge-based energy functions are derived on the I-TASSER models and used to train STRUM models through gradient boosting regression. STRUM was assessed by 5-fold cross validation on 3421 experimentally determined mutations from 150 proteins. The Pearson correlation coefficient (PCC) between predicted and measured changes of Gibbs free-energy gap, ΔΔG, upon mutation reaches 0.79 with a root-mean-square error 1.2 kcal/mol in the mutation-based cross-validations. The PCC reduces if separating training and test mutations from non-homologous proteins, which reflects inherent correlations in the current mutation sample. Nevertheless, the results significantly outperform other state-of-the-art methods, including those built on experimental protein structures. Detailed analyses show that the most sensitive features in STRUM are the physics-based energy terms on I-TASSER models and the conservation scores from multiple-threading template alignments. However, the ΔΔG prediction accuracy has only a marginal dependence on the accuracy of protein structure models as long as the global fold is correct. These data demonstrate the feasibility to use low-resolution structure modeling for high-accuracy stability change prediction upon point mutations. Availability and Implementation: http://zhanglab.ccmb.med.umich.edu/STRUM/ Contact: qiang@suda.edu.cn and zhng@umich.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27318206
Coherent microscopic picture for urea-induced denaturation of proteins.

PubMed

Yang, Zaixing; Xiu, Peng; Shi, Biyun; Hua, Lan; Zhou, Ruhong

2012-08-02

In a previous study, we explored the mechanism of urea-induced denaturation of proteins by performing molecular dynamics (MD) simulations of hen lysozyme in 8 M urea and supported the "direct interaction mechanism" whereby urea denatures protein via dispersion interaction (Hua, L.; Zhou, R. H.; Thirumalai, D.; Berne, B. J. Proc. Natl. Acad. Sci. U.S.A. 2008, 105, 16928). Here we perform large scale MD simulations of five representative protein/peptide systems in aqueous urea to investigate if the above mechanism is common to other proteins. In all cases, accumulations of urea around proteins/peptide are observed, suggesting that urea denatures proteins by directly attacking protein backbones and side chains rather than indirectly disrupting water structure as a "water breaker". Consistent with our previous case study of lysozyme, the current energetic analyses with five protein/peptide systems reveal that urea's preferential binding to proteins mainly comes from urea's stronger dispersion interactions with proteins than with bulk solution, whereas the electrostatic (hydrogen-bonded) interactions only play a relatively minor (even negative) role during this denaturation process. Furthermore, the simulations of the peptide system at different urea concentrations (8 and 4.5 M), and with different force fields (CHARMM and OPLSAA) suggest that the above mechanism is robust, independent of the urea concentration and force field used. Last, we emphasize the importance of periodic boundary conditions in pairwise energetic analyses. This article provides a comprehensive study on the physical mechanism of urea-induced protein denaturation and suggests that the "dispersion-interaction-driven" mechanism should be general.
MAISTAS: a tool for automatic structural evaluation of alternative splicing products.

PubMed

Floris, Matteo; Raimondo, Domenico; Leoni, Guido; Orsini, Massimiliano; Marcatili, Paolo; Tramontano, Anna

2011-06-15

Analysis of the human genome revealed that the amount of transcribed sequence is an order of magnitude greater than the number of predicted and well-characterized genes. A sizeable fraction of these transcripts is related to alternatively spliced forms of known protein coding genes. Inspection of the alternatively spliced transcripts identified in the pilot phase of the ENCODE project has clearly shown that often their structure might substantially differ from that of other isoforms of the same gene, and therefore that they might perform unrelated functions, or that they might even not correspond to a functional protein. Identifying these cases is obviously relevant for the functional assignment of gene products and for the interpretation of the effect of variations in the corresponding proteins. Here we describe a publicly available tool that, given a gene or a protein, retrieves and analyses all its annotated isoforms, provides users with three-dimensional models of the isoform(s) of his/her interest whenever possible and automatically assesses whether homology derived structural models correspond to plausible structures. This information is clearly relevant. When the homology model of some isoforms of a gene does not seem structurally plausible, the implications are that either they assume a structure unrelated to that of the other isoforms of the same gene with presumably significant functional differences, or do not correspond to functional products. We provide indications that the second hypothesis is likely to be true for a substantial fraction of the cases. http://maistas.bioinformatica.crs4.it/.
A multipurpose fusion tag derived from an unstructured and hyperacidic region of the amyloid precursor protein

PubMed Central

Sangawa, Takeshi; Tabata, Sanae; Suzuki, Kei; Saheki, Yasushi; Tanaka, Keiji; Takagi, Junichi

2013-01-01

Expression and purification of aggregation-prone and disulfide-containing proteins in Escherichia coli remains as a major hurdle for structural and functional analyses of high-value target proteins. Here, we present a novel gene-fusion strategy that greatly simplifies purification and refolding procedure at very low cost using a unique hyperacidic module derived from the human amyloid precursor protein. Fusion with this polypeptide (dubbed FATT for Flag-Acidic-Target Tag) results in near-complete soluble expression of variety of extracellular proteins, which can be directly refolded in the crude bacterial lysate and purified in one-step by anion exchange chromatography. Application of this system enabled preparation of functionally active extracellular enzymes and antibody fragments without the need for condition optimization. PMID:23526492
Combining NMR and Molecular Dynamics Studies for Insights into the Allostery of Small GTPase–Protein Interactions

PubMed Central

Zhang, Liqun; Bouguet-Bonnet, Sabine; Buck, Matthias

2014-01-01

Combinations of experimentally derived data from nuclear magnetic resonance spectroscopy and analyses of molecular dynamics trajectories increasingly allow us to obtain a detailed description of the molecular mechanisms by which proteins function in signal transduction. This chapter provides an introduction into these two methodologies, illustrated by example of a small GTPase–effector interaction. It is increasingly becoming clear that new insights are provided by the combination of experimental and computational methods. Understanding the structural and protein dynamical contributions to allostery will be useful for the engineering of new binding interfaces and protein functions, as well as for the design/in silico screening of chemical agents that can manipulate the function of small GTPase–protein interactions in diseases such as cancer. PMID:22052494
Conformational and functional analysis of molecular dynamics trajectories by Self-Organising Maps

PubMed Central

2011-01-01

Background Molecular dynamics (MD) simulations are powerful tools to investigate the conformational dynamics of proteins that is often a critical element of their function. Identification of functionally relevant conformations is generally done clustering the large ensemble of structures that are generated. Recently, Self-Organising Maps (SOMs) were reported performing more accurately and providing more consistent results than traditional clustering algorithms in various data mining problems. We present a novel strategy to analyse and compare conformational ensembles of protein domains using a two-level approach that combines SOMs and hierarchical clustering. Results The conformational dynamics of the α-spectrin SH3 protein domain and six single mutants were analysed by MD simulations. The Cα's Cartesian coordinates of conformations sampled in the essential space were used as input data vectors for SOM training, then complete linkage clustering was performed on the SOM prototype vectors. A specific protocol to optimize a SOM for structural ensembles was proposed: the optimal SOM was selected by means of a Taguchi experimental design plan applied to different data sets, and the optimal sampling rate of the MD trajectory was selected. The proposed two-level approach was applied to single trajectories of the SH3 domain independently as well as to groups of them at the same time. The results demonstrated the potential of this approach in the analysis of large ensembles of molecular structures: the possibility of producing a topological mapping of the conformational space in a simple 2D visualisation, as well as of effectively highlighting differences in the conformational dynamics directly related to biological functions. Conclusions The use of a two-level approach combining SOMs and hierarchical clustering for conformational analysis of structural ensembles of proteins was proposed. It can easily be extended to other study cases and to conformational ensembles from other sources. PMID:21569575
Oligomerisation status and evolutionary conservation of interfaces of protein structural domain superfamilies.

PubMed

Sukhwal, Anshul; Sowdhamini, Ramanathan

2013-07-01

Protein-protein interactions are important in carrying out many biological processes and functions. These interactions may be either permanent or of temporary nature. Several studies have employed tools like solvent accessibility and graph theory to identify these interactions, but still more studies need to be performed to quantify and validate them. Although we now have many databases available with predicted and experimental results on protein-protein interactions, we still do not have many databases which focus on providing structural details of the interacting complexes, their oligomerisation state and homologues. In this work, protein-protein interactions have been thoroughly investigated within the structural regime and quantified for their strength using calculated pseudoenergies. The PPCheck server, an in-house webserver, has been used for calculating the pseudoenergies like van der Waals, hydrogen bonds and electrostatic energy based on distances between atoms of amino acids from two interacting proteins. PPCheck can be visited at . Based on statistical data, as obtained by studying established protein-protein interacting complexes from earlier studies, we came to a conclusion that an average protein-protein interface consisted of about 51 to 150 amino acid residues and the generalized energy per residue ranged from -2 kJ mol(-1) to -6 kJ mol(-1). We found that some of the proteins have an exceptionally higher number of amino acids at the interface and it was purely because of their elaborate interface or extended topology i.e. some of their secondary structure regions or loops were either inter-mixing or running parallel to one another or they were taking part in domain swapping. Residue networks were prepared for all the amino acids of the interacting proteins involved in different types of interactions (like van der Waals, hydrogen-bonding, electrostatic or intramolecular interactions) and were analysed between the query domain-interacting partner pair and its remote homologue-interacting partner pair. We found that, in exceptional cases, homologous proteins belonging to the same superfamily, but with remote sequence similarity, can share similar interfaces.
Comparative Analysis of the 15.5kD Box C/D snoRNP Core Protein in the Primitive Eukaryote Giardia lamblia Reveals Unique Structural and Functional Features

DOE Office of Scientific and Technical Information (OSTI.GOV)

Biswas, Shyamasri; Buhrman, Greg; Gagnon, Keith

2012-07-11

Box C/D ribonucleoproteins (RNP) guide the 2'-O-methylation of targeted nucleotides in archaeal and eukaryotic rRNAs. The archaeal L7Ae and eukaryotic 15.5kD box C/D RNP core protein homologues initiate RNP assembly by recognizing kink-turn (K-turn) motifs. The crystal structure of the 15.5kD core protein from the primitive eukaryote Giardia lamblia is described here to a resolution of 1.8 {angstrom}. The Giardia 15.5kD protein exhibits the typical {alpha}-{beta}-{alpha} sandwich fold exhibited by both archaeal L7Ae and eukaryotic 15.5kD proteins. Characteristic of eukaryotic homologues, the Giardia 15.5kD protein binds the K-turn motif but not the variant K-loop motif. The highly conserved residues ofmore » loop 9, critical for RNA binding, also exhibit conformations similar to those of the human 15.5kD protein when bound to the K-turn motif. However, comparative sequence analysis indicated a distinct evolutionary position between Archaea and Eukarya. Indeed, assessment of the Giardia 15.5kD protein in denaturing experiments demonstrated an intermediate stability in protein structure when compared with that of the eukaryotic mouse 15.5kD and archaeal Methanocaldococcus jannaschii L7Ae proteins. Most notable was the ability of the Giardia 15.5kD protein to assemble in vitro a catalytically active chimeric box C/D RNP utilizing the archaeal M. jannaschii Nop56/58 and fibrillarin core proteins. In contrast, a catalytically competent chimeric RNP could not be assembled using the mouse 15.5kD protein. Collectively, these analyses suggest that the G. lamblia 15.5kD protein occupies a unique position in the evolution of this box C/D RNP core protein retaining structural and functional features characteristic of both archaeal L7Ae and higher eukaryotic 15.5kD homologues.« less
A network biology approach to understanding the importance of chameleon proteins in human physiology and pathology.

PubMed

Bahramali, Golnaz; Goliaei, Bahram; Minuchehr, Zarrin; Marashi, Sayed-Amir

2017-02-01

Chameleon proteins are proteins which include sequences that can adopt α-helix-β-strand (HE-chameleon) or α-helix-coil (HC-chameleon) or β-strand-coil (CE-chameleon) structures to operate their crucial biological functions. In this study, using a network-based approach, we examined the chameleon proteins to give a better knowledge on these proteins. We focused on proteins with identical chameleon sequences with more than or equal to seven residues long in different PDB entries, which adopt HE-chameleon, HC-chameleon, and CE-chameleon structures in the same protein. One hundred and ninety-one human chameleon proteins were identified via our in-house program. Then, protein-protein interaction (PPI) networks, Gene ontology (GO) enrichment, disease network, and pathway enrichment analyses were performed for our derived data set. We discovered that there are chameleon sequences which reside in protein-protein interaction regions between two proteins critical for their dual function. Analysis of the PPI networks for chameleon proteins introduced five hub proteins, namely TP53, EGFR, HSP90AA1, PPARA, and HIF1A, which were presented in four PPI clusters. The outcomes demonstrate that the chameleon regions are in critical domains of these proteins and are important in the development and treatment of human cancers. The present report is the first network-based functional study of chameleon proteins using computational approaches and might provide a new perspective for understanding the mechanisms of diseases helping us in developing new medical therapies along with discovering new proteins with chameleon properties which are highly important in cancer.
[RNA polymerase II and pre-mRNA splicing factors in diplotene oocyte nuclei of the giant African gastropod Achatina fulica].

PubMed

Stepanova, I S; Bogoliubov, D S

2003-01-01

The nuclear distribution of pre-mRNA splicing factors (snRNPs and SR-protein SC35) and unphosphorylated from of RNA polymerase II (Pol II) was studied using fluorescent and immunoelectron cytochemistry in diplotene oocytes of the gastropod Achatina fulica. Association of Pol II and splicing factors with oocyte nuclear structures was analysed. The antibodies against splicing factors and Pol II were shown to label perichromatin fibrils at the periphery of condensed chromatin blocks as well as those in interchromatin regions of nucleoplasm. The revealed character of distribution of snRNPs, SC35 protein, and Pol II, together with the decondensed chromatin and absence of karyosphere, enable us to suggest that oocyte chromosomes maintain their transcriptional activity at the diplotene stage of oogenesis. In A. fulica oocytes, sparse nuclear bodies (NBs) of a complex morphological structure were revealed. These NBs contain snRNPs rather than SC35 protein. NBs are associated with a fibrogranular material (FGM), which contains SC35 protein. No snRNPs were revealed in this material. Homology of A. fulica oocyte nuclear structures to Cajal bodies and interchromatin granule clusters is discussed.
Stability of spermine oxidase to thermal and chemical denaturation: comparison with bovine serum amine oxidase.

PubMed

Cervelli, Manuela; Leonetti, Alessia; Cervoni, Laura; Ohkubo, Shinji; Xhani, Marla; Stano, Pasquale; Federico, Rodolfo; Polticelli, Fabio; Mariottini, Paolo; Agostinelli, Enzo

2016-10-01

Spermine oxidase (SMOX) is a flavin-containing enzyme that specifically oxidizes spermine to produce spermidine, 3-aminopropanaldehyde and hydrogen peroxide. While no crystal structure is available for any mammalian SMOX, X-ray crystallography showed that the yeast Fms1 polyamine oxidase has a dimeric structure. Based on this scenario, we have investigated the quaternary structure of the SMOX protein by native gel electrophoresis, which revealed a composite gel band pattern, suggesting the formation of protein complexes. All high-order protein complexes are sensitive to reducing conditions, showing that disulfide bonds were responsible for protein complexes formation. The major gel band other than the SMOX monomer is the covalent SMOX homodimer, which was disassembled by increasing the reducing conditions, while being resistant to other denaturing conditions. Homodimeric and monomeric SMOXs are catalytically active, as revealed after gel staining for enzymatic activity. An engineered SMOX mutant deprived of all but two cysteine residues was prepared and characterized experimentally, resulting in a monomeric species. High-sensitivity differential scanning calorimetry of SMOX was compared with that of bovine serum amine oxidase, to analyse their thermal stability. Furthermore, enzymatic activity assays and fluorescence spectroscopy were used to gain insight into the unfolding process.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.