Chéron, Jean-Baptiste; Triki, Dhoha; Senac, Caroline; Flatters, Delphine; Camproux, Anne-Claude
2017-01-01
Protein flexibility is often implied in binding with different partners and is essential for protein function. The growing number of macromolecular structures in the Protein Data Bank entries and their redundancy has become a major source of structural knowledge of the protein universe. The analysis of structural variability through available redundant structures of a target, called multiple target conformations (MTC), obtained using experimental or modeling methods and under different biological conditions or different sources is one way to explore protein flexibility. This analysis is essential to improve the understanding of various mechanisms associated with protein target function and flexibility. In this study, we explored structural variability of three biological targets by analyzing different MTC sets associated with these targets. To facilitate the study of these MTC sets, we have developed an efficient tool, SA-conf, dedicated to capturing and linking the amino acid and local structure variability and analyzing the target structural variability space. The advantage of SA-conf is that it could be applied to divers sets composed of MTCs available in the PDB obtained using NMR and crystallography or homology models. This tool could also be applied to analyze MTC sets obtained by dynamics approaches. Our results showed that SA-conf tool is effective to quantify the structural variability of a MTC set and to localize the structural variable positions and regions of the target. By selecting adapted MTC subsets and comparing their variability detected by SA-conf, we highlighted different sources of target flexibility such as induced by binding partner, by mutation and intrinsic flexibility. Our results support the interest to mine available structures associated with a target using to offer valuable insight into target flexibility and interaction mechanisms. The SA-conf executable script, with a set of pre-compiled binaries are available at http://www.mti.univ-paris-diderot.fr/recherche/plateformes/logiciels. PMID:28817602
Regad, Leslie; Chéron, Jean-Baptiste; Triki, Dhoha; Senac, Caroline; Flatters, Delphine; Camproux, Anne-Claude
2017-01-01
Protein flexibility is often implied in binding with different partners and is essential for protein function. The growing number of macromolecular structures in the Protein Data Bank entries and their redundancy has become a major source of structural knowledge of the protein universe. The analysis of structural variability through available redundant structures of a target, called multiple target conformations (MTC), obtained using experimental or modeling methods and under different biological conditions or different sources is one way to explore protein flexibility. This analysis is essential to improve the understanding of various mechanisms associated with protein target function and flexibility. In this study, we explored structural variability of three biological targets by analyzing different MTC sets associated with these targets. To facilitate the study of these MTC sets, we have developed an efficient tool, SA-conf, dedicated to capturing and linking the amino acid and local structure variability and analyzing the target structural variability space. The advantage of SA-conf is that it could be applied to divers sets composed of MTCs available in the PDB obtained using NMR and crystallography or homology models. This tool could also be applied to analyze MTC sets obtained by dynamics approaches. Our results showed that SA-conf tool is effective to quantify the structural variability of a MTC set and to localize the structural variable positions and regions of the target. By selecting adapted MTC subsets and comparing their variability detected by SA-conf, we highlighted different sources of target flexibility such as induced by binding partner, by mutation and intrinsic flexibility. Our results support the interest to mine available structures associated with a target using to offer valuable insight into target flexibility and interaction mechanisms. The SA-conf executable script, with a set of pre-compiled binaries are available at http://www.mti.univ-paris-diderot.fr/recherche/plateformes/logiciels.
Method for protein structure alignment
Blankenbecler, Richard; Ohlsson, Mattias; Peterson, Carsten; Ringner, Markus
2005-02-22
This invention provides a method for protein structure alignment. More particularly, the present invention provides a method for identification, classification and prediction of protein structures. The present invention involves two key ingredients. First, an energy or cost function formulation of the problem simultaneously in terms of binary (Potts) assignment variables and real-valued atomic coordinates. Second, a minimization of the energy or cost function by an iterative method, where in each iteration (1) a mean field method is employed for the assignment variables and (2) exact rotation and/or translation of atomic coordinates is performed, weighted with the corresponding assignment variables.
Evaluation of variability in high-resolution protein structures by global distance scoring.
Anzai, Risa; Asami, Yoshiki; Inoue, Waka; Ueno, Hina; Yamada, Koya; Okada, Tetsuji
2018-01-01
Systematic analysis of the statistical and dynamical properties of proteins is critical to understanding cellular events. Extraction of biologically relevant information from a set of high-resolution structures is important because it can provide mechanistic details behind the functional properties of protein families, enabling rational comparison between families. Most of the current structural comparisons are pairwise-based, which hampers the global analysis of increasing contents in the Protein Data Bank. Additionally, pairing of protein structures introduces uncertainty with respect to reproducibility because it frequently accompanies other settings for superimposition. This study introduces intramolecular distance scoring for the global analysis of proteins, for each of which at least several high-resolution structures are available. As a pilot study, we have tested 300 human proteins and showed that the method is comprehensively used to overview advances in each protein and protein family at the atomic level. This method, together with the interpretation of the model calculations, provide new criteria for understanding specific structural variation in a protein, enabling global comparison of the variability in proteins from different species.
Feinauer, Christoph; Procaccini, Andrea; Zecchina, Riccardo; Weigt, Martin; Pagnani, Andrea
2014-01-01
In the course of evolution, proteins show a remarkable conservation of their three-dimensional structure and their biological function, leading to strong evolutionary constraints on the sequence variability between homologous proteins. Our method aims at extracting such constraints from rapidly accumulating sequence data, and thereby at inferring protein structure and function from sequence information alone. Recently, global statistical inference methods (e.g. direct-coupling analysis, sparse inverse covariance estimation) have achieved a breakthrough towards this aim, and their predictions have been successfully implemented into tertiary and quaternary protein structure prediction methods. However, due to the discrete nature of the underlying variable (amino-acids), exact inference requires exponential time in the protein length, and efficient approximations are needed for practical applicability. Here we propose a very efficient multivariate Gaussian modeling approach as a variant of direct-coupling analysis: the discrete amino-acid variables are replaced by continuous Gaussian random variables. The resulting statistical inference problem is efficiently and exactly solvable. We show that the quality of inference is comparable or superior to the one achieved by mean-field approximations to inference with discrete variables, as done by direct-coupling analysis. This is true for (i) the prediction of residue-residue contacts in proteins, and (ii) the identification of protein-protein interaction partner in bacterial signal transduction. An implementation of our multivariate Gaussian approach is available at the website http://areeweb.polito.it/ricerca/cmp/code. PMID:24663061
Cao, Hu; Lu, Yonggang
2017-01-01
With the rapid growth of known protein 3D structures in number, how to efficiently compare protein structures becomes an essential and challenging problem in computational structural biology. At present, many protein structure alignment methods have been developed. Among all these methods, flexible structure alignment methods are shown to be superior to rigid structure alignment methods in identifying structure similarities between proteins, which have gone through conformational changes. It is also found that the methods based on aligned fragment pairs (AFPs) have a special advantage over other approaches in balancing global structure similarities and local structure similarities. Accordingly, we propose a new flexible protein structure alignment method based on variable-length AFPs. Compared with other methods, the proposed method possesses three main advantages. First, it is based on variable-length AFPs. The length of each AFP is separately determined to maximally represent a local similar structure fragment, which reduces the number of AFPs. Second, it uses local coordinate systems, which simplify the computation at each step of the expansion of AFPs during the AFP identification. Third, it decreases the number of twists by rewarding the situation where nonconsecutive AFPs share the same transformation in the alignment, which is realized by dynamic programming with an improved transition function. The experimental data show that compared with FlexProt, FATCAT, and FlexSnap, the proposed method can achieve comparable results by introducing fewer twists. Meanwhile, it can generate results similar to those of the FATCAT method in much less running time due to the reduced number of AFPs.
Bioinformatics and variability in drug response: a protein structural perspective
Lahti, Jennifer L.; Tang, Grace W.; Capriotti, Emidio; Liu, Tianyun; Altman, Russ B.
2012-01-01
Marketed drugs frequently perform worse in clinical practice than in the clinical trials on which their approval is based. Many therapeutic compounds are ineffective for a large subpopulation of patients to whom they are prescribed; worse, a significant fraction of patients experience adverse effects more severe than anticipated. The unacceptable risk–benefit profile for many drugs mandates a paradigm shift towards personalized medicine. However, prior to adoption of patient-specific approaches, it is useful to understand the molecular details underlying variable drug response among diverse patient populations. Over the past decade, progress in structural genomics led to an explosion of available three-dimensional structures of drug target proteins while efforts in pharmacogenetics offered insights into polymorphisms correlated with differential therapeutic outcomes. Together these advances provide the opportunity to examine how altered protein structures arising from genetic differences affect protein–drug interactions and, ultimately, drug response. In this review, we first summarize structural characteristics of protein targets and common mechanisms of drug interactions. Next, we describe the impact of coding mutations on protein structures and drug response. Finally, we highlight tools for analysing protein structures and protein–drug interactions and discuss their application for understanding altered drug responses associated with protein structural variants. PMID:22552919
Medvedeva, Irina V; Demenkov, Pavel S; Ivanisenko, Vladimir A
2017-04-01
Functional sites define the diversity of protein functions and are the central object of research of the structural and functional organization of proteins. The mechanisms underlying protein functional sites emergence and their variability during evolution are distinguished by duplication, shuffling, insertion and deletion of the exons in genes. The study of the correlation between a site structure and exon structure serves as the basis for the in-depth understanding of sites organization. In this regard, the development of programming resources that allow the realization of the mutual projection of exon structure of genes and primary and tertiary structures of encoded proteins is still the actual problem. Previously, we developed the SitEx system that provides information about protein and gene sequences with mapped exon borders and protein functional sites amino acid positions. The database included information on proteins with known 3D structure. However, data with respect to orthologs was not available. Therefore, we added the projection of sites positions to the exon structures of orthologs in SitEx 2.0. We implemented a search through database using site conservation variability and site discontinuity through exon structure. Inclusion of the information on orthologs allowed to expand the possibilities of SitEx usage for solving problems regarding the analysis of the structural and functional organization of proteins. Database URL: http://www-bionet.sscc.ru/sitex/ .
Mixture models for protein structure ensembles.
Hirsch, Michael; Habeck, Michael
2008-10-01
Protein structure ensembles provide important insight into the dynamics and function of a protein and contain information that is not captured with a single static structure. However, it is not clear a priori to what extent the variability within an ensemble is caused by internal structural changes. Additional variability results from overall translations and rotations of the molecule. And most experimental data do not provide information to relate the structures to a common reference frame. To report meaningful values of intrinsic dynamics, structural precision, conformational entropy, etc., it is therefore important to disentangle local from global conformational heterogeneity. We consider the task of disentangling local from global heterogeneity as an inference problem. We use probabilistic methods to infer from the protein ensemble missing information on reference frames and stable conformational sub-states. To this end, we model a protein ensemble as a mixture of Gaussian probability distributions of either entire conformations or structural segments. We learn these models from a protein ensemble using the expectation-maximization algorithm. Our first model can be used to find multiple conformers in a structure ensemble. The second model partitions the protein chain into locally stable structural segments or core elements and less structured regions typically found in loops. Both models are simple to implement and contain only a single free parameter: the number of conformers or structural segments. Our models can be used to analyse experimental ensembles, molecular dynamics trajectories and conformational change in proteins. The Python source code for protein ensemble analysis is available from the authors upon request.
Improta, Roberto; Vitagliano, Luigi; Esposito, Luciana
2015-11-01
The elucidation of the mutual influence between peptide bond geometry and local conformation has important implications for protein structure refinement, validation, and prediction. To gain insights into the structural determinants and the energetic contributions associated with protein/peptide backbone plasticity, we here report an extensive analysis of the variability of the peptide bond angles by combining statistical analyses of protein structures and quantum mechanics calculations on small model peptide systems. Our analyses demonstrate that all the backbone bond angles strongly depend on the peptide conformation and unveil the existence of regular trends as function of ψ and/or φ. The excellent agreement of the quantum mechanics calculations with the statistical surveys of protein structures validates the computational scheme here employed and demonstrates that the valence geometry of protein/peptide backbone is primarily dictated by local interactions. Notably, for the first time we show that the position of the H(α) hydrogen atom, which is an important parameter in NMR structural studies, is also dependent on the local conformation. Most of the trends observed may be satisfactorily explained by invoking steric repulsive interactions; in some specific cases the valence bond variability is also influenced by hydrogen-bond like interactions. Moreover, we can provide a reliable estimate of the energies involved in the interplay between geometry and conformations. © 2015 Wiley Periodicals, Inc.
Abriata, Luciano A; Bovigny, Christophe; Dal Peraro, Matteo
2016-06-17
Protein variability can now be studied by measuring high-resolution tolerance-to-substitution maps and fitness landscapes in saturated mutational libraries. But these rich and expensive datasets are typically interpreted coarsely, restricting detailed analyses to positions of extremely high or low variability or dubbed important beforehand based on existing knowledge about active sites, interaction surfaces, (de)stabilizing mutations, etc. Our new webserver PsychoProt (freely available without registration at http://psychoprot.epfl.ch or at http://lucianoabriata.altervista.org/psychoprot/index.html ) helps to detect, quantify, and sequence/structure map the biophysical and biochemical traits that shape amino acid preferences throughout a protein as determined by deep-sequencing of saturated mutational libraries or from large alignments of naturally occurring variants. We exemplify how PsychoProt helps to (i) unveil protein structure-function relationships from experiments and from alignments that are consistent with structures according to coevolution analysis, (ii) recall global information about structural and functional features and identify hitherto unknown constraints to variation in alignments, and (iii) point at different sources of variation among related experimental datasets or between experimental and alignment-based data. Remarkably, metabolic costs of the amino acids pose strong constraints to variability at protein surfaces in nature but not in the laboratory. This and other differences call for caution when extrapolating results from in vitro experiments to natural scenarios in, for example, studies of protein evolution. We show through examples how PsychoProt can be a useful tool for the broad communities of structural biology and molecular evolution, particularly for studies about protein modeling, evolution and design.
X-ray crystal structures of native HIV-1 capsid protein reveal conformational variability
Gres, Anna T.; Kirby, Karen A.; KewalRamani, Vineet N.; ...
2015-06-04
The detailed molecular interactions between native HIV-1 capsid protein (CA) hexamers that shield the viral genome and proteins have been elusive. In this paper, we report crystal structures describing interactions between CA monomers related by sixfold symmetry within hexamers (intrahexamer) and threefold and twofold symmetry between neighboring hexamers (interhexamer). The structures describe how CA builds hexagonal lattices, the foundation of mature capsids. Lattice structure depends on an adaptable hydration layer modulating interactions among CA molecules. Disruption of this layer alters interhexamer interfaces, highlighting an inherent structural variability. A CA-targeting antiviral affects capsid stability by binding across CA molecules and subtlymore » altering interhexamer interfaces remote to the ligand-binding site. Finally, inherent structural plasticity, hydration layer rearrangement, and effector binding affect capsid stability and have functional implications for the retroviral life cycle.« less
A Method for WD40 Repeat Detection and Secondary Structure Prediction
Wang, Yang; Jiang, Fan; Zhuo, Zhu; Wu, Xian-Hui; Wu, Yun-Dong
2013-01-01
WD40-repeat proteins (WD40s), as one of the largest protein families in eukaryotes, play vital roles in assembling protein-protein/DNA/RNA complexes. WD40s fold into similar β-propeller structures despite diversified sequences. A program WDSP (WD40 repeat protein Structure Predictor) has been developed to accurately identify WD40 repeats and predict their secondary structures. The method is designed specifically for WD40 proteins by incorporating both local residue information and non-local family-specific structural features. It overcomes the problem of highly diversified protein sequences and variable loops. In addition, WDSP achieves a better prediction in identifying multiple WD40-domain proteins by taking the global combination of repeats into consideration. In secondary structure prediction, the average Q3 accuracy of WDSP in jack-knife test reaches 93.7%. A disease related protein LRRK2 was used as a representive example to demonstrate the structure prediction. PMID:23776530
Material Structure of a Graded Refractive Index Lens in Decapod Squid
NASA Astrophysics Data System (ADS)
Cai, Jing; Heiney, Paul; Sweeney, Alison
2013-03-01
Underwater vision with a camera-type eye that is simultaneously acute and sensitive requires a spherical lens with a graded distribution of refractive index. Squids have this type of lens, and our previous work has shown that its optical properties are likely achieved with radially variable densities of a single protein with multiple isoforms. Here we measure the spatial organization of this novel protein material in concentric layers of the lens and use these data to suggest possible mechanisms of self-assembly of the proteins into a graded refractive index structure. First, we performed small angle x-ray scattering (SAXS) to study how the protein is spatially organized. Then, molecular dynamic simulation allowed us to correlate structure to the possible dynamics of the system in different regions of the lens. The combination of simulation and SAXS data in this system revealed the likely protein-protein interactions, resulting material structure and its relationship to the observed and variable optical properties of this graded index system. We believe insights into the material properties of the squid lens system will inform the invention of self-assembling graded index devices.
Squeglia, Flavia; Bachert, Beth; Romano, Maria; Lukomski, Slawomir; Berisio, Rita
2013-09-01
Streptococcal collagen-like proteins (Scls) are widely expressed by the well recognized human pathogen Streptococcus pyogenes. These surface proteins contain a signature central collagen-like region and an amino-terminal globular domain, termed the variable domain, which is protruded away from the cell surface by the collagen-like domain. Despite their recognized importance in bacterial pathogenicity, no structural information is presently available on proteins of the Scl class. The variable domain of Scl2 from invasive M3-type S. pyogenes has successfully been crystallized using vapour-diffusion methods. The crystals diffracted to 1.5 Å resolution and belonged to space group H32, with unit-cell parameters a = 44.23, b = 44.23, c = 227.83 Å. The crystal structure was solved by single-wavelength anomalous dispersion using anomalous signal from a europium chloride derivative.|
Andersen, Ole Juul; Grouleff, Julie; Needham, Perri; Walker, Ross C; Jensen, Frank
2015-11-19
Current enhanced sampling molecular dynamics methods for studying large conformational changes in proteins suffer from certain limitations. These include, among others, the need for user defined collective variables, the prerequisite of both start and end point structures of the conformational change, and the need for a priori knowledge of the amount by which to boost specific parts of the potential. In this paper, a framework is proposed for a molecular dynamics method for studying ligand-induced conformational changes, in which the nonbonded interactions between the ligand and the protein are used to calculate a biasing force. The method requires only a single input structure, and does not entail the use of collective variables. We provide a proof-of-concept for accelerating conformational changes in three simple test molecules, as well as promising results for two proteins known to undergo domain closure upon ligand binding. For the ribose-binding protein, backbone root-mean-square deviations as low as 0.75 Å compared to the crystal structure of the closed conformation are obtained within 50 ns simulations, whereas no domain closures are observed in unbiased simulations. A skewed closed structure is obtained for the glutamine-binding protein at high bias values, indicating that specific protein-ligand interactions might suppress important protein-protein interactions.
Mining protein loops using a structural alphabet and statistical exceptionality
2010-01-01
Background Protein loops encompass 50% of protein residues in available three-dimensional structures. These regions are often involved in protein functions, e.g. binding site, catalytic pocket... However, the description of protein loops with conventional tools is an uneasy task. Regular secondary structures, helices and strands, have been widely studied whereas loops, because they are highly variable in terms of sequence and structure, are difficult to analyze. Due to data sparsity, long loops have rarely been systematically studied. Results We developed a simple and accurate method that allows the description and analysis of the structures of short and long loops using structural motifs without restriction on loop length. This method is based on the structural alphabet HMM-SA. HMM-SA allows the simplification of a three-dimensional protein structure into a one-dimensional string of states, where each state is a four-residue prototype fragment, called structural letter. The difficult task of the structural grouping of huge data sets is thus easily accomplished by handling structural letter strings as in conventional protein sequence analysis. We systematically extracted all seven-residue fragments in a bank of 93000 protein loops and grouped them according to the structural-letter sequence, named structural word. This approach permits a systematic analysis of loops of all sizes since we consider the structural motifs of seven residues rather than complete loops. We focused the analysis on highly recurrent words of loops (observed more than 30 times). Our study reveals that 73% of loop-lengths are covered by only 3310 highly recurrent structural words out of 28274 observed words). These structural words have low structural variability (mean RMSd of 0.85 Å). As expected, half of these motifs display a flanking-region preference but interestingly, two thirds are shared by short (less than 12 residues) and long loops. Moreover, half of recurrent motifs exhibit a significant level of amino-acid conservation with at least four significant positions and 87% of long loops contain at least one such word. We complement our analysis with the detection of statistically over-represented patterns of structural letters as in conventional DNA sequence analysis. About 30% (930) of structural words are over-represented, and cover about 40% of loop lengths. Interestingly, these words exhibit lower structural variability and higher sequential specificity, suggesting structural or functional constraints. Conclusions We developed a method to systematically decompose and study protein loops using recurrent structural motifs. This method is based on the structural alphabet HMM-SA and not on structural alignment and geometrical parameters. We extracted meaningful structural motifs that are found in both short and long loops. To our knowledge, it is the first time that pattern mining helps to increase the signal-to-noise ratio in protein loops. This finding helps to better describe protein loops and might permit to decrease the complexity of long-loop analysis. Detailed results are available at http://www.mti.univ-paris-diderot.fr/publication/supplementary/2009/ACCLoop/. PMID:20132552
Mining protein loops using a structural alphabet and statistical exceptionality.
Regad, Leslie; Martin, Juliette; Nuel, Gregory; Camproux, Anne-Claude
2010-02-04
Protein loops encompass 50% of protein residues in available three-dimensional structures. These regions are often involved in protein functions, e.g. binding site, catalytic pocket... However, the description of protein loops with conventional tools is an uneasy task. Regular secondary structures, helices and strands, have been widely studied whereas loops, because they are highly variable in terms of sequence and structure, are difficult to analyze. Due to data sparsity, long loops have rarely been systematically studied. We developed a simple and accurate method that allows the description and analysis of the structures of short and long loops using structural motifs without restriction on loop length. This method is based on the structural alphabet HMM-SA. HMM-SA allows the simplification of a three-dimensional protein structure into a one-dimensional string of states, where each state is a four-residue prototype fragment, called structural letter. The difficult task of the structural grouping of huge data sets is thus easily accomplished by handling structural letter strings as in conventional protein sequence analysis. We systematically extracted all seven-residue fragments in a bank of 93000 protein loops and grouped them according to the structural-letter sequence, named structural word. This approach permits a systematic analysis of loops of all sizes since we consider the structural motifs of seven residues rather than complete loops. We focused the analysis on highly recurrent words of loops (observed more than 30 times). Our study reveals that 73% of loop-lengths are covered by only 3310 highly recurrent structural words out of 28274 observed words). These structural words have low structural variability (mean RMSd of 0.85 A). As expected, half of these motifs display a flanking-region preference but interestingly, two thirds are shared by short (less than 12 residues) and long loops. Moreover, half of recurrent motifs exhibit a significant level of amino-acid conservation with at least four significant positions and 87% of long loops contain at least one such word. We complement our analysis with the detection of statistically over-represented patterns of structural letters as in conventional DNA sequence analysis. About 30% (930) of structural words are over-represented, and cover about 40% of loop lengths. Interestingly, these words exhibit lower structural variability and higher sequential specificity, suggesting structural or functional constraints. We developed a method to systematically decompose and study protein loops using recurrent structural motifs. This method is based on the structural alphabet HMM-SA and not on structural alignment and geometrical parameters. We extracted meaningful structural motifs that are found in both short and long loops. To our knowledge, it is the first time that pattern mining helps to increase the signal-to-noise ratio in protein loops. This finding helps to better describe protein loops and might permit to decrease the complexity of long-loop analysis. Detailed results are available at http://www.mti.univ-paris-diderot.fr/publication/supplementary/2009/ACCLoop/.
PDBFlex: exploring flexibility in protein structures
Hrabe, Thomas; Li, Zhanwen; Sedova, Mayya; Rotkiewicz, Piotr; Jaroszewski, Lukasz; Godzik, Adam
2016-01-01
The PDBFlex database, available freely and with no login requirements at http://pdbflex.org, provides information on flexibility of protein structures as revealed by the analysis of variations between depositions of different structural models of the same protein in the Protein Data Bank (PDB). PDBFlex collects information on all instances of such depositions, identifying them by a 95% sequence identity threshold, performs analysis of their structural differences and clusters them according to their structural similarities for easy analysis. The PDBFlex contains tools and viewers enabling in-depth examination of structural variability including: 2D-scaling visualization of RMSD distances between structures of the same protein, graphs of average local RMSD in the aligned structures of protein chains, graphical presentation of differences in secondary structure and observed structural disorder (unresolved residues), difference distance maps between all sets of coordinates and 3D views of individual structures and simulated transitions between different conformations, the latter displayed using JSMol visualization software. PMID:26615193
Zemla, Adam T; Lang, Dorothy M; Kostova, Tanya; Andino, Raul; Ecale Zhou, Carol L
2011-06-02
Most of the currently used methods for protein function prediction rely on sequence-based comparisons between a query protein and those for which a functional annotation is provided. A serious limitation of sequence similarity-based approaches for identifying residue conservation among proteins is the low confidence in assigning residue-residue correspondences among proteins when the level of sequence identity between the compared proteins is poor. Multiple sequence alignment methods are more satisfactory--still, they cannot provide reliable results at low levels of sequence identity. Our goal in the current work was to develop an algorithm that could help overcome these difficulties by facilitating the identification of structurally (and possibly functionally) relevant residue-residue correspondences between compared protein structures. Here we present StralSV (structure-alignment sequence variability), a new algorithm for detecting closely related structure fragments and quantifying residue frequency from tight local structure alignments. We apply StralSV in a study of the RNA-dependent RNA polymerase of poliovirus, and we demonstrate that the algorithm can be used to determine regions of the protein that are relatively unique, or that share structural similarity with proteins that would be considered distantly related. By quantifying residue frequencies among many residue-residue pairs extracted from local structural alignments, one can infer potential structural or functional importance of specific residues that are determined to be highly conserved or that deviate from a consensus. We further demonstrate that considerable detailed structural and phylogenetic information can be derived from StralSV analyses. StralSV is a new structure-based algorithm for identifying and aligning structure fragments that have similarity to a reference protein. StralSV analysis can be used to quantify residue-residue correspondences and identify residues that may be of particular structural or functional importance, as well as unusual or unexpected residues at a given sequence position. StralSV is provided as a web service at http://proteinmodel.org/AS2TS/STRALSV/.
Structures of membrane proteins
Vinothkumar, Kutti R.; Henderson, Richard
2010-01-01
In reviewing the structures of membrane proteins determined up to the end of 2009, we present in words and pictures the most informative examples from each family. We group the structures together according to their function and architecture to provide an overview of the major principles and variations on the most common themes. The first structures, determined 20 years ago, were those of naturally abundant proteins with limited conformational variability, and each membrane protein structure determined was a major landmark. With the advent of complete genome sequences and efficient expression systems, there has been an explosion in the rate of membrane protein structure determination, with many classes represented. New structures are published every month and more than 150 unique membrane protein structures have been determined. This review analyses the reasons for this success, discusses the challenges that still lie ahead, and presents a concise summary of the key achievements with illustrated examples selected from each class. PMID:20667175
A TALE-inspired computational screen for proteins that contain approximate tandem repeats.
Perycz, Malgorzata; Krwawicz, Joanna; Bochtler, Matthias
2017-01-01
TAL (transcription activator-like) effectors (TALEs) are bacterial proteins that are secreted from bacteria to plant cells to act as transcriptional activators. TALEs and related proteins (RipTALs, BurrH, MOrTL1 and MOrTL2) contain approximate tandem repeats that differ in conserved positions that define specificity. Using PERL, we screened ~47 million protein sequences for TALE-like architecture characterized by approximate tandem repeats (between 30 and 43 amino acids in length) and sequence variability in conserved positions, without requiring sequence similarity to TALEs. Candidate proteins were scored according to their propensity for nuclear localization, secondary structure, repeat sequence complexity, as well as covariation and predicted structural proximity of variable residues. Biological context was tentatively inferred from co-occurrence of other domains and interactome predictions. Approximate repeats with TALE-like features that merit experimental characterization were found in a protein of chestnut blight fungus, a eukaryotic plant pathogen.
A TALE-inspired computational screen for proteins that contain approximate tandem repeats
Krwawicz, Joanna
2017-01-01
TAL (transcription activator-like) effectors (TALEs) are bacterial proteins that are secreted from bacteria to plant cells to act as transcriptional activators. TALEs and related proteins (RipTALs, BurrH, MOrTL1 and MOrTL2) contain approximate tandem repeats that differ in conserved positions that define specificity. Using PERL, we screened ~47 million protein sequences for TALE-like architecture characterized by approximate tandem repeats (between 30 and 43 amino acids in length) and sequence variability in conserved positions, without requiring sequence similarity to TALEs. Candidate proteins were scored according to their propensity for nuclear localization, secondary structure, repeat sequence complexity, as well as covariation and predicted structural proximity of variable residues. Biological context was tentatively inferred from co-occurrence of other domains and interactome predictions. Approximate repeats with TALE-like features that merit experimental characterization were found in a protein of chestnut blight fungus, a eukaryotic plant pathogen. PMID:28617832
Ashford, Paul; Moss, David S; Alex, Alexander; Yeap, Siew K; Povia, Alice; Nobeli, Irene; Williams, Mark A
2012-03-14
Protein structures provide a valuable resource for rational drug design. For a protein with no known ligand, computational tools can predict surface pockets that are of suitable size and shape to accommodate a complementary small-molecule drug. However, pocket prediction against single static structures may miss features of pockets that arise from proteins' dynamic behaviour. In particular, ligand-binding conformations can be observed as transiently populated states of the apo protein, so it is possible to gain insight into ligand-bound forms by considering conformational variation in apo proteins. This variation can be explored by considering sets of related structures: computationally generated conformers, solution NMR ensembles, multiple crystal structures, homologues or homology models. It is non-trivial to compare pockets, either from different programs or across sets of structures. For a single structure, difficulties arise in defining particular pocket's boundaries. For a set of conformationally distinct structures the challenge is how to make reasonable comparisons between them given that a perfect structural alignment is not possible. We have developed a computational method, Provar, that provides a consistent representation of predicted binding pockets across sets of related protein structures. The outputs are probabilities that each atom or residue of the protein borders a predicted pocket. These probabilities can be readily visualised on a protein using existing molecular graphics software. We show how Provar simplifies comparison of the outputs of different pocket prediction algorithms, of pockets across multiple simulated conformations and between homologous structures. We demonstrate the benefits of use of multiple structures for protein-ligand and protein-protein interface analysis on a set of complexes and consider three case studies in detail: i) analysis of a kinase superfamily highlights the conserved occurrence of surface pockets at the active and regulatory sites; ii) a simulated ensemble of unliganded Bcl2 structures reveals extensions of a known ligand-binding pocket not apparent in the apo crystal structure; iii) visualisations of interleukin-2 and its homologues highlight conserved pockets at the known receptor interfaces and regions whose conformation is known to change on inhibitor binding. Through post-processing of the output of a variety of pocket prediction software, Provar provides a flexible approach to the analysis and visualization of the persistence or variability of pockets in sets of related protein structures.
New paradigm in ankyrin repeats: Beyond protein-protein interaction module.
Islam, Zeyaul; Nagampalli, Raghavendra Sashi Krishna; Fatima, Munazza Tamkeen; Ashraf, Ghulam Md
2018-04-01
Classically, ankyrin repeat (ANK) proteins are built from tandems of two or more repeats and form curved solenoid structures that are associated with protein-protein interactions. These are short, widespread structural motif of around 33 amino acids repeats in tandem, having a canonical helix-loop-helix fold, found individually or in combination with other domains. The multiplicity of structural pattern enables it to form assemblies of diverse sizes, required for their abilities to confer multiple binding and structural roles of proteins. Three-dimensional structures of these repeats determined to date reveal a degree of structural variability that translates into the considerable functional versatility of this protein superfamily. Recent work on the ANK has proposed novel structural information, especially protein-lipid, protein-sugar and protein-protein interaction. Self-assembly of these repeats was also shown to prevent the associated protein in forming filaments. In this review, we summarize the latest findings and how the new structural information has increased our understanding of the structural determinants of ANK proteins. We discussed latest findings on how these proteins participate in various interactions to diversify the ANK roles in numerous biological processes, and explored the emerging and evolving field of designer ankyrins and its framework for protein engineering emphasizing on biotechnological applications. Copyright © 2017 Elsevier B.V. All rights reserved.
Identifying DNA-binding proteins using structural motifs and the electrostatic potential
Shanahan, Hugh P.; Garcia, Mario A.; Jones, Susan; Thornton, Janet M.
2004-01-01
Robust methods to detect DNA-binding proteins from structures of unknown function are important for structural biology. This paper describes a method for identifying such proteins that (i) have a solvent accessible structural motif necessary for DNA-binding and (ii) a positive electrostatic potential in the region of the binding region. We focus on three structural motifs: helix–turn-helix (HTH), helix–hairpin–helix (HhH) and helix–loop–helix (HLH). We find that the combination of these variables detect 78% of proteins with an HTH motif, which is a substantial improvement over previous work based purely on structural templates and is comparable to more complex methods of identifying DNA-binding proteins. Similar true positive fractions are achieved for the HhH and HLH motifs. We see evidence of wide evolutionary diversity for DNA-binding proteins with an HTH motif, and much smaller diversity for those with an HhH or HLH motif. PMID:15356290
Stereophysicochemical variability plots highlight conserved antigenic areas in Flaviviruses
Schein, Catherine H; Zhou, Bin; Braun, Werner
2005-01-01
Background Flaviviruses, which include Dengue (DV) and West Nile (WN), mutate in response to immune system pressure. Identifying escape mutants, variant progeny that replicate in the presence of neutralizing antibodies, is a common way to identify functionally important residues of viral proteins. However, the mutations typically occur at variable positions on the viral surface that are not essential for viral replication. Methods are needed to determine the true targets of the neutralizing antibodies. Results Stereophysicochemical variability plots (SVPs), 3-D images of protein structures colored according to variability, as determined by our PCPMer program, were used to visualize residues conserved in their physical chemical properties (PCPs) near escape mutant positions. The analysis showed 1) that escape mutations in the flavivirus envelope protein are variable residues by our criteria and 2) two escape mutants found at the same position in many flaviviruses sit above clusters of conserved residues from different regions of the linear sequence. Conservation patterns in T-cell epitopes in the NS3- protease suggest a similar mechanism of immune system evasion. Conclusion The SVPs add another dimension to structurally defining the binding sites of neutralizing antibodies. They provide a useful aid for determining antigenically important regions and designing vaccines. PMID:15845145
Lu, Hui-Meng; Yin, Da-Chuan; Ye, Ya-Jing; Luo, Hui-Min; Geng, Li-Qiang; Li, Hai-Sheng; Guo, Wei-Hong; Shang, Peng
2009-01-01
As the most widely utilized technique to determine the 3-dimensional structure of protein molecules, X-ray crystallography can provide structure of the highest resolution among the developed techniques. The resolution obtained via X-ray crystallography is known to be influenced by many factors, such as the crystal quality, diffraction techniques, and X-ray sources, etc. In this paper, the authors found that the protein sequence could also be one of the factors. We extracted information of the resolution and the sequence of proteins from the Protein Data Bank (PDB), classified the proteins into different clusters according to the sequence similarity, and statistically analyzed the relationship between the sequence similarity and the best resolution obtained. The results showed that there was a pronounced correlation between the sequence similarity and the obtained resolution. These results indicate that protein structure itself is one variable that may affect resolution when X-ray crystallography is used.
The ribosomal subunit assembly line
Dlakić, Mensur
2005-01-01
Recent proteomic studies in Saccharomyces cerevisiae have identified nearly 200 proteins, other than the structural ribosomal proteins, that participate in the assembly of ribosomal subunits and their transport from the nucleus. In a separate line of research, proteomic studies of mature plant ribosomes have revealed considerable variability in the protein composition of individual ribosomes. PMID:16207363
Analysis of the interface variability in NMR structure ensembles of protein-protein complexes.
Calvanese, Luisa; D'Auria, Gabriella; Vangone, Anna; Falcigno, Lucia; Oliva, Romina
2016-06-01
NMR structures consist in ensembles of conformers, all satisfying the experimental restraints, which exhibit a certain degree of structural variability. We analyzed here the interface in NMR ensembles of protein-protein heterodimeric complexes and found it to span a wide range of different conservations. The different exhibited conservations do not simply correlate with the size of the systems/interfaces, and are most probably the result of an interplay between different factors, including the quality of experimental data and the intrinsic complex flexibility. In any case, this information is not to be missed when NMR structures of protein-protein complexes are analyzed; especially considering that, as we also show here, the first NMR conformer is usually not the one which best reflects the overall interface. To quantify the interface conservation and to analyze it, we used an approach originally conceived for the analysis and ranking of ensembles of docking models, which has now been extended to directly deal with NMR ensembles. We propose this approach, based on the conservation of the inter-residue contacts at the interface, both for the analysis of the interface in whole ensembles of NMR complexes and for the possible selection of a single conformer as the best representative of the overall interface. In order to make the analyses automatic and fast, we made the protocol available as a web tool at: https://www.molnac.unisa.it/BioTools/consrank/consrank-nmr.html. Copyright © 2016 Elsevier Inc. All rights reserved.
An Unusual Hydrophobic Core Confers Extreme Flexibility to HEAT Repeat Proteins
Kappel, Christian; Zachariae, Ulrich; Dölker, Nicole; Grubmüller, Helmut
2010-01-01
Alpha-solenoid proteins are suggested to constitute highly flexible macromolecules, whose structural variability and large surface area is instrumental in many important protein-protein binding processes. By equilibrium and nonequilibrium molecular dynamics simulations, we show that importin-β, an archetypical α-solenoid, displays unprecedentedly large and fully reversible elasticity. Our stretching molecular dynamics simulations reveal full elasticity over up to twofold end-to-end extensions compared to its bound state. Despite the absence of any long-range intramolecular contacts, the protein can return to its equilibrium structure to within 3 Å backbone RMSD after the release of mechanical stress. We find that this extreme degree of flexibility is based on an unusually flexible hydrophobic core that differs substantially from that of structurally similar but more rigid globular proteins. In that respect, the core of importin-β resembles molten globules. The elastic behavior is dominated by nonpolar interactions between HEAT repeats, combined with conformational entropic effects. Our results suggest that α-solenoid structures such as importin-β may bridge the molecular gap between completely structured and intrinsically disordered proteins. PMID:20816072
Native State Volume Fluctuations in Proteins as a Mechanism for Dynamic Allostery.
Law, Anthony B; Sapienza, Paul J; Zhang, Jun; Zuo, Xiaobing; Petit, Chad M
2017-03-15
Allostery enables tight regulation of protein function in the cellular environment. Although existing models of allostery are firmly rooted in the current structure-function paradigm, the mechanistic basis for allostery in the absence of structural change remains unclear. In this study, we show that a typical globular protein is able to undergo significant changes in volume under native conditions while exhibiting no additional changes in protein structure. These native state volume fluctuations were found to correlate with changes in internal motions that were previously recognized as a source of allosteric entropy. This finding offers a novel mechanistic basis for allostery in the absence of canonical structural change. The unexpected observation that function can be derived from expanded, low density protein states has broad implications for our understanding of allostery and suggests that the general concept of the native state be expanded to allow for more variable physical dimensions with looser packing.
A hidden markov model derived structural alphabet for proteins.
Camproux, A C; Gautier, R; Tufféry, P
2004-06-04
Understanding and predicting protein structures depends on the complexity and the accuracy of the models used to represent them. We have set up a hidden Markov model that discretizes protein backbone conformation as series of overlapping fragments (states) of four residues length. This approach learns simultaneously the geometry of the states and their connections. We obtain, using a statistical criterion, an optimal systematic decomposition of the conformational variability of the protein peptidic chain in 27 states with strong connection logic. This result is stable over different protein sets. Our model fits well the previous knowledge related to protein architecture organisation and seems able to grab some subtle details of protein organisation, such as helix sub-level organisation schemes. Taking into account the dependence between the states results in a description of local protein structure of low complexity. On an average, the model makes use of only 8.3 states among 27 to describe each position of a protein structure. Although we use short fragments, the learning process on entire protein conformations captures the logic of the assembly on a larger scale. Using such a model, the structure of proteins can be reconstructed with an average accuracy close to 1.1A root-mean-square deviation and for a complexity of only 3. Finally, we also observe that sequence specificity increases with the number of states of the structural alphabet. Such models can constitute a very relevant approach to the analysis of protein architecture in particular for protein structure prediction.
Bidargaddi, Niranjan P; Chetty, Madhu; Kamruzzaman, Joarder
2008-06-01
Profile hidden Markov models (HMMs) based on classical HMMs have been widely applied for protein sequence identification. The formulation of the forward and backward variables in profile HMMs is made under statistical independence assumption of the probability theory. We propose a fuzzy profile HMM to overcome the limitations of that assumption and to achieve an improved alignment for protein sequences belonging to a given family. The proposed model fuzzifies the forward and backward variables by incorporating Sugeno fuzzy measures and Choquet integrals, thus further extends the generalized HMM. Based on the fuzzified forward and backward variables, we propose a fuzzy Baum-Welch parameter estimation algorithm for profiles. The strong correlations and the sequence preference involved in the protein structures make this fuzzy architecture based model as a suitable candidate for building profiles of a given family, since the fuzzy set can handle uncertainties better than classical methods.
Ladunga, I
1992-04-01
The markedly nonuniform, even systematic distribution of sequences in the protein "universe" has been analyzed by methods of protein taxonomy. Mapping of the natural hierarchical system of proteins has revealed some dense cores, i.e., well-defined clusterings of proteins that seem to be natural structural groupings, possibly seeds for a future protein taxonomy. The aim was not to force proteins into more or less man-made categories by discriminant analysis, but to find structurally similar groups, possibly of common evolutionary origin. Single-valued distance measures between pairs of superfamilies from the Protein Identification Resource were defined by two chi 2-like methods on tripeptide frequencies and the variable-length subsequence identity method derived from dot-matrix comparisons. Distance matrices were processed by several methods of cluster analysis to detect phylogenetic continuum between highly divergent proteins. Only well-defined clusters characterized by relatively unique structural, intracellular environmental, organismal, and functional attribute states were selected as major protein groups, including subsets of viral and Escherichia coli proteins, hormones, inhibitors, plant, ribosomal, serum and structural proteins, amino acid synthases, and clusters dominated by certain oxidoreductases and apolar and DNA-associated enzymes. The limited repertoire of functional patterns due to small genome size, the high rate of recombination, specific features of the bacterial membranes, or of the virus cycle canalize certain proteins of viruses and Gram-negative bacteria, respectively, to organismal groups.
Noguera, Martín E.; Vazquez, Diego S.; Ferrer-Sueta, Gerardo; Agudelo, William A.; Howard, Eduardo; Rasia, Rodolfo M.; Manta, Bruno; Cousido-Siah, Alexandra; Mitschler, André; Podjarny, Alberto; Santos, Javier
2017-01-01
Thioredoxin is a ubiquitous small protein that catalyzes redox reactions of protein thiols. Additionally, thioredoxin from E. coli (EcTRX) is a widely-used model for structure-function studies. In a previous paper, we characterized several single-point mutants of the C-terminal helix (CTH) that alter global stability of EcTRX. However, spectroscopic signatures and enzymatic activity for some of these mutants were found essentially unaffected. A comprehensive structural characterization at the atomic level of these near-invariant mutants can provide detailed information about structural variability of EcTRX. We address this point through the determination of the crystal structures of four point-mutants, whose mutations occurs within or near the CTH, namely L94A, E101G, N106A and L107A. These structures are mostly unaffected compared with the wild-type variant. Notably, the E101G mutant presents a large region with two alternative traces for the backbone of the same chain. It represents a significant shift in backbone positions. Enzymatic activity measurements and conformational dynamics studies monitored by NMR and molecular dynamic simulations show that E101G mutation results in a small effect in the structural features of the protein. We hypothesize that these alternative conformations represent samples of the native-state ensemble of EcTRX, specifically the magnitude and location of conformational heterogeneity. PMID:28181556
NASA Astrophysics Data System (ADS)
Noguera, Martín E.; Vazquez, Diego S.; Ferrer-Sueta, Gerardo; Agudelo, William A.; Howard, Eduardo; Rasia, Rodolfo M.; Manta, Bruno; Cousido-Siah, Alexandra; Mitschler, André; Podjarny, Alberto; Santos, Javier
2017-02-01
Thioredoxin is a ubiquitous small protein that catalyzes redox reactions of protein thiols. Additionally, thioredoxin from E. coli (EcTRX) is a widely-used model for structure-function studies. In a previous paper, we characterized several single-point mutants of the C-terminal helix (CTH) that alter global stability of EcTRX. However, spectroscopic signatures and enzymatic activity for some of these mutants were found essentially unaffected. A comprehensive structural characterization at the atomic level of these near-invariant mutants can provide detailed information about structural variability of EcTRX. We address this point through the determination of the crystal structures of four point-mutants, whose mutations occurs within or near the CTH, namely L94A, E101G, N106A and L107A. These structures are mostly unaffected compared with the wild-type variant. Notably, the E101G mutant presents a large region with two alternative traces for the backbone of the same chain. It represents a significant shift in backbone positions. Enzymatic activity measurements and conformational dynamics studies monitored by NMR and molecular dynamic simulations show that E101G mutation results in a small effect in the structural features of the protein. We hypothesize that these alternative conformations represent samples of the native-state ensemble of EcTRX, specifically the magnitude and location of conformational heterogeneity.
Unbiased, scalable sampling of protein loop conformations from probabilistic priors.
Zhang, Yajia; Hauser, Kris
2013-01-01
Protein loops are flexible structures that are intimately tied to function, but understanding loop motion and generating loop conformation ensembles remain significant computational challenges. Discrete search techniques scale poorly to large loops, optimization and molecular dynamics techniques are prone to local minima, and inverse kinematics techniques can only incorporate structural preferences in adhoc fashion. This paper presents Sub-Loop Inverse Kinematics Monte Carlo (SLIKMC), a new Markov chain Monte Carlo algorithm for generating conformations of closed loops according to experimentally available, heterogeneous structural preferences. Our simulation experiments demonstrate that the method computes high-scoring conformations of large loops (>10 residues) orders of magnitude faster than standard Monte Carlo and discrete search techniques. Two new developments contribute to the scalability of the new method. First, structural preferences are specified via a probabilistic graphical model (PGM) that links conformation variables, spatial variables (e.g., atom positions), constraints and prior information in a unified framework. The method uses a sparse PGM that exploits locality of interactions between atoms and residues. Second, a novel method for sampling sub-loops is developed to generate statistically unbiased samples of probability densities restricted by loop-closure constraints. Numerical experiments confirm that SLIKMC generates conformation ensembles that are statistically consistent with specified structural preferences. Protein conformations with 100+ residues are sampled on standard PC hardware in seconds. Application to proteins involved in ion-binding demonstrate its potential as a tool for loop ensemble generation and missing structure completion.
Unbiased, scalable sampling of protein loop conformations from probabilistic priors
2013-01-01
Background Protein loops are flexible structures that are intimately tied to function, but understanding loop motion and generating loop conformation ensembles remain significant computational challenges. Discrete search techniques scale poorly to large loops, optimization and molecular dynamics techniques are prone to local minima, and inverse kinematics techniques can only incorporate structural preferences in adhoc fashion. This paper presents Sub-Loop Inverse Kinematics Monte Carlo (SLIKMC), a new Markov chain Monte Carlo algorithm for generating conformations of closed loops according to experimentally available, heterogeneous structural preferences. Results Our simulation experiments demonstrate that the method computes high-scoring conformations of large loops (>10 residues) orders of magnitude faster than standard Monte Carlo and discrete search techniques. Two new developments contribute to the scalability of the new method. First, structural preferences are specified via a probabilistic graphical model (PGM) that links conformation variables, spatial variables (e.g., atom positions), constraints and prior information in a unified framework. The method uses a sparse PGM that exploits locality of interactions between atoms and residues. Second, a novel method for sampling sub-loops is developed to generate statistically unbiased samples of probability densities restricted by loop-closure constraints. Conclusion Numerical experiments confirm that SLIKMC generates conformation ensembles that are statistically consistent with specified structural preferences. Protein conformations with 100+ residues are sampled on standard PC hardware in seconds. Application to proteins involved in ion-binding demonstrate its potential as a tool for loop ensemble generation and missing structure completion. PMID:24565175
Direct Calculation of Protein Fitness Landscapes through Computational Protein Design
Au, Loretta; Green, David F.
2016-01-01
Naturally selected amino-acid sequences or experimentally derived ones are often the basis for understanding how protein three-dimensional conformation and function are determined by primary structure. Such sequences for a protein family comprise only a small fraction of all possible variants, however, representing the fitness landscape with limited scope. Explicitly sampling and characterizing alternative, unexplored protein sequences would directly identify fundamental reasons for sequence robustness (or variability), and we demonstrate that computational methods offer an efficient mechanism toward this end, on a large scale. The dead-end elimination and A∗ search algorithms were used here to find all low-energy single mutant variants, and corresponding structures of a G-protein heterotrimer, to measure changes in structural stability and binding interactions to define a protein fitness landscape. We established consistency between these algorithms with known biophysical and evolutionary trends for amino-acid substitutions, and could thus recapitulate known protein side-chain interactions and predict novel ones. PMID:26745411
NASA Astrophysics Data System (ADS)
Weigt, Martin
Over the last years, biological research has been revolutionized by experimental high-throughput techniques, in particular by next-generation sequencing technology. Unprecedented amounts of data are accumulating, and there is a growing request for computational methods unveiling the information hidden in raw data, thereby increasing our understanding of complex biological systems. Statistical-physics models based on the maximum-entropy principle have, in the last few years, played an important role in this context. To give a specific example, proteins and many non-coding RNA show a remarkable degree of structural and functional conservation in the course of evolution, despite a large variability in amino acid sequences. We have developed a statistical-mechanics inspired inference approach - called Direct-Coupling Analysis - to link this sequence variability (easy to observe in sequence alignments, which are available in public sequence databases) to bio-molecular structure and function. In my presentation I will show, how this methodology can be used (i) to infer contacts between residues and thus to guide tertiary and quaternary protein structure prediction and RNA structure prediction, (ii) to discriminate interacting from non-interacting protein families, and thus to infer conserved protein-protein interaction networks, and (iii) to reconstruct mutational landscapes and thus to predict the phenotypic effect of mutations. References [1] M. Figliuzzi, H. Jacquier, A. Schug, O. Tenaillon and M. Weigt ''Coevolutionary landscape inference and the context-dependence of mutations in beta-lactamase TEM-1'', Mol. Biol. Evol. (2015), doi: 10.1093/molbev/msv211 [2] E. De Leonardis, B. Lutz, S. Ratz, S. Cocco, R. Monasson, A. Schug, M. Weigt ''Direct-Coupling Analysis of nucleotide coevolution facilitates RNA secondary and tertiary structure prediction'', Nucleic Acids Research (2015), doi: 10.1093/nar/gkv932 [3] F. Morcos, A. Pagnani, B. Lunt, A. Bertolino, D. Marks, C. Sander, R. Zecchina, J.N. Onuchic, T. Hwa, M. Weigt, ''Direct-coupling analysis of residue co-evolution captures native contacts across many protein families'', Proc. Natl. Acad. Sci. 108, E1293-E1301 (2011).
Structural Influence on the Dominance of Virus-Specific CD4 T Cell Epitopes in Zika Virus Infection.
Koblischke, Maximilian; Stiasny, Karin; Aberle, Stephan W; Malafa, Stefan; Tschouchnikas, Georgios; Schwaiger, Julia; Kundi, Michael; Heinz, Franz X; Aberle, Judith H
2018-01-01
Zika virus (ZIKV) has recently caused explosive outbreaks in Pacific islands, South- and Central America. Like with other flaviviruses, protective immunity is strongly dependent on potently neutralizing antibodies (Abs) directed against the viral envelope protein E. Such Ab formation is promoted by CD4 T cells through direct interaction with B cells that present epitopes derived from E or other structural proteins of the virus. Here, we examined the extent and epitope dominance of CD4 T cell responses to capsid (C) and envelope proteins in Zika patients. All patients developed ZIKV-specific CD4 T cell responses, with substantial contributions of C and E. In both proteins, immunodominant epitopes clustered at sites that are structurally conserved among flaviviruses but have highly variable sequences, suggesting a strong impact of protein structural features on immunodominant CD4 T cell responses. Our data are particularly relevant for designing flavivirus vaccines and their evaluation in T cell assays and provide insights into the importance of viral protein structure for epitope selection and antigenicity.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jackson, P.J.; Walthers, E.A.; Richmond, K.L.
1997-04-01
PCR analysis of 198 Bacillus anthracis isolates revealed a variable region of DNA sequence differing in length among the isolates. Five Polymorphisms differed by the presence Of two to six copies of the 12-bp tandem repeat 5{prime}-CAATATCAACAA-3{prime}. This variable-number tandem repeat (VNTR) region is located within a larger sequence containing one complete open reading frame that encodes a putative 30-kDa protein. Length variation did not change the reading frame of the encoded protein and only changed the copy number of a 4-amino-acid sequence (QYQQ) from 2 to 6. The structure of the VNTR region suggests that these multiple repeats aremore » generated by recombination or polymerase slippage. Protein structures predicted from the reverse-translated DNA sequence suggest that any structural changes in the encoded protein are confined to the region encoded by the VNTR sequence. Copy number differences in the VNTR region were used to define five different B. anthracis alleles. Characterization of 198 isolates revealed allele frequencies of 6.1, 17.7, 59.6, 5.6, and 11.1% sequentially from shorter to longer alleles. The high degree of polymorphism in the VNTR region provides a criterion for assigning isolates to five allelic categories. There is a correlation between categories and geographic distribution. Such molecular markers can be used to monitor the epidemiology of anthrax outbreaks in domestic and native herbivore populations. 22 refs., 4 figs., 3 tabs.« less
Iyer, Lavanya K.; Sacha, Gregory A.; Moorthy, Balakrishnan S.; Nail, Steven L.; Topp, Elizabeth M.
2016-01-01
Myoglobin (Mb) was lyophilized in the absence (Mb-A) and presence (Mb-B) of sucrose in a pilot-scale lyophilizer with or without controlled ice nucleation. Cake morphology was characterized using scanning electron microscopy (SEM) and changes in protein structure were monitored using solid-state Fourier-transform infrared spectroscopy (ssFTIR), solid-state hydrogen-deuterium exchange-mass spectrometry (ssHDX-MS) and solid-state photolytic labeling-mass spectrometry (ssPL-MS). The results showed greater variability in nucleation temperature and irregular cake structure for formulations lyophilized without controlled nucleation. Controlled nucleation resulted in nucleation at ~ −5 °C and uniform cake structure. Formulations containing sucrose showed better retention of protein structure by all measures than formulations without sucrose. Samples lyophilized with and without controlled nucleation were similar by most measures of protein structure. However, ssPL-MS showed the greatest pLeu incorporation and more labeled regions for Mb-B lyophilized with controlled nucleation. The data support the use of ssHDX-MS and ssPL-MS to study formulation and process-induced conformational changes in lyophilized proteins. PMID:27044943
Modeling Protein Expression and Protein Signaling Pathways
Telesca, Donatello; Müller, Peter; Kornblau, Steven M.; Suchard, Marc A.; Ji, Yuan
2015-01-01
High-throughput functional proteomic technologies provide a way to quantify the expression of proteins of interest. Statistical inference centers on identifying the activation state of proteins and their patterns of molecular interaction formalized as dependence structure. Inference on dependence structure is particularly important when proteins are selected because they are part of a common molecular pathway. In that case, inference on dependence structure reveals properties of the underlying pathway. We propose a probability model that represents molecular interactions at the level of hidden binary latent variables that can be interpreted as indicators for active versus inactive states of the proteins. The proposed approach exploits available expert knowledge about the target pathway to define an informative prior on the hidden conditional dependence structure. An important feature of this prior is that it provides an instrument to explicitly anchor the model space to a set of interactions of interest, favoring a local search approach to model determination. We apply our model to reverse-phase protein array data from a study on acute myeloid leukemia. Our inference identifies relevant subpathways in relation to the unfolding of the biological process under study. PMID:26246646
Perez, Romel B.; Tischer, Alexander; Auton, Matthew; Whitten, Steven T.
2014-01-01
Molecular transduction of biological signals is understood primarily in terms of the cooperative structural transitions of protein macromolecules, providing a mechanism through which discrete local structure perturbations affect global macromolecular properties. The recognition that proteins lacking tertiary stability, commonly referred to as intrinsically disordered proteins, mediate key signaling pathways suggests that protein structures without cooperative intramolecular interactions may also have the ability to couple local and global structure changes. Presented here are results from experiments that measured and tested the ability of disordered proteins to couple local changes in structure to global changes in structure. Using the intrinsically disordered N-terminal region of the p53 protein as an experimental model, a set of proline and alanine to glycine substitution variants were designed to modulate backbone conformational propensities without introducing non-native intramolecular interactions. The hydrodynamic radius (Rh) was used to monitor changes in global structure. Circular dichroism spectroscopy showed that the glycine substitutions decreased polyproline II (PPII) propensities relative to the wild type, as expected, and fluorescence methods indicated that substitution-induced changes in Rh were not associated with folding. The experiments showed that changes in local PPII structure cause changes in Rh that are variable and that depend on the intrinsic chain propensities of proline and alanine residues, demonstrating a mechanism for coupling local and global structure changes. Molecular simulations that model our results were used to extend the analysis to other proteins and illustrate the generality of the observed proline and alanine effects on the structures of intrinsically disordered proteins. PMID:25244701
DOE Office of Scientific and Technical Information (OSTI.GOV)
Le Coq, Johanne; Ghosh, Partho
2012-06-19
Anticipatory ligand binding through massive protein sequence variation is rare in biological systems, having been observed only in the vertebrate adaptive immune response and in a phage diversity-generating retroelement (DGR). Earlier work has demonstrated that the prototypical DGR variable protein, major tropism determinant (Mtd), meets the demands of anticipatory ligand binding by novel means through the C-type lectin (CLec) fold. However, because of the low sequence identity among DGR variable proteins, it has remained unclear whether the CLec fold is a general solution for DGRs. We have addressed this problem by determining the structure of a second DGR variable protein,more » TvpA, from the pathogenic oral spirochete Treponema denticola. Despite its weak sequence identity to Mtd ({approx}16%), TvpA was found to also have a CLec fold, with predicted variable residues exposed in a ligand-binding site. However, this site in TvpA was markedly more variable than the one in Mtd, reflecting the unprecedented approximate 10{sup 20} potential variability of TvpA. In addition, similarity between TvpA and Mtd with formylglycine-generating enzymes was detected. These results provide strong evidence for the conservation of the formylglycine-generating enzyme-type CLec fold among DGRs as a means of accommodating massive sequence variation.« less
Le Coq, Johanne; Ghosh, Partho
2011-01-01
Anticipatory ligand binding through massive protein sequence variation is rare in biological systems, having been observed only in the vertebrate adaptive immune response and in a phage diversity-generating retroelement (DGR). Earlier work has demonstrated that the prototypical DGR variable protein, major tropism determinant (Mtd), meets the demands of anticipatory ligand binding by novel means through the C-type lectin (CLec) fold. However, because of the low sequence identity among DGR variable proteins, it has remained unclear whether the CLec fold is a general solution for DGRs. We have addressed this problem by determining the structure of a second DGR variable protein, TvpA, from the pathogenic oral spirochete Treponema denticola. Despite its weak sequence identity to Mtd (∼16%), TvpA was found to also have a CLec fold, with predicted variable residues exposed in a ligand-binding site. However, this site in TvpA was markedly more variable than the one in Mtd, reflecting the unprecedented approximate 1020 potential variability of TvpA. In addition, similarity between TvpA and Mtd with formylglycine-generating enzymes was detected. These results provide strong evidence for the conservation of the formylglycine-generating enzyme-type CLec fold among DGRs as a means of accommodating massive sequence variation. PMID:21873231
Le Coq, Johanne; Ghosh, Partho
2011-08-30
Anticipatory ligand binding through massive protein sequence variation is rare in biological systems, having been observed only in the vertebrate adaptive immune response and in a phage diversity-generating retroelement (DGR). Earlier work has demonstrated that the prototypical DGR variable protein, major tropism determinant (Mtd), meets the demands of anticipatory ligand binding by novel means through the C-type lectin (CLec) fold. However, because of the low sequence identity among DGR variable proteins, it has remained unclear whether the CLec fold is a general solution for DGRs. We have addressed this problem by determining the structure of a second DGR variable protein, TvpA, from the pathogenic oral spirochete Treponema denticola. Despite its weak sequence identity to Mtd (∼16%), TvpA was found to also have a CLec fold, with predicted variable residues exposed in a ligand-binding site. However, this site in TvpA was markedly more variable than the one in Mtd, reflecting the unprecedented approximate 10(20) potential variability of TvpA. In addition, similarity between TvpA and Mtd with formylglycine-generating enzymes was detected. These results provide strong evidence for the conservation of the formylglycine-generating enzyme-type CLec fold among DGRs as a means of accommodating massive sequence variation.
Bayesian Peak Picking for NMR Spectra
Cheng, Yichen; Gao, Xin; Liang, Faming
2013-01-01
Protein structure determination is a very important topic in structural genomics, which helps people to understand varieties of biological functions such as protein-protein interactions, protein–DNA interactions and so on. Nowadays, nuclear magnetic resonance (NMR) has often been used to determine the three-dimensional structures of protein in vivo. This study aims to automate the peak picking step, the most important and tricky step in NMR structure determination. We propose to model the NMR spectrum by a mixture of bivariate Gaussian densities and use the stochastic approximation Monte Carlo algorithm as the computational tool to solve the problem. Under the Bayesian framework, the peak picking problem is casted as a variable selection problem. The proposed method can automatically distinguish true peaks from false ones without preprocessing the data. To the best of our knowledge, this is the first effort in the literature that tackles the peak picking problem for NMR spectrum data using Bayesian method. PMID:24184964
DOE Office of Scientific and Technical Information (OSTI.GOV)
Law, Anthony B.; Sapienza, Paul J.; Zhang, Jun
Allostery enables tight regulation of protein function in the cellular environment. While existing models of allostery are firmly rooted in the current structure-function paradigm, the mechanistic basis for allostery in the absence of structural change remains unclear. In this study, we show that a typical globular protein is able to undergo significant changes in volume under native conditions while exhibiting no additional changes in protein structure. These native state volume fluctuations were found to correlate with changes in internal motions that were previously recognized as a source of allosteric entropy. This finding offers a novel mechanistic basis for allostery inmore » the absence of canonical structural change. As a result, the unexpected observation that function can be derived from expanded, low density protein states has broad implications for our understanding of allostery and suggests that the general concept of the native state be expanded to allow for more variable physical dimensions with looser packing.« less
Bayesian peak picking for NMR spectra.
Cheng, Yichen; Gao, Xin; Liang, Faming
2014-02-01
Protein structure determination is a very important topic in structural genomics, which helps people to understand varieties of biological functions such as protein-protein interactions, protein-DNA interactions and so on. Nowadays, nuclear magnetic resonance (NMR) has often been used to determine the three-dimensional structures of protein in vivo. This study aims to automate the peak picking step, the most important and tricky step in NMR structure determination. We propose to model the NMR spectrum by a mixture of bivariate Gaussian densities and use the stochastic approximation Monte Carlo algorithm as the computational tool to solve the problem. Under the Bayesian framework, the peak picking problem is casted as a variable selection problem. The proposed method can automatically distinguish true peaks from false ones without preprocessing the data. To the best of our knowledge, this is the first effort in the literature that tackles the peak picking problem for NMR spectrum data using Bayesian method. Copyright © 2013. Production and hosting by Elsevier Ltd.
Native state volume fluctuations in proteins as a mechanism for dynamic allostery
Law, Anthony B.; Sapienza, Paul J.; Zhang, Jun; ...
2017-01-17
Allostery enables tight regulation of protein function in the cellular environment. While existing models of allostery are firmly rooted in the current structure-function paradigm, the mechanistic basis for allostery in the absence of structural change remains unclear. In this study, we show that a typical globular protein is able to undergo significant changes in volume under native conditions while exhibiting no additional changes in protein structure. These native state volume fluctuations were found to correlate with changes in internal motions that were previously recognized as a source of allosteric entropy. This finding offers a novel mechanistic basis for allostery inmore » the absence of canonical structural change. As a result, the unexpected observation that function can be derived from expanded, low density protein states has broad implications for our understanding of allostery and suggests that the general concept of the native state be expanded to allow for more variable physical dimensions with looser packing.« less
Crystal Structure of Neurotropism-Associated Variable Surface Protein 1 (VSP1) of Borrelia Turicatae
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lawson,C.; Yung, B.; Barbour, A.
2006-01-01
Vsp surface lipoproteins are serotype-defining antigens of relapsing fever spirochetes that undergo multiphasic antigenic variation to allow bacterial persistence in spite of an immune response. Two isogenic serotypes of Borrelia turicatae strain Oz1 differ in their Vsp sequences and in disease manifestations in infected mice: Vsp1 is associated with the selection of a neurological niche, while Vsp2 is associated with blood and skin infection. We report here crystal structures of the Vsp1 dimer at 2.7 and 2.2 Angstroms. The structures confirm that relapsing fever Vsp proteins share a common helical fold with OspCs of Lyme disease-causing Borrelia. The fold featuresmore » an inner stem formed by highly conserved N and C termini and an outer 'dome' formed by the variable central residues. Both Vsp1 and OspC structures possess small water-filled cavities, or pockets, that are lined largely by variable residues and are thus highly variable in shape. These features appear to signify tolerance of the Vsp-OspC fold for imperfect packing of residues at its antigenic surface. Structural comparison of Vsp1 with a homology model for Vsp2 suggests that observed differences in disease manifestation may arise in part from distinct differences in electrostatic surface properties; additional predicted positively charged surface patches on Vsp2 compared to Vsp1 may be sufficient to explain the relative propensity of Vsp2 to bind to acidic glycosaminoglycans.« less
On Relevance of Codon Usage to Expression of Synthetic and Natural Genes in Escherichia coli
Supek, Fran; Šmuc, Tomislav
2010-01-01
A recent investigation concluded that codon bias did not affect expression of green fluorescent protein (GFP) variants in Escherichia coli, while stability of an mRNA secondary structure near the 5′ end played a dominant role. We demonstrate that combining the two variables using regression trees or support vector regression yields a biologically plausible model with better support in the GFP data set and in other experimental data: codon usage is relevant for protein levels if the 5′ mRNA structures are not strong. Natural E. coli genes had weaker 5′ mRNA structures than the examined set of GFP variants and did not exhibit a correlation between the folding free energy of 5′ mRNA structures and protein expression. PMID:20421604
Young, Paul G; Moreland, Nicole J; Loh, Jacelyn M; Bell, Anita; Atatoa Carr, Polly; Proft, Thomas; Baker, Edward N
2014-07-01
Group A streptococcus (GAS; Streptococcus pyogenes) is a Gram-positive human pathogen that causes a broad range of diseases ranging from acute pharyngitis to the poststreptococcal sequelae of acute rheumatic fever. GAS pili are highly diverse, long protein polymers that extend from the cell surface. They have multiple roles in infection and are promising candidates for vaccine development. This study describes the structure of the T6 backbone pilin (BP; Lancefield T-antigen) from the important M6 serotype. The structure reveals a modular arrangement of three tandem immunoglobulin-like domains, two with internal isopeptide bonds. The T6 pilin lysine, essential for polymerization, is located in a novel VAKS motif that is structurally homologous to the canonical YPKN pilin lysine in other three- and four-domain Gram-positive pilins. The T6 structure also highlights a conserved pilin core whose surface is decorated with highly variable loops and extensions. Comparison to other Gram-positive BPs shows that many of the largest variable extensions are found in conserved locations. Studies with sera from patients diagnosed with GAS-associated acute rheumatic fever showed that each of the three T6 domains, and the largest of the variable extensions (V8), are targeted by IgG during infection in vivo. Although the GAS BP show large variations in size and sequence, the modular nature of the pilus proteins revealed by the T6 structure may aid the future design of a pilus-based vaccine. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Perez, Romel B; Tischer, Alexander; Auton, Matthew; Whitten, Steven T
2014-12-01
Molecular transduction of biological signals is understood primarily in terms of the cooperative structural transitions of protein macromolecules, providing a mechanism through which discrete local structure perturbations affect global macromolecular properties. The recognition that proteins lacking tertiary stability, commonly referred to as intrinsically disordered proteins (IDPs), mediate key signaling pathways suggests that protein structures without cooperative intramolecular interactions may also have the ability to couple local and global structure changes. Presented here are results from experiments that measured and tested the ability of disordered proteins to couple local changes in structure to global changes in structure. Using the intrinsically disordered N-terminal region of the p53 protein as an experimental model, a set of proline (PRO) and alanine (ALA) to glycine (GLY) substitution variants were designed to modulate backbone conformational propensities without introducing non-native intramolecular interactions. The hydrodynamic radius (R(h)) was used to monitor changes in global structure. Circular dichroism spectroscopy showed that the GLY substitutions decreased polyproline II (PP(II)) propensities relative to the wild type, as expected, and fluorescence methods indicated that substitution-induced changes in R(h) were not associated with folding. The experiments showed that changes in local PP(II) structure cause changes in R(h) that are variable and that depend on the intrinsic chain propensities of PRO and ALA residues, demonstrating a mechanism for coupling local and global structure changes. Molecular simulations that model our results were used to extend the analysis to other proteins and illustrate the generality of the observed PRO and alanine effects on the structures of IDPs. © 2014 Wiley Periodicals, Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Booe, Jason M.; Walker, Christopher S.; Barwell, James
Association of receptor activity-modifying proteins (RAMP1-3) with the G protein-coupled receptor (GPCR) calcitonin receptor-like receptor (CLR) enables selective recognition of the peptides calcitonin gene-related peptide (CGRP) and adrenomedullin (AM) that have diverse functions in the cardiovascular and lymphatic systems. How peptides selectively bind GPCR:RAMP complexes is unknown. We report crystal structures of CGRP analog-bound CLR:RAMP1 and AM-bound CLR:RAMP2 extracellular domain heterodimers at 2.5 and 1.8 Å resolutions, respectively. The peptides similarly occupy a shared binding site on CLR with conformations characterized by a β-turn structure near their C termini rather than the α-helical structure common to peptides that bind relatedmore » GPCRs. The RAMPs augment the binding site with distinct contacts to the variable C-terminal peptide residues and elicit subtly different CLR conformations. Lastly, the structures and accompanying pharmacology data reveal how a class of accessory membrane proteins modulate ligand binding of a GPCR and may inform drug development targeting CLR:RAMP complexes.« less
Booe, Jason M.; Walker, Christopher S.; Barwell, James; ...
2015-05-14
Association of receptor activity-modifying proteins (RAMP1-3) with the G protein-coupled receptor (GPCR) calcitonin receptor-like receptor (CLR) enables selective recognition of the peptides calcitonin gene-related peptide (CGRP) and adrenomedullin (AM) that have diverse functions in the cardiovascular and lymphatic systems. How peptides selectively bind GPCR:RAMP complexes is unknown. We report crystal structures of CGRP analog-bound CLR:RAMP1 and AM-bound CLR:RAMP2 extracellular domain heterodimers at 2.5 and 1.8 Å resolutions, respectively. The peptides similarly occupy a shared binding site on CLR with conformations characterized by a β-turn structure near their C termini rather than the α-helical structure common to peptides that bind relatedmore » GPCRs. The RAMPs augment the binding site with distinct contacts to the variable C-terminal peptide residues and elicit subtly different CLR conformations. Lastly, the structures and accompanying pharmacology data reveal how a class of accessory membrane proteins modulate ligand binding of a GPCR and may inform drug development targeting CLR:RAMP complexes.« less
Lagercrantz, Ulf; Ryman, Nils
1990-02-01
This study describes the population structure of Norway spruce (Picea abies) as revealed by protein polymorphisms and morphological variation. Electrophoretically detectable genetic variability was examined at 22 protein loci in 70 populations from the natural range of the species in Europe. Like other conifers, Norway spruce exhibits a relatively large amount of genetic variability and little differentiation among populations. Sixteen polymorphic loci (73%) segregate for a total of 51 alleles, and average heterozygosity per population is 0.115. Approximately 5% of the total genetic diversity is explained by differences between populations (G ST = 0.052), and Nei's standard genetic distance is less than 0.04 in all cases. We suggest that the population structure largely reflects relatively recent historical events related to the last glaciation and that Norway spruce is still in a process of adaptation and differentiation. There is a clear geographic pattern in the variation of allele frequencies. A major part of the allelefrequency variation can be accounted for by a few synthetic variables (principal components), and 80% of the variation of the first principal component is "explained" by latitude and longitude. The central European populations are consistently depauperate of genetic variability, most likely as an effect of severe restrictions of population size during the last glaciation. The pattern of differentiation at protein loci is very similar to that observed for seven morphological traits examined. This similarity suggests that the same evolutionary forces have acted upon both sets of characters. © 1990 The Society for the Study of Evolution.
Quantifying side-chain conformational variations in protein structure
Miao, Zhichao; Cao, Yang
2016-01-01
Protein side-chain conformation is closely related to their biological functions. The side-chain prediction is a key step in protein design, protein docking and structure optimization. However, side-chain polymorphism comprehensively exists in protein as various types and has been long overlooked by side-chain prediction. But such conformational variations have not been quantitatively studied and the correlations between these variations and residue features are vague. Here, we performed statistical analyses on large scale data sets and found that the side-chain conformational flexibility is closely related to the exposure to solvent, degree of freedom and hydrophilicity. These analyses allowed us to quantify different types of side-chain variabilities in PDB. The results underscore that protein side-chain conformation prediction is not a single-answer problem, leading us to reconsider the assessment approaches of side-chain prediction programs. PMID:27845406
Quantifying side-chain conformational variations in protein structure
NASA Astrophysics Data System (ADS)
Miao, Zhichao; Cao, Yang
2016-11-01
Protein side-chain conformation is closely related to their biological functions. The side-chain prediction is a key step in protein design, protein docking and structure optimization. However, side-chain polymorphism comprehensively exists in protein as various types and has been long overlooked by side-chain prediction. But such conformational variations have not been quantitatively studied and the correlations between these variations and residue features are vague. Here, we performed statistical analyses on large scale data sets and found that the side-chain conformational flexibility is closely related to the exposure to solvent, degree of freedom and hydrophilicity. These analyses allowed us to quantify different types of side-chain variabilities in PDB. The results underscore that protein side-chain conformation prediction is not a single-answer problem, leading us to reconsider the assessment approaches of side-chain prediction programs.
Quantifying side-chain conformational variations in protein structure.
Miao, Zhichao; Cao, Yang
2016-11-15
Protein side-chain conformation is closely related to their biological functions. The side-chain prediction is a key step in protein design, protein docking and structure optimization. However, side-chain polymorphism comprehensively exists in protein as various types and has been long overlooked by side-chain prediction. But such conformational variations have not been quantitatively studied and the correlations between these variations and residue features are vague. Here, we performed statistical analyses on large scale data sets and found that the side-chain conformational flexibility is closely related to the exposure to solvent, degree of freedom and hydrophilicity. These analyses allowed us to quantify different types of side-chain variabilities in PDB. The results underscore that protein side-chain conformation prediction is not a single-answer problem, leading us to reconsider the assessment approaches of side-chain prediction programs.
A study of the thermal denaturation of the S-layer protein from Lactobacillus salivarius
NASA Astrophysics Data System (ADS)
Lighezan, Liliana; Georgieva, Ralitsa; Neagu, Adrian
2012-09-01
Surface layer (S-layer) proteins display an intrinsic self-assembly property, forming monomolecular crystalline arrays, identified in outermost structures of the cell envelope in many organisms, such as bacteria and archaea. Isolated S-layer proteins also possess the ability to recrystallize into regular lattices, being used in biotechnological applications, such as controlling the architecture of biomimetic surfaces. To this end, the stability of the S-layer proteins under high-temperature conditions is very important. In this study, the S-layer protein has been isolated from Lactobacillus salivarius 16 strain of human origin, and purified by cation-exchange chromatography. Using circular dichroism (CD) spectroscopy, we have investigated the thermal denaturation of the S-layer protein. The far- and near-UV CD spectra have been collected, and the temperature dependence of the CD signal in these spectral domains has been analyzed. The variable temperature results show that the secondary and tertiary structures of the S-layer protein change irreversibly due to the heating of the sample. After the cooling of the heated protein, the secondary and tertiary structures are partially recovered. The denaturation curves show that the protein unfolding depends on the sample concentration and on the heating rate. The secondary and tertiary structures of the protein suffer changes in the same temperature range. We have also detected an intermediate state in the protein denaturation pathway. Our results on the thermal behavior of the S-layer protein may be important for the use of S-layer proteins in biotechnological applications, as well as for a better understanding of the structure and function of S-layer proteins.
HippDB: a database of readily targeted helical protein-protein interactions.
Bergey, Christina M; Watkins, Andrew M; Arora, Paramjit S
2013-11-01
HippDB catalogs every protein-protein interaction whose structure is available in the Protein Data Bank and which exhibits one or more helices at the interface. The Web site accepts queries on variables such as helix length and sequence, and it provides computational alanine scanning and change in solvent-accessible surface area values for every interfacial residue. HippDB is intended to serve as a starting point for structure-based small molecule and peptidomimetic drug development. HippDB is freely available on the web at http://www.nyu.edu/projects/arora/hippdb. The Web site is implemented in PHP, MySQL and Apache. Source code freely available for download at http://code.google.com/p/helidb, implemented in Perl and supported on Linux. arora@nyu.edu.
Structure-related statistical singularities along protein sequences: a correlation study.
Colafranceschi, Mauro; Colosimo, Alfredo; Zbilut, Joseph P; Uversky, Vladimir N; Giuliani, Alessandro
2005-01-01
A data set composed of 1141 proteins representative of all eukaryotic protein sequences in the Swiss-Prot Protein Knowledge base was coded by seven physicochemical properties of amino acid residues. The resulting numerical profiles were submitted to correlation analysis after the application of a linear (simple mean) and a nonlinear (Recurrence Quantification Analysis, RQA) filter. The main RQA variables, Recurrence and Determinism, were subsequently analyzed by Principal Component Analysis. The RQA descriptors showed that (i) within protein sequences is embedded specific information neither present in the codes nor in the amino acid composition and (ii) the most sensitive code for detecting ordered recurrent (deterministic) patterns of residues in protein sequences is the Miyazawa-Jernigan hydrophobicity scale. The most deterministic proteins in terms of autocorrelation properties of primary structures were found (i) to be involved in protein-protein and protein-DNA interactions and (ii) to display a significantly higher proportion of structural disorder with respect to the average data set. A study of the scaling behavior of the average determinism with the setting parameters of RQA (embedding dimension and radius) allows for the identification of patterns of minimal length (six residues) as possible markers of zones specifically prone to inter- and intramolecular interactions.
Heindl, Philipp; García, Avelina Fernández; Butz, Peter; Pfaff, Eberhard; Tauscher, Bernhard
2006-03-01
Application of high pressure can be used for gentle pasteurizing of food, minimizing undesirable alterations such as vitamin losses and changes in taste and color. In addition, pressure has become a useful tool for investigating structural changes in proteins. Treatments of proteins with high pressure can reveal conformations that are not obtainable by other physical variables like temperature, since pressure favors structural transitions accompanied with smaller volumes. Here, we discuss both the potential use of high pressure to inactivate infectious TSE material and the application of this thermodynamic parameter for the investigation of prion folding. This review summarizes our findings on the effects of pressure on the structure of native infectious scrapie prions in hamster brain homogenates and on the structure of infectious prion rods isolated from diseased hamsters brains. Native prions were found to be pressure sensitive, whereas isolated prions revealed an extreme pressure-resistant structure. The discussion will be focused on the different pressure behavior of these prion isoforms, which points out differences in the protein structure that have not been taken into consideration before.
Cohen, Mark; Appleby, Brian; Safar, Jiri G
2016-01-01
Vast evidence on human prions demonstrates that variable disease phenotypes, rates of propagation, and targeting of distinct brain structures are determined by unique conformers (strains) of pathogenic prion protein (PrP(Sc)). Recent progress in the development of advanced biophysical tools that inventory structural characteristics of amyloid beta (Aβ) in the brain cortex of phenotypically diverse Alzheimer's disease (AD) patients, revealed unique spectrum of oligomeric particles in the cortex of rapidly progressive cases, implicating these structures in variable rates of propagation in the brain, and in distict disease manifestation. Since only ∼30% of phenotypic diversity of AD can be explained by polymorphisms in risk genes, these and transgenic bioassay data argue that structurally distinct Aβ particles play a major role in the diverse pathogenesis of AD, and may behave as distinct prion-like strains encoding diverse phenotypes. From these observations and our growing understanding of prions, there is a critical need for new strain-specific diagnostic strategies for misfolded proteins causing these elusive disorders. Since targeted drug therapy can induce mutation and evolution of prions into new strains, effective treatments of AD will require drugs that enhance clearance of pathogenic conformers, reduce the precursor protein, or inhibit the conversion of precursors into prion-like states.
Vigne, Emmanuelle; Marmonier, Aurélie; Komar, Véronique; Lemaire, Olivier; Fuchs, Marc
2009-09-01
Recombination was assessed in a vineyard site in which grapevines cross-protected with mild strains GHu of Grapevine fanleaf virus (GFLV) or Ta of Arabis mosaic virus (ArMV) were superinfected with GFLV field isolates following transmission by the nematode vector Xiphinema index. The genetic structure and variability within RNA2 of isolates from grapevines co-infected with GFLV field isolates and either GFLV-GHu or ArMV-Ta were characterized to identify intra- and interspecies recombinants. Sequence analysis and phylogenetic relationships inferred intraspecies recombination among GFLV field isolates but not between field isolates and GFLV-GHu. SISCAN analysis confirmed a mosaic structure for two GFLV field isolates for which recombination sites were located in the movement protein and coat protein genes. One of the recombinants was found in eight grapevines that were in close spatial proximity within the vineyard site, suggesting its transmission by X. index. No interspecies recombination was detected between GFLV field isolates and ArMV-Ta. Altogether, our findings suggest that mild protective strains GFLV-GHu and ArMV-Ta did not assist the emergence of viable recombinants to detectable level during a 12-year cross-protection trial. To our knowledge, this is the first extensive characterization of the genetic structure and variability of virus isolates in cross-protected plants.
Ko, Junsu; Park, Hahnbeom; Seok, Chaok
2012-08-10
Protein structures can be reliably predicted by template-based modeling (TBM) when experimental structures of homologous proteins are available. However, it is challenging to obtain structures more accurate than the single best templates by either combining information from multiple templates or by modeling regions that vary among templates or are not covered by any templates. We introduce GalaxyTBM, a new TBM method in which the more reliable core region is modeled first from multiple templates and less reliable, variable local regions, such as loops or termini, are then detected and re-modeled by an ab initio method. This TBM method is based on "Seok-server," which was tested in CASP9 and assessed to be amongst the top TBM servers. The accuracy of the initial core modeling is enhanced by focusing on more conserved regions in the multiple-template selection and multiple sequence alignment stages. Additional improvement is achieved by ab initio modeling of up to 3 unreliable local regions in the fixed framework of the core structure. Overall, GalaxyTBM reproduced the performance of Seok-server, with GalaxyTBM and Seok-server resulting in average GDT-TS of 68.1 and 68.4, respectively, when tested on 68 single-domain CASP9 TBM targets. For application to multi-domain proteins, GalaxyTBM must be combined with domain-splitting methods. Application of GalaxyTBM to CASP9 targets demonstrates that accurate protein structure prediction is possible by use of a multiple-template-based approach, and ab initio modeling of variable regions can further enhance the model quality.
3dRPC: a web server for 3D RNA-protein structure prediction.
Huang, Yangyu; Li, Haotian; Xiao, Yi
2018-04-01
RNA-protein interactions occur in many biological processes. To understand the mechanism of these interactions one needs to know three-dimensional (3D) structures of RNA-protein complexes. 3dRPC is an algorithm for prediction of 3D RNA-protein complex structures and consists of a docking algorithm RPDOCK and a scoring function 3dRPC-Score. RPDOCK is used to sample possible complex conformations of an RNA and a protein by calculating the geometric and electrostatic complementarities and stacking interactions at the RNA-protein interface according to the features of atom packing of the interface. 3dRPC-Score is a knowledge-based potential that uses the conformations of nucleotide-amino-acid pairs as statistical variables and that is used to choose the near-native complex-conformations obtained from the docking method above. Recently, we built a web server for 3dRPC. The users can easily use 3dRPC without installing it locally. RNA and protein structures in PDB (Protein Data Bank) format are the only needed input files. It can also incorporate the information of interface residues or residue-pairs obtained from experiments or theoretical predictions to improve the prediction. The address of 3dRPC web server is http://biophy.hust.edu.cn/3dRPC. yxiao@hust.edu.cn.
Automated design evolution of stereochemically randomized protein foldamers
NASA Astrophysics Data System (ADS)
Ranbhor, Ranjit; Kumar, Anil; Patel, Kirti; Ramakrishnan, Vibin; Durani, Susheel
2018-05-01
Diversification of chain stereochemistry opens up the possibilities of an ‘in principle’ increase in the design space of proteins. This huge increase in the sequence and consequent structural variation is aimed at the generation of smart materials. To diversify protein structure stereochemically, we introduced L- and D-α-amino acids as the design alphabet. With a sequence design algorithm, we explored the usage of specific variables such as chirality and the sequence of this alphabet in independent steps. With molecular dynamics, we folded stereochemically diverse homopolypeptides and evaluated their ‘fitness’ for possible design as protein-like foldamers. We propose a fitness function to prune the most optimal fold among 1000 structures simulated with an automated repetitive simulated annealing molecular dynamics (AR-SAMD) approach. The highly scored poly-leucine fold with sequence lengths of 24 and 30 amino acids were later sequence-optimized using a Dead End Elimination cum Monte Carlo based optimization tool. This paper demonstrates a novel approach for the de novo design of protein-like foldamers.
Smirnov, Ivan; Carletti, Eugénie; Kurkova, Inna; Nachon, Florian; Nicolet, Yvain; Mitkevich, Vladimir A.; Débat, Hélène; Avalle, Bérangère; Belogurov, Alexey A.; Kuznetsov, Nikita; Reshetnyak, Andrey; Masson, Patrick; Tonevitsky, Alexander G.; Ponomarenko, Natalia; Makarov, Alexander A.; Friboulet, Alain; Tramontano, Alfonso; Gabibov, Alexander
2011-01-01
Igs offer a versatile template for combinatorial and rational design approaches to the de novo creation of catalytically active proteins. We have used a covalent capture selection strategy to identify biocatalysts from within a human semisynthetic antibody variable fragment library that uses a nucleophilic mechanism. Specific phosphonylation at a single tyrosine within the variable light-chain framework was confirmed in a recombinant IgG construct. High-resolution crystallographic structures of unmodified and phosphonylated Fabs display a 15-Å-deep two-chamber cavity at the interface of variable light (VL) and variable heavy (VH) fragments having a nucleophilic tyrosine at the base of the site. The depth and structure of the pocket are atypical of antibodies in general but can be compared qualitatively with the catalytic site of cholinesterases. A structurally disordered heavy chain complementary determining region 3 loop, constituting a wall of the cleft, is stabilized after covalent modification by hydrogen bonding to the phosphonate tropinol moiety. These features and presteady state kinetics analysis indicate that an induced fit mechanism operates in this reaction. Mutations of residues located in this stabilized loop do not interfere with direct contacts to the organophosphate ligand but can interrogate second shell interactions, because the H3 loop has a conformation adjusted for binding. Kinetic and thermodynamic parameters along with computational docking support the active site model, including plasticity and simple catalytic components. Although relatively uncomplicated, this catalytic machinery displays both stereo- and chemical selectivity. The organophosphate pesticide paraoxon is hydrolyzed by covalent catalysis with rate-limiting dephosphorylation. This reactibody is, therefore, a kinetically selected protein template that has enzyme-like catalytic attributes. PMID:21896761
Banach, Mateusz; Konieczny, Leszek; Roterman, Irena
2014-10-21
In this paper we show that the fuzzy oil drop model represents a general framework for describing the generation of hydrophobic cores in proteins and thus provides insight into the influence of the water environment upon protein structure and stability. The model has been successfully applied in the study of a wide range of proteins, however this paper focuses specifically on domains representing immunoglobulin-like folds. Here we provide evidence that immunoglobulin-like domains, despite being structurally similar, differ with respect to their participation in the generation of hydrophobic core. It is shown that β-structural fragments in β-barrels participate in hydrophobic core formation in a highly differentiated manner. Quantitatively measured participation in core formation helps explain the variable stability of proteins and is shown to be related to their biological properties. This also includes the known tendency of immunoglobulin domains to form amyloids, as shown using transthyretin to reveal the clear relation between amyloidogenic properties and structural characteristics based on the fuzzy oil drop model. Copyright © 2014 The Authors. Published by Elsevier Ltd.. All rights reserved.
Simulation of urea-induced protein unfolding: a lesson from bovine β-lactoglobulin.
Eberini, Ivano; Emerson, Andrew; Sensi, Cristina; Ragona, Laura; Ricchiuto, Piero; Pedretti, Alessandro; Gianazza, Elisabetta; Tramontano, Anna
2011-09-01
To investigate the molecular mechanisms involved in the very initial stages of protein unfolding, we carried out one long (1 μs) simulation of bovine β-lactoglobulin (BLG) together with three (500 ns) supporting MD runs, in which the unfolding conditions were produced by adding the osmolyte urea to the simulated systems and/or by increasing the thermal energy raising the temperature from 300 to 350 K. BLG was chosen, since it is a well-characterized model protein, for which structural and folding properties have been widely investigated by X-ray and NMR. MD trajectories were analyzed not only in terms of standard progress variables, such as backbone H-bonds, gyration radius width, secondary structure elements, but also through the scrutiny of interactions and dynamical behavior of specific key residues previously pointed out and investigated by NMR and belonging to a well known hydrophobic cluster. MD trajectories simulated in different unfolding conditions suggest that urea destabilizes BLG structure weakening protein::protein hydrophobic interactions and the hydrogen bond network. The early unfolding events, better observed at higher temperature, affect both secondary and tertiary structure of the protein. Copyright © 2011 Elsevier Inc. All rights reserved.
Tertiary alphabet for the observable protein structural universe.
Mackenzie, Craig O; Zhou, Jianfu; Grigoryan, Gevorg
2016-11-22
Here, we systematically decompose the known protein structural universe into its basic elements, which we dub tertiary structural motifs (TERMs). A TERM is a compact backbone fragment that captures the secondary, tertiary, and quaternary environments around a given residue, comprising one or more disjoint segments (three on average). We seek the set of universal TERMs that capture all structure in the Protein Data Bank (PDB), finding remarkable degeneracy. Only ∼600 TERMs are sufficient to describe 50% of the PDB at sub-Angstrom resolution. However, more rare geometries also exist, and the overall structural coverage grows logarithmically with the number of TERMs. We go on to show that universal TERMs provide an effective mapping between sequence and structure. We demonstrate that TERM-based statistics alone are sufficient to recapitulate close-to-native sequences given either NMR or X-ray backbones. Furthermore, sequence variability predicted from TERM data agrees closely with evolutionary variation. Finally, locations of TERMs in protein chains can be predicted from sequence alone based on sequence signatures emergent from TERM instances in the PDB. For multisegment motifs, this method identifies spatially adjacent fragments that are not contiguous in sequence-a major bottleneck in structure prediction. Although all TERMs recur in diverse proteins, some appear specialized for certain functions, such as interface formation, metal coordination, or even water binding. Structural biology has benefited greatly from previously observed degeneracies in structure. The decomposition of the known structural universe into a finite set of compact TERMs offers exciting opportunities toward better understanding, design, and prediction of protein structure.
Tertiary alphabet for the observable protein structural universe
Mackenzie, Craig O.; Zhou, Jianfu; Grigoryan, Gevorg
2016-01-01
Here, we systematically decompose the known protein structural universe into its basic elements, which we dub tertiary structural motifs (TERMs). A TERM is a compact backbone fragment that captures the secondary, tertiary, and quaternary environments around a given residue, comprising one or more disjoint segments (three on average). We seek the set of universal TERMs that capture all structure in the Protein Data Bank (PDB), finding remarkable degeneracy. Only ∼600 TERMs are sufficient to describe 50% of the PDB at sub-Angstrom resolution. However, more rare geometries also exist, and the overall structural coverage grows logarithmically with the number of TERMs. We go on to show that universal TERMs provide an effective mapping between sequence and structure. We demonstrate that TERM-based statistics alone are sufficient to recapitulate close-to-native sequences given either NMR or X-ray backbones. Furthermore, sequence variability predicted from TERM data agrees closely with evolutionary variation. Finally, locations of TERMs in protein chains can be predicted from sequence alone based on sequence signatures emergent from TERM instances in the PDB. For multisegment motifs, this method identifies spatially adjacent fragments that are not contiguous in sequence—a major bottleneck in structure prediction. Although all TERMs recur in diverse proteins, some appear specialized for certain functions, such as interface formation, metal coordination, or even water binding. Structural biology has benefited greatly from previously observed degeneracies in structure. The decomposition of the known structural universe into a finite set of compact TERMs offers exciting opportunities toward better understanding, design, and prediction of protein structure. PMID:27810958
MacRae, T H
2000-06-01
Small heat shock/alpha-crystallin proteins are defined by conserved sequence of approximately 90 amino acid residues, termed the alpha-crystallin domain, which is bounded by variable amino- and carboxy-terminal extensions. These proteins form oligomers, most of uncertain quaternary structure, and oligomerization is prerequisite to their function as molecular chaperones. Sequence modelling and physical analyses show that the secondary structure of small heat shock/alpha-crystallin proteins is predominately beta-pleated sheet. Crystallography, site-directed spin-labelling and yeast two-hybrid selection demonstrate regions of secondary structure within the alpha-crystallin domain that interact during oligomer assembly, a process also dependent on the amino terminus. Oligomers are dynamic, exhibiting subunit exchange and organizational plasticity, perhaps leading to functional diversity. Exposure of hydrophobic residues by structural modification facilitates chaperoning where denaturing proteins in the molten globule state associate with oligomers. The flexible carboxy-terminal extension contributes to chaperone activity by enhancing the solubility of small heat shock/alpha-crystallin proteins. Site-directed mutagenesis has yielded proteins where the effect of the change on structure and function depends upon the residue modified, the organism under study and the analytical techniques used. Most revealing, substitution of a conserved arginine residue within the alpha-crystallin domain has a major impact on quaternary structure and chaperone action probably through realignment of beta-sheets. These mutations are linked to inherited diseases. Oligomer size is regulated by a stress-responsive cascade including MAPKAP kinase 2/3 and p38. Phosphorylation of small heat shock/alpha-crystallin proteins has important consequences within stressed cells, especially for microfilaments.
Raut, Ashlesha S; Kalonia, Devendra S
2015-09-08
Dual variable domain immunoglobulin proteins (DVD-Ig proteins) are large molecules (MW ∼ 200 kDa) with increased asymmetry because of their extended Y-like shape, which results in increased formulation challenges. Liquid-liquid phase separation (LLPS) of protein solutions into protein-rich and protein-poor phases reduces solution stability at intermediate concentrations and lower temperatures, and is a serious concern in formulation development as therapeutic proteins are generally stored at refrigerated conditions. In the current work, LLPS was studied for a DVD-Ig protein molecule as a function of solution conditions by measuring solution opalescence. LLPS of the protein was confirmed by equilibrium studies and by visually observing under microscope. The protein does not undergo any structural change after phase separation. Protein-protein interactions were measured by light scattering (kD) and Tcloud (temperature that marks the onset of phase separation). There is a good agreement between kD measured in dilute solution with Tcloud measured in the critical concentration range. Results indicate that the increased complexity of the molecule (with respect to size, shape, and charge distribution on the molecule) increases contribution of specific and nonspecific interactions in solution, which are affected by formulation factors, resulting in LLPS for DVD-Ig protein.
Knowles, D P; Cheevers, W P; McGuire, T C; Brassfield, A L; Harwood, W G; Stem, T A
1991-11-01
To define the structure of the caprine arthritis-encephalitis virus (CAEV) env gene and characterize genetic changes which occur during antigenic variation, we sequenced the env genes of CAEV-63 and CAEV-Co, two antigenic variants of CAEV defined by serum neutralization. The deduced primary translation product of the CAEV env gene consists of a 60- to 80-amino-acid signal peptide followed by an amino-terminal surface protein (SU) and a carboxy-terminal transmembrane protein (TM) separated by an Arg-Lys-Lys-Arg cleavage site. The signal peptide cleavage site was verified by amino-terminal amino acid sequencing of native CAEV-63 SU. In addition, immunoprecipitation of [35S]methionine-labeled CAEV-63 proteins by sera from goats immunized with recombinant vaccinia virus expressing the CAEV-63 env gene confirmed that antibodies induced by env-encoded recombinant proteins react specifically with native virion SU and TM. The env genes of CAEV-63 and CAEV-Co encode 28 conserved cysteines and 25 conserved potential N-linked glycosylation sites. Nucleotide sequence variability results in 62 amino acid changes and one deletion within the SU and 34 amino acid changes within the TM.
Knowles, D P; Cheevers, W P; McGuire, T C; Brassfield, A L; Harwood, W G; Stem, T A
1991-01-01
To define the structure of the caprine arthritis-encephalitis virus (CAEV) env gene and characterize genetic changes which occur during antigenic variation, we sequenced the env genes of CAEV-63 and CAEV-Co, two antigenic variants of CAEV defined by serum neutralization. The deduced primary translation product of the CAEV env gene consists of a 60- to 80-amino-acid signal peptide followed by an amino-terminal surface protein (SU) and a carboxy-terminal transmembrane protein (TM) separated by an Arg-Lys-Lys-Arg cleavage site. The signal peptide cleavage site was verified by amino-terminal amino acid sequencing of native CAEV-63 SU. In addition, immunoprecipitation of [35S]methionine-labeled CAEV-63 proteins by sera from goats immunized with recombinant vaccinia virus expressing the CAEV-63 env gene confirmed that antibodies induced by env-encoded recombinant proteins react specifically with native virion SU and TM. The env genes of CAEV-63 and CAEV-Co encode 28 conserved cysteines and 25 conserved potential N-linked glycosylation sites. Nucleotide sequence variability results in 62 amino acid changes and one deletion within the SU and 34 amino acid changes within the TM. Images PMID:1656067
Avelange-Macherel, Marie-Hélène; Payet, Nicole; Lalanne, David; Neveu, Martine; Tolleter, Dimitri; Burstin, Judith; Macherel, David
2015-07-01
LEAM, a late embryogenesis abundant protein, and HSP22, a small heat shock protein, were shown to accumulate in the mitochondria during pea (Pisum sativum L.) seed development, where they are expected to contribute to desiccation tolerance. Here, their expression was examined in seeds of 89 pea genotypes by Western blot analysis. All genotypes expressed LEAM and HSP22 in similar amounts. In contrast with HSP22, LEAM displayed different isoforms according to apparent molecular mass. Each of the 89 genotypes harboured a single LEAM isoform. Genomic and RT-PCR analysis revealed four LEAM genes differing by a small variable indel in the coding region. These variations were consistent with the apparent molecular mass of each isoform. Indels, which occurred in repeated domains, did not alter the main properties of LEAM. Structural modelling indicated that the class A α-helix structure, which allows interactions with the mitochondrial inner membrane in the dry state, was preserved in all isoforms, suggesting functionality is maintained. The overall results point out the essential character of LEAM and HSP22 in pea seeds. LEAM variability is discussed in terms of pea breeding history as well as LEA gene evolution mechanisms. © 2014 John Wiley & Sons Ltd.
Collagens--structure, function, and biosynthesis.
Gelse, K; Pöschl, E; Aigner, T
2003-11-28
The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified so far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. This review focuses on the distribution and function of various collagen types in different tissues. It introduces their basic structural subunits and points out major steps in the biosynthesis and supramolecular processing of fibrillar collagens as prototypical members of this protein family. A final outlook indicates the importance of different collagen types not only for the understanding of collagen-related diseases, but also as a basis for the therapeutical use of members of this protein family discussed in other chapters of this issue.
Chen, Tong; Ji, Dongchao; Tian, Shiping
2018-03-14
The assembly of protein complexes and compositional lipid patterning act together to endow cells with the plasticity required to maintain compositional heterogeneity with respect to individual proteins. Hence, the applications for imaging protein localization and dynamics require high accuracy, particularly at high spatio-temporal level. We provided experimental data for the applications of Variable-Angle Epifluorescence Microscopy (VAEM) in dissecting protein dynamics in plant cells. The VAEM-based co-localization analysis took penetration depth and incident angle into consideration. Besides direct overlap of dual-color fluorescence signals, the co-localization analysis was carried out quantitatively in combination with the methodology for calculating puncta distance and protein proximity index. Besides, simultaneous VAEM tracking of cytoskeletal dynamics provided more insights into coordinated responses of actin filaments and microtubules. Moreover, lateral motility of membrane proteins was analyzed by calculating diffusion coefficients and kymograph analysis, which represented an alternative method for examining protein motility. The present study presented experimental evidence on illustrating the use of VAEM in tracking and dissecting protein dynamics, dissecting endosomal dynamics, cell structure assembly along with membrane microdomain and protein motility in intact plant cells.
Genome Pool Strategy for Structural Coverage of Protein Families
Jaroszewski, Lukasz; Slabinski, Lukasz; Wooley, John; Deacon, Ashley M.; Lesley, Scott A.; Wilson, Ian. A.; Godzik, Adam
2010-01-01
As noticed by generations of structural biologists, closely homologous proteins may have substantially different crystallization properties and propensities. These observations can be used to systematically introduce additional dimensionality into crystallization trials by targeting homologous proteins from multiple genomes in a “genome pool” strategy. Through extensive use of our recently introduced “crystallization feasibility score” (Slabinski et al., 2007a), we can explain that the genome pool strategy works well because the crystallization feasibility scores are surprisingly broad within families of homologous proteins, with most families containing a range of optimal to very difficult targets. We also show that some families can be regarded as relatively “easy”, where a significant number of proteins are predicted to have optimal crystallization features, and others are “very difficult”, where almost none are predicted to result in a crystal structure. Thus, the outcome of such variable distributions of such crystallizability' preferences leads to uneven structural coverage of known families, with “easier” or “optimal” families having several times more solved structures than “very difficult” ones. Nevertheless, this latter category can be successfully targeted by increasing the number of genomes that are used to select targets from a given family. On average, adding 10 new genomes to the “genome pool” provides more promising targets for 7 “very difficult” families. In contrast, our crystallization feasibility score does not indicate that any specific microbial genomes can be readily classified as “easier” or “very difficult” with respect to providing suitable candidates for crystallization and structure determination. Finally, our analyses show that specific physicochemical properties of the protein sequence favor successful outcomes for structure determination and, hence, the group of proteins with known 3D structures is systematically different from the general pool of known proteins. We, therefore, assess the structural consequences of these differences in protein sequence and protein biophysical properties. PMID:19000818
2010-01-01
Background Puf proteins have important roles in controlling gene expression at the post-transcriptional level by promoting RNA decay and repressing translation. The Pumilio homology domain (PUM-HD) is a conserved region within Puf proteins that binds to RNA with sequence specificity. Although Puf proteins have been well characterized in animal and fungal systems, little is known about the structural and functional characteristics of Puf-like proteins in plants. Results The Arabidopsis and rice genomes code for 26 and 19 Puf-like proteins, respectively, each possessing eight or fewer Puf repeats in their PUM-HD. Key amino acids in the PUM-HD of several of these proteins are conserved with those of animal and fungal homologs, whereas other plant Puf proteins demonstrate extensive variability in these amino acids. Three-dimensional modeling revealed that the predicted structure of this domain in plant Puf proteins provides a suitable surface for binding RNA. Electrophoretic gel mobility shift experiments showed that the Arabidopsis AtPum2 PUM-HD binds with high affinity to BoxB of the Drosophila Nanos Response Element I (NRE1) RNA, whereas a point mutation in the core of the NRE1 resulted in a significant reduction in binding affinity. Transient expression of several of the Arabidopsis Puf proteins as fluorescent protein fusions revealed a dynamic, punctate cytoplasmic pattern of localization for most of these proteins. The presence of predicted nuclear export signals and accumulation of AtPuf proteins in the nucleus after treatment of cells with leptomycin B demonstrated that shuttling of these proteins between the cytosol and nucleus is common among these proteins. In addition to the cytoplasmically enriched AtPum proteins, two AtPum proteins showed nuclear targeting with enrichment in the nucleolus. Conclusions The Puf family of RNA-binding proteins in plants consists of a greater number of members than any other model species studied to date. This, along with the amino acid variability observed within their PUM-HDs, suggests that these proteins may be involved in a wide range of post-transcriptional regulatory events that are important in providing plants with the ability to respond rapidly to changes in environmental conditions and throughout development. PMID:20214804
Urvoas, Agathe; Guellouz, Asma; Valerio-Lepiniec, Marie; Graille, Marc; Durand, Dominique; Desravines, Danielle C; van Tilbeurgh, Herman; Desmadril, Michel; Minard, Philippe
2010-11-26
Repeat proteins have a modular organization and a regular architecture that make them attractive models for design and directed evolution experiments. HEAT repeat proteins, although very common, have not been used as a scaffold for artificial proteins, probably because they are made of long and irregular repeats. Here, we present and validate a consensus sequence for artificial HEAT repeat proteins. The sequence was defined from the structure-based sequence analysis of a thermostable HEAT-like repeat protein. Appropriate sequences were identified for the N- and C-caps. A library of genes coding for artificial proteins based on this sequence design, named αRep, was assembled using new and versatile methodology based on circular amplification. Proteins picked randomly from this library are expressed as soluble proteins. The biophysical properties of proteins with different numbers of repeats and different combinations of side chains in hypervariable positions were characterized. Circular dichroism and differential scanning calorimetry experiments showed that all these proteins are folded cooperatively and are very stable (T(m) >70 °C). Stability of these proteins increases with the number of repeats. Detailed gel filtration and small-angle X-ray scattering studies showed that the purified proteins form either monomers or dimers. The X-ray structure of a stable dimeric variant structure was solved. The protein is folded with a highly regular topology and the repeat structure is organized, as expected, as pairs of alpha helices. In this protein variant, the dimerization interface results directly from the variable surface enriched in aromatic residues located in the randomized positions of the repeats. The dimer was crystallized both in an apo and in a PEG-bound form, revealing a very well defined binding crevice and some structure flexibility at the interface. This fortuitous binding site could later prove to be a useful binding site for other low molecular mass partners. Copyright © 2010 Elsevier Ltd. All rights reserved.
Systematic Comparison of Crystal and NMR Protein Structures Deposited in the Protein Data Bank
Sikic, Kresimir; Tomic, Sanja; Carugo, Oliviero
2010-01-01
Nearly all the macromolecular three-dimensional structures deposited in Protein Data Bank were determined by either crystallographic (X-ray) or Nuclear Magnetic Resonance (NMR) spectroscopic methods. This paper reports a systematic comparison of the crystallographic and NMR results deposited in the files of the Protein Data Bank, in order to find out to which extent these information can be aggregated in bioinformatics. A non-redundant data set containing 109 NMR – X-ray structure pairs of nearly identical proteins was derived from the Protein Data Bank. A series of comparisons were performed by focusing the attention towards both global features and local details. It was observed that: (1) the RMDS values between NMR and crystal structures range from about 1.5 Å to about 2.5 Å; (2) the correlation between conformational deviations and residue type reveals that hydrophobic amino acids are more similar in crystal and NMR structures than hydrophilic amino acids; (3) the correlation between solvent accessibility of the residues and their conformational variability in solid state and in solution is relatively modest (correlation coefficient = 0.462); (4) beta strands on average match better between NMR and crystal structures than helices and loops; (5) conformational differences between loops are independent of crystal packing interactions in the solid state; (6) very seldom, side chains buried in the protein interior are observed to adopt different orientations in the solid state and in solution. PMID:21293729
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zemla, A; Lang, D; Kostova, T
2010-11-29
Most of the currently used methods for protein function prediction rely on sequence-based comparisons between a query protein and those for which a functional annotation is provided. A serious limitation of sequence similarity-based approaches for identifying residue conservation among proteins is the low confidence in assigning residue-residue correspondences among proteins when the level of sequence identity between the compared proteins is poor. Multiple sequence alignment methods are more satisfactory - still, they cannot provide reliable results at low levels of sequence identity. Our goal in the current work was to develop an algorithm that could overcome these difficulties and facilitatemore » the identification of structurally (and possibly functionally) relevant residue-residue correspondences between compared protein structures. Here we present StralSV, a new algorithm for detecting closely related structure fragments and quantifying residue frequency from tight local structure alignments. We apply StralSV in a study of the RNA-dependent RNA polymerase of poliovirus and demonstrate that the algorithm can be used to determine regions of the protein that are relatively unique or that shared structural similarity with structures that are distantly related. By quantifying residue frequencies among many residue-residue pairs extracted from local alignments, one can infer potential structural or functional importance of specific residues that are determined to be highly conserved or that deviate from a consensus. We further demonstrate that considerable detailed structural and phylogenetic information can be derived from StralSV analyses. StralSV is a new structure-based algorithm for identifying and aligning structure fragments that have similarity to a reference protein. StralSV analysis can be used to quantify residue-residue correspondences and identify residues that may be of particular structural or functional importance, as well as unusual or unexpected residues at a given sequence position.« less
Papaneophytou, Christos P; Kontopidis, George
2014-02-01
The supply of many valuable proteins that have potential clinical or industrial use is often limited by their low natural availability. With the modern advances in genomics, proteomics and bioinformatics, the number of proteins being produced using recombinant techniques is exponentially increasing and seems to guarantee an unlimited supply of recombinant proteins. The demand of recombinant proteins has increased as more applications in several fields become a commercial reality. Escherichia coli (E. coli) is the most widely used expression system for the production of recombinant proteins for structural and functional studies. However, producing soluble proteins in E. coli is still a major bottleneck for structural biology projects. One of the most challenging steps in any structural biology project is predicting which protein or protein fragment will express solubly and purify for crystallographic studies. The production of soluble and active proteins is influenced by several factors including expression host, fusion tag, induction temperature and time. Statistical designed experiments are gaining success in the production of recombinant protein because they provide information on variable interactions that escape the "one-factor-at-a-time" method. Here, we review the most important factors affecting the production of recombinant proteins in a soluble form. Moreover, we provide information about how the statistical design experiments can increase protein yield and purity as well as find conditions for crystal growth. Copyright © 2013 Elsevier Inc. All rights reserved.
Nuttall, S D; Krishnan, U V; Hattarki, M; De Gori, R; Irving, R A; Hudson, P J
2001-08-01
The new antigen receptor (NAR) from nurse sharks consists of an immunoglobulin variable domain attached to five constant domains, and is hypothesised to function as an antigen-binding antibody-like molecule. To determine whether the NAR is present in other species we have isolated a number of new antigen receptor variable domains from the spotted wobbegong shark (Orectolobus maculatus) and compared their structure to that of the nurse shark protein. To determine whether these wNARs can function as antigen-binding proteins, we have used them as scaffolds for the construction of protein libraries in which the CDR3 loop was randomised, and displayed the resulting recombinant domains on the surface of fd bacteriophages. On selection against several protein antigens, the highest affinity wNAR proteins were generated against the Gingipain K protease from Porphyromonas gingivalis. One wNAR protein bound Gingipain K specifically by ELISA and BIAcore analysis and, when expressed in E. coli and purified by affinity chromatography, eluted from an FPLC column as a single peak consistent with folding into a monomeric protein. Naturally occurring nurse shark and wobbegong NAR variable domains exhibit conserved cysteine residues within the CDR1 and CDR3 loops which potentially form disulphide linkages and enhance protein stability; proteins isolated from the in vitro NAR wobbegong library showed similar selection for such paired cysteine residues. Thus, the New Antigen Receptor represents a protein scaffold with possible stability advantages over conventional antibodies when used in in vitro molecular libraries.
Guellouz, Asma; Valerio-Lepiniec, Marie; Urvoas, Agathe; Chevrel, Anne; Graille, Marc; Fourati-Kammoun, Zaineb; Desmadril, Michel; van Tilbeurgh, Herman; Minard, Philippe
2013-01-01
We previously designed a new family of artificial proteins named αRep based on a subgroup of thermostable helicoidal HEAT-like repeats. We have now assembled a large optimized αRep library. In this library, the side chains at each variable position are not fully randomized but instead encoded by a distribution of codons based on the natural frequency of side chains of the natural repeats family. The library construction is based on a polymerization of micro-genes and therefore results in a distribution of proteins with a variable number of repeats. We improved the library construction process using a "filtration" procedure to retain only fully coding modules that were recombined to recreate sequence diversity. The final library named Lib2.1 contains 1.7×10(9) independent clones. Here, we used phage display to select, from the previously described library or from the new library, new specific αRep proteins binding to four different non-related predefined protein targets. Specific binders were selected in each case. The results show that binders with various sizes are selected including relatively long sequences, with up to 7 repeats. ITC-measured affinities vary with Kd values ranging from micromolar to nanomolar ranges. The formation of complexes is associated with a significant thermal stabilization of the bound target protein. The crystal structures of two complexes between αRep and their cognate targets were solved and show that the new interfaces are established by the variable surfaces of the repeated modules, as well by the variable N-cap residues. These results suggest that αRep library is a new and versatile source of tight and specific binding proteins with favorable biophysical properties.
Chevrel, Anne; Graille, Marc; Fourati-Kammoun, Zaineb; Desmadril, Michel; van Tilbeurgh, Herman; Minard, Philippe
2013-01-01
We previously designed a new family of artificial proteins named αRep based on a subgroup of thermostable helicoidal HEAT-like repeats. We have now assembled a large optimized αRep library. In this library, the side chains at each variable position are not fully randomized but instead encoded by a distribution of codons based on the natural frequency of side chains of the natural repeats family. The library construction is based on a polymerization of micro-genes and therefore results in a distribution of proteins with a variable number of repeats. We improved the library construction process using a “filtration” procedure to retain only fully coding modules that were recombined to recreate sequence diversity. The final library named Lib2.1 contains 1.7×109 independent clones. Here, we used phage display to select, from the previously described library or from the new library, new specific αRep proteins binding to four different non-related predefined protein targets. Specific binders were selected in each case. The results show that binders with various sizes are selected including relatively long sequences, with up to 7 repeats. ITC-measured affinities vary with Kd values ranging from micromolar to nanomolar ranges. The formation of complexes is associated with a significant thermal stabilization of the bound target protein. The crystal structures of two complexes between αRep and their cognate targets were solved and show that the new interfaces are established by the variable surfaces of the repeated modules, as well by the variable N-cap residues. These results suggest that αRep library is a new and versatile source of tight and specific binding proteins with favorable biophysical properties. PMID:24014183
CWDPRNP: A tool for cervid prion sequence analysis in program R
Miller, William L.; Walter, W. David
2017-01-01
Chronic wasting disease is a fatal, neurological disease caused by an infectious prion protein, which affects economically and ecologically important members of the family Cervidae. Single nucleotide polymorphisms within the prion protein gene have been linked to differential susceptibility to the disease in many species. Wildlife managers are seeking to determine the frequencies of disease-associated alleles and genotypes and delineate spatial genetic patterns. The CWDPRNP package, implemented in program R, provides a unified framework for analyzing prion protein gene variability and spatial structure.
Machado, Agnes Thiane Pereira; Fonseca, Emanuella Maria Barreto; Reis, Marcelo Augusto Dos; Saraiva, Antonio Marcos; Santos, Clelton Aparecido Dos; de Toledo, Marcelo Augusto Szymanski; Polikarpov, Igor; de Souza, Anete Pereira; Aparicio, Ricardo; Iulek, Jorge
2017-10-01
Xylella fastidiosa is a xylem-limited bacterium that infects a wide variety of plants. Stationary phase survival protein E is classified as a nucleotidase, which is expressed when bacterial cells are in the stationary growth phase and subjected to environmental stresses. Here, we report four refined X-ray structures of this protein from X. fastidiosa in four different crystal forms in the presence and/or absence of the substrate 3'-AMP. In all chains, the conserved loop verified in family members assumes a closed conformation in either condition. Therefore, the enzymatic mechanism for the target protein might be different of its homologs. Two crystal forms exhibit two monomers whereas the other two show four monomers in the asymmetric unit. While the biological unit has been characterized as a tetramer, differences of their sizes and symmetry are remarkable. Four conformers identified by Small-Angle X-ray Scattering (SAXS) in a ligand-free solution are related to the low frequency normal modes of the crystallographic structures associated with rigid body-like protomer arrangements responsible for the longitudinal and symmetric adjustments between tetramers. When the substrate is present in solution, only two conformers are selected. The most prominent conformer for each case is associated to a normal mode able to elongate the protein by moving apart two dimers. To our knowledge, this work was the first investigation based on the normal modes that analyzed the quaternary structure variability for an enzyme of the SurE family followed by crystallography and SAXS validation. The combined results raise new directions to study allosteric features of XfSurE protein. © 2017 Wiley Periodicals, Inc.
Kinetics of the LOV domain of ZEITLUPE determine its circadian function in Arabidopsis
Pudasaini, Ashutosh; Shim, Jae Sung; Song, Young Hun; Shi, Hua; Kiba, Takatoshi; Somers, David E; Imaizumi, Takato; Zoltowski, Brian D
2017-01-01
A LOV (Light, Oxygen, or Voltage) domain containing blue-light photoreceptor ZEITLUPE (ZTL) directs circadian timing by degrading clock proteins in plants. Functions hinge upon allosteric differences coupled to the ZTL photocycle; however, structural and kinetic information was unavailable. Herein, we tune the ZTL photocycle over two orders of magnitude. These variants reveal that ZTL complexes with targets independent of light, but dictates enhanced protein degradation in the dark. In vivo experiments definitively show photocycle kinetics dictate the rate of clock component degradation, thereby impacting circadian period. Structural studies demonstrate that photocycle dependent activation of ZTL depends on an unusual dark-state conformation of ZTL. Crystal structures of ZTL LOV domain confirm delineation of structural and kinetic mechanisms and identify an evolutionarily selected allosteric hinge differentiating modes of PAS/LOV signal transduction. The combined biochemical, genetic and structural studies provide new mechanisms indicating how PAS/LOV proteins integrate environmental variables in complex networks. DOI: http://dx.doi.org/10.7554/eLife.21646.001 PMID:28244872
Structure of the Minor Pseudopilin EpsH From the Type 2 Secretion System of Vibrio Cholerae
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yanez, M.E.; Korotkov, K.V.; Abendroth, J.
2009-05-28
Many Gram-negative bacteria use the multi-protein type II secretion system (T2SS) to selectively translocate virulence factors from the periplasmic space into the extracellular environment. In Vibrio cholerae the T2SS is called the extracellular protein secretion (Eps) system, which translocates cholera toxin and several enzymes in their folded state across the outer membrane. Five proteins of the T2SS, the pseudopilins, are thought to assemble into a pseudopilus, which may control the outer membrane pore EpsD, and participate in the active export of proteins in a 'piston-like' manner. We report here the 2.0 {angstrom} resolution crystal structure of an N-terminally truncated variantmore » of EpsH, a minor pseudopilin from Vibrio cholerae. While EpsH maintains an N-terminal {alpha}-helix and C-terminal {beta}-sheet consistent with the type 4a pilin fold, structural comparisons reveal major differences between the minor pseudopilin EpsH and the major pseudopilin GspG from Klebsiella oxytoca: EpsH contains a large {beta}-sheet in the variable domain, where GspG contains an {alpha}-helix. Most importantly, EpsH contains at its surface a hydrophobic crevice between its variable and conserved {beta}-sheets, wherein a majority of the conserved residues within the EpsH family are clustered. In a tentative model of a T2SS pseudopilus with EpsH at its tip, the conserved crevice faces away from the helix axis. This conserved surface region may be critical for interacting with other proteins from the T2SS machinery.« less
Amino acid and structural variability of Yersinia pestis LcrV protein
DOE Office of Scientific and Technical Information (OSTI.GOV)
Anisimov, A P; Dentovskaya, S V; Panfertsev, E A
2009-11-09
The LcrV protein is a multifunctional virulence factor and protective antigen of the plague bacterium which is generally conserved between the epidemic strains of Yersinia pestis. They investigated the diversity in the LcrV sequences among non-epidemic Y. pestis strains which have a limited virulence in selected animal models and for humans. Sequencing of lcrV genes from ten Y. pestis strains belonging to different phylogenetic groups (subspecies) showed that the LcrV proteins possess four major variable hotspots at positions 18, 72, 273, and 324-326. These major variations, together with other minor substitutions in amino acid sequences, allowed them to classify themore » LcrV alleles into five sequence types (A-E). They observed that the strains of different Y. pestis subspecies can have the same typ of LcrV, and different types of LcrV can exist within the same natural plague focus. The LcrV polymorphisms were structurally analyzed by comparing the modeled structures of LcrV from all available strains. All changes except one occurred either in flexible regions or on the surface of the protein, but local chemical properties (i.e. those of a hydrophobic, hydrophilic, amphipathic, or charged nature) were conserved across all of the strains. Polymorphisms in flexible and surface regions are likely subject to less selective pressure, and have a limited impact on the structure. In contrast, the substitution of tryptophan at position 113 with either glutamic acid or glycine likely has a serious influence on the regional structure of the protein, and these mutations might have an effect on the function of LcrV. The polymorphisms at positions 18, 72 and 273 were accountable for differences in oligomerization of LcrV. The importance of the latter property in emergence of epidemic strains of Y. pestis during evolution of this pathogen will need to be further investigated.« less
Apoferritin fibers: a new template for 1D fluorescent hybrid nanostructures
NASA Astrophysics Data System (ADS)
Jurado, Rocío; Castello, Fabio; Bondia, Patricia; Casado, Santiago; Flors, Cristina; Cuesta, Rafael; Domínguez-Vera, José M.; Orte, Angel; Gálvez, Natividad
2016-05-01
Recently, research in the field of protein amyloid fibers has gained great attention due to the use of these materials as nanoscale templates for the construction of functional hybrid materials. The formation of apoferritin amyloid-like protein fibers is demonstrated herein for the first time. The morphology, size and stiffness of these one-dimensional structures are comparable to the fibers formed by β-lactoglobulin, a protein frequently used as a model in the study of amyloid-like fibrillar proteins. Nanometer-sized globular apoferritin is capable of self-assembling to form 1D micrometer-sized structures after being subjected to a heating process. Depending on the experimental conditions, fibers with different morphologies and sizes are obtained. The wire-like protein structure is rich in functional groups and allows chemical functionalization with diverse quantum dots (QD), as well as with different Alexa Fluor (AF) dyes, leading to hybrid fluorescent fibers with variable emission wavelengths, from green to near infrared, depending on the QD and AFs coupled. For fibers containing the pair AF488 and AF647, efficient fluorescence energy transfer from the covalently coupled donor (AF488) to acceptor tags (AF647) takes place. Apoferritin fibers are proposed here as a new promising template for obtaining hybrid functional materials.Recently, research in the field of protein amyloid fibers has gained great attention due to the use of these materials as nanoscale templates for the construction of functional hybrid materials. The formation of apoferritin amyloid-like protein fibers is demonstrated herein for the first time. The morphology, size and stiffness of these one-dimensional structures are comparable to the fibers formed by β-lactoglobulin, a protein frequently used as a model in the study of amyloid-like fibrillar proteins. Nanometer-sized globular apoferritin is capable of self-assembling to form 1D micrometer-sized structures after being subjected to a heating process. Depending on the experimental conditions, fibers with different morphologies and sizes are obtained. The wire-like protein structure is rich in functional groups and allows chemical functionalization with diverse quantum dots (QD), as well as with different Alexa Fluor (AF) dyes, leading to hybrid fluorescent fibers with variable emission wavelengths, from green to near infrared, depending on the QD and AFs coupled. For fibers containing the pair AF488 and AF647, efficient fluorescence energy transfer from the covalently coupled donor (AF488) to acceptor tags (AF647) takes place. Apoferritin fibers are proposed here as a new promising template for obtaining hybrid functional materials. Electronic supplementary information (ESI) available: TEM images of ferritin protein fiber formation, and apoferritin after 18 days of heat treatment; FLIM-PIE technique details; fluorescence emission spectra of apoferritin and β-lactoglobulin fibers functionalized with different QDs. See DOI: 10.1039/c6nr01044j
Deng, Dong; Yan, Chuangye; Wu, Jianping; Pan, Xiaojing; Yan, Nieng
2014-04-01
Transcription activator-like (TAL) effectors specifically bind to double stranded (ds) DNA through a central domain of tandem repeats. Each TAL effector (TALE) repeat comprises 33-35 amino acids and recognizes one specific DNA base through a highly variable residue at a fixed position in the repeat. Structural studies have revealed the molecular basis of DNA recognition by TALE repeats. Examination of the overall structure reveals that the basic building block of TALE protein, namely a helical hairpin, is one-helix shifted from the previously defined TALE motif. Here we wish to suggest a structure-based re-demarcation of the TALE repeat which starts with the residues that bind to the DNA backbone phosphate and concludes with the base-recognition hyper-variable residue. This new numbering system is consistent with the α-solenoid superfamily to which TALE belongs, and reflects the structural integrity of TAL effectors. In addition, it confers integral number of TALE repeats that matches the number of bound DNA bases. We then present fifteen crystal structures of engineered dHax3 variants in complex with target DNA molecules, which elucidate the structural basis for the recognition of bases adenine (A) and guanine (G) by reported or uncharacterized TALE codes. Finally, we analyzed the sequence-structure correlation of the amino acid residues within a TALE repeat. The structural analyses reported here may advance the mechanistic understanding of TALE proteins and facilitate the design of TALEN with improved affinity and specificity.
[Regression analysis to select native-like structures from decoys of antigen-antibody docking].
Chen, Zhengshan; Chi, Xiangyang; Fan, Pengfei; Zhang, Guanying; Wang, Meirong; Yu, Changming; Chen, Wei
2018-06-25
Given the increasing exploitation of antibodies in different contexts such as molecular diagnostics and therapeutics, it would be beneficial to unravel properties of antigen-antibody interaction with modeling of computational protein-protein docking, especially, in the absence of a cocrystal structure. However, obtaining a native-like antigen-antibody structure remains challenging due in part to failing to reliably discriminate accurate from inaccurate structures among tens of thousands of decoys after computational docking with existing scoring function. We hypothesized that some important physicochemical and energetic features could be used to describe antigen-antibody interfaces and identify native-like antigen-antibody structure. We prepared a dataset, a subset of Protein-Protein Docking Benchmark Version 4.0, comprising 37 nonredundant 3D structures of antigen-antibody complexes, and used it to train and test multivariate logistic regression equation which took several important physicochemical and energetic features of decoys as dependent variables. Our results indicate that the ability to identify native-like structures of our method is superior to ZRANK and ZDOCK score for the subset of antigen-antibody complexes. And then, we use our method in workflow of predicting epitope of anti-Ebola glycoprotein monoclonal antibody-4G7 and identify three accurate residues in its epitope.
Patterns of amino acid conservation in human and animal immunodeficiency viruses.
Voitenko, Olga S; Dhroso, Andi; Feldmann, Anna; Korkin, Dmitry; Kalinina, Olga V
2016-09-01
Due to their high genomic variability, RNA viruses and retroviruses present a unique opportunity for detailed study of molecular evolution. Lentiviruses, with HIV being a notable example, are one of the best studied viral groups: hundreds of thousands of sequences are available together with experimentally resolved three-dimensional structures for most viral proteins. In this work, we use these data to study specific patterns of evolution of the viral proteins, and their relationship to protein interactions and immunogenicity. We propose a method for identification of two types of surface residues clusters with abnormal conservation: extremely conserved and extremely variable clusters. We identify them on the surface of proteins from HIV and other animal immunodeficiency viruses. Both types of clusters are overrepresented on the interaction interfaces of viral proteins with other proteins, nucleic acids or low molecular-weight ligands, both in the viral particle and between the virus and its host. In the immunodeficiency viruses, the interaction interfaces are not more conserved than the corresponding proteins on an average, and we show that extremely conserved clusters coincide with protein-protein interaction hotspots, predicted as the residues with the largest energetic contribution to the interaction. Extremely variable clusters have been identified here for the first time. In the HIV-1 envelope protein gp120, they overlap with known antigenic sites. These antigenic sites also contain many residues from extremely conserved clusters, hence representing a unique interacting interface enriched both in extremely conserved and in extremely variable clusters of residues. This observation may have important implication for antiretroviral vaccine development. A Python package is available at https://bioinf.mpi-inf.mpg.de/publications/viral-ppi-pred/ voitenko@mpi-inf.mpg.de or kalinina@mpi-inf.mpg.de Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Taube, Michał; Pieńkowska, Joanna R.; Jarmołowski, Artur; Kozak, Maciej
2014-01-01
SGT1 is an evolutionarily conserved eukaryotic protein involved in many important cellular processes. In plants, SGT1 is involved in resistance to disease. In a low ionic strength environment, the SGT1 protein tends to form dimers. The protein consists of three structurally independent domains (the tetratricopeptide repeats domain (TPR), the CHORD- and SGT1-containing domain (CS), and the SGT1-specific domain (SGS)), and two less conserved variable regions (VR1 and VR2). In the present study, we provide the low-resolution structure of the barley (Hordeum vulgare) SGT1 protein in solution and its dimer/monomer equilibrium using small-angle scattering of synchrotron radiation, ab-initio modeling and circular dichroism spectroscopy. The multivariate curve resolution least-square method (MCR-ALS) was applied to separate the scattering data of the monomeric and dimeric species from a complex mixture. The models of the barley SGT1 dimer and monomer were formulated using rigid body modeling with ab-initio structure prediction. Both oligomeric forms of barley SGT1 have elongated shapes with unfolded inter-domain regions. Circular dichroism spectroscopy confirmed that the barley SGT1 protein had a modular architecture, with an α-helical TPR domain, a β-sheet sandwich CS domain, and a disordered SGS domain separated by VR1 and VR2 regions. Using molecular docking and ab-initio protein structure prediction, a model of dimerization of the TPR domains was proposed. PMID:24714665
Predictive and comparative analysis of Ebolavirus proteins
Cong, Qian; Pei, Jimin; Grishin, Nick V
2015-01-01
Ebolavirus is the pathogen for Ebola Hemorrhagic Fever (EHF). This disease exhibits a high fatality rate and has recently reached a historically epidemic proportion in West Africa. Out of the 5 known Ebolavirus species, only Reston ebolavirus has lost human pathogenicity, while retaining the ability to cause EHF in long-tailed macaque. Significant efforts have been spent to determine the three-dimensional (3D) structures of Ebolavirus proteins, to study their interaction with host proteins, and to identify the functional motifs in these viral proteins. Here, in light of these experimental results, we apply computational analysis to predict the 3D structures and functional sites for Ebolavirus protein domains with unknown structure, including a zinc-finger domain of VP30, the RNA-dependent RNA polymerase catalytic domain and a methyltransferase domain of protein L. In addition, we compare sequences of proteins that interact with Ebolavirus proteins from RESTV-resistant primates with those from RESTV-susceptible monkeys. The host proteins that interact with GP and VP35 show an elevated level of sequence divergence between the RESTV-resistant and RESTV-susceptible species, suggesting that they may be responsible for host specificity. Meanwhile, we detect variable positions in protein sequences that are likely associated with the loss of human pathogenicity in RESTV, map them onto the 3D structures and compare their positions to known functional sites. VP35 and VP30 are significantly enriched in these potential pathogenicity determinants and the clustering of such positions on the surfaces of VP35 and GP suggests possible uncharacterized interaction sites with host proteins that contribute to the virulence of Ebolavirus. PMID:26158395
Predictive and comparative analysis of Ebolavirus proteins.
Cong, Qian; Pei, Jimin; Grishin, Nick V
2015-01-01
Ebolavirus is the pathogen for Ebola Hemorrhagic Fever (EHF). This disease exhibits a high fatality rate and has recently reached a historically epidemic proportion in West Africa. Out of the 5 known Ebolavirus species, only Reston ebolavirus has lost human pathogenicity, while retaining the ability to cause EHF in long-tailed macaque. Significant efforts have been spent to determine the three-dimensional (3D) structures of Ebolavirus proteins, to study their interaction with host proteins, and to identify the functional motifs in these viral proteins. Here, in light of these experimental results, we apply computational analysis to predict the 3D structures and functional sites for Ebolavirus protein domains with unknown structure, including a zinc-finger domain of VP30, the RNA-dependent RNA polymerase catalytic domain and a methyltransferase domain of protein L. In addition, we compare sequences of proteins that interact with Ebolavirus proteins from RESTV-resistant primates with those from RESTV-susceptible monkeys. The host proteins that interact with GP and VP35 show an elevated level of sequence divergence between the RESTV-resistant and RESTV-susceptible species, suggesting that they may be responsible for host specificity. Meanwhile, we detect variable positions in protein sequences that are likely associated with the loss of human pathogenicity in RESTV, map them onto the 3D structures and compare their positions to known functional sites. VP35 and VP30 are significantly enriched in these potential pathogenicity determinants and the clustering of such positions on the surfaces of VP35 and GP suggests possible uncharacterized interaction sites with host proteins that contribute to the virulence of Ebolavirus.
Characterization of tail sheath protein of giant bacteriophage phiKZ Pseudomonas aeruginosa
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kurochkina, Lidia P., E-mail: lpk@ibch.r; Aksyuk, Anastasia A.; Sachkova, Maria Yu.
2009-12-20
The tail sheath protein of giant bacteriophage phiKZ Pseudomonas aeruginosa encoded by gene 29 was identified and its expression system was developed. Localization of the protein on the virion was confirmed by immunoelectron microscopy. Properties of gene product (gp) 29 were studied by electron microscopy, immunoblotting and limited trypsinolysis. Recombinant gp29 assembles into the regular tubular structures (polysheaths) of variable length. Trypsin digestion of gp29 within polysheaths or extended sheath of virion results in specific cleavage of the peptide bond between Arg135 and Asp136. However, this cleavage does not affect polymeric structure of polysheaths, sheaths and viral infectivity. Digestion bymore » trypsin of the C-truncated gp29 mutant, lacking the ability to self-assemble, results in formation of a stable protease-resistant fragment. Although there is no sequence homology of phiKZ proteins to proteins of other bacteriophages, some characteristic biochemical properties of gp29 revealed similarities to the tail sheath protein of bacteriophage T4.« less
Protein change in plant evolution: tracing one thread connecting molecular and phenotypic diversity
Bartlett, Madelaine E.; Whipple, Clinton J.
2013-01-01
Proteins change over the course of evolutionary time. New protein-coding genes and gene families emerge and diversify, ultimately affecting an organism’s phenotype and interactions with its environment. Here we survey the range of structural protein change observed in plants and review the role these changes have had in the evolution of plant form and function. Verified examples tying evolutionary change in protein structure to phenotypic change remain scarce. We will review the existing examples, as well as draw from investigations into domestication, and quantitative trait locus (QTL) cloning studies searching for the molecular underpinnings of natural variation. The evolutionary significance of many cloned QTL has not been assessed, but all the examples identified so far have begun to reveal the extent of protein structural diversity tolerated in natural systems. This molecular (and phenotypic) diversity could come to represent part of natural selection’s source material in the adaptive evolution of novel traits. Protein structure and function can change in many distinct ways, but the changes we identified in studies of natural diversity and protein evolution were predicted to fall primarily into one of six categories: altered active and binding sites; altered protein–protein interactions; altered domain content; altered activity as an activator or repressor; altered protein stability; and hypomorphic and hypermorphic alleles. There was also variability in the evolutionary scale at which particular changes were observed. Some changes were detected at both micro- and macroevolutionary timescales, while others were observed primarily at deep or shallow phylogenetic levels. This variation might be used to determine the trajectory of future investigations in structural molecular evolution. PMID:24124420
On the mineral core of ferritin-like proteins: structural and magnetic characterization
NASA Astrophysics Data System (ADS)
García-Prieto, A.; Alonso, J.; Muñoz, D.; Marcano, L.; Abad Díaz de Cerio, A.; Fernández de Luis, R.; Orue, I.; Mathon, O.; Muela, A.; Fdez-Gubieda, M. L.
2015-12-01
It is generally accepted that the mineral core synthesized by ferritin-like proteins consists of a ferric oxy-hydroxide mineral similar to ferrihydrite in the case of horse spleen ferritin (HoSF) and an oxy-hydroxide-phosphate phase in plant and prokaryotic ferritins. The structure reflects a dynamic process of deposition and dissolution, influenced by different biological, chemical and physical variables. In this work we shed light on this matter by combining a structural (High Resolution Transmission Electron Microscopy (HRTEM) and Fe K-edge X-ray Absorption Spectroscopy (XAS)) and a magnetic study of the mineral core biomineralized by horse spleen ferritin (HoSF) and three prokaryotic ferritin-like proteins: bacterial ferritin (FtnA) and bacterioferritin (Bfr) from Escherichia coli and archaeal ferritin (PfFtn) from Pyrococcus furiosus. The prokaryotic ferritin-like proteins have been studied under native conditions and inside the cells for the sake of preserving their natural attributes. They share with HoSF a nanocrystalline structure rather than an amorphous one as has been frequently reported. However, the presence of phosphorus changes drastically the short-range order and magnetic response of the prokaryotic cores with respect to HoSF. The superparamagnetism observed in HoSF is absent in the prokaryotic proteins, which show a pure atomic-like paramagnetic behaviour attributed to phosphorus breaking the Fe-Fe exchange interaction.It is generally accepted that the mineral core synthesized by ferritin-like proteins consists of a ferric oxy-hydroxide mineral similar to ferrihydrite in the case of horse spleen ferritin (HoSF) and an oxy-hydroxide-phosphate phase in plant and prokaryotic ferritins. The structure reflects a dynamic process of deposition and dissolution, influenced by different biological, chemical and physical variables. In this work we shed light on this matter by combining a structural (High Resolution Transmission Electron Microscopy (HRTEM) and Fe K-edge X-ray Absorption Spectroscopy (XAS)) and a magnetic study of the mineral core biomineralized by horse spleen ferritin (HoSF) and three prokaryotic ferritin-like proteins: bacterial ferritin (FtnA) and bacterioferritin (Bfr) from Escherichia coli and archaeal ferritin (PfFtn) from Pyrococcus furiosus. The prokaryotic ferritin-like proteins have been studied under native conditions and inside the cells for the sake of preserving their natural attributes. They share with HoSF a nanocrystalline structure rather than an amorphous one as has been frequently reported. However, the presence of phosphorus changes drastically the short-range order and magnetic response of the prokaryotic cores with respect to HoSF. The superparamagnetism observed in HoSF is absent in the prokaryotic proteins, which show a pure atomic-like paramagnetic behaviour attributed to phosphorus breaking the Fe-Fe exchange interaction. Electronic supplementary information (ESI) available. See DOI: 10.1039/c5nr04446d
Kono, H; Saven, J G
2001-02-23
Combinatorial experiments provide new ways to probe the determinants of protein folding and to identify novel folding amino acid sequences. These types of experiments, however, are complicated both by enormous conformational complexity and by large numbers of possible sequences. Therefore, a quantitative computational theory would be helpful in designing and interpreting these types of experiment. Here, we present and apply a statistically based, computational approach for identifying the properties of sequences compatible with a given main-chain structure. Protein side-chain conformations are included in an atom-based fashion. Calculations are performed for a variety of similar backbone structures to identify sequence properties that are robust with respect to minor changes in main-chain structure. Rather than specific sequences, the method yields the likelihood of each of the amino acids at preselected positions in a given protein structure. The theory may be used to quantify the characteristics of sequence space for a chosen structure without explicitly tabulating sequences. To account for hydrophobic effects, we introduce an environmental energy that it is consistent with other simple hydrophobicity scales and show that it is effective for side-chain modeling. We apply the method to calculate the identity probabilities of selected positions of the immunoglobulin light chain-binding domain of protein L, for which many variant folding sequences are available. The calculations compare favorably with the experimentally observed identity probabilities.
Fumagalli, Marco; Scali, Stefano; Zuffi, Marco A L; Cagnone, Maddalena; Salvini, Roberta; Sacchi, Roberto
2017-01-01
Abstract Femoral glands of male lizards produce waxy secretions that are involved in inter- and intraspecific chemical communication. The main components of these secretions are proteins and lipids, the latter having been extensively studied and already associated to male quality. On the opposite, the composition and role of proteins are nearly unknown, the only available information coming from few studies on iguanids. These studies got the conclusion that proteins might have a communicative function, notably they could signal individual identity. A generalization of these findings requires the extension of protein analysis to other lizard families, and the primary detection of some patterns of individual variability. Using the common wall lizard Podarcis muralis as a model species, the protein fraction of the femoral pore secretions was investigated to provide the first characterization of this component in a lacertid lizard and to explore its source of variability, as a first step to support the hypothesized communicative role. Samples of proteins from femoral secretions were collected from 6 Italian populations and subjected to 1-dimensional electrophoresis. The binary vector of the band presence/absence was used to define the individual profiles. Protein fraction is found to have a structured pattern, with both an individual and a population component. Although the former supports the potential communicative role of proteins, the latter offers a double interpretation, phylogenetic or environmental, even though the phylogenetic effect seems more likely given the climatic resemblance of the considered sites. Further studies are necessary to shed light on both these issues. PMID:29492027
Nomura, Yayoi; Sato, Yumi; Suno, Ryoji; Horita, Shoichiro
2016-01-01
Abstract Fv antibody fragments have been used as co‐crystallization partners in structural biology, particularly in membrane protein crystallography. However, there are inherent technical issues associated with the large‐scale production of soluble, functional Fv fragments through conventional methods in various expression systems. To circumvent these problems, we developed a new method, in which a single synthetic polyprotein consisting of a variable light (VL) domain, an intervening removable affinity tag (iRAT), and a variable heavy (VH) domain is expressed by a Gram‐positive bacterial secretion system. This method ensures stoichiometric expression of VL and VH from the monocistronic construct followed by proper folding and assembly of the two variable domains. The iRAT segment can be removed by a site‐specific protease during the purification process to yield tag‐free Fv fragments suitable for crystallization trials. In vitro refolding step is not required to obtain correctly folded Fv fragments. As a proof of concept, we tested the iRAT‐based production of multiple Fv fragments, including a crystallization chaperone for a mammalian membrane protein as well as FDA‐approved therapeutic antibodies. The resulting Fv fragments were functionally active and crystallized in complex with the target proteins. The iRAT system is a reliable, rapid and broadly applicable means of producing milligram quantities of Fv fragments for structural and biochemical studies. PMID:27595817
Seasonal carcass composition and energy balance of female black ducks in Maine
Reinecke, K.J.; Stone, T.L.; Owen, R.B.
1982-01-01
Female Black Ducks (Anas rubripes) collected in Maine during the summer, fall, and winter of 1974-1976 showed significant seasonal variation in body weight, nonfat dry weight, gizzard and pectoral muscle weight, and fat, moisture, and protein content. Variation of body weight within and among seasons was correlated more strongly with carcass protein content, and with fat content during seasons of heavy lipid deposition, than with three structural size variables (culmen, tarsus, and sternum). Regression equations including fat and protein as independent variables accounted for 80-90% of the annual and seasonal variation in body weight; structural size variables alone accounted for less than 30%. Immature females averaged 54 and 99 g lighter, and carried 54 and 59 g less fat than adults during the fall and winter. Ducks of both age classes lost weight in December and January. Adult and immature females metabolized 59 and 64 g of fat and 17 and 25 g of protein in winter compared with 46 g of fat during the nesting season. Nutrient reserves are thus equally as important for the winter survival of these birds as for successfurl eproduction. Seasonal changes in carcass composition suggest that (1) fat deposited in late fall provides an energy reserve during winter, (2) a reduction in lean weight during winter may lower daily energy requirements and increase the effective amount of energy reserves, and (3) declining body weights during late winter may be an endogenous rhythm that reflects a shift in the expected benefits of an energy reserve compared to the costs of carrying additional weight,
Dru, P.; Bras, F.; Dezelee, S.; Gay, P.; Petitjean, A. M.; Pierre-Deneubourg, A.; Teninges, D.; Contamine, D.
1993-01-01
The ref(2)P gene of Drosophila melanogaster was identified by the discovery of two alleles, P(o) and P(p), respectively, permissive and restrictive for sigma rhabdovirus multiplication. A surprising variability of this gene was first noticed by the observation of size differences between the transcripts of permissive and restrictive alleles. In this paper, another restrictive allele, P(n), clearly distinct from P(p), is described: it exhibits a weaker antiviral effect than P(p) and differs from P(p) by its molecular structure. Five types of alleles were distinguished on the basis of their molecular structure, as revealed by S1 nuclease analysis of 17 D. melanogaster strains; three alleles were permissive and two restrictive. Comparison of the sequences of four haplotypes revealed numerous point mutations, two deletions (21 and 24 bp) and a complex event involving a 3-bp deletion, all affected the coding region. The unusual variability of the ref(2)P locus was confirmed by the high ratio of amino acid replacements to synonymous mutations (7:1), as compared to that of other genes, such as the Adh (2:42). Nevertheless, nucleotide sequence comparison with the Drosophila erecta ref(2)P gene shows that selective pressures are exerted to maintain the existence of a functional protein. The effects of this high variability on the ref(2)P protein are discussed in relation to its specific antiviral properties and to its function in D. melanogaster, where it is required for male fertility. PMID:8462852
Venken, Tom; Daelemans, Dirk; De Maeyer, Marc; Voet, Arnout
2012-06-01
The HIV Rev protein mediates the nuclear export of viral mRNA, and is thereby essential for the production of late viral proteins in the replication cycle. Rev forms a large organized multimeric protein-protein complex for proper functioning. Recently, the three-dimensional structures of a Rev dimer and tetramer have been resolved and provide the basis for a thorough structural analysis of the binding interaction. Here, molecular dynamics (MD) and binding free energy calculations were performed to elucidate the forces thriving dimerization and higher order multimerization of the Rev protein. It is found that despite the structural differences between each crystal structure, both display a similar behavior according to our calculations. Our analysis based on a molecular mechanics-generalized Born surface area (MM/GBSA) and a configurational entropy approach demonstrates that the higher order multimerization site is much weaker than the dimerization site. In addition, a quantitative hot spot analysis combined with a mutational analysis reveals the most contributing amino acid residues for protein interactions in agreement with experimental results. Additional residues were found in each interface, which are important for the protein interaction. The investigation of the thermodynamics of the Rev multimerization interactions performed here could be a further step in the development of novel antiretrovirals using structure based drug design. Moreover, the variability of the angle between each Rev monomer as measured during the MD simulations suggests a role of the Rev protein in allowing flexibility of the arginine rich domain (ARM) to accommodate RNA binding. Copyright © 2012 Wiley Periodicals, Inc.
Bryksa, Brian C; Grahame, Douglas A; Yada, Rickey Y
2017-05-01
The present study characterized the aspartic protease saposin-like domains of four plant species, Solanum tuberosum (potato), Hordeum vulgare L. (barley), Cynara cardunculus L. (cardoon; artichoke thistle) and Arabidopsis thaliana, in terms of bilayer disruption and fusion, and structure pH-dependence. Comparison of the recombinant saposin-like domains revealed that each induced leakage of bilayer vesicles composed of a simple phospholipid mixture with relative rates Arabidopsis>barley>cardoon>potato. When compared for leakage of bilayer composed of a vacuole-like phospholipid mixture, leakage was approximately five times higher for potato saposin-like domain compared to the others. In terms of fusogenic activity, distinctions between particle size profiles were noted among the four proteins, particularly for potato saposin-like domain. Bilayer fusion assays in reducing conditions resulted in altered fusion profiles except in the case of cardoon saposin-like domain which was virtually unchanged. Secondary structure profiles were similar across all four proteins under different pH conditions, although cardoon saposin-like domain appeared to have higher overall helix structure. Furthermore, increases in Trp emission upon protein-bilayer interactions suggested that protein structure rearrangements equilibrated with half-times ranging from 52 to 120s, with cardoon saposin-like domain significantly slower than the other three species. Overall, the present findings serve as a foundation for future studies seeking to delineate protein structural features and motifs in protein-bilayer interactions based upon variability in plant aspartic protease saposin-like domain structures. Copyright © 2017 Elsevier B.V. All rights reserved.
The Diversity of Yellow-Related Proteins in Sand Flies (Diptera: Psychodidae)
Sima, Michal; Novotny, Marian; Pravda, Lukas; Sumova, Petra; Rohousova, Iva; Volf, Petr
2016-01-01
Yellow-related proteins (YRPs) present in sand fly saliva act as affinity binders of bioamines, and help the fly to complete a bloodmeal by scavenging the physiological signals of damaged cells. They are also the main antigens in sand fly saliva and their recombinant form is used as a marker of host exposure to sand flies. Moreover, several salivary proteins and plasmids coding these proteins induce strong immune response in hosts bitten by sand flies and are being used to design protecting vaccines against Leishmania parasites. In this study, thirty two 3D models of different yellow-related proteins from thirteen sand fly species of two genera were constructed based on the known protein structure from Lutzomyia longipalpis. We also studied evolutionary relationships among species based on protein sequences as well as sequence and structural variability of their ligand-binding site. All of these 33 sand fly YRPs shared a similar structure, including a unique tunnel that connects the ligand-binding site with the solvent by two independent paths. However, intraspecific modifications found among these proteins affects the charges of the entrances to the tunnel, the length of the tunnel and its hydrophobicity. We suggest that these structural and sequential differences influence the ligand-binding abilities of these proteins and provide sand flies with a greater number of YRP paralogs with more nuanced answers to bioamines. All these characteristics allow us to better evaluate these proteins with respect to their potential use as part of anti-Leishmania vaccines or as an antigen to measure host exposure to sand flies. PMID:27812196
Structure of the Human Mitochondrial Ribosome Studied In Situ by Cryoelectron Tomography.
Englmeier, Robert; Pfeffer, Stefan; Förster, Friedrich
2017-10-03
Mitochondria maintain their own genome and its corresponding protein synthesis machine, the mitochondrial ribosome (mitoribosome). Mitoribosomes primarily synthesize highly hydrophobic proteins of the inner mitochondrial membrane. Recent studies revealed the complete structure of the isolated mammalian mitoribosome, but its mode of membrane association remained hypothetical. In this study, we used cryoelectron tomography to visualize human mitoribosomes in isolated mitochondria. The subtomogram average of the membrane-associated human mitoribosome reveals a single major contact site with the inner membrane, mediated by the mitochondria-specific protein mL45. A second rRNA-mediated contact site that is present in yeast is absent in humans, resulting in a more variable association of the human mitoribosome with the inner membrane. Despite extensive structural differences of mammalian and fungal mitoribosomal structure, the principal organization of peptide exit tunnel and the mL45 homolog remains invariant, presumably to align the mitoribosome with the membrane-embedded insertion machinery. Copyright © 2017 Elsevier Ltd. All rights reserved.
Kuan, Lisa; Schaffer, Jessica N.; Zouzias, Christos D.
2014-01-01
Proteus mirabilis is a Gram-negative enteric bacterium that causes complicated urinary tract infections, particularly in patients with indwelling catheters. Sequencing of clinical isolate P. mirabilis HI4320 revealed the presence of 17 predicted chaperone-usher fimbrial operons. We classified these fimbriae into three groups by their genetic relationship to other chaperone-usher fimbriae. Sixteen of these fimbriae are encoded by all seven currently sequenced P. mirabilis genomes. The predicted protein sequence of the major structural subunit for 14 of these fimbriae was highly conserved (≥95 % identity), whereas three other structural subunits (Fim3A, UcaA and Fim6A) were variable. Further examination of 58 clinical isolates showed that 14 of the 17 predicted major structural subunit genes of the fimbriae were present in most strains (>85 %). Transcription of the predicted major structural subunit genes for all 17 fimbriae was measured under different culture conditions designed to mimic conditions in the urinary tract. The majority of the fimbrial genes were induced during stationary phase, static culture or colony growth when compared to exponential-phase aerated culture. Major structural subunit proteins for six of these fimbriae were detected using MS of proteins sheared from the surface of broth-cultured P. mirabilis, demonstrating that this organism may produce multiple fimbriae within a single culture. The high degree of conservation of P. mirabilis fimbriae stands in contrast to uropathogenic Escherichia coli and Salmonella enterica, which exhibit greater variability in their fimbrial repertoires. These findings suggest there may be evolutionary pressure for P. mirabilis to maintain a large fimbrial arsenal. PMID:24809384
Quality assessment of protein model-structures using evolutionary conservation.
Kalman, Matan; Ben-Tal, Nir
2010-05-15
Programs that evaluate the quality of a protein structural model are important both for validating the structure determination procedure and for guiding the model-building process. Such programs are based on properties of native structures that are generally not expected for faulty models. One such property, which is rarely used for automatic structure quality assessment, is the tendency for conserved residues to be located at the structural core and for variable residues to be located at the surface. We present ConQuass, a novel quality assessment program based on the consistency between the model structure and the protein's conservation pattern. We show that it can identify problematic structural models, and that the scores it assigns to the server models in CASP8 correlate with the similarity of the models to the native structure. We also show that when the conservation information is reliable, the method's performance is comparable and complementary to that of the other single-structure quality assessment methods that participated in CASP8 and that do not use additional structural information from homologs. A perl implementation of the method, as well as the various perl and R scripts used for the analysis are available at http://bental.tau.ac.il/ConQuass/. nirb@tauex.tau.ac.il Supplementary data are available at Bioinformatics online.
Goedhals, Dominique; Bester, Phillip A; Paweska, Janusz T; Swanepoel, Robert; Burt, Felicity J
2015-05-01
Crimean-Congo haemorrhagic fever virus (CCHFV) is a member of the Bunyaviridae family with a tripartite, negative sense RNA genome. This study used predictive software to analyse the L (large), M (medium), and S (small) segments of 14 southern African CCHFV isolates. The OTU-like cysteine protease domain and the RdRp domain of the L segment are highly conserved among southern African CCHFV isolates. The M segment encodes the structural glycoproteins, GN and GC, and the non-structural glycoproteins which are post-translationally cleaved at highly conserved furin and subtilase SKI-1 cleavage sites. All of the sites previously identified were shown to be conserved among southern African CCHFV isolates. The heavily O-glycosylated N-terminal variable mucin-like domain of the M segment shows the highest sequence variability of the CCHFV proteins. Five transmembrane domains are predicted in the M segment polyprotein resulting in three regions internal to and three regions external to the membrane across the G(N), NS(M) and G(C) glycoproteins. The corroboration of conserved genome domains and sequence identity among geographically diverse isolates may assist in the identification of protein function and pathogenic mechanisms, as well as the identification of potential targets for antiviral therapy and vaccine design. As detailed functional studies are lacking for many of the CCHFV proteins, identification of functional domains by prediction of protein structure, and identification of amino acid level similarity to functionally characterised proteins of related viruses or viruses with similar pathogenic mechanisms are a necessary step for selection of areas for further study. © 2015 Wiley Periodicals, Inc.
Interplay between Peptide Bond Geometrical Parameters in Nonglobular Structural Contexts
Esposito, Luciana; De Simone, Alfonso; Vitagliano, Luigi
2013-01-01
Several investigations performed in the last two decades have unveiled that geometrical parameters of protein backbone show a remarkable variability. Although these studies have provided interesting insights into one of the basic aspects of protein structure, they have been conducted on globular and water-soluble proteins. We report here a detailed analysis of backbone geometrical parameters in nonglobular proteins/peptides. We considered membrane proteins and two distinct fibrous systems (amyloid-forming and collagen-like peptides). Present data show that in these systems the local conformation plays a major role in dictating the amplitude of the bond angle N-Cα-C and the propensity of the peptide bond to adopt planar/nonplanar states. Since the trends detected here are in line with the concept of the mutual influence of local geometry and conformation previously established for globular and water-soluble proteins, our analysis demonstrates that the interplay of backbone geometrical parameters is an intrinsic and general property of protein/peptide structures that is preserved also in nonglobular contexts. For amyloid-forming peptides significant distortions of the N-Cα-C bond angle, indicative of sterical hidden strain, may occur in correspondence with side chain interdigitation. The correlation between the dihedral angles Δω/ψ in collagen-like models may have interesting implications for triple helix stability. PMID:24455689
Interplay between peptide bond geometrical parameters in nonglobular structural contexts.
Esposito, Luciana; Balasco, Nicole; De Simone, Alfonso; Berisio, Rita; Vitagliano, Luigi
2013-01-01
Several investigations performed in the last two decades have unveiled that geometrical parameters of protein backbone show a remarkable variability. Although these studies have provided interesting insights into one of the basic aspects of protein structure, they have been conducted on globular and water-soluble proteins. We report here a detailed analysis of backbone geometrical parameters in nonglobular proteins/peptides. We considered membrane proteins and two distinct fibrous systems (amyloid-forming and collagen-like peptides). Present data show that in these systems the local conformation plays a major role in dictating the amplitude of the bond angle N-C(α)-C and the propensity of the peptide bond to adopt planar/nonplanar states. Since the trends detected here are in line with the concept of the mutual influence of local geometry and conformation previously established for globular and water-soluble proteins, our analysis demonstrates that the interplay of backbone geometrical parameters is an intrinsic and general property of protein/peptide structures that is preserved also in nonglobular contexts. For amyloid-forming peptides significant distortions of the N-C(α)-C bond angle, indicative of sterical hidden strain, may occur in correspondence with side chain interdigitation. The correlation between the dihedral angles Δω/ψ in collagen-like models may have interesting implications for triple helix stability.
Molecular basis for the wide range of affinity found in Csr/Rsm protein-RNA recognition.
Duss, Olivier; Michel, Erich; Diarra dit Konté, Nana; Schubert, Mario; Allain, Frédéric H-T
2014-04-01
The carbon storage regulator/regulator of secondary metabolism (Csr/Rsm) type of small non-coding RNAs (sRNAs) is widespread throughout bacteria and acts by sequestering the global translation repressor protein CsrA/RsmE from the ribosome binding site of a subset of mRNAs. Although we have previously described the molecular basis of a high affinity RNA target bound to RsmE, it remains unknown how other lower affinity targets are recognized by the same protein. Here, we have determined the nuclear magnetic resonance solution structures of five separate GGA binding motifs of the sRNA RsmZ of Pseudomonas fluorescens in complex with RsmE. The structures explain how the variation of sequence and structural context of the GGA binding motifs modulate the binding affinity for RsmE by five orders of magnitude (∼10 nM to ∼3 mM, Kd). Furthermore, we see that conformational adaptation of protein side-chains and RNA enable recognition of different RNA sequences by the same protein contributing to binding affinity without conferring specificity. Overall, our findings illustrate how the variability in the Csr/Rsm protein-RNA recognition allows a fine-tuning of the competition between mRNAs and sRNAs for the CsrA/RsmE protein.
Analysis of the Free-Energy Surface of Proteins from Reversible Folding Simulations
Allen, Lucy R.; Krivov, Sergei V.; Paci, Emanuele
2009-01-01
Computer generated trajectories can, in principle, reveal the folding pathways of a protein at atomic resolution and possibly suggest general and simple rules for predicting the folded structure of a given sequence. While such reversible folding trajectories can only be determined ab initio using all-atom transferable force-fields for a few small proteins, they can be determined for a large number of proteins using coarse-grained and structure-based force-fields, in which a known folded structure is by construction the absolute energy and free-energy minimum. Here we use a model of the fast folding helical λ-repressor protein to generate trajectories in which native and non-native states are in equilibrium and transitions are accurately sampled. Yet, representation of the free-energy surface, which underlies the thermodynamic and dynamic properties of the protein model, from such a trajectory remains a challenge. Projections over one or a small number of arbitrarily chosen progress variables often hide the most important features of such surfaces. The results unequivocally show that an unprojected representation of the free-energy surface provides important and unbiased information and allows a simple and meaningful description of many-dimensional, heterogeneous trajectories, providing new insight into the possible mechanisms of fast-folding proteins. PMID:19593364
Analysis of the free-energy surface of proteins from reversible folding simulations.
Allen, Lucy R; Krivov, Sergei V; Paci, Emanuele
2009-07-01
Computer generated trajectories can, in principle, reveal the folding pathways of a protein at atomic resolution and possibly suggest general and simple rules for predicting the folded structure of a given sequence. While such reversible folding trajectories can only be determined ab initio using all-atom transferable force-fields for a few small proteins, they can be determined for a large number of proteins using coarse-grained and structure-based force-fields, in which a known folded structure is by construction the absolute energy and free-energy minimum. Here we use a model of the fast folding helical lambda-repressor protein to generate trajectories in which native and non-native states are in equilibrium and transitions are accurately sampled. Yet, representation of the free-energy surface, which underlies the thermodynamic and dynamic properties of the protein model, from such a trajectory remains a challenge. Projections over one or a small number of arbitrarily chosen progress variables often hide the most important features of such surfaces. The results unequivocally show that an unprojected representation of the free-energy surface provides important and unbiased information and allows a simple and meaningful description of many-dimensional, heterogeneous trajectories, providing new insight into the possible mechanisms of fast-folding proteins.
Shivange, Amol V; Hoeffken, Hans Wolfgang; Haefner, Stefan; Schwaneberg, Ulrich
2016-12-01
Protein consensus-based surface engineering (ProCoS) is a simple and efficient method for directed protein evolution combining computational analysis and molecular biology tools to engineer protein surfaces. ProCoS is based on the hypothesis that conserved residues originated from a common ancestor and that these residues are crucial for the function of a protein, whereas highly variable regions (situated on the surface of a protein) can be targeted for surface engineering to maximize performance. ProCoS comprises four main steps: ( i ) identification of conserved and highly variable regions; ( ii ) protein sequence design by substituting residues in the highly variable regions, and gene synthesis; ( iii ) in vitro DNA recombination of synthetic genes; and ( iv ) screening for active variants. ProCoS is a simple method for surface mutagenesis in which multiple sequence alignment is used for selection of surface residues based on a structural model. To demonstrate the technique's utility for directed evolution, the surface of a phytase enzyme from Yersinia mollaretii (Ymphytase) was subjected to ProCoS. Screening just 1050 clones from ProCoS engineering-guided mutant libraries yielded an enzyme with 34 amino acid substitutions. The surface-engineered Ymphytase exhibited 3.8-fold higher pH stability (at pH 2.8 for 3 h) and retained 40% of the enzyme's specific activity (400 U/mg) compared with the wild-type Ymphytase. The pH stability might be attributed to a significantly increased (20 percentage points; from 9% to 29%) number of negatively charged amino acids on the surface of the engineered phytase.
Prosser, Christine E; Waters, Lorna C; Muskett, Frederick W; Veverka, Vaclav; Addis, Philip W; Griffin, Laura M; Baker, Terry S; Lawson, Alastair D G; Wernery, Ulrich; Kinne, Jorg; Henry, Alistair J; Taylor, Richard J; Carr, Mark D
2014-04-01
Heavy chain antibodies differ in structure to conventional antibodies lacking both the light chain and the first heavy chain constant domain (CH1). Characteristics of the antigen-binding variable heavy domain of the heavy chain antibody (VHH) including the smaller size, high solubility and stability make them an attractive alternative to more traditional antibody fragments for detailed NMR-based structural analysis. Here we report essentially complete backbone and side chain (15)N, (13)C and (1)H assignments for a free VHH. Analysis of the backbone chemical shift data obtained indicates that the VHH is comprised predominantly of β-sheets corresponding to nearly 60% of the protein backbone.
Survey of large protein complexes D. vulgaris reveals great structural diversity
DOE Office of Scientific and Technical Information (OSTI.GOV)
Han, B.-G.; Dong, M.; Liu, H.
2009-08-15
An unbiased survey has been made of the stable, most abundant multi-protein complexes in Desulfovibrio vulgaris Hildenborough (DvH) that are larger than Mr {approx} 400 k. The quaternary structures for 8 of the 16 complexes purified during this work were determined by single-particle reconstruction of negatively stained specimens, a success rate {approx}10 times greater than that of previous 'proteomic' screens. In addition, the subunit compositions and stoichiometries of the remaining complexes were determined by biochemical methods. Our data show that the structures of only two of these large complexes, out of the 13 in this set that have recognizable functions,more » can be modeled with confidence based on the structures of known homologs. These results indicate that there is significantly greater variability in the way that homologous prokaryotic macromolecular complexes are assembled than has generally been appreciated. As a consequence, we suggest that relying solely on previously determined quaternary structures for homologous proteins may not be sufficient to properly understand their role in another cell of interest.« less
Direct Observation of Parallel Folding Pathways Revealed Using a Symmetric Repeat Protein System
Aksel, Tural; Barrick, Doug
2014-01-01
Although progress has been made to determine the native fold of a polypeptide from its primary structure, the diversity of pathways that connect the unfolded and folded states has not been adequately explored. Theoretical and computational studies predict that proteins fold through parallel pathways on funneled energy landscapes, although experimental detection of pathway diversity has been challenging. Here, we exploit the high translational symmetry and the direct length variation afforded by linear repeat proteins to directly detect folding through parallel pathways. By comparing folding rates of consensus ankyrin repeat proteins (CARPs), we find a clear increase in folding rates with increasing size and repeat number, although the size of the transition states (estimated from denaturant sensitivity) remains unchanged. The increase in folding rate with chain length, as opposed to a decrease expected from typical models for globular proteins, is a clear demonstration of parallel pathways. This conclusion is not dependent on extensive curve-fitting or structural perturbation of protein structure. By globally fitting a simple parallel-Ising pathway model, we have directly measured nucleation and propagation rates in protein folding, and have quantified the fluxes along each path, providing a detailed energy landscape for folding. This finding of parallel pathways differs from results from kinetic studies of repeat-proteins composed of sequence-variable repeats, where modest repeat-to-repeat energy variation coalesces folding into a single, dominant channel. Thus, for globular proteins, which have much higher variation in local structure and topology, parallel pathways are expected to be the exception rather than the rule. PMID:24988356
Novel calcium recognition constructions in proteins: Calcium blade and EF-hand zone
DOE Office of Scientific and Technical Information (OSTI.GOV)
Denesyuk, Alexander I., E-mail: adenesyu@abo.fi; Institute for Biological Instrumentation of the Russian Academy of Sciences, Pushchino 142290; Permyakov, Sergei E.
Metal ions can regulate various cell processes being first, second or third messengers, and some of them, especially transition metal ions, take part in catalysis in many enzymes. As an intracellular ion, Ca{sup 2+} is involved in many cellular functions from fertilization and contraction, cell differentiation and proliferation, to apoptosis and cancer. Here, we have identified and described two novel calcium recognition environments in proteins: the calcium blade zone and the EF-hand zone, common to 12 and 8 different protein families, respectively. Each of the two environments contains three distinct structural elements: (a) the well-known characteristic Dx[DN]xDG motif; (b) anmore » adjacent structurally identical segment, which binds metal ion in the same way between the calcium blade zone and the EF-hand zone; and (c) the following structurally variable segment, which distinguishes the calcium blade zone from the EF-hand zone. Both zones have sequence insertions between the last residue of the zone and calcium-binding residues in positions V or VI. The long insertion often connects the active and the calcium-binding sites in proteins. Using the structurally identical segments as an anchor, we were able to construct the classical calmodulin type EF-hand calcium-binding site out of two different calcium-binding motifs from two unrelated proteins.« less
Trezza, Alfonso; Bernini, Andrea; Langella, Andrea; Ascher, David B; Pires, Douglas E V; Sodi, Andrea; Passerini, Ilaria; Pelo, Elisabetta; Rizzo, Stanislao; Niccolai, Neri; Spiga, Ottavia
2017-10-01
The aim of this article is to report the investigation of the structural features of ABCA4, a protein associated with a genetic retinal disease. A new database collecting knowledge of ABCA4 structure may facilitate predictions about the possible functional consequences of gene mutations observed in clinical practice. In order to correlate structural and functional effects of the observed mutations, the structure of mouse P-glycoprotein was used as a template for homology modeling. The obtained structural information and genetic data are the basis of our relational database (ABCA4Database). Sequence variability among all ABCA4-deposited entries was calculated and reported as Shannon entropy score at the residue level. The three-dimensional model of ABCA4 structure was used to locate the spatial distribution of the observed variable regions. Our predictions from structural in silico tools were able to accurately link the functional effects of mutations to phenotype. The development of the ABCA4Database gathers all the available genetic and structural information, yielding a global view of the molecular basis of some retinal diseases. ABCA4 modeled structure provides a molecular basis on which to analyze protein sequence mutations related to genetic retinal disease in order to predict the risk of retinal disease across all possible ABCA4 mutations. Additionally, our ABCA4 predicted structure is a good starting point for the creation of a new data analysis model, appropriate for precision medicine, in order to develop a deeper knowledge network of the disease and to improve the management of patients.
The impact of p53 protein core domain structural alteration on ovarian cancer survival.
Rose, Stephen L; Robertson, Andrew D; Goodheart, Michael J; Smith, Brian J; DeYoung, Barry R; Buller, Richard E
2003-09-15
Although survival with a p53 missense mutation is highly variable, p53-null mutation is an independent adverse prognostic factor for advanced stage ovarian cancer. By evaluating ovarian cancer survival based upon a structure function analysis of the p53 protein, we tested the hypothesis that not all missense mutations are equivalent. The p53 gene was sequenced from 267 consecutive ovarian cancers. The effect of individual missense mutations on p53 structure was analyzed using the International Agency for Research on Cancer p53 Mutational Database, which specifies the effects of p53 mutations on p53 core domain structure. Mutations in the p53 core domain were classified as either explained or not explained in structural or functional terms by their predicted effects on protein folding, protein-DNA contacts, or mutation in highly conserved residues. Null mutations were classified by their mechanism of origin. Mutations were sequenced from 125 tumors. Effects of 62 of the 82 missense mutations (76%) could be explained by alterations in the p53 protein. Twenty-three (28%) of the explained mutations occurred in highly conserved regions of the p53 core protein. Twenty-two nonsense point mutations and 21 frameshift null mutations were sequenced. Survival was independent of missense mutation type and mechanism of null mutation. The hypothesis that not all missense mutations are equivalent is, therefore, rejected. Furthermore, p53 core domain structural alteration secondary to missense point mutation is not functionally equivalent to a p53-null mutation. The poor prognosis associated with p53-null mutation is independent of the mutation mechanism.
Zhao, Yong Mei; Golden, Aaron; Mar, Jessica C.; Einstein, Francine H.; Greally, John M.
2014-01-01
The mechanism and significance of epigenetic variability in the same cell type between healthy individuals are not clear. Here, we purify human CD34+ hematopoietic stem and progenitor cells (HSPCs) from different individuals and find that there is increased variability of DNA methylation at loci with properties of promoters and enhancers. The variability is especially enriched at candidate enhancers near genes transitioning between silent and expressed states, and encoding proteins with leukocyte differentiation properties. Our findings of increased variability at loci with intermediate DNA methylation values, at candidate “poised” enhancers, and at genes involved in HSPC lineage commitment suggest that CD34+ cell subtype heterogeneity between individuals is a major mechanism for the variability observed. Epigenomic studies performed on cell populations, even when purified, are testing collections of epigenomes, or meta-epigenomes. Our findings show that meta-epigenomic approaches to data analysis can provide insights into cell subpopulation structure. PMID:25327398
Structural determinants of nuclear export signal orientation in binding to exportin CRM1
Fung, Ho Yee Joyce; Fu, Szu -Chin; Brautigam, Chad A.; ...
2015-09-08
The Chromosome Region of Maintenance 1 (CRM1) protein mediates nuclear export of hundreds of proteins through recognition of their nuclear export signals (NESs), which are highly variable in sequence and structure. The plasticity of the CRM1-NES interaction is not well understood, as there are many NES sequences that seem incompatible with structures of the NES-bound CRM1 groove. Crystal structures of CRM1 bound to two different NESs with unusual sequences showed the NES peptides binding the CRM1 groove in the opposite orientation (minus) to that of previously studied NESs (plus). A comparison of minus and plus NESs identified structural and sequencemore » determinants for NES orientation. The binding of NESs to CRM1 in both orientations results in a large expansion in NES consensus patterns and therefore a corresponding expansion of potential NESs in the proteome.« less
Johnson, Jennifer L; Entzminger, Kevin C; Hyun, Jeongmin; Kalyoncu, Sibel; Heaner, David P; Morales, Ivan A; Sheppard, Aly; Gumbart, James C; Maynard, Jennifer A; Lieberman, Raquel L
2015-04-01
Crystallization chaperones are attracting increasing interest as a route to crystal growth and structure elucidation of difficult targets such as membrane proteins. While strategies to date have typically employed protein-specific chaperones, a peptide-specific chaperone to crystallize multiple cognate peptide epitope-containing client proteins is envisioned. This would eliminate the target-specific chaperone-production step and streamline the co-crystallization process. Previously, protein engineering and directed evolution were used to generate a single-chain variable (scFv) antibody fragment with affinity for the peptide sequence EYMPME (scFv/EE). This report details the conversion of scFv/EE to an anti-EE Fab format (Fab/EE) followed by its biophysical characterization. The addition of constant chains increased the overall stability and had a negligible impact on the antigen affinity. The 2.0 Å resolution crystal structure of Fab/EE reveals contacts with larger surface areas than those of scFv/EE. Surface plasmon resonance, an enzyme-linked immunosorbent assay, and size-exclusion chromatography were used to assess Fab/EE binding to EE-tagged soluble and membrane test proteins: namely, the β-barrel outer membrane protein intimin and α-helical A2a G protein-coupled receptor (A2aR). Molecular-dynamics simulation of the intimin constructs with and without Fab/EE provides insight into the energetic complexities of the co-crystallization approach.
Constraints on the Genetic and Antigenic Variability of Measles Virus.
Beaty, Shannon M; Lee, Benhur
2016-04-21
Antigenic drift and genetic variation are significantly constrained in measles virus (MeV). Genetic stability of MeV is exceptionally high, both in the lab and in the field, and few regions of the genome allow for rapid genetic change. The regions of the genome that are more tolerant of mutations (i.e., the untranslated regions and certain domains within the N, C, V, P, and M proteins) indicate genetic plasticity or structural flexibility in the encoded proteins. Our analysis reveals that strong constraints in the envelope proteins (F and H) allow for a single serotype despite known antigenic differences among its 24 genotypes. This review describes some of the many variables that limit the evolutionary rate of MeV. The high genomic stability of MeV appears to be a shared property of the Paramyxovirinae, suggesting a common mechanism that biologically restricts the rate of mutation.
Constraints on the Genetic and Antigenic Variability of Measles Virus
Beaty, Shannon M.; Lee, Benhur
2016-01-01
Antigenic drift and genetic variation are significantly constrained in measles virus (MeV). Genetic stability of MeV is exceptionally high, both in the lab and in the field, and few regions of the genome allow for rapid genetic change. The regions of the genome that are more tolerant of mutations (i.e., the untranslated regions and certain domains within the N, C, V, P, and M proteins) indicate genetic plasticity or structural flexibility in the encoded proteins. Our analysis reveals that strong constraints in the envelope proteins (F and H) allow for a single serotype despite known antigenic differences among its 24 genotypes. This review describes some of the many variables that limit the evolutionary rate of MeV. The high genomic stability of MeV appears to be a shared property of the Paramyxovirinae, suggesting a common mechanism that biologically restricts the rate of mutation. PMID:27110809
Hoehenwarter, Wolfgang; Larhlimi, Abdelhalim; Hummel, Jan; Egelhofer, Volker; Selbig, Joachim; van Dongen, Joost T; Wienkoop, Stefanie; Weckwerth, Wolfram
2011-07-01
Mass Accuracy Precursor Alignment is a fast and flexible method for comparative proteome analysis that allows the comparison of unprecedented numbers of shotgun proteomics analyses on a personal computer in a matter of hours. We compared 183 LC-MS analyses and more than 2 million MS/MS spectra and could define and separate the proteomic phenotypes of field grown tubers of 12 tetraploid cultivars of the crop plant Solanum tuberosum. Protein isoforms of patatin as well as other major gene families such as lipoxygenase and cysteine protease inhibitor that regulate tuber development were found to be the primary source of variability between the cultivars. This suggests that differentially expressed protein isoforms modulate genotype specific tuber development and the plant phenotype. We properly assigned the measured abundance of tryptic peptides to different protein isoforms that share extensive stretches of primary structure and thus inferred their abundance. Peptides unique to different protein isoforms were used to classify the remaining peptides assigned to the entire subset of isoforms based on a common abundance profile using multivariate statistical procedures. We identified nearly 4000 proteins which we used for quantitative functional annotation making this the most extensive study of the tuber proteome to date.
Dong, Zhaoming; Zhang, Weiwei; Zhang, Yan; Zhang, Xiaolu; Zhao, Ping; Xia, Qingyou
2016-05-06
Cuticle is mainly made of chitin filaments embedded in a matrix of cuticular proteins (CPs). Cuticular chitins have minor differences, whereas CPs are widely variable with respect to their sequences and structures. To understand the molecular basis underlying the mechanical properties of cuticle, it is necessary to know which CPs interact with chitin and how they are assembled into the cuticle structure. In the present study, a chitin-binding assay was performed followed by liquid chromatography-tandem mass spectrometry to identify the extracted proteins from the larval cuticle of silkworm, Bombyx mori. There were 463 proteins identified from the silkworm larval cuticle, 200 of which were recovered in the chitin-binding fraction. A total of 103 proteins were annotated as CPs, which were classified into 11 CP families based on their conserved motifs, including CPR, CPAP, CPT, CPF and CPFL, CPCFC, chitin_bind 3, BmCPH2 homologues, BmCPH9 homologues, BmCPG1 homologues, BmCPG20 homologues, and BmCPG21 homologues. A total of five CP families were newly identified in the chitin-binding fraction, thereby providing new information and insight into the composition, structure, and function of the silkworm larval cuticle.
2010-01-01
Background Protein over-production in Escherichia coli often results in formation of inclusion bodies (IBs). Some recent reports have shown that the aggregation into IBs does not necessarily mean that the target protein is inactivated and that IBs may contain a high proportion of correctly folded protein. This proportion is variable depending on the protein itself, the genetic background of the producing cells and the expression temperature. In this paper we have evaluated the influence of other production process parameters on the quality of an inclusion bodies protein. Results The present paper describes the recombinant production in Escherichia coli of the flavohemoglobin from the Antarctic bacterium Pseudoalteromonas haloplanktis TAC125. Flavohemoglobins are multidomain proteins requiring FAD and heme cofactors. The production was carried out in several different experimental setups differing in bioreactor geometry, oxygen supply and the presence of a nitrosating compound. In all production processes, the recombinant protein accumulates in IBs, from which it was solubilized in non-denaturing conditions. Comparing structural properties of the solubilized flavohemoglobins, i.e. deriving from the different process designs, our data demonstrated that the protein preparations differ significantly in the presence of cofactors (heme and FAD) and as far as their secondary and tertiary structure content is concerned. Conclusions Data reported in this paper demonstrate that other production process parameters, besides growth temperature, can influence the structure of a recombinant product that accumulates in IBs. To the best of our knowledge, this is the first reported example in which the structural properties of a protein solubilized from inclusion bodies have been correlated to the production process design. PMID:20334669
Structural analysis of the Quaking homodimerization interface
Beuck, Christine; Qu, Song; Fagg, W. Samuel; Ares, Manuel; Williamson, James R.
2012-01-01
Quaking is a prototypical member of the STAR protein family, which plays key roles in posttranscriptional gene regulation by controlling mRNA translation, stability and splicing. QkI-5 has been shown to regulate mRNA expression in the central nervous system, but little is known about its roles in other tissues. STAR proteins function as dimers and bind to bipartite RNA sequences, however, the structural and functional roles of homo- and hetero-dimerization are still unclear. Here, we present the crystal structure of the QkI dimerization domain, which adopts a similar stacked helix-turn-helix arrangement as its homologs GLD-1 and Sam68, but differs by an additional helix inserted in the dimer interface. Variability of the dimer interface residues likely ensures selective homodimerization by preventing association with non-cognate STAR family proteins in the cell. Mutations that inhibit dimerization also significantly impair RNA binding in vitro, alter QkI-5 protein levels, and impair QkI function in a splicing assay in vivo. Together our results indicate that a functional Qua1 homodimerization domain is required for QkI-5 function in mammalian cells. PMID:22982292
Islam, Nazrul; Woo, Sun-Hee; Tsujimoto, Hisashi; Kawasaki, Hiroshi; Hirano, Hisashi
2002-09-01
Changes in protein composition of wheat endosperm proteome were investigated in 39 ditelocentric chromosome lines of common wheat (Triticum aestivum L.) cv. Chinese Spring. Two-dimensional gel electrophoresis followed by Coomassie Brilliant Blue staining has resolved a total of 105 protein spots in a gel. Quantitative image analysis of protein spots was performed by PDQuest. Variations in protein spots between the euploid and the 39 ditelocentric lines were evaluated by spot number, appearance, disappearance and intensity. A specific spot present in all gels was taken as an internal standard, and the intensity of all other spots was calculated as the ratio of the internal standard. Out of the 1755 major spots detected in 39 ditelocentric lines, 1372 (78%) spots were found variable in different spot parameters: 147 (11%) disappeared, 978 (71%) up-regulated and 247 (18%) down-regulated. Correlation studies in changes in protein intensities among 24 protein spots across the ditelocentric lines were performed. High correlations in changes of protein intensities were observed among the proteins encoded by genes located in the homoeologous arms. Locations of structural genes controlling 26 spots were identified in 10 chromosomal arms. Multiple regulators of the same protein located at various chromosomal arms were also noticed. Identification of structural genes for most of the proteins was found difficult due to multiple regulators encoding the same protein. Two novel subunits (1B(Z,) 1BDz), the structure of which are very similar to the high molecular weight glutenin subunit 12, were identified, and the chromosome arm locations of these subunits were assigned.
Sudha, Govindarajan; Singh, Prashant; Swapna, Lakshmipuram S; Srinivasan, Narayanaswamy
2015-01-01
Residue types at the interface of protein–protein complexes (PPCs) are known to be reasonably well conserved. However, we show, using a dataset of known 3-D structures of homologous transient PPCs, that the 3-D location of interfacial residues and their interaction patterns are only moderately and poorly conserved, respectively. Another surprising observation is that a residue at the interface that is conserved is not necessarily in the interface in the homolog. Such differences in homologous complexes are manifested by substitution of the residues that are spatially proximal to the conserved residue and structural differences at the interfaces as well as differences in spatial orientations of the interacting proteins. Conservation of interface location and the interaction pattern at the core of the interfaces is higher than at the periphery of the interface patch. Extents of variability of various structural features reported here for homologous transient PPCs are higher than the variation in homologous permanent homomers. Our findings suggest that straightforward extrapolation of interfacial nature and inter-residue interaction patterns from template to target could lead to serious errors in the modeled complex structure. Understanding the evolution of interfaces provides insights to improve comparative modeling of PPC structures. PMID:26311309
Holman, Benjamin W B; Coombs, Cassius E O; Morris, Stephen; Kerr, Matthew J; Hopkins, David L
2018-05-01
Different chilled (~0.1 °C for up to 5 weeks) then frozen storage (up to 12 months) combinations and two frozen storage holding temperatures (-12 °C and -18 °C) effects on beef M. longissimus lumborum (LL) protein structure degradation and a marker of protein oxidation were tested. Particle size (PS) analysis and protein solubility results found storage combination effects on protein degradation to be significant (P < 0.05), although the influence of frozen holding temperatures was negligible. LL carbonyl, and nitrate and nitrite content responses were variable and yet broadly reflected an increased incidence of protein oxidation across increasing chilled storage and ensuing frozen storage periods - this aspect meriting future exploration. Total myoglobin content and the estimated myoglobin redox fractions (metmyoglobin, deoxymyoglobin, and oxymyoglobin) were also subject to storage treatment. These findings demonstrate the capacity for beef storage selection (chilled-then-frozen) to manage compositional protein changes and its implications on sensory quality traits across comparative 'long term' durations. Crown Copyright © 2018. Published by Elsevier Ltd. All rights reserved.
Tramontano, A; Bianchi, E; Venturini, S; Martin, F; Pessi, A; Sollazzo, M
1994-03-01
Conformationally constraining selectable peptides onto a suitable scaffold that enables their conformation to be predicted or readily determined by experimental techniques would considerably boost the drug discovery process by reducing the gap between the discovery of a peptide lead and the design of a peptidomimetic with a more desirable pharmacological profile. With this in mind, we designed the minibody, a 61-residue beta-protein aimed at retaining some desirable features of immunoglobulin variable domains, such as tolerance to sequence variability in selected regions of the protein and predictability of the main chain conformation of the same regions, based on the 'canonical structures' model. To test the ability of the minibody scaffold to support functional sites we also designed a metal binding version of the protein by suitably choosing the sequences of its loops. The minibody was produced both by chemical synthesis and expression in E. coli and characterized by size exclusion chromatography, UV CD (circular dichroism) spectroscopy and metal binding activity. All our data supported the model, but a more detailed structural characterization of the molecule was impaired by its low solubility. We were able to overcome this problem both by further mutagenesis of the framework and by addition of a solubilizing motif. The minibody is being used to select constrained human IL-6 peptidic ligands from a library displayed on the surface of the f1 bacteriophage.
Wang, Liwen; Qin, Yali; Ilchenko, Serguei; Bohon, Jen; Shi, Wuxian; Cho, Michael W.; Takamoto, Keiji; Chance, Mark R.
2010-01-01
Structural characterization of the HIV envelope protein gp120 is very important to provide an understanding of the protein's immunogenicity and it's binding to cell receptors. So far, crystallographic structure determination of gp120 with an intact V3 loop (in the absence of CD4 co-receptor or antibody) has not been achieved. The third variable region (V3) of the gp120 is immunodominant and contains glycosylation signatures that are essential for co-receptor binding and viral entry to T-cells. In this study, we characterized the structure of the outer domain of gp120 with an intact V3 loop (gp120-OD8) purified from Drosophila S2 cells utilizing mass spectrometry-based approaches. We mapped the glycosylation sites and calculated glycosylation occupancy of gp120-OD8; eleven sites from fifteen glycosylation motifs were determined as having high mannose or hybrid glycosylation structures. The specific glycan moieties of nine glycosylation sites from eight unique glycopeptides were determined by a combination of ECD and CID MS approaches. Hydroxyl radical-mediated protein footprinting coupled with mass spectrometry analysis was employed to provide detailed information on protein structure of gp120-OD8 by directly identifying accessible and hydroxyl radical-reactive side chain residues. Comparison of gp120-OD8 experimental footprinting data with a homology model derived from the ligated CD4/ gp120-OD8 crystal structure revealed a flexible V3 loop structure where the V3 tip may provide contacts with the rest of the protein while residues in the V3 base remain solvent accessible. In addition, the data illustrate interactions between specific sugar moieties and amino acid side chains potentially important to the gp120-OD8 structure. PMID:20825246
Extractable Bacterial Surface Proteins in Probiotic–Host Interaction
do Carmo, Fillipe L. R.; Rabah, Houem; De Oliveira Carvalho, Rodrigo D.; Gaucher, Floriane; Cordeiro, Barbara F.; da Silva, Sara H.; Le Loir, Yves; Azevedo, Vasco; Jan, Gwénaël
2018-01-01
Some Gram-positive bacteria, including probiotic ones, are covered with an external proteinaceous layer called a surface-layer. Described as a paracrystalline layer and formed by the self-assembly of a surface-layer-protein (Slp), this optional structure is peculiar. The surface layer per se is conserved and encountered in many prokaryotes. However, the sequence of the corresponding Slp protein is highly variable among bacterial species, or even among strains of the same species. Other proteins, including surface layer associated proteins (SLAPs), and other non-covalently surface-bound proteins may also be extracted with this surface structure. They can be involved a various functions. In probiotic Gram-positives, they were shown by different authors and experimental approaches to play a role in key interactions with the host. Depending on the species, and sometime on the strain, they can be involved in stress tolerance, in survival within the host digestive tract, in adhesion to host cells or mucus, or in the modulation of intestinal inflammation. Future trends include the valorization of their properties in the formation of nanoparticles, coating and encapsulation, and in the development of new vaccines. PMID:29670603
Byrne, Dominic P.; Vonderach, Matthias; Ferries, Samantha; Brownridge, Philip J.; Eyers, Claire E.; Eyers, Patrick A.
2016-01-01
cAMP-dependent protein kinase (PKA) is an archetypal biological signaling module and a model for understanding the regulation of protein kinases. In the present study, we combine biochemistry with differential scanning fluorimetry (DSF) and ion mobility–mass spectrometry (IM–MS) to evaluate effects of phosphorylation and structure on the ligand binding, dynamics and stability of components of heteromeric PKA protein complexes in vitro. We uncover dynamic, conformationally distinct populations of the PKA catalytic subunit with distinct structural stability and susceptibility to the physiological protein inhibitor PKI. Native MS of reconstituted PKA R2C2 holoenzymes reveals variable subunit stoichiometry and holoenzyme ablation by PKI binding. Finally, we find that although a ‘kinase-dead’ PKA catalytic domain cannot bind to ATP in solution, it interacts with several prominent chemical kinase inhibitors. These data demonstrate the combined power of IM–MS and DSF to probe PKA dynamics and regulation, techniques that can be employed to evaluate other protein-ligand complexes, with broad implications for cellular signaling. PMID:27444646
Liu, Liang; Chen, Jiyun; Yang, Bo; Wang, Yonghua
2015-03-06
Small heat shock proteins (sHSPs) are ubiquitous chaperones that play a vital role in protein homeostasis. sHSPs are characterized by oligomeric architectures and dynamic exchange of subunits. The flexible oligomeric assembling associating with function remains poorly understood. Based on the structural data, it is certainly agreed that two dimerization models depend on the presence or absence of a β6 strand to differentiate nonmetazoan sHSPs from metazoan sHSPs. Here, we report the Sulfolobus solfataricus Hsp20.1 ACD dimer structure, which shows a distinct dimeric interface. We observed that, in the absence of β6, Hsp20.1 dimer does not depend on β7 strand for forming dimer interface as metazoan sHSPs, nor dissociates to monomers. This is in contrast to other published sHSPs. Our structure reveals a variable, highly polar dimer interface that has advantages for rapid subunits exchange and substrate binding. Remarkably, we find that the C-terminal truncation variant has chaperone activity comparable to that of wild-type despite lack of the oligomer structure. Our further study indicates that the N-terminal region is essential for the oligomer and dimer binding to the target protein. Together, the structure and function of Hsp20.1 give more insight into the thermal protection mechanism of sHSPs. Copyright © 2015 Elsevier Inc. All rights reserved.
Krebs, Werner G.; Gerstein, Mark
2000-01-01
The number of solved structures of macromolecules that have the same fold and thus exhibit some degree of conformational variability is rapidly increasing. It is consequently advantageous to develop a standardized terminology for describing this variability and automated systems for processing protein structures in different conformations. We have developed such a system as a ‘front-end’ server to our database of macromolecular motions. Our system attempts to describe a protein motion as a rigid-body rotation of a small ‘core’ relative to a larger one, using a set of hinges. The motion is placed in a standardized coordinate system so that all statistics between any two motions are directly comparable. We find that while this model can accommodate most protein motions, it cannot accommodate all; the degree to which a motion can be accommodated provides an aid in classifying it. Furthermore, we perform an adiabatic mapping (a restrained interpolation) between every two conformations. This gives some indication of the extent of the energetic barriers that need to be surmounted in the motion, and as a by-product results in a ‘morph movie’. We make these movies available over the Web to aid in visualization. Many instances of conformational variability occur between proteins with somewhat different sequences. We can accommodate these differences in a rough fashion, generating an ‘evolutionary morph’. Users have already submitted hundreds of examples of protein motions to our server, producing a comprehensive set of statistics. So far the statistics show that the median submitted motion has a rotation of ~10° and a maximum Cα displacement of 17 Å. Almost all involve at least one large torsion angle change of >140°. The server is accessible at http://bioinfo.mbb.yale.edu/MolMovDB PMID:10734184
A first line of stress defense: small heat shock proteins and their function in protein homeostasis.
Haslbeck, Martin; Vierling, Elizabeth
2015-04-10
Small heat shock proteins (sHsps) are virtually ubiquitous molecular chaperones that can prevent the irreversible aggregation of denaturing proteins. sHsps complex with a variety of non-native proteins in an ATP-independent manner and, in the context of the stress response, form a first line of defense against protein aggregation in order to maintain protein homeostasis. In vertebrates, they act to maintain the clarity of the eye lens, and in humans, sHsp mutations are linked to myopathies and neuropathies. Although found in all domains of life, sHsps are quite diverse and have evolved independently in metazoans, plants and fungi. sHsp monomers range in size from approximately 12 to 42kDa and are defined by a conserved β-sandwich α-crystallin domain, flanked by variable N- and C-terminal sequences. Most sHsps form large oligomeric ensembles with a broad distribution of different, sphere- or barrel-like oligomers, with the size and structure of the oligomers dictated by features of the N- and C-termini. The activity of sHsps is regulated by mechanisms that change the equilibrium distribution in tertiary features and/or quaternary structure of the sHsp ensembles. Cooperation and/or co-assembly between different sHsps in the same cellular compartment add an underexplored level of complexity to sHsp structure and function. Copyright © 2015 Elsevier Ltd. All rights reserved.
Structural stability of proteins in aqueous and nonpolar environments
NASA Astrophysics Data System (ADS)
Yasuda, Satoshi; Oshima, Hiraku; Kinoshita, Masahiro
2012-10-01
A protein folds into its native structure with the α-helix and/or β-sheet in aqueous solution under the physiological condition. The relative content of these secondary structures largely varies from protein to protein. However, such structural variability is not exhibited in nonaqueous environment. For example, there is a strong trend that alcohol induces a protein to form α-helices, and many of the membrane proteins within the lipid bilayer consists of α-helices. Here we investigate the structural stability of proteins in aqueous and nonpolar environments using our recently developed free-energy function F = (Λ - TS)/(kBT0) = Λ/(kBT0) - S/kB (T0 = 298 K and the absolute temperature T is set at T0) which is based on statistical thermodynamics. Λ/(kBT0) and S/kB are the energetic and entropic components, respectively, and kB is Boltzmann's constant. A smaller value of the positive quantity, -S, represents higher efficiency of the backbone and side-chain packing promoted by the entropic effect arising from the translational displacement of solvent molecules or the CH2, CH3, and CH groups which constitute nonpolar chains of lipid molecules. As for Λ, in aqueous solution, a transition to a more compact structure of a protein accompanies the break of protein-solvent hydrogen bonds: As the number of donors and acceptors buried without protein intramolecular hydrogen bonding increases, Λ becomes higher. In nonpolar solvent, lower Λ simply implies more intramolecular hydrogen bonds formed. We find the following. The α-helix and β-sheet are advantageous with respect to -S as well as Λ and to be formed as much as possible. In aqueous solution, the solvent-entropy effect on the structural stability is so strong that the close packing of side chains is dominantly important, and the α-helix and β-sheet contents are judiciously adjusted to accomplish it. In nonpolar solvent, the solvent-entropy effect is substantially weaker than in aqueous solution. Λ is crucial and the α-helix is more stable than the β-sheet in terms of Λ, which develops a tendency that α-helices are exclusively chosen. For a membrane protein, α-helices are stabilized as fundamental structural units for the same reason, but their arrangement is performed through the entropic effect mentioned above.
The Structure of a Thermophilic Kinase Shapes Fitness upon Random Circular Permutation.
Jones, Alicia M; Mehta, Manan M; Thomas, Emily E; Atkinson, Joshua T; Segall-Shapiro, Thomas H; Liu, Shirley; Silberg, Jonathan J
2016-05-20
Proteins can be engineered for synthetic biology through circular permutation, a sequence rearrangement in which native protein termini become linked and new termini are created elsewhere through backbone fission. However, it remains challenging to anticipate a protein's functional tolerance to circular permutation. Here, we describe new transposons for creating libraries of randomly circularly permuted proteins that minimize peptide additions at their termini, and we use transposase mutagenesis to study the tolerance of a thermophilic adenylate kinase (AK) to circular permutation. We find that libraries expressing permuted AKs with either short or long peptides amended to their N-terminus yield distinct sets of active variants and present evidence that this trend arises because permuted protein expression varies across libraries. Mapping all sites that tolerate backbone cleavage onto AK structure reveals that the largest contiguous regions of sequence that lack cleavage sites are proximal to the phosphotransfer site. A comparison of our results with a range of structure-derived parameters further showed that retention of function correlates to the strongest extent with the distance to the phosphotransfer site, amino acid variability in an AK family sequence alignment, and residue-level deviations in superimposed AK structures. Our work illustrates how permuted protein libraries can be created with minimal peptide additions using transposase mutagenesis, and it reveals a challenge of maintaining consistent expression across permuted variants in a library that minimizes peptide additions. Furthermore, these findings provide a basis for interpreting responses of thermophilic phosphotransferases to circular permutation by calibrating how different structure-derived parameters relate to retention of function in a cellular selection.
Baculovirus-mediated expression of GPCRs in insect cells.
Saarenpää, Tuulia; Jaakola, Veli-Pekka; Goldman, Adrian
2015-01-01
G-protein-coupled receptors (GPCRs) are a large family of seven transmembrane proteins that influence a considerable number of cellular events. For this reason, they are one of the most studied receptor types for their pharmacological and structural properties. Solving the structure of several GPCR receptor types has been possible using almost all expression systems, including Escherichia coli, yeast, mammalian, and insect cells. So far, however, most of the GPCR structures solved have been done using the baculovirus insect cell expression system. The reason for this is mainly due to cost-effectiveness, posttranslational modification efficiency, and overall effortless maintenance. The system has evolved so much that variables starting from vector type, purification tags, cell line, and growth conditions can be varied and optimized countless ways to suit the needs of new constructs. Here, we present the array of techniques that enable the rapid and efficient optimization of expression steps for maximal protein quality and quantity, including our emendations. © 2015 Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wilbur, Jeremy D., E-mail: jwilbur@msg.ucsf.edu; Hwang, Peter K.; Brodsky, Frances M.
2010-03-01
Variable packing interaction related to the conformational flexibility within the huntingtin-interacting protein 1 coiled coil domain. Huntingtin-interacting protein 1 (HIP1) is an important link between the actin cytoskeleton and clathrin-mediated endocytosis machinery. HIP1 has also been implicated in the pathogenesis of Huntington’s disease. The binding of HIP1 to actin is regulated through an interaction with clathrin light chain. Clathrin light chain binds to a flexible coiled-coil domain in HIP1 and induces a compact state that is refractory to actin binding. To understand the mechanism of this conformational regulation, a high-resolution crystal structure of a stable fragment from the HIP1 coiled-coilmore » domain was determined. The flexibility of the HIP1 coiled-coil region was evident from its variation from a previously determined structure of a similar region. A hydrogen-bond network and changes in coiled-coil monomer interaction suggest that the HIP1 coiled-coil domain is uniquely suited to allow conformational flexibility.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
An, Bo; Jenkins, Janelle E; Sampath, Sujatha
Dragline silk from orb-weaving spiders is a copolymer of two large proteins, major ampullate spidroin 1 (MaSp1) and 2 (MaSp2). The ratio of these proteins is known to have a large variation across different species of orb-weaving spiders. NMR results from gland material of two different species of spiders, N. clavipes and A. aurantia, indicates that MaSp1 proteins are more easily formed into β-sheet nanostructures, while MaSp2 proteins form random coil and helical structures. To test if this behavior of natural silk proteins could be reproduced by recombinantly produced spider silk mimic protein, recombinant MaSp1/MaSp2 mixed fibers as well asmore » chimeric silk fibers from MaSp1 and MaSp2 sequences in a single protein were produced based on the variable ratio and conserved motifs of MaSp1 and MaSp2 in native silk fiber. Mechanical properties, solid-state NMR, and XRD results of tested synthetic fibers indicate the differing roles of MaSp1 and MaSp2 in the fiber and verify the importance of postspin stretching treatment in helping the fiber to form the proper spatial structure.« less
Cellular targeting and host-specific recognition of cyst nematode CLE proteins
USDA-ARS?s Scientific Manuscript database
Cyst nematodes produce secreted peptide mimics of plant CLAVATA3/ESR (CLE) peptides likely involved in redirecting CLE signaling pathways active in roots to form unique and essential feeding cells. The hallmark structure of plant CLEs, which includes an N-terminal signal peptide, a highly variable d...
Formin homology 2 domains occur in multiple contexts in angiosperms
Cvrčková, Fatima; Novotný, Marian; Pícková, Denisa; Žárský, Viktor
2004-01-01
Background Involvement of conservative molecular modules and cellular mechanisms in the widely diversified processes of eukaryotic cell morphogenesis leads to the intriguing question: how do similar proteins contribute to dissimilar morphogenetic outputs. Formins (FH2 proteins) play a central part in the control of actin organization and dynamics, providing a good example of evolutionarily versatile use of a conserved protein domain in the context of a variety of lineage-specific structural and signalling interactions. Results In order to identify possible plant-specific sequence features within the FH2 protein family, we performed a detailed analysis of angiosperm formin-related sequences available in public databases, with particular focus on the complete Arabidopsis genome and the nearly finished rice genome sequence. This has led to revision of the current annotation of half of the 22 Arabidopsis formin-related genes. Comparative analysis of the two plant genomes revealed a good conservation of the previously described two subfamilies of plant formins (Class I and Class II), as well as several subfamilies within them that appear to predate the separation of monocot and dicot plants. Moreover, a number of plant Class II formins share an additional conserved domain, related to the protein phosphatase/tensin/auxilin fold. However, considerable inter-species variability sets limits to generalization of any functional conclusions reached on a single species such as Arabidopsis. Conclusions The plant-specific domain context of the conserved FH2 domain, as well as plant-specific features of the domain itself, may reflect distinct functional requirements in plant cells. The variability of formin structures found in plants far exceeds that known from both fungi and metazoans, suggesting a possible contribution of FH2 proteins in the evolution of the plant type of multicellularity. PMID:15256004
Komives, Elizabeth A.; Wolynes, Peter G.
2008-01-01
Repeat-proteins are made up of near repetitions of 20– to 40–amino acid stretches. These polypeptides usually fold up into non-globular, elongated architectures that are stabilized by the interactions within each repeat and those between adjacent repeats, but that lack contacts between residues distant in sequence. The inherent symmetries both in primary sequence and three-dimensional structure are reflected in a folding landscape that may be analyzed as a quasi–one-dimensional problem. We present a general description of repeat-protein energy landscapes based on a formal Ising-like treatment of the elementary interaction energetics in and between foldons, whose collective ensemble are treated as spin variables. The overall folding properties of a complete “domain” (the stability and cooperativity of the repeating array) can be derived from this microscopic description. The one-dimensional nature of the model implies there are simple relations for the experimental observables: folding free-energy (ΔGwater) and the cooperativity of denaturation (m-value), which do not ordinarily apply for globular proteins. We show how the parameters for the “coarse-grained” description in terms of foldon spin variables can be extracted from more detailed folding simulations on perfectly funneled landscapes. To illustrate the ideas, we present a case-study of a family of tetratricopeptide (TPR) repeat proteins and quantitatively relate the results to the experimentally observed folding transitions. Based on the dramatic effect that single point mutations exert on the experimentally observed folding behavior, we speculate that natural repeat proteins are “poised” at particular ratios of inter- and intra-element interaction energetics that allow them to readily undergo structural transitions in physiologically relevant conditions, which may be intrinsically related to their biological functions. PMID:18483553
Muto, Memi; Kamitani, Wataru; Sakai, Mizuki; Hirano, Minato; Kobayashi, Shintaro; Kariwa, Hiroaki; Yoshii, Kentaro
2018-04-02
Tick-borne encephalitis virus (TBEV) causes severe neurological disease, but the pathogenetic mechanism is unclear. The conformational structure of the 3'-untranslated region (UTR) of TBEV is associated with its virulence. We tried to identify host proteins interacting with the 3'-UTR of TBEV. Cellular proteins of HEK293T cells were co-precipitated with biotinylated RNAs of the 3'-UTR of low- and high-virulence TBEV strains and subjected to mass spectrometry analysis. Fifteen host proteins were found to bind to the 3'-UTR of TBEV, four of which-cold shock domain containing-E1 (CSDE1), spermatid perinuclear RNA binding protein (STRBP), fragile X mental retardation protein (FMRP), and interleukin enhancer binding factor 3 (ILF3)-bound specifically to that of the low-virulence strain. An RNA immunoprecipitation and pull-down assay confirmed the interactions of the complete 3'-UTRs of TBEV genomic RNA with CSDE1, FMRP, and ILF3. Partial deletion of the stem loop (SL) 3 to SL 5 structure of the variable region of the 3'-UTR did not affect interactions with the host proteins, but the interactions were markedly suppressed by deletion of the complete SL 3, 4, and 5 structures, as in the high-virulence TBEV strain. Further analysis of the roles of host proteins in the neurologic pathogenicity of TBEV is warranted. Copyright © 2018 Elsevier B.V. All rights reserved.
Population entropies estimates of proteins
NASA Astrophysics Data System (ADS)
Low, Wai Yee
2017-05-01
The Shannon entropy equation provides a way to estimate variability of amino acids sequences in a multiple sequence alignment of proteins. Knowledge of protein variability is useful in many areas such as vaccine design, identification of antibody binding sites, and exploration of protein 3D structural properties. In cases where the population entropies of a protein are of interest but only a small sample size can be obtained, a method based on linear regression and random subsampling can be used to estimate the population entropy. This method is useful for comparisons of entropies where the actual sequence counts differ and thus, correction for alignment size bias is needed. In the current work, an R based package named EntropyCorrect that enables estimation of population entropy is presented and an empirical study on how well this new algorithm performs on simulated dataset of various combinations of population and sample sizes is discussed. The package is available at https://github.com/lloydlow/EntropyCorrect. This article, which was originally published online on 12 May 2017, contained an error in Eq. (1), where the summation sign was missing. The corrected equation appears in the Corrigendum attached to the pdf.
Characterization of the free-energy landscapes of proteins by NMR-guided metadynamics
Granata, Daniele; Camilloni, Carlo; Vendruscolo, Michele; Laio, Alessandro
2013-01-01
The use of free-energy landscapes rationalizes a wide range of aspects of protein behavior by providing a clear illustration of the different states accessible to these molecules, as well as of their populations and pathways of interconversion. The determination of the free-energy landscapes of proteins by computational methods is, however, very challenging as it requires an extensive sampling of their conformational spaces. We describe here a technique to achieve this goal with relatively limited computational resources by incorporating nuclear magnetic resonance (NMR) chemical shifts as collective variables in metadynamics simulations. As in this approach the chemical shifts are not used as structural restraints, the resulting free-energy landscapes correspond to the force fields used in the simulations. We illustrate this approach in the case of the third Ig-binding domain of protein G from streptococcal bacteria (GB3). Our calculations reveal the existence of a folding intermediate of GB3 with nonnative structural elements. Furthermore, the availability of the free-energy landscape enables the folding mechanism of GB3 to be elucidated by analyzing the conformational ensembles corresponding to the native, intermediate, and unfolded states, as well as the transition states between them. Taken together, these results show that, by incorporating experimental data as collective variables in metadynamics simulations, it is possible to enhance the sampling efficiency by two or more orders of magnitude with respect to standard molecular dynamics simulations, and thus to estimate free-energy differences among the different states of a protein with a kBT accuracy by generating trajectories of just a few microseconds. PMID:23572592
2010-01-01
Background Plasmodium vivax malaria is a major public health challenge in Latin America, Asia and Oceania, with 130-435 million clinical cases per year worldwide. Invasion of host blood cells by P. vivax mainly depends on a type I membrane protein called Duffy binding protein (PvDBP). The erythrocyte-binding motif of PvDBP is a 170 amino-acid stretch located in its cysteine-rich region II (PvDBPII), which is the most variable segment of the protein. Methods To test whether diversifying natural selection has shaped the nucleotide diversity of PvDBPII in Brazilian populations, this region was sequenced in 122 isolates from six different geographic areas. A Bayesian method was applied to test for the action of natural selection under a population genetic model that incorporates recombination. The analysis was integrated with a structural model of PvDBPII, and T- and B-cell epitopes were localized on the 3-D structure. Results The results suggest that: (i) recombination plays an important role in determining the haplotype structure of PvDBPII, and (ii) PvDBPII appears to contain neutrally evolving codons as well as codons evolving under natural selection. Diversifying selection preferentially acts on sites identified as epitopes, particularly on amino acid residues 417, 419, and 424, which show strong linkage disequilibrium. Conclusions This study shows that some polymorphisms of PvDBPII are present near the erythrocyte-binding domain and might serve to elude antibodies that inhibit cell invasion. Therefore, these polymorphisms should be taken into account when designing vaccines aimed at eliciting antibodies to inhibit erythrocyte invasion. PMID:21092207
2014-01-01
Background Protein sites evolve at different rates due to functional and biophysical constraints. It is usually considered that the main structural determinant of a site’s rate of evolution is its Relative Solvent Accessibility (RSA). However, a recent comparative study has shown that the main structural determinant is the site’s Local Packing Density (LPD). LPD is related with dynamical flexibility, which has also been shown to correlate with sequence variability. Our purpose is to investigate the mechanism that connects a site’s LPD with its rate of evolution. Results We consider two models: an empirical Flexibility Model and a mechanistic Stress Model. The Flexibility Model postulates a linear increase of site-specific rate of evolution with dynamical flexibility. The Stress Model, introduced here, models mutations as random perturbations of the protein’s potential energy landscape, for which we use simple Elastic Network Models (ENMs). To account for natural selection we assume a single active conformation and use basic statistical physics to derive a linear relationship between site-specific evolutionary rates and the local stress of the mutant’s active conformation. We compare both models on a large and diverse dataset of enzymes. In a protein-by-protein study we found that the Stress Model outperforms the Flexibility Model for most proteins. Pooling all proteins together we show that the Stress Model is strongly supported by the total weight of evidence. Moreover, it accounts for the observed nonlinear dependence of sequence variability on flexibility. Finally, when mutational stress is controlled for, there is very little remaining correlation between sequence variability and dynamical flexibility. Conclusions We developed a mechanistic Stress Model of evolution according to which the rate of evolution of a site is predicted to depend linearly on the local mutational stress of the active conformation. Such local stress is proportional to LPD, so that this model explains the relationship between LPD and evolutionary rate. Moreover, the model also accounts for the nonlinear dependence between evolutionary rate and dynamical flexibility. PMID:24716445
Groves, M R; Hanlon, N; Turowski, P; Hemmings, B A; Barford, D
1999-01-08
The PR65/A subunit of protein phosphatase 2A serves as a scaffolding molecule to coordinate the assembly of the catalytic subunit and a variable regulatory B subunit, generating functionally diverse heterotrimers. Mutations of the beta isoform of PR65 are associated with lung and colon tumors. The crystal structure of the PR65/Aalpha subunit, at 2.3 A resolution, reveals the conformation of its 15 tandemly repeated HEAT sequences, degenerate motifs of approximately 39 amino acids present in a variety of proteins, including huntingtin and importin beta. Individual motifs are composed of a pair of antiparallel alpha helices that assemble in a mainly linear, repetitive fashion to form an elongated molecule characterized by a double layer of alpha helices. Left-handed rotations at three interrepeat interfaces generate a novel left-hand superhelical conformation. The protein interaction interface is formed from the intrarepeat turns that are aligned to form a continuous ridge.
2012-01-01
Chlorite dismutases (Clds) are heme b-containing oxidoreductases that convert chlorite to chloride and dioxygen. In this work, the thermodynamics of the one-electron reduction of the ferric high-spin forms and of the six-coordinate low-spin cyanide adducts of the enzymes from Nitrobacter winogradskyi (NwCld) and Candidatus “Nitrospira defluvii” (NdCld) were determined through spectroelectrochemical experiments. These proteins belong to two phylogenetically separated lineages that differ in subunit (21.5 and 26 kDa, respectively) and oligomeric (dimeric and pentameric, respectively) structure but exhibit similar chlorite degradation activity. The E°′ values for free and cyanide-bound proteins were determined to be −119 and −397 mV for NwCld and −113 and −404 mV for NdCld, respectively (pH 7.0, 25 °C). Variable-temperature spectroelectrochemical experiments revealed that the oxidized state of both proteins is enthalpically stabilized. Molecular dynamics simulations suggest that changes in the protein structure are negligible, whereas solvent reorganization is mainly responsible for the increase in entropy during the redox reaction. Obtained data are discussed with respect to the known structures of the two Clds and the proposed reaction mechanism. PMID:23126649
Neanderthal and Denisova tooth protein variants in present-day humans
Zanolli, Clément; Hourset, Mathilde; Esclassan, Rémi
2017-01-01
Environment parameters, diet and genetic factors interact to shape tooth morphostructure. In the human lineage, archaic and modern hominins show differences in dental traits, including enamel thickness, but variability also exists among living populations. Several polymorphisms, in particular in the non-collagenous extracellular matrix proteins of the tooth hard tissues, like enamelin, are involved in dental structure variation and defects and may be associated with dental disorders or susceptibility to caries. To gain insights into the relationships between tooth protein polymorphisms and dental structural morphology and defects, we searched for non-synonymous polymorphisms in tooth proteins from Neanderthal and Denisova hominins. The objective was to identify archaic-specific missense variants that may explain the dental morphostructural variability between extinct and modern humans, and to explore their putative impact on present-day dental phenotypes. Thirteen non-collagenous extracellular matrix proteins specific to hard dental tissues have been selected, searched in the publicly available sequence databases of Neanderthal and Denisova individuals and compared with modern human genome data. A total of 16 non-synonymous polymorphisms were identified in 6 proteins (ameloblastin, amelotin, cementum protein 1, dentin matrix acidic phosphoprotein 1, enamelin and matrix Gla protein). Most of them are encoded by dentin and enamel genes located on chromosome 4, previously reported to show signs of archaic introgression within Africa. Among the variants shared with modern humans, two are ancestral (common with apes) and one is the derived enamelin major variant, T648I (rs7671281), associated with a thinner enamel and specific to the Homo lineage. All the others are specific to Neanderthals and Denisova, and are found at a very low frequency in modern Africans or East and South Asians, suggesting that they may be related to particular dental traits or disease susceptibility in these populations. This modern regional distribution of archaic dental polymorphisms may reflect persistence of archaic variants in some populations and may contribute in part to the geographic dental variations described in modern humans. PMID:28902892
Bukin, Yu S; Dzhioev, Yu P; Tkachev, S E; Kozlova, I V; Paramonov, A I; Ruzek, D; Qu, Z; Zlobin, V I
2017-06-15
This work is dedicated to the study of the variability of the main antigenic envelope protein E among different strains of tick-borne encephalitis virus at the level of physical and chemical properties of the amino acid residues. E protein variants were extracted from then NCBI database. Four amino acid residues properties in the polypeptide sequences were investigated: the average volume of the amino acid residue in the protein tertiary structure, the number of amino acid residue hydrogen bond donors, the charge of amino acid residue lateral radical and the dipole moment of the amino acid residue. These physico-chemical properties are involved in antigen-antibody interactions. As a result, 103 different variants of the antigenic determinants of the tick-borne encephalitis virus E protein were found, significantly different by physical and chemical properties of the amino acid residues in their structure. This means that some strains among the natural variants of tick-borne encephalitis virus can potentially escape the immune response induced by the standard vaccine. Copyright © 2017 Elsevier B.V. All rights reserved.
Zhang, Wenping; Yue, Bisong; Wang, Xiaofang; Zhang, Xiuyue; Xie, Zhong; Liu, Nonglin; Fu, Wenyuan; Yuan, Yaohua; Chen, Daqing; Fu, Danghua; Zhao, Bo; Yin, Yuzhong; Yan, Xiahui; Wang, Xinjing; Zhang, Rongying; Liu, Jie; Li, Maoping; Tang, Yao; Hou, Rong; Zhang, Zhihe
2011-10-01
In order to investigate the mitochondrial genome of Panthera tigris amoyensis, two South China tigers (P25 and P27) were analyzed following 15 cymt-specific primer sets. The entire mtDNA sequence was found to be 16,957 bp and 17,001 bp long for P25 and P27 respectively, and this difference in length between P25 and P27 occurred in the number of tandem repeats in the RS-3 segment of the control region. The structural characteristics of complete P. t. amoyensis mitochondrial genomes were also highly similar to those of P. uncia. Additionally, the rate of point mutation was only 0.3% and a total of 59 variable sites between P25 and P27 were found. Out of the 59 variable sites, 6 were located in 6 different tRNA genes, 6 in the 2 rRNA genes, 7 in non-coding regions (one located between tRNA-Asn and tRNA-Tyr and six in the D-loop), and 40 in 10 protein-coding genes. COI held the largest amount of variable sites (9 sites) and Cytb contained the highest variable rate (0.7%) in the complete sequences. Moreover, out of the 40 variable sites located in 10 protein-coding genes, 12 sites were nonsynonymous.
3D Protein structure prediction with genetic tabu search algorithm
2010-01-01
Background Protein structure prediction (PSP) has important applications in different fields, such as drug design, disease prediction, and so on. In protein structure prediction, there are two important issues. The first one is the design of the structure model and the second one is the design of the optimization technology. Because of the complexity of the realistic protein structure, the structure model adopted in this paper is a simplified model, which is called off-lattice AB model. After the structure model is assumed, optimization technology is needed for searching the best conformation of a protein sequence based on the assumed structure model. However, PSP is an NP-hard problem even if the simplest model is assumed. Thus, many algorithms have been developed to solve the global optimization problem. In this paper, a hybrid algorithm, which combines genetic algorithm (GA) and tabu search (TS) algorithm, is developed to complete this task. Results In order to develop an efficient optimization algorithm, several improved strategies are developed for the proposed genetic tabu search algorithm. The combined use of these strategies can improve the efficiency of the algorithm. In these strategies, tabu search introduced into the crossover and mutation operators can improve the local search capability, the adoption of variable population size strategy can maintain the diversity of the population, and the ranking selection strategy can improve the possibility of an individual with low energy value entering into next generation. Experiments are performed with Fibonacci sequences and real protein sequences. Experimental results show that the lowest energy obtained by the proposed GATS algorithm is lower than that obtained by previous methods. Conclusions The hybrid algorithm has the advantages from both genetic algorithm and tabu search algorithm. It makes use of the advantage of multiple search points in genetic algorithm, and can overcome poor hill-climbing capability in the conventional genetic algorithm by using the flexible memory functions of TS. Compared with some previous algorithms, GATS algorithm has better performance in global optimization and can predict 3D protein structure more effectively. PMID:20522256
Børud, Bente; Bårnes, Guro K; Brynildsrud, Ola Brønstad; Fritzsønn, Elisabeth; Caugant, Dominique A
2018-03-19
Species within the genus Neisseria display significant glycan diversity associated with the O -linked protein glycosylation ( pgl ) systems due to phase variation, polymorphic genes and gene content. The aim of this study was to examine in detail the pgl genotype and glycosylation phenotype in meningococcal isolates and the changes occurring during short-term asymptomatic carriage. Paired meningococcal isolates derived from 50 asymptomatic meningococcal carriers, taken about two months apart, were analyzed with whole genome sequencing. The O -linked protein glycosylation genes were characterized in detail using the Genome Comparator tool at the PubMLST.org database. Immunoblotting with glycan specific antibodies were used to investigate the protein glycosylation phenotype. All major pgl locus polymorphisms identified in N. meningitidis to date were present in our isolate collection, with the variable presence of pglG-pglH, both in combination with either pglB or pglB2. We identified significant changes and diversity in the pgl genotype and/or glycan phenotype in 96% of the paired isolates. There was also a high degree of glycan microheterogeneity, in which different variants of glycan structures were found at a given glycoprotein. The main mechanism responsible for the observed differences was phase variable expression of the involved glycosyltransferases and the O-acetyltransferase. To our knowledge, this is the first characterization of the pgl genotype and glycosylation phenotype in a larger strain collection. This study thus provides important insight into glycan diversity in N. meningitidis and phase variability changes that influence the expressed glycoform repertoire during meningococcal carriage. Importance Bacterial meningitis is a serious global health problem and one of the major causative organisms is Neisseria meningitidis , which is also a common commensal in the upper respiratory tract of healthy humans. In bacteria, numerous loci involved in biosynthesis of surface exposed antigenic structures that are involved in the interaction between bacteria and host, are frequently subjected to homologous recombination and phase variation. These mechanisms are well described in Neisseria, and phase variation provides the ability to change these structures reversibly in response to the environment. Protein glycosylation systems are becoming widely identified in bacteria, yet little is known about the mechanisms and evolutionary forces influencing glycan composition during carriage and disease. Copyright © 2018 American Society for Microbiology.
Coiled-Coil Irregularities and Instabilities in Group A Streptococcus M1 Are Required for Virulence
DOE Office of Scientific and Technical Information (OSTI.GOV)
McNamara, Case; Zinkernagel, Annelies S.; Macheboeuf, Pauline
2008-07-21
Antigenically variable M proteins are major virulence factors and immunogens of the human pathogen group A Streptococcus (GAS). Here, we report the -3 angstrom resolution structure of a GAS M1 fragment containing the regions responsible for eliciting type-specific, protective immunity and for binding fibrinogen, which promotes M1 proinflammatory and antiphagocytic functions. The structure revealed substantial irregularities and instabilities throughout the coiled coil of the M1 fragment. Similar structural irregularities occur in myosin and tropomyosin, explaining the patterns of cross-reactivity seen in autoimmune sequelae of GAS infection. Sequence idealization of a large segment of the M1 coiled coil enhanced stability butmore » diminished fibrinogen binding, proinflammatory effects, and antibody cross-reactivity, whereas it left protective immunogenicity undiminished. Idealized M proteins appear to have promise as vaccine immunogens.« less
Turoverov, Konstantin K.; Kuznetsova, Irina M.; Uversky, Vladimir N.
2010-01-01
The native state of a protein is usually associated with a compact globular conformation possessing a rigid and highly ordered structure. At the turn of the last century certain studies arose which concluded that many proteins cannot, in principle, form a rigid globular structure in an aqueous environment, but they are still able to fulfill their specific functions — i.e., they are native. The existence of the disordered regions allows these proteins to interact with their numerous binding partners. Such interactions are often accompanied by the formation of complexes that possess a more ordered structure than the original components. The functional diversity of these proteins, combined with the variability of signals related to the various intra-and intercellular processes handled by these proteins and their capability to produce multi-variant and multi-directional responses allow them to form a unique regulatory net in a cell. The abundance of disordered proteins inside the cell is precisely controlled at the synthesis and clearance levels as well as via interaction with specific binding partners and posttranslational modifications. Another recently recognized biologically active state of proteins is the functional amyloid. The formation of such functional amyloids is tightly controlled and therefore differs from the uncontrolled formation of pathogenic amyloids which are associated with the pathogenesis of several conformational diseases, the development of which is likely to be determined by the failures of the cellular regulatory systems rather than by the formation of the proteinaceous deposits and/or by the protofibril toxicity. PMID:20097220
Water promotes the sealing of nanoscale packing defects in folding proteins.
Fernández, Ariel
2014-05-21
A net dipole moment is shown to arise from a non-Debye component of water polarization created by nanoscale packing defects on the protein surface. Accordingly, the protein electrostatic field exerts a torque on the induced dipole, locally impeding the nucleation of ice at the protein-water interface. We evaluate the solvent orientation steering (SOS) as the reversible work needed to align the induced dipoles with the Debye electrostatic field and computed the SOS for the variable interface of a folding protein. The minimization of the SOS is shown to drive protein folding as evidenced by the entrainment of the total free energy by the SOS energy along trajectories that approach a Debye limit state where no torque arises. This result suggests that the minimization of anomalous water polarization at the interface promotes the sealing of packing defects, thereby maintaining structural integrity and committing the protein chain to fold.
Sharma, Alok; Pohlentz, Gottfried; Bobbili, Kishore Babu; Jeyaprakash, A Arockia; Chandran, Thyageshwar; Mormann, Michael; Swamy, Musti J; Vijayan, M
2013-08-01
The sequence and structure of snake gourd seed lectin (SGSL), a nontoxic homologue of type II ribosome-inactivating proteins (RIPs), have been determined by mass spectrometry and X-ray crystallography, respectively. As in type II RIPs, the molecule consists of a lectin chain made up of two β-trefoil domains. The catalytic chain, which is connected through a disulfide bridge to the lectin chain in type II RIPs, is cleaved into two in SGSL. However, the integrity of the three-dimensional structure of the catalytic component of the molecule is preserved. This is the first time that a three-chain RIP or RIP homologue has been observed. A thorough examination of the sequence and structure of the protein and of its interactions with the bound methyl-α-galactose indicate that the nontoxicity of SGSL results from a combination of changes in the catalytic and the carbohydrate-binding sites. Detailed analyses of the sequences of type II RIPs of known structure and their homologues with unknown structure provide valuable insights into the evolution of this class of proteins. They also indicate some variability in carbohydrate-binding sites, which appears to contribute to the different levels of toxicity exhibited by lectins from various sources.
Hotspot-Centric De Novo Design of Protein Binders
Fleishman, Sarel J.; Corn, Jacob E.; Strauch, Eva-Maria; Whitehead, Timothy A.; Karanicolas, John; Baker, David
2014-01-01
Protein–protein interactions play critical roles in biology, and computational design of interactions could be useful in a range of applications. We describe in detail a general approach to de novo design of protein interactions based on computed, energetically optimized interaction hotspots, which was recently used to produce high-affinity binders of influenza hemagglutinin. We present several alternative approaches to identify and build the key hotspot interactions within both core secondary structural elements and variable loop regions and evaluate the method's performance in natural-interface recapitulation. We show that the method generates binding surfaces that are more conformationally restricted than previous design methods, reducing opportunities for off-target interactions. PMID:21945116
Resolution of ab initio shapes determined from small-angle scattering.
Tuukkanen, Anne T; Kleywegt, Gerard J; Svergun, Dmitri I
2016-11-01
Spatial resolution is an important characteristic of structural models, and the authors of structures determined by X-ray crystallography or electron cryo-microscopy always provide the resolution upon publication and deposition. Small-angle scattering of X-rays or neutrons (SAS) has recently become a mainstream structural method providing the overall three-dimensional structures of proteins, nucleic acids and complexes in solution. However, no quantitative resolution measure is available for SAS-derived models, which significantly hampers their validation and further use. Here, a method is derived for resolution assessment for ab initio shape reconstruction from scattering data. The inherent variability of the ab initio shapes is utilized and it is demonstrated how their average Fourier shell correlation function is related to the model resolution. The method is validated against simulated data for proteins with known high-resolution structures and its efficiency is demonstrated in applications to experimental data. It is proposed that henceforth the resolution be reported in publications and depositions of ab initio SAS models.
Resolution of ab initio shapes determined from small-angle scattering
Tuukkanen, Anne T.; Kleywegt, Gerard J.; Svergun, Dmitri I.
2016-01-01
Spatial resolution is an important characteristic of structural models, and the authors of structures determined by X-ray crystallography or electron cryo-microscopy always provide the resolution upon publication and deposition. Small-angle scattering of X-rays or neutrons (SAS) has recently become a mainstream structural method providing the overall three-dimensional structures of proteins, nucleic acids and complexes in solution. However, no quantitative resolution measure is available for SAS-derived models, which significantly hampers their validation and further use. Here, a method is derived for resolution assessment for ab initio shape reconstruction from scattering data. The inherent variability of the ab initio shapes is utilized and it is demonstrated how their average Fourier shell correlation function is related to the model resolution. The method is validated against simulated data for proteins with known high-resolution structures and its efficiency is demonstrated in applications to experimental data. It is proposed that henceforth the resolution be reported in publications and depositions of ab initio SAS models. PMID:27840683
Ratcliff, Kathleen; Marqusee, Susan
2010-01-01
Ribonucleases H from organisms that grow at different temperatures demonstrate a variable change in heat capacity upon unfolding (ΔC°P) [Ratcliff, K., et al. (2009) Biochemistry 48, 5890–5898]. This ΔC°P has been shown to correlate with a tolerance to higher temperatures and residual structure in the unfolded state of the thermophilic proteins. In the RNase H from Thermus thermophilus, the low ΔC°P has been shown to arise from the same region as the folding core of the protein, and mutagenic studies have shown that loss of a hydrophobic residue in this region can disrupt this residual unfolded state structure and result in a return to a more mesophile-like ΔC°P [Robic, S., et al. (2002) Protein Sci. 11, 381–389; Robic, S., et al. (2003) Proc. Natl. Acad. Sci. U.S.A. 100, 11345–11349]. To understand further how residual structure in the unfolded state is encoded in the sequences of these thermophilic proteins, we subjected the RNase H from Chlorobium tepidum to similar studies. Analysis of new chimeric proteins reveals that like T. thermophilus RNase H, the folding core of C. tepidum RNaseH plays an important role in the unfolded state of this protein. Mutagenesis studies, based on both a computational investigation of the hydrophobic networks in the core region and comparisons with similar studies on T. thermophilus RNase H, identify new residues involved in this residual structure and suggest that the residual structure in the unfolded state of C. tepidum RNase H is more restricted than that of T. thermophilus. We conclude that while the folding core region determines the thermophilic-like behavior of this family of proteins, the residue-specific details vary. PMID:20491485
Muccifora, Simonetta; Guerranti, Roberto; Muzzi, Chiara; Hope-Onyekwere, Nnadozies S; Pagani, Roberto; Leoncini, Roberto; Bellani, Lorenza M
2010-03-01
The mobilization of storage reserves, with particular emphasis on storage proteins of Mucuna pruriens (L.) DC., cotyledons, and embryo was investigated from the ultrastructural and biochemical points of view. Proteins and starch were the two main storage substances in cotyledons, and proteins and lipids were the main ones in the embryo. Embryo protein bodies were smaller and fewer in number than those of cotyledons. Structural and ultrastructural data determined between 24 and 48 h after imbibition and between 48 and 72 h after imbibition, the end of significant embryo and cotyledon protein mobilization, respectively, indicating more precocious storage protein mobilization in the axis than cotyledons. Moreover, storage protein mobilization in embryo and cotyledons occurred before the end of germination. Water soluble proteins were separated by sodium dodecyl sulfate polyacrylamide gel electrophoresis, producing 29 bands with molecular weights from 14 to 90 KDa. Embryo extract contained more proteins than cotyledon extract, contained seven characteristic bands, and showed a higher variability of the optical density trend than cotyledon.
Krishnan, Neeraja M; Seligmann, Hervé; Rao, Basuthkar J
2008-01-28
Synonymous sites are freer to vary because of redundancy in genetic code. Messenger RNA secondary structure restricts this freedom, as revealed by previous findings in mitochondrial genes that mutations at third codon position nucleotides in helices are more selected against than those in loops. This motivated us to explore the constraints imposed by mRNA secondary structure on evolutionary variability at all codon positions in general, in chloroplast systems. We found that the evolutionary variability and intrinsic secondary structure stability of these sequences share an inverse relationship. Simulations of most likely single nucleotide evolution in Psilotum nudum and Nephroselmis olivacea mRNAs, indicate that helix-forming propensities of mutated mRNAs are greater than those of the natural mRNAs for short sequences and vice-versa for long sequences. Moreover, helix-forming propensity estimated by the percentage of total mRNA in helices increases gradually with mRNA length, saturating beyond 1000 nucleotides. Protection levels of functionally important sites vary across plants and proteins: r-strategists minimize mutation costs in large genes; K-strategists do the opposite. Mrna length presumably predisposes shorter mRNAs to evolve under different constraints than longer mRNAs. The positive correlation between secondary structure protection and functional importance of sites suggests that some sites might be conserved due to packing-protection constraints at the nucleic acid level in addition to protein level constraints. Consequently, nucleic acid secondary structure a priori biases mutations. The converse (exposure of conserved sites) apparently occurs in a smaller number of cases, indicating a different evolutionary adaptive strategy in these plants. The differences between the protection levels of functionally important sites for r- and K-strategists reflect their respective molecular adaptive strategies. These converge with increasing domestication levels of K-strategists, perhaps because domestication increases reproductive output.
Expanded explorations into the optimization of an energy function for protein design
Huang, Yao-ming; Bystroff, Christopher
2014-01-01
Nature possesses a secret formula for the energy as a function of the structure of a protein. In protein design, approximations are made to both the structural representation of the molecule and to the form of the energy equation, such that the existence of a general energy function for proteins is by no means guaranteed. Here we present new insights towards the application of machine learning to the problem of finding a general energy function for protein design. Machine learning requires the definition of an objective function, which carries with it the implied definition of success in protein design. We explored four functions, consisting of two functional forms, each with two criteria for success. Optimization was carried out by a Monte Carlo search through the space of all variable parameters. Cross-validation of the optimized energy function against a test set gave significantly different results depending on the choice of objective function, pointing to relative correctness of the built-in assumptions. Novel energy cross-terms correct for the observed non-additivity of energy terms and an imbalance in the distribution of predicted amino acids. This paper expands on the work presented at ACM-BCB, Orlando FL , October 2012. PMID:24384706
Thomas, Karluss; Herouet-Guicheney, Corinne; Ladics, Gregory; McClain, Scott; MacIntosh, Susan; Privalle, Laura; Woolhiser, Mike
2008-09-01
The International Life Science Institute's Health and Environmental Sciences Institute's Protein Allergenicity Technical Committee hosted an international workshop October 23-25, 2007, in Nice, France, to review and discuss existing and emerging methods and techniques for improving the current weight-of-evidence approach for evaluating the potential allergenicity of novel proteins. The workshop included over 40 international experts from government, industry, and academia. Their expertise represented a range of disciplines including immunology, chemistry, molecular biology, bioinformatics, and toxicology. Among participants, there was consensus that (1) current bioinformatic approaches are highly conservative; (2) advances in bioinformatics using structural comparisons of proteins may be helpful as the availability of structural data increases; (3) proteomics may prove useful for monitoring the natural variability in a plant's proteome and assessing the impact of biotechnology transformations on endogenous levels of allergens, but only when analytical techniques have been standardized and additional data are available on the natural variation of protein expression in non-transgenic bred plants; (4) basophil response assays are promising techniques, but need additional evaluation around specificity, sensitivity, and reproducibility; (5) additional research is required to develop and validate an animal model for the purpose of predicting protein allergenicity.
Wolf, Maxim Y; Wolf, Yuri I; Koonin, Eugene V
2008-01-01
Background Proteins show a broad range of evolutionary rates. Understanding the factors that are responsible for the characteristic rate of evolution of a given protein arguably is one of the major goals of evolutionary biology. A long-standing general assumption used to be that the evolution rate is, primarily, determined by the specific functional constraints that affect the given protein. These constrains were traditionally thought to depend both on the specific features of the protein's structure and its biological role. The advent of systems biology brought about new types of data, such as expression level and protein-protein interactions, and unexpectedly, a variety of correlations between protein evolution rate and these variables have been observed. The strongest connections by far were repeatedly seen between protein sequence evolution rate and the expression level of the respective gene. It has been hypothesized that this link is due to the selection for the robustness of the protein structure to mistranslation-induced misfolding that is particularly important for highly expressed proteins and is the dominant determinant of the sequence evolution rate. Results This work is an attempt to assess the relative contributions of protein domain structure and function, on the one hand, and expression level on the other hand, to the rate of sequence evolution. To this end, we performed a genome-wide analysis of the effect of the fusion of a pair of domains in multidomain proteins on the difference in the domain-specific evolutionary rates. The mistranslation-induced misfolding hypothesis would predict that, within multidomain proteins, fused domains, on average, should evolve at substantially closer rates than the same domains in different proteins because, within a mutlidomain protein, all domains are translated at the same rate. We performed a comprehensive comparison of the evolutionary rates of mammalian and plant protein domains that are either joined in multidomain proteins or contained in distinct proteins. Substantial homogenization of evolutionary rates in multidomain proteins was, indeed, observed in both animals and plants, although highly significant differences between domain-specific rates remained. The contributions of the translation rate, as determined by the effect of the fusion of a pair of domains within a multidomain protein, and intrinsic, domain-specific structural-functional constraints appear to be comparable in magnitude. Conclusion Fusion of domains in a multidomain protein results in substantial homogenization of the domain-specific evolutionary rates but significant differences between domain-specific evolution rates remain. Thus, the rate of translation and intrinsic structural-functional constraints both exert sizable and comparable effects on sequence evolution. Reviewers This article was reviewed by Sergei Maslov, Dennis Vitkup, Claus Wilke (nominated by Orly Alter), and Allan Drummond (nominated by Joel Bader). For the full reviews, please go to the Reviewers' Reports section. PMID:18840284
Singh, Aditya Abha; Agrawal, S B; Shahi, J P; Agrawal, Madhoolika
2014-02-01
Rapid industrialization and economic developments have increased the tropospheric ozone (O3) budget since preindustrial times, and presently, it is supposed to be a major threat to crop productivity. Maize (Zea mays L.), a C4 plant is the third most important staple crop at global level with a great deal of economic importance. The present study was conducted to evaluate the performance of two maize cultivars [HQPM1: quality protein maize (QPM)] and [DHM117: nonquality protein maize (NQPM)] to variable O3 doses. Experimental setup included filtered chambers, nonfiltered chambers (NFC), and two elevated doses of O3 viz. NFC+15 ppb O3 (NFC+15) and NFC+30 ppb O3 (NFC+30). During initial growth period, both QPM and NQPM plants showed hormetic effect that is beneficial due to exposure of low doses of a toxicant (NFC and NFC+15 ppb O3), but at later stages, growth attributes were negatively affected by O3. Growth indices showed the variable pattern of photosynthate translocation under O3 stress. Foliar injury in the form of interveinal chlorosis and reddening of leaves due to increased production of anthocyanin pigments was observed at higher concentrations of O3. One-dimensional gel electrophoresis of leaves taken from NFC+30 showed reductions of major photosynthetic proteins, and differential response was observed between the two test cultivars. Decline in the number of male flowers at elevated O3 doses suggested damaging effect of O3 on reproductive structures which might be a cause of productivity losses. Variable carbon allocation pattern particularly to husk leaves, foliar injury, and damage of photosynthetic proteins led to significant reductions in economic yield at higher O3 doses. PCA showed that both the cultivars responded more or less similarly to O3 stress in their respective groupings of growth and yield parameters, but magnitude of their response was variable. It is further supported by difference in the significance of correlations between variables of yield and AOT40. Cultivar response reflects that QPM performed better than NQPM against elevated O3.
Richards, Mark W.; Law, Edward W. P.; Rennalls, La’Verne P.; Busacca, Sara; O’Regan, Laura; Fry, Andrew M.; Fennell, Dean A.; Bayliss, Richard
2014-01-01
Proteins of the echinoderm microtubule-associated protein (EMAP)-like (EML) family contribute to formation of the mitotic spindle and interphase microtubule network. They contain a unique hydrophobic EML protein (HELP) motif and a variable number of WD40 repeats. Recurrent gene rearrangements in nonsmall cell lung cancer fuse EML4 to anaplastic lymphoma kinase (ALK), causing expression of several fusion oncoprotein variants. We have determined a 2.6-Å crystal structure of the representative ∼70-kDa core of EML1, revealing an intimately associated pair of β-propellers, which we term a TAPE (tandem atypical propeller in EMLs) domain. One propeller is highly atypical, having a discontinuous subdomain unrelated to a WD40 motif in place of one of its blades. This unexpected feature shows how a propeller structure can be assembled from subdomains with distinct folds. The HELP motif is not an independent domain but forms part of the hydrophobic core that joins the two β-propellers. The TAPE domain binds α/β-tubulin via its conserved, concave surface, including part of the atypical blade. Mapping the characteristic breakpoints of each EML4-ALK variant onto our structure indicates that the EML4 TAPE domain is truncated in many variants in a manner likely to make the fusion protein structurally unstable. We found that the heat shock protein 90 (Hsp90) inhibitor ganetespib induced degradation of these variants whereas others lacking a partial TAPE domain were resistant in both overexpression models and patient-derived cell lines. The Hsp90-sensitive EML4-ALK variants are exceptions to the rule that oncogenic fusion proteins involve breakpoints in disordered regions of both partners. PMID:24706829
Richards, Mark W; Law, Edward W P; Rennalls, La'Verne P; Busacca, Sara; O'Regan, Laura; Fry, Andrew M; Fennell, Dean A; Bayliss, Richard
2014-04-08
Proteins of the echinoderm microtubule-associated protein (EMAP)-like (EML) family contribute to formation of the mitotic spindle and interphase microtubule network. They contain a unique hydrophobic EML protein (HELP) motif and a variable number of WD40 repeats. Recurrent gene rearrangements in nonsmall cell lung cancer fuse EML4 to anaplastic lymphoma kinase (ALK), causing expression of several fusion oncoprotein variants. We have determined a 2.6-Å crystal structure of the representative ∼70-kDa core of EML1, revealing an intimately associated pair of β-propellers, which we term a TAPE (tandem atypical propeller in EMLs) domain. One propeller is highly atypical, having a discontinuous subdomain unrelated to a WD40 motif in place of one of its blades. This unexpected feature shows how a propeller structure can be assembled from subdomains with distinct folds. The HELP motif is not an independent domain but forms part of the hydrophobic core that joins the two β-propellers. The TAPE domain binds α/β-tubulin via its conserved, concave surface, including part of the atypical blade. Mapping the characteristic breakpoints of each EML4-ALK variant onto our structure indicates that the EML4 TAPE domain is truncated in many variants in a manner likely to make the fusion protein structurally unstable. We found that the heat shock protein 90 (Hsp90) inhibitor ganetespib induced degradation of these variants whereas others lacking a partial TAPE domain were resistant in both overexpression models and patient-derived cell lines. The Hsp90-sensitive EML4-ALK variants are exceptions to the rule that oncogenic fusion proteins involve breakpoints in disordered regions of both partners.
Phillips, J J; Javadi, Y; Millership, C; Main, E R G
2012-01-01
Tetratricopeptide repeats (TPRs) are a class of all alpha-helical repeat proteins that are comprised of 34-aa helix-turn-helix motifs. These stack together to form nonglobular structures that are stabilized by short-range interactions from residues close in primary sequence. Unlike globular proteins, they have few, if any, long-range nonlocal stabilizing interactions. Several studies on designed TPR proteins have shown that this modular structure is reflected in their folding, that is, modular multistate folding is observed as opposed to two-state folding. Here we show that TPR multistate folding can be suppressed to approximate two-state folding through modulation of intrinsic stability or extrinsic environmental variables. This modulation was investigated by comparing the thermodynamic unfolding under differing buffer regimes of two distinct series of consensus-designed TPR proteins, which possess different intrinsic stabilities. A total of nine proteins of differing sizes and differing consensus TPR motifs were each thermally and chemically denatured and their unfolding monitored using differential scanning calorimetry (DSC) and CD/fluorescence, respectively. Analyses of both the DSC and chemical denaturation data show that reducing the total stability of each protein and repeat units leads to observable two-state unfolding. These data highlight the intimate link between global and intrinsic repeat stability that governs whether folding proceeds by an observably two-state mechanism, or whether partial unfolding yields stable intermediate structures which retain sufficient stability to be populated at equilibrium. PMID:22170589
Intra-molecular cross-linking of acidic residues for protein structure studies.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kruppa, Gary Hermann; Young, Malin M.; Novak, Petr
2005-03-01
Intra-molecular cross-linking has been suggested as a method of obtaining distance constraints that would be useful in developing structural models of proteins. Recent work published on intra-molecular cross-linking for protein structural studies has employed commercially available primary amine selective reagents that can cross-link lysine residues to other lysine residues or the amino terminus. Previous work using these cross-linkers has shown that for several proteins of known structure, the number of cross-links that can be obtained experimentally may be small compared to what would be expected from the known structure, due to the relative reactivity, distribution, and solvent accessibility of themore » lysines in the protein sequence. To overcome these limitations we have investigated the use of cross-linking reagents that can react with other reactive sidechains in proteins. We used 1-Ethyl-3-(3-dimethylaminopropyl) carbodiimide hydrochloride (EDC) to activate the carboxylic acid containing residues, aspartic acid (D), glutamic acid (E), and the carboxy terminus (O), for cross-linking reactions. Once activated, the DEO sidechains can react to form 'zero-length' cross-links with nearby primary amine containing resides, lysines (K) and the amino terminus (X), via the formation of a new amide bond. We also show that the EDC-activated DEO sidechains can be cross-linked to each other using dihydrazides, two hydrazide moieties connected by an alkyl cross-linker ann of variable length. Using these reagents, we have found three new 'zero-length' cross-links in ubiquitin consistent with its known structure (M1-E16, M1-E18, and K63-E64). Using the dihydrazide cross-linkers, we have identified 2 new cross-links (D21-D32 and E24-D32) unambiguously. Using a library of dihydrazide cross-linkers with varying arm length, we have shown that there is a minimum arm length required for the DEO-DEO cross-links of 5.8 angstroms. These results show that additional structural information can be obtained by exploiting new cross-linker chemistry, increasing the probability that the protein target of choice will yield sufficient distance constraints to develop a structural model.« less
Assenberg, René; Mastrangelo, Eloise; Walter, Thomas S; Verma, Anil; Milani, Mario; Owens, Raymond J; Stuart, David I; Grimes, Jonathan M; Mancini, Erika J
2009-12-01
The flavivirus genome comprises a single strand of positive-sense RNA, which is translated into a polyprotein and cleaved by a combination of viral and host proteases to yield functional proteins. One of these, nonstructural protein 3 (NS3), is an enzyme with both serine protease and NTPase/helicase activities. NS3 plays a central role in the flavivirus life cycle: the NS3 N-terminal serine protease together with its essential cofactor NS2B is involved in the processing of the polyprotein, whereas the NS3 C-terminal NTPase/helicase is responsible for ATP-dependent RNA strand separation during replication. An unresolved question remains regarding why NS3 appears to encode two apparently disconnected functionalities within one protein. Here we report the 2.75-A-resolution crystal structure of full-length Murray Valley encephalitis virus NS3 fused with the protease activation peptide of NS2B. The biochemical characterization of this construct suggests that the protease has little influence on the helicase activity and vice versa. This finding is in agreement with the structural data, revealing a single protein with two essentially segregated globular domains. Comparison of the structure with that of dengue virus type 4 NS2B-NS3 reveals a relative orientation of the two domains that is radically different between the two structures. Our analysis suggests that the relative domain-domain orientation in NS3 is highly variable and dictated by a flexible interdomain linker. The possible implications of this conformational flexibility for the function of NS3 are discussed.
The structure of a thermophilic kinase shapes fitness upon random circular permutation
Jones, Alicia M.; Mehta, Manan M.; Thomas, Emily E.; Atkinson, Joshua T.; Segall-Shapiro, Thomas H.; Liu, Shirley; Silberg, Jonathan J.
2016-01-01
Proteins can be engineered for synthetic biology through circular permutation, a sequence rearrangement where native protein termini become linked and new termini are created elsewhere through backbone fission. However, it remains challenging to anticipate a protein’s functional tolerance to circular permutation. Here, we describe new transposons for creating libraries of randomly circularly permuted proteins that minimize peptide additions at their termini, and we use transposase mutagenesis to study the tolerance of a thermophilic adenylate kinase (AK) to circular permutation. We find that libraries expressing permuted AK with either short or long peptides amended to their N-terminus yield distinct sets of active variants and present evidence that this trend arises because permuted protein expression varies across libraries. Mapping all sites that tolerate backbone cleavage onto AK structure reveals that the largest contiguous regions of sequence that lack cleavage sites are proximal to the phosphotransfer site. A comparison of our results with a range of structure-derived parameters further showed that retention of function correlates to the strongest extent with the distance to the phosphotransfer site, amino acid variability in an AK family sequence alignment, and residue-level deviations in superimposed AK structures. Our work illustrates how permuted protein libraries can be created with minimal peptide additions using transposase mutagenesis, and they reveal a challenge of maintaining consistent expression across permuted variants in a library that minimizes peptide additions. Furthermore, these findings provide a basis for interpreting responses of thermophilic phosphotransferases to circular permutation by calibrating how different structure-derived parameters relate to retention of function in a cellular selection. PMID:26976658
Preservation of protein clefts in comparative models.
Piedra, David; Lois, Sergi; de la Cruz, Xavier
2008-01-16
Comparative, or homology, modelling of protein structures is the most widely used prediction method when the target protein has homologues of known structure. Given that the quality of a model may vary greatly, several studies have been devoted to identifying the factors that influence modelling results. These studies usually consider the protein as a whole, and only a few provide a separate discussion of the behaviour of biologically relevant features of the protein. Given the value of the latter for many applications, here we extended previous work by analysing the preservation of native protein clefts in homology models. We chose to examine clefts because of their role in protein function/structure, as they are usually the locus of protein-protein interactions, host the enzymes' active site, or, in the case of protein domains, can also be the locus of domain-domain interactions that lead to the structure of the whole protein. We studied how the largest cleft of a protein varies in comparative models. To this end, we analysed a set of 53507 homology models that cover the whole sequence identity range, with a special emphasis on medium and low similarities. More precisely we examined how cleft quality - measured using six complementary parameters related to both global shape and local atomic environment, depends on the sequence identity between target and template proteins. In addition to this general analysis, we also explored the impact of a number of factors on cleft quality, and found that the relationship between quality and sequence identity varies depending on cleft rank amongst the set of protein clefts (when ordered according to size), and number of aligned residues. We have examined cleft quality in homology models at a range of seq.id. levels. Our results provide a detailed view of how quality is affected by distinct parameters and thus may help the user of comparative modelling to determine the final quality and applicability of his/her cleft models. In addition, the large variability in model quality that we observed within each sequence bin, with good models present even at low sequence identities (between 20% and 30%), indicates that properly developed identification methods could be used to recover good cleft models in this sequence range.
Devassy, Jessay G; Wojcik, Jennifer L; Ibrahim, Naser H M; Zahradka, Peter; Taylor, Carla G; Aukema, Harold M
2017-02-01
Questions remain regarding the potential negative effects of dietary high protein (HP) on kidney health, particularly in the context of obesity in which the risk for renal disease is already increased. To examine whether some of the variability in HP effects on kidney health may be due to source of protein, obese fa/fa Zucker rats were given HP (35% of energy from protein) diets containing either casein, soy protein, or a mixed source of animal and plant proteins for 12 weeks. Control lean and obese rats were given diets containing casein at normal protein (15% of energy from protein) levels. Body weight and blood pressure were measured, and markers of renal structural changes, damage, and function were assessed. Obesity alone resulted in mild renal changes, as evidenced by higher kidney weights, proteinuria, and glomerular volumes. In obese rats, increasing the protein level using the single, but not mixed, protein sources resulted in higher renal fibrosis compared with the lean rats. The mixed-protein HP group also had lower levels of serum monocyte chemoattractant protein-1, even though this diet further increased kidney and glomerular size. Soy and mixed-protein HP diets also resulted in a small number of damaged glomeruli, while soy compared with mixed-protein HP diet delayed the increase in blood pressure over time. Since obesity itself confers added risk of renal disease, an HP diet from mixed-protein sources that enables weight loss but has fewer risks to renal health may be advantageous.
RNA regulatory networks diversified through curvature of the PUF protein scaffold
Wilinski, Daniel; Qiu, Chen; Lapointe, Christopher P.; ...
2015-09-14
Proteins bind and control mRNAs, directing their localization, translation and stability. Members of the PUF family of RNA-binding proteins control multiple mRNAs in a single cell, and play key roles in development, stem cell maintenance and memory formation. Here we identified the mRNA targets of a S. cerevisiae PUF protein, Puf5p, by ultraviolet-crosslinking-affinity purification and high-throughput sequencing (HITS-CLIP). The binding sites recognized by Puf5p are diverse, with variable spacer lengths between two specific sequences. Each length of site correlates with a distinct biological function. Crystal structures of Puf5p–RNA complexes reveal that the protein scaffold presents an exceptionally flat and extendedmore » interaction surface relative to other PUF proteins. In complexes with RNAs of different lengths, the protein is unchanged. A single PUF protein repeat is sufficient to induce broadening of specificity. Changes in protein architecture, such as alterations in curvature, may lead to evolution of mRNA regulatory networks.« less
RNA regulatory networks diversified through curvature of the PUF protein scaffold
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wilinski, Daniel; Qiu, Chen; Lapointe, Christopher P.
Proteins bind and control mRNAs, directing their localization, translation and stability. Members of the PUF family of RNA-binding proteins control multiple mRNAs in a single cell, and play key roles in development, stem cell maintenance and memory formation. Here we identified the mRNA targets of a S. cerevisiae PUF protein, Puf5p, by ultraviolet-crosslinking-affinity purification and high-throughput sequencing (HITS-CLIP). The binding sites recognized by Puf5p are diverse, with variable spacer lengths between two specific sequences. Each length of site correlates with a distinct biological function. Crystal structures of Puf5p–RNA complexes reveal that the protein scaffold presents an exceptionally flat and extendedmore » interaction surface relative to other PUF proteins. In complexes with RNAs of different lengths, the protein is unchanged. A single PUF protein repeat is sufficient to induce broadening of specificity. Changes in protein architecture, such as alterations in curvature, may lead to evolution of mRNA regulatory networks.« less
Thapliyal, Charu; Jain, Neha; Chaudhuri, Pratima
2015-01-01
A protein, differing in origin, may exhibit variable physicochemical behaviour, difference in sequence homology, fold and function. Thus studying structure-function relationship of proteins from altered sources is meaningful in the sense that it may give rise to comparative aspects of their sequence-structure-function relationship. Dihydrofolate reductase is an enzyme involved in cell cycle regulation. It is a significant enzyme as.a target for developing anticancer drugs. Hence, detailed understanding of structure-function relationships of wide variants of the enzyme dihydrofolate reductase would be important for developing an inhibitor or an antagonist against the enzyme involved in the cellular developmental processes. In this communication, we have reported the comparative structure-function relationship between E. coli and human dihydrofolate reductase. The differences in the unfolding behaviour of these two proteins have been investigated to understand various properties of these two proteins like relative' stability differences and variation in conformational changes under identical denaturing conditions. The equilibrium unfolding mechanism of dihydrofolate reductase proteins using guanidine hydrochloride as a denaturant in the presence of various types of osmolytes has been monitored using loss in enzymatic activity, intrinsic tryptophan fluorescence and an extrinsic fluorophore 8-anilino-1-naphthalene-sulfonic acid as probes. It has been observed that osmolytes, such as 1M sucrose, and 30% glycerol, provided enhanced stability to both variants of dihydrofolate reductase. Their level of stabilisation has been observed to be dependent on intrinsic protein stability. It was observed that 100 mM proline does not show any 'significant stabilisation to either of dihydrofolate reductases. In the present study, it has been observed that the human protein is relatively less stable than the E.coli counterpart.
Mimicking a p53-MDM2 interaction based on a stable immunoglobulin-like domain scaffold.
Jimenez-Sandoval, Pedro; Madrigal-Carrillo, Ezequiel A; Santamaría-Suárez, Hugo A; Maturana, Daniel; Rentería-González, Itzel; Benitez-Cardoza, Claudia G; Torres-Larios, Alfredo; Brieba, Luis G
2018-04-26
Antibodies recognize protein targets with great affinity and specificity. However, posttranslational modifications and the presence of intrinsic disulfide-bonds pose difficulties for their industrial use. The immunoglobulin fold is one of the most ubiquitous folds in nature and it is found in many proteins besides antibodies. An example of a protein family with an immunoglobulin-like fold is the Cysteine Protease Inhibitors (ICP) family I42 of the MEROPs database for protease and protease inhibitors. Members of this protein family are thermostable and do not present internal disulfide bonds. Crystal structures of several ICPs indicate that they resemble the Ig-like domain of the human T cell co-receptor CD8α As ICPs present 2 flexible recognition loops that vary accordingly to their targeted protease, we hypothesize that members of this protein family would be ideal to design peptide aptamers that mimic protein-protein interactions. Herein, we use an ICP variant from Entamoeba histolytica (EhICP1) to mimic the interaction between p53 and MDM2. We found that a 13 amino-acid peptide derived from p53 can be introduced in 2 variable loops (DE, FG) but not the third (BC). Chimeric EhICP1-p53 form a stable complex with MDM2 at a micromolar range. Crystal structure of the EhICP1-p53(FG)-loop variant in complex with MDM2 reveals a swapping subdomain between 2 chimeric molecules, however, the p53 peptide interacts with MDM2 as in previous crystal structures. The structural details of the EhICP1-p53(FG) interaction with MDM2 resemble the interaction between an antibody and MDM2. © 2018 Wiley Periodicals, Inc.
Aono, S; Bentrop, D; Bertini, I; Cosenza, G; Luchinat, C
1998-12-01
The solution structure of the D13C variant of the thermostable Fe7S8 ferredoxin from Bacillus schlegelii has been determined by 1H-NMR spectroscopy in its oxidized form. In a variable-temperature NMR study the D13C variant was as thermostable (up to 90 degrees C) as the wild-type protein (WT). Seventy-five out of 77 amino acid residues and 81% of all theoretically expected proton resonances in the D13C Fe8S8 protein have been assigned. Its structure was determined through torsion angle dynamics calculations with the program DYANA, using 935 meaningful NOEs (from a total of 1251), hydrogen bond constraints, and NMR-derived dihedral angle constraints for the cluster-ligating cysteines. Afterwards, restrained energy minimization and restrained molecular dynamics were applied to each conformer of the family. The final family of 20 structures has RMSD values from the mean structure of 0.055 nm for the backbone atoms and of 0.099 nm for all heavy atoms. The overall folding of the WT is maintained in the mutant, except for the immediate vicinity of the new cysteine, which becomes much more similar to native Fe8S8 proteins. The two residues at positions 11 and 12, which constitute an insertion with respect to all known Fe8S8 proteins, assume a conformation that does not prevent the preceding and following residues from folding like in native Fe8S8 proteins. Clear evidence for the existence of two conformations involving almost half of the amino acid residues was found. The two conformations are structurally indistinguishable. Temperature-dependent NMR experiments show that one of them is thermodynamically more stable than the other.
Structural and Functional Characterization of Reston Ebola Virus VP35 Interferon Inhibitory Domain
DOE Office of Scientific and Technical Information (OSTI.GOV)
Leung, Daisy W.; Shabman, Reed S.; Farahbakhsh, Mina
2010-09-21
Ebolaviruses are causative agents of lethal hemorrhagic fever in humans and nonhuman primates. Among the filoviruses characterized thus far, Reston Ebola virus (REBOV) is the only Ebola virus that is nonpathogenic to humans despite the fact that REBOV can cause lethal disease in nonhuman primates. Previous studies also suggest that REBOV is less effective at inhibiting host innate immune responses than Zaire Ebola virus (ZEBOV) or Marburg virus. Virally encoded VP35 protein is critical for immune suppression, but an understanding of the relative contributions of VP35 proteins from REBOV and other filoviruses is currently lacking. In order to address thismore » question, we characterized the REBOV VP35 interferon inhibitory domain (IID) using structural, biochemical, and virological studies. These studies reveal differences in double-stranded RNA binding and interferon inhibition between the two species. These observed differences are likely due to increased stability and loss of flexibility in REBOV VP35 IID, as demonstrated by thermal shift stability assays. Consistent with this finding, the 1.71-{angstrom} crystal structure of REBOV VP35 IID reveals that it is highly similar to that of ZEBOV VP35 IID, with an overall backbone r.m.s.d. of 0.64 {angstrom}, but contains an additional helical element at the linker between the two subdomains of VP35 IID. Mutations near the linker, including swapping sequences between REBOV and ZEBOV, reveal that the linker sequence has limited tolerance for variability. Together with the previously solved ligand-free and double-stranded-RNA-bound forms of ZEBOV VP35 IID structures, our current studies on REBOV VP35 IID reinforce the importance of VP35 in immune suppression. Functional differences observed between REBOV and ZEBOV VP35 proteins may contribute to observed differences in pathogenicity, but these are unlikely to be the major determinant. However, the high level of similarity in structure and the low tolerance for sequence variability, coupled with the multiple critical roles played by Ebola virus VP35 proteins, highlight the viability of VP35 as a potential target for therapeutic development.« less
Structural and functional characterization of Reston Ebola virus VP35 interferon inhibitory domain.
Leung, Daisy W; Shabman, Reed S; Farahbakhsh, Mina; Prins, Kathleen C; Borek, Dominika M; Wang, Tianjiao; Mühlberger, Elke; Basler, Christopher F; Amarasinghe, Gaya K
2010-06-11
Ebolaviruses are causative agents of lethal hemorrhagic fever in humans and nonhuman primates. Among the filoviruses characterized thus far, Reston Ebola virus (REBOV) is the only Ebola virus that is nonpathogenic to humans despite the fact that REBOV can cause lethal disease in nonhuman primates. Previous studies also suggest that REBOV is less effective at inhibiting host innate immune responses than Zaire Ebola virus (ZEBOV) or Marburg virus. Virally encoded VP35 protein is critical for immune suppression, but an understanding of the relative contributions of VP35 proteins from REBOV and other filoviruses is currently lacking. In order to address this question, we characterized the REBOV VP35 interferon inhibitory domain (IID) using structural, biochemical, and virological studies. These studies reveal differences in double-stranded RNA binding and interferon inhibition between the two species. These observed differences are likely due to increased stability and loss of flexibility in REBOV VP35 IID, as demonstrated by thermal shift stability assays. Consistent with this finding, the 1.71-A crystal structure of REBOV VP35 IID reveals that it is highly similar to that of ZEBOV VP35 IID, with an overall backbone r.m.s.d. of 0.64 A, but contains an additional helical element at the linker between the two subdomains of VP35 IID. Mutations near the linker, including swapping sequences between REBOV and ZEBOV, reveal that the linker sequence has limited tolerance for variability. Together with the previously solved ligand-free and double-stranded-RNA-bound forms of ZEBOV VP35 IID structures, our current studies on REBOV VP35 IID reinforce the importance of VP35 in immune suppression. Functional differences observed between REBOV and ZEBOV VP35 proteins may contribute to observed differences in pathogenicity, but these are unlikely to be the major determinant. However, the high level of similarity in structure and the low tolerance for sequence variability, coupled with the multiple critical roles played by Ebola virus VP35 proteins, highlight the viability of VP35 as a potential target for therapeutic development. Copyright 2010 Elsevier Ltd. All rights reserved.
Mirrors in the PDB: left-handed alpha-turns guide design with D-amino acids.
Annavarapu, Srinivas; Nanda, Vikas
2009-09-22
Incorporating variable amino acid stereochemistry in molecular design has the potential to improve existing protein stability and create new topologies inaccessible to homochiral molecules. The Protein Data Bank has been a reliable, rich source of information on molecular interactions and their role in protein stability and structure. D-amino acids rarely occur naturally, making it difficult to infer general rules for how they would be tolerated in proteins through an analysis of existing protein structures. However, protein elements containing short left-handed turns and helices turn out to contain useful information. Molecular mechanisms used in proteins to stabilize left-handed elements by L-amino acids are structurally enantiomeric to potential synthetic strategies for stabilizing right-handed elements with D-amino acids. Propensities for amino acids to occur in contiguous alpha(L) helices correlate with published thermodynamic scales for incorporation of D-amino acids into alpha(R) helices. Two backbone rules for terminating a left-handed helix are found: an alpha(R) conformation is disfavored at the amino terminus, and a beta(R) conformation is disfavored at the carboxy terminus. Helix capping sidechain-backbone interactions are found which are unique to alpha(L) helices including an elevated propensity for L-Asn, and L-Thr at the amino terminus and L-Gln, L-Thr and L-Ser at the carboxy terminus. By examining left-handed alpha-turns containing L-amino acids, new interaction motifs for incorporating D-amino acids into right-handed alpha-helices are identified. These will provide a basis for de novo design of novel heterochiral protein folds.
Mirrors in the PDB: left-handed α-turns guide design with D-amino acids
Annavarapu, Srinivas; Nanda, Vikas
2009-01-01
Background Incorporating variable amino acid stereochemistry in molecular design has the potential to improve existing protein stability and create new topologies inaccessible to homochiral molecules. The Protein Data Bank has been a reliable, rich source of information on molecular interactions and their role in protein stability and structure. D-amino acids rarely occur naturally, making it difficult to infer general rules for how they would be tolerated in proteins through an analysis of existing protein structures. However, protein elements containing short left-handed turns and helices turn out to contain useful information. Molecular mechanisms used in proteins to stabilize left-handed elements by L-amino acids are structurally enantiomeric to potential synthetic strategies for stabilizing right-handed elements with D-amino acids. Results Propensities for amino acids to occur in contiguous αL helices correlate with published thermodynamic scales for incorporation of D-amino acids into αR helices. Two backbone rules for terminating a left-handed helix are found: an αR conformation is disfavored at the amino terminus, and a βR conformation is disfavored at the carboxy terminus. Helix capping sidechain-backbone interactions are found which are unique to αL helices including an elevated propensity for L-Asn, and L-Thr at the amino terminus and L-Gln, L-Thr and L-Ser at the carboxy terminus. Conclusion By examining left-handed α-turns containing L-amino acids, new interaction motifs for incorporating D-amino acids into right-handed α-helices are identified. These will provide a basis for de novo design of novel heterochiral protein folds. PMID:19772623
Mouillon, Jean-Marie; Gustafsson, Petter; Harryson, Pia
2006-01-01
Dehydrins constitute a class of intrinsically disordered proteins that are expressed under conditions of water-related stress. Characteristic of the dehydrins are some highly conserved stretches of seven to 17 residues that are repetitively scattered in their sequences, the K-, S-, Y-, and Lys-rich segments. In this study, we investigate the putative role of these segments in promoting structure. The analysis is based on comparative analysis of four full-length dehydrins from Arabidopsis (Arabidopsis thaliana; Cor47, Lti29, Lti30, and Rab18) and isolated peptide mimics of the K-, Y-, and Lys-rich segments. In physiological buffer, the circular dichroism spectra of the full-length dehydrins reveal overall disordered structures with a variable content of poly-Pro helices, a type of elongated secondary structure relying on bridging water molecules. Similar disordered structures are observed for the isolated peptides of the conserved segments. Interestingly, neither the full-length dehydrins nor their conserved segments are able to adopt specific structure in response to altered temperature, one of the factors that regulate their expression in vivo. There is also no structural response to the addition of metal ions, increased protein concentration, or the protein-stabilizing salt Na2SO4. Taken together, these observations indicate that the dehydrins are not in equilibrium with high-energy folded structures. The result suggests that the dehydrins are highly evolved proteins, selected to maintain high configurational flexibility and to resist unspecific collapse and aggregation. The role of the conserved segments is thus not to promote tertiary structure, but to exert their biological function more locally upon interaction with specific biological targets, for example, by acting as beads on a string for specific recognition, interaction with membranes, or intermolecular scaffolding. In this perspective, it is notable that the Lys-rich segment in Cor47 and Lti29 shows sequence similarity with the animal chaperone HSP90. PMID:16565295
Niskanen, Einari A; Hytönen, Vesa P; Grapputo, Alessandro; Nordlund, Henri R; Kulomaa, Markku S; Laitinen, Olli H
2005-01-01
Background A chicken egg contains several biotin-binding proteins (BBPs), whose complete DNA and amino acid sequences are not known. In order to identify and characterise these genes and proteins we studied chicken cDNAs and genes available in the NCBI database and chicken genome database using the reported N-terminal amino acid sequences of chicken egg-yolk BBPs as search strings. Results Two separate hits showing significant homology for these N-terminal sequences were discovered. For one of these hits, the chromosomal location in the immediate proximity of the avidin gene family was found. Both of these hits encode proteins having high sequence similarity with avidin suggesting that chicken BBPs are paralogous to avidin family. In particular, almost all residues corresponding to biotin binding in avidin are conserved in these putative BBP proteins. One of the found DNA sequences, however, seems to encode a carboxy-terminal extension not present in avidin. Conclusion We describe here the predicted properties of the putative BBP genes and proteins. Our present observations link BBP genes together with avidin gene family and shed more light on the genetic arrangement and variability of this family. In addition, comparative modelling revealed the potential structural elements important for the functional and structural properties of the putative BBP proteins. PMID:15777476
Termination and read-through proteins encoded by genome segment 9 of Colorado tick fever virus.
Mohd Jaafar, Fauziah; Attoui, Houssam; De Micco, Philippe; De Lamballerie, Xavier
2004-08-01
Genome segment 9 (Seg-9) of Colorado tick fever virus (CTFV) is 1884 bp long and contains a large open reading frame (ORF; 1845 nt in length overall), although a single in-frame stop codon (at nt 1052-1054) reduces the ORF coding capacity by approximately 40 %. However, analyses of highly conserved RNA sequences in the vicinity of the stop codon indicate that it belongs to a class of 'leaky terminators'. The third nucleotide positions in codons situated both before and after the stop codon, shows the highest variability, suggesting that both regions are translated during virus replication. This also suggests that the stop signal is functionally leaky, allowing read-through translation to occur. Indeed, both the truncated 'termination' protein and the full-length 'read-through' protein (VP9 and VP9', respectively) were detected in CTFV-infected cells, in cells transfected with a plasmid expressing only Seg-9 protein products, and in the in vitro translation products from undenatured Seg-9 ssRNA. The ratios of full-length and truncated proteins generated suggest that read-through may be down-regulated by other viral proteins. Western blot analysis of infected cells and purified CTFV showed that VP9 is a structural component of the virion, while VP9' is a non-structural protein.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stevens, F. J.; Pokkuluri, P. R.; Schiffer, M.
2000-12-19
The antibody light chain variable domain (V{sub L}){sup 1} and myelin protein zero (MPZ) are representatives of the functionally diverse immunoglobulin superfamily. The V{sub L} is a subunit of the antigen-binding component of antibodies, while MPZ is the major membrane-linked constituent of the myelin sheaths that coat peripheral nerves. Despite limited amino acid sequence homology, the conformations of the core structures of the two proteins are largely superimposable. Amino acid variations in V{sub L} account for various conformational disease outcomes, including amyloidosis. However, the specific amino acid changes in V{sub L} that are responsible for disease have been obscured bymore » multiple concurrent primary structure alterations. Recently, certain demyelination disorders have been linked to point mutations and single amino acid polymorphisms in MPZ. We demonstrate here that some pathogenic variations in MPZ correspond to changes suspected of determining amyloidosis in V{sub L}. This unanticipated observation suggests that studies of the biophysical origin of conformational disease in one member of a superfamily of homologous proteins may have implications throughout the superfamily. In some cases, findings may account for overt disease; in other cases, due to the natural repertoire of inherited polymorphisms, variations in a representative protein may predict subclinical impairment of homologous proteins.« less
Oliva, Maria Luiza Vilela; Ferreira, Rodrigo da Silva; Ferreira, Joana Gasperazzo; de Paula, Cláudia Alessandra Andrade; Salas, Carlos E; Sampaio, Misako Uemura
2011-08-01
Seed proteins that inhibit proteinases are classified in families based on amino acid sequence similarity, nature of reactive site and mechanism of action, and are used as tools for investigating proteinases in physiological and pathological events. More recently, the plant Kunitz family of inhibitors with two disulphide bridges was enlarged with members containing variable number of cysteine residues, ranging from no cysteine at all to more than four residues. The characteristic of these proteins, as well the interactions with their target proteinases, are briefly discussed.
Frahm, Grant E; Smith, Daryl G S; Kane, Anita; Lorbetskie, Barry; Cyr, Terry D; Girard, Michel; Johnston, Michael J W
2014-01-01
The use of different expression systems to produce the same recombinant human protein can result in expression-dependent chemical modifications (CMs) leading to variability of structure, stability and immunogenicity. Of particular interest are recombinant human proteins expressed in plant-based systems, which have shown particularly high CM variability. In studies presented here, recombinant human serum albumins (rHSA) produced in Oryza sativa (Asian rice) (OsrHSA) from a number of suppliers have been extensively characterized and compared to plasma-derived HSA (pHSA) and rHSA expressed in yeast (Pichia pastoris and Saccharomyces cerevisiae). The heterogeneity of each sample was evaluated using size exclusion chromatography (SEC), reversed-phase high-performance liquid chromatography (RP-HPLC) and capillary electrophoresis (CE). Modifications of the samples were identified by liquid chromatography-mass spectrometry (LC-MS). The secondary and tertiary structure of the albumin samples were assessed with far U/V circular dichroism spectropolarimetry (far U/V CD) and fluorescence spectroscopy, respectively. Far U/V CD and fluorescence analyses were also used to assess thermal stability and drug binding. High molecular weight aggregates in OsrHSA samples were detected with SEC and supplier-to-supplier variability and, more critically, lot-to-lot variability in one manufactures supplied products were identified. LC-MS analysis identified a greater number of hexose-glycated arginine and lysine residues on OsrHSA compared to pHSA or rHSA expressed in yeast. This analysis also showed supplier-to-supplier and lot-to-lot variability in the degree of glycation at specific lysine and arginine residues for OsrHSA. Both the number of glycated residues and the degree of glycation correlated positively with the quantity of non-monomeric species and the chromatographic profiles of the samples. Tertiary structural changes were observed for most OsrHSA samples which correlated well with the degree of arginine/lysine glycation. The extensive glycation of OsrHSA from multiple suppliers may have further implications for the use of OsrHSA as a therapeutic product.
de Juan-Franco, Elena; Caruz, Antonio; Pedrajas, J R; Lechuga, Laura M
2013-04-07
We have implemented a novel strategy for the oriented immobilization of antibodies onto a gold surface based on the use of a fusion protein, the protein A-gold binding domain (PAG). PAG consists of a gold binding peptide (GBP) coupled to the immunoglobulin-binding domains of staphylococcal protein A. This fusion protein provides an easy and fast oriented immobilization of antibodies preserving its native structure, while leaving the antigen binding sites (Fab) freely exposed. Using this immobilization strategy, we have demonstrated the performance of the immunosensing of the human Growth Hormone by SPR. A limit of detection of 90 ng mL(-1) was obtained with an inter-chip variability lower than 7%. The comparison of this method with other strategies for the direct immobilization of antibodies over gold surfaces has showed the enhanced sensitivity provided by the PAG approach.
Hardies, Stephen C; Thomas, Julie A; Black, Lindsay; Weintraub, Susan T; Hwang, Chung Y; Cho, Byung C
2016-02-01
The virion proteins of Pseudoalteromonas phage φRIO-1 were identified and quantitated by mass spectrometry and gel densitometry. Bioinformatic methods customized to deal with extreme divergence defined a φRIO-1 tail structure homology group of phages, which was further related to T7 tail and internal virion proteins (IVPs). Similarly, homologs of tubular tail components and internal virion proteins were identified in essentially all completely sequenced podoviruses other than those in the subfamily Picovirinae. The podoviruses were subdivided into several tail structure homology groups, in addition to the RIO-1 and T7 groups. Molecular phylogeny indicated that these groups all arose about the same ancient time as the φRIO-1/T7 split. Hence, the T7-like infection mechanism involving the IVPs was an ancestral property of most podoviruses. The IVPs were found to variably host both tail lysozyme domains and domains destined for the cytoplasm, including the N4 virion RNA polymerase embedded within an IVP-D homolog. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Protein Modelling: What Happened to the “Protein Structure Gap”?
Schwede, Torsten
2013-01-01
Computational modeling and prediction of three-dimensional macromolecular structures and complexes from their sequence has been a long standing vision in structural biology as it holds the promise to bypass part of the laborious process of experimental structure solution. Over the last two decades, a paradigm shift has occurred: starting from a situation where the “structure knowledge gap” between the huge number of protein sequences and small number of known structures has hampered the widespread use of structure-based approaches in life science research, today some form of structural information – either experimental or computational – is available for the majority of amino acids encoded by common model organism genomes. Template based homology modeling techniques have matured to a point where they are now routinely used to complement experimental techniques. With the scientific focus of interest moving towards larger macromolecular complexes and dynamic networks of interactions, the integration of computational modeling methods with low-resolution experimental techniques allows studying large and complex molecular machines. Computational modeling and prediction techniques are still facing a number of challenges which hamper the more widespread use by the non-expert scientist. For example, it is often difficult to convey the underlying assumptions of a computational technique, as well as the expected accuracy and structural variability of a specific model. However, these aspects are crucial to understand the limitations of a model, and to decide which interpretations and conclusions can be supported. PMID:24010712
Use of the gram stain in microbiology.
Beveridge, T J
2001-05-01
The Gram stain differentiates bacteria into two fundamental varieties of cells. Bacteria that retain the initial crystal violet stain (purple) are said to be "gram-positive," whereas those that are decolorized and stain red with carbol fuchsin (or safranin) are said to be "gram-negative." This staining response is based on the chemical and structural makeup of the cell walls of both varieties of bacteria. Gram-positives have a thick, relatively impermeable wall that resists decolorization and is composed of peptidoglycan and secondary polymers. Gram-negatives have a thin peptidoglycan layer plus an overlying lipid-protein bilayer known as the outer membrane, which can be disrupted by decolorization. Some bacteria have walls of intermediate structure and, although they are officially classified as gram-positives because of their linage, they stain in a variable manner. One prokaryote domain, the Archaea, have such variability of wall structure that the Gram stain is not a useful differentiating tool.
Conserved and variable domains of RNase MRP RNA.
Dávila López, Marcela; Rosenblad, Magnus Alm; Samuelsson, Tore
2009-01-01
Ribonuclease MRP is a eukaryotic ribonucleoprotein complex consisting of one RNA molecule and 7-10 protein subunits. One important function of MRP is to catalyze an endonucleolytic cleavage during processing of rRNA precursors. RNase MRP is evolutionary related to RNase P which is critical for tRNA processing. A large number of MRP RNA sequences that now are available have been used to identify conserved primary and secondary structure features of the molecule. MRP RNA has structural features in common with P RNA such as a conserved catalytic core, but it also has unique features and is characterized by a domain highly variable between species. Information regarding primary and secondary structure features is of interest not only in basic studies of the function of MRP RNA, but also because mutations in the RNA give rise to human genetic diseases such as cartilage-hair hypoplasia.
Parametric models to compute tryptophan fluorescence wavelengths from classical protein simulations.
Lopez, Alvaro J; Martínez, Leandro
2018-02-26
Fluorescence spectroscopy is an important method to study protein conformational dynamics and solvation structures. Tryptophan (Trp) residues are the most important and practical intrinsic probes for protein fluorescence due to the variability of their fluorescence wavelengths: Trp residues emit in wavelengths ranging from 308 to 360 nm depending on the local molecular environment. Fluorescence involves electronic transitions, thus its computational modeling is a challenging task. We show that it is possible to predict the wavelength of emission of a Trp residue from classical molecular dynamics simulations by computing the solvent-accessible surface area or the electrostatic interaction between the indole group and the rest of the system. Linear parametric models are obtained to predict the maximum emission wavelengths with standard errors of the order 5 nm. In a set of 19 proteins with emission wavelengths ranging from 308 to 352 nm, the best model predicts the maximum wavelength of emission with a standard error of 4.89 nm and a quadratic Pearson correlation coefficient of 0.81. These models can be used for the interpretation of fluorescence spectra of proteins with multiple Trp residues, or for which local Trp environmental variability exists and can be probed by classical molecular dynamics simulations. © 2018 Wiley Periodicals, Inc. © 2018 Wiley Periodicals, Inc.
Subota, Ines; Julkowska, Daria; Vincensini, Laetitia; Reeg, Nele; Buisson, Johanna; Blisnick, Thierry; Huet, Diego; Perrot, Sylvie; Santi-Rocca, Julien; Duchateau, Magalie; Hourdel, Véronique; Rousselle, Jean-Claude; Cayet, Nadège; Namane, Abdelkader; Chamot-Rooke, Julia; Bastin, Philippe
2014-01-01
Cilia and flagella are complex organelles made of hundreds of proteins of highly variable structures and functions. Here we report the purification of intact flagella from the procyclic stage of Trypanosoma brucei using mechanical shearing. Structural preservation was confirmed by transmission electron microscopy that showed that flagella still contained typical elements such as the membrane, the axoneme, the paraflagellar rod, and the intraflagellar transport particles. It also revealed that flagella severed below the basal body, and were not contaminated by other cytoskeletal structures such as the flagellar pocket collar or the adhesion zone filament. Mass spectrometry analysis identified a total of 751 proteins with high confidence, including 88% of known flagellar components. Comparison with the cell debris fraction revealed that more than half of the flagellum markers were enriched in flagella and this enrichment criterion was taken into account to identify 212 proteins not previously reported to be associated to flagella. Nine of these were experimentally validated including a 14-3-3 protein not yet reported to be associated to flagella and eight novel proteins termed FLAM (FLAgellar Member). Remarkably, they localized to five different subdomains of the flagellum. For example, FLAM6 is restricted to the proximal half of the axoneme, no matter its length. In contrast, FLAM8 is progressively accumulating at the distal tip of growing flagella and half of it still needs to be added after cell division. A combination of RNA interference and Fluorescence Recovery After Photobleaching approaches demonstrated very different dynamics from one protein to the other, but also according to the stage of construction and the age of the flagellum. Structural proteins are added to the distal tip of the elongating flagellum and exhibit slow turnover whereas membrane proteins such as the arginine kinase show rapid turnover without a detectible polarity. PMID:24741115
A simple and fast heuristic for protein structure comparison.
Pelta, David A; González, Juan R; Moreno Vega, Marcos
2008-03-25
Protein structure comparison is a key problem in bioinformatics. There exist several methods for doing protein comparison, being the solution of the Maximum Contact Map Overlap problem (MAX-CMO) one of the alternatives available. Although this problem may be solved using exact algorithms, researchers require approximate algorithms that obtain good quality solutions using less computational resources than the formers. We propose a variable neighborhood search metaheuristic for solving MAX-CMO. We analyze this strategy in two aspects: 1) from an optimization point of view the strategy is tested on two different datasets, obtaining an error of 3.5%(over 2702 pairs) and 1.7% (over 161 pairs) with respect to optimal values; thus leading to high accurate solutions in a simpler and less expensive way than exact algorithms; 2) in terms of protein structure classification, we conduct experiments on three datasets and show that is feasible to detect structural similarities at SCOP's family and CATH's architecture levels using normalized overlap values. Some limitations and the role of normalization are outlined for doing classification at SCOP's fold level. We designed, implemented and tested.a new tool for solving MAX-CMO, based on a well-known metaheuristic technique. The good balance between solution's quality and computational effort makes it a valuable tool. Moreover, to the best of our knowledge, this is the first time the MAX-CMO measure is tested at SCOP's fold and CATH's architecture levels with encouraging results.
Identification of the protein folding transition state from molecular dynamics trajectories
NASA Astrophysics Data System (ADS)
Muff, S.; Caflisch, A.
2009-03-01
The rate of protein folding is governed by the transition state so that a detailed characterization of its structure is essential for understanding the folding process. In vitro experiments have provided a coarse-grained description of the folding transition state ensemble (TSE) of small proteins. Atomistic details could be obtained by molecular dynamics (MD) simulations but it is not straightforward to extract the TSE directly from the MD trajectories, even for small peptides. Here, the structures in the TSE are isolated by the cut-based free-energy profile (cFEP) using the network whose nodes and links are configurations sampled by MD and direct transitions among them, respectively. The cFEP is a barrier-preserving projection that does not require arbitrarily chosen progress variables. First, a simple two-dimensional free-energy surface is used to illustrate the successful determination of the TSE by the cFEP approach and to explain the difficulty in defining boundary conditions of the Markov state model for an entropically stabilized free-energy minimum. The cFEP is then used to extract the TSE of a β-sheet peptide with a complex free-energy surface containing multiple basins and an entropic region. In contrast, Markov state models with boundary conditions defined by projected variables and conventional histogram-based free-energy profiles are not able to identify the TSE of the β-sheet peptide.
Structural Basis of TLR5-Flagellin Recognition and Signaling
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yoon, Sung-il; Kurnasov, Oleg; Natarajan, Venkatesh
2012-03-01
Toll-like receptor 5 (TLR5) binding to bacterial flagellin activates signaling through the transcription factor NF-{kappa}B and triggers an innate immune response to the invading pathogen. To elucidate the structural basis and mechanistic implications of TLR5-flagellin recognition, we determined the crystal structure of zebrafish TLR5 (as a variable lymphocyte receptor hybrid protein) in complex with the D1/D2/D3 fragment of Salmonella flagellin, FliC, at 2.47 angstrom resolution. TLR5 interacts primarily with the three helices of the FliC D1 domain using its lateral side. Two TLR5-FliC 1:1 heterodimers assemble into a 2:2 tail-to-tail signaling complex that is stabilized by quaternary contacts of themore » FliC D1 domain with the convex surface of the opposing TLR5. The proposed signaling mechanism is supported by structure-guided mutagenesis and deletion analyses on CBLB502, a therapeutic protein derived from FliC.« less
Structure-based design of broadly protective group a streptococcal M protein-based vaccines.
Dale, James B; Smeesters, Pierre R; Courtney, Harry S; Penfound, Thomas A; Hohn, Claudia M; Smith, Jeremy C; Baudry, Jerome Y
2017-01-03
A major obstacle to the development of broadly protective M protein-based group A streptococcal (GAS) vaccines is the variability within the N-terminal epitopes that evoke potent bactericidal antibodies. The concept of M type-specific protective immune responses has recently been challenged based on the observation that multivalent M protein vaccines elicited cross-reactive bactericidal antibodies against a number of non-vaccine M types of GAS. Additionally, a new "cluster-based" typing system of 175M proteins identified a limited number of clusters containing closely related M proteins. In the current study, we used the emm cluster typing system, in combination with computational structure-based peptide modeling, as a novel approach to the design of potentially broadly protective M protein-based vaccines. M protein sequences (AA 16-50) from the E4 cluster containing 17 emm types of GAS were analyzed using de novo 3-D structure prediction tools and the resulting structures subjected to chemical diversity analysis to identify sequences that were the most representative of the 3-D physicochemical properties of the M peptides in the cluster. Five peptides that spanned the range of physicochemical attributes of all 17 peptides were used to formulate synthetic and recombinant vaccines. Rabbit antisera were assayed for antibodies that cross-reacted with E4 peptides and whole bacteria by ELISA and for bactericidal activity against all E4GAS. The synthetic vaccine rabbit antisera reacted with all 17 E4M peptides and demonstrated bactericidal activity against 15/17 E4GAS. A recombinant hybrid vaccine containing the same E4 peptides also elicited antibodies that cross-reacted with all E4M peptides. Comprehensive studies using structure-based design may result in a broadly protective M peptide vaccine that will elicit cluster-specific and emm type-specific antibody responses against the majority of clinically relevant emm types of GAS. Copyright © 2016 Elsevier Ltd. All rights reserved.
Immormino, Robert M; Silversmith, Ruth E; Bourret, Robert B
2016-10-04
Two-component regulatory systems, minimally composed of a sensor kinase and a response regulator protein, are common mediators of signal transduction in microorganisms. All response regulators contain a receiver domain with conserved active site residues that catalyze the signal activating and deactivating phosphorylation and dephosphorylation reactions. We explored the impact of variable active site position T+1 (one residue C-terminal to the conserved Thr/Ser) on reaction kinetics and signaling fidelity, using wild type and mutant Escherichia coli CheY, CheB, and NarL to represent the three major sequence classes observed across response regulators: Ala/Gly, Ser/Thr, and Val/Ile/Met, respectively, at T+1. Biochemical and structural data together suggested that different amino acids at T+1 impacted reaction kinetics by altering access to the active site while not perturbing overall protein structure. A given amino acid at position T+1 had similar effects on autodephosphorylation in each protein background tested, likely by modulating access of the attacking water molecule to the active site. Similarly, rate constants for CheY autophosphorylation with three different small molecule phosphodonors were consistent with the steric constraints on access to the phosphorylation site arising from combination of specific phosphodonors with particular amino acids at T+1. Because other variable active site residues also influence response regulator phosphorylation biochemistry, we began to explore how context (here, the amino acid at T+2) affected the influence of position T+1 on CheY autocatalytic reactions. Finally, position T+1 affected the fidelity and kinetics of phosphotransfer between sensor kinases and response regulators but was not a primary determinant of their interaction.
Alvarez-Cabrera, Ana L.; Delgado, Sandra; Gil-Carton, David; Mortuza, Gulnahar B.; Montoya, Guillermo; Sorzano, Carlos O. S.; Tang, Tang K.; Carazo, Jose M.
2017-01-01
Centrosomal P4.1-associated protein (CPAP) is a cell cycle regulated protein fundamental for centrosome assembly and centriole elongation. In humans, the region between residues 897–1338 of CPAP mediates interactions with other proteins and includes a homodimerization domain. CPAP mutations cause primary autosomal recessive microcephaly and Seckel syndrome. Despite of the biological/clinical relevance of CPAP, its mechanistic behavior remains unclear and its C-terminus (the G-box/TCP domain) is the only part whose structure has been solved. This situation is perhaps due in part to the challenges that represent obtaining the protein in a soluble, homogeneous state for structural studies. Our work constitutes a systematic structural analysis on multiple oligomers of HsCPAP897−1338, using single-particle electron microscopy (EM) of negatively stained (NS) samples. Based on image classification into clearly different regular 3D maps (putatively corresponding to dimers and tetramers) and direct observation of individual images representing other complexes of HsCPAP897−1338 (i.e., putative flexible monomers and higher-order multimers), we report a dynamic oligomeric behavior of this protein, where different homo-oligomers coexist in variable proportions. We propose that dimerization of the putative homodimer forms a putative tetramer which could be the structural unit for the scaffold that either tethers the pericentriolar material to centrioles or promotes procentriole elongation. A coarse fitting of atomic models into the NS 3D maps at resolutions around 20 Å is performed only to complement our experimental data, allowing us to hypothesize on the oligomeric composition of the different complexes. In this way, the current EM work represents an initial step toward the structural characterization of different oligomers of CPAP, suggesting further insights to understand how this protein works, contributing to the elucidation of control mechanisms for centriole biogenesis. PMID:28396859
Truan, Daphné; Bjelić, Saša; Li, Xiao-Dan; Winkler, Fritz K
2014-07-29
The trimeric PII signal transduction proteins regulate the function of a variety of target proteins predominantly involved in nitrogen metabolism. ATP, ADP and 2-oxoglutarate (2-OG) are key effector molecules influencing PII binding to targets. Studies of PII proteins have established that the 20-residue T-loop plays a central role in effector sensing and target binding. However, the specific effects of effector binding on T-loop conformation have remained poorly documented. We present eight crystal structures of the Azospirillum brasilense PII protein GlnZ, six of which are cocrystallized and liganded with ADP or ATP. We find that interaction with the diphosphate moiety of bound ADP constrains the N-terminal part of the T-loop in a characteristic way that is maintained in ADP-promoted complexes with target proteins. In contrast, the interactions with the triphosphate moiety in ATP complexes are much more variable and no single predominant interaction mode is apparent except for the ternary MgATP/2-OG complex. These conclusions can be extended to most investigated PII proteins of the GlnB/GlnK subfamily. Unlike reported for other PII proteins, microcalorimetry reveals no cooperativity between the three binding sites of GlnZ trimers for any of the three effectors under carefully controlled experimental conditions. Copyright © 2014 Elsevier Ltd. All rights reserved.
Structural diversity of domain superfamilies in the CATH database.
Reeves, Gabrielle A; Dallman, Timothy J; Redfern, Oliver C; Akpor, Adrian; Orengo, Christine A
2006-07-14
The CATH database of domain structures has been used to explore the structural variation of homologous domains in 294 well populated domain structure superfamilies, each containing at least three sequence diverse relatives. Our analyses confirm some previously detected trends relating sequence divergence to structural variation but for a much larger dataset and in some superfamilies the new data reveal exceptional structural variation. Use of a new algorithm (2DSEC) to analyse variability in secondary structure compositions across a superfamily sheds new light on how structures evolve. 2DSEC detects inserted secondary structures that embellish the core of conserved secondary structures found throughout the superfamily. Analysis showed that for 56% of highly populated superfamilies (>9 sequence diverse relatives), there are twofold or more increases in the numbers of secondary structures in some relatives. In some families fivefold increases occur, sometimes modifying the fold of the domain. Manual inspection of secondary structure insertions or embellishments in 48 particularly variable superfamilies revealed that although these insertions were usually discontiguous in the sequence they were often co-located in 3D resulting in a larger structural motif that often modified the geometry of the active site or the surface conformation promoting diverse domain partnerships and protein interactions. These observations, supported by automatic analysis of all well populated CATH families, suggest that accretion of small secondary structure insertions may provide a simple mechanism for evolving new functions in diverse relatives. Some layered domain architectures (e.g. mainly-beta and alpha-beta sandwiches) that recur highly in the genomes more frequently exploit these types of embellishments to modify function. In these architectures, aggregation occurs most often at the edges, top or bottom of the beta-sheets. Information on structural variability across domain superfamilies has been made available through the CATH Dictionary of Homologous Structures (DHS).
Weljie, Aalim M; Gagné, Stéphane M; Vogel, Hans J
2004-12-07
Ca(2+)-dependent protein kinases (CDPKs) are vital Ca(2+)-signaling proteins in plants and protists which have both a kinase domain and a self-contained calcium regulatory calmodulin-like domain (CLD). Despite being very similar to CaM (>40% identity) and sharing the same fold, recent biochemical and structural evidence suggests that the behavior of CLD is distinct from its namesake, calmodulin. In this study, NMR spectroscopy is employed to examine the structure and backbone dynamics of a 168 amino acid Ca(2+)-saturated construct of the CLD (NtH-CLD) in which almost the entire C-terminal domain is exchange broadened and not visible in the NMR spectra. Structural characterization of the N-terminal domain indicates that the first Ca(2+)-binding loop is significantly more open than in a recently reported structure of the CLD complexed with a putative intramolecular binding region (JD) in the CDPK. Backbone dynamics suggest that parts of the third helix exhibit unusually high mobility, and significant exchange, consistent with previous findings that this helix interacts with the C-terminal domain. Dynamics data also show that the "tether" region, consisting of the first 11 amino acids of CLD, is highly mobile and these residues exhibit distinctive beta-type secondary structure, which may help to position the JD and CLD. Finally, the unusual global dynamic behavior of the protein is rationalized on the basis of possible interdomain rearrangements and the highly variable environments of the C- and N-terminal domains.
STCRDab: the structural T-cell receptor database
de Oliveira, Saulo H P; Krawczyk, Konrad
2018-01-01
Abstract The Structural T–cell Receptor Database (STCRDab; http://opig.stats.ox.ac.uk/webapps/stcrdab) is an online resource that automatically collects and curates TCR structural data from the Protein Data Bank. For each entry, the database provides annotations, such as the α/β or γ/δ chain pairings, major histocompatibility complex details, and where available, antigen binding affinities. In addition, the orientation between the variable domains and the canonical forms of the complementarity-determining region loops are also provided. Users can select, view, and download individual or bulk sets of structures based on these criteria. Where available, STCRDab also finds antibody structures that are similar to TCRs, helping users explore the relationship between TCRs and antibodies. PMID:29087479
SAbPred: a structure-based antibody prediction server
Dunbar, James; Krawczyk, Konrad; Leem, Jinwoo; Marks, Claire; Nowak, Jaroslaw; Regep, Cristian; Georges, Guy; Kelm, Sebastian; Popovic, Bojana; Deane, Charlotte M.
2016-01-01
SAbPred is a server that makes predictions of the properties of antibodies focusing on their structures. Antibody informatics tools can help improve our understanding of immune responses to disease and aid in the design and engineering of therapeutic molecules. SAbPred is a single platform containing multiple applications which can: number and align sequences; automatically generate antibody variable fragment homology models; annotate such models with estimated accuracy alongside sequence and structural properties including potential developability issues; predict paratope residues; and predict epitope patches on protein antigens. The server is available at http://opig.stats.ox.ac.uk/webapps/sabpred. PMID:27131379
Geoghegan, James C.; Fleming, Ryan; Damschroder, Melissa; Bishop, Steven M.; Sathish, Hasige A.; Esfandiary, Reza
2016-01-01
ABSTRACT Undesired solution behaviors such as reversible self-association (RSA), high viscosity, and liquid-liquid phase separation can introduce substantial challenges during development of monoclonal antibody formulations. Although a global mechanistic understanding of RSA (i.e., native and reversible protein-protein interactions) is sufficient to develop robust formulation controls, its mitigation via protein engineering requires knowledge of the sites of protein-protein interactions. In the study reported here, we coupled our previous hydrogen-deuterium exchange mass spectrometry findings with structural modeling and in vitro screening to identify the residues responsible for RSA of a model IgG1 monoclonal antibody (mAb-C), and rationally engineered variants with improved solution properties (i.e., reduced RSA and viscosity). Our data show that mutation of either solvent-exposed aromatic residues within the heavy and light chain variable regions or buried residues within the heavy chain/light chain interface can significantly mitigate RSA and viscosity by reducing the IgG's surface hydrophobicity. The engineering strategy described here highlights the utility of integrating complementary experimental and in silico methods to identify mutations that can improve developability, in particular, high concentration solution properties, of candidate therapeutic antibodies. PMID:27050875
Mutations that Cause Human Disease: A Computational/Experimental Approach
DOE Office of Scientific and Technical Information (OSTI.GOV)
Beernink, P; Barsky, D; Pesavento, B
International genome sequencing projects have produced billions of nucleotides (letters) of DNA sequence data, including the complete genome sequences of 74 organisms. These genome sequences have created many new scientific opportunities, including the ability to identify sequence variations among individuals within a species. These genetic differences, which are known as single nucleotide polymorphisms (SNPs), are particularly important in understanding the genetic basis for disease susceptibility. Since the report of the complete human genome sequence, over two million human SNPs have been identified, including a large-scale comparison of an entire chromosome from twenty individuals. Of the protein coding SNPs (cSNPs), approximatelymore » half leads to a single amino acid change in the encoded protein (non-synonymous coding SNPs). Most of these changes are functionally silent, while the remainder negatively impact the protein and sometimes cause human disease. To date, over 550 SNPs have been found to cause single locus (monogenic) diseases and many others have been associated with polygenic diseases. SNPs have been linked to specific human diseases, including late-onset Parkinson disease, autism, rheumatoid arthritis and cancer. The ability to predict accurately the effects of these SNPs on protein function would represent a major advance toward understanding these diseases. To date several attempts have been made toward predicting the effects of such mutations. The most successful of these is a computational approach called ''Sorting Intolerant From Tolerant'' (SIFT). This method uses sequence conservation among many similar proteins to predict which residues in a protein are functionally important. However, this method suffers from several limitations. First, a query sequence must have a sufficient number of relatives to infer sequence conservation. Second, this method does not make use of or provide any information on protein structure, which can be used to understand how an amino acid change affects the protein. The experimental methods that provide the most detailed structural information on proteins are X-ray crystallography and NMR spectroscopy. However, these methods are labor intensive and currently cannot be carried out on a genomic scale. Nonetheless, Structural Genomics projects are being pursued by more than a dozen groups and consortia worldwide and as a result the number of experimentally determined structures is rising exponentially. Based on the expectation that protein structures will continue to be determined at an ever-increasing rate, reliable structure prediction schemes will become increasingly valuable, leading to information on protein function and disease for many different proteins. Given known genetic variability and experimentally determined protein structures, can we accurately predict the effects of single amino acid substitutions? An objective assessment of this question would involve comparing predicted and experimentally determined structures, which thus far has not been rigorously performed. The completed research leveraged existing expertise at LLNL in computational and structural biology, as well as significant computing resources, to address this question.« less
Stoltenburg, Regina; Krafčiková, Petra; Víglaský, Viktor; Strehlitz, Beate
2016-09-21
Aptamers for whole cell detection are selected mostly by the Cell-SELEX procedure. Alternatively, the use of specific cell surface epitopes as target during aptamer selections allows the development of aptamers with ability to bind whole cells. In this study, we integrated a formerly selected Protein A-binding aptamer PA#2/8 in an assay format called ELONA (Enzyme-Linked OligoNucleotide Assay) and evaluated the ability of the aptamer to recognise and bind to Staphylococcus aureus presenting Protein A on the cell surface. The full-length aptamer and one of its truncated variants could be demonstrated to specifically bind to Protein A-expressing intact cells of S. aureus, and thus have the potential to expand the portfolio of aptamers that can act as an analytical agent for the specific recognition and rapid detection of the bacterial pathogen. The functionality of the aptamer was found to be based on a very complex, but also highly variable structure. Two structural key elements were identified. The aptamer sequence contains several G-clusters allowing folding into a G-quadruplex structure with the potential of dimeric and multimeric assembly. An inverted repeat able to form an imperfect stem-loop at the 5'-end also contributes essentially to the aptameric function.
Stoltenburg, Regina; Krafčiková, Petra; Víglaský, Viktor; Strehlitz, Beate
2016-01-01
Aptamers for whole cell detection are selected mostly by the Cell-SELEX procedure. Alternatively, the use of specific cell surface epitopes as target during aptamer selections allows the development of aptamers with ability to bind whole cells. In this study, we integrated a formerly selected Protein A-binding aptamer PA#2/8 in an assay format called ELONA (Enzyme-Linked OligoNucleotide Assay) and evaluated the ability of the aptamer to recognise and bind to Staphylococcus aureus presenting Protein A on the cell surface. The full-length aptamer and one of its truncated variants could be demonstrated to specifically bind to Protein A-expressing intact cells of S. aureus, and thus have the potential to expand the portfolio of aptamers that can act as an analytical agent for the specific recognition and rapid detection of the bacterial pathogen. The functionality of the aptamer was found to be based on a very complex, but also highly variable structure. Two structural key elements were identified. The aptamer sequence contains several G-clusters allowing folding into a G-quadruplex structure with the potential of dimeric and multimeric assembly. An inverted repeat able to form an imperfect stem-loop at the 5′-end also contributes essentially to the aptameric function. PMID:27650576
Sharma, Alok K; Krieger, Tobias; Rigby, Alan C; Zelikovic, Israel; Alper, Seth L
2016-12-01
Mutations in the human SLC26A4/Pendrin polypeptide (hPDS) cause Pendred Syndrome /DFNB4, syndromic deafness with enlargement of the vestibular aqueduct and low-penetrance goiter. Here we present data on cloning, protein overexpression and purification, refolding, and biophysical characterization of the recombinant hPDS STAS domain lacking its intrinsic variable sequence (STAS-ΔIVS). We report a reproducible protein refolding protocol enabling milligram scale expression and purification of uniformly 15 N- and 13 C /15 N-enriched hPDS STAS-ΔIVS domain suitable for structural characterization by solution NMR. Circular dichroism, one-dimensional 1 H, two-dimensional 1 H- 15 N HSQC, and 1 H- 13 C HSQC NMR spectra confirmed the well-folded state of purified hPDS STAS-ΔIVS in solution. Heteronuclear NMR chemical shift perturbation of select STAS-ΔIVS residues by GDP was observed at fast-to-intermediate NMR time scales. Intrinsic tryptophan fluorescence quench experiments demonstrated GDP binding to hPDS STAS-ΔIVS with K d of 178 μM. These results are useful for structure/function characterization of hPDS STAS, the cytoplasmic subdomain of the congenital deafness protein, pendrin, as well as for studies of other mammalian STAS domains.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yanez, M.E.; Korotkov, K.V.; Abendroth, J.
2009-05-28
Type II secretion systems (T2SS) translocate virulence factors from the periplasmic space of many pathogenic bacteria into the extracellular environment. The T2SS of Vibrio cholerae and related species is called the extracellular protein secretion (Eps) system that consists of a core of multiple copies of 11 different proteins. The pseudopilins, EpsG, EpsH, EpsI, EpsJ and EpsK, are five T2SS proteins that are thought to assemble into a pseudopilus, which is assumed to interact with the outer membrane pore, and may actively participate in the export of proteins. We report here biochemical evidence that the minor pseudopilins EpsI and EpsJ frommore » Vibrio species interact directly with one another. Moreover, the 2.3 {angstrom} resolution crystal structure of a complex of EspI and EpsJ from Vibrio vulnificus represents the first atomic resolution structure of a complex of two different pseudopilin components from the T2SS. Both EpsI and EpsJ appear to be structural extremes within the family of type 4a pilin structures solved to date, with EpsI having the smallest, and EpsJ the largest, 'variable pilin segment' seen thus far. A high degree of sequence conservation in the EpsI:EpsJ interface indicates that this heterodimer occurs in the T2SS of a large number of bacteria. The arrangement of EpsI and EpsJ in the heterodimer would correspond to a right-handed helical character of proteins assembled into a pseudopilus.« less
Gretes, Michael; Lim, Daniel C; de Castro, Liza; Jensen, Susan E; Kang, Sung Gyun; Lee, Kye Joon; Strynadka, Natalie C J
2009-06-05
Beta-lactamase inhibitory protein (BLIP) binds a variety of beta-lactamase enzymes with wide-ranging specificity. Its binding mechanism and interface interactions are a well-established model system for the characterization of protein-protein interactions. Published studies have examined the binding of BLIP to diverse target beta-lactamases (e.g., TEM-1, SME-1, and SHV-1). However, apart from point mutations of amino acid residues, variability on the inhibitor side of this enzyme-inhibitor interface has remained unexplored. Thus, we present crystal structures of two likely BLIP relatives: (1) BLIP-I (solved alone and in complex with TEM-1), which has beta-lactamase inhibitory activity very similar to that of BLIP; and (2) beta-lactamase-inhibitory-protein-like protein (BLP) (in two apo forms, including an ultra-high-resolution structure), which is unable to inhibit any tested beta-lactamase. Despite categorical differences in species of origin and function, BLIP-I and BLP share nearly identical backbone conformations, even at loop regions differing in BLIP. We describe interacting residues and provide a comparative structural analysis of the interactions formed at the interface of BLIP-I.TEM-1 versus those formed at the interface of BLIP.TEM-1. Along with initial attempts to functionally characterize BLP, we examine its amino acid residues that structurally correspond to BLIP/BLIP-I binding hotspots to explain its inability to bind and inhibit TEM-1. We conclude that the BLIP family fold is a robust and flexible scaffold that permits the formation of high-affinity protein-protein interactions while remaining highly selective. Comparison of the two naturally occurring, distinct binding interfaces built upon this scaffold (BLIP and BLIP-I) shows that there is substantial variation possible in the subnanomolar binding interaction with TEM-1. The corresponding (non-TEM-1-binding) BLP surface shows that numerous favorable backbone-backbone/backbone-side-chain interactions with a protein partner can be negated by the presence of a few, strongly unfavorable interactions, especially electrostatic repulsions.
Gonadotropin-Releasing Hormone (GnRH) Receptor Structure and GnRH Binding
Flanagan, Colleen A.; Manilall, Ashmeetha
2017-01-01
Gonadotropin-releasing hormone (GnRH) regulates reproduction. The human GnRH receptor lacks a cytoplasmic carboxy-terminal tail but has amino acid sequence motifs characteristic of rhodopsin-like, class A, G protein-coupled receptors (GPCRs). This review will consider how recent descriptions of X-ray crystallographic structures of GPCRs in inactive and active conformations may contribute to understanding GnRH receptor structure, mechanism of activation and ligand binding. The structures confirmed that ligands bind to variable extracellular surfaces, whereas the seven membrane-spanning α-helices convey the activation signal to the cytoplasmic receptor surface, which binds and activates heterotrimeric G proteins. Forty non-covalent interactions that bridge topologically equivalent residues in different transmembrane (TM) helices are conserved in class A GPCR structures, regardless of activation state. Conformation-independent interhelical contacts account for a conserved receptor protein structure and their importance in the GnRH receptor structure is supported by decreased expression of receptors with mutations of residues in the network. Many of the GnRH receptor mutations associated with congenital hypogonadotropic hypogonadism, including the Glu2.53(90) Lys mutation, involve amino acids that constitute the conserved network. Half of the ~250 intramolecular interactions in GPCRs differ between inactive and active structures. Conformation-specific interhelical contacts depend on amino acids changing partners during activation. Conserved inactive conformation-specific contacts prevent receptor activation by stabilizing proximity of TM helices 3 and 6 and a closed G protein-binding site. Mutations of GnRH receptor residues involved in these interactions, such as Arg3.50(139) of the DRY/S motif or Tyr7.53(323) of the N/DPxxY motif, increase or decrease receptor expression and efficiency of receptor coupling to G protein signaling, consistent with the native residues stabilizing the inactive GnRH receptor structure. Active conformation-specific interhelical contacts stabilize an open G protein-binding site. Progress in defining the GnRH-binding site has recently slowed, with evidence that Tyr6.58(290) contacts Tyr5 of GnRH, whereas other residues affect recognition of Trp3 and Gly10NH2. The surprisingly consistent observations that GnRH receptor mutations that disrupt GnRH binding have less effect on “conformationally constrained” GnRH peptides may now be explained by crystal structures of agonist-bound peptide receptors. Analysis of GPCR structures provides insight into GnRH receptor function. PMID:29123501
Bonatto, Ana C; Souza, Emanuel M; Pedrosa, Fábio O; Yates, M Geoffrey; Benelli, Elaine M
2005-01-01
Proteins of the PII family are found in species of all kingdoms. Although these proteins usually share high identity, their functions are specific to the different organisms. Comparison of structural data from Escherichia coli GlnB and GlnK and Herbaspirillum seropedicae GlnB showed that the T-loop and C-terminus were variable regions. To evaluate the role of these regions in signal transduction by the H. seropedicae GlnB protein, four mutants were constructed: Y51F, G108A/P109a, G108W and Q3R/T5A. The activities of the native and mutated proteins were assayed in an E. coli background constitutively expressing the Klebsiella pneumoniae nifLA operon. The results suggested that the T-loop and C-terminus regions of H. seropedicae GlnB are involved in nitrogen signal transduction.
Topologically Associating Domains: An invariant framework or a dynamic scaffold?
Cubeñas-Potts, Caelin; Corces, Victor G
2015-01-01
Metazoan genomes are organized into regions of topologically associating domains (TADs). TADs are demarcated by border elements, which are enriched for active genes and high occupancy architectural protein binding sites. We recently demonstrated that 3D chromatin architecture is dynamic in response to heat shock, a physiological stress that downregulates transcription and causes a global redistribution of architectural proteins. We utilized a quantitative measure of border strength after heat shock, transcriptional inhibition, and architectural protein knockdown to demonstrate that changes in both transcription and architectural protein occupancy contribute to heat shock-induced TAD dynamics. Notably, architectural proteins appear to play a more important role in altering 3D chromatin architecture. Here, we discuss the implications of our findings on previous studies evaluating the dynamics of TAD structure during cellular differentiation. We propose that the subset of variable TADs observed after differentiation are representative of cell-type specific gene expression and are biologically significant.
Somarelli, J A; Mesa, A; Rodriguez, R; Avellan, R; Martinez, L; Zang, Y J; Greidinger, E L; Herrera, R J
2011-03-01
Systemic lupus erythematosus (SLE) and mixed connective tissue disease (MCTD) are autoimmune illnesses characterized by the presence of high titers of autoantibodies directed against a wide range of 'self ' antigens. Proteins of the U1 small nuclear ribonucleoprotein particle (U1 snRNP) are among the most immunogenic molecules in patients with SLE and MCTD. The recent release of a crystallized U1 snRNP provides a unique opportunity to evaluate the effects of tertiary and quaternary structures on autoantigenicity within the U1 snRNP. In the present study, an epitope map was created using the U1 snRNP crystal structure. A total of 15 peptides were tested in a cohort of 68 patients with SLE, 29 with MCTD and 26 healthy individuals and mapped onto the U1 snRNP structure. Antigenic sites were detected in a variety of structures and appear to include RNA binding domains, but mostly exclude regions necessary for protein-protein interactions. These data suggest that while some autoantibodies may target U1 snRNP proteins as monomers or apoptosis-induced, protease-digested fragments, others may recognize epitopes on assembled protein subcomplexes of the U1 snRNP. Although nearly all of the peptides are strong predictors of autoimmune illness, none were successful at distinguishing between SLE and MCTD. The antigenicity of some peptides significantly correlated with several clinical symptoms. This investigation implicitly highlights the complexities of autoimmune epitopes, and autoimmune illnesses in general, and demonstrates the variability of antigens in patient populations, all of which contribute to difficult clinical diagnoses.
NASA Astrophysics Data System (ADS)
Zhou, Mowei; Yan, Jing; Romano, Christine A.; Tebo, Bradley M.; Wysocki, Vicki H.; Paša-Tolić, Ljiljana
2018-01-01
Manganese oxidation is an important biogeochemical process that is largely regulated by bacteria through enzymatic reactions. However, the detailed mechanism is poorly understood due to challenges in isolating and characterizing these unknown enzymes. A manganese oxidase, Mnx, from Bacillus sp. PL-12 has been successfully overexpressed in active form as a protein complex with a molecular mass of 211 kDa. We have recently used surface induced dissociation (SID) and ion mobility-mass spectrometry (IM-MS) to release and detect folded subcomplexes for determining subunit connectivity and quaternary structure. The data from the native mass spectrometry experiments led to a plausible structural model of this multicopper oxidase, which has been difficult to study by conventional structural biology methods. It was also revealed that each Mnx subunit binds a variable number of copper ions. Becasue of the heterogeneity of the protein and limited mass resolution, ambiguities in assigning some of the observed peaks remained as a barrier to fully understanding the role of metals and potential unknown ligands in Mnx. In this study, we performed SID in a modified Fourier transform-ion cyclotron resonance (FTICR) mass spectrometer. The high mass accuracy and resolution offered by FTICR unveiled unexpected artificial modifications on the protein that had been previously thought to be iron bound species based on lower resolution spectra. Additionally, isotopically resolved spectra of the released subcomplexes revealed the metal binding stoichiometry at different structural levels. This method holds great potential for in-depth characterization of metalloproteins and protein-ligand complexes. [Figure not available: see fulltext.
Zhuravlev, Pavel I; Papoian, Garegin A
2010-08-01
Energy landscape theories have provided a common ground for understanding the protein folding problem, which once seemed to be overwhelmingly complicated. At the same time, the native state was found to be an ensemble of interconverting states with frustration playing a more important role compared to the folding problem. The landscape of the folded protein - the native landscape - is glassier than the folding landscape; hence, a general description analogous to the folding theories is difficult to achieve. On the other hand, the native basin phase volume is much smaller, allowing a protein to fully sample its native energy landscape on the biological timescales. Current computational resources may also be used to perform this sampling for smaller proteins, to build a 'topographical map' of the native landscape that can be used for subsequent analysis. Several major approaches to representing this topographical map are highlighted in this review, including the construction of kinetic networks, hierarchical trees and free energy surfaces with subsequent structural and kinetic analyses. In this review, we extensively discuss the important question of choosing proper collective coordinates characterizing functional motions. In many cases, the substates on the native energy landscape, which represent different functional states, can be used to obtain variables that are well suited for building free energy surfaces and analyzing the protein's functional dynamics. Normal mode analysis can provide such variables in cases where functional motions are dictated by the molecule's architecture. Principal component analysis is a more expensive way of inferring the essential variables from the protein's motions, one that requires a long molecular dynamics simulation. Finally, the two popular models for the allosteric switching mechanism, 'preexisting equilibrium' and 'induced fit', are interpreted within the energy landscape paradigm as extreme points of a continuum of transition mechanisms. Some experimental evidence illustrating each of these two models, as well as intermediate mechanisms, is presented and discussed.
Protein Tyrosine Nitration: Biochemical Mechanisms and Structural Basis of its Functional Effects
Radi, Rafael
2012-01-01
CONSPECTUS The nitration of protein tyrosine residues to 3-nitrotyrosine represents an oxidative postranslational modification that unveils the disruption of nitric oxide (•NO) signaling and metabolism towards pro-oxidant processes. Indeed, excess levels of reactive oxygen species in the presence of •NO or •NO-derived metabolites lead to the formation of nitrating species such as peroxynitrite. Thus, protein 3-nitrotyrosine has been established as a biomarker of cell, tissue and systemic “nitroxidative stress”. Moreover, tyrosine nitration modifies key properties of the amino acid (i.e. phenol group pKa, redox potential, hydrophobicity and volume). Thus, the incorporation of a nitro group (−NO2) to protein tyrosines can lead to profound structural and functional changes, some of which contribute to altered cell and tissue homeostasis. In this Account, I describe our current efforts to define 1) biologically-relevant mechanisms of protein tyrosine nitration and 2) how this modification can cause changes in protein structure and function at the molecular level. First, the relevance of protein tyrosine nitration via free radical-mediated reactions (in both peroxynitrite-dependent or independent pathways) involving the intermediacy of tyrosyl radical (Tyr•) will be underscored. This feature of the nitration process becomes critical as Tyr• can take variable fates, including the formation of 3-nitrotyrosine. Fast kinetic techniques, electron paramagnetic resonance (EPR) studies, bioanalytical methods and kinetic simulations have altogether assisted to characterize and fingerprint the reactions of tyrosine with peroxynitrite and one-electron oxidants and its further evolution to 3-nitrotyrosine. Recent findings show that nitration of tyrosines in proteins associated to biomembranes is linked to the lipid peroxidation process via a connecting reaction that involves the one-electron oxidation of tyrosine by lipid peroxyl radicals (LOO•). Second, immunochemical and proteomic-based studies indicate that protein tyrosine nitration is a selective process in vitro and in vivo, preferentially directed to a subset of proteins, and within those proteins, typically one or two tyrosine residues are site-specifically modified. The nature and site(s) of formation of the proximal oxidizing/nitrating species, the physico-chemical characteristics of the local microenvironment and also structural features of the protein account for part of this selectivity. Then, how this relatively subtle chemical modification in one tyrosine residue can sometimes cause dramatic changes in protein activity has remained elusive. Herein, I will analyze recent structural biology data of two pure and homogenously nitrated mitochondrial proteins (i.e. cytochrome c and MnSOD) to illustrate regio-selectivity and structural effects of tyrosine nitration, and subsequent impact in protein loss- or even gain-of-function. PMID:23157446
Protein lipoxidation: Detection strategies and challenges
Aldini, Giancarlo; Domingues, M. Rosário; Spickett, Corinne M.; Domingues, Pedro; Altomare, Alessandra; Sánchez-Gómez, Francisco J.; Oeste, Clara L.; Pérez-Sala, Dolores
2015-01-01
Enzymatic and non-enzymatic lipid metabolism can give rise to reactive species that may covalently modify cellular or plasma proteins through a process known as lipoxidation. Under basal conditions, protein lipoxidation can contribute to normal cell homeostasis and participate in signaling or adaptive mechanisms, as exemplified by lipoxidation of Ras proteins or of the cytoskeletal protein vimentin, both of which behave as sensors of electrophilic species. Nevertheless, increased lipoxidation under pathological conditions may lead to deleterious effects on protein structure or aggregation. This can result in impaired degradation and accumulation of abnormally folded proteins contributing to pathophysiology, as may occur in neurodegenerative diseases. Identification of the protein targets of lipoxidation and its functional consequences under pathophysiological situations can unveil the modification patterns associated with the various outcomes, as well as preventive strategies or potential therapeutic targets. Given the wide structural variability of lipid moieties involved in lipoxidation, highly sensitive and specific methods for its detection are required. Derivatization of reactive carbonyl species is instrumental in the detection of adducts retaining carbonyl groups. In addition, use of tagged derivatives of electrophilic lipids enables enrichment of lipoxidized proteins or peptides. Ultimate confirmation of lipoxidation requires high resolution mass spectrometry approaches to unequivocally identify the adduct and the targeted residue. Moreover, rigorous validation of the targets identified and assessment of the functional consequences of these modifications are essential. Here we present an update on methods to approach the complex field of lipoxidation along with validation strategies and functional assays illustrated with well-studied lipoxidation targets. PMID:26072467
Effects of konjac glucomannan on heat-induced changes of wheat gluten structure.
Wang, Yu; Chen, Yiheng; Zhou, Yun; Nirasawa, Satoru; Tatsumi, Eizo; Li, Xiuting; Cheng, Yongqiang
2017-08-15
Effects of konjac glucomannan on the structure of wheat gluten were investigated at variable temperatures in this study. Dynamic oscillatory rheology study showed that konjac glucomannan conferred stiffness on gluten with a higher tan δ data at 25°C and 55°C, but this parameter was lower at 75°C and 95°C. Konjac glucomannan decreased the content of thiol equivalent groups and increased the α-helix/β-sheet content ratio, respectively. The thicker layer of gluten protein with 5% konjac glucomannan was observed by scanning electron microscopy. This study revealed that konjac glucomannan could alter the conformations of gluten proteins upon heating via non-covalent interactions and physical entanglements. It is likely that konjac glucomannan promotes protein aggregation by strengthening hydrophobic interaction at 25°C and 55°C, and alleviates heat-induced denaturation by decreasing the flexibility of polypeptide chain at higher 75°C and 95°C. Copyright © 2017 Elsevier Ltd. All rights reserved.
Moudjou, Mohammed; Chapuis, Jérôme; Mekrouti, Mériem; Reine, Fabienne; Herzog, Laetitia; Sibille, Pierre; Laude, Hubert; Vilette, Didier; Andréoletti, Olivier; Rezaei, Human; Dron, Michel; Béringue, Vincent
2016-01-01
Prions are formed of misfolded assemblies (PrPSc) of the variably N-glycosylated cellular prion protein (PrPC). In infected species, prions replicate by seeding the conversion and polymerization of host PrPC. Distinct prion strains can be recognized, exhibiting defined PrPSc biochemical properties such as the glycotype and specific biological traits. While strain information is encoded within the conformation of PrPSc assemblies, the storage of the structural information and the molecular requirements for self-perpetuation remain uncertain. Here, we investigated the specific role of PrPC glycosylation status. First, we developed an efficient protein misfolding cyclic amplification method using cells expressing the PrPC species of interest as substrate. Applying the technique to PrPC glycosylation mutants expressing cells revealed that neither PrPC nor PrPSc glycoform stoichiometry was instrumental to PrPSc formation and strainness perpetuation. Our study supports the view that strain properties, including PrPSc glycotype are enciphered within PrPSc structural backbone, not in the attached glycans. PMID:27384922
Sagane, Yoshimasa; Hosp, Julia; Zech, Karin; Thompson, Eric M
2011-05-01
Oriented cellulose deposition is critical to plant patterning and models suggest microtubules constrain cellulose synthase movements through the plasma membrane. Though widespread in plants, urochordates are the only animals that synthesize cellulose. We characterized the distinctive cellulose microfibril scaffold of the larvacean house and its interaction with house structural proteins (oikosins). Targeted disruption of cytoskeletal elements, secretory pathways, and plasma membrane organization, suggested a working model for templating extracellular cellulose microfibrils from animal cells that shows both convergence and differences to plant models. Specialized cortical F-actin arrays template microfibril orientation and glycosylphosphatidylinositol-anchored proteins in lipid rafts may act as scaffolding proteins in microfibril elongation. Microtubules deliver and maintain cellulose synthase complexes to specific cell membrane sites rather than orienting their movement through the membrane. Oikosins are incorporated into house compartments directly above their corresponding cellular field of expression and interact with the cellulose scaffold to a variable extent.
Molecular Effects of Concentrated Solutes on Protein Hydration, Dynamics, and Electrostatics.
Abriata, Luciano A; Spiga, Enrico; Peraro, Matteo Dal
2016-08-23
Most studies of protein structure and function are performed in dilute conditions, but proteins typically experience high solute concentrations in their physiological scenarios and biotechnological applications. High solute concentrations have well-known effects on coarse protein traits like stability, diffusion, and shape, but likely also perturb other traits through finer effects pertinent at the residue and atomic levels. Here, NMR and molecular dynamics investigations on ubiquitin disclose variable interactions with concentrated solutes that lead to localized perturbations of the protein's surface, hydration, electrostatics, and dynamics, all dependent on solute size and chemical properties. Most strikingly, small polar uncharged molecules are sticky on the protein surface, whereas charged small molecules are not, but the latter still perturb the internal protein electrostatics as they diffuse nearby. Meanwhile, interactions with macromolecular crowders are favored mainly through hydrophobic, but not through polar, surface patches. All the tested small solutes strongly slow down water exchange at the protein surface, whereas macromolecular crowders do not exert such strong perturbation. Finally, molecular dynamics simulations predict that unspecific interactions slow down microsecond- to millisecond-timescale protein dynamics despite having only mild effects on pico- to nanosecond fluctuations as corroborated by NMR. We discuss our results in the light of recent advances in understanding proteins inside living cells, focusing on the physical chemistry of quinary structure and cellular organization, and we reinforce the idea that proteins should be studied in native-like media to achieve a faithful description of their function. Copyright © 2016 Biophysical Society. Published by Elsevier Inc. All rights reserved.
Regvar, Marjana; Eichert, Diane; Kaulich, Burkhard; Gianoncelli, Alessandra; Pongrac, Paula; Vogel-Mikuš, Katarina; Kreft, Ivan
2011-01-01
Mature developed seeds are physiologically and biochemically committed to store nutrients, principally as starch, protein, oils, and minerals. The composition and distribution of elements inside the aleurone cell layer reflect their biogenesis, structural characteristics, and physiological functions. It is therefore of primary importance to understand the mechanisms underlying metal ion accumulation, distribution, storage, and bioavailability in aleurone subcellular organelles for seed fortification purposes. Synchrotron radiation soft X-ray full-field imaging mode (FFIM) and low-energy X-ray fluorescence (LEXRF) spectromicroscopy were applied to characterize major structural features and the subcellular distribution of physiologically important elements (Zn, Fe, Na, Mg, Al, Si, and P). These direct imaging methods reveal the accumulation patterns between the apoplast and symplast, and highlight the importance of globoids with phytic acid mineral salts and walls as preferential storage structures. C, N, and O chemical topographies are directly linked to the structural backbone of plant substructures. Zn, Fe, Na, Mg, Al, and P were linked to globoid structures within protein storage vacuoles with variable levels of co-localization. Si distribution was atypical, being contained in the aleurone apoplast and symplast, supporting a physiological role for Si in addition to its structural function. These results reveal that the immobilization of metals within the observed endomembrane structures presents a structural and functional barrier and affects bioavailability. The combination of high spatial and chemical X-ray microscopy techniques highlights how in situ analysis can yield new insights into the complexity of the wheat aleurone layer, whose precise biochemical composition, morphology, and structural characteristics are still not unequivocally resolved. PMID:21447756
Modeling the SHG activities of diverse protein crystals
DOE Office of Scientific and Technical Information (OSTI.GOV)
Haupert, Levi M.; DeWalt, Emma L.; Simpson, Garth J.
2012-10-18
A symmetry-additiveab initiomodel for second-harmonic generation (SHG) activity of protein crystals was applied to assess the likely protein-crystal coverage of SHG microscopy. Calculations were performed for 250 proteins in nine point-group symmetries: a total of 2250 crystals. The model suggests that the crystal symmetry and the limit of detection of the instrument are expected to be the strongest predictors of coverage of the factors considered, which also included secondary-structural content and protein size. Much of the diversity in SHG activity is expected to arise primarily from the variability in the intrinsic protein response as well as the orientation within themore » crystal lattice. Two or more orders-of-magnitude variation in intensity are expected even within protein crystals of the same symmetry. SHG measurements of tetragonal lysozyme crystals confirmed detection, from which a protein coverage of ~84% was estimated based on the proportion of proteins calculated to produce SHG responses greater than that of tetragonal lysozyme. Good agreement was observed between the measured and calculated ratios of the SHG intensity from lysozyme in tetragonal and monoclinic lattices.« less
Fusco, Diana; Barnum, Timothy J.; Bruno, Andrew E.; Luft, Joseph R.; Snell, Edward H.; Mukherjee, Sayan; Charbonneau, Patrick
2014-01-01
X-ray crystallography is the predominant method for obtaining atomic-scale information about biological macromolecules. Despite the success of the technique, obtaining well diffracting crystals still critically limits going from protein to structure. In practice, the crystallization process proceeds through knowledge-informed empiricism. Better physico-chemical understanding remains elusive because of the large number of variables involved, hence little guidance is available to systematically identify solution conditions that promote crystallization. To help determine relationships between macromolecular properties and their crystallization propensity, we have trained statistical models on samples for 182 proteins supplied by the Northeast Structural Genomics consortium. Gaussian processes, which capture trends beyond the reach of linear statistical models, distinguish between two main physico-chemical mechanisms driving crystallization. One is characterized by low levels of side chain entropy and has been extensively reported in the literature. The other identifies specific electrostatic interactions not previously described in the crystallization context. Because evidence for two distinct mechanisms can be gleaned both from crystal contacts and from solution conditions leading to successful crystallization, the model offers future avenues for optimizing crystallization screens based on partial structural information. The availability of crystallization data coupled with structural outcomes analyzed through state-of-the-art statistical models may thus guide macromolecular crystallization toward a more rational basis. PMID:24988076
Fusco, Diana; Barnum, Timothy J; Bruno, Andrew E; Luft, Joseph R; Snell, Edward H; Mukherjee, Sayan; Charbonneau, Patrick
2014-01-01
X-ray crystallography is the predominant method for obtaining atomic-scale information about biological macromolecules. Despite the success of the technique, obtaining well diffracting crystals still critically limits going from protein to structure. In practice, the crystallization process proceeds through knowledge-informed empiricism. Better physico-chemical understanding remains elusive because of the large number of variables involved, hence little guidance is available to systematically identify solution conditions that promote crystallization. To help determine relationships between macromolecular properties and their crystallization propensity, we have trained statistical models on samples for 182 proteins supplied by the Northeast Structural Genomics consortium. Gaussian processes, which capture trends beyond the reach of linear statistical models, distinguish between two main physico-chemical mechanisms driving crystallization. One is characterized by low levels of side chain entropy and has been extensively reported in the literature. The other identifies specific electrostatic interactions not previously described in the crystallization context. Because evidence for two distinct mechanisms can be gleaned both from crystal contacts and from solution conditions leading to successful crystallization, the model offers future avenues for optimizing crystallization screens based on partial structural information. The availability of crystallization data coupled with structural outcomes analyzed through state-of-the-art statistical models may thus guide macromolecular crystallization toward a more rational basis.
Conformational divergence in the HA-33/HA-17 trimer of serotype C and D botulinum toxin complex.
Sagane, Yoshimasa; Hayashi, Shintaro; Akiyama, Tomonori; Matsumoto, Takashi; Hasegawa, Kimiko; Yamano, Akihito; Suzuki, Tomonori; Niwa, Koichi; Watanabe, Toshihiro; Yajima, Shunsuke
2016-08-05
Clostridium botulinum produces a large toxin complex (L-TC) comprising botulinum neurotoxin associated with auxiliary nontoxic proteins. A complex of 33- and 17-kDa hemagglutinins (an HA-33/HA-17 trimer) enhances L-TC transport across the intestinal epithelial cell layer via binding HA-33 to a sugar on the cell surface. At least two subtypes of serotype C/D HA-33 exhibit differing preferences for the sugars sialic acid and galactose. Here, we compared the three-dimensional structures of the galactose-binding HA-33 and HA-33/HA-17 trimers produced by the C-Yoichi strain. Comparisons of serotype C/D HA-33 sequences reveal a variable region with relatively low sequence similarity across the C. botulinum strains; the variability of this region may influence the manner of sugar-recognition by HA-33. Crystal structures of sialic acid- and galactose-binding HA-33 are broadly similar in appearance. However, small-angle X-ray scattering revealed distinct solution structures for HA-33/HA-17 trimers. A structural change in the C-terminal variable region of HA-33 might cause a dramatic shift in the conformation and sugar-recognition mode of HA-33/HA-17 trimer. Copyright © 2016 Elsevier Inc. All rights reserved.
Distelhorst, Steven L; Jurkovic, Dominika A; Shi, Jian; Jensen, Grant J; Balish, Mitchell F
2017-06-15
Although mycoplasmas have small genomes, many of them, including the HIV-associated opportunist Mycoplasma penetrans , construct a polar attachment organelle (AO) that is used for both adherence to host cells and gliding motility. However, the irregular phylogenetic distribution of similar structures within the mycoplasmas, as well as compositional and ultrastructural differences among these AOs, suggests that AOs have arisen several times through convergent evolution. We investigated the ultrastructure and protein composition of the cytoskeleton-like material of the M. penetrans AO with several forms of microscopy and biochemical analysis, to determine whether the M. penetrans AO was constructed at the molecular level on principles similar to those of other mycoplasmas, such as Mycoplasma pneumoniae and Mycoplasma mobile We found that the M. penetrans AO interior was generally dissimilar from that of other mycoplasmas, in that it exhibited considerable heterogeneity in size and shape, suggesting a gel-like nature. In contrast, several of the 12 potential protein components identified by mass spectrometry of M. penetrans detergent-insoluble proteins shared certain distinctive biochemical characteristics with M. pneumoniae AO proteins, although not with M. mobile proteins. We conclude that convergence between M. penetrans and M. pneumoniae AOs extends to the molecular level, leading to the possibility that the less organized material in both M. pneumoniae and M. penetrans is the substance principally responsible for the organization and function of the AO. IMPORTANCE Mycoplasma penetrans is a bacterium that infects HIV-positive patients and may contribute to the progression of AIDS. It attaches to host cells through a structure called an AO, but it is not clear how it builds this structure. Our research is significant not only because it identifies the novel protein components that make up the material within the AO that give it its structure but also because we find that the M. penetrans AO is organized unlike AOs from other mycoplasmas, suggesting that similar structures have evolved multiple times. From this work, we derive some basic principles by which mycoplasmas, and potentially all organisms, build structures at the subcellular level. Copyright © 2017 American Society for Microbiology.
Day, Ryan; Joo, Hyun; Chavan, Archana; Lennox, Kristin P.; Chen, Ann; Dahl, David B.; Vannucci, Marina; Tsai, Jerry W.
2012-01-01
As an alternative to the common template based protein structure prediction methods based on main-chain position, a novel side-chain centric approach has been developed. Together with a Bayesian loop modeling procedure and a combination scoring function, the Stone Soup algorithm was applied to the CASP9 set of template based modeling targets. Although the method did not generate as large of perturbations to the template structures as necessary, the analysis of the results gives unique insights into the differences in packing between the target structures and their templates. Considerable variation in packing is found between target and template structures even when the structures are close, and this variation is found due to 2 and 3 body packing interactions. Outside the inherent restrictions in packing representation of the PDB, the first steps in correctly defining those regions of variable packing have been mapped primarily to local interactions, as the packing at the secondary and tertiary structure are largely conserved. Of the scoring functions used, a loop scoring function based on water structure exhibited some promise for discrimination. These results present a clear structural path for further development of a side-chain centered approach to template based modeling. PMID:23266765
Day, Ryan; Joo, Hyun; Chavan, Archana C; Lennox, Kristin P; Chen, Y Ann; Dahl, David B; Vannucci, Marina; Tsai, Jerry W
2013-02-01
As an alternative to the common template based protein structure prediction methods based on main-chain position, a novel side-chain centric approach has been developed. Together with a Bayesian loop modeling procedure and a combination scoring function, the Stone Soup algorithm was applied to the CASP9 set of template based modeling targets. Although the method did not generate as large of perturbations to the template structures as necessary, the analysis of the results gives unique insights into the differences in packing between the target structures and their templates. Considerable variation in packing is found between target and template structures even when the structures are close, and this variation is found due to 2 and 3 body packing interactions. Outside the inherent restrictions in packing representation of the PDB, the first steps in correctly defining those regions of variable packing have been mapped primarily to local interactions, as the packing at the secondary and tertiary structure are largely conserved. Of the scoring functions used, a loop scoring function based on water structure exhibited some promise for discrimination. These results present a clear structural path for further development of a side-chain centered approach to template based modeling. Copyright © 2012 Elsevier Ltd. All rights reserved.
SCOWLP classification: Structural comparison and analysis of protein binding regions
Teyra, Joan; Paszkowski-Rogacz, Maciej; Anders, Gerd; Pisabarro, M Teresa
2008-01-01
Background Detailed information about protein interactions is critical for our understanding of the principles governing protein recognition mechanisms. The structures of many proteins have been experimentally determined in complex with different ligands bound either in the same or different binding regions. Thus, the structural interactome requires the development of tools to classify protein binding regions. A proper classification may provide a general view of the regions that a protein uses to bind others and also facilitate a detailed comparative analysis of the interacting information for specific protein binding regions at atomic level. Such classification might be of potential use for deciphering protein interaction networks, understanding protein function, rational engineering and design. Description Protein binding regions (PBRs) might be ideally described as well-defined separated regions that share no interacting residues one another. However, PBRs are often irregular, discontinuous and can share a wide range of interacting residues among them. The criteria to define an individual binding region can be often arbitrary and may differ from other binding regions within a protein family. Therefore, the rational behind protein interface classification should aim to fulfil the requirements of the analysis to be performed. We extract detailed interaction information of protein domains, peptides and interfacial solvent from the SCOWLP database and we classify the PBRs of each domain family. For this purpose, we define a similarity index based on the overlapping of interacting residues mapped in pair-wise structural alignments. We perform our classification with agglomerative hierarchical clustering using the complete-linkage method. Our classification is calculated at different similarity cut-offs to allow flexibility in the analysis of PBRs, feature especially interesting for those protein families with conflictive binding regions. The hierarchical classification of PBRs is implemented into the SCOWLP database and extends the SCOP classification with three additional family sub-levels: Binding Region, Interface and Contacting Domains. SCOWLP contains 9,334 binding regions distributed within 2,561 families. In 65% of the cases we observe families containing more than one binding region. Besides, 22% of the regions are forming complex with more than one different protein family. Conclusion The current SCOWLP classification and its web application represent a framework for the study of protein interfaces and comparative analysis of protein family binding regions. This comparison can be performed at atomic level and allows the user to study interactome conservation and variability. The new SCOWLP classification may be of great utility for reconstruction of protein complexes, understanding protein networks and ligand design. SCOWLP will be updated with every SCOP release. The web application is available at . PMID:18182098
Bohannon, Kevin Patrick; Jun, Yonggun; Gross, Steven P.; Smith, Gregory Allan
2013-01-01
The herpesvirus virion is a multilayered structure consisting of a DNA-filled capsid, tegument, and envelope. Detailed reconstructions of the capsid are possible based on its icosahedral symmetry, but the surrounding tegument and envelope layers lack regular architecture. To circumvent limitations of symmetry-based ultrastructural reconstruction methods, a fluorescence approach was developed using single-particle imaging combined with displacement measurements at nanoscale resolution. An analysis of 11 tegument and envelope proteins defined the composition and plasticity of symmetric and asymmetric elements of the virion architecture. The resulting virion protein map ascribes molecular composition to density profiles previously acquired by traditional ultrastructural methods, and provides a way forward to examine the dynamics of the virion architecture during infection. PMID:23569236
Bohannon, Kevin Patrick; Jun, Yonggun; Gross, Steven P; Smith, Gregory Allan
2013-04-23
The herpesvirus virion is a multilayered structure consisting of a DNA-filled capsid, tegument, and envelope. Detailed reconstructions of the capsid are possible based on its icosahedral symmetry, but the surrounding tegument and envelope layers lack regular architecture. To circumvent limitations of symmetry-based ultrastructural reconstruction methods, a fluorescence approach was developed using single-particle imaging combined with displacement measurements at nanoscale resolution. An analysis of 11 tegument and envelope proteins defined the composition and plasticity of symmetric and asymmetric elements of the virion architecture. The resulting virion protein map ascribes molecular composition to density profiles previously acquired by traditional ultrastructural methods, and provides a way forward to examine the dynamics of the virion architecture during infection.
A simple and fast heuristic for protein structure comparison
Pelta, David A; González, Juan R; Moreno Vega, Marcos
2008-01-01
Background Protein structure comparison is a key problem in bioinformatics. There exist several methods for doing protein comparison, being the solution of the Maximum Contact Map Overlap problem (MAX-CMO) one of the alternatives available. Although this problem may be solved using exact algorithms, researchers require approximate algorithms that obtain good quality solutions using less computational resources than the formers. Results We propose a variable neighborhood search metaheuristic for solving MAX-CMO. We analyze this strategy in two aspects: 1) from an optimization point of view the strategy is tested on two different datasets, obtaining an error of 3.5%(over 2702 pairs) and 1.7% (over 161 pairs) with respect to optimal values; thus leading to high accurate solutions in a simpler and less expensive way than exact algorithms; 2) in terms of protein structure classification, we conduct experiments on three datasets and show that is feasible to detect structural similarities at SCOP's family and CATH's architecture levels using normalized overlap values. Some limitations and the role of normalization are outlined for doing classification at SCOP's fold level. Conclusion We designed, implemented and tested.a new tool for solving MAX-CMO, based on a well-known metaheuristic technique. The good balance between solution's quality and computational effort makes it a valuable tool. Moreover, to the best of our knowledge, this is the first time the MAX-CMO measure is tested at SCOP's fold and CATH's architecture levels with encouraging results. Software is available for download at . PMID:18366735
Heat Management Strategies for Solid-state NMR of Functional Proteins
Fowler, Daniel J.; Harris, Michael J.; Thompson, Lynmarie K.
2012-01-01
Modern solid-state NMR methods can acquire high-resolution protein spectra for structure determination. However, these methods use rapid sample spinning and intense decoupling fields that can heat and denature the protein being studied. Here we present a strategy to avoid destroying valuable samples. We advocate first creating a sacrificial sample, which contains unlabeled protein (or no protein) in buffer conditions similar to the intended sample. This sample is then doped with the chemical shift thermometer Sm2Sn2O7. We introduce a pulse scheme called TCUP (for Temperature Calibration Under Pulseload) that can characterize the heating of this sacrificial sample rapidly, under a variety of experimental conditions, and with high temporal resolution. Sample heating is discussed with respect to different instrumental variables such as spinning speed, decoupling strength and duration, and cooling gas flow rate. The effects of different sample preparation variables are also discussed, including ionic strength, the inclusion of cryoprotectants, and the physical state of the sample (i.e. liquid, solid, or slurry). Lastly, we discuss probe detuning as a measure of sample thawing that does not require retuning the probe or using chemical shift thermometer compounds. Use of detuning tests and chemical shift thermometers with representative sample conditions makes it possible to maximize the efficiency of the NMR experiment while retaining a functional sample. PMID:22868258
Thermal and chemical denaturation of the BRCT functional module of human 53BP1.
Thanassoulas, Angelos; Nomikos, Michail; Theodoridou, Maria; Stavros, Philemon; Mastellos, Dimitris; Nounesis, George
2011-10-01
BRCTs are protein-docking modules involved in eukaryotic DNA repair. They are characterized by low sequence homology with generally well-conserved structure organization. In a considerable number of proteins, a pair of BRCT structural repeats occurs, connected with inter-BRCT linkers, variable in length, sequence and structure. Linkers may separate and control the relative position of BRCT domains as well as protect and stabilize the hydrophobic inter-BRCT interface region. Their vital role in protein function has been demonstrated by recent findings associating missense mutations in the inter-repeat linker region of the BRCT domain of BRCA1 (BRCA1-BRCT) to hereditary breast/ovarian cancer. The interaction of 53BP1 with the core domain of the p53 tumor suppressor involves the C-terminal BRCT repeat as well as the inert-BRCT linker of the tandem BRCT domain of 53BP1 (53BP1-BRCT). High-accuracy differential scanning calorimetry (DSC) and circular dichroism (CD) have been employed to characterize the heat-induced unfolding of 53BP1-BRCT domain. The calorimetric results provide evidence for unfolding to an intermediate, only partly unfolded state, which, based on the CD results, retains the secondary structural characteristics of the native protein. A direct comparison with the corresponding thermal processes for BRAC1-BRCT and BARD1-BRCT provides evidence that the observed behavior is analogous to BRCA1-BRCT even though the two domains differ substantially in the linker structure. Moreover, chemical denaturation experiments of the untagged 53BP1-BRCT and comparison with BRCA1 and BARD1 BRCTs show that no clear association can be drawn between the structural organization of the inter-BRCT linkers and the overall stability of the BRCT domains. Copyright © 2011 Elsevier B.V. All rights reserved.
Zhao, Jun; Zhang, Baohong; Zhu, Jianwei; Nussinov, Ruth; Ma, Buyong
2018-06-01
Amyloid formation and deposition of immunoglobulin light-chain proteins in systemic amyloidosis (AL) cause major organ failures. While the κ light-chain is dominant (λ/κ=1:2) in healthy individuals, λ is highly overrepresented (λ/κ=3:1) in AL patients. The structural basis of the amyloid formation and the sequence preference are unknown. We examined the correlation between sequence and structural stability of dimeric variable domains of immunoglobulin light chains using molecular dynamics simulations of 24 representative dimer interfaces, followed by energy evaluation of conformational ensembles for 20 AL patients' light chain sequences. We identified a stable interface with displaced N-terminal residues, provides the structural basis for AL protein fibrils formation. Proline isomerization may cause the N-terminus to adopt amyloid-prone conformations. We found that λ light-chains prefer misfolded dimer conformation, while κ chain structures are stabilized by a natively folded dimer. Our study may facilitate structure-based small molecule and antibody design to inhibit AL. This article is part of a Special Issue entitled: Accelerating Precision Medicine through Genetic and Genomic Big Data Analysis edited by Yudong Cai & Tao Huang. Copyright © 2017 Elsevier B.V. All rights reserved.
Papaleo, Elena; Mereghetti, Paolo; Fantucci, Piercarlo; Grandori, Rita; De Gioia, Luca
2009-01-01
Several molecular dynamics (MD) simulations were used to sample conformations in the neighborhood of the native structure of holo-myoglobin (holo-Mb), collecting trajectories spanning 0.22 micros at 300 K. Principal component (PCA) and free-energy landscape (FEL) analyses, integrated by cluster analysis, which was performed considering the position and structures of the individual helices of the globin fold, were carried out. The coherence between the different structural clusters and the basins of the FEL, together with the convergence of parameters derived by PCA indicates that an accurate description of the Mb conformational space around the native state was achieved by multiple MD trajectories spanning at least 0.14 micros. The integration of FEL, PCA, and structural clustering was shown to be a very useful approach to gain an overall view of the conformational landscape accessible to a protein and to identify representative protein substates. This method could be also used to investigate the conformational and dynamical properties of Mb apo-, mutant, or delete versions, in which greater conformational variability is expected and, therefore identification of representative substates from the simulations is relevant to disclose structure-function relationship.
Preparation and Characterization of Biofunctionalized Inorganic Substrates.
Dugger, Jason W; Webb, Lauren J
2015-09-29
Integrating the function of biological molecules into traditional inorganic materials and substrates couples biologically relevant function to synthetic devices and generates new materials and capabilities by combining biological and inorganic functions. At this so-called "bio/abio interface," basic biological functions such as ligand binding and catalysis can be co-opted to detect analytes with exceptional sensitivity or to generate useful molecules with chiral specificity under entirely benign reaction conditions. Proteins function in dynamic, complex, and crowded environments (the living cell) and are therefore appropriate for integrating into multistep, multiscale, multimaterial devices such as integrated circuits and heterogeneous catalysts. However, the goal of reproducing the highly specific activities of biomolecules in the perturbed chemical and electrostatic environment at an inorganic interface while maintaining their native conformations is challenging to achieve. Moreover, characterizing protein structure and function at a surface is often difficult, particularly if one wishes to compare the activity of the protein to that of the dilute, aqueous solution phase. Our laboratory has developed a general strategy to address this challenge by taking advantage of the structural and chemical properties of alkanethiol self-assembled monolayers (SAMs) on gold surfaces that are functionalized with covalently tethered peptides. These surface-bound peptides then act as the chemical recognition element for a target protein, generating a biomimetic surface in which protein orientation, structure, density, and function are controlled and variable. Herein we discuss current research and future directions related to generating a chemically tunable biofunctionalization strategy that has potential to successfully incorporate the highly specialized functions of proteins onto inorganic substrates.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Munshi, Parthapratim; Myles, Dean A A; Robertson, Lee
2013-01-01
We report the first high resolution neutron protein structure of perdeuterated rubredoxin from Pyrococcus furiosus (PfRd) determined using the new IMAGINE macromolecular neutron crystallography instrument at the Oak Ridge National Laboratory. Neutron diffraction data extending to 1.65 resolution were collected from a relatively small 0.7 mm3 PfRd crystal using 2.5 days (60 h) of beam time. The refined structure contains 371 out of 391, or 95%, of the deuterium atoms of the protein, and 58 solvent molecules. The IMAGINE instrument is designed to provide neutron data at or near atomic resolutions (1.5 ) from crystals with volume < 1.0 mm3more » and with unit cell edges < 100 . Beam line features include elliptical focusing mirrors that deliver 3x107 n s-1 cm-2 into a 3.5 x 2.0 mm2 focal spot at the sample position, and variable short and long wavelength cutoff optics that provide automated exchange between multiple wavelength configurations ( min=2.0 , 2.8 , 3.3 - max =3.0 , 4.0 , 4.5 , ~20 ). Notably, the crystal used to collect this PfRd data is 5-10 times smaller than has been previously reported.« less
Li, Chuang; Peng, Qiongfang; Wan, Xiao; Sun, Haili; Tang, Jun
2017-10-15
Promyelocytic leukemia protein (PML) nuclear bodies (NBs), which are sub-nuclear protein structures, are involved in a variety of important cellular functions. PML-NBs are assembled by PML isoforms, and contact between small ubiquitin-like modifiers (SUMOs) with the SUMO interaction motif (SIM) are critically involved in this process. PML isoforms contain a common N-terminal region and a variable C-terminus. However, the contribution of the C-terminal regions to PML-NB formation remains poorly defined. Here, using high-resolution microscopy, we show that mutation of the SIM distinctively influences the structure of NBs formed by each individual PML isoform, with that of PML-III and PML-V minimally changed, and PML-I and PML-IV dramatically impaired. We further identify several C-terminal elements that are important in regulating NB structure and provide strong evidence to suggest that the 8b element in PML-IV possesses a strong ability to interact with SUMO-1 and SUMO-2, and critically participates in NB formation. Our findings highlight the importance of PML C-termini in NB assembly and function, and provide molecular insight into the PML-NB assembly of each distinctive isoform. © 2017. Published by The Company of Biologists Ltd.
Jackson, Philip J; Hitchcock, Andrew; Swainsbury, David J K; Qian, Pu; Martin, Elizabeth C; Farmer, David A; Dickman, Mark J; Canniffe, Daniel P; Hunter, C Neil
2018-02-01
The X-ray crystal structure of the Rhodopseudomonas (Rps.) palustris reaction center-light harvesting 1 (RC-LH1) core complex revealed the presence of a sixth protein component, variably referred to in the literature as helix W, subunit W or protein W. The position of this protein prevents closure of the LH1 ring, possibly to allow diffusion of ubiquinone/ubiquinol between the RC and the cytochrome bc 1 complex in analogous fashion to the well-studied PufX protein from Rhodobacter sphaeroides. The identity and function of helix W have remained unknown for over 13years; here we use a combination of biochemistry, mass spectrometry, molecular genetics and electron microscopy to identify this protein as RPA4402 in Rps. palustris CGA009. Protein W shares key conserved sequence features with PufX homologs, and although a deletion mutant was able to grow under photosynthetic conditions with no discernible phenotype, we show that a tagged version of protein W pulls down the RC-LH1 complex. Protein W is not encoded in the photosynthesis gene cluster and our data indicate that only approximately 10% of wild-type Rps. palustris core complexes contain this non-essential subunit; functional and evolutionary consequences of this observation are discussed. The ability to purify uniform RC-LH1 and RC-LH1-protein W preparations will also be beneficial for future structural studies of these bacterial core complexes. Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.
2015-01-01
Density is an easily adjusted variable in molecular dynamics (MD) simulations. Thus, pressure-jump (P-jump)-induced protein refolding, if it could be made fast enough, would be ideally suited for comparison with MD. Although pressure denaturation perturbs secondary structure less than temperature denaturation, protein refolding after a fast P-jump is not necessarily faster than that after a temperature jump. Recent P-jump refolding experiments on the helix bundle λ-repressor have shown evidence of a <3 μs burst phase, but also of a ∼1.5 ms “slow” phase of refolding, attributed to non-native helical structure frustrating microsecond refolding. Here we show that a λ-repressor mutant is nonetheless capable of refolding in a single explicit solvent MD trajectory in about 19 μs, indicating that the burst phase observed in experiments on the same mutant could produce native protein. The simulation reveals that after about 18.5 μs of conformational sampling, the productive structural rearrangement to the native state does not occur in a single swift step but is spread out over a brief series of helix and loop rearrangements that take about 0.9 μs. Our results support the molecular time scale inferred for λ-repressor from near-downhill folding experiments, where transition-state population can be seen experimentally, and also agrees with the transition-state transit time observed in slower folding proteins by single-molecule spectroscopy. PMID:24437525
Koo, Yoon; Ka, Donghyun; Kim, Eun-Jin; Suh, Nayoung; Bae, Euiyoung
2013-10-23
Clustered regularly interspaced short palindromic repeats (CRISPRs) and CRISPR-associated (Cas) proteins form an RNA-mediated microbial immune system against invading foreign genetic elements. Cas5 proteins constitute one of the most prevalent Cas protein families in CRISPR-Cas systems and are predicted to have RNA recognition motif (RRM) domains. Cas5d is a subtype I-C-specific Cas5 protein that can be divided into two distinct subgroups, one of which has extra C-terminal residues while the other contains a longer insertion in the middle of its N-terminal RRM domain. Here, we report crystal structures of Cas5d from Streptococcus pyogenes and Xanthomonas oryzae, which respectively represent the two Cas5d subgroups. Despite a common domain architecture consisting of an N-terminal RRM domain and a C-terminal β-sheet domain, the structural differences between the two Cas5d proteins are highlighted by the presence of a unique extended helical region protruding from the N-terminal RRM domain of X. oryzae Cas5d. We also demonstrate that Cas5d proteins possess not only specific endoribonuclease activity for CRISPR RNAs but also nonspecific double-stranded DNA binding affinity. These findings suggest that Cas5d may play multiple roles in CRISPR-mediated immunity. Furthermore, the specific RNA processing was also observed between S. pyogenes Cas5d protein and X. oryzae CRISPR RNA and vice versa. This cross-species activity of Cas5d provides a special opportunity for elucidating conserved features of the CRISPR RNA processing event. Copyright © 2013 Elsevier Ltd. All rights reserved.
Madhurantakam, Chaithanya; Rajakumara, Eerappa; Mazumdar, Pooja Anjali; Saha, Baisakhee; Mitra, Devrani; Wiker, Harald G; Sankaranarayanan, Rajan; Das, Amit Kumar
2005-03-01
The low-molecular-weight protein tyrosine phosphatase (LMWPTPase) belongs to a distinctive class of phosphotyrosine phosphatases widely distributed among prokaryotes and eukaryotes. We report here the crystal structure of LMWPTPase of microbial origin, the first of its kind from Mycobacterium tuberculosis. The structure was determined to be two crystal forms at 1.9- and 2.5-A resolutions. These structural forms are compared with those of the LMWPTPases of eukaryotes. Though the overall structure resembles that of the eukaryotic LMWPTPases, there are significant changes around the active site and the protein tyrosine phosphatase (PTP) loop. The variable loop forming the wall of the crevice leading to the active site is conformationally unchanged from that of mammalian LMWPTPase; however, differences are observed in the residues involved, suggesting that they have a role in influencing different substrate specificities. The single amino acid substitution (Leu12Thr [underlined below]) in the consensus sequence of the PTP loop, CTGNICRS, has a major role in the stabilization of the PTP loop, unlike what occurs in mammalian LMWPTPases. A chloride ion and a glycerol molecule were modeled in the active site where the chloride ion interacts in a manner similar to that of phosphate with the main chain nitrogens of the PTP loop. This structural study, in addition to identifying specific mycobacterial features, may also form the basis for exploring the mechanism of the substrate specificities of bacterial LMWPTPases.
Mills, Jeffrey L; Liu, Gaohua; Skerra, Arne; Szyperski, Thomas
2009-08-11
The NMR structure of the 21 kDa lipocalin FluA, which was previously obtained by combinatorial design, elucidates a reshaped binding site specific for the dye fluorescein resulting from 21 side chain replacements with respect to the parental lipocalin, the naturally occurring bilin-binding protein (BBP). As expected, FluA exhibits the lipocalin fold of BBP, comprising eight antiparallel beta-strands forming a beta-barrel with an alpha-helix attached to its side. Comparison of the NMR structure of free FluA with the X-ray structures of BBP.biliverdin IX(gamma) and FluA.fluorescein complexes revealed significant conformational changes in the binding pocket, which is formed by four loops at the open end of the beta-barrel as well as adjoining beta-strand segments. An "induced fit" became apparent for the side chain conformations of Arg 88 and Phe 99, which contact the bound fluorescein in the complex and undergo concerted rearrangement upon ligand binding. Moreover, slower internal motional modes of the polypeptide backbone were identified by measuring transverse (15)N backbone spin relaxation times in the rotating frame for free FluA and also for the FluA.fluorescein complex. A reduction in the level of such motions was detected upon complex formation, indicating rigidification of the protein structure and loss of conformational entropy. This hypothesis was confirmed by isothermal titration calorimetry, showing that ligand binding is enthalpy-driven, thus overcompensating for the negative entropy associated with both ligand binding per se and rigidification of the protein. Our investigation of the solution structure and dynamics as well as thermodynamics of lipocalin-ligand interaction not only provides insight into the general mechanism of small molecule accommodation in the deep and narrow cavity of this abundant class of proteins but also supports the future design of corresponding binding proteins with novel specificities, so-called "anticalins".
Kumar, Vineet; Dixit, Nitin; Zhou, Liqiang Lisa; Fraunhofer, Wolfgang
2011-12-12
The purpose of this work was to determine the nature of long and short-range forces governing protein aggregation kinetics at low and high concentrations for a monoclonal antibody (IgG1) and a dual-variable-domain immunoglobulin (DVD-Ig). Protein-protein interactions (PPI) were studied under dilute conditions by utilizing the methods of static (B(22)) and dynamic light scattering (k(D)). PPI in solutions containing minimal ionic strengths were characterized to get detailed insights into the impact of ionic strength on aggregation. Microcalorimetry and susceptibility to denature at air-liquid interface were used to assess the tertiary structure and quiescent stability studies were conducted to study aggregation characteristics. Results for IgG1 showed that electrostatic interactions governed protein aggregation kinetics both under dilute and concentrated conditions (i.e., 5 mg/mL and 150 mg/mL). For DVD-Ig molecules, on the other hand, although electrostatic interactions governed protein aggregation under dilute conditions, hydrophobic forces clearly determined the kinetics at high concentrations. This manuscript shows for the first time that short-range hydrophobic interactions can outweigh electrostatic forces and play an important role in determining protein aggregation at high concentrations. Additionally, results show that although higher-order virial coefficients become significant under low ionic strength conditions, removal of added charges may be used to enhance the aggregation stability of dilute protein formulations. Copyright © 2011 Elsevier B.V. All rights reserved.
Fish Allergens at a Glance: Variable Allergenicity of Parvalbumins, the Major Fish Allergens
Kuehn, Annette; Swoboda, Ines; Arumugam, Karthik; Hilger, Christiane; Hentges, François
2014-01-01
Fish is a common trigger of severe, food-allergic reactions. Only a limited number of proteins induce specific IgE-mediated immune reactions. The major fish allergens are the parvalbumins. They are members of the calcium-binding EF-hand protein family characterized by a conserved protein structure. They represent highly cross-reactive allergens for patients with specific IgE to conserved epitopes. These patients might experience clinical reactions with various fish species. On the other hand, some individuals have IgE antibodies directed against unique, species-specific parvalbumin epitopes, and these patients show clinical symptoms only with certain fish species. Furthermore, different parvalbumin isoforms and isoallergens are present in the same fish and might display variable allergenicity. This was shown for salmon homologs, where only a single parvalbumin (beta-1) isoform was identified as allergen in specific patients. In addition to the parvalbumins, several other fish proteins, enolases, aldolases, and fish gelatin, seem to be important allergens. New clinical and molecular insights advanced the knowledge and understanding of fish allergy in the last years. These findings were useful for the advancement of the IgE-based diagnosis and also for the management of fish allergies consisting of advice and treatment of fish-allergic patients. PMID:24795722
How to tackle protein structural data from solution and solid state: An integrated approach.
Carlon, Azzurra; Ravera, Enrico; Andrałojć, Witold; Parigi, Giacomo; Murshudov, Garib N; Luchinat, Claudio
2016-02-01
Long-range NMR restraints, such as diamagnetic residual dipolar couplings and paramagnetic data, can be used to determine 3D structures of macromolecules. They are also used to monitor, and potentially to improve, the accuracy of a macromolecular structure in solution by validating or "correcting" a crystal model. Since crystal structures suffer from crystal packing forces they may not be accurate models for the macromolecular structures in solution. However, the presence of real differences should be tested for by simultaneous refinement of the structure using both crystal and solution NMR data. To achieve this, the program REFMAC5 from CCP4 was modified to allow the simultaneous use of X-ray crystallographic and paramagnetic NMR data and/or diamagnetic residual dipolar couplings. Inconsistencies between crystal structures and solution NMR data, if any, may be due either to structural rearrangements occurring on passing from the solution to solid state, or to a greater degree of conformational heterogeneity in solution with respect to the crystal. In the case of multidomain proteins, paramagnetic restraints can provide the correct mutual orientations and positions of domains in solution, as well as information on the conformational variability experienced by the macromolecule. Copyright © 2016 Elsevier B.V. All rights reserved.
Measuring kinetic drivers of pneumolysin pore structure.
Gilbert, Robert J C; Sonnen, Andreas F-P
2016-05-01
Most membrane attack complex-perforin/cholesterol-dependent cytolysin (MACPF/CDC) proteins are thought to form pores in target membranes by assembling into pre-pore oligomers before undergoing a pre-pore to pore transition. Assembly during pore formation is into both full rings of subunits and incomplete rings (arcs). The balance between arcs and full rings is determined by a mechanism dependent on protein concentration in which arc pores arise due to kinetic trapping of the pre-pore forms by the depletion of free protein subunits during oligomerization. Here we describe the use of a kinetic assay to study pore formation in red blood cells by the MACPF/CDC pneumolysin from Streptococcus pneumoniae. We show that cell lysis displays two kinds of dependence on protein concentration. At lower concentrations, it is dependent on the pre-pore to pore transition of arc oligomers, which we show to be a cooperative process. At higher concentrations, it is dependent on the amount of pneumolysin bound to the membrane and reflects the affinity of the protein for its receptor, cholesterol. A lag occurs before cell lysis begins; this is dependent on oligomerization of pneumolysin. Kinetic dissection of cell lysis by pneumolysin demonstrates the capacity of MACPF/CDCs to generate pore-forming oligomeric structures of variable size with, most likely, different functional roles in biology.
NASA Astrophysics Data System (ADS)
Bhakat, Soumendranath; Åberg, Emil; Söderhjelm, Pär
2018-01-01
Advanced molecular docking methods often aim at capturing the flexibility of the protein upon binding to the ligand. In this study, we investigate whether instead a simple rigid docking method can be applied, if combined with multiple target structures to model the backbone flexibility and molecular dynamics simulations to model the sidechain and ligand flexibility. The methods are tested for the binding of 35 ligands to FXR as part of the first stage of the Drug Design Data Resource (D3R) Grand Challenge 2 blind challenge. The results show that the multiple-target docking protocol performs surprisingly well, with correct poses found for 21 of the ligands. MD simulations started on the docked structures are remarkably stable, but show almost no tendency of refining the structure closer to the experimentally found binding pose. Reconnaissance metadynamics enhances the exploration of new binding poses, but additional collective variables involving the protein are needed to exploit the full potential of the method.
Bhakat, Soumendranath; Åberg, Emil; Söderhjelm, Pär
2018-01-01
Advanced molecular docking methods often aim at capturing the flexibility of the protein upon binding to the ligand. In this study, we investigate whether instead a simple rigid docking method can be applied, if combined with multiple target structures to model the backbone flexibility and molecular dynamics simulations to model the sidechain and ligand flexibility. The methods are tested for the binding of 35 ligands to FXR as part of the first stage of the Drug Design Data Resource (D3R) Grand Challenge 2 blind challenge. The results show that the multiple-target docking protocol performs surprisingly well, with correct poses found for 21 of the ligands. MD simulations started on the docked structures are remarkably stable, but show almost no tendency of refining the structure closer to the experimentally found binding pose. Reconnaissance metadynamics enhances the exploration of new binding poses, but additional collective variables involving the protein are needed to exploit the full potential of the method.
Structure of the protein phosphatase 2A holoenzyme.
Xu, Yanhui; Xing, Yongna; Chen, Yu; Chao, Yang; Lin, Zheng; Fan, Eugene; Yu, Jong W; Strack, Stefan; Jeffrey, Philip D; Shi, Yigong
2006-12-15
Protein Phosphatase 2A (PP2A) plays an essential role in many aspects of cellular physiology. The PP2A holoenzyme consists of a heterodimeric core enzyme, which comprises a scaffolding subunit and a catalytic subunit, and a variable regulatory subunit. Here we report the crystal structure of the heterotrimeric PP2A holoenzyme involving the regulatory subunit B'/B56/PR61. Surprisingly, the B'/PR61 subunit has a HEAT-like (huntingtin-elongation-A subunit-TOR-like) repeat structure, similar to that of the scaffolding subunit. The regulatory B'/B56/PR61 subunit simultaneously interacts with the catalytic subunit as well as the conserved ridge of the scaffolding subunit. The carboxyterminus of the catalytic subunit recognizes a surface groove at the interface between the B'/B56/PR61 subunit and the scaffolding subunit. Compared to the scaffolding subunit in the PP2A core enzyme, formation of the holoenzyme forces the scaffolding subunit to undergo pronounced conformational rearrangements. This structure reveals significant ramifications for understanding the function and regulation of PP2A.
Kawamoto, Akihiro; Matsuo, Lisa; Kato, Takayuki; Yamamoto, Hiroki
2016-01-01
ABSTRACT Mycoplasma pneumoniae, a pathogenic bacterium, glides on host surfaces using a unique mechanism. It forms an attachment organelle at a cell pole as a protrusion comprised of knoblike surface structures and an internal core. Here, we analyzed the three-dimensional structure of the organelle in detail by electron cryotomography. On the surface, knoblike particles formed a two-dimensional array, albeit with limited regularity. Analyses using a nonbinding mutant and an antibody showed that the knoblike particles correspond to a naplike structure that has been observed by negative-staining electron microscopy and is likely to be formed as a complex of P1 adhesin, the key protein for binding and gliding. The paired thin and thick plates feature a rigid hexagonal lattice and striations with highly variable repeat distances, respectively. The combination of variable and invariant structures in the internal core and the P1 adhesin array on the surface suggest a model in which axial extension and compression of the thick plate along a rigid thin plate is coupled with attachment to and detachment from the substrate during gliding. PMID:27073090
Variability of Protein Structure Models from Electron Microscopy.
Monroe, Lyman; Terashi, Genki; Kihara, Daisuke
2017-04-04
An increasing number of biomolecular structures are solved by electron microscopy (EM). However, the quality of structure models determined from EM maps vary substantially. To understand to what extent structure models are supported by information embedded in EM maps, we used two computational structure refinement methods to examine how much structures can be refined using a dataset of 49 maps with accompanying structure models. The extent of structure modification as well as the disagreement between refinement models produced by the two computational methods scaled inversely with the global and the local map resolutions. A general quantitative estimation of deviations of structures for particular map resolutions are provided. Our results indicate that the observed discrepancy between the deposited map and the refined models is due to the lack of structural information present in EM maps and thus these annotations must be used with caution for further applications. Copyright © 2017 Elsevier Ltd. All rights reserved.
Pomel, Sébastien; Diogon, Marie; Bouchard, Philippe; Pradel, Lydie; Ravet, Viviane; Coffe, Gérard; Viguès, Bernard
2006-02-01
Previous attempts to identify the membrane skeleton of Paramecium cells have revealed a protein pattern that is both complex and specific. The most prominent structural elements, epiplasmic scales, are centered around ciliary units and are closely apposed to the cytoplasmic side of the inner alveolar membrane. We sought to characterize epiplasmic scale proteins (epiplasmins) at the molecular level. PCR approaches enabled the cloning and sequencing of two closely related genes by amplifications of sequences from a macronuclear genomic library. Using these two genes (EPI-1 and EPI-2), we have contributed to the annotation of the Paramecium tetraurelia macronuclear genome and identified 39 additional (paralogous) sequences. Two orthologous sequences were found in the Tetrahymena thermophila genome. Structural analysis of the 43 sequences indicates that the hallmark of this new multigenic family is a 79 aa domain flanked by two Q-, P- and V-rich stretches of sequence that are much more variable in amino-acid composition. Such features clearly distinguish members of the multigenic family from epiplasmic proteins previously sequenced in other ciliates. The expression of Green Fluorescent Protein (GFP)-tagged epiplasmin showed significant labeling of epiplasmic scales as well as oral structures. We expect that the GFP construct described herein will prove to be a useful tool for comparative subcellular localization of different putative epiplasmins in Paramecium.
Crystal Structure of the 25 kDa Subunit of Human Cleavage Factor I{m}
DOE Office of Scientific and Technical Information (OSTI.GOV)
Coseno,M.; Martin, G.; Berger, C.
Cleavage factor Im is an essential component of the pre-messenger RNA 3'-end processing machinery in higher eukaryotes, participating in both the polyadenylation and cleavage steps. Cleavage factor Im is an oligomer composed of a small 25 kDa subunit (CF Im25) and a variable larger subunit of either 59, 68 or 72 kDa. The small subunit also interacts with RNA, poly(A) polymerase, and the nuclear poly(A)-binding protein. These protein-protein interactions are thought to be facilitated by the Nudix domain of CF Im25, a hydrolase motif with a characteristic {alpha}/{beta}/{alpha} fold and a conserved catalytic sequence or Nudix box. We present heremore » the crystal structures of human CF Im25 in its free and diadenosine tetraphosphate (Ap4A) bound forms at 1.85 and 1.80 Angstroms, respectively. CF Im25 crystallizes as a dimer and presents the classical Nudix fold. Results from crystallographic and biochemical experiments suggest that CF Im25 makes use of its Nudix fold to bind but not hydrolyze ATP and Ap4A. The complex and apo protein structures provide insight into the active oligomeric state of CF Im and suggest a possible role of nucleotide binding in either the polyadenylation and/or cleavage steps of pre-messenger RNA 3'-end processing.« less
Matsushima, Hiroyuki; Mukai, Kouichiro; Obara, Yoshitaka; Yoshida, Shinichiro; Clark, John I
2004-01-01
To analyze selected lens cytoskeletal proteins in posterior capsule opacification (PCO) 2 weeks after intraocular lens (IOL) implantation in rabbits. Department of Ophthalmology, Dokkyo University School of Medicine, Tochigi, Japan. Eight 10-week-old albino rabbits were prepared and anesthetized for phacoemulsification and aspiration of the crystalline lens and implantation of an acrylic or a hydrogel IOL. Two weeks postoperatively, the rabbits were killed and the IOLs removed for immunohistochemistry. Deparaffinized tissue sections were processed with antibodies against alpha-smooth muscle actin (alpha-SMA) and beta-crystallin to observe the types of PCO with the 2 IOL types. The proteins in the PCO tissue and the normal lens were homogenized, centrifuged, and analyzed using SDS-polyacrylamide gel electrophoresis (SDS-PAGE) densitometric analysis and Western immunoblotting for actin and vimentin. Immunohistochemistry demonstrated a fibroblastic cell type expressing alpha-SMA and partial regeneration of epithelial cells, resulting in a lenticular structure that stained irregularly for beta-crystallin. The immunoreactivity of fibroblast-like cells to beta-crystallin appeared weaker than that of the regenerated lenticular structure. SDS-PAGE showed variability in the content of cytoskeletal proteins in the insoluble fractions of the PCO. Degradation of the cytoskeletal components was greater with the acrylic IOL than with the hydrogel IOL. Cytoskeletal proteins expressed during the formation of PCO and IOL implantation may have potential as therapeutic target proteins to improve the biocompatibility of IOLs.
Understanding the mechanisms of protein-DNA interactions
NASA Astrophysics Data System (ADS)
Lavery, Richard
2004-03-01
Structural, biochemical and thermodynamic data on protein-DNA interactions show that specific recognition cannot be reduced to a simple set of binary interactions between the partners (such as hydrogen bonds, ion pairs or steric contacts). The mechanical properties of the partners also play a role and, in the case of DNA, variations in both conformation and flexibility as a function of base sequence can be a significant factor in guiding a protein to the correct binding site. All-atom molecular modeling offers a means of analyzing the role of different binding mechanisms within protein-DNA complexes of known structure. This however requires estimating the binding strengths for the full range of sequences with which a given protein can interact. Since this number grows exponentially with the length of the binding site it is necessary to find a method to accelerate the calculations. We have achieved this by using a multi-copy approach (ADAPT) which allows us to build a DNA fragment with a variable base sequence. The results obtained with this method correlate well with experimental consensus binding sequences. They enable us to show that indirect recognition mechanisms involving the sequence dependent properties of DNA play a significant role in many complexes. This approach also offers a means of predicting protein binding sites on the basis of binding energies, which is complementary to conventional lexical techniques.
Emerging facets of prokaryotic glycosylation
Schäffer, Christina; Messner, Paul
2017-01-01
Glycosylation of proteins is one of the most prevalent post-translational modifications occurring in nature, with a wide repertoire of biological implications. Pathways for the main types of this modification, the N- and O-glycosylation, can be found in all three domains of life—the Eukarya, Bacteria and Archaea—thereby following common principles, which are valid also for lipopolysaccharides, lipooligosaccharides and glycopolymers. Thus, studies on any glycoconjugate can unravel novel facets of the still incompletely understood fundamentals of protein N- and O-glycosylation. While it is estimated that more than two-thirds of all eukaryotic proteins would be glycosylated, no such estimate is available for prokaryotic glycoproteins, whose understanding is lagging behind, mainly due to the enormous variability of their glycan structures and variations in the underlying glycosylation processes. Combining glycan structural information with bioinformatic, genetic, biochemical and enzymatic data has opened up an avenue for in-depth analyses of glycosylation processes as a basis for glycoengineering endeavours. Here, the common themes of glycosylation are conceptualised for the major classes of prokaryotic (i.e. bacterial and archaeal) glycoconjugates, with a special focus on glycosylated cell-surface proteins. We describe the current knowledge of biosynthesis and importance of these glycoconjugates in selected pathogenic and beneficial microbes. PMID:27566466
Convergent mechanisms favor fast amyloid formation in two lambda 6a Ig light chain mutants.
Valdés-García, Gilberto; Millán-Pacheco, César; Pastor, Nina
2017-08-01
Extracellular deposition as amyloids of immunoglobulin light chains causes light chain amyloidosis. Among the light chain families, lambda 6a is one of the most frequent in light chain amyloidosis patients. Its germline protein, 6aJL2, and point mutants, R24G and P7S, are good models to study fibrillogenesis, because their stability and fibril formation characteristics have been described. Both mutations make the germline protein unstable and speed up its ability to aggregate. To date, there is no molecular mechanism that explains how these differences in amyloidogenesis can arise from a single mutation. To look into the structural and dynamical differences in the native state of these proteins, we carried out molecular dynamics simulations at room temperature. Despite the structural similarity of the germline protein and the mutants, we found differences in their dynamical signatures that explain the mutants' increased tendency to form amyloids. The contact network alterations caused by the mutations, though different, converge in affecting two anti-aggregation motifs present in light chain variable domains, suggesting a different starting point for aggregation in lambda chains compared to kappa chains. © 2017 Wiley Periodicals, Inc.
Naturally Occurring Structural Isomers in Serum IgA1 O-Glycosylation
Takahashi, Kazuo; Smith, Archer D.; Poulsen, Knud; Kilian, Mogens; Julian, Bruce A.; Mestecky, Jiri; Novak, Jan; Renfrow, Matthew B.
2013-01-01
IgA is the most abundantly produced antibody and plays an important role in the mucosal immune system. Human IgA is represented by two isotypes, IgA1 and IgA2. The major structural difference between these two subclasses is the presence of nine potential sites of O-glycosylation in the hinge region between the first and second constant region domains of the heavy chain. Thr225, Thr228, Ser230, Ser232 and Thr236 have been identified as the predominant sites of O-glycan attachment. The range and distribution of O-glycan chains at each site within the context of adjacent sites in this clustered region create a complex heterogeneity of surface epitopes that is incompletely defined. We previously described the analysis of IgA1 O-glycan heterogeneity by use of high resolution LC/MS and electron capture dissociation tandem MS to unambiguously localize all amino acid attachment sites in IgA1 (Ale) myeloma protein. Here, we report the identification and elucidation of IgA1 O-glycopeptide structural isomers that occur based on amino acid position of the attached glycans (positional isomers) and the structure of the O-glycan chains at individual sites (glycan isomers). These isomers are present in a model IgA1 (Mce1) myeloma protein and occur naturally in normal human serum IgA1. Variable O-glycan chains attached to Ser230, Thr233 or Thr236 produce the predominant positional isomers, including O-glycans composed of a single GalNAc residue. These findings represent the first definitive identification of structural isomeric IgA1 O-glycoforms, define the single-site heterogeneity for all O-glycan sites in a single sample, and have implications for defining epitopes based on clustered O-glycan variability. PMID:22067045
Cryo-Electron Tomography of Marburg Virus Particles and Their Morphogenesis within Infected Cells
Kolesnikova, Larissa; Welsch, Sonja; Krähling, Verena; Davey, Norman; Parsy, Marie-Laure; Becker, Stephan; Briggs, John A. G.
2011-01-01
Several major human pathogens, including the filoviruses, paramyxoviruses, and rhabdoviruses, package their single-stranded RNA genomes within helical nucleocapsids, which bud through the plasma membrane of the infected cell to release enveloped virions. The virions are often heterogeneous in shape, which makes it difficult to study their structure and assembly mechanisms. We have applied cryo-electron tomography and sub-tomogram averaging methods to derive structures of Marburg virus, a highly pathogenic filovirus, both after release and during assembly within infected cells. The data demonstrate the potential of cryo-electron tomography methods to derive detailed structural information for intermediate steps in biological pathways within intact cells. We describe the location and arrangement of the viral proteins within the virion. We show that the N-terminal domain of the nucleoprotein contains the minimal assembly determinants for a helical nucleocapsid with variable number of proteins per turn. Lobes protruding from alternate interfaces between each nucleoprotein are formed by the C-terminal domain of the nucleoprotein, together with viral proteins VP24 and VP35. Each nucleoprotein packages six RNA bases. The nucleocapsid interacts in an unusual, flexible “Velcro-like” manner with the viral matrix protein VP40. Determination of the structures of assembly intermediates showed that the nucleocapsid has a defined orientation during transport and budding. Together the data show striking architectural homology between the nucleocapsid helix of rhabdoviruses and filoviruses, but unexpected, fundamental differences in the mechanisms by which the nucleocapsids are then assembled together with matrix proteins and initiate membrane envelopment to release infectious virions, suggesting that the viruses have evolved different solutions to these conserved assembly steps. PMID:22110401
Biosurfactants and surfactants interacting with membranes and proteins: Same but different?
Otzen, Daniel E
2017-04-01
Biosurfactants (BS) are surface-active molecules produced by microorganisms. For several decades they have attracted interest as promising alternatives to current petroleum-based surfactants. Aside from their green profile, they have remarkably low critical micelle concentrations, reduce the air/water surface tension to very low levels and are excellent emulsifiers, all of which make them comparable or superior to their synthetic counterparts. These remarkable physical properties derive from their more complex chemical structures in which hydrophilic and hydrophobic regions are not as clearly separated as chemical surfactants but have a more mosaic distribution of polarity as well as branched or circular structures. This allows the lipopeptide surfactin to adopt spherical structures to facilitate dense packing at interfaces. They are also more complex. Glycolipid BS, e.g. rhamnolipids (RL) and sophorolipids, are produced biologically as mixtures which vary in the size and saturation of the hydrophobic region as well as modifications in the hydrophilic headgroup, such as the number of sugar groups and different levels of acetylation, leading to variable surface-active properties. Their amphiphilicity allows RL to insert easily into membranes at sub-cmc concentrations to modulate membrane structure and extract lipopolysaccharides, leading to extensive biofilm remodeling in vivo, sometimes in collaboration with hydrophobic RL precursors. Thanks to their mosaicity, even anionic BS like RL only bind weakly to proteins and show much lower denaturing potency, even supporting membrane protein refolding. Nevertheless, they can promote protein degradation by proteases e.g. by neutralizing positive charges, which together with their biofilm-combating properties makes them very promising detergent surfactants. This article is part of a Special Issue entitled: Lipid order/lipid defects and lipid-control of protein activity edited by Dirk Schneider. Copyright © 2016 Elsevier B.V. All rights reserved.
Zhang, Yue-Ling; Peng, Bo; Li, Hui; Yan, Fang; Wu, Hong-Kai; Zhao, Xian-Liang; Lin, Xiang-Min; Min, Shao-Ying; Gao, Yuan-Yuan; Wang, San-Ying; Li, Yuan-You; Peng, Xuan-Xian
2017-01-01
Invertebrates rely heavily on immune-like molecules with highly diversified variability so as to counteract infections. However, the mechanisms and the relationship between this variability and functionalities are not well understood. Here, we showed that the C-terminal domain of hemocyanin (HMC) from shrimp Litopenaeus vannamei contained an evolutionary conserved domain with highly variable genetic sequence, which is structurally homologous to immunoglobulin (Ig). This domain is responsible for recognizing and binding to bacteria or red blood cells, initiating agglutination and hemolysis. Furthermore, when HMC is separated into three fractions using anti-human IgM, IgG, or IgA, the subpopulation, which reacted with anti-human IgM (HMC-M), showed the most significant antimicrobial activity. The high potency of HMC-M is a consequence of glycosylation, as it contains high abundance of α-d-mannose relative to α-d-glucose and N-acetyl-d-galactosamine. Thus, the removal of these glycans abolished the antimicrobial activity of HMC-M. Our results present a comprehensive investigation of the role of HMC in fighting against infections through genetic variability and epigenetic modification. PMID:28659912
Salt-induced enhancement of antifreeze protein activity: a salting-out effect.
Kristiansen, Erlend; Pedersen, Sindre Andre; Zachariassen, Karl Erik
2008-10-01
Antifreeze proteins are a structurally diverse group of proteins characterized by their unique ability to cause a separation of the melting- and growth-temperatures of ice. These proteins have evolved independently in different kinds of cold-adapted ectothermic animals, including insects and fish, where they protect against lethal freezing of the body fluids. There is a great variability in the capacity of different kinds of antifreeze proteins to evoke the antifreeze effect, but the basis of these differences is not well understood. This study reports on salt-induced enhancement of the antifreeze activity of an antifreeze protein from the longhorn beetle Rhagium inquisitor (L.). The results imply that antifreeze activity is predetermined by a steady-state distribution of the antifreeze protein between the solution and the ice surface region. The observed salt-induced enhancement of the antifreeze activity compares qualitatively and quantitatively with salt-induced lowering of protein solubility. Thus, salts apparently enhance antifreeze activity by evoking a solubility-induced shift in the distribution pattern of the antifreeze proteins in favour of the ice. These results indicate that the solubility of antifreeze proteins in the solution surrounding the ice crystal is a fundamental physiochemical property in relation to their antifreeze potency.
Structural Protein 4.1 in the Nucleus of Human Cells: Dynamic Rearrangements during Cell Division
Krauss, Sharon Wald; Larabell, Carolyn A.; Lockett, Stephen; Gascard, Philippe; Penman, Sheldon; Mohandas, Narla; Chasis, Joel Anne
1997-01-01
Structural protein 4.1, first identified as a crucial 80-kD protein in the mature red cell membrane skeleton, is now known to be a diverse family of protein isoforms generated by complex alternative mRNA splicing, variable usage of translation initiation sites, and posttranslational modification. Protein 4.1 epitopes are detected at multiple intracellular sites in nucleated mammalian cells. We report here investigations of protein 4.1 in the nucleus. Reconstructions of optical sections of human diploid fibroblast nuclei using antibodies specific for 80-kD red cell 4.1 and for 4.1 peptides showed 4.1 immunofluorescent signals were intranuclear and distributed throughout the volume of the nucleus. After sequential extractions of cells in situ, 4.1 epitopes were detected in nuclear matrix both by immunofluorescence light microscopy and resinless section immunoelectron microscopy. Western blot analysis of fibroblast nuclear matrix protein fractions, isolated under identical extraction conditions as those for microscopy, revealed several polypeptide bands reactive to multiple 4.1 antibodies against different domains. Epitope-tagged protein 4.1 was detected in fibroblast nuclei after transient transfections using a construct encoding red cell 80-kD 4.1 fused to an epitope tag. Endogenous protein 4.1 epitopes were detected throughout the cell cycle but underwent dynamic spatial rearrangements during cell division. Protein 4.1 was observed in nucleoplasm and centrosomes at interphase, in the mitotic spindle during mitosis, in perichromatin during telophase, as well as in the midbody during cytokinesis. These results suggest that multiple protein 4.1 isoforms may contribute significantly to nuclear architecture and ultimately to nuclear function. PMID:9128242
Protein-mediated loops in supercoiled DNA create large topological domains
Yan, Yan; Ding, Yue; Leng, Fenfei; Dunlap, David; Finzi, Laura
2018-01-01
Abstract Supercoiling can alter the form and base pairing of the double helix and directly impact protein binding. More indirectly, changes in protein binding and the stress of supercoiling also influence the thermodynamic stability of regulatory, protein-mediated loops and shift the equilibria of fundamental DNA/chromatin transactions. For example, supercoiling affects the hierarchical organization and function of chromatin in topologically associating domains (TADs) in both eukaryotes and bacteria. On the other hand, a protein-mediated loop in DNA can constrain supercoiling within a plectonemic structure. To characterize the extent of constrained supercoiling, 400 bp, lac repressor-secured loops were formed in extensively over- or under-wound DNA under gentle tension in a magnetic tweezer. The protein-mediated loops constrained variable amounts of supercoiling that often exceeded the maximum writhe expected for a 400 bp plectoneme. Loops with such high levels of supercoiling appear to be entangled with flanking domains. Thus, loop-mediating proteins operating on supercoiled substrates can establish topological domains that may coordinate gene regulation and other DNA transactions across spans in the genome that are larger than the separation between the binding sites. PMID:29538766
Evolutionary and biophysical relationships among the papillomavirus E2 proteins.
Blakaj, Dukagjin M; Fernandez-Fuentes, Narcis; Chen, Zigui; Hegde, Rashmi; Fiser, Andras; Burk, Robert D; Brenowitz, Michael
2009-01-01
Infection by human papillomavirus (HPV) may result in clinical conditions ranging from benign warts to invasive cancer. The HPV E2 protein represses oncoprotein transcription and is required for viral replication. HPV E2 binds to palindromic DNA sequences of highly conserved four base pair sequences flanking an identical length variable 'spacer'. E2 proteins directly contact the conserved but not the spacer DNA. Variation in naturally occurring spacer sequences results in differential protein affinity that is dependent on their sensitivity to the spacer DNA's unique conformational and/or dynamic properties. This article explores the biophysical character of this core viral protein with the goal of identifying characteristics that associated with risk of virally caused malignancy. The amino acid sequence, 3d structure and electrostatic features of the E2 protein DNA binding domain are highly conserved; specific interactions with DNA binding sites have also been conserved. In contrast, the E2 protein's transactivation domain does not have extensive surfaces of highly conserved residues. Rather, regions of high conservation are localized to small surface patches. Implications to cancer biology are discussed.
Fluctuation-driven mechanotransduction regulates mitochondrial-network structure and function
NASA Astrophysics Data System (ADS)
Bartolák-Suki, Erzsébet; Imsirovic, Jasmin; Parameswaran, Harikrishnan; Wellman, Tyler J.; Martinez, Nuria; Allen, Philip G.; Frey, Urs; Suki, Béla
2015-10-01
Cells can be exposed to irregular mechanical fluctuations, such as those arising from changes in blood pressure. Here, we report that ATP production, assessed through changes in mitochondrial membrane potential, is downregulated in vascular smooth muscle cells in culture exposed to monotonous stretch cycles when compared with cells exposed to a variable cyclic stretch that incorporates physiological levels of cycle-by-cycle variability in stretch amplitude. Variable stretch enhances ATP production by increasing the expression of ATP synthase’s catalytic domain, cytochrome c oxidase and its tyrosine phosphorylation, mitofusins and PGC-1α. Such a fluctuation-driven mechanotransduction mechanism is mediated by motor proteins and by the enhancement of microtubule-, actin- and mitochondrial-network complexity. We also show that, in aorta rings isolated from rats, monotonous stretch downregulates--whereas variable stretch maintains--physiological vessel-wall contractility through mitochondrial ATP production. Our results have implications for ATP-dependent and mechanosensitive intracellular processes.
The Structure of Neurexin 1[alpha] Reveals Features Promoting a Role as Synaptic Organizer
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chen, Fang; Venugopal, Vandavasi; Murray, Beverly
{alpha}-Neurexins are essential synaptic adhesion molecules implicated in autism spectrum disorder and schizophrenia. The {alpha}-neurexin extracellular domain consists of six LNS domains interspersed by three EGF-like repeats and interacts with many different proteins in the synaptic cleft. To understand how {alpha}-neurexins might function as synaptic organizers, we solved the structure of the neurexin 1{alpha} extracellular domain (n1{alpha}) to 2.65 {angstrom}. The L-shaped molecule can be divided into a flexible repeat I (LNS1-EGF-A-LNS2), a rigid horseshoe-shaped repeat II (LNS3-EGF-B-LNS4) with structural similarity to so-called reelin repeats, and an extended repeat III (LNS5-EGF-B-LNS6) with controlled flexibility. A 2.95 {angstrom} structure of n1{alpha}more » carrying splice insert SS3 in LNS4 reveals that SS3 protrudes as a loop and does not alter the rigid arrangement of repeat II. The global architecture imposed by conserved structural features enables {alpha}-neurexins to recruit and organize proteins in distinct and variable ways, influenced by splicing, thereby promoting synaptic function.« less
A chirality-based metrics for free-energy calculations in biomolecular systems.
Pietropaolo, Adriana; Branduardi, Davide; Bonomi, Massimiliano; Parrinello, Michele
2011-09-01
In this work, we exploit the chirality index introduced in (Pietropaolo et al., Proteins 2008, 70, 667) as an effective descriptor of the secondary structure of proteins to explore their complex free-energy landscape. We use the chirality index as an alternative metrics in the path collective variables (PCVs) framework and we show in the prototypical case of the C-terminal domain of immunoglobulin binding protein GB1 that relevant configurations can be efficiently sampled in combination with well-tempered metadynamics. While the projections of the configurations found onto a variety of different descriptors are fully consistent with previously reported calculations, this approach provides a unifying perspective of the folding mechanism which was not possible using metadynamics with the previous formulation of PCVs. Copyright © 2011 Wiley Periodicals, Inc.
Salt-dependent properties of proteins from extremely halophilic bacteria
NASA Technical Reports Server (NTRS)
Lanyi, J. K.
1974-01-01
Based on information concerning the interaction of salts and macromolecules the literature of the enzymes of halophilic bacteria and their constituents is examined. Although in halophilic systems the salt requirement of enzyme activity is variable the enzymes investigated show a time-dependent inactivation at lower salt concentrations especially in the absence of salt. The studies described show that in some halophilic systems the effect of salt may be restricted to a small region on the protein molecule. The concept of the hydrophobic bond to consider certain solvent-dependent phenomena is introduced. It is shown that some halophilic enzymes are unable to maintain their structure without the involvement of hydrophobic interactions that are usually not supported by water. A table lists indices of hydrophobicity and polarity for various halophilic and nonhalophilic proteins.
NASA Astrophysics Data System (ADS)
Yin, Deshun; Qu, Pengfei
2018-02-01
Protein lateral diffusion is considered anomalous in the plasma membrane. And this diffusion is related to membrane microstructure. In order to better describe the property of protein lateral diffusion and find out the inner relationship between protein lateral diffusion and membrane microstructure, this article applies variable-order fractional mean square displacement (f-MSD) function for characterizing the anomalous diffusion. It is found that the variable order can reflect the evolution of diffusion ability. The results of numerical simulation demonstrate variable-order f-MSD function can predict the tendency of anomalous diffusion during the process of confined diffusion. It is also noted that protein lateral diffusion ability during the processes of confined and hop diffusion can be split into three parts. In addition, the comparative analyses reveal that the variable order is related to the confinement-domain size and microstructure of compartment boundary too.
Engineering of routes to heparin and related polysaccharides.
Bhaskar, Ujjwal; Sterner, Eric; Hickey, Anne Marie; Onishi, Akihiro; Zhang, Fuming; Dordick, Jonathan S; Linhardt, Robert J
2012-01-01
Anticoagulant heparin has been shown to possess important biological functions that vary according to its fine structure. Variability within heparin's structure occurs owing to its biosynthesis and animal tissue-based recovery and adds another dimension to its complex polymeric structure. The structural variations in chain length and sulfation patterns mediate its interaction with many heparin-binding proteins, thereby eliciting complex biological responses. The advent of novel chemical and enzymatic approaches for polysaccharide synthesis coupled with high throughput combinatorial approaches for drug discovery have facilitated an increased effort to understand heparin's structure-activity relationships. An improved understanding would offer potential for new therapeutic development through the engineering of polysaccharides. Such a bioengineering approach requires the amalgamation of several different disciplines, including carbohydrate synthesis, applied enzymology, metabolic engineering, and process biochemistry.
Mapping the Geometric Evolution of Protein Folding Motor.
Jerath, Gaurav; Hazam, Prakash Kishore; Shekhar, Shashi; Ramakrishnan, Vibin
2016-01-01
Polypeptide chain has an invariant main-chain and a variant side-chain sequence. How the side-chain sequence determines fold in terms of its chemical constitution has been scrutinized extensively and verified periodically. However, a focussed investigation on the directive effect of side-chain geometry may provide important insights supplementing existing algorithms in mapping the geometrical evolution of protein chains and its structural preferences. Geometrically, folding of protein structure may be envisaged as the evolution of its geometric variables: ϕ, and ψ dihedral angles of polypeptide main-chain directed by χ1, and χ2 of side chain. In this work, protein molecule is metaphorically modelled as a machine with 4 rotors ϕ, ψ, χ1 and χ2, with its evolution to the functional fold is directed by combinations of its rotor directions. We observe that differential rotor motions lead to different secondary structure formations and the combinatorial pattern is unique and consistent for particular secondary structure type. Further, we found that combination of rotor geometries of each amino acid is unique which partly explains how different amino acid sequence combinations have unique structural evolution and functional adaptation. Quantification of these amino acid rotor preferences, resulted in the generation of 3 substitution matrices, which later on plugged in the BLAST tool, for evaluating their efficiency in aligning sequences. We have employed BLOSUM62 and PAM30 as standard for primary evaluation. Generation of substitution matrices is a logical extension of the conceptual framework we attempted to build during the development of this work. Optimization of matrices following the conventional routines and possible application with biologically relevant data sets are beyond the scope of this manuscript, though it is a part of the larger project design.
Cyclic coordinate descent: A robotics algorithm for protein loop closure.
Canutescu, Adrian A; Dunbrack, Roland L
2003-05-01
In protein structure prediction, it is often the case that a protein segment must be adjusted to connect two fixed segments. This occurs during loop structure prediction in homology modeling as well as in ab initio structure prediction. Several algorithms for this purpose are based on the inverse Jacobian of the distance constraints with respect to dihedral angle degrees of freedom. These algorithms are sometimes unstable and fail to converge. We present an algorithm developed originally for inverse kinematics applications in robotics. In robotics, an end effector in the form of a robot hand must reach for an object in space by altering adjustable joint angles and arm lengths. In loop prediction, dihedral angles must be adjusted to move the C-terminal residue of a segment to superimpose on a fixed anchor residue in the protein structure. The algorithm, referred to as cyclic coordinate descent or CCD, involves adjusting one dihedral angle at a time to minimize the sum of the squared distances between three backbone atoms of the moving C-terminal anchor and the corresponding atoms in the fixed C-terminal anchor. The result is an equation in one variable for the proposed change in each dihedral. The algorithm proceeds iteratively through all of the adjustable dihedral angles from the N-terminal to the C-terminal end of the loop. CCD is suitable as a component of loop prediction methods that generate large numbers of trial structures. It succeeds in closing loops in a large test set 99.79% of the time, and fails occasionally only for short, highly extended loops. It is very fast, closing loops of length 8 in 0.037 sec on average.
Free energy and hidden barriers of the β-sheet structure of prion protein.
Paz, S Alexis; Abrams, Cameron F
2015-10-13
On-the-fly free-energy parametrization is a new collective variable biasing approach akin to metadynamics with one important distinction: rather than acquiring an accelerated distribution via a history-dependent bias potential, sampling on this distribution is achieved from the beginning of the simulation using temperature-accelerated molecular dynamics. In the present work, we compare the performance of both approaches to compute the free-energy profile along a scalar collective variable measuring the H-bond registry of the β-sheet structure of the mouse Prion protein. Both methods agree on the location of the free-energy minimum, but free-energy profiles from well-tempered metadynamics are subject to a much higher degree of statistical noise due to hidden barriers. The sensitivity of metadynamics to hidden barriers is shown to be a consequence of the history dependence of the bias potential, and we detail the nature of these barriers for the prion β-sheet. In contrast, on-the-fly parametrization is much less sensitive to these barriers and thus displays improved convergence behavior relative to that of metadynamics. While hidden barriers are a frequent and central issue in free-energy methods, on-the-fly free-energy parametrization appears to be a robust and preferable method to confront this issue.
The movement of proteins across the insect and tick digestive system.
Jeffers, Laura A; Michael Roe, R
2008-02-01
The movement of intact proteins across the digestive system was shown in a number of different blood-feeding and non-blood-feeding insects in the orders Blattaria, Coleoptera, Diptera, Hemiptera, Lepidoptera, Orthoptera, Neuroptera and Siphonaptera, as well as in two tick families Ixodidae and Argasidae. Protein movement was observed for both normal dietary and xenobiotic proteins, which suggest that the mechanism for transfer is not substrate specific. The number of studies on the mechanism of movement is limited. The research so far suggests that movement can occur by either a transcellular or an intercellular pathway in the ventriculus with most of the research describing the former. Transfer is by continuous diffusion with no evidence of pinocytosis or vesicular transport common in mammalian systems. Proteins can move across the digestive system without modification of their primary or multimeric structure and with retention of their functional characteristics. Accumulation in the hemolymph is the result of the protein degradation rate in the gut and hemolymph and transfer rate across the digestive system and can be highly variable depending on species. Research on the development of delivery systems to enhance protein movement across the insect digestive system is in its infancy. The approaches so far considered with some success include the use of lipophilic-polyethylene glycol (PEG) polymers, the development of fusion proteins with lectins, reduced gut protease activity and the development of amphiphilic peptidic analogs. Additional research on understanding the basic mechanisms of protein delivery across the insect digestive system, the importance of structure activity in this transfer and the development of technology to improve movement across the gut could be highly significant to the future of protein and nucleic acid-based insecticide development as well as traditional chemical insecticidal technologies.
Mutational Analysis of Drosophila Basigin Function in the Visual System
Munro, Michelle; Akkam, Yazan; Curtin, Kathryn D.
2009-01-01
Drosophila basigin is a cell-surface glycoprotein of the Ig superfamily and a member of a protein family that includes mammalian EMMPRIN/CD147/basigin, neuroplastin, and embigin. Our previous work on Drosophila basigin has shown that it is required for normal photoreceptor cell structure and normal neuron-glia interaction in the fly visual system. Specifically, the photoreceptor neurons of mosaic animals that are mutant in the eye for basigin show altered cell structure with nuclei, mitochondria and rER misplaced and variable axon diameter compared to wild-type. In addition, glia cells in the optic lamina that contact photoreceptor axons are misplaced and show altered structure. All these defects are rescued by expression of either transgenic fly basigin or transgenic mouse basigin in the photoreceptors demonstrating that mouse basigin can functionally replace fly basigin. To determine what regions of the basigin protein are required for each of these functions, we have created mutant basigin transgenes coding for proteins that are altered in conserved residues, introduced these into the fly genome, and tested them for their ability to rescue both photoreceptor cell structure defects and neuron-glia interaction defects of basigin. The results suggest that the highly conserved transmembrane domain and the extracellular domains are crucial for basigin function in the visual system while the short intracellular tail may not play a role in these functions. PMID:19782733
Rapid X-ray Photoreduction of Dimetal-Oxygen Cofactors in Ribonucleotide Reductase
Sigfridsson, Kajsa G. V.; Chernev, Petko; Leidel, Nils; Popović-Bijelić, Ana; Gräslund, Astrid; Haumann, Michael
2013-01-01
Prototypic dinuclear metal cofactors with varying metallation constitute a class of O2-activating catalysts in numerous enzymes such as ribonucleotide reductase. Reliable structures are required to unravel the reaction mechanisms. However, protein crystallography data may be compromised by x-ray photoreduction (XRP). We studied XPR of Fe(III)Fe(III) and Mn(III)Fe(III) sites in the R2 subunit of Chlamydia trachomatis ribonucleotide reductase using x-ray absorption spectroscopy. Rapid and biphasic x-ray photoreduction kinetics at 20 and 80 K for both cofactor types suggested sequential formation of (III,II) and (II,II) species and similar redox potentials of iron and manganese sites. Comparing with typical x-ray doses in crystallography implies that (II,II) states are reached in <1 s in such studies. First-sphere metal coordination and metal-metal distances differed after chemical reduction at room temperature and after XPR at cryogenic temperatures, as corroborated by model structures from density functional theory calculations. The inter-metal distances in the XPR-induced (II,II) states, however, are similar to R2 crystal structures. Therefore, crystal data of initially oxidized R2-type proteins mostly contain photoreduced (II,II) cofactors, which deviate from the native structures functional in O2 activation, explaining observed variable metal ligation motifs. This situation may be remedied by novel femtosecond free electron-laser protein crystallography techniques. PMID:23400774
Rapid X-ray photoreduction of dimetal-oxygen cofactors in ribonucleotide reductase.
Sigfridsson, Kajsa G V; Chernev, Petko; Leidel, Nils; Popovic-Bijelic, Ana; Gräslund, Astrid; Haumann, Michael
2013-04-05
Prototypic dinuclear metal cofactors with varying metallation constitute a class of O2-activating catalysts in numerous enzymes such as ribonucleotide reductase. Reliable structures are required to unravel the reaction mechanisms. However, protein crystallography data may be compromised by x-ray photoreduction (XRP). We studied XPR of Fe(III)Fe(III) and Mn(III)Fe(III) sites in the R2 subunit of Chlamydia trachomatis ribonucleotide reductase using x-ray absorption spectroscopy. Rapid and biphasic x-ray photoreduction kinetics at 20 and 80 K for both cofactor types suggested sequential formation of (III,II) and (II,II) species and similar redox potentials of iron and manganese sites. Comparing with typical x-ray doses in crystallography implies that (II,II) states are reached in <1 s in such studies. First-sphere metal coordination and metal-metal distances differed after chemical reduction at room temperature and after XPR at cryogenic temperatures, as corroborated by model structures from density functional theory calculations. The inter-metal distances in the XPR-induced (II,II) states, however, are similar to R2 crystal structures. Therefore, crystal data of initially oxidized R2-type proteins mostly contain photoreduced (II,II) cofactors, which deviate from the native structures functional in O2 activation, explaining observed variable metal ligation motifs. This situation may be remedied by novel femtosecond free electron-laser protein crystallography techniques.
Berjón-Otero, Mónica; Lechuga, Ana; Mehla, Jitender; Uetz, Peter; Salas, Margarita; Redrejo-Rodríguez, Modesto
2017-07-26
Tectiviridae comprises a group of tail-less, icosahedral, membrane-containing bacteriophages that can be divided into two groups by their hosts, either Gram-negative or Gram-positive bacteria. While the first group is composed of PRD1 and nearly identical well characterized lytic viruses, the second one includes more variable temperate phages, like GIL16 or Bam35, whose hosts are Bacillus cereus and related Gram-positive bacteria.In the genome of Bam35, nearly half of the 32 annotated open reading frames (ORFs) have no homologs in databases (ORFans), being putative proteins of unknown function, which hinders the understanding of their biology. With the aim of increasing the knowledge of the viral proteome, we carried out a comprehensive yeast two-hybrid analysis among all the putative proteins encoded by the Bam35 genome. The resulting protein interactome comprises 76 unique interactions among 24 proteins, of which 12 have an unknown function. These results suggested that the P17 protein is the minor capsid protein of Bam35 and P24 is the penton protein, being the latter also supported by iterative threading protein modeling. Moreover, the inner membrane transglycosylase protein P26 could have an additional structural role. We also detected interactions involving non-structural proteins, such as the DNA binding protein P1 and the genome terminal protein (P4), which was confirmed by co-immunoprecipitation of recombinant proteins. Altogether, our results provide a functional view of the Bam35 viral proteome, with a focus on the composition and organization of the viral particle. IMPORTANCE Tail-less viruses of the family Tectiviridae can infect commensal and pathogenic Gram-positive and Gram-negative bacteria. Moreover, they have been proposed to be at the evolutionary origin of several groups of large eukaryotic DNA viruses and self-replicating plasmids. However, due to their ancient origin and complex diversity, many tectiviral proteins are ORFans of unknown function.Comprehensive protein-protein interaction (PPI) analysis among viral proteins can eventually disclose biological mechanisms and thus provide new insights into protein function unattainable by studying proteins one by one. Here we comprehensively describe intraviral PPIs among tectivirus Bam35 proteins using multi-vector yeast two-hybrid screening that was further supported by co-immunoprecipitation assays and protein structural models. This approach allowed us to propose new functions for known proteins and hypothesize on the biological role localization within the viral particle of some viral ORFan proteins that will be helpful for understanding the biology of Gram-positive tectivirus. Copyright © 2017 American Society for Microbiology.
Evolution and Structural Organization of the C Proteins of Paramyxovirinae
Karlin, David G.
2014-01-01
The phosphoprotein (P) gene of most Paramyxovirinae encodes several proteins in overlapping frames: P and V, which share a common N-terminus (PNT), and C, which overlaps PNT. Overlapping genes are of particular interest because they encode proteins originated de novo, some of which have unknown structural folds, challenging the notion that nature utilizes only a limited, well-mapped area of fold space. The C proteins cluster in three groups, comprising measles, Nipah, and Sendai virus. We predicted that all C proteins have a similar organization: a variable, disordered N-terminus and a conserved, α-helical C-terminus. We confirmed this predicted organization by biophysically characterizing recombinant C proteins from Tupaia paramyxovirus (measles group) and human parainfluenza virus 1 (Sendai group). We also found that the C of the measles and Nipah groups have statistically significant sequence similarity, indicating a common origin. Although the C of the Sendai group lack sequence similarity with them, we speculate that they also have a common origin, given their similar genomic location and structural organization. Since C is dispensable for viral replication, unlike PNT, we hypothesize that C may have originated de novo by overprinting PNT in the ancestor of Paramyxovirinae. Intriguingly, in measles virus and Nipah virus, PNT encodes STAT1-binding sites that overlap different regions of the C-terminus of C, indicating they have probably originated independently. This arrangement, in which the same genetic region encodes simultaneously a crucial functional motif (a STAT1-binding site) and a highly constrained region (the C-terminus of C), seems paradoxical, since it should severely reduce the ability of the virus to adapt. The fact that it originated twice suggests that it must be balanced by an evolutionary advantage, perhaps from reducing the size of the genetic region vulnerable to mutations. PMID:24587180
2017-01-01
The effect of S-substitution on the O6 guanine site of a 13-mer DNA duplex containing a G:T mismatch is studied using molecular dynamics. The structure, dynamic evolution and hydration of the S-substituted duplex are compared with those of a normal duplex, a duplex with S-substitution on guanine, but no mismatch and a duplex with just a G:T mismatch. The S-substituted mismatch leads to cell death rather than repair. One suggestion is that the G:T mismatch recognition protein recognises the S-substituted mismatch (GS:T) as G:T. This leads to a cycle of futile repair ending in DNA breakage and cell death. We find that some structural features of the helix are similar for the duplex with the G:T mismatch and that with the S-substituted mismatch, but differ from the normal duplex, notably the helical twist. These differences arise from the change in the hydrogen-bonding pattern of the base pair. However a marked feature of the S-substituted G:T mismatch duplex is a very large opening. This showed considerable variability. It is suggested that this enlarged opening would lend support to an alternative model of cell death in which the mismatch protein attaches to thioguanine and activates downstream damage-response pathways. Attack on the sulphur by reactive oxygen species, also leading to cell death, would also be aided by the large, variable opening. PMID:28910418
Structural polymorphism at LCR and its role in beta-globin gene regulation.
Kukreti, Shrikant; Kaur, Harpreet; Kaushik, Mahima; Bansal, Aparna; Saxena, Sarika; Kaushik, Shikha; Kukreti, Ritushree
2010-09-01
Information on the secondary structures and conformational manifestations of eukaryotic DNA and their biological significance with reference to gene regulation and expression is limited. The human beta-globin gene Locus Control Region (LCR), a dominant regulator of globin gene expression, is a contiguous piece of DNA with five tissue-specific DNase I-hypersensitive sites (HSs). Since these HSs have a high density of transcription factor binding sites, structural interdependencies between HSs and different promoters may directly or indirectly regulate LCR functions. Mutations and SNPs may stabilize or destabilize the local secondary structures, affecting the gene expression by changes in the protein-DNA recognition patterns. Various palindromic or quasi-palindromic segments within LCR, could cause structural polymorphism and geometrical switching of DNA. This emphasizes the importance of understanding of the sequence-dependent variations of the DNA structure. Such structural motifs might act as regulatory elements. The local conformational variability of a DNA segment or action of a DNA specific protein is key to create and maintain active chromatin domains and affect transcription of various tissue specific beta-globin genes. We, summarize here the current status of beta-globin LCR structure and function. Further structural studies at molecular level and functional genomics might solve the regulatory puzzles that control the beta-globin gene locus. Copyright (c) 2010 Elsevier Masson SAS. All rights reserved.
A novel and rapid method for obtaining high titre intact prion strains from mammalian brain.
Wenborn, Adam; Terry, Cassandra; Gros, Nathalie; Joiner, Susan; D'Castro, Laura; Panico, Silvia; Sells, Jessica; Cronier, Sabrina; Linehan, Jacqueline M; Brandner, Sebastian; Saibil, Helen R; Collinge, John; Wadsworth, Jonathan D F
2015-05-07
Mammalian prions exist as multiple strains which produce characteristic and highly reproducible phenotypes in defined hosts. How this strain diversity is encoded by a protein-only agent remains one of the most interesting and challenging questions in biology with wide relevance to understanding other diseases involving the aggregation or polymerisation of misfolded host proteins. Progress in understanding mammalian prion strains has however been severely limited by the complexity and variability of the methods used for their isolation from infected tissue and no high resolution structures have yet been reported. Using high-throughput cell-based prion bioassay to re-examine prion purification from first principles we now report the isolation of prion strains to exceptional levels of purity from small quantities of infected brain and demonstrate faithful retention of biological and biochemical strain properties. The method's effectiveness and simplicity should facilitate its wide application and expedite structural studies of prions.
A novel and rapid method for obtaining high titre intact prion strains from mammalian brain
Wenborn, Adam; Terry, Cassandra; Gros, Nathalie; Joiner, Susan; D’Castro, Laura; Panico, Silvia; Sells, Jessica; Cronier, Sabrina; Linehan, Jacqueline M.; Brandner, Sebastian; Saibil, Helen R.; Collinge, John; Wadsworth, Jonathan D. F.
2015-01-01
Mammalian prions exist as multiple strains which produce characteristic and highly reproducible phenotypes in defined hosts. How this strain diversity is encoded by a protein-only agent remains one of the most interesting and challenging questions in biology with wide relevance to understanding other diseases involving the aggregation or polymerisation of misfolded host proteins. Progress in understanding mammalian prion strains has however been severely limited by the complexity and variability of the methods used for their isolation from infected tissue and no high resolution structures have yet been reported. Using high-throughput cell-based prion bioassay to re-examine prion purification from first principles we now report the isolation of prion strains to exceptional levels of purity from small quantities of infected brain and demonstrate faithful retention of biological and biochemical strain properties. The method’s effectiveness and simplicity should facilitate its wide application and expedite structural studies of prions. PMID:25950908
Multiple polymer architectures of human Polyhomeotic homolog 3 (PHC3) SAM
Nanyes, David R.; Junco, Sarah E.; Taylor, Alexander B.; Robinson, Angela K.; Patterson, Nicolle L.; Shivarajpur, Ambika; Halloran, Jonathan; Hale, Seth M.; Kaur, Yogeet; Hart, P. John; Kim, Chongwoo A.
2014-01-01
The self-association of sterile alpha motifs (SAMs) into a helical polymer architecture is a critical functional component of many different and diverse array of proteins. For the Drosophila Polycomb group (PcG) protein Polyhomeotic (Ph), its SAM polymerization serves as the structural foundation to cluster multiple PcG complexes, helping to maintain a silenced chromatin state. Ph SAM shares 64% sequence identity with its human ortholog, PHC3 SAM, and both SAMs polymerize. However, in the context of their larger protein regions, PHC3 SAM forms longer polymers compared to Ph SAM. Motivated to establish the precise structural basis for the differences, if any, between Ph and PHC3 SAM, we determined the crystal structure of the PHC3 SAM polymer. PHC3 SAM utilizes the same SAM-SAM interaction as the Ph SAM six-fold repeat polymer. Yet, PHC3 SAM polymerizes utilizing just five SAMs per turn of the helical polymer rather than the typical six per turn observed for all SAM polymers reported to date. Structural analysis suggested that malleability of the PHC3 SAM would allow formation of not just the five-fold repeat structure but possibly others. Indeed, a second PHC3 SAM polymer in a different crystal form forms a six-fold repeat polymer. These results suggest that the polymers formed by PHC3 SAM, and likely others, are quite dynamic. The functional consequence of the variable PHC3 SAM polymers may be to create different chromatin architectures. PMID:25044168
Structural dissection of Ebola virus and its assembly determinants using cryo-electron tomography.
Bharat, Tanmay A M; Noda, Takeshi; Riches, James D; Kraehling, Verena; Kolesnikova, Larissa; Becker, Stephan; Kawaoka, Yoshihiro; Briggs, John A G
2012-03-13
Ebola virus is a highly pathogenic filovirus causing severe hemorrhagic fever with high mortality rates. It assembles heterogenous, filamentous, enveloped virus particles containing a negative-sense, single-stranded RNA genome packaged within a helical nucleocapsid (NC). We have used cryo-electron microscopy and tomography to visualize Ebola virus particles, as well as Ebola virus-like particles, in three dimensions in a near-native state. The NC within the virion forms a left-handed helix with an inner nucleoprotein layer decorated with protruding arms composed of VP24 and VP35. A comparison with the closely related Marburg virus shows that the N-terminal region of nucleoprotein defines the inner diameter of the Ebola virus NC, whereas the RNA genome defines its length. Binding of the nucleoprotein to RNA can assemble a loosely coiled NC-like structure; the loose coil can be condensed by binding of the viral matrix protein VP40 to the C terminus of the nucleoprotein, and rigidified by binding of VP24 and VP35 to alternate copies of the nucleoprotein. Four proteins (NP, VP24, VP35, and VP40) are necessary and sufficient to mediate assembly of an NC with structure, symmetry, variability, and flexibility indistinguishable from that in Ebola virus particles released from infected cells. Together these data provide a structural and architectural description of Ebola virus and define the roles of viral proteins in its structure and assembly.
Moudjou, Mohammed; Chapuis, Jérôme; Mekrouti, Mériem; Reine, Fabienne; Herzog, Laetitia; Sibille, Pierre; Laude, Hubert; Vilette, Didier; Andréoletti, Olivier; Rezaei, Human; Dron, Michel; Béringue, Vincent
2016-07-07
Prions are formed of misfolded assemblies (PrP(Sc)) of the variably N-glycosylated cellular prion protein (PrP(C)). In infected species, prions replicate by seeding the conversion and polymerization of host PrP(C). Distinct prion strains can be recognized, exhibiting defined PrP(Sc) biochemical properties such as the glycotype and specific biological traits. While strain information is encoded within the conformation of PrP(Sc) assemblies, the storage of the structural information and the molecular requirements for self-perpetuation remain uncertain. Here, we investigated the specific role of PrP(C) glycosylation status. First, we developed an efficient protein misfolding cyclic amplification method using cells expressing the PrP(C) species of interest as substrate. Applying the technique to PrP(C) glycosylation mutants expressing cells revealed that neither PrP(C) nor PrP(Sc) glycoform stoichiometry was instrumental to PrP(Sc) formation and strainness perpetuation. Our study supports the view that strain properties, including PrP(Sc) glycotype are enciphered within PrP(Sc) structural backbone, not in the attached glycans.
Genetic structure and regulation of isoprene synthase in Poplar (Populus spp.).
Vickers, Claudia E; Possell, Malcolm; Nicholas Hewitt, C; Mullineaux, Philip M
2010-07-01
Isoprene is a volatile 5-carbon hydrocarbon derived from the chloroplastic methylerythritol 2-C-methyl-D: -erythritol 4-phosphate isoprenoid pathway. In plants, isoprene emission is controlled by the enzyme isoprene synthase; however, there is still relatively little known about the genetics and regulation of this enzyme. Isoprene synthase gene structure was analysed in three poplar species. It was found that genes encoding stromal isoprene synthase exist as a small gene family, the members of which encode virtually identical proteins and are differentially regulated. Accumulation of isoprene synthase protein is developmentally regulated, but does not differ between sun and shade leaves and does not increase when heat stress is applied. Our data suggest that, in mature leaves, isoprene emission rates are primarily determined by substrate (dimethylallyl diphosphate, DMADP) availability. In immature leaves, where isoprene synthase levels are variable, emission levels are also influenced by the amount of isoprene synthase protein. No thylakoid isoforms could be identified in Populus alba or in Salix babylonica. Together, these data show that control of isoprene emission at the genetic level is far more complicated than previously assumed.
Stanger, Frédéric V; de Beer, Tjaart A P; Dranow, David M; Schirmer, Tilman; Phan, Isabelle; Dehio, Christoph
2017-01-03
The BID (Bep intracellular delivery) domain functions as secretion signal in a subfamily of protein substrates of bacterial type IV secretion (T4S) systems. It mediates transfer of (1) relaxases and the attached DNA during bacterial conjugation, and (2) numerous Bartonella effector proteins (Beps) during protein transfer into host cells infected by pathogenic Bartonella species. Furthermore, BID domains of Beps have often evolved secondary effector functions within host cells. Here, we provide crystal structures for three representative BID domains and describe a novel conserved fold characterized by a compact, antiparallel four-helix bundle topped with a hook. The conserved hydrophobic core provides a rigid scaffold to a surface that, despite a few conserved exposed residues and similarities in charge distribution, displays significant variability. We propose that the genuine function of BID domains as T4S signal may primarily depend on their rigid structure, while the plasticity of their surface may facilitate adaptation to secondary effector functions. Copyright © 2016 Elsevier Ltd. All rights reserved.
Mesa, Martha C; Rodríguez, Luz-Stella; Franco, Manuel A; Angel, Juana
2007-09-15
We studied the interaction of RV with human peripheral blood mononuclear cells (PBMC) from adult volunteers. After exposure of PBMC to rhesus RV (RRV), T and B lymphocytes, NK cells, monocytes, and myeloid and plasmacytoid dendritic cells expressed RV non-structural proteins, at variable levels. Expression of these RV proteins was abolished if infection was done in the presence of anti-VP7 neutralizing antibodies or 10% autologous serum. Supernatants of RRV exposed PBMC contained TNF-alpha, IL-6, IFN-alpha, IFN-gamma, IL-2 and IL-10. Plasmacytoid DC were found to be the main source of IFN-alpha production, and in their absence the production of IFN-gamma and the frequency of RV specific T cells that secrete IFN-gamma diminished. Finally, we could not detect RV-antigen associated with the PBMC or expression of RV non-structural proteins in PBMC of acutely RV-infected children. Thus, although PBMC are susceptible to the initial steps of RV infection, most PBMC of children with RV-gastroenteritis are not infected.
Mechanism of allosteric regulation of β2-adrenergic receptor by cholesterol
Manna, Moutusi; Niemelä, Miia; Tynkkynen, Joona; Javanainen, Matti; Kulig, Waldemar; Müller, Daniel J; Rog, Tomasz; Vattulainen, Ilpo
2016-01-01
There is evidence that lipids can be allosteric regulators of membrane protein structure and activation. However, there are no data showing how exactly the regulation emerges from specific lipid-protein interactions. Here we show in atomistic detail how the human β2-adrenergic receptor (β2AR) – a prototypical G protein-coupled receptor – is modulated by cholesterol in an allosteric fashion. Extensive atomistic simulations show that cholesterol regulates β2AR by limiting its conformational variability. The mechanism of action is based on the binding of cholesterol at specific high-affinity sites located near the transmembrane helices 5–7 of the receptor. The alternative mechanism, where the β2AR conformation would be modulated by membrane-mediated interactions, plays only a minor role. Cholesterol analogues also bind to cholesterol binding sites and impede the structural flexibility of β2AR, however cholesterol generates the strongest effect. The results highlight the capacity of lipids to regulate the conformation of membrane receptors through specific interactions. DOI: http://dx.doi.org/10.7554/eLife.18432.001 PMID:27897972
Is Congo red an amyloid-specific dye?
Khurana, R; Uversky, V N; Nielsen, L; Fink, A L
2001-06-22
Congo red (CR) binding, monitored by characteristic yellow-green birefringence under crossed polarization has been used as a diagnostic test for the presence of amyloid in tissue sections for several decades. This assay is also widely used for the characterization of in vitro amyloid fibrils. In order to probe the structural specificity of Congo red binding to amyloid fibrils we have used an induced circular dichroism (CD) assay. Amyloid fibrils from insulin and the variable domain of Ig light chain demonstrate induced CD spectra upon binding to Congo red. Surprisingly, the native conformations of insulin and Ig light chain also induced Congo red circular dichroism, but with different spectral shapes than those from fibrils. In fact, a wide variety of native proteins exhibited induced CR circular dichroism indicating that CR bound to representative proteins from different classes of secondary structure such as alpha (citrate synthase), alpha + beta (lysozyme), beta (concavalin A), and parallel beta-helical proteins (pectate lyase). Partially folded intermediates of apomyoglobin induced different Congo red CD bands than the corresponding native conformation, however, no induced CD bands were observed with unfolded protein. Congo red was also found to induce oligomerization of native proteins, as demonstrated by covalent cross-linking and small angle x-ray scattering. Our data suggest that Congo red is sandwiched between two protein molecules causing protein oligomerization. The fact that Congo red binds to native, partially folded conformations and amyloid fibrils of several proteins shows that it must be used with caution as a diagnostic test for the presence of amyloid fibrils in vitro.
Ahmed, Aqeel; Rippmann, Friedrich; Barnickel, Gerhard; Gohlke, Holger
2011-07-25
A three-step approach for multiscale modeling of protein conformational changes is presented that incorporates information about preferred directions of protein motions into a geometric simulation algorithm. The first two steps are based on a rigid cluster normal-mode analysis (RCNMA). Low-frequency normal modes are used in the third step (NMSim) to extend the recently introduced idea of constrained geometric simulations of diffusive motions in proteins by biasing backbone motions of the protein, whereas side-chain motions are biased toward favorable rotamer states. The generated structures are iteratively corrected regarding steric clashes and stereochemical constraint violations. The approach allows performing three simulation types: unbiased exploration of conformational space; pathway generation by a targeted simulation; and radius of gyration-guided simulation. When applied to a data set of proteins with experimentally observed conformational changes, conformational variabilities are reproduced very well for 4 out of 5 proteins that show domain motions, with correlation coefficients r > 0.70 and as high as r = 0.92 in the case of adenylate kinase. In 7 out of 8 cases, NMSim simulations starting from unbound structures are able to sample conformations that are similar (root-mean-square deviation = 1.0-3.1 Å) to ligand bound conformations. An NMSim generated pathway of conformational change of adenylate kinase correctly describes the sequence of domain closing. The NMSim approach is a computationally efficient alternative to molecular dynamics simulations for conformational sampling of proteins. The generated conformations and pathways of conformational transitions can serve as input to docking approaches or as starting points for more sophisticated sampling techniques.
The role played by alternative splicing in antigenic variability in human endo-parasites.
Hull, Rodney; Dlamini, Zodwa
2014-01-28
Endo-parasites that affect humans include Plasmodium, the causative agent of malaria, which remains one of the leading causes of death in human beings. Despite decades of research, vaccines to this and other endo-parasites remain elusive. This is in part due to the hyper-variability of the parasites surface proteins. Generally these surface proteins are encoded by a large family of genes, with only one being dominantly expressed at certain life stages. Another layer of complexity can be introduced through the alternative splicing of these surface proteins. The resulting isoforms may differ from each other with regard to cell localisation, substrate affinities and functions. They may even differ in structure to the extent that they are no longer recognised by the host's immune system. In many cases this leads to changes in the N terminus of these proteins. The geographical localisation of endo-parasitic infections around the tropics and the highest incidences of HIV-1 infection in the same areas, adds a further layer of complexity as parasitic infections affect the host immune system resulting in higher HIV infection rates, faster disease progression, and an increase in the severity of infections and complications in HIV diagnosis. This review discusses some examples of parasite surface proteins that are alternatively spliced in trypanosomes, Plasmodium and the parasitic worm Schistosoma as well as what role alternate splicing may play in the interaction between HIV and these endo-parasites.
Kabat, E A; Wu, T T; Bilofsky, H
1976-02-01
From collected data on variable region sequences of heavy chains of immunoglobulins, the probability of random associations of any two amino-acid residues in the complementarity-determining segments was computed, and pairs of residues occurring significantly more frequently than expected were selected by computer. Significant associations between Phe 32 and Tyr 33, Phe 32 and Glu 35, and Tyr 33 and Glu 35 were found in six proteins, all of which were mouse myeloma proteins which bound phosphorylcholine (= phosphocholine). From the x-ray structure of McPC603, Tyr 33 and Glu 35 are contacting residues; a seventh phosphorylcholine-binding mouse myeloma protein also contained Phe 32 and Tyr 33 but position 35 had only been determined as Glx and thus this position had not been selected. Met 34 occurred in all seven phosphorylcholine-binding myeoma proteins but was also present at this position in 29 other proteins and thus was not selected; it is seen in the x-ray structure not to be a contacting residue. The role of Phe 32 is not obvious but it could have some conformational influence. A human phosphorylcholine-binding myeloma protien also had Phe, Tyr, and Met at positions 32, 33, and 34, but had Asp instead of Glu at position 35 and showed a lower binding constant. The ability to use sequence data to locate residues in complementarity-determing segments making contact with antigenic determinants and those playing essentially a structural role would contribute substantially to the understanding of antibody specificity.
Carrazco-Peña, Laura; Osuna-Castro, Juan A; De León-Rodríguez, Antonio; Maruyama, Nobuyuki; Toro-Vazquez, Jorge F; Morales-Rueda, Juan A; Barba de la Rosa, Ana P
2013-04-10
The primary structure of amaranth 11S globulin (Ah11S) was engineered with the aim to improve its functional properties. Four continuous methionines were inserted in variable region V, obtaining the Ah11Sr+4M construction. Changes on protein structure and surface characteristics were analyzed in silico. Solubility and heat-induced gelation of recombinant amaranth 11S proglobulin (Ah11Sr and Ah11Sr+4M) were compared with the native protein (Ah11Sn) purified from amaranth seed flour. The Ah11Sr+4 M showed the highest surface hydrophobicity, but as consequence the solubility was reduced. At low ionic strength (μ = 0.2) and acidic pH (<4.1), the recombinant proteins Ah11Sr and Ah11Sr+4 M had the highest and lowest solubility values, respectively. All globulins samples formed gels at 90 °C and low ionic strength, but Ah11Sn produced the weakest and Ah11Sr the strongest gels. Differential scanning calorimetry analysis under gel forming conditions revealed only exothermic transitions for all amaranth 11S globulins analyzed. In conclusion, the 3D structure analysis has revealed interesting molecular features that could explain the thermal resistance and gel forming ability of amaranth 11S globulins. The incorporation of four continuous methionines in amaranth increased the hydrophobicity, and self-supporting gels formed had intermediate hardness between Ah11Sn and Ah11Sr. These functional properties could be used in the food industry for the development of new products based on amaranth proteins.
Towers, Rebecca J.; Fagan, Peter K.; Talay, Susanne R.; Currie, Bart J.; Sriprakash, Kadaba S.; Walker, Mark J.; Chhatwal, Gursharan S.
2003-01-01
Streptococcal fibronectin-binding protein is an important virulence factor involved in colonization and invasion of epithelial cells and tissues by Streptococcus pyogenes. In order to investigate the mechanisms involved in the evolution of sfbI, the sfbI genes from 54 strains were sequenced. Thirty-four distinct alleles were identified. Three principal mechanisms appear to have been involved in the evolution of sfbI. The amino-terminal aromatic amino acid-rich domain is the most variable region and is apparently generated by intergenic recombination of horizontally acquired DNA cassettes, resulting in a genetic mosaic in this region. Two distinct and divergent sequence types that shared only 61 to 70% identity were identified in the central proline-rich region, while variation at the 3′ end of the gene is due to deletion or duplication of defined repeat units. Potential antigenic and functional variabilities in SfbI imply significant selective pressure in vivo with direct implications for the microbial pathogenesis of S. pyogenes. PMID:14662917
Human heavy chain disease protein WIS: implications for the organization of immunoglobulin genes.
Franklin, E C; Prelli, F; Frangione, B
1979-01-01
Protein WIS is a human gamma3 heavy (H) chain disease immunoglobulin variant whose amino acid sequence is most readily interpreted by postulating that three residues of the amino terminus are followed by a deletion of most of the variable (VH) domain, which ends at the variable-constant (VC) joining region. Then there is a stretch of eight residues, three of which are unusual, while the other five have striking homology to the VC junction sequence. This is followed by a second deletion, which ends at the beginning of the quadruplicated hinge region. These findings are consistent with mutations resulting in deletions of most of the gene coding for the V region and CH1 domain followed by splicing at the VC joining region and at the hinge. These structural features fit well the notion of genetic discontinuity between V and C genes and also suggest similar mechanisms of excision and splicing in the interdomain regions of the C gene of the heavy chain. PMID:106391
Packing Regularities in Biological Structures Relate to Their Dynamics
Jernigan, Robert L.; Kloczkowski, Andrzej
2007-01-01
The high packing density inside proteins leads to certain geometric regularities and also is one of the most important contributors to the high extent of cooperativity manifested by proteins in their cohesive domain motions. The orientations between neighboring non-bonded residues in proteins substantially follow the similar geometric regularities, regardless of whether the residues are on the surface or buried - a direct result of hydrophobicity forces. These orientations are relatively fixed and correspond closely to small deformations from those of the face-centered cubic lattice, which is the way in which identical spheres pack at the highest density. Packing density also is related to the extent of conservation of residues, and we show this relationship for residue packing densities by averaging over a large sample or residue packings. There are three regimes: 1) over a broad range of packing densities the relationship between sequence entropy and inverse packing density is nearly linear, 2) over a limited range of low packing densities the sequence entropy is nearly constant, and 3) at extremely low packing densities the sequence entropy is highly variable. These packing results provide important justification for the simple elastic network models that have been shown for a large number of proteins to represent protein dynamics so successfully, even when the models are extremely coarse-grained. Elastic network models for polymeric chains are simple and could be combined with these protein elastic networks to represent partially denatured parts of proteins. Finally, we show results of applications of the elastic network model to study the functional motions of the ribosome, based on its known structure. These results indicate expected correlations among its components for the step-wise processing steps in protein synthesis, and suggest ways to use these elastic network models to develop more detailed mechanisms - an important possibility, since most experiments yield only static structures. PMID:16957327
Inaba, Satomi; Numoto, Nobutaka; Ogawa, Shuhei; Morii, Hisayuki; Ikura, Teikichi; Abe, Ryo; Ito, Nobutoshi; Oda, Masayuki
2017-01-01
Full activation of T cells and differentiation into effector T cells are essential for many immune responses and require co-stimulatory signaling via the CD28 receptor. Extracellular ligand binding to CD28 recruits protein-tyrosine kinases to its cytoplasmic tail, which contains a YMNM motif. Following phosphorylation of the tyrosine, the proteins growth factor receptor-bound protein 2 (Grb2), Grb2-related adaptor downstream of Shc (Gads), and p85 subunit of phosphoinositide 3-kinase may bind to pYMNM (where pY is phosphotyrosine) via their Src homology 2 (SH2) domains, leading to downstream signaling to distinct immune pathways. These three adaptor proteins bind to the same site on CD28 with variable affinity, and all are important for CD28-mediated co-stimulatory function. However, the mechanism of how these proteins recognize and compete for CD28 is unclear. To visualize their interactions with CD28, we have determined the crystal structures of Gads SH2 and two p85 SH2 domains in complex with a CD28-derived phosphopeptide. The high resolution structures obtained revealed that, whereas the CD28 phosphopeptide bound to Gads SH2 is in a bent conformation similar to that when bound to Grb2 SH2, it adopts a more extended conformation when bound to the N- and C-terminal SH2 domains of p85. These differences observed in the peptide-protein interactions correlated well with the affinity and other thermodynamic parameters for each interaction determined by isothermal titration calorimetry. The detailed insight into these interactions reported here may inform the development of compounds that specifically inhibit the association of CD28 with these adaptor proteins to suppress excessive T cell responses, such as in allergies and autoimmune diseases. PMID:27927989
Inaba, Satomi; Numoto, Nobutaka; Ogawa, Shuhei; Morii, Hisayuki; Ikura, Teikichi; Abe, Ryo; Ito, Nobutoshi; Oda, Masayuki
2017-01-20
Full activation of T cells and differentiation into effector T cells are essential for many immune responses and require co-stimulatory signaling via the CD28 receptor. Extracellular ligand binding to CD28 recruits protein-tyrosine kinases to its cytoplasmic tail, which contains a YMNM motif. Following phosphorylation of the tyrosine, the proteins growth factor receptor-bound protein 2 (Grb2), Grb2-related adaptor downstream of Shc (Gads), and p85 subunit of phosphoinositide 3-kinase may bind to pYMNM (where pY is phosphotyrosine) via their Src homology 2 (SH2) domains, leading to downstream signaling to distinct immune pathways. These three adaptor proteins bind to the same site on CD28 with variable affinity, and all are important for CD28-mediated co-stimulatory function. However, the mechanism of how these proteins recognize and compete for CD28 is unclear. To visualize their interactions with CD28, we have determined the crystal structures of Gads SH2 and two p85 SH2 domains in complex with a CD28-derived phosphopeptide. The high resolution structures obtained revealed that, whereas the CD28 phosphopeptide bound to Gads SH2 is in a bent conformation similar to that when bound to Grb2 SH2, it adopts a more extended conformation when bound to the N- and C-terminal SH2 domains of p85. These differences observed in the peptide-protein interactions correlated well with the affinity and other thermodynamic parameters for each interaction determined by isothermal titration calorimetry. The detailed insight into these interactions reported here may inform the development of compounds that specifically inhibit the association of CD28 with these adaptor proteins to suppress excessive T cell responses, such as in allergies and autoimmune diseases. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Binding regularities in complexes of transcription factors with operator DNA: homeodomain family.
Chirgadze, Yu N; Zheltukhin, E I; Polozov, R V; Sivozhelezov, V S; Ivanov, V V
2009-06-01
In order to disclose general regularities of binding in homeodomain-DNA complexes we considered five of them and extended the observed regularities over the entire homeodomain family. The five complexes have been selected by similarity of protein structures and patterns of contacting residues. Their long range interactions and interfaces were compared. The long-range stage of the recognition process was characterized by electrostatic potentials about 5 Angstrom away from molecular surfaces of protein or DNA. For proteins, clear positive potential is displayed only at the side contacting the DNA. The double-chained DNA molecule displays a rather strong negative potential, especially in their grooves. Thus, a functional role of electrostatics is a guiding of the protein into the DNA major groove, so the protein and DNA could form a loose non-specific complex. At the close-range stage, neutralization of the phosphate charges by positively charged residues is necessary for decreasing the strong electrostatic potential of DNA, allowing nucleotide bases to participate in the formation of protein-DNA atomic contacts in the interface. The recognizing alpha-helix of protein was shown to form both invariant and variable groups of contacts with DNA by means of certain specific side groups. The invariant contacts included highly specific protein-DNA hydrogen bonds between asparagine and adenine, nonpolar contacts of hydrophobic amino acids serving as a stereochemical barrier for fixing the protein factor on DNA, and an interface cluster of water molecules providing local conformational mobility necessary for the dissociation process. There is a unique water molecule within the interface that is conservative and located at the interface center. Invariant contacts of the proteins are mostly formed with the TAAT motif of the promoter DNA forward strand. While the invariant contacts specify the family of homeodomains, the variable contacts that are formed with the reverse strand of DNA provide specificity of individual complexes within the homeodomain family.
Stoops, Janelle; Byrd, Samantha; Hasegawa, Haruki
2012-10-01
Russell bodies are intracellular aggregates of immunoglobulins. Although the mechanism of Russell body biogenesis has been extensively studied by using truncated mutant heavy chains, the importance of the variable domain sequences in this process and in immunoglobulin biosynthesis remains largely unknown. Using a panel of structurally and functionally normal human immunoglobulin Gs, we show that individual immunoglobulin G clones possess distinctive Russell body inducing propensities that can surface differently under normal and abnormal cellular conditions. Russell body inducing predisposition unique to each immunoglobulin G clone was corroborated by the intrinsic physicochemical properties encoded in the heavy chain variable domain/light chain variable domain sequence combinations that define each immunoglobulin G clone. While the sequence based intrinsic factors predispose certain immunoglobulin G clones to be more prone to induce Russell bodies, extrinsic factors such as stressful cell culture conditions also play roles in unmasking Russell body propensity from immunoglobulin G clones that are normally refractory to developing Russell bodies. By taking advantage of heterologous expression systems, we dissected the roles of individual subunit chains in Russell body formation and examined the effect of non-cognate subunit chain pair co-expression on Russell body forming propensity. The results suggest that the properties embedded in the variable domain of individual light chain clones and their compatibility with the partnering heavy chain variable domain sequences underscore the efficiency of immunoglobulin G biosynthesis, the threshold for Russell body induction, and the level of immunoglobulin G secretion. We propose that an interplay between the unique properties encoded in variable domain sequences and the state of protein homeostasis determines whether an immunoglobulin G expressing cell will develop the Russell body phenotype in a dynamic cellular setting. Copyright © 2012 Elsevier B.V. All rights reserved.
Structure-based function prediction of the expanding mollusk tyrosinase family
NASA Astrophysics Data System (ADS)
Huang, Ronglian; Li, Li; Zhang, Guofan
2017-11-01
Tyrosinase (Ty) is a common enzyme found in many different animal groups. In our previous study, genome sequencing revealed that the Ty family is expanded in the Pacific oyster ( Crassostrea gigas). Here, we examine the larger number of Ty family members in the Pacific oyster by high-level structure prediction to obtain more information about their function and evolution, especially the unknown role in biomineralization. We verified 12 Ty gene sequences from Crassostrea gigas genome and Pinctada fucata martensii transcriptome. By using phylogenetic analysis of these Tys with functionally known Tys from other molluscan species, eight subgroups were identified (CgTy_s1, CgTy_s2, MolTy_s1, MolTy-s2, MolTy-s3, PinTy-s1, PinTy-s2 and PviTy). Structural data and surface pockets of the dinuclear copper center in the eight subgroups of molluscan Ty were obtained using the latest versions of prediction online servers. Structural comparison with other Ty proteins from the protein databank revealed functionally important residues (HA1, HA2, HA3, HB1, HB2, HB3, Z1-Z9) and their location within these protein structures. The structural and chemical features of these pockets which may related to the substrate binding showed considerable variability among mollusks, which undoubtedly defines Ty substrate binding. Finally, we discuss the potential driving forces of Ty family evolution in mollusks. Based on these observations, we conclude that the Ty family has rapidly evolved as a consequence of substrate adaptation in mollusks.
Four structural risk factors identify most fibril-forming kappa light chains.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stevens, F. J.; Biosciences Division
2000-09-01
Antibody light chains (LCs) comprise the most structurally diverse family of proteins involved in amyloidosis. Many antibody LCs incorporate structural features that impair their stability and solubility, leading to their assembly into fibrils and to their subsequent pathological deposition when produced in excess during multiple myeloma and primary amyloidosis. The particular amino acid variations in antibody LCs that account for fibril formation and amyloidogenesis have not been identified. This study focuses on amyloidogenesis within the Kl family of human LCs. Reanalysis of the current database of primary structures of proteins from more than 100 patients who produced Kl LCS, 37more » of which were amyloidogenic, reveals apparent structural features that may contribute to amyloidosis. These features include loss of conserved residues or the gain of particular residues through mutation at sites involving a repertoire of approximately 20% of the amino acid positions in the light chain variable domain (V{sub L}). Moreover, 80% of all K1 amyloidogenic V{sub L}s are identifiable by the presence of at least one of three single-site substitutions or the acquisition of an N-linked glycosylation site through mutations. These findings suggest that it is feasible to predict fibril propensity by analysis of primary structure.« less
Modeling the SHG activities of diverse protein crystals
DOE Office of Scientific and Technical Information (OSTI.GOV)
Haupert, Levi M.; DeWalt, Emma L.; Simpson, Garth J., E-mail: gsimpson@purdue.edu
2012-11-01
The origins of the diversity in the SHG signal from protein crystals are investigated and potential protein-crystal coverage by SHG microscopy is assessed. A symmetry-additive ab initio model for second-harmonic generation (SHG) activity of protein crystals was applied to assess the likely protein-crystal coverage of SHG microscopy. Calculations were performed for 250 proteins in nine point-group symmetries: a total of 2250 crystals. The model suggests that the crystal symmetry and the limit of detection of the instrument are expected to be the strongest predictors of coverage of the factors considered, which also included secondary-structural content and protein size. Much ofmore » the diversity in SHG activity is expected to arise primarily from the variability in the intrinsic protein response as well as the orientation within the crystal lattice. Two or more orders-of-magnitude variation in intensity are expected even within protein crystals of the same symmetry. SHG measurements of tetragonal lysozyme crystals confirmed detection, from which a protein coverage of ∼84% was estimated based on the proportion of proteins calculated to produce SHG responses greater than that of tetragonal lysozyme. Good agreement was observed between the measured and calculated ratios of the SHG intensity from lysozyme in tetragonal and monoclinic lattices.« less
Sander, Adam F.; Lavstsen, Thomas; Rask, Thomas S.; Lisby, Michael; Salanti, Ali; Fordyce, Sarah L.; Jespersen, Jakob S.; Carter, Richard; Deitsch, Kirk W.; Theander, Thor G.; Pedersen, Anders Gorm; Arnot, David E.
2014-01-01
Many bacterial, viral and parasitic pathogens undergo antigenic variation to counter host immune defense mechanisms. In Plasmodium falciparum, the most lethal of human malaria parasites, switching of var gene expression results in alternating expression of the adhesion proteins of the Plasmodium falciparum-erythrocyte membrane protein 1 class on the infected erythrocyte surface. Recombination clearly generates var diversity, but the nature and control of the genetic exchanges involved remain unclear. By experimental and bioinformatic identification of recombination events and genome-wide recombination hotspots in var genes, we show that during the parasite’s sexual stages, ectopic recombination between isogenous var paralogs occurs near low folding free energy DNA 50-mers and that these sequences are heavily concentrated at the boundaries of regions encoding individual Plasmodium falciparum-erythrocyte membrane protein 1 structural domains. The recombinogenic potential of these 50-mers is not parasite-specific because these sequences also induce recombination when transferred to the yeast Saccharomyces cerevisiae. Genetic cross data suggest that DNA secondary structures (DSS) act as inducers of recombination during DNA replication in P. falciparum sexual stages, and that these DSS-regulated genetic exchanges generate functional and diverse P. falciparum adhesion antigens. DSS-induced recombination may represent a common mechanism for optimizing the evolvability of virulence gene families in pathogens. PMID:24253306
Taghizadeh, Mohammad; Goliaei, Bahram; Madadkar-Sobhani, Armin
2016-06-01
Protein flexibility, which has been referred as a dynamic behavior has various roles in proteins' functions. Furthermore, for some developed tools in bioinformatics, such as protein-protein docking software, considering the protein flexibility, causes a higher degree of accuracy. Through undertaking the present work, we have accomplished the quantification plus analysis of the variations in the human Cyclin Dependent Kinase 2 (hCDK2) protein flexibility without affecting a significant change in its initial environment or the protein per se. The main goal of the present research was to calculate variations in the flexibility for each residue of the hCDK2, analysis of their flexibility variations through clustering, and to investigate the functional aspects of the residues with high flexibility variations. Using Gromacs package (version 4.5.4), three independent molecular dynamics (MD) simulations of the hCDK2 protein (PDB ID: 1HCL) was accomplished with no significant changes in their initial environments, structures, or conformations, followed by Root Mean Square Fluctuations (RMSF) calculation of these MD trajectories. The amount of variations in these three curves of RMSF was calculated using two formulas. More than 50% of the variation in the flexibility (the distance between the maximum and the minimum amount of the RMSF) was found at the region of Val-154. As well, there are other major flexibility fluctuations in other residues. These residues were mostly positioned in the vicinity of the functional residues. The subsequent works were done, as followed by clustering all hCDK2 residues into four groups considering the amount of their variability with respect to flexibility and their position in the RMSF curves. This work has introduced a new class of flexibility aspect of the proteins' residues. It could also help designing and engineering proteins, with introducing a new dynamic aspect of hCDK2, and accordingly, for the other similar globular proteins. In addition, it could provide a better computational calculation of the protein flexibility, which is, especially important in the comparative studies of the proteins' flexibility.
Smirnov, Alexey; Zubrienė, Asta; Manakova, Elena; Gražulis, Saulius
2018-01-01
The structure-thermodynamics correlation analysis was performed for a series of fluorine- and chlorine-substituted benzenesulfonamide inhibitors binding to several human carbonic anhydrase (CA) isoforms. The total of 24 crystal structures of 16 inhibitors bound to isoforms CA I, CA II, CA XII, and CA XIII provided the structural information of selective recognition between a compound and CA isoform. The binding thermodynamics of all structures was determined by the analysis of binding-linked protonation events, yielding the intrinsic parameters, i.e., the enthalpy, entropy, and Gibbs energy of binding. Inhibitor binding was compared within structurally similar pairs that differ by para- or meta-substituents enabling to obtain the contributing energies of ligand fragments. The pairs were divided into two groups. First, similar binders—the pairs that keep the same orientation of the benzene ring exhibited classical hydrophobic effect, a less exothermic enthalpy and a more favorable entropy upon addition of the hydrophobic fragments. Second, dissimilar binders—the pairs of binders that demonstrated altered positions of the benzene rings exhibited the non-classical hydrophobic effect, a more favorable enthalpy and variable entropy contribution. A deeper understanding of the energies contributing to the protein-ligand recognition should lead toward the eventual goal of rational drug design where chemical structures of ligands could be designed based on the target protein structure. PMID:29503769
The Effect of Natural Osmolyte Mixtures on the Temperature-Pressure Stability of the Protein RNase A
NASA Astrophysics Data System (ADS)
Arns, Loana; Schuabb, Vitor; Meichsner, Shari; Berghaus, Melanie; Winter, Roland
2018-05-01
In biological cells, osmolytes appear as complex mixtures with variable compositions, depending on the particular environmental conditions of the organism. Based on various spectroscopic, thermodynamic and small-angle scattering data, we explored the effect of two different natural osmolyte mixtures, which are found in shallow-water and deep-sea shrimps, on the temperature and pressure stability of a typical monomeric protein, RNase A. Both natural osmolyte mixtures stabilize the protein against thermal and pressure denaturation. This effect seems to be mainly caused by the major osmolyte components of the osmolyte mixtures, i.e. by glycine and trimethylamine-N-oxide (TMAO), respectively. A minor compaction of the structure, in particular in the unfolded state, seems to be largely due to TMAO. Differences in thermodynamic properties observed for glycine and TMAO, and hence also for the two osmolyte mixtures, are most likely due to different solvation properties and interactions with the protein. Different from TMAO, glycine seems to interact with the amino acid side chains and/or the backbone of the protein, thus competing with hydration water and leading to a less hydrated protein surface.
Haikonen, Tuuli; Rajamäki, Minna-Liisa; Tian, Yan-Ping; Valkonen, Jari P T
2013-07-01
Helper component proteinase (HCpro) is a multifunctional protein of potyviruses (genus Potyvirus). HCpro of Potato virus A (PVA) interacts with the microtubule-associated protein HIP2 in host cells, and depletion of HIP2 reduces virus accumulation. This study shows that HCpro of Potato virus Y and Tobacco etch virus also interact with HIP2. The C-proximal portion of PVA HCpro determines the interaction with HIP2 and was found to contain a stretch of six residues comprising a highly variable region (HVR) in potyviruses. Mutations in HVR reduced PVA accumulation in tobacco plants and induced necrotic symptoms novel to PVA. Microarray and quantitative reverse transcription polymerase chain reaction analyses revealed induction of many defense-related genes including ethylene- and jasmonic acid-inducible pathways in systemically infected leaves at necrosis onset. Salicylic acid-mediated signaling was dispensable for the response. Genes related to microtubule functions were down-regulated. Structural modeling of HCpro suggested that all mutations in HVR caused conformational changes in adjacent regions containing functionally important motifs conserved in potyviruses. Those mutations, which also caused conformational changes in HVR, led to the greatest reduction of fitness. Our results implicate HVR in the regulation of HCpro conformation and virus-host interactions and suggest that mutation of HVR induces host defense.
Pheromone Recognition and Selectivity by ComR Proteins among Streptococcus Species
Morrison, Donald A.; Talagas, Antoine; Nessler, Sylvie; Federle, Michael J.; Prehna, Gerd
2016-01-01
Natural transformation, or competence, is an ability inherent to bacteria for the uptake of extracellular DNA. This process is central to bacterial evolution and allows for the rapid acquirement of new traits, such as antibiotic resistance in pathogenic microorganisms. For the Gram-positive bacteria genus Streptococcus, genes required for competence are under the regulation of quorum sensing (QS) mediated by peptide pheromones. One such system, ComRS, consists of a peptide (ComS) that is processed (XIP), secreted, and later imported into the cytoplasm, where it binds and activates the transcription factor ComR. ComR then engages in a positive feedback loop for the expression of ComS and the alternative sigma-factor SigX. Although ComRS are present in the majority of Streptococcus species, the sequence of both ComS/XIP and ComR diverge significantly, suggesting a mechanism for species-specific communication. To study possible cross-talk between streptococcal species in the regulation of competence, and to explore in detail the molecular interaction between ComR and XIP we undertook an interdisciplinary approach. We developed a ‘test-bed’ assay to measure the activity of different ComR proteins in response to cognate and heterologous XIP peptides in vivo, revealing distinct ComR classes of strict, intermediate, and promiscuous specificity among species. We then solved an X-ray crystal structure of ComR from S. suis to further understand the interaction with XIP and to search for structural features in ComR proteins that may explain XIP recognition. Using the structure as a guide, we probed the apo conformation of the XIP-binding pocket by site-directed mutagenesis, both in test-bed cultures and biochemically in vitro. In alignments with ComR proteins from other species, we find that the pocket is lined by a variable and a conserved face, where residues of the conserved face contribute to ligand binding and the variable face discriminate among XIP peptides. Together, our results not only provide a model for XIP recognition and specificity, but also allow for the prediction of novel XIP peptides that induce ComR activity. PMID:27907154
2010-01-01
Background Intragenic tandem repeats occur throughout all domains of life and impart functional and structural variability to diverse translation products. Repeat proteins confer distinctive surface phenotypes to many unicellular organisms, including those with minimal genomes such as the wall-less bacterial monoderms, Mollicutes. One such repeat pattern in this clade is distributed in a manner suggesting its exchange by horizontal gene transfer (HGT). Expanding genome sequence databases reveal the pattern in a widening range of bacteria, and recently among eucaryotic microbes. We examined the genomic flux and consequences of the motif by determining its distribution, predicted structural features and association with membrane-targeted proteins. Results Using a refined hidden Markov model, we document a 25-residue protein sequence motif tandemly arrayed in variable-number repeats in ORFs lacking assigned functions. It appears sporadically in unicellular microbes from disparate bacterial and eucaryotic clades, representing diverse lifestyles and ecological niches that include host parasitic, marine and extreme environments. Tracts of the repeats predict a malleable configuration of recurring domains, with conserved hydrophobic residues forming an amphipathic secondary structure in which hydrophilic residues endow extensive sequence variation. Many ORFs with these domains also have membrane-targeting sequences that predict assorted topologies; others may comprise reservoirs of sequence variants. We demonstrate expressed variants among surface lipoproteins that distinguish closely related animal pathogens belonging to a subgroup of the Mollicutes. DNA sequences encoding the tandem domains display dyad symmetry. Moreover, in some taxa the domains occur in ORFs selectively associated with mobile elements. These features, a punctate phylogenetic distribution, and different patterns of dispersal in genomes of related taxa, suggest that the repeat may be disseminated by HGT and intra-genomic shuffling. Conclusions We describe novel features of PARCELs (Palindromic Amphipathic Repeat Coding ELements), a set of widely distributed repeat protein domains and coding sequences that were likely acquired through HGT by diverse unicellular microbes, further mobilized and diversified within genomes, and co-opted for expression in the membrane proteome of some taxa. Disseminated by multiple gene-centric vehicles, ORFs harboring these elements enhance accessory gene pools as part of the "mobilome" connecting genomes of various clades, in taxa sharing common niches. PMID:20626840
Biesiekierski, Jessica R
2017-03-01
Gluten is the main storage protein of wheat grains. Gluten is a complex mixture of hundreds of related but distinct proteins, mainly gliadin and glutenin. Similar storage proteins exist as secalin in rye, hordein in barley, and avenins in oats and are collectively referred to as "gluten." The objective was to discuss the biochemical and functional properties of the gluten proteins, including structure, sources, and dietary intakes. Literature was reviewed from food science and nutrition journals. The gluten protein networks vary because of different components and sizes, and variability caused by genotype, growing conditions, and technological processes. The structures and interactions of this matrix contribute to the unique properties of gluten. The resulting functions are essential to determining the dough quality of bread and other baked products. Gluten is heat stable and has the capacity to act as a binding and extending agent and is commonly used as an additive in processed foods for improved texture, moisture retention, and flavor. Gliadin contains peptide sequences that are highly resistant to gastric, pancreatic, and intestinal proteolytic digestion in the gastrointestinal tract. The average daily gluten intake in a Western diet is thought to be 5-20 g/day and has been implicated in several disorders. Gluten containing grains (wheat, rye, barley, and oats) are important staple foods. Gluten is among the most complex protein networks and plays a key role in determining the rheological dough properties. © 2017 Journal of Gastroenterology and Hepatology Foundation and John Wiley & Sons Australia, Ltd.
McClintock, Carlee S; Hettich, Robert L.
2012-01-01
Oxidative protein surface mapping has become a powerful approach for measuring the solvent accessibility of folded protein structures. A variety of techniques exist for generating the key reagent – hydroxyl radicals – for these measurements; however, these approaches range significantly in their complexity and expense of operation. This research expands upon earlier work to enhance the controllability of boron-doped diamond (BDD) electrochemistry as an easily accessible tool for producing hydroxyl radicals in order to oxidize a range of intact proteins. Efforts to modulate oxidation level while minimizing the adsorption of protein to the electrode involved the use of relatively high flow rates to reduce protein residence time inside the electrochemical flow chamber. Additionally, a different cell activation approach using variable voltage to supply a controlled current allowed us to precisely tune the extent of oxidation in a protein-dependent manner. In order to gain perspective on the level of protein adsorption onto the electrode surface, studies were conducted to monitor protein concentration during electrolysis and gauge changes in the electrode surface between cell activation events. This report demonstrates the successful use of BDD electrochemistry for greater precision in generating a target number of oxidation events upon intact proteins. PMID:23210708
Liu, Ye; Li, Nan; Zhang, Shoufeng; Zhang, Fei; Lian, Hai; Wang, Ying; Zhang, Jinxia; Hu, Rongliang
2013-12-01
The genome of Irkut virus, isolate IRKV-THChina12, the first non-rabies lyssavirus from China (of bat origin), has been completely sequenced. In general, coding and non-coding regions of this viral genome are similar to those of other lyssaviruses. However, alignment of the deduced amino acid sequences of the structural proteins of IRKV-THChina12 with those of other lyssavirus representatives revealed significant variability between viral species. The nucleoprotein and matrix protein were found to be the most conserved, followed by the large protein, glycoprotein and phosphoprotein. Differences in the antigenic sites in glycoprotein may result in only partial protection of the available rabies biologics against Irkut virus, which is of particular concern for pre- and post-exposure rabies prophylaxis. Copyright © 2013 Elsevier Inc. All rights reserved.
Accelerating the Conformational Sampling of Intrinsically Disordered Proteins.
Do, Trang Nhu; Choy, Wing-Yiu; Karttunen, Mikko
2014-11-11
Intrinsically disordered proteins (IDPs) are a class of proteins lacking a well-defined secondary structure. Instead, they are able to attain multiple conformations, bind to multiple targets, and respond to changes in their surroundings. Functionally, IDPs have been associated with molecular recognition, cell regulation, and signal transduction. The dynamic conformational ensemble of IDPs is highly environmental and binding partner dependent, rendering the characterization of IDPs extremely challenging. Here, we compare the sampling efficiencies of conventional molecular dynamics (MD), well-tempered metadynamics (WT-META), and bias-exchange metadynamics (BE-META). The total simulation time was over 10 μs, and a 20-mer peptide derived from the Neh2 domain of the Nuclear factor erythroid 2-related factor 2 (Nrf2) protein was simulated. BE-META, with a neutral replica and seven biased replicas employing a set of seven relevant collective variables (CVs), provided the most reliable and efficient sampling. Finally, we propose a free-energy reconstruction method based on the probability distribution of the secondary structure contents. This postprocessing analysis confirms the presence of not only the β-hairpin conformation of the free Neh2 peptide but also its rare bound-state-like conformation, both of that have been experimentally observed. In addition, our simulations also predict other possible conformations to be verified with future experiments.
Megger, Dominik A; Rosowski, Kristin; Radunsky, Christian; Kösters, Jutta; Sitek, Barbara; Müller, Jens
2017-04-05
Three new complexes bearing the tridentate hydrazone-based ligand 2-(2-(1-(pyridin-2-yl)ethylidene)hydrazinyl)pyridine (L) were synthesized and structurally characterized. Biological tests indicate that the Zn(ii) complex [ZnCl 2 (L)] is of low cytotoxicity against the hepatocellular carcinoma cell line HepG2. In contrast, the Cu(ii) and Mn(ii) complexes [CuCl 2 (L)] and [MnCl 2 (L)] are highly cytotoxic with EC 50 values of 1.25 ± 0.01 μM and 20 ± 1 μM, respectively. A quantitative proteome analysis reveals that treatment of the cells with the Cu(ii) complex leads to a significantly altered abundance of 102 apoptosis-related proteins, whereas 38 proteins were up- or down-regulated by the Mn(ii) complex. A closer inspection of those proteins regulated only by the Cu(ii) complex suggests that the superior cytotoxic activity of this complex is likely to be related to an initiation of the caspase-independent cell death (CICD). In addition, an increased generation of reactive oxygen species (ROS) and a strong up-regulation of proteins responsive to oxidative stress suggest that alterations of the cellular redox metabolism likely contribute to the cytotoxicity of the Cu(ii) complex.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Han, Seungil; Mistry, Anil; Chang, Jeanne S.
Proline-rich tyrosine kinase 2 (PYK2) is a cytoplasmic, non-receptor tyrosine kinase implicated in multiple signaling pathways. It is a negative regulator of osteogenesis and considered a viable drug target for osteoporosis treatment. The high-resolution structures of the human PYK2 kinase domain with different inhibitor complexes establish the conventional bilobal kinase architecture and show the conformational variability of the DFG loop. The basis for the lack of selectivity for the classical kinase inhibitor, PF-431396, within the FAK family is explained by our structural analyses. Importantly, the novel DFG-out conformation with two diarylurea inhibitors (BIRB796, PF-4618433) reveals a distinct subclass of non-receptormore » tyrosine kinases identifiable by the gatekeeper Met-502 and the unique hinge loop conformation of Leu-504. This is the first example of a leucine residue in the hinge loop that blocks the ATP binding site in the DFG-out conformation. Our structural, biophysical, and pharmacological studies suggest that the unique features of the DFG motif, including Leu-504 hinge-loop variability, can be exploited for the development of selective protein kinase inhibitors.« less
Preservation of the metaproteome: variability of protein preservation in ancient dental calculus.
Mackie, Meaghan; Hendy, Jessica; Lowe, Abigail D; Sperduti, Alessandra; Holst, Malin; Collins, Matthew J; Speller, Camilla F
2017-01-01
Proteomic analysis of dental calculus is emerging as a powerful tool for disease and dietary characterisation of archaeological populations. To better understand the variability in protein results from dental calculus, we analysed 21 samples from three Roman-period populations to compare: 1) the quantity of extracted protein; 2) the number of mass spectral queries; and 3) the number of peptide spectral matches and protein identifications. We found little correlation between the quantity of calculus analysed and total protein identifications, as well as no systematic trends between site location and protein preservation. We identified a wide range of individual variability, which may be associated with the mechanisms of calculus formation and/or post-depositional contamination, in addition to taphonomic factors. Our results suggest dental calculus is indeed a stable, long-term reservoir of proteins as previously reported, but further systematic studies are needed to identify mechanisms associated with protein entrapment and survival in dental calculus.
Sequence Analysis and Domain Motifs in the Porcine Skin Decorin Glycosaminoglycan Chain*
Zhao, Xue; Yang, Bo; Solakylidirim, Kemal; Joo, Eun Ji; Toida, Toshihiko; Higashi, Kyohei; Linhardt, Robert J.; Li, Lingyun
2013-01-01
Decorin proteoglycan is comprised of a core protein containing a single O-linked dermatan sulfate/chondroitin sulfate glycosaminoglycan (GAG) chain. Although the sequence of the decorin core protein is determined by the gene encoding its structure, the structure of its GAG chain is determined in the Golgi. The recent application of modern MS to bikunin, a far simpler chondroitin sulfate proteoglycans, suggests that it has a single or small number of defined sequences. On this basis, a similar approach to sequence the decorin of porcine skin much larger and more structurally complex dermatan sulfate/chondroitin sulfate GAG chain was undertaken. This approach resulted in information on the consistency/variability of its linkage region at the reducing end of the GAG chain, its iduronic acid-rich domain, glucuronic acid-rich domain, and non-reducing end. A general motif for the porcine skin decorin GAG chain was established. A single small decorin GAG chain was sequenced using MS/MS analysis. The data obtained in the study suggest that the decorin GAG chain has a small or a limited number of sequences. PMID:23423381
Brockmann, Eeva-Christine; Huovinen, Tuomas; Guglielmetti, Simone; Mora, Diego; Taverniti, Valentina; Arioli, Stefania; De Noni, Ivano; Lamminmäki, Urpo
2014-01-01
Single-chain variable-fragment antibodies (scFvs) have considerable potential in immunological detection and localization of bacterial surface structures. In this study, synthetic phage-displayed antibody libraries were used to select scFvs against immunologically active S-layer protein of Lactobacillus helveticus MIMLh5. After three rounds of panning, five relevant phage clones were obtained, of which four were specific for the S-layer protein of L. helveticus MIMLh5 and one was also capable of binding to the S-layer protein of L. helveticus ATCC 15009. All five anti-S-layer scFvs were expressed in Escherichia coli XL1-Blue, and their specificity profiles were characterized by Western blotting. The anti-S-layer scFv PolyH4, with the highest specificity for the S-layer protein of L. helveticus MIMLh5, was used to detect the S-layer protein in Grana Padano protected-designation-of-origin (PDO) cheese extracts by Western blotting. These results showed promising applications of this monoclonal antibody for the detection of immunomodulatory S-layer protein in dairy (and dairy-based) foods. PMID:24242242
Cerminati, Sebastián; Paoletti, Luciana; Peirú, Salvador; Menzella, Hugo G; Castelli, María Eugenia
2018-06-16
βγ-crystallin has emerged as a superfamily of structurally homologous proteins with representatives across all domains of life. A major portion of this superfamily is constituted by microbial members. This superfamily has also been recognized as a novel group of Ca 2+ -binding proteins with a large diversity and variable properties in Ca 2+ binding and stability. We have recently described a new phosphatidylinositol phospholipase C from Lysinibacillus sphaericus (LS-PIPLC) which was shown to efficiently remove phosphatidylinositol from crude vegetable oil. Here, the role of the C-terminal βγ-crystallin domain of LS-PIPLC was analyzed in the context of the whole protein. A truncated protein in which the C-terminal βγ-crystallin domain was deleted (LS-PIPLC ΔCRY ) is catalytically as efficient as the full-length protein (LS-PIPLC). However, the thermal and chemical stability of LS-PIPLC ΔCRY are highly affected, demonstrating a stabilizing role for this domain. It is also shown that the presence of Ca 2+ increases the thermal and chemical stability of the protein both in aqueous media and in oil, making LS-PIPLC an excellent candidate for use in industrial soybean oil degumming.
Staphylococcal Immune Evasion Proteins: Structure, Function, and Host Adaptation.
Koymans, Kirsten J; Vrieling, Manouk; Gorham, Ronald D; van Strijp, Jos A G
2017-01-01
Staphylococcus aureus is a successful human and animal pathogen. Its pathogenicity is linked to its ability to secrete a large amount of virulence factors. These secreted proteins interfere with many critical components of the immune system, both innate and adaptive, and hamper proper immune functioning. In recent years, numerous studies have been conducted in order to understand the molecular mechanism underlying the interaction of evasion molecules with the host immune system. Structural studies have fundamentally contributed to our understanding of the mechanisms of action of the individual factors. Furthermore, such studies revealed one of the most striking characteristics of the secreted immune evasion molecules: their conserved structure. Despite high-sequence variability, most immune evasion molecules belong to a small number of structural categories. Another remarkable characteristic is that S. aureus carries most of these virulence factors on mobile genetic elements (MGE) or ex-MGE in its accessory genome. Coevolution of pathogen and host has resulted in immune evasion molecules with a highly host-specific function and prevalence. In this review, we explore how these shared structures and genomic locations relate to function and host specificity. This is discussed in the context of therapeutic options for these immune evasion molecules in infectious as well as in inflammatory diseases.
Structural insights into binding of small molecule inhibitors to Enhancer of Zeste Homolog 2
NASA Astrophysics Data System (ADS)
Kalinić, Marko; Zloh, Mire; Erić, Slavica
2014-11-01
Enhancer of Zeste Homolog 2 (EZH2) is a SET domain protein lysine methyltransferase (PKMT) which has recently emerged as a chemically tractable and therapeutically promising epigenetic target, evidenced by the discovery and characterization of potent and highly selective EZH2 inhibitors. However, no experimental structures of the inhibitors co-crystallized to EZH2 have been resolved, and the structural basis for their activity and selectivity remains unknown. Considering the need to minimize cross-reactivity between prospective PKMT inhibitors, much can be learned from understanding the molecular basis for selective inhibition of EZH2. Thus, to elucidate the binding of small-molecule inhibitors to EZH2, we have developed a model of its fully-formed cofactor binding site and used it to carry out molecular dynamics simulations of protein-ligand complexes, followed by molecular mechanics/generalized born surface area calculations. The obtained results are in good agreement with biochemical inhibition data and reflect the structure-activity relationships of known ligands. Our findings suggest that the variable and flexible post-SET domain plays an important role in inhibitor binding, allowing possibly distinct binding modes of inhibitors with only small variations in their structure. Insights from this study present a good basis for design of novel and optimization of existing compounds targeting the cofactor binding site of EZH2.
Rossi, Omar; Maggiore, Luana; Necchi, Francesca; Koeberling, Oliver; MacLennan, Calman A; Saul, Allan; Gerke, Christiane
2015-01-01
Genetically induced outer membrane particles from Gram-negative bacteria, called Generalized Modules for Membrane Antigens (GMMA), are being investigated as vaccines. Rapid methods are required for estimating the protein content for in-process assays during production. Since GMMA are complex biological structures containing lipid and polysaccharide as well as protein, protein determinations are not necessarily straightforward. We compared protein quantification by Bradford, Lowry, and Non-Interfering assays using bovine serum albumin (BSA) as standard with quantitative amino acid (AA) analysis, the most accurate currently available method for protein quantification. The Lowry assay has the lowest inter- and intra-assay variation and gives the best linearity between protein amount and absorbance. In all three assays, the color yield (optical density per mass of protein) of GMMA was markedly different from that of BSA with a ratio of approximately 4 for the Bradford assay, and highly variable between different GMMA; and approximately 0.7 for the Lowry and Non-Interfering assays, highlighting the need for calibrating the standard used in the colorimetric assay against GMMA quantified by AA analysis. In terms of a combination of ease, reproducibility, and proportionality of protein measurement, and comparability between samples, the Lowry assay was superior to Bradford and Non-Interfering assays for GMMA quantification.
Determining Membrane Protein-Lipid Binding Thermodynamics Using Native Mass Spectrometry.
Cong, Xiao; Liu, Yang; Liu, Wen; Liang, Xiaowen; Russell, David H; Laganowsky, Arthur
2016-04-06
Membrane proteins are embedded in the biological membrane where the chemically diverse lipid environment can modulate their structure and function. However, the thermodynamics governing the molecular recognition and interaction of lipids with membrane proteins is poorly understood. Here, we report a method using native mass spectrometry (MS), to determine thermodynamics of individual ligand binding events to proteins. Unlike conventional methods, native MS can resolve individual ligand binding events and, coupled with an apparatus to control the temperature, determine binding thermodynamic parameters, such as for protein-lipid interactions. We validated our approach using three soluble protein-ligand systems (maltose binding protein, lysozyme, and nitrogen regulatory protein) and obtained similar results to those using isothermal titration calorimetry and surface plasmon resonance. We also determined for the first time the thermodynamics of individual lipid binding to the ammonia channel (AmtB), an integral membrane protein from Escherichia coli. Remarkably, we observed distinct thermodynamic signatures for the binding of different lipids and entropy-enthalpy compensation for binding lipids of variable chain length. Additionally, using a mutant form of AmtB that abolishes a specific phosphatidylglycerol (PG) binding site, we observed distinct changes in the thermodynamic signatures for binding PG, implying these signatures can identify key residues involved in specific lipid binding and potentially differentiate between specific lipid binding sites.
A structural view of egg coat architecture and function in fertilization.
Monné, Magnus; Jovine, Luca
2011-10-01
Species-restricted interaction between gametes at the beginning of fertilization is mediated by the extracellular coat of the egg, a matrix of cross-linked glycoprotein filaments called the zona pellucida (ZP) in mammals and the vitelline envelope in nonmammals. All egg coat subunits contain a conserved protein-protein interaction module-the "ZP domain"-that allows them to polymerize upon dissociation of a C-terminal propeptide containing an external hydrophobic patch (EHP). Recently, the first crystal structures of a ZP domain protein, sperm receptor ZP subunit zona pellucida glycoprotein 3 (ZP3), have been reported, giving a glimpse of the structural organization of the ZP at the atomic level and the molecular basis of gamete recognition in vertebrates. The ZP module is divided in two related immunoglobulin-like domains, ZP-N and ZP-C, that contain characteristic disulfide bond patterns and, in the case of ZP-C, also incorporate the EHP. This segment lies at the interface between the two domains, which are connected by a long loop carrying a conserved O-glycan important for binding to sperm in vitro. The structures explain several apparently contradictory observations by reconciling the variable disulfide bond patterns found in different homologues of ZP3 as well as the multiple ZP3 determinants alternatively involved in gamete interaction. These findings have implications for our understanding of ZP subunit biogenesis; egg coat assembly, architecture, and interaction with sperm; structural rearrangements leading to postfertilization hardening of the ZP and the block to sperm binding; and the evolutionary origin of egg coats.
Structural re-alignment in an immunologic surface region of ricin A chain
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zemla, A T; Zhou, C E
2007-07-24
We compared structure alignments generated by several protein structure comparison programs to determine whether existing methods would satisfactorily align residues at a highly conserved position within an immunogenic loop in ribosome inactivating proteins (RIPs). Using default settings, structure alignments generated by several programs (CE, DaliLite, FATCAT, LGA, MAMMOTH, MATRAS, SHEBA, SSM) failed to align the respective conserved residues, although LGA reported correct residue-residue (R-R) correspondences when the beta-carbon (Cb) position was used as the point of reference in the alignment calculations. Further tests using variable points of reference indicated that points distal from the beta carbon along a vector connectingmore » the alpha and beta carbons yielded rigid structural alignments in which residues known to be highly conserved in RIPs were reported as corresponding residues in structural comparisons between ricin A chain, abrin-A, and other RIPs. Results suggest that approaches to structure alignment employing alternate point representations corresponding to side chain position may yield structure alignments that are more consistent with observed conservation of functional surface residues than do standard alignment programs, which apply uniform criteria for alignment (i.e., alpha carbon (Ca) as point of reference) along the entirety of the peptide chain. We present the results of tests that suggest the utility of allowing user-specified points of reference in generating alternate structural alignments, and we present a web server for automatically generating such alignments.« less
Automated Assignment of MS/MS Cleavable Cross-Links in Protein 3D-Structure Analysis
NASA Astrophysics Data System (ADS)
Götze, Michael; Pettelkau, Jens; Fritzsche, Romy; Ihling, Christian H.; Schäfer, Mathias; Sinz, Andrea
2015-01-01
CID-MS/MS cleavable cross-linkers hold an enormous potential for an automated analysis of cross-linked products, which is essential for conducting structural proteomics studies. The created characteristic fragment ion patterns can easily be used for an automated assignment and discrimination of cross-linked products. To date, there are only a few software solutions available that make use of these properties, but none allows for an automated analysis of cleavable cross-linked products. The MeroX software fills this gap and presents a powerful tool for protein 3D-structure analysis in combination with MS/MS cleavable cross-linkers. We show that MeroX allows an automatic screening of characteristic fragment ions, considering static and variable peptide modifications, and effectively scores different types of cross-links. No manual input is required for a correct assignment of cross-links and false discovery rates are calculated. The self-explanatory graphical user interface of MeroX provides easy access for an automated cross-link search platform that is compatible with commonly used data file formats, enabling analysis of data originating from different instruments. The combination of an MS/MS cleavable cross-linker with a dedicated software tool for data analysis provides an automated workflow for 3D-structure analysis of proteins. MeroX is available at
Structure of a Pheromone Receptor-Associated Mhc Molecule With An Open And Empty Groove
DOE Office of Scientific and Technical Information (OSTI.GOV)
Olson, R.; Huey-Tubman, K.E.; Dulac, C.
2006-10-06
Neurons in the murine vomeronasal organ (VNO) express a family of class Ib major histocompatibility complex (MHC) proteins (M10s) that interact with the V2R class of VNO receptors. This interaction may play a direct role in the detection of pheromonal cues that initiate reproductive and territorial behaviors. The crystal structure of M10.5, an M10 family member, is similar to that of classical MHC molecules. However, the M10.5 counterpart of the MHC peptide-binding groove is open and unoccupied, revealing the first structure of an empty class I MHC molecule. Similar to empty MHC molecules, but unlike peptide-filled MHC proteins and non-peptide-bindingmore » MHC homologs, M10.5 is thermally unstable, suggesting that its groove is normally occupied. However, M10.5 does not bind endogenous peptides when expressed in mammalian cells or when offered a mixture of class I-binding peptides. The F pocket side of the M10.5 groove is open, suggesting that ligands larger than 8-10-mer class I-binding peptides could fit by extending out of the groove. Moreover, variable residues point up from the groove helices, rather than toward the groove as in classical MHC structures. These data suggest that M10s are unlikely to provide specific recognition of class I MHC-binding peptides, but are consistent with binding to other ligands, including proteins such as the V2Rs.« less
Horst, Reto; Wüthrich, Kurt
2015-07-20
Reconstitution of integral membrane proteins (IMP) in aqueous solutions of detergent micelles has been extensively used in structural biology, using either X-ray crystallography or NMR in solution. Further progress could be achieved by establishing a rational basis for the selection of detergent and buffer conditions, since the stringent bottleneck that slows down the structural biology of IMPs is the preparation of diffracting crystals or concentrated solutions of stable isotope labeled IMPs. Here, we describe procedures to monitor the quality of aqueous solutions of [ 2 H, 15 N]-labeled IMPs reconstituted in detergent micelles. This approach has been developed for studies of β-barrel IMPs, where it was successfully applied for numerous NMR structure determinations, and it has also been adapted for use with α-helical IMPs, in particular GPCRs, in guiding crystallization trials and optimizing samples for NMR studies (Horst et al ., 2013). 2D [ 15 N, 1 H]-correlation maps are used as "fingerprints" to assess the foldedness of the IMP in solution. For promising samples, these "inexpensive" data are then supplemented with measurements of the translational and rotational diffusion coefficients, which give information on the shape and size of the IMP/detergent mixed micelles. Using microcoil equipment for these NMR experiments enables data collection with only micrograms of protein and detergent. This makes serial screens of variable solution conditions viable, enabling the optimization of parameters such as the detergent concentration, sample temperature, pH and the composition of the buffer.
Moreno, I M; Malpica, J M; Díaz-Pendón, J A; Moriones, E; Fraile, A; García-Arenal, F
2004-01-05
The genetic structure of the population of Watermelon mosaic virus (WMV) in Spain was analysed by the biological and molecular characterisation of isolates sampled from its main host plant, melon. The population was a highly homogeneous one, built of a single pathotype, and comprising isolates closely related genetically. There was indication of temporal replacement of genotypes, but not of spatial structure of the population. Analyses of nucleotide sequences in three genomic regions, that is, in the cistrons for the P1, cylindrical inclusion (CI) and capsid (CP) proteins, showed lower similar values of nucleotide diversity for the P1 than for the CI or CP cistrons. The CI protein and the CP were under tighter evolutionary constraints than the P1 protein. Also, for the CI and CP cistrons, but not for the P1 cistron, two groups of sequences, defining two genetic strains, were apparent. Thus, different genomic regions of WMV show different evolutionary dynamics. Interestingly, for the CI and CP cistrons, sequences were clustered into two regions of the sequence space, defining the two strains above, and no intermediary sequences were identified. Recombinant isolates were found, accounting for at least 7% of the population. These recombinants presented two interesting features: (i) crossover points were detected between the analysed regions in the CI and CP cistrons, but not between those in the P1 and CI cistrons, (ii) crossover points were not observed within the analysed coding regions for the P1, CI or CP proteins. This indicates strong selection against isolates with recombinant proteins, even when originated from closely related strains. Hence, data indicate that genotypes of WMV, generated by mutation or recombination, outside of acceptable, discrete, regions in the evolutionary space, are eliminated from the virus population by negative selection.
Patterns of IgE responses to multiple allergen components and clinical symptoms at age 11 years
Simpson, Angela; Lazic, Nevena; Belgrave, Danielle C.M.; Johnson, Phil; Bishop, Christopher; Mills, Clare; Custovic, Adnan
2015-01-01
Background The relationship between sensitization to allergens and disease is complex. Objective We sought to identify patterns of response to a broad range of allergen components and investigate associations with asthma, eczema, and hay fever. Methods Serum specific IgE levels to 112 allergen components were measured by using a multiplex array (Immuno Solid-phase Allergen Chip) in a population-based birth cohort. Latent variable modeling was used to identify underlying patterns of component-specific IgE responses; these patterns were then related to asthma, eczema, and hay fever. Results Two hundred twenty-one of 461 children had IgE to 1 or more components. Seventy-one of the 112 components were recognized by 3 or more children. By using latent variable modeling, 61 allergen components clustered into 3 component groups (CG1, CG2, and CG3); protein families within each CG were exclusive to that group. CG1 comprised 27 components from 8 plant protein families. CG2 comprised 7 components of mite allergens from 3 protein families. CG3 included 27 components of plant, animal, and fungal origin from 12 protein families. Each CG included components from different biological sources with structural homology and also nonhomologous proteins arising from the same biological source. Sensitization to CG3 was most strongly associated with asthma (odds ratio [OR], 8.20; 95% CI, 3.49-19.24; P < .001) and lower FEV1 (P < .001). Sensitization to CG1 was associated with hay fever (OR, 12.79; 95% CI, 6.84-23.90; P < .001). Sensitization to CG2 was associated with both asthma (OR, 3.60; 95% CI, 2.05-6.29) and hay fever (OR, 2.52; 95% CI, 1.38-4.61). Conclusions Latent variable modeling with a large number of allergen components identified 3 patterns of IgE responses, each including different protein families. In 11-year-old children the pattern of response to components of multiple allergens appeared to be associated with current asthma and hay fever but not eczema. PMID:25935108
Gobin-Limballe, Stéphanie; McAndrew, Ryan P; Djouadi, Fatima; Kim, Jung-Ja; Bastin, Jean
2010-05-01
Very-Long-Chain Acyl-CoA Dehydrogenase deficiency (VLCADD) is an autosomal recessive disorder considered as one of the more common ss-oxidation defects, possibly associated with neonatal cardiomyopathy, infantile hepatic coma, or adult-onset myopathy. Numerous gene missense mutations have been described in these VLCADD phenotypes, but only few of them have been structurally and functionally analyzed, and the molecular basis of disease variability is still poorly understood. To address this question, we first analyzed fourteen disease-causing amino acid changes using the recently described crystal structure of VLCAD. The predicted effects varied from the replacement of amino acid residues lining the substrate binding cavity, involved in holoenzyme-FAD interactions or in enzyme dimerisation, predicted to have severe functional consequences, up to amino acid substitutions outside key enzyme domains or lying on near enzyme surface, with predicted milder consequences. These data were combined with functional analysis of residual fatty acid oxidation (FAO) and VLCAD protein levels in patient cells harboring these mutations, before and after pharmacological stimulation by bezafibrate. Mutations identified as detrimental to the protein structure in the 3-D model were generally associated to profound FAO and VLCAD protein deficiencies in the patient cells, however, some mutations affecting FAD binding or monomer-monomer interactions allowed a partial response to bezafibrate. On the other hand, bezafibrate restored near-normal FAO rates in some mutations predicted to have milder consequences on enzyme structure. Overall, combination of structural, biochemical, and pharmacological analysis allowed assessment of the relative severity of individual mutations, with possible applications for disease management and therapeutic approach. Copyright 2010 Elsevier B.V. All rights reserved.
Frequency Response of a Protein to Local Conformational Perturbations
Eren, Dilek; Alakent, Burak
2013-01-01
Signals created by local perturbations are known to propagate long distances through proteins via backbone connectivity and nonbonded interactions. In the current study, signal propagation from the flexible ligand binding loop to the rest of Protein Tyrosine Phosphatase 1B (PTP1B) was investigated using frequency response techniques. Using restrained Targeted Molecular Dynamics (TMD) potential on WPD and R loops, PTP1B was driven between its crystal structure conformations at different frequencies. Propagation of the local perturbation signal was manifested via peaks at the fundamental frequency and upper harmonics of 1/f distributed spectral density of atomic variables, such as Cα atoms, dihedral angles, or polar interaction distances. Frequency of perturbation was adjusted high enough (simulation length >∼10×period of a perturbation cycle) not to be clouded by random diffusional fluctuations, and low enough (<∼0.8 ns−1) not to attenuate the propagating signal and enhance the contribution of the side-chains to the dissipation of the signals. Employing Discrete Fourier Transform (DFT) to TMD simulation trajectories of 16 cycles of conformational transitions at periods of 1.2 to 5 ns yielded Cα displacements consistent with those obtained from crystal structures. Identification of the perturbed atomic variables by statistical t-tests on log-log scale spectral densities revealed the extent of signal propagation in PTP1B, while phase angles of the filtered trajectories at the fundamental frequency were used to cluster collectively fluctuating elements. Hydrophobic interactions were found to have a higher contribution to signal transduction between side-chains compared to the role of polar interactions. Most of in-phase fluctuating residues on the signaling pathway were found to have high identity among PTP domains, and located over a wide region of PTP1B including the allosteric site. Due to its simplicity and efficiency, the suggested technique may find wide applications in identification of signaling pathways of different proteins. PMID:24086121
Kangur, Liina; Timpmann, Kõu; Freiberg, Arvi
2008-07-03
The bacteriochlorophyll a-containing LH2 and LH3 antenna complexes are the integral membrane proteins that catalyze the photosynthetic process in purple photosynthetic bacteria. The LH2 complex from Rhodobacter sphaeroides shows characteristic strong absorbance at 800 and 850 nm due to the pigment molecules confined in two separate areas of the protein. In the LH3 complex from Rhodopesudomonas acidophila the corresponding bands peak at 800 and 820 nm. Using the bacteriochlorophyll a cofactors as intrinsic probes to monitor local changes in the protein structure, we investigate spectral responses of the antenna complexes to very high hydrostatic pressures up to 2.5 GPa when embedded into natural membrane environment or extracted with detergent. We first demonstrate that high pressure does induce significant alterations to the tertiary structure of the proteins not only in proximity of the 800 nm-absorbing bacteriochlorophyll a molecules known previously (Gall, A.; et al. Biochemistry 2003, 42, 13019) but also of the 850 nm- and 820 nm-absorbing molecules, including breakage of the hydrogen bond they are involved in. The membrane-protected complexes appear more resilient to damaging effects of the compression compared with the complexes extracted into mixed detergent-buffer environment. Increased resistance of the isolated complexes is observed at high protein concentration resulting aggregation as well as when cosolvent (glycerol) is added into the solution. These stability variations correlate with ability of penetration of the surrounding polar solvent (water) into the hydrophobic protein interiors, being thus the principal reason of the pressure-induced denaturation of the proteins. Considerable variability of elastic properties of the isolated complexes was also observed, tentatively assigned to heterogeneous protein packing in detergent micelles. While a number of the isolated complexes release most of their bacteriochlorophyll a content under high pressure, quite some of them remain apparently intact. The pigmented photosynthetic antenna complexes thus constitute a suitable model system for studying in detail the stability of integral membrane proteins.
Gagaoua, Mohammed; Terlouw, E M Claudia; Micol, Didier; Boudjellal, Abdelghani; Hocquette, Jean-François; Picard, Brigitte
2015-08-05
Many studies on color biochemistry and protein biomarkers were undertaken in post-mortem beef muscles after ≥24 hours. The present study was conducted on Longissimus thoracis muscles of 21 Blond d'Aquitaine young bulls to evaluate the relationships between protein biomarkers present during the early post-mortem and known to be related to tenderness and pH decline and color development. pH values at 45 min, 3 h, and 30 h post-mortem were correlated with three, seven, and six biomarkers, respectively. L*a*b* color coordinates 24 h post-mortem were correlated with nine, five, and eight protein biomarkers, respectively. Regression models included Hsp proteins and explained between 47 and 59% of the variability between individuals in pH and between 47 and 65% of the variability in L*a*b* color coordinates. Proteins correlated with pH and/or color coordinates were involved in apoptosis or had antioxidative or chaperone activities. The main results include the negative correlations between pH45 min, pH3 h, and pHu and Prdx6, which may be explained by the antioxidative and phospholipase activities of this biomarker. Similarly, inducible Hsp70-1A/B and μ-calpain were correlated with L*a*b* coordinates, due to the protective action of Hsp70-1A/B on the proteolytic activities of μ-calpain on structural proteins. Correlations existed further between MDH1, ENO3, and LDH-B and pH decline and color stability probably due to the involvement of these enzymes in the glycolytic pathway and, thus, the energy status of the cell. The present results show that research using protein indicators may increase the understanding of early post-mortem biological mechanisms involved in pH and beef color development.
Richards, Mark W; O'Regan, Laura; Roth, Daniel; Montgomery, Jessica M; Straube, Anne; Fry, Andrew M; Bayliss, Richard
2015-05-01
Proteins of the echinoderm microtubule (MT)-associated protein (EMAP)-like (EML) family contribute to formation of the mitotic spindle and interphase MT network. EML1-4 consist of Trp-Asp 40 (WD40) repeats and an N-terminal region containing a putative coiled-coil. Recurrent gene rearrangements in non-small cell lung cancer (NSCLC) fuse EML4 to anaplastic lymphoma kinase (ALK) causing expression of several oncogenic fusion variants. The fusions have constitutive ALK activity due to self-association through the EML4 coiled-coil. We have determined crystal structures of the coiled-coils from EML2 and EML4, which describe the structural basis of both EML self-association and oncogenic EML4-ALK activation. The structures reveal a trimeric oligomerization state directed by a conserved pattern of hydrophobic residues and salt bridges. We show that the trimerization domain (TD) of EML1 is necessary and sufficient for self-association. The TD is also essential for MT binding; however, this property requires an adjacent basic region. These observations prompted us to investigate MT association of EML4-ALK and EML1-ABL1 (Abelson 1) fusions in which variable portions of the EML component are present. Uniquely, EML4-ALK variant 3, which includes the TD and basic region of EML4 but none of the WD40 repeats, was localized to MTs, both when expressed recombinantly and when expressed in a patient-derived NSCLC cell line (H2228). This raises the question of whether the mislocalization of ALK activity to MTs might influence downstream signalling and malignant properties of cells. Furthermore, the structure of EML4 TD may enable the development of protein-protein interaction inhibitors targeting the trimerization interface, providing a possible avenue towards therapeutic intervention in EML4-ALK NSCLC.
Neumann, Wilma; Hadley, Rose C.; Nolan, Elizabeth M.
2017-01-01
Transition metals are essential nutrients for all organisms and important players in the host-microbe interaction. During bacterial infection, a tug-of-war between the host and microbe for nutrient metals occurs: the host innate immune system responds to the pathogen by reducing metal availability and the pathogen tries to outmaneuver this response. The outcome of this competition, which involves metal-sequestering host-defense proteins and microbial metal acquisition machinery, is an important variable for whether infection occurs. One strategy bacterial pathogens employ to overcome metal restriction involves hijacking abundant host metalloproteins. The obligate human pathogens Neisseria spp. express TonB-dependent transport systems that capture human metalloproteins, extract the bound metal ions, and deliver these nutrients into the bacterial cell. This Essay highlights structural and mechanistic investigations that provide insights into how Neisseria acquire iron from the Fe(III)-transport protein transferrin, the Fe(III)-chelating host-defense protein lactoferrin, and the oxygen-transport protein hemoglobin, and obtain zinc from the metal-sequestering antimicrobial protein calprotectin. PMID:28487398
Taghizadeh, Mohammad; Goliaei, Bahram; Madadkar-Sobhani, Armin
2016-01-01
Background Protein flexibility, which has been referred as a dynamic behavior has various roles in proteins’ functions. Furthermore, for some developed tools in bioinformatics, such as protein-protein docking software, considering the protein flexibility, causes a higher degree of accuracy. Through undertaking the present work, we have accomplished the quantification plus analysis of the variations in the human Cyclin Dependent Kinase 2 (hCDK2) protein flexibility without affecting a significant change in its initial environment or the protein per se. Objectives The main goal of the present research was to calculate variations in the flexibility for each residue of the hCDK2, analysis of their flexibility variations through clustering, and to investigate the functional aspects of the residues with high flexibility variations. Materials and Methods Using Gromacs package (version 4.5.4), three independent molecular dynamics (MD) simulations of the hCDK2 protein (PDB ID: 1HCL) was accomplished with no significant changes in their initial environments, structures, or conformations, followed by Root Mean Square Fluctuations (RMSF) calculation of these MD trajectories. The amount of variations in these three curves of RMSF was calculated using two formulas. Results More than 50% of the variation in the flexibility (the distance between the maximum and the minimum amount of the RMSF) was found at the region of Val-154. As well, there are other major flexibility fluctuations in other residues. These residues were mostly positioned in the vicinity of the functional residues. The subsequent works were done, as followed by clustering all hCDK2 residues into four groups considering the amount of their variability with respect to flexibility and their position in the RMSF curves. Conclusions This work has introduced a new class of flexibility aspect of the proteins’ residues. It could also help designing and engineering proteins, with introducing a new dynamic aspect of hCDK2, and accordingly, for the other similar globular proteins. In addition, it could provide a better computational calculation of the protein flexibility, which is, especially important in the comparative studies of the proteins’ flexibility. PMID:28959320
Tapia, Lorena I; Shaw, Chad A; Aideyan, Letisha O; Jewell, Alan M; Dawson, Brian C; Haq, Taha R; Piedra, Pedro A
2014-01-01
Human respiratory syncytial virus (HRSV) has three surface glycoproteins: small hydrophobic (SH), attachment (G) and fusion (F), encoded by three consecutive genes (SH-G-F). A 270-nt fragment of the G gene is used to genotype HRSV isolates. This study genotyped and investigated the variability of the gene and amino acid sequences of the three surface proteins of HRSV strains collected from 1987 to 2005 from one center. Sixty original clinical isolates and 5 prototype strains were analyzed. Sequences containing SH, F and G genes were generated, and multiple alignments and phylogenetic trees were analyzed. Genetic variability by protein domains comparing virus genotypes was assessed. Complete sequences of the SH-G-F genes were obtained for all 65 samples: HRSV-A = 35; HRSV-B = 30. In group A strains, genotypes GA5 and GA2 were predominant. For HRSV-B strains, the genotype GB4 was predominant from 1992 to 1994 and only genotype BA viruses were detected in 2004-2005. Different genetic variability at nucleotide level was detected between the genes, with G gene being the most variable and the highest variability detected in the 270-nt G fragment that is frequently used to genotype the virus. High variability (>10%) was also detected in the signal peptide and transmembrane domains of the F gene of HRSV A strains. Variability among the HRSV strains resulting in non-synonymous changes was detected in hypervariable domains of G protein, the signal peptide of the F protein, a not previously defined domain in the F protein, and the antigenic site Ø in the pre-fusion F. Divergent trends were observed between HRSV -A and -B groups for some functional domains. A diverse population of HRSV -A and -B genotypes circulated in Houston during an 18 year period. We hypothesize that diverse sequence variation of the surface protein genes provide HRSV strains a survival advantage in a partially immune-protected community.
Tapia, Lorena I.; Shaw, Chad A.; Aideyan, Letisha O.; Jewell, Alan M.; Dawson, Brian C.; Haq, Taha R.; Piedra, Pedro A.
2014-01-01
Human respiratory syncytial virus (HRSV) has three surface glycoproteins: small hydrophobic (SH), attachment (G) and fusion (F), encoded by three consecutive genes (SH-G-F). A 270-nt fragment of the G gene is used to genotype HRSV isolates. This study genotyped and investigated the variability of the gene and amino acid sequences of the three surface proteins of HRSV strains collected from 1987 to 2005 from one center. Sixty original clinical isolates and 5 prototype strains were analyzed. Sequences containing SH, F and G genes were generated, and multiple alignments and phylogenetic trees were analyzed. Genetic variability by protein domains comparing virus genotypes was assessed. Complete sequences of the SH-G-F genes were obtained for all 65 samples: HRSV-A = 35; HRSV-B = 30. In group A strains, genotypes GA5 and GA2 were predominant. For HRSV-B strains, the genotype GB4 was predominant from 1992 to 1994 and only genotype BA viruses were detected in 2004–2005. Different genetic variability at nucleotide level was detected between the genes, with G gene being the most variable and the highest variability detected in the 270-nt G fragment that is frequently used to genotype the virus. High variability (>10%) was also detected in the signal peptide and transmembrane domains of the F gene of HRSV A strains. Variability among the HRSV strains resulting in non-synonymous changes was detected in hypervariable domains of G protein, the signal peptide of the F protein, a not previously defined domain in the F protein, and the antigenic site Ø in the pre-fusion F. Divergent trends were observed between HRSV -A and -B groups for some functional domains. A diverse population of HRSV -A and -B genotypes circulated in Houston during an 18 year period. We hypothesize that diverse sequence variation of the surface protein genes provide HRSV strains a survival advantage in a partially immune-protected community. PMID:24625544
Motion of variable-length MreB filaments at the bacterial cell membrane influences cell morphology
Reimold, Christian; Defeu Soufo, Herve Joel; Dempwolff, Felix; Graumann, Peter L.
2013-01-01
The maintenance of rod-cell shape in many bacteria depends on actin-like MreB proteins and several membrane proteins that interact with MreB. Using superresolution microscopy, we show that at 50-nm resolution, Bacillus subtilis MreB forms filamentous structures of length up to 3.4 μm underneath the cell membrane, which run at angles diverging up to 40° relative to the cell circumference. MreB from Escherichia coli forms at least 1.4-μm-long filaments. MreB filaments move along various tracks with a maximal speed of 85 nm/s, and the loss of ATPase activity leads to the formation of extended and static filaments. Suboptimal growth conditions lead to formation of patch-like structures rather than extended filaments. Coexpression of wild-type MreB with MreB mutated in the subunit interface leads to formation of shorter MreB filaments and a strong effect on cell shape, revealing a link between filament length and cell morphology. Thus MreB has an extended-filament architecture with the potential to position membrane proteins over long distances, whose localization in turn may affect the shape of the cell wall. PMID:23783036
Motion of variable-length MreB filaments at the bacterial cell membrane influences cell morphology.
Reimold, Christian; Defeu Soufo, Herve Joel; Dempwolff, Felix; Graumann, Peter L
2013-08-01
The maintenance of rod-cell shape in many bacteria depends on actin-like MreB proteins and several membrane proteins that interact with MreB. Using superresolution microscopy, we show that at 50-nm resolution, Bacillus subtilis MreB forms filamentous structures of length up to 3.4 μm underneath the cell membrane, which run at angles diverging up to 40° relative to the cell circumference. MreB from Escherichia coli forms at least 1.4-μm-long filaments. MreB filaments move along various tracks with a maximal speed of 85 nm/s, and the loss of ATPase activity leads to the formation of extended and static filaments. Suboptimal growth conditions lead to formation of patch-like structures rather than extended filaments. Coexpression of wild-type MreB with MreB mutated in the subunit interface leads to formation of shorter MreB filaments and a strong effect on cell shape, revealing a link between filament length and cell morphology. Thus MreB has an extended-filament architecture with the potential to position membrane proteins over long distances, whose localization in turn may affect the shape of the cell wall.
NASA Astrophysics Data System (ADS)
Sicard, François; Senet, Patrick
2013-06-01
Well-Tempered Metadynamics (WTmetaD) is an efficient method to enhance the reconstruction of the free-energy surface of proteins. WTmetaD guarantees a faster convergence in the long time limit in comparison with the standard metadynamics. It still suffers, however, from the same limitation, i.e., the non-trivial choice of pertinent collective variables (CVs). To circumvent this problem, we couple WTmetaD with a set of CVs generated from a dihedral Principal Component Analysis (dPCA) on the Ramachandran dihedral angles describing the backbone structure of the protein. The dPCA provides a generic method to extract relevant CVs built from internal coordinates, and does not depend on the alignment to an arbitrarily chosen reference structure as usual in Cartesian PCA. We illustrate the robustness of this method in the case of a reference model protein, the small and very diffusive Met-enkephalin pentapeptide. We propose a justification a posteriori of the considered number of CVs necessary to bias the metadynamics simulation in terms of the one-dimensional free-energy profiles associated with Ramachandran dihedral angles along the amino-acid sequence.
Sicard, François; Senet, Patrick
2013-06-21
Well-Tempered Metadynamics (WTmetaD) is an efficient method to enhance the reconstruction of the free-energy surface of proteins. WTmetaD guarantees a faster convergence in the long time limit in comparison with the standard metadynamics. It still suffers, however, from the same limitation, i.e., the non-trivial choice of pertinent collective variables (CVs). To circumvent this problem, we couple WTmetaD with a set of CVs generated from a dihedral Principal Component Analysis (dPCA) on the Ramachandran dihedral angles describing the backbone structure of the protein. The dPCA provides a generic method to extract relevant CVs built from internal coordinates, and does not depend on the alignment to an arbitrarily chosen reference structure as usual in Cartesian PCA. We illustrate the robustness of this method in the case of a reference model protein, the small and very diffusive Met-enkephalin pentapeptide. We propose a justification a posteriori of the considered number of CVs necessary to bias the metadynamics simulation in terms of the one-dimensional free-energy profiles associated with Ramachandran dihedral angles along the amino-acid sequence.
Paz, S. Alexis; Vanden-Eijnden, Eric
2017-01-01
We study the thermodynamic stability of the native state of the human prion protein using a new free-energy method, replica-exchange on-the-fly parameterization. This method is designed to overcome hidden-variable sampling limitations to yield nearly error-free free-energy profiles along a conformational coordinate. We confirm that all four (M129V, D178N) polymorphs have a ground-state conformation with three intact β-sheet hydrogen bonds. Additionally, they are observed to have distinct metastabilities determined by the side-chain at position 129. We rationalize these findings with reference to the prion “strain” hypothesis, which links the variety of transmissible spongiform encephalopathy phenotypes to conformationally distinct infectious prion forms and classifies distinct phenotypes of sporadic Creutzfeldt-Jakob disease based solely on the 129 polymorphism. Because such metastable structures are not easily observed in structural experiments, our approach could potentially provide new insights into the conformational origins of prion diseases and other pathologies arising from protein misfolding and aggregation. PMID:28451263
Determination of ensemble-average pairwise root mean-square deviation from experimental B-factors.
Kuzmanic, Antonija; Zagrovic, Bojan
2010-03-03
Root mean-square deviation (RMSD) after roto-translational least-squares fitting is a measure of global structural similarity of macromolecules used commonly. On the other hand, experimental x-ray B-factors are used frequently to study local structural heterogeneity and dynamics in macromolecules by providing direct information about root mean-square fluctuations (RMSF) that can also be calculated from molecular dynamics simulations. We provide a mathematical derivation showing that, given a set of conservative assumptions, a root mean-square ensemble-average of an all-against-all distribution of pairwise RMSD for a single molecular species,
Distribution of genotype network sizes in sequence-to-structure genotype-phenotype maps.
Manrubia, Susanna; Cuesta, José A
2017-04-01
An essential quantity to ensure evolvability of populations is the navigability of the genotype space. Navigability, understood as the ease with which alternative phenotypes are reached, relies on the existence of sufficiently large and mutually attainable genotype networks. The size of genotype networks (e.g. the number of RNA sequences folding into a particular secondary structure or the number of DNA sequences coding for the same protein structure) is astronomically large in all functional molecules investigated: an exhaustive experimental or computational study of all RNA folds or all protein structures becomes impossible even for moderately long sequences. Here, we analytically derive the distribution of genotype network sizes for a hierarchy of models which successively incorporate features of increasingly realistic sequence-to-structure genotype-phenotype maps. The main feature of these models relies on the characterization of each phenotype through a prototypical sequence whose sites admit a variable fraction of letters of the alphabet. Our models interpolate between two limit distributions: a power-law distribution, when the ordering of sites in the prototypical sequence is strongly constrained, and a lognormal distribution, as suggested for RNA, when different orderings of the same set of sites yield different phenotypes. Our main result is the qualitative and quantitative identification of those features of sequence-to-structure maps that lead to different distributions of genotype network sizes. © 2017 The Author(s).
Determination of Ensemble-Average Pairwise Root Mean-Square Deviation from Experimental B-Factors
Kuzmanic, Antonija; Zagrovic, Bojan
2010-01-01
Abstract Root mean-square deviation (RMSD) after roto-translational least-squares fitting is a measure of global structural similarity of macromolecules used commonly. On the other hand, experimental x-ray B-factors are used frequently to study local structural heterogeneity and dynamics in macromolecules by providing direct information about root mean-square fluctuations (RMSF) that can also be calculated from molecular dynamics simulations. We provide a mathematical derivation showing that, given a set of conservative assumptions, a root mean-square ensemble-average of an all-against-all distribution of pairwise RMSD for a single molecular species,
Karageorgos, Ioannis; Gallagher, Elyssia S; Galvin, Connor; Gallagher, D Travis; Hudgens, Jeffrey W
2017-11-01
Monoclonal antibody pharmaceuticals are the fastest-growing class of therapeutics, with a wide range of clinical applications. To assure their safety, these protein drugs must demonstrate highly consistent purity and stability. Key to these objectives is higher order structure measurements validated by calibration to reference materials. We describe preparation, characterization, and crystal structure of the Fab fragment prepared from the NIST Reference Antibody RM 8671 (NISTmAb). NISTmAb is a humanized IgG1κ antibody, produced in murine cell culture and purified by standard biopharmaceutical production methods, developed at the National Institute of Standards and Technology (NIST) to serve as a reference material. The Fab fragment was derived from NISTmAb through papain cleavage followed by protein A based purification. The purified Fab fragment was characterized by SDS-PAGE, capillary gel electrophoresis, multi-angle light scattering, size exclusion chromatography, mass spectrometry, and x-ray crystallography. The crystal structure at 0.2 nm resolution includes four independent Fab molecules with complete light chains and heavy chains through Cys 223, enabling assessment of conformational variability and providing a well-characterized reference structure for research and engineering applications. This nonproprietary, publically available reference material of known higher-order structure can support metrology in biopharmaceutical applications, and it is a suitable platform for validation of molecular modeling studies. Published by Elsevier Ltd.
In Silico Analysis for the Study of Botulinum Toxin Structure
NASA Astrophysics Data System (ADS)
Suzuki, Tomonori; Miyazaki, Satoru
2010-01-01
Protein-protein interactions play many important roles in biological function. Knowledge of protein-protein complex structure is required for understanding the function. The determination of protein-protein complex structure by experimental studies remains difficult, therefore computational prediction of protein structures by structure modeling and docking studies is valuable method. In addition, MD simulation is also one of the most popular methods for protein structure modeling and characteristics. Here, we attempt to predict protein-protein complex structure and property using some of bioinformatic methods, and we focus botulinum toxin complex as target structure.
Probing short-range protein Brownian motion in the cytoplasm of living cells.
Di Rienzo, Carmine; Piazza, Vincenzo; Gratton, Enrico; Beltram, Fabio; Cardarelli, Francesco
2014-12-23
The translational motion of molecules in cells deviates from what is observed in dilute solutions. Theoretical models provide explanations for this effect but with predictions that drastically depend on the nanoscale organization assumed for macromolecular crowding agents. A conclusive test of the nature of the translational motion in cells is missing owing to the lack of techniques capable of probing crowding with the required temporal and spatial resolution. Here we show that fluorescence-fluctuation analysis of raster scans at variable timescales can provide this information. By using green fluorescent proteins in cells, we measure protein motion at the unprecedented timescale of 1 μs, unveiling unobstructed Brownian motion from 25 to 100 nm, and partially suppressed diffusion above 100 nm. Furthermore, experiments on model systems attribute this effect to the presence of relatively immobile structures rather than to diffusing crowding agents. We discuss the implications of these results for intracellular processes.
Amyloid fibers provide structural integrity to Bacillus subtilis biofilms.
Romero, Diego; Aguilar, Claudio; Losick, Richard; Kolter, Roberto
2010-02-02
Bacillus subtilis forms biofilms whose constituent cells are held together by an extracellular matrix. Previous studies have shown that the protein TasA and an exopolysaccharide are the main components of the matrix. Given the importance of TasA in biofilm formation, we characterized the physicochemical properties of this protein. We report that purified TasA forms fibers of variable length and 10-15 nm in width. Biochemical analyses, in combination with the use of specific dyes and microscopic analyses, indicate that TasA forms amyloid fibers. Consistent with this hypothesis, TasA fibers required harsh treatments (e.g., formic acid) to be depolymerized. When added to a culture of a tasA mutant, purified TasA restored wild-type biofilm morphology, indicating that the purified protein retained biological activity. We propose that TasA forms amyloid fibers that bind cells together in the biofilm.
Schaffert, David H; Okholm, Anders H; Sørensen, Rasmus S; Nielsen, Jesper S; Tørring, Thomas; Rosen, Christian B; Kodal, Anne Louise B; Mortensen, Michael R; Gothelf, Kurt V; Kjems, Jørgen
2016-05-01
DNA origami provides rapid access to easily functionalized, nanometer-sized structures making it an intriguing platform for the development of defined drug delivery and sensor systems. Low cellular uptake of DNA nanostructures is a major obstacle in the development of DNA-based delivery platforms. Herein, significant strong increase in cellular uptake in an established cancer cell line by modifying a planar DNA origami structure with the iron transport protein transferrin (Tf) is demonstrated. A variable number of Tf molecules are coupled to the origami structure using a DNA-directed, site-selective labeling technique to retain ligand functionality. A combination of confocal fluorescence microscopy and quantitative (qPCR) techniques shows up to 22-fold increased cytoplasmic uptake compared to unmodified structures and with an efficiency that correlates to the number of transferrin molecules on the origami surface. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Betker, Jamie L.; Gomez, Joe; Anchordoquy, Thomas J.
2013-01-01
The use of lipoplexes for the intracellular delivery of nucleic acids typically involves the optimization of several parameters that are known to affect delivery. Researchers commonly vary charge ratio, and often incorporate different amounts of helper lipids (e.g., cholesterol) to optimize formulations for transfection in cell culture and in vivo. The results of such experiments are often interpreted in the context of nuclease resistance and cell association, but effects on the protein corona are usually not considered. While many studies have demonstrated that lipoplex structure and function can be dramatically compromised in the presence of serum, little attention has been paid to the adsorption of specific proteins and how this might be affected by formulation parameters. In this study, we characterize changes in the protein corona that occur as DOTAP-based lipoplexes are formulated with different amounts of cholesterol and prepared at different charge ratios. Our results demonstrate a significant effect of lipid composition on both total protein adsorption as well as the individual proteins from fetal calf serum that are associated with lipoplexes. In addition, we show that PEGylation increases protein adsorption with our formulations; effects that depend on the type of PEG conjugate employed in the lipoplex. Attempts to identify a specific protein responsible for enhancing transfection were unsuccessful. PMID:23920037
Entropy in molecular recognition by proteins.
Caro, José A; Harpole, Kyle W; Kasinath, Vignesh; Lim, Jackwee; Granja, Jeffrey; Valentine, Kathleen G; Sharp, Kim A; Wand, A Joshua
2017-06-20
Molecular recognition by proteins is fundamental to molecular biology. Dissection of the thermodynamic energy terms governing protein-ligand interactions has proven difficult, with determination of entropic contributions being particularly elusive. NMR relaxation measurements have suggested that changes in protein conformational entropy can be quantitatively obtained through a dynamical proxy, but the generality of this relationship has not been shown. Twenty-eight protein-ligand complexes are used to show a quantitative relationship between measures of fast side-chain motion and the underlying conformational entropy. We find that the contribution of conformational entropy can range from favorable to unfavorable, which demonstrates the potential of this thermodynamic variable to modulate protein-ligand interactions. For about one-quarter of these complexes, the absence of conformational entropy would render the resulting affinity biologically meaningless. The dynamical proxy for conformational entropy or "entropy meter" also allows for refinement of the contributions of solvent entropy and the loss in rotational-translational entropy accompanying formation of high-affinity complexes. Furthermore, structure-based application of the approach can also provide insight into long-lived specific water-protein interactions that escape the generic treatments of solvent entropy based simply on changes in accessible surface area. These results provide a comprehensive and unified view of the general role of entropy in high-affinity molecular recognition by proteins.
Crystal structure of the 25 kDa subunit of human cleavage factor Im
Coseno, Molly; Martin, Georges; Berger, Christopher; Gilmartin, Gregory; Keller, Walter; Doublié, Sylvie
2008-01-01
Cleavage factor Im is an essential component of the pre-messenger RNA 3′-end processing machinery in higher eukaryotes, participating in both the polyadenylation and cleavage steps. Cleavage factor Im is an oligomer composed of a small 25 kDa subunit (CF Im25) and a variable larger subunit of either 59, 68 or 72 kDa. The small subunit also interacts with RNA, poly(A) polymerase, and the nuclear poly(A)-binding protein. These protein–protein interactions are thought to be facilitated by the Nudix domain of CF Im25, a hydrolase motif with a characteristic α/β/α fold and a conserved catalytic sequence or Nudix box. We present here the crystal structures of human CF Im25 in its free and diadenosine tetraphosphate (Ap4A) bound forms at 1.85 and 1.80 Å, respectively. CF Im25 crystallizes as a dimer and presents the classical Nudix fold. Results from crystallographic and biochemical experiments suggest that CF Im25 makes use of its Nudix fold to bind but not hydrolyze ATP and Ap4A. The complex and apo protein structures provide insight into the active oligomeric state of CF Im and suggest a possible role of nucleotide binding in either the polyadenylation and/or cleavage steps of pre-messenger RNA 3′-end processing. PMID:18445629
Preservation of the metaproteome: variability of protein preservation in ancient dental calculus
Mackie, Meaghan; Hendy, Jessica; Lowe, Abigail D.; Sperduti, Alessandra; Holst, Malin; Collins, Matthew J.; Speller, Camilla F.
2017-01-01
ABSTRACT Proteomic analysis of dental calculus is emerging as a powerful tool for disease and dietary characterisation of archaeological populations. To better understand the variability in protein results from dental calculus, we analysed 21 samples from three Roman-period populations to compare: 1) the quantity of extracted protein; 2) the number of mass spectral queries; and 3) the number of peptide spectral matches and protein identifications. We found little correlation between the quantity of calculus analysed and total protein identifications, as well as no systematic trends between site location and protein preservation. We identified a wide range of individual variability, which may be associated with the mechanisms of calculus formation and/or post-depositional contamination, in addition to taphonomic factors. Our results suggest dental calculus is indeed a stable, long-term reservoir of proteins as previously reported, but further systematic studies are needed to identify mechanisms associated with protein entrapment and survival in dental calculus. PMID:29098079
Amid, Bahareh Tabatabaee; Mirhosseini, Hamed
2012-01-01
In recent years, the demand for a natural plant-based polymer with potential functions from plant sources has increased considerably. The main objective of the current study was to study the effect of chemical extraction conditions on the rheological and functional properties of the heteropolysaccharide/protein biopolymer from durian (Durio zibethinus) seed. The efficiency of different extraction conditions was determined by assessing the extraction yield, protein content, solubility, rheological properties and viscoelastic behavior of the natural polymer from durian seed. The present study revealed that the soaking process had a more significant (p < 0.05) effect than the decolorizing process on the rheological and functional properties of the natural polymer. The considerable changes in the rheological and functional properties of the natural polymer could be due to the significant (p < 0.05) effect of the chemical extraction variables on the protein fraction present in the molecular structure of the natural polymer from durian seed. The natural polymer from durian seed had a more elastic (or gel like) behavior compared to the viscous (liquid like) behavior at low frequency. The present study revealed that the natural heteropolysaccharide/protein polymer from durian seed had a relatively low solubility ranging from 9.1% to 36.0%. This might be due to the presence of impurities, insoluble matter and large particles present in the chemical structure of the natural polymer from durian seed. PMID:23203099
Bashan, Anat; Yonath, Ada
2009-01-01
Crystallography of ribosomes, the universal cell nucleoprotein assemblies facilitating the translation of the genetic-code into proteins, met with severe problems owing to their large size, complex structure, inherent flexibility and high conformational variability. For the case of the small ribosomal subunit, which caused extreme difficulties, post crystallization treatment by minute amounts of a heteropolytungstate cluster allowed structure determination at atomic resolution. This cluster played a dual role in ribosomal crystallography: providing anomalous phasing power and dramatically increased the resolution, by stabilization of a selected functional conformation. Thus, four out of the fourteen clusters that bind to each of the crystallized small subunits are attached to a specific ribosomal protein in a fashion that may control a significant component of the subunit internal flexibility, by “gluing” symmetrical related subunits. Here we highlight basic issues in the relationship between metal ions and macromolecules and present common traits controlling in the interactions between polymetalates and various macromolecules, which may be extended towards the exploitation of polymetalates for therapeutical treatment. PMID:19915655
Flot, Jean-François; Tillier, Simon
2007-10-15
The complete mitochondrial genomes of two individuals attributed to different morphospecies of the scleractinian coral genus Pocillopora have been sequenced. Both genomes, respectively 17,415 and 17,422 nt long, share the presence of a previously undescribed ORF encoding a putative protein made up of 302 amino acids and of unknown function. Surprisingly, this ORF turns out to be the second most variable region of the mitochondrial genome (1% nucleotide sequence difference between the two individuals) after the putative control region (1.5% sequence difference). Except for the presence of this ORF and for the location of the putative control region, the mitochondrial genome of Pocillopora is organized in a fashion similar to the other scleractinian coral genomes published to date. For the first time in a cnidarian, a putative second origin of replication is described based on its secondary structure similar to the stem-loop structure of O(L), the origin of L-strand replication in vertebrates.
Zilio, Nicola; Boddy, Michael N
2017-03-01
The tandem affinity purification (TAP) method uses an epitope that contains two different affinity purification tags separated by a site-specific protease site to isolate a protein rapidly and easily. Proteins purified via the TAP tag are eluted under mild conditions, allowing them to be used for structural and biochemical analyses. The original TAP tag contains a calmodulin-binding peptide and the IgG-binding domain from protein A separated by a tobacco etch virus (TEV) protease cleavage site. After capturing the Protein A epitope on an IgG resin, bound proteins are released by incubation with the TEV protease and then isolated on a calmodulin matrix in the presence of calcium; elution from this resin is achieved by chelating calcium with EGTA. However, because the robustness of the calmodulin-binding step in this procedure is highly variable, we replaced the calmodulin-binding peptide with three copies of the FLAG epitope, (3× FLAG)-TEV-Protein A, which can be isolated using an anti-FLAG resin. Elution from this matrix is achieved in the presence of an excess of a 3× FLAG peptide. In addition to allowing proteins to be released under mild conditions, elution by the 3× FLAG peptide adds an extra layer of specificity to the TAP procedure, because it liberates only FLAG-tagged proteins. © 2017 Cold Spring Harbor Laboratory Press.
Shabalina, Svetlana A.; Ogurtsov, Aleksey Y.; Spiridonov, Nikolay A.; Koonin, Eugene V.
2014-01-01
Alternative splicing (AS), alternative transcription initiation (ATI) and alternative transcription termination (ATT) create the extraordinary complexity of transcriptomes and make key contributions to the structural and functional diversity of mammalian proteomes. Analysis of mammalian genomic and transcriptomic data shows that contrary to the traditional view, the joint contribution of ATI and ATT to the transcriptome and proteome diversity is quantitatively greater than the contribution of AS. Although the mean numbers of protein-coding constitutive and alternative nucleotides in gene loci are nearly identical, their distribution along the transcripts is highly non-uniform. On average, coding exons in the variable 5′ and 3′ transcript ends that are created by ATI and ATT contain approximately four times more alternative nucleotides than core protein-coding regions that diversify exclusively via AS. Short upstream exons that encompass alternative 5′-untranslated regions and N-termini of proteins evolve under strong nucleotide-level selection whereas in 3′-terminal exons that encode protein C-termini, protein-level selection is significantly stronger. The groups of genes that are subject to ATI and ATT show major differences in biological roles, expression and selection patterns. PMID:24792168
Gross, G; Snel, J; Boekhorst, J; Smits, M A; Kleerebezem, M
2010-03-01
Recently, we have identified the mannose-specific adhesin encoding gene (msa) of Lactobacillus plantarum. In the current study, structure and function of this potentially probiotic effector gene were further investigated, exploring genetic diversity of msa in L. plantarum in relation to mannose adhesion capacity. The results demonstrate that there is considerable variation in quantitative in vitro mannose adhesion capacity, which is paralleled by msa gene sequence variation. The msa genes of different L. plantarum strains encode proteins with variable domain composition. Construction of L. plantarum 299v mutant strains revealed that the msa gene product is the key-protein for mannose adhesion, also in a strain with high mannose adhering capacity. However, no straightforward correlation between adhesion capacity and domain composition of Msa in L. plantarum could be identified. Nevertheless, differences in Msa sequences in combination with variable genetic background of specific bacterial strains appears to determine mannose adhesion capacity and potentially affects probiotic properties. These findings exemplify the strain-specificity of probiotic characteristics and illustrate the need for careful and molecular selection of new candidate probiotics.
Tertiary structure of human {Lambda}6 light chains.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pokkuluri, P. R.; Solomon, A.; Weiss, D. T.
1999-01-01
AL amyloidosis is a disease process characterized by the pathologic deposition of monoclonal light chains in tissue. To date, only limited information has been obtained on the molecular features that render such light chains amyloidogenic. Although protein products of the major human V kappa and V lambda gene families have been identified in AL deposits, one particular subgroup--lambda 6--has been found to be preferentially associated with this disease. Notably, the variable region of lambda 6 proteins (V lambda 6) has distinctive primary structural features including the presence in the third framework region (FR3) of two additional amino acid residues thatmore » distinguish members of this subgroup from other types of light chains. However, the structural consequences of these alterations have not been elucidated. To determine if lambda 6 proteins possess unique tertiary structural features, as compared to light chains of other V lambda subgroups, we have obtained x-ray diffraction data on crystals prepared from two recombinant V lambda 6 molecules. These components, isolated from a bacterial expression system, were generated from lambda 6-related cDNAs cloned from bone marrow-derived plasma cells from a patient (Wil) who had documented AL amyloidosis and another (Jto) with multiple myeloma and tubular cast nephropathy, but no evident fibrillar deposits. The x-ray crystallographic analyses revealed that the two-residue insertion located between positions 68 and 69 (not between 66 and 67 as previously surmised) extended an existing loop region that effectively increased the surface area adjacent to the first complementarity determining region (CDR1). Further, an unusual interaction between the Arg 25 and Phe 2 residues commonly found in lambda 6 molecules was noted. However, the structures of V lambda 6 Wil and Jto also differed from each other, as evidenced by the presence in the latter of certain ionic and hydrophobic interactions that we posit increased protein stability and thus prevented amyloid formation.« less
Molecular architecture of polycomb repressive complexes
Chittock, Emily C.; Latwiel, Sebastian; Miller, Thomas C.R.
2017-01-01
The polycomb group (PcG) proteins are a large and diverse family that epigenetically repress the transcription of key developmental genes. They form three broad groups of polycomb repressive complexes (PRCs) known as PRC1, PRC2 and Polycomb Repressive DeUBiquitinase, each of which modifies and/or remodels chromatin by distinct mechanisms that are tuned by having variable compositions of core and accessory subunits. Until recently, relatively little was known about how the various PcG proteins assemble to form the PRCs; however, studies by several groups have now allowed us to start piecing together the PcG puzzle. Here, we discuss some highlights of recent PcG structures and the insights they have given us into how these complexes regulate transcription through chromatin. PMID:28202673
[Entification of the Rubella virus genotype 1H in Western Siberia].
Seregin, S V; Babkin, I V; Petrova, I D; Iashina, L N; Malkova, E M; Petrov, V S
2011-01-01
Molecular epidemiological study of novel strain of Rubella virus isolated during the outbreak in Western Siberia in 2004 was described. Detailed phylogenetic analysis performed based upon entire SP-region, which encodes all three Rubella structural proteins (C, E2, and E1), was implemented. This analysis provides characterization of this strain and classifies it as 1H genotype, thereby correcting previous classification of this strain based upon shorter nucleotide sequence, only encoding E1 protein. Therefore, this study identified the genotype of the Rubella virus not previously detected in Western Siberia (and even entire Russian Federation), which highlights the importance of more extensive characterization of genetic variability of the Rubella virus, especially with regard to potential influence of vaccination on the Rubella virus mutagenesis.
Grasping the nettle: A bacterial invasin that targets immunoglobulin variable domains.
Barlow, Paul
2018-06-01
In a new paper, the protein InvD from Yersinia pseudotuberculosis , a zoonotic pathogen, is shown to assist late-stage invasion of intestinal epithelia. Remarkably, InvD acts by binding the Fab region of IgG or IgA. It straddles adjacent light-chain and heavy-chain variable domains, but its binding is different from that of antigens in that complementarity-determining regions do not participate. Structure determination revealed that its Fab-interacting domain adopts an immunoglobulin-like fold, fused to the preceding immunoglobulin-like domain and carried on a long stalk anchored to the bacterial outer membrane. Possible roles of this unusual host-pathogen interaction include avoidance of clearance from the intestine by secretory IgA. © 2018 Barlow.
Clustering coefficients of protein-protein interaction networks
NASA Astrophysics Data System (ADS)
Miller, Gerald A.; Shi, Yi Y.; Qian, Hong; Bomsztyk, Karol
2007-05-01
The properties of certain networks are determined by hidden variables that are not explicitly measured. The conditional probability (propagator) that a vertex with a given value of the hidden variable is connected to k other vertices determines all measurable properties. We study hidden variable models and find an averaging approximation that enables us to obtain a general analytical result for the propagator. Analytic results showing the validity of the approximation are obtained. We apply hidden variable models to protein-protein interaction networks (PINs) in which the hidden variable is the association free energy, determined by distributions that depend on biochemistry and evolution. We compute degree distributions as well as clustering coefficients of several PINs of different species; good agreement with measured data is obtained. For the human interactome two different parameter sets give the same degree distributions, but the computed clustering coefficients differ by a factor of about 2. This shows that degree distributions are not sufficient to determine the properties of PINs.
Crowther, Gregory J.; Napuli, Alberto J.; Thomas, Andrew P.; Chung, Diana J.; Kovzun, Kuzma V.; Leibly, David J.; Castaneda, Lisa J.; Bhandari, Janhavi; Damman, Christopher J.; Hui, Raymond; Hol, Wim G. J.; Buckner, Frederick S.; Verlinde, Christophe L. M. J.; Zhang, Zhongsheng; Fan, Erkang; Van Voorhis, Wesley C.
2010-01-01
In the last decade, thermal melt/thermal shift assays have become a common tool for identifying ligands and other factors that stabilize specific proteins. Increased stability is indicated by an increase in the protein's melting temperature (Tm). In optimizing the assays for subsequent screening of compound libraries, it is important to minimize the variability of Tm measurements so as to maximize the assay's ability to detect potential ligands. Here we present an investigation of Tm variability in recombinant proteins from Plasmodium parasites. Ligands of Plasmodium proteins are particularly interesting as potential starting points for drugs for malaria, and new drugs are urgently needed. A single standard buffer (100 mM HEPES, pH 7.5, 150 mM NaCl) permitted estimation of Tm for 58 of 61 Plasmodium proteins tested. However, with several proteins, Tm could not be measured with a consistency suitable for high-throughput screening unless alternative protein-specific buffers were employed. We conclude that buffer optimization to minimize variability in Tm measurements increases the success of thermal melt screens involving proteins for which a standard buffer is suboptimal. PMID:19470714
NASA Astrophysics Data System (ADS)
Bhattacharya, Susmita; Ghosh, Sudeshna; Dasgupta, Swagata; Roy, Anushree
2013-10-01
The difference in molecular structure of native HEWL and its fibrils, grown at a pH value near physiological pH 7.4 and at a pH value just above the pI, 10.7 in presence and absence of Cu(II) ions, is discussed. We focus on differences between the molecular structure of the native protein and fibrils using principal component analysis of their Raman spectra. The overlap areas of the scores of each species are used to quantify the difference in the structure of the native HEWL and fibrils in different environments. The overall molecular structures are significantly different for fibrils grown at two pH values. However, in presence of Cu(II) ions, the fibrils have similarities in their molecular structures at these pH environments. Spectral variation within each species, as obtained from the standard deviations of the scores in PCA plots, reveals the variability in the structure within a particular species.
Crystal structure of a 3B3 variant - A broadly neutralizing HIV-1 scFv antibody
DOE Office of Scientific and Technical Information (OSTI.GOV)
Clark, K. Reed; Walsh, Scott T.R.; NCH)
2009-12-10
We present the crystal structure determination of an anti-HIV-1 gp120 single-chain variable fragment antibody variant, 3B3, at 2.5 {angstrom} resolution. This 3B3 variant was derived from the b12 antibody, using phage display and site-directed mutagenesis of the variable heavy chain (V{sub H}) complementary-determining regions (CDRs). 3B3 exhibits enhanced binding affinity and neutralization activity against several cross-clade primary isolates of HIV-1 by interaction with the recessed CD4-binding site on the gp120 envelope protein. Comparison with the structures of the unbound and bound forms of b12, the 3B3 structure closely resembles these structures with minimal differences with two notable exceptions. First, theremore » is a reorientation of the CDR-H3 of the V{sub H} domain where the primary sequences evolved from b12 to 3B3. The structural changes in CDR-H3 of 3B3, in light of the b12-gp120 complex structure, allow for positioning an additional Trp side chain in the binding interface with gp120. Finally, the second region of structural change involves two peptide bond flips in CDR-L3 of the variable light (VL) domain triggered by a point mutation in CDR-H3 of Q100eY resulting in changes in the intramolecular hydrogen bonding patterning between the VL and VH domains. Thus, the enhanced binding affinities and neutralization capabilities of 3B3 relative to b12 probably result from higher hydrophobic driving potential by burying more aromatic residues at the 3B3-gp120 interface and by indirect stabilization of intramolecular contacts of the core framework residues between the VL and VH domains possibly through more favorable entropic effect through the expulsion of water.« less
Tuncbag, Nurcan; Gursoy, Attila; Nussinov, Ruth; Keskin, Ozlem
2011-08-11
Prediction of protein-protein interactions at the structural level on the proteome scale is important because it allows prediction of protein function, helps drug discovery and takes steps toward genome-wide structural systems biology. We provide a protocol (termed PRISM, protein interactions by structural matching) for large-scale prediction of protein-protein interactions and assembly of protein complex structures. The method consists of two components: rigid-body structural comparisons of target proteins to known template protein-protein interfaces and flexible refinement using a docking energy function. The PRISM rationale follows our observation that globally different protein structures can interact via similar architectural motifs. PRISM predicts binding residues by using structural similarity and evolutionary conservation of putative binding residue 'hot spots'. Ultimately, PRISM could help to construct cellular pathways and functional, proteome-scale annotation. PRISM is implemented in Python and runs in a UNIX environment. The program accepts Protein Data Bank-formatted protein structures and is available at http://prism.ccbb.ku.edu.tr/prism_protocol/.
Characterization of protein folding by a Φ-value calculation with a statistical-mechanical model.
Wako, Hiroshi; Abe, Haruo
2016-01-01
The Φ-value analysis approach provides information about transition-state structures along the folding pathway of a protein by measuring the effects of an amino acid mutation on folding kinetics. Here we compared the theoretically calculated Φ values of 27 proteins with their experimentally observed Φ values; the theoretical values were calculated using a simple statistical-mechanical model of protein folding. The theoretically calculated Φ values reflected the corresponding experimentally observed Φ values with reasonable accuracy for many of the proteins, but not for all. The correlation between the theoretically calculated and experimentally observed Φ values strongly depends on whether the protein-folding mechanism assumed in the model holds true in real proteins. In other words, the correlation coefficient can be expected to illuminate the folding mechanisms of proteins, providing the answer to the question of which model more accurately describes protein folding: the framework model or the nucleation-condensation model. In addition, we tried to characterize protein folding with respect to various properties of each protein apart from the size and fold class, such as the free-energy profile, contact-order profile, and sensitivity to the parameters used in the Φ-value calculation. The results showed that any one of these properties alone was not enough to explain protein folding, although each one played a significant role in it. We have confirmed the importance of characterizing protein folding from various perspectives. Our findings have also highlighted that protein folding is highly variable and unique across different proteins, and this should be considered while pursuing a unified theory of protein folding.
Characterization of protein folding by a Φ-value calculation with a statistical-mechanical model
Wako, Hiroshi; Abe, Haruo
2016-01-01
The Φ-value analysis approach provides information about transition-state structures along the folding pathway of a protein by measuring the effects of an amino acid mutation on folding kinetics. Here we compared the theoretically calculated Φ values of 27 proteins with their experimentally observed Φ values; the theoretical values were calculated using a simple statistical-mechanical model of protein folding. The theoretically calculated Φ values reflected the corresponding experimentally observed Φ values with reasonable accuracy for many of the proteins, but not for all. The correlation between the theoretically calculated and experimentally observed Φ values strongly depends on whether the protein-folding mechanism assumed in the model holds true in real proteins. In other words, the correlation coefficient can be expected to illuminate the folding mechanisms of proteins, providing the answer to the question of which model more accurately describes protein folding: the framework model or the nucleation-condensation model. In addition, we tried to characterize protein folding with respect to various properties of each protein apart from the size and fold class, such as the free-energy profile, contact-order profile, and sensitivity to the parameters used in the Φ-value calculation. The results showed that any one of these properties alone was not enough to explain protein folding, although each one played a significant role in it. We have confirmed the importance of characterizing protein folding from various perspectives. Our findings have also highlighted that protein folding is highly variable and unique across different proteins, and this should be considered while pursuing a unified theory of protein folding. PMID:28409079
Deciphering the Hidden Informational Content of Protein Sequences
Liu, Ming; Hua, Qing-xin; Hu, Shi-Quan; Jia, Wenhua; Yang, Yanwu; Saith, Sunil Evan; Whittaker, Jonathan; Arvan, Peter; Weiss, Michael A.
2010-01-01
Protein sequences encode both structure and foldability. Whereas the interrelationship of sequence and structure has been extensively investigated, the origins of folding efficiency are enigmatic. We demonstrate that the folding of proinsulin requires a flexible N-terminal hydrophobic residue that is dispensable for the structure, activity, and stability of the mature hormone. This residue (PheB1 in placental mammals) is variably positioned within crystal structures and exhibits 1H NMR motional narrowing in solution. Despite such flexibility, its deletion impaired insulin chain combination and led in cell culture to formation of non-native disulfide isomers with impaired secretion of the variant proinsulin. Cellular folding and secretion were maintained by hydrophobic substitutions at B1 but markedly perturbed by polar or charged side chains. We propose that, during folding, a hydrophobic side chain at B1 anchors transient long-range interactions by a flexible N-terminal arm (residues B1–B8) to mediate kinetic or thermodynamic partitioning among disulfide intermediates. Evidence for the overall contribution of the arm to folding was obtained by alanine scanning mutagenesis. Together, our findings demonstrate that efficient folding of proinsulin requires N-terminal sequences that are dispensable in the native state. Such arm-dependent folding can be abrogated by mutations associated with β-cell dysfunction and neonatal diabetes mellitus. PMID:20663888
In silico study of amyloid -protein folding and oligomerization
NASA Astrophysics Data System (ADS)
Urbanc, B.; Cruz, L.; Yun, S.; Buldyrev, S. V.; Bitan, G.; Teplow, D. B.; Stanley, H. E.
2004-12-01
Experimental findings suggest that oligomeric forms of the amyloid protein (A) play a critical role in Alzheimer's disease. Thus, elucidating their structure and the mechanisms of their formation is critical for developing therapeutic agents. We use discrete molecular dynamics simulations and a four-bead protein model to study oligomerization of two predominant alloforms, A40 and A42, at the atomic level. The four-bead model incorporates backbone hydrogen-bond interactions and amino acid-specific interactions mediated through hydrophobic and hydrophilic elements of the side chains. During the simulations we observe monomer folding and aggregation of monomers into oligomers of variable sizes. A40 forms significantly more dimers than A42, whereas pentamers are significantly more abundant in A42 relative to A40. Structure analysis reveals a turn centered at Gly-37-Gly-38 that is present in a folded A42 monomer but not in a folded A40 monomer and is associated with the first contacts that form during monomer folding. Our results suggest that this turn plays an important role in A42 pentamer formation. A pentamers have a globular structure comprising hydrophobic residues within the pentamer's core and hydrophilic N-terminal residues at the surface of the pentamer. The N termini of A40 pentamers are more spatially restricted than A42 pentamers. A40 pentamers form a -strand structure involving Ala-2-Phe-4, which is absent in A42 pentamers. These structural differences imply a different degree of hydrophobic core exposure between pentamers of the two alloforms, with the hydrophobic core of the Aβ42 pentamer being more exposed and thus more prone to form larger oligomers.
Unique secreted–surface protein complex of Lactobacillus rhamnosus, identified by phage display
Gagic, Dragana; Wen, Wesley; Collett, Michael A; Rakonjac, Jasna
2013-01-01
Proteins are the most diverse structures on bacterial surfaces; hence, they are candidates for species- and strain-specific interactions of bacteria with the host, environment, and other microorganisms. Genomics has decoded thousands of bacterial surface and secreted proteins, yet the function of most cannot be predicted because of the enormous variability and a lack of experimental data that would allow deduction of function through homology. Here, we used phage display to identify a pair of interacting extracellular proteins in the probiotic bacterium Lactobacillus rhamnosus HN001. A secreted protein, SpcA, containing two bacterial immunoglobulin-like domains type 3 (Big-3) and a domain distantly related to plant pathogen response domain 1 (PR-1-like) was identified by screening of an L. rhamnosus HN001 library using HN001 cells as bait. The SpcA-“docking” protein, SpcB, was in turn detected by another phage display library screening, using purified SpcA as bait. SpcB is a 3275-residue cell-surface protein that contains general features of large glycosylated Serine-rich adhesins/fibrils from gram-positive bacteria, including the hallmark signal sequence motif KxYKxGKxW. Both proteins are encoded by genes within a L. rhamnosus-unique gene cluster that distinguishes this species from other lactobacilli. To our knowledge, this is the first example of a secreted-docking protein pair identified in lactobacilli. PMID:23233310
NASA Astrophysics Data System (ADS)
Vitarelli, Michael J.; Talaga, David S.
2013-09-01
Single solid-state nanopores find increasing use for electrical detection and/or manipulation of macromolecules. These applications exploit the changes in signals due to the geometry and electrical properties of the molecular species found within the nanopore. The sensitivity and resolution of such measurements are also influenced by the geometric and electrical properties of the nanopore. This paper continues the development of an analytical theory to predict the electrochemical impedance spectra of nanopores by including the influence of the presence of an unfolded protein using the variable topology finite Warburg impedance model previously published by the authors. The local excluded volume of, and charges present on, the segment of protein sampled by the nanopore are shown to influence the shape and peak frequency of the electrochemical impedance spectrum. An analytical theory is used to relate the capacitive response of the electrical double layer at the surface of the protein to both the charge density at the protein surface and the more commonly measured zeta potential. Illustrative examples show how the theory predicts that the varying sequential regions of surface charge density and excluded volume dictated by the protein primary structure may allow for an impedance-based approach to identifying unfolded proteins.
Bourgeois, Michael; Jacquin, Françoise; Cassecuelle, Florence; Savois, Vincent; Belghazi, Maya; Aubert, Grégoire; Quillien, Laurence; Huart, Myriam; Marget, Pascal; Burstin, Judith
2011-05-01
Legume seeds are a major source of dietary proteins for humans and animals. Deciphering the genetic control of their accumulation is thus of primary significance towards their improvement. At first, we analysed the genetic variability of the pea seed proteome of three genotypes over 3 years of cultivation. This revealed that seed protein composition variability was under predominant genetic control, with as much as 60% of the spots varying quantitatively among the three genotypes. Then, by combining proteomic and quantitative trait loci (QTL) mapping approaches, we uncovered the genetic architecture of seed proteome variability. Protein quantity loci (PQL) were searched for 525 spots detected on 2-D gels obtained for 157 recombinant inbred lines. Most protein quantity loci mapped in clusters, suggesting that the accumulation of the major storage protein families was under the control of a limited number of loci. While convicilin accumulation was mainly under the control of cis-regulatory regions, vicilins and legumins were controlled by both cis- and trans-regulatory regions. Some loci controlled both seed protein composition and protein content and a locus on LGIIa appears to be a major regulator of protein composition and of protein in vitro digestibility. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Statistical Analysis of Big Data on Pharmacogenomics
Fan, Jianqing; Liu, Han
2013-01-01
This paper discusses statistical methods for estimating complex correlation structure from large pharmacogenomic datasets. We selectively review several prominent statistical methods for estimating large covariance matrix for understanding correlation structure, inverse covariance matrix for network modeling, large-scale simultaneous tests for selecting significantly differently expressed genes and proteins and genetic markers for complex diseases, and high dimensional variable selection for identifying important molecules for understanding molecule mechanisms in pharmacogenomics. Their applications to gene network estimation and biomarker selection are used to illustrate the methodological power. Several new challenges of Big data analysis, including complex data distribution, missing data, measurement error, spurious correlation, endogeneity, and the need for robust statistical methods, are also discussed. PMID:23602905
Linking epigenetic function to electrostatics: The DNMT2 structural model example.
Vieira, Gilberto Cavalheiro; Vieira, Gustavo Fioravanti; Sinigaglia, Marialva; Silva Valente, Vera Lúcia da
2017-01-01
The amino acid sequence of DNMT2 is very similar to the catalytic domains of bacterial and eukaryotic proteins. However, there is great variability in the region of recognition of the target sequence. While bacterial DNMT2 acts as a DNA methyltransferase, previous studies have indicated low DNA methylation activity in eukaryotic DNMT2, with preference by tRNA methylation. Drosophilids are known as DNMT2-only species and the DNA methylation phenomenon is a not elucidated case yet, as well as the ontogenetic and physiologic importance of DNMT2 for this species group. In addition, more recently study showed that methylation in the genome in Drosophila melanogaster is independent in relation to DNMT2. Despite these findings, Drosophilidae family has more than 4,200 species with great ecological diversity and historical evolution, thus we, therefore, aimed to examine the drosophilids DNMT2 in order to verify its conservation at the physicochemical and structural levels in a functional context. We examined the twenty-six DNMT2 models generated by molecular modelling and five crystallographic structures deposited in the Protein Data Bank (PDB) using different approaches. Our results showed that despite sequence and structural similarity between species close related, we found outstanding differences when they are analyzed in the context of surface distribution of electrostatic properties. The differences found in the electrostatic potentials may be linked with different affinities and processivity of DNMT2 for its different substrates (DNA, RNA or tRNA) and even for interactions with other proteins involved in the epigenetic mechanisms.
Linking epigenetic function to electrostatics: The DNMT2 structural model example
Vieira, Gustavo Fioravanti; da Silva Valente, Vera Lúcia
2017-01-01
The amino acid sequence of DNMT2 is very similar to the catalytic domains of bacterial and eukaryotic proteins. However, there is great variability in the region of recognition of the target sequence. While bacterial DNMT2 acts as a DNA methyltransferase, previous studies have indicated low DNA methylation activity in eukaryotic DNMT2, with preference by tRNA methylation. Drosophilids are known as DNMT2-only species and the DNA methylation phenomenon is a not elucidated case yet, as well as the ontogenetic and physiologic importance of DNMT2 for this species group. In addition, more recently study showed that methylation in the genome in Drosophila melanogaster is independent in relation to DNMT2. Despite these findings, Drosophilidae family has more than 4,200 species with great ecological diversity and historical evolution, thus we, therefore, aimed to examine the drosophilids DNMT2 in order to verify its conservation at the physicochemical and structural levels in a functional context. We examined the twenty-six DNMT2 models generated by molecular modelling and five crystallographic structures deposited in the Protein Data Bank (PDB) using different approaches. Our results showed that despite sequence and structural similarity between species close related, we found outstanding differences when they are analyzed in the context of surface distribution of electrostatic properties. The differences found in the electrostatic potentials may be linked with different affinities and processivity of DNMT2 for its different substrates (DNA, RNA or tRNA) and even for interactions with other proteins involved in the epigenetic mechanisms. PMID:28575027
Characterization of casein gene complex and genetic diversity analysis in Indian goats.
Rout, P K; Kumar, A; Mandal, A; Laloe, D; Singh, S K; Roy, R
2010-04-01
Milk protein polymorphism plays an important role in genetic diversity analysis, phylogenetic studies, establishing geographical diversity, conservation decision, and improving breeding goals. Milk protein polymorphism in Indian goat breeds has not been well studied; therefore, an investigation was carried out to analyze the genetic structure of the casein gene and milk protein diversity at six milk protein loci in nine Indian goat breeds/genetic groups from varied agro-climatic zones. Milk protein genotyping was carried out in 1098 individual milk samples by SDS-PAGE at alphaS1-CN (CSN1S1), beta-CN (CSN2), alphaS2-CN (CSN1S2), kappa-CN (CSN3), beta-LG, and alpha-LA loci. Indian goats exhibited alphaS1-casein A allele in higher frequency in the majority of breeds except Ganjam and local goats. The alphaS1-casein A allele frequencies varied from 0.45 to 0.77. A total of 16 casein haplotypes were observed in seven breeds and breed specific haplotypes were observed with respect to geographic region. The average number of alleles was lowest in Ganjam (1.66 +/- 0.81) and highest in Sirohi goats (2.50 +/- 1.05). Expected heterozygosity at six different loci demonstrated genetic diversity and breed fragmentation. Neighbor-Joining tree was built basing on Nei's distance. There was about 16.95% variability due to differences between breeds, indicating a strong subdivision. Principal component analysis was carried out to highlight the relationship among breeds. The variability among goat breeds was contributed by alphaS2-CN, beta-LG and alphaS1-CN. The Indian goats exhibited alphaS1-CN (CSN1S1) A allele in higher frequency in all the breeds indicating the higher casein yield in their milk.
NASA Astrophysics Data System (ADS)
Zhao, Haiqing
Centromere protein A (CENP-A) is a centromere-specific H3 histone variant and shares only about 50% amino acid sequence identity with the canonical H3 protein. CENP-A is required for packaging the centromere and for the proper separation of chromosomes during mitosis. Despite their discrete functions, previously reported crystal structures of the CENP-A/H4 and H3/H4 dimers reveal surprising similarity. In this work, we characterize the structure and dynamics of CENP-A/H4 and H3/H4 dimers with a dual-resolution approach, using both all-atom and coarse-grained (CG) molecular dynamics (MD) simulations. Interestingly, the histone dimer containing CENP-A is more structurally variable than the canonical H3 dimer. Furthermore, our calculations revealed significant conformational distinctions between the interface profiles of CENP-A/H4 and H3/H4. In addition, the presence of the CENP-A-specific chaperone HJURP dramatically reduced the conformational heterogeneity of CENP-A/H4. Overall, these results are in general agreement with the available experimental data and provide new dynamic insights into the mechanisms underpinning the chaperone-mediated assembly of CENP-A nucleosomes in vivo.
Utilization of protein intrinsic disorder knowledge in structural proteomics
Oldfield, Christopher J.; Xue, Bin; Van, Ya-Yue; Ulrich, Eldon L.; Markley, John L.; Dunker, A. Keith; Uversky, Vladimir N.
2014-01-01
Intrinsically disordered proteins (IDPs) and proteins with long disordered regions are highly abundant in various proteomes. Despite their lack of well-defined ordered structure, these proteins and regions are frequently involved in crucial biological processes. Although in recent years these proteins have attracted the attention of many researchers, IDPs represent a significant challenge for structural characterization since these proteins can impact many of the processes in the structure determination pipeline. Here we investigate the effects of IDPs on the structure determination process and the utility of disorder prediction in selecting and improving proteins for structural characterization. Examination of the extent of intrinsic disorder in existing crystal structures found that relatively few protein crystal structures contain extensive regions of intrinsic disorder. Although intrinsic disorder is not the only cause of crystallization failures and many structured proteins cannot be crystallized, filtering out highly disordered proteins from structure-determination target lists is still likely to be cost effective. Therefore it is desirable to avoid highly disordered proteins from structure-determination target lists and we show that disorder prediction can be applied effectively to enrich structure determination pipelines with proteins more likely to yield crystal structures. For structural investigation of specific proteins, disorder prediction can be used to improve targets for structure determination. Finally, a framework for considering intrinsic disorder in the structure determination pipeline is proposed. PMID:23232152
Structure based alignment and clustering of proteins (STRALCP)
Zemla, Adam T.; Zhou, Carol E.; Smith, Jason R.; Lam, Marisa W.
2013-06-18
Disclosed are computational methods of clustering a set of protein structures based on local and pair-wise global similarity values. Pair-wise local and global similarity values are generated based on pair-wise structural alignments for each protein in the set of protein structures. Initially, the protein structures are clustered based on pair-wise local similarity values. The protein structures are then clustered based on pair-wise global similarity values. For each given cluster both a representative structure and spans of conserved residues are identified. The representative protein structure is used to assign newly-solved protein structures to a group. The spans are used to characterize conservation and assign a "structural footprint" to the cluster.
Tischer, Alexander; Auton, Matthew
2013-09-01
We have analyzed the thermodynamic properties of the von Willebrand factor (VWF) A3 domain using urea-induced unfolding at variable temperature and thermal unfolding at variable urea concentrations to generate a phase diagram that quantitatively describes the equilibrium between native and denatured states. From this analysis, we were able to determine consistent thermodynamic parameters with various spectroscopic and calorimetric methods that define the urea-temperature parameter plane from cold denaturation to heat denaturation. Urea and thermal denaturation are experimentally reversible and independent of the thermal scan rate indicating that all transitions are at equilibrium and the van't Hoff and calorimetric enthalpies obtained from analysis of individual thermal transitions are equivalent demonstrating two-state character. Global analysis of the urea-temperature phase diagram results in a significantly higher enthalpy of unfolding than obtained from analysis of individual thermal transitions and significant cross correlations describing the urea dependence of ΔH0 and ΔCP0 that define a complex temperature dependence of the m-value. Circular dichroism (CD) spectroscopy illustrates a large increase in secondary structure content of the urea-denatured state as temperature increases and a loss of secondary structure in the thermally denatured state upon addition of urea. These structural changes in the denatured ensemble make up ∼40% of the total ellipticity change indicating a highly compact thermally denatured state. The difference between the thermodynamic parameters obtained from phase diagram analysis and those obtained from analysis of individual thermal transitions illustrates that phase diagrams capture both contributions to unfolding and denatured state expansion and by comparison are able to decipher these contributions. © 2013 The Protein Society.
A data set from flash X-ray imaging of carboxysomes
NASA Astrophysics Data System (ADS)
Hantke, Max F.; Hasse, Dirk; Ekeberg, Tomas; John, Katja; Svenda, Martin; Loh, Duane; Martin, Andrew V.; Timneanu, Nicusor; Larsson, Daniel S. D.; van der Schot, Gijs; Carlsson, Gunilla H.; Ingelman, Margareta; Andreasson, Jakob; Westphal, Daniel; Iwan, Bianca; Uetrecht, Charlotte; Bielecki, Johan; Liang, Mengning; Stellato, Francesco; Deponte, Daniel P.; Bari, Sadia; Hartmann, Robert; Kimmel, Nils; Kirian, Richard A.; Seibert, M. Marvin; Mühlig, Kerstin; Schorb, Sebastian; Ferguson, Ken; Bostedt, Christoph; Carron, Sebastian; Bozek, John D.; Rolles, Daniel; Rudenko, Artem; Foucar, Lutz; Epp, Sascha W.; Chapman, Henry N.; Barty, Anton; Andersson, Inger; Hajdu, Janos; Maia, Filipe R. N. C.
2016-08-01
Ultra-intense femtosecond X-ray pulses from X-ray lasers permit structural studies on single particles and biomolecules without crystals. We present a large data set on inherently heterogeneous, polyhedral carboxysome particles. Carboxysomes are cell organelles that vary in size and facilitate up to 40% of Earth’s carbon fixation by cyanobacteria and certain proteobacteria. Variation in size hinders crystallization. Carboxysomes appear icosahedral in the electron microscope. A protein shell encapsulates a large number of Rubisco molecules in paracrystalline arrays inside the organelle. We used carboxysomes with a mean diameter of 115±26 nm from Halothiobacillus neapolitanus. A new aerosol sample-injector allowed us to record 70,000 low-noise diffraction patterns in 12 min. Every diffraction pattern is a unique structure measurement and high-throughput imaging allows sampling the space of structural variability. The different structures can be separated and phased directly from the diffraction data and open a way for accurate, high-throughput studies on structures and structural heterogeneity in biology and elsewhere.
NASA Technical Reports Server (NTRS)
He, X. M.; Ruker, F.; Casale, E.; Carter, D. C.
1992-01-01
The three-dimensional structure of a human monoclonal antibody (Fab), which binds specifically to a major epitope of the transmembrane protein gp41 of the human immunodeficiency virus type 1, has been determined by crystallographic methods to a resolution of 2.7 A. It has been previously determined that this antibody recognizes the epitope SGKLICTTAVPWNAS, belongs to the subclass IgG1 (kappa), and exhibits antibody-dependent cellular cytotoxicity. The quaternary structure of the Fab is in an extended conformation with an elbow bend angle between the constant and variable domains of 175 degrees. Structurally, four of the hypervariable loops can be classified according to previously recognized canonical structures. The third hypervariable loops of the heavy (H3) and light chain (L3) are structurally distinct. Hypervariable loop H3, residues 102H-109H, is unusually extended from the surface. The complementarity-determining region forms a hydrophobic binding pocket that is created primarily from hypervariable loops L3, H3, and H2.
NASA Technical Reports Server (NTRS)
He, Xiao M.; Rueker, Florian; Casale, Elena; Carter, Daniel C.
1992-01-01
The three-dimensional structure of a human monoclonal antibody (Fab), which binds specifically to a major epitope of the transmembrane protein gp41 of the human immunodeficiency virus type 1, has been determined by crystallographic methods to a resolution of 2.7 A. It has been previously determined that this antibody recognizes the epitope SGKLICTTAVPWNAS, belongs to the subclass IgG1 (kappa), and exhibits antibody-dependent cellular cytotoxicity. The quaternary structure of the Fab is in an extended conformation with an elbow bend angle between the constant and variable domains of 175 deg. Structurally, four of the hypervariable loops can be classified according to previously recognized canonical structures. The third hypervariable loops of the heavy (H3) and light chain (L3) are structurally distinct. Hypervariable loop H3, residues 102H-109H, is unusually extended from the surface. The complementarity-determining region forms a hydrophobic binding pocket that is created primarily from hypervariable loops L3, H3, and H2.
Protein construct storage: Bayesian variable selection and prediction with mixtures.
Clyde, M A; Parmigiani, G
1998-07-01
Determining optimal conditions for protein storage while maintaining a high level of protein activity is an important question in pharmaceutical research. A designed experiment based on a space-filling design was conducted to understand the effects of factors affecting protein storage and to establish optimal storage conditions. Different model-selection strategies to identify important factors may lead to very different answers about optimal conditions. Uncertainty about which factors are important, or model uncertainty, can be a critical issue in decision-making. We use Bayesian variable selection methods for linear models to identify important variables in the protein storage data, while accounting for model uncertainty. We also use the Bayesian framework to build predictions based on a large family of models, rather than an individual model, and to evaluate the probability that certain candidate storage conditions are optimal.
The Crystal Structure of TAL Effector PthXo1 Bound to Its DNA Target
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mak, Amanda Nga-Sze; Bradley, Philip; Cernadas, Raul A.
2012-02-10
DNA recognition by TAL effectors is mediated by tandem repeats, each 33 to 35 residues in length, that specify nucleotides via unique repeat-variable diresidues (RVDs). The crystal structure of PthXo1 bound to its DNA target was determined by high-throughput computational structure prediction and validated by heavy-atom derivatization. Each repeat forms a left-handed, two-helix bundle that presents an RVD-containing loop to the DNA. The repeats self-associate to form a right-handed superhelix wrapped around the DNA major groove. The first RVD residue forms a stabilizing contact with the protein backbone, while the second makes a base-specific contact to the DNA sense strand.more » Two degenerate amino-terminal repeats also interact with the DNA. Containing several RVDs and noncanonical associations, the structure illustrates the basis of TAL effector-DNA recognition.« less
Dutta, Soumita
2017-01-01
ABSTRACT The unicellular green alga Chlamydomonas reinhardtii is an ideal model organism for studies of ciliary function and assembly. In assays for biological and biochemical effects of various factors on flagellar structure and function, synchronous culture is advantageous for minimizing variability. Here, we have characterized a method in which 100% synchronization is achieved with respect to flagellar length but not with respect to the cell cycle. The method requires inducing flagellar regeneration by amputation of the entire cell population and limiting regeneration time. This results in a maximally homogeneous distribution of flagellar lengths at 3 h postamputation. We found that time-limiting new protein synthesis during flagellar synchronization limits variability in the unassembled pool of limiting flagellar protein and variability in flagellar length without affecting the range of cell volumes. We also found that long- and short-flagella mutants that regenerate normally require longer and shorter synchronization times, respectively. By minimizing flagellar length variability using a simple method requiring only hours and no changes in media, flagellar synchronization facilitates the detection of small changes in flagellar length resulting from both chemical and genetic perturbations in Chlamydomonas. This method increases our ability to probe the basic biology of ciliary size regulation and related disease etiologies. IMPORTANCE Cilia and flagella are highly conserved antenna-like organelles that found in nearly all mammalian cell types. They perform sensory and motile functions contributing to numerous physiological and developmental processes. Defects in their assembly and function are implicated in a wide range of human diseases ranging from retinal degeneration to cancer. Chlamydomonas reinhardtii is an algal model system for studying mammalian cilium formation and function. Here, we report a simple synchronization method that allows detection of small changes in ciliary length by minimizing variability in the population. We find that this method alters the key relationship between cell size and the amount of protein accumulated for flagellar growth. This provides a rapid alternative to traditional methods of cell synchronization for uncovering novel regulators of cilia. PMID:28289724
Dutta, Soumita; Avasthi, Prachee
2017-01-01
The unicellular green alga Chlamydomonas reinhardtii is an ideal model organism for studies of ciliary function and assembly. In assays for biological and biochemical effects of various factors on flagellar structure and function, synchronous culture is advantageous for minimizing variability. Here, we have characterized a method in which 100% synchronization is achieved with respect to flagellar length but not with respect to the cell cycle. The method requires inducing flagellar regeneration by amputation of the entire cell population and limiting regeneration time. This results in a maximally homogeneous distribution of flagellar lengths at 3 h postamputation. We found that time-limiting new protein synthesis during flagellar synchronization limits variability in the unassembled pool of limiting flagellar protein and variability in flagellar length without affecting the range of cell volumes. We also found that long- and short-flagella mutants that regenerate normally require longer and shorter synchronization times, respectively. By minimizing flagellar length variability using a simple method requiring only hours and no changes in media, flagellar synchronization facilitates the detection of small changes in flagellar length resulting from both chemical and genetic perturbations in Chlamydomonas . This method increases our ability to probe the basic biology of ciliary size regulation and related disease etiologies. IMPORTANCE Cilia and flagella are highly conserved antenna-like organelles that found in nearly all mammalian cell types. They perform sensory and motile functions contributing to numerous physiological and developmental processes. Defects in their assembly and function are implicated in a wide range of human diseases ranging from retinal degeneration to cancer. Chlamydomonas reinhardtii is an algal model system for studying mammalian cilium formation and function. Here, we report a simple synchronization method that allows detection of small changes in ciliary length by minimizing variability in the population. We find that this method alters the key relationship between cell size and the amount of protein accumulated for flagellar growth. This provides a rapid alternative to traditional methods of cell synchronization for uncovering novel regulators of cilia.
Structure and variation of the mitochondrial genome of fishes.
Satoh, Takashi P; Miya, Masaki; Mabuchi, Kohji; Nishida, Mutsumi
2016-09-07
The mitochondrial (mt) genome has been used as an effective tool for phylogenetic and population genetic analyses in vertebrates. However, the structure and variability of the vertebrate mt genome are not well understood. A potential strategy for improving our understanding is to conduct a comprehensive comparative study of large mt genome data. The aim of this study was to characterize the structure and variability of the fish mt genome through comparative analysis of large datasets. An analysis of the secondary structure of proteins for 250 fish species (248 ray-finned and 2 cartilaginous fishes) illustrated that cytochrome c oxidase subunits (COI, COII, and COIII) and a cytochrome bc1 complex subunit (Cyt b) had substantial amino acid conservation. Among the four proteins, COI was the most conserved, as more than half of all amino acid sites were invariable among the 250 species. Our models identified 43 and 58 stems within 12S rRNA and 16S rRNA, respectively, with larger numbers than proposed previously for vertebrates. The models also identified 149 and 319 invariable sites in 12S rRNA and 16S rRNA, respectively, in all fishes. In particular, the present result verified that a region corresponding to the peptidyl transferase center in prokaryotic 23S rRNA, which is homologous to mt 16S rRNA, is also conserved in fish mt 16S rRNA. Concerning the gene order, we found 35 variations (in 32 families) that deviated from the common gene order in vertebrates. These gene rearrangements were mostly observed in the area spanning the ND5 gene to the control region as well as two tRNA gene cluster regions (IQM and WANCY regions). Although many of such gene rearrangements were unique to a specific taxon, some were shared polyphyletically between distantly related species. Through a large-scale comparative analysis of 250 fish species mt genomes, we elucidated various structural aspects of the fish mt genome and the encoded genes. The present results will be important for understanding functions of the mt genome and developing programs for nucleotide sequence analysis. This study demonstrated the significance of extensive comparisons for understanding the structure of the mt genome.
2008-01-01
Background Freshwater unionoidean bivalves, and species representing two marine bivalve orders (Mytiloida and Veneroida), exhibit a mode of mtDNA inheritance involving distinct maternal (F) and paternal (M) transmission routes concomitant with highly divergent gender-associated mtDNA genomes. Additionally, male unionoidean bivalves have a ~550 bp 3' coding extension to the cox2 gene (Mcox2e), that is apparently absent from all other metazoan taxa. Results Our molecular sequence analyses of MCOX2e indicate that both the primary and secondary structures of the MCOX2e region are evolving much faster than other regions of the F and M COX2-COX1 gene junction. The near N-terminus ~2/3 of the MCOX2e region contains an interspecifically variable number of predicted transmembrane helices (TMH) and interhelical loops (IHL) whereas the C-terminus ~1/3 is relatively conserved and hydrophilic while containing conserved functional motifs. MCOX2e displays an overall pattern of purifying selection that leads to the preservation of TMH/IHL and C-terminus tail sub-regions. However, 14 amino acid positions in the MCOX2e TMH/IHL sub-region might be targeted by diversifying selection, each representing a site where there exists interspecific variation for the constituent amino acids residing in a TMH or IHL. Conclusion Our results indicate that Mcox2e is unique to unionoidean bivalves, likely the result of a single insertion event that took place over 65 MYA and that MCOX2e is functional. The predicted TMH number, length and position variability likely stems from substitution-based processes rather than the typically implicated insertion/deletion events. MCOX2e has relatively high rates of primary and secondary structure evolution, with some amino acid residues potentially subjected to site-specific positive selection, yet an overall pattern of purifying selection leading to the preservation of the TMH/IHL and hydrophilic C-terminus tail subregions. The more conserved C-terminus tail (relative to the TMH/IHL sub-region of MCOX2e) is likely biologically active because it contains functional motifs. The rapid evolution of primary and secondary structure in MCOX2e, combined with the action of both positive and purifying selection, provide supporting evidence for the hypothesis that MCOX2e has a novel reproductive function within unionoidean bivalves. All tolled, our data indicate that unionoidean bivalve MCOX2 is the first reported chimeric animal mtDNA-encoded protein. PMID:18513440
Kato, Koichi; Nakayoshi, Tomoki; Fukuyoshi, Shuichi; Kurimoto, Eiji; Oda, Akifumi
2017-10-12
Although various higher-order protein structure prediction methods have been developed, almost all of them were developed based on the three-dimensional (3D) structure information of known proteins. Here we predicted the short protein structures by molecular dynamics (MD) simulations in which only Newton's equations of motion were used and 3D structural information of known proteins was not required. To evaluate the ability of MD simulationto predict protein structures, we calculated seven short test protein (10-46 residues) in the denatured state and compared their predicted and experimental structures. The predicted structure for Trp-cage (20 residues) was close to the experimental structure by 200-ns MD simulation. For proteins shorter or longer than Trp-cage, root-mean square deviation values were larger than those for Trp-cage. However, secondary structures could be reproduced by MD simulations for proteins with 10-34 residues. Simulations by replica exchange MD were performed, but the results were similar to those from normal MD simulations. These results suggest that normal MD simulations can roughly predict short protein structures and 200-ns simulations are frequently sufficient for estimating the secondary structures of protein (approximately 20 residues). Structural prediction method using only fundamental physical laws are useful for investigating non-natural proteins, such as primitive proteins and artificial proteins for peptide-based drug delivery systems.
Chemical composition and ruminal nutrient degradability of fresh and ensiled amaranth forage.
Seguin, Philippe; Mustafa, Arif F; Donnelly, Danielle J; Gélinas, Bruce
2013-12-01
Amaranth is a crop with potential as a source of forage for ruminants that has not been well characterized. A study was conducted to determine the impact of ensiling on the nutritional quality and ruminal degradability of forage from two amaranth cultivars adapted to North America (i.e. Plainsman and D136). In particular, quantification and some microscopic characterization of oxalate found in amaranth were performed as it is an antiquality compound of concern. There were limited interactions between cultivars and ensiling for most variables. Differences in chemical composition between amaranth cultivars were also limited. Ensiling reduced non-structural carbohydrate and true protein contents. The proportion of acid detergent protein was high in fresh and ensiled forages of both cultivars (average of 177 g kg(-1) crude protein). Total oxalate content averaged 30 and 25 g kg(-1) in fresh and ensiled forages respectively. Ensiling reduced soluble oxalate content. Crystals observed in amaranth were calcium oxalate druses found mostly in idioblast cells in leaf mesophyll and parenchyma of primary and secondary veins. In situ ruminal degradability data indicated that both fresh and ensiled amaranth are highly degradable in the rumen. This study confirms that amaranth is a suitable forage for ruminant animals. Its chemical composition is comparable, for most variables, to that of other commonly used forage species. © 2013 Society of Chemical Industry.
Monajjemi, Majid
2015-12-01
Cell membrane has a unique feature of storing biological energies in a physiologically relevant environment. This study illustrates a capacitor model of biological cell membrane including DPPC structures. The electron density profile models, electron localization function (ELF) and local information entropy have been applied to study the interaction of proteins with lipid bilayers in the cell membrane. The quantum and coulomb blockade effects of different thicknesses in the membrane have also been specifically investigated. It has been exhibited the quantum effects can appear in a small region of the free space within the membrane thickness due to the number and type of phospholipid layers. In addition, from the viewpoint of quantum effects by Heisenberg rule, it is shown the quantum tunneling is allowed in some micro positions while it is forbidden in other forms of membrane capacitor systems. Due to the dynamical behavior of the cell membrane, its capacitance is not fixed which results a variable capacitor. In presence of the external fields through protein trance membrane or ions, charges exert forces that can influence the state of the cell membrane. This causes to appear the charge capacitive susceptibility that can resonate with self-induction of helical coils; the resonance of which is the main reason for various biological pulses. Copyright © 2015 Elsevier B.V. All rights reserved.
Jdila, Marwa Ben; Issa, Abir Ben; Khabou, Boudour; Rhouma, Bochra Ben; Kamoun, Fatma; Ammar-Keskes, Leila; Triki, Chahnez; Fakhfakh, Faiza
2017-10-01
West syndrome is a rare epileptic encephalopathy of early infancy, characterized by epileptic spasms, hypsarrhythmia, and psychomotor retardation beginning in the first year of life. The present study reports the clinical, molecular and bioinformatic investigation in the three studied West patients. The results revealed a complex genotype with more than one mutation in each patient including the known mutations c.1910C>G (P2, P3); c.2372A>C in P3 and c.2395C>G in P1 and novel variants including c.616G>A, shared by the three patients P1, P2 and P3; c.1403G>C shared by P2 and P3 and c.2288A>G in patient P1. All the mutations were at somatic mosaic state and were de novo in the patients except ones (c.2372A>C). To our knowledge; the somatic mosaic state is described for the first time in patients with West syndrome. Five identified mutations were located in the C-terminal domain of the protein, while the novel mutation (c.616G>A) was in the catalytic domain. Bioinformatic tools predicted that this latter is the most pathogenic substitution affecting 3D protein structure and the secondary mRNA structure. Complex genotype composed of different combinations of mutations in each patient seems to be related to the phenotype variability. Copyright © 2017. Published by Elsevier B.V.
Teruya, Kenta; Hattori, Yasunao; Shimamoto, Yasuhiro; Kobayashi, Kazuya; Sanjoh, Akira; Nakagawa, Atsushi; Yamashita, Eiki; Akaji, Kenichi
2016-11-04
Design of inhibitors against severe acute respiratory syndrome (SARS) chymotrypsin-like protease (3CL(pro) ) is a potentially important approach to fight against SARS. We have developed several synthetic inhibitors by structure-based drug design. In this report, we reveal two crystal structures of SARS 3CL(pro) complexed with two new inhibitors based on our previous work. These structures combined with six crystal structures complexed with a series of related ligands reported by us are collectively analyzed. To these eight complexes, the structural basis for inhibitor binding was analyzed by the COMBINE method, which is a chemometrical analysis optimized for the protein-ligand complex. The analysis revealed that the first two latent variables gave a cumulative contribution ratio of r(2) = 0.971. Interestingly, scores using the second latent variables for each complex were strongly correlated with root mean square deviations (RMSDs) of side-chain heavy atoms of Met(49) from those of the intact crystal structure of SARS-3CL(pro) (r = 0.77) enlarging the S2 pocket. The substantial contribution of this side chain (∼10%) for the explanation of pIC50 s was dependent on stereochemistry and the chemical structure of the ligand adapted to the S2 pocket of the protease. Thus, starting from a substrate mimic inhibitor, a design for a central scaffold for a low molecular weight inhibitor was evaluated to develop a further potent inhibitor. © 2015 Wiley Periodicals, Inc. Biopolymers (Pept Sci) 106: 391-403, 2016. © 2015 Wiley Periodicals, Inc.
Hu, Liyan; Pandey, Amit V; Eggimann, Sandra; Rüfenacht, Véronique; Möslinger, Dorothea; Nuoffer, Jean-Marc; Häberle, Johannes
2013-11-29
Argininosuccinic aciduria (ASA) is an autosomal recessive urea cycle disorder caused by deficiency of argininosuccinate lyase (ASL) with a wide clinical spectrum from asymptomatic to severe hyperammonemic neonatal onset life-threatening courses. We investigated the role of ASL transcript variants in the clinical and biochemical variability of ASA. Recombinant proteins for ASL wild type, mutant p.E189G, and the frequently occurring transcript variants with exon 2 or 7 deletions were (co-)expressed in human embryonic kidney 293T cells. We found that exon 2-deleted ASL forms a stable truncated protein with no relevant activity but a dose-dependent dominant negative effect on enzymatic activity after co-expression with wild type or mutant ASL, whereas exon 7-deleted ASL is unstable but seems to have, nevertheless, a dominant negative effect on mutant ASL. These findings were supported by structural modeling predictions for ASL heterotetramer/homotetramer formation. Illustrating the physiological relevance, the predominant occurrence of exon 7-deleted ASL was found in two patients who were both heterozygous for the ASL mutant p.E189G. Our results suggest that ASL transcripts can contribute to the highly variable phenotype in ASA patients if expressed at high levels. Especially, the exon 2-deleted ASL variant may form a heterotetramer with wild type or mutant ASL, causing markedly reduced ASL activity.
2010-01-01
Background Accurate diagnosis is essential for prompt and appropriate treatment of malaria. While rapid diagnostic tests (RDTs) offer great potential to improve malaria diagnosis, the sensitivity of RDTs has been reported to be highly variable. One possible factor contributing to variable test performance is the diversity of parasite antigens. This is of particular concern for Plasmodium falciparum histidine-rich protein 2 (PfHRP2)-detecting RDTs since PfHRP2 has been reported to be highly variable in isolates of the Asia-Pacific region. Methods The pfhrp2 exon 2 fragment from 458 isolates of P. falciparum collected from 38 countries was amplified and sequenced. For a subset of 80 isolates, the exon 2 fragment of histidine-rich protein 3 (pfhrp3) was also amplified and sequenced. DNA sequence and statistical analysis of the variation observed in these genes was conducted. The potential impact of the pfhrp2 variation on RDT detection rates was examined by analysing the relationship between sequence characteristics of this gene and the results of the WHO product testing of malaria RDTs: Round 1 (2008), for 34 PfHRP2-detecting RDTs. Results Sequence analysis revealed extensive variations in the number and arrangement of various repeats encoded by the genes in parasite populations world-wide. However, no statistically robust correlation between gene structure and RDT detection rate for P. falciparum parasites at 200 parasites per microlitre was identified. Conclusions The results suggest that despite extreme sequence variation, diversity of PfHRP2 does not appear to be a major cause of RDT sensitivity variation. PMID:20470441
Integrative Exploratory Analysis of Two or More Genomic Datasets.
Meng, Chen; Culhane, Aedin
2016-01-01
Exploratory analysis is an essential step in the analysis of high throughput data. Multivariate approaches such as correspondence analysis (CA), principal component analysis, and multidimensional scaling are widely used in the exploratory analysis of single dataset. Modern biological studies often assay multiple types of biological molecules (e.g., mRNA, protein, phosphoproteins) on a same set of biological samples, thereby creating multiple different types of omics data or multiassay data. Integrative exploratory analysis of these multiple omics data is required to leverage the potential of multiple omics studies. In this chapter, we describe the application of co-inertia analysis (CIA; for analyzing two datasets) and multiple co-inertia analysis (MCIA; for three or more datasets) to address this problem. These methods are powerful yet simple multivariate approaches that represent samples using a lower number of variables, allowing a more easily identification of the correlated structure in and between multiple high dimensional datasets. Graphical representations can be employed to this purpose. In addition, the methods simultaneously project samples and variables (genes, proteins) onto the same lower dimensional space, so the most variant variables from each dataset can be selected and associated with samples, which can be further used to facilitate biological interpretation and pathway analysis. We applied CIA to explore the concordance between mRNA and protein expression in a panel of 60 tumor cell lines from the National Cancer Institute. In the same 60 cell lines, we used MCIA to perform a cross-platform comparison of mRNA gene expression profiles obtained on four different microarray platforms. Last, as an example of integrative analysis of multiassay or multi-omics data we analyzed transcriptomic, proteomic, and phosphoproteomic data from pluripotent (iPS) and embryonic stem (ES) cell lines.
The evolution of an osmotically inducible dps in the genus Streptomyces.
Facey, Paul D; Hitchings, Matthew D; Williams, Jason S; Skibinski, David O F; Dyson, Paul J; Del Sol, Ricardo
2013-01-01
Dps proteins are found almost ubiquitously in bacterial genomes and there is now an appreciation of their multifaceted roles in various stress responses. Previous studies have shown that this family of proteins assemble into dodecamers and their quaternary structure is entirely critical to their function. Moreover, the numbers of dps genes per bacterial genome is variable; even amongst closely related species - however, for many genera this enigma is yet to be satisfactorily explained. We reconstruct the most probable evolutionary history of Dps in Streptomyces genomes. Typically, these bacteria encode for more than one Dps protein. We offer the explanation that variation in the number of dps per genome among closely related Streptomyces can be explained by gene duplication or lateral acquisition, and the former preceded a subsequent shift in expression patterns for one of the resultant paralogs. We show that the genome of S. coelicolor encodes for three Dps proteins including a tailless Dps. Our in vivo observations show that the tailless protein, unlike the other two Dps in S. coelicolor, does not readily oligomerise. Phylogenetic and bioinformatic analyses combined with expression studies indicate that in several Streptomyces species at least one Dps is significantly over-expressed during osmotic shock, but the identity of the ortholog varies. In silico analysis of dps promoter regions coupled with gene expression studies of duplicated dps genes shows that paralogous gene pairs are expressed differentially and this correlates with the presence of a sigB promoter. Lastly, we identify a rare novel clade of Dps and show that a representative of these proteins in S. coelicolor possesses a dodecameric quaternary structure of high stability.
Proline Hydroxylation in Cell Wall Proteins: Is It Yet Possible to Define Rules?
Duruflé, Harold; Hervé, Vincent; Balliau, Thierry; Zivy, Michel; Dunand, Christophe; Jamet, Elisabeth
2017-01-01
Cell wall proteins (CWPs) play critical and dynamic roles in plant cell walls by contributing to developmental processes and response to environmental cues. Since the CWPs go through the secretion pathway, most of them undergo post-translational modifications (PTMs) which can modify their biological activity. Glycosylation is one of the major PTMs of CWPs and refers to N -glycosylation, O -glycosylation and glypiation. Each of these PTMs occurs in different amino acid contexts which are not all well defined. This article deals with the hydroxylation of Pro residues which is a prerequisite for O -glycosylation of CWPs on hydroxyproline (Hyp) residues. The location of Hyp residues is well described in several structural CWPs, but yet rarely described in other CWPs. In this article, it is studied in detail in five Arabidopsis thaliana proteins using mass spectrometry data: one of them (At4g38770, AtPRP4) is a structural CWP containing 32.5% of Pro residues arranged in typical motifs, the others are either rich (27-28%, At1g31580 and At2g10940) or poor (6-8%, At1g09750 and At3g08030) in Pro residues. The known rules of Pro hydroxylation allowed a good prediction of Hyp location in AtPRP4. However, they could not be applied to the other proteins whatever their Pro content. In addition, variability of the Pro hydroxylation patterns was observed within some amino acid motifs in all the proteins and new patterns of Pro hydroxylation are described. Altogether, this work shows that Hyp residues are present in more protein families than initially described, and that Pro hydroxylation patterns could be different in each of them. This work paves the way for completing the existing Pro hydroxylation code.
SIMS: A Hybrid Method for Rapid Conformational Analysis
Gipson, Bryant; Moll, Mark; Kavraki, Lydia E.
2013-01-01
Proteins are at the root of many biological functions, often performing complex tasks as the result of large changes in their structure. Describing the exact details of these conformational changes, however, remains a central challenge for computational biology due the enormous computational requirements of the problem. This has engendered the development of a rich variety of useful methods designed to answer specific questions at different levels of spatial, temporal, and energetic resolution. These methods fall largely into two classes: physically accurate, but computationally demanding methods and fast, approximate methods. We introduce here a new hybrid modeling tool, the Structured Intuitive Move Selector (sims), designed to bridge the divide between these two classes, while allowing the benefits of both to be seamlessly integrated into a single framework. This is achieved by applying a modern motion planning algorithm, borrowed from the field of robotics, in tandem with a well-established protein modeling library. sims can combine precise energy calculations with approximate or specialized conformational sampling routines to produce rapid, yet accurate, analysis of the large-scale conformational variability of protein systems. Several key advancements are shown, including the abstract use of generically defined moves (conformational sampling methods) and an expansive probabilistic conformational exploration. We present three example problems that sims is applied to and demonstrate a rapid solution for each. These include the automatic determination of “active” residues for the hinge-based system Cyanovirin-N, exploring conformational changes involving long-range coordinated motion between non-sequential residues in Ribose-Binding Protein, and the rapid discovery of a transient conformational state of Maltose-Binding Protein, previously only determined by Molecular Dynamics. For all cases we provide energetic validations using well-established energy fields, demonstrating this framework as a fast and accurate tool for the analysis of a wide range of protein flexibility problems. PMID:23935893
Thrombin-like enzymes from snake venom: Structural characterization and mechanism of action.
Ullah, Anwar; Masood, Rehana; Ali, Ijaz; Ullah, Kifayat; Ali, Hamid; Akbar, Haji; Betzel, Christian
2018-07-15
Snake venom thrombin-like enzymes (SVTLEs) constitute the major portion (10-24%) of snake venom and these are the second most abundant enzymes present in the crude venom. During envenomation, these enzymes had shown prominently the various pathological effects, such as disturbance in hemostatic system, fibrinogenolysis, fibrinolysis, platelet aggregation, thrombosis, neurologic disorders, activation of coagulation factors, coagulant, procoagulant etc. These enzymes also been used as a therapeutic agent for the treatment of various diseases such as congestive heart failure, ischemic stroke, thrombotic disorders etc. Although the crystal structures of five SVTLEs are available in the Protein Data Bank (PDB), there is no single article present in the literature that has described all of them. The current work describes the structural aspects, structure-based mechanism of action, processing and inhibition of these enzymes. The sequence analysis indicates that these enzymes show a high sequence identity (57-85%) with each other and low sequence identity with trypsin (36-43%), human alpha-thrombin (29-36%) and other snake venom serine proteinases (57-85%). Three-dimensional structural analysis indicates that the loops surrounding the active site are variable both in amino acids composition and length that may convey variable substrate specificity to these enzymes. The surface charge distributions also vary in these enzymes. Docking analysis with suramin shows that this inhibitor preferably binds to the C-terminal region of these enzymes and causes the destabilization of their three-dimensional structure. Copyright © 2018 Elsevier B.V. All rights reserved.
Lee, Hasup; Baek, Minkyung; Lee, Gyu Rie; Park, Sangwoo; Seok, Chaok
2017-03-01
Many proteins function as homo- or hetero-oligomers; therefore, attempts to understand and regulate protein functions require knowledge of protein oligomer structures. The number of available experimental protein structures is increasing, and oligomer structures can be predicted using the experimental structures of related proteins as templates. However, template-based models may have errors due to sequence differences between the target and template proteins, which can lead to functional differences. Such structural differences may be predicted by loop modeling of local regions or refinement of the overall structure. In CAPRI (Critical Assessment of PRotein Interactions) round 30, we used recently developed features of the GALAXY protein modeling package, including template-based structure prediction, loop modeling, model refinement, and protein-protein docking to predict protein complex structures from amino acid sequences. Out of the 25 CAPRI targets, medium and acceptable quality models were obtained for 14 and 1 target(s), respectively, for which proper oligomer or monomer templates could be detected. Symmetric interface loop modeling on oligomer model structures successfully improved model quality, while loop modeling on monomer model structures failed. Overall refinement of the predicted oligomer structures consistently improved the model quality, in particular in interface contacts. Proteins 2017; 85:399-407. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Protein flexibility in the light of structural alphabets
Craveur, Pierrick; Joseph, Agnel P.; Esque, Jeremy; Narwani, Tarun J.; Noël, Floriane; Shinada, Nicolas; Goguet, Matthieu; Leonard, Sylvain; Poulain, Pierre; Bertrand, Olivier; Faure, Guilhem; Rebehmed, Joseph; Ghozlane, Amine; Swapna, Lakshmipuram S.; Bhaskara, Ramachandra M.; Barnoud, Jonathan; Téletchéa, Stéphane; Jallu, Vincent; Cerny, Jiri; Schneider, Bohdan; Etchebest, Catherine; Srinivasan, Narayanaswamy; Gelly, Jean-Christophe; de Brevern, Alexandre G.
2015-01-01
Protein structures are valuable tools to understand protein function. Nonetheless, proteins are often considered as rigid macromolecules while their structures exhibit specific flexibility, which is essential to complete their functions. Analyses of protein structures and dynamics are often performed with a simplified three-state description, i.e., the classical secondary structures. More precise and complete description of protein backbone conformation can be obtained using libraries of small protein fragments that are able to approximate every part of protein structures. These libraries, called structural alphabets (SAs), have been widely used in structure analysis field, from definition of ligand binding sites to superimposition of protein structures. SAs are also well suited to analyze the dynamics of protein structures. Here, we review innovative approaches that investigate protein flexibility based on SAs description. Coupled to various sources of experimental data (e.g., B-factor) and computational methodology (e.g., Molecular Dynamic simulation), SAs turn out to be powerful tools to analyze protein dynamics, e.g., to examine allosteric mechanisms in large set of structures in complexes, to identify order/disorder transition. SAs were also shown to be quite efficient to predict protein flexibility from amino-acid sequence. Finally, in this review, we exemplify the interest of SAs for studying flexibility with different cases of proteins implicated in pathologies and diseases. PMID:26075209
Katsura, Kazushige; Matsuda, Takayoshi; Tomabechi, Yuri; Yonemochi, Mayumi; Hanada, Kazuharu; Ohsawa, Noboru; Sakamoto, Kensaku; Takemoto, Chie; Shirouzu, Mikako
2017-11-01
Cell-free protein synthesis is a useful method for preparing proteins for functional or structural analyses. However, batch-to-batch variability with regard to protein synthesis activity remains a problem for large-scale production of cell extract in the laboratory. To address this issue, we have developed a novel procedure for large-scale preparation of bacterial cell extract with high protein synthesis activity. The developed procedure comprises cell cultivation using a fermentor, harvesting and washing of cells by tangential flow filtration, cell disruption with high-pressure homogenizer and continuous diafiltration. By optimizing and combining these methods, ∼100 ml of the cell extract was prepared from 150 g of Escherichia coli cells. The protein synthesis activities, defined as the yield of protein per unit of absorbance at 260 nm of the cell extract, were shown to be reproducible, and the average activity of several batches was twice that obtained using a previously reported method. In addition, combinatorial use of the high-pressure homogenizer and diafiltration increased the scalability, indicating that the cell concentration at disruption varies from 0.04 to 1 g/ml. Furthermore, addition of Gam protein and examinations of the N-terminal sequence rendered the extract prepared here useful for rapid screening with linear DNA templates. © The Authors 2017. Published by Oxford University Press on behalf of the Japanese Biochemical Society. All rights reserved.
Murdin, A D; Su, H; Klein, M H; Caldwell, H D
1995-01-01
Trachoma and sexually transmitted diseases caused by Chlamydia trachomatis are major health problems worldwide. Epitopes from the variable domains of the major outer membrane protein are candidates for vaccine development. We have constructed hybrid polioviruses expressing sequences from major outer membrane protein variable domains I and IV. Antisera to the hybrids could, in combination, strongly neutralize 8 of the 12 C. trachomatis serovars most commonly associated with oculogenital infections and weakly neutralize the others. PMID:7532625
Hayashi, Yukiko
2013-01-01
Myofibrillar myopathy (MFM) is a group of hereditary disorders pathologically characterized by focal disorganizations of myofibril structures with cytoplasmic inclusions. Most of the diseases so-called desmin-related or storage myopathy, cytoplasmic body myopathy, spheroid body myopathy, reducing body myopathy, and hyaline body myopathy are included in MFM. Several causative genes have been identified such as DES, CRYAB, MYOT, ZASP, BAG3, FLNC, DNAJB6, FHL1, TTN, and VCP. Most of these genes encode Z-line related proteins or proteins associated with protein quality control. Since MFM is the name from pathological characteristics, clinical features of the patients including the age at disease onset, affected muscles, disease course, and complications are quite variable. In this paper, characteristic clinical and pathological features of each causative gene are summarized. Unexpectedly, hereditary myopathy with early respiratory failure (HMERF) caused by mutation in the A-band region of TTN is the most common cause of MFM in our cohort. Despite of intensive mutation screening, the causative gene of more than 60% of MFM patients is still unknown. Further identification of novel causative genes and elucidate pathomechanisms of protein aggregation in necessary.
Gorelik, Maryna; Orlicky, Stephen; Sartori, Maria A.; ...
2016-03-14
Skp1–Cul1–F-box (SCF) E3 ligases play key roles in multiple cellular processes through ubiquitination and subsequent degradation of substrate proteins. Although Skp1 and Cul1 are invariant components of all SCF complexes, the 69 different human F-box proteins are variable substrate binding modules that determine specificity. SCF E3 ligases are activated in many cancers and inhibitors could have therapeutic potential. Here, we used phage display to develop specific ubiquitin-based inhibitors against two F-box proteins, Fbw7 and Fbw11. Unexpectedly, the ubiquitin variants bind at the interface of Skp1 and F-box proteins and inhibit ligase activity by preventing Cul1 binding to the same surface.more » Using structure-based design and phage display, we modified the initial inhibitors to generate broad-spectrum inhibitors that targeted many SCF ligases, or conversely, a highly specific inhibitor that discriminated between even the close homologs Fbw11 and Fbw1. We propose that most F-box proteins can be targeted by this approach for basic research and for potential cancer therapies.« less
Clarke, Richard W; Monnier, Nilah; Li, Haitao; Zhou, Dejian; Browne, Helena; Klenerman, David
2007-08-15
We present a single virion method to determine absolute distributions of copy number in the protein composition of viruses and apply it to herpes simplex virus type 1. Using two-color coincidence fluorescence spectroscopy, we determine the virion-to-virion variability in copy numbers of fluorescently labeled tegument and envelope proteins relative to a capsid protein by analyzing fluorescence intensity ratios for ensembles of individual dual-labeled virions and fitting the resulting histogram of ratios. Using EYFP-tagged capsid protein VP26 as a reference for fluorescence intensity, we are able to calculate the mean and also, for the first time to our knowledge, the variation in numbers of gD, VP16, and VP22 tegument. The measurement of the number of glycoprotein D molecules was in good agreement with independent measurements of average numbers of these glycoproteins in bulk virus preparations, validating the method. The accuracy, straightforward data processing, and high throughput of this technique make it widely applicable to the analysis of the molecular composition of large complexes in general, and it is particularly suited to providing insights into virus structure, assembly, and infectivity.
A Template-Based Protein Structure Reconstruction Method Using Deep Autoencoder Learning.
Li, Haiou; Lyu, Qiang; Cheng, Jianlin
2016-12-01
Protein structure prediction is an important problem in computational biology, and is widely applied to various biomedical problems such as protein function study, protein design, and drug design. In this work, we developed a novel deep learning approach based on a deeply stacked denoising autoencoder for protein structure reconstruction. We applied our approach to a template-based protein structure prediction using only the 3D structural coordinates of homologous template proteins as input. The templates were identified for a target protein by a PSI-BLAST search. 3DRobot (a program that automatically generates diverse and well-packed protein structure decoys) was used to generate initial decoy models for the target from the templates. A stacked denoising autoencoder was trained on the decoys to obtain a deep learning model for the target protein. The trained deep model was then used to reconstruct the final structural model for the target sequence. With target proteins that have highly similar template proteins as benchmarks, the GDT-TS score of the predicted structures is greater than 0.7, suggesting that the deep autoencoder is a promising method for protein structure reconstruction.
Berjón-Otero, Mónica; Lechuga, Ana; Mehla, Jitender; Uetz, Peter
2017-01-01
ABSTRACT The family Tectiviridae comprises a group of tailless, icosahedral, membrane-containing bacteriophages that can be divided into two groups by their hosts, either Gram-negative or Gram-positive bacteria. While the first group is composed of PRD1 and nearly identical well-characterized lytic viruses, the second one includes more variable temperate phages, like GIL16 or Bam35, whose hosts are Bacillus cereus and related Gram-positive bacteria. In the genome of Bam35, nearly half of the 32 annotated open reading frames (ORFs) have no homologs in databases (ORFans), being putative proteins of unknown function, which hinders the understanding of their biology. With the aim of increasing knowledge about the viral proteome, we carried out a comprehensive yeast two-hybrid analysis of all the putative proteins encoded by the Bam35 genome. The resulting protein interactome comprised 76 unique interactions among 24 proteins, of which 12 have an unknown function. These results suggest that the P17 protein is the minor capsid protein of Bam35 and P24 is the penton protein, with the latter finding also being supported by iterative threading protein modeling. Moreover, the inner membrane transglycosylase protein P26 could have an additional structural role. We also detected interactions involving nonstructural proteins, such as the DNA-binding protein P1 and the genome terminal protein (P4), which was confirmed by coimmunoprecipitation of recombinant proteins. Altogether, our results provide a functional view of the Bam35 viral proteome, with a focus on the composition and organization of the viral particle. IMPORTANCE Tailless viruses of the family Tectiviridae can infect commensal and pathogenic Gram-positive and Gram-negative bacteria. Moreover, they have been proposed to be at the evolutionary origin of several groups of large eukaryotic DNA viruses and self-replicating plasmids. However, due to their ancient origin and complex diversity, many tectiviral proteins are ORFans of unknown function. Comprehensive protein-protein interaction (PPI) analysis of viral proteins can eventually disclose biological mechanisms and thus provide new insights into protein function unattainable by studying proteins one by one. Here we comprehensively describe intraviral PPIs among tectivirus Bam35 proteins determined using multivector yeast two-hybrid screening, and these PPIs were further supported by the results of coimmunoprecipitation assays and protein structural models. This approach allowed us to propose new functions for known proteins and hypothesize about the biological role of the localization of some viral ORFan proteins within the viral particle that will be helpful for understanding the biology of tectiviruses infecting Gram-positive bacteria. PMID:28747494
Protein structure similarity from Principle Component Correlation analysis.
Zhou, Xiaobo; Chou, James; Wong, Stephen T C
2006-01-25
Owing to rapid expansion of protein structure databases in recent years, methods of structure comparison are becoming increasingly effective and important in revealing novel information on functional properties of proteins and their roles in the grand scheme of evolutionary biology. Currently, the structural similarity between two proteins is measured by the root-mean-square-deviation (RMSD) in their best-superimposed atomic coordinates. RMSD is the golden rule of measuring structural similarity when the structures are nearly identical; it, however, fails to detect the higher order topological similarities in proteins evolved into different shapes. We propose new algorithms for extracting geometrical invariants of proteins that can be effectively used to identify homologous protein structures or topologies in order to quantify both close and remote structural similarities. We measure structural similarity between proteins by correlating the principle components of their secondary structure interaction matrix. In our approach, the Principle Component Correlation (PCC) analysis, a symmetric interaction matrix for a protein structure is constructed with relationship parameters between secondary elements that can take the form of distance, orientation, or other relevant structural invariants. When using a distance-based construction in the presence or absence of encoded N to C terminal sense, there are strong correlations between the principle components of interaction matrices of structurally or topologically similar proteins. The PCC method is extensively tested for protein structures that belong to the same topological class but are significantly different by RMSD measure. The PCC analysis can also differentiate proteins having similar shapes but different topological arrangements. Additionally, we demonstrate that when using two independently defined interaction matrices, comparison of their maximum eigenvalues can be highly effective in clustering structurally or topologically similar proteins. We believe that the PCC analysis of interaction matrix is highly flexible in adopting various structural parameters for protein structure comparison.
Loving, Kathryn A.; Lin, Andy; Cheng, Alan C.
2014-01-01
Advances reported over the last few years and the increasing availability of protein crystal structure data have greatly improved structure-based druggability approaches. However, in practice, nearly all druggability estimation methods are applied to protein crystal structures as rigid proteins, with protein flexibility often not directly addressed. The inclusion of protein flexibility is important in correctly identifying the druggability of pockets that would be missed by methods based solely on the rigid crystal structure. These include cryptic pockets and flexible pockets often found at protein-protein interaction interfaces. Here, we apply an approach that uses protein modeling in concert with druggability estimation to account for light protein backbone movement and protein side-chain flexibility in protein binding sites. We assess the advantages and limitations of this approach on widely-used protein druggability sets. Applying the approach to all mammalian protein crystal structures in the PDB results in identification of 69 proteins with potential druggable cryptic pockets. PMID:25079060
Diehl, Carl; Engström, Olof; Delaine, Tamara; Håkansson, Maria; Genheden, Samuel; Modig, Kristofer; Leffler, Hakon; Ryde, Ulf; Nilsson, Ulf J; Akke, Mikael
2010-10-20
Rational drug design is predicated on knowledge of the three-dimensional structure of the protein-ligand complex and the thermodynamics of ligand binding. Despite the fundamental importance of both enthalpy and entropy in driving ligand binding, the role of conformational entropy is rarely addressed in drug design. In this work, we have probed the conformational entropy and its relative contribution to the free energy of ligand binding to the carbohydrate recognition domain of galectin-3. Using a combination of NMR spectroscopy, isothermal titration calorimetry, and X-ray crystallography, we characterized the binding of three ligands with dissociation constants ranging over 2 orders of magnitude. (15)N and (2)H spin relaxation measurements showed that the protein backbone and side chains respond to ligand binding by increased conformational fluctuations, on average, that differ among the three ligand-bound states. Variability in the response to ligand binding is prominent in the hydrophobic core, where a distal cluster of methyl groups becomes more rigid, whereas methyl groups closer to the binding site become more flexible. The results reveal an intricate interplay between structure and conformational fluctuations in the different complexes that fine-tunes the affinity. The estimated change in conformational entropy is comparable in magnitude to the binding enthalpy, demonstrating that it contributes favorably and significantly to ligand binding. We speculate that the relatively weak inherent protein-carbohydrate interactions and limited hydrophobic effect associated with oligosaccharide binding might have exerted evolutionary pressure on carbohydrate-binding proteins to increase the affinity by means of conformational entropy.
Fu, Xinmiao; Chang, Zengyi
2004-04-02
Small heat shock proteins (sHsps) usually exist as oligomers that undergo dynamic oligomeric dissociation/re-association, with the dissociated oligomers as active forms to bind substrate proteins under heat shock conditions. In this study, however, we found that Hsp16.3, one sHsp from Mycobacterium tuberculosis, is able to sensitively modulate its chaperone-like activity in a range of physiological temperatures (from 25 to 37.5 degrees C) while its native oligomeric size is still maintained. Further analysis demonstrated that Hsp16.3 exposes higher hydrophobic surfaces upon temperatures increasing and that a large soluble complex between Hsp16.3 and substrate is formed only in the condition of heating temperature up to 35 and 37.5 degrees C. Structural analysis by fluorescence anisotropy showed that Hsp16.3 nonameric structure becomes more dynamic and variable at elevated temperatures. Moreover, subunit exchange between Hsp16.3 oligomers was found to occur faster upon temperatures increasing as revealed by fluorescence energy resonance transfer. These observations indicate that Hsp16.3 is able to modulate its chaperone activity by adjusting the dynamics of oligomeric dissociation/re-association process while maintaining its static oligomeric size unchangeable. A kinetic model is therefore proposed to explain the mechanism of sHsps-binding substrate proteins through oligomeric dissociation. The present study also implied that Hsp16.3 is at least capable of binding non-native proteins in vivo while expressing in the host organism that survives at 37 degrees C.
Dynamics of the Tec‐family tyrosine kinase SH3 domains
Roberts, Justin M.; Tarafdar, Sreya; Joseph, Raji E.; Andreotti, Amy H.; Smithgall, Thomas E.; Engen, John R.
2016-01-01
Abstract The Src Homology 3 (SH3) domain is an important regulatory domain found in many signaling proteins. X‐ray crystallography and NMR structures of SH3 domains are generally conserved but other studies indicate that protein flexibility and dynamics are not. We previously reported that based on hydrogen exchange mass spectrometry (HX MS) studies, there is variable flexibility and dynamics among the SH3 domains of the Src‐family tyrosine kinases and related proteins. Here we have extended our studies to the SH3 domains of the Tec family tyrosine kinases (Itk, Btk, Tec, Txk, Bmx). The SH3 domains of members of this family augment the variety in dynamics observed in previous SH3 domains. Txk and Bmx SH3 were found to be highly dynamic in solution by HX MS and Bmx was unstructured by NMR. Itk and Btk SH3 underwent a clear EX1 cooperative unfolding event, which was localized using pepsin digestion and mass spectrometry after hydrogen exchange labeling. The unfolding was localized to peptide regions that had been previously identified in the Src‐family and related protein SH3 domains, yet the kinetics of unfolding were not. Sequence alignment does not provide an easy explanation for the observed dynamics behavior, yet the similarity of location of EX1 unfolding suggests that higher‐order structural properties may play a role. While the exact reason for such dynamics is not clear, such motions can be exploited in intra‐ and intermolecular binding assays of proteins containing the domains. PMID:26808198
Ramírez-Anguita, Juan Manuel; Rodríguez-Espigares, Ismael; Guixà-González, Ramon; Bruno, Agostino; Torrens-Fontanals, Mariona; Varela-Rial, Alejandro; Selent, Jana
2018-01-01
The serotonin 5-hydroxytryptamine 2A (5-HT 2A ) receptor is a G-protein-coupled receptor (GPCR) relevant for the treatment of CNS disorders. In this regard, neuronal membrane composition in the brain plays a crucial role in the modulation of the receptor functioning. Since cholesterol is an essential component of neuronal membranes, we have studied its effect on the 5-HT 2A receptor dynamics through all-atom MD simulations. We find that the presence of cholesterol in the membrane increases receptor conformational variability in most receptor segments. Importantly, detailed structural analysis indicates that conformational variability goes along with the destabilization of hydrogen bonding networks not only within the receptor but also between receptor and lipids. In addition to increased conformational variability, we also find receptor segments with reduced variability. Our analysis suggests that this increased stabilization is the result of stabilizing effects of tightly bound cholesterol molecules to the receptor surface. Our finding contributes to a better understanding of membrane-induced alterations of receptor dynamics and points to cholesterol-induced stabilizing and destabilizing effects on the conformational variability of GPCRs. © 2017 International Union of Biochemistry and Molecular Biology, Inc.
Crystal structure of Spy0129, a Streptococcus pyogenes class B sortase involved in pilus assembly.
Kang, Hae Joo; Coulibaly, Fasséli; Proft, Thomas; Baker, Edward N
2011-01-11
Sortase enzymes are cysteine transpeptidases that mediate the covalent attachment of substrate proteins to the cell walls of gram-positive bacteria, and thereby play a crucial role in virulence, infection and colonisation by pathogens. Many cell-surface proteins are anchored by the housekeeping sortase SrtA but other more specialised sortases exist that attach sub-sets of proteins or function in pilus assembly. The sortase Spy0129, or SrtC1, from the M1 SF370 strain of Streptococcus pyogenes is responsible for generating the covalent linkages between the pilin subunits in the pili of this organism. The crystal structure of Spy0129 has been determined at 2.3 Å resolution (R = 20.4%, Rfree = 26.0%). The structure shows that Spy0129 is a class B sortase, in contrast to other characterised pilin polymerases, which belong to class C. Spy0129 lacks a flap believed to function in substrate recognition in class C enzymes and instead has an elaborated β6/β7 loop. The two independent Spy0129 molecules in the crystal show differences in the positions and orientations of the catalytic Cys and His residues, Cys221 and His126, correlated with movements of the β7/β8 and β4/β5 loops that respectively follow these residues. Bound zinc ions stabilise these alternative conformations in the crystal. This conformational variability is likely to be important for function although there is no evidence that zinc is involved in vivo.
Sperm-Associated Antigen–17 Gene Is Essential for Motile Cilia Function and Neonatal Survival
Teves, Maria Eugenia; Zhang, Zhibing; Costanzo, Richard M.; Henderson, Scott C.; Corwin, Frank D.; Zweit, Jamal; Sundaresan, Gobalakrishnan; Subler, Mark; Salloum, Fadi N.; Rubin, Bruce K.
2013-01-01
Primary ciliary dyskinesia (PCD), resulting from defects in cilia assembly or motility, is caused by mutations in a number of genes encoding axonemal proteins. PCD phenotypes are variable, and include recurrent respiratory tract infections, bronchiectasis, hydrocephaly, situs inversus, and male infertility. We generated knockout mice for the sperm-associated antigen–17 (Spag17) gene, which encodes a central pair (CP) protein present in the axonemes of cells with “9 + 2” motile cilia or flagella. The targeting of Spag17 resulted in a severe phenotype characterized by immotile nasal and tracheal cilia, reduced clearance of nasal mucus, profound respiratory distress associated with lung fluid accumulation and disruption of the alveolar epithelium, cerebral ventricular expansion consistent with emerging hydrocephalus, failure to suckle, and neonatal demise within 12 hours of birth. Ultrastructural analysis revealed the loss of one CP microtubule in approximately one quarter of tracheal cilia axonemes, an absence of a C1 microtubule projection, and other less frequent CP structural abnormalities. SPAG6 and SPAG16 (CP proteins that interact with SPAG17) were increased in tracheal tissue from SPAG17-deficient mice. We conclude that Spag17 plays a critical role in the function and structure of motile cilia, and that neonatal lethality is likely explained by impaired airway mucociliary clearance. PMID:23418344
2015-01-01
Biological assays formatted as microarrays have become a critical tool for the generation of the comprehensive data sets required for systems-level understanding of biological processes. Manual annotation of data extracted from images of microarrays, however, remains a significant bottleneck, particularly for protein microarrays due to the sensitivity of this technology to weak artifact signal. In order to automate the extraction and curation of data from protein microarrays, we describe an algorithm called Crossword that logically combines information from multiple approaches to fully automate microarray segmentation. Automated artifact removal is also accomplished by segregating structured pixels from the background noise using iterative clustering and pixel connectivity. Correlation of the location of structured pixels across image channels is used to identify and remove artifact pixels from the image prior to data extraction. This component improves the accuracy of data sets while reducing the requirement for time-consuming visual inspection of the data. Crossword enables a fully automated protocol that is robust to significant spatial and intensity aberrations. Overall, the average amount of user intervention is reduced by an order of magnitude and the data quality is increased through artifact removal and reduced user variability. The increase in throughput should aid the further implementation of microarray technologies in clinical studies. PMID:24417579
Tarafder, Sumit; Toukir Ahmed, Md; Iqbal, Sumaiya; Tamjidul Hoque, Md; Sohel Rahman, M
2018-03-14
Accessible surface area (ASA) of a protein residue is an effective feature for protein structure prediction, binding region identification, fold recognition problems etc. Improving the prediction of ASA by the application of effective feature variables is a challenging but explorable task to consider, specially in the field of machine learning. Among the existing predictors of ASA, REGAd 3 p is a highly accurate ASA predictor which is based on regularized exact regression with polynomial kernel of degree 3. In this work, we present a new predictor RBSURFpred, which extends REGAd 3 p on several dimensions by incorporating 58 physicochemical, evolutionary and structural properties into 9-tuple peptides via Chou's general PseAAC, which allowed us to obtain higher accuracies in predicting both real-valued and binary ASA. We have compared RBSURFpred for both real and binary space predictions with state-of-the-art predictors, such as REGAd 3 p and SPIDER2. We also have carried out a rigorous analysis of the performance of RBSURFpred in terms of different amino acids and their properties, and also with biologically relevant case-studies. The performance of RBSURFpred establishes itself as a useful tool for the community. Copyright © 2018 Elsevier Ltd. All rights reserved.
What are the structural features that drive partitioning of proteins in aqueous two-phase systems?
Wu, Zhonghua; Hu, Gang; Wang, Kui; Zaslavsky, Boris Yu; Kurgan, Lukasz; Uversky, Vladimir N
2017-01-01
Protein partitioning in aqueous two-phase systems (ATPSs) represents a convenient, inexpensive, and easy to scale-up protein separation technique. Since partition behavior of a protein dramatically depends on an ATPS composition, it would be highly beneficial to have reliable means for (even qualitative) prediction of partitioning of a target protein under different conditions. Our aim was to understand which structural features of proteins contribute to partitioning of a query protein in a given ATPS. We undertook a systematic empirical analysis of relations between 57 numerical structural descriptors derived from the corresponding amino acid sequences and crystal structures of 10 well-characterized proteins and the partition behavior of these proteins in 29 different ATPSs. This analysis revealed that just a few structural characteristics of proteins can accurately determine behavior of these proteins in a given ATPS. However, partition behavior of proteins in different ATPSs relies on different structural features. In other words, we could not find a unique set of protein structural features derived from their crystal structures that could be used for the description of the protein partition behavior of all proteins in all ATPSs analyzed in this study. We likely need to gain better insight into relationships between protein-solvent interactions and protein structure peculiarities, in particular given limitations of the used here crystal structures, to be able to construct a model that accurately predicts protein partition behavior across all ATPSs. Copyright © 2016 Elsevier B.V. All rights reserved.
In Silico Studies of Medicinal Compounds Against Hepatitis C Capsid Protein from North India
Mathew, Shilu; Faheem, Muhammad; Archunan, Govindaraju; Ilyas, Muhammad; Begum, Nargis; Jahangir, Syed; Qadri, Ishtiaq; Qahtani, Mohammad Al; Mathew, Shiny
2014-01-01
Hepatitis viral infection is a leading cause of chronic hepatitis, cirrhosis, and hepatocellular carcinoma (HCC). Over one million people are estimated to be persistently infected with hepatitis C virus (HCV) worldwide. As capsid core protein is the key element in spreading HCV; hence, it is considered to be the superlative target of antiviral compounds. Novel drug inhibitors of HCV are in need to complement or replace the current treatments such as pegylated interferon’s and ribavirin as they are partially booming and beset with various side effects. Our study was conducted to predict 3D structure of capsid core protein of HCV from northern part of India. Core, the capsid protein of HCV, handles the assembly and packaging of HCV RNA genome and is the least variable of all the ten HCV proteins among the six HCV genotypes. Therefore, we screened four phytochemicals inhibitors that are known to disrupt the interactions of core and other HCV proteins such as (a) epigallocatechin gallate (EGCG), (b) ladanein, (c) naringenin, and (d) silybin extracted from medicinal plants; targeted against active site of residues of HCV-genotype 3 (G3) (Q68867) and its subtypes 3b (Q68861) and 3g (Q68865) from north India. To study the inhibitory activity of the recruited flavonoids, we conducted a quantitative structure–activity relationship (QSAR). Furthermore, docking interaction suggests that EGCG showed a maximum number of hydrogen bond (H-bond) interactions with all the three modeled capsid proteins with high interaction energy followed by naringenin and silybin. Thus, our results strongly correlate the inhibitory activity of the selected bioflavonoid. Finally, the dynamic predicted capsid protein molecule of HCV virion provides a general avenue to target structure-based antiviral compounds that support the hypothesis that the screened inhibitors for viral capsid might constitute new class of potent agents but further confirmation is necessary using in vitro and in vivo studies. PMID:25002815
NASA Astrophysics Data System (ADS)
An, Bo
The extraordinary mechanical properties of orb-weaving spider silks have served spiders for over 400 million years. However, only in the late 20th century did we start to understand the molecular nature of spider silk that contributes to its incredible properties as biomaterials. Among all seven types of spider silks, major ampullate silk from typical orb-weaving spiders is the toughest of all, it consists of primarily two proteins: MaSp1 and MaSp2. Variable ratios and conserved motifs of these two proteins in all the native spider silks demonstrate the significant role of MaSp1 and MaSp2 in controlling the mechanical properties of the fiber. The amino acid sequences of the orb weaving spider silk proteins have remained almost unchanged for more than 100 million years. Interestingly, MaSp1 and MaSp2 are the only two components in all studied dragline silk fibers from these spiders. The mechanical properties of native dragline silk vary slightly between species, which are believed to relate to the ratio of MaSp1 to MaSp2 in the silk. Both of these facts clearly indicate the importance of these two proteins to the mechanical properties of the fiber. Various types of synthetic spider silk fibers have been produced and studied in an effort to mass-produce man-made fibers with qualities comparable to native spider silk. To investigate the roles of MaSp1 and MaSp2 in silk fiber, synthetic MaSp1 (major abundant protein in Nephila clavipes major ampullate silks) only fibers, MaSp1/MaSp2 protein mixture fibers and chimeric protein fibers with both MaSp1 and MaSp2 sequence features have been produced and tested for mechanical properties. Solid-State Nuclear Magnetic Resonance was used to characterize the structure of silk fibers and reveal the relation between fiber spatial structure and mechanical properties.
Unified Deep Learning Architecture for Modeling Biology Sequence.
Wu, Hongjie; Cao, Chengyuan; Xia, Xiaoyan; Lu, Qiang
2017-10-09
Prediction of the spatial structure or function of biological macromolecules based on their sequence remains an important challenge in bioinformatics. When modeling biological sequences using traditional sequencing models, characteristics, such as long-range interactions between basic units, the complicated and variable output of labeled structures, and the variable length of biological sequences, usually lead to different solutions on a case-by-case basis. This study proposed the use of bidirectional recurrent neural networks based on long short-term memory or a gated recurrent unit to capture long-range interactions by designing the optional reshape operator to adapt to the diversity of the output labels and implementing a training algorithm to support the training of sequence models capable of processing variable-length sequences. Additionally, the merge and pooling operators enhanced the ability to capture short-range interactions between basic units of biological sequences. The proposed deep-learning model and its training algorithm might be capable of solving currently known biological sequence-modeling problems through the use of a unified framework. We validated our model on one of the most difficult biological sequence-modeling problems currently known, with our results indicating the ability of the model to obtain predictions of protein residue interactions that exceeded the accuracy of current popular approaches by 10% based on multiple benchmarks.
Ban, Yajing; L Prates, Luciana; Yu, Peiqiang
2017-10-18
This study was conducted to (1) determine protein and carbohydrate molecular structure profiles and (2) quantify the relationship between structural features and protein bioavailability of newly developed carinata and canola seeds for dairy cows by using Fourier transform infrared molecular spectroscopy. Results showed similarity in protein structural makeup within the entire protein structural region between carinata and canola seeds. The highest area ratios related to structural CHO, total CHO, and cellulosic compounds were obtained for carinata seeds. Carinata and canola seeds showed similar carbohydrate and protein molecular structures by multivariate analyses. Carbohydrate molecular structure profiles were highly correlated to protein rumen degradation and intestinal digestion characteristics. In conclusion, the molecular spectroscopy can detect inherent structural characteristics in carinata and canola seeds in which carbohydrate-relative structural features are related to protein metabolism and utilization. Protein and carbohydrate spectral profiles could be used as predictors of rumen protein bioavailability in cows.
Exploring Human Diseases and Biological Mechanisms by Protein Structure Prediction and Modeling.
Wang, Juexin; Luttrell, Joseph; Zhang, Ning; Khan, Saad; Shi, NianQing; Wang, Michael X; Kang, Jing-Qiong; Wang, Zheng; Xu, Dong
2016-01-01
Protein structure prediction and modeling provide a tool for understanding protein functions by computationally constructing protein structures from amino acid sequences and analyzing them. With help from protein prediction tools and web servers, users can obtain the three-dimensional protein structure models and gain knowledge of functions from the proteins. In this chapter, we will provide several examples of such studies. As an example, structure modeling methods were used to investigate the relation between mutation-caused misfolding of protein and human diseases including epilepsy and leukemia. Protein structure prediction and modeling were also applied in nucleotide-gated channels and their interaction interfaces to investigate their roles in brain and heart cells. In molecular mechanism studies of plants, rice salinity tolerance mechanism was studied via structure modeling on crucial proteins identified by systems biology analysis; trait-associated protein-protein interactions were modeled, which sheds some light on the roles of mutations in soybean oil/protein content. In the age of precision medicine, we believe protein structure prediction and modeling will play more and more important roles in investigating biomedical mechanism of diseases and drug design.
Clinico-pathological and biological prognostic variables in squamous cell carcinoma of the vulva.
Gadducci, Angiolo; Tana, Roberta; Barsotti, Cecilia; Guerrieri, Maria Elena; Genazzani, Andrea Riccardo
2012-07-01
Several clinical-pathological parameters have been related to survival of patients with invasive squamous cell carcinoma of the vulva, whereas few studies have investigated the ability of biological variables to predict the clinical outcome of these patients. The present paper reviews the literature data on the prognostic relevance of lymph node-related parameters, primary tumor-related parameters, FIGO stage, blood variables, and tissue biological variables. Regarding these latter, the paper takes into account the analysis of DNA content, cell cycle-regulatory proteins, apoptosis-related proteins, epidermal growth factor receptor [EGFR], and proteins that are involved in tumor invasiveness, metastasis and angiogenesis. At present, the lymph node status and FIGO stage according to the new 2009 classification system are the main predictors for vulvar squamous cell carcinoma, whereas biological variables do not have yet a clinical relevance and their role is still investigational. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Alakent, Burak; Camurdan, Mehmet C.; Doruker, Pemra
2005-10-01
Time series models, which are constructed from the projections of the molecular-dynamics (MD) runs on principal components (modes), are used to mimic the dynamics of two proteins: tendamistat and immunity protein of colicin E7 (ImmE7). Four independent MD runs of tendamistat and three independent runs of ImmE7 protein in vacuum are used to investigate the energy landscapes of these proteins. It is found that mean-square displacements of residues along the modes in different time scales can be mimicked by time series models, which are utilized in dividing protein dynamics into different regimes with respect to the dominating motion type. The first two regimes constitute the dominance of intraminimum motions during the first 5ps and the random walk motion in a hierarchically higher-level energy minimum, which comprise the initial time period of the trajectories up to 20-40ps for tendamistat and 80-120ps for ImmE7. These are also the time ranges within which the linear nonstationary time series are completely satisfactory in explaining protein dynamics. Encountering energy barriers enclosing higher-level energy minima constrains the random walk motion of the proteins, and pseudorelaxation processes at different levels of minima are detected in tendamistat, depending on the sampling window size. Correlation (relaxation) times of 30-40ps and 150-200ps are detected for two energy envelopes of successive levels for tendamistat, which gives an overall idea about the hierarchical structure of the energy landscape. However, it should be stressed that correlation times of the modes are highly variable with respect to conformational subspaces and sampling window sizes, indicating the absence of an actual relaxation. The random-walk step sizes and the time length of the second regime are used to illuminate an important difference between the dynamics of the two proteins, which cannot be clarified by the investigation of relaxation times alone: ImmE7 has lower-energy barriers enclosing the higher-level energy minimum, preventing the protein to relax and letting it move in a random-walk fashion for a longer period of time.
Shen, Hong-Bin; Yi, Dong-Liang; Yao, Li-Xiu; Yang, Jie; Chou, Kuo-Chen
2008-10-01
In the postgenomic age, with the avalanche of protein sequences generated and relatively slow progress in determining their structures by experiments, it is important to develop automated methods to predict the structure of a protein from its sequence. The membrane proteins are a special group in the protein family that accounts for approximately 30% of all proteins; however, solved membrane protein structures only represent less than 1% of known protein structures to date. Although a great success has been achieved for developing computational intelligence techniques to predict secondary structures in both globular and membrane proteins, there is still much challenging work in this regard. In this review article, we firstly summarize the recent progress of automation methodology development in predicting protein secondary structures, especially in membrane proteins; we will then give some future directions in this research field.
Genshaft, Alexander; Moser, Joe-Ann S.; D'Antonio, Edward L.; Bowman, Christine M.; Christianson, David W.
2013-01-01
The reversible acetylation of lysine to form N6-acetyllysine in the regulation of protein function is a hallmark of epigenetics. Acetylation of the positively charged amino group of the lysine side chain generates a neutral N-alkylacetamide moiety that serves as a molecular “switch” for the modulation of protein function and protein-protein interactions. We now report the analysis of 381 N6-acetyllysine side chain amide conformations as found in 79 protein crystal structures and 11 protein NMR structures deposited in the Protein Data Bank (PDB) of the Research Collaboratory for Structural Bioinformatics. We find that only 74.3% of N6-acetyllysine residues in protein crystal structures and 46.5% in protein NMR structures contain amide groups with energetically preferred trans or generously trans conformations. Surprisingly, 17.6% of N6-acetyllysine residues in protein crystal structures and 5.3% in protein NMR structures contain amide groups with energetically unfavorable cis or generously cis conformations. Even more surprisingly, 8.1% of N6-acetyllysine residues in protein crystal structures and 48.2% in NMR structures contain amide groups with energetically prohibitive twisted conformations that approach the transition state structure for cis-trans isomerization. In contrast, 109 unique N-alkylacetamide groups contained in 84 highly-accurate small molecule crystal structures retrieved from the Cambridge Structural Database exclusively adopt energetically preferred trans conformations. Therefore, we conclude that cis and twisted N6-acetyllysine amides in protein structures deposited in the PDB are erroneously modeled due to their energetically unfavorable or prohibitive conformations. PMID:23401043
Structure of the OsSERK2 leucine-rich repeat extracellular domain.
McAndrew, Ryan; Pruitt, Rory N; Kamita, Shizuo G; Pereira, Jose Henrique; Majumdar, Dipali; Hammock, Bruce D; Adams, Paul D; Ronald, Pamela C
2014-11-01
Somatic embryogenesis receptor kinases (SERKs) are leucine-rich repeat (LRR)-containing integral membrane receptors that are involved in the regulation of development and immune responses in plants. It has recently been shown that rice SERK2 (OsSERK2) is essential for XA21-mediated resistance to the pathogen Xanthomonas oryzae pv. oryzae. OsSERK2 is also required for the BRI1-mediated, FLS2-mediated and EFR-mediated responses to brassinosteroids, flagellin and elongation factor Tu (EF-Tu), respectively. Here, crystal structures of the LRR domains of OsSERK2 and a D128N OsSERK2 mutant, expressed as hagfish variable lymphocyte receptor (VLR) fusions, are reported. These structures suggest that the aspartate mutation does not generate any significant conformational change in the protein, but instead leads to an altered interaction with partner receptors.
The Prediction of Botulinum Toxin Structure Based on in Silico and in Vitro Analysis
NASA Astrophysics Data System (ADS)
Suzuki, Tomonori; Miyazaki, Satoru
2011-01-01
Many of biological system mediated through protein-protein interactions. Knowledge of protein-protein complex structure is required for understanding the function. The determination of huge size and flexible protein-protein complex structure by experimental studies remains difficult, costly and five-consuming, therefore computational prediction of protein structures by homolog modeling and docking studies is valuable method. In addition, MD simulation is also one of the most powerful methods allowing to see the real dynamics of proteins. Here, we predict protein-protein complex structure of botulinum toxin to analyze its property. These bioinformatics methods are useful to report the relation between the flexibility of backbone structure and the activity.
Music, Nedzad; Gagnon, Carl A
2010-12-01
Porcine reproductive and respiratory syndrome (PRRS) is an economically devastating viral disease affecting the swine industry worldwide. The etiological agent, PRRS virus (PRRSV), possesses a RNA viral genome with nine open reading frames (ORFs). The ORF1a and ORF1b replicase-associated genes encode the polyproteins pp1a and pp1ab, respectively. The pp1a is processed in nine non-structural proteins (nsps): nsp1α, nsp1β, and nsp2 to nsp8. Proteolytic cleavage of pp1ab generates products nsp9 to nsp12. The proteolytic pp1a cleavage products process and cleave pp1a and pp1ab into nsp products. The nsp9 to nsp12 are involved in virus genome transcription and replication. The 3' end of the viral genome encodes four minor and three major structural proteins. The GP(2a), GP₃ and GP₄ (encoded by ORF2a, 3 and 4), are glycosylated membrane associated minor structural proteins. The fourth minor structural protein, the E protein (encoded by ORF2b), is an unglycosylated membrane associated protein. The viral envelope contains two major structural proteins: a glycosylated major envelope protein GP₅ (encoded by ORF5) and an unglycosylated membrane M protein (encoded by ORF6). The third major structural protein is the nucleocapsid N protein (encoded by ORF7). All PRRSV non-structural and structural proteins are essential for virus replication, and PRRSV infectivity is relatively intolerant to subtle changes within the structural proteins. PRRSV virulence is multigenic and resides in both the non-structural and structural viral proteins. This review discusses the molecular characteristics, biological and immunological functions of the PRRSV structural and nsps and their involvement in the virus pathogenesis.
Joven, Alberto; Morona, Ruth; Moreno, Nerea; González, Agustín
2013-07-01
The sequence of appearance of calretinin and calbindin-D28k immunoreactive (CRir and CBir, respectively) cells and fibers has been studied in the brain of the urodele amphibian Pleurodeles waltl. Embryonic, larval and juvenile stages were studied. The early expression and the dynamics of the distribution of CBir and CRir structures have been used as markers for developmental aspects of distinct neuronal populations, highlighting the accurate extent of many regions in the developing brain, not observed on the basis of cytoarchitecture alone. CR and, to a lesser extent, CB are expressed early in the central nervous system and show a progressively increasing expression from the embryonic stages throughout the larval life and, in general, the labeled structures in the developing brain retain their ability to express these proteins in the adult brain. The onset of CRir cells primarily served to follow the development of the olfactory bulbs, subpallium, thalamus, alar hypothalamus, mesencephalic tegmentum, and distinct cell populations in the rhombencephalic reticular formation. CBir cells highlighted the development of, among others, the pallidum, hypothalamus, dorsal habenula, midbrain tegmentum, cerebellum, and central gray of the rostral rhombencephalon. However, it was the relative and mostly segregated distribution of both proteins in distinct cell populations which evidenced the developing regionalization of the brain. The results have shown the usefulness in neuroanatomy of the analysis during development of the onset of CBir and CRir structures, but the comparison with previous data has shown extensive variability across vertebrate classes. Therefore, one should be cautious when comparing possible homologue structures across species only on the basis of the expression of these proteins, due to the variation of the content of calcium-binding proteins observed in well-established homologous regions in the brain of different vertebrates.
Batra, Ritu; Saripalli, Gautam; Mohan, Amita; Gupta, Saurabh; Gill, Kulvinder S.; Varadwaj, Pritish K.; Balyan, Harindra S.; Gupta, Pushpendra K.
2017-01-01
ADP-glucose pyrophosphorylase (AGPase) is a heterotetrameric enzyme with two large subunits (LS) and two small subunits (SS). It plays a critical role in starch biosynthesis. We are reporting here detailed structure, function and evolution of the genes encoding the LS and the SS among monocots and dicots. “True” orthologs of maize Sh2 (AGPase LS) and Bt2 (AGPase SS) were identified in seven other monocots and three dicots; structure of the enzyme at protein level was also studied. Novel findings of the current study include the following: (i) at the DNA level, the genes controlling the SS are more conserved than those controlling the LS; the variation in both is mainly due to intron number, intron length and intron phase distribution; (ii) at protein level, the SS genes are more conserved relative to those for LS; (iii) “QTCL” motif present in SS showed evolutionary differences in AGPase belonging to wheat 7BS, T. urartu, rice and sorghum, while “LGGG” motif in LS was present in all species except T. urartu and chickpea; SS provides thermostability to AGPase, while LS is involved in regulation of AGPase activity; (iv) heterotetrameric structure of AGPase was predicted and analyzed in real time environment through molecular dynamics simulation for all the species; (v) several cis-acting regulatory elements were identified in the AGPase promoters with their possible role in regulating spatial and temporal expression (endosperm and leaf tissue) and also the expression, in response to abiotic stresses; and (vi) expression analysis revealed downregulation of both subunits under conditions of heat and drought stress. The results of the present study have allowed better understanding of structure and evolution of the genes and the encoded proteins and provided clues for exploitation of variability in these genes for engineering thermostable AGPase. PMID:28174576
von Jeinsen, Beatrice; Short, Meghan I; Xanthakis, Vanessa; Carneiro, Herman; Cheng, Susan; Mitchell, Gary F; Vasan, Ramachandran S
2018-06-21
Adipokines mediate cardiometabolic risk associated with obesity but their role in the pathogenesis of obesity-associated heart failure remains uncertain. We investigated the associations between circulating adipokine concentrations and echocardiographic measures in a community-based sample. We evaluated 3514 Framingham Heart Study participants (mean age 40 years, 53.8% women) who underwent routine echocardiography and had select circulating adipokines measured, ie, leptin, soluble leptin receptor, fatty acid-binding protein 4, retinol-binding protein 4, fetuin-A, and adiponectin. We used multivariable linear regression, adjusting for known correlates (including weight), to relate adipokine concentrations (independent variables) to the following echocardiographic measures (dependent variables): left ventricular mass index, left atrial diameter in end systole, fractional shortening, and E/e'. In multivariable-adjusted analysis, left ventricular mass index was inversely related to circulating leptin and fatty acid-binding protein 4 concentrations but positively related to retinol-binding protein 4 and leptin receptor levels ( P ≤0.002 for all). Left atrial end-systolic dimension was inversely related to leptin but positively related to retinol-binding protein 4 concentrations ( P ≤0.0001). E/e' was inversely related to leptin receptor levels ( P =0.0002). We observed effect modification by body weight for select associations (leptin receptor and fatty acid-binding protein 4 with left ventricular mass index, and leptin with left atrial diameter in end systole; P <0.05 for interactions). Fractional shortening was not associated with any of the adipokines. No echocardiographic trait was associated with fetuin-A or adiponectin concentrations. In our cross-sectional study of a large, young to middle-aged, relatively healthy community-based sample, key indices of subclinical cardiac remodeling were associated with higher or lower circulating concentrations of prohypertrophic and antihypertrophic adipokines in a context-specific manner. These observations may offer insights into the pathogenesis of the cardiomyopathy of obesity. © 2018 The Authors. Published on behalf of the American Heart Association, Inc., by Wiley.
Martin, Juliette; Regad, Leslie; Etchebest, Catherine; Camproux, Anne-Claude
2008-11-15
Interresidue protein contacts in proteins structures and at protein-protein interface are classically described by the amino acid types of interacting residues and the local structural context of the contact, if any, is described using secondary structures. In this study, we present an alternate analysis of interresidue contact using local structures defined by the structural alphabet introduced by Camproux et al. This structural alphabet allows to describe a 3D structure as a sequence of prototype fragments called structural letters, of 27 different types. Each residue can then be assigned to a particular local structure, even in loop regions. The analysis of interresidue contacts within protein structures defined using Voronoï tessellations reveals that pairwise contact specificity is greater in terms of structural letters than amino acids. Using a simple heuristic based on specificity score comparison, we find that 74% of the long-range contacts within protein structures are better described using structural letters than amino acid types. The investigation is extended to a set of protein-protein complexes, showing that the similar global rules apply as for intraprotein contacts, with 64% of the interprotein contacts best described by local structures. We then present an evaluation of pairing functions integrating structural letters to decoy scoring and show that some complexes could benefit from the use of structural letter-based pairing functions.
Classification of proteins: available structural space for molecular modeling.
Andreeva, Antonina
2012-01-01
The wealth of available protein structural data provides unprecedented opportunity to study and better understand the underlying principles of protein folding and protein structure evolution. A key to achieving this lies in the ability to analyse these data and to organize them in a coherent classification scheme. Over the past years several protein classifications have been developed that aim to group proteins based on their structural relationships. Some of these classification schemes explore the concept of structural neighbourhood (structural continuum), whereas other utilize the notion of protein evolution and thus provide a discrete rather than continuum view of protein structure space. This chapter presents a strategy for classification of proteins with known three-dimensional structure. Steps in the classification process along with basic definitions are introduced. Examples illustrating some fundamental concepts of protein folding and evolution with a special focus on the exceptions to them are presented.
A single determinant dominates the rate of yeast protein evolution.
Drummond, D Allan; Raval, Alpan; Wilke, Claus O
2006-02-01
A gene's rate of sequence evolution is among the most fundamental evolutionary quantities in common use, but what determines evolutionary rates has remained unclear. Here, we carry out the first combined analysis of seven predictors (gene expression level, dispensability, protein abundance, codon adaptation index, gene length, number of protein-protein interactions, and the gene's centrality in the interaction network) previously reported to have independent influences on protein evolutionary rates. Strikingly, our analysis reveals a single dominant variable linked to the number of translation events which explains 40-fold more variation in evolutionary rate than any other, suggesting that protein evolutionary rate has a single major determinant among the seven predictors. The dominant variable explains nearly half the variation in the rate of synonymous and protein evolution. We show that the two most commonly used methods to disentangle the determinants of evolutionary rate, partial correlation analysis and ordinary multivariate regression, produce misleading or spurious results when applied to noisy biological data. We overcome these difficulties by employing principal component regression, a multivariate regression of evolutionary rate against the principal components of the predictor variables. Our results support the hypothesis that translational selection governs the rate of synonymous and protein sequence evolution in yeast.
NASA Astrophysics Data System (ADS)
Bozkurt, Ozlem; Haman Bayari, Sevgi; Severcan, Mete; Krafft, Christoph; Popp, Jürgen; Severcan, Feride
2012-07-01
The relation between protein structural alterations and tissue dysfunction is a major concern as protein fibrillation and/or aggregation due to structural alterations has been reported in many disease states. In the current study, Fourier transform infrared microspectroscopic imaging has been used to investigate diabetes-induced changes on protein secondary structure and macromolecular content in streptozotocin-induced diabetic rat liver. Protein secondary structural alterations were predicted using neural network approach utilizing the amide I region. Moreover, the role of selenium in the recovery of diabetes-induced alterations on macromolecular content and protein secondary structure was also studied. The results revealed that diabetes induced a decrease in lipid to protein and glycogen to protein ratios in diabetic livers. Significant alterations in protein secondary structure were observed with a decrease in α-helical and an increase in β-sheet content. Both doses of selenium restored diabetes-induced changes in lipid to protein and glycogen to protein ratios. However, low-dose selenium supplementation was not sufficient to recover the effects of diabetes on protein secondary structure, while a higher dose of selenium fully restored diabetes-induced alterations in protein structure.
Single-Point Mutation with a Rotamer Library Toolkit: Toward Protein Engineering.
Pottel, Joshua; Moitessier, Nicolas
2015-12-28
Protein engineers have long been hard at work to harness biocatalysts as a natural source of regio-, stereo-, and chemoselectivity in order to carry out chemistry (reactions and/or substrates) not previously achieved with these enzymes. The extreme labor demands and exponential number of mutation combinations have induced computational advances in this domain. The first step in our virtual approach is to predict the correct conformations upon mutation of residues (i.e., rebuilding side chains). For this purpose, we opted for a combination of molecular mechanics and statistical data. In this work, we have developed automated computational tools to extract protein structural information and created conformational libraries for each amino acid dependent on a variable number of parameters (e.g., resolution, flexibility, secondary structure). We have also developed the necessary tool to apply the mutation and optimize the conformation accordingly. For side-chain conformation prediction, we obtained overall average root-mean-square deviations (RMSDs) of 0.91 and 1.01 Å for the 18 flexible natural amino acids within two distinct sets of over 3000 and 1500 side-chain residues, respectively. The commonly used dihedral angle differences were also evaluated and performed worse than the state of the art. These two metrics are also compared. Furthermore, we generated a family-specific library for kinases that produced an average 2% lower RMSD upon side-chain reconstruction and a residue-specific library that yielded a 17% improvement. Ultimately, since our protein engineering outlook involves using our docking software, Fitted/Impacts, we applied our mutation protocol to a benchmarked data set for self- and cross-docking. Our side-chain reconstruction does not hinder our docking software, demonstrating differences in pose prediction accuracy of approximately 2% (RMSD cutoff metric) for a set of over 200 protein/ligand structures. Similarly, when docking to a set of over 100 kinases, side-chain reconstruction (using both general and biased conformation libraries) had minimal detriment to the docking accuracy.
Gay, Sean C; Roberts, Arthur G; Halpert, James R
2010-09-01
Cytochromes P450 (P450s) play a major role in the clearance of drugs, toxins, and environmental pollutants. Additionally, metabolism by P450s can result in toxic or carcinogenic products. The metabolism of pharmaceuticals by P450s is a major concern during the design of new drug candidates. Determining the interactions between P450s and compounds of very diverse structures is complicated by the variability in P450-ligand interactions. Understanding the protein structural elements and the chemical attributes of ligands that dictate their orientation in the P450 active site will aid in the development of effective and safe therapeutic agents. The goal of this review is to describe P450-ligand interactions from two perspectives. The first is the various structural elements that microsomal P450s have at their disposal to assume the different conformations observed in X-ray crystal structures. The second is P450-ligand dynamics analyzed by NMR relaxation studies.
Crystal structures of the components of the Staphylococcus aureus leukotoxin ED
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nocadello, S.; Minasov, G.; Shuvalova, L.
Staphylococcal leukotoxins are a family of β-barrel, bicomponent, pore-forming toxins with membrane-damaging functions. These bacterial exotoxins share sequence and structural homology and target several host-cell types. Leukotoxin ED (LukED) is one of these bicomponent pore-forming toxins thatStaphylococcus aureusproduces in order to suppress the ability of the host to contain the infection. The recent delineation of the important role that LukED plays inS. aureuspathogenesis and the identification of its protein receptors, combined with its presence inS. aureusmethicillin-resistant epidemic strains, establish this leukocidin as a possible target for the development of novel therapeutics. Here, the crystal structures of the water-soluble LukE andmore » LukD components of LukED have been determined. Lastly, the two structures illustrate the tertiary-structural variability with respect to the other leukotoxins while retaining the conservation of the residues involved in the interaction of the protomers in the bipartite leukotoxin in the pore complex.« less
Crystal structures of the components of the Staphylococcus aureus leukotoxin ED
Nocadello, S.; Minasov, G.; Shuvalova, L.; ...
2016-01-01
Staphylococcal leukotoxins are a family of β-barrel, bicomponent, pore-forming toxins with membrane-damaging functions. These bacterial exotoxins share sequence and structural homology and target several host-cell types. Leukotoxin ED (LukED) is one of these bicomponent pore-forming toxins thatStaphylococcus aureusproduces in order to suppress the ability of the host to contain the infection. The recent delineation of the important role that LukED plays inS. aureuspathogenesis and the identification of its protein receptors, combined with its presence inS. aureusmethicillin-resistant epidemic strains, establish this leukocidin as a possible target for the development of novel therapeutics. Here, the crystal structures of the water-soluble LukE andmore » LukD components of LukED have been determined. Lastly, the two structures illustrate the tertiary-structural variability with respect to the other leukotoxins while retaining the conservation of the residues involved in the interaction of the protomers in the bipartite leukotoxin in the pore complex.« less
Pascual, Elena; Mata, Carlos P; Gómez-Blanco, Josué; Moreno, Noelia; Bárcena, Juan; Blanco, Esther; Rodríguez-Frandsen, Ariel; Nieto, Amelia; Carrascosa, José L; Castón, José R
2015-03-01
Bioengineering of viruses and virus-like particles (VLPs) is a well-established approach in the development of new and improved vaccines against viral and bacterial pathogens. We report here that the capsid of a major avian pathogen, infectious bursal disease virus (IBDV), can accommodate heterologous proteins to induce protective immunity. The structural units of the ~70-nm-diameter T=13 IBDV capsid are trimers of VP2, which is made as a precursor (pVP2). The pVP2 C-terminal domain has an amphipathic α helix that controls VP2 polymorphism. In the absence of the VP3 scaffolding protein, 466-residue pVP2 intermediates bearing this α helix assemble into genuine VLPs only when expressed with an N-terminal His6 tag (the HT-VP2-466 protein). HT-VP2-466 capsids are optimal for protein insertion, as they are large enough (cargo space, ~78,000 nm(3)) and are assembled from a single protein. We explored HT-VP2-466-based chimeric capsids initially using enhanced green fluorescent protein (EGFP). The VLP assembly yield was efficient when we coexpressed EGFP-HT-VP2-466 and HT-VP2-466 from two recombinant baculoviruses. The native EGFP structure (~240 copies/virion) was successfully inserted in a functional form, as VLPs were fluorescent, and three-dimensional cryo-electron microscopy showed that the EGFP molecules incorporated at the inner capsid surface. Immunization of mice with purified EGFP-VLPs elicited anti-EGFP antibodies. We also inserted hemagglutinin (HA) and matrix (M2) protein epitopes derived from the mouse-adapted A/PR/8/34 influenza virus and engineered several HA- and M2-derived chimeric capsids. Mice immunized with VLPs containing the HA stalk, an M2 fragment, or both antigens developed full protection against viral challenge. Virus-like particles (VLPs) are multimeric protein cages that mimic the infectious virus capsid and are potential candidates as nonliving vaccines that induce long-lasting protection. Chimeric VLPs can display or include foreign antigens, which could be a conserved epitope to elicit broadly neutralizing antibodies or several variable epitopes effective against a large number of viral strains. We report the biochemical, structural, and immunological characterization of chimeric VLPs derived from infectious bursal disease virus (IBDV), an important poultry pathogen. To test the potential of IBDV VLPs as a vaccine vehicle, we used the enhanced green fluorescent protein and two fragments derived from the hemagglutinin and the M2 matrix protein of the human murine-adapted influenza virus. The IBDV capsid protein fused to influenza virus peptides formed assemblies able to protect mice against viral challenge. Our studies establish the basis for a new generation of multivalent IBDV-based vaccines. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Bui, Huyen T.; Karren, Mary A.; Bhar, Debjani
2012-01-01
To initiate mitochondrial fission, dynamin-related proteins (DRPs) must bind specific adaptors on the outer mitochondrial membrane. The structural features underlying this interaction are poorly understood. Using yeast as a model, we show that the Insert B domain of the Dnm1 guanosine triphosphatase (a DRP) contains a novel motif required for association with the mitochondrial adaptor Mdv1. Mutation of this conserved motif specifically disrupted Dnm1–Mdv1 interactions, blocking Dnm1 recruitment and mitochondrial fission. Suppressor mutations in Mdv1 that restored Dnm1–Mdv1 interactions and fission identified potential protein-binding interfaces on the Mdv1 β-propeller domain. These results define the first known function for Insert B in DRP–adaptor interactions. Based on the variability of Insert B sequences and adaptor proteins, we propose that Insert B domains and mitochondrial adaptors have coevolved to meet the unique requirements for mitochondrial fission of different organisms. PMID:23148233
Probing short-range protein Brownian motion in the cytoplasm of living cells
Di Rienzo, Carmine; Piazza, Vincenzo; Gratton, Enrico; Beltram, Fabio; Cardarelli, Francesco
2014-01-01
The translational motion of molecules in cells deviates from what is observed in dilute solutions. Theoretical models provide explanations for this effect but with predictions that drastically depend on the nanoscale organization assumed for macromolecular crowding agents. A conclusive test of the nature of the translational motion in cells is missing owing to the lack of techniques capable of probing crowding with the required temporal and spatial resolution. Here we show that fluorescence-fluctuation analysis of raster scans at variable timescales can provide this information. By using green fluorescent proteins in cells, we measure protein motion at the unprecedented timescale of 1 μs, unveiling unobstructed Brownian motion from 25 to 100 nm, and partially suppressed diffusion above 100 nm. Furthermore, experiments on model systems attribute this effect to the presence of relatively immobile structures rather than to diffusing crowding agents. We discuss the implications of these results for intracellular processes. PMID:25532887
Multicollinearity in canonical correlation analysis in maize.
Alves, B M; Cargnelutti Filho, A; Burin, C
2017-03-30
The objective of this study was to evaluate the effects of multicollinearity under two methods of canonical correlation analysis (with and without elimination of variables) in maize (Zea mays L.) crop. Seventy-six maize genotypes were evaluated in three experiments, conducted in a randomized block design with three replications, during the 2009/2010 crop season. Eleven agronomic variables (number of days from sowing until female flowering, number of days from sowing until male flowering, plant height, ear insertion height, ear placement, number of plants, number of ears, ear index, ear weight, grain yield, and one thousand grain weight), 12 protein-nutritional variables (crude protein, lysine, methionine, cysteine, threonine, tryptophan, valine, isoleucine, leucine, phenylalanine, histidine, and arginine), and 6 energetic-nutritional variables (apparent metabolizable energy, apparent metabolizable energy corrected for nitrogen, ether extract, crude fiber, starch, and amylose) were measured. A phenotypic correlation matrix was first generated among the 29 variables for each of the experiments. A multicollinearity diagnosis was later performed within each group of variables using methodologies such as variance inflation factor and condition number. Canonical correlation analysis was then performed, with and without the elimination of variables, among groups of agronomic and protein-nutritional, and agronomic and energetic-nutritional variables. The canonical correlation analysis in the presence of multicollinearity (without elimination of variables) overestimates the variability of canonical coefficients. The elimination of variables is an efficient method to circumvent multicollinearity in canonical correlation analysis.
Insight into the Evolution of the Histidine Triad Protein (HTP) Family in Streptococcus
Pan, Xiu-Zhen; Wang, Bin; Chen, Jian-Qun
2013-01-01
The Histidine Triad Proteins (HTPs), also known as Pht proteins in Streptococcus pneumoniae, constitute a family of surface-exposed proteins that exist in many pathogenic streptococcal species. Although many studies have revealed the importance of HTPs in streptococcal physiology and pathogenicity, little is known about their origin and evolution. In this study, after identifying all htp homologs from 105 streptococcal genomes representing 38 different species/subspecies, we analyzed their domain structures, positions in genome, and most importantly, their evolutionary histories. By further projecting this information onto the streptococcal phylogeny, we made several major findings. First, htp genes originated earlier than the Streptococcus genus and gene-loss events have occurred among three streptococcal groups, resulting in the absence of the htp gene in the Bovis, Mutans and Salivarius groups. Second, the copy number of htp genes in other groups of Streptococcus is variable, ranging from one to four functional copies. Third, both phylogenetic evidence and domain structure analyses support the division of two htp subfamilies, designated as htp I and htp II. Although present mainly in the pyogenic group and in Streptococcus suis, htp II members are distinct from htp I due to the presence of an additional leucine-rich-repeat domain at the C-terminus. Finally, htp genes exhibit a faster nucleotide substitution rate than do housekeeping genes. Specifically, the regions outside the HTP domains are under strong positive selection. This distinct evolutionary pattern likely helped Streptococcus to easily escape from recognition by host immunity. PMID:23527301
Variation and molecular evolution of HmbR, the Neisseria meningitidis haemoglobin receptor
Evans, Nicholas J.; Harrison, Odile B.; Clow, Kirsten; Derrick, Jeremy P.; Feavers, Ian M.; Maiden, Martin C. J.
2010-01-01
Meningococcal disease caused by serogroup B Neisseria meningitidis remains an important health problem in many parts of the world, and there are currently no comprehensive vaccines. Poor immunogenicity, combined with immunological identity to human sialic acids, have hindered the development of a serogroup B conjugate vaccine, resulting in the development of alternative vaccine candidates, including many outer-membrane protein (OMP)-based formulations. However, the design of protein-based meningococcal vaccines is complicated by the high level of genetic and antigenic diversity of the meningococcus. Knowledge of the extent and structuring of this diversity can have implications for the use of particular proteins as potential vaccine candidates. With this in mind, the diversity of the meningococcal OMP HmbR was investigated among N. meningitidis isolates representative of major hyper-invasive lineages. In common with other meningococcal antigens, the genetic diversity of hmbR resulted from a combination of intraspecies horizontal genetic exchange and de novo mutation. Furthermore, genealogical analysis showed an association of hmbR genes with clonal complexes and the occurrence of two hmbR families, A and B. Three variable regions (VR1–VR3), located in loops 2, 3 and 4, were observed with clonal complex structuring of VR types. A minority of codons (3.9 %), located within putative surface-exposed loop regions of a 2D model, were under diversifying selection, indicating regions of the protein likely to be subject to immune attack. PMID:20150237
An ANOVA approach for statistical comparisons of brain networks.
Fraiman, Daniel; Fraiman, Ricardo
2018-03-16
The study of brain networks has developed extensively over the last couple of decades. By contrast, techniques for the statistical analysis of these networks are less developed. In this paper, we focus on the statistical comparison of brain networks in a nonparametric framework and discuss the associated detection and identification problems. We tested network differences between groups with an analysis of variance (ANOVA) test we developed specifically for networks. We also propose and analyse the behaviour of a new statistical procedure designed to identify different subnetworks. As an example, we show the application of this tool in resting-state fMRI data obtained from the Human Connectome Project. We identify, among other variables, that the amount of sleep the days before the scan is a relevant variable that must be controlled. Finally, we discuss the potential bias in neuroimaging findings that is generated by some behavioural and brain structure variables. Our method can also be applied to other kind of networks such as protein interaction networks, gene networks or social networks.
Boosted structured additive regression for Escherichia coli fed-batch fermentation modeling.
Melcher, Michael; Scharl, Theresa; Luchner, Markus; Striedner, Gerald; Leisch, Friedrich
2017-02-01
The quality of biopharmaceuticals and patients' safety are of highest priority and there are tremendous efforts to replace empirical production process designs by knowledge-based approaches. Main challenge in this context is that real-time access to process variables related to product quality and quantity is severely limited. To date comprehensive on- and offline monitoring platforms are used to generate process data sets that allow for development of mechanistic and/or data driven models for real-time prediction of these important quantities. Ultimate goal is to implement model based feed-back control loops that facilitate online control of product quality. In this contribution, we explore structured additive regression (STAR) models in combination with boosting as a variable selection tool for modeling the cell dry mass, product concentration, and optical density on the basis of online available process variables and two-dimensional fluorescence spectroscopic data. STAR models are powerful extensions of linear models allowing for inclusion of smooth effects or interactions between predictors. Boosting constructs the final model in a stepwise manner and provides a variable importance measure via predictor selection frequencies. Our results show that the cell dry mass can be modeled with a relative error of about ±3%, the optical density with ±6%, the soluble protein with ±16%, and the insoluble product with an accuracy of ±12%. Biotechnol. Bioeng. 2017;114: 321-334. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Amonsou, Eric Oscar; Sakyi-Dawson, Esther; Saalia, Firibu Kwesi; Houssou, Paul
2008-12-01
Griddled cowpea paste foods have high nutritional potential because they are low in fat but high in protein. A good understanding of process and product characteristics of kpejigaou is necessary to improve its quality and enhance acceptability. To describe the product, evaluate critical variables in traditional processing, and determine consumer quality criteria and preferences for kpejigaou. A survey of kpejigaou processing was carried out among processors and regular consumers of kpejigaou. Kpejigaou is flat and circular in shape, with uniform thickness and porous structure. The production process of kpejigaou was found to be simple and rapid, but the quality of the finished product varied among processors and among batches. Critical processing variables affecting quality were dehulling of the cowpeas, type of griddling equipment, and griddling temperature. Texture (sponginess) is the most important quality index that determines the preference and acceptability of kpejigaou by consumers. Traditionally processed kpejigaou does not meet current standards for high-quality foods. This study provides the basis for efforts to standardize the kpejigaou process to ensure consistent product quality and enhance the acceptability of kpejigaou among consumers. Kpejigaou has a potential for success if marketed as a low-fat, nutritious fast food.
Diversity of the P2 protein among nontypeable Haemophilus influenzae isolates.
Bell, J; Grass, S; Jeanteur, D; Munson, R S
1994-01-01
The genes for outer membrane protein P2 of four nontypeable Haemophilus influenzae strains were cloned and sequenced. The derived amino acid sequences were compared with the outer membrane protein P2 sequence from H. influenzae type b MinnA and the sequences of P2 from three additional nontypeable H. influenzae strains. The sequences were 76 to 94% identical. The sequences had regions with considerable variability separated by regions which were highly conserved. The variable regions mapped to putative surface-exposed loops of the protein. PMID:8188390
Gaia: automated quality assessment of protein structure models.
Kota, Pradeep; Ding, Feng; Ramachandran, Srinivas; Dokholyan, Nikolay V
2011-08-15
Increasing use of structural modeling for understanding structure-function relationships in proteins has led to the need to ensure that the protein models being used are of acceptable quality. Quality of a given protein structure can be assessed by comparing various intrinsic structural properties of the protein to those observed in high-resolution protein structures. In this study, we present tools to compare a given structure to high-resolution crystal structures. We assess packing by calculating the total void volume, the percentage of unsatisfied hydrogen bonds, the number of steric clashes and the scaling of the accessible surface area. We assess covalent geometry by determining bond lengths, angles, dihedrals and rotamers. The statistical parameters for the above measures, obtained from high-resolution crystal structures enable us to provide a quality-score that points to specific areas where a given protein structural model needs improvement. We provide these tools that appraise protein structures in the form of a web server Gaia (http://chiron.dokhlab.org). Gaia evaluates the packing and covalent geometry of a given protein structure and provides quantitative comparison of the given structure to high-resolution crystal structures. dokh@unc.edu Supplementary data are available at Bioinformatics online.
Zelaya, Víctor Martín; Fernández, Paula Virginia; Vega, Andrea Susana; Mantese, Anita Ida; Federico, Ana Ailén; Ciancia, Marina
2017-07-01
Young shoots of Phyllostachys aurea showed glucuronoarabinoxylans (GAX) as the major hemicellulosic components, being extracted in major amounts with 1M KOH (ratio Xyl:Ara:GlcA, 100:67:8), but also with water, showing a broad structural variability. Mixed linkage glucans were also present, but in minor amounts, mostly concentrated in the 4M KOH extracts, while pectin polymers were very scarce. Arabinogalactan proteins were an important part of water extracts, determined by the presence of the typical arabinogalactan structures (3- and 6-linked Gal p; terminal and 5-linked Ara f), in addition to small amounts of hydroxyproline (2-3% of total protein) and positive reaction to Yariv's reagent. Morphological and anatomical characteristics of young shoots are described, as well as localization of some cell wall components, and related with chemical analysis. A method for determination of uronic acids as their N-propylaldonamide acetates and separation and quantification by GC/MS was adapted for its use with grass cell wall fractions. Copyright © 2017 Elsevier Ltd. All rights reserved.
Expression of endogenous proteins in maize hybrids in a multi-location field trial in India.
Gutha, Linga R; Purushottam, Divakar; Veeramachaneni, Aruna; Tigulla, Sarita; Kodappully, Vikas; Enjala, Chandana; Rajput, Hitendrasinh; Anderson, Jennifer; Hong, Bonnie; Schmidt, Jean; Bagga, Shveta
2018-05-17
Genetically modified (GM) crops undergo large scale multi-location field trials to characterize agronomics, composition, and the concentration of newly expressed protein(s) [herein referred to as transgenic protein(s)]. The concentration of transgenic proteins in different plant tissues and across the developmental stages of the plant is considered in the safety assessment of GM crops. Reference or housekeeping proteins are expected to maintain a relatively stable expression pattern in healthy plants given their role in cellular functions. Understanding the effects of genotype, growth stage and location on the concentration of endogenous housekeeping proteins may provide insight into the contribution these factors could have on transgenic protein concentrations in GM crops. The concentrations of three endogenous proteins (actin, elongation factor 1-alpha, and glyceraldehyde 3-phosphate dehydrogenase) were measured in several different maize hybrids grown across multiple field locations over 2 years. Leaf samples were collected from healthy plants at three developmental stages across the growing seasons, and protein concentrations were quantified by indirect enzyme-linked immunosorbent assay (ELISA) for each protein. In general, the concentrations of these three endogenous proteins were relatively consistent across hybrid backgrounds, when compared within one growth stage and location (2-26%CV), whereas the concentrations of proteins in the same hybrid and growth stage across different locations were more variable (12-64%CV). In general, the protein concentrations in 2013 and 2014 show similar trends in variability. Some degree of variability in protein concentrations should be expected for both transgenic and endogenous plant-expressed proteins. In the case of GM crops, the potential variation in protein concentrations due to location effects is captured in the current model of multi-location field testing.
Cargo selection by specific kinesin light chain 1 isoforms.
Woźniak, Marcin J; Allan, Victoria J
2006-11-29
Kinesin-1 drives the movement of diverse cargoes, and it has been proposed that specific kinesin light chain (KLC) isoforms target kinesin-1 to these different structures. Here, we test this hypothesis using two in vitro motility assays, which reconstitute the movement of rough endoplasmic reticulum (RER) and vesicles present in a Golgi membrane fraction. We generated GST-tagged fusion proteins of KLC1B and KLC1D that included the tetratricopeptide repeat domain and the variable C-terminus. We find that preincubation of RER with KLC1B inhibits RER motility, whereas KLC1D does not. In contrast, Golgi fraction vesicle movement is inhibited by KLC1D but not KLC1B reagents. Both RER and vesicle movement is inhibited by preincubation with the GST-tagged C-terminal domain of ubiquitous kinesin heavy chain (uKHC), which binds to the N-terminal domain of uKHC and alters its interaction with microtubules. We propose that although the TRR domains are required for cargo binding, it is the variable C-terminal region of KLCs that are vital for targeting kinesin-1 to different cellular structures.
Cargo selection by specific kinesin light chain 1 isoforms
Woźniak, Marcin J; Allan, Victoria J
2006-01-01
Kinesin-1 drives the movement of diverse cargoes, and it has been proposed that specific kinesin light chain (KLC) isoforms target kinesin-1 to these different structures. Here, we test this hypothesis using two in vitro motility assays, which reconstitute the movement of rough endoplasmic reticulum (RER) and vesicles present in a Golgi membrane fraction. We generated GST-tagged fusion proteins of KLC1B and KLC1D that included the tetratricopeptide repeat domain and the variable C-terminus. We find that preincubation of RER with KLC1B inhibits RER motility, whereas KLC1D does not. In contrast, Golgi fraction vesicle movement is inhibited by KLC1D but not KLC1B reagents. Both RER and vesicle movement is inhibited by preincubation with the GST-tagged C-terminal domain of ubiquitous kinesin heavy chain (uKHC), which binds to the N-terminal domain of uKHC and alters its interaction with microtubules. We propose that although the TRR domains are required for cargo binding, it is the variable C-terminal region of KLCs that are vital for targeting kinesin-1 to different cellular structures. PMID:17093494
Protein Structure Prediction by Protein Threading
NASA Astrophysics Data System (ADS)
Xu, Ying; Liu, Zhijie; Cai, Liming; Xu, Dong
The seminal work of Bowie, Lüthy, and Eisenberg (Bowie et al., 1991) on "the inverse protein folding problem" laid the foundation of protein structure prediction by protein threading. By using simple measures for fitness of different amino acid types to local structural environments defined in terms of solvent accessibility and protein secondary structure, the authors derived a simple and yet profoundly novel approach to assessing if a protein sequence fits well with a given protein structural fold. Their follow-up work (Elofsson et al., 1996; Fischer and Eisenberg, 1996; Fischer et al., 1996a,b) and the work by Jones, Taylor, and Thornton (Jones et al., 1992) on protein fold recognition led to the development of a new brand of powerful tools for protein structure prediction, which we now term "protein threading." These computational tools have played a key role in extending the utility of all the experimentally solved structures by X-ray crystallography and nuclear magnetic resonance (NMR), providing structural models and functional predictions for many of the proteins encoded in the hundreds of genomes that have been sequenced up to now.
3D-SURFER 2.0: web platform for real-time search and characterization of protein surfaces.
Xiong, Yi; Esquivel-Rodriguez, Juan; Sael, Lee; Kihara, Daisuke
2014-01-01
The increasing number of uncharacterized protein structures necessitates the development of computational approaches for function annotation using the protein tertiary structures. Protein structure database search is the basis of any structure-based functional elucidation of proteins. 3D-SURFER is a web platform for real-time protein surface comparison of a given protein structure against the entire PDB using 3D Zernike descriptors. It can smoothly navigate the protein structure space in real-time from one query structure to another. A major new feature of Release 2.0 is the ability to compare the protein surface of a single chain, a single domain, or a single complex against databases of protein chains, domains, complexes, or a combination of all three in the latest PDB. Additionally, two types of protein structures can now be compared: all-atom-surface and backbone-atom-surface. The server can also accept a batch job for a large number of database searches. Pockets in protein surfaces can be identified by VisGrid and LIGSITE (csc) . The server is available at http://kiharalab.org/3d-surfer/.
Protein Structure Determination using Metagenome sequence data
Ovchinnikov, Sergey; Park, Hahnbeom; Varghese, Neha; Huang, Po-Ssu; Pavlopoulos, Georgios A.; Kim, David E.; Kamisetty, Hetunandan; Kyrpides, Nikos C.; Baker, David
2017-01-01
Despite decades of work by structural biologists, there are still ~5200 protein families with unknown structure outside the range of comparative modeling. We show that Rosetta structure prediction guided by residue-residue contacts inferred from evolutionary information can accurately model proteins that belong to large families, and that metagenome sequence data more than triples the number of protein families with sufficient sequences for accurate modeling. We then integrate metagenome data, contact based structure matching and Rosetta structure calculations to generate models for 614 protein families with currently unknown structures; 206 are membrane proteins and 137 have folds not represented in the PDB. This approach provides the representative models for large protein families originally envisioned as the goal of the protein structure initiative at a fraction of the cost. PMID:28104891
PDZ-containing proteins: alternative splicing as a source of functional diversity.
Sierralta, Jimena; Mendoza, Carolina
2004-12-01
Scaffold proteins allow specific protein complexes to be assembled in particular regions of the cell at which they organize subcellular structures and signal transduction complexes. This characteristic is especially important for neurons, which are highly polarized cells. Among the domains contained by scaffold proteins, the PSD-95, Discs-large, ZO-1 (PDZ) domains are of particular relevance in signal transduction processes and maintenance of neuronal and epithelial polarity. These domains are specialized in the binding of the carboxyl termini of proteins allowing membrane proteins to be localized by the anchoring to the cytoskeleton mediated by PDZ-containing scaffold proteins. In vivo studies carried out in Drosophila have taught that the role of many scaffold proteins is not limited to a single process; thus, in many cases the same genes are expressed in different tissues and participate in apparently very diverse processes. In addition to the differential expression of interactors of scaffold proteins, the expression of variants of these molecular scaffolds as the result of the alternative processing of the genes that encode them is proving to be a very important source of variability and complexity on a main theme. Alternative splicing in the nervous system is well documented, where specific isoforms play roles in neurotransmission, ion channel function, neuronal cell recognition, and are developmentally regulated making it a major mechanism of functional diversity. Here we review the current state of knowledge about the diversity and the known function of PDZ-containing proteins in Drosophila with emphasis in the role played by alternatively processed forms in the diversity of functions attributed to this family of proteins.