Sample records for improves protein identification

  1. Identification of Extracellular Segments by Mass Spectrometry Improves Topology Prediction of Transmembrane Proteins.

    PubMed

    Langó, Tamás; Róna, Gergely; Hunyadi-Gulyás, Éva; Turiák, Lilla; Varga, Julia; Dobson, László; Várady, György; Drahos, László; Vértessy, Beáta G; Medzihradszky, Katalin F; Szakács, Gergely; Tusnády, Gábor E

    2017-02-13

    Transmembrane proteins play crucial role in signaling, ion transport, nutrient uptake, as well as in maintaining the dynamic equilibrium between the internal and external environment of cells. Despite their important biological functions and abundance, less than 2% of all determined structures are transmembrane proteins. Given the persisting technical difficulties associated with high resolution structure determination of transmembrane proteins, additional methods, including computational and experimental techniques remain vital in promoting our understanding of their topologies, 3D structures, functions and interactions. Here we report a method for the high-throughput determination of extracellular segments of transmembrane proteins based on the identification of surface labeled and biotin captured peptide fragments by LC/MS/MS. We show that reliable identification of extracellular protein segments increases the accuracy and reliability of existing topology prediction algorithms. Using the experimental topology data as constraints, our improved prediction tool provides accurate and reliable topology models for hundreds of human transmembrane proteins.

  2. Proteomics meets blood banking: identification of protein targets for the improvement of platelet quality.

    PubMed

    Schubert, Peter; Devine, Dana V

    2010-01-03

    Proteomics has brought new perspectives to the fields of hematology and transfusion medicine in the last decade. The steady improvement of proteomic technology is propelling novel discoveries of molecular mechanisms by studying protein expression, post-translational modifications and protein interactions. This review article focuses on the application of proteomics to the identification of molecular mechanisms leading to the deterioration of blood platelets during storage - a critical aspect in the provision of platelet transfusion products. Several proteomic approaches have been employed to analyse changes in the platelet protein profile during storage and the obtained data now need to be translated into platelet biochemistry in order to connect the results to platelet function. Targeted biochemical applications then allow the identification of points for intervention in signal transduction pathways. Once validated and placed in a transfusion context, these data will provide further understanding of the underlying molecular mechanisms leading to platelet storage lesion. Future aspects of proteomics in blood banking will aim to make use of protein markers identified for platelet storage lesion development to monitor proteome changes when alterations such as the use of additive solutions or pathogen reduction strategies are put in place in order to improve platelet quality for patients. (c) 2009 Elsevier B.V. All rights reserved.

  3. Method optimization for proteomic analysis of soybean leaf: Improvements in identification of new and low-abundance proteins

    PubMed Central

    Mesquita, Rosilene Oliveira; de Almeida Soares, Eduardo; de Barros, Everaldo Gonçalves; Loureiro, Marcelo Ehlers

    2012-01-01

    The most critical step in any proteomic study is protein extraction and sample preparation. Better solubilization increases the separation and resolution of gels, allowing identification of a higher number of proteins and more accurate quantitation of differences in gene expression. Despite the existence of published results for the optimization of proteomic analyses of soybean seeds, no comparable data are available for proteomic studies of soybean leaf tissue. In this work we have tested the effects of modification of a TCA-acetone method on the resolution of 2-DE gels of leaves and roots of soybean. Better focusing was obtained when both mercaptoethanol and dithiothreitol were used in the extraction buffer simultaneously. Increasing the number of washes of TCA precipitated protein with acetone, using a final wash with 80% ethanol and using sonication to ressuspend the pellet increased the number of detected proteins as well the resolution of the 2-DE gels. Using this approach we have constructed a soybean protein map. The major group of identified proteins corresponded to genes of unknown function. The second and third most abundant groups of proteins were composed of photosynthesis and metabolism related genes. The resulting protocol improved protein solubility and gel resolution allowing the identification of 122 soybean leaf proteins, 72 of which were not detected in other published soybean leaf 2-DE gel datasets, including a transcription factor and several signaling proteins. PMID:22802721

  4. Improving automatic peptide mass fingerprint protein identification by combining many peak sets.

    PubMed

    Rögnvaldsson, Thorsteinn; Häkkinen, Jari; Lindberg, Claes; Marko-Varga, György; Potthast, Frank; Samuelsson, Jim

    2004-08-05

    An automated peak picking strategy is presented where several peak sets with different signal-to-noise levels are combined to form a more reliable statement on the protein identity. The strategy is compared against both manual peak picking and industry standard automated peak picking on a set of mass spectra obtained after tryptic in gel digestion of 2D-gel samples from human fetal fibroblasts. The set of spectra contain samples ranging from strong to weak spectra, and the proposed multiple-scale method is shown to be much better on weak spectra than the industry standard method and a human operator, and equal in performance to these on strong and medium strong spectra. It is also demonstrated that peak sets selected by a human operator display a considerable variability and that it is impossible to speak of a single "true" peak set for a given spectrum. The described multiple-scale strategy both avoids time-consuming parameter tuning and exceeds the human operator in protein identification efficiency. The strategy therefore promises reliable automated user-independent protein identification using peptide mass fingerprints.

  5. Systematic Errors in Peptide and Protein Identification and Quantification by Modified Peptides*

    PubMed Central

    Bogdanow, Boris; Zauber, Henrik; Selbach, Matthias

    2016-01-01

    The principle of shotgun proteomics is to use peptide mass spectra in order to identify corresponding sequences in a protein database. The quality of peptide and protein identification and quantification critically depends on the sensitivity and specificity of this assignment process. Many peptides in proteomic samples carry biochemical modifications, and a large fraction of unassigned spectra arise from modified peptides. Spectra derived from modified peptides can erroneously be assigned to wrong amino acid sequences. However, the impact of this problem on proteomic data has not yet been investigated systematically. Here we use combinations of different database searches to show that modified peptides can be responsible for 20–50% of false positive identifications in deep proteomic data sets. These false positive hits are particularly problematic as they have significantly higher scores and higher intensities than other false positive matches. Furthermore, these wrong peptide assignments lead to hundreds of false protein identifications and systematic biases in protein quantification. We devise a “cleaned search” strategy to address this problem and show that this considerably improves the sensitivity and specificity of proteomic data. In summary, we show that modified peptides cause systematic errors in peptide and protein identification and quantification and should therefore be considered to further improve the quality of proteomic data annotation. PMID:27215553

  6. Improved Recovery and Identification of Membrane Proteins from Rat Hepatic Cells using a Centrifugal Proteomic Reactor*

    PubMed Central

    Zhou, Hu; Wang, Fangjun; Wang, Yuwei; Ning, Zhibin; Hou, Weimin; Wright, Theodore G.; Sundaram, Meenakshi; Zhong, Shumei; Yao, Zemin; Figeys, Daniel

    2011-01-01

    Despite their importance in many biological processes, membrane proteins are underrepresented in proteomic analysis because of their poor solubility (hydrophobicity) and often low abundance. We describe a novel approach for the identification of plasma membrane proteins and intracellular microsomal proteins that combines membrane fractionation, a centrifugal proteomic reactor for streamlined protein extraction, protein digestion and fractionation by centrifugation, and high performance liquid chromatography-electrospray ionization-tandem MS. The performance of this approach was illustrated for the study of the proteome of ER and Golgi microsomal membranes in rat hepatic cells. The centrifugal proteomic reactor identified 945 plasma membrane proteins and 955 microsomal membrane proteins, of which 63 and 47% were predicted as bona fide membrane proteins, respectively. Among these proteins, >800 proteins were undetectable by the conventional in-gel digestion approach. The majority of the membrane proteins only identified by the centrifugal proteomic reactor were proteins with ≥2 transmembrane segments or proteins with high molecular mass (e.g. >150 kDa) and hydrophobicity. The improved proteomic reactor allowed the detection of a group of endocytic and/or signaling receptor proteins on the plasma membrane, as well as apolipoproteins and glycerolipid synthesis enzymes that play a role in the assembly and secretion of apolipoprotein B100-containing very low density lipoproteins. Thus, the centrifugal proteomic reactor offers a new analytical tool for structure and function studies of membrane proteins involved in lipid and lipoprotein metabolism. PMID:21749988

  7. Direct Maximization of Protein Identifications from Tandem Mass Spectra*

    PubMed Central

    Spivak, Marina; Weston, Jason; Tomazela, Daniela; MacCoss, Michael J.; Noble, William Stafford

    2012-01-01

    The goal of many shotgun proteomics experiments is to determine the protein complement of a complex biological mixture. For many mixtures, most methodological approaches fall significantly short of this goal. Existing solutions to this problem typically subdivide the task into two stages: first identifying a collection of peptides with a low false discovery rate and then inferring from the peptides a corresponding set of proteins. In contrast, we formulate the protein identification problem as a single optimization problem, which we solve using machine learning methods. This approach is motivated by the observation that the peptide and protein level tasks are cooperative, and the solution to each can be improved by using information about the solution to the other. The resulting algorithm directly controls the relevant error rate, can incorporate a wide variety of evidence and, for complex samples, provides 18–34% more protein identifications than the current state of the art approaches. PMID:22052992

  8. Preprocessing Significantly Improves the Peptide/Protein Identification Sensitivity of High-resolution Isobarically Labeled Tandem Mass Spectrometry Data*

    PubMed Central

    Sheng, Quanhu; Li, Rongxia; Dai, Jie; Li, Qingrun; Su, Zhiduan; Guo, Yan; Li, Chen; Shyr, Yu; Zeng, Rong

    2015-01-01

    Isobaric labeling techniques coupled with high-resolution mass spectrometry have been widely employed in proteomic workflows requiring relative quantification. For each high-resolution tandem mass spectrum (MS/MS), isobaric labeling techniques can be used not only to quantify the peptide from different samples by reporter ions, but also to identify the peptide it is derived from. Because the ions related to isobaric labeling may act as noise in database searching, the MS/MS spectrum should be preprocessed before peptide or protein identification. In this article, we demonstrate that there are a lot of high-frequency, high-abundance isobaric related ions in the MS/MS spectrum, and removing isobaric related ions combined with deisotoping and deconvolution in MS/MS preprocessing procedures significantly improves the peptide/protein identification sensitivity. The user-friendly software package TurboRaw2MGF (v2.0) has been implemented for converting raw TIC data files to mascot generic format files and can be downloaded for free from https://github.com/shengqh/RCPA.Tools/releases as part of the software suite ProteomicsTools. The data have been deposited to the ProteomeXchange with identifier PXD000994. PMID:25435543

  9. Applications of graph theory in protein structure identification

    PubMed Central

    2011-01-01

    There is a growing interest in the identification of proteins on the proteome wide scale. Among different kinds of protein structure identification methods, graph-theoretic methods are very sharp ones. Due to their lower costs, higher effectiveness and many other advantages, they have drawn more and more researchers’ attention nowadays. Specifically, graph-theoretic methods have been widely used in homology identification, side-chain cluster identification, peptide sequencing and so on. This paper reviews several methods in solving protein structure identification problems using graph theory. We mainly introduce classical methods and mathematical models including homology modeling based on clique finding, identification of side-chain clusters in protein structures upon graph spectrum, and de novo peptide sequencing via tandem mass spectrometry using the spectrum graph model. In addition, concluding remarks and future priorities of each method are given. PMID:22165974

  10. Protein denaturation improves enzymatic digestion efficiency for direct tissue analysis using mass spectrometry

    NASA Astrophysics Data System (ADS)

    Setou, M.; Hayasaka, T.; Shimma, S.; Sugiura, Y.; Matsumoto, M.

    2008-12-01

    Molecular identification using high-sensitivity tandem mass spectrometry is essential for protein analysis on the tissue surface. Here we report an improved digestion protocol for protein identification directly on the tissue surface using mass spectrometry. By denaturation process and the use of detergent-supplemented trypsin solution, we could successfully detect and identify many molecules such as tubulin, neurofilament, and synaptosomal-associated 25 kDa protein directly from a mouse cerebellum section.

  11. Protein Identification Using Top-Down Spectra*

    PubMed Central

    Liu, Xiaowen; Sirotkin, Yakov; Shen, Yufeng; Anderson, Gordon; Tsai, Yihsuan S.; Ting, Ying S.; Goodlett, David R.; Smith, Richard D.; Bafna, Vineet; Pevzner, Pavel A.

    2012-01-01

    In the last two years, because of advances in protein separation and mass spectrometry, top-down mass spectrometry moved from analyzing single proteins to analyzing complex samples and identifying hundreds and even thousands of proteins. However, computational tools for database search of top-down spectra against protein databases are still in their infancy. We describe MS-Align+, a fast algorithm for top-down protein identification based on spectral alignment that enables searches for unexpected post-translational modifications. We also propose a method for evaluating statistical significance of top-down protein identifications and further benchmark various software tools on two top-down data sets from Saccharomyces cerevisiae and Salmonella typhimurium. We demonstrate that MS-Align+ significantly increases the number of identified spectra as compared with MASCOT and OMSSA on both data sets. Although MS-Align+ and ProSightPC have similar performance on the Salmonella typhimurium data set, MS-Align+ outperforms ProSightPC on the (more complex) Saccharomyces cerevisiae data set. PMID:22027200

  12. Distilling the essential features of a protein surface for improving protein-ligand docking, scoring, and virtual screening

    NASA Astrophysics Data System (ADS)

    Zavodszky, Maria I.; Sanschagrin, Paul C.; Kuhn, Leslie A.; Korde, Rajesh S.

    2002-12-01

    For the successful identification and docking of new ligands to a protein target by virtual screening, the essential features of the protein and ligand surfaces must be captured and distilled in an efficient representation. Since the running time for docking increases exponentially with the number of points representing the protein and each ligand candidate, it is important to place these points where the best interactions can be made between the protein and the ligand. This definition of favorable points of interaction can also guide protein structure-based ligand design, which typically focuses on which chemical groups provide the most energetically favorable contacts. In this paper, we present an alternative method of protein template and ligand interaction point design that identifies the most favorable points for making hydrophobic and hydrogen-bond interactions by using a knowledge base. The knowledge-based protein and ligand representations have been incorporated in version 2.0 of SLIDE and resulted in dockings closer to the crystal structure orientations when screening a set of 57 known thrombin and glutathione S-transferase (GST) ligands against the apo structures of these proteins. There was also improved scoring enrichment of the dockings, meaning better differentiation between the chemically diverse known ligands and a ˜15,000-molecule dataset of randomly-chosen small organic molecules. This approach for identifying the most important points of interaction between proteins and their ligands can equally well be used in other docking and design techniques. While much recent effort has focused on improving scoring functions for protein-ligand docking, our results indicate that improving the representation of the chemistry of proteins and their ligands is another avenue that can lead to significant improvements in the identification, docking, and scoring of ligands.

  13. DeepSig: deep learning improves signal peptide detection in proteins.

    PubMed

    Savojardo, Castrense; Martelli, Pier Luigi; Fariselli, Piero; Casadio, Rita

    2018-05-15

    The identification of signal peptides in protein sequences is an important step toward protein localization and function characterization. Here, we present DeepSig, an improved approach for signal peptide detection and cleavage-site prediction based on deep learning methods. Comparative benchmarks performed on an updated independent dataset of proteins show that DeepSig is the current best performing method, scoring better than other available state-of-the-art approaches on both signal peptide detection and precise cleavage-site identification. DeepSig is available as both standalone program and web server at https://deepsig.biocomp.unibo.it. All datasets used in this study can be obtained from the same website. pierluigi.martelli@unibo.it. Supplementary data are available at Bioinformatics online.

  14. enDNA-Prot: identification of DNA-binding proteins by applying ensemble learning.

    PubMed

    Xu, Ruifeng; Zhou, Jiyun; Liu, Bin; Yao, Lin; He, Yulan; Zou, Quan; Wang, Xiaolong

    2014-01-01

    DNA-binding proteins are crucial for various cellular processes, such as recognition of specific nucleotide, regulation of transcription, and regulation of gene expression. Developing an effective model for identifying DNA-binding proteins is an urgent research problem. Up to now, many methods have been proposed, but most of them focus on only one classifier and cannot make full use of the large number of negative samples to improve predicting performance. This study proposed a predictor called enDNA-Prot for DNA-binding protein identification by employing the ensemble learning technique. Experiential results showed that enDNA-Prot was comparable with DNA-Prot and outperformed DNAbinder and iDNA-Prot with performance improvement in the range of 3.97-9.52% in ACC and 0.08-0.19 in MCC. Furthermore, when the benchmark dataset was expanded with negative samples, the performance of enDNA-Prot outperformed the three existing methods by 2.83-16.63% in terms of ACC and 0.02-0.16 in terms of MCC. It indicated that enDNA-Prot is an effective method for DNA-binding protein identification and expanding training dataset with negative samples can improve its performance. For the convenience of the vast majority of experimental scientists, we developed a user-friendly web-server for enDNA-Prot which is freely accessible to the public.

  15. A Graph-Centric Approach for Metagenome-Guided Peptide and Protein Identification in Metaproteomics

    PubMed Central

    Tang, Haixu; Li, Sujun; Ye, Yuzhen

    2016-01-01

    Metaproteomic studies adopt the common bottom-up proteomics approach to investigate the protein composition and the dynamics of protein expression in microbial communities. When matched metagenomic and/or metatranscriptomic data of the microbial communities are available, metaproteomic data analyses often employ a metagenome-guided approach, in which complete or fragmental protein-coding genes are first directly predicted from metagenomic (and/or metatranscriptomic) sequences or from their assemblies, and the resulting protein sequences are then used as the reference database for peptide/protein identification from MS/MS spectra. This approach is often limited because protein coding genes predicted from metagenomes are incomplete and fragmental. In this paper, we present a graph-centric approach to improving metagenome-guided peptide and protein identification in metaproteomics. Our method exploits the de Bruijn graph structure reported by metagenome assembly algorithms to generate a comprehensive database of protein sequences encoded in the community. We tested our method using several public metaproteomic datasets with matched metagenomic and metatranscriptomic sequencing data acquired from complex microbial communities in a biological wastewater treatment plant. The results showed that many more peptides and proteins can be identified when assembly graphs were utilized, improving the characterization of the proteins expressed in the microbial communities. The additional proteins we identified contribute to the characterization of important pathways such as those involved in degradation of chemical hazards. Our tools are released as open-source software on github at https://github.com/COL-IU/Graph2Pro. PMID:27918579

  16. Size-Sorting Combined with Improved Nanocapillary-LC-MS for Identification of Intact Proteins up to 80 kDa

    PubMed Central

    Vellaichamy, Adaikkalam; Tran, John C.; Catherman, Adam D.; Lee, Ji Eun; Kellie, John F.; Sweet, Steve M.M.; Zamdborg, Leonid; Thomas, Paul M.; Ahlf, Dorothy R.; Durbin, Kenneth R.; Valaskovic, Gary A.; Kelleher, Neil L.

    2010-01-01

    Despite the availability of ultra-high resolution mass spectrometers, methods for separation and detection of intact proteins for proteome-scale analyses are still in a developmental phase. Here we report robust protocols for on-line LC-MS to drive high-throughput top-down proteomics in a fashion similar to bottom-up. Comparative work on protein standards showed that a polymeric stationary phase led to superior sensitivity over a silica-based medium in reversed-phase nanocapillary-LC, with detection of proteins >50 kDa routinely accomplished in the linear ion trap of a hybrid Fourier-Transform mass spectrometer. Protein identification was enabled by nozzle-skimmer dissociation (NSD) and detection of fragment ions with <5 ppm mass accuracy for highly-specific database searching using custom software. This overall approach led to identification of proteins up to 80 kDa, with 10-60 proteins identified in single LC-MS runs of samples from yeast and human cell lines pre-fractionated by their molecular weight using a gel-based sieving system. PMID:20073486

  17. Stable isotope, site-specific mass tagging for protein identification

    DOEpatents

    Chen, Xian

    2006-10-24

    Proteolytic peptide mass mapping as measured by mass spectrometry provides an important method for the identification of proteins, which are usually identified by matching the measured and calculated m/z values of the proteolytic peptides. A unique identification is, however, heavily dependent upon the mass accuracy and sequence coverage of the fragment ions generated by peptide ionization. The present invention describes a method for increasing the specificity, accuracy and efficiency of the assignments of particular proteolytic peptides and consequent protein identification, by the incorporation of selected amino acid residue(s) enriched with stable isotope(s) into the protein sequence without the need for ultrahigh instrumental accuracy. Selected amino acid(s) are labeled with .sup.13C/.sup.15N/.sup.2H and incorporated into proteins in a sequence-specific manner during cell culturing. Each of these labeled amino acids carries a defined mass change encoded in its monoisotopic distribution pattern. Through their characteristic patterns, the peptides with mass tag(s) can then be readily distinguished from other peptides in mass spectra. The present method of identifying unique proteins can also be extended to protein complexes and will significantly increase data search specificity, efficiency and accuracy for protein identifications.

  18. Improved gel electrophoresis matrix for hydrophobic protein separation and identification.

    PubMed

    Tokarski, Caroline; Fillet, Marianne; Rolando, Christian

    2011-03-01

    We propose an improved acrylamide gel for the separation of hydrophobic proteins. The separation strategy is based on the incorporation of N-alkylated and N,N'-dialkylated acrylamide monomers in the gel composition in order to increase hydrophobic interactions between the gel matrix and the membrane proteins. Focusing on the most efficient monomer, N,N'-dimethylacrylamide, the potentiality of the new matrix was evaluated on membrane proteins of the human colon HCT-116 cell line. Protein analysis was performed using an adapted analytical strategy based on FT-ICR tandem mass spectrometry. As a result of this comparative study, including advanced reproducibility experiments, more hydrophobic proteins were identified in the new gel (average GRAVY: -0.085) than in the classical gel (average GRAVY: -0.411). Highly hydrophobic peptides were identified reaching a GRAVY value up to 1.450, therefore indicating their probable locations in the membrane. Focusing on predicted transmembrane domains, it can be pointed out that 27 proteins were identified in the hydrophobic gel containing up to 11 transmembrane domains; in the classical gel, only 5 proteins containing 1 transmembrane domain were successfully identified. For example, multiple ionic channels and receptors were characterized in the hydrophobic gel such as the sodium/potassium channel and the glutamate or the transferrin receptors whereas they are traditionally detected using specific enrichment techniques such as immunoprecipitation. In total, membrane proteins identified in the classical gel are well documented in the literature, while most of the membrane proteins only identified on the hydrophobic gel have rarely or never been described using a proteomic-based approach. 2010 Elsevier Inc. All rights reserved.

  19. Identification of Protein-Protein Interactions with Glutathione-S-Transferase (GST) Fusion Proteins.

    PubMed

    Einarson, Margret B; Pugacheva, Elena N; Orlinick, Jason R

    2007-08-01

    INTRODUCTIONGlutathione-S-transferase (GST) fusion proteins have had a wide range of applications since their introduction as tools for synthesis of recombinant proteins in bacteria. GST was originally selected as a fusion moiety because of several desirable properties. First and foremost, when expressed in bacteria alone, or as a fusion, GST is not sequestered in inclusion bodies (in contrast to previous fusion protein systems). Second, GST can be affinity-purified without denaturation because it binds to immobilized glutathione, which provides the basis for simple purification. Consequently, GST fusion proteins are routinely used for antibody generation and purification, protein-protein interaction studies, and biochemical analysis. This article describes the use of GST fusion proteins as probes for the identification of protein-protein interactions.

  20. A peptide-retrieval strategy enables significant improvement of quantitative performance without compromising confidence of identification.

    PubMed

    Tu, Chengjian; Shen, Shichen; Sheng, Quanhu; Shyr, Yu; Qu, Jun

    2017-01-30

    Reliable quantification of low-abundance proteins in complex proteomes is challenging largely owing to the limited number of spectra/peptides identified. In this study we developed a straightforward method to improve the quantitative accuracy and precision of proteins by strategically retrieving the less confident peptides that were previously filtered out using the standard target-decoy search strategy. The filtered-out MS/MS spectra matched to confidently-identified proteins were recovered, and the peptide-spectrum-match FDR were re-calculated and controlled at a confident level of FDR≤1%, while protein FDR maintained at ~1%. We evaluated the performance of this strategy in both spectral count- and ion current-based methods. >60% increase of total quantified spectra/peptides was respectively achieved for analyzing a spike-in sample set and a public dataset from CPTAC. Incorporating the peptide retrieval strategy significantly improved the quantitative accuracy and precision, especially for low-abundance proteins (e.g. one-hit proteins). Moreover, the capacity of confidently discovering significantly-altered proteins was also enhanced substantially, as demonstrated with two spike-in datasets. In summary, improved quantitative performance was achieved by this peptide recovery strategy without compromising confidence of protein identification, which can be readily implemented in a broad range of quantitative proteomics techniques including label-free or labeling approaches. We hypothesize that more quantifiable spectra and peptides in a protein, even including less confident peptides, could help reduce variations and improve protein quantification. Hence the peptide retrieval strategy was developed and evaluated in two spike-in sample sets with different LC-MS/MS variations using both MS1- and MS2-based quantitative approach. The list of confidently identified proteins using the standard target-decoy search strategy was fixed and more spectra/peptides with less

  1. Comprehensive Identification of Proteins from MALDI Imaging*

    PubMed Central

    Maier, Stefan K.; Hahne, Hannes; Gholami, Amin Moghaddas; Balluff, Benjamin; Meding, Stephan; Schoene, Cédrik; Walch, Axel K.; Kuster, Bernhard

    2013-01-01

    Matrix-assisted laser desorption/ionization imaging mass spectrometry (MALDI IMS) is a powerful tool for the visualization of proteins in tissues and has demonstrated considerable diagnostic and prognostic value. One main challenge is that the molecular identity of such potential biomarkers mostly remains unknown. We introduce a generic method that removes this issue by systematically identifying the proteins embedded in the MALDI matrix using a combination of bottom-up and top-down proteomics. The analyses of ten human tissues lead to the identification of 1400 abundant and soluble proteins constituting the set of proteins detectable by MALDI IMS including >90% of all IMS biomarkers reported in the literature. Top-down analysis of the matrix proteome identified 124 mostly N- and C-terminally fragmented proteins indicating considerable protein processing activity in tissues. All protein identification data from this study as well as the IMS literature has been deposited into MaTisse, a new publically available database, which we anticipate will become a valuable resource for the IMS community. PMID:23782541

  2. Identification of secreted bacterial proteins by noncanonical amino acid tagging

    PubMed Central

    Mahdavi, Alborz; Szychowski, Janek; Ngo, John T.; Sweredoski, Michael J.; Graham, Robert L. J.; Hess, Sonja; Schneewind, Olaf; Mazmanian, Sarkis K.; Tirrell, David A.

    2014-01-01

    Pathogenic microbes have evolved complex secretion systems to deliver virulence factors into host cells. Identification of these factors is critical for understanding the infection process. We report a powerful and versatile approach to the selective labeling and identification of secreted pathogen proteins. Selective labeling of microbial proteins is accomplished via translational incorporation of azidonorleucine (Anl), a methionine surrogate that requires a mutant form of the methionyl-tRNA synthetase for activation. Secreted pathogen proteins containing Anl can be tagged by azide-alkyne cycloaddition and enriched by affinity purification. Application of the method to analysis of the type III secretion system of the human pathogen Yersinia enterocolitica enabled efficient identification of secreted proteins, identification of distinct secretion profiles for intracellular and extracellular bacteria, and determination of the order of substrate injection into host cells. This approach should be widely useful for the identification of virulence factors in microbial pathogens and the development of potential new targets for antimicrobial therapy. PMID:24347637

  3. A standardized framing for reporting protein identifications in mzIdentML 1.2

    PubMed Central

    Seymour, Sean L.; Farrah, Terry; Binz, Pierre-Alain; Chalkley, Robert J.; Cottrell, John S.; Searle, Brian C.; Tabb, David L.; Vizcaíno, Juan Antonio; Prieto, Gorka; Uszkoreit, Julian; Eisenacher, Martin; Martínez-Bartolomé, Salvador; Ghali, Fawaz; Jones, Andrew R.

    2015-01-01

    Inferring which protein species have been detected in bottom-up proteomics experiments has been a challenging problem for which solutions have been maturing over the past decade. While many inference approaches now function well in isolation, comparing and reconciling the results generated across different tools remains difficult. It presently stands as one of the greatest barriers in collaborative efforts such as the Human Proteome Project and public repositories like the PRoteomics IDEntifications (PRIDE) database. Here we present a framework for reporting protein identifications that seeks to improve capabilities for comparing results generated by different inference tools. This framework standardizes the terminology for describing protein identification results, associated with the HUPO-Proteomics Standards Initiative (PSI) mzIdentML standard, while still allowing for differing methodologies to reach that final state. It is proposed that developers of software for reporting identification results will adopt this terminology in their outputs. While the new terminology does not require any changes to the core mzIdentML model, it represents a significant change in practice, and, as such, the rules will be released via a new version of the mzIdentML specification (version 1.2) so that consumers of files are able to determine whether the new guidelines have been adopted by export software. PMID:25092112

  4. Improvement in Protein Domain Identification Is Reached by Breaking Consensus, with the Agreement of Many Profiles and Domain Co-occurrence

    PubMed Central

    Bernardes, Juliana; Zaverucha, Gerson; Vaquero, Catherine; Carbone, Alessandra

    2016-01-01

    Traditional protein annotation methods describe known domains with probabilistic models representing consensus among homologous domain sequences. However, when relevant signals become too weak to be identified by a global consensus, attempts for annotation fail. Here we address the fundamental question of domain identification for highly divergent proteins. By using high performance computing, we demonstrate that the limits of state-of-the-art annotation methods can be bypassed. We design a new strategy based on the observation that many structural and functional protein constraints are not globally conserved through all species but might be locally conserved in separate clades. We propose a novel exploitation of the large amount of data available: 1. for each known protein domain, several probabilistic clade-centered models are constructed from a large and differentiated panel of homologous sequences, 2. a decision-making protocol combines outcomes obtained from multiple models, 3. a multi-criteria optimization algorithm finds the most likely protein architecture. The method is evaluated for domain and architecture prediction over several datasets and statistical testing hypotheses. Its performance is compared against HMMScan and HHblits, two widely used search methods based on sequence-profile and profile-profile comparison. Due to their closeness to actual protein sequences, clade-centered models are shown to be more specific and functionally predictive than the broadly used consensus models. Based on them, we improved annotation of Plasmodium falciparum protein sequences on a scale not previously possible. We successfully predict at least one domain for 72% of P. falciparum proteins against 63% achieved previously, corresponding to 30% of improvement over the total number of Pfam domain predictions on the whole genome. The method is applicable to any genome and opens new avenues to tackle evolutionary questions such as the reconstruction of ancient domain

  5. Automatic poisson peak harvesting for high throughput protein identification.

    PubMed

    Breen, E J; Hopwood, F G; Williams, K L; Wilkins, M R

    2000-06-01

    High throughput identification of proteins by peptide mass fingerprinting requires an efficient means of picking peaks from mass spectra. Here, we report the development of a peak harvester to automatically pick monoisotopic peaks from spectra generated on matrix-assisted laser desorption/ionisation time of flight (MALDI-TOF) mass spectrometers. The peak harvester uses advanced mathematical morphology and watershed algorithms to first process spectra to stick representations. Subsequently, Poisson modelling is applied to determine which peak in an isotopically resolved group represents the monoisotopic mass of a peptide. We illustrate the features of the peak harvester with mass spectra of standard peptides, digests of gel-separated bovine serum albumin, and with Escherictia coli proteins prepared by two-dimensional polyacrylamide gel electrophoresis. In all cases, the peak harvester proved effective in its ability to pick similar monoisotopic peaks as an experienced human operator, and also proved effective in the identification of monoisotopic masses in cases where isotopic distributions of peptides were overlapping. The peak harvester can be operated in an interactive mode, or can be completely automated and linked through to peptide mass fingerprinting protein identification tools to achieve high throughput automated protein identification.

  6. Enhancing Membrane Protein Identification Using a Simplified Centrifugation and Detergent-Based Membrane Extraction Approach.

    PubMed

    Zhou, Yanting; Gao, Jing; Zhu, Hongwen; Xu, Jingjing; He, Han; Gu, Lei; Wang, Hui; Chen, Jie; Ma, Danjun; Zhou, Hu; Zheng, Jing

    2018-02-20

    Membrane proteins may act as transporters, receptors, enzymes, and adhesion-anchors, accounting for nearly 70% of pharmaceutical drug targets. Difficulties in efficient enrichment, extraction, and solubilization still exist because of their relatively low abundance and poor solubility. A simplified membrane protein extraction approach with advantages of user-friendly sample processing procedures, good repeatability and significant effectiveness was developed in the current research for enhancing enrichment and identification of membrane proteins. This approach combining centrifugation and detergent along with LC-MS/MS successfully identified higher proportion of membrane proteins, integral proteins and transmembrane proteins in membrane fraction (76.6%, 48.1%, and 40.6%) than in total cell lysate (41.6%, 16.4%, and 13.5%), respectively. Moreover, our method tended to capture membrane proteins with high degree of hydrophobicity and number of transmembrane domains as 486 out of 2106 (23.0%) had GRAVY > 0 in membrane fraction, 488 out of 2106 (23.1%) had TMs ≥ 2. It also provided for improved identification of membrane proteins as more than 60.6% of the commonly identified membrane proteins in two cell samples were better identified in membrane fraction with higher sequence coverage. Data are available via ProteomeXchange with identifier PXD008456.

  7. Seed storage proteins as a system for teaching protein identification by mass spectrometry in biochemistry laboratory.

    PubMed

    Wilson, Karl A; Tan-Wilson, Anna

    2013-01-01

    Mass spectrometry (MS) has become an important tool in studying biological systems. One application is the identification of proteins and peptides by the matching of peptide and peptide fragment masses to the sequences of proteins in protein sequence databases. Often prior protein separation of complex protein mixtures by 2D-PAGE is needed, requiring more time and expertise than instructors of large laboratory classes can devote. We have developed an experimental module for our Biochemistry Laboratory course that engages students in MS-based protein identification following protein separation by one-dimensional SDS-PAGE, a technique that is usually taught in this type of course. The module is based on soybean seed storage proteins, a relatively simple mixture of proteins present in high levels in the seed, allowing the identification of the main protein bands by MS/MS and in some cases, even by peptide mass fingerprinting. Students can identify their protein bands using software available on the Internet, and are challenged to deduce post-translational modifications that have occurred upon germination. A collection of mass spectral data and tutorials that can be used as a stand-alone computer-based laboratory module were also assembled. Copyright © 2013 International Union of Biochemistry and Molecular Biology, Inc.

  8. Identification of Modules in Protein-Protein Interaction Networks

    NASA Astrophysics Data System (ADS)

    Erten, Sinan; Koyutürk, Mehmet

    In biological systems, most processes are carried out through orchestration of multiple interacting molecules. These interactions are often abstracted using network models. A key feature of cellular networks is their modularity, which contributes significantly to the robustness, as well as adaptability of biological systems. Therefore, modularization of cellular networks is likely to be useful in obtaining insights into the working principles of cellular systems, as well as building tractable models of cellular organization and dynamics. A common, high-throughput source of data on molecular interactions is in the form of physical interactions between proteins, which are organized into protein-protein interaction (PPI) networks. This chapter provides an overview on identification and analysis of functional modules in PPI networks, which has been an active area of research in the last decade.

  9. General M13 phage display: M13 phage display in identification and characterization of protein-protein interactions.

    PubMed

    Hertveldt, Kirsten; Beliën, Tim; Volckaert, Guido

    2009-01-01

    In M13 phage display, proteins and peptides are exposed on one of the surface proteins of filamentous phage particles and become accessible to affinity enrichment against a bait of interest. We describe the construction of fragmented whole genome and gene fragment phage display libraries and interaction selection by panning. This strategy allows the identification and characterization of interacting proteins on a genomic scale by screening the fragmented "proteome" against protein baits. Gene fragment libraries allow a more in depth characterization of the protein-protein interaction site by identification of the protein region involved in the interaction.

  10. Identification of Trypanosome Proteins in Plasma from African Sleeping Sickness Patients Infected with T. b. rhodesiense

    PubMed Central

    Enyaru, John C.; Carr, Steven A.; Pearson, Terry W.

    2013-01-01

    Control of human African sleeping sickness, caused by subspecies of the protozoan parasite Trypanosoma brucei, is based on preventing transmission by elimination of the tsetse vector and by active diagnostic screening and treatment of infected patients. To identify trypanosome proteins that have potential as biomarkers for detection and monitoring of African sleeping sickness, we have used a ‘deep-mining” proteomics approach to identify trypanosome proteins in human plasma. Abundant human plasma proteins were removed by immunodepletion. Depleted plasma samples were then digested to peptides with trypsin, fractionated by basic reversed phase and each fraction analyzed by liquid chromatography-tandem mass spectrometry (LC-MS/MS). This sample processing and analysis method enabled identification of low levels of trypanosome proteins in pooled plasma from late stage sleeping sickness patients infected with Trypanosoma brucei rhodesiense. A total of 254 trypanosome proteins were confidently identified. Many of the parasite proteins identified were of unknown function, although metabolic enzymes, chaperones, proteases and ubiquitin-related/acting proteins were found. This approach to the identification of conserved, soluble trypanosome proteins in human plasma offers a possible route to improved disease diagnosis and monitoring, since these molecules are potential biomarkers for the development of a new generation of antigen-detection assays. The combined immuno-depletion/mass spectrometric approach can be applied to a variety of infectious diseases for unbiased biomarker identification. PMID:23951171

  11. Identification of Trypanosome proteins in plasma from African sleeping sickness patients infected with T. b. rhodesiense.

    PubMed

    Eyford, Brett A; Ahmad, Rushdy; Enyaru, John C; Carr, Steven A; Pearson, Terry W

    2013-01-01

    Control of human African sleeping sickness, caused by subspecies of the protozoan parasite Trypanosoma brucei, is based on preventing transmission by elimination of the tsetse vector and by active diagnostic screening and treatment of infected patients. To identify trypanosome proteins that have potential as biomarkers for detection and monitoring of African sleeping sickness, we have used a 'deep-mining" proteomics approach to identify trypanosome proteins in human plasma. Abundant human plasma proteins were removed by immunodepletion. Depleted plasma samples were then digested to peptides with trypsin, fractionated by basic reversed phase and each fraction analyzed by liquid chromatography-tandem mass spectrometry (LC-MS/MS). This sample processing and analysis method enabled identification of low levels of trypanosome proteins in pooled plasma from late stage sleeping sickness patients infected with Trypanosoma brucei rhodesiense. A total of 254 trypanosome proteins were confidently identified. Many of the parasite proteins identified were of unknown function, although metabolic enzymes, chaperones, proteases and ubiquitin-related/acting proteins were found. This approach to the identification of conserved, soluble trypanosome proteins in human plasma offers a possible route to improved disease diagnosis and monitoring, since these molecules are potential biomarkers for the development of a new generation of antigen-detection assays. The combined immuno-depletion/mass spectrometric approach can be applied to a variety of infectious diseases for unbiased biomarker identification.

  12. 34 CFR 200.32 - Identification for school improvement.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 34 Education 1 2010-07-01 2010-07-01 false Identification for school improvement. 200.32 Section... Improving Basic Programs Operated by Local Educational Agencies Lea and School Improvement § 200.32 Identification for school improvement. (a)(1)(i) An LEA must identify for school improvement any elementary or...

  13. 34 CFR 200.32 - Identification for school improvement.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... 34 Education 1 2011-07-01 2011-07-01 false Identification for school improvement. 200.32 Section... Improving Basic Programs Operated by Local Educational Agencies Lea and School Improvement § 200.32 Identification for school improvement. (a)(1)(i) An LEA must identify for school improvement any elementary or...

  14. IFPTarget: A Customized Virtual Target Identification Method Based on Protein-Ligand Interaction Fingerprinting Analyses.

    PubMed

    Li, Guo-Bo; Yu, Zhu-Jun; Liu, Sha; Huang, Lu-Yi; Yang, Ling-Ling; Lohans, Christopher T; Yang, Sheng-Yong

    2017-07-24

    Small-molecule target identification is an important and challenging task for chemical biology and drug discovery. Structure-based virtual target identification has been widely used, which infers and prioritizes potential protein targets for the molecule of interest (MOI) principally via a scoring function. However, current "universal" scoring functions may not always accurately identify targets to which the MOI binds from the retrieved target database, in part due to a lack of consideration of the important binding features for an individual target. Here, we present IFPTarget, a customized virtual target identification method, which uses an interaction fingerprinting (IFP) method for target-specific interaction analyses and a comprehensive index (Cvalue) for target ranking. Evaluation results indicate that the IFP method enables substantially improved binding pose prediction, and Cvalue has an excellent performance in target ranking for the test set. When applied to screen against our established target library that contains 11,863 protein structures covering 2842 unique targets, IFPTarget could retrieve known targets within the top-ranked list and identified new potential targets for chemically diverse drugs. IFPTarget prediction led to the identification of the metallo-β-lactamase VIM-2 as a target for quercetin as validated by enzymatic inhibition assays. This study provides a new in silico target identification tool and will aid future efforts to develop new target-customized methods for target identification.

  15. De novo identification of highly diverged protein repeats by probabilistic consistency.

    PubMed

    Biegert, A; Söding, J

    2008-03-15

    An estimated 25% of all eukaryotic proteins contain repeats, which underlines the importance of duplication for evolving new protein functions. Internal repeats often correspond to structural or functional units in proteins. Methods capable of identifying diverged repeated segments or domains at the sequence level can therefore assist in predicting domain structures, inferring hypotheses about function and mechanism, and investigating the evolution of proteins from smaller fragments. We present HHrepID, a method for the de novo identification of repeats in protein sequences. It is able to detect the sequence signature of structural repeats in many proteins that have not yet been known to possess internal sequence symmetry, such as outer membrane beta-barrels. HHrepID uses HMM-HMM comparison to exploit evolutionary information in the form of multiple sequence alignments of homologs. In contrast to a previous method, the new method (1) generates a multiple alignment of repeats; (2) utilizes the transitive nature of homology through a novel merging procedure with fully probabilistic treatment of alignments; (3) improves alignment quality through an algorithm that maximizes the expected accuracy; (4) is able to identify different kinds of repeats within complex architectures by a probabilistic domain boundary detection method and (5) improves sensitivity through a new approach to assess statistical significance. Server: http://toolkit.tuebingen.mpg.de/hhrepid; Executables: ftp://ftp.tuebingen.mpg.de/pub/protevo/HHrepID

  16. Identification of DNA-binding proteins using multi-features fusion and binary firefly optimization algorithm.

    PubMed

    Zhang, Jian; Gao, Bo; Chai, Haiting; Ma, Zhiqiang; Yang, Guifu

    2016-08-26

    DNA-binding proteins (DBPs) play fundamental roles in many biological processes. Therefore, the developing of effective computational tools for identifying DBPs is becoming highly desirable. In this study, we proposed an accurate method for the prediction of DBPs. Firstly, we focused on the challenge of improving DBP prediction accuracy with information solely from the sequence. Secondly, we used multiple informative features to encode the protein. These features included evolutionary conservation profile, secondary structure motifs, and physicochemical properties. Thirdly, we introduced a novel improved Binary Firefly Algorithm (BFA) to remove redundant or noisy features as well as select optimal parameters for the classifier. The experimental results of our predictor on two benchmark datasets outperformed many state-of-the-art predictors, which revealed the effectiveness of our method. The promising prediction performance on a new-compiled independent testing dataset from PDB and a large-scale dataset from UniProt proved the good generalization ability of our method. In addition, the BFA forged in this research would be of great potential in practical applications in optimization fields, especially in feature selection problems. A highly accurate method was proposed for the identification of DBPs. A user-friendly web-server named iDbP (identification of DNA-binding Proteins) was constructed and provided for academic use.

  17. Choosing an Optimal Database for Protein Identification from Tandem Mass Spectrometry Data.

    PubMed

    Kumar, Dhirendra; Yadav, Amit Kumar; Dash, Debasis

    2017-01-01

    Database searching is the preferred method for protein identification from digital spectra of mass to charge ratios (m/z) detected for protein samples through mass spectrometers. The search database is one of the major influencing factors in discovering proteins present in the sample and thus in deriving biological conclusions. In most cases the choice of search database is arbitrary. Here we describe common search databases used in proteomic studies and their impact on final list of identified proteins. We also elaborate upon factors like composition and size of the search database that can influence the protein identification process. In conclusion, we suggest that choice of the database depends on the type of inferences to be derived from proteomics data. However, making additional efforts to build a compact and concise database for a targeted question should generally be rewarding in achieving confident protein identifications.

  18. Identification of shed proteins from Chinese hamster ovary cells: Application of statistical confidence using human and mouse protein databases

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ahram, Mamoun; Strittmatter, Eric F.; Monroe, Matthew E.

    The shedding process releases ligands, receptors, and other proteins from the surface of the cell and is a mechanism whereby cells communicate. Even though altered regulation of this process has been implicated in several diseases, global approaches to evaluate shed proteins have not been developed. A goal of this study was to identify global changes in shed proteins in media taken from cells exposed to low-doses of radiation in an effort to develop a fundamental understanding of the bystander response. CHO cells were chosen for this study because they have been widely used for radiation studies and since they havemore » been reported to respond to radiation by releasing factors into the media that cause genomic instability and cytotoxicity in unexposed cells, i.e., a bystander effect. Media samples taken for irradiated cells were evaluated using a combination of tandem- and FTICR-mass spectrometry analysis. Since the hamster genome has not been sequenced, mass spectrometry data was searched against the mouse and human proteins databases. Nearly 150 proteins that were identified by tandem mass spectrometry were confirmed by FTICR. When both types of mass spectrometry data were evaluated with a new confidence scoring tool, which is based on discriminant analyses, about 500 protein were identified. Approximately 20% of these identifications were either integral membrane proteins or membrane associated proteins, suggesting that they were derived from the cell surface, hence were likely shed. However, estimates of quantitative changes, based on two independent mass spectrometry approaches, did not identify any protein abundance changes attributable to the bystander effect. Results from this study demonstrate the feasibility of global evaluation of shed proteins using mass spectrometry in conjunction with cross-species protein databases and that significant improvement in peptide/protein identifications is provided by the confidence scoring tool.« less

  19. Identification of Conserved Water Sites in Protein Structures for Drug Design.

    PubMed

    Jukič, Marko; Konc, Janez; Gobec, Stanislav; Janežič, Dušanka

    2017-12-26

    Identification of conserved waters in protein structures is a challenging task with applications in molecular docking and protein stability prediction. As an alternative to computationally demanding simulations of proteins in water, experimental cocrystallized waters in the Protein Data Bank (PDB) in combination with a local structure alignment algorithm can be used for reliable prediction of conserved water sites. We developed the ProBiS H2O approach based on the previously developed ProBiS algorithm, which enables identification of conserved water sites in proteins using experimental protein structures from the PDB or a set of custom protein structures available to the user. With a protein structure, a binding site, or an individual water molecule as a query, ProBiS H2O collects similar proteins from the PDB and performs local or binding site-specific superimpositions of the query structure with similar proteins using the ProBiS algorithm. It collects the experimental water molecules from the similar proteins and transposes them to the query protein. Transposed waters are clustered by their mutual proximity, which enables identification of discrete sites in the query protein with high water conservation. ProBiS H2O is a robust and fast new approach that uses existing experimental structural data to identify conserved water sites on the interfaces of protein complexes, for example protein-small molecule interfaces, and elsewhere on the protein structures. It has been successfully validated in several reported proteins in which conserved water molecules were found to play an important role in ligand binding with applications in drug design.

  20. Establishment of a protein frequency library and its application in the reliable identification of specific protein interaction partners.

    PubMed

    Boulon, Séverine; Ahmad, Yasmeen; Trinkle-Mulcahy, Laura; Verheggen, Céline; Cobley, Andy; Gregor, Peter; Bertrand, Edouard; Whitehorn, Mark; Lamond, Angus I

    2010-05-01

    The reliable identification of protein interaction partners and how such interactions change in response to physiological or pathological perturbations is a key goal in most areas of cell biology. Stable isotope labeling with amino acids in cell culture (SILAC)-based mass spectrometry has been shown to provide a powerful strategy for characterizing protein complexes and identifying specific interactions. Here, we show how SILAC can be combined with computational methods drawn from the business intelligence field for multidimensional data analysis to improve the discrimination between specific and nonspecific protein associations and to analyze dynamic protein complexes. A strategy is shown for developing a protein frequency library (PFL) that improves on previous use of static "bead proteomes." The PFL annotates the frequency of detection in co-immunoprecipitation and pulldown experiments for all proteins in the human proteome. It can provide a flexible and objective filter for discriminating between contaminants and specifically bound proteins and can be used to normalize data values and facilitate comparisons between data obtained in separate experiments. The PFL is a dynamic tool that can be filtered for specific experimental parameters to generate a customized library. It will be continuously updated as data from each new experiment are added to the library, thereby progressively enhancing its utility. The application of the PFL to pulldown experiments is especially helpful in identifying either lower abundance or less tightly bound specific components of protein complexes that are otherwise lost among the large, nonspecific background.

  1. Seed Storage Proteins as a System for Teaching Protein Identification by Mass Spectrometry in Biochemistry Laboratory

    ERIC Educational Resources Information Center

    Wilson, Karl A.; Tan-Wilson, Anna

    2013-01-01

    Mass spectrometry (MS) has become an important tool in studying biological systems. One application is the identification of proteins and peptides by the matching of peptide and peptide fragment masses to the sequences of proteins in protein sequence databases. Often prior protein separation of complex protein mixtures by 2D-PAGE is needed,…

  2. Identification of Potent ACE Inhibitory Peptides from Wild Almond Proteins.

    PubMed

    Mirzapour, Mozhgan; Rezaei, Karamatollah; Sentandreu, Miguel Angel

    2017-10-01

    In this study, the production, fractionation, purification and identification of ACE (angiotensin-I-converting enzyme) inhibitory peptides from wild almond (Amygdalus scoparia) proteins were investigated. Wild almond proteins were hydrolyzed using 5 different enzymes (pepsin, trypsin, chymotrypsin, alcalase and flavourzyme) and assayed for their ACE inhibitory activities. The degree of ACE inhibiting activity obtained after hydrolysis was found to be in the following order: alcalase > chymotrypsin > trypsin/pepsin > flavourzyme. The hydrolysates obtained from alcalase (IC 50 = 0.8 mg/mL) were fractionated by sequential ultrafiltration at 10 and 3 kDa cutoff values and the most active fraction (<3 kDa) was further separated using reversed phase high-performance liquid chromatography (RP-HPLC). Peptide sequence identifications were carried out on highly potential fractions obtained from RP-HPLC by means of liquid chromatography coupled to electrospray ionization and tandem mass spectrometry (LC-ESI-MS/MS). Sequencing of ACE inhibitory peptides present in the fraction 26 of RP-HPLC resulted in the identification of 3 peptide sequences (VVNE, VVTR, and VVGVD) not reported previously in the literature. Sequence identification of fractions 40 and 42 from RP-HPLC, which showed the highest ACE inhibitory activities (84.1% and 86.9%, respectively), resulted in the identification of more than 40 potential ACE inhibitory sequences. The results indicate that wild almond protein is a rich source of potential antihypertensive peptides and can be suggested for applications in functional foods and drinks with respect to hindrance and mitigation of hypertension after in vivo assessment. This study has shown the potential of wild almond proteins as good sources for producing ACE-inhibitory active peptides. According to this finding, peptides with higher ACE inhibitory activities could be released during the gastrointestinal digestion and contribute to the health- promoting

  3. Improved Proteomic Analysis Following Trichloroacetic Acid Extraction of Bacillus anthracis Spore Proteins

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kaiser, Brooke LD; Wunschel, David S.; Sydor, Michael A.

    2015-08-07

    Proteomic analysis of bacterial samples provides valuable information about cellular responses and functions under different environmental pressures. Proteomic analysis is dependent upon efficient extraction of proteins from bacterial samples without introducing bias toward extraction of particular protein classes. While no single method can recover 100% of the bacterial proteins, selected protocols can improve overall protein isolation, peptide recovery, or enrich for certain classes of proteins. The method presented here is technically simple and does not require specialized equipment such as a mechanical disrupter. Our data reveal that for particularly challenging samples, such as B. anthracis Sterne spores, trichloroacetic acid extractionmore » improved the number of proteins identified within a sample compared to bead beating (714 vs 660, respectively). Further, TCA extraction enriched for 103 known spore specific proteins whereas bead beating resulted in 49 unique proteins. Analysis of C. botulinum samples grown to 5 days, composed of vegetative biomass and spores, showed a similar trend with improved protein yields and identification using our method compared to bead beating. Interestingly, easily lysed samples, such as B. anthracis vegetative cells, were equally as effectively processed via TCA and bead beating, but TCA extraction remains the easiest and most cost effective option. As with all assays, supplemental methods such as implementation of an alternative preparation method may provide additional insight to the protein biology of the bacteria being studied.« less

  4. Effective Identification of Akt Interacting Proteins by Two-Step Chemical Crosslinking, Co-Immunoprecipitation and Mass Spectrometry

    PubMed Central

    Huang, Bill X.; Kim, Hee-Yong

    2013-01-01

    Akt is a critical protein for cell survival and known to interact with various proteins. However, Akt binding partners that modulate or regulate Akt activation have not been fully elucidated. Identification of Akt-interacting proteins has been customarily achieved by co-immunoprecipitation combined with western blot and/or MS analysis. An intrinsic problem of the method is loss of interacting proteins during procedures to remove non-specific proteins. Moreover, antibody contamination often interferes with the detection of less abundant proteins. Here, we developed a novel two-step chemical crosslinking strategy to overcome these problems which resulted in a dramatic improvement in identifying Akt interacting partners. Akt antibody was first immobilized on protein A/G beads using disuccinimidyl suberate and allowed to bind to cellular Akt along with its interacting proteins. Subsequently, dithiobis[succinimidylpropionate], a cleavable crosslinker, was introduced to produce stable complexes between Akt and binding partners prior to the SDS-PAGE and nanoLC-MS/MS analysis. This approach enabled identification of ten Akt partners from cell lysates containing as low as 1.5 mg proteins, including two new potential Akt interacting partners. None of these but one protein was detectable without crosslinking procedures. The present method provides a sensitive and effective tool to probe Akt-interacting proteins. This strategy should also prove useful for other protein interactions, particularly those involving less abundant or weakly associating partners. PMID:23613850

  5. Identification of AOSC-binding proteins in neurons

    NASA Astrophysics Data System (ADS)

    Liu, Ming; Nie, Qin; Xin, Xianliang; Geng, Meiyu

    2008-11-01

    Acidic oligosaccharide sugar chain (AOSC), a D-mannuronic acid oligosaccharide, derived from brown algae polysaccharide, has been completed Phase I clinical trial in China as an anti-Alzheimer’s Disease (AD) drug candidate. The identification of AOSC-binding protein(s) in neurons is very important for understanding its action mechanism. To determine the binding protein(s) of AOSC in neurons mediating its anti-AD activities, confocal microscopy, affinity chromatography, and liquid chromatography-tandem mass spectrometry (LC-MS/MS) analysis were used. Confocal microscopy analysis shows that AOSC binds to SH-SY5Y cells in concentration-, time-, and temperature-dependent fashions. The AOSC binding proteins were purified by affinity chromatography and identified by LC-MS/MS analysis. The results showed that there are 349 proteins binding AOSC, including clathrin, adaptor protein-2 (AP-2) and amyloid precursor protein (APP). These results suggest that the binding/entrance of AOSC to neurons is probably responsible for anti-AD activities.

  6. Biochemical component identification by plasmonic improved whispering gallery mode optical resonance based sensor

    NASA Astrophysics Data System (ADS)

    Saetchnikov, Vladimir A.; Tcherniavskaia, Elina A.; Saetchnikov, Anton V.; Schweiger, Gustav; Ostendorf, Andreas

    2014-05-01

    Experimental data on detection and identification of variety of biochemical agents, such as proteins, microelements, antibiotic of different generation etc. in both single and multi component solutions under varied in wide range concentration analyzed on the light scattering parameters of whispering gallery mode optical resonance based sensor are represented. Multiplexing on parameters and components has been realized using developed fluidic sensor cell with fixed in adhesive layer dielectric microspheres and data processing. Biochemical component identification has been performed by developed network analysis techniques. Developed approach is demonstrated to be applicable both for single agent and for multi component biochemical analysis. Novel technique based on optical resonance on microring structures, plasmon resonance and identification tools has been developed. To improve a sensitivity of microring structures microspheres fixed by adhesive had been treated previously by gold nanoparticle solution. Another technique used thin film gold layers deposited on the substrate below adhesive. Both biomolecule and nanoparticle injections caused considerable changes of optical resonance spectra. Plasmonic gold layers under optimized thickness also improve parameters of optical resonance spectra. Biochemical component identification has been also performed by developed network analysis techniques both for single and for multi component solution. So advantages of plasmon enhancing optical microcavity resonance with multiparameter identification tools is used for development of a new platform for ultra sensitive label-free biomedical sensor.

  7. Identification of PDC-109-like protein(s) in buffalo seminal plasma.

    PubMed

    Harshan, Hiron M; Sankar, Surya; Singh, L P; Singh, Manish Kumar; Sudharani, S; Ansari, M R; Singh, S K; Majumdar, A C; Joshi, P

    2009-10-01

    The FN-2 family of seminal plasma proteins represents the major protein fraction of bovine seminal plasma. These proteins also constitute the major seminal plasma proteins fraction in horse, goat and bison seminal plasma and are present in pig, rat, mouse, hamster and human seminal plasma. BSP-A1 and BSP-A2, the predominant proteins of the FN-2 family, are collectively termed as PDC-109. Fn-2 proteins play an important role in fertilization, including sperm capacitation and formation of oviductal sperm reservoirs. Significantly, BSP proteins were also shown to have negative effects in the context of sperm storage. No conclusive evidence for the presence of buffalo seminal plasma protein(s) similar to PDC-109 exists. Studies with buffalo seminal plasma indicated that isolation and identification of PDC-109-like protein(s) from buffalo seminal plasma by conventional methods might be difficult. Thus, antibodies raised against PDC-109 isolated, and purified from cattle seminal plasma, were used for investigating the presence of PDC-109-like protein(s) in buffalo seminal plasma. Buffalo seminal plasma proteins were resolved on SDS-PAGE, blotted to nitro cellulose membranes and probed for the presence of PDC-109-like protein(s) using the PDC-109 antisera raised in rabbits. A distinct immunoreactive band well below the 20-kDa regions indicated the presence of PDC-109-like protein(s) in buffalo seminal plasma.

  8. Deglycosylation systematically improves N-glycoprotein identification in liquid chromatography-tandem mass spectrometry proteomics for analysis of cell wall stress responses in Saccharomyces cerevisiae lacking Alg3p.

    PubMed

    Bailey, Ulla-Maja; Schulz, Benjamin L

    2013-04-01

    Post-translational modification of proteins with glycosylation is of key importance in many biological systems in eukaryotes, influencing fundamental biological processes and regulating protein function. Changes in glycosylation are therefore of interest in understanding these processes and are also useful as clinical biomarkers of disease. The presence of glycosylation can also inhibit protease digestion and lower the quality and confidence of protein identification by mass spectrometry. While deglycosylation can improve the efficiency of subsequent protease digest and increase protein coverage, this step is often excluded from proteomic workflows. Here, we performed a systematic analysis that showed that deglycosylation with peptide-N-glycosidase F (PNGase F) prior to protease digestion with AspN or trypsin improved the quality of identification of the yeast cell wall proteome. The improvement in the confidence of identification of glycoproteins following PNGase F deglycosylation correlated with a higher density of glycosylation sites. Optimal identification across the proteome was achieved with PNGase F deglycosylation and complementary proteolysis with either AspN or trypsin. We used this combination of deglycosylation and complementary protease digest to identify changes in the yeast cell wall proteome caused by lack of the Alg3p protein, a key component of the biosynthetic pathway of protein N-glycosylation. The cell wall of yeast lacking Alg3p showed specifically increased levels of Cis3p, a protein important for cell wall integrity. Our results showed that deglycosylation prior to protease digestion improved the quality of proteomic analyses even if protein glycosylation is not of direct relevance to the study at hand. Copyright © 2013 Elsevier B.V. All rights reserved.

  9. Proteomic analysis of human aqueous humor using multidimensional protein identification technology

    PubMed Central

    Richardson, Matthew R.; Price, Marianne O.; Price, Francis W.; Pardo, Jennifer C.; Grandin, Juan C.; You, Jinsam; Wang, Mu

    2009-01-01

    Aqueous humor (AH) supports avascular tissues in the anterior segment of the eye, maintains intraocular pressure, and potentially influences the pathogenesis of ocular diseases. Nevertheless, the AH proteome is still poorly defined despite several previous efforts, which were hindered by interfering high abundance proteins, inadequate animal models, and limited proteomic technologies. To facilitate future investigations into AH function, the AH proteome was extensively characterized using an advanced proteomic approach. Samples from patients undergoing cataract surgery were pooled and depleted of interfering abundant proteins and thereby divided into two fractions: albumin-bound and albumin-depleted. Multidimensional Protein Identification Technology (MudPIT) was utilized for each fraction; this incorporates strong cation exchange chromatography to reduce sample complexity before reversed-phase liquid chromatography and tandem mass spectrometric analysis. Twelve proteins had multi-peptide, high confidence identifications in the albumin-bound fraction and 50 proteins had multi-peptide, high confidence identifications in the albumin-depleted fraction. Gene ontological analyses were performed to determine which cellular components and functions were enriched. Many proteins were previously identified in the AH and for several their potential role in the AH has been investigated; however, the majority of identified proteins were novel and only speculative roles can be suggested. The AH was abundant in anti-oxidant and immunoregulatory proteins as well as anti-angiogenic proteins, which may be involved in maintaining the avascular tissues. This is the first known report to extensively characterize and describe the human AH proteome and lays the foundation for future work regarding its function in homeostatic and pathologic states. PMID:20019884

  10. A Simple and Practical Dictionary-based Approach for Identification of Proteins in Medline Abstracts

    PubMed Central

    Egorov, Sergei; Yuryev, Anton; Daraselia, Nikolai

    2004-01-01

    Objective: The aim of this study was to develop a practical and efficient protein identification system for biomedical corpora. Design: The developed system, called ProtScan, utilizes a carefully constructed dictionary of mammalian proteins in conjunction with a specialized tokenization algorithm to identify and tag protein name occurrences in biomedical texts and also takes advantage of Medline “Name-of-Substance” (NOS) annotation. The dictionaries for ProtScan were constructed in a semi-automatic way from various public-domain sequence databases followed by an intensive expert curation step. Measurements: The recall and precision of the system have been determined using 1,000 randomly selected and hand-tagged Medline abstracts. Results: The developed system is capable of identifying protein occurrences in Medline abstracts with a 98% precision and 88% recall. It was also found to be capable of processing approximately 300 abstracts per second. Without utilization of NOS annotation, precision and recall were found to be 98.5% and 84%, respectively. Conclusion: The developed system appears to be well suited for protein-based Medline indexing and can help to improve biomedical information retrieval. Further approaches to ProtScan's recall improvement also are discussed. PMID:14764613

  11. Improved Identification of Membrane Proteins by MALDI-TOF MS/MS Using Vacuum Sublimated Matrix Spots on an Ultraphobic Chip Surface

    PubMed Central

    Poetsch, Ansgar; Schlüsener, Daniela; Florizone, Christine; Eltis, Lindsay; Menzel, Christoph; Rögner, Matthias; Steinert, Kerstin; Roth, Udo

    2008-01-01

    Integral membrane proteins are notoriously difficult to identify and analyze by mass spectrometry because of their low abundance and limited number of trypsin cleavage sites. Our strategy to address this problem is based on a novel technology for MALDI-MS peptide sample preparation that increases the success rate of membrane protein identification by increasing the sensitivity of the MALDI-TOF system. For this, we used sample plates with predeposited matrix spots of CHCA crystals prepared by vacuum sublimation onto an extremely low wettable (ultraphobic) surface. In experiments using standard peptides, an up to 10-fold gain of sensitivity was found for on-chip preparations compared with classical dried-droplet preparations on a steel target. In order to assess the performance of the chips with membrane proteins, three model proteins (bacteriorhodopsin, subunit IV(a) of ATP synthase, and the cp47 subunit from photosystem II) were analyzed. To mimic realistic analysis conditions, purified proteins were separated by SDS-PAGE and digested with trypsin. The digest MALDI samples were prepared either by dried-droplet technique on steel plates using CHCA as matrix, or applied directly onto the matrix spots of the chip surface. Significantly higher signal-to-noise ratios were observed for all of the spectra resulting from on-chip preparations of different peptides. In a second series of experiments, the membrane proteome of Rhodococcus jostii RHA1 was investigated by AIEC/SDS-PAGE in combination with MALDI-TOF MS/MS. As in the first experiments, Coomassie-stained SDS-PAGE bands were digested and the two different preparation methods were compared. For preparations on the Mass·Spec·Turbo Chip, 43 of 60 proteins were identified, whereas only 30 proteins were reliably identified after classical sample preparation. Comparison of the obtained Mascot scores, which reflect the confidence level of the protein identifications, revealed that for 70% of the identified proteins

  12. Identification of DNA-binding proteins by combining auto-cross covariance transformation and ensemble learning.

    PubMed

    Liu, Bin; Wang, Shanyi; Dong, Qiwen; Li, Shumin; Liu, Xuan

    2016-04-20

    DNA-binding proteins play a pivotal role in various intra- and extra-cellular activities ranging from DNA replication to gene expression control. With the rapid development of next generation of sequencing technique, the number of protein sequences is unprecedentedly increasing. Thus it is necessary to develop computational methods to identify the DNA-binding proteins only based on the protein sequence information. In this study, a novel method called iDNA-KACC is presented, which combines the Support Vector Machine (SVM) and the auto-cross covariance transformation. The protein sequences are first converted into profile-based protein representation, and then converted into a series of fixed-length vectors by the auto-cross covariance transformation with Kmer composition. The sequence order effect can be effectively captured by this scheme. These vectors are then fed into Support Vector Machine (SVM) to discriminate the DNA-binding proteins from the non DNA-binding ones. iDNA-KACC achieves an overall accuracy of 75.16% and Matthew correlation coefficient of 0.5 by a rigorous jackknife test. Its performance is further improved by employing an ensemble learning approach, and the improved predictor is called iDNA-KACC-EL. Experimental results on an independent dataset shows that iDNA-KACC-EL outperforms all the other state-of-the-art predictors, indicating that it would be a useful computational tool for DNA binding protein identification. .

  13. Cy5 maleimide labelling for sensitive detection of free thiols in native protein extracts: identification of seed proteins targeted by barley thioredoxin h isoforms.

    PubMed Central

    Maeda, Kenji; Finnie, Christine; Svensson, Birte

    2004-01-01

    Barley thioredoxin h isoforms HvTrxh1 and HvTrxh2 differ in temporal and spatial distribution and in kinetic properties. Target proteins of HvTrxh1 and HvTrxh2 were identified in mature seeds and in seeds after 72 h of germination. Improvement of the established method for identification of thioredoxin-targeted proteins based on two-dimensional electrophoresis and fluorescence labelling of thiol groups was achieved by application of a highly sensitive Cy5 maleimide dye and large-format two-dimensional gels, resulting in a 10-fold increase in the observed number of labelled protein spots. The technique also provided information about accessible thiol groups in the proteins identified in the barley seed proteome. In total, 16 different putative target proteins were identified from 26 spots using tryptic in-gel digestion, matrix-assisted laser-desorption ionization-time-of-flight MS and database search. HvTrxh1 and HvTrxh2 were shown to have similar target specificity. Barley alpha-amylase/subtilisin inhibitor, previously demonstrated to be reduced by both HvTrxh1 and HvTrxh2, was among the identified target proteins, confirming the suitability of the method. Several alpha-amylase/trypsin inhibitors, some of which are already known as target proteins of thioredoxin h, and cyclophilin known as a target protein of m-type thioredoxin were also identified. Lipid transfer protein, embryospecific protein, three chitinase isoenzymes, a single-domain glyoxalase-like protein and superoxide dismutase were novel identifications of putative target proteins, suggesting new physiological roles of thioredoxin h in barley seeds. PMID:14636158

  14. The Protein Interactome of Streptococcus pneumoniae and Bacterial Meta-interactomes Improve Function Predictions.

    PubMed

    Wuchty, S; Rajagopala, S V; Blazie, S M; Parrish, J R; Khuri, S; Finley, R L; Uetz, P

    2017-01-01

    The functions of roughly a third of all proteins in Streptococcus pneumoniae , a significant human-pathogenic bacterium, are unknown. Using a yeast two-hybrid approach, we have determined more than 2,000 novel protein interactions in this organism. We augmented this network with meta-interactome data that we defined as the pool of all interactions between evolutionarily conserved proteins in other bacteria. We found that such interactions significantly improved our ability to predict a protein's function, allowing us to provide functional predictions for 299 S. pneumoniae proteins with previously unknown functions. IMPORTANCE Identification of protein interactions in bacterial species can help define the individual roles that proteins play in cellular pathways and pathogenesis. Very few protein interactions have been identified for the important human pathogen S. pneumoniae . We used an experimental approach to identify over 2,000 new protein interactions for S. pneumoniae , the most extensive interactome data for this bacterium to date. To predict protein function, we used our interactome data augmented with interactions from other closely related bacteria. The combination of the experimental data and meta-interactome data significantly improved the prediction results, allowing us to assign possible functions to a large number of poorly characterized proteins.

  15. Isolation and identification of peanut leaf proteins regulated by water stress.

    PubMed

    Akkasaeng, Chutipong; Tantisuwichwong, Napaporn; Chairam, Issariya; Prakrongrak, Narumon; Jogloy, Sanun; Pathanothai, Aran

    2007-05-15

    Water deficits trigger signaling cascades leading to modulation of protein expression in plant tissues. Identification of peanut leaf proteins regulated by water stress provides some insights of cellular and molecular response of peanut plants to drought stress. Peanut variety Khon Kaen 4, a water-stress sensitive variety, was grown in a growth chamber under controlled environment. Water stress was imposed on day 30 after seedling emergence by withholding watering peanut plants for 6 days as compared to plants adequately supplied with water. Total protein were prepared from a leaflet of fully expanded leaf on the main stem. Proteins were separated in duplicated gels using two-dimensional gel electrophoresis and visualized by silver nitrate staining. Image analysis was performed using ImageMaster 2D Platinum 5.0 to determine proteins regulated by water stress. Molecular mass and isoelectric point of each regulated protein were used in database queries for protein identification. One protein was induced under water stress and the homologous protein was identified as Serine/threonine-protein phosphatase PP 1. Five proteins were down-regulated by water deficit. The homologous proteins were chaperone protein DNAJ, auxin-responsive protein IAA29, peroxidase 43, caffeoyl-CoA O-methyltransferase and SNF1-related protein kinase regulatory subunit beta-2. Down-regulated proteins may be associated with sensitivity of the peanut variety to water stress.

  16. Experimental Methods for Protein Interaction Identification and Characterization

    NASA Astrophysics Data System (ADS)

    Uetz, Peter; Titz, Björn; Cagney, Gerard

    There are dozens of methods for the detection of protein-protein interactions but they fall into a few broad categories. Fragment complementation assays such as the yeast two-hybrid (Y2H) system are based on split proteins that are functionally reconstituted by fusions of interacting proteins. Biophysical methods include structure determination and mass spectrometric (MS) identification of proteins in complexes. Biochemical methods include methods such as far western blotting and peptide arrays. Only the Y2H and protein complex purification combined with MS have been used on a larger scale. Due to the lack of data it is still difficult to compare these methods with respect to their efficiency and error rates. Current data does not favor any particular method and thus multiple experimental approaches are necessary to maximally cover the interactome of any target cell or organism.

  17. The Protein Interactome of Streptococcus pneumoniae and Bacterial Meta-interactomes Improve Function Predictions

    PubMed Central

    Rajagopala, S. V.; Blazie, S. M.; Parrish, J. R.; Khuri, S.; Finley, R. L.

    2017-01-01

    ABSTRACT The functions of roughly a third of all proteins in Streptococcus pneumoniae, a significant human-pathogenic bacterium, are unknown. Using a yeast two-hybrid approach, we have determined more than 2,000 novel protein interactions in this organism. We augmented this network with meta-interactome data that we defined as the pool of all interactions between evolutionarily conserved proteins in other bacteria. We found that such interactions significantly improved our ability to predict a protein’s function, allowing us to provide functional predictions for 299 S. pneumoniae proteins with previously unknown functions. IMPORTANCE Identification of protein interactions in bacterial species can help define the individual roles that proteins play in cellular pathways and pathogenesis. Very few protein interactions have been identified for the important human pathogen S. pneumoniae. We used an experimental approach to identify over 2,000 new protein interactions for S. pneumoniae, the most extensive interactome data for this bacterium to date. To predict protein function, we used our interactome data augmented with interactions from other closely related bacteria. The combination of the experimental data and meta-interactome data significantly improved the prediction results, allowing us to assign possible functions to a large number of poorly characterized proteins. PMID:28744484

  18. Strategies for the enrichment and identification of basic proteins in proteome projects.

    PubMed

    Bae, Soo-Han; Harris, Andrew G; Hains, Peter G; Chen, Hong; Garfin, David E; Hazell, Stuart L; Paik, Young-Ki; Walsh, Bradley J; Cordwell, Stuart J

    2003-05-01

    Two-dimensional gel electrophoresis (2-DE) is currently the method of choice for separating complex mixtures of proteins for visual comparison in proteome analysis. This technology, however, is biased against certain classes of proteins including low abundance and hydrophobic proteins. Proteins with extremely alkaline isoelectric points (pI) are often very poorly represented using 2-DE technology, even when complex mixtures are separated using commercially available pH 6-11 or pH 7-10 immobilized pH gradients. The genome of the human gut pathogen, Helicobacter pylori, is dominated by genes encoding basic proteins, and is therefore a useful model for examining methodology suitable for separating such proteins. H. pylori proteins were separated on pH 6-11 and novel pH 9-12 immobilized pH gradients and 65 protein spots were subjected to matrix-assisted laser desorption/ionization-time of flight mass spectrometry, leading to the identification of 49 unique proteins. No proteins were characterized with a theoretical pI of greater than 10.23. A second approach to examine extremely alkaline proteins (pI > 9.0) utilized a prefractionation isoelectric focusing. Proteins were separated into two fractions using Gradiflow technology, and the extremely basic fraction subjected to both sodium dodecyl sulphate-polyacrylamide gel electrophoresis and liquid chromatography (LC) - tandem mass spectrometry post-tryptic digest, allowing the identification of 17 and 13 proteins, respectively. Gradiflow separations were highly specific for proteins with pI > 9.0, however, a single LC separation only allowed the identification of peptides from highly abundant proteins. These methods and those encompassing multiple LC 'dimensions' may be a useful complement to 2-DE for 'near-to-total' proteome coverage in the alkaline pH range.

  19. Automated protein identification by the combination of MALDI MS and MS/MS spectra from different instruments.

    PubMed

    Levander, Fredrik; James, Peter

    2005-01-01

    The identification of proteins separated on two-dimensional gels is most commonly performed by trypsin digestion and subsequent matrix-assisted laser desorption ionization (MALDI) with time-of-flight (TOF). Recently, atmospheric pressure (AP) MALDI coupled to an ion trap (IT) has emerged as a convenient method to obtain tandem mass spectra (MS/MS) from samples on MALDI target plates. In the present work, we investigated the feasibility of using the two methodologies in line as a standard method for protein identification. In this setup, the high mass accuracy MALDI-TOF spectra are used to calibrate the peptide precursor masses in the lower mass accuracy AP-MALDI-IT MS/MS spectra. Several software tools were developed to automate the analysis process. Two sets of MALDI samples, consisting of 142 and 421 gel spots, respectively, were analyzed in a highly automated manner. In the first set, the protein identification rate increased from 61% for MALDI-TOF only to 85% for MALDI-TOF combined with AP-MALDI-IT. In the second data set the increase in protein identification rate was from 44% to 58%. AP-MALDI-IT MS/MS spectra were in general less effective than the MALDI-TOF spectra for protein identification, but the combination of the two methods clearly enhanced the confidence in protein identification.

  20. Identification of cell wall proteins in the flax (Linum usitatissimum) stem.

    PubMed

    Day, Arnaud; Fénart, Stéphane; Neutelings, Godfrey; Hawkins, Simon; Rolando, Christian; Tokarski, Caroline

    2013-03-01

    Sequential salt (CaCl2 , LiCl) extractions were used to obtain fractions enriched in cell wall proteins (CWPs) from the stem of 60-day-old flax (Linum usitatissimum) plants. High-resolution FT-ICR MS analysis and the use of recently published genomic data allowed the identification of 11 912 peptides corresponding to a total of 1418 different proteins. Subcellular localization using TargetP, Predotar, and WoLF PSORT led to the identification of 152 putative flax CWPs that were classified into nine different functional classes previously established for Arabidopsis thaliana. Examination of different functional classes revealed the presence of a number of proteins known to be involved in, or potentially involved in cell-wall metabolism in plants. The flax stem cell wall proteome was also compared with transcriptomic data previously obtained on comparable samples. This study represents a major contribution to the identification of CWPs in flax and will lead to a better understanding of cell wall biology in this species. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  1. PconsFold: improved contact predictions improve protein models.

    PubMed

    Michel, Mirco; Hayat, Sikander; Skwark, Marcin J; Sander, Chris; Marks, Debora S; Elofsson, Arne

    2014-09-01

    Recently it has been shown that the quality of protein contact prediction from evolutionary information can be improved significantly if direct and indirect information is separated. Given sufficiently large protein families, the contact predictions contain sufficient information to predict the structure of many protein families. However, since the first studies contact prediction methods have improved. Here, we ask how much the final models are improved if improved contact predictions are used. In a small benchmark of 15 proteins, we show that the TM-scores of top-ranked models are improved by on average 33% using PconsFold compared with the original version of EVfold. In a larger benchmark, we find that the quality is improved with 15-30% when using PconsC in comparison with earlier contact prediction methods. Further, using Rosetta instead of CNS does not significantly improve global model accuracy, but the chemistry of models generated with Rosetta is improved. PconsFold is a fully automated pipeline for ab initio protein structure prediction based on evolutionary information. PconsFold is based on PconsC contact prediction and uses the Rosetta folding protocol. Due to its modularity, the contact prediction tool can be easily exchanged. The source code of PconsFold is available on GitHub at https://www.github.com/ElofssonLab/pcons-fold under the MIT license. PconsC is available from http://c.pcons.net/. Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press.

  2. Proteomic identification of erythrocyte membrane protein deficiency in hereditary spherocytosis.

    PubMed

    Peker, Selen; Akar, Nejat; Demiralp, Duygu Ozel

    2012-03-01

    Hereditary spherocytosis (HS) is the most common congenital hemolytic anemia in Caucasians, with an estimated prevalence ranging from 1:2000 to 1:5000. The molecular defect in one of the erythrocytes (RBC) membrane proteins underlying HS like; spectrin-α, spectrin-β, ankyrin, band 3 and protein 4.2 that lead to membrane destabilization and vesiculation, may change the RBCs into denser and more rigid cells (spherocytes), which are removed by the spleen, leading to the development of hemolytic anemia. It is classified as mild, moderate and severe, according to the degree of the hemolytic anemia and the associated symptoms. Two-dimensional gel electrophoresis (2-DE) is potentially valuable method for studying heritable disorders as HS that involve membrane proteins. This separation technique of proteins based upon two biophysically unrelated parameters; molecular weight and charge, is a good option in clinical proteomics in terms of ability to separate complex mixtures, display post-translational modifications and changes after phosphorylation. In this study, we have used contemporary methods with some modifications for the solubilisation, separation and identification of erythrocyte membrane proteins in normal and in HS RBCs. Spectrin alpha and beta chain, ankyrin and band 3 proteins expression differences were found with PDQuest software 8.0.1. and peptide mass fingerprinting (PMF) analysis performed for identification of proteins in this study.

  3. Proteogenomic Analysis Greatly Expands the Identification of Proteins Related to Reproduction in the Apogamous Fern Dryopteris affinis ssp. affinis.

    PubMed

    Grossmann, Jonas; Fernández, Helena; Chaubey, Pururawa M; Valdés, Ana E; Gagliardini, Valeria; Cañal, María J; Russo, Giancarlo; Grossniklaus, Ueli

    2017-01-01

    Performing proteomic studies on non-model organisms with little or no genomic information is still difficult. However, many specific processes and biochemical pathways occur only in species that are poorly characterized at the genomic level. For example, many plants can reproduce both sexually and asexually, the first one allowing the generation of new genotypes and the latter their fixation. Thus, both modes of reproduction are of great agronomic value. However, the molecular basis of asexual reproduction is not well understood in any plant. In ferns, it combines the production of unreduced spores (diplospory) and the formation of sporophytes from somatic cells (apogamy). To set the basis to study these processes, we performed transcriptomics by next-generation sequencing (NGS) and shotgun proteomics by tandem mass spectrometry in the apogamous fern D. affinis ssp. affinis . For protein identification we used the public viridiplantae database (VPDB) to identify orthologous proteins from other plant species and new transcriptomics data to generate a "species-specific transcriptome database" (SSTDB). In total 1,397 protein clusters with 5,865 unique peptide sequences were identified (13 decoy proteins out of 1,410, protFDR 0.93% on protein cluster level). We show that using the SSTDB for protein identification increases the number of identified peptides almost four times compared to using only the publically available VPDB. We identified homologs of proteins involved in reproduction of higher plants, including proteins with a potential role in apogamy. With the increasing availability of genomic data from non-model species, similar proteogenomics approaches will improve the sensitivity in protein identification for species only distantly related to models.

  4. Identification of compound-protein interactions through the analysis of gene ontology, KEGG enrichment for proteins and molecular fragments of compounds.

    PubMed

    Chen, Lei; Zhang, Yu-Hang; Zheng, Mingyue; Huang, Tao; Cai, Yu-Dong

    2016-12-01

    Compound-protein interactions play important roles in every cell via the recognition and regulation of specific functional proteins. The correct identification of compound-protein interactions can lead to a good comprehension of this complicated system and provide useful input for the investigation of various attributes of compounds and proteins. In this study, we attempted to understand this system by extracting properties from both proteins and compounds, in which proteins were represented by gene ontology and KEGG pathway enrichment scores and compounds were represented by molecular fragments. Advanced feature selection methods, including minimum redundancy maximum relevance, incremental feature selection, and the basic machine learning algorithm random forest, were used to analyze these properties and extract core factors for the determination of actual compound-protein interactions. Compound-protein interactions reported in The Binding Databases were used as positive samples. To improve the reliability of the results, the analytic procedure was executed five times using different negative samples. Simultaneously, five optimal prediction methods based on a random forest and yielding maximum MCCs of approximately 77.55 % were constructed and may be useful tools for the prediction of compound-protein interactions. This work provides new clues to understanding the system of compound-protein interactions by analyzing extracted core features. Our results indicate that compound-protein interactions are related to biological processes involving immune, developmental and hormone-associated pathways.

  5. A water-soluble conjugated polymer for protein identification and denaturation detection.

    PubMed

    Xu, Qingling; Wu, Chunxian; Zhu, Chunlei; Duan, Xinrui; Liu, Libing; Han, Yuchun; Wang, Yilin; Wang, Shu

    2010-12-03

    Rapid and sensitive methods to detect proteins and protein denaturation have become increasingly needful in the field of proteomics, medical diagnostics, and biology. In this paper, we have reported the synthesis of a new cationic water-soluble conjugated polymer that contains fluorene and diene moieties in the backbone (PFDE) for protein identification by sensing an array of PFDE solutions in different ionic strengths using the linear discriminant analysis technique (LDA). The PFDE can form complexes with proteins by electrostatic and/or hydrophobic interactions and exhibits different fluorescence response. Three main factors contribute to the fluorescence response of PFDE, namely, the net charge density on the protein surface, the hydrophobic nature of the protein, and the metalloprotein characteristics. The denaturation of proteins can also be detected using PFDE as a fluorescent probe. The interactions between PFDE and proteins were also studied by dynamic light scattering (DLS) and isothermal titration microcalorimetry (ITC) techniques. In contrast to other methods based on conjugated polymers, the synthesis of a series of quencher or dye-labeled acceptors or protein substrates has been avoided in our method, which significantly reduces the cost and the synthetic complexity. Our method provides promising applications on protein identification and denaturation detection in a simple, fast, and label-free manner based on non-specific interaction-induced perturbation of PFDE fluorescence response.

  6. Mass spectrometry and animal science: protein identification strategies and particularities of farm animal species.

    PubMed

    Soares, Renata; Franco, Catarina; Pires, Elisabete; Ventosa, Miguel; Palhinhas, Rui; Koci, Kamila; Martinho de Almeida, André; Varela Coelho, Ana

    2012-07-19

    Proteomic approaches are gaining increasing importance in the context of all fields of animal and veterinary sciences, including physiology, productive characterization, and disease/parasite tolerance, among others. Proteomic studies mainly aim the proteome characterization of a certain organ, tissue, cell type or organism, either in a specific condition or comparing protein differential expression within two or more selected situations. Due to the high complexity of samples, usually total protein extracts, proteomics relies heavily on separation procedures, being 2D-electrophoresis and HPLC the most common, as well as on protein identification using mass spectrometry (MS) based methodologies. Despite the increasing importance of MS in the context of animal and veterinary science studies, the usefulness of such tools is still poorly perceived by the animal science community. This is primarily due to the limited knowledge on mass spectrometry by animal scientists. Additionally, confidence and success in protein identification is hindered by the lack of information in public databases for most of farm animal species and their pathogens, with the exception of cattle (Bos taurus), pig (Sus scrofa) and chicken (Gallus gallus). In this article, we will briefly summarize the main methodologies available for protein identification using mass spectrometry providing a case study of specific applications in the field of animal science. We will also address the difficulties inherent to protein identification using MS, with particular reference to experiments using animal species poorly described in public databases. Additionally, we will suggest strategies to increase the rate of successful identifications when working with farm animal species. Copyright © 2012 Elsevier B.V. All rights reserved.

  7. Efficient identification of tubby-binding proteins by an improved system of T7 phage display.

    PubMed

    Caberoy, Nora B; Zhou, Yixiong; Jiang, Xiaoyu; Alvarado, Gabriela; Li, Wei

    2010-01-01

    Mutation in the tubby gene causes adult-onset obesity, progressive retinal, and cochlear degeneration with unknown mechanism. In contrast, mutations in tubby-like protein 1 (Tulp1), whose C-terminus is highly homologous to tubby, only lead to retinal degeneration. We speculate that their diverse N-terminus may define their distinct disease profile. To elucidate the binding partners of tubby, we used tubby N-terminus (tubby-N) as bait to identify unknown binding proteins with open-reading-frame (ORF) phage display. T7 phage display was engineered with three improvements: high-quality ORF phage display cDNA library, specific phage elution by protease cleavage, and dual phage display for sensitive high throughput screening. The new system is capable of identifying unknown bait-binding proteins in as fast as approximately 4-7 days. While phage display with conventional cDNA libraries identifies high percentage of out-of-frame unnatural short peptides, all 28 tubby-N-binding clones identified by ORF phage display were ORFs. They encode 16 proteins, including 8 nuclear proteins. Fourteen proteins were analyzed by yeast two-hybrid assay and protein pull-down assay with ten of them independently verified. Comparative binding analyses revealed several proteins binding to both tubby and Tulp1 as well as one tubby-specific binding protein. These data suggest that tubby-N is capable of interacting with multiple nuclear and cytoplasmic protein binding partners. These results demonstrated that the newly-engineered ORF phage display is a powerful technology to identify unknown protein-protein interactions. (c) 2009 John Wiley & Sons, Ltd.

  8. In silico re-identification of properties of drug target proteins.

    PubMed

    Kim, Baeksoo; Jo, Jihoon; Han, Jonghyun; Park, Chungoo; Lee, Hyunju

    2017-05-31

    Computational approaches in the identification of drug targets are expected to reduce time and effort in drug development. Advances in genomics and proteomics provide the opportunity to uncover properties of druggable genomes. Although several studies have been conducted for distinguishing drug targets from non-drug targets, they mainly focus on the sequences and functional roles of proteins. Many other properties of proteins have not been fully investigated. Using the DrugBank (version 3.0) database containing nearly 6,816 drug entries including 760 FDA-approved drugs and 1822 of their targets and human UniProt/Swiss-Prot databases, we defined 1578 non-redundant drug target and 17,575 non-drug target proteins. To select these non-redundant protein datasets, we built four datasets (A, B, C, and D) by considering clustering of paralogous proteins. We first reassessed the widely used properties of drug target proteins. We confirmed and extended that drug target proteins (1) are likely to have more hydrophobic, less polar, less PEST sequences, and more signal peptide sequences higher and (2) are more involved in enzyme catalysis, oxidation and reduction in cellular respiration, and operational genes. In this study, we proposed new properties (essentiality, expression pattern, PTMs, and solvent accessibility) for effectively identifying drug target proteins. We found that (1) drug targetability and protein essentiality are decoupled, (2) druggability of proteins has high expression level and tissue specificity, and (3) functional post-translational modification residues are enriched in drug target proteins. In addition, to predict the drug targetability of proteins, we exploited two machine learning methods (Support Vector Machine and Random Forest). When we predicted drug targets by combining previously known protein properties and proposed new properties, an F-score of 0.8307 was obtained. When the newly proposed properties are integrated, the prediction performance

  9. Identification and modification of dynamical regions in proteins for alteration of enzyme catalytic effect

    DOEpatents

    Agarwal, Pratul K.

    2015-11-24

    A method for analysis, control, and manipulation for improvement of the chemical reaction rate of a protein-mediated reaction is provided. Enzymes, which typically comprise protein molecules, are very efficient catalysts that enhance chemical reaction rates by many orders of magnitude. Enzymes are widely used for a number of functions in chemical, biochemical, pharmaceutical, and other purposes. The method identifies key protein vibration modes that control the chemical reaction rate of the protein-mediated reaction, providing identification of the factors that enable the enzymes to achieve the high rate of reaction enhancement. By controlling these factors, the function of enzymes may be modulated, i.e., the activity can either be increased for faster enzyme reaction or it can be decreased when a slower enzyme is desired. This method provides an inexpensive and efficient solution by utilizing computer simulations, in combination with available experimental data, to build suitable models and investigate the enzyme activity.

  10. Identification and modification of dynamical regions in proteins for alteration of enzyme catalytic effect

    DOEpatents

    Agarwal, Pratul K.

    2013-04-09

    A method for analysis, control, and manipulation for improvement of the chemical reaction rate of a protein-mediated reaction is provided. Enzymes, which typically comprise protein molecules, are very efficient catalysts that enhance chemical reaction rates by many orders of magnitude. Enzymes are widely used for a number of functions in chemical, biochemical, pharmaceutical, and other purposes. The method identifies key protein vibration modes that control the chemical reaction rate of the protein-mediated reaction, providing identification of the factors that enable the enzymes to achieve the high rate of reaction enhancement. By controlling these factors, the function of enzymes may be modulated, i.e., the activity can either be increased for faster enzyme reaction or it can be decreased when a slower enzyme is desired. This method provides an inexpensive and efficient solution by utilizing computer simulations, in combination with available experimental data, to build suitable models and investigate the enzyme activity.

  11. Pooled protein immunization for identification of cell surface antigens in Streptococcus sanguinis.

    PubMed

    Ge, Xiuchun; Kitten, Todd; Munro, Cindy L; Conrad, Daniel H; Xu, Ping

    2010-07-26

    Available bacterial genomes provide opportunities for screening vaccines by reverse vaccinology. Efficient identification of surface antigens is required to reduce time and animal cost in this technology. We developed an approach to identify surface antigens rapidly in Streptococcus sanguinis, a common infective endocarditis causative species. We applied bioinformatics for antigen prediction and pooled antigens for immunization. Forty-seven surface-exposed proteins including 28 lipoproteins and 19 cell wall-anchored proteins were chosen based on computer algorithms and comparative genomic analyses. Eight proteins among these candidates and 2 other proteins were pooled together to immunize rabbits. The antiserum reacted strongly with each protein and with S. sanguinis whole cells. Affinity chromatography was used to purify the antibodies to 9 of the antigen pool components. Competitive ELISA and FACS results indicated that these 9 proteins were exposed on S. sanguinis cell surfaces. The purified antibodies had demonstrable opsonic activity. The results indicate that immunization with pooled proteins, in combination with affinity purification, and comprehensive immunological assays may facilitate cell surface antigen identification to combat infectious diseases.

  12. Pooled Protein Immunization for Identification of Cell Surface Antigens in Streptococcus sanguinis

    PubMed Central

    Ge, Xiuchun; Kitten, Todd; Munro, Cindy L.; Conrad, Daniel H.; Xu, Ping

    2010-01-01

    Background Available bacterial genomes provide opportunities for screening vaccines by reverse vaccinology. Efficient identification of surface antigens is required to reduce time and animal cost in this technology. We developed an approach to identify surface antigens rapidly in Streptococcus sanguinis, a common infective endocarditis causative species. Methods and Findings We applied bioinformatics for antigen prediction and pooled antigens for immunization. Forty-seven surface-exposed proteins including 28 lipoproteins and 19 cell wall-anchored proteins were chosen based on computer algorithms and comparative genomic analyses. Eight proteins among these candidates and 2 other proteins were pooled together to immunize rabbits. The antiserum reacted strongly with each protein and with S. sanguinis whole cells. Affinity chromatography was used to purify the antibodies to 9 of the antigen pool components. Competitive ELISA and FACS results indicated that these 9 proteins were exposed on S. sanguinis cell surfaces. The purified antibodies had demonstrable opsonic activity. Conclusions The results indicate that immunization with pooled proteins, in combination with affinity purification, and comprehensive immunological assays may facilitate cell surface antigen identification to combat infectious diseases. PMID:20668678

  13. Protein social behavior makes a stronger signal for partner identification than surface geometry

    PubMed Central

    Laine, Elodie

    2016-01-01

    ABSTRACT Cells are interactive living systems where proteins movements, interactions and regulation are substantially free from centralized management. How protein physico‐chemical and geometrical properties determine who interact with whom remains far from fully understood. We show that characterizing how a protein behaves with many potential interactors in a complete cross‐docking study leads to a sharp identification of its cellular/true/native partner(s). We define a sociability index, or S‐index, reflecting whether a protein likes or not to pair with other proteins. Formally, we propose a suitable normalization function that accounts for protein sociability and we combine it with a simple interface‐based (ranking) score to discriminate partners from non‐interactors. We show that sociability is an important factor and that the normalization permits to reach a much higher discriminative power than shape complementarity docking scores. The social effect is also observed with more sophisticated docking algorithms. Docking conformations are evaluated using experimental binding sites. These latter approximate in the best possible way binding sites predictions, which have reached high accuracy in recent years. This makes our analysis helpful for a global understanding of partner identification and for suggesting discriminating strategies. These results contradict previous findings claiming the partner identification problem being solvable solely with geometrical docking. Proteins 2016; 85:137–154. © 2016 Wiley Periodicals, Inc. PMID:27802579

  14. A Protein Preparation Method for the High-throughput Identification of Proteins Interacting with a Nuclear Cofactor Using LC-MS/MS Analysis.

    PubMed

    Tsuchiya, Megumi; Karim, M Rezaul; Matsumoto, Taro; Ogawa, Hidesato; Taniguchi, Hiroaki

    2017-01-24

    Transcriptional coregulators are vital to the efficient transcriptional regulation of nuclear chromatin structure. Coregulators play a variety of roles in regulating transcription. These include the direct interaction with transcription factors, the covalent modification of histones and other proteins, and the occasional chromatin conformation alteration. Accordingly, establishing relatively quick methods for identifying proteins that interact within this network is crucial to enhancing our understanding of the underlying regulatory mechanisms. LC-MS/MS-mediated protein binding partner identification is a validated technique used to analyze protein-protein interactions. By immunoprecipitating a previously-identified member of a protein complex with an antibody (occasionally with an antibody for a tagged protein), it is possible to identify its unknown protein interactions via mass spectrometry analysis. Here, we present a method of protein preparation for the LC-MS/MS-mediated high-throughput identification of protein interactions involving nuclear cofactors and their binding partners. This method allows for a better understanding of the transcriptional regulatory mechanisms of the targeted nuclear factors.

  15. PROTEOMIC IDENTIFICATION OF CARBONYLATED PROTEINS AND THEIR OXIDATION SITES

    PubMed Central

    Madian, Ashraf G.; Regnier, Fred E.

    2011-01-01

    Excessive oxidative stress leaves a protein carbonylation fingerprint in biological systems. Carbonylation is an irreversible post translational modification (PTM) that often leads to the loss of protein function and can be a component of multiple diseases. Protein carbonyl groups can be generated directly (by amino acids oxidation and the a-amidation pathway) or indirectly by forming adducts with lipid peroxidation products or glycation and advanced glycation end-products. Studies of oxidative stress are complicated by the low concentration of oxidation products and wide array of routes by which proteins are carbonylated. The development of new selection and enrichment techniques coupled with advances in mass spectrometry are allowing identification of hundreds of new carbonylated protein products from a broad range of proteins located at many sites in biological systems. The focus of this review is on the use of proteomics tools and methods to identify oxidized proteins along with specific sites of oxidative damage and the consequences of protein oxidation. PMID:20521848

  16. Protein social behavior makes a stronger signal for partner identification than surface geometry.

    PubMed

    Laine, Elodie; Carbone, Alessandra

    2017-01-01

    Cells are interactive living systems where proteins movements, interactions and regulation are substantially free from centralized management. How protein physico-chemical and geometrical properties determine who interact with whom remains far from fully understood. We show that characterizing how a protein behaves with many potential interactors in a complete cross-docking study leads to a sharp identification of its cellular/true/native partner(s). We define a sociability index, or S-index, reflecting whether a protein likes or not to pair with other proteins. Formally, we propose a suitable normalization function that accounts for protein sociability and we combine it with a simple interface-based (ranking) score to discriminate partners from non-interactors. We show that sociability is an important factor and that the normalization permits to reach a much higher discriminative power than shape complementarity docking scores. The social effect is also observed with more sophisticated docking algorithms. Docking conformations are evaluated using experimental binding sites. These latter approximate in the best possible way binding sites predictions, which have reached high accuracy in recent years. This makes our analysis helpful for a global understanding of partner identification and for suggesting discriminating strategies. These results contradict previous findings claiming the partner identification problem being solvable solely with geometrical docking. Proteins 2016; 85:137-154. © 2016 Wiley Periodicals, Inc. © 2016 The Authors Proteins: Structure, Function, and Bioinformatics Published by Wiley Periodicals, Inc.

  17. Recombinant blood group proteins for use in antibody screening and identification tests.

    PubMed

    Seltsam, Axel; Blasczyk, Rainer

    2009-11-01

    The present review elucidates the potentials of recombinant blood group proteins (BGPs) for red blood cell (RBC) antibody detection and identification in pretransfusion testing and the achievements in this field so far. Many BGPs have been eukaryotically and prokaryotically expressed in sufficient quantity and quality for RBC antibody testing. Recombinant BGPs can be incorporated in soluble protein reagents or solid-phase assays such as ELISA, color-coded microsphere and protein microarray chip-based techniques. Because novel recombinant protein-based assays use single antigens, a positive reaction of a serum with the recombinant protein directly indicates the presence and specificity of the target antibody. Inversely, conventional RBC-based assays use panels of human RBCs carrying a huge number of blood group antigens at the same time and require negative reactions of samples with antigen-negative cells for indirect determination of antibody specificity. Because of their capacity for single-step, direct RBC antibody determination, recombinant protein-based assays may greatly facilitate and accelerate the identification of common and rare RBC antibodies.

  18. Identification of Abiotic Stress Protein Biomarkers by Proteomic Screening of Crop Cultivar Diversity

    PubMed Central

    Barkla, Bronwyn J.

    2016-01-01

    Modern day agriculture practice is narrowing the genetic diversity in our food supply. This may compromise the ability to obtain high yield under extreme climactic conditions, threatening food security for a rapidly growing world population. To identify genetic diversity, tolerance mechanisms of cultivars, landraces and wild relatives of major crops can be identified and ultimately exploited for yield improvement. Quantitative proteomics allows for the identification of proteins that may contribute to tolerance mechanisms by directly comparing protein abundance under stress conditions between genotypes differing in their stress responses. In this review, a summary is provided of the data accumulated from quantitative proteomic comparisons of crop genotypes/cultivars which present different stress tolerance responses when exposed to various abiotic stress conditions, including drought, salinity, high/low temperature, nutrient deficiency and UV-B irradiation. This field of research aims to identify molecular features that can be developed as biomarkers for crop improvement, however without accurate phenotyping, careful experimental design, statistical robustness and appropriate biomarker validation and verification it will be challenging to deliver what is promised. PMID:28248236

  19. Identification of Abiotic Stress Protein Biomarkers by Proteomic Screening of Crop Cultivar Diversity.

    PubMed

    Barkla, Bronwyn J

    2016-09-08

    Modern day agriculture practice is narrowing the genetic diversity in our food supply. This may compromise the ability to obtain high yield under extreme climactic conditions, threatening food security for a rapidly growing world population. To identify genetic diversity, tolerance mechanisms of cultivars, landraces and wild relatives of major crops can be identified and ultimately exploited for yield improvement. Quantitative proteomics allows for the identification of proteins that may contribute to tolerance mechanisms by directly comparing protein abundance under stress conditions between genotypes differing in their stress responses. In this review, a summary is provided of the data accumulated from quantitative proteomic comparisons of crop genotypes/cultivars which present different stress tolerance responses when exposed to various abiotic stress conditions, including drought, salinity, high/low temperature, nutrient deficiency and UV-B irradiation. This field of research aims to identify molecular features that can be developed as biomarkers for crop improvement, however without accurate phenotyping, careful experimental design, statistical robustness and appropriate biomarker validation and verification it will be challenging to deliver what is promised.

  20. High-throughput identification of proteins with AMPylation using self-assembled human protein (NAPPA) microarrays.

    PubMed

    Yu, Xiaobo; LaBaer, Joshua

    2015-05-01

    AMPylation (adenylylation) has been recognized as an important post-translational modification that is used by pathogens to regulate host cellular proteins and their associated signaling pathways. AMPylation has potential functions in various cellular processes, and it is widely conserved across both prokaryotes and eukaryotes. However, despite the identification of many AMPylators, relatively few candidate substrates of AMPylation are known. This is changing with the recent development of a robust and reliable method for identifying new substrates using protein microarrays, which can markedly expand the list of potential substrates. Here we describe procedures for detecting AMPylated and auto-AMPylated proteins in a sensitive, high-throughput and nonradioactive manner. The approach uses high-density protein microarrays fabricated using nucleic acid programmable protein array (NAPPA) technology, which enables the highly successful display of fresh recombinant human proteins in situ. The modification of target proteins is determined via copper-catalyzed azide-alkyne cycloaddition (CuAAC). The assay can be accomplished within 11 h.

  1. Hidden Markov models incorporating fuzzy measures and integrals for protein sequence identification and alignment.

    PubMed

    Bidargaddi, Niranjan P; Chetty, Madhu; Kamruzzaman, Joarder

    2008-06-01

    Profile hidden Markov models (HMMs) based on classical HMMs have been widely applied for protein sequence identification. The formulation of the forward and backward variables in profile HMMs is made under statistical independence assumption of the probability theory. We propose a fuzzy profile HMM to overcome the limitations of that assumption and to achieve an improved alignment for protein sequences belonging to a given family. The proposed model fuzzifies the forward and backward variables by incorporating Sugeno fuzzy measures and Choquet integrals, thus further extends the generalized HMM. Based on the fuzzified forward and backward variables, we propose a fuzzy Baum-Welch parameter estimation algorithm for profiles. The strong correlations and the sequence preference involved in the protein structures make this fuzzy architecture based model as a suitable candidate for building profiles of a given family, since the fuzzy set can handle uncertainties better than classical methods.

  2. Algorithm improvement program nuclide identification algorithm scoring criteria and scoring application.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Enghauser, Michael

    2016-02-01

    The goal of the Domestic Nuclear Detection Office (DNDO) Algorithm Improvement Program (AIP) is to facilitate gamma-radiation detector nuclide identification algorithm development, improvement, and validation. Accordingly, scoring criteria have been developed to objectively assess the performance of nuclide identification algorithms. In addition, a Microsoft Excel spreadsheet application for automated nuclide identification scoring has been developed. This report provides an overview of the equations, nuclide weighting factors, nuclide equivalencies, and configuration weighting factors used by the application for scoring nuclide identification algorithm performance. Furthermore, this report presents a general overview of the nuclide identification algorithm scoring application including illustrative examples.

  3. Identification of ATM Protein Kinase Phosphorylation Sites by Mass Spectrometry.

    PubMed

    Graham, Mark E; Lavin, Martin F; Kozlov, Sergei V

    2017-01-01

    ATM (ataxia-telangiectasia mutated) protein kinase is a key regulator of cellular responses to DNA damage and oxidative stress. DNA damage triggers complex cascade of signaling events leading to numerous posttranslational modification on multitude of proteins. Understanding the regulation of ATM kinase is therefore critical not only for understanding the human genetic disorder ataxia-telangiectasia and potential treatment strategies, but essential for deciphering physiological responses of cells to stress. These responses play an important role in carcinogenesis, neurodegeneration, and aging. We focus here on the identification of DNA damage inducible ATM phosphorylation sites to understand the importance of autophosphorylation in the mechanism of ATM kinase activation. We demonstrate the utility of using immunoprecipitated ATM in quantitative LC-MS/MS workflow with stable isotope dimethyl labeling of ATM peptides for identification of phosphorylation sites.

  4. Improved method for identification of low abundance proteins using 2D-gel electrophoresis, MALDI-TOF and TOF/TOF

    EPA Science Inventory

    Introduction: Differential protein expression studies have been routinely performed in our laboratory to determine the health effects of environmentally-important chemicals. In this abstract, improvements in the in-gel protein digestion, MALDI plate spotting and data acquisition...

  5. Identification of novel lysosomal matrix proteins by proteome analysis.

    PubMed

    Kollmann, Katrin; Mutenda, Kudzai E; Balleininger, Martina; Eckermann, Ellen; von Figura, Kurt; Schmidt, Bernhard; Lübke, Torben

    2005-10-01

    The lysosomal matrix is estimated to contain about 50 different proteins. Most of the matrix proteins are acid hydrolases that depend on mannose 6-phosphate receptors (MPR) for targeting to lysosomes. Here, we describe a comprehensive proteome analysis of MPR-binding proteins from mouse. Mouse embryonic fibroblasts defective in both MPR (MPR 46-/- and MPR 300-/-) are known to secrete the lysosomal matrix proteins. Secretions of these cells were affinity purified using an affinity matrix derivatized with MPR46 and MPR300. In the protein fraction bound to the affinity matrix and eluted with mannose 6-phosphate, 34 known lysosomal matrix proteins, 4 candidate proteins of the lysosomal matrix and 4 non-lysosomal contaminants were identified by mass spectrometry after separation by two-dimensional gel electrophoresis or by multidimensional protein identification technology. For 3 of the candidate proteins, mammalian ependymin-related protein-2 (MERP-2), retinoid-inducible serine carboxypeptidase (RISC) and the hypothetical 66.3-kDa protein we could verify that C-terminally tagged forms bound in an M6P-dependent manner to an MPR-affinity matrix and were internalized via MPR-mediated endocytosis. Hence these 3 proteins are likely to represent hitherto unrecognized lysosomal matrix proteins.

  6. ProteinInferencer: Confident protein identification and multiple experiment comparison for large scale proteomics projects.

    PubMed

    Zhang, Yaoyang; Xu, Tao; Shan, Bing; Hart, Jonathan; Aslanian, Aaron; Han, Xuemei; Zong, Nobel; Li, Haomin; Choi, Howard; Wang, Dong; Acharya, Lipi; Du, Lisa; Vogt, Peter K; Ping, Peipei; Yates, John R

    2015-11-03

    Shotgun proteomics generates valuable information from large-scale and target protein characterizations, including protein expression, protein quantification, protein post-translational modifications (PTMs), protein localization, and protein-protein interactions. Typically, peptides derived from proteolytic digestion, rather than intact proteins, are analyzed by mass spectrometers because peptides are more readily separated, ionized and fragmented. The amino acid sequences of peptides can be interpreted by matching the observed tandem mass spectra to theoretical spectra derived from a protein sequence database. Identified peptides serve as surrogates for their proteins and are often used to establish what proteins were present in the original mixture and to quantify protein abundance. Two major issues exist for assigning peptides to their originating protein. The first issue is maintaining a desired false discovery rate (FDR) when comparing or combining multiple large datasets generated by shotgun analysis and the second issue is properly assigning peptides to proteins when homologous proteins are present in the database. Herein we demonstrate a new computational tool, ProteinInferencer, which can be used for protein inference with both small- or large-scale data sets to produce a well-controlled protein FDR. In addition, ProteinInferencer introduces confidence scoring for individual proteins, which makes protein identifications evaluable. This article is part of a Special Issue entitled: Computational Proteomics. Copyright © 2015. Published by Elsevier B.V.

  7. Odour discrimination and identification are improved in early blindness.

    PubMed

    Cuevas, Isabel; Plaza, Paula; Rombaux, Philippe; De Volder, Anne G; Renier, Laurent

    2009-12-01

    Previous studies showed that early blind humans develop superior abilities in the use of their remaining senses, hypothetically due to a functional reorganization of the deprived visual brain areas. While auditory and tactile functions have been investigated for long, little is known about the effects of early visual deprivation on olfactory processing. However, blind humans make an extensive use of olfactory information in their daily life. Here we investigated olfactory discrimination and identification abilities in early blind subjects and age-matched sighted controls. Three levels of cuing were used in the identification task, i.e., free-identification (no cue), categorization (semantic cues) and multiple choice (semantic and phonological cues). Early blind subjects significantly outperformed the controls in odour discrimination, free-identification and categorization. In addition, the larger group difference was observed in the free-identification as compared to the categorization and the multiple choice conditions. This indicated that a better access to the semantic information from odour perception accounted for part of the improved olfactory performances in odour identification in the blind. We concluded that early blind subjects have both improved perceptual abilities and a better access to the information stored in semantic memory than sighted subjects.

  8. Modified filter-aided sample preparation (FASP) method increases peptide and protein identifications for shotgun proteomics.

    PubMed

    Ni, Mao-Wei; Wang, Lu; Chen, Wei; Mou, Han-Zhou; Zhou, Jie; Zheng, Zhi-Guo

    2017-01-30

    Mass spectrometry (MS)-based protein identification depends mainly on protein extraction and digestion. Although sodium dodecyl sulfate (SDS) can preclude enzymatic digestion and interfere with MS analysis, it is still the most widely used surfactant in these steps. To overcome these disadvantages, a SDS-compatible proteomic technique for SDS removal prior to MS-based analyses was developed, namely filter-aided sample preparation (FASP). Herein, based on the effectiveness of sodium deoxycholate and a detergent removal spin column, we developed a modified FASP (mFASP) method and compared its overall performance, total number of peptides and proteins identified for shotgun proteomic experiments with that of the FASP method. Identification of 4570 ± 392 and 9139 ± 317 peptides and description of 862 ± 46 and 1377 ± 33 protein groups with two or more peptides from the ovarian cancer cell line A2780 was accomplished by FASP and mFASP methods, respectively. The mFASP method (21.2 ± 0.2%) had higher average peptide to protein coverage than FASP method (13.2 ± 0.5%). More hydrophobic peptides were identified by mFASP than by FASP, as indicated by the GRAVY score distribution. The reported method enables reliable and efficient identification of proteins and peptides in whole-cell extracts containing SDS. The new approach allows for higher throughput (the simultaneous identification of more proteins), a more comprehensive investigation of proteins, and potentially the discovery of new biomarkers. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.

  9. SITEHOUND-web: a server for ligand binding site identification in protein structures.

    PubMed

    Hernandez, Marylens; Ghersi, Dario; Sanchez, Roberto

    2009-07-01

    SITEHOUND-web (http://sitehound.sanchezlab.org) is a binding-site identification server powered by the SITEHOUND program. Given a protein structure in PDB format SITEHOUND-web will identify regions of the protein characterized by favorable interactions with a probe molecule. These regions correspond to putative ligand binding sites. Depending on the probe used in the calculation, sites with preference for different ligands will be identified. Currently, a carbon probe for identification of binding sites for drug-like molecules, and a phosphate probe for phosphorylated ligands (ATP, phoshopeptides, etc.) have been implemented. SITEHOUND-web will display the results in HTML pages including an interactive 3D representation of the protein structure and the putative sites using the Jmol java applet. Various downloadable data files are also provided for offline data analysis.

  10. Identification of immunodominant proteins of the microalgae Prototheca by proteomic analysis

    PubMed Central

    Irrgang, A.; Weise, C.; Murugaiyan, J.; Roesler, U.

    2014-01-01

    Prototheca zopfii associated with bovine mastitis and human protothecosis exists as two genotypes, of which genotype 1 is considered as non-infectious and genotype 2 as infectious. The mechanism of infection has not yet been described. The present study was aimed to identify genotype 2-specific immunodominant proteins. Prototheca proteins were separated using two-dimensional gel electrophoresis. Subsequent western blotting with rabbit hyperimmune serum revealed 28 protein spots. Matrix-assisted laser desorption ionization time-of-flight mass spectrometry analysis resulted in the identification of 15 proteins including malate dehydrogenase, elongation factor 1-alpha, heat shock protein 70, and 14-3-3 protein, which were previously described as immunogenic proteins of other eukaryotic pathogens. PMID:25755891

  11. Advances in identification and validation of protein targets of natural products without chemical modification.

    PubMed

    Chang, J; Kim, Y; Kwon, H J

    2016-05-04

    Covering: up to February 2016Identification of the target proteins of natural products is pivotal to understanding the mechanisms of action to develop natural products for use as molecular probes and potential therapeutic drugs. Affinity chromatography of immobilized natural products has been conventionally used to identify target proteins, and has yielded good results. However, this method has limitations, in that labeling or tagging for immobilization and affinity purification often result in reduced or altered activity of the natural product. New strategies have recently been developed and applied to identify the target proteins of natural products and synthetic small molecules without chemical modification of the natural product. These direct and indirect methods for target identification of label-free natural products include drug affinity responsive target stability (DARTS), stability of proteins from rates of oxidation (SPROX), cellular thermal shift assay (CETSA), thermal proteome profiling (TPP), and bioinformatics-based analysis of connectivity. This review focuses on and reports case studies of the latest advances in target protein identification methods for label-free natural products. The integration of newly developed technologies will provide new insights and highlight the value of natural products for use as biological probes and new drug candidates.

  12. Targeted nanodiamonds for identification of subcellular protein assemblies in mammalian cells

    PubMed Central

    Lake, Michael P.; Bouchard, Louis-S.

    2017-01-01

    Transmission electron microscopy (TEM) can be used to successfully determine the structures of proteins. However, such studies are typically done ex situ after extraction of the protein from the cellular environment. Here we describe an application for nanodiamonds as targeted intensity contrast labels in biological TEM, using the nuclear pore complex (NPC) as a model macroassembly. We demonstrate that delivery of antibody-conjugated nanodiamonds to live mammalian cells using maltotriose-conjugated polypropylenimine dendrimers results in efficient localization of nanodiamonds to the intended cellular target. We further identify signatures of nanodiamonds under TEM that allow for unambiguous identification of individual nanodiamonds from a resin-embedded, OsO4-stained environment. This is the first demonstration of nanodiamonds as labels for nanoscale TEM-based identification of subcellular protein assemblies. These results, combined with the unique fluorescence properties and biocompatibility of nanodiamonds, represent an important step toward the use of nanodiamonds as markers for correlated optical/electron bioimaging. PMID:28636640

  13. Identification of proteins from tuberculin purified protein derivative (PPD) by LC-MS/MS.

    PubMed

    Borsuk, Sibele; Newcombe, Jane; Mendum, Tom A; Dellagostin, Odir A; McFadden, Johnjoe

    2009-11-01

    The tuberculin purified protein derivative (PPD) is a widely used diagnostic antigen for tuberculosis, however it is poorly defined. Most mycobacterial proteins are extensively denatured by the procedure employed in its preparation, which explains previous difficulties in identifying constituents from PPD to characterize their behaviour in B- and T-cell reactions. We here described a proteomics-based characterization of PPD from several different sources by LC-MS/MS, which combines the solute separation power of HPLC, with the detection power of a mass spectrometer. The technique is able to identify proteins from complex mixtures of peptide fragments. A total of 171 different proteins were identified among the four PPD samples (two bovine PPD and two avium PPD) from Brazil and UK. The majority of the proteins were cytoplasmic (77.9%) and involved in intermediary metabolism and respiration (24.25%) but there was a preponderance of proteins involved in lipid metabolism. We identified a group of 21 proteins that are present in both bovine PPD but were not detected in avium PPD preparation. In addition, four proteins found in bovine PPD are absent in Mycobacterium bovis BCG vaccine strain. This study provides a better understanding of the tuberculin PPD components leading to the identification of additional antigens useful as reagents for specific diagnosis of tuberculosis.

  14. Leptospiral outer membrane protein microarray, a novel approach to identification of host ligand-binding proteins.

    PubMed

    Pinne, Marija; Matsunaga, James; Haake, David A

    2012-11-01

    Leptospirosis is a zoonosis with worldwide distribution caused by pathogenic spirochetes belonging to the genus Leptospira. The leptospiral life cycle involves transmission via freshwater and colonization of the renal tubules of their reservoir hosts. Infection requires adherence to cell surfaces and extracellular matrix components of host tissues. These host-pathogen interactions involve outer membrane proteins (OMPs) expressed on the bacterial surface. In this study, we developed an Leptospira interrogans serovar Copenhageni strain Fiocruz L1-130 OMP microarray containing all predicted lipoproteins and transmembrane OMPs. A total of 401 leptospiral genes or their fragments were transcribed and translated in vitro and printed on nitrocellulose-coated glass slides. We investigated the potential of this protein microarray to screen for interactions between leptospiral OMPs and fibronectin (Fn). This approach resulted in the identification of the recently described fibronectin-binding protein, LIC10258 (MFn8, Lsa66), and 14 novel Fn-binding proteins, denoted Microarray Fn-binding proteins (MFns). We confirmed Fn binding of purified recombinant LIC11612 (MFn1), LIC10714 (MFn2), LIC11051 (MFn6), LIC11436 (MFn7), LIC10258 (MFn8, Lsa66), and LIC10537 (MFn9) by far-Western blot assays. Moreover, we obtained specific antibodies to MFn1, MFn7, MFn8 (Lsa66), and MFn9 and demonstrated that MFn1, MFn7, and MFn9 are expressed and surface exposed under in vitro growth conditions. Further, we demonstrated that MFn1, MFn4 (LIC12631, Sph2), and MFn7 enable leptospires to bind fibronectin when expressed in the saprophyte, Leptospira biflexa. Protein microarrays are valuable tools for high-throughput identification of novel host ligand-binding proteins that have the potential to play key roles in the virulence mechanisms of pathogens.

  15. 34 CFR 200.39 - Responsibilities resulting from identification for school improvement.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... 34 Education 1 2011-07-01 2011-07-01 false Responsibilities resulting from identification for school improvement. 200.39 Section 200.39 Education Regulations of the Offices of the Department of... Lea and School Improvement § 200.39 Responsibilities resulting from identification for school...

  16. 34 CFR 200.39 - Responsibilities resulting from identification for school improvement.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... 34 Education 1 2014-07-01 2014-07-01 false Responsibilities resulting from identification for school improvement. 200.39 Section 200.39 Education Regulations of the Offices of the Department of... Lea and School Improvement § 200.39 Responsibilities resulting from identification for school...

  17. 34 CFR 200.39 - Responsibilities resulting from identification for school improvement.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... 34 Education 1 2012-07-01 2012-07-01 false Responsibilities resulting from identification for school improvement. 200.39 Section 200.39 Education Regulations of the Offices of the Department of... Lea and School Improvement § 200.39 Responsibilities resulting from identification for school...

  18. 34 CFR 200.39 - Responsibilities resulting from identification for school improvement.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 34 Education 1 2010-07-01 2010-07-01 false Responsibilities resulting from identification for school improvement. 200.39 Section 200.39 Education Regulations of the Offices of the Department of... Lea and School Improvement § 200.39 Responsibilities resulting from identification for school...

  19. Improved chemical identification from sensor arrays using intelligent algorithms

    NASA Astrophysics Data System (ADS)

    Roppel, Thaddeus A.; Wilson, Denise M.

    2001-02-01

    Intelligent signal processing algorithms are shown to improve identification rates significantly in chemical sensor arrays. This paper focuses on the use of independently derived sensor status information to modify the processing of sensor array data by using a fast, easily-implemented "best-match" approach to filling in missing sensor data. Most fault conditions of interest (e.g., stuck high, stuck low, sudden jumps, excess noise, etc.) can be detected relatively simply by adjunct data processing, or by on-board circuitry. The objective then is to devise, implement, and test methods for using this information to improve the identification rates in the presence of faulted sensors. In one typical example studied, utilizing separately derived, a-priori knowledge about the health of the sensors in the array improved the chemical identification rate by an artificial neural network from below 10 percent correct to over 99 percent correct. While this study focuses experimentally on chemical sensor arrays, the results are readily extensible to other types of sensor platforms.

  20. Identification of proteins in the aqueous humor associated with cataract development using iTRAQ methodology.

    PubMed

    Xiang, Minhong; Zhang, Xingru; Li, Qingsong; Wang, Hanmin; Zhang, Zhenyong; Han, Zhumei; Ke, Meiqing; Chen, Xingxing

    2017-05-01

    Proteins in the aqueous humor (AH) are important in the induction of cataract development. The identification of cataract-associated proteins assists in identifying patients and predisposed to the condition and improve treatment efficacy. Proteomics analysis has previously been used for identifying protein markers associated with eye diseases; however, few studies have examined the proteomic alterations in cataract development due to high myopia, glaucoma and diabetes. The present study, using the isobaric tagging for relative and absolute protein quantification methodology, aimed to examine cataract-associated proteins in the AH from patients with high myopia, glaucoma or diabetes, and controls. The results revealed that 445 proteins were identified in the AH groups, compared with the control groups, and 146, 264 and 130 proteins were differentially expressed in the three groups of patients, respectively. In addition, 44 of these proteins were determined to be cataract‑associated, and the alterations of five randomly selected proteins were confirmed using enzyme-linked immunosorbent assays. The biological functions of these 44 cataract-associated proteins were analyzed using Gen Ontology/pathways annotation, in addition to protein‑protein interaction network analysis. The results aimed to expand current knowledge of the pathophysiologic characteristics of cataract development and provided a panel of candidates for biomarkers of the disease, which may assist in further diagnosis and the monitoring of cataract development.

  1. Algorithm Improvement Program Nuclide Identification Algorithm Scoring Criteria And Scoring Application - DNDO.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Enghauser, Michael

    2015-02-01

    The goal of the Domestic Nuclear Detection Office (DNDO) Algorithm Improvement Program (AIP) is to facilitate gamma-radiation detector nuclide identification algorithm development, improvement, and validation. Accordingly, scoring criteria have been developed to objectively assess the performance of nuclide identification algorithms. In addition, a Microsoft Excel spreadsheet application for automated nuclide identification scoring has been developed. This report provides an overview of the equations, nuclide weighting factors, nuclide equivalencies, and configuration weighting factors used by the application for scoring nuclide identification algorithm performance. Furthermore, this report presents a general overview of the nuclide identification algorithm scoring application including illustrative examples.

  2. A rapid identification system for metallothionein proteins using expert system

    PubMed Central

    Praveen, Bhoopathi; Vincent, Savariar; Murty, Upadhyayula Suryanarayana; Krishna, Amirapu Radha; Jamil, Kaiser

    2005-01-01

    Metallothioneins (MT) are low molecular weight proteins mostly rich in cysteine residues with high metal content. Generally, MT proteins are responsible for regulating the intracellular supply of biologically essential metal ions and they protect cells from the deleterious effects of non-essential polarizable transition and post-transition metal ions. Due to their biological importance, proper characterization of MT is necessary. Here we describe a computer program (ID3 algorithm, a part of Artificial Intelligence) developed using available data for the rapid identification of MT. Tissue samples contains several low molecular weight proteins with different physical, chemical and biological characteristics. The described software solution proposes to categorize MT proteins without aromatic amino acids and high metal content. The proposed solution can be expanded to other types of proteins with specific known characteristics. PMID:17597844

  3. Identification of Major Outer Surface Proteins of Streptococcus agalactiae

    PubMed Central

    Hughes, Martin J. G.; Moore, Joanne C.; Lane, Jonathan D.; Wilson, Rebecca; Pribul, Philippa K.; Younes, Zabin N.; Dobson, Richard J.; Everest, Paul; Reason, Andrew J.; Redfern, Joanne M.; Greer, Fiona M.; Paxton, Thanai; Panico, Maria; Morris, Howard R.; Feldman, Robert G.; Santangelo, Joseph D.

    2002-01-01

    To identify the major outer surface proteins of Streptococcus agalactiae (group B streptococcus), a proteomic analysis was undertaken. An extract of the outer surface proteins was separated by two-dimensional electrophoresis. The visualized spots were identified through a combination of peptide sequencing and reverse genetic methodologies. Of the 30 major spots identified as S. agalactiae specific, 27 have been identified. Six of these proteins, previously unidentified in S. agalactiae, were sequenced and cloned. These were ornithine carbamoyltransferase, phosphoglycerate kinase, nonphosphorylating glyceraldehyde-3-phosphate dehydrogenase, purine nucleoside phosphorylase, enolase, and glucose-6-phosphate isomerase. Using a gram-positive expression system, we have overexpressed two of these proteins in an in vitro system. These recombinant, purified proteins were used to raise antisera. The identification of these proteins as residing on the outer surface was confirmed by the ability of the antisera to react against whole, live bacteria. Further, in a neonatal-animal model system, we demonstrate that some of these sera are protective against lethal doses of bacteria. These studies demonstrate the successful application of proteomics as a technique for identifying vaccine candidates. PMID:11854208

  4. Mass spectrometry-based protein identification by integrating de novo sequencing with database searching.

    PubMed

    Wang, Penghao; Wilson, Susan R

    2013-01-01

    Mass spectrometry-based protein identification is a very challenging task. The main identification approaches include de novo sequencing and database searching. Both approaches have shortcomings, so an integrative approach has been developed. The integrative approach firstly infers partial peptide sequences, known as tags, directly from tandem spectra through de novo sequencing, and then puts these sequences into a database search to see if a close peptide match can be found. However the current implementation of this integrative approach has several limitations. Firstly, simplistic de novo sequencing is applied and only very short sequence tags are used. Secondly, most integrative methods apply an algorithm similar to BLAST to search for exact sequence matches and do not accommodate sequence errors well. Thirdly, by applying these methods the integrated de novo sequencing makes a limited contribution to the scoring model which is still largely based on database searching. We have developed a new integrative protein identification method which can integrate de novo sequencing more efficiently into database searching. Evaluated on large real datasets, our method outperforms popular identification methods.

  5. Improving identification of traumatic brain injury after nonmilitary bomb blasts.

    PubMed

    Rutland-Brown, Wesley; Langlois, Jean A; Bazarian, Jeffrey J; Warden, Deborah

    2008-01-01

    To improve identification of traumatic brain injury (TBI) in survivors of nonmilitary bomb blasts during the acute care phase. The Centers for Disease Control and Prevention convened a meeting of experts in TBI, emergency medicine, and disaster response to review the recent literature and make recommendations. Seven key recommendations were proposed: (1) increase TBI awareness among medical professionals; (2) encourage use of standard definitions and consistent terminology; (3) improve screening methods for TBI in the acute care setting; (4) clarify the distinction between TBI and acute stress disorder; (5) encourage routine screening of hospitalized trauma patients for TBI; (6) improve identification of nonhospitalized TBI patients; and (7) integrate the appropriate level of TBI identification into all-hazards mass casualty preparedness. By adopting these recommendations, the United States could be better prepared to identify and respond to TBI following future bombing events.

  6. Discovering functional interdependence relationship in PPI networks for protein complex identification.

    PubMed

    Lam, Winnie W M; Chan, Keith C C

    2012-04-01

    Protein molecules interact with each other in protein complexes to perform many vital functions, and different computational techniques have been developed to identify protein complexes in protein-protein interaction (PPI) networks. These techniques are developed to search for subgraphs of high connectivity in PPI networks under the assumption that the proteins in a protein complex are highly interconnected. While these techniques have been shown to be quite effective, it is also possible that the matching rate between the protein complexes they discover and those that are previously determined experimentally be relatively low and the "false-alarm" rate can be relatively high. This is especially the case when the assumption of proteins in protein complexes being more highly interconnected be relatively invalid. To increase the matching rate and reduce the false-alarm rate, we have developed a technique that can work effectively without having to make this assumption. The name of the technique called protein complex identification by discovering functional interdependence (PCIFI) searches for protein complexes in PPI networks by taking into consideration both the functional interdependence relationship between protein molecules and the network topology of the network. The PCIFI works in several steps. The first step is to construct a multiple-function protein network graph by labeling each vertex with one or more of the molecular functions it performs. The second step is to filter out protein interactions between protein pairs that are not functionally interdependent of each other in the statistical sense. The third step is to make use of an information-theoretic measure to determine the strength of the functional interdependence between all remaining interacting protein pairs. Finally, the last step is to try to form protein complexes based on the measure of the strength of functional interdependence and the connectivity between proteins. For performance evaluation

  7. Enhancing bioactive peptide release and identification using targeted enzymatic hydrolysis of milk proteins.

    PubMed

    Nongonierma, Alice B; FitzGerald, Richard J

    2018-06-01

    Milk proteins have been extensively studied for their ability to yield a range of bioactive peptides following enzymatic hydrolysis/digestion. However, many hurdles still exist regarding the widespread utilization of milk protein-derived bioactive peptides as health enhancing agents for humans. These mostly arise from the fact that most milk protein-derived bioactive peptides are not highly potent. In addition, they may be degraded during gastrointestinal digestion and/or have a low intestinal permeability. The targeted release of bioactive peptides during the enzymatic hydrolysis of milk proteins may allow the generation of particularly potent bioactive hydrolysates and peptides. Therefore, the development of milk protein hydrolysates capable of improving human health requires, in the first instance, optimized targeted release of specific bioactive peptides. The targeted hydrolysis of milk proteins has been aided by a range of in silico tools. These include peptide cutters and predictive modeling linking bioactivity to peptide structure [i.e., molecular docking, quantitative structure activity relationship (QSAR)], or hydrolysis parameters [design of experiments (DOE)]. Different targeted enzymatic release strategies employed during the generation of milk protein hydrolysates are reviewed herein and their limitations are outlined. In addition, specific examples are provided to demonstrate how in silico tools may help in the identification and discovery of potent milk protein-derived peptides. It is anticipated that the development of novel strategies employing a range of in silico tools may help in the generation of milk protein hydrolysates containing potent and bioavailable peptides, which in turn may be used to validate their health promoting effects in humans. Graphical abstract The targeted enzymatic hydrolysis of milk proteins may allow the generation of highly potent and bioavailable bioactive peptides.

  8. Free fatty acid particles in protein formulations, part 1: microspectroscopic identification.

    PubMed

    Cao, Xiaolin; Fesinmeyer, R Matthew; Pierini, Christopher J; Siska, Christine C; Litowski, Jennifer R; Brych, Stephen; Wen, Zai-Qing; Kleemann, Gerd R

    2015-02-01

    We report, for the first time, the identification of fatty acid particles in formulations containing the surfactant polysorbate 20. These fatty acid particles were observed in multiple mAb formulations during their expected shelf life under recommended storage conditions. The fatty acid particles were granular or sand-like in morphology and were several microns in size. They could be identified by distinct IR bands, with additional confirmation from energy-dispersive X-ray spectroscopy analysis. The particles were readily distinguishable from protein particles by these methods. In addition, particles containing a mixture of protein and fatty acids were also identified, suggesting that the particulation pathways for the two particle types may not be distinct. The techniques and observations described will be useful for the correct identification of proteinaceous versus nonproteinaceous particles in pharmaceutical products. © 2014 Wiley Periodicals, Inc. and the American Pharmacists Association.

  9. Advances in the Study of Aptamer-Protein Target Identification Using the Chromatographic Approach.

    PubMed

    Drabik, Anna; Ner-Kluza, Joanna; Mielczarek, Przemyslaw; Civit, Laia; Mayer, Günter; Silberring, Jerzy

    2018-06-01

    Ever since the development of the process known as the systematic evolution of ligands by exponential enrichment (SELEX), aptamers have been widely used in a variety of studies, including the exploration of new diagnostic tools and the discovery of new treatment methods. Aptamers' ability to bind to proteins with high affinity and specificity, often compared to that of antibodies, enables the search for potential cancer biomarkers and helps us understand the mechanisms of carcinogenesis. The blind spot of those investigations is usually the difficulty in the selective extraction of targets attached to the aptamer. There are many studies describing the cell SELEX for the prime choice of aptamers toward living cancer cells or even whole tumors in the animal models. However, a dilemma arises when a large number of proteins are being identified as potential targets, which is often the case. In this article, we present a new analytical approach designed to selectively target proteins bound to aptamers. During studies, we have focused on the unambiguous identification of the molecular targets of aptamers characterized by high specificity to the prostate cancer cells. We have compared four assay approaches using electrophoretic and chromatographic methods for "fishing out" aptamer protein targets followed by mass spectrometry identification. We have established a new methodology, based on the fluorescent-tagged oligonucleotides commonly used for flow-cytometry experiments or as optic aptasensors, that allowed the detection of specific aptamer-protein interactions by mass spectrometry. The use of atto488-labeled aptamers for the tracking of the formation of specific aptamer-target complexes provides the possibility of studying putative protein counterparts without needing to apply enrichment techniques. Significantly, changes in the hydrophobic properties of atto488-labeled aptamer-protein complexes facilitate their separation by reverse-phase chromatography combined with

  10. Identification of Small RNA-Protein Partners in Plant Symbiotic Bacteria.

    PubMed

    Robledo, Marta; Matia-González, Ana M; García-Tomsig, Natalia I; Jiménez-Zurdo, José I

    2018-01-01

    The identification of the protein partners of bacterial small noncoding RNAs (sRNAs) is essential to understand the mechanistic principles and functions of riboregulation in prokaryotic cells. Here, we describe an optimized affinity chromatography protocol that enables purification of in vivo formed sRNA-protein complexes in Sinorhizobium meliloti, a genetically tractable nitrogen-fixing plant symbiotic bacterium. The procedure requires the tagging of the desired sRNA with the MS2 aptamer, which is affinity-captured by the MS2-MBP protein conjugated to an amylose resin. As proof of principle, we show recovery of the RNA chaperone Hfq associated to the strictly Hfq-dependent AbcR2 trans-sRNA. This method can be applied for the investigation of sRNA-protein interactions on a broad range of genetically tractable α-proteobacteria.

  11. Identification of novel direct protein-protein interactions by irradiating living cells with femtosecond UV laser pulses.

    PubMed

    Itri, Francesco; Monti, Daria Maria; Chino, Marco; Vinciguerra, Roberto; Altucci, Carlo; Lombardi, Angela; Piccoli, Renata; Birolo, Leila; Arciello, Angela

    2017-10-07

    The identification of protein-protein interaction networks in living cells is becoming increasingly fundamental to elucidate main biological processes and to understand disease molecular bases on a system-wide level. We recently described a method (LUCK, Laser UV Cross-linKing) to cross-link interacting protein surfaces in living cells by UV laser irradiation. By using this innovative methodology, that does not require any protein modification or cell engineering, here we demonstrate that, upon UV laser irradiation of HeLa cells, a direct interaction between GAPDH and alpha-enolase was "frozen" by a cross-linking event. We validated the occurrence of this direct interaction by co-immunoprecipitation and Immuno-FRET analyses. This represents a proof of principle of the LUCK capability to reveal direct protein interactions in their physiological environment. Copyright © 2017 Elsevier Inc. All rights reserved.

  12. Compositions and methods for improved protein production

    DOEpatents

    Bodie, Elizabeth A [San Carlos, CA; Kim, Steve [San Francisco, CA

    2012-07-10

    The present invention relates to the identification of novel nucleic acid sequences, designated herein as 7p, 8k, 7E, 9G, 8Q and 203, in a host cell which effect protein production. The present invention also provides host cells having a mutation or deletion of part or all of the gene encoding 7p, 8k, 7E, 9G, 8Q and 203, which are presented in FIG. 1, and are SEQ ID NOS.: 1-6, respectively. The present invention also provides host cells further comprising a nucleic acid encoding a desired heterologous protein such as an enzyme.

  13. Compositions and methods for improved protein production

    DOEpatents

    Bodie, Elizabeth A.; Kim, Steve Sungjin

    2014-06-03

    The present invention relates to the identification of novel nucleic acid sequences, designated herein as 7p, 8k, 7E, 9G, 8Q and 203, in a host cell which effect protein production. The present invention also provides host cells having a mutation or deletion of part or all of the gene encoding 7p, 8k, 7E, 9G, 8Q and 203, which are presented in FIG. 1, and are SEQ ID NOS.: 1-6, respectively. The present invention also provides host cells further comprising a nucleic acid encoding a desired heterologous protein such as an enzyme.

  14. Protein markers for identification of Yersinia pestis and their variation related to culture

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wunschel, David S.; Engelmann, Heather E.; Victry, Kristin D.

    2013-12-11

    The detection of high consequence pathogens, such as Yersinia pestis, is well established in biodefense laboratories for bioterror situations. Laboratory protocols are well established using specified culture media and a growth temperature of 37 °C for expression of specific antigens. Direct detection of Y. pestis protein markers, without prior culture, depends on their expression. Unfortunately protein expression can be impacted by the culture medium which cannot be predicted ahead of time. Furthermore, higher biomass yields are obtained at the optimal growth temperature (i.e. 28 °C–30 °C) and therefore are more likely to be used for bulk production. Analysis of Y.more » pestis grown on several types of media at 30 °C showed that several protein markers were found to be differentially detected in different media. Analysis of the identified proteins against a comprehensive database provided an additional level of organism identification. Peptides corresponding to variable regions of some proteins could separate large groups of strains and aid in organism identification. This work illustrates the need to understand variability of protein expression for detection targets. The potential for relating expression changes of known proteins to specific media factors, even in nutrient rich and chemically complex culture medium, may provide the opportunity to draw forensic information from protein profiles.« less

  15. Identification of novel phosphatidic acid-binding proteins in the rat brain.

    PubMed

    Park, ChiHu; Kang, Du-Seock; Shin, Geon-Hoon; Seo, Jeongkon; Kim, Hyein; Suh, Pann-Ghill; Bae, Chang-Dae; Shin, Joo-Ho

    2015-05-19

    Phosphatidic acid (PA) is an abundant negatively-charged phospholipid and has long been considered to be an important signaling molecule in diverse cellular events. Thus, the identification of proteins that specifically interact with PA is of considerable interest to understand the regulatory roles of PA. Herein, lipid-affinity purification and mass spectrometric analysis reveals 43 proteins, 19 known and 24 novel, as PA-binding proteins. A lipid-protein overlay assay confirmed that GDI1, PACSIN1, and DPYSL2 interact with not only with PA but also with other phospholipids. These results might be helpful for deciphering the functional effect of PA in the brain. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  16. Accurate Identification of Cancerlectins through Hybrid Machine Learning Technology.

    PubMed

    Zhang, Jieru; Ju, Ying; Lu, Huijuan; Xuan, Ping; Zou, Quan

    2016-01-01

    Cancerlectins are cancer-related proteins that function as lectins. They have been identified through computational identification techniques, but these techniques have sometimes failed to identify proteins because of sequence diversity among the cancerlectins. Advanced machine learning identification methods, such as support vector machine and basic sequence features (n-gram), have also been used to identify cancerlectins. In this study, various protein fingerprint features and advanced classifiers, including ensemble learning techniques, were utilized to identify this group of proteins. We improved the prediction accuracy of the original feature extraction methods and classification algorithms by more than 10% on average. Our work provides a basis for the computational identification of cancerlectins and reveals the power of hybrid machine learning techniques in computational proteomics.

  17. Identification of Host Proteins Associated with Retroviral Vector Particles by Proteomic Analysis of Highly Purified Vector Preparations▿

    PubMed Central

    Segura, María Mercedes; Garnier, Alain; Di Falco, Marcos Rafael; Whissell, Gavin; Meneses-Acosta, Angélica; Arcand, Normand; Kamen, Amine

    2008-01-01

    The Moloney murine leukemia virus (MMLV) belongs to the Retroviridae family of enveloped viruses, which is known to acquire minute amounts of host cellular proteins both on the surface and inside the virion. Despite the extensive use of retroviral vectors in experimental and clinical applications, the repertoire of host proteins incorporated into MMLV vector particles remains unexplored. We report here the identification of host proteins from highly purified retroviral vector preparations obtained by rate-zonal ultracentrifugation. Viral proteins were fractionated by one-dimensional sodium dodecyl sulfate-polyacrylamide gel electrophoresis, in-gel tryptic digested, and subjected to liquid chromatography/tandem mass spectrometry analysis. Immunogold electron microscopy studies confirmed the presence of several host membrane proteins exposed at the vector surface. These studies led to the identification of 27 host proteins on MMLV vector particles derived from 293 HEK cells, including 5 proteins previously described as part of wild-type MMLV. Nineteen host proteins identified corresponded to intracellular proteins. A total of eight host membrane proteins were identified, including cell adhesion proteins integrin β1 (fibronectin receptor subunit beta) and HMFG-E8, tetraspanins CD81 and CD9, and late endosomal markers CD63 and Lamp-2. Identification of membrane proteins on the retroviral surface is particularly attractive, since they can serve as anchoring sites for the insertion of tags for targeting or purification purposes. The implications of our findings for retrovirus-mediated gene therapy are discussed. PMID:18032515

  18. Screening Fusion Tags for Improved Recombinant Protein Expression in E. coli with the Expresso® Solubility and Expression Screening System.

    PubMed

    Steinmetz, Eric J; Auldridge, Michele E

    2017-11-01

    The simplicity, speed, and low cost of bacterial culture make E. coli the system of choice for most initial trials of recombinant protein expression. However, many heterologous proteins are either poorly expressed in bacteria, or are produced as incorrectly folded, insoluble aggregates that lack the activity of the native protein. In many cases, fusion to a partner protein can allow for improved expression and/or solubility of a difficult target protein. Although several different fusion partners have gained favor, none are universally effective, and identifying the one that best improves soluble expression of a given target protein is an empirical process. This unit presents a strategy for parallel screening of fusion partners for enhanced expression or solubility. The Expresso® Solubility and Expression Screening System includes a panel of seven distinct fusion partners and utilizes an extremely simple cloning strategy to enable rapid screening and identification of the most effective fusion partner. © 2017 by John Wiley & Sons, Inc. Copyright © 2017 John Wiley & Sons, Inc.

  19. Rapid Identification of Sequences for Orphan Enzymes to Power Accurate Protein Annotation

    PubMed Central

    Ojha, Sunil; Watson, Douglas S.; Bomar, Martha G.; Galande, Amit K.; Shearer, Alexander G.

    2013-01-01

    The power of genome sequencing depends on the ability to understand what those genes and their proteins products actually do. The automated methods used to assign functions to putative proteins in newly sequenced organisms are limited by the size of our library of proteins with both known function and sequence. Unfortunately this library grows slowly, lagging well behind the rapid increase in novel protein sequences produced by modern genome sequencing methods. One potential source for rapidly expanding this functional library is the “back catalog” of enzymology – “orphan enzymes,” those enzymes that have been characterized and yet lack any associated sequence. There are hundreds of orphan enzymes in the Enzyme Commission (EC) database alone. In this study, we demonstrate how this orphan enzyme “back catalog” is a fertile source for rapidly advancing the state of protein annotation. Starting from three orphan enzyme samples, we applied mass-spectrometry based analysis and computational methods (including sequence similarity networks, sequence and structural alignments, and operon context analysis) to rapidly identify the specific sequence for each orphan while avoiding the most time- and labor-intensive aspects of typical sequence identifications. We then used these three new sequences to more accurately predict the catalytic function of 385 previously uncharacterized or misannotated proteins. We expect that this kind of rapid sequence identification could be efficiently applied on a larger scale to make enzymology’s “back catalog” another powerful tool to drive accurate genome annotation. PMID:24386392

  20. Rapid identification of sequences for orphan enzymes to power accurate protein annotation.

    PubMed

    Ramkissoon, Kevin R; Miller, Jennifer K; Ojha, Sunil; Watson, Douglas S; Bomar, Martha G; Galande, Amit K; Shearer, Alexander G

    2013-01-01

    The power of genome sequencing depends on the ability to understand what those genes and their proteins products actually do. The automated methods used to assign functions to putative proteins in newly sequenced organisms are limited by the size of our library of proteins with both known function and sequence. Unfortunately this library grows slowly, lagging well behind the rapid increase in novel protein sequences produced by modern genome sequencing methods. One potential source for rapidly expanding this functional library is the "back catalog" of enzymology--"orphan enzymes," those enzymes that have been characterized and yet lack any associated sequence. There are hundreds of orphan enzymes in the Enzyme Commission (EC) database alone. In this study, we demonstrate how this orphan enzyme "back catalog" is a fertile source for rapidly advancing the state of protein annotation. Starting from three orphan enzyme samples, we applied mass-spectrometry based analysis and computational methods (including sequence similarity networks, sequence and structural alignments, and operon context analysis) to rapidly identify the specific sequence for each orphan while avoiding the most time- and labor-intensive aspects of typical sequence identifications. We then used these three new sequences to more accurately predict the catalytic function of 385 previously uncharacterized or misannotated proteins. We expect that this kind of rapid sequence identification could be efficiently applied on a larger scale to make enzymology's "back catalog" another powerful tool to drive accurate genome annotation.

  1. Improving substructure identification accuracy of shear structures using virtual control system

    NASA Astrophysics Data System (ADS)

    Zhang, Dongyu; Yang, Yang; Wang, Tingqiang; Li, Hui

    2018-02-01

    Substructure identification is a powerful tool to identify the parameters of a complex structure. Previously, the authors developed an inductive substructure identification method for shear structures. The identification error analysis showed that the identification accuracy of this method is significantly influenced by the magnitudes of two key structural responses near a certain frequency; if these responses are unfavorable, the method cannot provide accurate estimation results. In this paper, a novel method is proposed to improve the substructure identification accuracy by introducing a virtual control system (VCS) into the structure. A virtual control system is a self-balanced system, which consists of some control devices and a set of self-balanced forces. The self-balanced forces counterbalance the forces that the control devices apply on the structure. The control devices are combined with the structure to form a controlled structure used to replace the original structure in the substructure identification; and the self-balance forces are treated as known external excitations to the controlled structure. By optimally tuning the VCS’s parameters, the dynamic characteristics of the controlled structure can be changed such that the original structural responses become more favorable for the substructure identification and, thus, the identification accuracy is improved. A numerical example of 6-story shear structure is utilized to verify the effectiveness of the VCS based controlled substructure identification method. Finally, shake table tests are conducted on a 3-story structural model to verify the efficacy of the VCS to enhance the identification accuracy of the structural parameters.

  2. Functional Module Search in Protein Networks based on Semantic Similarity Improves the Analysis of Proteomics Data*

    PubMed Central

    Boyanova, Desislava; Nilla, Santosh; Klau, Gunnar W.; Dandekar, Thomas; Müller, Tobias; Dittrich, Marcus

    2014-01-01

    The continuously evolving field of proteomics produces increasing amounts of data while improving the quality of protein identifications. Albeit quantitative measurements are becoming more popular, many proteomic studies are still based on non-quantitative methods for protein identification. These studies result in potentially large sets of identified proteins, where the biological interpretation of proteins can be challenging. Systems biology develops innovative network-based methods, which allow an integrated analysis of these data. Here we present a novel approach, which combines prior knowledge of protein-protein interactions (PPI) with proteomics data using functional similarity measurements of interacting proteins. This integrated network analysis exactly identifies network modules with a maximal consistent functional similarity reflecting biological processes of the investigated cells. We validated our approach on small (H9N2 virus-infected gastric cells) and large (blood constituents) proteomic data sets. Using this novel algorithm, we identified characteristic functional modules in virus-infected cells, comprising key signaling proteins (e.g. the stress-related kinase RAF1) and demonstrate that this method allows a module-based functional characterization of cell types. Analysis of a large proteome data set of blood constituents resulted in clear separation of blood cells according to their developmental origin. A detailed investigation of the T-cell proteome further illustrates how the algorithm partitions large networks into functional subnetworks each representing specific cellular functions. These results demonstrate that the integrated network approach not only allows a detailed analysis of proteome networks but also yields a functional decomposition of complex proteomic data sets and thereby provides deeper insights into the underlying cellular processes of the investigated system. PMID:24807868

  3. Method for identification of rigid domains and hinge residues in proteins based on exhaustive enumeration.

    PubMed

    Sim, Jaehyun; Sim, Jun; Park, Eunsung; Lee, Julian

    2015-06-01

    Many proteins undergo large-scale motions where relatively rigid domains move against each other. The identification of rigid domains, as well as the hinge residues important for their relative movements, is important for various applications including flexible docking simulations. In this work, we develop a method for protein rigid domain identification based on an exhaustive enumeration of maximal rigid domains, the rigid domains not fully contained within other domains. The computation is performed by mapping the problem to that of finding maximal cliques in a graph. A minimal set of rigid domains are then selected, which cover most of the protein with minimal overlap. In contrast to the results of existing methods that partition a protein into non-overlapping domains using approximate algorithms, the rigid domains obtained from exact enumeration naturally contain overlapping regions, which correspond to the hinges of the inter-domain bending motion. The performance of the algorithm is demonstrated on several proteins. © 2015 Wiley Periodicals, Inc.

  4. Identification of Immunoreactive Leishmania infantum Protein Antigens to Asymptomatic Dog Sera through Combined Immunoproteomics and Bioinformatics Analysis

    PubMed Central

    Samiotaki, Martina; Panayotou, George; Karagouni, Evdokia

    2016-01-01

    Leishmania infantum is the etiologic agent of zoonotic visceral leishmaniasis (VL) in countries in the Mediterranean basin, where dogs are the domestic reservoirs and represent important elements in the transmission of the disease. Since the major focal areas of human VL exhibit a high prevalence of seropositive dogs, the control of canine VL could reduce the infection rate in humans. Efforts toward this have focused on the improvement of diagnostic tools, as well as on vaccine development. The identification of parasite antigens including suitable major histocompatibility complex (MHC) class I- and/or II-restricted epitopes is very important since disease protection is characterized by strong and long-lasting CD8+ T and CD4+ Th1 cell-dominated immunity. In the present study, total protein extract from late-log phase L. infantum promastigotes was analyzed by two-dimensional western blots and probed with sera from asymptomatic and symptomatic dogs. A total of 42 protein spots were found to differentially react with IgG from asymptomatic dogs, while 17 of these identified by Coommasie stain were extracted and analyzed. Of these, 21 proteins were identified by mass spectrometry; they were mainly involved in metabolism and stress responses. An in silico analysis predicted that the chaperonin HSP60, dihydrolipoamide dehydrogenase, enolase, cyclophilin 2, cyclophilin 40, and one hypothetical protein contain promiscuous MHCI and/or MHCII epitopes. Our results suggest that the combination of immunoproteomics and bioinformatics analyses is a promising method for the identification of novel candidate antigens for vaccine development or with potential use in the development of sensitive diagnostic tests. PMID:26906226

  5. Identification and Characterization of Arabidopsis Seed Coat Mucilage Proteins.

    PubMed

    Tsai, Allen Yi-Lun; Kunieda, Tadashi; Rogalski, Jason; Foster, Leonard J; Ellis, Brian E; Haughn, George W

    2017-02-01

    Plant cell wall proteins are important regulators of cell wall architecture and function. However, because cell wall proteins are difficult to extract and analyze, they are generally poorly understood. Here, we describe the identification and characterization of proteins integral to the Arabidopsis (Arabidopsis thaliana) seed coat mucilage, a specialized layer of the extracellular matrix composed of plant cell wall carbohydrates that is used as a model for cell wall research. The proteins identified in mucilage include those previously identified by genetic analysis, and several mucilage proteins are reduced in mucilage-deficient mutant seeds, suggesting that these proteins are genuinely associated with the mucilage. Arabidopsis mucilage has both nonadherent and adherent layers. Both layers have similar protein profiles except for proteins involved in lipid metabolism, which are present exclusively in the adherent mucilage. The most abundant mucilage proteins include a family of proteins named TESTA ABUNDANT1 (TBA1) to TBA3; a less abundant fourth homolog was named TBA-LIKE (TBAL). TBA and TBAL transcripts and promoter activities were detected in developing seed coats, and their expression requires seed coat differentiation regulators. TBA proteins are secreted to the mucilage pocket during differentiation. Although reverse genetics failed to identify a function for TBAs/TBAL, the TBA promoters are highly expressed and cell type specific and so should be very useful tools for targeting proteins to the seed coat epidermis. Altogether, these results highlight the mucilage proteome as a model for cell walls in general, as it shares similarities with other cell wall proteomes while also containing mucilage-specific features. © 2017 American Society of Plant Biologists. All Rights Reserved.

  6. Identification of Phosphorylated Proteins on a Global Scale.

    PubMed

    Iliuk, Anton

    2018-05-31

    Liquid chromatography (LC) coupled with tandem mass spectrometry (MS/MS) has enabled researchers to analyze complex biological samples with unprecedented depth. It facilitates the identification and quantification of modifications within thousands of proteins in a single large-scale proteomic experiment. Analysis of phosphorylation, one of the most common and important post-translational modifications, has particularly benefited from such progress in the field. Here, detailed protocols are provided for a few well-regarded, common sample preparation methods for an effective phosphoproteomic experiment. © 2018 by John Wiley & Sons, Inc. Copyright © 2018 John Wiley & Sons, Inc.

  7. Peptide Array X-Linking (PAX): A New Peptide-Protein Identification Approach

    PubMed Central

    Okada, Hirokazu; Uezu, Akiyoshi; Soderblom, Erik J.; Moseley, M. Arthur; Gertler, Frank B.; Soderling, Scott H.

    2012-01-01

    Many protein interaction domains bind short peptides based on canonical sequence consensus motifs. Here we report the development of a peptide array-based proteomics tool to identify proteins directly interacting with ligand peptides from cell lysates. Array-formatted bait peptides containing an amino acid-derived cross-linker are photo-induced to crosslink with interacting proteins from lysates of interest. Indirect associations are removed by high stringency washes under denaturing conditions. Covalently trapped proteins are subsequently identified by LC-MS/MS and screened by cluster analysis and domain scanning. We apply this methodology to peptides with different proline-containing consensus sequences and show successful identifications from brain lysates of known and novel proteins containing polyproline motif-binding domains such as EH, EVH1, SH3, WW domains. These results suggest the capacity of arrayed peptide ligands to capture and subsequently identify proteins by mass spectrometry is relatively broad and robust. Additionally, the approach is rapid and applicable to cell or tissue fractions from any source, making the approach a flexible tool for initial protein-protein interaction discovery. PMID:22606326

  8. Identification of mycobacterial surface proteins released into subcellular compartments of infected macrophages.

    PubMed

    Beatty, W L; Russell, D G

    2000-12-01

    Considerable effort has focused on the identification of proteins secreted from Mycobacterium spp. that contribute to the development of protective immunity. Little is known, however, about the release of mycobacterial proteins from the bacterial phagosome and the potential role of these molecules in chronically infected macrophages. In the present study, the release of mycobacterial surface proteins from the bacterial phagosome into subcellular compartments of infected macrophages was analyzed. Mycobacterium bovis BCG was surface labeled with fluorescein-tagged succinimidyl ester, an amine-reactive probe. The fluorescein tag was then used as a marker for the release of bacterial proteins in infected macrophages. Fractionation studies revealed bacterial proteins within subcellular compartments distinct from mycobacteria and mycobacterial phagosomes. To identify these proteins, subcellular fractions free of bacteria were probed with mycobacterium-specific antibodies. The fibronectin attachment protein and proteins of the antigen 85-kDa complex were identified among the mycobacterial proteins released from the bacterial phagosome.

  9. Identification of Mycobacterial Surface Proteins Released into Subcellular Compartments of Infected Macrophages

    PubMed Central

    Beatty, Wandy L.; Russell, David G.

    2000-01-01

    Considerable effort has focused on the identification of proteins secreted from Mycobacterium spp. that contribute to the development of protective immunity. Little is known, however, about the release of mycobacterial proteins from the bacterial phagosome and the potential role of these molecules in chronically infected macrophages. In the present study, the release of mycobacterial surface proteins from the bacterial phagosome into subcellular compartments of infected macrophages was analyzed. Mycobacterium bovis BCG was surface labeled with fluorescein-tagged succinimidyl ester, an amine-reactive probe. The fluorescein tag was then used as a marker for the release of bacterial proteins in infected macrophages. Fractionation studies revealed bacterial proteins within subcellular compartments distinct from mycobacteria and mycobacterial phagosomes. To identify these proteins, subcellular fractions free of bacteria were probed with mycobacterium-specific antibodies. The fibronectin attachment protein and proteins of the antigen 85-kDa complex were identified among the mycobacterial proteins released from the bacterial phagosome. PMID:11083824

  10. Identification of host proteins, Spata3 and Dkk2, interacting with Toxoplasma gondii micronemal protein MIC3.

    PubMed

    Wang, Yifan; Fang, Rui; Yuan, Yuan; Pan, Ming; Hu, Min; Zhou, Yanqin; Shen, Bang; Zhao, Junlong

    2016-07-01

    As an obligate intracellular protozoan, Toxoplasma gondii is a successful pathogen infecting a variety of animals, including humans. As an adhesin involving in host invasion, the micronemal protein MIC3 plays important roles in host cell attachment, as well as modulation of host EGFR signaling cascade. However, the specific host proteins that interact with MIC3 are unknown and the identification of such proteins will increase our understanding of how MIC3 exerts its functions. This study was designed to identify host proteins interacting with MIC3 by yeast two-hybrid screens. Using MIC3 as bait, a library expressing mouse proteins was screened, uncovering eight mouse proteins that showed positive interactions with MIC3. Two of which, spermatogenesis-associated protein 3 (Spata3) and dickkopf-related protein 2 (Dkk2), were further confirmed to interact with MIC3 by additional protein-protein interaction tests. The results also revealed that the tandem repeat EGF domains of MIC3 were critical in mediating the interactions with the identified host proteins. This is the first study to show that MIC3 interacts with host proteins that are involved in reproduction, growth, and development. The results will provide a clearer understanding of the functions of adhesion-associated micronemal proteins in T. gondii.

  11. Identification of proteins in renaissance paintings by proteomics.

    PubMed

    Tokarski, Caroline; Martin, Elisabeth; Rolando, Christian; Cren-Olivé, Cécile

    2006-03-01

    The presented work proposes a new methodology based on proteomics techniques to identify proteins in old art paintings. The main challenging tasks of this work were (i) to find appropriate conditions for extracting proteins from the binding media without protein hydrolysis in amino acids and (ii) to develop analytical methods adapted to the small sample quantity available. Starting from microsamples of painting models (ovalbumin-based, which is the major egg white protein, and egg-based paintings), multiple extraction solutions (HCl, HCOOH, NH3, NaOH) and conditions (ultrasonic bath, mortar and pestle, grinding resin) were evaluated. The best results were obtained using a commercial kit including a synthetic resin, mortar and pestle to grind the sample in an aqueous solution acidified with trifluoroacetic acid at 1% with additional multiple steps of ultrasonic baths. The resulting supernatant was analyzed by MALDI-TOF in linear mode to verify the efficiency of the extraction solution. An enzymatic hydrolysis step was also performed for protein identification; the peptide mixture was analyzed by nanoLC/nanoESI/Q-q-TOF MS/MS with an adapted chromatographic run for the low sample quantity. Finally, the developed methodology was successfully applied to Renaissance art painting microsamples of approximately 10 microg from Benedetto Bonfigli's triptych, The Virgin and Child, St. John the Baptist, St. Sebastian (XVth century), and Niccolo di Pietro Gerini's painting, The Virgin and Child (XIVth century), identifying, for the first time and without ambiguity, the presence of whole egg proteins (egg yolk and egg white) in a painting binder.

  12. Domain selection combined with improved cloning strategy for high throughput expression of higher eukaryotic proteins

    PubMed Central

    Chen, Yunjia; Qiu, Shihong; Luan, Chi-Hao; Luo, Ming

    2007-01-01

    Background Expression of higher eukaryotic genes as soluble, stable recombinant proteins is still a bottleneck step in biochemical and structural studies of novel proteins today. Correct identification of stable domains/fragments within the open reading frame (ORF), combined with proper cloning strategies, can greatly enhance the success rate when higher eukaryotic proteins are expressed as these domains/fragments. Furthermore, a HTP cloning pipeline incorporated with bioinformatics domain/fragment selection methods will be beneficial to studies of structure and function genomics/proteomics. Results With bioinformatics tools, we developed a domain/domain boundary prediction (DDBP) method, which was trained by available experimental data. Combined with an improved cloning strategy, DDBP had been applied to 57 proteins from C. elegans. Expression and purification results showed there was a 10-fold increase in terms of obtaining purified proteins. Based on the DDBP method, the improved GATEWAY cloning strategy and a robotic platform, we constructed a high throughput (HTP) cloning pipeline, including PCR primer design, PCR, BP reaction, transformation, plating, colony picking and entry clones extraction, which have been successfully applied to 90 C. elegans genes, 88 Brucella genes, and 188 human genes. More than 97% of the targeted genes were obtained as entry clones. This pipeline has a modular design and can adopt different operations for a variety of cloning/expression strategies. Conclusion The DDBP method and improved cloning strategy were satisfactory. The cloning pipeline, combined with our recombinant protein HTP expression pipeline and the crystal screening robots, constitutes a complete platform for structure genomics/proteomics. This platform will increase the success rate of purification and crystallization dramatically and promote the further advancement of structure genomics/proteomics. PMID:17663785

  13. Identification of Contractile Vacuole Proteins in Trypanosoma cruzi

    PubMed Central

    Park, Miyoung; Martins, Vicente P.; Atwood, James; Moles, Kristen; Collins, Dalis; Rohloff, Peter; Tarleton, Rick; Moreno, Silvia N. J.; Orlando, Ron; Docampo, Roberto

    2011-01-01

    Contractile vacuole complexes are critical components of cell volume regulation and have been shown to have other functional roles in several free-living protists. However, very little is known about the functions of the contractile vacuole complex of the parasite Trypanosoma cruzi, the etiologic agent of Chagas disease, other than a role in osmoregulation. Identification of the protein composition of these organelles is important for understanding their physiological roles. We applied a combined proteomic and bioinfomatic approach to identify proteins localized to the contractile vacuole. Proteomic analysis of a T. cruzi fraction enriched for contractile vacuoles and analyzed by one-dimensional gel electrophoresis and LC-MS/MS resulted in the addition of 109 newly detected proteins to the group of expressed proteins of epimastigotes. We also identified different peptides that map to at least 39 members of the dispersed gene family 1 (DGF-1) providing evidence that many members of this family are simultaneously expressed in epimastigotes. Of the proteins present in the fraction we selected several homologues with known localizations in contractile vacuoles of other organisms and others that we expected to be present in these vacuoles on the basis of their potential roles. We determined the localization of each by expression as GFP-fusion proteins or with specific antibodies. Six of these putative proteins (Rab11, Rab32, AP180, ATPase subunit B, VAMP1, and phosphate transporter) predominantly localized to the vacuole bladder. TcSNARE2.1, TcSNARE2.2, and calmodulin localized to the spongiome. Calmodulin was also cytosolic. Our results demonstrate the utility of combining subcellular fractionation, proteomic analysis, and bioinformatic approaches for localization of organellar proteins that are difficult to detect with whole cell methodologies. The CV localization of the proteins investigated revealed potential novel roles of these organelles in phosphate metabolism

  14. Genome-Wide Identification of Arabidopsis Coiled-Coil Proteins and Establishment of the ARABI-COIL Database1

    PubMed Central

    Rose, Annkatrin; Manikantan, Sankaraganesh; Schraegle, Shannon J.; Maloy, Michael A.; Stahlberg, Eric A.; Meier, Iris

    2004-01-01

    Increasing evidence demonstrates the importance of long coiled-coil proteins for the spatial organization of cellular processes. Although several protein classes with long coiled-coil domains have been studied in animals and yeast, our knowledge about plant long coiled-coil proteins is very limited. The repeat nature of the coiled-coil sequence motif often prevents the simple identification of homologs of animal coiled-coil proteins by generic sequence similarity searches. As a consequence, counterparts of many animal proteins with long coiled-coil domains, like lamins, golgins, or microtubule organization center components, have not been identified yet in plants. Here, all Arabidopsis proteins predicted to contain long stretches of coiled-coil domains were identified by applying the algorithm MultiCoil to a genome-wide screen. A searchable protein database, ARABI-COIL (http://www.coiled-coil.org/arabidopsis), was established that integrates information on number, size, and position of predicted coiled-coil domains with subcellular localization signals, transmembrane domains, and available functional annotations. ARABI-COIL serves as a tool to sort and browse Arabidopsis long coiled-coil proteins to facilitate the identification and selection of candidate proteins of potential interest for specific research areas. Using the database, candidate proteins were identified for Arabidopsis membrane-bound, nuclear, and organellar long coiled-coil proteins. PMID:15020757

  15. Identification of bovine sperm acrosomal proteins that interact with a 32-kDa acrosomal matrix protein.

    PubMed

    Nagdas, Subir K; Smith, Linda; Medina-Ortiz, Ilza; Hernandez-Encarnacion, Luisa; Raychoudhury, Samir

    2016-03-01

    Mammalian fertilization is accomplished by the interaction between sperm and egg. Previous studies from this laboratory have identified a stable acrosomal matrix assembly from the bovine sperm acrosome termed the outer acrosomal membrane-matrix complex (OMC). This stable matrix assembly exhibits precise binding activity for acrosin and N-acetylglucosaminidase. A highly purified OMC fraction comprises three major (54, 50, and 45 kDa) and several minor (38-19 kDa) polypeptides. The set of minor polypeptides (38-19 kDa) termed "OMCrpf polypeptides" is selectively solubilized by high-pH extraction (pH 10.5), while the three major polypeptides (55, 50, and 45 kDa) remain insoluble. Proteomic identification of the OMC32 polypeptide (32 kDa polypeptide isolated from high-pH soluble fraction of OMC) yielded two peptides that matched the NCBI database sequence of acrosin-binding protein. Anti-OMC32 recognized an antigenically related family of polypeptides (OMCrpf polypeptides) in the 38-19-kDa range with isoelectric points ranging between 4.0 and 5.1. Other than glycohydrolases, OMC32 may also be complexed to other acrosomal proteins. The present study was undertaken to identify and localize the OMC32 binding polypeptides and to elucidate the potential role of the acrosomal protein complex in sperm function. OMC32 affinity chromatography of a detergent-soluble fraction of bovine cauda sperm acrosome followed by mass spectrometry-based identification of bound proteins identified acrosin, lactadherin, SPACA3, and IZUMO1. Co-immunoprecipitation analysis also demonstrated the interaction of OMC32 with acrosin, lactadherin, SPACA3, and IZUMO1. Our immunofluorescence studies revealed the presence of SPACA3 and lactadherin over the apical segment, whereas IZUMO1 is localized over the equatorial segment of Triton X-100 permeabilized cauda sperm. Immunoblot analysis showed that a significant portion of SPACA3 was released after the lysophosphatidylcholine (LPC)-induced acrosome

  16. Identification of protein-interacting nucleotides in a RNA sequence using composition profile of tri-nucleotides.

    PubMed

    Panwar, Bharat; Raghava, Gajendra P S

    2015-04-01

    The RNA-protein interactions play a diverse role in the cells, thus identification of RNA-protein interface is essential for the biologist to understand their function. In the past, several methods have been developed for predicting RNA interacting residues in proteins, but limited efforts have been made for the identification of protein-interacting nucleotides in RNAs. In order to discriminate protein-interacting and non-interacting nucleotides, we used various classifiers (NaiveBayes, NaiveBayesMultinomial, BayesNet, ComplementNaiveBayes, MultilayerPerceptron, J48, SMO, RandomForest, SMO and SVM(light)) for prediction model development using various features and achieved highest 83.92% sensitivity, 84.82 specificity, 84.62% accuracy and 0.62 Matthew's correlation coefficient by SVM(light) based models. We observed that certain tri-nucleotides like ACA, ACC, AGA, CAC, CCA, GAG, UGA, and UUU preferred in protein-interaction. All the models have been developed using a non-redundant dataset and are evaluated using five-fold cross validation technique. A web-server called RNApin has been developed for the scientific community (http://crdd.osdd.net/raghava/rnapin/). Copyright © 2015 Elsevier Inc. All rights reserved.

  17. [Progress in the spectral library based protein identification strategy].

    PubMed

    Yu, Derui; Ma, Jie; Xie, Zengyan; Bai, Mingze; Zhu, Yunping; Shu, Kunxian

    2018-04-25

    Exponential growth of the mass spectrometry (MS) data is exhibited when the mass spectrometry-based proteomics has been developing rapidly. It is a great challenge to develop some quick, accurate and repeatable methods to identify peptides and proteins. Nowadays, the spectral library searching has become a mature strategy for tandem mass spectra based proteins identification in proteomics, which searches the experiment spectra against a collection of confidently identified MS/MS spectra that have been observed previously, and fully utilizes the abundance in the spectrum, peaks from non-canonical fragment ions, and other features. This review provides an overview of the implement of spectral library search strategy, and two key steps, spectral library construction and spectral library searching comprehensively, and discusses the progress and challenge of the library search strategy.

  18. YahO protein as a calibrant for top-down proteomic identification of Shiga toxin using MALDI-TOF-TOF-MS/MS and post-source decay

    USDA-ARS?s Scientific Manuscript database

    Matrix-assisted laser desorption/ionization tandem time-of-flight (MALDI-TOF-TOF) mass spectrometry is increasingly utilized for rapid top-down proteomic identification of proteins. This identification may involve analysis of either a pure protein or a protein mixture. For analysis of a pure protein...

  19. Investigation and identification of functional post-translational modification sites associated with drug binding and protein-protein interactions.

    PubMed

    Su, Min-Gang; Weng, Julia Tzu-Ya; Hsu, Justin Bo-Kai; Huang, Kai-Yao; Chi, Yu-Hsiang; Lee, Tzong-Yi

    2017-12-21

    Protein post-translational modification (PTM) plays an essential role in various cellular processes that modulates the physical and chemical properties, folding, conformation, stability and activity of proteins, thereby modifying the functions of proteins. The improved throughput of mass spectrometry (MS) or MS/MS technology has not only brought about a surge in proteome-scale studies, but also contributed to a fruitful list of identified PTMs. However, with the increase in the number of identified PTMs, perhaps the more crucial question is what kind of biological mechanisms these PTMs are involved in. This is particularly important in light of the fact that most protein-based pharmaceuticals deliver their therapeutic effects through some form of PTM. Yet, our understanding is still limited with respect to the local effects and frequency of PTM sites near pharmaceutical binding sites and the interfaces of protein-protein interaction (PPI). Understanding PTM's function is critical to our ability to manipulate the biological mechanisms of protein. In this study, to understand the regulation of protein functions by PTMs, we mapped 25,835 PTM sites to proteins with available three-dimensional (3D) structural information in the Protein Data Bank (PDB), including 1785 modified PTM sites on the 3D structure. Based on the acquired structural PTM sites, we proposed to use five properties for the structural characterization of PTM substrate sites: the spatial composition of amino acids, residues and side-chain orientations surrounding the PTM substrate sites, as well as the secondary structure, division of acidity and alkaline residues, and solvent-accessible surface area. We further mapped the structural PTM sites to the structures of drug binding and PPI sites, identifying a total of 1917 PTM sites that may affect PPI and 3951 PTM sites associated with drug-target binding. An integrated analytical platform (CruxPTM), with a variety of methods and online molecular docking

  20. Identification of Optimal Epitopes for Plasmodium falciparum Rapid Diagnostic Tests That Target Histidine-Rich Proteins 2 and 3

    PubMed Central

    Lee, Nelson; Gatton, Michelle L.; Pelecanos, Anita; Bubb, Martin; Gonzalez, Iveth; Bell, David; Cheng, Qin

    2012-01-01

    Rapid diagnostic tests (RDTs) represent important tools to diagnose malaria infection. To improve understanding of the variable performance of RDTs that detect the major target in Plasmodium falciparum, namely, histidine-rich protein 2 (HRP2), and to inform the design of better tests, we undertook detailed mapping of the epitopes recognized by eight HRP-specific monoclonal antibodies (MAbs). To investigate the geographic skewing of this polymorphic protein, we analyzed the distribution of these epitopes in parasites from geographically diverse areas. To identify an ideal amino acid motif for a MAb to target in HRP2 and in the related protein HRP3, we used a purpose-designed script to perform bioinformatic analysis of 448 distinct gene sequences from pfhrp2 and from 99 sequences from the closely related gene pfhrp3. The frequency and distribution of these motifs were also compared to the MAb epitopes. Heat stability testing of MAbs immobilized on nitrocellulose membranes was also performed. Results of these experiments enabled the identification of MAbs with the most desirable characteristics for inclusion in RDTs, including copy number and coverage of target epitopes, geographic skewing, heat stability, and match with the most abundant amino acid motifs identified. This study therefore informs the selection of MAbs to include in malaria RDTs as well as in the generation of improved MAbs that should improve the performance of HRP-detecting malaria RDTs. PMID:22259210

  1. Mass spectrometric identification of proteins in complex post-genomic projects. Soluble proteins of the metabolically versatile, denitrifying 'Aromatoleum' sp. strain EbN1.

    PubMed

    Hufnagel, Peter; Rabus, Ralf

    2006-01-01

    The rapidly developing proteomics technologies help to advance the global understanding of physiological and cellular processes. The lifestyle of a study organism determines the type and complexity of a given proteomic project. The complexity of this study is characterized by a broad collection of pathway-specific subproteomes, reflecting the metabolic versatility as well as the regulatory potential of the aromatic-degrading, denitrifying bacterium 'Aromatoleum' sp. strain EbN1. Differences in protein profiles were determined using a gel-based approach. Protein identification was based on a progressive application of MALDI-TOF-MS, MALDI-TOF-MS/MS and LC-ESI-MS/MS. This progression was result-driven and automated by software control. The identification rate was increased by the assembly of a project-specific list of background signals that was used for internal calibration of the MS spectra, and by the combination of two search engines using a dedicated MetaScoring algorithm. In total, intelligent bioinformatics could increase the identification yield from 53 to 70% of the analyzed 5,050 gel spots; a total of 556 different proteins were identified. MS identification was highly reproducible: most proteins were identified more than twice from parallel 2DE gels with an average sequence coverage of >50% and rather restrictive score thresholds (Mascot >or=95, ProFound >or=2.2, MetaScore >or=97). The MS technologies and bioinformatics tools that were implemented and integrated to handle this complex proteomic project are presented. In addition, we describe the basic principles and current developments of the applied technologies and provide an overview over the current state of microbial proteome research. Copyright (c) 2006 S. Karger AG, Basel.

  2. Improving membrane protein expression by optimizing integration efficiency

    PubMed Central

    2017-01-01

    The heterologous overexpression of integral membrane proteins in Escherichia coli often yields insufficient quantities of purifiable protein for applications of interest. The current study leverages a recently demonstrated link between co-translational membrane integration efficiency and protein expression levels to predict protein sequence modifications that improve expression. Membrane integration efficiencies, obtained using a coarse-grained simulation approach, robustly predicted effects on expression of the integral membrane protein TatC for a set of 140 sequence modifications, including loop-swap chimeras and single-residue mutations distributed throughout the protein sequence. Mutations that improve simulated integration efficiency were 4-fold enriched with respect to improved experimentally observed expression levels. Furthermore, the effects of double mutations on both simulated integration efficiency and experimentally observed expression levels were cumulative and largely independent, suggesting that multiple mutations can be introduced to yield higher levels of purifiable protein. This work provides a foundation for a general method for the rational overexpression of integral membrane proteins based on computationally simulated membrane integration efficiencies. PMID:28918393

  3. PPCM: Combing multiple classifiers to improve protein-protein interaction prediction

    DOE PAGES

    Yao, Jianzhuang; Guo, Hong; Yang, Xiaohan

    2015-08-01

    Determining protein-protein interaction (PPI) in biological systems is of considerable importance, and prediction of PPI has become a popular research area. Although different classifiers have been developed for PPI prediction, no single classifier seems to be able to predict PPI with high confidence. We postulated that by combining individual classifiers the accuracy of PPI prediction could be improved. We developed a method called protein-protein interaction prediction classifiers merger (PPCM), and this method combines output from two PPI prediction tools, GO2PPI and Phyloprof, using Random Forests algorithm. The performance of PPCM was tested by area under the curve (AUC) using anmore » assembled Gold Standard database that contains both positive and negative PPI pairs. Our AUC test showed that PPCM significantly improved the PPI prediction accuracy over the corresponding individual classifiers. We found that additional classifiers incorporated into PPCM could lead to further improvement in the PPI prediction accuracy. Furthermore, cross species PPCM could achieve competitive and even better prediction accuracy compared to the single species PPCM. This study established a robust pipeline for PPI prediction by integrating multiple classifiers using Random Forests algorithm. Ultimately, this pipeline will be useful for predicting PPI in nonmodel species.« less

  4. Current algorithmic solutions for peptide-based proteomics data generation and identification.

    PubMed

    Hoopmann, Michael R; Moritz, Robert L

    2013-02-01

    Peptide-based proteomic data sets are ever increasing in size and complexity. These data sets provide computational challenges when attempting to quickly analyze spectra and obtain correct protein identifications. Database search and de novo algorithms must consider high-resolution MS/MS spectra and alternative fragmentation methods. Protein inference is a tricky problem when analyzing large data sets of degenerate peptide identifications. Combining multiple algorithms for improved peptide identification puts significant strain on computational systems when investigating large data sets. This review highlights some of the recent developments in peptide and protein identification algorithms for analyzing shotgun mass spectrometry data when encountering the aforementioned hurdles. Also explored are the roles that analytical pipelines, public spectral libraries, and cloud computing play in the evolution of peptide-based proteomics. Copyright © 2012 Elsevier Ltd. All rights reserved.

  5. Identification of divergent protein domains by combining HMM-HMM comparisons and co-occurrence detection.

    PubMed

    Ghouila, Amel; Florent, Isabelle; Guerfali, Fatma Zahra; Terrapon, Nicolas; Laouini, Dhafer; Yahia, Sadok Ben; Gascuel, Olivier; Bréhélin, Laurent

    2014-01-01

    Identification of protein domains is a key step for understanding protein function. Hidden Markov Models (HMMs) have proved to be a powerful tool for this task. The Pfam database notably provides a large collection of HMMs which are widely used for the annotation of proteins in sequenced organisms. This is done via sequence/HMM comparisons. However, this approach may lack sensitivity when searching for domains in divergent species. Recently, methods for HMM/HMM comparisons have been proposed and proved to be more sensitive than sequence/HMM approaches in certain cases. However, these approaches are usually not used for protein domain discovery at a genome scale, and the benefit that could be expected from their utilization for this problem has not been investigated. Using proteins of P. falciparum and L. major as examples, we investigate the extent to which HMM/HMM comparisons can identify new domain occurrences not already identified by sequence/HMM approaches. We show that although HMM/HMM comparisons are much more sensitive than sequence/HMM comparisons, they are not sufficiently accurate to be used as a standalone complement of sequence/HMM approaches at the genome scale. Hence, we propose to use domain co-occurrence--the general domain tendency to preferentially appear along with some favorite domains in the proteins--to improve the accuracy of the approach. We show that the combination of HMM/HMM comparisons and co-occurrence domain detection boosts protein annotations. At an estimated False Discovery Rate of 5%, it revealed 901 and 1098 new domains in Plasmodium and Leishmania proteins, respectively. Manual inspection of part of these predictions shows that it contains several domain families that were missing in the two organisms. All new domain occurrences have been integrated in the EuPathDomains database, along with the GO annotations that can be deduced.

  6. Identification of Divergent Protein Domains by Combining HMM-HMM Comparisons and Co-Occurrence Detection

    PubMed Central

    Ghouila, Amel; Florent, Isabelle; Guerfali, Fatma Zahra; Terrapon, Nicolas; Laouini, Dhafer; Yahia, Sadok Ben; Gascuel, Olivier; Bréhélin, Laurent

    2014-01-01

    Identification of protein domains is a key step for understanding protein function. Hidden Markov Models (HMMs) have proved to be a powerful tool for this task. The Pfam database notably provides a large collection of HMMs which are widely used for the annotation of proteins in sequenced organisms. This is done via sequence/HMM comparisons. However, this approach may lack sensitivity when searching for domains in divergent species. Recently, methods for HMM/HMM comparisons have been proposed and proved to be more sensitive than sequence/HMM approaches in certain cases. However, these approaches are usually not used for protein domain discovery at a genome scale, and the benefit that could be expected from their utilization for this problem has not been investigated. Using proteins of P. falciparum and L. major as examples, we investigate the extent to which HMM/HMM comparisons can identify new domain occurrences not already identified by sequence/HMM approaches. We show that although HMM/HMM comparisons are much more sensitive than sequence/HMM comparisons, they are not sufficiently accurate to be used as a standalone complement of sequence/HMM approaches at the genome scale. Hence, we propose to use domain co-occurrence — the general domain tendency to preferentially appear along with some favorite domains in the proteins — to improve the accuracy of the approach. We show that the combination of HMM/HMM comparisons and co-occurrence domain detection boosts protein annotations. At an estimated False Discovery Rate of 5%, it revealed 901 and 1098 new domains in Plasmodium and Leishmania proteins, respectively. Manual inspection of part of these predictions shows that it contains several domain families that were missing in the two organisms. All new domain occurrences have been integrated in the EuPathDomains database, along with the GO annotations that can be deduced. PMID:24901648

  7. iProphet: Multi-level Integrative Analysis of Shotgun Proteomic Data Improves Peptide and Protein Identification Rates and Error Estimates*

    PubMed Central

    Shteynberg, David; Deutsch, Eric W.; Lam, Henry; Eng, Jimmy K.; Sun, Zhi; Tasman, Natalie; Mendoza, Luis; Moritz, Robert L.; Aebersold, Ruedi; Nesvizhskii, Alexey I.

    2011-01-01

    The combination of tandem mass spectrometry and sequence database searching is the method of choice for the identification of peptides and the mapping of proteomes. Over the last several years, the volume of data generated in proteomic studies has increased dramatically, which challenges the computational approaches previously developed for these data. Furthermore, a multitude of search engines have been developed that identify different, overlapping subsets of the sample peptides from a particular set of tandem mass spectrometry spectra. We present iProphet, the new addition to the widely used open-source suite of proteomic data analysis tools Trans-Proteomics Pipeline. Applied in tandem with PeptideProphet, it provides more accurate representation of the multilevel nature of shotgun proteomic data. iProphet combines the evidence from multiple identifications of the same peptide sequences across different spectra, experiments, precursor ion charge states, and modified states. It also allows accurate and effective integration of the results from multiple database search engines applied to the same data. The use of iProphet in the Trans-Proteomics Pipeline increases the number of correctly identified peptides at a constant false discovery rate as compared with both PeptideProphet and another state-of-the-art tool Percolator. As the main outcome, iProphet permits the calculation of accurate posterior probabilities and false discovery rate estimates at the level of sequence identical peptide identifications, which in turn leads to more accurate probability estimates at the protein level. Fully integrated with the Trans-Proteomics Pipeline, it supports all commonly used MS instruments, search engines, and computer platforms. The performance of iProphet is demonstrated on two publicly available data sets: data from a human whole cell lysate proteome profiling experiment representative of typical proteomic data sets, and from a set of Streptococcus pyogenes experiments

  8. Identification of Differentially Abundant Proteins of Edwardsiella ictaluri during Iron Restriction

    PubMed Central

    Dumpala, Pradeep R.; Peterson, Brian C.; Lawrence, Mark L.; Karsi, Attila

    2015-01-01

    Edwardsiella ictaluri is a Gram-negative facultative anaerobe intracellular bacterium that causes enteric septicemia in channel catfish. Iron is an essential inorganic nutrient of bacteria and is crucial for bacterial invasion. Reduced availability of iron by the host may cause significant stress for bacterial pathogens and is considered a signal that leads to significant alteration in virulence gene expression. However, the precise effect of iron-restriction on E. ictaluri protein abundance is unknown. The purpose of this study was to identify differentially abundant proteins of E. ictaluri during in vitro iron-restricted conditions. We applied two-dimensional difference in gel electrophoresis (2D-DIGE) for determining differentially abundant proteins and matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI TOF/TOF MS) for protein identification. Gene ontology and pathway-based functional modeling of differentially abundant proteins was also conducted. A total of 50 unique differentially abundant proteins at a minimum of 2-fold (p ≤ 0.05) difference in abundance due to iron-restriction were detected. The numbers of up- and down-regulated proteins were 37 and 13, respectively. We noted several proteins, including EsrB, LamB, MalM, MalE, FdaA, and TonB-dependent heme/hemoglobin receptor family proteins responded to iron restriction in E. ictaluri. PMID:26168192

  9. [Identification of proteins interacting with the circadian clock protein PER1 in tumors using bacterial two-hybrid system technique].

    PubMed

    Zhang, Yu; Yao, Youlin; Jiang, Siyuan; Lu, Yilu; Liu, Yunqiang; Tao, Dachang; Zhang, Sizhong; Ma, Yongxin

    2015-04-01

    To identify protein-protein interaction partners of PER1 (period circadian protein homolog 1), key component of the molecular oscillation system of the circadian rhythm in tumors using bacterial two-hybrid system technique. Human cervical carcinoma cell Hela library was adopted. Recombinant bait plasmid pBT-PER1 and pTRG cDNA plasmid library were cotransformed into the two-hybrid system reporter strain cultured in a special selective medium. Target clones were screened. After isolating the positive clones, the target clones were sequenced and analyzed. Fourteen protein coding genes were identified, 4 of which were found to contain whole coding regions of genes, which included optic atrophy 3 protein (OPA3) associated with mitochondrial dynamics and homo sapiens cutA divalent cation tolerance homolog of E. coli (CUTA) associated with copper metabolism. There were also cellular events related proteins and proteins which are involved in biochemical reaction and signal transduction-related proteins. Identification of potential interacting proteins with PER1 in tumors may provide us new insights into the functions of the circadian clock protein PER1 during tumorigenesis.

  10. MASCOT HTML and XML parser: an implementation of a novel object model for protein identification data.

    PubMed

    Yang, Chunguang G; Granite, Stephen J; Van Eyk, Jennifer E; Winslow, Raimond L

    2006-11-01

    Protein identification using MS is an important technique in proteomics as well as a major generator of proteomics data. We have designed the protein identification data object model (PDOM) and developed a parser based on this model to facilitate the analysis and storage of these data. The parser works with HTML or XML files saved or exported from MASCOT MS/MS ions search in peptide summary report or MASCOT PMF search in protein summary report. The program creates PDOM objects, eliminates redundancy in the input file, and has the capability to output any PDOM object to a relational database. This program facilitates additional analysis of MASCOT search results and aids the storage of protein identification information. The implementation is extensible and can serve as a template to develop parsers for other search engines. The parser can be used as a stand-alone application or can be driven by other Java programs. It is currently being used as the front end for a system that loads HTML and XML result files of MASCOT searches into a relational database. The source code is freely available at http://www.ccbm.jhu.edu and the program uses only free and open-source Java libraries.

  11. Detachable strong cation exchange monolith, integrated with capillary zone electrophoresis and coupled with pH gradient elution, produces improved sensitivity and numbers of peptide identifications during bottom-up analysis of complex proteomes.

    PubMed

    Zhang, Zhenbin; Yan, Xiaojing; Sun, Liangliang; Zhu, Guijie; Dovichi, Norman J

    2015-04-21

    A detachable sulfonate-silica hybrid strong cation-exchange monolith was synthesized in a fused silica capillary, and used for solid phase extraction with online pH gradient elution during capillary zone electrophoresis-tandem mass spectrometry (CZE-MS/MS) proteomic analysis. Tryptic digests were prepared in 50 mM formic acid and loaded onto the strong cation-exchange monolith. Fractions were eluted using a series of buffers with lower concentration but higher pH values than the 50 mM formic acid background electrolyte. This combination of elution and background electrolytes results in both sample stacking and formation of a dynamic pH junction and allows use of relatively large elution buffer volumes while maintaining reasonable peak efficiency and resolution. A series of five pH bumps were applied to elute E. coli tryptic peptides from the monolith, followed by analysis using CZE coupled to an LTQ-Orbitrap Velos mass spectrometer; 799 protein groups and 3381 peptides were identified from 50 ng of the digest in a 2.5 h analysis, which approaches the identification rate for this organism that was obtained with an Orbitrap Fusion. We attribute the improved numbers of peptide and protein identifications to the efficient fractionation by the online pH gradient elution, which decreased the complexity of the sample in each elution step and improved the signal intensity of low abundance peptides. We also performed a comparative analysis using a nanoACQUITY UltraPerformance LCH system. Similar numbers of protein and peptide identifications were produced by the two methods. Protein identifications showed significant overlap between the two methods, whereas peptide identifications were complementary.

  12. An Improved Algorithm of Congruent Matching Cells (CMC) Method for Firearm Evidence Identifications.

    PubMed

    Tong, Mingsi; Song, John; Chu, Wei

    2015-01-01

    The Congruent Matching Cells (CMC) method was invented at the National Institute of Standards and Technology (NIST) for firearm evidence identifications. The CMC method divides the measured image of a surface area, such as a breech face impression from a fired cartridge case, into small correlation cells and uses four identification parameters to identify correlated cell pairs originating from the same firearm. The CMC method was validated by identification tests using both 3D topography images and optical images captured from breech face impressions of 40 cartridge cases fired from a pistol with 10 consecutively manufactured slides. In this paper, we discuss the processing of the cell correlations and propose an improved algorithm of the CMC method which takes advantage of the cell correlations at a common initial phase angle and combines the forward and backward correlations to improve the identification capability. The improved algorithm is tested by 780 pairwise correlations using the same optical images and 3D topography images as the initial validation.

  13. An Improved Algorithm of Congruent Matching Cells (CMC) Method for Firearm Evidence Identifications

    PubMed Central

    Tong, Mingsi; Song, John; Chu, Wei

    2015-01-01

    The Congruent Matching Cells (CMC) method was invented at the National Institute of Standards and Technology (NIST) for firearm evidence identifications. The CMC method divides the measured image of a surface area, such as a breech face impression from a fired cartridge case, into small correlation cells and uses four identification parameters to identify correlated cell pairs originating from the same firearm. The CMC method was validated by identification tests using both 3D topography images and optical images captured from breech face impressions of 40 cartridge cases fired from a pistol with 10 consecutively manufactured slides. In this paper, we discuss the processing of the cell correlations and propose an improved algorithm of the CMC method which takes advantage of the cell correlations at a common initial phase angle and combines the forward and backward correlations to improve the identification capability. The improved algorithm is tested by 780 pairwise correlations using the same optical images and 3D topography images as the initial validation. PMID:26958441

  14. The cassava (Manihot esculenta Crantz) root proteome: protein identification and differential expression.

    PubMed

    Sheffield, Jeanne; Taylor, Nigel; Fauquet, Claude; Chen, Sixue

    2006-03-01

    Using high-resolution 2-DE, we resolved proteins extracted from fibrous and tuberous root tissues of 3-month-old cassava plants. Gel image analysis revealed an average of 1467 electrophoretically resolved spots on the fibrous gels and 1595 spots on the tuberous gels in pH 3-10 range. Protein spots from both sets of gels were digested with trypsin. The digests were subjected to nanoelectrospray quadrupole TOF tandem mass analysis. Currently, we have obtained 299 protein identifications for 292 gel spots corresponding to 237 proteins. The proteins span various functional categories from energy, primary and secondary metabolism, disease and defense, destination and storage, transport, signal transduction, protein synthesis, cell structure, and transcription to cell growth and division. Gel image analysis has shown unique, as well as up- and down-regulated proteins, present in the tuberous and the fibrous tissues. Quantitative and qualitative analysis of the cassava root proteome is an important step towards further characterization of differentially expressed proteins and the elucidation of the mechanisms underlying the development and biological functions of the two types of roots.

  15. Identification of immunodominant proteins from Mannheimia haemolytica and Histophilus somni by an immunoproteomic approach.

    PubMed

    Alvarez, Angel H; Gutiérrez-Ortega, Abel; Hernández-Gutiérrez, Rodolfo

    2015-10-01

    Mannheimia haemolytica and Histophilus somni are frequently isolated from diseased cattle with bovine respiratory disease (BRD). They compromise animal lung function and the immune responses generated are not sufficient to limit infection. Identification of specific immunogenic antigens for vaccine development represents a great challenge. Immunogenic proteins were identified by immunoproteomic approach with sera from cattle immunized with a commercial cellular vaccine of M. haemolytica and H. somni. Proteins of M. haemolytica were identified as solute ABC transporter, iron-binding protein, and hypothetical protein of capsular biosynthesis. Histophilus somni proteins correspond to porin, amino acid ABC transporter, hypothetical outer membrane protein, cysteine synthase, and outer membrane protein P6. Although these antigens share strong similarities with other proteins from animal pathogens, the ABC system proteins have been associated with virulence and these proteins could be considered as potential vaccine candidates for BRD.

  16. Identification of immunodominant proteins from Mannheimia haemolytica and Histophilus somni by an immunoproteomic approach

    PubMed Central

    Alvarez, Angel H.; Gutiérrez-Ortega, Abel; Hernández-Gutiérrez, Rodolfo

    2015-01-01

    Mannheimia haemolytica and Histophilus somni are frequently isolated from diseased cattle with bovine respiratory disease (BRD). They compromise animal lung function and the immune responses generated are not sufficient to limit infection. Identification of specific immunogenic antigens for vaccine development represents a great challenge. Immunogenic proteins were identified by immunoproteomic approach with sera from cattle immunized with a commercial cellular vaccine of M. haemolytica and H. somni. Proteins of M. haemolytica were identified as solute ABC transporter, iron-binding protein, and hypothetical protein of capsular biosynthesis. Histophilus somni proteins correspond to porin, amino acid ABC transporter, hypothetical outer membrane protein, cysteine synthase, and outer membrane protein P6. Although these antigens share strong similarities with other proteins from animal pathogens, the ABC system proteins have been associated with virulence and these proteins could be considered as potential vaccine candidates for BRD. PMID:26424916

  17. Multidimensional protein identification technology (MudPIT): technical overview of a profiling method optimized for the comprehensive proteomic investigation of normal and diseased heart tissue.

    PubMed

    Kislinger, Thomas; Gramolini, Anthony O; MacLennan, David H; Emili, Andrew

    2005-08-01

    An optimized analytical expression profiling strategy based on gel-free multidimensional protein identification technology (MudPIT) is reported for the systematic investigation of biochemical (mal)-adaptations associated with healthy and diseased heart tissue. Enhanced shotgun proteomic detection coverage and improved biological inference is achieved by pre-fractionation of excised mouse cardiac muscle into subcellular components, with each organellar fraction investigated exhaustively using multiple repeat MudPIT analyses. Functional-enrichment, high-confidence identification, and relative quantification of hundreds of organelle- and tissue-specific proteins are achieved readily, including detection of low abundance transcriptional regulators, signaling factors, and proteins linked to cardiac disease. Important technical issues relating to data validation, including minimization of artifacts stemming from biased under-sampling and spurious false discovery, together with suggestions for further fine-tuning of sample preparation, are discussed. A framework for follow-up bioinformatic examination, pattern recognition, and data mining is also presented in the context of a stringent application of MudPIT for probing fundamental aspects of heart muscle physiology as well as the discovery of perturbations associated with heart failure.

  18. Mass spectrometry-based protein identification with accurate statistical significance assignment.

    PubMed

    Alves, Gelio; Yu, Yi-Kuo

    2015-03-01

    Assigning statistical significance accurately has become increasingly important as metadata of many types, often assembled in hierarchies, are constructed and combined for further biological analyses. Statistical inaccuracy of metadata at any level may propagate to downstream analyses, undermining the validity of scientific conclusions thus drawn. From the perspective of mass spectrometry-based proteomics, even though accurate statistics for peptide identification can now be achieved, accurate protein level statistics remain challenging. We have constructed a protein ID method that combines peptide evidences of a candidate protein based on a rigorous formula derived earlier; in this formula the database P-value of every peptide is weighted, prior to the final combination, according to the number of proteins it maps to. We have also shown that this protein ID method provides accurate protein level E-value, eliminating the need of using empirical post-processing methods for type-I error control. Using a known protein mixture, we find that this protein ID method, when combined with the Sorić formula, yields accurate values for the proportion of false discoveries. In terms of retrieval efficacy, the results from our method are comparable with other methods tested. The source code, implemented in C++ on a linux system, is available for download at ftp://ftp.ncbi.nlm.nih.gov/pub/qmbp/qmbp_ms/RAId/RAId_Linux_64Bit. Published by Oxford University Press 2014. This work is written by US Government employees and is in the public domain in the US.

  19. Identification of new intrinsic proteins in Arabidopsis plasma membrane proteome.

    PubMed

    Marmagne, Anne; Rouet, Marie-Aude; Ferro, Myriam; Rolland, Norbert; Alcon, Carine; Joyard, Jacques; Garin, Jérome; Barbier-Brygoo, Hélène; Ephritikhine, Geneviève

    2004-07-01

    Identification and characterization of anion channel genes in plants represent a goal for a better understanding of their central role in cell signaling, osmoregulation, nutrition, and metabolism. Though channel activities have been well characterized in plasma membrane by electrophysiology, the corresponding molecular entities are little documented. Indeed, the hydrophobic protein equipment of plant plasma membrane still remains largely unknown, though several proteomic approaches have been reported. To identify new putative transport systems, we developed a new proteomic strategy based on mass spectrometry analyses of a plasma membrane fraction enriched in hydrophobic proteins. We produced from Arabidopsis cell suspensions a highly purified plasma membrane fraction and characterized it in detail by immunological and enzymatic tests. Using complementary methods for the extraction of hydrophobic proteins and mass spectrometry analyses on mono-dimensional gels, about 100 proteins have been identified, 95% of which had never been found in previous proteomic studies. The inventory of the plasma membrane proteome generated by this approach contains numerous plasma membrane integral proteins, one-third displaying at least four transmembrane segments. The plasma membrane localization was confirmed for several proteins, therefore validating such proteomic strategy. An in silico analysis shows a correlation between the putative functions of the identified proteins and the expected roles for plasma membrane in transport, signaling, cellular traffic, and metabolism. This analysis also reveals 10 proteins that display structural properties compatible with transport functions and will constitute interesting targets for further functional studies.

  20. Improving Pharmaceutical Protein Production in Oryza sativa

    PubMed Central

    Kuo, Yu-Chieh; Tan, Chia-Chun; Ku, Jung-Ting; Hsu, Wei-Cho; Su, Sung-Chieh; Lu, Chung-An; Huang, Li-Fen

    2013-01-01

    Application of plant expression systems in the production of recombinant proteins has several advantages, such as low maintenance cost, absence of human pathogens, and possession of complex post-translational glycosylation capabilities. Plants have been successfully used to produce recombinant cytokines, vaccines, antibodies, and other proteins, and rice (Oryza sativa) is a potential plant used as recombinant protein expression system. After successful transformation, transgenic rice cells can be either regenerated into whole plants or grown as cell cultures that can be upscaled into bioreactors. This review summarizes recent advances in the production of different recombinant protein produced in rice and describes their production methods as well as methods to improve protein yield and quality. Glycosylation and its impact in plant development and protein production are discussed, and several methods of improving yield and quality that have not been incorporated in rice expression systems are also proposed. Finally, different bioreactor options are explored and their advantages are analyzed. PMID:23615467

  1. Improving homology modeling of G-protein coupled receptors through multiple-template derived conserved inter-residue interactions

    NASA Astrophysics Data System (ADS)

    Chaudhari, Rajan; Heim, Andrew J.; Li, Zhijun

    2015-05-01

    Evidenced by the three-rounds of G-protein coupled receptors (GPCR) Dock competitions, improving homology modeling methods of helical transmembrane proteins including the GPCRs, based on templates of low sequence identity, remains an eminent challenge. Current approaches addressing this challenge adopt the philosophy of "modeling first, refinement next". In the present work, we developed an alternative modeling approach through the novel application of available multiple templates. First, conserved inter-residue interactions are derived from each additional template through conservation analysis of each template-target pairwise alignment. Then, these interactions are converted into distance restraints and incorporated in the homology modeling process. This approach was applied to modeling of the human β2 adrenergic receptor using the bovin rhodopsin and the human protease-activated receptor 1 as templates and improved model quality was demonstrated compared to the homology model generated by standard single-template and multiple-template methods. This method of "refined restraints first, modeling next", provides a fast and complementary way to the current modeling approaches. It allows rational identification and implementation of additional conserved distance restraints extracted from multiple templates and/or experimental data, and has the potential to be applicable to modeling of all helical transmembrane proteins.

  2. Protein-protein interface analysis and hot spots identification for chemical ligand design.

    PubMed

    Chen, Jing; Ma, Xiaomin; Yuan, Yaxia; Pei, Jianfeng; Lai, Luhua

    2014-01-01

    Rational design for chemical compounds targeting protein-protein interactions has grown from a dream to reality after a decade of efforts. There are an increasing number of successful examples, though major challenges remain in the field. In this paper, we will first give a brief review of the available methods that can be used to analyze protein-protein interface and predict hot spots for chemical ligand design. New developments of binding sites detection, ligandability and hot spots prediction from the author's group will also be described. Pocket V.3 is an improved program for identifying hot spots in protein-protein interface using only an apo protein structure. It has been developed based on Pocket V.2 that can derive receptor-based pharmacophore model for ligand binding cavity. Given similarities and differences between the essence of pharmacophore and hot spots for guiding design of chemical compounds, not only energetic but also spatial properties of protein-protein interface are used in Pocket V.3 for dealing with protein-protein interface. In order to illustrate the capability of Pocket V.3, two datasets have been used. One is taken from ASEdb and BID having experimental alanine scanning results for testing hot spots prediction. The other is taken from the 2P2I database containing complex structures of protein-ligand binding at the original protein-protein interface for testing hot spots application in ligand design.

  3. Identification and characterization of moonlighting long non-coding RNAs based on RNA and protein interactome.

    PubMed

    Cheng, Lixin; Leung, Kwong-Sak

    2018-05-16

    Moonlighting proteins are a class of proteins having multiple distinct functions, which play essential roles in a variety of cellular and enzymatic functioning systems. Although there have long been calls for computational algorithms for the identification of moonlighting proteins, research on approaches to identify moonlighting long non-coding RNAs (lncRNAs) has never been undertaken. Here, we introduce a novel methodology, MoonFinder, for the identification of moonlighting lncRNAs. MoonFinder is a statistical algorithm identifying moonlighting lncRNAs without a priori knowledge through the integration of protein interactome, RNA-protein interactions, and functional annotation of proteins. We identify 155 moonlighting lncRNA candidates and uncover that they are a distinct class of lncRNAs characterized by specific sequence and cellular localization features. The non-coding genes that transcript moonlighting lncRNAs tend to have shorter but more exons and the moonlighting lncRNAs have a variable localization pattern with a high chance of residing in the cytoplasmic compartment in comparison to the other lncRNAs. Moreover, moonlighting lncRNAs and moonlighting proteins are rather mutually exclusive in terms of both their direct interactions and interacting partners. Our results also shed light on how the moonlighting candidates and their interacting proteins implicated in the formation and development of cancers and other diseases. The code implementing MoonFinder is supplied as an R package in the supplementary material. lxcheng@cse.cuhk.edu.hk or ksleung@cse.cuhk.edu.hk. Supplementary data are available at Bioinformatics online.

  4. Identification of a dual-specificity protein phosphatase that inactivates a MAP kinase from Arabidopsis

    NASA Technical Reports Server (NTRS)

    Gupta, R.; Huang, Y.; Kieber, J.; Luan, S.; Evans, M. L. (Principal Investigator)

    1998-01-01

    Mitogen-activated protein kinases (MAPKs) play a key role in plant responses to stress and pathogens. Activation and inactivation of MAPKs involve phosphorylation and dephosphorylation on both threonine and tyrosine residues in the kinase domain. Here we report the identification of an Arabidopsis gene encoding a dual-specificity protein phosphatase capable of hydrolysing both phosphoserine/threonine and phosphotyrosine in protein substrates. This enzyme, designated AtDsPTP1 (Arabidopsis thaliana dual-specificity protein tyrosine phosphatase), dephosphorylated and inactivated AtMPK4, a MAPK member from the same plant. Replacement of a highly conserved cysteine by serine abolished phosphatase activity of AtDsPTP1, indicating a conserved catalytic mechanism of dual-specificity protein phosphatases from all eukaryotes.

  5. A practical guide for the identification of membrane and plasma membrane proteins in human embryonic stem cells and human embryonal carcinoma cells.

    PubMed

    Dormeyer, Wilma; van Hoof, Dennis; Mummery, Christine L; Krijgsveld, Jeroen; Heck, Albert J R

    2008-10-01

    The identification of (plasma) membrane proteins in cells can provide valuable insights into the regulation of their biological processes. Pluripotent cells such as human embryonic stem cells and embryonal carcinoma cells are capable of unlimited self-renewal and share many of the biological mechanisms that regulate proliferation and differentiation. The comparison of their membrane proteomes will help unravel the biological principles of pluripotency, and the identification of biomarker proteins in their plasma membranes is considered a crucial step to fully exploit pluripotent cells for therapeutic purposes. For these tasks, membrane proteomics is the method of choice, but as indicated by the scarce identification of membrane and plasma membrane proteins in global proteomic surveys it is not an easy task. In this minireview, we first describe the general challenges of membrane proteomics. We then review current sample preparation steps and discuss protocols that we found particularly beneficial for the identification of large numbers of (plasma) membrane proteins in human tumour- and embryo-derived stem cells. Our optimized assembled protocol led to the identification of a large number of membrane proteins. However, as the composition of cells and membranes is highly variable we still recommend adapting the sample preparation protocol for each individual system.

  6. Modified Protein Improves Vitiligo Symptoms in Mice

    MedlinePlus

    ... Vitiligo Symptoms in Mice Spotlight on Research Modified Protein Improves Vitiligo Symptoms in Mice By Colleen Labbe, ... D., Ph.D., Rush University. Altering a key protein involved in the development of vitiligo may protect ...

  7. Improving membrane protein expression and function using genomic edits

    DOE PAGES

    Jensen, Heather M.; Eng, Thomas; Chubukov, Victor; ...

    2017-10-12

    Expression of membrane proteins often leads to growth inhibition and perturbs central metabolism and this burden varies with the protein being overexpressed. There are also known strain backgrounds that allow greater expression of membrane proteins but that differ in efficacy across proteins. Here, we hypothesized that for any membrane protein, it may be possible to identify a modified strain background where its expression can be accommodated with less burden. To directly test this hypothesis, we used a bar-coded transposon insertion library in tandem with cell sorting to assess genome-wide impact of gene deletions on membrane protein expression. The expression ofmore » five membrane proteins (CyoB, CydB, MdlB, YidC, and LepI) and one soluble protein (GST), each fused to GFP, was examined. We identified Escherichia coli mutants that demonstrated increased membrane protein expression relative to that in wild type. For two of the proteins (CyoB and CydB), we conducted functional assays to confirm that the increase in protein expression also led to phenotypic improvement in function. This study represents a systematic approach to broadly identify genetic loci that can be used to improve membrane protein expression, and our method can be used to improve expression of any protein that poses a cellular burden.« less

  8. Improving membrane protein expression and function using genomic edits

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jensen, Heather M.; Eng, Thomas; Chubukov, Victor

    Expression of membrane proteins often leads to growth inhibition and perturbs central metabolism and this burden varies with the protein being overexpressed. There are also known strain backgrounds that allow greater expression of membrane proteins but that differ in efficacy across proteins. Here, we hypothesized that for any membrane protein, it may be possible to identify a modified strain background where its expression can be accommodated with less burden. To directly test this hypothesis, we used a bar-coded transposon insertion library in tandem with cell sorting to assess genome-wide impact of gene deletions on membrane protein expression. The expression ofmore » five membrane proteins (CyoB, CydB, MdlB, YidC, and LepI) and one soluble protein (GST), each fused to GFP, was examined. We identified Escherichia coli mutants that demonstrated increased membrane protein expression relative to that in wild type. For two of the proteins (CyoB and CydB), we conducted functional assays to confirm that the increase in protein expression also led to phenotypic improvement in function. This study represents a systematic approach to broadly identify genetic loci that can be used to improve membrane protein expression, and our method can be used to improve expression of any protein that poses a cellular burden.« less

  9. Identification of the protein components displaying immunomodulatory activity in aged garlic extract.

    PubMed

    Chandrashekar, P M; Venkatesh, Y P

    2009-07-30

    Traditionally, garlic (Allium sativum L.; Alliaceae) has been known to boost the immune system. Aged garlic has more potent immunomodulatory effects than raw garlic. These effects have been attributed to the transformed organosulfur compounds; the identity of the immunomodulatory proteins in aged garlic extract (AGE) is not known. The major aims are to examine the changes occurring in the protein fraction during ageing of garlic and to identify the immunomodulatory proteins. Changes occurring in garlic during ageing have been examined by protein quantitation and gel electrophoresis. Purification and identification of the immunomodulatory proteins have been achieved by Q-Sepharose chromatography and mitogenic activity. Only two major proteins (12-14 kDa range by SDS-PAGE) are observed in AGE. The purified protein components QA-1, QA-2, and QA-3 display immunomodulatory and mannose-binding activity; QA-2 shows the highest mitogenic activity. The identity of QA-2 and QA-1 proteins with the garlic lectins ASA I and ASA II, respectively, has been confirmed by hemagglutination analysis. QA-3 exhibits mitogenic activity, but no hemagglutination activity. The immunomodulatory activity of AGE is also contributed by immunomodulatory proteins. The major immunomodulatory proteins have been identified as the well-known garlic lectins.

  10. Identification of DNA-Binding Proteins Using Structural, Electrostatic and Evolutionary Features

    PubMed Central

    Nimrod, Guy; Szilágyi, András; Leslie, Christina; Ben-Tal, Nir

    2009-01-01

    Summary DNA binding proteins (DBPs) often take part in various crucial processes of the cell's life cycle. Therefore, the identification and characterization of these proteins are of great importance. We present here a random forests classifier for identifying DBPs among proteins with known three-dimensional structures. First, clusters of evolutionarily conserved regions (patches) on the protein's surface are detected using the PatchFinder algorithm; previous studies showed that these regions are typically the proteins' functionally important regions. Next, we train a classifier using features like the electrostatic potential, cluster-based amino acid conservation patterns and the secondary structure content of the patches, as well as features of the whole protein including its dipole moment. Using 10-fold cross validation on a dataset of 138 DNA-binding proteins and 110 proteins which do not bind DNA, the classifier achieved a sensitivity and a specificity of 0.90, which is overall better than the performance of previously published methods. Furthermore, when we tested 5 different methods on 11 new DBPs which did not appear in the original dataset, only our method annotated all correctly. The resulting classifier was applied to a collection of 757 proteins of known structure and unknown function. Of these proteins, 218 were predicted to bind DNA, and we anticipate that some of them interact with DNA using new structural motifs. The use of complementary computational tools supports the notion that at least some of them do bind DNA. PMID:19233205

  11. P185-M Protein Identification and Validation of Results in Workflows that Integrate over Various Instruments, Datasets, Search Engines

    PubMed Central

    Hufnagel, P.; Glandorf, J.; Körting, G.; Jabs, W.; Schweiger-Hufnagel, U.; Hahner, S.; Lubeck, M.; Suckau, D.

    2007-01-01

    Analysis of complex proteomes often results in long protein lists, but falls short in measuring the validity of identification and quantification results on a greater number of proteins. Biological and technical replicates are mandatory, as is the combination of the MS data from various workflows (gels, 1D-LC, 2D-LC), instruments (TOF/TOF, trap, qTOF or FTMS), and search engines. We describe a database-driven study that combines two workflows, two mass spectrometers, and four search engines with protein identification following a decoy database strategy. The sample was a tryptically digested lysate (10,000 cells) of a human colorectal cancer cell line. Data from two LC-MALDI-TOF/TOF runs and a 2D-LC-ESI-trap run using capillary and nano-LC columns were submitted to the proteomics software platform ProteinScape. The combined MALDI data and the ESI data were searched using Mascot (Matrix Science), Phenyx (GeneBio), ProteinSolver (Bruker and Protagen), and Sequest (Thermo) against a decoy database generated from IPI-human in order to obtain one protein list across all workflows and search engines at a defined maximum false-positive rate of 5%. ProteinScape combined the data to one LC-MALDI and one LC-ESI dataset. The initial separate searches from the two combined datasets generated eight independent peptide lists. These were compiled into an integrated protein list using the ProteinExtractor algorithm. An initial evaluation of the generated data led to the identification of approximately 1200 proteins. Result integration on a peptide level allowed discrimination of protein isoforms that would not have been possible with a mere combination of protein lists.

  12. Genetically modified proteins: functional improvement and chimeragenesis

    PubMed Central

    Balabanova, Larissa; Golotin, Vasily; Podvolotskaya, Anna; Rasskazov, Valery

    2015-01-01

    This review focuses on the emerging role of site-specific mutagenesis and chimeragenesis for the functional improvement of proteins in areas where traditional protein engineering methods have been extensively used and practically exhausted. The novel path for the creation of the novel proteins has been created on the farther development of the new structure and sequence optimization algorithms for generating and designing the accurate structure models in result of x-ray crystallography studies of a lot of proteins and their mutant forms. Artificial genetic modifications aim to expand nature's repertoire of biomolecules. One of the most exciting potential results of mutagenesis or chimeragenesis finding could be design of effective diagnostics, bio-therapeutics and biocatalysts. A sampling of recent examples is listed below for the in vivo and in vitro genetically improvement of various binding protein and enzyme functions, with references for more in-depth study provided for the reader's benefit. PMID:26211369

  13. Identification, sequencing and expression of an integral membrane protein of the trans-Golgi network (TGN38).

    PubMed Central

    Luzio, J P; Brake, B; Banting, G; Howell, K E; Braghetta, P; Stanley, K K

    1990-01-01

    Organelle-specific integral membrane proteins were identified by a novel strategy which gives rise to monospecific antibodies to these proteins as well as to the cDNA clones encoding them. A cDNA expression library was screened with a polyclonal antiserum raised against Triton X-114-extracted organelle proteins and clones were then grouped using antibodies affinity-purified on individual fusion proteins. The identification, molecular cloning and sequencing are described of a type 1 membrane protein (TGN38) which is located specifically in the trans-Golgi network. Images Fig. 1. Fig. 3. PMID:2204342

  14. Identification and Characterization of Arabidopsis Seed Coat Mucilage Proteins1[OPEN

    PubMed Central

    Tsai, Allen Yi-Lun; Kunieda, Tadashi; Rogalski, Jason; Foster, Leonard J.; Ellis, Brian E.

    2017-01-01

    Plant cell wall proteins are important regulators of cell wall architecture and function. However, because cell wall proteins are difficult to extract and analyze, they are generally poorly understood. Here, we describe the identification and characterization of proteins integral to the Arabidopsis (Arabidopsis thaliana) seed coat mucilage, a specialized layer of the extracellular matrix composed of plant cell wall carbohydrates that is used as a model for cell wall research. The proteins identified in mucilage include those previously identified by genetic analysis, and several mucilage proteins are reduced in mucilage-deficient mutant seeds, suggesting that these proteins are genuinely associated with the mucilage. Arabidopsis mucilage has both nonadherent and adherent layers. Both layers have similar protein profiles except for proteins involved in lipid metabolism, which are present exclusively in the adherent mucilage. The most abundant mucilage proteins include a family of proteins named TESTA ABUNDANT1 (TBA1) to TBA3; a less abundant fourth homolog was named TBA-LIKE (TBAL). TBA and TBAL transcripts and promoter activities were detected in developing seed coats, and their expression requires seed coat differentiation regulators. TBA proteins are secreted to the mucilage pocket during differentiation. Although reverse genetics failed to identify a function for TBAs/TBAL, the TBA promoters are highly expressed and cell type specific and so should be very useful tools for targeting proteins to the seed coat epidermis. Altogether, these results highlight the mucilage proteome as a model for cell walls in general, as it shares similarities with other cell wall proteomes while also containing mucilage-specific features. PMID:28003327

  15. Identification of lipopolysaccharide-interacting plasma membrane-type proteins in Arabidopsis thaliana.

    PubMed

    Vilakazi, Cornelius S; Dubery, Ian A; Piater, Lizelle A

    2017-02-01

    Lipopolysaccharide (LPS) is an amphiphatic bacterial glycoconjugate found on the external membrane of Gram-negative bacteria. This endotoxin is considered as a microbe-associated molecular pattern (MAMP) molecule and has been shown to elicit defense responses in plants. Here, LPS-interacting proteins from Arabidopsis thaliana plasma membrane (PM)-type fractions were captured and identified in order to investigate those involved in LPS perception and linked to triggering of innate immune responses. A novel proteomics-based affinity-capture strategy coupled to liquid chromatography-tandem mass spectrometry (LC-MS/MS) was employed for the enrichment and identification of LPS-interacting proteins. As such, LPS isolated from Burkholderia cepacia (LPS B.cep. ) was immobilized on three independent and distinct affinity-based matrices to serve as bait for interacting proteins from A. thaliana leaf and callus tissue. These were resolved by 1D electrophoresis and identified by mass spectrometry. Proteins specifically bound to LPS B.cep. have been implicated in membrane structure (e.g. COBRA-like and tubulin proteins), membrane trafficking and/or transport (e.g. soluble NSF attachment protein receptor (SNARE) proteins, patellin, aquaporin, PM instrinsic proteins (PIP) and H + -ATPase), signal transduction (receptor-like kinases and calcium-dependent protein kinases) as well as defense/stress responses (e.g. hypersensitive-induced response (HIR) proteins, jacalin-like lectin domain-containing protein and myrosinase-binding proteins). The novel affinity-capture strategy for the enrichment of LPS-interacting proteins proved to be effective, especially in the binding of proteins involved in plant defense responses, and can thus be used to elucidate LPS-mediated molecular recognition and disease mechanism(s). Copyright © 2016 Elsevier Masson SAS. All rights reserved.

  16. Proteomic platform for the identification of proteins in olive (Olea europaea) pulp.

    PubMed

    Capriotti, Anna Laura; Cavaliere, Chiara; Foglia, Patrizia; Piovesana, Susy; Samperi, Roberto; Stampachiacchiere, Serena; Laganà, Aldo

    2013-10-24

    The nutritional and cancer-protective properties of the oil extracted mechanically from the ripe fruits of Olea europaea trees are attracting constantly more attention worldwide. The preparation of high-quality protein samples from plant tissues for proteomic analysis poses many challenging problems. In this study we employed a proteomic platform based on two different extraction methods, SDS and CHAPS based protocols, followed by two precipitation protocols, TCA/acetone and MeOH precipitation, in order to increase the final number of identified proteins. The use of advanced MS techniques in combination with the Swissprot and NCBI Viridiplantae databases and TAIR10 Arabidopsis database allowed us to identify 1265 proteins, of which 22 belong to O. europaea. The application of this proteomic platform for protein extraction and identification will be useful also for other proteomic studies on recalcitrant plant/fruit tissues. Copyright © 2013. Published by Elsevier B.V.

  17. Identification of DNA-binding proteins using structural, electrostatic and evolutionary features.

    PubMed

    Nimrod, Guy; Szilágyi, András; Leslie, Christina; Ben-Tal, Nir

    2009-04-10

    DNA-binding proteins (DBPs) participate in various crucial processes in the life-cycle of the cells, and the identification and characterization of these proteins is of great importance. We present here a random forests classifier for identifying DBPs among proteins with known 3D structures. First, clusters of evolutionarily conserved regions (patches) on the surface of proteins were detected using the PatchFinder algorithm; earlier studies showed that these regions are typically the functionally important regions of proteins. Next, we trained a classifier using features like the electrostatic potential, cluster-based amino acid conservation patterns and the secondary structure content of the patches, as well as features of the whole protein, including its dipole moment. Using 10-fold cross-validation on a dataset of 138 DBPs and 110 proteins that do not bind DNA, the classifier achieved a sensitivity and a specificity of 0.90, which is overall better than the performance of published methods. Furthermore, when we tested five different methods on 11 new DBPs that did not appear in the original dataset, only our method annotated all correctly. The resulting classifier was applied to a collection of 757 proteins of known structure and unknown function. Of these proteins, 218 were predicted to bind DNA, and we anticipate that some of them interact with DNA using new structural motifs. The use of complementary computational tools supports the notion that at least some of them do bind DNA.

  18. A pseudo MS3 approach for identification of disulfide-bonded proteins: uncommon product ions and database search.

    PubMed

    Chen, Jianzhong; Shiyanov, Pavel; Schlager, John J; Green, Kari B

    2012-02-01

    It has previously been reported that disulfide and backbone bonds of native intact proteins can be concurrently cleaved using electrospray ionization (ESI) and collision-induced dissociation (CID) tandem mass spectrometry (MS/MS). However, the cleavages of disulfide bonds result in different cysteine modifications in product ions, making it difficult to identify the disulfide-bonded proteins via database search. To solve this identification problem, we have developed a pseudo MS(3) approach by combining nozzle-skimmer dissociation (NSD) and CID on a quadrupole time-of-flight (Q-TOF) mass spectrometer using chicken lysozyme as a model. Although many of the product ions were similar to those typically seen in MS/MS spectra of enzymatically derived peptides, additional uncommon product ions were detected including c(i-1) ions (the i(th) residue being aspartic acid, arginine, lysine and dehydroalanine) as well as those from a scrambled sequence. The formation of these uncommon types of product ions, likely caused by the lack of mobile protons, were proposed to involve bond rearrangements via a six-membered ring transition state and/or salt bridge(s). A search of 20 pseudo MS(3) spectra against the Gallus gallus (chicken) database using Batch-Tag, a program originally designed for bottom up MS/MS analysis, identified chicken lysozyme as the only hit with the expectation values less than 0.02 for 12 of the spectra. The pseudo MS(3) approach may help to identify disulfide-bonded proteins and determine the associated post-translational modifications (PTMs); the confidence in the identification may be improved by incorporating the fragmentation characteristics into currently available search programs. © American Society for Mass Spectrometry, 2011

  19. Genome-Wide Identification and Expression of Xenopus F-Box Family of Proteins.

    PubMed

    Saritas-Yildirim, Banu; Pliner, Hannah A; Ochoa, Angelica; Silva, Elena M

    2015-01-01

    Protein degradation via the multistep ubiquitin/26S proteasome pathway is a rapid way to alter the protein profile and drive cell processes and developmental changes. Many key regulators of embryonic development are targeted for degradation by E3 ubiquitin ligases. The most studied family of E3 ubiquitin ligases is the SCF ubiquitin ligases, which use F-box adaptor proteins to recognize and recruit target proteins. Here, we used a bioinformatics screen and phylogenetic analysis to identify and annotate the family of F-box proteins in the Xenopus tropicalis genome. To shed light on the function of the F-box proteins, we analyzed expression of F-box genes during early stages of Xenopus development. Many F-box genes are broadly expressed with expression domains localized to diverse tissues including brain, spinal cord, eye, neural crest derivatives, somites, kidneys, and heart. All together, our genome-wide identification and expression profiling of the Xenopus F-box family of proteins provide a foundation for future research aimed to identify the precise role of F-box dependent E3 ubiquitin ligases and their targets in the regulatory circuits of development.

  20. Identification and characterization of intracellular proteins that bind oligonucleotides with phosphorothioate linkages

    PubMed Central

    Liang, Xue-hai; Sun, Hong; Shen, Wen; Crooke, Stanley T.

    2015-01-01

    Although the RNase H-dependent mechanism of inhibition of gene expression by chemically modified antisense oligonucleotides (ASOs) has been well characterized, little is known about the interactions between ASOs and intracellular proteins that may alter cellular localization and/or potency of ASOs. Here, we report the identification of 56 intracellular ASO-binding proteins using multi-step affinity selection approaches. Many of the tested proteins had no significant effect on ASO activity; however, some proteins, including La/SSB, NPM1, ANXA2, VARS and PC4, appeared to enhance ASO activities, likely through mechanisms related to subcellular distribution. VARS and ANXA2 co-localized with ASOs in endocytic organelles, and reduction in the level of VARS altered lysosome/ASO localization patterns, implying that these proteins may facilitate ASO release from the endocytic pathway. Depletion of La and NPM1 reduced nuclear ASO levels, suggesting potential roles in ASO nuclear accumulation. On the other hand, Ku70 and Ku80 proteins inhibited ASO activity, most likely by competition with RNase H1 for ASO/RNA duplex binding. Our results demonstrate that phosphorothioate-modified ASOs bind a set of cellular proteins that affect ASO activity via different mechanisms. PMID:25712094

  1. Identification of a nuclear localization signal in the retinitis pigmentosa-mutated RP26 protein, ceramide kinase-like protein

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Inagaki, Yuichi; Mitsutake, Susumu; Igarashi, Yasuyuki

    2006-05-12

    Retinitis pigmentosa (RP) is a genetically heterogeneous disease characterized by degeneration of the retina. A mutation in a new ceramide kinase (CERK) homologous gene, named CERK-like protein (CERKL), was found to cause autosomal recessive retinitis pigmentosa (RP26). Here, we show a point mutation of one of two putative nuclear localization signal (NLS) sequences inhibited the nuclear localization of the protein. Furthermore, the tetra-GFP-tagged NLS, which cannot passively enter the nucleus, was observed not only in the nucleus but also in the nucleolus. Our results provide First evidence of the active nuclear import of CERKL and suggest that the identified NLSmore » might be responsible for nucleolar retention of the protein. As recent studies have shown other RP-related proteins are localized in the nucleus or the nucleolus, our identification of NLS in CERKL suggests that CERKL likely plays important roles for retinal functions in the nucleus and the nucleolus.« less

  2. Identification of ZASP, a novel protein associated to Zona occludens-2.

    PubMed

    Lechuga, Susana; Alarcón, Lourdes; Solano, Jesús; Huerta, Miriam; Lopez-Bayghen, Esther; González-Mariscal, Lorenza

    2010-11-15

    With the aim of discovering new molecular interactions of the tight junction protein ZO-2, a two-hybrid screen was performed on a human kidney cDNA library using as bait the middle segment of ZO-2. Through this assay we identified a 24-kDa novel protein herein named ZASP for ZO-2 associated speckle protein. ZO-2/ZASP interaction further confirmed by pull down and immunoprecipitation experiments, requires the presence of the intact PDZ binding motif SQV of ZASP and the third PDZ domain of ZO-2. ZASP mRNA and protein are present in the kidney and in several epithelial cell lines. Endogenous ZASP is expressed primarily in nuclear speckles in co-localization with splicing factor SC-35. Nocodazole treatment and wash out reveals that ZASP disappears from the nucleus during mitosis in accordance with speckle disassembly during metaphase. ZASP amino acid sequence exhibits a canonical nuclear exportation signal and in agreement the protein exits the nucleus through a process mediated by exportin/CRM1. ZASP over-expression blocks the inhibitory activity of ZO-2 on cyclin D1 gene transcription and protein expression. The identification of ZASP helps to unfold the complex nuclear molecular arrays that form on ZO-2 scaffolds. Copyright © 2010 Elsevier Inc. All rights reserved.

  3. THE IDENTIFICATION AND CHARACTERIZATION OF AN IGE-INDUCING PROTEIN IN METARHIZIUM ANISOPLIAE EXTRACT

    EPA Science Inventory

    The Identification and Characterization of an IgE-Inducing Protein in Metarhizium anisopliae Extract

    Marsha D.W. Ward1, Lisa B. Copeland1, Maura J. Donahue2, and Jody A. Shoemaker3
    1ORD, NHEERL, US EPA, RTP, NC; 2Oak Ridge Institute for Science and Education, Cincinnati...

  4. Comparative Evaluation of Small Molecular Additives and Their Effects on Peptide/Protein Identification.

    PubMed

    Gao, Jing; Zhong, Shaoyun; Zhou, Yanting; He, Han; Peng, Shuying; Zhu, Zhenyun; Liu, Xing; Zheng, Jing; Xu, Bin; Zhou, Hu

    2017-06-06

    Detergents and salts are widely used in lysis buffers to enhance protein extraction from biological samples, facilitating in-depth proteomic analysis. However, these detergents and salt additives must be efficiently removed from the digested samples prior to LC-MS/MS analysis to obtain high-quality mass spectra. Although filter-aided sample preparation (FASP), acetone precipitation (AP), followed by in-solution digestion, and strong cation exchange-based centrifugal proteomic reactors (CPRs) are commonly used for proteomic sample processing, little is known about their efficiencies at removing detergents and salt additives. In this study, we (i) developed an integrative workflow for the quantification of small molecular additives in proteomic samples, developing a multiple reaction monitoring (MRM)-based LC-MS approach for the quantification of six additives (i.e., Tris, urea, CHAPS, SDS, SDC, and Triton X-100) and (ii) systematically evaluated the relationships between the level of additive remaining in samples following sample processing and the number of peptides/proteins identified by mass spectrometry. Although FASP outperformed the other two methods, the results were complementary in terms of peptide/protein identification, as well as the GRAVY index and amino acid distributions. This is the first systematic and quantitative study of the effect of detergents and salt additives on protein identification. This MRM-based approach can be used for an unbiased evaluation of the performance of new sample preparation methods. Data are available via ProteomeXchange under identifier PXD005405.

  5. Improved protein hydrogen/deuterium exchange mass spectrometry platform with fully automated data processing.

    PubMed

    Zhang, Zhongqi; Zhang, Aming; Xiao, Gang

    2012-06-05

    Protein hydrogen/deuterium exchange (HDX) followed by protease digestion and mass spectrometric (MS) analysis is accepted as a standard method for studying protein conformation and conformational dynamics. In this article, an improved HDX MS platform with fully automated data processing is described. The platform significantly reduces systematic and random errors in the measurement by introducing two types of corrections in HDX data analysis. First, a mixture of short peptides with fast HDX rates is introduced as internal standards to adjust the variations in the extent of back exchange from run to run. Second, a designed unique peptide (PPPI) with slow intrinsic HDX rate is employed as another internal standard to reflect the possible differences in protein intrinsic HDX rates when protein conformations at different solution conditions are compared. HDX data processing is achieved with a comprehensive HDX model to simulate the deuterium labeling and back exchange process. The HDX model is implemented into the in-house developed software MassAnalyzer and enables fully unattended analysis of the entire protein HDX MS data set starting from ion detection and peptide identification to final processed HDX output, typically within 1 day. The final output of the automated data processing is a set (or the average) of the most possible protection factors for each backbone amide hydrogen. The utility of the HDX MS platform is demonstrated by exploring the conformational transition of a monoclonal antibody by increasing concentrations of guanidine.

  6. Identification of neuronal target genes for CCAAT/Enhancer Binding Proteins

    PubMed Central

    Kfoury, N.; Kapatos, G.

    2009-01-01

    CCAAT/Enhancer Binding Proteins (C/EBPs) play pivotal roles in development and plasticity of the nervous system. Identification of the physiological targets of C/EBPs (C/EBP target genes) should therefore provide insight into the underlying biology of these processes. We used unbiased genome-wide mapping to identify 115 C/EBPβ target genes in PC12 cells that include transcription factors, neurotransmitter receptors, ion channels, protein kinases and synaptic vesicle proteins. C/EBPβ binding sites were located primarily within introns, suggesting novel regulatory functions, and were associated with binding sites for other developmentally important transcription factors. Experiments using dominant negatives showed C/EBPβ to repress transcription of a subset of target genes. Target genes in rat brain were subsequently found to preferentially bind C/EBPα, β and δ. Analysis of the hippocampal transcriptome of C/EBPβ knockout mice revealed dysregulation of a high percentage of transcripts identified as C/EBP target genes. These results support the hypothesis that C/EBPs play non-redundant roles in the brain. PMID:19103292

  7. Identification of ADAM 31: a protein expressed in Leydig cells and specialized epithelia.

    PubMed

    Liu, L; Smith, J W

    2000-06-01

    A family of proteins containing a disintegrin and metalloproteinase domain (ADAMs) has been identified recently. Here, we report the identification of a novel member of the ADAM protein family from mouse. This protein is designated ADAM 31. The complementary DNA sequence of ADAM 31 predicts a transmembrane protein with metalloproteinase, disintegrin, cysteine-rich, and cytoplasmic domains. Messenger RNA encoding ADAM 31 was most abundant in testes, but was also detected in many other tissues. More significantly, the antibodies raised against ADAM 31 reveal that the protein has a unique and restricted expression pattern. ADAM 31 is expressed in Leydig cells of the testes, but unlike many other ADAMs, it is not found on developing sperm. Furthermore, ADAM 31 is highly expressed on four types of specialized epithelia: the cauda epididymidis, the vas deferens, the convoluted tubules of the kidney, and the parietal cells of the stomach.

  8. Identification of 24h Ixodes scapularis immunogenic tick saliva proteins.

    PubMed

    Lewis, Lauren A; Radulović, Željko M; Kim, Tae K; Porter, Lindsay M; Mulenga, Albert

    2015-04-01

    Ixodes scapularis is arguably the most medically important tick species in the United States. This tick transmits 5 of the 14 human tick-borne disease (TBD) agents in the USA: Borrelia burgdorferi, Anaplasma phagocytophilum, B. miyamotoi, Babesia microti, and Powassan virus disease. Except for the Powassan virus disease, I. scapularis-vectored TBD agents require more than 24h post attachment to be transmitted. This study describes identification of 24h immunogenic I. scapularis tick saliva proteins, which could provide opportunities to develop strategies to stop tick feeding before transmission of the majority of pathogens. A 24h fed female I. scapularis phage display cDNA expression library was biopanned using rabbit antibodies to 24h fed I. scapularis female tick saliva proteins, subjected to next generation sequencing, de novo assembly, and bioinformatic analyses. A total of 182 contigs were assembled, of which ∼19% (35/182) are novel and did not show identity to any known proteins in GenBank. The remaining ∼81% (147/182) of contigs were provisionally identified based on matches in GenBank including ∼18% (27/147) that matched protein sequences previously annotated as hypothetical and putative tick saliva proteins. Others include proteases and protease inhibitors (∼3%, 5/147), transporters and/or ligand binding proteins (∼6%, 9/147), immunogenic tick saliva housekeeping enzyme-like (17%, 25/147), ribosomal protein-like (∼31%, 46/147), and those classified as miscellaneous (∼24%, 35/147). Notable among the miscellaneous class include antimicrobial peptides (microplusin and ricinusin), myosin-like proteins that have been previously found in tick saliva, and heat shock tick saliva protein. Data in this study provides the foundation for in-depth analysis of I. scapularis feeding during the first 24h, before the majority of TBD agents can be transmitted. Copyright © 2015 Elsevier GmbH. All rights reserved.

  9. A systematic identification of species-specific protein succinylation sites using joint element features information.

    PubMed

    Hasan, Md Mehedi; Khatun, Mst Shamima; Mollah, Md Nurul Haque; Yong, Cao; Guo, Dianjing

    2017-01-01

    Lysine succinylation, an important type of protein posttranslational modification, plays significant roles in many cellular processes. Accurate identification of succinylation sites can facilitate our understanding about the molecular mechanism and potential roles of lysine succinylation. However, even in well-studied systems, a majority of the succinylation sites remain undetected because the traditional experimental approaches to succinylation site identification are often costly, time-consuming, and laborious. In silico approach, on the other hand, is potentially an alternative strategy to predict succinylation substrates. In this paper, a novel computational predictor SuccinSite2.0 was developed for predicting generic and species-specific protein succinylation sites. This predictor takes the composition of profile-based amino acid and orthogonal binary features, which were used to train a random forest classifier. We demonstrated that the proposed SuccinSite2.0 predictor outperformed other currently existing implementations on a complementarily independent dataset. Furthermore, the important features that make visible contributions to species-specific and cross-species-specific prediction of protein succinylation site were analyzed. The proposed predictor is anticipated to be a useful computational resource for lysine succinylation site prediction. The integrated species-specific online tool of SuccinSite2.0 is publicly accessible.

  10. Improving CID, HCD, and ETD FT MS/MS degradome-peptidome identifications using high accuracy mass information

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Shen, Yufeng; Tolic, Nikola; Purvine, Samuel O.

    2011-11-07

    The peptidome (i.e. processed and degraded forms of proteins) of e.g. blood can potentially provide insights into disease processes, as well as a source of candidate biomarkers that are unobtainable using conventional bottom-up proteomics approaches. MS dissociation methods, including CID, HCD, and ETD, can each contribute distinct identifications using conventional peptide identification methods (Shen et al. J. Proteome Res. 2011), but such samples still pose significant analysis and informatics challenges. In this work, we explored a simple approach for better utilization of high accuracy fragment ion mass measurements provided e.g. by FT MS/MS and demonstrate significant improvements relative to conventionalmore » descriptive and probabilistic scores methods. For example, at the same FDR level we identified 20-40% more peptides than SEQUEST and Mascot scoring methods using high accuracy fragment ion information (e.g., <10 mass errors) from CID, HCD, and ETD spectra. Species identified covered >90% of all those identified from SEQUEST, Mascot, and MS-GF scoring methods. Additionally, we found that the merging the different fragment spectra provided >60% more species using the UStags method than achieved previously, and enabled >1000 peptidome components to be identified from a single human blood plasma sample with a 0.6% peptide-level FDR, and providing an improved basis for investigation of potentially disease-related peptidome components.« less

  11. How enhanced molecular ions in Cold EI improve compound identification by the NIST library.

    PubMed

    Alon, Tal; Amirav, Aviv

    2015-12-15

    Library-based compound identification with electron ionization (EI) mass spectrometry (MS) is a well-established identification method which provides the names and structures of sample compounds up to the isomer level. The library (such as NIST) search algorithm compares different EI mass spectra in the library's database with the measured EI mass spectrum, assigning each of them a similarity score called 'Match' and an overall identification probability. Cold EI, electron ionization of vibrationally cold molecules in supersonic molecular beams, provides mass spectra with all the standard EI fragment ions combined with enhanced Molecular Ions and high-mass fragments. As a result, Cold EI mass spectra differ from those provided by standard EI and tend to yield lower matching scores. However, in most cases, library identification actually improves with Cold EI, as library identification probabilities for the correct library mass spectra increase, despite the lower matching factors. This research examined the way that enhanced molecular ion abundances affect library identification probability and the way that Cold EI mass spectra, which include enhanced molecular ions and high-mass fragment ions, typically improve library identification results. It involved several computer simulations, which incrementally modified the relative abundances of the various ions and analyzed the resulting mass spectra. The simulation results support previous measurements, showing that while enhanced molecular ion and high-mass fragment ions lower the matching factor of the correct library compound, the matching factors of the incorrect library candidates are lowered even more, resulting in a rise in the identification probability for the correct compound. This behavior which was previously observed by analyzing Cold EI mass spectra can be explained by the fact that high-mass ions, and especially the molecular ion, characterize a compound more than low-mass ions and therefore carries more

  12. Improvement on a simplified model for protein folding simulation.

    PubMed

    Zhang, Ming; Chen, Changjun; He, Yi; Xiao, Yi

    2005-11-01

    Improvements were made on a simplified protein model--the Ramachandran model-to achieve better computer simulation of protein folding. To check the validity of such improvements, we chose the ultrafast folding protein Engrailed Homeodomain as an example and explored several aspects of its folding. The engrailed homeodomain is a mainly alpha-helical protein of 61 residues from Drosophila melanogaster. We found that the simplified model of Engrailed Homeodomain can fold into a global minimum state with a tertiary structure in good agreement with its native structure.

  13. Identification of Important Process Variables for Fiber Spinning of Protein Nanotubes Generated from Waste Materials

    DTIC Science & Technology

    2012-01-11

    nanotubes , which sold at the same current cost as carbon nanotubes , this would equate to a $788 million industry. In the USA, the potential to source eye...advantages over carbon nanotubes due to the ability to functionalized them 31. The nanotubes are a highly ordered, insoluble form of protein. Fibrils...1756 Identification of important process variables for fiber spinning of protein nanotubes generated from waste materials. Research Team (listed

  14. Identification of GPCR-Interacting Cytosolic Proteins Using HDL Particles and Mass Spectrometry-Based Proteomic Approach

    PubMed Central

    Chung, Ka Young; Day, Peter W.; Vélez-Ruiz, Gisselle; Sunahara, Roger K.; Kobilka, Brian K.

    2013-01-01

    G protein-coupled receptors (GPCRs) have critical roles in various physiological and pathophysiological processes, and more than 40% of marketed drugs target GPCRs. Although the canonical downstream target of an agonist-activated GPCR is a G protein heterotrimer; there is a growing body of evidence suggesting that other signaling molecules interact, directly or indirectly, with GPCRs. However, due to the low abundance in the intact cell system and poor solubility of GPCRs, identification of these GPCR-interacting molecules remains challenging. Here, we establish a strategy to overcome these difficulties by using high-density lipoprotein (HDL) particles. We used the β2-adrenergic receptor (β2AR), a GPCR involved in regulating cardiovascular physiology, as a model system. We reconstituted purified β2AR in HDL particles, to mimic the plasma membrane environment, and used the reconstituted receptor as bait to pull-down binding partners from rat heart cytosol. A total of 293 proteins were identified in the full agonist-activated β2AR pull-down, 242 proteins in the inverse agonist-activated β2AR pull-down, and 210 proteins were commonly identified in both pull-downs. A small subset of the β2AR-interacting proteins isolated was confirmed by Western blot; three known β2AR-interacting proteins (Gsα, NHERF-2, and Grb2) and 3 newly identified known β2AR-interacting proteins (AMPKα, acetyl-CoA carboxylase, and UBC-13). Profiling of the identified proteins showed a clear bias toward intracellular signal transduction pathways, which is consistent with the role of β2AR as a cell signaling molecule. This study suggests that HDL particle-reconstituted GPCRs can provide an effective platform method for the identification of GPCR binding partners coupled with a mass spectrometry-based proteomic analysis. PMID:23372797

  15. Highly efficient proteome analysis with combination of protein pre-fractionation by preparative microscale solution isoelectric focusing and identification by μRPLC-MS/MS with serially coupled long microcolumn.

    PubMed

    Tao, Dingyin; Sun, Liangliang; Zhu, Guijie; Liang, Yu; Liang, Zhen; Zhang, Lihua; Zhang, Yukui

    2011-01-01

    To improve the efficiency of proteome analysis, a strategy with the combination of protein pre-fractionation by preparative microscale solution isoelectric focusing, peptide separation by μRPLC with serially coupled long microcolumn and protein identification by ESI-MS/MS was proposed. By preparative microscale solution isoelectric focusing technique, proteins extracted from whole cell lysates of Escherichia coli were fractionated into five chambers divided by isoelectric membranes, respectively with pH range from 3.0 to 4.6, 4.6 to 5.4, 5.4 to 6.2, 6.2 to 7.0 and 7.0 to 10.0. Compared to the traditional on-gel IFF, the protein recovery could be obviously improved to over 95%. Subsequently, the enriched and fractionated proteins in each chamber were digested, and further separated by a 30-cm long serially coupled RP microcolumn. Through the detection by ESI-MS/MS, about 200 proteins were identified in each fraction, and in total 835 proteins were identified even with one-dimensional μRPLC-MS/MS system. All these results demonstrate that by such a combination strategy, highly efficient proteome analysis could be achieved, not only due to the in-solution protein enrichment and pre-fractionation with improved protein recovery but also owing to the increased separation capacity of serially coupled long μRPLC columns. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  16. Increasing protein stability by improving beta-turns.

    PubMed

    Fu, Hailong; Grimsley, Gerald R; Razvi, Abbas; Scholtz, J Martin; Pace, C Nick

    2009-11-15

    Our goal was to gain a better understanding of how protein stability can be increased by improving beta-turns. We studied 22 beta-turns in nine proteins with 66-370 residues by replacing other residues with proline and glycine and measuring the stability. These two residues are statistically preferred in some beta-turn positions. We studied: Cold shock protein B (CspB), Histidine-containing phosphocarrier protein, Ubiquitin, Ribonucleases Sa2, Sa3, T1, and HI, Tryptophan synthetase alpha-subunit, and Maltose binding protein. Of the 15 single proline mutations, 11 increased stability (Average = 0.8 +/- 0.3; Range = 0.3-1.5 kcal/mol), and the stabilizing effect of double proline mutants was additive. On the basis of this and our previous work, we conclude that proteins can generally be stabilized by replacing nonproline residues with proline residues at the i + 1 position of Type I and II beta-turns and at the i position in Type II beta-turns. Other turn positions can sometimes be used if the phi angle is near -60 degrees for the residue replaced. It is important that the side chain of the residue replaced is less than 50% buried. Identical substitutions in beta-turns in related proteins give similar results. Proline substitutions increase stability mainly by decreasing the entropy of the denatured state. In contrast, the large, diverse group of proteins considered here had almost no residues in beta-turns that could be replaced by Gly to increase protein stability. Improving beta-turns by substituting Pro residues is a generally useful way of increasing protein stability. 2009 Wiley-Liss, Inc.

  17. INCREASING PROTEIN STABILITY BY IMPROVING BETA-TURNS

    PubMed Central

    Fu, Hailong; Grimsley, Gerald R.; Razvi, Abbas; Scholtz, J. Martin; Pace, C. Nick

    2009-01-01

    Our goal was to gain a better understanding of how protein stability can be increased by improving β-turns. We studied 22 β-turns in nine proteins with 66 to 370 residues by replacing other residues with proline and glycine and measuring the stability. These two residues are statistically preferred in some β-turn positions. We studied: Cold shock protein B (CspB), Histidine-containing phosphocarrier protein (HPr), Ubiquitin, Ribonucleases Sa2, Sa3, T1, and HI, Tryptophan synthetase α-subunit (TSα), and Maltose binding protein (MBP). Of the fifteen single proline mutations, 11increased stability (Average = 0.8 ± 0.3; Range = 0.3 – 1.5 kcal/mol), and the stabilizing effect of double proline mutants was additive. Based on this and our previous work, we conclude that proteins can generally be stabilized by replacing non-proline residues with proline residues at the i + 1 position of Type I and II β-turns and at the i position in Type II β-turns. Other turn positions can sometimes be used if the φ angle is near −60° for the residue replaced. It is important that the side chain of the residue replaced is less than 50% buried. Identical substitutions in β-turns in related proteins give similar results. Proline substitutions increase stability mainly by decreasing the entropy of the denatured state. In contrast, the large, diverse group of proteins considered here had almost no residues in β-turns that could be replaced by Gly to increase protein stability. Improving β-turns by substituting Pro residues is a generally useful way of increasing protein stability. PMID:19626709

  18. Identification and characterization of intracellular proteins that bind oligonucleotides with phosphorothioate linkages.

    PubMed

    Liang, Xue-hai; Sun, Hong; Shen, Wen; Crooke, Stanley T

    2015-03-11

    Although the RNase H-dependent mechanism of inhibition of gene expression by chemically modified antisense oligonucleotides (ASOs) has been well characterized, little is known about the interactions between ASOs and intracellular proteins that may alter cellular localization and/or potency of ASOs. Here, we report the identification of 56 intracellular ASO-binding proteins using multi-step affinity selection approaches. Many of the tested proteins had no significant effect on ASO activity; however, some proteins, including La/SSB, NPM1, ANXA2, VARS and PC4, appeared to enhance ASO activities, likely through mechanisms related to subcellular distribution. VARS and ANXA2 co-localized with ASOs in endocytic organelles, and reduction in the level of VARS altered lysosome/ASO localization patterns, implying that these proteins may facilitate ASO release from the endocytic pathway. Depletion of La and NPM1 reduced nuclear ASO levels, suggesting potential roles in ASO nuclear accumulation. On the other hand, Ku70 and Ku80 proteins inhibited ASO activity, most likely by competition with RNase H1 for ASO/RNA duplex binding. Our results demonstrate that phosphorothioate-modified ASOs bind a set of cellular proteins that affect ASO activity via different mechanisms. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  19. Identification of O-linked β-d-N-acetylglucosamine-Modified Proteins from Arabidopsis

    PubMed Central

    Xu, Shou-Ling; Chalkley, Robert J.; Wang, Zhi-Yong; Burlingame, Alma L.

    2013-01-01

    The posttranslational modification of proteins with O-linked β-d-N-acetylglucosamine (O-GlcNAc) on serine and threonine residues occurs in all animals and plants. This modification is dynamic and ubiquitous, and regulates many cellular processes, including transcription, signaling and cytokinesis and is associated with several diseases. Cycling of O-GlcNAc is tightly regulated by O-GlcNAc transferase (OGT) and O-GlcNAcase (OGA). Plants have two OGTs, SPINDLY (SPY) and SECRET AGENT (SEC); disruption of both causes embryo lethality. Despite O-GlcNAc modification of proteins being discovered more than 20-years ago, identification and mapping of protein GlcNAcylation is still a challenging task. Here we describe the use of lectin affinity chromatography combined with electron transfer dissociation mass spectrometry to enrich and to detect O-GlcNAc modified peptides from Arabidopsis. PMID:22576084

  20. Improved corn protein based articles

    USDA-ARS?s Scientific Manuscript database

    Developing higher value uses for zein (corn protein), a potential major co-product of the bio-ethanol industry, will improve the economics of this business. Historically, zein was predominantly used in the textile fiber industry. Unfortunately the techniques used at that time to modify the zein cann...

  1. A Proof of Concept to Bridge the Gap between Mass Spectrometry Imaging, Protein Identification and Relative Quantitation: MSI~LC-MS/MS-LF.

    PubMed

    Théron, Laëtitia; Centeno, Delphine; Coudy-Gandilhon, Cécile; Pujos-Guillot, Estelle; Astruc, Thierry; Rémond, Didier; Barthelemy, Jean-Claude; Roche, Frédéric; Feasson, Léonard; Hébraud, Michel; Béchet, Daniel; Chambon, Christophe

    2016-10-26

    Mass spectrometry imaging (MSI) is a powerful tool to visualize the spatial distribution of molecules on a tissue section. The main limitation of MALDI-MSI of proteins is the lack of direct identification. Therefore, this study focuses on a MSI~LC-MS/MS-LF workflow to link the results from MALDI-MSI with potential peak identification and label-free quantitation, using only one tissue section. At first, we studied the impact of matrix deposition and laser ablation on protein extraction from the tissue section. Then, we did a back-correlation of the m / z of the proteins detected by MALDI-MSI to those identified by label-free quantitation. This allowed us to compare the label-free quantitation of proteins obtained in LC-MS/MS with the peak intensities observed in MALDI-MSI. We managed to link identification to nine peaks observed by MALDI-MSI. The results showed that the MSI~LC-MS/MS-LF workflow (i) allowed us to study a representative muscle proteome compared to a classical bottom-up workflow; and (ii) was sparsely impacted by matrix deposition and laser ablation. This workflow, performed as a proof-of-concept, suggests that a single tissue section can be used to perform MALDI-MSI and protein extraction, identification, and relative quantitation.

  2. A Proof of Concept to Bridge the Gap between Mass Spectrometry Imaging, Protein Identification and Relative Quantitation: MSI~LC-MS/MS-LF

    PubMed Central

    Théron, Laëtitia; Centeno, Delphine; Coudy-Gandilhon, Cécile; Pujos-Guillot, Estelle; Astruc, Thierry; Rémond, Didier; Barthelemy, Jean-Claude; Roche, Frédéric; Feasson, Léonard; Hébraud, Michel; Béchet, Daniel; Chambon, Christophe

    2016-01-01

    Mass spectrometry imaging (MSI) is a powerful tool to visualize the spatial distribution of molecules on a tissue section. The main limitation of MALDI-MSI of proteins is the lack of direct identification. Therefore, this study focuses on a MSI~LC-MS/MS-LF workflow to link the results from MALDI-MSI with potential peak identification and label-free quantitation, using only one tissue section. At first, we studied the impact of matrix deposition and laser ablation on protein extraction from the tissue section. Then, we did a back-correlation of the m/z of the proteins detected by MALDI-MSI to those identified by label-free quantitation. This allowed us to compare the label-free quantitation of proteins obtained in LC-MS/MS with the peak intensities observed in MALDI-MSI. We managed to link identification to nine peaks observed by MALDI-MSI. The results showed that the MSI~LC-MS/MS-LF workflow (i) allowed us to study a representative muscle proteome compared to a classical bottom-up workflow; and (ii) was sparsely impacted by matrix deposition and laser ablation. This workflow, performed as a proof-of-concept, suggests that a single tissue section can be used to perform MALDI-MSI and protein extraction, identification, and relative quantitation. PMID:28248242

  3. A targeted mass spectrometry-based approach for the identification and characterization of proteins containing α-aminoadipic and γ-glutamic semialdehyde residues

    PubMed Central

    Chavez, Juan D.; Bisson, William H.

    2011-01-01

    The site-specific identification of α-aminoadipic semialdehyde (AAS) and γ-glutamic semialdehyde (GGS) residues in proteins is reported. Semialdehydic protein modifications result from the metal-catalyzed oxidation of Lys or Arg and Pro residues, respectively. Most of the analytical methods for the analysis of protein carbonylation measure change to the global level of carbonylation and fail to provide details regarding protein identity, site, and chemical nature of the carbonylation. In this work, we used a targeted approach, which combines chemical labeling, enrichment, and tandem mass spectrometric analysis, for the site-specific identification of AAS and GGS sites in proteins. The approach is applied to in vitro oxidized glyceraldehyde-3-phosphate dehydrogenase (GAPDH) and an untreated biological sample, namely cardiac mitochondrial proteins. The analysis of GAPDH resulted in the site-specific identification of two AAA and four GGS residues. Computational evaluation of the identified AAS and GGS sites in GAPDH indicated that these sites are located in flexible regions, show high solvent accessibility values, and are in proximity with possible metal ion binding sites. The targeted proteomic analysis of semialdehydic modifications in cardiac mitochondria yielded nine AAS modification sites which were unambiguously assigned to distinct lysine residues in the following proteins: ATP/ATP translocase isoforms 1 and 2, ubiquinol cytochrome-c reductase core protein 2, and ATP synthase α-subunit. PMID:20957471

  4. Comprehensive identification of novel proteins and N-glycosylation sites in royal jelly

    PubMed Central

    2014-01-01

    Background Royal jelly (RJ) is a proteinaceous secretion produced from the hypopharyngeal and mandibular glands of nurse bees. It plays vital roles in honeybee biology and in the improvement of human health. However, some proteins remain unknown in RJ, and mapping N-glycosylation modification sites on RJ proteins demands further investigation. We used two different liquid chromatography-tandem mass spectrometry techniques, complementary N-glycopeptide enrichment strategies, and bioinformatic approaches to gain a better understanding of novel and glycosylated proteins in RJ. Results A total of 25 N-glycosylated proteins, carrying 53 N-glycosylation sites, were identified in RJ proteins, of which 42 N-linked glycosylation sites were mapped as novel on RJ proteins. Most of the glycosylated proteins were related to metabolic activities and health improvement. The 13 newly identified proteins were also mainly associated with metabolic processes and health improvement activities. Conclusion Our in-depth, large-scale mapping of novel glycosylation sites represents a crucial step toward systematically revealing the functionality of N-glycosylated RJ proteins, and is potentially useful for producing a protein with desirable pharmacokinetic and biological activity using a genetic engineering approach. The newly-identified proteins significantly extend the proteome coverage of RJ. These findings contribute vital and new knowledge to our understanding of the innate biochemical nature of RJ at both the proteome and glycoproteome levels. PMID:24529077

  5. Sialome of a Generalist Lepidopteran Herbivore: Identification of Transcripts and Proteins from Helicoverpa armigera Labial Salivary Glands

    PubMed Central

    Celorio-Mancera, Maria de la Paz; Courtiade, Juliette; Muck, Alexander; Heckel, David G.; Musser, Richard O.; Vogel, Heiko

    2011-01-01

    Although the importance of insect saliva in insect-host plant interactions has been acknowledged, there is very limited information on the nature and complexity of the salivary proteome in lepidopteran herbivores. We inspected the labial salivary transcriptome and proteome of Helicoverpa armigera, an important polyphagous pest species. To identify the majority of the salivary proteins we have randomly sequenced 19,389 expressed sequence tags (ESTs) from a normalized cDNA library of salivary glands. In parallel, a non-cytosolic enriched protein fraction was obtained from labial salivary glands and subjected to two-dimensional gel electrophoresis (2-DE) and de novo peptide sequencing. This procedure allowed comparison of peptides and EST sequences and enabled us to identify 65 protein spots from the secreted labial saliva 2DE proteome. The mass spectrometry analysis revealed ecdysone, glucose oxidase, fructosidase, carboxyl/cholinesterase and an uncharacterized protein previously detected in H. armigera midgut proteome. Consistently, their corresponding transcripts are among the most abundant in our cDNA library. We did find redundancy of sequence identification of saliva-secreted proteins suggesting multiple isoforms. As expected, we found several enzymes responsible for digestion and plant offense. In addition, we identified non-digestive proteins such as an arginine kinase and abundant proteins of unknown function. This identification of secreted salivary gland proteins allows a more comprehensive understanding of insect feeding and poses new challenges for the elucidation of protein function. PMID:22046331

  6. Identification of two bvg-repressed surface proteins of Bordetella pertussis.

    PubMed Central

    Stenson, T H; Peppler, M S

    1995-01-01

    Bordetella pertussis, the etiological agent of whooping cough, has the ability to modulate its phenotype in response to environmental conditions by using the BvgAS sensory transduction system which is encoded by the vir locus (now known as bvg). The BvgAS system is part of a large family of two-component sensory transduction systems which are common to a number of pathogenic bacteria. Although much is known about the proteins which exist in the B. pertussis virulent (X-mode or phase I) phenotype, relatively little is known about the proteins produced in the avirulent (C-mode or phase III) phenotype. We used sodium dodecyl sulfate-polyacrylamide gel electrophoresis and isoelectric focusing techniques to demonstrate the existence of at least 22 vir-repressed molecules which are increased in the avirulent phenotype. In addition, a series of monoclonal antibodies which are specific for the surface of avirulent B. pertussis were developed. Using immunological and protein techniques, we characterized two of these antigens as surface-exposed proteins. One of these antigens is expressed only in B. pertussis but not in the related species B. parapertussis and B. bronchiseptica. The other antigen is also present in B. parapertussis and B. bronchiseptica but is expressed at lower levels which are not regulated by bvg. The identification and characterization of vir-repressed proteins (and the genes which encode and regulate them) may help elucidate a physiological role for modulation of this obligate human pathogen. PMID:7558280

  7. Filtering Gene Ontology semantic similarity for identifying protein complexes in large protein interaction networks.

    PubMed

    Wang, Jian; Xie, Dong; Lin, Hongfei; Yang, Zhihao; Zhang, Yijia

    2012-06-21

    Many biological processes recognize in particular the importance of protein complexes, and various computational approaches have been developed to identify complexes from protein-protein interaction (PPI) networks. However, high false-positive rate of PPIs leads to challenging identification. A protein semantic similarity measure is proposed in this study, based on the ontology structure of Gene Ontology (GO) terms and GO annotations to estimate the reliability of interactions in PPI networks. Interaction pairs with low GO semantic similarity are removed from the network as unreliable interactions. Then, a cluster-expanding algorithm is used to detect complexes with core-attachment structure on filtered network. Our method is applied to three different yeast PPI networks. The effectiveness of our method is examined on two benchmark complex datasets. Experimental results show that our method performed better than other state-of-the-art approaches in most evaluation metrics. The method detects protein complexes from large scale PPI networks by filtering GO semantic similarity. Removing interactions with low GO similarity significantly improves the performance of complex identification. The expanding strategy is also effective to identify attachment proteins of complexes.

  8. Microbial Protein-Antigenome Determination (MAD) Technology: A Proteomics-Based Strategy for Rapid Identification of Microbial Targets of Host Humoral Immune Responses

    USDA-ARS?s Scientific Manuscript database

    Immunogenic, pathogen-specific proteins have excellent potential for development of novel management modalities. Here, we describe an innovative application of proteomics called Microbial protein-Antigenome Determination (MAD) Technology for rapid identification of native microbial proteins that el...

  9. Microbial Protein-Antigenome Determination (MAD) Technology: A Proteomics-Based Strategy for Rapid Identification of Microbial Targets of Host Humoral Immune Responses

    USDA-ARS?s Scientific Manuscript database

    Immunogenic, pathogen-specific proteins have excellent potential for development of novel management modalities. Here, we describe an innovative application of proteomics called Microbial protein-Antigenome Determination (MAD) Technology for rapid identification of native microbial proteins that eli...

  10. Evaluating an Art-Based Intervention to Improve Practicing Nurses' Observation, Description, and Problem Identification Skills.

    PubMed

    Nease, Beth M; Haney, Tina S

    Astute observation, description, and problem identification skills provide the underpinning for nursing assessment, surveillance, and prevention of failure to rescue events. Art-based education has been effective in nursing schools for improving observation, description, and problem identification. The authors describe a randomized controlled pilot study testing the effectiveness of an art-based educational intervention aimed at improving these skills in practicing nurses.

  11. New molecular markers and cytogenetic probes enable chromosome identification of wheat-Thinopyrum intermedium introgression lines for improving protein and gluten contents.

    PubMed

    Li, Guangrong; Wang, Hongjin; Lang, Tao; Li, Jianbo; La, Shixiao; Yang, Ennian; Yang, Zujun

    2016-10-01

    New molecular markers were developed for targeting Thinopyrum intermedium 1St#2 chromosome, and novel FISH probe representing the terminal repeats was produced for identification of Thinopyrum chromosomes. Thinopyrum intermedium has been used as a valuable resource for improving the disease resistance and yield potential of wheat. A wheat-Th. intermedium ssp. trichophorum chromosome 1St#2 substitution and translocation has displayed superior grain protein and wet gluten content. With the aim to develop a number of chromosome 1St#2 specific molecular and cytogenetic markers, a high throughput, low-cost specific-locus amplified fragment sequencing (SLAF-seq) technology was used to compare the sequences between a wheat-Thinopyrum 1St#2 (1D) substitution and the related species Pseudoroegneria spicata (St genome, 2n = 14). A total of 5142 polymorphic fragments were analyzed and 359 different SLAF markers for 1St#2 were predicted. Thirty-seven specific molecular markers were validated by PCR from 50 randomly selected SLAFs. Meanwhile, the distribution of transposable elements (TEs) at the family level between wheat and St genomes was compared using the SLAFs. A new oligo-nucleotide probe named Oligo-pSt122 from high SLAF reads was produced for fluorescence in situ hybridization (FISH), and was observed to hybridize to the terminal region of 1St#L and also onto the terminal heterochromatic region of Th. intermedium genomes. The genome-wide markers and repetitive based probe Oligo-pSt122 will be valuable for identifying Thinopyrum chromosome segments in wheat backgrounds.

  12. Evaluation of protein spectra cluster analysis for Streptococcus spp. identification from various swine clinical samples.

    PubMed

    Matajira, Carlos E C; Moreno, Luisa Z; Gomes, Vasco T M; Silva, Ana Paula S; Mesquita, Renan E; Doto, Daniela S; Calderaro, Franco F; de Souza, Fernando N; Christ, Ana Paula G; Sato, Maria Inês Z; Moreno, Andrea M

    2017-03-01

    Traditional microbiological methods enable genus-level identification of Streptococcus spp. isolates. However, as the species of this genus show broad phenotypic variation, species-level identification or even differentiation within the genus is difficult. Herein we report the evaluation of protein spectra cluster analysis for the identification of Streptococcus species associated with disease in swine by means of matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS). A total of 250 S. suis-like isolates obtained from pigs with clinical signs of encephalitis, arthritis, pneumonia, metritis, and urinary or septicemic infection were studied. The isolates came from pigs in different Brazilian states from 2001 to 2014. The MALDI-TOF MS analysis identified 86% (215 of 250) as S. suis and 14% (35 of 250) as S. alactolyticus, S. dysgalactiae, S. gallinaceus, S. gallolyticus, S. gordonii, S. henryi, S. hyointestinalis, S. hyovaginalis, S. mitis, S. oralis, S. pluranimalium, and S. sanguinis. The MALDI-TOF MS identification was confirmed in 99.2% of the isolates by 16S rDNA sequencing, with MALDI-TOF MS misidentifying 2 S. pluranimalium as S. hyovaginalis. Isolates were also tested by a biochemical automated system that correctly identified all isolates of 8 of the 10 species in the database. Neither the isolates of the 3 species not in the database ( S. gallinaceus, S. henryi, and S. hyovaginalis) nor the isolates of 2 species that were in the database ( S. oralis and S. pluranimalium) could be identified. The topology of the protein spectra cluster analysis appears to sustain the species phylogenetic similarities, further supporting identification by MALDI-TOF MS examination as a rapid and accurate alternative to 16S rDNA sequencing.

  13. Identification of Plant Ice-binding Proteins Through Assessment of Ice-recrystallization Inhibition and Isolation Using Ice-affinity Purification.

    PubMed

    Bredow, Melissa; Tomalty, Heather E; Walker, Virginia K

    2017-05-05

    Ice-binding proteins (IBPs) belong to a family of stress-induced proteins that are synthesized by certain organisms exposed to subzero temperatures. In plants, freeze damage occurs when extracellular ice crystals grow, resulting in the rupture of plasma membranes and possible cell death. Adsorption of IBPs to ice crystals restricts further growth by a process known as ice-recrystallization inhibition (IRI), thereby reducing cellular damage. IBPs also demonstrate the ability to depress the freezing point of a solution below the equilibrium melting point, a property known as thermal hysteresis (TH) activity. These protective properties have raised interest in the identification of novel IBPs due to their potential use in industrial, medical and agricultural applications. This paper describes the identification of plant IBPs through 1) the induction and extraction of IBPs in plant tissue, 2) the screening of extracts for IRI activity, and 3) the isolation and purification of IBPs. Following the induction of IBPs by low temperature exposure, extracts are tested for IRI activity using a 'splat assay', which allows the observation of ice crystal growth using a standard light microscope. This assay requires a low protein concentration and generates results that are quickly obtained and easily interpreted, providing an initial screen for ice binding activity. IBPs can then be isolated from contaminating proteins by utilizing the property of IBPs to adsorb to ice, through a technique called 'ice-affinity purification'. Using cell lysates collected from plant extracts, an ice hemisphere can be slowly grown on a brass probe. This incorporates IBPs into the crystalline structure of the polycrystalline ice. Requiring no a priori biochemical or structural knowledge of the IBP, this method allows for recovery of active protein. Ice-purified protein fractions can be used for downstream applications including the identification of peptide sequences by mass spectrometry and the

  14. Template-based structure modeling of protein-protein interactions

    PubMed Central

    Szilagyi, Andras; Zhang, Yang

    2014-01-01

    The structure of protein-protein complexes can be constructed by using the known structure of other protein complexes as a template. The complex structure templates are generally detected either by homology-based sequence alignments or, given the structure of monomer components, by structure-based comparisons. Critical improvements have been made in recent years by utilizing interface recognition and by recombining monomer and complex template libraries. Encouraging progress has also been witnessed in genome-wide applications of template-based modeling, with modeling accuracy comparable to high-throughput experimental data. Nevertheless, bottlenecks exist due to the incompleteness of the proteinprotein complex structure library and the lack of methods for distant homologous template identification and full-length complex structure refinement. PMID:24721449

  15. Identification of proteins with the CDw75 epitope in human colorectal cancer

    PubMed Central

    Mariño-Crespo, Óscar; Fernández-Briera, Almudena; Gil-Martín, Emilio

    2018-01-01

    The CDw75 epitope is an α(2,6) sialylated antigen overexpressed in colorectal cancer (CRC), where its expression correlates with the progression of the disease. The CDw75 epitope is located mainly in N-glycoproteins, whose identity remains unknown. The aim of the present study was to identify proteins with the CDw75 epitope as a strategy to deepen the understanding of molecular pathogenesis of CRC and to identify novel biomarkers for this disease. For this purpose, a two-dimensional electrophoresis approach was employed. Protein spots in the gels were matched to the corresponding CDw75 positive spots in the immunoblotted polyvinylidene difluoride membranes, and further identification of the protein species was performed by mass spectrometry. Additionally, one-dimensional western blotting experiments were performed to verify the expression of these candidate proteins in the colorectal tissue and their coincidence in molecular mass with the CDw75-positive bands. The findings of the present study indicate that haptoglobin and the keratins 8 (K8) and 18 (K18) are proteins with the CDw75 epitope in the colorectal tissue from CRC patients and also suggest novel functions and cellular locations for these proteins in the colorectal tissue and in relation to CRC. PMID:29391890

  16. Binomial probability distribution model-based protein identification algorithm for tandem mass spectrometry utilizing peak intensity information.

    PubMed

    Xiao, Chuan-Le; Chen, Xiao-Zhou; Du, Yang-Li; Sun, Xuesong; Zhang, Gong; He, Qing-Yu

    2013-01-04

    Mass spectrometry has become one of the most important technologies in proteomic analysis. Tandem mass spectrometry (LC-MS/MS) is a major tool for the analysis of peptide mixtures from protein samples. The key step of MS data processing is the identification of peptides from experimental spectra by searching public sequence databases. Although a number of algorithms to identify peptides from MS/MS data have been already proposed, e.g. Sequest, OMSSA, X!Tandem, Mascot, etc., they are mainly based on statistical models considering only peak-matches between experimental and theoretical spectra, but not peak intensity information. Moreover, different algorithms gave different results from the same MS data, implying their probable incompleteness and questionable reproducibility. We developed a novel peptide identification algorithm, ProVerB, based on a binomial probability distribution model of protein tandem mass spectrometry combined with a new scoring function, making full use of peak intensity information and, thus, enhancing the ability of identification. Compared with Mascot, Sequest, and SQID, ProVerB identified significantly more peptides from LC-MS/MS data sets than the current algorithms at 1% False Discovery Rate (FDR) and provided more confident peptide identifications. ProVerB is also compatible with various platforms and experimental data sets, showing its robustness and versatility. The open-source program ProVerB is available at http://bioinformatics.jnu.edu.cn/software/proverb/ .

  17. Identification of ubiquitin/ubiquitin-like protein modification from tandem mass spectra with various PTMs

    PubMed Central

    2011-01-01

    Background Various solutions have been introduced for the identification of post-translational modification (PTM) from tandem mass spectrometry (MS/MS) in proteomics field but the identification of peptide modifiers, such as Ubiquitin (Ub) and ubiquitin-like proteins (Ubls), is still a challenge. The fragmentation of peptide modifier produce complex shifted ion mass patterns in combination with other PTMs, which makes it difficult to identify and locate the PTMs on a protein sequence. Currently, most PTM identification methods do not consider the complex fragmentation of peptide modifier or deals it separately from the other PTMs. Results We developed an advanced PTM identification method that inspects possible ion patterns of the most known peptide modifiers as well as other known biological and chemical PTMs to make more comprehensive and accurate conclusion. The proposed method searches all detectable mass differences of measured peaks from their theoretical values and the mass differences within mass tolerance range are grouped as mass shift classes. The most possible locations of multiple PTMs including peptide modifiers can be determined by evaluating all possible scenarios generated by the combination of the qualified mass shift classes.The proposed method showed excellent performance in the test with simulated spectra having various PTMs including peptide modifiers and in the comparison with recently developed methods such as QuickMod and SUMmOn. In the analysis of HUPO Brain Proteome Project (BPP) datasets, the proposed method could find the ubiquitin modification sites that were not identified by other conventional methods. Conclusions This work presents a novel method for identifying bothpeptide modifiers that generate complex fragmentation patternsand PTMs that are not fragmented during fragmentation processfrom tandem mass spectra. PMID:22373085

  18. Three new sensitive and specific heat-shock protein 70 PCRs for global Leishmania species identification.

    PubMed

    Montalvo, A M; Fraga, J; Maes, I; Dujardin, J-C; Van der Auwera, G

    2012-07-01

    The heat-shock protein 70 gene (hsp70) has been exploited for Leishmania species identification in the New and Old World, using PCR followed by restriction fragment length polymorphism (RFLP) analysis. Current PCR presents limitations in terms of sensitivity, which hampers its use for analyzing clinical and biological samples, and specificity, which makes it inappropriate to discriminate between Leishmania and other trypanosomatids. The aim of the study was to improve the sensitivity and specificity of a previously reported hsp70 PCR using alternative PCR primers and RFLPs. Following in silico analysis of available sequences, three new PCR primer sets and restriction digest schemes were tested on a globally representative panel of 114 Leishmania strains, various other infectious agents, and clinical samples. The largest new PCR fragment retained the discriminatory power from RFLP, while two smaller fragments discriminated less species. The detection limit of the new PCRs was between 0.05 and 0.5 parasite genomes, they amplified clinical samples more efficiently, and were Leishmania specific. We succeeded in significantly improving the specificity and sensitivity of the PCRs for hsp70 Leishmania species typing. The improved PCR-RFLP assays can impact diagnosis, treatment, and epidemiological studies of leishmaniasis in any setting worldwide.

  19. An approach to large scale identification of non-obvious structural similarities between proteins

    PubMed Central

    Cherkasov, Artem; Jones, Steven JM

    2004-01-01

    Background A new sequence independent bioinformatics approach allowing genome-wide search for proteins with similar three dimensional structures has been developed. By utilizing the numerical output of the sequence threading it establishes putative non-obvious structural similarities between proteins. When applied to the testing set of proteins with known three dimensional structures the developed approach was able to recognize structurally similar proteins with high accuracy. Results The method has been developed to identify pathogenic proteins with low sequence identity and high structural similarity to host analogues. Such protein structure relationships would be hypothesized to arise through convergent evolution or through ancient horizontal gene transfer events, now undetectable using current sequence alignment techniques. The pathogen proteins, which could mimic or interfere with host activities, would represent candidate virulence factors. The developed approach utilizes the numerical outputs from the sequence-structure threading. It identifies the potential structural similarity between a pair of proteins by correlating the threading scores of the corresponding two primary sequences against the library of the standard folds. This approach allowed up to 64% sensitivity and 99.9% specificity in distinguishing protein pairs with high structural similarity. Conclusion Preliminary results obtained by comparison of the genomes of Homo sapiens and several strains of Chlamydia trachomatis have demonstrated the potential usefulness of the method in the identification of bacterial proteins with known or potential roles in virulence. PMID:15147578

  20. Identification of proteins from Mycobacterium tuberculosis missing in attenuated Mycobacterium bovis BCG strains.

    PubMed

    Mattow, J; Jungblut, P R; Schaible, U E; Mollenkopf, H J; Lamer, S; Zimny-Arndt, U; Hagens, K; Müller, E C; Kaufmann, S H

    2001-08-01

    A proteome approach, combining high-resolution two-dimensional electrophoresis (2-DE) with mass spectrometry, was used to compare the cellular protein composition of two virulent strains of Mycobacterium tuberculosis with two attenuated strains of Mycobacterium bovis Bacillus Calmette-Guerin (BCG), in order to identify unique proteins of these strains. Emphasis was given to the identification of M. tuberculosis specific proteins, because we consider these proteins to represent putative virulence factors and interesting candidates for vaccination and diagnosis of tuberculosis. The genome of M. tuberculosis strain H37Rv comprises nearly 4000 predicted open reading frames. In contrast, the separation of proteins from whole mycobacterial cells by 2-DE resulted in silver-stained patterns comprising about 1800 distinct protein spots. Amongst these, 96 spots were exclusively detected either in the virulent (56 spots) or in the attenuated (40 spots) mycobacterial strains. Fifty-three of these spots were analyzed by mass spectrometry, of which 41 were identified, including 32 M. tuberculosis specific spots. Twelve M. tuberculosis specific spots were identified as proteins, encoded by genes previously reported to be deleted in M. bovis BCG. The remaining 20 spots unique for M. tuberculosis were identified as proteins encoded by genes that are not known to be missing in M. bovis BCG.

  1. Rapid identification of fluorochrome modification sites in proteins by LC ESI-Q-TOF mass spectrometry.

    PubMed

    Manikwar, Prakash; Zimmerman, Tahl; Blanco, Francisco J; Williams, Todd D; Siahaan, Teruna J

    2011-07-20

    Conjugation of either a fluorescent dye or a drug molecule to the ε-amino groups of lysine residues of proteins has many applications in biology and medicine. However, this type of conjugation produces a heterogeneous population of protein conjugates. Because conjugation of fluorochrome or drug molecule to a protein may have deleterious effects on protein function, the identification of conjugation sites is necessary. Unfortunately, the identification process can be time-consuming and laborious; therefore, there is a need to develop a rapid and reliable way to determine the conjugation sites of the fluorescent label or drug molecule. In this study, the sites of conjugation of fluorescein-5'-isothiocyanate and rhodamine-B-isothiocyanate to free amino groups on the insert-domain (I-domain) protein derived from the α-subunit of lymphocyte function-associated antigen-1 (LFA-1) were determined by electrospray ionization quadrupole time-of-flight mass spectrometry (ESI-Q-TOF MS) along with peptide mapping using trypsin digestion. A reporter fragment of the fluorochrome moiety that is generated in the collision cell of the Q-TOF without explicit MS/MS precursor selection was used to identify the conjugation site. Selected ion plots of the reporter ion readily mark modified peptides in chromatograms of the complex digest. Interrogation of theses spectra reveals a neutral loss/precursor pair that identifies the modified peptide. The results show that one to seven fluorescein molecules or one to four rhodamine molecules were attached to the lysine residue(s) of the I-domain protein. No modifications were found in the metal ion-dependent adhesion site (MIDAS), which is an important binding region of the I-domain.

  2. Top-Down and Bottom-Up Identification of Proteins by Liquid Extraction Surface Analysis Mass Spectrometry of Healthy and Diseased Human Liver Tissue

    NASA Astrophysics Data System (ADS)

    Sarsby, Joscelyn; Martin, Nicholas J.; Lalor, Patricia F.; Bunch, Josephine; Cooper, Helen J.

    2014-09-01

    Liquid extraction surface analysis mass spectrometry (LESA MS) has the potential to become a useful tool in the spatially-resolved profiling of proteins in substrates. Here, the approach has been applied to the analysis of thin tissue sections from human liver. The aim was to determine whether LESA MS was a suitable approach for the detection of protein biomarkers of nonalcoholic liver disease (nonalcoholic steatohepatitis, NASH), with a view to the eventual development of LESA MS for imaging NASH pathology. Two approaches were considered. In the first, endogenous proteins were extracted from liver tissue sections by LESA, subjected to automated trypsin digestion, and the resulting peptide mixture was analyzed by liquid chromatography tandem mass spectrometry (LC-MS/MS) (bottom-up approach). In the second (top-down approach), endogenous proteins were extracted by LESA, and analyzed intact. Selected protein ions were subjected to collision-induced dissociation (CID) and/or electron transfer dissociation (ETD) mass spectrometry. The bottom-up approach resulted in the identification of over 500 proteins; however identification of key protein biomarkers, liver fatty acid binding protein (FABP1), and its variant (Thr→Ala, position 94), was unreliable and irreproducible. Top-down LESA MS analysis of healthy and diseased liver tissue revealed peaks corresponding to multiple (~15-25) proteins. MS/MS of four of these proteins identified them as FABP1, its variant, α-hemoglobin, and 10 kDa heat shock protein. The reliable identification of FABP1 and its variant by top-down LESA MS suggests that the approach may be suitable for imaging NASH pathology in sections from liver biopsies.

  3. Top-down and bottom-up identification of proteins by liquid extraction surface analysis mass spectrometry of healthy and diseased human liver tissue.

    PubMed

    Sarsby, Joscelyn; Martin, Nicholas J; Lalor, Patricia F; Bunch, Josephine; Cooper, Helen J

    2014-11-01

    Liquid extraction surface analysis mass spectrometry (LESA MS) has the potential to become a useful tool in the spatially-resolved profiling of proteins in substrates. Here, the approach has been applied to the analysis of thin tissue sections from human liver. The aim was to determine whether LESA MS was a suitable approach for the detection of protein biomarkers of nonalcoholic liver disease (nonalcoholic steatohepatitis, NASH), with a view to the eventual development of LESA MS for imaging NASH pathology. Two approaches were considered. In the first, endogenous proteins were extracted from liver tissue sections by LESA, subjected to automated trypsin digestion, and the resulting peptide mixture was analyzed by liquid chromatography tandem mass spectrometry (LC-MS/MS) (bottom-up approach). In the second (top-down approach), endogenous proteins were extracted by LESA, and analyzed intact. Selected protein ions were subjected to collision-induced dissociation (CID) and/or electron transfer dissociation (ETD) mass spectrometry. The bottom-up approach resulted in the identification of over 500 proteins; however identification of key protein biomarkers, liver fatty acid binding protein (FABP1), and its variant (Thr→Ala, position 94), was unreliable and irreproducible. Top-down LESA MS analysis of healthy and diseased liver tissue revealed peaks corresponding to multiple (~15-25) proteins. MS/MS of four of these proteins identified them as FABP1, its variant, α-hemoglobin, and 10 kDa heat shock protein. The reliable identification of FABP1 and its variant by top-down LESA MS suggests that the approach may be suitable for imaging NASH pathology in sections from liver biopsies.

  4. Mass spectrometry compatible surfactant for optimized in-gel protein digestion.

    PubMed

    Saveliev, Sergei V; Woodroofe, Carolyn C; Sabat, Grzegorz; Adams, Christopher M; Klaubert, Dieter; Wood, Keith; Urh, Marjeta

    2013-01-15

    Identification of proteins resolved by SDS-PAGE depends on robust in-gel protein digestion and efficient peptide extraction, requirements that are often difficult to achieve. A lengthy and laborious procedure is an additional challenge of protein identification in gel. We show here that with the use of the mass spectrometry compatible surfactant sodium 3-((1-(furan-2-yl)undecyloxy)carbonylamino)propane-1-sulfonate, the challenges of in-gel protein digestion are effectively addressed. Peptide quantitation based on stable isotope labeling showed that the surfactant induced 1.5-2 fold increase in peptide recovery. Consequently, protein sequence coverage was increased by 20-30%, on average, and the number of identified proteins saw a substantial boost. The surfactant also accelerated the digestion process. Maximal in-gel digestion was achieved in as little as one hour, depending on incubation temperature, and peptides were readily recovered from gel eliminating the need for postdigestion extraction. This study shows that the surfactant provides an efficient means of improving protein identification in gel and streamlining the in-gel digestion procedure requiring no extra handling steps or special equipment.

  5. Pepitome: evaluating improved spectral library search for identification complementarity and quality assessment

    PubMed Central

    Dasari, Surendra; Chambers, Matthew C.; Martinez, Misti A.; Carpenter, Kristin L.; Ham, Amy-Joan L.; Vega-Montoto, Lorenzo J.; Tabb, David L.

    2012-01-01

    Spectral libraries have emerged as a viable alternative to protein sequence databases for peptide identification. These libraries contain previously detected peptide sequences and their corresponding tandem mass spectra (MS/MS). Search engines can then identify peptides by comparing experimental MS/MS scans to those in the library. Many of these algorithms employ the dot product score for measuring the quality of a spectrum-spectrum match (SSM). This scoring system does not offer a clear statistical interpretation and ignores fragment ion m/z discrepancies in the scoring. We developed a new spectral library search engine, Pepitome, which employs statistical systems for scoring SSMs. Pepitome outperformed the leading library search tool, SpectraST, when analyzing data sets acquired on three different mass spectrometry platforms. We characterized the reliability of spectral library searches by confirming shotgun proteomics identifications through RNA-Seq data. Applying spectral library and database searches on the same sample revealed their complementary nature. Pepitome identifications enabled the automation of quality analysis and quality control (QA/QC) for shotgun proteomics data acquisition pipelines. PMID:22217208

  6. "Plasmo2D": an ancillary proteomic tool to aid identification of proteins from Plasmodium falciparum.

    PubMed

    Khachane, Amit; Kumar, Ranjit; Jain, Sanyam; Jain, Samta; Banumathy, Gowrishankar; Singh, Varsha; Nagpal, Saurabh; Tatu, Utpal

    2005-01-01

    Bioinformatics tools to aid gene and protein sequence analysis have become an integral part of biology in the post-genomic era. Release of the Plasmodium falciparum genome sequence has allowed biologists to define the gene and the predicted protein content as well as their sequences in the parasite. Using pI and molecular weight as characteristics unique to each protein, we have developed a bioinformatics tool to aid identification of proteins from Plasmodium falciparum. The tool makes use of a Virtual 2-DE generated by plotting all of the proteins from the Plasmodium database on a pI versus molecular weight scale. Proteins are identified by comparing the position of migration of desired protein spots from an experimental 2-DE and that on a virtual 2-DE. The procedure has been automated in the form of user-friendly software called "Plasmo2D". The tool can be downloaded from http://144.16.89.25/Plasmo2D.zip.

  7. Identification of structural proteins of koi herpesvirus.

    PubMed

    Fuchs, Walter; Granzow, Harald; Dauber, Malte; Fichtner, Dieter; Mettenleiter, Thomas C

    2014-12-01

    As a prerequisite for development of improved vaccines and diagnostic tools for control of the fish pathogen koi herpesvirus, or cyprinid herpesvirus 3 (CyHV-3), we have started to identify putative viral envelope and capsid proteins. The complete or partial CyHV-3 open reading frames ORF25, ORF65, ORF92, ORF99, ORF136, ORF138, ORF146, ORF148, and ORF149 were expressed as bacterial fusion proteins, which were then used for preparation of monospecific rabbit antisera. All of the sera that were obtained detected their target proteins in cells transfected with the corresponding eukaryotic expression plasmids. However, only the type I membrane proteins pORF25, pORF65, pORF99, pORF136 and pORF149 and the major capsid protein pORF92 were sufficiently abundant and immunogenic to permit unambiguous detection in CyHV-3-infected cells. In indirect immunofluorescence tests (IIFT), sera from naturally or experimentally CyHV-3-infected carp and koi predominantly reacted with cells transfected with expression plasmids encoding pORF25, pORF65, pORF148, and pORF149, which represent a family of related CyHV-3 membrane proteins. Moreover, several neutralizing monoclonal antibodies raised against CyHV-3 virions proved to be specific for pORF149 in IIFT of transfected cells and in immunoelectron microscopic analysis of CyHV-3 particles. Since pORF149 appears to be an immunorelevant envelope protein of CyHV-3, a recombinant baculovirus was generated for its expression in insect cells, and pORF149 was shown to be incorporated into pseudotyped baculovirus particles, which might be suitable as diagnostic tools or subunit vaccines.

  8. Combinatorial Labeling Method for Improving Peptide Fragmentation in Mass Spectrometry

    NASA Astrophysics Data System (ADS)

    Kuchibhotla, Bhanuramanand; Kola, Sankara Rao; Medicherla, Jagannadham V.; Cherukuvada, Swamy V.; Dhople, Vishnu M.; Nalam, Madhusudhana Rao

    2017-06-01

    Annotation of peptide sequence from tandem mass spectra constitutes the central step of mass spectrometry-based proteomics. Peptide mass spectra are obtained upon gas-phase fragmentation. Identification of the protein from a set of experimental peptide spectral matches is usually referred as protein inference. Occurrence and intensity of these fragment ions in the MS/MS spectra are dependent on many factors such as amino acid composition, peptide basicity, activation mode, protease, etc. Particularly, chemical derivatizations of peptides were known to alter their fragmentation. In this study, the influence of acetylation, guanidinylation, and their combination on peptide fragmentation was assessed initially on a lipase (LipA) from Bacillus subtilis followed by a bovine six protein mix digest. The dual modification resulted in improved fragment ion occurrence and intensity changes, and this resulted in the equivalent representation of b- and y-type fragment ions in an ion trap MS/MS spectrum. The improved representation has allowed us to accurately annotate the peptide sequences de novo. Dual labeling has significantly reduced the false positive protein identifications in standard bovine six peptide digest. Our study suggests that the combinatorial labeling of peptides is a useful method to validate protein identifications for high confidence protein inference. [Figure not available: see fulltext.

  9. DiffSLC: A graph centrality method to detect essential proteins of a protein-protein interaction network.

    PubMed

    Mistry, Divya; Wise, Roger P; Dickerson, Julie A

    2017-01-01

    Identification of central genes and proteins in biomolecular networks provides credible candidates for pathway analysis, functional analysis, and essentiality prediction. The DiffSLC centrality measure predicts central and essential genes and proteins using a protein-protein interaction network. Network centrality measures prioritize nodes and edges based on their importance to the network topology. These measures helped identify critical genes and proteins in biomolecular networks. The proposed centrality measure, DiffSLC, combines the number of interactions of a protein and the gene coexpression values of genes from which those proteins were translated, as a weighting factor to bias the identification of essential proteins in a protein interaction network. Potentially essential proteins with low node degree are promoted through eigenvector centrality. Thus, the gene coexpression values are used in conjunction with the eigenvector of the network's adjacency matrix and edge clustering coefficient to improve essentiality prediction. The outcome of this prediction is shown using three variations: (1) inclusion or exclusion of gene co-expression data, (2) impact of different coexpression measures, and (3) impact of different gene expression data sets. For a total of seven networks, DiffSLC is compared to other centrality measures using Saccharomyces cerevisiae protein interaction networks and gene expression data. Comparisons are also performed for the top ranked proteins against the known essential genes from the Saccharomyces Gene Deletion Project, which show that DiffSLC detects more essential proteins and has a higher area under the ROC curve than other compared methods. This makes DiffSLC a stronger alternative to other centrality methods for detecting essential genes using a protein-protein interaction network that obeys centrality-lethality principle. DiffSLC is implemented using the igraph package in R, and networkx package in Python. The python package can be

  10. Multilevel biological characterization of exomic variants at the protein level significantly improves the identification of their deleterious effects.

    PubMed

    Raimondi, Daniele; Gazzo, Andrea M; Rooman, Marianne; Lenaerts, Tom; Vranken, Wim F

    2016-06-15

    There are now many predictors capable of identifying the likely phenotypic effects of single nucleotide variants (SNVs) or short in-frame Insertions or Deletions (INDELs) on the increasing amount of genome sequence data. Most of these predictors focus on SNVs and use a combination of features related to sequence conservation, biophysical, and/or structural properties to link the observed variant to either neutral or disease phenotype. Despite notable successes, the mapping between genetic variants and their phenotypic effects is riddled with levels of complexity that are not yet fully understood and that are often not taken into account in the predictions, despite their promise of significantly improving the prediction of deleterious mutants. We present DEOGEN, a novel variant effect predictor that can handle both missense SNVs and in-frame INDELs. By integrating information from different biological scales and mimicking the complex mixture of effects that lead from the variant to the phenotype, we obtain significant improvements in the variant-effect prediction results. Next to the typical variant-oriented features based on the evolutionary conservation of the mutated positions, we added a collection of protein-oriented features that are based on functional aspects of the gene affected. We cross-validated DEOGEN on 36 825 polymorphisms, 20 821 deleterious SNVs, and 1038 INDELs from SwissProt. The multilevel contextualization of each (variant, protein) pair in DEOGEN provides a 10% improvement of MCC with respect to current state-of-the-art tools. The software and the data presented here is publicly available at http://ibsquare.be/deogen : wvranken@vub.ac.be Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  11. Study of cellular oncometabolism via multidimensional protein identification technology.

    PubMed

    Aukim-Hastie, Claire; Garbis, Spiros D

    2014-01-01

    Cellular proteomics is becoming a widespread clinical application, matching the definition of bench-to-bedside translation. Among various fields of investigation, this approach can be applied to the study of the metabolic alterations that accompany oncogenesis and tumor progression, which are globally referred to as oncometabolism. Here, we describe a multidimensional protein identification technology (MuDPIT)-based strategy that can be employed to study the cellular proteome of malignant cells and tissues. This method has previously been shown to be compatible with the reproducible, in-depth analysis of up to a thousand proteins in clinical samples. The possibility to employ this technique to study clinical specimens demonstrates its robustness. MuDPIT is advantageous as compared to other approaches because it is direct, highly sensitive, and reproducible, it provides high resolution with ultra-high mass accuracy, it allows for relative quantifications, and it is compatible with multiplexing (thus limiting costs).This method enables the direct assessment of the proteomic profile of neoplastic cells and tissues and could be employed in the near future as a high-throughput, rapid, quantitative, and cost-effective screening platform for clinical samples. © 2014 Elsevier Inc. All rights reserved.

  12. Isolation and Analysis of Keratins and Keratin-Associated Proteins from Hair and Wool.

    PubMed

    Deb-Choudhury, Santanu; Plowman, Jeffrey E; Harland, Duane P

    2016-01-01

    The presence of highly cross-linked protein networks in hair and wool makes them very difficult substrates for protein extraction, a prerequisite for further protein analysis and characterization. It is therefore imperative that these cross-links formed by disulfide bridges are first disrupted for the efficient extraction of proteins. Chaotropes such as urea are commonly used as efficient extractants. However, a combination of urea and thiourea not only improves recovery of proteins but also results in improved resolution of the keratins in 2DE gels. Reductants also play an important role in protein dissolution. Dithiothreitol effectively removes keratinous material from the cortex, whereas phosphines, like Tris(2-carboxyethyl)phosphine, remove material from the exocuticle. The relative extractability of the keratins and keratin-associated proteins is also dependent on the concentration of chaotropes, reductants, and pH, thus providing a means to preferentially extract these proteins. Ionic liquids such as 1-butyl-3-methylimidazolium chloride (BMIM(+)[Cl](-)) are known to solubilize wool by disrupting noncovalent interactions, specifically intermolecular hydrogen bonds. BMIM(+)[Cl](-) proved to be an effective extractant of wool proteins and complementary in nature to chaotropes such as urea and thiourea for identifying unique peptides of wool proteins using mass spectrometry (MS). Successful identification of proteins resolved by one- or two-dimensional electrophoresis and MS is highly dependent on the optimal recovery of its protease-digested peptides with an efficient removal of interfering substances. The detergent sodium deoxycholate used in conjunction with Empore™ disks improved identification of proteins by mass spectrometry leading to higher percentage sequence coverage, identification of unique peptides and higher score. Copyright © 2016 Elsevier Inc. All rights reserved.

  13. Using Avatars for Improving Speaker Identification in Captioning

    NASA Astrophysics Data System (ADS)

    Vy, Quoc V.; Fels, Deborah I.

    Captioning is the main method for accessing television and film content by people who are deaf or hard-of-hearing. One major difficulty consistently identified by the community is that of knowing who is speaking particularly for an off screen narrator. A captioning system was created using a participatory design method to improve speaker identification. The final prototype contained avatars and a coloured border for identifying specific speakers. Evaluation results were very positive; however participants also wanted to customize various components such as caption and avatar location.

  14. Demonstration of protein-based human identification using the hair shaft proteome [Protein-based human identification: A proof of concept using the hair shaft proteome

    DOE PAGES

    Parker, Glendon J.; Leppert, Tami; Anex, Deon S.; ...

    2016-09-07

    Human identification from biological material is largely dependent on the ability to characterize genetic polymorphisms in DNA. Unfortunately, DNA can degrade in the environment, sometimes below the level at which it can be amplified by PCR. Protein however is chemically more robust than DNA and can persist for longer periods. Protein also contains genetic variation in the form of single amino acid polymorphisms. These can be used to infer the status of non-synonymous single nucleotide polymorphism alleles. To demonstrate this, we used mass spectrometry-based shotgun proteomics to characterize hair shaft proteins in 66 European-American subjects. A total of 596 singlemore » nucleotide polymorphism alleles were correctly imputed in 32 loci from 22 genes of subjects’ DNA and directly validated using Sanger sequencing. Estimates of the probability of resulting individual non-synonymous single nucleotide polymorphism allelic profiles in the European population, using the product rule, resulted in a maximum power of discrimination of 1 in 12,500. Imputed non-synonymous single nucleotide polymorphism profiles from European–American subjects were considerably less frequent in the African population (maximum likelihood ratio = 11,000). The converse was true for hair shafts collected from an additional 10 subjects with African ancestry, where some profiles were more frequent in the African population. Genetically variant peptides were also identified in hair shaft datasets from six archaeological skeletal remains (up to 260 years old). Furthermore, this study demonstrates that quantifiable measures of identity discrimination and biogeographic background can be obtained from detecting genetically variant peptides in hair shaft protein, including hair from bioarchaeological contexts.« less

  15. Demonstration of protein-based human identification using the hair shaft proteome [Protein-based human identification: A proof of concept using the hair shaft proteome

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Parker, Glendon J.; Leppert, Tami; Anex, Deon S.

    Human identification from biological material is largely dependent on the ability to characterize genetic polymorphisms in DNA. Unfortunately, DNA can degrade in the environment, sometimes below the level at which it can be amplified by PCR. Protein however is chemically more robust than DNA and can persist for longer periods. Protein also contains genetic variation in the form of single amino acid polymorphisms. These can be used to infer the status of non-synonymous single nucleotide polymorphism alleles. To demonstrate this, we used mass spectrometry-based shotgun proteomics to characterize hair shaft proteins in 66 European-American subjects. A total of 596 singlemore » nucleotide polymorphism alleles were correctly imputed in 32 loci from 22 genes of subjects’ DNA and directly validated using Sanger sequencing. Estimates of the probability of resulting individual non-synonymous single nucleotide polymorphism allelic profiles in the European population, using the product rule, resulted in a maximum power of discrimination of 1 in 12,500. Imputed non-synonymous single nucleotide polymorphism profiles from European–American subjects were considerably less frequent in the African population (maximum likelihood ratio = 11,000). The converse was true for hair shafts collected from an additional 10 subjects with African ancestry, where some profiles were more frequent in the African population. Genetically variant peptides were also identified in hair shaft datasets from six archaeological skeletal remains (up to 260 years old). Furthermore, this study demonstrates that quantifiable measures of identity discrimination and biogeographic background can be obtained from detecting genetically variant peptides in hair shaft protein, including hair from bioarchaeological contexts.« less

  16. Intuitive, but not simple: including explicit water molecules in protein-protein docking simulations improves model quality.

    PubMed

    Parikh, Hardik I; Kellogg, Glen E

    2014-06-01

    Characterizing the nature of interaction between proteins that have not been experimentally cocrystallized requires a computational docking approach that can successfully predict the spatial conformation adopted in the complex. In this work, the Hydropathic INTeractions (HINT) force field model was used for scoring docked models in a data set of 30 high-resolution crystallographically characterized "dry" protein-protein complexes and was shown to reliably identify native-like models. However, most current protein-protein docking algorithms fail to explicitly account for water molecules involved in bridging interactions that mediate and stabilize the association of the protein partners, so we used HINT to illuminate the physical and chemical properties of bridging waters and account for their energetic stabilizing contributions. The HINT water Relevance metric identified the "truly" bridging waters at the 30 protein-protein interfaces and we utilized them in "solvated" docking by manually inserting them into the input files for the rigid body ZDOCK program. By accounting for these interfacial waters, a statistically significant improvement of ∼24% in the average hit-count within the top-10 predictions the protein-protein dataset was seen, compared to standard "dry" docking. The results also show scoring improvement, with medium and high accuracy models ranking much better than incorrect ones. These improvements can be attributed to the physical presence of water molecules that alter surface properties and better represent native shape and hydropathic complementarity between interacting partners, with concomitantly more accurate native-like structure predictions. © 2013 Wiley Periodicals, Inc.

  17. A photometric mode identification method, including an improved non-adiabatic treatment of the atmosphere

    NASA Astrophysics Data System (ADS)

    Dupret, M.-A.; De Ridder, J.; De Cat, P.; Aerts, C.; Scuflaire, R.; Noels, A.; Thoul, A.

    2003-02-01

    We present an improved version of the method of photometric mode identification of Heynderickx et al. (\\cite{hey}). Our new version is based on the inclusion of precise non-adiabatic eigenfunctions determined in the outer stellar atmosphere according to the formalism recently proposed by Dupret et al. (\\cite{dup}). Our improved photometric mode identification technique is therefore no longer dependent on ad hoc parameters for the non-adiabatic effects. It contains the complete physical conditions of the outer atmosphere of the star, provided that rotation does not play a key role. We apply our method to the two slowly pulsating B stars HD 74560 and HD 138764 and to the beta Cephei star EN (16) Lac. Besides identifying the degree l of the pulsating stars, our method is also a tool for improving the knowledge of stellar interiors and atmospheres, by imposing constraints on parameters such as the metallicity and the mixing-length parameter alpha (a procedure we label non-adiabatic asteroseismology). The non-adiabatic eigenfunctions needed for the mode identification are available upon request from the authors.

  18. Neutrophil CD64, C-reactive protein, and procalcitonin in the identification of sepsis in the ICU - Post-test probabilities.

    PubMed

    Jämsä, Joel; Ala-Kokko, Tero; Huotari, Virva; Ohtonen, Pasi; Savolainen, Eeva-Riitta; Syrjälä, Hannu

    2018-02-01

    We were interested in whether C-reactive protein (CRP) and procalcitonin (PCT) distinguish sepsis from non-septic controls and whether a combination of CRP, PCT, and neutrophil CD64 improves identification of sepsis in the intensive care unit (ICU). We analyzed the CRP and PCT concentrations from 27 patients with sepsis and 15 ICU controls. In addition, CD64 on neutrophils was measured using quantitative flow cytometry. We present a multiple marker analysis for sepsis diagnostics combining neutrophil CD64, CRP, and PCT using post-test analysis. The CRP and PCT values separated sepsis and non-septic ICU patients. In post-test analysis, CRP provided a positive probability of 0.48 and a negative probability of 0.053 for sepsis in the ICU; while, the corresponding values were 0.35 and 0.0059, respectively, for PCT and 0.62 and 0.0013, respectively, for neutrophil CD64. When neutrophil CD64 was analyzed with PCT and CRP, the probabilities were 0.98 and <0.001, respectively. Neutrophil CD64 expression was superior to PCT and CRP for the identification of sepsis in ICU. Positive post-test probability for any combinations of simultaneously analyzed CRP, PCT and CD64 showed improved diagnostic accuracy for sepsis. This approach may be useful for guiding antibiotic treatment in ICU. Copyright © 2017 Elsevier Inc. All rights reserved.

  19. Technical advance: identification of plant actin-binding proteins by F-actin affinity chromatography

    NASA Technical Reports Server (NTRS)

    Hu, S.; Brady, S. R.; Kovar, D. R.; Staiger, C. J.; Clark, G. B.; Roux, S. J.; Muday, G. K.

    2000-01-01

    Proteins that interact with the actin cytoskeleton often modulate the dynamics or organization of the cytoskeleton or use the cytoskeleton to control their localization. In plants, very few actin-binding proteins have been identified and most are thought to modulate cytoskeleton function. To identify actin-binding proteins that are unique to plants, the development of new biochemical procedures will be critical. Affinity columns using actin monomers (globular actin, G-actin) or actin filaments (filamentous actin, F-actin) have been used to identify actin-binding proteins from a wide variety of organisms. Monomeric actin from zucchini (Cucurbita pepo L.) hypocotyl tissue was purified to electrophoretic homogeneity and shown to be native and competent for polymerization to actin filaments. G-actin, F-actin and bovine serum albumin affinity columns were prepared and used to separate samples enriched in either soluble or membrane-associated actin-binding proteins. Extracts of soluble actin-binding proteins yield distinct patterns when eluted from the G-actin and F-actin columns, respectively, leading to the identification of a putative F-actin-binding protein of approximately 40 kDa. When plasma membrane-associated proteins were applied to these columns, two abundant polypeptides eluted selectively from the F-actin column and cross-reacted with antiserum against pea annexins. Additionally, a protein that binds auxin transport inhibitors, the naphthylphthalamic acid binding protein, which has been previously suggested to associate with the actin cytoskeleton, was eluted in a single peak from the F-actin column. These experiments provide a new approach that may help to identify novel actin-binding proteins from plants.

  20. Technical advance: identification of plant actin-binding proteins by F-actin affinity chromatography.

    PubMed

    Hu, S; Brady, S R; Kovar, D R; Staiger, C J; Clark, G B; Roux, S J; Muday, G K

    2000-10-01

    Proteins that interact with the actin cytoskeleton often modulate the dynamics or organization of the cytoskeleton or use the cytoskeleton to control their localization. In plants, very few actin-binding proteins have been identified and most are thought to modulate cytoskeleton function. To identify actin-binding proteins that are unique to plants, the development of new biochemical procedures will be critical. Affinity columns using actin monomers (globular actin, G-actin) or actin filaments (filamentous actin, F-actin) have been used to identify actin-binding proteins from a wide variety of organisms. Monomeric actin from zucchini (Cucurbita pepo L.) hypocotyl tissue was purified to electrophoretic homogeneity and shown to be native and competent for polymerization to actin filaments. G-actin, F-actin and bovine serum albumin affinity columns were prepared and used to separate samples enriched in either soluble or membrane-associated actin-binding proteins. Extracts of soluble actin-binding proteins yield distinct patterns when eluted from the G-actin and F-actin columns, respectively, leading to the identification of a putative F-actin-binding protein of approximately 40 kDa. When plasma membrane-associated proteins were applied to these columns, two abundant polypeptides eluted selectively from the F-actin column and cross-reacted with antiserum against pea annexins. Additionally, a protein that binds auxin transport inhibitors, the naphthylphthalamic acid binding protein, which has been previously suggested to associate with the actin cytoskeleton, was eluted in a single peak from the F-actin column. These experiments provide a new approach that may help to identify novel actin-binding proteins from plants.

  1. Identification of syntrophic acetate-oxidizing bacteria in anaerobic digesters by combined protein-based stable isotope probing and metagenomics

    PubMed Central

    Mosbæk, Freya; Kjeldal, Henrik; Mulat, Daniel G; Albertsen, Mads; Ward, Alastair J; Feilberg, Anders; Nielsen, Jeppe L

    2016-01-01

    Inhibition of anaerobic digestion through accumulation of volatile fatty acids occasionally occurs as the result of unbalanced growth between acidogenic bacteria and methanogens. A fast recovery is a prerequisite for establishing an economical production of biogas. However, very little is known about the microorganisms facilitating this recovery. In this study, we investigated the organisms involved by a novel approach of mapping protein-stable isotope probing (protein-SIP) onto a binned metagenome. Under simulation of acetate accumulation conditions, formations of 13C-labeled CO2 and CH4 were detected immediately following incubation with [U-13C]acetate, indicating high turnover rate of acetate. The identified 13C-labeled peptides were mapped onto a binned metagenome for improved identification of the organisms involved. The results revealed that Methanosarcina and Methanoculleus were actively involved in acetate turnover, as were five subspecies of Clostridia. The acetate-consuming organisms affiliating with Clostridia all contained the FTFHS gene for formyltetrahydrofolate synthetase, a key enzyme for reductive acetogenesis, indicating that these organisms are possible syntrophic acetate-oxidizing (SAO) bacteria that can facilitate acetate consumption via SAO, coupled with hydrogenotrophic methanogenesis (SAO-HM). This study represents the first study applying protein-SIP for analysis of complex biogas samples, a promising method for identifying key microorganisms utilizing specific pathways. PMID:27128991

  2. Identification of syntrophic acetate-oxidizing bacteria in anaerobic digesters by combined protein-based stable isotope probing and metagenomics.

    PubMed

    Mosbæk, Freya; Kjeldal, Henrik; Mulat, Daniel G; Albertsen, Mads; Ward, Alastair J; Feilberg, Anders; Nielsen, Jeppe L

    2016-10-01

    Inhibition of anaerobic digestion through accumulation of volatile fatty acids occasionally occurs as the result of unbalanced growth between acidogenic bacteria and methanogens. A fast recovery is a prerequisite for establishing an economical production of biogas. However, very little is known about the microorganisms facilitating this recovery. In this study, we investigated the organisms involved by a novel approach of mapping protein-stable isotope probing (protein-SIP) onto a binned metagenome. Under simulation of acetate accumulation conditions, formations of (13)C-labeled CO2 and CH4 were detected immediately following incubation with [U-(13)C]acetate, indicating high turnover rate of acetate. The identified (13)C-labeled peptides were mapped onto a binned metagenome for improved identification of the organisms involved. The results revealed that Methanosarcina and Methanoculleus were actively involved in acetate turnover, as were five subspecies of Clostridia. The acetate-consuming organisms affiliating with Clostridia all contained the FTFHS gene for formyltetrahydrofolate synthetase, a key enzyme for reductive acetogenesis, indicating that these organisms are possible syntrophic acetate-oxidizing (SAO) bacteria that can facilitate acetate consumption via SAO, coupled with hydrogenotrophic methanogenesis (SAO-HM). This study represents the first study applying protein-SIP for analysis of complex biogas samples, a promising method for identifying key microorganisms utilizing specific pathways.

  3. Identification of functional interactome of a key cell division regulatory protein CedA of E.coli.

    PubMed

    Sharma, Pankaj; Tomar, Anil Kumar; Kundu, Bishwajit

    2018-01-01

    Cell division is compromised in DnaAcos mutant Escherichia coli cells that results in filamentous cell morphology. This is countered by over-expression of CedA protein that induces cytokinesis and thus, regular cell morphology is regained; however via an unknown mechanism. To understand the process systematically, exact role of CedA should be deciphered. Protein interactions are crucial for functional organization of a cell and their identification helps in revealing exact function(s) of a protein and its binding partners. Thus, this study was intended to identify CedA binding proteins (CBPs) to gain more clues of CedA function. We isolated CBPs by pull down assay using purified recombinant CedA and identified nine CBPs by mass spectrometric analysis (MALDI-TOF MS and LC-MS/MS), viz. PDHA1, RL2, DNAK, LPP, RPOB, G6PD, GLMS, RL3 and YBCJ. Based on CBPs identified, we hypothesize that CedA plays a crucial and multifaceted role in cell cycle regulation and specific pathways in which CedA participates may include transcription and energy metabolism. However, further validation through in-vitro and in-vivo experiments is necessary. In conclusion, identification of CBPs may help us in deciphering mechanism of CedA mediated cell division during chromosomal DNA over-replication. Copyright © 2017 Elsevier B.V. All rights reserved.

  4. A survey of computational methods and error rate estimation procedures for peptide and protein identification in shotgun proteomics

    PubMed Central

    Nesvizhskii, Alexey I.

    2010-01-01

    This manuscript provides a comprehensive review of the peptide and protein identification process using tandem mass spectrometry (MS/MS) data generated in shotgun proteomic experiments. The commonly used methods for assigning peptide sequences to MS/MS spectra are critically discussed and compared, from basic strategies to advanced multi-stage approaches. A particular attention is paid to the problem of false-positive identifications. Existing statistical approaches for assessing the significance of peptide to spectrum matches are surveyed, ranging from single-spectrum approaches such as expectation values to global error rate estimation procedures such as false discovery rates and posterior probabilities. The importance of using auxiliary discriminant information (mass accuracy, peptide separation coordinates, digestion properties, and etc.) is discussed, and advanced computational approaches for joint modeling of multiple sources of information are presented. This review also includes a detailed analysis of the issues affecting the interpretation of data at the protein level, including the amplification of error rates when going from peptide to protein level, and the ambiguities in inferring the identifies of sample proteins in the presence of shared peptides. Commonly used methods for computing protein-level confidence scores are discussed in detail. The review concludes with a discussion of several outstanding computational issues. PMID:20816881

  5. Identification of methyllysine peptides binding to chromobox protein homolog 6 chromodomain in the human proteome.

    PubMed

    Li, Nan; Stein, Richard S L; He, Wei; Komives, Elizabeth; Wang, Wei

    2013-10-01

    Methylation is one of the important post-translational modifications that play critical roles in regulating protein functions. Proteomic identification of this post-translational modification and understanding how it affects protein activity remain great challenges. We tackled this problem from the aspect of methylation mediating protein-protein interaction. Using the chromodomain of human chromobox protein homolog 6 as a model system, we developed a systematic approach that integrates structure modeling, bioinformatics analysis, and peptide microarray experiments to identify lysine residues that are methylated and recognized by the chromodomain in the human proteome. Given the important role of chromobox protein homolog 6 as a reader of histone modifications, it was interesting to find that the majority of its interacting partners identified via this approach function in chromatin remodeling and transcriptional regulation. Our study not only illustrates a novel angle for identifying methyllysines on a proteome-wide scale and elucidating their potential roles in regulating protein function, but also suggests possible strategies for engineering the chromodomain-peptide interface to enhance the recognition of and manipulate the signal transduction mediated by such interactions.

  6. Cross-reaction between Formosan Termite(Coptotermes formosanus) Proteins and Cockroach Allergens

    USDA-ARS?s Scientific Manuscript database

    Edible insects, such as cockroaches and termites, are beginning to be popularized as an alternate source of protein and have high nutritional value. Identification of cross-reactivity between commonly consumed food proteins and edible insects is important for food safety and to enable improvements ...

  7. Improving foaming properties of yolk-contaminated egg albumen by basic soy protein.

    PubMed

    Wang, Guang; Wang, Tong

    2009-10-01

    Yolk contamination of egg white is a common problem in the egg breaking industry. Foaming properties of egg white protein are affected by such contamination, but proteins of basic nature may restore the foaming properties of the yolk-contaminated egg white protein. The purpose of this study was to chemically modify a soy protein, that is, to esterify the acidic groups on the protein and to study the potential of such modified protein in improving foaming. We showed that the modification changed the isoelectric point of soy protein isolate (SPI) from 4.5 to about 10. Sonication was proven to be a very effective means to redisperse the methanol-denatured soy protein during reaction, as shown by the improved solubility profile. Such modified basic protein, that is, the sonicated-modified SPI (SMSPI), when added to the yolk-contaminated (at 0.4% level, as-is basis) egg white, gave significantly improved foaming properties. We have shown that the slight change in pH due to the addition of SMSPI was not the reason for improved foaming performance; instead, the modified protein itself was the main reason for such improvement. Addition of SMSPI increased the foaming performance of both pure egg white and yolk-contaminated egg white. SMSPI consistently performed better than the unmodified SPI for improving foaming. Addition of SMSPI (16%, based on dry egg white, and 1.6% based on liquid egg white) fully restored foam expansion and foam liquid stability of 0.4% yolk-contaminated egg white, and it even out-performed the foaming of pure white protein. Therefore, a feasible solution to restore the foaming properties of yolk-contaminated egg white has been identified. It is expected that such modified SPI can be used as an additive or ingredient in foaming formulation, especially when the egg white protein is suspected of lipid contamination.

  8. Protein-based forensic identification using genetically variant peptides in human bone.

    PubMed

    Mason, Katelyn Elizabeth; Anex, Deon; Grey, Todd; Hart, Bradley; Parker, Glendon

    2018-04-22

    Bone tissue contains organic material that is useful for forensic investigations and may contain preserved endogenous protein that can persist in the environment for extended periods of time over a range of conditions. Single amino acid polymorphisms in these proteins reflect genetic information since they result from non-synonymous single nucleotide polymorphisms (SNPs) in DNA. Detection of genetically variant peptides (GVPs) - those peptides that contain amino acid polymorphisms - in digests of bone proteins allows for the corresponding SNP alleles to be inferred. Resulting genetic profiles can be used to calculate statistical measures of association between a bone sample and an individual. In this study proteomic analysis on rib cortical bone samples from 10 recently deceased individuals demonstrates this concept. A straight-forward acidic demineralization protocol yielded proteins that were digested with trypsin. Tryptic digests were analyzed by liquid chromatography mass spectrometry. A total of 1736 different proteins were identified across all resulting datasets. On average, individual samples contained 454±121 (x¯±σ) proteins. Thirty-five genetically variant peptides were identified from 15 observed proteins. Overall, 134 SNP inferences were made based on proteomically detected GVPs, which were confirmed by sequencing of subject DNA. Inferred individual SNP genetic profiles ranged in random match probability (RMP) from 1/6 to 1/42,472 when calculated with European population frequencies in the 1000 Genomes Project, Phase 3. Similarly, RMPs based on African population frequencies were calculated for each SNP genetic profile and likelihood ratios (LR) were obtained by dividing each European RMP by the corresponding African RMP. Resulting LR values ranged from 1.4 to 825 with a median value of 16. GVP markers offer a basis for the identification of compromised skeletal remains independent of the presence of DNA template. Published by Elsevier B.V.

  9. Application of an Improved Proteomics Method for Abundant Protein Cleanup: Molecular and Genomic Mechanisms Study in Plant Defense*

    PubMed Central

    Zhang, Yixiang; Gao, Peng; Xing, Zhuo; Jin, Shumei; Chen, Zhide; Liu, Lantao; Constantino, Nasie; Wang, Xinwang; Shi, Weibing; Yuan, Joshua S.; Dai, Susie Y.

    2013-01-01

    High abundance proteins like ribulose-1,5-bisphosphate carboxylase oxygenase (Rubisco) impose a consistent challenge for the whole proteome characterization using shot-gun proteomics. To address this challenge, we developed and evaluated Polyethyleneimine Assisted Rubisco Cleanup (PARC) as a new method by combining both abundant protein removal and fractionation. The new approach was applied to a plant insect interaction study to validate the platform and investigate mechanisms for plant defense against herbivorous insects. Our results indicated that PARC can effectively remove Rubisco, improve the protein identification, and discover almost three times more differentially regulated proteins. The significantly enhanced shot-gun proteomics performance was translated into in-depth proteomic and molecular mechanisms for plant insect interaction, where carbon re-distribution was used to play an essential role. Moreover, the transcriptomic validation also confirmed the reliability of PARC analysis. Finally, functional studies were carried out for two differentially regulated genes as revealed by PARC analysis. Insect resistance was induced by over-expressing either jacalin-like or cupin-like genes in rice. The results further highlighted that PARC can serve as an effective strategy for proteomics analysis and gene discovery. PMID:23943779

  10. Identification of herpesvirus proteins that contribute to G1/S arrest.

    PubMed

    Paladino, Patrick; Marcon, Edyta; Greenblatt, Jack; Frappier, Lori

    2014-04-01

    Lytic infection by herpesviruses induces cell cycle arrest at the G1/S transition. This appears to be a function of multiple herpesvirus proteins, but only a minority of herpesvirus proteins have been examined for cell cycle effects. To gain a more comprehensive understanding of the viral proteins that contribute to G1/S arrest, we screened a library of over 200 proteins from herpes simplex virus type 1, human cytomegalovirus, and Epstein-Barr virus (EBV) for effects on the G1/S interface, using HeLa fluorescent, ubiquitination-based cell cycle indicator (Fucci) cells in which G1/S can be detected colorimetrically. Proteins from each virus were identified that induce accumulation of G1/S cells, predominantly tegument, early, and capsid proteins. The identification of several capsid proteins in this screen suggests that incoming viral capsids may function to modulate cellular processes. The cell cycle effects of selected EBV proteins were further verified and examined for effects on p53 and p21 as regulators of the G1/S transition. Two EBV replication proteins (BORF2 and BMRF1) were found to induce p53 but not p21, while a previously uncharacterized tegument protein (BGLF2) was found to induce p21 protein levels in a p53-independent manner. Proteomic analyses of BGLF2-interacting proteins identified interactions with the NIMA-related protein kinase (NEK9) and GEM-interacting protein (GMIP). Silencing of either NEK9 or GMIP induced p21 without affecting p53 and abrogated the ability of BGLF2 to further induce p21. Collectively, these results suggest multiple viral proteins contribute to G1/S arrest, including BGLF2, which induces p21 levels likely by interfering with the functions of NEK9 and GMIP. Most people are infected with multiple herpesviruses, whose proteins alter the infected cells in several ways. During lytic infection, the viral proteins block cell proliferation just before the cellular DNA replicates. We used a novel screening method to identify proteins

  11. MSblender: A probabilistic approach for integrating peptide identifications from multiple database search engines.

    PubMed

    Kwon, Taejoon; Choi, Hyungwon; Vogel, Christine; Nesvizhskii, Alexey I; Marcotte, Edward M

    2011-07-01

    Shotgun proteomics using mass spectrometry is a powerful method for protein identification but suffers limited sensitivity in complex samples. Integrating peptide identifications from multiple database search engines is a promising strategy to increase the number of peptide identifications and reduce the volume of unassigned tandem mass spectra. Existing methods pool statistical significance scores such as p-values or posterior probabilities of peptide-spectrum matches (PSMs) from multiple search engines after high scoring peptides have been assigned to spectra, but these methods lack reliable control of identification error rates as data are integrated from different search engines. We developed a statistically coherent method for integrative analysis, termed MSblender. MSblender converts raw search scores from search engines into a probability score for every possible PSM and properly accounts for the correlation between search scores. The method reliably estimates false discovery rates and identifies more PSMs than any single search engine at the same false discovery rate. Increased identifications increment spectral counts for most proteins and allow quantification of proteins that would not have been quantified by individual search engines. We also demonstrate that enhanced quantification contributes to improve sensitivity in differential expression analyses.

  12. MSblender: a probabilistic approach for integrating peptide identifications from multiple database search engines

    PubMed Central

    Kwon, Taejoon; Choi, Hyungwon; Vogel, Christine; Nesvizhskii, Alexey I.; Marcotte, Edward M.

    2011-01-01

    Shotgun proteomics using mass spectrometry is a powerful method for protein identification but suffers limited sensitivity in complex samples. Integrating peptide identifications from multiple database search engines is a promising strategy to increase the number of peptide identifications and reduce the volume of unassigned tandem mass spectra. Existing methods pool statistical significance scores such as p-values or posterior probabilities of peptide-spectrum matches (PSMs) from multiple search engines after high scoring peptides have been assigned to spectra, but these methods lack reliable control of identification error rates as data are integrated from different search engines. We developed a statistically coherent method for integrative analysis, termed MSblender. MSblender converts raw search scores from search engines into a probability score for all possible PSMs and properly accounts for the correlation between search scores. The method reliably estimates false discovery rates and identifies more PSMs than any single search engine at the same false discovery rate. Increased identifications increment spectral counts for all detected proteins and allow quantification of proteins that would not have been quantified by individual search engines. We also demonstrate that enhanced quantification contributes to improve sensitivity in differential expression analyses. PMID:21488652

  13. Proteomic identification of early salicylate- and flg22-responsive redox-sensitive proteins in Arabidopsis

    PubMed Central

    Liu, Pei; Zhang, Huoming; Yu, Boying; Xiong, Liming; Xia, Yiji

    2015-01-01

    Accumulation of reactive oxygen species (ROS) is one of the early defense responses against pathogen infection in plants. The mechanism about the initial and direct regulation of the defense signaling pathway by ROS remains elusive. Perturbation of cellular redox homeostasis by ROS is believed to alter functions of redox-sensitive proteins through their oxidative modifications. Here we report an OxiTRAQ-based proteomic study in identifying proteins whose cysteines underwent oxidative modifications in Arabidopsis cells during the early response to salicylate or flg22, two defense pathway elicitors that are known to disturb cellular redox homeostasis. Among the salicylate- and/or flg22-responsive redox-sensitive proteins are those involved in transcriptional regulation, chromatin remodeling, RNA processing, post-translational modifications, and nucleocytoplasmic shuttling. The identification of the salicylate-/flg22-responsive redox-sensitive proteins provides a foundation from which further study can be conducted toward understanding biological significance of their oxidative modifications during the plant defense response. PMID:25720653

  14. Identification of a putative protein profile associating with tamoxifen therapy resistance in breast cancer

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Umar, Arzu; Kang, Hyuk; Timmermans, A. M.

    2009-06-01

    Tamoxifen-resistance is a major cause of death in patients with recurrent breast cancer. Current clinical factors can correctly predict therapy response in only half of the treated patients. Identification of proteins that associate with tamoxifen-resistance is a first step towards better response prediction and tailored treatment of patients. In the present study we intended to identify putative protein biomarkers indicative of tamoxifen therapy-resistance in breast cancer, using nanoLC coupled with FTICR MS. Comparative proteome analysis was performed on ~5,500 pooled tumor cells (corresponding to ~550 ng protein lysate/analysis) obtained through laser capture microdissection (LCM) from two independently processed data setsmore » (n=24 and n=27) containing both tamoxifen therapy-sensitive and therapy-resistant tumors. Peptides and proteins were identified by matching mass and elution time of newly acquired LC-MS features to information in previously generated accurate mass and time tag (AMT) reference databases.« less

  15. Detection of protein-protein interactions by ribosome display and protein in situ immobilisation.

    PubMed

    He, Mingyue; Liu, Hong; Turner, Martin; Taussig, Michael J

    2009-12-31

    We describe a method for identification of protein-protein interactions by combining two cell-free protein technologies, namely ribosome display and protein in situ immobilisation. The method requires only PCR fragments as the starting material, the target proteins being made through cell-free protein synthesis, either associated with their encoding mRNA as ribosome complexes or immobilised on a solid surface. The use of ribosome complexes allows identification of interacting protein partners from their attached coding mRNA. To demonstrate the procedures, we have employed the lymphocyte signalling proteins Vav1 and Grb2 and confirmed the interaction between Grb2 and the N-terminal SH3 domain of Vav1. The method has promise for library screening of pairwise protein interactions, down to the analytical level of individual domain or motif mapping.

  16. Improving protein quality of bread - nutritional benefits and realities.

    PubMed

    Betschart, A A

    1978-01-01

    The bases for improving bread protein quality are critically examined. Protein consumption is shown to be directly related to total calorie intake in many countries, with a correlation coefficient (r) of greater than or equal to 0.90. Concentration of protein in bread, % kilocalories, is similar to that of mixed diets in many parts of the world. Quality of bread protein, when evaluated by male weanling rats, may be improved by supplementation with lysine and threonine, as well as with many protein sources. Human adults, on bread diets, may be maintained in nitrogen equilibrium or slightly positive nitrogen balance. Increases, however, in nitrogen retention have been reported when lysine was added to bread. Laboratory studies with infants and young children, often hospitalized and recovering from severe malnutrition, show that lysine supplementation of wheat flour and gluten diets enhanced nitrogen retention and weight gain. No effect was observed when whole wheat diets were supplemented with lysine. Several field studies with children indicate that the addition of lysine to either supplemental breads provided at school, or to all wheat products consumed, resulted in no observed beneficial effects. Other field studies report an increase in either weight or height with addition of lysine to breads. A laboratory study with human adults suggests that a wheat flour: soy flour mixture has a higher biological value than wheat flour alone. The role, in human nutrition, of breads with improved protein quality remains somewhat obscure.

  17. Proteomic identification of altered cerebral proteins in the complex regional pain syndrome animal model.

    PubMed

    Nahm, Francis Sahngun; Park, Zee-Yong; Nahm, Sang-Soep; Kim, Yong Chul; Lee, Pyung Bok

    2014-01-01

    Complex regional pain syndrome (CRPS) is a rare but debilitating pain disorder. Although the exact pathophysiology of CRPS is not fully understood, central and peripheral mechanisms might be involved in the development of this disorder. To reveal the central mechanism of CRPS, we conducted a proteomic analysis of rat cerebrum using the chronic postischemia pain (CPIP) model, a novel experimental model of CRPS. After generating the CPIP animal model, we performed a proteomic analysis of the rat cerebrum using a multidimensional protein identification technology, and screened the proteins differentially expressed between the CPIP and control groups. Results. A total of 155 proteins were differentially expressed between the CPIP and control groups: 125 increased and 30 decreased; expressions of proteins related to cell signaling, synaptic plasticity, regulation of cell proliferation, and cytoskeletal formation were increased in the CPIP group. However, proenkephalin A, cereblon, and neuroserpin were decreased in CPIP group. Altered expression of cerebral proteins in the CPIP model indicates cerebral involvement in the pathogenesis of CRPS. Further study is required to elucidate the roles of these proteins in the development and maintenance of CRPS.

  18. Protein S-nitrosylation: specificity and identification strategies in plants

    NASA Astrophysics Data System (ADS)

    Lamotte, Olivier; Bertoldo, Jean; Besson-Bard, Angélique; Rosnoblet, Claire; Aimé, Sébastien; Hichami, Siham; Terenzi, Hernan; Wendehenne, David

    2014-12-01

    The role of nitric oxide (NO) as a major regulator of plant physiological functions has become increasingly evident. To further improve our understanding of its role, within the last few years plant biologists have begun to embrace the exciting opportunity of investigating protein S-nitrosylation, a major reversible NO-dependent post-translational modification (PTM) targeting specific Cys residues and widely studied in animals. Thanks to the development of dedicated proteomic approaches, in particular the use of the Biotin Switch Technique (BST) combined with mass spectrometry, hundreds of plant protein candidates for S-nitrosylation have been identified. Functional studies focused on specific proteins provided preliminary comprehensive views of how this PTM impacts the structure and function of proteins and, more generally, of how NO might regulate biological plant processes. The aim of this review is to detail the basic principle of protein S-nitrosylation, to provide information on the biochemical and structural features of the S-nitrosylation sites and to describe the proteomic strategies adopted to investigate this PTM in plants. Limits of the current approaches and tomorrow's challenges are also discussed.

  19. Proteomic identification of plant proteins probed by mammalian nitric oxide synthase antibodies.

    PubMed

    Butt, Yoki Kwok-Chu; Lum, John Hon-Kei; Lo, Samuel Chun-Lap

    2003-03-01

    Several studies suggest that a mammalian-like nitric oxide synthase (NOS) exists in plants. Researchers have attempted to verify its presence using two approaches: (i) determination of NOS functional activity and (ii) probing with mammalian NOS antibodies. However, up to now, neither a NOS-like gene nor a protein has been found in plants. While there is still some controversy over whether the NOS functional activity seen is due to nitrate reductase, using the mammalian NOS antibodies in western blot analysis, several groups have reported the presence of immunoreactive protein bands in plant homogenates. Based on these results, immunohistochemical studies using these antibodies have also been used to localize NOS in plant tissues. However, plant NOS has never been positively identified or characterized. Thus, we used a proteomic approach to verify the identities of plant proteins that cross-reacted with the mammalian NOS antibodies. Proteins extracted from maize (Zea mays L.) embryonic axes were separated by two-dimensional gel electrophoresis and subjected to western blot analysis with the mammalian neuronal NOS and inducible NOS antibodies. Twenty immunoreactive protein spots recognized on a corresponding Coomassie blue-stained two-dimensional gel were subjected to tryptic digestion, followed by identification using matrix-assisted laser desorption/ionization-time of flight mass spectrometry. Fifteen proteins were successfully identified and they have described functions that are unrelated to NO metabolism. The remaining five proteins could not be identified. The amino acid sequences of these identified proteins and those used to raise the antibodies were aligned. However, no homologous region could be found. Our results demonstrate that the mammalian NOS antibodies recognize many NOS-unrelated plant proteins. Therefore, it is inappropriate to infer the presence of plant NOS using this immunological technique.

  20. Large-scale identification of target proteins of a glycosyltransferase isozyme by Lectin-IGOT-LC/MS, an LC/MS-based glycoproteomic approach

    PubMed Central

    Sugahara, Daisuke; Kaji, Hiroyuki; Sugihara, Kazushi; Asano, Masahide; Narimatsu, Hisashi

    2012-01-01

    Model organisms containing deletion or mutation in a glycosyltransferase-gene exhibit various physiological abnormalities, suggesting that specific glycan motifs on certain proteins play important roles in vivo. Identification of the target proteins of glycosyltransferase isozymes is the key to understand the roles of glycans. Here, we demonstrated the proteome-scale identification of the target proteins specific for a glycosyltransferase isozyme, β1,4-galactosyltransferase-I (β4GalT-I). Although β4GalT-I is the most characterized glycosyltransferase, its distinctive contribution to β1,4-galactosylation has been hardly described so far. We identified a large number of candidates for the target proteins specific to β4GalT-I by comparative analysis of β4GalT-I-deleted and wild-type mice using the LC/MS-based technique with the isotope-coded glycosylation site-specific tagging (IGOT) of lectin-captured N-glycopeptides. Our approach to identify the target proteins in a proteome-scale offers common features and trends in the target proteins, which facilitate understanding of the mechanism that controls assembly of a particular glycan motif on specific proteins. PMID:23002422

  1. Mass Spectrometric Identification of the Arginine and Lysine deficient Proline Rich Glutamine Rich Wheat Storage Proteins

    USDA-ARS?s Scientific Manuscript database

    Tandem mass spectrometry (MS/MS) of enzymatic digest has made possible identification of a wide variety of proteins and complex samples prepared by such techniques as RP-HPLC or 2-D gel electrophoresis. Success requires peptide fragmentation to be indicative of the peptide amino acid sequence. The f...

  2. Improvement of genome assembly completeness and identification of novel full-length protein-coding genes by RNA-seq in the giant panda genome.

    PubMed

    Chen, Meili; Hu, Yibo; Liu, Jingxing; Wu, Qi; Zhang, Chenglin; Yu, Jun; Xiao, Jingfa; Wei, Fuwen; Wu, Jiayan

    2015-12-11

    High-quality and complete gene models are the basis of whole genome analyses. The giant panda (Ailuropoda melanoleuca) genome was the first genome sequenced on the basis of solely short reads, but the genome annotation had lacked the support of transcriptomic evidence. In this study, we applied RNA-seq to globally improve the genome assembly completeness and to detect novel expressed transcripts in 12 tissues from giant pandas, by using a transcriptome reconstruction strategy that combined reference-based and de novo methods. Several aspects of genome assembly completeness in the transcribed regions were effectively improved by the de novo assembled transcripts, including genome scaffolding, the detection of small-size assembly errors, the extension of scaffold/contig boundaries, and gap closure. Through expression and homology validation, we detected three groups of novel full-length protein-coding genes. A total of 12.62% of the novel protein-coding genes were validated by proteomic data. GO annotation analysis showed that some of the novel protein-coding genes were involved in pigmentation, anatomical structure formation and reproduction, which might be related to the development and evolution of the black-white pelage, pseudo-thumb and delayed embryonic implantation of giant pandas. The updated genome annotation will help further giant panda studies from both structural and functional perspectives.

  3. Carbene footprinting accurately maps binding sites in protein-ligand and protein-protein interactions

    NASA Astrophysics Data System (ADS)

    Manzi, Lucio; Barrow, Andrew S.; Scott, Daniel; Layfield, Robert; Wright, Timothy G.; Moses, John E.; Oldham, Neil J.

    2016-11-01

    Specific interactions between proteins and their binding partners are fundamental to life processes. The ability to detect protein complexes, and map their sites of binding, is crucial to understanding basic biology at the molecular level. Methods that employ sensitive analytical techniques such as mass spectrometry have the potential to provide valuable insights with very little material and on short time scales. Here we present a differential protein footprinting technique employing an efficient photo-activated probe for use with mass spectrometry. Using this methodology the location of a carbohydrate substrate was accurately mapped to the binding cleft of lysozyme, and in a more complex example, the interactions between a 100 kDa, multi-domain deubiquitinating enzyme, USP5 and a diubiquitin substrate were located to different functional domains. The much improved properties of this probe make carbene footprinting a viable method for rapid and accurate identification of protein binding sites utilizing benign, near-UV photoactivation.

  4. Improving arrival time identification in transient elastography

    NASA Astrophysics Data System (ADS)

    Klein, Jens; McLaughlin, Joyce; Renzi, Daniel

    2012-04-01

    In this paper, we improve the first step in the arrival time algorithm used for shear wave speed recovery in transient elastography. In transient elastography, a shear wave is initiated at the boundary and the interior displacement of the propagating shear wave is imaged with an ultrasound ultra-fast imaging system. The first step in the arrival time algorithm finds the arrival times of the shear wave by cross correlating displacement time traces (the time history of the displacement at a single point) with a reference time trace located near the shear wave source. The second step finds the shear wave speed from the arrival times. In performing the first step, we observe that the wave pulse decorrelates as it travels through the medium, which leads to inaccurate estimates of the arrival times and ultimately to blurring and artifacts in the shear wave speed image. In particular, wave ‘spreading’ accounts for much of this decorrelation. Here we remove most of the decorrelation by allowing the reference wave pulse to spread during the cross correlation. This dramatically improves the images obtained from arrival time identification. We illustrate the improvement of this method on phantom and in vivo data obtained from the laboratory of Mathias Fink at ESPCI, Paris.

  5. The Yeast Saccharomyces cerevisiae: a versatile model system for the identification and characterization of bacterial virulence proteins.

    PubMed

    Siggers, Keri A; Lesser, Cammie F

    2008-07-17

    Microbial pathogens utilize complex secretion systems to deliver proteins into host cells. These effector proteins target and usurp host cell processes to promote infection and cause disease. While secretion systems are conserved, each pathogen delivers its own unique set of effectors. The identification and characterization of these effector proteins has been difficult, often limited by the lack of detectable signal sequences and functional redundancy. Model systems including yeast, worms, flies, and fish are being used to circumvent these issues. This technical review details the versatility and utility of yeast Saccharomyces cerevisiae as a system to identify and characterize bacterial effectors.

  6. Proteins improving recombinant antibody production in mammalian cells.

    PubMed

    Nishimiya, Daisuke

    2014-02-01

    Mammalian cells have been successfully used for the industrial manufacture of antibodies due to their ability to synthesize antibodies correctly. Nascent polypeptides must be subjected to protein folding and assembly in the ER and the Golgi to be secreted as mature proteins. If these reactions do not proceed appropriately, unfolded or misfolded proteins are degraded by the ER-associated degradation (ERAD) pathway. The accumulation of unfolded proteins or intracellular antibody crystals accompanied by this failure triggers the unfolded protein response (UPR), which can considerably attenuate the levels of translation, folding, assembly, and secretion, resulting in reduction of antibody productivity. Accumulating studies by omics-based analysis of recombinant mammalian cells suggest that not only protein secretion processes including protein folding and assembly but also translation are likely to be the rate-limiting factors for increasing antibody production. Here, this review describes the mechanism of antibody folding and assembly and recent advantages which could improve recombinant antibody production in mammalian cells by utilizing proteins such as ER chaperones or UPR-related proteins.

  7. High-throughput screening of T7 phage display and protein microarrays as a methodological approach for the identification of IgE-reactive components.

    PubMed

    San Segundo-Acosta, Pablo; Garranzo-Asensio, María; Oeo-Santos, Carmen; Montero-Calle, Ana; Quiralte, Joaquín; Cuesta-Herranz, Javier; Villalba, Mayte; Barderas, Rodrigo

    2018-05-01

    Olive pollen and yellow mustard seeds are major allergenic sources with high clinical relevance. To aid with the identification of IgE-reactive components, the development of sensitive methodological approaches is required. Here, we have combined T7 phage display and protein microarrays for the identification of allergenic peptides and mimotopes from olive pollen and mustard seeds. The identification of these allergenic sequences involved the construction and biopanning of T7 phage display libraries of mustard seeds and olive pollen using sera from allergic patients to both biological sources together with the construction of phage microarrays printed with 1536 monoclonal phages from the third/four rounds of biopanning. The screening of the phage microarrays with individual sera from allergic patients enabled the identification of 10 and 9 IgE-reactive unique amino acid sequences from olive pollen and mustard seeds, respectively. Five immunoreactive amino acid sequences displayed on phages were selected for their expression as His6-GST tag fusion proteins and validation. After immunological characterization, we assessed the IgE-reactivity of the constructs. Our results show that protein microarrays printed with T7 phages displaying peptides from allergenic sources might be used to identify allergenic components -peptides, proteins or mimotopes- through their screening with specific IgE antibodies from allergic patients. Copyright © 2018 Elsevier B.V. All rights reserved.

  8. Matrix-assisted laser desorption/ionization time-of-flight mass spectrometry as a tool for fast identification of protein binders in color layers of paintings.

    PubMed

    Hynek, Radovan; Kuckova, Stepanka; Hradilova, Janka; Kodicek, Milan

    2004-01-01

    Identification of materials in color layers of paintings is necessary for correct decisions concerning restoration procedures as well as proving the authenticity of the painting. The proteins are usually important components of the painting layers. In this paper it has been demonstrated that matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOFMS) can be used for fast and reliable identification of proteins in color layers even in old, highly aged matrices. The digestion can be easily performed directly on silica wafers which are routinely used for infrared analysis. The amount of material necessary for such an analysis is extremely small. Peptide mass mapping using digestion with trypsin followed by MALDI-TOFMS and identification of the protein was successfully used for determination of the binder from a painting of the 19th century. Copyright 2004 John Wiley & Sons, Ltd.

  9. Designing and benchmarking the MULTICOM protein structure prediction system

    PubMed Central

    2013-01-01

    Background Predicting protein structure from sequence is one of the most significant and challenging problems in bioinformatics. Numerous bioinformatics techniques and tools have been developed to tackle almost every aspect of protein structure prediction ranging from structural feature prediction, template identification and query-template alignment to structure sampling, model quality assessment, and model refinement. How to synergistically select, integrate and improve the strengths of the complementary techniques at each prediction stage and build a high-performance system is becoming a critical issue for constructing a successful, competitive protein structure predictor. Results Over the past several years, we have constructed a standalone protein structure prediction system MULTICOM that combines multiple sources of information and complementary methods at all five stages of the protein structure prediction process including template identification, template combination, model generation, model assessment, and model refinement. The system was blindly tested during the ninth Critical Assessment of Techniques for Protein Structure Prediction (CASP9) in 2010 and yielded very good performance. In addition to studying the overall performance on the CASP9 benchmark, we thoroughly investigated the performance and contributions of each component at each stage of prediction. Conclusions Our comprehensive and comparative study not only provides useful and practical insights about how to select, improve, and integrate complementary methods to build a cutting-edge protein structure prediction system but also identifies a few new sources of information that may help improve the design of a protein structure prediction system. Several components used in the MULTICOM system are available at: http://sysbio.rnet.missouri.edu/multicom_toolbox/. PMID:23442819

  10. An improved cyan fluorescent protein variant useful for FRET.

    PubMed

    Rizzo, Mark A; Springer, Gerald H; Granada, Butch; Piston, David W

    2004-04-01

    Many genetically encoded biosensors use Förster resonance energy transfer (FRET) between fluorescent proteins to report biochemical phenomena in living cells. Most commonly, the enhanced cyan fluorescent protein (ECFP) is used as the donor fluorophore, coupled with one of several yellow fluorescent protein (YFP) variants as the acceptor. ECFP is used despite several spectroscopic disadvantages, namely a low quantum yield, a low extinction coefficient and a fluorescence lifetime that is best fit by a double exponential. To improve the characteristics of ECFP for FRET measurements, we used a site-directed mutagenesis approach to overcome these disadvantages. The resulting variant, which we named Cerulean (ECFP/S72A/Y145A/H148D), has a greatly improved quantum yield, a higher extinction coefficient and a fluorescence lifetime that is best fit by a single exponential. Cerulean is 2.5-fold brighter than ECFP and replacement of ECFP with Cerulean substantially improves the signal-to-noise ratio of a FRET-based sensor for glucokinase activation.

  11. APRICOT: an integrated computational pipeline for the sequence-based identification and characterization of RNA-binding proteins.

    PubMed

    Sharan, Malvika; Förstner, Konrad U; Eulalio, Ana; Vogel, Jörg

    2017-06-20

    RNA-binding proteins (RBPs) have been established as core components of several post-transcriptional gene regulation mechanisms. Experimental techniques such as cross-linking and co-immunoprecipitation have enabled the identification of RBPs, RNA-binding domains (RBDs) and their regulatory roles in the eukaryotic species such as human and yeast in large-scale. In contrast, our knowledge of the number and potential diversity of RBPs in bacteria is poorer due to the technical challenges associated with the existing global screening approaches. We introduce APRICOT, a computational pipeline for the sequence-based identification and characterization of proteins using RBDs known from experimental studies. The pipeline identifies functional motifs in protein sequences using position-specific scoring matrices and Hidden Markov Models of the functional domains and statistically scores them based on a series of sequence-based features. Subsequently, APRICOT identifies putative RBPs and characterizes them by several biological properties. Here we demonstrate the application and adaptability of the pipeline on large-scale protein sets, including the bacterial proteome of Escherichia coli. APRICOT showed better performance on various datasets compared to other existing tools for the sequence-based prediction of RBPs by achieving an average sensitivity and specificity of 0.90 and 0.91 respectively. The command-line tool and its documentation are available at https://pypi.python.org/pypi/bio-apricot. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  12. Identification of immunogenic proteins and evaluation of four recombinant proteins as potential vaccine antigens from Vibrio anguillarum in flounder (Paralichthys olivaceus).

    PubMed

    Xing, Jing; Xu, Hongsen; Wang, Yang; Tang, Xiaoqian; Sheng, Xiuzhen; Zhan, Wenbin

    2017-05-31

    Vibrio anguillarum is a severe bacterial pathogen that can infect a wide range of fish species. Identification of immunogenic proteins and development of vaccine are essential for disease prevention. In this study, immunogenic proteins were screened and identified from V. anguillarum, and then protective efficacy of the immunogenic proteins was evaluated. Immunogenic proteins in V. anguillarum whole cell were detected by Western blotting (WB) using immunized flounder (Paralichthys olivaceus) serum, and then identified by Mass spectrometry (MS). The recombinant proteins of four identified immunogenic proteins were produced and immunized to fish, and then percentages of surface membrane immunoglobulin-positive (sIg+) cells in peripheral blood lymphocytes (PBL), total antibodies, antibodies against V. anguillarum, antibodies against recombinant proteins and relative percent survival (RPS) were measured, respectively. The results showed that five immunogenic proteins, VAA, Groel, OmpU, PteF and SpK, were identified; their recombinant proteins, rOmpU, rGroel, rSpK and rVAA, could induce the proliferation of sIg+ cells in PBL and production of total antibodies, antibodies against V. anguillarum and antibodies against the recombinant proteins; their protection against V. anguillarum showed 64.86%, 72.97%, 21.62% and 78.38% RPS, respectively. The results revealed that the immunoproteomic technique using fish anti-V. anguillarum serum provided an efficient way to screen the immunogenic protein for vaccine antigen. Moreover, the rVAA, rGroel and rOmpU had potential to be vaccine candidates against V. anguillarum infection. Copyright © 2017 Elsevier Ltd. All rights reserved.

  13. Computational Identification and Comparative Analysis of Secreted and Transmembrane Proteins in Six Burkholderia Species.

    PubMed

    Nguyen, Thao Thi; Lee, Hyun-Hee; Park, Jungwook; Park, Inmyoung; Seo, Young-Su

    2017-04-01

    As a step towards discovering novel pathogenesis-related proteins, we performed a genome scale computational identification and characterization of secreted and transmembrane (TM) proteins, which are mainly responsible for bacteria-host interactions and interactions with other bacteria, in the genomes of six representative Burkholderia species. The species comprised plant pathogens ( B. glumae BGR1, B. gladioli BSR3), human pathogens ( B. pseudomallei K96243, B. cepacia LO6), and plant-growth promoting endophytes ( Burkholderia sp. KJ006, B. phytofirmans PsJN). The proportions of putative classically secreted proteins (CSPs) and TM proteins among the species were relatively high, up to approximately 20%. Lower proportions of putative type 3 non-classically secreted proteins (T3NCSPs) (~10%) and unclassified non-classically secreted proteins (NCSPs) (~5%) were observed. The numbers of TM proteins among the three clusters (plant pathogens, human pathogens, and endophytes) were different, while the distribution of these proteins according to the number of TM domains was conserved in which TM proteins possessing 1, 2, 4, or 12 TM domains were the dominant groups in all species. In addition, we observed conservation in the protein size distribution of the secreted protein groups among the species. There were species-specific differences in the functional characteristics of these proteins in the various groups of CSPs, T3NCSPs, and unclassified NCSPs. Furthermore, we assigned the complete sets of the conserved and unique NCSP candidates of the collected Burkholderia species using sequence similarity searching. This study could provide new insights into the relationship among plant-pathogenic, human-pathogenic, and endophytic bacteria.

  14. M13 bacteriophage coat proteins engineered for improved phage display.

    PubMed

    Sidhu, Sachdev S; Feld, Birte K; Weiss, Gregory A

    2007-01-01

    This chapter describes a method for increasing levels of protein fusions displayed on the surfaces of M13 bacteriophage particles. By introducing mutations into the anchoring M13 coat protein, protein display levels can be increased by up to two orders of magnitude. Experimental methods are presented for the design, construction, and screening of phage-displayed libraries for improved protein display.

  15. Fast digestive, leucine-rich, soluble milk proteins improve muscle protein anabolism, and mitochondrial function in undernourished old rats.

    PubMed

    Salles, Jérôme; Chanet, Audrey; Berry, Alexandre; Giraudet, Christophe; Patrac, Véronique; Domingues-Faria, Carla; Rocher, Christophe; Guillet, Christelle; Denis, Philippe; Pouyet, Corinne; Bonhomme, Cécile; Le Ruyet, Pascale; Rolland, Yves; Boirie, Yves; Walrand, Stéphane

    2017-11-01

    One strategy to manage malnutrition in older patients is to increase protein and energy intake. Here, we evaluate the influence of protein quality during refeeding on improvement in muscle protein and energy metabolism. Twenty-month-old male rats (n = 40) were fed 50% of their spontaneous intake for 12 weeks to induce malnutrition, then refed ad libitum with a standard diet enriched with casein or soluble milk proteins (22%) for 4 weeks. A 13C-valine was infused to measure muscle protein synthesis and expression of MuRF1, and MAFbx was measured to evaluate muscle proteolysis. mTOR pathway activation and mitochondrial function were assessed in muscle. Malnutrition was associated with a decrease in body weight, fat mass, and lean mass, particularly muscle mass. Malnutrition decreased muscle mTOR pathway activation and protein FSR associated with increased MuRF1 mRNA levels, and decreased mitochondrial function. The refeeding period partially restored fat mass and lean mass. Unlike the casein diet, the soluble milk protein diet improved muscle protein metabolism and mitochondrial function in old malnourished rats. These results suggest that providing better-quality proteins during refeeding may improve efficacy of renutrition in malnourished older patients. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  16. Improved Functional Characteristics of Whey Protein Hydrolysates in Food Industry

    PubMed Central

    Jeewanthi, Renda Kankanamge Chaturika; Lee, Na-Kyoung; Paik, Hyun-Dong

    2015-01-01

    This review focuses on the enhanced functional characteristics of enzymatic hydrolysates of whey proteins (WPHs) in food applications compared to intact whey proteins (WPs). WPs are applied in foods as whey protein concentrates (WPCs), whey protein isolates (WPIs), and WPHs. WPs are byproducts of cheese production, used in a wide range of food applications due to their nutritional validity, functional activities, and cost effectiveness. Enzymatic hydrolysis yields improved functional and nutritional benefits in contrast to heat denaturation or native applications. WPHs improve solubility over a wide range of pH, create viscosity through water binding, and promote cohesion, adhesion, and elasticity. WPHs form stronger but more flexible edible films than WPC or WPI. WPHs enhance emulsification, bind fat, and facilitate whipping, compared to intact WPs. Extensive hydrolyzed WPHs with proper heat applications are the best emulsifiers and addition of polysaccharides improves the emulsification ability of WPHs. Also, WPHs improve the sensorial properties like color, flavor, and texture but impart a bitter taste in case where extensive hydrolysis (degree of hydrolysis greater than 8%). It is important to consider the type of enzyme, hydrolysis conditions, and WPHs production method based on the nature of food application. PMID:26761849

  17. Improved Functional Characteristics of Whey Protein Hydrolysates in Food Industry.

    PubMed

    Jeewanthi, Renda Kankanamge Chaturika; Lee, Na-Kyoung; Paik, Hyun-Dong

    2015-01-01

    This review focuses on the enhanced functional characteristics of enzymatic hydrolysates of whey proteins (WPHs) in food applications compared to intact whey proteins (WPs). WPs are applied in foods as whey protein concentrates (WPCs), whey protein isolates (WPIs), and WPHs. WPs are byproducts of cheese production, used in a wide range of food applications due to their nutritional validity, functional activities, and cost effectiveness. Enzymatic hydrolysis yields improved functional and nutritional benefits in contrast to heat denaturation or native applications. WPHs improve solubility over a wide range of pH, create viscosity through water binding, and promote cohesion, adhesion, and elasticity. WPHs form stronger but more flexible edible films than WPC or WPI. WPHs enhance emulsification, bind fat, and facilitate whipping, compared to intact WPs. Extensive hydrolyzed WPHs with proper heat applications are the best emulsifiers and addition of polysaccharides improves the emulsification ability of WPHs. Also, WPHs improve the sensorial properties like color, flavor, and texture but impart a bitter taste in case where extensive hydrolysis (degree of hydrolysis greater than 8%). It is important to consider the type of enzyme, hydrolysis conditions, and WPHs production method based on the nature of food application.

  18. Identification of a novel heteroglycan-interacting protein, HIP 1.3, from Arabidopsis thaliana.

    PubMed

    Fettke, Joerg; Nunes-Nesi, Adriano; Fernie, Alisdair R; Steup, Martin

    2011-08-15

    Plastidial degradation of transitory starch yields mainly maltose and glucose. Following the export into the cytosol, maltose acts as donor for a glucosyl transfer to cytosolic heteroglycans as mediated by a cytosolic transglucosidase (DPE2; EC 2.4.1.25) and the second glucosyl residue is liberated as glucose. The cytosolic phosphorylase (Pho2/PHS2; EC 2.4.1.1) also interacts with heteroglycans using the same intramolecular sites as DPE2. Thus, the two glucosyl transferases interconnect the cytosolic pools of glucose and glucose 1-phosphate. Due to the complex monosaccharide pattern, other heteroglycan-interacting proteins (HIPs) are expected to exist. Identification of those proteins was approached by using two types of affinity chromatography. Heteroglycans from leaves of Arabidopsis thaliana (Col-0) covalently bound to Sepharose served as ligands that were reacted with a complex mixture of buffer-soluble proteins from Arabidopsis leaves. Binding proteins were eluted by sodium chloride. For identification, SDS-PAGE, tryptic digestion and MALDI-TOF analyses were applied. A strongly interacting polypeptide (approximately 40kDa; designated as HIP1.3) was observed as product of locus At1g09340. Arabidopsis mutants deficient in HIP1.3 were reduced in growth and contained heteroglycans displaying an altered monosaccharide pattern. Wild type plants express HIP1.3 most strongly in leaves. As revealed by immuno fluorescence, HIP1.3 is located in the cytosol of mesophyll cells but mostly associated with the cytosolic surface of the chloroplast envelope membranes. In an HIP1.3-deficient mutant the immunosignal was undetectable. Metabolic profiles from leaves of this mutant and wild type plants as well were determined by GC-MS. As compared to the wild type control, more than ten metabolites, such as ascorbic acid, fructose, fructose bisphosphate, glucose, glycine, were elevated in darkness but decreased in the light. Although the biochemical function of HIP1.3 has not yet

  19. A peptide affinity column for the identification of integrin alpha IIb-binding proteins.

    PubMed

    Daxecker, Heide; Raab, Markus; Bernard, Elise; Devocelle, Marc; Treumann, Achim; Moran, Niamh

    2008-03-01

    To understand the regulation of integrin alpha(IIb)beta(3), a critical platelet adhesion molecule, we have developed a peptide affinity chromatography method using the known integrin regulatory motif, LAMWKVGFFKR. Using standard Fmoc chemistry, this peptide was synthesized onto a Toyopearl AF-Amino-650 M resin on a 6-aminohexanoic acid (Ahx) linker. Peptide density was controlled by acetylation of 83% of the Ahx amino groups. Four recombinant human proteins (CIB1, PP1, ICln and RN181), previously identified as binding to this integrin regulatory motif, were specifically retained by the column containing the integrin peptide but not by a column presenting an irrelevant peptide. Hemoglobin, creatine kinase, bovine serum albumin, fibrinogen and alpha-tubulin failed to bind under the chosen conditions. Immunodetection methods confirmed the binding of endogenous platelet proteins, including CIB1, PP1, ICln RN181, AUP-1 and beta3-integrin, from a detergent-free platelet lysate. Thus, we describe a reproducible method that facilitates the reliable extraction of specific integrin-binding proteins from complex biological matrices. This methodology may enable the sensitive and specific identification of proteins that interact with linear, membrane-proximal peptide motifs such as the integrin regulatory motif LAMWKVGFFKR.

  20. HIPPI: highly accurate protein family classification with ensembles of HMMs.

    PubMed

    Nguyen, Nam-Phuong; Nute, Michael; Mirarab, Siavash; Warnow, Tandy

    2016-11-11

    Given a new biological sequence, detecting membership in a known family is a basic step in many bioinformatics analyses, with applications to protein structure and function prediction and metagenomic taxon identification and abundance profiling, among others. Yet family identification of sequences that are distantly related to sequences in public databases or that are fragmentary remains one of the more difficult analytical problems in bioinformatics. We present a new technique for family identification called HIPPI (Hierarchical Profile Hidden Markov Models for Protein family Identification). HIPPI uses a novel technique to represent a multiple sequence alignment for a given protein family or superfamily by an ensemble of profile hidden Markov models computed using HMMER. An evaluation of HIPPI on the Pfam database shows that HIPPI has better overall precision and recall than blastp, HMMER, and pipelines based on HHsearch, and maintains good accuracy even for fragmentary query sequences and for protein families with low average pairwise sequence identity, both conditions where other methods degrade in accuracy. HIPPI provides accurate protein family identification and is robust to difficult model conditions. Our results, combined with observations from previous studies, show that ensembles of profile Hidden Markov models can better represent multiple sequence alignments than a single profile Hidden Markov model, and thus can improve downstream analyses for various bioinformatic tasks. Further research is needed to determine the best practices for building the ensemble of profile Hidden Markov models. HIPPI is available on GitHub at https://github.com/smirarab/sepp .

  1. Methodology for identification of pore forming antimicrobial peptides from soy protein subunits β-conglycinin and glycinin.

    PubMed

    Xiang, Ning; Lyu, Yuan; Zhu, Xiao; Bhunia, Arun K; Narsimhan, Ganesan

    2016-11-01

    Antimicrobial peptides (AMPs) inactivate microbial cells through pore formation in cell membrane. Because of their different mode of action compared to antibiotics, AMPs can be effectively used to combat drug resistant bacteria in human health. AMPs can also be used to replace antibiotics in animal feed and immobilized on food packaging films. In this research, we developed a methodology based on mechanistic evaluation of peptide-lipid bilayer interaction to identify AMPs from soy protein. Production of AMPs from soy protein is an attractive, cost-saving alternative for commercial consideration, because soy protein is an abundant and common protein resource. This methodology is also applicable for identification of AMPs from any protein. Initial screening of peptide segments from soy glycinin (11S) and soy β-conglycinin (7S) subunits was based on their hydrophobicity, hydrophobic moment and net charge. Delicate balance between hydrophilic and hydrophobic interactions is necessary for pore formation. High hydrophobicity decreases the peptide solubility in aqueous phase whereas high hydrophilicity limits binding of the peptide to the bilayer. Out of several candidates chosen from the initial screening, two peptides satisfied the criteria for antimicrobial activity, viz. (i) lipid-peptide binding in surface state and (ii) pore formation in transmembrane state of the aggregate. This method of identification of antimicrobial activity via molecular dynamics simulation was shown to be robust in that it is insensitive to the number of peptides employed in the simulation, initial peptide structure and force field. Their antimicrobial activity against Listeria monocytogenes and Escherichia coli was further confirmed by spot-on-lawn test. Copyright © 2016 Elsevier Inc. All rights reserved.

  2. Identification of Naegleria fowleri proteins linked to primary amoebic meningoencephalitis.

    PubMed

    Jamerson, Melissa; Schmoyer, Jacqueline A; Park, Jay; Marciano-Cabral, Francine; Cabral, Guy A

    2017-03-01

    Naegleria fowleri (N. fowleri) causes primary amoebic meningoencephalitis, a rapidly fatal disease of the central nervous system. N. fowleri can exist in cyst, flagellate or amoebic forms, depending on environmental conditions. The amoebic form can invade the brain following introduction into the nasal passages. When applied intranasally to a mouse model, cultured N. fowleri amoebae exhibit low virulence. However, upon serial passage in mouse brain, the amoebae acquire a highly virulent state. In the present study, a proteomics approach was applied to the identification of N. fowleri amoeba proteins whose expression was associated with the highly virulent state in mice. Mice were inoculated intranasally with axenically cultured amoebae or with mouse-passaged amoebae. Examination by light and electron microscopy revealed no morphological differences. However, mouse-passaged amoebae were more virulent in mice as indicated by exhibiting a two log10 titre decrease in median infective dose 50 (ID50). Scatter plot analysis of amoebic lysates revealed a subset of proteins, the expression of which was associated with highly virulent amoebae. MS-MS indicated that this subset contained proteins that shared homology with those linked to cytoskeletal rearrangement and the invasion process. Invasion assays were performed in the presence of a select inhibitor to expand on the findings. The collective results suggest that N. fowleri gene products linked to cytoskeletal rearrangement and invasion may be candidate targets in the management of primary amoebic meningoencephalitis.

  3. Disruption of prion protein-HOP engagement impairs glioblastoma growth and cognitive decline and improves overall survival.

    PubMed

    Lopes, M H; Santos, T G; Rodrigues, B R; Queiroz-Hazarbassanov, N; Cunha, I W; Wasilewska-Sampaio, A P; Costa-Silva, B; Marchi, F A; Bleggi-Torres, L F; Sanematsu, P I; Suzuki, S H; Oba-Shinjo, S M; Marie, S K N; Toulmin, E; Hill, A F; Martins, V R

    2015-06-01

    Glioblastomas (GBMs) are resistant to current therapy protocols and identification of molecules that target these tumors is crucial. Interaction of secreted heat-shock protein 70 (Hsp70)-Hsp90-organizing protein (HOP) with cellular prion protein (PrP(C)) triggers a large number of trophic effects in the nervous system. We found that both PrP(C) and HOP are highly expressed in human GBM samples relative to non-tumoral tissue or astrocytoma grades I-III. High levels of PrP(C) and HOP were associated with greater GBM proliferation and lower patient survival. HOP-PrP(C) binding increased GBM proliferation in vitro via phosphatidylinositide 3-kinase and extracellular-signal-regulated kinase pathways, and a HOP peptide mimicking the PrP(C) binding site (HOP230-245) abrogates this effect. PrP(C) knockdown impaired tumor growth and increased survival of mice with tumors. In mice, intratumor delivery of HOP230-245 peptide impaired proliferation and promoted apoptosis of GBM cells. In addition, treatment with HOP230-245 peptide inhibited tumor growth, maintained cognitive performance and improved survival. Thus, together, the present results indicate that interfering with PrP(C)-HOP engagement is a promising approach for GBM therapy.

  4. Identification of the Kelch Family Protein Nd1-L as a Novel Molecular Interactor of KRIT1

    PubMed Central

    Cutano, Valentina; Martino, Chiara

    2012-01-01

    Loss-of-function mutations of the KRIT1 gene (CCM1) have been associated with the Cerebral Cavernous Malformation (CCM) disease, which is characterized by serious alterations of brain capillary architecture. The KRIT1 protein contains multiple interaction domains and motifs, suggesting that it might act as a scaffold for the assembly of functional protein complexes involved in signaling networks. In previous work, we defined structure-function relationships underlying KRIT1 intramolecular and intermolecular interactions and nucleocytoplasmic shuttling, and found that KRIT1 plays an important role in molecular mechanisms involved in the maintenance of the intracellular Reactive Oxygen Species (ROS) homeostasis to prevent oxidative cellular damage. Here we report the identification of the Kelch family protein Nd1-L as a novel molecular interactor of KRIT1. This interaction was discovered through yeast two-hybrid screening of a mouse embryo cDNA library, and confirmed by pull-down and co-immunoprecipitation assays of recombinant proteins, as well as by co-immunoprecipitation of endogenous proteins in human endothelial cells. Furthermore, using distinct KRIT1 isoforms and mutants, we defined the role of KRIT1 domains in the Nd1-L/KRIT1 interaction. Finally, functional assays showed that Nd1-L may contribute to the regulation of KRIT1 nucleocytoplasmic shuttling and cooperate with KRIT1 in modulating the expression levels of the antioxidant protein SOD2, opening a novel avenue for future mechanistic studies. The identification of Nd1-L as a novel KRIT1 interacting protein provides a novel piece of the molecular puzzle involving KRIT1 and suggests a potential functional cooperation in cellular responses to oxidative stress, thus expanding the framework of molecular complexes and mechanisms that may underlie the pathogenesis of CCM disease. PMID:22970292

  5. On plate graphite supported sample processing for simultaneous lipid and protein identification by matrix assisted laser desorption ionization mass spectrometry.

    PubMed

    Calvano, Cosima Damiana; van der Werf, Inez Dorothé; Sabbatini, Luigia; Palmisano, Francesco

    2015-05-01

    The simultaneous identification of lipids and proteins by matrix assisted laser desorption ionization-mass spectrometry (MALDI-MS) after direct on-plate processing of micro-samples supported on colloidal graphite is demonstrated. Taking advantages of large surface area and thermal conductivity, graphite provided an ideal substrate for on-plate proteolysis and lipid extraction. Indeed proteins could be efficiently digested on-plate within 15 min, providing sequence coverages comparable to those obtained by conventional in-solution overnight digestion. Interestingly, detection of hydrophilic phosphorylated peptides could be easily achieved without any further enrichment step. Furthermore, lipids could be simultaneously extracted/identified without any additional treatment/processing step as demonstrated for model complex samples such as milk and egg. The present approach is simple, efficient, of large applicability and offers great promise for protein and lipid identification in very small samples. Copyright © 2015 Elsevier B.V. All rights reserved.

  6. Maximizing Selective Cleavages at Aspartic Acid and Proline Residues for the Identification of Intact Proteins

    NASA Astrophysics Data System (ADS)

    Foreman, David J.; Dziekonski, Eric T.; McLuckey, Scott A.

    2018-04-01

    A new approach for the identification of intact proteins has been developed that relies on the generation of relatively few abundant products from specific cleavage sites. This strategy is intended to complement standard approaches that seek to generate many fragments relatively non-selectively. Specifically, this strategy seeks to maximize selective cleavage at aspartic acid and proline residues via collisional activation of precursor ions formed via electrospray ionization (ESI) under denaturing conditions. A statistical analysis of the SWISS-PROT database was used to predict the number of arginine residues for a given intact protein mass and predict a m/z range where the protein carries a similar charge to the number of arginine residues thereby enhancing cleavage at aspartic acid residues by limiting proton mobility. Cleavage at aspartic acid residues is predicted to be most favorable in the m/z range of 1500-2500, a range higher than that normally generated by ESI at low pH. Gas-phase proton transfer ion/ion reactions are therefore used for precursor ion concentration from relatively high charge states followed by ion isolation and subsequent generation of precursor ions within the optimal m/z range via a second proton transfer reaction step. It is shown that the majority of product ion abundance is concentrated into cleavages C-terminal to aspartic acid residues and N-terminal to proline residues for ions generated by this process. Implementation of a scoring system that weights both ion fragment type and ion fragment area demonstrated identification of standard proteins, ranging in mass from 8.5 to 29.0 kDa. [Figure not available: see fulltext.

  7. Maximizing Selective Cleavages at Aspartic Acid and Proline Residues for the Identification of Intact Proteins.

    PubMed

    Foreman, David J; Dziekonski, Eric T; McLuckey, Scott A

    2018-04-30

    A new approach for the identification of intact proteins has been developed that relies on the generation of relatively few abundant products from specific cleavage sites. This strategy is intended to complement standard approaches that seek to generate many fragments relatively non-selectively. Specifically, this strategy seeks to maximize selective cleavage at aspartic acid and proline residues via collisional activation of precursor ions formed via electrospray ionization (ESI) under denaturing conditions. A statistical analysis of the SWISS-PROT database was used to predict the number of arginine residues for a given intact protein mass and predict a m/z range where the protein carries a similar charge to the number of arginine residues thereby enhancing cleavage at aspartic acid residues by limiting proton mobility. Cleavage at aspartic acid residues is predicted to be most favorable in the m/z range of 1500-2500, a range higher than that normally generated by ESI at low pH. Gas-phase proton transfer ion/ion reactions are therefore used for precursor ion concentration from relatively high charge states followed by ion isolation and subsequent generation of precursor ions within the optimal m/z range via a second proton transfer reaction step. It is shown that the majority of product ion abundance is concentrated into cleavages C-terminal to aspartic acid residues and N-terminal to proline residues for ions generated by this process. Implementation of a scoring system that weights both ion fragment type and ion fragment area demonstrated identification of standard proteins, ranging in mass from 8.5 to 29.0 kDa. Graphical Abstract ᅟ.

  8. Improved feed protein fractionation schemes for formulating rations with the cornell net carbohydrate and protein system.

    PubMed

    Lanzas, C; Broderick, G A; Fox, D G

    2008-12-01

    Adequate predictions of rumen-degradable protein (RDP) and rumen-undegradable protein (RUP) supplies are necessary to optimize performance while minimizing losses of excess nitrogen (N). The objectives of this study were to evaluate the original Cornell Net Carbohydrate Protein System (CNCPS) protein fractionation scheme and to develop and evaluate alternatives designed to improve its adequacy in predicting RDP and RUP. The CNCPS version 5 fractionates CP into 5 fractions based on solubility in protein precipitant agents, buffers, and detergent solutions: A represents the soluble nonprotein N, B1 is the soluble true protein, B2 represents protein with intermediate rates of degradation, B3 is the CP insoluble in neutral detergent solution but soluble in acid detergent solution, and C is the unavailable N. Model predictions were evaluated with studies that measured N flow data at the omasum. The N fractionation scheme in version 5 of the CNCPS explained 78% of the variation in RDP with a root mean square prediction error (RMSPE) of 275 g/d, and 51% of the RUP variation with RMSPE of 248 g/d. Neutral detergent insoluble CP flows were overpredicted with a mean bias of 128 g/d (40% of the observed mean). The greatest improvements in the accuracy of RDP and RUP predictions were obtained with the following 2 alternative schemes. Alternative 1 used the inhibitory in vitro system to measure the fractional rate of degradation for the insoluble protein fraction in which A = nonprotein N, B1 = true soluble protein, B2 = insoluble protein, C = unavailable protein (RDP: R(2) = 0.84 and RMSPE = 167 g/d; RUP: R(2) = 0.61 and RMSPE = 209 g/d), whereas alternative 2 redefined A and B1 fractions as the non-amino-N and amino-N in the soluble fraction respectively (RDP: R(2) = 0.79 with RMSPE = 195 g/d and RUP: R(2) = 0.54 with RMSPE = 225 g/d). We concluded that implementing alternative 1 or 2 will improve the accuracy of predicting RDP and RUP within the CNCPS framework.

  9. An improved method for identification of small non-coding RNAs in bacteria using support vector machine

    NASA Astrophysics Data System (ADS)

    Barman, Ranjan Kumar; Mukhopadhyay, Anirban; Das, Santasabuj

    2017-04-01

    Bacterial small non-coding RNAs (sRNAs) are not translated into proteins, but act as functional RNAs. They are involved in diverse biological processes like virulence, stress response and quorum sensing. Several high-throughput techniques have enabled identification of sRNAs in bacteria, but experimental detection remains a challenge and grossly incomplete for most species. Thus, there is a need to develop computational tools to predict bacterial sRNAs. Here, we propose a computational method to identify sRNAs in bacteria using support vector machine (SVM) classifier. The primary sequence and secondary structure features of experimentally-validated sRNAs of Salmonella Typhimurium LT2 (SLT2) was used to build the optimal SVM model. We found that a tri-nucleotide composition feature of sRNAs achieved an accuracy of 88.35% for SLT2. We validated the SVM model also on the experimentally-detected sRNAs of E. coli and Salmonella Typhi. The proposed model had robustly attained an accuracy of 81.25% and 88.82% for E. coli K-12 and S. Typhi Ty2, respectively. We confirmed that this method significantly improved the identification of sRNAs in bacteria. Furthermore, we used a sliding window-based method and identified sRNAs from complete genomes of SLT2, S. Typhi Ty2 and E. coli K-12 with sensitivities of 89.09%, 83.33% and 67.39%, respectively.

  10. Evaluation of mass spectrometric data using principal component analysis for determination of the effects of organic lakes on protein binder identification.

    PubMed

    Hrdlickova Kuckova, Stepanka; Rambouskova, Gabriela; Hynek, Radovan; Cejnar, Pavel; Oltrogge, Doris; Fuchs, Robert

    2015-11-01

    Matrix-assisted laser desorption/ionisation-time of flight (MALDI-TOF) mass spectrometry is commonly used for the identification of proteinaceous binders and their mixtures in artworks. The determination of protein binders is based on a comparison between the m/z values of tryptic peptides in the unknown sample and a reference one (egg, casein, animal glues etc.), but this method has greater potential to study changes due to ageing and the influence of organic/inorganic components on protein identification. However, it is necessary to then carry out statistical evaluation on the obtained data. Before now, it has been complicated to routinely convert the mass spectrometric data into a statistical programme, to extract and match the appropriate peaks. Only several 'homemade' computer programmes without user-friendly interfaces are available for these purposes. In this paper, we would like to present our completely new, publically available, non-commercial software, ms-alone and multiMS-toolbox, for principal component analyses of MALDI-TOF MS data for R software, and their application to the study of the influence of heterogeneous matrices (organic lakes) for protein identification. Using this new software, we determined the main factors that influence the protein analyses of artificially aged model mixtures of organic lakes and fish glue, prepared according to historical recipes that were used for book illumination, using MALDI-TOF peptide mass mapping. Copyright © 2015 John Wiley & Sons, Ltd.

  11. Optimization of mass spectrometric parameters improve the identification performance of capillary zone electrophoresis for single-shot bottom-up proteomics analysis.

    PubMed

    Zhang, Zhenbin; Dovichi, Norman J

    2018-02-25

    The effects of MS1 injection time, MS2 injection time, dynamic exclusion time, intensity threshold, and isolation width were investigated on the numbers of peptide and protein identifications for single-shot bottom-up proteomics analysis using CZE-MS/MS analysis of a Xenopus laevis tryptic digest. An electrokinetically pumped nanospray interface was used to couple a linear-polyacrylamide coated capillary to a Q Exactive HF mass spectrometer. A sensitive method that used a 1.4 Th isolation width, 60,000 MS2 resolution, 110 ms MS2 injection time, and a top 7 fragmentation produced the largest number of identifications when the CZE loading amount was less than 100 ng. A programmable autogain control method (pAGC) that used a 1.4 Th isolation width, 15,000 MS2 resolution, 110 ms MS2 injection time, and top 10 fragmentation produced the largest number of identifications for CZE loading amounts greater than 100 ng; 7218 unique peptides and 1653 protein groups were identified from 200 ng by using the pAGC method. The effect of mass spectrometer conditions on the performance of UPLC-MS/MS was also investigated. A fast method that used a 1.4 Th isolation width, 30,000 MS2 resolution, 45 ms MS2 injection time, and top 12 fragmentation produced the largest number of identifications for 200 ng UPLC loading amount (6025 unique peptides and 1501 protein groups). This is the first report where the identification number for CZE surpasses that of the UPLC at the 200 ng loading level. However, more peptides (11476) and protein groups (2378) were identified by using UPLC-MS/MS when the sample loading amount was increased to 2 μg with the fast method. To exploit the fast scan speed of the Q-Exactive HF mass spectrometer, higher sample loading amounts are required for single-shot bottom-up proteomics analysis using CZE-MS/MS. Copyright © 2017 Elsevier B.V. All rights reserved.

  12. Improved hybrid optimization algorithm for 3D protein structure prediction.

    PubMed

    Zhou, Changjun; Hou, Caixia; Wei, Xiaopeng; Zhang, Qiang

    2014-07-01

    A new improved hybrid optimization algorithm - PGATS algorithm, which is based on toy off-lattice model, is presented for dealing with three-dimensional protein structure prediction problems. The algorithm combines the particle swarm optimization (PSO), genetic algorithm (GA), and tabu search (TS) algorithms. Otherwise, we also take some different improved strategies. The factor of stochastic disturbance is joined in the particle swarm optimization to improve the search ability; the operations of crossover and mutation that are in the genetic algorithm are changed to a kind of random liner method; at last tabu search algorithm is improved by appending a mutation operator. Through the combination of a variety of strategies and algorithms, the protein structure prediction (PSP) in a 3D off-lattice model is achieved. The PSP problem is an NP-hard problem, but the problem can be attributed to a global optimization problem of multi-extremum and multi-parameters. This is the theoretical principle of the hybrid optimization algorithm that is proposed in this paper. The algorithm combines local search and global search, which overcomes the shortcoming of a single algorithm, giving full play to the advantage of each algorithm. In the current universal standard sequences, Fibonacci sequences and real protein sequences are certified. Experiments show that the proposed new method outperforms single algorithms on the accuracy of calculating the protein sequence energy value, which is proved to be an effective way to predict the structure of proteins.

  13. Evaluation of techniques for increasing recall in a dictionary approach to gene and protein name identification.

    PubMed

    Schuemie, Martijn J; Mons, Barend; Weeber, Marc; Kors, Jan A

    2007-06-01

    Gene and protein name identification in text requires a dictionary approach to relate synonyms to the same gene or protein, and to link names to external databases. However, existing dictionaries are incomplete. We investigate two complementary methods for automatic generation of a comprehensive dictionary: combination of information from existing gene and protein databases and rule-based generation of spelling variations. Both methods have been reported in literature before, but have hitherto not been combined and evaluated systematically. We combined gene and protein names from several existing databases of four different organisms. The combined dictionaries showed a substantial increase in recall on three different test sets, as compared to any single database. Application of 23 spelling variation rules to the combined dictionaries further increased recall. However, many rules appeared to have no effect and some appear to have a detrimental effect on precision.

  14. Improvement of proteolytic efficiency towards low-level proteins by an antifouling surface of alumina gel in a microchannel.

    PubMed

    Liu, Yun; Wang, Huixiang; Liu, Qingping; Qu, Haiyun; Liu, Baohong; Yang, Pengyuan

    2010-11-07

    A microfluidic reactor has been developed for rapid enhancement of protein digestion by constructing an alumina network within a poly(ethylene terephthalate) (PET) microchannel. Trypsin is stably immobilized in a sol-gel network on the PET channel surface after pretreatment, which produces a protein-resistant interface to reduce memory effects, as characterized by X-ray fluorescence spectrometry and electroosmotic flow. The gel-derived network within a microchannel provides a large surface-to-volume ratio stationary phase for highly efficient proteolysis of proteins existing both at a low level and in complex extracts. The maximum reaction rate of the encapsulated trypsin reactor, measured by kinetic analysis, is much faster than in bulk solution. Due to the microscopic confinement effect, high levels of enzyme entrapment and the biocompatible microenvironment provided by the alumina gel network, the low-level proteins can be efficiently digested using such a microreactor within a very short residence time of a few seconds. The on-chip microreactor is further applied to the identification of a mixture of proteins extracted from normal mouse liver cytoplasm sample via integration with 2D-LC-ESI-MS/MS to show its potential application for large-scale protein identification.

  15. Spectroscopic Analysis of Red Fluorescent Proteins and Development of a Microfluidic Cell Sorter for the Generation of Improved Variants

    NASA Astrophysics Data System (ADS)

    Lubbeck, Jennifer L.

    The discovery of the green fluorescent protein (GFP) launched the development of a wide variety of fluorescent protein (FP) mutants whose spectral and photophysical diversity revolutionized in vivo imaging. The excitation and emission spectra of red fluorescent proteins (RFPs), in particular, have been ideally tuned to a window optically favorable for in vivo work. However, their quantum yields, photostabilities and fluorescence intermittency properties require improvement if they are to be broadly employed for low-copy or single-molecule measurements. Attempts to engineer improved RFPs often result in optimization of one photophysical property at the expense of others. We developed a microfluidic-based cytometer for screening HeLa cell-based genetic RFP-libraries simultaneously on the basis of fluorescence lifetime (a proxy for quantum yield), photostability, and brightness. Ten 532 nm excitation beams interrogate each cell in flow. The first is electro-optically modulated (30 MHz) to enable lifetime measurement with phase fluorimetry. The remaining beams act as a pulse sequence for isolating the irreversible photobleaching time constant. Optical-force switching is employed to sort cells based on any combination of the photophysical parameters. Screening with this instrument enables identification of regions of the structure that synergistically affect quantum yield and photostability and the sorting capability provides a new tool for accelerating the development of next generation RFPs.

  16. Using Variable-Length Aligned Fragment Pairs and an Improved Transition Function for Flexible Protein Structure Alignment.

    PubMed

    Cao, Hu; Lu, Yonggang

    2017-01-01

    With the rapid growth of known protein 3D structures in number, how to efficiently compare protein structures becomes an essential and challenging problem in computational structural biology. At present, many protein structure alignment methods have been developed. Among all these methods, flexible structure alignment methods are shown to be superior to rigid structure alignment methods in identifying structure similarities between proteins, which have gone through conformational changes. It is also found that the methods based on aligned fragment pairs (AFPs) have a special advantage over other approaches in balancing global structure similarities and local structure similarities. Accordingly, we propose a new flexible protein structure alignment method based on variable-length AFPs. Compared with other methods, the proposed method possesses three main advantages. First, it is based on variable-length AFPs. The length of each AFP is separately determined to maximally represent a local similar structure fragment, which reduces the number of AFPs. Second, it uses local coordinate systems, which simplify the computation at each step of the expansion of AFPs during the AFP identification. Third, it decreases the number of twists by rewarding the situation where nonconsecutive AFPs share the same transformation in the alignment, which is realized by dynamic programming with an improved transition function. The experimental data show that compared with FlexProt, FATCAT, and FlexSnap, the proposed method can achieve comparable results by introducing fewer twists. Meanwhile, it can generate results similar to those of the FATCAT method in much less running time due to the reduced number of AFPs.

  17. Specific identification of Bacillus anthracis strains

    NASA Astrophysics Data System (ADS)

    Krishnamurthy, Thaiya; Deshpande, Samir; Hewel, Johannes; Liu, Hongbin; Wick, Charles H.; Yates, John R., III

    2007-01-01

    Accurate identification of human pathogens is the initial vital step in treating the civilian terrorism victims and military personnel afflicted in biological threat situations. We have applied a powerful multi-dimensional protein identification technology (MudPIT) along with newly generated software termed Profiler to identify the sequences of specific proteins observed for few strains of Bacillus anthracis, a human pathogen. Software termed Profiler was created to initially screen the MudPIT data of B. anthracis strains and establish the observed proteins specific for its strains. A database was also generated using Profiler containing marker proteins of B. anthracis and its strains, which in turn could be used for detecting the organism and its corresponding strains in samples. Analysis of the unknowns by our methodology, combining MudPIT and Profiler, led to the accurate identification of the anthracis strains present in samples. Thus, a new approach for the identification of B. anthracis strains in unknown samples, based on the molecular mass and sequences of marker proteins, has been ascertained.

  18. Identification of membrane-associated proteins with pathogenic potential expressed by Corynebacterium pseudotuberculosis grown in animal serum.

    PubMed

    Raynal, José Tadeu; Bastos, Bruno Lopes; Vilas-Boas, Priscilla Carolinne Bagano; Sousa, Thiago de Jesus; Costa-Silva, Marcos; de Sá, Maria da Conceição Aquino; Portela, Ricardo Wagner; Moura-Costa, Lília Ferreira; Azevedo, Vasco; Meyer, Roberto

    2018-01-25

    Previous works defining antigens that might be used as vaccine targets against Corynebacterium pseudotuberculosis, which is the causative agent of sheep and goat caseous lymphadenitis, have focused on secreted proteins produced in a chemically defined culture media. Considering that such antigens might not reflect the repertoire of proteins expressed during infection conditions, this experiment aimed to investigate the membrane-associated proteins with pathogenic potential expressed by C. pseudotuberculosis grown directly in animal serum. Its membrane-associated proteins have been extracted using an organic solvent enrichment methodology, followed by LC-MS/MS and bioinformatics analysis for protein identification and classification. The results revealed 22 membrane-associated proteins characterized as potentially pathogenic. An interaction network analysis indicated that the four potentially pathogenic proteins ciuA, fagA, OppA4 and OppCD were biologically connected within two distinct network pathways, which were both associated with the ABC Transporters KEGG pathway. These results suggest that C. pseudotuberculosis pathogenesis might be associated with the transport and uptake of nutrients; other seven identified potentially pathogenic membrane proteins also suggest that pathogenesis might involve events of bacterial resistance and adhesion. The proteins herein reported potentially reflect part of the protein repertoire expressed during real infection conditions and might be tested as vaccine antigens.

  19. Generic framework for mining cellular automata models on protein-folding simulations.

    PubMed

    Diaz, N; Tischer, I

    2016-05-13

    Cellular automata model identification is an important way of building simplified simulation models. In this study, we describe a generic architectural framework to ease the development process of new metaheuristic-based algorithms for cellular automata model identification in protein-folding trajectories. Our framework was developed by a methodology based on design patterns that allow an improved experience for new algorithms development. The usefulness of the proposed framework is demonstrated by the implementation of four algorithms, able to obtain extremely precise cellular automata models of the protein-folding process with a protein contact map representation. Dynamic rules obtained by the proposed approach are discussed, and future use for the new tool is outlined.

  20. Improved Facial Nerve Identification During Parotidectomy With Fluorescently Labeled Peptide

    PubMed Central

    Hussain, Timon; Nguyen, Linda T.; Whitney, Michael; Hasselmann, Jonathan; Nguyen, Quyen T.

    2016-01-01

    Objectives/Hypothesis Additional intraoperative guidance could reduce the risk of iatrogenic injury during parotid gland cancer surgery. We evaluated the intraoperative use of fluorescently labeled nerve binding peptide NP41 to aid facial nerve identification and preservation during parotidectomy in an orthotopic model of murine parotid gland cancer. We also quantified the accuracy of intraoperative nerve detection for surface and buried nerves in the head and neck with NP41 versus white light (WL) alone. Study Design Twenty-eight mice underwent parotid gland cancer surgeries with additional fluorescence (FL) guidance versus WL reflectance (WLR) alone. Eight mice were used for additional nerve-imaging experiments. Methods Twenty-eight parotid tumor-bearing mice underwent parotidectomy. Eight mice underwent imaging of both sides of the face after skin removal. Postoperative assessment of facial nerve function measured by automated whisker tracking were compared between FL guidance (n = 13) versus WL alone (n = 15). In eight mice, nerve to surrounding tissue contrast was measured under FL versus WLR for all nerve branches detectable in the field of view. Results Postoperative facial nerve function after parotid gland cancer surgery tended to be better with additional FL guidance. Fluorescent labeling significantly improved nerve to surrounding tissue contrast for both large and smaller buried nerve branches compared to WLR visualization and improved detection sensitivity and specificity. Conclusions NP41 FL imaging significantly aids the intraoperative identification of nerve braches otherwise nearly invisible to the naked eye. Its application in a murine model of parotid gland cancer surgery tended to improve functional preservation of the facial nerve. PMID:27171862

  1. Demonstration of Protein-Based Human Identification Using the Hair Shaft Proteome

    PubMed Central

    Leppert, Tami; Anex, Deon S.; Hilmer, Jonathan K.; Matsunami, Nori; Baird, Lisa; Stevens, Jeffery; Parsawar, Krishna; Durbin-Johnson, Blythe P.; Rocke, David M.; Nelson, Chad; Fairbanks, Daniel J.; Wilson, Andrew S.; Rice, Robert H.; Woodward, Scott R.; Bothner, Brian; Hart, Bradley R.; Leppert, Mark

    2016-01-01

    Human identification from biological material is largely dependent on the ability to characterize genetic polymorphisms in DNA. Unfortunately, DNA can degrade in the environment, sometimes below the level at which it can be amplified by PCR. Protein however is chemically more robust than DNA and can persist for longer periods. Protein also contains genetic variation in the form of single amino acid polymorphisms. These can be used to infer the status of non-synonymous single nucleotide polymorphism alleles. To demonstrate this, we used mass spectrometry-based shotgun proteomics to characterize hair shaft proteins in 66 European-American subjects. A total of 596 single nucleotide polymorphism alleles were correctly imputed in 32 loci from 22 genes of subjects’ DNA and directly validated using Sanger sequencing. Estimates of the probability of resulting individual non-synonymous single nucleotide polymorphism allelic profiles in the European population, using the product rule, resulted in a maximum power of discrimination of 1 in 12,500. Imputed non-synonymous single nucleotide polymorphism profiles from European–American subjects were considerably less frequent in the African population (maximum likelihood ratio = 11,000). The converse was true for hair shafts collected from an additional 10 subjects with African ancestry, where some profiles were more frequent in the African population. Genetically variant peptides were also identified in hair shaft datasets from six archaeological skeletal remains (up to 260 years old). This study demonstrates that quantifiable measures of identity discrimination and biogeographic background can be obtained from detecting genetically variant peptides in hair shaft protein, including hair from bioarchaeological contexts. PMID:27603779

  2. Text Mining Improves Prediction of Protein Functional Sites

    PubMed Central

    Cohn, Judith D.; Ravikumar, Komandur E.

    2012-01-01

    We present an approach that integrates protein structure analysis and text mining for protein functional site prediction, called LEAP-FS (Literature Enhanced Automated Prediction of Functional Sites). The structure analysis was carried out using Dynamics Perturbation Analysis (DPA), which predicts functional sites at control points where interactions greatly perturb protein vibrations. The text mining extracts mentions of residues in the literature, and predicts that residues mentioned are functionally important. We assessed the significance of each of these methods by analyzing their performance in finding known functional sites (specifically, small-molecule binding sites and catalytic sites) in about 100,000 publicly available protein structures. The DPA predictions recapitulated many of the functional site annotations and preferentially recovered binding sites annotated as biologically relevant vs. those annotated as potentially spurious. The text-based predictions were also substantially supported by the functional site annotations: compared to other residues, residues mentioned in text were roughly six times more likely to be found in a functional site. The overlap of predictions with annotations improved when the text-based and structure-based methods agreed. Our analysis also yielded new high-quality predictions of many functional site residues that were not catalogued in the curated data sources we inspected. We conclude that both DPA and text mining independently provide valuable high-throughput protein functional site predictions, and that integrating the two methods using LEAP-FS further improves the quality of these predictions. PMID:22393388

  3. Identification of Tyrosine Phosphorylated Proteins by SH2 Domain Affinity Purification and Mass Spectrometry.

    PubMed

    Buhs, Sophia; Gerull, Helwe; Nollau, Peter

    2017-01-01

    Phosphotyrosine signaling plays a major role in the control of many important biological functions such as cell proliferation and apoptosis. Deciphering of phosphotyrosine-dependent signaling is therefore of great interest paving the way for the understanding of physiological and pathological processes of signal transduction. On the basis of the specific binding of SH2 domains to phosphotyrosine residues, we here present an experimental workflow for affinity purification and subsequent identification of tyrosine phosphorylated proteins by mass spectrometry. In combination with SH2 profiling, a broadly applicable platform for the characterization of phosphotyrosine profiles in cell extracts, our pull down strategy enables researchers by now to identify proteins in signaling cascades which are differentially phosphorylated and selectively recognized by distinct SH2 domains.

  4. Identification of a novel protein for memory regulation in the hippocampus.

    PubMed

    Zhang, Xue-Han; Zhang, Hui; Tu, Yanyang; Gao, Xiang; Zhou, Changfu; Jin, Meilei; Zhao, Guoping; Jing, Naihe; Li, Bao-Ming; Yu, Lei

    2005-08-26

    Memory formation, maintenance, and retrieval are a dynamic process, reflecting a combined outcome of new memory formation on one hand, and older memory suppression/clearance on the other. Although much knowledge has been gained regarding new memory formation, less is known about the molecular components and processes that serve the function of memory suppression/clearance. Here, we report the identification of a novel protein, termed hippyragranin (HGN), that is expressed in the rat hippocampus and its expression is reduced by hippocampal denervation. Inhibition of HGN by antisense oligonucleotide in area CA1 results in enhanced performance in Morris water maze, as well as elevated long-term potentiation. These results suggest that HGN is involved in negative memory regulation.

  5. Quantitative identification of proteins that influence miRNA biogenesis by RNA pull-down-SILAC mass spectrometry (RP-SMS).

    PubMed

    Choudhury, Nila Roy; Michlewski, Gracjan

    2018-06-08

    RNA-binding proteins mediate and control gene expression. As some examples, they regulate pre-mRNA synthesis and processing; mRNA localisation, translation and decay; and microRNA (miRNA) biogenesis and function. Here, we present a detailed protocol for RNA pull-down coupled to stable isotope labelling by amino acids in cell culture (SILAC) mass spectrometry (RP-SMS) that enables quantitative, fast and specific detection of RNA-binding proteins that regulate miRNA biogenesis. In general, this method allows for the identification of RNA-protein complexes formed using in vitro or chemically synthesized RNAs and protein extracts derived from cultured cells. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.

  6. Protein-Protein Interactions in a Crowded Environment: An Analysis via Cross-Docking Simulations and Evolutionary Information

    PubMed Central

    Lopes, Anne; Sacquin-Mora, Sophie; Dimitrova, Viktoriya; Laine, Elodie; Ponty, Yann; Carbone, Alessandra

    2013-01-01

    Large-scale analyses of protein-protein interactions based on coarse-grain molecular docking simulations and binding site predictions resulting from evolutionary sequence analysis, are possible and realizable on hundreds of proteins with variate structures and interfaces. We demonstrated this on the 168 proteins of the Mintseris Benchmark 2.0. On the one hand, we evaluated the quality of the interaction signal and the contribution of docking information compared to evolutionary information showing that the combination of the two improves partner identification. On the other hand, since protein interactions usually occur in crowded environments with several competing partners, we realized a thorough analysis of the interactions of proteins with true partners but also with non-partners to evaluate whether proteins in the environment, competing with the true partner, affect its identification. We found three populations of proteins: strongly competing, never competing, and interacting with different levels of strength. Populations and levels of strength are numerically characterized and provide a signature for the behavior of a protein in the crowded environment. We showed that partner identification, to some extent, does not depend on the competing partners present in the environment, that certain biochemical classes of proteins are intrinsically easier to analyze than others, and that small proteins are not more promiscuous than large ones. Our approach brings to light that the knowledge of the binding site can be used to reduce the high computational cost of docking simulations with no consequence in the quality of the results, demonstrating the possibility to apply coarse-grain docking to datasets made of thousands of proteins. Comparison with all available large-scale analyses aimed to partner predictions is realized. We release the complete decoys set issued by coarse-grain docking simulations of both true and false interacting partners, and their evolutionary

  7. Targeted Identification of SUMOylation Sites in Human Proteins Using Affinity Enrichment and Paralog-specific Reporter Ions*

    PubMed Central

    Lamoliatte, Frederic; Bonneil, Eric; Durette, Chantal; Caron-Lizotte, Olivier; Wildemann, Dirk; Zerweck, Johannes; Wenshuk, Holger; Thibault, Pierre

    2013-01-01

    Protein modification by small ubiquitin-like modifier (SUMO) modulates the activities of numerous proteins involved in different cellular functions such as gene transcription, cell cycle, and DNA repair. Comprehensive identification of SUMOylated sites is a prerequisite to determine how SUMOylation regulates protein function. However, mapping SUMOylated Lys residues by mass spectrometry (MS) is challenging because of the dynamic nature of this modification, the existence of three functionally distinct human SUMO paralogs, and the large SUMO chain remnant that remains attached to tryptic peptides. To overcome these problems, we created HEK293 cell lines that stably express functional SUMO paralogs with an N-terminal His6-tag and an Arg residue near the C terminus that leave a short five amino acid SUMO remnant upon tryptic digestion. We determined the fragmentation patterns of our short SUMO remnant peptides by collisional activation and electron transfer dissociation using synthetic peptide libraries. Activation using higher energy collisional dissociation on the LTQ-Orbitrap Elite identified SUMO paralog-specific fragment ions and neutral losses of the SUMO remnant with high mass accuracy (< 5 ppm). We exploited these features to detect SUMO modified tryptic peptides in complex cell extracts by correlating mass measurements of precursor and fragment ions using a data independent acquisition method. We also generated bioinformatics tools to retrieve MS/MS spectra containing characteristic fragment ions to the identification of SUMOylated peptide by conventional Mascot database searches. In HEK293 cell extracts, this MS approach uncovered low abundance SUMOylated peptides and 37 SUMO3-modified Lys residues in target proteins, most of which were previously unknown. Interestingly, we identified mixed SUMO-ubiquitin chains with ubiquitylated SUMO proteins (K20 and K32) and SUMOylated ubiquitin (K63), suggesting a complex crosstalk between these two modifications. PMID

  8. Novel, improved sample preparation for rapid, direct identification from positive blood cultures using matrix-assisted laser desorption/ionization time-of-flight (MALDI-TOF) mass spectrometry.

    PubMed

    Schubert, Sören; Weinert, Kirsten; Wagner, Chris; Gunzl, Beatrix; Wieser, Andreas; Maier, Thomas; Kostrzewa, Markus

    2011-11-01

    Matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-TOF MS) is widely used for rapid and reliable identification of bacteria and yeast grown on agar plates. Moreover, MALDI-TOF MS also holds promise for bacterial identification from blood culture (BC) broths in hospital laboratories. The most important technical step for the identification of bacteria from positive BCs by MALDI-TOF MS is sample preparation to remove blood cells and host proteins. We present a method for novel, rapid sample preparation using differential lysis of blood cells. We demonstrate the efficacy and ease of use of this sample preparation and subsequent MALDI-TOF MS identification, applying it to a total of 500 aerobic and anaerobic BCs reported to be positive by a Bactec 9240 system. In 86.5% of all BCs, the microorganism species were correctly identified. Moreover, in 18/27 mixed cultures at least one isolate was correctly identified. A novel method that adjusts the score value for MALDI-TOF MS results is proposed, further improving the proportion of correctly identified samples. The results of the present study show that the MALDI-TOF MS-based method allows rapid (<20 minutes) bacterial identification directly from positive BCs and with high accuracy. Copyright © 2011 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.

  9. Identification, Nomenclature, and Evolutionary Relationships of Mitogen-Activated Protein Kinase (MAPK) Genes in Soybean

    PubMed Central

    Neupane, Achal; Nepal, Madhav P.; Piya, Sarbottam; Subramanian, Senthil; Rohila, Jai S.; Reese, R. Neil; Benson, Benjamin V.

    2013-01-01

    Mitogen-activated protein kinase (MAPK) genes in eukaryotes regulate various developmental and physiological processes including those associated with biotic and abiotic stresses. Although MAPKs in some plant species including Arabidopsis have been identified, they are yet to be identified in soybean. Major objectives of this study were to identify GmMAPKs, assess their evolutionary relationships, and analyze their functional divergence. We identified a total of 38 MAPKs, eleven MAPKKs, and 150 MAPKKKs in soybean. Within the GmMAPK family, we also identified a new clade of six genes: four genes with TEY and two genes with TQY motifs requiring further investigation into possible legume-specific functions. The results indicated the expansion of the GmMAPK families attributable to the ancestral polyploidy events followed by chromosomal rearrangements. The GmMAPK and GmMAPKKK families were substantially larger than those in other plant species. The duplicated GmMAPK members presented complex evolutionary relationships and functional divergence when compared to their counterparts in Arabidopsis. We also highlighted existing nomenclatural issues, stressing the need for nomenclatural consistency. GmMAPK identification is vital to soybean crop improvement, and novel insights into the evolutionary relationships will enhance our understanding about plant genome evolution. PMID:24137047

  10. Mass Defect Labeling of Cysteine for Improving Peptide Assignment in Shotgun Proteomic Analyses

    PubMed Central

    Hernandez, Hilda; Niehauser, Sarah; Boltz, Stacey A.; Gawandi, Vijay; Phillips, Robert S.; Amster, I. Jonathan

    2006-01-01

    A method for improving the identification of peptides in a shotgun proteome analysis using accurate mass measurement has been developed. The improvement is based upon the derivatization of cysteine residues with a novel reagent, 2,4-dibromo-(2′-iodo)acetanilide. The derivitization changes the mass defect of cysteine-containing proteolytic peptides in a manner that increases their identification specificity. Peptide masses were measured using matrix-assisted laser desorption/ionization Fourier transform ion cyclotron mass spectrometry. Reactions with protein standards show that the derivatization of cysteine is rapid and quantitative, and the data suggest that the derivatized peptides are more easily ionized or detected than unlabeled cysteine-containing peptides. The reagent was tested on a 15N-metabolically labeled proteome from M. maripaludis. Proteins were identified by their accurate mass values and from their nitrogen stoichiometry. A total of 47% of the labeled peptides are identified versus 27% for the unlabeled peptides. This procedure permits the identification of proteins from the M. maripaludis proteome that are not usually observed by the standard protocol and shows that better protein coverage is obtained with this methodology. PMID:16689545

  11. Separation and identification of Musa acuminate Colla (banana) leaf proteins by two-dimensional gel electrophoresis and mass spectrometry.

    PubMed

    Lu, Y; Qi, Y X; Zhang, H; Zhang, H Q; Pu, J J; Xie, Y X

    2013-12-19

    To establish a proteomic reference map of Musa acuminate Colla (banana) leaf, we separated and identified leaf proteins using two-dimensional polyacrylamide gel electrophoresis (2D-PAGE) and mass spectrometry (MS). Tryptic digests of 44 spots were subjected to peptide mass fingerprinting (PMF) by matrix-assisted laser desorption/ionization time-of-flight (MALDI-TOF) MS. Three spots that were not identified by MALDI-TOF MS analysis were identified by searching against the NCBInr, SwissProt, and expressed sequence tag (EST) databases. We identified 41 unique proteins. The majority of the identified leaf proteins were found to be involved in energy metabolism. The results indicate that 2D-PAGE is a sensitive and powerful technique for the separation and identification of Musa leaf proteins. A summary of the identified proteins and their putative functions is discussed.

  12. Improving the identification and management of chronic kidney disease in primary care: lessons from a staged improvement collaborative.

    PubMed

    Harvey, Gill; Oliver, Kathryn; Humphreys, John; Rothwell, Katy; Hegarty, Janet

    2015-02-01

    Undiagnosed chronic kidney disease (CKD) contributes to a high cost and care burden in secondary care. Uptake of evidence-based guidelines in primary care is inconsistent, resulting in variation in the detection and management of CKD. Routinely collected general practice data in one UK region suggested a CKD prevalence of 4.1%, compared with an estimated national prevalence of 8.5%. Of patients on CKD registers, ∼ 30% were estimated to have suboptimal management according to Public Health Observatory analyses. An evidence-based framework for implementation was developed. This informed the design of an improvement collaborative to work with a sample of 30 general practices. A two-phase collaborative was implemented between September 2009 and March 2012. Key elements of the intervention included learning events, improvement targets, Plan-Do-Study-Act cycles, benchmarking of audit data, facilitator support and staff time reimbursement. Outcomes were evaluated against two indicators: number of patients with CKD on practice registers; percentage of patients achieving evidence-based blood pressure (BP) targets, as a marker for CKD care. In Phase 1, recorded prevalence of CKD in collaborative practices increased ∼ 2-fold more than that in comparator local practices; in Phase 2, this increased to 4-fold, indicating improved case identification. Management of BP according to guideline recommendations also improved. An improvement collaborative with tailored facilitation support appears to promote the uptake of evidence-based guidance on the identification and management of CKD in primary care. A controlled evaluation study is needed to rigorously evaluate the impact of this promising improvement intervention. © The Author 2014. Published by Oxford University Press in association with the International Society for Quality in Health Care.

  13. Improving the identification and management of chronic kidney disease in primary care: lessons from a staged improvement collaborative

    PubMed Central

    Harvey, Gill; Oliver, Kathryn; Humphreys, John; Rothwell, Katy; Hegarty, Janet

    2015-01-01

    Quality problem Undiagnosed chronic kidney disease (CKD) contributes to a high cost and care burden in secondary care. Uptake of evidence-based guidelines in primary care is inconsistent, resulting in variation in the detection and management of CKD. Initial assessment Routinely collected general practice data in one UK region suggested a CKD prevalence of 4.1%, compared with an estimated national prevalence of 8.5%. Of patients on CKD registers, ∼30% were estimated to have suboptimal management according to Public Health Observatory analyses. Choice of solution An evidence-based framework for implementation was developed. This informed the design of an improvement collaborative to work with a sample of 30 general practices. Implementation A two-phase collaborative was implemented between September 2009 and March 2012. Key elements of the intervention included learning events, improvement targets, Plan-Do-Study-Act cycles, benchmarking of audit data, facilitator support and staff time reimbursement. Evaluation Outcomes were evaluated against two indicators: number of patients with CKD on practice registers; percentage of patients achieving evidence-based blood pressure (BP) targets, as a marker for CKD care. In Phase 1, recorded prevalence of CKD in collaborative practices increased ∼2-fold more than that in comparator local practices; in Phase 2, this increased to 4-fold, indicating improved case identification. Management of BP according to guideline recommendations also improved. Lessons learned An improvement collaborative with tailored facilitation support appears to promote the uptake of evidence-based guidance on the identification and management of CKD in primary care. A controlled evaluation study is needed to rigorously evaluate the impact of this promising improvement intervention. PMID:25525148

  14. Purification, identification and preliminary crystallographic studies of Pru du amandin, an allergenic protein from Prunus dulcis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gaur, Vineet; Sethi, Dhruv K.; Salunke, Dinakar M., E-mail: dinakar@nii.res.in

    The purification, identification, crystallization and preliminary crystallographic studies of an allergy-related protein, Pru du amandin, from P. dulcis nuts are reported. Food allergies appear to be one of the foremost causes of hypersensitivity reactions. Nut allergies account for most food allergies and are often permanent. The 360 kDa hexameric protein Pru du amandin, a known allergen, was purified from almonds (Prunus dulcis) by ammonium sulfate fractionation and ion-exchange chromatography. The protein was identified by a BLAST homology search against the nonredundant sequence database. Pru du amandin belongs to the 11S legumin family of seed storage proteins characterized by the presencemore » of a cupin motif. Crystals were obtained by the hanging-drop vapour-diffusion method. The crystals belong to space group P4{sub 1} (or P4{sub 3}), with unit-cell parameters a = b = 150.7, c = 164.9 Å.« less

  15. Experimental strategies for the identification and characterization of adhesive proteins in animals: a review

    PubMed Central

    Hennebert, Elise; Maldonado, Barbara; Ladurner, Peter; Flammang, Patrick; Santos, Romana

    2015-01-01

    Adhesive secretions occur in both aquatic and terrestrial animals, in which they perform diverse functions. Biological adhesives can therefore be remarkably complex and involve a large range of components with different functions and interactions. However, being mainly protein based, biological adhesives can be characterized by classical molecular methods. This review compiles experimental strategies that were successfully used to identify, characterize and obtain the full-length sequence of adhesive proteins from nine biological models: echinoderms, barnacles, tubeworms, mussels, sticklebacks, slugs, velvet worms, spiders and ticks. A brief description and practical examples are given for a variety of tools used to study adhesive molecules at different levels from genes to secreted proteins. In most studies, proteins, extracted from secreted materials or from adhesive organs, are analysed for the presence of post-translational modifications and submitted to peptide sequencing. The peptide sequences are then used directly for a BLAST search in genomic or transcriptomic databases, or to design degenerate primers to perform RT-PCR, both allowing the recovery of the sequence of the cDNA coding for the investigated protein. These sequences can then be used for functional validation and recombinant production. In recent years, the dual proteomic and transcriptomic approach has emerged as the best way leading to the identification of novel adhesive proteins and retrieval of their complete sequences. PMID:25657842

  16. Ammonium Bicarbonate Addition Improves the Detection of Proteins by Desorption Electrospray Ionization Mass Spectrometry

    NASA Astrophysics Data System (ADS)

    Honarvar, Elahe; Venter, Andre R.

    2017-06-01

    The analysis of protein by desorption electrospray ionization mass spectrometry (DESI-MS) is considered impractical due to a mass-dependent loss in sensitivity with increase in protein molecular weights. With the addition of ammonium bicarbonate to the DESI-MS analysis the sensitivity towards proteins by DESI was improved. The signal to noise ratio (S/N) improvement for a variety of proteins increased between 2- to 3-fold relative to solvent systems containing formic acid and more than seven times relative to aqueous methanol spray solvents. Three methods for ammonium bicarbonate addition during DESI-MS were investigated. The additive delivered improvements in S/N whether it was mixed with the analyte prior to sample deposition, applied over pre-prepared samples, or simply added to the desorption spray solvent. The improvement correlated well with protein pI but not with protein size. Other ammonium or bicarbonate salts did not produce similar improvements in S/N, nor was this improvement in S/N observed for ESI of the same samples. As was previously described for ESI, DESI also caused extensive protein unfolding upon the addition of ammonium bicarbonate. [Figure not available: see fulltext.

  17. Construction Project Performance Improvement through Radio Frequency Identification Technology Application on a Project Supply Chain

    ERIC Educational Resources Information Center

    Wang, Heng

    2017-01-01

    Construction project productivity typically lags other industries and it has been the focus of numerous studies in order to improve the project performance. This research investigated the application of Radio Frequency Identification (RFID) technology on construction projects' supply chain and determined that RFID technology can improve the…

  18. Protein-Level Integration Strategy of Multiengine MS Spectra Search Results for Higher Confidence and Sequence Coverage.

    PubMed

    Zhao, Panpan; Zhong, Jiayong; Liu, Wanting; Zhao, Jing; Zhang, Gong

    2017-12-01

    Multiple search engines based on various models have been developed to search MS/MS spectra against a reference database, providing different results for the same data set. How to integrate these results efficiently with minimal compromise on false discoveries is an open question due to the lack of an independent, reliable, and highly sensitive standard. We took the advantage of the translating mRNA sequencing (RNC-seq) result as a standard to evaluate the integration strategies of the protein identifications from various search engines. We used seven mainstream search engines (Andromeda, Mascot, OMSSA, X!Tandem, pFind, InsPecT, and ProVerB) to search the same label-free MS data sets of human cell lines Hep3B, MHCCLM3, and MHCC97H from the Chinese C-HPP Consortium for Chromosomes 1, 8, and 20. As expected, the union of seven engines resulted in a boosted false identification, whereas the intersection of seven engines remarkably decreased the identification power. We found that identifications of at least two out of seven engines resulted in maximizing the protein identification power while minimizing the ratio of suspicious/translation-supported identifications (STR), as monitored by our STR index, based on RNC-Seq. Furthermore, this strategy also significantly improves the peptides coverage of the protein amino acid sequence. In summary, we demonstrated a simple strategy to significantly improve the performance for shotgun mass spectrometry by protein-level integrating multiple search engines, maximizing the utilization of the current MS spectra without additional experimental work.

  19. Reduced dimensionality tailored HN(C)N experiments for facile backbone resonance assignment of proteins through unambiguous identification of sequential HSQC peaks

    NASA Astrophysics Data System (ADS)

    Kumar, Dinesh

    2013-12-01

    Two novel reduced dimensionality (RD) tailored HN(C)N [S.C. Panchal, N.S. Bhavesh, R.V. Hosur, Improved 3D triple resonance experiments, HNN and HN(C)N, for HN and 15N sequential correlations in (13C, 15N) labeled proteins: application to unfolded proteins, J. Biomol. NMR 20 (2001) 135-147] experiments are proposed to facilitate the backbone resonance assignment of proteins both in terms of its accuracy and speed. These experiments - referred here as (4,3)D-hNCOcaNH and (4,3)D-hNcoCANH - exploit the linear combination of backbone 15N and 13C‧/13Cα chemical shifts simultaneously to achieve higher peak dispersion and randomness along their respective F1 dimensions. Simply, this has been achieved by modulating the backbone 15N(i) chemical shifts with that of 13C‧ (i - 1)/13Cα (i - 1) spins following the established reduced dimensionality NMR approach [T. Szyperski, D.C. Yeh, D.K. Sukumaran, H.N. Moseley, G.T. Montelione, Reduced-dimensionality NMR spectroscopy for high-throughput protein resonance assignment, Proc. Natl. Acad. Sci. USA 99 (2002) 8009-8014]. Though the modification is simple it has resulted an ingenious improvement of HN(C)N both in terms of peak dispersion and easiness of establishing the sequential connectivities. The increased dispersion along F1 dimension solves two purposes here: (i) resolves the ambiguities arising because of degenerate 15N chemical shifts and (ii) reduces the signal overlap in F2(15N)-F3(1H) planes (an important requisite in HN(C)N based assignment protocol for facile and unambiguous identification of sequentially connected HSQC peaks). The performance of both these experiments and the assignment protocol has been demonstrated using bovine apo Calbindin-d9k (75 aa) and urea denatured UNC60B (a 152 amino acid ADF/cofilin family protein of Caenorhabditis elegans), as representatives of folded and unfolded protein systems, respectively.

  20. Improving the representation of peptide-like inhibitor and antibiotic molecules in the Protein Data Bank

    PubMed Central

    Dutta, Shuchismita; Dimitropoulos, Dimitris; Feng, Zukang; Persikova, Irina; Sen, Sanchayita; Shao, Chenghua; Westbrook, John; Young, Jasmine; Zhuravleva, Marina A; Kleywegt, Gerard J; Berman, Helen M

    2014-01-01

    With the accumulation of a large number and variety of molecules in the Protein Data Bank (PDB) comes the need on occasion to review and improve their representation. The Worldwide PDB (wwPDB) partners have periodically updated various aspects of structural data representation to improve the integrity and consistency of the archive. The remediation effort described here was focused on improving the representation of peptide-like inhibitor and antibiotic molecules so that they can be easily identified and analyzed. Peptide-like inhibitors or antibiotics were identified in over 1000 PDB entries, systematically reviewed and represented either as peptides with polymer sequence or as single components. For the majority of the single-component molecules, their peptide-like composition was captured in a new representation, called the subcomponent sequence. A novel concept called “group” was developed for representing complex peptide-like antibiotics and inhibitors that are composed of multiple polymer and nonpolymer components. In addition, a reference dictionary was developed with detailed information about these peptide-like molecules to aid in their annotation, identification and analysis. Based on the experience gained in this remediation, guidelines, procedures, and tools were developed to annotate new depositions containing peptide-like inhibitors and antibiotics accurately and consistently. © 2013 Wiley Periodicals, Inc. Biopolymers 101: 659–668, 2014. PMID:24173824

  1. In situ identification of the synthrophic protein fermentative Coprothermobacter spp. involved in the thermophilic anaerobic digestion process.

    PubMed

    Gagliano, Maria Cristina; Braguglia, Camilla Maria; Rossetti, Simona

    2014-09-01

    Thermophilic bacteria have recently attracted great attention because of their potential application in improving different biochemical processes such as anaerobic digestion of various substrates, wastewater treatment or hydrogen production. In this study we report on the design of a specific 16S rRNA-targeted oligonucleotide probe for detecting members of Coprothermobacter genus characterized by a strong protease activity to degrade proteins and peptides. The newly designed CTH485 probe and helper probes hCTH429 and hCTH439 were optimized for use in fluorescence in situ hybridization (FISH) on thermophilic anaerobic sludge samples. In situ probing revealed that thermo-adaptive mechanisms shaping the 16S rRNA gene may affect the identification of thermophilic microorganisms. The novel developed FISH probe extends the possibility to study the widespread thermophilic syntrophic interaction of Coprothermobacter spp. with hydrogenotrophic methanogenic archaea, whose establishment is a great benefit for the whole anaerobic system. © 2014 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.

  2. Improving the Identification of Neonatal Encephalopathy: Utility of a Web-Based Video Tool.

    PubMed

    Ivy, Autumn S; Clark, Catherine L; Bahm, Sarah M; Meurs, Krisa P Van; Wusthoff, Courtney J

    2017-04-01

    Objective  This study tested the effectiveness of a video teaching tool in improving identification and classification of encephalopathy in infants. Study Design  We developed an innovative video teaching tool to help clinicians improve their skills in interpreting the neonatal neurological examination for grading encephalopathy. Pediatric residents were shown 1-minute video clips demonstrating exam findings in normal neonates and neonates with various degrees of encephalopathy. Findings from five domains were demonstrated: spontaneous activity, level of alertness, posture/tone, reflexes, and autonomic responses. After each clip, subjects were asked to identify whether the exam finding was normal or consistent with mild, moderate, or severe abnormality. Subjects were then directed to a web-based teaching toolkit, containing a compilation of videos demonstrating normal and abnormal findings on the neonatal neurological examination. Immediately after training, subjects underwent posttesting, again identifying exam findings as normal, mild, moderate, or severe abnormality. Results  Residents improved in their overall ability to identify and classify neonatal encephalopathy after viewing the teaching tool. In particular, the identification of abnormal spontaneous activity, reflexes, and autonomic responses were most improved. Conclusion  This pretest/posttest evaluation of an educational tool demonstrates that after viewing our toolkit, pediatric residents were able to improve their overall ability to detect neonatal encephalopathy. Thieme Medical Publishers 333 Seventh Avenue, New York, NY 10001, USA.

  3. Bacillus anthracis secretome time course under host-simulated conditions and identification of immunogenic proteins.

    PubMed

    Walz, Alexander; Mujer, Cesar V; Connolly, Joseph P; Alefantis, Tim; Chafin, Ryan; Dake, Clarissa; Whittington, Jessica; Kumar, Srikanta P; Khan, Akbar S; DelVecchio, Vito G

    2007-07-27

    progression of pathogenicity, identification of therapeutics and diagnostic markers, and vaccine development. This study also adds to the continuously growing list of identified Bacillus anthracis secretome proteins.

  4. Bacillus anthracis secretome time course under host-simulated conditions and identification of immunogenic proteins

    PubMed Central

    Walz, Alexander; Mujer, Cesar V; Connolly, Joseph P; Alefantis, Tim; Chafin, Ryan; Dake, Clarissa; Whittington, Jessica; Kumar, Srikanta P; Khan, Akbar S; DelVecchio, Vito G

    2007-01-01

    relevant in elucidation of the progression of pathogenicity, identification of therapeutics and diagnostic markers, and vaccine development. This study also adds to the continuously growing list of identified Bacillus anthracis secretome proteins. PMID:17662140

  5. Weight and See: Loading Working Memory Improves Incidental Identification of Irrelevant Faces

    PubMed Central

    Carmel, David; Fairnie, Jake; Lavie, Nilli

    2012-01-01

    Are task-irrelevant stimuli processed to a level enabling individual identification? This question is central both for perceptual processing models and for applied settings (e.g., eye-witness testimony). Lavie’s load theory proposes that working memory actively maintains attentional prioritization of relevant over irrelevant information. Loading working memory thus impairs attentional prioritization, leading to increased processing of task-irrelevant stimuli. Previous research has shown that increased working memory load leads to greater interference effects from response-competing distractors. Here we test the novel prediction that increased processing of irrelevant stimuli under high working memory load should lead to a greater likelihood of incidental identification of entirely irrelevant stimuli. To test this, we asked participants to perform a word-categorization task while ignoring task-irrelevant images. The categorization task was performed during the retention interval of a working memory task with either low or high load (defined by memory set size). Following the final experimental trial, a surprise question assessed incidental identification of the irrelevant image. Loading working memory was found to improve identification of task-irrelevant faces, but not of building stimuli (shown in a separate experiment to be less distracting). These findings suggest that working memory plays a critical role in determining whether distracting stimuli will be subsequently identified. PMID:22912623

  6. Identification of Surface-Exposed Protein Radicals and A Substrate Oxidation Site in A-Class Dye-Decolorizing Peroxidase from Thermomonospora curvata

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Shrestha, Ruben; Chen, Xuejie; Ramyar, Kasra X.

    Dye-decolorizing peroxidases (DyPs) are a family of heme peroxidases in which a catalytic distal aspartate is involved in H 2O 2 activation to catalyze oxidations under acidic conditions. They have received much attention due to their potential applications in lignin compound degradation and biofuel production from biomass. However, the mode of oxidation in bacterial DyPs remains unknown. We have recently reported that the bacterial TcDyP from Thermomonospora curvata is among the most active DyPs and shows activity toward phenolic lignin model compounds. On the basis of the X-ray crystal structure solved at 1.75 Å, sigmoidal steady-state kinetics with Reactive Bluemore » 19 (RB19), and formation of compound II like product in the absence of reducing substrates observed with stopped-flow spectroscopy and electron paramagnetic resonance (EPR), we hypothesized that the TcDyP catalyzes oxidation of large-size substrates via multiple surface-exposed protein radicals. Among 7 tryptophans and 3 tyrosines in TcDyP consisting of 376 residues for the matured protein, W263, W376, and Y332 were identified as surface-exposed protein radicals. Only the W263 was also characterized as one of the surface-exposed oxidation sites. SDS-PAGE and size-exclusion chromatography demonstrated that W376 represents an off-pathway destination for electron transfer, resulting in the cross-linking of proteins in the absence of substrates. Mutation of W376 improved compound I stability and overall catalytic efficiency toward RB19. While Y332 is highly conserved across all four classes of DyPs, its catalytic function in A-class TcDyP is minimal, possibly due to its extremely small solvent-accessible areas. Identification of surface-exposed protein radicals and substrate oxidation sites is important for understanding the DyP mechanism and modulating its catalytic functions for improved activity on phenolic lignin.« less

  7. Incorporation of unnatural sugars for the identification of glycoproteins.

    PubMed

    Zaro, Balyn W; Hang, Howard C; Pratt, Matthew R

    2013-01-01

    Glycosylation is an abundant post-translational modification that alters the fate and function of its substrate proteins. To aid in understanding the significance of protein glycosylation, identification of target proteins is key. As with all proteomics experiments, mass spectrometry has been established as the desired method for substrate identification. However, these approaches require selective enrichment and purification of modified proteins. Chemical reporters in combination with bioorthogonal reactions have emerged as robust tools for identifying post-translational modifications including glycosylation. We provide here a method for the use of bioorthogonal chemical reporters for isolation and identification of glycosylated proteins. More specifically, this protocol is a representative procedure from our own work using an alkyne-bearing O-GlcNAc chemical reporter (GlcNAlk) and a chemically cleavable azido-azo-biotin probe for the identification of O-GlcNAc-modified proteins.

  8. Transmembrane proteins in the Protein Data Bank: identification and classification.

    PubMed

    Tusnády, Gábor E; Dosztányi, Zsuzsanna; Simon, István

    2004-11-22

    Integral membrane proteins play important roles in living cells. Although these proteins are estimated to constitute 25% of proteins at a genomic scale, the Protein Data Bank (PDB) contains only a few hundred membrane proteins due to the difficulties with experimental techniques. The presence of transmembrane proteins in the structure data bank, however, is quite invisible, as the annotation of these entries is rather poor. Even if a protein is identified as a transmembrane one, the possible location of the lipid bilayer is not indicated in the PDB because these proteins are crystallized without their natural lipid bilayer, and currently no method is publicly available to detect the possible membrane plane using the atomic coordinates of membrane proteins. Here, we present a new geometrical approach to distinguish between transmembrane and globular proteins using structural information only and to locate the most likely position of the lipid bilayer. An automated algorithm (TMDET) is given to determine the membrane planes relative to the position of atomic coordinates, together with a discrimination function which is able to separate transmembrane and globular proteins even in cases of low resolution or incomplete structures such as fragments or parts of large multi chain complexes. This method can be used for the proper annotation of protein structures containing transmembrane segments and paves the way to an up-to-date database containing the structure of all known transmembrane proteins and fragments (PDB_TM) which can be automatically updated. The algorithm is equally important for the purpose of constructing databases purely of globular proteins.

  9. Identification of Putative ORF5 Protein of Porcine Circovirus Type 2 and Functional Analysis of GFP-Fused ORF5 Protein

    PubMed Central

    Xu, Han; Wang, Tao; Zhang, Yanming

    2015-01-01

    Porcine circovirus type 2 (PCV2) is the essential infectious agent responsible for causing porcine circovirus-associated diseases in pigs. To date, eleven RNAs and five viral proteins of PCV2 have been detected. Here, we identified a novel viral gene within the PCV2 genome, termed ORF5, that exists at both the transcriptional and translational level during productive infection of PCV2 in porcine alveolar macrophages 3D4/2 (PAMs). Northern blot analysis was used to demonstrate that the ORF5 gene measures 180 bp in length and overlaps completely with ORF1 when read in the same direction. Site-directed mutagenesis was used to show that the ORF5 protein is not essential for PCV2 replication. To investigate the biological functions of the novel protein, we constructed a recombinant eukaryotic expression plasmid capable of expressing PCV2 ORF5. The results show that the GFP-tagged PCV2 ORF5 protein localizes to the endoplasmic reticulum (ER), is degraded via the proteasome, inhibits PAM growth and prolongs the S-phase of the cell cycle. Further studies show that the GFP-tagged PCV2 ORF5 protein induces ER stress and activates NF-κB, which was further confirmed by a significant upregulation in IL-6, IL-8 and COX-2 expression. In addition, five cellular proteins (GPNMB, CYP1A1, YWHAB, ZNF511 and SRSF3) were found to interact with ORF5 via yeast two-hybrid assay. These findings provide novel information on the identification and functional analysis of the PCV2 ORF5 protein and are likely to be of benefit in elucidating the molecular mechanisms of PCV2 pathogenicity. However, additional experiments are needed to validate the expression and function of the ORF5 protein during PCV2 infection in vitro before any definitive conclusion can be drawn. PMID:26035722

  10. Improved facial nerve identification during parotidectomy with fluorescently labeled peptide.

    PubMed

    Hussain, Timon; Nguyen, Linda T; Whitney, Michael; Hasselmann, Jonathan; Nguyen, Quyen T

    2016-12-01

    Additional intraoperative guidance could reduce the risk of iatrogenic injury during parotid gland cancer surgery. We evaluated the intraoperative use of fluorescently labeled nerve binding peptide NP41 to aid facial nerve identification and preservation during parotidectomy in an orthotopic model of murine parotid gland cancer. We also quantified the accuracy of intraoperative nerve detection for surface and buried nerves in the head and neck with NP41 versus white light (WL) alone. Twenty-eight mice underwent parotid gland cancer surgeries with additional fluorescence (FL) guidance versus WL reflectance (WLR) alone. Eight mice were used for additional nerve-imaging experiments. Twenty-eight parotid tumor-bearing mice underwent parotidectomy. Eight mice underwent imaging of both sides of the face after skin removal. Postoperative assessment of facial nerve function measured by automated whisker tracking were compared between FL guidance (n = 13) versus WL alone (n=15). In eight mice, nerve to surrounding tissue contrast was measured under FL versus WLR for all nerve branches detectable in the field of view. Postoperative facial nerve function after parotid gland cancer surgery tended to be better with additional FL guidance. Fluorescent labeling significantly improved nerve to surrounding tissue contrast for both large and smaller buried nerve branches compared to WLR visualization and improved detection sensitivity and specificity. NP41 FL imaging significantly aids the intraoperative identification of nerve braches otherwise nearly invisible to the naked eye. Its application in a murine model of parotid gland cancer surgery tended to improve functional preservation of the facial nerve. NA Laryngoscope, 126:2711-2717, 2016. © 2016 The American Laryngological, Rhinological and Otological Society, Inc.

  11. [Electrophoretic patterns of cell wall protein as a criterion for the identification and classification of Corynebacteria].

    PubMed

    Mykhal's'kyĭ, L O; Furtat, I M; Dem'ianenko, F P; Kostiuchyk, A A

    2001-01-01

    Electrophoretic patterns of cell wall protein of three industrial strains, that were used for production of lysin, and eight collection strains from the genus Corynevacterium were studied to analyze their similarity as well as to estimate an opportunity of using this parameter as an additional criterion for identification and classification of corynebacteria. Similarity coefficient of cell wall overall and main protein electrophoretic patterns were determined by a specially created computer program. Electrophoretic analysis showed that every specie had an individual protein profile. There were determined biopolymers common for the specie, genus and individual among the overall majors and minors. The obtained results showed, that the patterns of main proteins were more conservative and informative in comparison with those ones of overall proteins. The definition of similarity coefficient by the main protein patterns has correlated with the protein profile characteristics of every analyzed strain, and it managed to distribute them into the separate groups. The similarity coefficient of preparations by the main protein patterns allows to separate one specie or a strain from another, and that gives us a chance to claim that this parameter could be used as an additional criterion for differentiation and referring the corynebacteria to a certain taxonomic group.

  12. Retinal identification based on an Improved Circular Gabor Filter and Scale Invariant Feature Transform.

    PubMed

    Meng, Xianjing; Yin, Yilong; Yang, Gongping; Xi, Xiaoming

    2013-07-18

    Retinal identification based on retinal vasculatures in the retina provides the most secure and accurate means of authentication among biometrics and has primarily been used in combination with access control systems at high security facilities. Recently, there has been much interest in retina identification. As digital retina images always suffer from deformations, the Scale Invariant Feature Transform (SIFT), which is known for its distinctiveness and invariance for scale and rotation, has been introduced to retinal based identification. However, some shortcomings like the difficulty of feature extraction and mismatching exist in SIFT-based identification. To solve these problems, a novel preprocessing method based on the Improved Circular Gabor Transform (ICGF) is proposed. After further processing by the iterated spatial anisotropic smooth method, the number of uninformative SIFT keypoints is decreased dramatically. Tested on the VARIA and eight simulated retina databases combining rotation and scaling, the developed method presents promising results and shows robustness to rotations and scale changes.

  13. Retinal Identification Based on an Improved Circular Gabor Filter and Scale Invariant Feature Transform

    PubMed Central

    Meng, Xianjing; Yin, Yilong; Yang, Gongping; Xi, Xiaoming

    2013-01-01

    Retinal identification based on retinal vasculatures in the retina provides the most secure and accurate means of authentication among biometrics and has primarily been used in combination with access control systems at high security facilities. Recently, there has been much interest in retina identification. As digital retina images always suffer from deformations, the Scale Invariant Feature Transform (SIFT), which is known for its distinctiveness and invariance for scale and rotation, has been introduced to retinal based identification. However, some shortcomings like the difficulty of feature extraction and mismatching exist in SIFT-based identification. To solve these problems, a novel preprocessing method based on the Improved Circular Gabor Transform (ICGF) is proposed. After further processing by the iterated spatial anisotropic smooth method, the number of uninformative SIFT keypoints is decreased dramatically. Tested on the VARIA and eight simulated retina databases combining rotation and scaling, the developed method presents promising results and shows robustness to rotations and scale changes. PMID:23873409

  14. Bayesian module identification from multiple noisy networks.

    PubMed

    Zamani Dadaneh, Siamak; Qian, Xiaoning

    2016-12-01

    Module identification has been studied extensively in order to gain deeper understanding of complex systems, such as social networks as well as biological networks. Modules are often defined as groups of vertices in these networks that are topologically cohesive with similar interaction patterns with the rest of the vertices. Most of the existing module identification algorithms assume that the given networks are faithfully measured without errors. However, in many real-world applications, for example, when analyzing protein-protein interaction networks from high-throughput profiling techniques, there is significant noise with both false positive and missing links between vertices. In this paper, we propose a new model for more robust module identification by taking advantage of multiple observed networks with significant noise so that signals in multiple networks can be strengthened and help improve the solution quality by combining information from various sources. We adopt a hierarchical Bayesian model to integrate multiple noisy snapshots that capture the underlying modular structure of the networks under study. By introducing a latent root assignment matrix and its relations to instantaneous module assignments in all the observed networks to capture the underlying modular structure and combine information across multiple networks, an efficient variational Bayes algorithm can be derived to accurately and robustly identify the underlying modules from multiple noisy networks. Experiments on synthetic and protein-protein interaction data sets show that our proposed model enhances both the accuracy and resolution in detecting cohesive modules, and it is less vulnerable to noise in the observed data. In addition, it shows higher power in predicting missing edges compared to individual-network methods.

  15. An improved method to unravel phosphoacceptors in Ser/Thr protein kinase-phosphorylated substrates.

    PubMed

    Molle, Virginie; Leiba, Jade; Zanella-Cléon, Isabelle; Becchi, Michel; Kremer, Laurent

    2010-11-01

    Identification of the phosphorylated residues of bacterial Ser/Thr protein kinase (STPK) substrates still represents a challenging task. Herein, we present a new strategy allowing the rapid determination of phosphoacceptors in kinase substrates, essentially based on the dual expression of the kinase with its substrate in the surrogate E. coli, followed by MS analysis in a single-step procedure. The performance of this strategy is illustrated using two distinct proteins from Mycobacterium tuberculosis as model substrates, the GroEL2 and HspX chaperones. A comparative analysis with a standard method that includes mass spectrometry analysis of in vitro phosphorylated substrates is also addressed.

  16. Prediction of hot regions in protein-protein interaction by combining density-based incremental clustering with feature-based classification.

    PubMed

    Hu, Jing; Zhang, Xiaolong; Liu, Xiaoming; Tang, Jinshan

    2015-06-01

    Discovering hot regions in protein-protein interaction is important for drug and protein design, while experimental identification of hot regions is a time-consuming and labor-intensive effort; thus, the development of predictive models can be very helpful. In hot region prediction research, some models are based on structure information, and others are based on a protein interaction network. However, the prediction accuracy of these methods can still be improved. In this paper, a new method is proposed for hot region prediction, which combines density-based incremental clustering with feature-based classification. The method uses density-based incremental clustering to obtain rough hot regions, and uses feature-based classification to remove the non-hot spot residues from the rough hot regions. Experimental results show that the proposed method significantly improves the prediction performance of hot regions. Copyright © 2015 Elsevier Ltd. All rights reserved.

  17. Application of Machine Learning Approaches for Protein-protein Interactions Prediction.

    PubMed

    Zhang, Mengying; Su, Qiang; Lu, Yi; Zhao, Manman; Niu, Bing

    2017-01-01

    Proteomics endeavors to study the structures, functions and interactions of proteins. Information of the protein-protein interactions (PPIs) helps to improve our knowledge of the functions and the 3D structures of proteins. Thus determining the PPIs is essential for the study of the proteomics. In this review, in order to study the application of machine learning in predicting PPI, some machine learning approaches such as support vector machine (SVM), artificial neural networks (ANNs) and random forest (RF) were selected, and the examples of its applications in PPIs were listed. SVM and RF are two commonly used methods. Nowadays, more researchers predict PPIs by combining more than two methods. This review presents the application of machine learning approaches in predicting PPI. Many examples of success in identification and prediction in the area of PPI prediction have been discussed, and the PPIs research is still in progress. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  18. Serum microRNA biomarker identification in a residential cohort with elevated polychlorinated biphenyl exposures

    EPA Science Inventory

    Exposure to liver toxicants can result in or exacerbate fatty liver disease. Recent evidence suggests that serum-derived microRNAs (miRs) may improve identification of chemical-induced fatty liver disease relative to traditional protein-based biomarkers alone. Historical serum sa...

  19. Comparison of identification methods for oral asaccharolytic Eubacterium species.

    PubMed

    Wade, W G; Slayne, M A; Aldred, M J

    1990-12-01

    Thirty one strains of oral, asaccharolytic Eubacterium spp. and the type strains of E. brachy, E. nodatum and E. timidum were subjected to three identification techniques--protein-profile analysis, determination of metabolic end-products, and the API ATB32A identification kit. Five clusters were obtained from numerical analysis of protein profiles and excellent correlations were seen with the other two methods. Protein profiles alone allowed unequivocal identification.

  20. Breast cancer and protein biomarkers

    PubMed Central

    Gam, Lay-Harn

    2012-01-01

    Breast cancer is a healthcare concern of women worldwide. Despite procedures being available for diagnosis, prognosis and treatment of breast cancer, researchers are working intensively on the disease in order to improve the life quality of breast cancer patients. At present, there is no single treatment known to bring a definite cure for breast cancer. One of the possible solutions for combating breast cancer is through identification of reliable protein biomarkers that can be effectively used for early detection, prognosis and treatments of the cancer. Therefore, the task of identification of biomarkers for breast cancer has become the focus of many researchers worldwide. PMID:24520539

  1. Estimating the Efficiency of Phosphopeptide Identification by Tandem Mass Spectrometry

    NASA Astrophysics Data System (ADS)

    Hsu, Chuan-Chih; Xue, Liang; Arrington, Justine V.; Wang, Pengcheng; Paez Paez, Juan Sebastian; Zhou, Yuan; Zhu, Jian-Kang; Tao, W. Andy

    2017-06-01

    Mass spectrometry has played a significant role in the identification of unknown phosphoproteins and sites of phosphorylation in biological samples. Analyses of protein phosphorylation, particularly large scale phosphoproteomic experiments, have recently been enhanced by efficient enrichment, fast and accurate instrumentation, and better software, but challenges remain because of the low stoichiometry of phosphorylation and poor phosphopeptide ionization efficiency and fragmentation due to neutral loss. Phosphoproteomics has become an important dimension in systems biology studies, and it is essential to have efficient analytical tools to cover a broad range of signaling events. To evaluate current mass spectrometric performance, we present here a novel method to estimate the efficiency of phosphopeptide identification by tandem mass spectrometry. Phosphopeptides were directly isolated from whole plant cell extracts, dephosphorylated, and then incubated with one of three purified kinases—casein kinase II, mitogen-activated protein kinase 6, and SNF-related protein kinase 2.6—along with 16O4- and 18O4-ATP separately for in vitro kinase reactions. Phosphopeptides were enriched and analyzed by LC-MS. The phosphopeptide identification rate was estimated by comparing phosphopeptides identified by tandem mass spectrometry with phosphopeptide pairs generated by stable isotope labeled kinase reactions. Overall, we found that current high speed and high accuracy mass spectrometers can only identify 20%-40% of total phosphopeptides primarily due to relatively poor fragmentation, additional modifications, and low abundance, highlighting the urgent need for continuous efforts to improve phosphopeptide identification efficiency. [Figure not available: see fulltext.

  2. A Multifaceted Study of Scedosporium boydii Cell Wall Changes during Germination and Identification of GPI-Anchored Proteins

    PubMed Central

    Ghamrawi, Sarah; Gastebois, Amandine; Zykwinska, Agata; Vandeputte, Patrick; Marot, Agnès; Mabilleau, Guillaume; Cuenot, Stéphane; Bouchara, Jean-Philippe

    2015-01-01

    Scedosporium boydii is a pathogenic filamentous fungus that causes a wide range of human infections, notably respiratory infections in patients with cystic fibrosis. The development of new therapeutic strategies targeting S. boydii necessitates a better understanding of the physiology of this fungus and the identification of new molecular targets. In this work, we studied the conidium-to-germ tube transition using a variety of techniques including scanning and transmission electron microscopy, atomic force microscopy, two-phase partitioning, microelectrophoresis and cationized ferritin labeling, chemical force spectroscopy, lectin labeling, and nanoLC-MS/MS for cell wall GPI-anchored protein analysis. We demonstrated that the cell wall undergoes structural changes with germination accompanied with a lower hydrophobicity, electrostatic charge and binding capacity to cationized ferritin. Changes during germination also included a higher accessibility of some cell wall polysaccharides to lectins and less CH3/CH3 interactions (hydrophobic adhesion forces mainly due to glycoproteins). We also extracted and identified 20 GPI-anchored proteins from the cell wall of S. boydii, among which one was detected only in the conidial wall extract and 12 only in the mycelial wall extract. The identified sequences belonged to protein families involved in virulence in other fungi like Gelp/Gasp, Crhp, Bglp/Bgtp families and a superoxide dismutase. These results highlighted the cell wall remodeling during germination in S. boydii with the identification of a substantial number of cell wall GPI-anchored conidial or hyphal specific proteins, which provides a basis to investigate the role of these molecules in the host-pathogen interaction and fungal virulence. PMID:26038837

  3. A Multifaceted Study of Scedosporium boydii Cell Wall Changes during Germination and Identification of GPI-Anchored Proteins.

    PubMed

    Ghamrawi, Sarah; Gastebois, Amandine; Zykwinska, Agata; Vandeputte, Patrick; Marot, Agnès; Mabilleau, Guillaume; Cuenot, Stéphane; Bouchara, Jean-Philippe

    2015-01-01

    Scedosporium boydii is a pathogenic filamentous fungus that causes a wide range of human infections, notably respiratory infections in patients with cystic fibrosis. The development of new therapeutic strategies targeting S. boydii necessitates a better understanding of the physiology of this fungus and the identification of new molecular targets. In this work, we studied the conidium-to-germ tube transition using a variety of techniques including scanning and transmission electron microscopy, atomic force microscopy, two-phase partitioning, microelectrophoresis and cationized ferritin labeling, chemical force spectroscopy, lectin labeling, and nanoLC-MS/MS for cell wall GPI-anchored protein analysis. We demonstrated that the cell wall undergoes structural changes with germination accompanied with a lower hydrophobicity, electrostatic charge and binding capacity to cationized ferritin. Changes during germination also included a higher accessibility of some cell wall polysaccharides to lectins and less CH3/CH3 interactions (hydrophobic adhesion forces mainly due to glycoproteins). We also extracted and identified 20 GPI-anchored proteins from the cell wall of S. boydii, among which one was detected only in the conidial wall extract and 12 only in the mycelial wall extract. The identified sequences belonged to protein families involved in virulence in other fungi like Gelp/Gasp, Crhp, Bglp/Bgtp families and a superoxide dismutase. These results highlighted the cell wall remodeling during germination in S. boydii with the identification of a substantial number of cell wall GPI-anchored conidial or hyphal specific proteins, which provides a basis to investigate the role of these molecules in the host-pathogen interaction and fungal virulence.

  4. An improved in-house lysis-filtration protocol for bacterial identification from positive blood culture bottles with high identification rates by MALDI-TOF MS.

    PubMed

    Tsuchida, Sachio; Murata, Syota; Miyabe, Akiko; Satoh, Mamoru; Takiwaki, Masaki; Matsushita, Kazuyuki; Nomura, Fumio

    2018-05-01

    Matrix-assisted laser desorption ionization-time of flight mass spectrometry (MALDI-TOF MS) is now a well-established method for identification of microorganisms from positive blood cultures. Pretreatments to effectively remove non-bacterial proteins are a prerequisite for successful identification, and a variety of protocols have been reported. Although commercially available kits, mainly the Sepsityper Kit, are increasingly used, the identification rates reported often are not satisfactory, particularly for Gram-positive isolates. We developed a new, in-house lysis-filtration protocol and prospectively evaluated its performance compared to the Sepsityper kit. The in-house protocol consists of three simple steps: lysis by ammonium chloride, aspiration with a syringe fitted with a 0.45-μm membrane, and centrifugation to collect microbes. The novel protocol requires only 20 min. Performance of the in-house protocol was evaluated using a total of 117 monomicrobial cases of positive blood culture. Medium from blood culture bottles was pretreated by the in-house protocol or the commercial kit, and isolated cells were subjected to direct identification by mass spectrometry fingerprinting in parallel with conventional subculturing for reference identification. The overall MALDI-TOF MS-based identification rates with score > 1.7 and > 2.0 obtained using the in-house protocol were 99.2% and 85.5%, respectively, whereas those obtained using the Sepsityper Kit were 85.4% and 61.5%, respectively. For Gram-positive cases, the in-house protocol yielded scores >1.7 and > 2.0 at 98.5% and 76.1%, respectively, whereas the commercial kit yielded these scores at 76.1% and 43.3%, respectively. Although these are preliminary results, these values suggest that this easy lysis-filtration protocol deserves assessment in a larger-scale test. Copyright © 2018 Elsevier B.V. All rights reserved.

  5. Improving photoelectron counting and particle identification in scintillation detectors with Bayesian techniques

    NASA Astrophysics Data System (ADS)

    Akashi-Ronquest, M.; Amaudruz, P.-A.; Batygov, M.; Beltran, B.; Bodmer, M.; Boulay, M. G.; Broerman, B.; Buck, B.; Butcher, A.; Cai, B.; Caldwell, T.; Chen, M.; Chen, Y.; Cleveland, B.; Coakley, K.; Dering, K.; Duncan, F. A.; Formaggio, J. A.; Gagnon, R.; Gastler, D.; Giuliani, F.; Gold, M.; Golovko, V. V.; Gorel, P.; Graham, K.; Grace, E.; Guerrero, N.; Guiseppe, V.; Hallin, A. L.; Harvey, P.; Hearns, C.; Henning, R.; Hime, A.; Hofgartner, J.; Jaditz, S.; Jillings, C. J.; Kachulis, C.; Kearns, E.; Kelsey, J.; Klein, J. R.; Kuźniak, M.; LaTorre, A.; Lawson, I.; Li, O.; Lidgard, J. J.; Liimatainen, P.; Linden, S.; McFarlane, K.; McKinsey, D. N.; MacMullin, S.; Mastbaum, A.; Mathew, R.; McDonald, A. B.; Mei, D.-M.; Monroe, J.; Muir, A.; Nantais, C.; Nicolics, K.; Nikkel, J. A.; Noble, T.; O'Dwyer, E.; Olsen, K.; Orebi Gann, G. D.; Ouellet, C.; Palladino, K.; Pasuthip, P.; Perumpilly, G.; Pollmann, T.; Rau, P.; Retière, F.; Rielage, K.; Schnee, R.; Seibert, S.; Skensved, P.; Sonley, T.; Vázquez-Jáuregui, E.; Veloce, L.; Walding, J.; Wang, B.; Wang, J.; Ward, M.; Zhang, C.

    2015-05-01

    Many current and future dark matter and neutrino detectors are designed to measure scintillation light with a large array of photomultiplier tubes (PMTs). The energy resolution and particle identification capabilities of these detectors depend in part on the ability to accurately identify individual photoelectrons in PMT waveforms despite large variability in pulse amplitudes and pulse pileup. We describe a Bayesian technique that can identify the times of individual photoelectrons in a sampled PMT waveform without deconvolution, even when pileup is present. To demonstrate the technique, we apply it to the general problem of particle identification in single-phase liquid argon dark matter detectors. Using the output of the Bayesian photoelectron counting algorithm described in this paper, we construct several test statistics for rejection of backgrounds for dark matter searches in argon. Compared to simpler methods based on either observed charge or peak finding, the photoelectron counting technique improves both energy resolution and particle identification of low energy events in calibration data from the DEAP-1 detector and simulation of the larger MiniCLEAN dark matter detector.

  6. Extracting features from protein sequences to improve deep extreme learning machine for protein fold recognition.

    PubMed

    Ibrahim, Wisam; Abadeh, Mohammad Saniee

    2017-05-21

    Protein fold recognition is an important problem in bioinformatics to predict three-dimensional structure of a protein. One of the most challenging tasks in protein fold recognition problem is the extraction of efficient features from the amino-acid sequences to obtain better classifiers. In this paper, we have proposed six descriptors to extract features from protein sequences. These descriptors are applied in the first stage of a three-stage framework PCA-DELM-LDA to extract feature vectors from the amino-acid sequences. Principal Component Analysis PCA has been implemented to reduce the number of extracted features. The extracted feature vectors have been used with original features to improve the performance of the Deep Extreme Learning Machine DELM in the second stage. Four new features have been extracted from the second stage and used in the third stage by Linear Discriminant Analysis LDA to classify the instances into 27 folds. The proposed framework is implemented on the independent and combined feature sets in SCOP datasets. The experimental results show that extracted feature vectors in the first stage could improve the performance of DELM in extracting new useful features in second stage. Copyright © 2017 Elsevier Ltd. All rights reserved.

  7. Identification of TOEFAZ1-interacting proteins reveals key regulators of Trypanosoma brucei cytokinesis.

    PubMed

    Hilton, Nicholas A; Sladewski, Thomas E; Perry, Jenna A; Pataki, Zemplen; Sinclair-Davis, Amy N; Muniz, Richard S; Tran, Holly L; Wurster, Jenna I; Seo, Jiwon; de Graffenried, Christopher L

    2018-05-21

    The protist parasite Trypanosoma brucei is an obligate extracellular pathogen that retains its highly-polarized morphology during cell division and has evolved a novel cytokinetic process independent of non-muscle myosin II. The polo-like kinase homolog TbPLK is essential for transmission of cell polarity during division and for cytokinesis. We previously identified a putative TbPLK substrate named Tip of the Extending FAZ 1 (TOEFAZ1) as an essential kinetoplastid-specific component of the T. brucei cytokinetic machinery. We performed a proximity-dependent biotinylation (BioID) screen using TOEFAZ1 as a means to identify additional proteins that are involved in cytokinesis. Using quantitative proteomic methods, we identified nearly 500 TOEFAZ1-proximal proteins and characterized 59 in further detail. Among the candidates, we identified an essential putative phosphatase that regulates the expression level and localization of both TOEFAZ1 and TbPLK, a previously uncharacterized protein that is necessary for the assembly of a new cell posterior, and a microtubule plus-end directed orphan kinesin that is required for completing cleavage furrow ingression. The identification of these proteins provides new insight into T. brucei cytokinesis and establishes TOEFAZ1 as a key component of this essential and uniquely-configured process in kinetoplastids. This article is protected by copyright. All rights reserved. © 2018 John Wiley & Sons Ltd.

  8. Improving protein complex classification accuracy using amino acid composition profile.

    PubMed

    Huang, Chien-Hung; Chou, Szu-Yu; Ng, Ka-Lok

    2013-09-01

    Protein complex prediction approaches are based on the assumptions that complexes have dense protein-protein interactions and high functional similarity between their subunits. We investigated those assumptions by studying the subunits' interaction topology, sequence similarity and molecular function for human and yeast protein complexes. Inclusion of amino acids' physicochemical properties can provide better understanding of protein complex properties. Principal component analysis is carried out to determine the major features. Adopting amino acid composition profile information with the SVM classifier serves as an effective post-processing step for complexes classification. Improvement is based on primary sequence information only, which is easy to obtain. Copyright © 2013 Elsevier Ltd. All rights reserved.

  9. Identification of small peptides arising from hydrolysis of meat proteins in dry fermented sausages.

    PubMed

    López, Constanza M; Bru, Elena; Vignolo, Graciela M; Fadda, Silvina G

    2015-06-01

    In this study, proteolysis and low molecular weight (LMW) peptides (<3kDa) from commercial Argentinean fermented sausages were characterized by applying a peptidomic approach. Protein profiles and peptides obtained by Tricine-SDS-PAGE and RP-HPLC-MS, respectively, allowed distinguishing two different types of fermented sausages, although no specific biomarkers relating to commercial brands or quality were recognized. From electrophoresis, α-actin, myoglobin, creatine kinase M-type and L-lactate dehydrogenase were degraded at different intensities. In addition, a partial characterization of fermented sausage peptidome through the identification of 36 peptides, in the range of 1000-2100 Da, arising from sarcoplasmic (28) and myofibrillar (8) proteins was achieved. These peptides had been originated from α-actin, myoglobin, and creatine kinase M-type, but also from the hydrolysis of other proteins not previously reported. Although muscle enzymes exerted a major role on peptidogenesis, microbial contribution cannot be excluded as it was postulated herein. This work represents a first peptidomic approach for fermented sausages, thereby providing a baseline to define key peptides acting as potential biomarkers. Copyright © 2015 Elsevier Ltd. All rights reserved.

  10. Whey Protein Components - Lactalbumin and Lactoferrin - Improve Energy Balance and Metabolism.

    PubMed

    Zapata, Rizaldy C; Singh, Arashdeep; Pezeshki, Adel; Nibber, Traj; Chelikani, Prasanth K

    2017-08-30

    Whey protein promotes weight loss and improves diabetic control, however, less is known of its bioactive components that produce such benefits. We compared the effects of normal protein (control) diet with high protein diets containing whey, or its fractions lactalbumin and lactoferrin, on energy balance and metabolism. Diet-induced obese rats were randomized to isocaloric diets: Control, Whey, Lactalbumin, Lactoferrin, or pair-fed to lactoferrin. Whey and lactalbumin produced transient hypophagia, whereas lactoferrin caused prolonged hypophagia; the hypophagia was likely due to decreased preference. Lactalbumin decreased weight and fat gain. Notably, lactoferrin produced sustained weight and fat loss, and attenuated the reduction in energy expenditure associated with calorie restriction. Lactalbumin and lactoferrin decreased plasma leptin and insulin, and lactalbumin increased peptide YY. Whey, lactalbumin and lactoferrin improved glucose clearance partly through differential upregulation of glucoregulatory transcripts in the liver and skeletal muscle. Interestingly, lactalbumin and lactoferrin decreased hepatic lipidosis partly through downregulation of lipogenic and/or upregulation of β-oxidation transcripts, and differentially modulated cecal bacterial populations. Our findings demonstrate that protein quantity and quality are important for improving energy balance. Dietary lactalbumin and lactoferrin improved energy balance and metabolism, and decreased adiposity, with the effects of lactoferrin being partly independent of caloric intake.

  11. Identification of Sequence Specificity of 5-Methylcytosine Oxidation by Tet1 Protein with High-Throughput Sequencing.

    PubMed

    Kizaki, Seiichiro; Chandran, Anandhakumar; Sugiyama, Hiroshi

    2016-03-02

    Tet (ten-eleven translocation) family proteins have the ability to oxidize 5-methylcytosine (mC) to 5-hydroxymethylcytosine (hmC), 5-formylcytosine (fC), and 5-carboxycytosine (caC). However, the oxidation reaction of Tet is not understood completely. Evaluation of genomic-level epigenetic changes by Tet protein requires unbiased identification of the highly selective oxidation sites. In this study, we used high-throughput sequencing to investigate the sequence specificity of mC oxidation by Tet1. A 6.6×10(4) -member mC-containing random DNA-sequence library was constructed. The library was subjected to Tet-reactive pulldown followed by high-throughput sequencing. Analysis of the obtained sequence data identified the Tet1-reactive sequences. We identified mCpG as a highly reactive sequence of Tet1 protein. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  12. Identification of S-glutathionylation sites in species-specific proteins by incorporating five sequence-derived features into the general pseudo-amino acid composition.

    PubMed

    Zhao, Xiaowei; Ning, Qiao; Ai, Meiyue; Chai, Haiting; Yang, Guifu

    2016-06-07

    As a selective and reversible protein post-translational modification, S-glutathionylation generates mixed disulfides between glutathione (GSH) and cysteine residues, and plays an important role in regulating protein activity, stability, and redox regulation. To fully understand S-glutathionylation mechanisms, identification of substrates and specific S-Glutathionylated sites is crucial. Experimental identification of S-glutathionylated sites is labor-intensive and time consuming, so establishing an effective computational method is much desirable due to their convenient and fast speed. Therefore, in this study, a new bioinformatics tool named SSGlu (Species-Specific identification of Protein S-glutathionylation Sites) was developed to identify species-specific protein S-glutathionylated sites, utilizing support vector machines that combine multiple sequence-derived features with a two-step feature selection. By 5-fold cross validation, the performance of SSGlu was measured with an AUC of 0.8105 and 0.8041 for Homo sapiens and Mus musculus, respectively. Additionally, SSGlu was compared with the existing methods, and the higher MCC and AUC of SSGlu demonstrated that SSGlu was very promising to predict S-glutathionylated sites. Furthermore, a site-specific analysis showed that S-glutathionylation intimately correlated with the features derived from its surrounding sites. The conclusions derived from this study might help to understand more of the S-glutathionylation mechanism and guide the related experimental validation. For public access, SSGlu is freely accessible at http://59.73.198.144:8080/SSGlu/. Copyright © 2016 Elsevier Ltd. All rights reserved.

  13. Protein cleavage strategies for an improved analysis of the membrane proteome

    PubMed Central

    Fischer, Frank; Poetsch, Ansgar

    2006-01-01

    Background Membrane proteins still remain elusive in proteomic studies. This is in part due to the distribution of the amino acids lysine and arginine, which are less frequent in integral membrane proteins and almost absent in transmembrane helices. As these amino acids are cleavage targets for the commonly used protease trypsin, alternative cleavage conditions, which should improve membrane protein analysis, were tested by in silico digestion for the three organisms Saccharomyces cerevisiae, Halobacterium sp. NRC-1, and Corynebacterium glutamicum as hallmarks for eukaryotes, archea and eubacteria. Results For the membrane proteomes from all three analyzed organisms, we identified cleavage conditions that achieve better sequence and proteome coverage than trypsin. Greater improvement was obtained for bacteria than for yeast, which was attributed to differences in protein size and GRAVY. It was demonstrated for bacteriorhodopsin that the in silico predictions agree well with the experimental observations. Conclusion For all three examined organisms, it was found that a combination of chymotrypsin and staphylococcal peptidase I gave significantly better results than trypsin. As some of the improved cleavage conditions are not more elaborate than trypsin digestion and have been proven useful in practice, we suppose that the cleavage at both hydrophilic and hydrophobic amino acids should facilitate in general the analysis of membrane proteins for all organisms. PMID:16512920

  14. Fluorescent protein vectors for pancreatic islet cell identification in live-cell imaging.

    PubMed

    Shuai, Hongyan; Xu, Yunjian; Yu, Qian; Gylfe, Erik; Tengholm, Anders

    2016-10-01

    The islets of Langerhans contain different types of endocrine cells, which are crucial for glucose homeostasis. β- and α-cells that release insulin and glucagon, respectively, are most abundant, whereas somatostatin-producing δ-cells and particularly pancreatic polypeptide-releasing PP-cells are more scarce. Studies of islet cell function are hampered by difficulties to identify the different cell types, especially in live-cell imaging experiments when immunostaining is unsuitable. The aim of the present study was to create a set of vectors for fluorescent protein expression with cell-type-specific promoters and evaluate their applicability in functional islet imaging. We constructed six adenoviral vectors for expression of red and green fluorescent proteins controlled by the insulin, preproglucagon, somatostatin, or pancreatic polypeptide promoters. After transduction of mouse and human islets or dispersed islet cells, a majority of the fluorescent cells also immunostained for the appropriate hormone. Recordings of the sub-plasma membrane Ca(2+) and cAMP concentrations with a fluorescent indicator and a protein biosensor, respectively, showed that labeled cells respond to glucose and other modulators of secretion and revealed a striking variability in Ca(2+) signaling among α-cells. The measurements allowed comparison of the phase relationship of Ca(2+) oscillations between different types of cells within intact islets. We conclude that the fluorescent protein vectors allow easy identification of specific islet cell types and can be used in live-cell imaging together with organic dyes and genetically encoded biosensors. This approach will facilitate studies of normal islet physiology and help to clarify molecular defects and disturbed cell interactions in diabetic islets.

  15. Identification of liver protein targets modified by tienilic acid metabolites using a two-dimensional Western blot-mass spectrometry approach

    NASA Astrophysics Data System (ADS)

    Methogo, Ruth Menque; Dansette, Patrick M.; Klarskov, Klaus

    2007-12-01

    A combined approach based on two-dimensional electrophoresis-immuno-blotting and nanoliquid chromatography coupled on-line with electrospray ionization mass spectrometry (nLC-MS/MS) was used to identify proteins modified by a reactive intermediate of tienilic acid (TA). Liver homogenates from rats exposed to TA were fractionated using ultra centrifugation; four fractions were obtained and subjected to 2D electrophoresis. Following transfer to PVDF membranes, modified proteins were visualized after India ink staining, using an anti-serum raised against TA and ECL detection. Immuno-reactive spots were localized on the PVDF membrane by superposition of the ECL image, protein spots of interest were excised, digested on the membrane with trypsin followed by nLC-MS/MS analysis and protein identification. A total of 15 proteins were identified as likely targets modified by a TA reactive metabolite. These include selenium binding protein 2, senescence marker protein SMP-30, adenosine kinase, Acy1 protein, adenosylhomocysteinase, capping protein (actin filament), protein disulfide isomerase, fumarylacetoacetase, arginase chain A, ketohexokinase, proteasome endopeptidase complex, triosephosphate isomerase, superoxide dismutase, dna-type molecular chaperone hsc73 and malate dehydrogenase.

  16. Affinity purification combined with mass spectrometry to identify herpes simplex virus protein-protein interactions.

    PubMed

    Meckes, David G

    2014-01-01

    The identification and characterization of herpes simplex virus protein interaction complexes are fundamental to understanding the molecular mechanisms governing the replication and pathogenesis of the virus. Recent advances in affinity-based methods, mass spectrometry configurations, and bioinformatics tools have greatly increased the quantity and quality of protein-protein interaction datasets. In this chapter, detailed and reliable methods that can easily be implemented are presented for the identification of protein-protein interactions using cryogenic cell lysis, affinity purification, trypsin digestion, and mass spectrometry.

  17. Swirling cavitation improves the emulsifying properties of commercial soy protein isolate.

    PubMed

    Yang, Feng; Liu, Xue; Ren, Xian'e; Huang, Yongchun; Huang, Chengdu; Zhang, Kunming

    2018-04-01

    Since emulsifying properties are important functional properties of soy protein, many physical, chemical, and enzymatic methods have been applied to treat soy protein to improve emulsifying properties. In this study, we investigated the effects of swirling cavitation at different pressures and for different times on emulsifying and physicochemical properties of soy protein isolate (SPI). The SPI treated with swirling cavitation showed a significant decrease in particle size and increase in solubility. Emulsions formed from treated SPI had higher emulsifying activity and emulsifying stability indexes, smaller oil droplet sizes, lower flocculation indexes, higher adsorbed proteins, lower interfacial protein concentrations, and lower creaming indexes than those formed from untreated SPI, indicating that swirling cavitation improved the emulsifying properties of the SPI. Furthermore, swirling cavitation treatment significantly enhanced the surface hydrophobicity, altered the disulfide bond and exposed sulfhydryl group contents of the SPI. The secondary structure of the SPI was also influenced by swirling cavitation, with an increase in β-sheet content and a decrease in α-helix, β-turn, and random coil contents. In addition, several significant correlations between physicochemical and emulsifying properties were revealed by Pearson correlation analysis, suggesting that the physicochemical changes observed in treated SPI, including the decreased particle size, increased solubility and surface hydrophobicity, and enhanced β-sheet formation, may explain the improved emulsifying properties of the isolate. Thus, our findings implied that swirling cavitation treatment may be an effective technique to improve the emulsifying properties of SPI. Copyright © 2017 Elsevier B.V. All rights reserved.

  18. Identification of T1D susceptibility genes within the MHC region by combining protein interaction networks and SNP genotyping data

    PubMed Central

    Brorsson, C.; Hansen, N. T.; Lage, K.; Bergholdt, R.; Brunak, S.; Pociot, F.

    2009-01-01

    Aim To develop novel methods for identifying new genes that contribute to the risk of developing type 1 diabetes within the Major Histocompatibility Complex (MHC) region on chromosome 6, independently of the known linkage disequilibrium (LD) between human leucocyte antigen (HLA)-DRB1, -DQA1, -DQB1 genes. Methods We have developed a novel method that combines single nucleotide polymorphism (SNP) genotyping data with protein–protein interaction (ppi) networks to identify disease-associated network modules enriched for proteins encoded from the MHC region. Approximately 2500 SNPs located in the 4 Mb MHC region were analysed in 1000 affected offspring trios generated by the Type 1 Diabetes Genetics Consortium (T1DGC). The most associated SNP in each gene was chosen and genes were mapped to ppi networks for identification of interaction partners. The association testing and resulting interacting protein modules were statistically evaluated using permutation. Results A total of 151 genes could be mapped to nodes within the protein interaction network and their interaction partners were identified. Five protein interaction modules reached statistical significance using this approach. The identified proteins are well known in the pathogenesis of T1D, but the modules also contain additional candidates that have been implicated in β-cell development and diabetic complications. Conclusions The extensive LD within the MHC region makes it important to develop new methods for analysing genotyping data for identification of additional risk genes for T1D. Combining genetic data with knowledge about functional pathways provides new insight into mechanisms underlying T1D. PMID:19143816

  19. Fatty Acid-binding Proteins Interact with Comparative Gene Identification-58 Linking Lipolysis with Lipid Ligand Shuttling*

    PubMed Central

    Hofer, Peter; Boeszoermenyi, Andras; Jaeger, Doris; Feiler, Ursula; Arthanari, Haribabu; Mayer, Nicole; Zehender, Fabian; Rechberger, Gerald; Oberer, Monika; Zimmermann, Robert; Lass, Achim; Haemmerle, Guenter; Breinbauer, Rolf; Zechner, Rudolf; Preiss-Landl, Karina

    2015-01-01

    The coordinated breakdown of intracellular triglyceride (TG) stores requires the exquisitely regulated interaction of lipolytic enzymes with regulatory, accessory, and scaffolding proteins. Together they form a dynamic multiprotein network designated as the “lipolysome.” Adipose triglyceride lipase (Atgl) catalyzes the initiating step of TG hydrolysis and requires comparative gene identification-58 (Cgi-58) as a potent activator of enzyme activity. Here, we identify adipocyte-type fatty acid-binding protein (A-Fabp) and other members of the fatty acid-binding protein (Fabp) family as interaction partners of Cgi-58. Co-immunoprecipitation, microscale thermophoresis, and solid phase assays proved direct protein/protein interaction between A-Fabp and Cgi-58. Using nuclear magnetic resonance titration experiments and site-directed mutagenesis, we located a potential contact region on A-Fabp. In functional terms, A-Fabp stimulates Atgl-catalyzed TG hydrolysis in a Cgi-58-dependent manner. Additionally, transcriptional transactivation assays with a luciferase reporter system revealed that Fabps enhance the ability of Atgl/Cgi-58-mediated lipolysis to induce the activity of peroxisome proliferator-activated receptors. Our studies identify Fabps as crucial structural and functional components of the lipolysome. PMID:25953897

  20. Generic comparison of protein inference engines.

    PubMed

    Claassen, Manfred; Reiter, Lukas; Hengartner, Michael O; Buhmann, Joachim M; Aebersold, Ruedi

    2012-04-01

    Protein identifications, instead of peptide-spectrum matches, constitute the biologically relevant result of shotgun proteomics studies. How to appropriately infer and report protein identifications has triggered a still ongoing debate. This debate has so far suffered from the lack of appropriate performance measures that allow us to objectively assess protein inference approaches. This study describes an intuitive, generic and yet formal performance measure and demonstrates how it enables experimentalists to select an optimal protein inference strategy for a given collection of fragment ion spectra. We applied the performance measure to systematically explore the benefit of excluding possibly unreliable protein identifications, such as single-hit wonders. Therefore, we defined a family of protein inference engines by extending a simple inference engine by thousands of pruning variants, each excluding a different specified set of possibly unreliable identifications. We benchmarked these protein inference engines on several data sets representing different proteomes and mass spectrometry platforms. Optimally performing inference engines retained all high confidence spectral evidence, without posterior exclusion of any type of protein identifications. Despite the diversity of studied data sets consistently supporting this rule, other data sets might behave differently. In order to ensure maximal reliable proteome coverage for data sets arising in other studies we advocate abstaining from rigid protein inference rules, such as exclusion of single-hit wonders, and instead consider several protein inference approaches and assess these with respect to the presented performance measure in the specific application context.

  1. Molecular Dynamics Information Improves cis-Peptide-Based Function Annotation of Proteins.

    PubMed

    Das, Sreetama; Bhadra, Pratiti; Ramakumar, Suryanarayanarao; Pal, Debnath

    2017-08-04

    cis-Peptide bonds, whose occurrence in proteins is rare but evolutionarily conserved, are implicated to play an important role in protein function. This has led to their previous use in a homology-independent, fragment-match-based protein function annotation method. However, proteins are not static molecules; dynamics is integral to their activity. This is nicely epitomized by the geometric isomerization of cis-peptide to trans form for molecular activity. Hence we have incorporated both static (cis-peptide) and dynamics information to improve the prediction of protein molecular function. Our results show that cis-peptide information alone cannot detect functional matches in cases where cis-trans isomerization exists but 3D coordinates have been obtained for only the trans isomer or when the cis-peptide bond is incorrectly assigned as trans. On the contrary, use of dynamics information alone includes false-positive matches for cases where fragments with similar secondary structure show similar dynamics, but the proteins do not share a common function. Combining the two methods reduces errors while detecting the true matches, thereby enhancing the utility of our method in function annotation. A combined approach, therefore, opens up new avenues of improving existing automated function annotation methodologies.

  2. Multi-level machine learning prediction of protein-protein interactions in Saccharomyces cerevisiae.

    PubMed

    Zubek, Julian; Tatjewski, Marcin; Boniecki, Adam; Mnich, Maciej; Basu, Subhadip; Plewczynski, Dariusz

    2015-01-01

    Accurate identification of protein-protein interactions (PPI) is the key step in understanding proteins' biological functions, which are typically context-dependent. Many existing PPI predictors rely on aggregated features from protein sequences, however only a few methods exploit local information about specific residue contacts. In this work we present a two-stage machine learning approach for prediction of protein-protein interactions. We start with the carefully filtered data on protein complexes available for Saccharomyces cerevisiae in the Protein Data Bank (PDB) database. First, we build linear descriptions of interacting and non-interacting sequence segment pairs based on their inter-residue distances. Secondly, we train machine learning classifiers to predict binary segment interactions for any two short sequence fragments. The final prediction of the protein-protein interaction is done using the 2D matrix representation of all-against-all possible interacting sequence segments of both analysed proteins. The level-I predictor achieves 0.88 AUC for micro-scale, i.e., residue-level prediction. The level-II predictor improves the results further by a more complex learning paradigm. We perform 30-fold macro-scale, i.e., protein-level cross-validation experiment. The level-II predictor using PSIPRED-predicted secondary structure reaches 0.70 precision, 0.68 recall, and 0.70 AUC, whereas other popular methods provide results below 0.6 threshold (recall, precision, AUC). Our results demonstrate that multi-scale sequence features aggregation procedure is able to improve the machine learning results by more than 10% as compared to other sequence representations. Prepared datasets and source code for our experimental pipeline are freely available for download from: http://zubekj.github.io/mlppi/ (open source Python implementation, OS independent).

  3. Identification of a novel homolog of the Drosophila staufen protein in the chromosome 8q13-q21.1 region.

    PubMed

    Buchner, G; Bassi, M T; Andolfi, G; Ballabio, A; Franco, B

    1999-11-15

    We report the identification of a new transcript homologous to the Drosophila staufen protein. This transcript, named STAU2 (HGMW-approved gene symbol and name), maps to the chromosome 8q13-q21 region. The full-length STAU2 cDNA is 4058 bp and contains an open reading frame of 479 amino acids. Analysis of the predicted protein product indicated the presence of three double-stranded RNA-binding domains. Best-fit analysis revealed a 48.5% similarity to the Drosophila protein and a 59.9% similarity to the recently described mammalian homolog hStau, indicating that at least two different transcripts with homologies to the fly protein are present in mammals. Copyright 1999 Academic Press.

  4. Rapid identification and classification of Mycobacterium spp. using whole-cell protein barcodes with matrix assisted laser desorption ionization time of flight mass spectrometry in comparison with multigene phylogenetic analysis.

    PubMed

    Wang, Jun; Chen, Wen Feng; Li, Qing X

    2012-02-24

    The need of quick diagnostics and increasing number of bacterial species isolated necessitate development of a rapid and effective phenotypic identification method. Mass spectrometry (MS) profiling of whole cell proteins has potential to satisfy the requirements. The genus Mycobacterium contains more than 154 species that are taxonomically very close and require use of multiple genes including 16S rDNA for phylogenetic identification and classification. Six strains of five Mycobacterium species were selected as model bacteria in the present study because of their 16S rDNA similarity (98.4-99.8%) and the high similarity of the concatenated 16S rDNA, rpoB and hsp65 gene sequences (95.9-99.9%), requiring high identification resolution. The classification of the six strains by MALDI TOF MS protein barcodes was consistent with, but at much higher resolution than, that of the multi-locus sequence analysis of using 16S rDNA, rpoB and hsp65. The species were well differentiated using MALDI TOF MS and MALDI BioTyper™ software after quick preparation of whole-cell proteins. Several proteins were selected as diagnostic markers for species confirmation. An integration of MALDI TOF MS, MALDI BioTyper™ software and diagnostic protein fragments provides a robust phenotypic approach for bacterial identification and classification. Copyright © 2011 Elsevier B.V. All rights reserved.

  5. HFIP Extraction Followed by 2D CTAB/SDS-PAGE Separation: A New Methodology for Protein Identification from Tissue Sections after MALDI Mass Spectrometry Profiling for Personalized Medicine Research

    PubMed Central

    Longuespée, Rémi; Tastet, Christophe; Desmons, Annie; Kerdraon, Olivier; Day, Robert

    2014-01-01

    Abstract Matrix-assisted laser desorption ionization mass spectrometry imaging (MALDI-MSI) and profiling technology have become the easiest methods for quickly accessing the protein composition of a tissue area. Unfortunately, the demand for the identification of these proteins remains unmet. To overcome this bottleneck, we combined several strategies to identify the proteins detected via MALDI profiling including on-tissue protein extraction using hexafluoroIsopropanol (1,1,1,3,3,3-hexafluoro-2-propanol, HFIP) coupled with two-dimensional cetyl trimethylammonium bromide/sodium dodecyl sulfate–polyacrylamide gel electrophoresis (2D CTAB/SDS-PAGE) for separation followed by trypsin digestion and MALDI-MS analyses for identification. This strategy was compared with an on-tissue bottom-up strategy that we previously developed. The data reflect the complementarity of the approaches. An increase in the number of specific proteins identified has been established. This approach demonstrates the potential of adapted extraction procedures and the combination of parallel identification approaches for personalized medicine applications. The anatomical context provides important insight into identifying biomarkers and may be considered a first step for tissue-based biomarker research, as well as the extemporaneous examination of biopsies during surgery. PMID:24841221

  6. A novel strategy for global mapping of O-GlcNAc proteins and peptides using selective enzymatic deglycosylation, HILIC enrichment and mass spectrometry identification.

    PubMed

    Shen, Bingquan; Zhang, Wanjun; Shi, Zhaomei; Tian, Fang; Deng, Yulin; Sun, Changqing; Wang, Guangshun; Qin, Weijie; Qian, Xiaohong

    2017-07-01

    O-GlcNAcylation is a kind of dynamic O-linked glycosylation of nucleocytoplasmic and mitochondrial proteins. It serves as a major nutrient sensor to regulate numerous biological processes including transcriptional regulation, cell metabolism, cellular signaling, and protein degradation. Dysregulation of cellular O-GlcNAcylated levels contributes to the etiologies of many diseases such as diabetes, neurodegenerative disease and cancer. However, deeper insight into the biological mechanism of O-GlcNAcylation is hampered by its extremely low stoichiometry and the lack of efficient enrichment approaches for large-scale identification by mass spectrometry. Herein, we developed a novel strategy for the global identification of O-GlcNAc proteins and peptides using selective enzymatic deglycosylation, HILIC enrichment and mass spectrometry analysis. Standard O-GlcNAc peptides can be efficiently enriched even in the presence of 500-fold more abundant non-O-GlcNAc peptides and identified by mass spectrometry with a low nanogram detection sensitivity. This strategy successfully achieved the first large-scale enrichment and characterization of O-GlcNAc proteins and peptides in human urine. A total of 474 O-GlcNAc peptides corresponding to 457 O-GlcNAc proteins were identified by mass spectrometry analysis, which is at least three times more than that obtained by commonly used enrichment methods. A large number of unreported O-GlcNAc proteins related to cell cycle, biological regulation, metabolic and developmental process were found in our data. The above results demonstrated that this novel strategy is highly efficient in the global enrichment and identification of O-GlcNAc peptides. These data provide new insights into the biological function of O-GlcNAcylation in human urine, which is correlated with the physiological states and pathological changes of human body and therefore indicate the potential of this strategy for biomarker discovery from human urine. Copyright

  7. Rational identification of aggregation hotspots based on secondary structure and amino acid hydrophobicity.

    PubMed

    Matsui, Daisuke; Nakano, Shogo; Dadashipour, Mohammad; Asano, Yasuhisa

    2017-08-25

    Insolubility of proteins expressed in the Escherichia coli expression system hinders the progress of both basic and applied research. Insoluble proteins contain residues that decrease their solubility (aggregation hotspots). Mutating these hotspots to optimal amino acids is expected to improve protein solubility. To date, however, the identification of these hotspots has proven difficult. In this study, using a combination of approaches involving directed evolution and primary sequence analysis, we found two rules to help inductively identify hotspots: the α-helix rule, which focuses on the hydrophobicity of amino acids in the α-helix structure, and the hydropathy contradiction rule, which focuses on the difference in hydrophobicity relative to the corresponding amino acid in the consensus protein. By properly applying these two rules, we succeeded in improving the probability that expressed proteins would be soluble. Our methods should facilitate research on various insoluble proteins that were previously difficult to study due to their low solubility.

  8. Identification of increased amounts of eppin protein complex components in sperm cells of diabetic and obese individuals by difference gel electrophoresis.

    PubMed

    Paasch, Uwe; Heidenreich, Falk; Pursche, Theresia; Kuhlisch, Eberhard; Kettner, Karina; Grunewald, Sonja; Kratzsch, Jürgen; Dittmar, Gunnar; Glander, Hans-Jürgen; Hoflack, Bernard; Kriegel, Thomas M

    2011-08-01

    Metabolic disorders like diabetes mellitus and obesity may compromise the fertility of men and women. To unveil disease-associated proteomic changes potentially affecting male fertility, the proteomes of sperm cells from type-1 diabetic, type-2 diabetic, non-diabetic obese and clinically healthy individuals were comparatively analyzed by difference gel electrophoresis. The adaptation of a general protein extraction procedure to the solubilization of proteins from sperm cells allowed for the resolution of 3187 fluorescent spots in the difference gel electrophoresis image of the master gel, which contained the entirety of solubilized sperm proteins. Comparison of the pathological and reference proteomes by applying an average abundance ratio setting of 1.6 and a p ≤ 0.05 criterion resulted in the identification of 79 fluorescent spots containing proteins that were present at significantly changed levels in the sperm cells. Biometric evaluation of the fluorescence data followed by mass spectrometric protein identification revealed altered levels of 12, 71, and 13 protein species in the proteomes of the type-1 diabetic, type-2 diabetic, and non-diabetic obese patients, respectively, with considerably enhanced amounts of the same set of one molecular form of semenogelin-1, one form of clusterin, and two forms of lactotransferrin in each group of pathologic samples. Remarkably, β-galactosidase-1-like protein was the only protein that was detected at decreased levels in all three pathologic situations. The former three proteins are part of the eppin (epididymal proteinase inhibitor) protein complex, which is thought to fulfill fertilization-related functions, such as ejaculate sperm protection, motility regulation and gain of competence for acrosome reaction, whereas the putative role of the latter protein to function as a glycosyl hydrolase during sperm maturation remains to be explored at the protein/enzyme level. The strikingly similar differences detected in the

  9. The plastid ribosomal proteins. Identification of all the proteins in the 30 S subunit of an organelle ribosome (chloroplast).

    PubMed

    Yamaguchi, K; von Knoblauch, K; Subramanian, A R

    2000-09-15

    Identification of all the protein components of a plastid (chloroplast) ribosomal 30 S subunit has been achieved, using two-dimensional gel electropholesis, high performance liquid chromatography purification, N-terminal sequencing, polymerase chain reaction-based screening of cDNA library, nucleotide sequencing, and mass spectrometry (electrospray ionization, matrix-assisted laser desorption/ionization time-of-flight, and reversed-phase HPLC coupled with electrospray ionization mass spectrometry). 25 proteins were identified, of which 21 are orthologues of all Escherichia coli 30 S ribosomal proteins (S1-S21), and 4 are plastid-specific ribosomal proteins (PSRPs) that have no homologues in the mitochondrial, archaebacterial, or cytosolic ribosomal protein sequences in data bases. 12 of the 25 plastid 30 S ribosomal proteins (PRPs) are encoded in the plastid genome, whereas the remaining 13 are encoded by the nuclear genome. Post-translational transit peptide cleavage sites for the maturation of the 13 cytosolically synthesized PRPs, and post-translational N-terminal processing in the maturation of the 12 plastid synthesized PRPs are described. Post-translational modifications in several PRPs were observed: alpha-N-acetylation of S9, N-terminal processings leading to five mature forms of S6 and two mature forms of S10, C-terminal and/or internal modifications in S1, S14, S18, and S19, leading to two distinct forms differing in mass and/or charge (the corresponding modifications are not observed in E. coli). The four PSRPs in spinach plastid 30 S ribosomal subunit (PSRP-1, 26.8 kDa, pI 6.2; PSRP-2, 21.7 kDa, pI 5.0; PSRP-3, 13.8 kDa, pI 4.9; PSRP-4, 5.2 kDa, pI 11.8) comprise 16% (67.6 kDa) of the total protein mass of the 30 S subunit (429.3 kDa). PSRP-1 and PSRP-3 show sequence similarities with hypothetical photosynthetic bacterial proteins, indicating their possible origins in photosynthetic bacteria. We propose the hypothesis that PSRPs form a "plastid

  10. The on-bead digestion of protein corona on nanoparticles by trypsin immobilized on the magnetic nanoparticle.

    PubMed

    Hu, Zhengyan; Zhao, Liang; Zhang, Hongyan; Zhang, Yi; Wu, Ren'an; Zou, Hanfa

    2014-03-21

    Proteins interacting with nanoparticles would form the protein coronas on the surface of nanoparticles in biological systems, which would critically impact the biological identities of nanoparticles and/or result in the physiological and pathological consequences. The enzymatic digestion of protein corona was the primary step to achieve the identification of protein components of the protein corona for the bottom-up proteomic approaches. In this study, the investigation on the tryptic digestion of protein corona by the immobilized trypsin on a magnetic nanoparticle was carried out for the first time. As a comparison with the usual overnight long-time digestion and the severe self-digestion of free trypsin, the on-bead digestion of protein corona by the immobilized trypsin could be accomplished within 1h, along with the significantly reduced self-digestion of trypsin and the improved reproducibility on the identification of proteins by the mass spectrometry-based proteomic approach. It showed that the number of identified bovine serum (BS) proteins on the commercial Fe3O4 nanoparticles was increased by 13% for the immobilized trypsin with 1h digestion as compared to that of using free trypsin with even overnight digestion. In addition, the on-bead digestion of using the immobilized trypsin was further applied on the identification of human plasma protein corona on the commercial Fe3O4 nanoparticles, which leads the efficient digestion of the human plasma proteins and the identification of 149 human plasma proteins corresponding to putative critical pathways and biological processes. Copyright © 2014 Elsevier B.V. All rights reserved.

  11. Do recommended protein intakes improve neurodevelopment in extremely preterm babies?

    PubMed

    Cester, E A; Bloomfield, F H; Taylor, J; Smith, S; Cormack, B E

    2015-05-01

    To determine whether achieving recommended protein intakes for extremely low birthweight (ELBW; birth weight <1000 g) babies, resulting in better growth, improves neurodevelopmental outcomes. A prospective cohort study of ELBW babies before and after the introduction of a new nutritional policy designed to meet international consensus protein recommendations. Forty-five children born 'before' and 42 born 'after' the policy change were assessed at 2 years' corrected age (CA). Associations between nutritional intakes, growth and neurodevelopmental outcome (Bayley Scales of Infant and Toddler Development, Third edition (Bayley-III), motor and sensory impairment) were assessed using univariate and multivariate analyses. Bayley-III cognitive (mean (SD) 96 (12) vs 96 (15)), motor (96 (13) vs 95 (15)) or language scores (89 (11) vs 91 (17)) were not different between the 'before' and 'after' cohorts. In the 'before' cohort, motor scores were positively associated with enteral nutrition intakes and growth velocity. Neither were sensory impairments different between groups (visual impairment 4 vs 2, hearing impairment 2 vs 0) nor was the gross motor function classification score (any cerebral palsy 2 vs 1). In this prospective cohort study, increasing intravenous and enteral protein intakes to recommended levels in the first month after birth was not associated with improved cognitive, language or motor scores or decreased sensory impairments at 2 years' CA despite significantly improved early growth and reduced postnatal faltering growth. Appropriate randomised controlled trials are needed to answer definitively whether higher early protein intakes improve neurodevelopmental outcome in this population. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.

  12. Insights from quantitative metaproteomics and protein-stable isotope probing into microbial ecology.

    PubMed

    von Bergen, Martin; Jehmlich, Nico; Taubert, Martin; Vogt, Carsten; Bastida, Felipe; Herbst, Florian-Alexander; Schmidt, Frank; Richnow, Hans-Hermann; Seifert, Jana

    2013-10-01

    The recent development of metaproteomics has enabled the direct identification and quantification of expressed proteins from microbial communities in situ, without the need for microbial enrichment. This became possible by (1) significant increases in quality and quantity of metagenome data and by improvements of (2) accuracy and (3) sensitivity of modern mass spectrometers (MS). The identification of physiologically relevant enzymes can help to understand the role of specific species within a community or an ecological niche. Beside identification, relative and absolute quantitation is also crucial. We will review label-free and label-based methods of quantitation in MS-based proteome analysis and the contribution of quantitative proteome data to microbial ecology. Additionally, approaches of protein-based stable isotope probing (protein-SIP) for deciphering community structures are reviewed. Information on the species-specific metabolic activity can be obtained when substrates or nutrients are labeled with stable isotopes in a protein-SIP approach. The stable isotopes ((13)C, (15)N, (36)S) are incorporated into proteins and the rate of incorporation can be used for assessing the metabolic activity of the corresponding species. We will focus on the relevance of the metabolic and phylogenetic information retrieved with protein-SIP studies and for detecting and quantifying the carbon flux within microbial consortia. Furthermore, the combination of protein-SIP with established tools in microbial ecology such as other stable isotope probing techniques are discussed.

  13. Identification of Open Stomata1-Interacting Proteins Reveals Interactions with Sucrose Non-fermenting1-Related Protein Kinases2 and with Type 2A Protein Phosphatases That Function in Abscisic Acid Responses

    DOE PAGES

    Waadt, Rainer; Manalansan, Bianca; Rauniyar, Navin; ...

    2015-09-04

    The plant hormone abscisic acid (ABA) controls growth and development and regulates plant water status through an established signaling pathway. In the presence of ABA, pyrabactin resistance/regulatory component of ABA receptor proteins inhibit type 2C protein phosphatases (PP2Cs). This, in turn, enables the activation of Sucrose Nonfermenting1-Related Protein Kinases2 (SnRK2). Open Stomata1 (OST1)/SnRK2.6/SRK2E is a major SnRK2-type protein kinase responsible for mediating ABA responses. Arabidopsis (Arabidopsis thaliana) expressing an epitope-tagged OST1 in the recessive ost1-3 mutant background was used for the copurification and identification of OST1-interacting proteins after osmotic stress and ABA treatments. Furthemore, these analyses, which were confirmed usingmore » bimolecular fluorescence complementation and coimmunoprecipitation, unexpectedly revealed homo- and heteromerization of OST1 with SnRK2.2, SnRK2.3, OST1, and SnRK2.8. Furthermore, several OST1-complexed proteins were identified as type 2A protein phosphatase (PP2A) subunits and as proteins involved in lipid and galactolipid metabolism. More detailed analyses suggested an interaction network between ABA-activated SnRK2-type protein kinases and several PP2A-type protein phosphatase regulatory subunits. pp2a double mutants exhibited a reduced sensitivity to ABA during seed germination and stomatal closure and an enhanced ABA sensitivity in root growth regulation. Our analyses add PP2A-type protein phosphatases as another class of protein phosphatases to the interaction network of SnRK2-type protein kinases.« less

  14. Optical monitoring of proteins at solid interfaces

    NASA Astrophysics Data System (ADS)

    Dunne, G.; McDonnell, L.; Miller, R.; McMillan, N. D.; O'Rourke, B.; Mitchell, C. I.

    2005-06-01

    The adsorption properties of polymers are of great importance for implant studies. A better understanding of these properties can lead to improved implant materials. In this study the surface energy of different polymers was derived from contact angle measurements taken using profile analysis tensiometry (PAT) of sessile drops of water. The contact angles were measured for advancing and receding water drops on polished polymer surfaces and also on polymer surfaces modified by adsorbing protein to the surface prior to analysis of the sessile drop. The protein used was bovine serum albumin (BSA) and the surfaces were poly-methylmethacrylate (PMMA), poly-ether-ether-ketone (PEEK) and stainless steel. The polymer surfaces were also studied using atomic force microscopy (AFM). Images of the surfaces were taken in different states: rough, smooth and with albumin adsorbed. As a method to identify the proteins on the surface easier, anti-albumin antibodies with 30nm nano gold particles attached were adsorbed to the albumin on the surfaces. Using nano gold particles made the imaging more straightforward and thus made identification of the protein on the surface easier. The results from this work show the differing hydrophobicities of polymer surfaces under different conditions and a new nanotechnological method of protein identification.

  15. Cell-Free Expression and In Situ Immobilization of Parasite Proteins from Clonorchis sinensis for Rapid Identification of Antigenic Candidates

    PubMed Central

    Ju, Jung Won; Kim, Ho-Cheol; Shin, Hyun-Il; Kim, Yu Jung; Kim, Dong-Myung

    2015-01-01

    Progress towards genetic sequencing of human parasites has provided the groundwork for a post-genomic approach to develop novel antigens for the diagnosis and treatment of parasite infections. To fully utilize the genomic data, however, high-throughput methodologies are required for functional analysis of the proteins encoded in the genomic sequences. In this study, we investigated cell-free expression and in situ immobilization of parasite proteins as a novel platform for the discovery of antigenic proteins. PCR-amplified parasite DNA was immobilized on microbeads that were also functionalized to capture synthesized proteins. When the microbeads were incubated in a reaction mixture for cell-free synthesis, proteins expressed from the microbead-immobilized DNA were instantly immobilized on the same microbeads, providing a physical linkage between the genetic information and encoded proteins. This approach of in situ expression and isolation enables streamlined recovery and analysis of cell-free synthesized proteins and also allows facile identification of the genes coding antigenic proteins through direct PCR of the microbead-bound DNA. PMID:26599101

  16. Mapping protein-protein interactions using yeast two-hybrid assays.

    PubMed

    Mehla, Jitender; Caufield, J Harry; Uetz, Peter

    2015-05-01

    Yeast two-hybrid (Y2H) screens are an efficient system for mapping protein-protein interactions and whole interactomes. The screens can be performed using random libraries or collections of defined open reading frames (ORFs) called ORFeomes. This protocol describes both library and array-based Y2H screening, with an emphasis on array-based assays. Array-based Y2H is commonly used to test a number of "prey" proteins for interactions with a single "bait" (target) protein or pool of proteins. The advantage of this approach is the direct identification of interacting protein pairs without further downstream experiments: The identity of the preys is known and does not require further confirmation. In contrast, constructing and screening a random prey library requires identification of individual prey clones and systematic retesting. Retesting is typically performed in an array format. © 2015 Cold Spring Harbor Laboratory Press.

  17. Effects of protein transduction domain (PTD) selection and position for improved intracellular delivery of PTD-Hsp27 fusion protein formulations.

    PubMed

    Ul Ain, Qurrat; Lee, Jong Hwan; Woo, Young Sun; Kim, Yong-Hee

    2016-09-01

    Protein drugs have attracted considerable attention as therapeutic agents due to their diversity and biocompatibility. However, hydrophilic proteins possess difficulty in penetrating lipophilic cell membrane. Although protein transduction domains (PTDs) have shown effectiveness in protein delivery, the importance of selection and position of PTDs in recombinant protein vector constructs has not been investigated. This study intends to investigate the significance of PTD selection and position for therapeutic protein delivery. Heat shock protein 27 (Hsp27) would be a therapeutic protein for the treatment of ischemic heart diseases, but itself is insufficient to prevent systemic degradation and overcoming biochemical barriers during cellular transport. Among all PTD-Hsp27 fusion proteins we cloned, Tat-Hsp27 fusion protein showed the highest efficacy. Nona-arginine (9R) conjugation to the N-terminal of Hsp27 (Hsp27-T) showed higher efficacy than C-terminal. To test the synergistic effect of two PTDs, Tat was inserted to the N-terminal of Hsp27-9R. Tat-Hsp27-9R exhibited enhanced transduction efficiency and significant improvement against oxidative stress and apoptosis. PTD-Hsp27 fusion proteins have strong potential to be developed as therapeutic proteins for the treatment of ischemic heart diseases and selection and position of PTDs for improved efficacy of PTD-fusion proteins need to be optimized considering protein's nature, transduction efficiency and stability.

  18. Nanobiocatalysis for protein digestion in proteomic analysis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kim, Jungbae; Kim, Byoung Chan; Lopez-Ferrer, Daniel

    2010-02-01

    The process of protein digestion is a critical step for successful protein identification in the bottom-up proteomic analysis. To substitute the present practice of in-solution protein digestion, which is long, tedious, and difficult to automate, a lot of efforts have been dedicated for the development of a rapid, recyclable and automated digestion system. Recent advances of nanobiocatalytic approaches have improved the performance of protein digestion by using various nanomaterials such as nanoporous materials, magnetic nanoparticles, and polymer nanofibers. Especially, the unprecedented success of trypsin stabilization in the form of trypsin-coated nanofibers, showing no activity decrease under repeated uses for onemore » year and retaining good resistance to proteolysis, has demonstrated its great potential to be employed in the development of automated, high-throughput, and on-line digestion systems. This review discusses recent developments of nanobiocatalytic approaches for the improved performance of protein digestion in speed, detection sensitivity, recyclability, and trypsin stability. In addition, we also introduce the protein digestions under unconventional energy inputs for protein denaturation and the development of microfluidic enzyme reactors that can benefit from recent successes of these nanobiocatalytic approaches.« less

  19. Top-down proteomic identification of bacterial protein biomarkers and toxins using MALDI-TOF-TOF-MS/MS and post-source decay

    USDA-ARS?s Scientific Manuscript database

    Matrix-assisted laser desorption/ionization time-of-flight-time-of-flight mass spectrometry(MALDI-TOF-TOF-MS)has provided new capabilities for the rapid identification of digested and non-digested proteins. The tandem (MS/MS) capability of TOF-TOF instruments allows precursor ion selection/isolation...

  20. Identification of a "glycine-loop"-like coiled structure in the 34 AA Pro,Gly,Met repeat domain of the biomineral-associated protein, PM27.

    PubMed

    Wustman, Brandon A; Santos, Rudolpho; Zhang, Bo; Evans, John Spencer

    2002-12-05

    Fracture resistance in biomineralized structures has been linked to the presence of proteins, some of which possess sequences that are associated with elastic behavior. One such protein superfamily, the Pro,Gly-rich sea urchin intracrystalline spicule matrix proteins, form protein-protein supramolecular assemblies that modify the microstructure and fracture-resistant properties of the calcium carbonate mineral phase within embryonic sea urchin spicules and adult sea urchin spines. In this report, we detail the identification of a repetitive keratin-like "glycine-loop"- or coil-like structure within the 34-AA (AA: amino acid) N-terminal domain, (PGMG)(8)PG, of the spicule matrix protein, PM27. The identification of this repetitive structural motif was accomplished using two capped model peptides: a 9-AA sequence, GPGMGPGMG, and a 34-AA peptide representing the entire motif. Using CD, NMR spectrometry, and molecular dynamics simulated annealing/minimization simulations, we have determined that the 9-AA model peptide adopts a loop-like structure at pH 7.4. The structure of the 34-AA polypeptide resembles a coil structure consisting of repeating loop motifs that do not exhibit long-range ordering. Given that loop structures have been associated with protein elastic behavior and protein motion, it is plausible that the 34-AA Pro,Gly,Met repeat sequence motif in PM27 represents a putative elastic or mobile domain. Copyright 2002 Wiley Periodicals, Inc.

  1. Computational identification of binding energy hot spots in protein-RNA complexes using an ensemble approach.

    PubMed

    Pan, Yuliang; Wang, Zixiang; Zhan, Weihua; Deng, Lei

    2018-05-01

    Identifying RNA-binding residues, especially energetically favored hot spots, can provide valuable clues for understanding the mechanisms and functional importance of protein-RNA interactions. Yet, limited availability of experimentally recognized energy hot spots in protein-RNA crystal structures leads to the difficulties in developing empirical identification approaches. Computational prediction of RNA-binding hot spot residues is still in its infant stage. Here, we describe a computational method, PrabHot (Prediction of protein-RNA binding hot spots), that can effectively detect hot spot residues on protein-RNA binding interfaces using an ensemble of conceptually different machine learning classifiers. Residue interaction network features and new solvent exposure characteristics are combined together and selected for classification with the Boruta algorithm. In particular, two new reference datasets (benchmark and independent) have been generated containing 107 hot spots from 47 known protein-RNA complex structures. In 10-fold cross-validation on the training dataset, PrabHot achieves promising performances with an AUC score of 0.86 and a sensitivity of 0.78, which are significantly better than that of the pioneer RNA-binding hot spot prediction method HotSPRing. We also demonstrate the capability of our proposed method on the independent test dataset and gain a competitive advantage as a result. The PrabHot webserver is freely available at http://denglab.org/PrabHot/. leideng@csu.edu.cn. Supplementary data are available at Bioinformatics online.

  2. Basophile: Accurate Fragment Charge State Prediction Improves Peptide Identification Rates

    DOE PAGES

    Wang, Dong; Dasari, Surendra; Chambers, Matthew C.; ...

    2013-03-07

    In shotgun proteomics, database search algorithms rely on fragmentation models to predict fragment ions that should be observed for a given peptide sequence. The most widely used strategy (Naive model) is oversimplified, cleaving all peptide bonds with equal probability to produce fragments of all charges below that of the precursor ion. More accurate models, based on fragmentation simulation, are too computationally intensive for on-the-fly use in database search algorithms. We have created an ordinal-regression-based model called Basophile that takes fragment size and basic residue distribution into account when determining the charge retention during CID/higher-energy collision induced dissociation (HCD) of chargedmore » peptides. This model improves the accuracy of predictions by reducing the number of unnecessary fragments that are routinely predicted for highly-charged precursors. Basophile increased the identification rates by 26% (on average) over the Naive model, when analyzing triply-charged precursors from ion trap data. Basophile achieves simplicity and speed by solving the prediction problem with an ordinal regression equation, which can be incorporated into any database search software for shotgun proteomic identification.« less

  3. Protein-Nanoparticle Interactions: Improving Immobilized Lytic Enzyme Activity and Surface Energy Effects

    NASA Astrophysics Data System (ADS)

    Downs, Emily Elizabeth

    Protein-nanostructure conjugates, particularly particles, are a subject of significant interest due to changes in their fundamental behavior compared to bulk surfaces. As the size scale of nano-structured materials and proteins are on the same order of magnitude, nanomaterial properties can heavily influence how proteins adsorb and conform to the surface. Previous work has demonstrated the ability of nanoscale surfaces to modulate protein activity, conformation, and retention by modifying the particle surface curvature, morphology, and surface charge. This work has improved our understanding of the protein material interactions, but a complete understanding is still lacking. The goal of this thesis is to investigate two missing areas of understanding using two distinct systems. The first system utilizes a particle with controlled surface energy to observe the impact of surface energy on protein-particle interactions, while the second system uses a modified Listeria-specific protein to determine how protein structure and flexibility affects protein adsorption and activity on particles. Spherical, amorphous, and uniformly doped Zn-silica particles with tailored surface energies were synthesized to understand the impact of surface energy on protein adsorption behavior. Particle surface energy increased with a decrease in particle size and greater dopant concentrations. Protein adsorption and structural loss increased with both particle size and particle surface energy. Higher surface energies promoted protein-particle association and increased protein unfolding. Particle curvature and protein steric hindrance effects limited adsorption and structural loss on smaller particles. Protein surface charge heterogeneity was also found to be linked to both protein adsorption and unfolding behavior on larger particles. Greater surface charge heterogeneity led to higher adsorption concentrations and multilayer formation. These multilayers transitioned from protein

  4. EF-2DE Analysis and Protein Identification

    USDA-ARS?s Scientific Manuscript database

    Isoelectric focusing followed by SDS-PAGE (IEF-2DE) separates proteins in a two-dimensional matrix of protein pI (Protein Isoelectric Point) and molecular weight (MW). The technique is particularly useful to distinguish protein isoforms (Radwan et al., 2012) and proteins that contain post-translatio...

  5. HIGH-THROUGHPUT IDENTIFICATION OF CATALYTIC REDOX-ACTIVE CYSTEINE RESIDUES

    EPA Science Inventory

    Cysteine (Cys) residues often play critical roles in proteins; however, identification of their specific functions has been limited to case-by-case experimental approaches. We developed a procedure for high-throughput identification of catalytic redox-active Cys in proteins by se...

  6. Improving protein-protein interactions prediction accuracy using protein evolutionary information and relevance vector machine model.

    PubMed

    An, Ji-Yong; Meng, Fan-Rong; You, Zhu-Hong; Chen, Xing; Yan, Gui-Ying; Hu, Ji-Pu

    2016-10-01

    Predicting protein-protein interactions (PPIs) is a challenging task and essential to construct the protein interaction networks, which is important for facilitating our understanding of the mechanisms of biological systems. Although a number of high-throughput technologies have been proposed to predict PPIs, there are unavoidable shortcomings, including high cost, time intensity, and inherently high false positive rates. For these reasons, many computational methods have been proposed for predicting PPIs. However, the problem is still far from being solved. In this article, we propose a novel computational method called RVM-BiGP that combines the relevance vector machine (RVM) model and Bi-gram Probabilities (BiGP) for PPIs detection from protein sequences. The major improvement includes (1) Protein sequences are represented using the Bi-gram probabilities (BiGP) feature representation on a Position Specific Scoring Matrix (PSSM), in which the protein evolutionary information is contained; (2) For reducing the influence of noise, the Principal Component Analysis (PCA) method is used to reduce the dimension of BiGP vector; (3) The powerful and robust Relevance Vector Machine (RVM) algorithm is used for classification. Five-fold cross-validation experiments executed on yeast and Helicobacter pylori datasets, which achieved very high accuracies of 94.57 and 90.57%, respectively. Experimental results are significantly better than previous methods. To further evaluate the proposed method, we compare it with the state-of-the-art support vector machine (SVM) classifier on the yeast dataset. The experimental results demonstrate that our RVM-BiGP method is significantly better than the SVM-based method. In addition, we achieved 97.15% accuracy on imbalance yeast dataset, which is higher than that of balance yeast dataset. The promising experimental results show the efficiency and robust of the proposed method, which can be an automatic decision support tool for future

  7. Sequestration of PDC-109 protein improves freezability of crossbred bull spermatozoa.

    PubMed

    Srivastava, N; Srivastava, S K; Ghosh, S K; Singh, L P; Prasad, J K; Kumar, Amit; Perumal, P; Jerome, A; Thamizharasan, A

    2012-03-01

    A study was carried out to assess the effect of sequestration of PDC-109 protein, a majority constituent of heparin binding proteins (HBP) of seminal plasma, on freezability and in vitro fertilizing ability of crossbred bull spermatozoa after cryopreservation. The study consisted of isolation and characterization of PDC-109 protein to raise anti-sera against it in rabbits. Following which, raised antibodies against PDC-109 protein was quantitated and coated in tubes used for collection of ejaculates. Semen ejaculates thus collected were cryopreserved using EYTG extender. Physico-morphological characteristics, viz. motility, viability, acrosomal integrity and HOS response as an indicator of freezability of cryopreserved spermatozoa were determined at pre freeze as well as post thaw stage. At pre freeze stage, a significant (p<0.05) improvement in viability (83.83 ± 2.18 vs 75.17 ± 2.42) and acrosome integrity (81.33 ± 2.38 vs 72.83 ± 2.39) in antibodies treated group than control was observed. Similarly, increase in HOS responsive spermatozoa was highly significant (p<0.01) than control (78.83 ± 1.69 vs 67.5 ± 1.75). At post thaw stage, significant (p<0.05) improvement in viability (69.50 ± 2.16 vs 60.33 ± 2.19) and HOS responsive spermatozoa (68.67 ± 1.62 vs 58.50 ± 1.32) and highly significant (p<0.01) increase in individual motility (56.17 ± 1.83 vs 47.00 ± 1.86) and acrosome integrity (75.17 ± 2.38 vs 61.83 ± 2.1) was observed in antibodies treated group when compared to control was observed. The results from the study revealed that sequestration of PDC-109 protein from semen samples leads to significant improvement in pre-freeze and post-thaw values of above parameters in cryopreserved spermatozoa. It is thus concluded that sequestration of PDC-109 protein from ejaculates improves freezability of crossbred bull spermatozoa. Copyright © 2012 Elsevier B.V. All rights reserved.

  8. Indigenous species barcode database improves the identification of zooplankton

    PubMed Central

    Yang, Jianghua; Zhang, Wanwan; Sun, Jingying; Xie, Yuwei; Zhang, Yimin; Burton, G. Allen; Yu, Hongxia

    2017-01-01

    Incompleteness and inaccuracy of DNA barcode databases is considered an important hindrance to the use of metabarcoding in biodiversity analysis of zooplankton at the species-level. Species barcoding by Sanger sequencing is inefficient for organisms with small body sizes, such as zooplankton. Here mitochondrial cytochrome c oxidase I (COI) fragment barcodes from 910 freshwater zooplankton specimens (87 morphospecies) were recovered by a high-throughput sequencing platform, Ion Torrent PGM. Intraspecific divergence of most zooplanktons was < 5%, except Branchionus leydign (Rotifer, 14.3%), Trichocerca elongate (Rotifer, 11.5%), Lecane bulla (Rotifer, 15.9%), Synchaeta oblonga (Rotifer, 5.95%) and Schmackeria forbesi (Copepod, 6.5%). Metabarcoding data of 28 environmental samples from Lake Tai were annotated by both an indigenous database and NCBI Genbank database. The indigenous database improved the taxonomic assignment of metabarcoding of zooplankton. Most zooplankton (81%) with barcode sequences in the indigenous database were identified by metabarcoding monitoring. Furthermore, the frequency and distribution of zooplankton were also consistent between metabarcoding and morphology identification. Overall, the indigenous database improved the taxonomic assignment of zooplankton. PMID:28977035

  9. Identification of marker proteins for the adulteration of meat products with soybean proteins by multidimensional liquid chromatography-tandem mass spectrometry.

    PubMed

    Leitner, Alexander; Castro-Rubio, Florentina; Marina, Maria Luisa; Lindner, Wolfgang

    2006-09-01

    Soybean proteins are frequently added to processed meat products for economic reasons and to improve their functional properties. Monitoring of the addition of soybean protein to meat products is of high interest due to the existence of regulations forbidding or limiting the amount of soybean proteins that can be added during the processing of meat products. We have used chromatographic prefractionation on the protein level by perfusion liquid chromatography to isolate peaks of interest from extracts of soybean protein isolate (SPI) and of meat products containing SPI. After enzymatic digestion using trypsin, the collected fractions were analyzed by nanoflow liquid chromatography-tandem mass spectrometry. Several variants and subunits of the major seed proteins, glycinin and beta-conglycinin, were identified in SPI, along with two other proteins. In soybean-protein-containing meat samples, different glycinin A subunits could be identified from the peak discriminating between samples with and without soybean proteins added. Among those, glycinin G4 subunit A4 was consistently found in all samples. Consequently, this protein (subunit) can be used as a target for new analytical techniques in the course of identifying the addition of soybean protein to meat products.

  10. Identification of Surface Protein Biomarkers of Listeria monocytogenes via Bioinformatics and Antibody-Based Protein Detection Tools

    PubMed Central

    Zhang, Cathy X. Y.; Brooks, Brian W.; Huang, Hongsheng; Pagotto, Franco

    2016-01-01

    ABSTRACT The Gram-positive bacterium Listeria monocytogenes causes a significant percentage of the fatalities among foodborne illnesses in humans. Surface proteins specifically expressed in a wide range of L. monocytogenes serotypes under selective enrichment culture conditions could serve as potential biomarkers for detection and isolation of this pathogen via antibody-based methods. Our study aimed to identify such biomarkers. Interrogation of the L. monocytogenes serotype 4b strain F2365 genome identified 130 putative or known surface proteins. The homologues of four surface proteins, LMOf2365_0578, LMOf2365_0581, LMOf2365_0639, and LMOf2365_2117, were assessed as biomarkers due to the presence of conserved regions among strains of L. monocytogenes which are variable among other Listeria species. Rabbit polyclonal antibodies against the four recombinant proteins revealed the expression of only LMOf2365_0639 on the surface of serotype 4b strain LI0521 cells despite PCR detection of mRNA transcripts for all four proteins in the organism. Three of 35 monoclonal antibodies (MAbs) to LMOf2365_0639, MAbs M3643, M3644, and M3651, specifically recognized 42 (91.3%) of 46 L. monocytogenes lineage I and II isolates grown in nonselective brain heart infusion medium. While M3644 and M3651 reacted with 14 to 15 (82.4 to 88.2%) of 17 L. monocytogenes lineage I and II isolates, M3643 reacted with 22 (91.7%) of 24 lineage I, II, and III isolates grown in selective enrichment media (UVM1, modified Fraser, Palcam, and UVM2 media). The three MAbs exhibited only weak reactivities (the optical densities at 414 nm were close to the cutoff value) to some other Listeria species grown in selective enrichment media. Collectively, the data indicate the potential of LMOf2365_0639 as a surface biomarker of L. monocytogenes, with the aid of specific MAbs, for pathogen detection, identification, and isolation in clinical, environmental, and food samples. IMPORTANCE L. monocytogenes is

  11. Selecting fillers on emotional appearance improves lineup identification accuracy.

    PubMed

    Flowe, Heather D; Klatt, Thimna; Colloff, Melissa F

    2014-12-01

    Mock witnesses sometimes report using criminal stereotypes to identify a face from a lineup, a tendency known as criminal face bias. Faces are perceived as criminal-looking if they appear angry. We tested whether matching the emotional appearance of the fillers to an angry suspect can reduce criminal face bias. In Study 1, mock witnesses (n = 226) viewed lineups in which the suspect had an angry, happy, or neutral expression, and we varied whether the fillers matched the expression. An additional group of participants (n = 59) rated the faces on criminal and emotional appearance. As predicted, mock witnesses tended to identify suspects who appeared angrier and more criminal-looking than the fillers. This tendency was reduced when the lineup fillers matched the emotional appearance of the suspect. Study 2 extended the results, testing whether the emotional appearance of the suspect and fillers affects recognition memory. Participants (n = 1,983) studied faces and took a lineup test in which the emotional appearance of the target and fillers was varied between subjects. Discrimination accuracy was enhanced when the fillers matched an angry target's emotional appearance. We conclude that lineup member emotional appearance plays a critical role in the psychology of lineup identification. The fillers should match an angry suspect's emotional appearance to improve lineup identification accuracy. PsycINFO Database Record (c) 2014 APA, all rights reserved.

  12. Identification of Open Stomata1-Interacting Proteins Reveals Interactions with Sucrose Non-fermenting1-Related Protein Kinases2 and with Type 2A Protein Phosphatases That Function in Abscisic Acid Responses1[OPEN

    PubMed Central

    Waadt, Rainer; Manalansan, Bianca; Rauniyar, Navin; Munemasa, Shintaro; Booker, Matthew A.; Brandt, Benjamin; Waadt, Christian; Nusinow, Dmitri A.; Kay, Steve A.; Kunz, Hans-Henning; Schumacher, Karin; DeLong, Alison; Yates, John R.; Schroeder, Julian I.

    2015-01-01

    The plant hormone abscisic acid (ABA) controls growth and development and regulates plant water status through an established signaling pathway. In the presence of ABA, pyrabactin resistance/regulatory component of ABA receptor proteins inhibit type 2C protein phosphatases (PP2Cs). This, in turn, enables the activation of Sucrose Nonfermenting1-Related Protein Kinases2 (SnRK2). Open Stomata1 (OST1)/SnRK2.6/SRK2E is a major SnRK2-type protein kinase responsible for mediating ABA responses. Arabidopsis (Arabidopsis thaliana) expressing an epitope-tagged OST1 in the recessive ost1-3 mutant background was used for the copurification and identification of OST1-interacting proteins after osmotic stress and ABA treatments. These analyses, which were confirmed using bimolecular fluorescence complementation and coimmunoprecipitation, unexpectedly revealed homo- and heteromerization of OST1 with SnRK2.2, SnRK2.3, OST1, and SnRK2.8. Furthermore, several OST1-complexed proteins were identified as type 2A protein phosphatase (PP2A) subunits and as proteins involved in lipid and galactolipid metabolism. More detailed analyses suggested an interaction network between ABA-activated SnRK2-type protein kinases and several PP2A-type protein phosphatase regulatory subunits. pp2a double mutants exhibited a reduced sensitivity to ABA during seed germination and stomatal closure and an enhanced ABA sensitivity in root growth regulation. These analyses add PP2A-type protein phosphatases as another class of protein phosphatases to the interaction network of SnRK2-type protein kinases. PMID:26175513

  13. Identification of a multi-protein reductive dehalogenase complex in Dehalococcoides mccartyi strain CBDB1 suggests a protein-dependent respiratory electron transport chain obviating quinone involvement.

    PubMed

    Kublik, Anja; Deobald, Darja; Hartwig, Stefanie; Schiffmann, Christian L; Andrades, Adarelys; von Bergen, Martin; Sawers, R Gary; Adrian, Lorenz

    2016-09-01

    Dehalococcoides mccartyi strain CBDB1 is an obligate organohalide-respiring bacterium using only hydrogen as electron donor and halogenated organics as electron acceptor. Here, we studied proteins involved in the respiratory chain under non-denaturing conditions. Using blue native gel electrophoresis (BN-PAGE), gel filtration and ultrafiltration an active dehalogenating protein complex with a molecular mass of 250-270 kDa was identified. The active subunit of reductive dehalogenase (RdhA) colocalised with a complex iron-sulfur molybdoenzyme (CISM) subunit (CbdbA195) and an iron-sulfur cluster containing subunit (CbdbA131) of the hydrogen uptake hydrogenase (Hup). No colocalisation between the catalytically active subunits of hydrogenase and reductive dehalogenase was found. By two-dimensional BN/SDS-PAGE the stability of the complex towards detergents was assessed, demonstrating stepwise disintegration with increasing detergent concentrations. Chemical cross-linking confirmed the presence of a higher molecular mass reductive dehalogenase protein complex composed of RdhA, CISM I and Hup hydrogenase and proved to be a potential tool for stabilising protein-protein interactions of the dehalogenating complex prior to membrane solubilisation. Taken together, the identification of the respiratory dehalogenase protein complex and the absence of indications for quinone participation in the respiration suggest a quinone-independent protein-based respiratory electron transfer chain in D. mccartyi. © 2015 Society for Applied Microbiology and John Wiley & Sons Ltd.

  14. Improvement of foreign-protein production in Aspergillus niger var. awamori by constitutive induction of the unfolded-protein response.

    PubMed

    Valkonen, Mari; Ward, Michael; Wang, Huaming; Penttilä, Merja; Saloheimo, Markku

    2003-12-01

    Unfolded-protein response (UPR) denotes the upregulation of endoplasmic reticulum (ER)-resident chaperone and foldase genes and numerous other genes involved in secretory functions during the accumulation of unfolded proteins into the ER. Overexpression of individual foldases and chaperones has been used in attempts to improve protein production in different production systems. We describe here a novel strategy to improve foreign-protein production. We show that the constitutive induction of the UPR pathway in Aspergillus niger var. awamori can be achieved by expressing the activated form of the transcription factor hacA. This induction enhances the production of Trametes versicolor laccase by up to sevenfold and of bovine preprochymosin by up to 2.8-fold in this biotechnically important fungus. The regulatory range of UPR was studied by analyzing the mRNA levels of novel A. niger var. awamori genes involved in different secretory functions. This revealed both similarities and differences to corresponding studies in Saccharomyces cerevisiae.

  15. Ligand screening system using fusion proteins of G protein-coupled receptors with G protein alpha subunits.

    PubMed

    Suga, Hinako; Haga, Tatsuya

    2007-01-01

    G protein-coupled receptors (GPCRs) constitute one of the largest families of genes in the human genome, and are the largest targets for drug development. Although a large number of GPCR genes have recently been identified, ligands have not yet been identified for many of them. Various assay systems have been employed to identify ligands for orphan GPCRs, but there is still no simple and general method to screen for ligands of such GPCRs, particularly of G(i)-coupled receptors. We have examined whether fusion proteins of GPCRs with G protein alpha subunit (Galpha) could be utilized for ligand screening and showed that the fusion proteins provide an effective method for the purpose. This article focuses on the followings: (1) characterization of GPCR genes and GPCRs, (2) identification of ligands for orphan GPCRs, (3) characterization of GPCR-Galpha fusion proteins, and (4) identification of ligands for orphan GPCRs using GPCR-Galpha fusion proteins.

  16. Experience with a mouse intranasal test for the predictive identification of respiratory sensitization potential of proteins.

    PubMed

    Blaikie, L; Basketter, D A

    1999-08-01

    The predictive identification of respiratory allergenic potential is an important primary step in the safety evaluation of (novel) proteins, such as the enzymes used in a range of consumer laundry products. In the past this has been achieved by assessing the relative ability of proteins to give rise to the formation of anaphylactic antibody in the guinea pig. Recently, an alternative model has been proposed which assesses the formation of specific IgG1 antibody in a mouse intranasal test (MINT), the assumption being that specific IgG1 antibody is a surrogate for anaphylactic antibody in the mouse. This procedure has undergone successful initial intralaboratory and interlaboratory assessment. In the present work, the MINT has been evaluated in a more thorough intralaboratory study using eight enzymes plus ovalbumin. While the data generated with a reference enzyme protein, Alcalase, showed good reproducibility, results with the remaining eight proteins led to estimates of their relative antigenic or sensitization potential several of which were at variance from those derived from the guinea pig/ human experience. In consequence, it is concluded that the MINT requires substantial further investigation before it can be adopted as a model for the assessment of the relative ability of proteins to behave as respiratory allergens.

  17. Intradialytic protein supplementation reduces inflammation and improves physical function in maintenance hemodialysis patients.

    PubMed

    Tomayko, Emily J; Kistler, Brandon M; Fitschen, Peter J; Wilund, Kenneth R

    2015-05-01

    Protein malnutrition is both a cause and consequence of inflammation and related comorbidities for maintenance hemodialysis (MHD) patients. This study sought to determine if oral supplementation with soy or whey protein during dialysis treatment reduces inflammation and improves physical function and body composition in MHD patients. The design used in the study was randomized controlled trial, and the setting used was hemodialysis clinics in Champaign and Chicago, Illinois. Patients who received treatment ≥3 days/week, were ages ≥30 years did not have congestive heart failure or chronic obstructive pulmonary disease, and were receiving dialysis treatment for ≥3 months were eligible for inclusion. Patients were randomized to oral supplementation with a whey protein, soy protein, or placebo beverage. Patients (WHEY, n = 11; SOY, n = 12; CON, n = 15) consumed their assigned beverage before every dialysis session for 6 months. Body composition was measured by dual-energy x-ray absorptiometry, physical function by gait speed and shuttle walk test, and markers of inflammation (C-reactive protein and interleukin 6) using commercially available enzyme-linked immunosorbent assay kits before and after the 6-month intervention. Dietary intake was assessed by 24-hour dietary recalls. Six months of whey or soy supplementation significantly reduced predialysis interleukin 6 levels (P < .05 for both), whereas there was a trend for a reduction in C-reactive protein when both protein groups were combined (P = .062). Gait speed and shuttle walk test performance also significantly improved in the protein groups (P < .05 for both). No changes in body composition were observed. However, alkaline phosphatase, a marker of bone turnover, was significantly reduced in the protein groups. Intradialytic protein supplementation during a 6-month intervention reduced inflammation and improved physical function and represents an affordable intervention to improve the health

  18. Parametric system identification of catamaran for improving controller design

    NASA Astrophysics Data System (ADS)

    Timpitak, Surasak; Prempraneerach, Pradya; Pengwang, Eakkachai

    2018-01-01

    This paper presents an estimation of simplified dynamic model for only surge- and yaw- motions of catamaran by using system identification (SI) techniques to determine associated unknown parameters. These methods will enhance the performance of designing processes for the motion control system of Unmanned Surface Vehicle (USV). The simulation results demonstrate an effective way to solve for damping forces and to determine added masses by applying least-square and AutoRegressive Exogenous (ARX) methods. Both methods are then evaluated according to estimated parametric errors from the vehicle’s dynamic model. The ARX method, which yields better estimated accuracy, can then be applied to identify unknown parameters as well as to help improving a controller design of a real unmanned catamaran.

  19. Robust enzyme design: bioinformatic tools for improved protein stability.

    PubMed

    Suplatov, Dmitry; Voevodin, Vladimir; Švedas, Vytas

    2015-03-01

    The ability of proteins and enzymes to maintain a functionally active conformation under adverse environmental conditions is an important feature of biocatalysts, vaccines, and biopharmaceutical proteins. From an evolutionary perspective, robust stability of proteins improves their biological fitness and allows for further optimization. Viewed from an industrial perspective, enzyme stability is crucial for the practical application of enzymes under the required reaction conditions. In this review, we analyze bioinformatic-driven strategies that are used to predict structural changes that can be applied to wild type proteins in order to produce more stable variants. The most commonly employed techniques can be classified into stochastic approaches, empirical or systematic rational design strategies, and design of chimeric proteins. We conclude that bioinformatic analysis can be efficiently used to study large protein superfamilies systematically as well as to predict particular structural changes which increase enzyme stability. Evolution has created a diversity of protein properties that are encoded in genomic sequences and structural data. Bioinformatics has the power to uncover this evolutionary code and provide a reproducible selection of hotspots - key residues to be mutated in order to produce more stable and functionally diverse proteins and enzymes. Further development of systematic bioinformatic procedures is needed to organize and analyze sequences and structures of proteins within large superfamilies and to link them to function, as well as to provide knowledge-based predictions for experimental evaluation. Copyright © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  20. Improving protein-protein interaction prediction using evolutionary information from low-quality MSAs.

    PubMed

    Várnai, Csilla; Burkoff, Nikolas S; Wild, David L

    2017-01-01

    Evolutionary information stored in multiple sequence alignments (MSAs) has been used to identify the interaction interface of protein complexes, by measuring either co-conservation or co-mutation of amino acid residues across the interface. Recently, maximum entropy related correlated mutation measures (CMMs) such as direct information, decoupling direct from indirect interactions, have been developed to identify residue pairs interacting across the protein complex interface. These studies have focussed on carefully selected protein complexes with large, good-quality MSAs. In this work, we study protein complexes with a more typical MSA consisting of fewer than 400 sequences, using a set of 79 intramolecular protein complexes. Using a maximum entropy based CMM at the residue level, we develop an interface level CMM score to be used in re-ranking docking decoys. We demonstrate that our interface level CMM score compares favourably to the complementarity trace score, an evolutionary information-based score measuring co-conservation, when combined with the number of interface residues, a knowledge-based potential and the variability score of individual amino acid sites. We also demonstrate, that, since co-mutation and co-complementarity in the MSA contain orthogonal information, the best prediction performance using evolutionary information can be achieved by combining the co-mutation information of the CMM with co-conservation information of a complementarity trace score, predicting a near-native structure as the top prediction for 41% of the dataset. The method presented is not restricted to small MSAs, and will likely improve interface prediction also for complexes with large and good-quality MSAs.

  1. ProtPhylo: identification of protein-phenotype and protein-protein functional associations via phylogenetic profiling.

    PubMed

    Cheng, Yiming; Perocchi, Fabiana

    2015-07-01

    ProtPhylo is a web-based tool to identify proteins that are functionally linked to either a phenotype or a protein of interest based on co-evolution. ProtPhylo infers functional associations by comparing protein phylogenetic profiles (co-occurrence patterns of orthology relationships) for more than 9.7 million non-redundant protein sequences from all three domains of life. Users can query any of 2048 fully sequenced organisms, including 1678 bacteria, 255 eukaryotes and 115 archaea. In addition, they can tailor ProtPhylo to a particular kind of biological question by choosing among four main orthology inference methods based either on pair-wise sequence comparisons (One-way Best Hits and Best Reciprocal Hits) or clustering of orthologous proteins across multiple species (OrthoMCL and eggNOG). Next, ProtPhylo ranks phylogenetic neighbors of query proteins or phenotypic properties using the Hamming distance as a measure of similarity between pairs of phylogenetic profiles. Candidate hits can be easily and flexibly prioritized by complementary clues on subcellular localization, known protein-protein interactions, membrane spanning regions and protein domains. The resulting protein list can be quickly exported into a csv text file for further analyses. ProtPhylo is freely available at http://www.protphylo.org. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  2. Improved bacterial identification directly from urine samples with matrix-assisted laser desorption/ionization time-of-flight mass spectrometry.

    PubMed

    Kitagawa, Koichi; Shigemura, Katsumi; Onuma, Ken-Ichiro; Nishida, Masako; Fujiwara, Mayu; Kobayashi, Saori; Yamasaki, Mika; Nakamura, Tatsuya; Yamamichi, Fukashi; Shirakawa, Toshiro; Tokimatsu, Issei; Fujisawa, Masato

    2018-03-01

    Matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) contributes to rapid identification of pathogens in the clinic but has not yet performed especially well for Gram-positive cocci (GPC) causing complicated urinary tract infection (UTI). The goal of this study was to investigate the possible clinical use of MALDI-TOF MS as a rapid method for bacterial identification directly from urine in complicated UTI. MALDI-TOF MS was applied to urine samples gathered from 142 suspected complicated UTI patients in 2015-2017. We modified the standard procedure (Method 1) for sample preparation by adding an initial 10 minutes of ultrasonication followed by centrifugation at 500 g for 1 minutes to remove debris such as epithelial cells and leukocytes from the urine (Method 2). In 133 urine culture-positive bacteria, the rate of corresponded with urine culture in GPC by MALDI-TOF MS in urine with standard sample preparation (Method 1) was 16.7%, but the modified sample preparation (Method 2) significantly improved that rate to 52.2% (P=.045). Method 2 also improved the identification accuracy for Gram-negative rods (GNR) from 77.1% to 94.2% (P=.022). The modified Method 2 significantly improved the average MALDI score from 1.408±0.153 to 2.166±0.045 (P=.000) for GPC and slightly improved the score from 2.107±0.061 to 2.164±0.037 for GNR. The modified sample preparation for MALDI-TOF MS can improve identification accuracy for complicated UTI causative bacteria. This simple modification offers a rapid and accurate routine diagnosis for UTI, and may possibly be a substitute for urine cultures. © 2017 Wiley Periodicals, Inc.

  3. Improved packing of protein side chains with parallel ant colonies.

    PubMed

    Quan, Lijun; Lü, Qiang; Li, Haiou; Xia, Xiaoyan; Wu, Hongjie

    2014-01-01

    The accurate packing of protein side chains is important for many computational biology problems, such as ab initio protein structure prediction, homology modelling, and protein design and ligand docking applications. Many of existing solutions are modelled as a computational optimisation problem. As well as the design of search algorithms, most solutions suffer from an inaccurate energy function for judging whether a prediction is good or bad. Even if the search has found the lowest energy, there is no certainty of obtaining the protein structures with correct side chains. We present a side-chain modelling method, pacoPacker, which uses a parallel ant colony optimisation strategy based on sharing a single pheromone matrix. This parallel approach combines different sources of energy functions and generates protein side-chain conformations with the lowest energies jointly determined by the various energy functions. We further optimised the selected rotamers to construct subrotamer by rotamer minimisation, which reasonably improved the discreteness of the rotamer library. We focused on improving the accuracy of side-chain conformation prediction. For a testing set of 442 proteins, 87.19% of X1 and 77.11% of X12 angles were predicted correctly within 40° of the X-ray positions. We compared the accuracy of pacoPacker with state-of-the-art methods, such as CIS-RR and SCWRL4. We analysed the results from different perspectives, in terms of protein chain and individual residues. In this comprehensive benchmark testing, 51.5% of proteins within a length of 400 amino acids predicted by pacoPacker were superior to the results of CIS-RR and SCWRL4 simultaneously. Finally, we also showed the advantage of using the subrotamers strategy. All results confirmed that our parallel approach is competitive to state-of-the-art solutions for packing side chains. This parallel approach combines various sources of searching intelligence and energy functions to pack protein side chains

  4. Improved packing of protein side chains with parallel ant colonies

    PubMed Central

    2014-01-01

    Introduction The accurate packing of protein side chains is important for many computational biology problems, such as ab initio protein structure prediction, homology modelling, and protein design and ligand docking applications. Many of existing solutions are modelled as a computational optimisation problem. As well as the design of search algorithms, most solutions suffer from an inaccurate energy function for judging whether a prediction is good or bad. Even if the search has found the lowest energy, there is no certainty of obtaining the protein structures with correct side chains. Methods We present a side-chain modelling method, pacoPacker, which uses a parallel ant colony optimisation strategy based on sharing a single pheromone matrix. This parallel approach combines different sources of energy functions and generates protein side-chain conformations with the lowest energies jointly determined by the various energy functions. We further optimised the selected rotamers to construct subrotamer by rotamer minimisation, which reasonably improved the discreteness of the rotamer library. Results We focused on improving the accuracy of side-chain conformation prediction. For a testing set of 442 proteins, 87.19% of X1 and 77.11% of X12 angles were predicted correctly within 40° of the X-ray positions. We compared the accuracy of pacoPacker with state-of-the-art methods, such as CIS-RR and SCWRL4. We analysed the results from different perspectives, in terms of protein chain and individual residues. In this comprehensive benchmark testing, 51.5% of proteins within a length of 400 amino acids predicted by pacoPacker were superior to the results of CIS-RR and SCWRL4 simultaneously. Finally, we also showed the advantage of using the subrotamers strategy. All results confirmed that our parallel approach is competitive to state-of-the-art solutions for packing side chains. Conclusions This parallel approach combines various sources of searching intelligence and energy

  5. Performance of VITEK mass spectrometry V3.0 for rapid identification of clinical Aspergillus fumigatus in different culture conditions based on ribosomal proteins

    PubMed Central

    Zhou, Longrong; Chen, Yongquan; Xu, Yuanhong

    2017-01-01

    Fast and accurate discrimination of Aspergillus fumigatus is significant, since misidentification may lead to inappropriate clinical therapy. This study assessed VITEK mass spectrometry (MS) V3.0 for A. fumigatus identification using extracted fungal ribosomal proteins. A total of 52 isolates preliminarily identified as A. fumigatus by traditional morphological methods were inoculated in three different culture media and cultured at two different temperatures. The specific spectral fingerprints of different culture time points (48, 72, 96, and 120 h) were obtained. Of all strains, 88.5% (46/52) were discriminated as A. fumigatus, while the remaining 11.5% (6/52) produced results inconsistent with morphological analysis. Molecular sequencing, as a reference method for species identification, was used to validate the morphological analysis and matrix-assisted laser desorption/ionization time of flight MS. Chi-square tests (χ2 test, P=0.05) demonstrated that the culture medium and incubation temperature had no effects on identification accuracy; however, identification accuracy of the strains in the 48-h group was lower than that in other groups. In addition, we found that ribosomal proteins extracted from A. fumigatus can be stored in different environments for at least 1 week, with their profiles remaining stable and strain identification results showing no change. This is beneficial for medical institutions with no mass spectrometer at hand. Overall, this study showed the powerful ability of VITEK MS V 3.0 in identifying A. fumigatus. PMID:29263685

  6. Performance of VITEK mass spectrometry V3.0 for rapid identification of clinical Aspergillus fumigatus in different culture conditions based on ribosomal proteins.

    PubMed

    Zhou, Longrong; Chen, Yongquan; Xu, Yuanhong

    2017-01-01

    Fast and accurate discrimination of Aspergillus fumigatus is significant, since misidentification may lead to inappropriate clinical therapy. This study assessed VITEK mass spectrometry (MS) V3.0 for A. fumigatus identification using extracted fungal ribosomal proteins. A total of 52 isolates preliminarily identified as A. fumigatus by traditional morphological methods were inoculated in three different culture media and cultured at two different temperatures. The specific spectral fingerprints of different culture time points (48, 72, 96, and 120 h) were obtained. Of all strains, 88.5% (46/52) were discriminated as A. fumigatus , while the remaining 11.5% (6/52) produced results inconsistent with morphological analysis. Molecular sequencing, as a reference method for species identification, was used to validate the morphological analysis and matrix-assisted laser desorption/ionization time of flight MS. Chi-square tests ( χ 2 test, P =0.05) demonstrated that the culture medium and incubation temperature had no effects on identification accuracy; however, identification accuracy of the strains in the 48-h group was lower than that in other groups. In addition, we found that ribosomal proteins extracted from A. fumigatus can be stored in different environments for at least 1 week, with their profiles remaining stable and strain identification results showing no change. This is beneficial for medical institutions with no mass spectrometer at hand. Overall, this study showed the powerful ability of VITEK MS V 3.0 in identifying A. fumigatus .

  7. A Proteomic Approach for the Identification of Up-Regulated Proteins Involved in the Metabolic Process of the Leiomyoma.

    PubMed

    Ura, Blendi; Scrimin, Federica; Arrigoni, Giorgio; Franchin, Cinzia; Monasta, Lorenzo; Ricci, Giuseppe

    2016-04-09

    Uterine leiomyoma is the most common benign smooth muscle cell tumor of the uterus. Proteomics is a powerful tool for the analysis of complex mixtures of proteins. In our study, we focused on proteins that were upregulated in the leiomyoma compared to the myometrium. Paired samples of eight leiomyomas and adjacent myometrium were obtained and submitted to two-dimensional gel electrophoresis (2-DE) and mass spectrometry for protein identification and to Western blotting for 2-DE data validation. The comparison between the patterns revealed 24 significantly upregulated (p < 0.05) protein spots, 12 of which were found to be associated with the metabolic processes of the leiomyoma and not with the normal myometrium. The overexpression of seven proteins involved in the metabolic processes of the leiomyoma was further validated by Western blotting and 2D Western blotting. Four of these proteins have never been associated with the leiomyoma before. The 2-DE approach coupled with mass spectrometry, which is among the methods of choice for comparative proteomic studies, identified a number of proteins overexpressed in the leiomyoma and involved in several biological processes, including metabolic processes. A better understanding of the mechanism underlying the overexpression of these proteins may be important for therapeutic purposes.

  8. A Proteomic Approach for the Identification of Up-Regulated Proteins Involved in the Metabolic Process of the Leiomyoma

    PubMed Central

    Ura, Blendi; Scrimin, Federica; Arrigoni, Giorgio; Franchin, Cinzia; Monasta, Lorenzo; Ricci, Giuseppe

    2016-01-01

    Uterine leiomyoma is the most common benign smooth muscle cell tumor of the uterus. Proteomics is a powerful tool for the analysis of complex mixtures of proteins. In our study, we focused on proteins that were upregulated in the leiomyoma compared to the myometrium. Paired samples of eight leiomyomas and adjacent myometrium were obtained and submitted to two-dimensional gel electrophoresis (2-DE) and mass spectrometry for protein identification and to Western blotting for 2-DE data validation. The comparison between the patterns revealed 24 significantly upregulated (p < 0.05) protein spots, 12 of which were found to be associated with the metabolic processes of the leiomyoma and not with the normal myometrium. The overexpression of seven proteins involved in the metabolic processes of the leiomyoma was further validated by Western blotting and 2D Western blotting. Four of these proteins have never been associated with the leiomyoma before. The 2-DE approach coupled with mass spectrometry, which is among the methods of choice for comparative proteomic studies, identified a number of proteins overexpressed in the leiomyoma and involved in several biological processes, including metabolic processes. A better understanding of the mechanism underlying the overexpression of these proteins may be important for therapeutic purposes. PMID:27070597

  9. Directed evolution methods for improving polypeptide folding and solubility and superfolder fluorescent proteins generated thereby

    DOEpatents

    Waldo, Geoffrey S.

    2007-09-18

    The current invention provides methods of improving folding of polypeptides using a poorly folding domain as a component of a fusion protein comprising the poorly folding domain and a polypeptide of interest to be improved. The invention also provides novel green fluorescent proteins (GFPs) and red fluorescent proteins that have enhanced folding properties.

  10. Identification of human microRNA targets from isolated argonaute protein complexes.

    PubMed

    Beitzinger, Michaela; Peters, Lasse; Zhu, Jia Yun; Kremmer, Elisabeth; Meister, Gunter

    2007-06-01

    MicroRNAs (miRNAs) constitute a class of small non-coding RNAs that regulate gene expression on the level of translation and/or mRNA stability. Mammalian miRNAs associate with members of the Argonaute (Ago) protein family and bind to partially complementary sequences in the 3' untranslated region (UTR) of specific target mRNAs. Computer algorithms based on factors such as free binding energy or sequence conservation have been used to predict miRNA target mRNAs. Based on such predictions, up to one third of all mammalian mRNAs seem to be under miRNA regulation. However, due to the low degree of complementarity between the miRNA and its target, such computer programs are often imprecise and therefore not very reliable. Here we report the first biochemical identification approach of miRNA targets from human cells. Using highly specific monoclonal antibodies against members of the Ago protein family, we co-immunoprecipitate Ago-bound mRNAs and identify them by cloning. Interestingly, most of the identified targets are also predicted by different computer programs. Moreover, we randomly analyzed six different target candidates and were able to experimentally validate five as miRNA targets. Our data clearly indicate that miRNA targets can be experimentally identified from Ago complexes and therefore provide a new tool to directly analyze miRNA function.

  11. Identification of a key structural element for protein folding within beta-hairpin turns.

    PubMed

    Kim, Jaewon; Brych, Stephen R; Lee, Jihun; Logan, Timothy M; Blaber, Michael

    2003-05-09

    Specific residues in a polypeptide may be key contributors to the stability and foldability of the unique native structure. Identification and prediction of such residues is, therefore, an important area of investigation in solving the protein folding problem. Atypical main-chain conformations can help identify strains within a folded protein, and by inference, positions where unique amino acids may have a naturally high frequency of occurrence due to favorable contributions to stability and folding. Non-Gly residues located near the left-handed alpha-helical region (L-alpha) of the Ramachandran plot are a potential indicator of structural strain. Although many investigators have studied mutations at such positions, no consistent energetic or kinetic contributions to stability or folding have been elucidated. Here we report a study of the effects of Gly, Ala and Asn substitutions found within the L-alpha region at a characteristic position in defined beta-hairpin turns within human acidic fibroblast growth factor, and demonstrate consistent effects upon stability and folding kinetics. The thermodynamic and kinetic data are compared to available data for similar mutations in other proteins, with excellent agreement. The results have identified that Gly at the i+3 position within a subset of beta-hairpin turns is a key contributor towards increasing the rate of folding to the native state of the polypeptide while leaving the rate of unfolding largely unchanged.

  12. Modal Identification of Tsing MA Bridge by Using Improved Eigensystem Realization Algorithm

    NASA Astrophysics Data System (ADS)

    QIN, Q.; LI, H. B.; QIAN, L. Z.; LAU, C.-K.

    2001-10-01

    This paper presents the results of research work on modal identification of Tsing Ma bridge ambient testing data by using an improved eigensystem realization algorithm. The testing was carried out before the bridge was open to traffic and after the completion of surfacing. Without traffic load, ambient excitations were much less intensive, and the bridge responses to such ambient excitation were also less intensive. Consequently, the bridge responses were significantly influenced by the random movement of heavy construction vehicles on the deck. To cut off noises in the testing data and make the ambient signals more stationary, the Chebyshev digital filter was used instead of the digital filter with a Hanning window. Random decrement (RD) functions were built to convert the ambient responses to free vibrations. An improved eigensystem realization algorithm was employed to improve the accuracy and the efficiency of modal identification. It uses cross-correlation functions ofRD functions to form the Hankel matrix instead of RD functions themselves and uses eigenvalue decomposition instead of singular value decomposition. The data for response accelerations were acquired group by group because of limited number of high-quality accelerometers and channels of data loggers available. The modes were identified group by group and then assembled by using response accelerations acquired at reference points to form modes of the complete bridge. Seventy-nine modes of the Tsing Ma bridge were identified, including five complex modes formed in accordance with unevenly distributed damping in the bridge. The identified modes in time domain were then compared with those identified in frequency domain and finite element analytical results.

  13. Biomarker Candidates of Chlamydophila pneumoniae Proteins and Protein Fragments Identified by Affinity-Proteomics Using FTICR-MS and LC-MS/MS

    NASA Astrophysics Data System (ADS)

    Susnea, Iuliana; Bunk, Sebastian; Wendel, Albrecht; Hermann, Corinna; Przybylski, Michael

    2011-04-01

    We report here an affinity-proteomics approach that combines 2D-gel electrophoresis and immunoblotting with high performance mass spectrometry to the identification of both full length protein antigens and antigenic fragments of Chlamydophila pneumoniae (C. pneumoniae). The present affinity-mass spectrometry approach effectively utilized high resolution FTICR mass spectrometry and LC-tandem-MS for protein identification, and enabled the identification of several new highly antigenic C. pneumoniae proteins that were not hitherto reported or previously detected only in other Chlamydia species, such as Chlamydia trachomatis. Moreover, high resolution affinity-MS provided the identification of several neo-antigenic protein fragments containing N- and C-terminal, and central domains such as fragments of the membrane protein Pmp21 and the secreted chlamydial proteasome-like factor (Cpaf), representing specific biomarker candidates.

  14. Strategies for analysing and improving the expression and quality of recombinant proteins made in mammalian cells.

    PubMed

    Jenkins, Nigel; Meleady, Paula; Tyther, Raymond; Murphy, Lisa

    2009-05-06

    The production of monoclonal antibodies and other recombinant proteins is one of the highest growth areas in the pharmaceutical industry. Mammalian cells are used to manufacture the majority of biotherapeutics, largely due to their ability to perform complex post-translational modifications. Although significant progress has been made recently in improving product yields and protein quality, many challenges still lie ahead to achieve consistently high yields while avoiding potentially damaging protein modifications. The present review first considers the strategies used to analyse and improve recombinant protein expression of industrial cell lines, with an emphasis on proteomic technologies. Next, cellular and environmental influences on protein production and quality are examined, and strategies for improvements in product yield and quality are reviewed. The analytical techniques required to detect these protein changes are also described, together with prospects for assay improvements.

  15. Resistance training reduces whole-body protein turnover and improves net protein retention in untrained young males.

    PubMed

    Hartman, Joseph W; Moore, Daniel R; Phillips, Stuart M

    2006-10-01

    It is thought that resistance exercise results in an increased need for dietary protein; however, data also exists to support the opposite conclusion. The purpose of this study was to determine the impact of resistance exercise training on protein metabolism in novices with the hypothesis that resistance training would reduce protein turnover and improve whole-body protein retention. Healthy males (n = 8, 22 +/- 1 y, BMI = 25.3 +/- 1.8 kg.m(-2)) participated in a progressive whole-body split routine resistance-training program 5d/week for 12 weeks. Before (PRE) and after (POST) the training, oral [15N]-glycine ingestion was used to assess nitrogen flux (Q), protein synthesis (PS), protein breakdown (PB), and net protein balance (NPB = PS-PB). Macronutrient intake was controlled over a 5d period PRE and POST, while estimates of protein turnover and urinary nitrogen balance (N(bal) = N(in) - urine N(out)) were conducted. Bench press and leg press increased 40% and 50%, respectively (p < 0.01). Fat- and bone-free mass (i.e., lean muscle mass) increased from PRE to POST (2.5 +/- 0.8 kg, p < 0.05). Significant PRE to POST decreases (p <0.05) occurred in Q (0.9 +/- 0.1 vs. 0.6 +/- 0.1 g N.kg(-1).d(-1)), PS (4.6 +/- 0.7 vs. 2.9 +/- 0.3 g.kg(-1).d(-1)), and PB (4.3 +/- 0.7 vs. 2.4 +/- 0.2 g.kg(-1).d(-1)). Significant training-induced increases in both NPB (PRE = 0.22 +/- 0.13 g.kg(-1).d(-1); POST = 0.54 +/- 0.08 g.kg(-1).d(-1)) and urinary nitrogen balance (PRE = 2.8 +/- 1.7 g N.d(-1); POST = 6.5 +/- 0.9 g N.d(-1)) were observed. A program of resistance training that induced significant muscle hypertrophy resulted in reductions of both whole-body PS and PB, but an improved NPB, which favoured the accretion of skeletal muscle protein. Urinary nitrogen balance increased after training. The reduction in PS and PB and a higher NPB in combination with an increased nitrogen balance after training suggest that dietary requirements for protein in novice resistance-trained athletes

  16. Biosynthetically directed fractional 13C labeling facilitates identification of Phe and Tyr aromatic signals in proteins.

    PubMed

    Jacob, Jaison; Louis, John M; Nesheiwat, Issa; Torchia, Dennis A

    2002-11-01

    Analysis of 2D [(13)C,(1)H]-HSQC spectra of biosynthetic fractionally (13)C labeled proteins is a reliable, straightforward means to obtain stereospecific assignments of Val and Leu methyl sites in proteins. Herein we show that the same fractionally labeled protein sample facilitates observation and identification of Phe and Tyr aromatic signals. This is the case, in part, because the fractional (13)C labeling yields aromatic rings in which some of the (13)C-(13)C J-couplings, present in uniformly labeled samples, are absent. Also, the number of homonuclear J-coupling partners differs for the delta-, epsilon- and zeta-carbons. This enabled us to vary their signal intensities in distinctly different ways by appropriately setting the (13)C constant-time period in 2D [(13)C,(1)H]-HSQC spectra. We illustrate the application of this approach to an 18 kDa protein, c-VIAF, a modulator of apoptosis. In addition, we show that cancellation of the aromatic (13)C CSA and (13)C-(1)H dipolar interactions can be fruitfully utilized in the case of the fractionally labeled sample to obtain high resolution (13)C constant-time spectra with good sensitivity.

  17. ContaMiner and ContaBase: a webserver and database for early identification of unwantedly crystallized protein contaminants

    PubMed Central

    Hungler, Arnaud; Momin, Afaque; Diederichs, Kay; Arold, Stefan, T.

    2016-01-01

    Solving the phase problem in protein X-ray crystallography relies heavily on the identity of the crystallized protein, especially when molecular replacement (MR) methods are used. Yet, it is not uncommon that a contaminant crystallizes instead of the protein of interest. Such contaminants may be proteins from the expression host organism, protein fusion tags or proteins added during the purification steps. Many contaminants co-purify easily, crystallize and give good diffraction data. Identification of contaminant crystals may take time, since the presence of the contaminant is unexpected and its identity unknown. A webserver (ContaMiner) and a contaminant database (ContaBase) have been established, to allow fast MR-based screening of crystallographic data against currently 62 known contaminants. The web-based ContaMiner (available at http://strube.cbrc.kaust.edu.sa/contaminer/) currently produces results in 5 min to 4 h. The program is also available in a github repository and can be installed locally. ContaMiner enables screening of novel crystals at synchrotron beamlines, and it would be valuable as a routine safety check for ‘crystallization and preliminary X-ray analysis’ publications. Thus, in addition to potentially saving X-ray crystallographers much time and effort, ContaMiner might considerably lower the risk of publishing erroneous data. PMID:27980519

  18. [Patient identification errors and biological samples in the analytical process: Is it possible to improve patient safety?].

    PubMed

    Cuadrado-Cenzual, M A; García Briñón, M; de Gracia Hills, Y; González Estecha, M; Collado Yurrita, L; de Pedro Moro, J A; Fernández Pérez, C; Arroyo Fernández, M

    2015-01-01

    Patient identification errors and biological samples are one of the problems with the highest risk factor in causing an adverse event in the patient. To detect and analyse the causes of patient identification errors in analytical requests (PIEAR) from emergency departments, and to develop improvement strategies. A process and protocol was designed, to be followed by all professionals involved in the requesting and performing of laboratory tests. Evaluation and monitoring indicators of PIEAR were determined, before and after the implementation of these improvement measures (years 2010-2014). A total of 316 PIEAR were detected in a total of 483,254 emergency service requests during the study period, representing a mean of 6.80/10,000 requests. Patient identification failure was the most frequent in all the 6-monthly periods assessed, with a significant difference (P<.0001). The improvement strategies applied showed to be effective in detecting PIEAR, as well as the prevention of such errors. However, we must continue working with this strategy, promoting a culture of safety for all the professionals involved, and trying to achieve the goal that 100% of the analytical and samples are properly identified. Copyright © 2015 SECA. Published by Elsevier Espana. All rights reserved.

  19. Coupling Protein Side-Chain and Backbone Flexibility Improves the Re-design of Protein-Ligand Specificity.

    PubMed

    Ollikainen, Noah; de Jong, René M; Kortemme, Tanja

    2015-01-01

    Interactions between small molecules and proteins play critical roles in regulating and facilitating diverse biological functions, yet our ability to accurately re-engineer the specificity of these interactions using computational approaches has been limited. One main difficulty, in addition to inaccuracies in energy functions, is the exquisite sensitivity of protein-ligand interactions to subtle conformational changes, coupled with the computational problem of sampling the large conformational search space of degrees of freedom of ligands, amino acid side chains, and the protein backbone. Here, we describe two benchmarks for evaluating the accuracy of computational approaches for re-engineering protein-ligand interactions: (i) prediction of enzyme specificity altering mutations and (ii) prediction of sequence tolerance in ligand binding sites. After finding that current state-of-the-art "fixed backbone" design methods perform poorly on these tests, we develop a new "coupled moves" design method in the program Rosetta that couples changes to protein sequence with alterations in both protein side-chain and protein backbone conformations, and allows for changes in ligand rigid-body and torsion degrees of freedom. We show significantly increased accuracy in both predicting ligand specificity altering mutations and binding site sequences. These methodological improvements should be useful for many applications of protein-ligand design. The approach also provides insights into the role of subtle conformational adjustments that enable functional changes not only in engineering applications but also in natural protein evolution.

  20. Predicting cancer-relevant proteins using an improved molecular similarity ensemble approach.

    PubMed

    Zhou, Bin; Sun, Qi; Kong, De-Xin

    2016-05-31

    In this study, we proposed an improved algorithm for identifying proteins relevant to cancer. The algorithm was named two-layer molecular similarity ensemble approach (TL-SEA). We applied TL-SEA to analyzing the correlation between anticancer compounds (against cell lines K562, MCF7 and A549) and active compounds against separate target proteins listed in BindingDB. Several associations between cancer types and related proteins were revealed using this chemoinformatics approach. An analysis of the literature showed that 26 of 35 predicted proteins were correlated with cancer cell proliferation, apoptosis or differentiation. Additionally, interactions between proteins in BindingDB and anticancer chemicals were also predicted. We discuss the roles of the most important predicted proteins in cancer biology and conclude that TL-SEA could be a useful tool for inferring novel proteins involved in cancer and revealing underlying molecular mechanisms.

  1. Probiotic Bacillus coagulans GBI-30, 6086 Improves Protein Absorption and Utilization.

    PubMed

    Jäger, Ralf; Purpura, Martin; Farmer, Sean; Cash, Howard A; Keller, David

    2017-12-01

    Probiotics offer numerous health benefits, including digestive and immune health. Improved digestive health is linked to a more efficient absorption of important nutrients from our diet. This review focused on the rationale of using the probiotic Bacillus coagulans GBI-30, 6086 to aid protein absorption and utilization. B. coagulans GBI-30, 6086 can withstand the acidic environment of the stomach to reach the intestine where it germinates. Once active in the small intestine after germination, it has been shown to aid the digestion of carbohydrates and proteins. Co-administration of B. coagulans GBI-30, 6086 with protein has been shown to increase protein absorption and to maximize the health benefits associated with protein supplementation.

  2. Identification and application of self-binding zipper-like sequences in SARS-CoV spike protein.

    PubMed

    Zhang, Si Min; Liao, Ying; Neo, Tuan Ling; Lu, Yanning; Liu, Ding Xiang; Vahlne, Anders; Tam, James P

    2018-05-22

    Self-binding peptides containing zipper-like sequences, such as the Leu/Ile zipper sequence within the coiled coil regions of proteins and the cross-β spine steric zippers within the amyloid-like fibrils, could bind to the protein-of-origin through homophilic sequence-specific zipper motifs. These self-binding sequences represent opportunities for the development of biochemical tools and/or therapeutics. Here, we report on the identification of a putative self-binding β-zipper-forming peptide within the severe acute respiratory syndrome-associated coronavirus spike (S) protein and its application in viral detection. Peptide array scanning of overlapping peptides covering the entire length of S protein identified 34 putative self-binding peptides of six clusters, five of which contained octapeptide core consensus sequences. The Cluster I consensus octapeptide sequence GINITNFR was predicted by the Eisenberg's 3D profile method to have high amyloid-like fibrillation potential through steric β-zipper formation. Peptide C6 containing the Cluster I consensus sequence was shown to oligomerize and form amyloid-like fibrils. Taking advantage of this, C6 was further applied to detect the S protein expression in vitro by fluorescence staining. Meanwhile, the coiled-coil-forming Leu/Ile heptad repeat sequences within the S protein were under-represented during peptide array scanning, in agreement with that long peptide lengths were required to attain high helix-mediated interaction avidity. The data suggest that short β-zipper-like self-binding peptides within the S protein could be identified through combining the peptide scanning and predictive methods, and could be exploited as biochemical detection reagents for viral infection. Copyright © 2018. Published by Elsevier Ltd.

  3. Hub Protein Controversy: Taking a Closer Look at Plant Stress Response Hubs

    PubMed Central

    Vandereyken, Katy; Van Leene, Jelle; De Coninck, Barbara; Cammue, Bruno P. A.

    2018-01-01

    Plant stress responses involve numerous changes at the molecular and cellular level and are regulated by highly complex signaling pathways. Studying protein-protein interactions (PPIs) and the resulting networks is therefore becoming increasingly important in understanding these responses. Crucial in PPI networks are the so-called hubs or hub proteins, commonly defined as the most highly connected central proteins in scale-free PPI networks. However, despite their importance, a growing amount of confusion and controversy seems to exist regarding hub protein identification, characterization and classification. In order to highlight these inconsistencies and stimulate further clarification, this review critically analyses the current knowledge on hub proteins in the plant interactome field. We focus on current hub protein definitions, including the properties generally seen as hub-defining, and the challenges and approaches associated with hub protein identification. Furthermore, we give an overview of the most important large-scale plant PPI studies of the last decade that identified hub proteins, pointing out the lack of overlap between different studies. As such, it appears that although major advances are being made in the plant interactome field, defining hub proteins is still heavily dependent on the quality, origin and interpretation of the acquired PPI data. Nevertheless, many hub proteins seem to have a reported role in the plant stress response, including transcription factors, protein kinases and phosphatases, ubiquitin proteasome system related proteins, (co-)chaperones and redox signaling proteins. A significant number of identified plant stress hubs are however still functionally uncharacterized, making them interesting targets for future research. This review clearly shows the ongoing improvements in the plant interactome field but also calls attention to the need for a more comprehensive and precise identification of hub proteins, allowing a more

  4. Mitochondrial and nuclear localization of a novel pea thioredoxin: identification of its mitochondrial target proteins.

    PubMed

    Martí, María C; Olmos, Enrique; Calvete, Juan J; Díaz, Isabel; Barranco-Medina, Sergio; Whelan, James; Lázaro, Juan J; Sevilla, Francisca; Jiménez, Ana

    2009-06-01

    Plants contain several genes encoding thioredoxins (Trxs), small proteins involved in the regulation of the activity of many enzymes through dithiol-disulfide exchange. In addition to chloroplastic and cytoplasmic Trx systems, plant mitochondria contain a reduced nicotinamide adenine dinucleotide phosphate-dependent Trx reductase and a specific Trx o, and to date, there have been no reports of a gene encoding a plant nuclear Trx. We report here the presence in pea (Pisum sativum) mitochondria and nuclei of a Trx isoform (PsTrxo1) that seems to belong to the Trx o group, although it differs from this Trx type by its absence of introns in the genomic sequence. Western-blot analysis with isolated mitochondria and nuclei, immunogold labeling, and green fluorescent protein fusion constructs all indicated that PsTrxo1 is present in both cell compartments. Moreover, the identification by tandem mass spectrometry of the native mitochondrial Trx after gel filtration using the fast-protein liquid chromatography system of highly purified mitochondria and the in vitro uptake assay into isolated mitochondria also corroborated a mitochondrial location for this protein. The recombinant PsTrxo1 protein has been shown to be reduced more effectively by the Saccharomyces cerevisiae mitochondrial Trx reductase Trr2 than by the wheat (Triticum aestivum) cytoplasmic reduced nicotinamide adenine dinucleotide phosphate-dependent Trx reductase. PsTrxo1 was able to activate alternative oxidase, and it was shown to interact with a number of mitochondrial proteins, including peroxiredoxin and enzymes mainly involved in the photorespiratory process.

  5. Peptide reranking with protein-peptide correspondence and precursor peak intensity information.

    PubMed

    Yang, Chao; He, Zengyou; Yang, Can; Yu, Weichuan

    2012-01-01

    Searching tandem mass spectra against a protein database has been a mainstream method for peptide identification. Improving peptide identification results by ranking true Peptide-Spectrum Matches (PSMs) over their false counterparts leads to the development of various reranking algorithms. In peptide reranking, discriminative information is essential to distinguish true PSMs from false PSMs. Generally, most peptide reranking methods obtain discriminative information directly from database search scores or by training machine learning models. Information in the protein database and MS1 spectra (i.e., single stage MS spectra) is ignored. In this paper, we propose to use information in the protein database and MS1 spectra to rerank peptide identification results. To quantitatively analyze their effects to peptide reranking results, three peptide reranking methods are proposed: PPMRanker, PPIRanker, and MIRanker. PPMRanker only uses Protein-Peptide Map (PPM) information from the protein database, PPIRanker only uses Precursor Peak Intensity (PPI) information, and MIRanker employs both PPM information and PPI information. According to our experiments on a standard protein mixture data set, a human data set and a mouse data set, PPMRanker and MIRanker achieve better peptide reranking results than PetideProphet, PeptideProphet+NSP (number of sibling peptides) and a score regularization method SRPI. The source codes of PPMRanker, PPIRanker, and MIRanker, and all supplementary documents are available at our website: http://bioinformatics.ust.hk/pepreranking/. Alternatively, these documents can also be downloaded from: http://sourceforge.net/projects/pepreranking/.

  6. Identification and Characterization of the UL37 Protein of Herpes Simplex Virus Type 1 and Demonstration that it Interacts with ICP8, the Major DNA Binding Protein of Herpes Simplex Virus

    DTIC Science & Technology

    1992-10-20

    Identification of ORFs HSV DNA binding proteins • 1 3 3 5 7 7 11 17 18 22 reps and its role in HSV replication 23 Biochemical properties . . 23...Figure 1 . 2. 3 • 4. 5. 6. 7. 8. Structural model of the herpesvirus virion Schematic diagram of HSV pathogenesis . Diagram of the main...vaccinia virus- 13. Autoradiogram of an immunoblot of HSV - 1 -infected cell proteins harvested at various times postinfec- 85 tioD probed with anti-UL42

  7. Identification of ovarian cancer-associated proteins in symptomatic women: A novel method for semi-quantitative plasma proteomics.

    PubMed

    Shield-Artin, Kristy L; Bailey, Mark J; Oliva, Karen; Liovic, Ana K; Barker, Gillian; Dellios, Nicole L; Reisman, Simone; Ayhan, Mustafa; Rice, Gregory E

    2012-04-01

    To evaluate the utility of an enhanced biomarker discovery approach in order to identify potential biomarkers relevant to ovarian cancer detection. We combined immuno-depletion, liquid-phase IEF, 1D-DIGE, MALDI-TOF/MS and LC-MS/MS to identify differentially expressed proteins in the plasma of symptomatic ovarian cancer patients, stratified by stage, compared to samples obtained from normal subjects. We demonstrate that this approach is a practical alternative to traditional 2D gel techniques and that it has some advantages, most notably increased protein capacity. Proteins were identified in all 76 bands excised from the gels in this project and confirmed the cancer-associated expression of several well-established biomarkers of ovarian cancer. These included C-reactive protein (CRP), haptoglobin, alpha-2 macroglobulin and A1A2. We also identified new ovarian cancer candidate biomarkers, Protein S100-A9 (S100A9) and multimerin-2. The cancer-associated differential expression of CRP and S100A9 was further confirmed by Western blot and ELISA. The methods developed in this study allow for the increased loading of plasma proteins into the analytical stream when compared to traditional 2D-DIGE. This increased protein identification sensitivity allowed us to identify new putative ovarian cancer biomarkers. © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  8. Protein identification and quantification from riverbank grape, Vitis riparia: Comparing SDS-PAGE and FASP-GPF techniques for shotgun proteomic analysis.

    PubMed

    George, Iniga S; Fennell, Anne Y; Haynes, Paul A

    2015-09-01

    Protein sample preparation optimisation is critical for establishing reproducible high throughput proteomic analysis. In this study, two different fractionation sample preparation techniques (in-gel digestion and in-solution digestion) for shotgun proteomics were used to quantitatively compare proteins identified in Vitis riparia leaf samples. The total number of proteins and peptides identified were compared between filter aided sample preparation (FASP) coupled with gas phase fractionation (GPF) and SDS-PAGE methods. There was a 24% increase in the total number of reproducibly identified proteins when FASP-GPF was used. FASP-GPF is more reproducible, less expensive and a better method than SDS-PAGE for shotgun proteomics of grapevine samples as it significantly increases protein identification across biological replicates. Total peptide and protein information from the two fractionation techniques is available in PRIDE with the identifier PXD001399 (http://proteomecentral.proteomexchange.org/dataset/PXD001399). © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  9. An improved method to detect correct protein folds using partial clustering.

    PubMed

    Zhou, Jianjun; Wishart, David S

    2013-01-16

    Structure-based clustering is commonly used to identify correct protein folds among candidate folds (also called decoys) generated by protein structure prediction programs. However, traditional clustering methods exhibit a poor runtime performance on large decoy sets. We hypothesized that a more efficient "partial" clustering approach in combination with an improved scoring scheme could significantly improve both the speed and performance of existing candidate selection methods. We propose a new scheme that performs rapid but incomplete clustering on protein decoys. Our method detects structurally similar decoys (measured using either C(α) RMSD or GDT-TS score) and extracts representatives from them without assigning every decoy to a cluster. We integrated our new clustering strategy with several different scoring functions to assess both the performance and speed in identifying correct or near-correct folds. Experimental results on 35 Rosetta decoy sets and 40 I-TASSER decoy sets show that our method can improve the correct fold detection rate as assessed by two different quality criteria. This improvement is significantly better than two recently published clustering methods, Durandal and Calibur-lite. Speed and efficiency testing shows that our method can handle much larger decoy sets and is up to 22 times faster than Durandal and Calibur-lite. The new method, named HS-Forest, avoids the computationally expensive task of clustering every decoy, yet still allows superior correct-fold selection. Its improved speed, efficiency and decoy-selection performance should enable structure prediction researchers to work with larger decoy sets and significantly improve their ab initio structure prediction performance.

  10. An improved method to detect correct protein folds using partial clustering

    PubMed Central

    2013-01-01

    Background Structure-based clustering is commonly used to identify correct protein folds among candidate folds (also called decoys) generated by protein structure prediction programs. However, traditional clustering methods exhibit a poor runtime performance on large decoy sets. We hypothesized that a more efficient “partial“ clustering approach in combination with an improved scoring scheme could significantly improve both the speed and performance of existing candidate selection methods. Results We propose a new scheme that performs rapid but incomplete clustering on protein decoys. Our method detects structurally similar decoys (measured using either Cα RMSD or GDT-TS score) and extracts representatives from them without assigning every decoy to a cluster. We integrated our new clustering strategy with several different scoring functions to assess both the performance and speed in identifying correct or near-correct folds. Experimental results on 35 Rosetta decoy sets and 40 I-TASSER decoy sets show that our method can improve the correct fold detection rate as assessed by two different quality criteria. This improvement is significantly better than two recently published clustering methods, Durandal and Calibur-lite. Speed and efficiency testing shows that our method can handle much larger decoy sets and is up to 22 times faster than Durandal and Calibur-lite. Conclusions The new method, named HS-Forest, avoids the computationally expensive task of clustering every decoy, yet still allows superior correct-fold selection. Its improved speed, efficiency and decoy-selection performance should enable structure prediction researchers to work with larger decoy sets and significantly improve their ab initio structure prediction performance. PMID:23323835

  11. DIGE Analysis Software and Protein Identification Approaches.

    PubMed

    Hmmier, Abduladim; Dowling, Paul

    2018-01-01

    DIGE is a high-resolution two-dimensional gel electrophoresis method, with excellent dynamic range obtained by fluorescent tag labeling of protein samples. Scanned images of DIGE gels show thousands of protein spots, each spot representing a single or a group of protein isoforms. By using commercially available software, each protein spot is defined by an outline, which is digitized and correlated with the quantity of proteins present in each spot. Software packages include DeCyder, SameSpots, and Dymension 3. In addition, proteins of interest can be excised from post-stained gels and identified with conventional mass spectrometry techniques. High-throughput mass spectrometry is performed using sophisticated instrumentation including matrix-assisted laser desorption ionization time-of-flight (MALDI-TOF), MALDI-TOF/TOF, and liquid chromatography tandem mass spectrometry (LC-MS/MS). Tandem MS (MALDI-TOF/TOF or LC-MS/MS), analyzes fragmented peptides, resulting in amino acid sequence information, especially useful when protein spots are low abundant or where a mixture of proteins is present.

  12. Characterization of Plasma Membrane Proteins from Ovarian Cancer Cells Using Mass Spectrometry

    DOE PAGES

    Springer, David L.; Auberry, Deanna L.; Ahram, Mamoun; ...

    2004-01-01

    To determine how the repertoire of plasma membrane proteins change with disease state, specifically related to cancer, several methods for preparation of plasma membrane proteins were evaluated. Cultured cells derived from stage IV ovarian tumors were grown to 90% confluence and harvested in buffer containing CHAPS detergent. This preparation was centrifuged at low speed to remove insoluble cellular debris resulting in a crude homogenate. Glycosylated proteins in the crude homogenate were selectively enriched using lectin affinity chromatography. The crude homogenate and the lectin purified sample were prepared for mass spectrometric evaluation. The general procedure for protein identification began with trypsinmore » digestion of protein fractions followed by separation by reversed phase liquid chromatography that was coupled directly to a conventional tandem mass spectrometer (i.e. LCQ ion trap). Mass and fragmentation data for the peptides were searched against a human proteome data base using the informatics program SEQUEST. Using this procedure 398 proteins were identified with high confidence, including receptors, membrane-associated ligands, proteases, phosphatases, as well as structural and adhesion proteins. Results indicate that lectin chromatography provides a select subset of proteins and that the number and quality of the identifications improve as does the confidence of the protein identifications for this subset. These results represent the first step in development of methods to separate and successfully identify plasma membrane proteins from advanced ovarian cancer cells. Further characterization of plasma membrane proteins will contribute to our understanding of the mechanisms underlying progression of this deadly disease and may lead to new targeted interventions as well as new biomarkers for diagnosis.« less

  13. Identification of a follistatin-related protein from the tick Haemaphysalis longicornis and its effect on tick oviposition.

    PubMed

    Zhou, Jinlin; Liao, Min; Hatta, Takeshi; Tanaka, Miho; Xuan, Xuenan; Fujisaki, Kozo

    2006-05-10

    The identification of ovary-associated molecules will lead to a better understanding of the physiology of tick reproduction and vector-pathogen interactions. A gene encoding a follistatin-related protein (FRP) was obtained by random sequencing from the ovary cDNA library of the tick Haemaphysalis longicornis. The full-length cDNA is 1157 bp, including an intact ORF encoding an expected protein with 289 amino acids. Three distinct domains were present in the deduced amino acids, namely, the follistatin-like domain, KAZAL, and two calcium-binding motifs, EFh. The sequence shows homology with the follistatin-related protein (FRP), which was thought to play some roles in the negative regulation of cellular growth. RT-PCR showed that the gene was expressed throughout the developing stages and mainly in the ovary as well as in fat body, hemocytes, salivary glands, and midgut. This gene was expressed in GST-fused recombinant protein with an expected size. The mouse antiserum against the recombinant protein recognized a 56-kDa native protein in both tick ovary and hemolymph. The recombinant proteins were found to have binding activity for both activin A and bone morphogenetic protein-2 (BMP-2). Silencing of FRP by RNAi showed a decrease in tick oviposition, which is consistent with the effect of a recombinant protein vaccine on the adult tick. These results showed that the tick FRP might be involved in tick oviposition. This is the first report of a member of follistatin family proteins in Chelicerata, which include ticks, spiders, and scorpions.

  14. Identification and characterisation of seed storage protein transcripts from Lupinus angustifolius.

    PubMed

    Foley, Rhonda C; Gao, Ling-Ling; Spriggs, Andrew; Soo, Lena Y C; Goggin, Danica E; Smith, Penelope M C; Atkins, Craig A; Singh, Karam B

    2011-04-04

    In legumes, seed storage proteins are important for the developing seedling and are an important source of protein for humans and animals. Lupinus angustifolius (L.), also known as narrow-leaf lupin (NLL) is a grain legume crop that is gaining recognition as a potential human health food as the grain is high in protein and dietary fibre, gluten-free and low in fat and starch. Genes encoding the seed storage proteins of NLL were characterised by sequencing cDNA clones derived from developing seeds. Four families of seed storage proteins were identified and comprised three unique α, seven β, two γ and four δ conglutins. This study added eleven new expressed storage protein genes for the species. A comparison of the deduced amino acid sequences of NLL conglutins with those available for the storage proteins of Lupinus albus (L.), Pisum sativum (L.), Medicago truncatula (L.), Arachis hypogaea (L.) and Glycine max (L.) permitted the analysis of a phylogenetic relationships between proteins and demonstrated, in general, that the strongest conservation occurred within species. In the case of 7S globulin (β conglutins) and 2S sulphur-rich albumin (δ conglutins), the analysis suggests that gene duplication occurred after legume speciation. This contrasted with 11S globulin (α conglutin) and basic 7S (γ conglutin) sequences where some of these sequences appear to have diverged prior to speciation. The most abundant NLL conglutin family was β (56%), followed by α (24%), δ (15%) and γ (6%) and the transcript levels of these genes increased 103 to 106 fold during seed development. We used the 16 NLL conglutin sequences identified here to determine that for individuals specifically allergic to lupin, all seven members of the β conglutin family were potential allergens. This study has characterised 16 seed storage protein genes in NLL including 11 newly-identified members. It has helped lay the foundation for efforts to use molecular breeding approaches to improve

  15. Identification of urine protein biomarkers with the potential for early detection of lung cancer.

    PubMed

    Zhang, Hongjuan; Cao, Jing; Li, Lin; Liu, Yanbin; Zhao, Hong; Li, Nan; Li, Bo; Zhang, Aiqun; Huang, Huanwei; Chen, She; Dong, Mengqiu; Yu, Lei; Zhang, Jian; Chen, Liang

    2015-07-02

    Lung cancer is the leading cause of cancer-related deaths and has an overall 5-year survival rate lower than 15%. Large-scale clinical trials have demonstrated a significant relative reduction in mortality in high-risk individuals with low-dose computed tomography screening. However, biomarkers capable of identifying the most at-risk population and detecting lung cancer before it becomes clinically apparent are urgently needed in the clinic. Here, we report the identification of urine biomarkers capable of detecting lung cancer. Using the well-characterized inducible Kras (G12D) mouse model of lung cancer, we identified alterations in the urine proteome in tumor-bearing mice compared with sibling controls. Marked differences at the proteomic level were also detected between the urine of patients and that of healthy population controls. Importantly, we identified 7 proteins commonly found to be significantly up-regulated in both tumor-bearing mice and patients. In an independent cohort, we showed that 2 of the 7 proteins were up-regulated in urine samples from lung cancer patients but not in those from controls. The kinetics of these proteins correlated with the disease state in the mouse model. These tumor biomarkers could potentially aid in the early detection of lung cancer.

  16. Identification of immunogenic polypeptides from a Mycoplasma hyopneumoniae genome library by phage display.

    PubMed

    Kügler, Jonas; Nieswandt, Simone; Gerlach, Gerald F; Meens, Jochen; Schirrmann, Thomas; Hust, Michael

    2008-09-01

    The identification of immunogenic polypeptides of pathogens is helpful for the development of diagnostic assays and therapeutic applications like vaccines. Routinely, these proteins are identified by two-dimensional polyacrylamide gel electrophoresis and Western blot using convalescent serum, followed by mass spectrometry. This technology, however, is limited, because low or differentially expressed proteins, e.g. dependent on pathogen-host interaction, cannot be identified. In this work, we developed and improved a M13 genomic phage display-based method for the selection of immunogenic polypeptides of Mycoplasma hyopneumoniae, a pathogen causing porcine enzootic pneumonia. The fragmented genome of M. hyopneumoniae was cloned into a phage display vector, and the genomic library was packaged using the helperphage Hyperphage to enrich open reading frames (ORFs). Afterwards, the phage display library was screened by panning using convalescent serum. The analysis of individual phage clones resulted in the identification of five genes encoding immunogenic proteins, only two of which had been previously identified and described as immunogenic. This M13 genomic phage display, directly combining ORF enrichment and the presentation of the corresponding polypeptide on the phage surface, complements proteome-based methods for the identification of immunogenic polypeptides and is particularly well suited for the use in mycoplasma species.

  17. Identification of lipid- and protein-based binders in paintings by direct on-plate wet chemistry and matrix-assisted laser desorption ionization mass spectrometry.

    PubMed

    Calvano, Cosima Damiana; van der Werf, Inez Dorothé; Palmisano, Francesco; Sabbatini, Luigia

    2015-01-01

    Direct on-target plate processing of small (ca. 100 μg) fragments of paint samples for MALDI-MS identification of lipid- and protein-based binders is described. Fragments were fixed on a conventional stainless steel target plate by colloidal graphite followed by in situ fast tryptic digestion and matrix addition. The new protocol was first developed on paint replicas composed of chicken egg, collagen, and cow milk mixed with inorganic pigments and then successfully applied on historical paint samples taken from a fifteenth century Italian panel painting. The present work contributes a step forward in the simplification of binder identification in very small paint samples since no conventional solvent extraction is required, speeding up the whole sample preparation to 10 min and reducing lipid/protein loss.

  18. Improved protein surface comparison and application to low-resolution protein structure data.

    PubMed

    Sael, Lee; Kihara, Daisuke

    2010-12-14

    Recent advancements of experimental techniques for determining protein tertiary structures raise significant challenges for protein bioinformatics. With the number of known structures of unknown function expanding at a rapid pace, an urgent task is to provide reliable clues to their biological function on a large scale. Conventional approaches for structure comparison are not suitable for a real-time database search due to their slow speed. Moreover, a new challenge has arisen from recent techniques such as electron microscopy (EM), which provide low-resolution structure data. Previously, we have introduced a method for protein surface shape representation using the 3D Zernike descriptors (3DZDs). The 3DZD enables fast structure database searches, taking advantage of its rotation invariance and compact representation. The search results of protein surface represented with the 3DZD has showngood agreement with the existing structure classifications, but some discrepancies were also observed. The three new surface representations of backbone atoms, originally devised all-atom-surface representation, and the combination of all-atom surface with the backbone representation are examined. All representations are encoded with the 3DZD. Also, we have investigated the applicability of the 3DZD for searching protein EM density maps of varying resolutions. The surface representations are evaluated on structure retrieval using two existing classifications, SCOP and the CE-based classification. Overall, the 3DZDs representing backbone atoms show better retrieval performance than the original all-atom surface representation. The performance further improved when the two representations are combined. Moreover, we observed that the 3DZD is also powerful in comparing low-resolution structures obtained by electron microscopy.

  19. Improved protein surface comparison and application to low-resolution protein structure data

    PubMed Central

    2010-01-01

    Background Recent advancements of experimental techniques for determining protein tertiary structures raise significant challenges for protein bioinformatics. With the number of known structures of unknown function expanding at a rapid pace, an urgent task is to provide reliable clues to their biological function on a large scale. Conventional approaches for structure comparison are not suitable for a real-time database search due to their slow speed. Moreover, a new challenge has arisen from recent techniques such as electron microscopy (EM), which provide low-resolution structure data. Previously, we have introduced a method for protein surface shape representation using the 3D Zernike descriptors (3DZDs). The 3DZD enables fast structure database searches, taking advantage of its rotation invariance and compact representation. The search results of protein surface represented with the 3DZD has showngood agreement with the existing structure classifications, but some discrepancies were also observed. Results The three new surface representations of backbone atoms, originally devised all-atom-surface representation, and the combination of all-atom surface with the backbone representation are examined. All representations are encoded with the 3DZD. Also, we have investigated the applicability of the 3DZD for searching protein EM density maps of varying resolutions. The surface representations are evaluated on structure retrieval using two existing classifications, SCOP and the CE-based classification. Conclusions Overall, the 3DZDs representing backbone atoms show better retrieval performance than the original all-atom surface representation. The performance further improved when the two representations are combined. Moreover, we observed that the 3DZD is also powerful in comparing low-resolution structures obtained by electron microscopy. PMID:21172052

  20. An improved monomeric infrared fluorescent protein for neuronal and tumour brain imaging.

    PubMed

    Yu, Dan; Gustafson, William Clay; Han, Chun; Lafaye, Céline; Noirclerc-Savoye, Marjolaine; Ge, Woo-Ping; Thayer, Desiree A; Huang, Hai; Kornberg, Thomas B; Royant, Antoine; Jan, Lily Yeh; Jan, Yuh Nung; Weiss, William A; Shu, Xiaokun

    2014-05-15

    Infrared fluorescent proteins (IFPs) are ideal for in vivo imaging, and monomeric versions of these proteins can be advantageous as protein tags or for sensor development. In contrast to GFP, which requires only molecular oxygen for chromophore maturation, phytochrome-derived IFPs incorporate biliverdin (BV) as the chromophore. However, BV varies in concentration in different cells and organisms. Here we engineered cells to express the haeme oxygenase responsible for BV biosynthesis and a brighter monomeric IFP mutant (IFP2.0). Together, these tools improve the imaging capabilities of IFP2.0 compared with monomeric IFP1.4 and dimeric iRFP. By targeting IFP2.0 to the plasma membrane, we demonstrate robust labelling of neuronal processes in Drosophila larvae. We also show that this strategy improves the sensitivity when imaging brain tumours in whole mice. Our work shows promise in the application of IFPs for protein labelling and in vivo imaging.

  1. Sequence Identification, Recombinant Production, and Analysis of the Self-Assembly of Egg Stalk Silk Proteins from Lacewing Chrysoperla carnea.

    PubMed

    Neuenfeldt, Martin; Scheibel, Thomas

    2017-06-13

    Egg stalk silks of the common green lacewing Chrysoperla carnea likely comprise at least three different silk proteins. Based on the natural spinning process, it was hypothesized that these proteins self-assemble without shear stress, as adult lacewings do not use a spinneret. To examine this, the first sequence identification and determination of the gene expression profile of several silk proteins and various transcript variants thereof was conducted, and then the three major proteins were recombinantly produced in Escherichia coli encoded by their native complementary DNA (cDNA) sequences. Circular dichroism measurements indicated that the silk proteins in aqueous solutions had a mainly intrinsically disordered structure. The largest silk protein, which we named ChryC1, exhibited a lower critical solution temperature (LCST) behavior and self-assembled into fibers or film morphologies, depending on the conditions used. The second silk protein, ChryC2, self-assembled into nanofibrils and subsequently formed hydrogels. Circular dichroism and Fourier transform infrared spectroscopy confirmed conformational changes of both proteins into beta sheet rich structures upon assembly. ChryC3 did not self-assemble into any morphology under the tested conditions. Thereby, through this work, it could be shown that recombinant lacewing silk proteins can be produced and further used for studying the fiber formation of lacewing egg stalks.

  2. Leveraging 3D chemical similarity, target and phenotypic data in the identification of drug-protein and drug-adverse effect associations.

    PubMed

    Vilar, Santiago; Hripcsak, George

    2016-01-01

    Drug-target identification is crucial to discover novel applications for existing drugs and provide more insights about mechanisms of biological actions, such as adverse drug effects (ADEs). Computational methods along with the integration of current big data sources provide a useful framework for drug-target and drug-adverse effect discovery. In this article, we propose a method based on the integration of 3D chemical similarity, target and adverse effect data to generate a drug-target-adverse effect predictor along with a simple leveraging system to improve identification of drug-targets and drug-adverse effects. In the first step, we generated a system for multiple drug-target identification based on the application of 3D drug similarity into a large target dataset extracted from the ChEMBL. Next, we developed a target-adverse effect predictor combining targets from ChEMBL with phenotypic information provided by SIDER data source. Both modules were linked to generate a final predictor that establishes hypothesis about new drug-target-adverse effect candidates. Additionally, we showed that leveraging drug-target candidates with phenotypic data is very useful to improve the identification of drug-targets. The integration of phenotypic data into drug-target candidates yielded up to twofold precision improvement. In the opposite direction, leveraging drug-phenotype candidates with target data also yielded a significant enhancement in the performance. The modeling described in the current study is simple and efficient and has applications at large scale in drug repurposing and drug safety through the identification of mechanism of action of biological effects.

  3. A combinatorial perspective of the protein inference problem.

    PubMed

    Yang, Chao; He, Zengyou; Yu, Weichuan

    2013-01-01

    In a shotgun proteomics experiment, proteins are the most biologically meaningful output. The success of proteomics studies depends on the ability to accurately and efficiently identify proteins. Many methods have been proposed to facilitate the identification of proteins from peptide identification results. However, the relationship between protein identification and peptide identification has not been thoroughly explained before. In this paper, we devote ourselves to a combinatorial perspective of the protein inference problem. We employ combinatorial mathematics to calculate the conditional protein probabilities (protein probability means the probability that a protein is correctly identified) under three assumptions, which lead to a lower bound, an upper bound, and an empirical estimation of protein probabilities, respectively. The combinatorial perspective enables us to obtain an analytical expression for protein inference. Our method achieves comparable results with ProteinProphet in a more efficient manner in experiments on two data sets of standard protein mixtures and two data sets of real samples. Based on our model, we study the impact of unique peptides and degenerate peptides (degenerate peptides are peptides shared by at least two proteins) on protein probabilities. Meanwhile, we also study the relationship between our model and ProteinProphet. We name our program ProteinInfer. Its Java source code, our supplementary document and experimental results are available at: >http://bioinformatics.ust.hk/proteininfer.

  4. The Search Engine for Multi-Proteoform Complexes: An Online Tool for the Identification and Stoichiometry Determination of Protein Complexes.

    PubMed

    Skinner, Owen S; Schachner, Luis F; Kelleher, Neil L

    2016-12-08

    Recent advances in top-down mass spectrometry using native electrospray now enable the analysis of intact protein complexes with relatively small sample amounts in an untargeted mode. Here, we describe how to characterize both homo- and heteropolymeric complexes with high molecular specificity using input data produced by tandem mass spectrometry of whole protein assemblies. The tool described is a "search engine for multi-proteoform complexes," (SEMPC) and is available for free online. The output is a list of candidate multi-proteoform complexes and scoring metrics, which are used to define a distinct set of one or more unique protein subunits, their overall stoichiometry in the intact complex, and their pre- and post-translational modifications. Thus, we present an approach for the identification and characterization of intact protein complexes from native mass spectrometry data. © 2016 by John Wiley & Sons, Inc. Copyright © 2016 John Wiley & Sons, Inc.

  5. Identification of Crosslinked Peptides after Click-based Enrichment Using Sequential CID and ETD Tandem Mass Spectrometry

    PubMed Central

    Chowdhury, Saiful M.; Du, Xiuxia; Tolić, Nikola; Wu, Si; Moore, Ronald J.; Mayer, M. Uljana; Smith, Richard D.; Adkins, Joshua N.

    2010-01-01

    Chemical crosslinking combined with mass spectrometry can be a powerful approach for the identification of protein-protein interactions and for providing constraints on protein structures. However, enrichment of crosslinked peptides is crucial to reduce sample complexity before mass spectrometric analysis. In addition compact crosslinkers are often preferred to provide short spacer lengths, surface accessibility to the protein complexes, and must have reasonable solubility under condition where the native complex structure is stable. In this study, we present a novel compact crosslinker that contains two distinct features: 1) an alkyne tag and 2) a small molecule detection tag (NO2-) to maintain reasonable solubility in water. The alkyne tag enables enrichment of the crosslinked peptide after proteolytic cleavage after coupling of an affinity tag using alkyne-azido click chemistry. Neutral loss of the small NO2- moiety provides a secondary means of detecting crosslinked peptides in MS/MS analyses, providing additional confidence in peptide identifications. We show the labeling efficiency of this crosslinker, which we termed CLIP (Click-enabled Linker for Interacting Proteins) using ubiquitin. The enrichment capability of CLIP is demonstrated for crosslinked ubiquitin in highly complex E. coli cell lysates. Sequential CID-MS/MS and ETD-MS/MS of inter-crosslinked peptides (two peptides connected with a crosslinker) are also demonstrated for improved automated identification of crosslinked peptides. PMID:19496583

  6. Verification of Ribosomal Proteins of Aspergillus fumigatus for Use as Biomarkers in MALDI-TOF MS Identification.

    PubMed

    Nakamura, Sayaka; Sato, Hiroaki; Tanaka, Reiko; Yaguchi, Takashi

    2016-01-01

    We have previously proposed a rapid identification method for bacterial strains based on the profiles of their ribosomal subunit proteins (RSPs), observed using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS). This method can perform phylogenetic characterization based on the mass of housekeeping RSP biomarkers, ideally calculated from amino acid sequence information registered in public protein databases. With the aim of extending its field of application to medical mycology, this study investigates the actual state of information of RSPs of eukaryotic fungi registered in public protein databases through the characterization of ribosomal protein fractions extracted from genome-sequenced Aspergillus fumigatus strains Af293 and A1163 as a model. In this process, we have found that the public protein databases harbor problems. The RSP names are in confusion, so we have provisionally unified them using the yeast naming system. The most serious problem is that many incorrect sequences are registered in the public protein databases. Surprisingly, more than half of the sequences are incorrect, due chiefly to mis-annotation of exon/intron structures. These errors could be corrected by a combination of in silico inspection by sequence homology analysis and MALDI-TOF MS measurements. We were also able to confirm conserved post-translational modifications in eleven RSPs. After these verifications, the masses of 31 expressed RSPs under 20,000 Da could be accurately confirmed. These RSPs have a potential to be useful biomarkers for identifying clinical isolates of A. fumigatus .

  7. Identification of DNA-Binding Proteins Using Mixed Feature Representation Methods.

    PubMed

    Qu, Kaiyang; Han, Ke; Wu, Song; Wang, Guohua; Wei, Leyi

    2017-09-22

    DNA-binding proteins play vital roles in cellular processes, such as DNA packaging, replication, transcription, regulation, and other DNA-associated activities. The current main prediction method is based on machine learning, and its accuracy mainly depends on the features extraction method. Therefore, using an efficient feature representation method is important to enhance the classification accuracy. However, existing feature representation methods cannot efficiently distinguish DNA-binding proteins from non-DNA-binding proteins. In this paper, a multi-feature representation method, which combines three feature representation methods, namely, K-Skip-N-Grams, Information theory, and Sequential and structural features (SSF), is used to represent the protein sequences and improve feature representation ability. In addition, the classifier is a support vector machine. The mixed-feature representation method is evaluated using 10-fold cross-validation and a test set. Feature vectors, which are obtained from a combination of three feature extractions, show the best performance in 10-fold cross-validation both under non-dimensional reduction and dimensional reduction by max-relevance-max-distance. Moreover, the reduced mixed feature method performs better than the non-reduced mixed feature technique. The feature vectors, which are a combination of SSF and K-Skip-N-Grams, show the best performance in the test set. Among these methods, mixed features exhibit superiority over the single features.

  8. Incorporating conditional random fields and active learning to improve sentiment identification.

    PubMed

    Zhang, Kunpeng; Xie, Yusheng; Yang, Yi; Sun, Aaron; Liu, Hengchang; Choudhary, Alok

    2014-10-01

    Many machine learning, statistical, and computational linguistic methods have been developed to identify sentiment of sentences in documents, yielding promising results. However, most of state-of-the-art methods focus on individual sentences and ignore the impact of context on the meaning of a sentence. In this paper, we propose a method based on conditional random fields to incorporate sentence structure and context information in addition to syntactic information for improving sentiment identification. We also investigate how human interaction affects the accuracy of sentiment labeling using limited training data. We propose and evaluate two different active learning strategies for labeling sentiment data. Our experiments with the proposed approach demonstrate a 5%-15% improvement in accuracy on Amazon customer reviews compared to existing supervised learning and rule-based methods. Copyright © 2014 Elsevier Ltd. All rights reserved.

  9. Characterization of the human submandibular/sublingual saliva glycoproteome using lectin affinity chromatography coupled to Multidimensional Protein Identification Technology

    PubMed Central

    Gonzalez-Begne, Mireya; Lu, Bingwen; Liao, Lujian; Xu, Tao; Bedi, Gurrinder; Melvin, James E.; Yates, John R.

    2011-01-01

    In-depth analysis of the salivary proteome is fundamental to understanding the functions of salivary proteins in the oral cavity and to reveal disease biomarkers involved in different pathophysiological conditions, with the ultimate goal of improving patient diagnosis and prognosis. Submandibular and sublingual glands contribute saliva rich in glycoproteins to the total saliva output, making them valuable sources for glycoproteomic analysis. Lectin-affinity chromatography coupled to mass spectrometry-based shotgun proteomics was used to explore the submandibular/sublingual (SM/SL) saliva glycoproteome. A total of 262 N- and O-linked glycoproteins were identified by multidimensional protein identification technology (MudPIT). Only 38 were previously described in SM and SL salivas from the human salivary N-linked glycoproteome, while 224 were unique. Further comparison analysis with SM/SL saliva of the human saliva proteome, revealed 125 glycoproteins not formerly reported in this secretion. KEGG pathway analyses demonstrated that many of these glycoproteins are involved in processes such as complement and coagulation cascades, cell communication, glycosphingolipid biosynthesis neo-lactoseries, O-glycan biosynthesis, glycan structures-biosynthesis 2, starch and sucrose metabolism, peptidoglycan biosynthesis or others pathways. In summary, lectin-affinity chromatography coupled to MudPIT mass spectrometry identified many novel glycoproteins in SM/SL saliva. These new additions to the salivary proteome may prove to be a critical step for providing reliable biomarkers in the diagnosis of a myriad of oral and systemic diseases. PMID:21936497

  10. rTANDEM, an R/Bioconductor package for MS/MS protein identification.

    PubMed

    Fournier, Frédéric; Joly Beauparlant, Charles; Paradis, René; Droit, Arnaud

    2014-08-01

    rTANDEM is an R/Bioconductor package that interfaces the X!Tandem protein identification algorithm. The package can run the multi-threaded algorithm on proteomic data files directly from R. It also provides functions to convert search parameters and results to/from R as well as functions to manipulate parameters and automate searches. An associated R package, shinyTANDEM, provides a web-based graphical interface to visualize and interpret the results. Together, those two packages form an entry point for a general MS/MS-based proteomic pipeline in R/Bioconductor. rTANDEM and shinyTANDEM are distributed in R/Bioconductor, http://bioconductor.org/packages/release/bioc/. The packages are under open licenses (GPL-3 and Artistice-1.0). frederic.fournier@crchuq.ulaval.ca or arnaud.droit@crchuq.ulaval.ca Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  11. Improved method for predicting protein fold patterns with ensemble classifiers.

    PubMed

    Chen, W; Liu, X; Huang, Y; Jiang, Y; Zou, Q; Lin, C

    2012-01-27

    Protein folding is recognized as a critical problem in the field of biophysics in the 21st century. Predicting protein-folding patterns is challenging due to the complex structure of proteins. In an attempt to solve this problem, we employed ensemble classifiers to improve prediction accuracy. In our experiments, 188-dimensional features were extracted based on the composition and physical-chemical property of proteins and 20-dimensional features were selected using a coupled position-specific scoring matrix. Compared with traditional prediction methods, these methods were superior in terms of prediction accuracy. The 188-dimensional feature-based method achieved 71.2% accuracy in five cross-validations. The accuracy rose to 77% when we used a 20-dimensional feature vector. These methods were used on recent data, with 54.2% accuracy. Source codes and dataset, together with web server and software tools for prediction, are available at: http://datamining.xmu.edu.cn/main/~cwc/ProteinPredict.html.

  12. Identification of a putative protein profile associated with tamoxifen therapy resistance in breast cancer.

    PubMed

    Umar, Arzu; Kang, Hyuk; Timmermans, Annemieke M; Look, Maxime P; Meijer-van Gelder, Marion E; den Bakker, Michael A; Jaitly, Navdeep; Martens, John W M; Luider, Theo M; Foekens, John A; Pasa-Tolić, Ljiljana

    2009-06-01

    Tamoxifen resistance is a major cause of death in patients with recurrent breast cancer. Current clinical factors can correctly predict therapy response in only half of the treated patients. Identification of proteins that are associated with tamoxifen resistance is a first step toward better response prediction and tailored treatment of patients. In the present study we intended to identify putative protein biomarkers indicative of tamoxifen therapy resistance in breast cancer using nano-LC coupled with FTICR MS. Comparative proteome analysis was performed on approximately 5,500 pooled tumor cells (corresponding to approximately 550 ng of protein lysate/analysis) obtained through laser capture microdissection (LCM) from two independently processed data sets (n = 24 and n = 27) containing both tamoxifen therapy-sensitive and therapy-resistant tumors. Peptides and proteins were identified by matching mass and elution time of newly acquired LC-MS features to information in previously generated accurate mass and time tag reference databases. A total of 17,263 unique peptides were identified that corresponded to 2,556 non-redundant proteins identified with > or = 2 peptides. 1,713 overlapping proteins between the two data sets were used for further analysis. Comparative proteome analysis revealed 100 putatively differentially abundant proteins between tamoxifen-sensitive and tamoxifen-resistant tumors. The presence and relative abundance for 47 differentially abundant proteins were verified by targeted nano-LC-MS/MS in a selection of unpooled, non-microdissected discovery set tumor tissue extracts. ENPP1, EIF3E, and GNB4 were significantly associated with progression-free survival upon tamoxifen treatment for recurrent disease. Differential abundance of our top discriminating protein, extracellular matrix metalloproteinase inducer, was validated by tissue microarray in an independent patient cohort (n = 156). Extracellular matrix metalloproteinase inducer levels were

  13. Identification of Proteins in Promastigote and Amastigote-like Leishmania Using an Immunoproteomic Approach

    PubMed Central

    Coelho, Vinicio T. S.; Oliveira, Jamil S.; Valadares, Diogo G.; Chávez-Fumagalli, Miguel A.; Duarte, Mariana C.; Lage, Paula S.; Soto, Manuel; Santoro, Marcelo M.; Tavares, Carlos A. P.; Fernandes, Ana Paula; Coelho, Eduardo A. F.

    2012-01-01

    Background The present study aims to identify antigens in protein extracts of promastigote and amastigote-like Leishmania (Leishmania) chagasi syn. L. (L.) infantum recognized by antibodies present in the sera of dogs with asymptomatic and symptomatic visceral leishmaniasis (VL). Methodology/Principal Findings Proteins recognized by sera samples were separated by two-dimensional electrophoresis (2DE) and identified by mass spectrometry. A total of 550 spots were observed in the 2DE gels, and approximately 104 proteins were identified. Several stage-specific proteins could be identified by either or both classes of sera, including, as expected, previously known proteins identified as diagnosis, virulence factors, drug targets, or vaccine candidates. Three, seven, and five hypothetical proteins could be identified in promastigote antigenic extracts; while two, eleven, and three hypothetical proteins could be identified in amastigote-like antigenic extracts by asymptomatic and symptomatic sera, as well as a combination of both, respectively. Conclusions/Significance The present study represents a significant contribution not only in identifying stage-specific L. infantum molecules, but also in revealing the expression of a large number of hypothetical proteins. Moreover, when combined, the identified proteins constitute a significant source of information for the improvement of diagnostic tools and/or vaccine development to VL. PMID:22272364

  14. Large-scale identification of c-MYC-associated proteins using a combined TAP/MudPIT approach.

    PubMed

    Koch, Heike B; Zhang, Ru; Verdoodt, Berlinda; Bailey, Aaron; Zhang, Chang-Dong; Yates, John R; Menssen, Antje; Hermeking, Heiko

    2007-01-15

    The c-MYC oncogene encodes a transcription factor, which is sufficient and necessary for the induction of cellular proliferation. However, the c-MYC protein is a relatively weak transactivator suggesting that it may have other functions. To identify protein interactors which may reveal new functions or represent regulators of c-MYC we systematically identified proteins associated with c-MYC in vivo using a proteomic approach. We combined tandem affinity purification (TAP) with the mass spectral multidimensional protein identification technology (MudPIT). Thereby, 221 c-MYC-associated proteins were identified. Among them were 17 previously known c-MYC-interactors. Selected new c-MYC-associated proteins (DBC-1, FBX29, KU70, MCM7, Mi2-beta/CHD4, RNA Pol II, RFC2, RFC3, SV40 Large T Antigen, TCP1alpha, U5-116kD, ZNF281) were confirmed independently. For association with MCM7, SV40 Large T Antigen and DBC-1 the functionally important MYC-box II region was required, whereas FBX29 and Mi2-beta interacted via MYC-box II and the BR-HLH-LZ motif. In addition, regulators of c-MYC activity were identified: ectopic expression of FBX29, an E3 ubiquitin ligase, decreased c-MYC protein levels and inhibited c-MYC transactivation, whereas knock-down of FBX29 elevated the concentration of c-MYC. Furthermore, sucrose gradient analysis demonstrated that c-MYC is present in numerous complexes with varying size and composition, which may accommodate the large number of new c-MYC-associated proteins identified here and mediate the diverse functions of c-MYC. Our results suggest that c-MYC, besides acting as a mitogenic transcription factor, regulates cellular proliferation by direct association with protein complexes involved in multiple synthetic processes required for cell division, as for example DNA-replication/repair and RNA-processing. Furthermore, this first comprehensive description of the c-MYC-associated sub-proteome will facilitate further studies aimed to elucidate the biology

  15. Identification and correction of abnormal, incomplete and mispredicted proteins in public databases.

    PubMed

    Nagy, Alinda; Hegyi, Hédi; Farkas, Krisztina; Tordai, Hedvig; Kozma, Evelin; Bányai, László; Patthy, László

    2008-08-27

    Despite significant improvements in computational annotation of genomes, sequences of abnormal, incomplete or incorrectly predicted genes and proteins remain abundant in public databases. Since the majority of incomplete, abnormal or mispredicted entries are not annotated as such, these errors seriously affect the reliability of these databases. Here we describe the MisPred approach that may provide an efficient means for the quality control of databases. The current version of the MisPred approach uses five distinct routines for identifying abnormal, incomplete or mispredicted entries based on the principle that a sequence is likely to be incorrect if some of its features conflict with our current knowledge about protein-coding genes and proteins: (i) conflict between the predicted subcellular localization of proteins and the absence of the corresponding sequence signals; (ii) presence of extracellular and cytoplasmic domains and the absence of transmembrane segments; (iii) co-occurrence of extracellular and nuclear domains; (iv) violation of domain integrity; (v) chimeras encoded by two or more genes located on different chromosomes. Analyses of predicted EnsEMBL protein sequences of nine deuterostome (Homo sapiens, Mus musculus, Rattus norvegicus, Monodelphis domestica, Gallus gallus, Xenopus tropicalis, Fugu rubripes, Danio rerio and Ciona intestinalis) and two protostome species (Caenorhabditis elegans and Drosophila melanogaster) have revealed that the absence of expected signal peptides and violation of domain integrity account for the majority of mispredictions. Analyses of sequences predicted by NCBI's GNOMON annotation pipeline show that the rates of mispredictions are comparable to those of EnsEMBL. Interestingly, even the manually curated UniProtKB/Swiss-Prot dataset is contaminated with mispredicted or abnormal proteins, although to a much lesser extent than UniProtKB/TrEMBL or the EnsEMBL or GNOMON-predicted entries. MisPred works efficiently in

  16. Systematic analysis of protein turnover in primary cells.

    PubMed

    Mathieson, Toby; Franken, Holger; Kosinski, Jan; Kurzawa, Nils; Zinn, Nico; Sweetman, Gavain; Poeckel, Daniel; Ratnu, Vikram S; Schramm, Maike; Becher, Isabelle; Steidel, Michael; Noh, Kyung-Min; Bergamini, Giovanna; Beck, Martin; Bantscheff, Marcus; Savitski, Mikhail M

    2018-02-15

    A better understanding of proteostasis in health and disease requires robust methods to determine protein half-lives. Here we improve the precision and accuracy of peptide ion intensity-based quantification, enabling more accurate protein turnover determination in non-dividing cells by dynamic SILAC-based proteomics. This approach allows exact determination of protein half-lives ranging from 10 to >1000 h. We identified 4000-6000 proteins in several non-dividing cell types, corresponding to 9699 unique protein identifications over the entire data set. We observed similar protein half-lives in B-cells, natural killer cells and monocytes, whereas hepatocytes and mouse embryonic neurons show substantial differences. Our data set extends and statistically validates the previous observation that subunits of protein complexes tend to have coherent turnover. Moreover, analysis of different proteasome and nuclear pore complex assemblies suggests that their turnover rate is architecture dependent. These results illustrate that our approach allows investigating protein turnover and its implications in various cell types.

  17. Identification of the DotL Coupling Protein Subcomplex of the Legionella Dot/Icm Type IV Secretion System

    PubMed Central

    Vincent, Carr D.; Friedman, Jonathan R.; Jeong, Kwang Cheol; Sutherland, Molly C.; Vogel, Joseph P.

    2012-01-01

    Summary Legionella pneumophila, the causative agent of Legionnaires’ disease, survives in macrophages by altering the endocytic pathway of its host cell. To accomplish this, the bacterium utilizes a type IVB secretion system to deliver effector molecules into the host cell cytoplasm. In a previous report, we performed an extensive characterization of the L. pneumophila type IVB secretion system that resulted in the identification of a critical five-protein subcomplex that forms the core of the secretion apparatus. Here we describe a second Dot/Icm protein subassembly composed of the type IV coupling protein DotL, the apparatus proteins DotM and DotN, and the secretion adaptor proteins IcmS and IcmW. In the absence of IcmS or IcmW, DotL becomes destabilized at the transition from the exponential to stationary phases of growth, concurrent with the expression of many secreted substrates. Loss of DotL is dependent on ClpA, a regulator of the cytoplasmic protease ClpP. The resulting decreased levels of DotL in the icmS and icmW mutants exacerbates the intracellular defects of these strains and can be partially suppressed by overproduction of DotL. Thus, in addition to their role as chaperones for Legionella T4SS substrates, IcmS and IcmW perform a second function as part of the Dot/Icm type IV coupling protein subcomplex. PMID:22694730

  18. Improving functional properties of pea protein isolate for microencapsulation of flaxseed oil.

    PubMed

    Bajaj, Poonam R; Bhunia, Kanishka; Kleiner, Leslie; Joyner Melito, Helen S; Smith, Denise; Ganjyal, Girish; Sablani, Shyam S

    2017-03-01

    Unhydrolysed pea protein (UN) forms very viscous emulsions when used at higher concentrations. To overcome this, UN was hydrolysed using enzymes alcalase, flavourzyme, neutrase, alcalase-flavourzyme, and neutrase-flavourzyme at 50 °C for 0 min, 30 min, 60 min, and 120 min to form hydrolysed proteins A, F, N, AF, and NF, respectively. All hydrolysed proteins had lower apparent viscosity and higher solubility than UN. Foaming capacity of A was the highest, followed by NF, N, and AF. Hydrolysed proteins N60, A60, NF60, and AF60 were prepared by hydrolysing UN for 60 min and used further for microencapsulation. At 20% oil loading (on a total solid basis), the encapsulated powder N60 had the highest microencapsulation efficiency (ME = 56.2). A decrease in ME occurred as oil loading increased to 40%. To improve the ME of N60, >90%, UN and maltodextrin were added. Flowability and particle size distribution of microencapsulated powders with >90% microencapsulation efficiency and morphology of all powders were investigated. This study identified a new way to improve pea protein functionality in emulsions, as well as a new application of hydrolysed pea protein as wall material for microencapsulation.

  19. MoRFchibi SYSTEM: software tools for the identification of MoRFs in protein sequences.

    PubMed

    Malhis, Nawar; Jacobson, Matthew; Gsponer, Jörg

    2016-07-08

    Molecular recognition features, MoRFs, are short segments within longer disordered protein regions that bind to globular protein domains in a process known as disorder-to-order transition. MoRFs have been found to play a significant role in signaling and regulatory processes in cells. High-confidence computational identification of MoRFs remains an important challenge. In this work, we introduce MoRFchibi SYSTEM that contains three MoRF predictors: MoRFCHiBi, a basic predictor best suited as a component in other applications, MoRFCHiBi_ Light, ideal for high-throughput predictions and MoRFCHiBi_ Web, slower than the other two but best for high accuracy predictions. Results show that MoRFchibi SYSTEM provides more than double the precision of other predictors. MoRFchibi SYSTEM is available in three different forms: as HTML web server, RESTful web server and downloadable software at: http://www.chibi.ubc.ca/faculty/joerg-gsponer/gsponer-lab/software/morf_chibi/. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  20. The Protein Information Resource: an integrated public resource of functional annotation of proteins

    PubMed Central

    Wu, Cathy H.; Huang, Hongzhan; Arminski, Leslie; Castro-Alvear, Jorge; Chen, Yongxing; Hu, Zhang-Zhi; Ledley, Robert S.; Lewis, Kali C.; Mewes, Hans-Werner; Orcutt, Bruce C.; Suzek, Baris E.; Tsugita, Akira; Vinayaka, C. R.; Yeh, Lai-Su L.; Zhang, Jian; Barker, Winona C.

    2002-01-01

    The Protein Information Resource (PIR) serves as an integrated public resource of functional annotation of protein data to support genomic/proteomic research and scientific discovery. The PIR, in collaboration with the Munich Information Center for Protein Sequences (MIPS) and the Japan International Protein Information Database (JIPID), produces the PIR-International Protein Sequence Database (PSD), the major annotated protein sequence database in the public domain, containing about 250 000 proteins. To improve protein annotation and the coverage of experimentally validated data, a bibliography submission system is developed for scientists to submit, categorize and retrieve literature information. Comprehensive protein information is available from iProClass, which includes family classification at the superfamily, domain and motif levels, structural and functional features of proteins, as well as cross-references to over 40 biological databases. To provide timely and comprehensive protein data with source attribution, we have introduced a non-redundant reference protein database, PIR-NREF. The database consists of about 800 000 proteins collected from PIR-PSD, SWISS-PROT, TrEMBL, GenPept, RefSeq and PDB, with composite protein names and literature data. To promote database interoperability, we provide XML data distribution and open database schema, and adopt common ontologies. The PIR web site (http://pir.georgetown.edu/) features data mining and sequence analysis tools for information retrieval and functional identification of proteins based on both sequence and annotation information. The PIR databases and other files are also available by FTP (ftp://nbrfa.georgetown.edu/pir_databases). PMID:11752247

  1. Identification of the ESKAPE pathogens by mass spectrometric analysis of microbial membrane glycolipids.

    PubMed

    Leung, Lisa M; Fondrie, William E; Doi, Yohei; Johnson, J Kristie; Strickland, Dudley K; Ernst, Robert K; Goodlett, David R

    2017-07-25

    Rapid diagnostics that enable identification of infectious agents improve patient outcomes, antimicrobial stewardship, and length of hospital stay. Current methods for pathogen detection in the clinical laboratory include biological culture, nucleic acid amplification, ribosomal protein characterization, and genome sequencing. Pathogen identification from single colonies by matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF-MS) analysis of high abundance proteins is gaining popularity in clinical laboratories. Here, we present a novel and complementary approach that utilizes essential microbial glycolipids as chemical fingerprints for identification of individual bacterial species. Gram-positive and negative bacterial glycolipids were extracted using a single optimized protocol. Extracts of the clinically significant ESKAPE pathogens: E nterococcus faecium, S taphylococcus aureus, K lebsiella pneumoniae, A cinetobacter baumannii, P seudomonas aeruginosa, and E nterobacter spp. were analyzed by MALDI-TOF-MS in negative ion mode to obtain glycolipid mass spectra. A library of glycolipid mass spectra from 50 microbial entries was developed that allowed bacterial speciation of the ESKAPE pathogens, as well as identification of pathogens directly from blood bottles without culture on solid medium and determination of antimicrobial peptide resistance. These results demonstrate that bacterial glycolipid mass spectra represent chemical barcodes that identify pathogens, potentially providing a useful alternative to existing diagnostics.

  2. Detecting protein-protein interactions in the intact cell of Bacillus subtilis (ATCC 6633).

    PubMed

    Winters, Michael S; Day, R A

    2003-07-01

    The salt bridge, paired group-specific reagent cyanogen (ethanedinitrile; C(2)N(2)) converts naturally occurring pairs of functional groups into covalently linked products. Cyanogen readily permeates cell walls and membranes. When the paired groups are shared between associated proteins, isolation of the covalently linked proteins allows their identity to be assigned. Examination of organisms of known genome sequence permits identification of the linked proteins by mass spectrometric techniques applied to peptides derived from them. The cyanogen-linked proteins were isolated by polyacrylamide gel electrophoresis. Digestion of the isolated proteins with proteases of known specificity afforded sets of peptides that could be analyzed by mass spectrometry. These data were compared with those derived theoretically from the Swiss Protein Database by computer-based comparisons (Protein Prospector; http://prospector.ucsf.edu). Identification of associated proteins in the ribosome of Bacillus subtilis strain ATCC 6633 showed that there is an association homology with the association patterns of the ribosomal proteins of Haloarcula marismortui and Thermus thermophilus. In addition, other proteins involved in protein biosynthesis were shown to be associated with ribosomal proteins.

  3. Detecting Protein-Protein Interactions in the Intact Cell of Bacillus subtilis (ATCC 6633)

    PubMed Central

    Winters, Michael S.; Day, R. A.

    2003-01-01

    The salt bridge, paired group-specific reagent cyanogen (ethanedinitrile; C2N2) converts naturally occurring pairs of functional groups into covalently linked products. Cyanogen readily permeates cell walls and membranes. When the paired groups are shared between associated proteins, isolation of the covalently linked proteins allows their identity to be assigned. Examination of organisms of known genome sequence permits identification of the linked proteins by mass spectrometric techniques applied to peptides derived from them. The cyanogen-linked proteins were isolated by polyacrylamide gel electrophoresis. Digestion of the isolated proteins with proteases of known specificity afforded sets of peptides that could be analyzed by mass spectrometry. These data were compared with those derived theoretically from the Swiss Protein Database by computer-based comparisons (Protein Prospector; http://prospector.ucsf.edu). Identification of associated proteins in the ribosome of Bacillus subtilis strain ATCC 6633 showed that there is an association homology with the association patterns of the ribosomal proteins of Haloarcula marismortui and Thermus thermophilus. In addition, other proteins involved in protein biosynthesis were shown to be associated with ribosomal proteins. PMID:12837803

  4. Identification of StARD3 as a Lutein-binding Protein in the Macula of the Primate Retina†

    PubMed Central

    Li, Binxing; Vachali, Preejith; Frederick, Jeanne M.; Bernstein, Paul S.

    2011-01-01

    Lutein, zeaxanthin and their metabolites are the xanthophyll carotenoids that form the macular pigment of the human retina. Epidemiological evidence suggests that high levels of these carotenoids in the diet, serum and macula are associated with decreased risk of age-related macular degeneration (AMD), and the AREDS2 study is prospectively testing this hypothesis. Understanding the biochemical mechanisms underlying the selective uptakes of lutein and zeaxanthin into the human macula may provide important insights into the physiology of the human macula in health and disease. GSTP1 is the macular zeaxanthin-binding protein, but the identity of the human macular lutein-binding protein has remained elusive. Prior identification of the silkworm lutein-binding protein (CBP) as a member of the steroidogenic acute regulatory domain (StARD) protein family, and selective labeling of monkey photoreceptor inner segments by anti-CBP antibody provided an important clue toward identifying the primate retina lutein-binding protein. Homology of CBP to all 15 human StARD proteins was analyzed using database searches, western blotting and immunohistochemistry, and we here provide evidence to identify StARD3 (also known as MLN64) as a human retinal lutein-binding protein. Further, recombinant StARD3 selectively binds lutein with high affinity (KD = 0.45 micromolar) when assessed by surface plasmon resonance (SPR) binding assays. Our results demonstrate previously unrecognized, specific interactions of StARD3 with lutein and provide novel avenues to explore its roles in human macular physiology and disease. PMID:21322544

  5. Improved Methods for the Enrichment and Analysis of Glycated Peptides

    PubMed Central

    Zhang, Qibin; Schepmoes, Athena A.; Brock, Jonathan W. C.; Wu, Si; Moore, Ronald J.; Purvine, Samuel O.; Baynes, John W.; Smith, Richard D.; Metz, Thomas O.

    2009-01-01

    Nonenzymatic glycation of tissue proteins has important implications in the development of complications of diabetes mellitus. Herein we report improved methods for the enrichment and analysis of glycated peptides using boronate affinity chromatography and electron-transfer dissociation mass spectrometry, respectively. The enrichment of glycated peptides was improved by replacing an off-line desalting step with an online wash of column-bound glycated peptides using 50 mM ammonium acetate, followed by elution with 100 mM acetic acid. The analysis of glycated peptides by MS/MS was improved by considering only higher charged (≥3) precursor ions during data-dependent acquisition, which increased the number of glycated peptide identifications. Similarly, the use of supplemental collisional activation after electron transfer (ETcaD) resulted in more glycated peptide identifications when the MS survey scan was acquired with enhanced resolution. Acquiring ETD-MS/MS data at a normal MS survey scan rate, in conjunction with the rejection of both 1+ and 2+ precursor ions, increased the number of identified glycated peptides relative to ETcaD or the enhanced MS survey scan rate. Finally, an evaluation of trypsin, Arg-C, and Lys-C showed that tryptic digestion of glycated proteins was comparable to digestion with Lys-C and that both were better than Arg-C in terms of the number of glycated peptides and corresponding glycated proteins identified by LC–MS/MS. PMID:18989935

  6. Tandem SUMO fusion vectors for improving soluble protein expression and purification.

    PubMed

    Guerrero, Fernando; Ciragan, Annika; Iwaï, Hideo

    2015-12-01

    Availability of highly purified proteins in quantity is crucial for detailed biochemical and structural investigations. Fusion tags are versatile tools to facilitate efficient protein purification and to improve soluble overexpression of proteins. Various purification and fusion tags have been widely used for overexpression in Escherichia coli. However, these tags might interfere with biological functions and/or structural investigations of the protein of interest. Therefore, an additional purification step to remove fusion tags by proteolytic digestion might be required. Here, we describe a set of new vectors in which yeast SUMO (SMT3) was used as the highly specific recognition sequence of ubiquitin-like protease 1, together with other commonly used solubility enhancing proteins, such as glutathione S-transferase, maltose binding protein, thioredoxin and trigger factor for optimizing soluble expression of protein of interest. This tandem SUMO (T-SUMO) fusion system was tested for soluble expression of the C-terminal domain of TonB from different organisms and for the antiviral protein scytovirin. Copyright © 2015 Elsevier Inc. All rights reserved.

  7. An improved procedure, involving mass spectrometry, for N-terminal amino acid sequence determination of proteins which are N alpha-blocked.

    PubMed Central

    Rose, K; Kocher, H P; Blumberg, B M; Kolakofsky, D

    1984-01-01

    A modification to a previously described procedure [Gray & del Valle (1970) Biochemistry 9, 2134-2137; Rose, Simona & Offord (1983) Biochem. J. 215, 261-272] for mass-spectral identification of the N-terminal regions of proteins is shown to be useful in cases where the N-terminus is blocked. Three proteins were studied: vesicular-stomatitis-virus N protein, Sendai-virus NP protein, and a rabbit immunoglobulin lambda-light chain. These proteins, found to be blocked at the N-terminus with either the acetyl group or a pyroglutamic acid residue, had all failed to yield to attempted Edman degradation, in one case even after attempted enzymic removal of the pyroglutamic acid residue. The N-terminal regions of all three proteins were sequenced by using the new procedure. PMID:6421284

  8. Identification and characterisation of seed storage protein transcripts from Lupinus angustifolius

    PubMed Central

    2011-01-01

    breeding approaches to improve lupins, for example by reducing allergens or increasing the expression of specific seed storage protein(s) with desirable nutritional properties. PMID:21457583

  9. Druggable orthosteric and allosteric hot spots to target protein-protein interactions.

    PubMed

    Ma, Buyong; Nussinov, Ruth

    2014-01-01

    Drug designing targeting protein-protein interactions is challenging. Because structural elucidation and computational analysis have revealed the importance of hot spot residues in stabilizing these interactions, there have been on-going efforts to develop drugs which bind the hot spots and out-compete the native protein partners. The question arises as to what are the key 'druggable' properties of hot spots in protein-protein interactions and whether these mimic the general hot spot definition. Identification of orthosteric (at the protein- protein interaction site) and allosteric (elsewhere) druggable hot spots is expected to help in discovering compounds that can more effectively modulate protein-protein interactions. For example, are there any other significant features beyond their location in pockets in the interface? The interactions of protein-protein hot spots are coupled with conformational dynamics of protein complexes. Currently increasing efforts focus on the allosteric drug discovery. Allosteric drugs bind away from the native binding site and can modulate the native interactions. We propose that identification of allosteric hot spots could similarly help in more effective allosteric drug discovery. While detection of allosteric hot spots is challenging, targeting drugs to these residues has the potential of greatly increasing the hot spot and protein druggability.

  10. Identification of RNAIII-binding proteins in Staphylococcus aureus using tethered RNAs and streptavidin aptamers based pull-down assay.

    PubMed

    Zhang, Xu; Zhu, Qing; Tian, Tian; Zhao, Changlong; Zang, Jianye; Xue, Ting; Sun, Baolin

    2015-05-15

    It has been widely recognized that small RNAs (sRNAs) play important roles in physiology and virulence control in bacteria. In Staphylococcus aureus, many sRNAs have been identified and some of them have been functionally studied. Since it is difficult to identify RNA-binding proteins (RBPs), very little has been known about the RBPs in S. aureus, especially those associated with sRNAs. Here we adopted a tRNA scaffold streptavidin aptamer based pull-down assay to identify RBPs in S. aureus. The tethered RNA was successfully captured by the streptavidin magnetic beads, and proteins binding to RNAIII were isolated and analyzed by mass spectrometry. We have identified 81 proteins, and expressed heterologously 9 of them in Escherichia coli. The binding ability of the recombinant proteins with RNAIII was further analyzed by electrophoresis mobility shift assay, and the result indicates that proteins CshA, RNase J2, Era, Hu, WalR, Pyk, and FtsZ can bind to RNAIII. This study suggests that some proteins can bind to RNA III in S. aureus, and may be involved in RNA III function. And tRSA based pull-down assay is an effective method to search for RBPs in bacteria, which should facilitate the identification and functional study of RBPs in diverse bacterial species.

  11. An improved wavelet-Galerkin method for dynamic response reconstruction and parameter identification of shear-type frames

    NASA Astrophysics Data System (ADS)

    Bu, Haifeng; Wang, Dansheng; Zhou, Pin; Zhu, Hongping

    2018-04-01

    An improved wavelet-Galerkin (IWG) method based on the Daubechies wavelet is proposed for reconstructing the dynamic responses of shear structures. The proposed method flexibly manages wavelet resolution level according to excitation, thereby avoiding the weakness of the wavelet-Galerkin multiresolution analysis (WGMA) method in terms of resolution and the requirement of external excitation. IWG is implemented by this work in certain case studies, involving single- and n-degree-of-freedom frame structures subjected to a determined discrete excitation. Results demonstrate that IWG performs better than WGMA in terms of accuracy and computation efficiency. Furthermore, a new method for parameter identification based on IWG and an optimization algorithm are also developed for shear frame structures, and a simultaneous identification of structural parameters and excitation is implemented. Numerical results demonstrate that the proposed identification method is effective for shear frame structures.

  12. The verbal facilitation effect: re-reading person descriptions as a system variable to improve identification performance.

    PubMed

    Sporer, Siegfried L; Kaminski, Kristina S; Davids, Maike C; McQuiston, Dawn

    2016-11-01

    When witnesses report a crime, police usually ask for a description of the perpetrator. Several studies suggested that verbalising faces leads to a detriment in identification performance (verbal overshadowing effect [VOE]) but the effect has been difficult to replicate. Here, we sought to reverse the VOE by inducing context reinstatement as a system variable through re-reading one's own description before an identification task. Participants (N = 208) watched a video film and were then dismissed (control group), only described the perpetrator, or described and later re-read their own descriptions before identification in either target-present or target-absent lineups after a 2-day or a 5-week delay. Identification accuracy was significantly higher after re-reading (85.0%) than in the no description control group (62.5%) irrespective of target presence. Data were internally replicated using a second target and corroborated by several small meta-analyses. Identification accuracy was related to description quality. Moreover, there was a tendency towards a verbal facilitation effect (VFE) rather than a VOE. Receiver operating characteristic (ROC) curve analyses confirm that our findings are not due to a shift in response bias but truly reflect improvement of recognition performance. Differences in the ecological validity of study paradigms are discussed.

  13. Approaches to Optimizing Animal Cell Culture Process: Substrate Metabolism Regulation and Protein Expression Improvement

    NASA Astrophysics Data System (ADS)

    Zhang, Yuanxing

    Some high value proteins and vaccines for medical and veterinary applications by animal cell culture have an increasing market in China. In order to meet the demands of large-scale productions of proteins and vaccines, animal cell culture technology has been widely developed. In general, an animal cell culture process can be divided into two stages in a batch culture. In cell growth stage a high specific growth rate is expected to achieve a high cell density. In production stage a high specific production rate is stressed for the expression and secretion of qualified protein or replication of virus. It is always critical to maintain high cell viability in fed-batch and perfusion cultures. More concern has been focused on two points by the researchers in China. First, the cell metabolism of substrates is analyzed and the accumulation of toxic by-products is decreased through regulating cell metabolism in the culture process. Second, some important factors effecting protein expression are understood at the molecular level and the production ability of protein is improved. In pace with the rapid development of large-scale cell culture for the production of vaccines, antibodies and other recombinant proteins in China, the medium design and process optimization based on cell metabolism regulation and protein expression improvement will play an important role. The chapter outlines the main advances in metabolic regulation of cell and expression improvement of protein in animal cell culture in recent years.

  14. Improving low-level plasma protein mass spectrometry-based detection for candidate biomarker discovery and validation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Page, Jason S.; Kelly, Ryan T.; Camp, David G.

    2008-09-01

    Methods. To improve the detection of low abundance protein candidate biomarker discovery and validation, particularly in complex biological fluids such as blood plasma, increased sensitivity is desired using mass spectrometry (MS)-based instrumentation. A key current limitation on the sensitivity of electrospray ionization (ESI) MS is due to the fact that many sample molecules in solution are never ionized, and the vast majority of the ions that are created are lost during transmission from atmospheric pressure to the low pressure region of the mass analyzer. Two key technologies, multi-nanoelectrospray emitters and the electrodynamic ion funnel have recently been developed and refinedmore » at Pacific Northwest National Laboratory (PNNL) to greatly improve the ionization and transmission efficiency of ESI MS based analyses. Multi-emitter based ESI enables the flow from a single source (typically a liquid chromatography [LC] column) to be divided among an array of emitters (Figure 1). The flow rate delivered to each emitter is thus reduced, allowing the well-documented benefits of nanoelectrospray 1 for both sensitivity and quantitation to be realized for higher flow rate separations. To complement the increased ionization efficiency afforded by multi-ESI, tandem electrodynamic ion funnels have also been developed at PNNL, and shown to greatly improve ion transmission efficiency in the ion source interface.2, 3 These technologies have been integrated into a triple quadrupole mass spectrometer for multiple reaction monitoring (MRM) of probable biomarker candidates in blood plasma and show promise for the identification of new species even at low level concentrations.« less

  15. Immuno-affinity Capture Followed by TMPP N-Terminus Tagging to Study Catabolism of Therapeutic Proteins.

    PubMed

    Kullolli, Majlinda; Rock, Dan A; Ma, Ji

    2017-02-03

    Characterization of in vitro and in vivo catabolism of therapeutic proteins has increasingly become an integral part of discovery and development process for novel proteins. Unambiguous and efficient identification of catabolites can not only facilitate accurate understanding of pharmacokinetic profiles of drug candidates, but also enables follow up protein engineering to generate more catabolically stable molecules with improved properties (pharmacokinetics and pharmacodynamics). Immunoaffinity capture (IC) followed by top-down intact protein analysis using either matrix-assisted laser desorption/ionization or electrospray ionization mass spectrometry analysis have been the primary methods of choice for catabolite identification. However, the sensitivity and efficiency of these methods is not always sufficient for characterization of novel proteins from complex biomatrices such as plasma or serum. In this study a novel bottom-up targeted protein workflow was optimized for analysis of proteolytic degradation of therapeutic proteins. Selective and sensitive tagging of the alpha-amine at the N-terminus of proteins of interest was performed by immunoaffinity capture of therapeutic protein and its catabolites followed by on-bead succinimidyloxycarbonylmethyl tri-(2,4,6-trimethoxyphenyl N-terminus (TMPP-NTT) tagging. The positively charged hydrophobic TMPP tag facilitates unambiguous sequence identification of all N-terminus peptides from complex tryptic digestion samples via data dependent liquid chromatgraphy-tandem mass spectroscopy. Utility of the workflow was illustrated by definitive analysis of in vitro catabolic profile of neurotensin human Fc (NTs-huFc) protein in mouse serum. The results from this study demonstrated that the IC-TMPP-NTT workflow is a simple and efficient method for catabolite formation in therapeutic proteins.

  16. Screening and identification of resistance related proteins from apple leaves inoculated with Marssonina coronaria (EII. & J. J. Davis)

    PubMed Central

    2014-01-01

    Background Apple, an invaluable fruit crop worldwide, is often prone to infection by pathogenic fungi. Identification of potentially resistance-conferring apple proteins is one of the most important aims for studying apple resistance mechanisms and promoting the development of disease-resistant apple strains. In order to find proteins which promote resistance to Marssonina coronaria, a deadly pathogen which has been related to premature apple maturation, proteomes from apple leaves inoculated with M. coronaria were characterized at 3 and 6 days post-inoculation by two dimensional electrophoresis (2-DE). Results Overall, 59 differentially accumulated protein spots between inoculation and non-inoculation were successfully identified and aligned as 35 different proteins or protein families which involved in photosynthesis, amino acid metabolism, transport, energy metabolism, carbohydrate metabolism, binding, antioxidant, defense and stress. Quantitative real-time PCR (qRT-PCR) was also used to examine the change of some defense and stress related genes abundance under inoculated conditions. Conclusions In a conclusion, different proteins in response to Marssonina coronaria were identified by proteomic analysis. Among of these proteins, there are some PR proteins, for example class III endo-chitinase, beta-1,3-glucanase and thaumatine-like protein, and some antioxidant related proteins including aldo/keto reductase AKR, ascorbate peroxidase and phi class glutathione S-transferase protein that were associated with disease resistance. The transcription levels of class III endo-chitinase (L13) and beta-1, 3-glucanase (L17) have a good relation with the abundance of the encoded protein’s accumulation, however, the mRNA abundance of thaumatine-like protein (L22) and ascorbate peroxidase (L28) are not correlated with their protein abundance of encoded protein. To elucidate the resistant mechanism, the data in the present study will promote us to investigate further the

  17. Identification of Human N-Myristoylated Proteins from Human Complementary DNA Resources by Cell-Free and Cellular Metabolic Labeling Analyses.

    PubMed

    Takamitsu, Emi; Otsuka, Motoaki; Haebara, Tatsuki; Yano, Manami; Matsuzaki, Kanako; Kobuchi, Hirotsugu; Moriya, Koko; Utsumi, Toshihiko

    2015-01-01

    To identify physiologically important human N-myristoylated proteins, 90 cDNA clones predicted to encode human N-myristoylated proteins were selected from a human cDNA resource (4,369 Kazusa ORFeome project human cDNA clones) by two bioinformatic N-myristoylation prediction systems, NMT-The MYR Predictor and Myristoylator. After database searches to exclude known human N-myristoylated proteins, 37 cDNA clones were selected as potential human N-myristoylated proteins. The susceptibility of these cDNA clones to protein N-myristoylation was first evaluated using fusion proteins in which the N-terminal ten amino acid residues were fused to an epitope-tagged model protein. Then, protein N-myristoylation of the gene products of full-length cDNAs was evaluated by metabolic labeling experiments both in an insect cell-free protein synthesis system and in transfected human cells. As a result, the products of 13 cDNA clones (FBXL7, PPM1B, SAMM50, PLEKHN, AIFM3, C22orf42, STK32A, FAM131C, DRICH1, MCC1, HID1, P2RX5, STK32B) were found to be human N-myristoylated proteins. Analysis of the role of protein N-myristoylation on the intracellular localization of SAMM50, a mitochondrial outer membrane protein, revealed that protein N-myristoylation was required for proper targeting of SAMM50 to mitochondria. Thus, the strategy used in this study is useful for the identification of physiologically important human N-myristoylated proteins from human cDNA resources.

  18. Identification of Human N-Myristoylated Proteins from Human Complementary DNA Resources by Cell-Free and Cellular Metabolic Labeling Analyses

    PubMed Central

    Takamitsu, Emi; Otsuka, Motoaki; Haebara, Tatsuki; Yano, Manami; Matsuzaki, Kanako; Kobuchi, Hirotsugu; Moriya, Koko; Utsumi, Toshihiko

    2015-01-01

    To identify physiologically important human N-myristoylated proteins, 90 cDNA clones predicted to encode human N-myristoylated proteins were selected from a human cDNA resource (4,369 Kazusa ORFeome project human cDNA clones) by two bioinformatic N-myristoylation prediction systems, NMT-The MYR Predictor and Myristoylator. After database searches to exclude known human N-myristoylated proteins, 37 cDNA clones were selected as potential human N-myristoylated proteins. The susceptibility of these cDNA clones to protein N-myristoylation was first evaluated using fusion proteins in which the N-terminal ten amino acid residues were fused to an epitope-tagged model protein. Then, protein N-myristoylation of the gene products of full-length cDNAs was evaluated by metabolic labeling experiments both in an insect cell-free protein synthesis system and in transfected human cells. As a result, the products of 13 cDNA clones (FBXL7, PPM1B, SAMM50, PLEKHN, AIFM3, C22orf42, STK32A, FAM131C, DRICH1, MCC1, HID1, P2RX5, STK32B) were found to be human N-myristoylated proteins. Analysis of the role of protein N-myristoylation on the intracellular localization of SAMM50, a mitochondrial outer membrane protein, revealed that protein N-myristoylation was required for proper targeting of SAMM50 to mitochondria. Thus, the strategy used in this study is useful for the identification of physiologically important human N-myristoylated proteins from human cDNA resources. PMID:26308446

  19. Proteomics: Protein Identification Using Online Databases

    ERIC Educational Resources Information Center

    Eurich, Chris; Fields, Peter A.; Rice, Elizabeth

    2012-01-01

    Proteomics is an emerging area of systems biology that allows simultaneous study of thousands of proteins expressed in cells, tissues, or whole organisms. We have developed this activity to enable high school or college students to explore proteomic databases using mass spectrometry data files generated from yeast proteins in a college laboratory…

  20. OpenKnowledge for peer-to-peer experimentation in protein identification by MS/MS

    PubMed Central

    2011-01-01

    Background Traditional scientific workflow platforms usually run individual experiments with little evaluation and analysis of performance as required by automated experimentation in which scientists are being allowed to access numerous applicable workflows rather than being committed to a single one. Experimental protocols and data under a peer-to-peer environment could potentially be shared freely without any single point of authority to dictate how experiments should be run. In such environment it is necessary to have mechanisms by which each individual scientist (peer) can assess, locally, how he or she wants to be involved with others in experiments. This study aims to implement and demonstrate simple peer ranking under the OpenKnowledge peer-to-peer infrastructure by both simulated and real-world bioinformatics experiments involving multi-agent interactions. Methods A simulated experiment environment with a peer ranking capability was specified by the Lightweight Coordination Calculus (LCC) and automatically executed under the OpenKnowledge infrastructure. The peers such as MS/MS protein identification services (including web-enabled and independent programs) were made accessible as OpenKnowledge Components (OKCs) for automated execution as peers in the experiments. The performance of the peers in these automated experiments was monitored and evaluated by simple peer ranking algorithms. Results Peer ranking experiments with simulated peers exhibited characteristic behaviours, e.g., power law effect (a few dominant peers dominate), similar to that observed in the traditional Web. Real-world experiments were run using an interaction model in LCC involving two different types of MS/MS protein identification peers, viz., peptide fragment fingerprinting (PFF) and de novo sequencing with another peer ranking algorithm simply based on counting the successful and failed runs. This study demonstrated a novel integration and useful evaluation of specific proteomic

  1. Genome engineering for improved recombinant protein expression in Escherichia coli.

    PubMed

    Mahalik, Shubhashree; Sharma, Ashish K; Mukherjee, Krishna J

    2014-12-19

    A metabolic engineering perspective which views recombinant protein expression as a multistep pathway allows us to move beyond vector design and identify the downstream rate limiting steps in expression. In E.coli these are typically at the translational level and the supply of precursors in the form of energy, amino acids and nucleotides. Further recombinant protein production triggers a global cellular stress response which feedback inhibits both growth and product formation. Countering this requires a system level analysis followed by a rational host cell engineering to sustain expression for longer time periods. Another strategy to increase protein yields could be to divert the metabolic flux away from biomass formation and towards recombinant protein production. This would require a growth stoppage mechanism which does not affect the metabolic activity of the cell or the transcriptional or translational efficiencies. Finally cells have to be designed for efficient export to prevent buildup of proteins inside the cytoplasm and also simplify downstream processing. The rational and the high throughput strategies that can be used for the construction of such improved host cell platforms for recombinant protein expression is the focus of this review.

  2. Improving isotopic identification with INDRA Silicon-CsI(Tl) telescopes

    NASA Astrophysics Data System (ADS)

    Lopez, O.; Pârlog, M.; Borderie, B.; Rivet, M. F.; Lehaut, G.; Tabacaru, G.; Tassan-Got, L.; Pawłowski, P.; Bonnet, E.; Bougault, R.; Chbihi, A.; Dell'Aquila, D.; Frankland, J. D.; Galichet, E.; Gruyer, D.; La Commara, M.; Le Neindre, N.; Lombardo, I.; Manduci, L.; Marini, P.; Steckmeyer, J. C.; Verde, G.; Vient, E.; Wieleczko, J. P.; Indra Collaboration

    2018-03-01

    Profiting from previous works done with the INDRA multidetector on the description of the light response L of the CsI(Tl) crystals to different impinging nuclei, we propose an improved ΔE - L identification-calibration procedure for Silicon-Caesium Iodide (Si-CsI) telescopes, namely an Advanced Mass Estimate (AME) method. AME is compared to the usual, simple visual analysis of the corresponding two-dimensional map of ΔE - E type, by using INDRA experimental data from nuclear reactions induced by heavy ions in the Fermi energy regime. We show that the capability of such telescopes to identify both the atomic Z and the mass A numbers of light and heavy reaction products, can be quantitatively improved thanks to the proposed approach. This conclusion opens new possibilities to use INDRA for studying these reactions especially with radioactive beams. Indeed, the determination of the mass for charged reaction products becomes of paramount importance to shed light on the role of the isospin degree of freedom in the nuclear equation of state [1,2].

  3. Verification of Ribosomal Proteins of Aspergillus fumigatus for Use as Biomarkers in MALDI-TOF MS Identification

    PubMed Central

    Nakamura, Sayaka; Sato, Hiroaki; Tanaka, Reiko; Yaguchi, Takashi

    2016-01-01

    We have previously proposed a rapid identification method for bacterial strains based on the profiles of their ribosomal subunit proteins (RSPs), observed using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS). This method can perform phylogenetic characterization based on the mass of housekeeping RSP biomarkers, ideally calculated from amino acid sequence information registered in public protein databases. With the aim of extending its field of application to medical mycology, this study investigates the actual state of information of RSPs of eukaryotic fungi registered in public protein databases through the characterization of ribosomal protein fractions extracted from genome-sequenced Aspergillus fumigatus strains Af293 and A1163 as a model. In this process, we have found that the public protein databases harbor problems. The RSP names are in confusion, so we have provisionally unified them using the yeast naming system. The most serious problem is that many incorrect sequences are registered in the public protein databases. Surprisingly, more than half of the sequences are incorrect, due chiefly to mis-annotation of exon/intron structures. These errors could be corrected by a combination of in silico inspection by sequence homology analysis and MALDI-TOF MS measurements. We were also able to confirm conserved post-translational modifications in eleven RSPs. After these verifications, the masses of 31 expressed RSPs under 20,000 Da could be accurately confirmed. These RSPs have a potential to be useful biomarkers for identifying clinical isolates of A. fumigatus. PMID:27843740

  4. Accurate disulfide-bonding network predictions improve ab initio structure prediction of cysteine-rich proteins

    PubMed Central

    Yang, Jing; He, Bao-Ji; Jang, Richard; Zhang, Yang; Shen, Hong-Bin

    2015-01-01

    Abstract Motivation: Cysteine-rich proteins cover many important families in nature but there are currently no methods specifically designed for modeling the structure of these proteins. The accuracy of disulfide connectivity pattern prediction, particularly for the proteins of higher-order connections, e.g. >3 bonds, is too low to effectively assist structure assembly simulations. Results: We propose a new hierarchical order reduction protocol called Cyscon for disulfide-bonding prediction. The most confident disulfide bonds are first identified and bonding prediction is then focused on the remaining cysteine residues based on SVR training. Compared with purely machine learning-based approaches, Cyscon improved the average accuracy of connectivity pattern prediction by 21.9%. For proteins with more than 5 disulfide bonds, Cyscon improved the accuracy by 585% on the benchmark set of PDBCYS. When applied to 158 non-redundant cysteine-rich proteins, Cyscon predictions helped increase (or decrease) the TM-score (or RMSD) of the ab initio QUARK modeling by 12.1% (or 14.4%). This result demonstrates a new avenue to improve the ab initio structure modeling for cysteine-rich proteins. Availability and implementation: http://www.csbio.sjtu.edu.cn/bioinf/Cyscon/ Contact: zhng@umich.edu or hbshen@sjtu.edu.cn Supplementary information: Supplementary data are available at Bioinformatics online. PMID:26254435

  5. Identification of a Functional Plasmodesmal Localization Signal in a Plant Viral Cell-To-Cell-Movement Protein.

    PubMed

    Yuan, Cheng; Lazarowitz, Sondra G; Citovsky, Vitaly

    2016-01-19

    Our fundamental knowledge of the protein-sorting pathways required for plant cell-to-cell trafficking and communication via the intercellular connections termed plasmodesmata has been severely limited by the paucity of plasmodesmal targeting sequences that have been identified to date. To address this limitation, we have identified the plasmodesmal localization signal (PLS) in the Tobacco mosaic virus (TMV) cell-to-cell-movement protein (MP), which has emerged as the paradigm for dissecting the molecular details of cell-to-cell transport through plasmodesmata. We report here the identification of a bona fide functional TMV MP PLS, which encompasses amino acid residues between positions 1 and 50, with residues Val-4 and Phe-14 potentially representing critical sites for PLS function that most likely affect protein conformation or protein interactions. We then demonstrated that this PLS is both necessary and sufficient for protein targeting to plasmodesmata. Importantly, as TMV MP traffics to plasmodesmata by a mechanism that is distinct from those of the three plant cell proteins in which PLSs have been reported, our findings provide important new insights to expand our understanding of protein-sorting pathways to plasmodesmata. The science of virology began with the discovery of Tobacco mosaic virus (TMV). Since then, TMV has served as an experimental and conceptual model for studies of viruses and dissection of virus-host interactions. Indeed, the TMV cell-to-cell-movement protein (MP) has emerged as the paradigm for dissecting the molecular details of cell-to-cell transport through the plant intercellular connections termed plasmodesmata. However, one of the most fundamental and key functional features of TMV MP, its putative plasmodesmal localization signal (PLS), has not been identified. Here, we fill this gap in our knowledge and identify the TMV MP PLS. Copyright © 2016 Yuan et al.

  6. An improved approach to infer protein-protein interaction based on a hierarchical vector space model.

    PubMed

    Zhang, Jiongmin; Jia, Ke; Jia, Jinmeng; Qian, Ying

    2018-04-27

    Comparing and classifying functions of gene products are important in today's biomedical research. The semantic similarity derived from the Gene Ontology (GO) annotation has been regarded as one of the most widely used indicators for protein interaction. Among the various approaches proposed, those based on the vector space model are relatively simple, but their effectiveness is far from satisfying. We propose a Hierarchical Vector Space Model (HVSM) for computing semantic similarity between different genes or their products, which enhances the basic vector space model by introducing the relation between GO terms. Besides the directly annotated terms, HVSM also takes their ancestors and descendants related by "is_a" and "part_of" relations into account. Moreover, HVSM introduces the concept of a Certainty Factor to calibrate the semantic similarity based on the number of terms annotated to genes. To assess the performance of our method, we applied HVSM to Homo sapiens and Saccharomyces cerevisiae protein-protein interaction datasets. Compared with TCSS, Resnik, and other classic similarity measures, HVSM achieved significant improvement for distinguishing positive from negative protein interactions. We also tested its correlation with sequence, EC, and Pfam similarity using online tool CESSM. HVSM showed an improvement of up to 4% compared to TCSS, 8% compared to IntelliGO, 12% compared to basic VSM, 6% compared to Resnik, 8% compared to Lin, 11% compared to Jiang, 8% compared to Schlicker, and 11% compared to SimGIC using AUC scores. CESSM test showed HVSM was comparable to SimGIC, and superior to all other similarity measures in CESSM as well as TCSS. Supplementary information and the software are available at https://github.com/kejia1215/HVSM .

  7. Scorecard implementation improves identification of postpartum patients at risk for venous thromboembolism.

    PubMed

    Berkin, Jill A; Lee, Colleen; Landsberger, Ellen; Chazotte, Cynthia; Bernstein, Peter S; Goffman, Dena

    2016-07-01

    To evaluate if an intensive educational intervention in the use of a standardized venous thromboembolism (VTE) risk assessment tool (scorecard) improves physicians' identification and chemoprophylaxis of postpartum patients at risk for VTE. After implementation of a VTE scorecard and prior to an intensive educational intervention, postpartum patients (n = 140) were evaluated to assess scorecard completion, risk factors, and chemoprophylaxis. A performance improvement campaign focusing on patient safety, VTE prevention, and scorecard utilization was then conducted. Evaluation of the same parameters was subsequently performed for a similar group of patients (n = 133). Differences in scorecard utilization and risk assessment were tested for statistical significance. Population-at-risk rates were similar in both assessment periods (31.4% vs 28.6%; p = NS). The greatest risk factors included cesarean delivery, body mass index (BMI) >30 and age >35. Scorecard completion rates for all patients increased in the postintervention period (15.7% vs 67.7%; p < .001). Postintervention scorecard completion rates for the at-risk population also improved (20% vs 79%; p < .001). In the postintervention group, those at risk with completed scorecards had higher prophylaxis rates than those at risk without scorecards (73% vs 25%; p = .03). At-risk patients with completed scorecards had 2.6 times more orders for chemoprophylaxis than at-risk patients without scorecards in both time periods (odds ratio [OR] = 8.4; 95% confidence interval [CI] 3.1-22.8). Utilization of a VTE scorecard coupled with an educational intervention for health care providers increases detection and chemoprophylaxis orders for at-risk patients. Encouraging universal scorecard assessment standardizes identification and chemoprophylaxis of at-risk patients who were otherwise not perceived to be at risk. © 2016 American Society for Healthcare Risk Management of the American Hospital Association.

  8. The Efficacy of Short-term Gated Audiovisual Speech Training for Improving Auditory Sentence Identification in Noise in Elderly Hearing Aid Users

    PubMed Central

    Moradi, Shahram; Wahlin, Anna; Hällgren, Mathias; Rönnberg, Jerker; Lidestam, Björn

    2017-01-01

    This study aimed to examine the efficacy and maintenance of short-term (one-session) gated audiovisual speech training for improving auditory sentence identification in noise in experienced elderly hearing-aid users. Twenty-five hearing aid users (16 men and 9 women), with an average age of 70.8 years, were randomly divided into an experimental (audiovisual training, n = 14) and a control (auditory training, n = 11) group. Participants underwent gated speech identification tasks comprising Swedish consonants and words presented at 65 dB sound pressure level with a 0 dB signal-to-noise ratio (steady-state broadband noise), in audiovisual or auditory-only training conditions. The Hearing-in-Noise Test was employed to measure participants’ auditory sentence identification in noise before the training (pre-test), promptly after training (post-test), and 1 month after training (one-month follow-up). The results showed that audiovisual training improved auditory sentence identification in noise promptly after the training (post-test vs. pre-test scores); furthermore, this improvement was maintained 1 month after the training (one-month follow-up vs. pre-test scores). Such improvement was not observed in the control group, neither promptly after the training nor at the one-month follow-up. However, no significant between-groups difference nor an interaction between groups and session was observed. Conclusion: Audiovisual training may be considered in aural rehabilitation of hearing aid users to improve listening capabilities in noisy conditions. However, the lack of a significant between-groups effect (audiovisual vs. auditory) or an interaction between group and session calls for further research. PMID:28348542

  9. The Efficacy of Short-term Gated Audiovisual Speech Training for Improving Auditory Sentence Identification in Noise in Elderly Hearing Aid Users.

    PubMed

    Moradi, Shahram; Wahlin, Anna; Hällgren, Mathias; Rönnberg, Jerker; Lidestam, Björn

    2017-01-01

    This study aimed to examine the efficacy and maintenance of short-term (one-session) gated audiovisual speech training for improving auditory sentence identification in noise in experienced elderly hearing-aid users. Twenty-five hearing aid users (16 men and 9 women), with an average age of 70.8 years, were randomly divided into an experimental (audiovisual training, n = 14) and a control (auditory training, n = 11) group. Participants underwent gated speech identification tasks comprising Swedish consonants and words presented at 65 dB sound pressure level with a 0 dB signal-to-noise ratio (steady-state broadband noise), in audiovisual or auditory-only training conditions. The Hearing-in-Noise Test was employed to measure participants' auditory sentence identification in noise before the training (pre-test), promptly after training (post-test), and 1 month after training (one-month follow-up). The results showed that audiovisual training improved auditory sentence identification in noise promptly after the training (post-test vs. pre-test scores); furthermore, this improvement was maintained 1 month after the training (one-month follow-up vs. pre-test scores). Such improvement was not observed in the control group, neither promptly after the training nor at the one-month follow-up. However, no significant between-groups difference nor an interaction between groups and session was observed. Audiovisual training may be considered in aural rehabilitation of hearing aid users to improve listening capabilities in noisy conditions. However, the lack of a significant between-groups effect (audiovisual vs. auditory) or an interaction between group and session calls for further research.

  10. A numerical study of sensory-guided multiple views for improved object identification

    NASA Astrophysics Data System (ADS)

    Blakeslee, B. A.; Zelnio, E. G.; Koditschek, D. E.

    2014-06-01

    We explore the potential on-line adjustment of sensory controls for improved object identification and discrimination in the context of a simulated high resolution camera system carried onboard a maneuverable robotic platform that can actively choose its observational position and pose. Our early numerical studies suggest the significant efficacy and enhanced performance achieved by even very simple feedback-driven iteration of the view in contrast to identification from a fixed pose, uninformed by any active adaptation. Specifically, we contrast the discriminative performance of the same conventional classification system when informed by: a random glance at a vehicle; two random glances at a vehicle; or a random glance followed by a guided second look. After each glance, edge detection algorithms isolate the most salient features of the image and template matching is performed through the use of the Hausdor↵ distance, comparing the simulated sensed images with reference images of the vehicles. We present initial simulation statistics that overwhelmingly favor the third scenario. We conclude with a sketch of our near-future steps in this study that will entail: the incorporation of more sophisticated image processing and template matching algorithms; more complex discrimination tasks such as distinguishing between two similar vehicles or vehicles in motion; more realistic models of the observers mobility including platform dynamics and eventually environmental constraints; and expanding the sensing task beyond the identification of a specified object selected from a pre-defined library of alternatives.

  11. Study on Urban Heat Island Intensity Level Identification Based on an Improved Restricted Boltzmann Machine.

    PubMed

    Zhang, Yang; Jiang, Ping; Zhang, Hongyan; Cheng, Peng

    2018-01-23

    Thermal infrared remote sensing has become one of the main technology methods used for urban heat island research. When applying urban land surface temperature inversion of the thermal infrared band, problems with intensity level division arise because the method is subjective. However, this method is one of the few that performs heat island intensity level identification. This paper will build an intensity level identifier for an urban heat island, by using weak supervision and thought-based learning in an improved, restricted Boltzmann machine (RBM) model. The identifier automatically initializes the annotation and optimizes the model parameters sequentially until the target identifier is completed. The algorithm needs very little information about the weak labeling of the target training sample and generates an urban heat island intensity spatial distribution map. This study can provide reliable decision-making support for urban ecological planning and effective protection of urban ecological security. The experimental results showed the following: (1) The heat island effect in Wuhan is existent and intense. Heat island areas are widely distributed. The largest heat island area is in Wuhan, followed by the sub-green island. The total area encompassed by heat island and strong island levels accounts for 54.16% of the land in Wuhan. (2) Partially based on improved RBM identification, this method meets the research demands of determining the spatial distribution characteristics of the internal heat island effect; its identification accuracy is superior to that of comparable methods.

  12. Study on Urban Heat Island Intensity Level Identification Based on an Improved Restricted Boltzmann Machine

    PubMed Central

    Jiang, Ping; Zhang, Hongyan; Cheng, Peng

    2018-01-01

    Thermal infrared remote sensing has become one of the main technology methods used for urban heat island research. When applying urban land surface temperature inversion of the thermal infrared band, problems with intensity level division arise because the method is subjective. However, this method is one of the few that performs heat island intensity level identification. This paper will build an intensity level identifier for an urban heat island, by using weak supervision and thought-based learning in an improved, restricted Boltzmann machine (RBM) model. The identifier automatically initializes the annotation and optimizes the model parameters sequentially until the target identifier is completed. The algorithm needs very little information about the weak labeling of the target training sample and generates an urban heat island intensity spatial distribution map. This study can provide reliable decision-making support for urban ecological planning and effective protection of urban ecological security. The experimental results showed the following: (1) The heat island effect in Wuhan is existent and intense. Heat island areas are widely distributed. The largest heat island area is in Wuhan, followed by the sub-green island. The total area encompassed by heat island and strong island levels accounts for 54.16% of the land in Wuhan. (2) Partially based on improved RBM identification, this method meets the research demands of determining the spatial distribution characteristics of the internal heat island effect; its identification accuracy is superior to that of comparable methods. PMID:29360786

  13. Fast tandem mass spectra-based protein identification regardless of the number of spectra or potential modifications examined.

    PubMed

    Falkner, Jayson; Andrews, Philip

    2005-05-15

    Comparing tandem mass spectra (MSMS) against a known dataset of protein sequences is a common method for identifying unknown proteins; however, the processing of MSMS by current software often limits certain applications, including comprehensive coverage of post-translational modifications, non-specific searches and real-time searches to allow result-dependent instrument control. This problem deserves attention as new mass spectrometers provide the ability for higher throughput and as known protein datasets rapidly grow in size. New software algorithms need to be devised in order to address the performance issues of conventional MSMS protein dataset-based protein identification. This paper describes a novel algorithm based on converting a collection of monoisotopic, centroided spectra to a new data structure, named 'peptide finite state machine' (PFSM), which may be used to rapidly search a known dataset of protein sequences, regardless of the number of spectra searched or the number of potential modifications examined. The algorithm is verified using a set of commercially available tryptic digest protein standards analyzed using an ABI 4700 MALDI TOFTOF mass spectrometer, and a free, open source PFSM implementation. It is illustrated that a PFSM can accurately search large collections of spectra against large datasets of protein sequences (e.g. NCBI nr) using a regular desktop PC; however, this paper only details the method for identifying peptide and subsequently protein candidates from a dataset of known protein sequences. The concept of using a PFSM as a peptide pre-screening technique for MSMS-based search engines is validated by using PFSM with Mascot and XTandem. Complete source code, documentation and examples for the reference PFSM implementation are freely available at the Proteome Commons, http://www.proteomecommons.org and source code may be used both commercially and non-commercially as long as the original authors are credited for their work.

  14. Identification of Borrelia protein candidates in mouse skin for potential diagnosis of disseminated Lyme borreliosis.

    PubMed

    Grillon, Antoine; Westermann, Benoît; Cantero, Paola; Jaulhac, Benoît; Voordouw, Maarten J; Kapps, Delphine; Collin, Elody; Barthel, Cathy; Ehret-Sabatier, Laurence; Boulanger, Nathalie

    2017-12-01

    In vector-borne diseases, the skin plays an essential role in the transmission of vector-borne pathogens between the vertebrate host and blood-feeding arthropods and in pathogen persistence. Borrelia burgdorferi sensu lato is a tick-borne bacterium that causes Lyme borreliosis (LB) in humans. This pathogen may establish a long-lasting infection in its natural vertebrate host where it can persist in the skin and some other organs. Using a mouse model, we demonstrate that Borrelia targets the skin regardless of the route of inoculation, and can persist there at low densities that are difficult to detect via qPCR, but that were infective for blood-feeding ticks. Application of immunosuppressive dermocorticoids at 40 days post-infection (PI) significantly enhanced the Borrelia population size in the mouse skin. We used non-targeted (Ge-LC-MS/MS) and targeted (SRM-MS) proteomics to detect several Borrelia-specific proteins in the mouse skin at 40 days PI. Detected Borrelia proteins included flagellin, VlsE and GAPDH. An important problem in LB is the lack of diagnosis methods capable of detecting active infection in humans suffering from disseminated LB. The identification of Borrelia proteins in skin biopsies may provide new approaches for assessing active infection in disseminated manifestations.

  15. Identify High-Quality Protein Structural Models by Enhanced K-Means.

    PubMed

    Wu, Hongjie; Li, Haiou; Jiang, Min; Chen, Cheng; Lv, Qiang; Wu, Chuang

    2017-01-01

    Background. One critical issue in protein three-dimensional structure prediction using either ab initio or comparative modeling involves identification of high-quality protein structural models from generated decoys. Currently, clustering algorithms are widely used to identify near-native models; however, their performance is dependent upon different conformational decoys, and, for some algorithms, the accuracy declines when the decoy population increases. Results. Here, we proposed two enhanced K -means clustering algorithms capable of robustly identifying high-quality protein structural models. The first one employs the clustering algorithm SPICKER to determine the initial centroids for basic K -means clustering ( SK -means), whereas the other employs squared distance to optimize the initial centroids ( K -means++). Our results showed that SK -means and K -means++ were more robust as compared with SPICKER alone, detecting 33 (59%) and 42 (75%) of 56 targets, respectively, with template modeling scores better than or equal to those of SPICKER. Conclusions. We observed that the classic K -means algorithm showed a similar performance to that of SPICKER, which is a widely used algorithm for protein-structure identification. Both SK -means and K -means++ demonstrated substantial improvements relative to results from SPICKER and classical K -means.

  16. Identify High-Quality Protein Structural Models by Enhanced K-Means

    PubMed Central

    Li, Haiou; Chen, Cheng; Lv, Qiang; Wu, Chuang

    2017-01-01

    Background. One critical issue in protein three-dimensional structure prediction using either ab initio or comparative modeling involves identification of high-quality protein structural models from generated decoys. Currently, clustering algorithms are widely used to identify near-native models; however, their performance is dependent upon different conformational decoys, and, for some algorithms, the accuracy declines when the decoy population increases. Results. Here, we proposed two enhanced K-means clustering algorithms capable of robustly identifying high-quality protein structural models. The first one employs the clustering algorithm SPICKER to determine the initial centroids for basic K-means clustering (SK-means), whereas the other employs squared distance to optimize the initial centroids (K-means++). Our results showed that SK-means and K-means++ were more robust as compared with SPICKER alone, detecting 33 (59%) and 42 (75%) of 56 targets, respectively, with template modeling scores better than or equal to those of SPICKER. Conclusions. We observed that the classic K-means algorithm showed a similar performance to that of SPICKER, which is a widely used algorithm for protein-structure identification. Both SK-means and K-means++ demonstrated substantial improvements relative to results from SPICKER and classical K-means. PMID:28421198

  17. Identification of differentially expressed proteins in Ostrinia furnacalis adults after exposure to ultraviolet A.

    PubMed

    Zhang, Changyu; Meng, Jianyu

    2018-06-23

    Ultraviolet A (UVA), the major component of solar UV irradiation, is an important environmental factor inducing damage to insects including cell death, photoreceptor damage, and oxidative stress. In order to improve understanding of the adaptation mechanisms of insect after UVA exposure, a comparative proteomic analysis was carried out to reveal differential protein expression in Ostrinia furnacalis. Three-day-old adults were treated with UVA for 1 h. Total proteins of control and UVA-treated insects were examined using two-dimensional electrophoresis (2-DE). 2-DE analysis demonstrated that 19 proteins were increased and 18 proteins were decreased significantly in O. furnacalis after UVA exposure, respectively. Thirty differentially expressed proteins were successfully identified by mass spectrometry. The identified proteins were involved in diverse biological processes, such as signal transduction, transport processing, cellular stress, metabolisms, and cytoskeleton organization. Our results reveal that the response patterns of O. furnacalis to UVA irradiation are complex and provide novel insights into the adaptation response to UVA irradiation stress.

  18. Optimization of Search Engines and Postprocessing Approaches to Maximize Peptide and Protein Identification for High-Resolution Mass Data.

    PubMed

    Tu, Chengjian; Sheng, Quanhu; Li, Jun; Ma, Danjun; Shen, Xiaomeng; Wang, Xue; Shyr, Yu; Yi, Zhengping; Qu, Jun

    2015-11-06

    The two key steps for analyzing proteomic data generated by high-resolution MS are database searching and postprocessing. While the two steps are interrelated, studies on their combinatory effects and the optimization of these procedures have not been adequately conducted. Here, we investigated the performance of three popular search engines (SEQUEST, Mascot, and MS Amanda) in conjunction with five filtering approaches, including respective score-based filtering, a group-based approach, local false discovery rate (LFDR), PeptideProphet, and Percolator. A total of eight data sets from various proteomes (e.g., E. coli, yeast, and human) produced by various instruments with high-accuracy survey scan (MS1) and high- or low-accuracy fragment ion scan (MS2) (LTQ-Orbitrap, Orbitrap-Velos, Orbitrap-Elite, Q-Exactive, Orbitrap-Fusion, and Q-TOF) were analyzed. It was found combinations involving Percolator achieved markedly more peptide and protein identifications at the same FDR level than the other 12 combinations for all data sets. Among these, combinations of SEQUEST-Percolator and MS Amanda-Percolator provided slightly better performances for data sets with low-accuracy MS2 (ion trap or IT) and high accuracy MS2 (Orbitrap or TOF), respectively, than did other methods. For approaches without Percolator, SEQUEST-group performs the best for data sets with MS2 produced by collision-induced dissociation (CID) and IT analysis; Mascot-LFDR gives more identifications for data sets generated by higher-energy collisional dissociation (HCD) and analyzed in Orbitrap (HCD-OT) and in Orbitrap Fusion (HCD-IT); MS Amanda-Group excels for the Q-TOF data set and the Orbitrap Velos HCD-OT data set. Therefore, if Percolator was not used, a specific combination should be applied for each type of data set. Moreover, a higher percentage of multiple-peptide proteins and lower variation of protein spectral counts were observed when analyzing technical replicates using Percolator

  19. Distyrylbenzene-aldehydes: identification of proteins in water.

    PubMed

    Kumpf, Jan; Freudenberg, Jan; Bunz, Uwe H F

    2015-05-07

    Three different, water soluble, aldehyde-appended distyrylbenzene (DSB) derivatives were prepared. Their interaction with different albumin variants (human, porcine, bovine, lactalbumin, ovalbumin) was investigated (pH 11). All three fluorophores exhibit graded, protein-dependent fluorescence turn-on at slightly differing wavelengths. Linear discriminant analysis (LDA) differentiated all of the investigated albumins and was used to discern commercially available protein shakes. The three DSB derivatives barely react with the constituting amino acids but cysteine. In the proteins significant fluorescence signals are generated, probably due to a combination of imine/N,S-aminal formation and hydrophobic interactions between the DSBs and the proteins.

  20. P2P proteomics -- data sharing for enhanced protein identification

    PubMed Central

    2012-01-01

    Background In order to tackle the important and challenging problem in proteomics of identifying known and new protein sequences using high-throughput methods, we propose a data-sharing platform that uses fully distributed P2P technologies to share specifications of peer-interaction protocols and service components. By using such a platform, information to be searched is no longer centralised in a few repositories but gathered from experiments in peer proteomics laboratories, which can subsequently be searched by fellow researchers. Methods The system distributively runs a data-sharing protocol specified in the Lightweight Communication Calculus underlying the system through which researchers interact via message passing. For this, researchers interact with the system through particular components that link to database querying systems based on BLAST and/or OMSSA and GUI-based visualisation environments. We have tested the proposed platform with data drawn from preexisting MS/MS data reservoirs from the 2006 ABRF (Association of Biomolecular Resource Facilities) test sample, which was extensively tested during the ABRF Proteomics Standards Research Group 2006 worldwide survey. In particular we have taken the data available from a subset of proteomics laboratories of Spain's National Institute for Proteomics, ProteoRed, a network for the coordination, integration and development of the Spanish proteomics facilities. Results and Discussion We performed queries against nine databases including seven ProteoRed proteomics laboratories, the NCBI Swiss-Prot database and the local database of the CSIC/UAB Proteomics Laboratory. A detailed analysis of the results indicated the presence of a protein that was supported by other NCBI matches and highly scored matches in several proteomics labs. The analysis clearly indicated that the protein was a relatively high concentrated contaminant that could be present in the ABRF sample. This fact is evident from the information that